Your response is excellent
@Michiel Tack. Here are some additional points you may want to consider adding, if you feel that these do not go into too much detail and could thus water down the strength of your arguments.
Fatigue outcome in the Cochrane review
Busse et al emphasize on the moderate-to-large SMD for the fatigue outcome following GET:
Note the results of the Cochrane review on chronic fatigue syndrome: their results in standard deviation units provide a point estimate of a moderate to large effect (standardize mean difference 0.66) and the lower boundary of the confidence interval (0.31) excludes the threshold – SMD of 0.2 – suggested as a small effect.
You can cite your own commentary to the Cochrane review [1] in which you point out that, when removing the trial by Powell for the purpose of heterogeneity, the SMD increases from -0.66 to -0.44 and that this no longer represents a clinically meaningful improvement on the Chalder Fatigue Scale. And that, combined with the issue of imprecision, the evidence for fatigue post-treatment should be rated as 'low' instead of moderate according to the GRADE handbook.
2) Fatigue post-treatment should be rated as low instead of moderate quality evidence
The certainty of evidence for all outcomes in comparison 1 (exercise therapy versus treatment as usual, relaxation or flexibility) was assessed as low to very low according to the GRADE system. (2) The sole exception is fatigue measured at the end of treatment which was assessed as providing moderate certainty evidence. It is unclear why the certainty of evidence for this outcome wasn’t downgraded for inconsistency and/or imprecision as was the case for physical function measured at the end of treatment.
The meta-analysis of post-treatment fatigue was associated with considerable heterogeneity (I2 = 80%, P< 0.0001). This heterogeneity was mainly caused by one outlier, the trial by Powell et al. If this trial is excluded, heterogeneity is reduced to acceptable levels (I2 = 26%, P = 0.24) but the standardized mean difference (SMD) drops by one third, from -0.66 to -0.44. This corresponds to a 2.3 point instead of 3.4 point reduction when re-expressed on the 33-point Chalder Fatigue Scale, a difference that may no longer be clinically meaningful. A minimal important difference (MID) of 3 points on the Chalder Fatigue Scale has previously been used in an exercise trial for CFS. (3)
Fatigue post-treatment could also be downgraded for imprecision as the confidence interval crosses the line of no clinically significant effect. The 95% confidence interval of the SMD for fatigue (.31-1.10) corresponds to a 1.6 to 5.3 point interval when re-expressed on the 33-point Chalder Fatigue Scale. For continuous outcomes, the GRADE handbook recommends: “Whether you will rate down for imprecision is dependent on the choice of the difference (Δ) you wish to detect and the resulting sample size required.” Given that the authors of this Cochrane review specified a MID of 2.3 for the Chalder Fatigue Scale and that a MID of 3 points or higher has been used for CFS (3) and other chronic conditions (4,5), it seems warranted to downgrade this outcome for imprecision.
I recognize that for both inconsistency and imprecision the case isn’t clear-cut. The GRADE handbook, however, states that if there is a borderline case to downgrade the certainty of evidence for two factors, it is recommended to downgrade for at least one of them. The handbook writes: “If, for instance, reviewers find themselves in a close-call situation with respect to two quality issues (risk of bias and, say, precision), we suggest rating down for at least one of the two.” (2) Therefore the outcome fatigue measured at the end of treatment should preferably be downgraded to low certainty evidence.
On the topic of 'fatigue questionnaires' you may also want to briefly remind the issues with the Chalder Fatigue Scale, namely interpretation problems (questions that are not relevant to fatigue, focus on change in fatigue versus intensity) and the ceiling effect. [1, 2, 3]
Different diagnostic criteria and their effects on fatigue improvement
You can counter the arguments that, in the Cochrane review subgroup analysis on different diagnostic criteria, there were no differences between subgroups on the fatigue outcome, with the Agency for Healthcare Quality Research's addendum to its review of treatments for ME/CFS. [4] It states (bolding mine apart from paragraph titles):
Exercise Therapies
Six trials compared different forms of exercise therapy with control groups. Three trials used the Oxford (Sharpe, 1991) case definition for inclusion, all of which evaluated the effectiveness of graded exercise therapy (GET).
12,
30,
41 Of the three trials using the CDC (Fukuda,1994) case definition, one trial evaluated the effectiveness of GET.
42 The other two trials evaluated other exercise interventions and do not impact this addendum.
43-
45
Graded Exercise Therapy
Four trials evaluated the effectiveness of GET compared with a control group (n=656) (
Table 6,
Figures 3 and
4). Of these, three used the Oxford (Sharpe, 1991) case definition (n=607)
12,
30,
41 while one small trial used the CDC (Fukuda, 1994) case definition (n=49).
42 The results are consistent across trials with improvement in function, fatigue, and global improvement and provided moderate strength of evidence for improved function (4 trials, n=607) and global improvement (3 trials, n=539),
low strength of evidence for reduced fatigue (4 trials, n=607) and decreased work impairment (1 trial, n=480), and insufficient evidence for improved quality of life (no trials) (
Table 7).
By excluding the three trials using the Oxford (Sharpe, 1991) case definition for inclusion, there would be insufficient evidence of the effectiveness of GET on any outcome (1 trial, n=49).
Further, a recent systematic review on the evidence base for physiotherapy in ME/CFS shows that there is no evidence available when considering PEM. [5] This matters when it comes to directness because NICE now requires PEM in their proposed diagnostic criteria, so it can't recommend an intervention for which there is no evidence of effectiveness based on these new criteria.
Methods
A systematic review of randomized controlled trials published over the last two decades was conducted. Studies evaluating physiotherapeutic interventions for adult ME/CFS patients were included. The diagnostic criteria sets were classified into three groups according to the extent to which the importance of PEM was emphasized: chronic fatigue (CF; PEM not mentioned as a criterion), CFS (PEM included as an optional or minor criterion) or ME (PEM is a required symptom). The main results of included studies were synthesized in relation to the classification of the applied diagnostic criteria. In addition, special attention was given to the tolerability of the interventions.
Results
Eighteen RCTs were included in the systematic review: three RCTs with CF patients, 14 RCTs with CFS patients and one RCT covering ME patients with PEM. Intervention effects, if any, seemed to disappear with more narrow case definitions, increasing objectivity of the outcome measures and longer follow-up.
Conclusion
Currently, there is no scientific evidence when it comes to effective physiotherapy for ME patients. Applying treatment that seems effective for CF or CFS patients may have adverse consequences for ME patients and should be avoided.
The meta-epidemiological study on blinding includes few meta-analyses of behavioural interventions with PROM
Moustgaard et al. [6] found that, for the group "Effect of blinding patients in trials with patient reported outcomes" (Ia):
The ROR for lack of blinding of patients was 0.91 (95% credible interval 0.61 to 1.34) in 18 meta-analyses with patient reported outcomes
("An ROR lower than 1 indicated exaggerated effect estimates in trials without blinding")
According to the
appendix (table 4), in this group, 3 meta-analyses (out of 18, 16.7%) studied a behavioural intervention, totalling 50 trials (out of 132, 37.9%) and 19.13% of the weight of the analysis:
-
Neuropsychological rehabilitation for multiple sclerosis, 3 trials, ROR = 1.37 (95% CI: 0.22 - 8.66)
-
Music for stress and anxiety reduction in coronary heart disease patients, 5 trials, ROR = 1.58 (95% CI: 0.42 - 5.85)
- (Not a treatment)
Decision aids for people facing health treatment or screening decisions, 42 trials, ROR = 1.93 (95% CI: 1.50 - 2.48)
The rest only included meta-analyses of drug trials. It is not unreasonable to think that, if more blindable behavioural interventions (e.g. as proposed by Busse et al., "attention control in a trial of CBT") where positive expectations can be repeatedly promoted by the therapists (as in your example with the GET manual) had been included for this group, the effects of performance (and detection) bias would have tipped over the balance towards a lower ROR.
For the case of unblindable behavioural interventions, you'll find this as the first example on the
page about performance bias in the Catalog of Bias of the University of Oxford's Centre for EBM:
Performance bias often occurs in trials where it is not possible to blind participants and/or researchers, such as trials of surgical interventions, nutrition or exercise. For example,
a systematic review of trials of physical activity for women with breast cancer after adjuvant therapy found that all the included trials were at high risk of performance bias because the nature of the intervention (i.e. physical activity) made it impossible to blind trial personnel and participants, and because the main outcomes were subjective.
[1]
https://www.cochranelibrary.com/cds....pub8/detailed-comment/en?messageId=266353165
[2] Kirke KD. Measuring improvement and deterioration in myalgic encephalomyelitis/chronic fatigue syndrome: the pitfalls of the Chalder Fatigue Questionnaire. J R Soc Med. 2021 Feb;114(2):54.
doi: 10.1177/0141076820977843. Epub 2020 Dec 15. PMID: 33319615; PMCID: PMC7879015.
[3]
https://huisartsvink.files.wordpress.com/2018/08/wilshire-mcphee-cfq-cde-critique-for-s4me-final.pdf
[4] Beth Smith ME, Nelson HD, Haney E, et al. Diagnosis and Treatment of Myalgic Encephalomyelitis/Chronic Fatigue Syndrome. Rockville (MD): Agency for Healthcare Research and Quality (US); 2014 Dec. (Evidence Reports/Technology Assessments, No. 219.) July 2016 Addendum. Available from:
https://www.ncbi.nlm.nih.gov/books/NBK379582/
[5] Wormgoor MEA, Rodenburg SC. The evidence base for physiotherapy in myalgic encephalomyelitis/chronic fatigue syndrome when considering post-exertional malaise: a systematic review and narrative synthesis. J Transl Med. 2021 Jan 4;19(1):1.
doi: 10.1186/s12967-020-02683-4. PMID: 33397399; PMCID: PMC7780213.
[6] Moustgaard H, Clayton GL, Jones HE, Boutron I, Jørgensen L, Laursen DRT, Olsen MF, Paludan-Müller A, Ravaud P, Savović J, Sterne JAC, Higgins JPT, Hróbjartsson A. Impact of blinding on estimated treatment effects in randomised clinical trials: meta-epidemiological study. BMJ. 2020 Jan 21;368:l6802.
doi: 10.1136/bmj.l6802. Erratum in: BMJ. 2020 Feb 5;368:m358. PMID: 31964641; PMCID: PMC7190062.