Lucibee
Senior Member (Voting Rights)
Regression to the mean is an interesting possibility.
It's a certainty if they subgroup the data like that! Those with low scores will tend to improve; those with high scores will tend to get worse (before any treatment effects). They don't seem to have selected on PF this time, but that will have happened to a certain extent anyway, by nature of the selection criteria for CFS.
They also said:
The original protocol had only one primary outcome measure, the SF-36 PF. However, when some eligible participants were found to have high SF-36 PF scores at randomisation (because of their illness affecting cognitive or social functions but not physical function), we decided to also include fatigue, using the CFQ, as a co-primary outcome. This decision was made mid-way through trial recruitment...
So it looks like they did have some kind of cut-off originally (at least in principle), to allow themselves room for improvement. In some ways, it's good they weren't selecting patients on the measure they were going to use at outcome. But they would have been using selection measures that were highly correlated with it.
I'm not 100% sure what you're saying: they are saying that because the higher group didn't do as well, that's likely because they were already near the maximum (ceiling effects). There could be situations where this arises.
However, based on the figures here we have enough information to know that the patients in the higher functioning group only approved by a maximum of 2.7 points and may actually have decreased. A final score of around 55 is not near a maximum score unless one is accepting that graded exercise therapy can only bring people up to a very low level which the authors have never conceded.
Unlike us, the authors have the figures. It would be pretty clear if they gave the figures that their interpretation is not valid. But by not giving the figures people could be taken in by it.
But didn't they concede that by saying that there might be a ceiling effect? I think that's where I'm getting confused, because otherwise I don't understand what they haven't admitted to. To anyone who knows the PF scale, the improvement they reported is truly tiny.
I certainly agree that it would be useful to see the data. A nice set of histograms of the baseline and outcome measures in the 2 groups would make things much clearer.