Preprint Replicated blood-based biomarkers for Myalgic Encephalomyelitis not explicable by inactivity, 2024, Beentjes, Ponting et al

A very precise reason.
In this particular case a group has reported some fascinating findings based on hard work and very timely and initiative-driven data gathering. But the data have been presented in the context of what look to me to be spurious arguments. And, importantly, that is linked to the presentation of the data being very difficult to follow and missing some key aspects. SOD3 and complement proteins look very interesting but we have almost no information about where the values lie in relation to normal scatter or whether they occur in the same people, and so on. There is very lengthy mathematical modelling in terms of 'mediation analysis' but if the diagnostic ascertainment problem is as big as we have been led to believe you cannot even begin to build a mediation model. Additionally the causal diagram looks to be back to front, again relating to a concept of a diagnostic 'disease' unity.

And, most important of all, we know that the people involved are headed up by first rate minds at least one of whom we know personally and I can only assume that the analysis reflects lack of familiarity with the practical complexities around diagnostic categories and the way we interpret biological measures in terms of process models. The introduction jumps off with stuff about physical activity, and then settles back into an account of ME/CFS, which presumably should come first, since the issue of physical activity's relevance to ME/CFS is very specific to the history of the disease and its models and management. It looks to me like an enthusiastic draft rather than a paper.

In many other cases there doesn't seem much point in going in to deep discussion because the science looks pretty scrappy in the first place. Nevertheless, I do tend to raise these issues whenever studies claim to be looking for diagnostic markers, and raising them repeatedly in the context of specific studies does seem necessary because people are still making the same mistakes.
I agree with most of this. The paper does need reworking. As you say, we need graphs of actual values (with separate charts for male/female, and adjustments for age if appropriate).

I don't have a background in advanced statistics, so I say the following with much hesitation and willingness to be corrected, but, there are statistical techniques useful for identifying groups and subgroups based on clusters of attributes, things like PCA, random forest. If there are any promising clusters based on differences in individual molecule levels (and perhaps ratios of molecules), then it would be useful to see if a lack of activity can explain the differences. If physical activity does explain some attribute levels, those ones could be removed and classifying statistical techniques like PCA repeated, to see if any identified groups remain. I think we need that order of analysis - identify differences that are probably real differences in an identified group of individuals, and then consider if any of the differences are just the result lifestyle/medicine differences.

A family member did an analysis of a large collection of plants, looking to see if there were sufficient differences to divide the plants into two species, or just subspecies, or no division at all. I mention this, because I find it a useful, concrete way to think about how a differentiating analysis can be done. He had a whole lot of measured attributes including things like flower structure, leaf length and leaf width, as well as ratios of leaf length to leaf width. The classifying analyses showed that the plants divided neatly into two distinct groups. He then needed to consider if things like altitude or latitude could explain the differences, using regressions and spatial analysis, and found there was still a real difference. He then went on to speculate on the biological basis for the collection of attributes in each proposed species. I think the same sort of approach to the analysis, the same order of analysis, could be useful for studies like the one that is the subject of this thread, with physical activity being a possible confounder, like altitude might have been for the differences of my family member's plants.

Regarding the moved posts, we left enough to indicate that the problem had been raised in the context of this study and include some discussion of it. There were 27 posts that went into more detail or repeated points that were moved. I think SOD3 is another good example of where it is useful to split off the discussion. Personally, I would prefer that there is a separate thread for SOD3, rather than having a wide-ranging discussion of it on a study thread that covers many other things. That way, it is easier to go into much more detail and it is easier for people to find and follow the discussion. Threads on studies that have reported findings in ME/CFS cohorts can be linked.
 
If physical activity does explain some attribute levels, those ones could be removed

I don't fully understand mediation analysis but my guess is that it hinges on a correlation. If physical activity correlates to the physiologic measures found then one possibility is that these measures indicate part of the cause of the inactivity - via a process we think o f as the cause of ME/CFS. So maybe they shouldn't be thrown out. One of the things that worries me about the paper is that all causes end up at the blood tests. But the blood tests are only going to turn out to be interesting if they reflect causes of the syndrome, which is not an option on the diagram if I remember rightly.
 
Last edited:
A correlation between physical activity and a molecule doesn't prove much. As you say, the levels of the molecule might be an indication of disease, and the disease might be causing inactivity. But, if the 'ME/CFS' group and the healthy group both have the same relationship between levels of the molecule and activity levels (in terms of a regression line), and the levels of the molecule in the two groups (and indeed the levels of activity) overlap to a significant degree, then I think the molecule doesn't have much use for classification.
 
But, if the 'ME/CFS' group and the healthy group both have the same relationship between levels of the molecule and activity levels (in terms of a regression line),

That's an important point but if the molecule was a mediator of fatigue that synergised with some other factor in ME/CFS maybe you would get the same regression line.

the levels of the molecule in the two groups (and indeed the levels of activity) overlap to a significant degree,

Another good point. I think it reflects another general concern that I have that tests that help us identify groups with common natural history (often called 'diagnostic'!) nearly always fall outside two standard deviations from the mean in a good proportion of probands. There are some exceptions but mostly where there is an unbroken spectrum of risk-bearing factors - like lipids or blood pressure. There are also some oddities like urate but they are few.

If you rely on shifts or correlations in measures within the normal range and require combinations picked out by machine learning etc. then increasingly you put all your eggs in a statistical basket. There may be situations where that works. But if you are doing it to try to make a diagnostic category crisper then you have a Catch22 if you use the fuzzy clinical categorisation to do your statistics on.

My guess is that there is a risk that anything up to 80% of patients picked up on the screening criteria may not have ME/CFS as we like to understand it. ME/CFS is not that common - maybe one person in 500. As many as one person in fifty in the 40-70 age group is likely to acquire a diagnosis like ME or CFS simply because it is easier for the GP to provide an explanation for low grade poor health that does not fit another category. Worse still those 89% will bring with them all sorts of shifts in physiology that will confound a purely statistical approach to discriminatory tests.
 
The sort of scientists we want to take ME/CFS seriously are the sort of scientists who will immediately pick up the irrelevance of discussion of diagnostic tests and the Catch22 of looking for tests that will provide that when there are likely to be serious ascertainment problems in the cohort selection

How does one resolve such ascertainment problems?

Are you saying that current case criteria are unreliable, or that the rigor of their application in cohort selection is lacking? (or are you saying both?)

If it is the former, how does one perform meaningful research? Would this not also confound mechanistic studies?

Are you suggesting instead a detailed case study approach to generate testable hypotheses? (eg: what seemed to happen with Paul Hwang's WASF3 work.) Or no?

From my perspective the most important people to read these threads are scientists doing the work - either the paper in question or similar work. The issue of diagnosis and its relation to gathering physiologic data is absolutely central.

With this in mind please interpret my foregoing questions through the following lens: I am a young (29 year old) scientist whose strongest calling is to make his ME/CFS research rigorous and meaningful. I am earnestly here to listen to your experience.
 
Last edited:
How does one resolve such ascertainment problems?

Are you saying that current case criteria are unreliable, or that the rigor of their application in cohort selection is lacking? (or are you saying both?)

Thanks for the questions. It is complicated but I will try to give some answers.

I guess that you cannot so much 'resolve' the ascertainment problems as work knowing what they are. Ascertainment of diagnosis of RA was always uncertain, although not as bad as ME/CFS, but it didn't get in the way of working out how to treat people.

If you compare a cohort of barn-door typical cases with healthy controls it doesn't matter very much what your case criteria are. And no one set can be claimed to be better than another. I don't think I ever knew the case criteria for RA, any more than I know them for a dog.

The problem comes if 'grey area' cases affect the data. And grey area cases may be commoner than barn door, both for RA and ME/CFS. Are you fatigued more than half the time? The answer to that can depend on all sorts of things. Even something like joint swelling judged by a physician will vary greatly in grey area cases. My juniors were often sceptical about subtle joint swelling that I could recognise because I had seen it just like that for years. They might also score swelling that I knew was not due to the joint.

This particular study relies on reports that are likely to pick up grey area cases of ME/CFS. The study was well worth doing because it made use of a potentially valuable large data resource, but I think it illustrates just how complex confounding factors might be. To give an idea of the problem, let's take RA, fibromyalgia and ME/CFS. In RA for every 1000 people there will be maybe 5 barn door cases and maybe 5 cases who really do have the pathology of RA (which has been reasonably defined for 100 years in terms of chronic lymphocytic infiltration of synovia) but for whom being sure of that is very hard. We even have defined categories of 'probable' and 'possible' RA and we don't include these in studies. There will also be maybe 10-20 cases of people who have been told they have RA at some time but don't. And that is the easy situation. For fibromyalgia studies have suggested that physicians vary up to a hundred fold in their rate of making the diagnosis. That is the worst case. I see ME/CFS as likely in between.

You might say that none of this is too disastrous because if grey area cases are misdiagnosed it will just blunt the statistical significance figures a bit. An important correlation will still show through. But if you factor in the way probands are recruited things get trickier. If you recruit from clinics there may be all sorts of spurious differences between people who come to clinics and people who do not. If you go to a population asking for volunteers you might do better but you still have people acquiring diagnoses by going to a doctor. And if they go to a doctor for other reasons too that may affect how likely they are to get an ME/CFS diagnosis.

On the forum we constantly moan about doctors diagnosing chronic fatigue too widely. Most doctors aren't bothered whether they are diagnosing chronic fatigue NOS or PEM-carrying ME/CFS. The names they use will probably reflect their choice of shirt colours and voting behaviour.

For the great majority of diseases none of this really matters in the end because the clues to disease mechanism and treatment stick out like a sore thumb when you find them. When someone finally bothered to look at smoking and risk of RA the association was barn door. But if you are trawling for statistically significant differences in a large population within the normal range I think the chances of getting spurious associations are 100%.

Genetic studies are much easier because correlation to gene alleles pretty much have to be causal. The causation can be indirect and obscure but it is still causal. I had concerns about using internet calls for recruiting for DecodeME but I now see that those concerns are probably not too great. Gene alleles for metabolic pathways or immune functions are pretty unlikely to be commoner in people who volunteer for research. But spurious associations between volunteering for a Biobank and obtaining an ME or CFS diagnosis are much more plausible, both being behavioural traits relating to one's health status.
 
To be a bit clearer, what I think we need are tests that indicate, however indirectly, however much just a hint of where to look, something about what is going wrong in an illness like RA or ME/CFS. Those test may seem like 'diagnostic tests' and maybe you could call them that at times but if you conceive of their purpose as diagnostic tests, as a scientist, you are likely to start chasing things that don't exist.

It would probably be even more reasonable often to call these tests 'biomarkers' but the discussion makes clear that nobody is quite sure what that term is supposed to mean and I have never found it useful. By and large I think it is almost as misleading as diagnostic test.
 
I don't have a background in advanced statistics, so I say the following with much hesitation and willingness to be corrected, but, there are statistical techniques useful for identifying groups and subgroups based on clusters of attributes, things like PCA, random forest.

TSNE is a good mechanism for visualizing multidimensional data: t-distributed stochastic neighbor embedding - Wikipedia
 
Moderator note

We have read the discussion of whether posts should be moved and taken note of points made. We have decided to leave most of the discussion even though it is a diversion from the research topic and breaches Rule 9: Specific moderation decisions should not be discussed publicly. A few posts restarting the discussion and repeating points already made have been deleted.

Please in future, if you think moderators have made a mistake in moving your post on this or another thread, use the contact moderators button or contact a moderator privately to discuss it.
 
New abstract (paragraphs added by me for improved readability)
Abstract.
Myalgic Encephalomyelitis (ME; sometimes referred to as chronic fatigue syndrome [CFS]) is a relatively common and female-biased disease of unknown pathogenesis that pro- foundly decreases patients’ health-related quality-of-life. ME/CFS diagnosis is hindered by the absence of robustly-defined and specific biomarkers that are easily measured from available sources such as blood, and unaffected by ME/CFS patients’ low level of physical activity. Pre- vious studies of blood biomarkers have not yielded replicated results, perhaps due to low study sample sizes (n < 100).

Here, we use UK Biobank (UKB) data for up to 1,455 ME/CFS cases and 131,303 population controls to discover hundreds of molecular and cellular blood traits that differ significantly between cases and controls. Importantly, 116 of these traits are replicated, as they are significant for both female and male cohorts. Our analysis used semi-parametric efficient estimators, an initial Super Learner fit followed by a one-step correction, three types of mediators, and natural direct and indirect estimands, to decompose the average effect of ME/CFS status on molecular and cellular traits.

Strikingly, these trait differences cannot be explained by ME/CFS cases’ restricted activity. Of 3,237 traits considered, ME/CFS status had a significant effect on only one, via the “Duration of walk” (UKB field 874) mediator. By contrast, ME/CFS status had a significant direct effect on 290 traits (9%). As expected, these effects became more significant with increased stringency of case and control definition. Signifi- cant female and male traits were indicative of chronic inflammation, insulin resistance and liver disease. Individually, significant effects on blood traits, however, were not sufficient to cleanly distinguish cases from controls. Nevertheless, their large number, lack of sex-bias, and strong significance, despite the ‘healthy volunteer’ selection bias of UKB participants, keep alive the future ambition of a blood-based biomarker panel for accurate ME/CFS diagnosis.


The only change from version 1 in the abstract is the use of ME/CFS as the disease name.

There's just this bit that I think still needs more work
Strikingly, these trait differences cannot be explained by ME/CFS cases’ restricted activity. Of 3,237 traits considered, ME/CFS status had a significant effect on only one, via the “Duration of walk” (UKB field 874) mediator. By contrast, ME/CFS status had a significant direct effect on 290 traits (9%).
They say that ME/CFS status had a significant effect on only one trait, and then they say that, by contrast, ME/CFS had a significant direct effect on 290 traits.

Perhaps they meant to say something like that 'ME/CFS status had a significant indirect effect via a mediator related to inactivity on only one trait'? I could be wrong about what they are meaning to say, as I haven't read the new paper, but I don't think the two statements as they are written can be true.

I made a comment about this before - am I misunderstanding something?
 
Last edited:
Perhaps they meant to say something like that 'ME/CFS status had a significant indirect effect via a mediator related to inactivity on only one trait'? I could be wrong about what they are meaning to say, as I haven't read the new paper, but I don't think the two statements as they are written can be true.

Yes, it does come across oddly.

I read it as trying to emphasise that ME/CFS status had a significant effect on only one activity-related trait (the Duration of Walk one), whereas there were direct effects on 290 non-activity related traits.

It doesn't say that at all, so I may be wrong, but I can see why they might want to make that kind of point.
 
I have been looking at the new version. The introduction is much crisper. I find the mathematical modelling hard to relate to the biology I am familiar with but I think I have a reasonable idea what the overall picture is.

Various people have picked out particular proteins they think are interesting. It is good to see that these are independent of inactivity and Chris assures me that the BMI levels are not signficantly different. For me the real prize would be if one or more of these proteins matched a signal on DecodeME in some way.
 
Back
Top Bottom