Use of EEfRT in the NIH study: Deep phenotyping of PI-ME/CFS, 2024, Walitt et al

I believe a more meaningful analysis would have to look at what role a percentage of previously successfully completed hard trials plays in choosing a hard trial in the next round and so forth
I've been trying to calculate this by adding a new column: 'previous_trials_completion_average'. It initiates at value 0.5 (equal chance of completion or not) and then for row i takes the average of the completion rate of the previous (i-1 rows). It calculates these means from the 'Successful_Completion_Yes_is_1' column which has the completion rate of all trials rather than just hard ones. When it hits a new ID participant, it starts over.

Would this be a (rather poor perhaps) implementation of what you mean?
upload_2024-3-13_14-54-13.png
 
Last edited:
I've been trying to calculate this by adding a new column: 'previous_trials_completion_average'. It initiates at value 0.5 (equal chance of completion or not) and then takes the average of the completion rate of the previous (i-1 rows). It calculates these means from the 'Successful_Completion_Yes_is_1' column which has the completion rate of all trials rather than just hard ones. When it hits a new ID participant, it starts over.

Would this be a (rather poor perhaps) implementation of what you mean?
View attachment 21451

Awesome! Yes this would be a implementation of what I mean.

I think a more meaningful interpretation would use some sort of discounting on previous trials (it probably matters less whether you completed or didn't complete a hard trial 20 rounds ago, but one will remember a previous success/failure/effort more). The easiest thing one could do is say something like the previous hard trial counts with 50% and all other previous hard trials make up the remaining 50% (e.g. if your past history for hard trials, starting with your first hard trial and nding with your last hard trial, is failed-completed-failed-completed-failed-completed your rate completion rate would be 2/5*1/2+1/2=0.7 rather than 3/6=0.5), the choice of 50% and how many trials one counts towards being "in the previous hard trial" is rather arbitrary.

Alternatively one could try to do something more elaborate, which should also be easier to implement, is using something like using a discounting factor
ql_0efc9a97d87d1c57296ead43a3df61b9_l3.png
and then have the rate completion on hard trials (which I call r for now) be something like
ql_4fc3d8a3400dc9f9842c65e4e5c4e0f4_l3.png

(i.e. in the above example
ql_755775cf3018e8f8e7a16ddcb22cf53d_l3.png
which for
ql_6c1fb125c11efc881723a93bd190c542_l3.png
yields the thing you implemented. The smaller you choose , the larger you discount moves that lie further in the past (and for gamma=0 and the usual convention of 0^0=1 you get that only the previous hard trial contributes to your next decision). Of course for
ql_616b0e5b0a903f7c94865dce0265bf23_l3.png
one then gets rates smaller than 1 which doesn't reflect the true completion rate for anyone (think of someone that is able to complete all trials but has a rate r smaller than 1), however since we're more interested in comparing people/groups than reflecting true values, I don't currently see why this would matter.

One step higher up would be to try to even use a Markovian model, but I think that might be far too complicated for now and I think starting off with something a bit easier might already tell us if it could be worth of any effort at all.

A priori I would think one should focus on only hard trials and whether those were completed or not, to not dilute rates of those people that do easy because they can't do hard (for example pwME do hard marginally less often so one would artificially dilute their values if easy tasks are included) and since everyone is able to do easy trials there's nothing interesting there.
 
Last edited:
Not sure if this goes here or in the other thread.

The paper noted:
Compared to HVs, PI-ME/CFS participants failed to maintain a moderate grip force even though there was no difference in maximum grip strength or arm muscle mass. This difference in performance correlated with decreased activity of the right temporal-parietal junction, a part of the brain that is focused on determining mismatch between willed action and resultant movement. Mismatch relates to the degree of agency, i.e., the sense of control of the movement. Greater activation in the HVs suggests that they are attending in detail to their slight failures, while the PI-ME/ CFS participants are accomplishing what they are intending. [bolding added]

But this thread describes the high rate of failure of pwME on the EEfRT tests. That suggests pwME are not accomplishing what they are intending. Has that discrepancy been discussed here and/or do the authors explain anywhere how both can be true?

Apologies if this was already discussed.
 
Not sure if this goes here or in the other thread.

The paper noted:


But this thread describes the high rate of failure of pwME on the EEfRT tests. That suggests pwME are not accomplishing what they are intending. Has that discrepancy been discussed here and/or do the authors explain anywhere how both can be true?

Apologies if this was already discussed.
I agree that this is not expressed clearly in the paper. I think that when they say
the PI-ME/CFS participants are accomplishing what they are intending
they are referring to what pwME intend to do when effort preference kicks in, rather than what they intend to do at the beginning of the task.

If you view effort preference as involuntary (as Nath has said in an interview, contradicting the paper) or unconscious (as they write in the paper is an option, should this be subconscious?), then they're saying the pwME initially tries to do what they're asked, their brain stops them (likely coinciding with pain or weakness, though symptoms aren't reported in this paper) and they aim lower.

If you view effort preference as voluntary, ie pacing, then they're saying the pwME either aims lower from the beginning, or initially tries to do what they're asked, encounters a barrier (symptoms, they just can't do it), and then aims lower.
 
That's a great starting point, thanks for getting the discussion going again! I've had to take a quick break, but am still very much looking into the data and analysing it and deciding what response we can write, @andrewkq is still very much involved in this.
Apologies for being absent, my health has taken a turn for the worse so I've had to cut back on how much time I'm spending on this. I've been slowly working on a draft of a letter in the mean time but it probably won't be ready for another week or two. @ME/CFS Skeptic your letter is great and will definitely help speed things up for me. I'm still willing to coordinate getting a final group of co-authors together with @EndME to finalize and submit the letter but it's taking long than I'd initially hoped. If people are eager to get this out the door and want to move forward without my draft I'd understand, just let me know.

@ME/CFS Skeptic I'd feel uncomfortable submitting a letter that has sections written by you without at least acknowledging your contributions, would it be okay with you if we included an acknowledgement section that states something like "We wish to acknowledge intellectual contributions made to this letter by members of the online community Science4ME who wish to remain anonymous at this time."
 
If you view effort preference as voluntary, ie pacing, then they're saying the pwME either aims lower from the beginning, or initially tries to do what they're asked, encounters a barrier (symptoms, they just can't do it), and then aims lower.
The paper would've been less problematic if it said the subjects failed to exert because of symptoms. It instead surmised that pacing, and therefore the fear of symptoms, is responsible.

Deep phenotyping of PI-ME/CFS said:
This pattern suggests the PI-ME/CFS participants were pacing to limit exertion and associated feelings of discomfort
So, anticipation of the feelings of discomfort associated with continued exertion, not the discomfort itself, was holding them back, according to the paper. [edited for clarification]
 
The paper would've been less problematic if it said the subjects failed to exert because of symptoms. It instead surmised that pacing, and therefore the fear of symptoms, is responsible.
Couldn't agree more.

And does it get more dismissive than "discomfort"?
So, anticipation of the feelings of discomfort associated with continued exertion, not the discomfort itself, was holding them back, according to the paper. [edited for clarification]
That discomfort could theoretically be immediate or delayed, but given that we get told nothing about the symptoms patients were experiencing during the tasks, and they later say something similar after referring to the PEM after CPET paper by Stussman et al, I reckon you're right.
Interviews with PI-ME/CFS participants revealed that sustained effort led to post-exertional malaise34. Conscious and unconscious behavioral alterations to pace and avoid discomfort may underlie the differential performance observed35.
 
That's a very interesting reply @Robert 1973, because it allows to put the claims of the publication next to yet another "alternative" explanation, and because of its context.

You noted that the study had a low number of participants and that it appears to have neglected studying post-exertional malaise (PEM). PEM was a criterion for participation in the study. The Q&A link mentioned above provides information on PEM-related findings. The study was designed to examine a small group of well-defined patients that could be studied in depth. Patients were at NIH for a few weeks which limited the number of people that could be seen by researchers. When COVID-19 complicated the ability to bring in new patients, researchers determined that a sufficient number of patients had been studied and moved on to analyzing the data.

Is that really an answer re. the neglect of PEM in the study? 'It was a criterion, now go look at the Q&A where we tell you that we left it out of our main paper discussing the "deep phenotyping" of ME/CFS.'

In 2016 patient advocates were promised by Walitt that they were "going to try to understand exactly what the biology of post-exertional malaise is." (NIH ME/CFS Advocacy Call | National Institutes of Health (NIH))

Instead the "deep phenotyping" paper spends a lot of energy on "understanding the biology" of Walitt's view of ME/CFS, and PEM is absent.

(At the time Walitt said this he was convinced that ME/CFS was a somatoform disorder characterised by a dysfunction of subjective perception, which he self-termed "interoceptive disorder", although in 2016 he was also trial ballooning "central sensitivity syndrome". He is now the director of his second unit for "interoceptive disorders" while he just launched a paper that - based on feeble substantiation but in line with his views- launches the term "effort preference" as an avoidant choice ME/CFS patients make, while his bio says that his deep phenotyping research focusses on "persons with disorders characterized by aversive symptoms that develop after exposures, such as infection." By which he means ME/CFS, Gulf War Illness and Long Covid.)
 
I'm still willing to coordinate getting a final group of co-authors together with @EndME to finalize and submit the letter but it's taking long than I'd initially hoped.
That's great to hear. I don't think there is a strict time limit here so please take your time.

@ME/CFS Skeptic I'd feel uncomfortable submitting a letter that has sections written by you without at least acknowledging your contributions, would it be okay with you if we included an acknowledgement section that states something like "We wish to acknowledge intellectual contributions made to this letter by members of the online community Science4ME who wish to remain anonymous at this time."
An acknowledgement like the one you suggest would be great also for other S4ME members who contributed but may not be able to sign. Most of the arguments were already made by others forum members, I just tried to put them together.

EndMe made some good points about 'task completion' not being a good predictor of 'hard task choices' so that's an area that might need some further work.
 
I fully expect these to be dismissed, but I felt I had to at least try. I completed the following today:

- I filed a suspected ‘Research Misconduct’ with the ‘Agency Intramural Research Integrity Officer (AIRIO)’ that oversees NIH Intramural research (https://t.co/hRdxdhCtfx). They also have three individuals listed with AIRIO to share intramural-related study concerns at NIH with.

- I wrote today to my Regional Manager in the Office for Civil Rights, U.S. Department of Health and Human Services (HHS) to surface my concerns on the study, and what I feel warrants ‘Research Misconduct’ per HHS guidelines.

- I submitted a complaint to the HHS Office of Research Integrity & submitted an ‘Office for Human Research Protections (OHRP) Research Complaint Form’
 
Last edited:
I don't know if this thread has just slowed down or if others are working on a letter behind the scenes. I think a lot of valid points were raised that prove that Walitt et al.'s interpretation of the EEfRT data is seriously flawed.

I have written a first draft for a letter to be sent to the authors, asking for a correction. Pretty much all arguments have already been made by others on this thread so I take no credit for them - just want to help to string them toghether.

EDIT: I unfortunately will not be able to sign or co-sign a letter as (due to personal circumstances) I prefer to remain anonymous at the moment.
I thus have no concrete plan to send the letter. I hope that others who have more statistical knowledge and academic qualifications than me will take this further.

Incorrect interpretation of ME/CFS patients’ effort preference
In a comprehensive study of patients with post-infectious myalgic encephalomyelitis/chronic fatigue syndrome (PI-ME/CFS), Walitt et al. claim that “an alteration of effort preference, rather than physical or central fatigue” is a defining feature of the illness.1 Effort preference was defined as “how much effort a person subjectively wants to exert” and measured using the Effort-Expenditure for Rewards Task (EEfRT). The interpretation of EEfRT results by Walitt and colleagues, however, is highly problematic as it fails to consider that tasks required more effort from patients than from healthy controls.

The EEfRT is a multi-trial experiment that has been used to measure reward motivation in patients with anhedonia, schizophrenia, and other mental disorders. In Walitt et al.’s study of 15 PI-ME/CFS and 17 healthy the EEfRT lasted 15 minutes. In each of the successive trials, participants were instructed to choose between an easy and hard task. Both required several button presses within a limited time frame for successful completion: 30 button presses in 7 seconds for the easy task and 98 button presses in 21 seconds for the hard task. If participants completed the hard task successfully, they had a chance of receiving a higher reward than for completing the easy task. The reward values and probability of receiving it varied across trials and this information was provided to participants before they made their choice. Effort preference was estimated by the proportion of hard task choices. Walitt et al. report that given equal levels and probabilities of reward, healthy controls chose more hard tasks than PI-ME/CFS patients (Odds Ratio, OR = 1.65 [1.03, 2.65], p = 0.04).

The EEfRT requires however that participants can complete the tasks successfully and that the effort needed is equivalent in patients and controls. Treadway et al., the research team that developed the EEfRT and whose protocol was implemented by Walitt et al. with minor modifications, cautioned: “An important requirement for the EEfRT is that it measure individual differences in motivation for rewards, rather than individual differences in ability or fatigue. The task was specifically designed to require a meaningful difference in effort between hard and easy-task choices while still being simple enough to ensure that all subjects were capable of completing either task, and that subjects would not reach a point of exhaustion.” 2

Several techniques have been introduced in the EEfRT literature to ensure that the test measures reward motivation rather than differences in effort or ability. These include individually calibrating the required number of button presses3, controlling for participants’ motoric ability4, and evaluating whether participants had an adequate completion rate.2

Although Walitt et al. implemented four test trials before the EEfRT started, they did not implement measures to ensure that the effort required to complete tasks was similar in patients and controls. Consequently, PI-ME/CFS patients could only complete 67% of the hard tasks successfully compared to 98% in controls. This was a much larger difference (OR = 27.23 [6.33, 117.14], p < 0.0001) than the group difference in choosing hard over easy tasks. This problem was already evident in the four test trials during which PI-ME/CFS patients could only complete 42% of the hard tasks compared to 82% for controls. When we added successful completion rate to the statistical model, the difference in hard task choices was no longer significant (OR = 1.19 [0.79, 1.81]).

Walitt et al. did note that during the EEfRT there was no difference in the decline in button-press rate over time for either group for hard tasks. This might indicate that task-related fatigue did not influence the results. There was however a decline in button-press-rate in PI-ME/CFS patients for easy tasks that was not seen in controls. In addition, these measurements only reflect fatigue induced by the 15-minute button-pressing test, not the symptoms and debility participants already had at the start of the EEfRT.

PI-ME/CFS patients were severely disabled with a mean SF-36 physical function score of 31.8 compared to a score of 97.9 for the control group. Reduced psychomotor function5 and impairments in fingertip dexterity and gross movement of the hand, fingers, and arm6 have been reported in ME/CFS patients. Walitt et al. also found that patients in their cohort were unable to maintain force during a hand grip task.1 It is therefore likely that the EEfRT required more effort from PI-ME/CFS patients than from controls. Walitt et al. also reported a strong correlation (R=0.57) between PI-ME/CFS patients’ hard task choices and the ability to maintain force during the grip test. This supports our conclusion that patients’ EEfRT choices reflected motor ability as well as effort preference. No such correlation was found in healthy controls (R=-0.04).

The fact that the group difference in hard task choices was relatively small compared to the larger difference in completion rate, suggests that patients kept trying to succeed on hard tasks, despite past failures. Figure 1 shows the recorded button presses and completion rate per trial for all 31 participants. Several PI-ME/CFS patients had repeated failed attempts to complete the hard tasks (seen as repeated high red bars on the graph), a pattern that was not seen in controls. These findings are contrary to Walitt et al.’s hypothesis that ME/CFS patients prefer to exert themselves less than healthy controls.

View attachment 21419
Figure 1. Button presses per participant for each of the trials they managed to complete during the 15-minute EEfRT. Successful completions are pictured in green while failed attempts are indicated in red.

In conclusion, the EEfRT data indicates that the button-pressing tasks were more difficult for PI-ME/CFS patients than for controls, not that the former have abnormal effort preferences. In the past, ME/CFS patients have repeatedly been ‘victim blamed’ when behavioral consequences of their illness were incorrectly proposed as the cause of their symptoms.7 Considering the negative impact such misattributions may have, we kindly ask Walitt et al to correct their erroneous account of the EEfRT results.

References
1. Walitt, B. et al. Deep phenotyping of post-infectious myalgic encephalomyelitis/chronic fatigue syndrome. Nat Commun 15, 907 (2024).

2. Treadway, M. T., Buckholtz, J. W., Schwartzman, A. N., Lambert, W. E. & Zald, D. H. Worth the ‘EEfRT’? The Effort Expenditure for Rewards Task as an Objective Measure of Motivation and Anhedonia. PLOS ONE 4, e6598 (2009).

3. Fervaha, G. et al. Incentive motivation deficits in schizophrenia reflect effort computation impairments during cost-benefit decision-making. J Psychiatr Res 47, 1590–1596 (2013).

4. Ohmann, H. A., Kuper, N. & Wacker, J. A low dosage of the dopamine D2-receptor antagonist sulpiride affects effort allocation for reward regardless of trait extraversion. Personal Neurosci 3, e7 (2020).

5. Schrijvers, D. et al. Psychomotor functioning in chronic fatigue syndrome and major depressive disorder: a comparative study. J Affect Disord 115, 46–53 (2009).

6. Sanal-Hayes, N. E. M., Hayes, L. D., Mclaughlin, M., Berry, E. C. J. & Sculthorpe, N. F. People with Long Covid and ME/CFS Exhibit Similarly Impaired Dexterity and Bimanual Coordination: A Case-Case-Control Study. Am J Med S0002-9343(24)00091–3 (2024) doi:10.1016/j.amjmed.2024.02.003.

7. Thoma, M. et al. Why the Psychosomatic View on Myalgic Encephalomyelitis/Chronic Fatigue Syndrome Is Inconsistent with Current Evidence and Harmful to Patients. Medicina 60, 83 (2024).

Wow. I’ve just found this and am digesting it.

Thanks so much for sharing.

FWIW - I’m severe, but have masters in statistics and spent 21 years working with scientists and business teams. My background isn’t medical or research. And I’m rusty on stat details. Happy to collaborate or support as I’m able.

@EndME
 
I have written a first draft for a letter to be sent to the authors, asking for a correction. Pretty much all arguments have already been made by others on this thread so I take no credit for them - just want to help to string them toghether.

wow. wowowowowow. Still digesting but so far it seems like this amounts to :

Walitt et al. recklessly* failed to calibrate equipment for EEfRT test, causing misleading results. This failure is tantamount to falsification of results via manipulation of process and equipment.

Y'all's work is causing eustress.

[*edit: At minimum negligence, but with Walitt’s history and other factors- may be able to show recklessness.]


Edited to add:

Has anyone looked into showing recklessness?
Or are you looking to ask for a correction/retraction versus misconduct?

Here's a quote and link, pointing out reckless is somewhere between intentional/knowing and negligent ("honest mistake"):

Unlike knowing and intentional conduct, however, reckless conduct cannot be easily defined with reference to an everyday standard, nor is “recklessly” defined under Part 93. Participants in research misconduct proceedings typically understand recklessness to fall somewhere between, on the one hand, intentional or knowing conduct (both of which constitute research misconduct) and, on the other hand, conduct that is simply negligent. Negligence, like honest error, is regarded as a state of mind devoid of any intention to deceive and is widely understood to differ from behavior meeting the definition of research misconduct (Committee on Science, Engineering, and Public Policy Citation2009). However, a broad range of conduct can fit between “intentional” acts to fabricate, falsify, or plagiarize, and “negligent” conduct that results in fabrication, falsification, or plagiarism. Institutions and their faculty research misconduct committees often struggle to articulate and apply a satisfactory and consistent recklessness standard to allegations of research misconduct.

https://www.tandfonline.com/doi/full/10.1080/08989621.2023.2256650#:~:text=Part 93 states that a,falsification, fabrication, or plagiarism constitutes
 
Last edited:
I fully expect these to be dismissed, but I felt I had to at least try. I completed the following today:

- I filed a suspected ‘Research Misconduct’ with the ‘Agency Intramural Research Integrity Officer (AIRIO)’ that oversees NIH Intramural research (https://t.co/hRdxdhCtfx). They also have three individuals listed with AIRIO to share intramural-related study concerns at NIH with.

- I wrote today to my Regional Manager in the Office for Civil Rights, U.S. Department of Health and Human Services (HHS) to surface my concerns on the study, and what I feel warrants ‘Research Misconduct’ per HHS guidelines.

- I submitted a complaint to the HHS Office of Research Integrity & submitted an ‘Office for Human Research Protections (OHRP) Research Complaint Form’
Do you mind sharing your complaint?
 
"The EEfRT test was used to assess effort, task-related fatigue, and reward sensitivity."

I’m reviewing some older comments and this still seems relevant. Given standard process for eefrt is specifically for motivation not fatigue & cautions that fatigue can confound results - falsification by design. & Speaks to intent.

They do not provide any justification for removing this participant from the analysis other than saying

Falsifying by data omission.

Just noting these may be relevant & I don’t recall seeing them in write-up above.
 
Do you mind sharing your complaint?

At this time it may be best to keep private, as HHS Office of Integrity has already informed me "The Office of Research Integrity (ORI) handles specific allegations of research misconduct involving data fabrication, falsification and plagiarism on US public health service (PHS) funded research according to 42 CFR Part 93 at §93.103. ORI is not able to assist you with your concerns on the overall study design, test used and conclusions that are subjected to differences in opinion."

AIRIO at NIH is doing the internal audit (I don't have high hopes) at this time
 
Wow. I’ve just found this and am digesting it.

Thanks so much for sharing.

FWIW - I’m severe, but have masters in statistics and spent 21 years working with scientists and business teams. My background isn’t medical or research. And I’m rusty on stat details. Happy to collaborate or support as I’m able.

@EndME

@ME/CFS Skeptic Apologies if this is stated elsewhere, but do you mind if I ask if there are plans to send to the authors or to Nature Comms (this great write-up that you have formulated)? Fighting through difficult cognitive dysfunction today and trying to decipher
 
Last edited:
Back
Top Bottom