There are a couple of groups who have sort of gone down this route. Programms such as LIINC have been setup to track people right from their Covid infection until whatever outcome they arrive at and the recent ME/CFS biobank in Germany is also setup to do something similar.
However, some of those studies have shown that even people that meet the CCC definition at 6 months or at least look very similar to ME/CFS often turn out to look rather different at a later time, as for example seen in Long-term symptom severity and clinical biomarkers in post-COVID-19/chronic fatigue syndrome:results from a prospective observational cohort. This German group surrounding Scheibenbogen is also tracking ME/CFS post EBV, but those studies are probably extremely underpowered to reveal much from an epidemiological stand-point as seen in One-Year Follow-up of Young People with ME/CFS Following Infectious Mononucleosis by Epstein-Barr Virus.
If the studies are anyways severly underpowered from an epidemiological stand-point, for example to figure out how large of a risk factor EBV infection is for ME/CFS, I do agree that it might just be sensible too just look at 1000 ME/CFS patients with a longer disease history and do a deep phenotyping study of those patients.
I do wonder though why there is so much focus on trying to look at signatures that seperate somewhere around 90% of ME/CFS patients from healthy controls and then often do so via something like a random forest classifier that just ends up looking rather random, finding marginal differences in a larger set of people without telling us anything fruitful.
I'm far more interested in a result that shows stark differences between two groups and that tells us something meaningful about pathology even if that result only applies to say 50% of ME/CFS patients. But then I wonder how large the sample set of patients really has to be. Depending on what is done it might sometimes be more important to have someone that look at a problem under a novel angle. There are certainly things wrong with the intramural study by the NIH, however what it did show us that a study expecting somewhere around 90% accuracy from a signature is destined to fail from the get go if it is really supposed to distinguish ME/CFS from HC.
We have a paper submitted to a Nature journal that you will probably be interested in. We're hoping it gets published soon. We looked at common co-morbid conditions in an ME cohort and then compared ME to cohorts of that co-morbidity to see if we could find differences that had more to do with an ME signature than the co-morbidity. We also looked for a diagnostic signature that would separate ME from these comorbid diseases. We got one that was ~75% accurate. So we think there is potential here.