Preprint Dissecting the genetic complexity of myalgic encephalomyelitis/chronic fatigue syndrome via deep learning-powered genome analysis, 2025, Zhang+

Discussion in 'ME/CFS research' started by SNT Gatchaman, Apr 17, 2025.

  1. jnmaciuch

    jnmaciuch Senior Member (Voting Rights)

    Messages:
    670
    Location:
    USA
    I made a quick note about this at some point much earlier in the thread—it looks like they redid the clustering on the scRNA-seq data set, and I was already quite skeptical of the celltype labeling on the original paper.

    Both this paper and the original scRNA-seq did not show feature or violin plots affirming that their clusters are what they say they are. In the text, this paper mentioned CCL5 and GZMB markers—I can be reasonably confident that this cluster is cytotoxic T cells based on those findings alone.

    However, I did not see any convincing evidence that they were in fact CD4s. The original paper did not look like it got good separation between their CD4s and CD8s, so it’s possible this cluster could be partly, or mostly, cytotoxic CD8s.

    I’m hoping they might provide cluster markers as additional supplementary material if a reviewer brings up my same points. Though it wasn’t brought up as a critique for the Hanson group paper so I’m pessimistic.
     
    janice, hotblack, MeSci and 5 others like this.
  2. wigglethemouse

    wigglethemouse Senior Member (Voting Rights)

    Messages:
    1,175
    Here is the plot from supplementary figure 6. Is that a "feature plot" or "cluster markers"? The cells are labelled "8. CTL CD4"
    https://www.medrxiv.org/content/medrxiv/early/2025/04/16/2025.04.15.25325899/DC6/embed/media-6.pdf

    This is the first paper that came up when searching CD4 CTL
    CD4 CTL, a Cytotoxic Subset of CD4+ T Cells, Their Differentiation and Function
     
  3. jnmaciuch

    jnmaciuch Senior Member (Voting Rights)

    Messages:
    670
    Location:
    USA
    upload_2025-5-3_18-41-4.png
    Unfortunately that’s just showing their annotations, not the evidence for it. What you’re looking for is something like the attached which shows the expression levels of CD4 and CD8A/B.
    Or a violin plot showing expression levels separated by their cluster labels. I didn’t see any evidence in any of their supplementals. cytotoxic CD4s do exist, they just haven’t shown that the cells they claim are cytotoxic CD4s are exclusively CD4s
     
    Last edited: May 6, 2025
    hotblack, MeSci, Deanne NZ and 3 others like this.
  4. Jonathan Edwards

    Jonathan Edwards Senior Member (Voting Rights)

    Messages:
    17,230
    Location:
    London, UK
    I think it may be worth remembering that, as far as I know, absolutely nothing informative has been gleaned from looking at T cell subsets in the blood in the known autoimmune diseases. There may be some shifts but nobody has shown them to mean anything relevant to the disease mechanism. Even in the diseases that really are likely to be T cell driven - psoriasis and Reiter's - nothing useful has been found as far as I know. Looking at lymphocyte function using PBMC is probably a waste of time in chronic disorders of immune regulation. AIDS shows itself in CD4 cytopenia but that is something rather different.

    I think it would have been better if the paper focused on the genetics pure and simple and did not try to interpret the findings mixed in with scRNA studies.
     
    janice, Ariel, Kitty and 4 others like this.
  5. wastwater

    wastwater Senior Member (Voting Rights)

    Messages:
    372
    NLGN2 links to a rare disease https://www.malacards.org/card/senior_loken_syndrome_7
    Autosomal recessive genetic disease characterized by progressive wasting of the filtering unit of the kidney , with or without medullary cystic renal disease, and progressive eye disease

    Thin basement membrane disease

    A few of those genes are a target of FOXO1a
     
    Last edited: May 9, 2025 at 6:22 AM
    Kitty and Deanne NZ like this.
  6. Hutan

    Hutan Moderator Staff Member

    Messages:
    32,414
    Location:
    Aotearoa New Zealand
    Forestglip was commenting about this sentence:
    Upthread, I also commented on that sentence:

    C2 means the group of people who got Covid, I don't know what Covid19:_C2_v2_England_controls means but it doesn't sound like 'people who got Covid-19 really badly'. Figure 5A shows a strong correlation of their gene variation set with both 'Covid19:_C2_v2_England_controls' and 'Covid-19 controls' (both presumably, very large, very heterogeneous data sets). I don't think a gene set being correlated with a group that 'got Covid' or 'Covid-19 controls' means very much at all. So, assuming that's correct, it is rather misleading to say that their ("ME/CFS") gene variation set correlated with Covid-19 susceptibility.


    It was at this point that I became worried about this paper. It would be great if the researchers could explain what was going on there. Please click on Figure 5A below and see for yourself.

    Screen Shot 2025-04-18 at 4.08.42 pm.png
     
    wigglethemouse, Kitty, Yann04 and 3 others like this.
  7. Kitty

    Kitty Senior Member (Voting Rights)

    Messages:
    8,081
    Location:
    UK
    I don't really know how to read these figures, but it looks as if the strongest associations are with really common things that aren't part of the ME/CFS syndrome—but are female-weighted like ME/CFS, e.g. gall bladder removal, depression, and likelihood of having had surgery?
     
  8. forestglip

    forestglip Senior Member (Voting Rights)

    Messages:
    2,241
    I think the second one should say 'Covid19:C2_v2'. But I agree, I can't think of what it could mean to be associated with both the controls and the cases for COVID (assuming that's what that code stands for)
     
    Deanne NZ, Hutan, hotblack and 2 others like this.
  9. Nightsong

    Nightsong Senior Member (Voting Rights)

    Messages:
    1,143
  10. forestglip

    forestglip Senior Member (Voting Rights)

    Messages:
    2,241
    Oh, so "C2_v2_england_controls" means "COVID-19 positive (controls include untested), only patients from centers in England". So they had a COVID illness but not everyone was confirmed with a lab test? I guess that means both groups were COVID positive and having both associations makes sense.
     
  11. ME/CFS Skeptic

    ME/CFS Skeptic Senior Member (Voting Rights)

    Messages:
    4,356
    Location:
    Belgium
    I think Leptin (LEP in figure 2B) is also showing up again, as in the study by Beentjes et al?
     
    Yann04, SNT Gatchaman, Kitty and 4 others like this.
  12. Hutan

    Hutan Moderator Staff Member

    Messages:
    32,414
    Location:
    Aotearoa New Zealand
    I'm still not sure what the labels mean. Clicking through to Genebass descriptions,

    C2_v2_england_controls
    has the description: COVID-19 positive (controls include untested), only patients from centers in England

    and mentions 11,767 cases and 337472 controls.

    C2_v2
    is the same as the England one, but presumably includes some more people from elsewhere in the UK.
    It mentions 12,303 cases and 382538 controls.

    So, I think they are essentially the same thing, I think there is a massive overlap in the people that are included.

    I still don't know if the data that was tested against the Zhang gene variants are from the cases, or the controls, or both. Genebass gives heritability scores for the two datasets - I don't know what they mean, if anything, but the scores are very low (zero for the England sample, 0.01 for the total sample).

    The thing is, even if the cases were used, people's likelihood of getting Covid-19 early on in the pandemic probably didn't have much to do with genetic susceptibility. It had to do with age and occupation and a lot of bad luck. Neither of these data sets appear to be a measure of the severity of the acute infection, so it surely is a bit dubious for Zhang et al to claim that their gene variants set was correlated with Covid-19 susceptibility.

    Figure 5d seems to show relationships with various long covid and covid samples.
    Covid A2 is severe covid-19; Covid B2 is hospitalised covid-19; Covid C2 is just covid-19.

    Only Long covid19_1 is significant. But it didn't show as significant in Figure 5a, where Covid19 C2 was significant.
    Screen Shot 2025-04-18 at 4.08.30 pm.png

    There's this too, where they don't explain what the correlation is. Is their set of genetic variants associated with the genetic variants of people with longer sleep or shorter sleep?

    This is only a preprint, and perhaps (hopefully) they will tighten the report up before it is published.
     
  13. wigglethemouse

    wigglethemouse Senior Member (Voting Rights)

    Messages:
    1,175
    Good catch! Leptin seems to have come up in quite a few ME/CFS studies if I remember right. MEpedia (link) has some links to those.
     
    Kitty, Deanne NZ and Jacob Richter like this.
  14. forestglip

    forestglip Senior Member (Voting Rights)

    Messages:
    2,241
    I'm not sure I follow. The GWAS returned genes significant between people who have and have not had COVID. Since they're genes, they won't have anything to do with age. Maybe there are genes that affect what occupation they got, but that's still on the causal pathway between genes and getting COVID. I think it's fair to point out that's it's one of the highest associations out of 4000 conditions, even if getting to the bottom of why they're associated will have to come later.

    Though I'm not sure this correlation is very interesting anyway, if it turns out there are post-COVID ME/CFS participants in this study, since it's likely the reason some of them have ME/CFS is because they got COVID which would be more likely if they have genes for COVID susceptibility. [Edit: But it'd at least be a sort of replication of the genes for COVID susceptibility in this case.]

    I don't know exactly what this methodology is, but I'm thinking it's something along the lines of seeing if the p-values for these genes in other GWAS were significant. So it would only let them know if the gene is correlated, but not which direction.
     
    Last edited: May 6, 2025
    hotblack, Kitty and Deanne NZ like this.
  15. forestglip

    forestglip Senior Member (Voting Rights)

    Messages:
    2,241
    I think figures A and B are looking at rare variants and C and D are looking at common variants, so there should be some differences. And I think Fig. D is using genes from a specific other study (ref 38, Lammi 2023) that used data other than the UK Biobank, which was used in Figs. A and B.
     
    Last edited: May 6, 2025
    hotblack, Kitty and Deanne NZ like this.
  16. Hutan

    Hutan Moderator Staff Member

    Messages:
    32,414
    Location:
    Aotearoa New Zealand
    Ah, that makes sense that within the C2 cohorts (ie not in the Zhang study), gene variants differing between the people who had had Covid-19 and those who had not were assessed.

    My point is that whether a person fell into the covid-19 or control basket was mostly due to chance. The controls might include some people with asymptomatic infections who weren't tested and that might have some genetic influence. But, given the numbers in each basket, the overwhelming reason for someone to have a covid-19 infection at that point in the pandemic was bad luck. They were in the wrong place at the wrong time. Genetics probably doesn't have much influence. I suspect that is what those very low heritability scores are telling us.

    So, I think the relationships between the Zhang variants and the variants in these C2 cohorts don't really tell us anything.

    And, I don't think we know if the Zhang variants looked more like the cases or controls.
     
    Last edited: May 6, 2025
    hotblack, Kitty and Deanne NZ like this.
  17. forestglip

    forestglip Senior Member (Voting Rights)

    Messages:
    2,241
    You're saying the genes for COVID in the UK Biobank are essentially mostly random and meaningless? "Bad luck" is randomness, which is what significance tests are used to rule out. If they were split into COVID and control only or mainly based on random chance, then there should be minimal findings from the GWAS. Maybe a few irrelevant genes by chance would pass the threshold, but it's no different from any other disease in any other GWAS.

    Yes, some people in the control group were probably asymptomatic. That doesn't really change the findings. In that case the genes are associated with COVID which is bad enough to cause symptoms.

    Apart from that, I think it would be quite the coincidence for COVID (and depression) to show up in the top 10 out of over 4000 conditions when compared to ME/CFS, considering the connections that can be made between these and ME/CFS, if the genes for COVID in the UK Biobank were effectively random.
     
  18. forestglip

    forestglip Senior Member (Voting Rights)

    Messages:
    2,241
    I think one of the most interesting parts of the study will be seeing what genes matched between these participants with ME/CFS and Biobank data on depression and COVID.

    Considering the similarities between ME/CFS and depression, and how the distinction for accurate diagnosis can be tricky, so cohorts will have some misdiagnosed, seeing depression come out as number one out over >4000 traits is essentially a replication and shows that there are probably actually genes found here that cause ME/CFS and/or depression. Or the same genes cause both conditions. Unless I'm missing something, getting 1st out of 4000 traits would be highly unlikely for such a similar condition if those genes were random.

    Similarly, if these are people with long COVID ME/CFS, finding a high association with COVID is essentially indicating that those COVID susceptibility genes probably actually do cause COVID.
     
    Deanne NZ, Kitty and Eleanor like this.
  19. Hutan

    Hutan Moderator Staff Member

    Messages:
    32,414
    Location:
    Aotearoa New Zealand
    Yes, I probably am saying that the genes for people who were tested and found positive for Covid early in the pandemic are probably fairly random in terms of susceptibility to Covid-19, because, in time, most people would have had symptomatic covid-19. I expect that there is a lot of noise there, relating to why the people were exposed to the virus early in the pandemic and why the people were tested.

    I don't know what the noise is, maybe something to do with occupation (health care?), conscientiousness, possibly even intellect, in that, early in the pandemic, the cases were choosing to get tested. Not everyone would have been choosing to be tested then. Maybe the people in the ME/CFS cohorts are higher than average in conscientiousness and intellect too, because they chose to participate in research? It's important that the distinguishing variants for the Covid-19 severe group didn't match up at all.

    I don't even know if the gene variants from the ME/CFS cohort matched up better with the cases or the controls from that Covid-19 dataset. How would we know that?
     
    hotblack and Deanne NZ like this.
  20. forestglip

    forestglip Senior Member (Voting Rights)

    Messages:
    2,241
    Ok, I see you mean that there may be less interesting reasons on the causal pathway between the genes and being a member of the COVID cohort, not so much the randomness of an invisible hand rolling dice for whether a person will or won't get COVID with no regard for what genes they have. Yeah, maybe.

    It's a good point about severe COVID not being one of the most significant traits if the genes code for immune susceptibility.

    Good question. I don't think that this study tried to answer that, and I don't think I know enough about genetic studies to know how to figure that out.
     
    hotblack, Hutan and Deanne NZ like this.

Share This Page