Preprint Initial findings from the DecodeME genome-wide association study of myalgic encephalomyelitis/chronic fatigue syndrome, 2025, DecodeMe Collaboration

forestglip · Feb 8, 2026

I posted in another thread, but I think this is notable enough to mention here. In a large GWAS (122,341 European ancestry cases and 729,881 controls) of anxiety-related traits (GAD, panic disorder, social phobia, agoraphobia or specific phobias), MAGMA tissue enrichment was tested.

The four most significant tissues are the same as in DecodeME, and in the same order: Frontal Cortex, Cortex, Anterior Cingulate Cortex BA24, Nucleus Accumbens.

Maybe this indicates that similar brain structures are affected in both types of disorder.

forestglip said:
Supplementary Figure 89: MAGMA tissue expression analysis to test tissue enrichment of 53 specific tissue types for ANX genes (derived
from the main ANX GWAS meta-analsis (Ncases = 122,341, Ncontrols = 729,881)).

Click to expand...

From DecodeMe:

Kitty · Feb 8, 2026

forestglip said:
The four most significant tissues are the same as in DecodeME, and in the same order: Frontal Cortex, Cortex, Anterior Cingulate Cortex BA24, Nucleus Accumbens.

Maybe this indicates that similar brain structures are affected in both types of disorder.

I wonder if we could ask the question the other way round: what traits or disorders are associated with these brain regions, in this order of ranking?

(Possibly not, just thinking aloud.)

forestglip · Feb 8, 2026

Kitty said:
I wonder if we could ask the question the other way round: what traits or disorders are associated with these brain regions, in this order of ranking?

I think it's likely that it's not actually all these brain regions affected in these disorders. Similar genes are expressed in different parts of the brain, so if only one brain region is actually causal (say, frontal cortex) and thus is significant in MAGMA, then I think it's possible that other brain regions will be significant too just from having similar patterns of gene expression to the frontal cortex.

But the main thing of interest, I think, is that the pattern of GWAS genes in the two disorders is so similar that the same top four tissues were significant in both. Though, which, if any (maybe all), of these four tissues are actually relevant, is probably still an open question.

We could look for similar patterns in MAGMA analyses in other disorders, which I did yesterday, and none but anxiety, of the MAGMA plots I previously compiled, look to be quite so similar.

[Edit: Realizing now that the last part is probably all you were asking anyway.]

jnmaciuch · Feb 27, 2026

I made a quick little custom track for looking at DecodeME hits on UCSC genome browser, which allows you to cross-reference with a lot of other databases to find out some more interesting information about top variants beyond just what genes they are near.

[Edit: the magic of the internet is real and apparently all these steps are already embedded in the link I provided. I'll move all the additional instructions to a spoiler just for future reference]

Go to UCSC genome browser

At the bottom of the main window, click the middle button that says "Add custom tracks":

Above the first text box hit "Browse" and upload the file attached at the end of the post (DecodeME_CredibleSet_UCSC_custom_track.txt), then click "Submit":

Click "Go to first annotation" to jump back to the viewer window:

You will end up very zoomed in to the location of the first variant. You can hit "Zoom out" 3x or 10x at the top right a couple times to get a wider view and orient yourself. The DecodeME annotations will default to populating at the top row of the window.

I had to narrow down the number of hits to plot, so this file only includes hits in the 95% credible set as calculated by LocusZoom (i.e. we rarely know exactly which variants are the "causal" ones for disease because nearby ones tend to be inherited together, but we can be 95% confident that the real causal variants are within that "credible set"). Not all "peaks" and genes discussed in the DecodeME results were included in the credible set, so only hits around 6 loci (on chr1, chr6p, chr6q, chr15, chr17, chr20) are in the custom track.

I'm a bit limited in what visual features I could code into the track, but I wanted to give a sense of which hits had the strongest signal within specific areas of interest. Variants are colored by -log10(p-value), scaled for the local maximum according to the viridis color scale:

Meaning that only the green-colored dots from the LocusZoom plot below will be plotted in the UCSC genome browser track, with the highest dot colored yellow and the lowest dot colored dark purple in the UCSC track:

A bit clunky but hopefully still helpful.

The file is formatted as a bedDetail, a tab-delimited file with columns as follows:
chrom (chromosome)
chromStart (1 to the left of the "Pos" in the DecodeME summary stats)
chromEnd ("Pos" in the DecodeME summary stats)
name (name for the site, given as the Ref > Alt alleles)
score (colorscale greyness)
strand (always positive)
thickStart (same as chromStart, just designating a "thick" line)
thickEnd
rgb (RGB color code, according to local relative p-value)
ID (arbitrary row number)
description (additional details about the variant that appear when you click on an element from the track)

It's easy to add new sites to the track as new rows, so long as you keep the same formatting. I would not recommend messing with the file header or the columns.

There's an abundance of different data you can include in your window on UCSC genome browser. At minimum I'd recommend "MANE", "NCBI RefSeq" or "GENCODE V49" under the section "Genes and Gene Predictions" to show you the location of genes.

Change the drop-down menu from "hide" to "dense" to have it appear in your window--the other options just determine how much space the track takes up in your window. You'll need to click the "Refresh" button on the right hand side of the section header to have it show up in your window.

To better see how a DecodeME variant overlaps with info on other tracks, place your cursor all the way at the top of the window and click-and-drag to highlight a region of interest. In the pop-up window, click "Add highlight." Alternatively, you can zoom the window until the site you're interested in fills the whole screen, and click "Highlight" from the list of buttons at the bottom of the window.

Some other tracks I use frequently:
Genes and Gene Predictions
Non-coding RNA - shows regions that don't code for proteins but might code for important regulatory RNA.

Phenotypes, Variants, and Literature
OMIM - a genetics database that can show you if a location in the genome is associated with other diseases/traits/etc.

Expression
GTEx Gene V8 - under genes that code for detectable mRNA, this shows you a handy little barplot with relative steady-state expression levels across tissues (measured from a tissue bank).

Regulation
ENCODE cCREs - highlights cis-regulatory elements, like promoters, enhancers, and CTCF binding sites. Can be useful to tell if a SNV falls within a hotspot where a lot of transcription factor binding and gene regulation happens.

JASPAR Transcription Factors - a database of transcription factor binding motifs. Can tell you if a SNV overlaps the place in the genome where particular transcription factors bind. Note this goes off of the reference genome, so it will not show you if a given SNV adds in a TF binding site that isn't already there in the hg38 reference. Also note this isn't proof that a given transcription factor does bind there in any given cell type, just that it theoretically can.

library(tidyverse)
library(magrittr)
library(data.table)
library(colourvalues)

# Load summary stats
# Inputs are 6 csv files manually exported from jumping to top loci at LocusZoom
files <- list.files(workDir,
pattern = ".csv")

regions <- files %>%
str_split_i(pattern = "_",
i = 2)

summary_stats <- files %>%
map(\(x) read_csv(x)) %>%
set_names(regions)

# Filter to credible set
summary_stats %<>%
map(\(x) x %>%
as.data.table() %>%
.[`Cred. set` == TRUE])

# Bind rows
summary_stats %<>% rbindlist(idcol = "region")

# Rename some columns
summary_stats %<>% setnames(old = c("Chrom", "Pos", "-log<sub>10</sub>(p)", "β", "Alt freq."),
new = c("chrom", "chromEnd", "neglog10pval", "beta", "alt_freq"))

# Format chrom column
summary_stats %<>% .[, chrom := paste0("chr", chrom)]

# Add start position
summary_stats %<>% .[, chromStart := chromEnd - 1]

# Create name
summary_stats %<>% .[, name := paste0(Ref, ">", Alt)]

# Create description
summary_stats %<>% .[, desc := paste0("rsID=",
rsID,
"; Beta=",
signif(beta, 4),
"; -log10(pval)=",
signif(neglog10pval, 3),
"; alt freq=",
alt_freq)]

# Around each peak, scale color values into 8 bins (8 is color limit) and assign RGB code
summary_stats %<>% .[, color_bin := cut_number(neglog10pval, n = 8) %>%
as.numeric(),
by = "region"] %>%
.[, color := colour_values_rgb(color_bin,
include_alpha = F) %>%
apply(MARGIN = 1, \(x) paste(x, collapse = ","))]

# Add ID
summary_stats %<>% .[, ID := 1:nrow(.)]

# Add score
summary_stats %<>% .[, score := 999] %>%
.[, strand := "+"]

# Pull columns for BED file
BED <- summary_stats %>%
.[, c("chrom",
"chromStart",
"chromEnd",
"name",
"score",
"strand",
"chromStart",
"chromEnd",
"color",
"ID",
"desc")]

# Add track header as column names
header <- c("track name=DecodeME",
"type='bedDetail'",
"description='95% credible hits from DecodeME'",
"db=hg38",
"visibility=3",
"itemRgb='On'",
"",
"",
"",
"",
"")

BED %<>% setnames(header)

# Save as tab delimited file
write_delim(BED,
file = file.path(workDir,
"DecodeME_CredibleSet_UCSC_custom_track.txt"),
delim = "\t")

hotblack · Feb 27, 2026

Nice! Thanks @jnmaciuch

Edit: I think the link you shared has your session ID or something at the end of the URL, as it already has your custom track applied!

forestglip · Feb 27, 2026

jnmaciuch said:
Steps:
Go to UCSC genome browser

At the bottom of the main window, click the middle button that says "Add custom tracks:

Very cool.

I notice that the link you provided seems to already have a DecodeME custom track with variants.

jnmaciuch · Feb 27, 2026

forestglip said:
Very cool.

I notice that the link you provided seems to already have a DecodeME custom track with variants.

Oh nice I didn't realize it could save a custom track within a URL. I'll update the link to a version that's a little less busy with a few of my recommended tracks.

hotblack · Feb 27, 2026

jnmaciuch said:
Oh nice I didn't realize it could save a custom track within a URL. I'll update the link to a version that's a little less busy with a few of my recommended tracks.

Really useful to have the BED detail file and instructions anyway so thank you. Learning how to do custom tracks is on my todo list after you pointed out some of the Genome Browser features elsewhere recently, but having an example will help a lot!

Jonathan Edwards · Mar 14, 2026

I have forgotten if we have but have we looked at Ch16p13.3 to see if there is any signal at all near the tryptases? Ditto for IgE wherever it is.

hotblack · Mar 14, 2026

Jonathan Edwards said:
I have forgotten if we have but have we looked at Ch16p13.3 to see if there is any signal at all near the tryptases? Ditto for IgE wherever it is.

LocusZoom is failing to load for me for the locations of tryptase alpha, beta, delta and gamma. Not sure what the issue is, would need to look at the underlying data or try some manual locations I expect. Will try to do so later unless someone else gets time to prod around. There’s not much around epsilon

Not sure what you mean by IgE whatever it is, can you clarify?

Jonathan Edwards · Mar 14, 2026

hotblack said:
Not sure what you mean by IgE whatever it is, can you clarify?

I was meaning IgE genes - wherever they are, which is Ch14 at least for the constant heavy chain gene. I don't know much about the IgE genes.

ME/CFS Science Blog · Mar 14, 2026

There were no signals at the p < 10^-6 level on chromosome 14 or 16.

Jonathan Edwards · Mar 14, 2026

ME/CFS Science Blog said:
There were no signals at the p < 10^-6 level on chromosome 14 or 16.

Yes, I saw that. I wondered if homing in on the tryptase area and IgE there might be some weak signals that might possibly be real and come out with a significant value if a targeted study just looked at those. Or nothing at all. It would likely be a wild goose chase but a solid negative could be important.

ME/CFS Science Blog · Mar 14, 2026

Here's what I got for tryptase-related genes on chromosome 16 (using GRCh38)

And using locuszoom:

Looks like my own code doesn't pick up on IGHE (position: chr14:105,597,691-105,601,728) for some reason but here's the data around that position.

And here's the location on locuszoom with IGHE:

So it looks like there is no signal around these genes.

Hutan · Mar 14, 2026

Exploring common pathogenic association between Epstein Barr virus infection and Long Covid by integrating RNA-Seq and molecular dynamics simulations

abstract said:
We applied the bulk RNA-Seq from LC and EBV-infected peripheral blood mononuclear cells (PBMCs), identified the differentially expressed genes (DEGs) and the Protein–Protein interaction (PPI) network using the STRING database, identified hub genes using the cytoscape plugins CytoHubba and MCODE, and performed enrichment analysis using ClueGO.

abstract said:
Out of 357 common genes, 22 genes (CCL2, CCL20, CDCA2, CEP55, CHI3L1, CKAP2L, DEPDC1, DIAPH3, DLGAP5, E2F8, FGF1, NEK2, PBK, TOP2A, CCL3, CXCL8, DEPDC1, IL6, RETN, MMP2, LCN2, and OLR1) were classified as hub genes

Different Genetic Associations of the IgE Production among Fetus, Infancy and Childhood

Analyses of gene-gene interactions indentified that the combination of NPSR1, rs324981 TT with FGF1, rs2282797 CC had the highest risk (85.7%) of IgE elevation at 1.5 years of age (P = 1.46×10(-4)). The combination of IL13, CYFIP2 and PDE2A was significantly associated with IgE elevation at 3 years of age (P = 5.98×10(-7)), and the combination of CLEC2D, COLEC11 and CCL2 was significantly associated with IgE elevation at 6 years of age (P = 6.65×10(-7)).

So, the two genes bolded are reported as being relevant to Long Covid, an active EBV infection and high IgE in childhood.

A very long shot, probably problems with both studies; I haven't looked into them at all. But, it's a coincidence, so perhaps it's worth checking out the genes.

Sounds as though HLA region is important, I'm looking forward to the DecideME analysis of that region.

V.R.T. · Mar 14, 2026

Hutan said:
Sounds as though HLA region is important, I'm looking forward to the DecideME analysis of that region.

Really hoping that turns up something interesting.

Hutan · Apr 8, 2026

I want to write something about the evidence for CoQ10 supplementation for ME/CFS. Primary deficiencies of CoQ10 result from variations in COQ and PDSS genes. I don't think those genes were identified as being the regions of variations more common in the ME/CFS participants.

So, can we be confident that ME/CFS isn't associated with primary deficiencies of CoQ10? I'm not yet clear on what DecodeME as a GWAS can find, versus the more detailed Whole Genome Sequencing.

Mutations in these nuclear-encoded genes cause primary coenzyme Q10 deficiency, a rare autosomal recessive metabolic disorder.
COQ1 (PDSS1): Chromosome 10 (10p12.1)
COQ1 (PDSS2): Chromosome 6 (6q21)
COQ2: Chromosome 4 (4q21.3)
COQ3: Chromosome 6 (6q16.3)
COQ4: Chromosome 9 (9q34.13)
COQ5: Chromosome 12 (12q24.31)
COQ6: Chromosome 14 (14q24.1)
COQ7: Chromosome 16 (16p13.11-p12.3)
COQ8A (ADCK3): Chromosome 1 (1q42.13)
COQ8B (ADCK4): Chromosome 19
COQ9: Chromosome 16 (16q13)
COQ10A: Chromosome 12 (12q13.3)
COQ10B: Chromosome 2 (2q33.1)

Jonathan Edwards · Apr 8, 2026

Hutan said:
So, can we be confident that ME/CFS isn't associated with primary deficiencies of CoQ10? I'm not yet clear on what DecodeME as a GWAS can find, versus the more detailed Whole Genome Sequencing.

My limited unserstanding is that DecodeME will have only produced a signal for these genes if there are relatively common SNP variants that alter function. If deficient function is only associated with rather rare variants then that would be missed I think.

It raises th point that maybe has not been emphasised so far - that SequenceME has the power to produce reliable negative indicators for a whole host of gene functions that have been popular candidates for disease mechanisms. At the moment I guess it is possible as many as 10%+ of people with ME/CFS have COQ or PDSS variants with altered function - just ones that are too scarce in the general population to include in a GWAS. But if 98% of people with ME/CFS on WGS do not have functional variants of these genes then we can say that they are unlikely to be critical rate limiting steps in disease. The gene product may still be involved in some relevant pathways, just as water and glucose will be, but that is OK.

Hutan · Apr 8, 2026

Thanks @Jonathan Edwards. We've really got to get that SequenceME study well-funded.

ScoutB · Apr 15, 2026

Apologies if this has been answered already: is there any hope of looking for genetic clues in early onset cases of the DecodeME data? Was age-of-onset data collected? (The thinking being that early-onset cases might have a stronger genetic predisposition, or something else distinct going on.)

I was reminded of this question by this line of Simon's excellent new blog post :

Autoimmune vitiligo is an interesting example of a disease with bimodal onset where the age at onset has provided clues into the mechanisms of the disease. A specific genetic association related to the immune system has been demonstrated in the early onset peak, which has an odds ratio of >8 – a very large genetic risk.

Preprint Initial findings from the DecodeME genome-wide association study of myalgic encephalomyelitis/chronic fatigue syndrome, 2025, DecodeMe Collaboration

Moderator

Senior Member (Voting Rights)

Moderator

Senior Member (Voting Rights)

Attachments

Senior Member (Voting Rights)

Moderator

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Moderator

Different Genetic Associations of the IgE Production among Fetus, Infancy and Childhood​

Senior Member (Voting Rights)

Moderator

Senior Member (Voting Rights)

Moderator

Senior Member (Voting Rights)

Different Genetic Associations of the IgE Production among Fetus, Infancy and Childhood