Yeah, mine has always said "Disability". In a video about Trends, they seem to imply that any word under the term other than "Search term" means a Topic.
Interesting that your R2 is higher.
Just to clarify, the trends page you're using says "CFS/ME" and not "Myalgic encephalomyelitis/chronic fatigue syndrome"? Can you link to that page?
And you're doing a correlation against the percentages from the World Population Review website?
Yeah, mine has always said "Disability". In a video about Trends, they seem to imply that any word under the term other than "Search term" means a Topic.
Interesting that your R2 is higher.
Just to clarify, the trends page you're using says "CFS/ME" and not "Myalgic encephalomyelitis/chronic fatigue syndrome"? Can you link to that page?
And you're doing a correlation against the percentages from the World Population Review website?
Will get you that information in a bit;
Meantime I've also found the Australian census has extremely detailed ancestry and place of birth data, combined with some health data. It's not going to be definitive because mecfs is not listed. But I will pre-specify the hypothesis I will set out to test.
If there is a stronger association between English heritage and "other health conditions" than between other heritages and "other health conditions". Putting extra weight on association among females.
And using association with other health conditions, e.g. stroke diabetes mental health, as a baseline.
And after doing any coldly rational bonferroni adjustments I will also probably do some highly motivated data mining!
So then I got a bit systematic, and looked to see what really was most associated with English heritage, controlling for age. Turns out it is early onset arthritis. Also early onset mental health issues. (English heritage is strongly negatively correlated with kidney disease, apparently).
It is fascinating and it certainly seems to stand up the idea that people from certain backgrounds can be more and less prone to certain diseases but it doesn't shed light on me/cfs directly nor does it say why. So I think it is worth leaving there - unless anyone has any hypotheses they would like me to dig into? Feel free to ask!
I think what could be interesting is looking for the best correlations of ME/CFS searches with other searches. For example, if the states that search most for ME/CFS also search most for "mono".
There's no publicly available API for Google Trends, though, so data would have to be downloaded one at a time through the browser. The idea I had was to start by looking at the correlation of ME/CFS searches with searches for 10 random words. For the most correlated search term, find 10 concepts related to that search term, with something like a thesaurus, and test correlation with those to see if any are even better. And keep iterating.
But I think without being able to automate doing that for thousands of search terms, it might be too slow to be fruitful. But maybe still worth testing with some hand selected terms.
I managed to do this, using gtrendsR, as suggested by Murph. I decided to have an AI code basically the whole script, and I am pleasantly surprised by how well it works.
Essentially, the script is testing how well search interest in "chronic fatigue syndrome" correlates to search interest for other terms. I chose the specific search term "chronic fatigue syndrome" (without quotes) because I worry the ME/CFS Topic includes unrelated terms, and searches for the specific term "ME/CFS" are much less common than for "chronic fatigue syndrome" (henceforth referred to as CFS).
I decided to use the scores for metro regions instead of for states because there are more of them (~210 vs 51), making the statistics more precise, and allows comparing a larger variety of regions.
The algorithm used is as follows:
I have a huge list of words as "seeds". The script starts with a seed word, say "elephant", and then tests how well search interest correlates between "elephant" and CFS.
If this initial correlation passes a lenient threshold of p<0.01, then the script identifies 10 related search terms. Google Trends helpfully provides related terms, which are accessible in the gtrendsR results.
For example, for "elephant", here are the first 3 related terms it returns: "white elephant", "baby elephant", "drunk elephant".
Then the script tests correlation of CFS with each of those terms, and identifies the most significant correlation with CFS out of these 10 terms (let's say "baby elephant"), and if that term is also more significant than the "parent" term ("elephant"), then 10 more related terms are retrieved that are related to this new term.
For example, related terms for "baby elephant" include: "baby elephant baby shower", "baby elephants", "baby elephant walk"
Then the script tests correlation of CFS with these 10 new terms, and on and on, until none of the 10 new terms is more significant than its parent term.
Then the script goes on to the next seed word and starts all over. The correlation statistics for each term are saved to a file.
For "uveitis", none of the 10 related terms was more significantly correlated than "uveitis" to CFS, so it stopped there. For "kidney infection", the related term "kidney infection back pain" was even more significant, so it identified the related terms to this new term, of which there was only one, and for that term, there was too little data to run the correlation test.
It takes about 2 to 5 seconds per term, and if retrieving too many results too fast, it starts giving errors.
So far, I've tested the correlation of CFS with around 4800 other terms. I've used some seed words that are totally random words, some that are diseases, and some that are body parts.
Here are the top 50 most significant correlations so far. (I'll attach a file with the full results.)
Search depth of 0 indicates that this was a seed term. 1 means it was a related term of the seed word, and so on.
Notably the correlation of CFS with itself is not 1. This is because the search score data isn't the same every time, as previously discussed. But reassuringly, searches for fibromyalgia are most significantly correlated with CFS, which makes sense.
Among the top correlations are the very confusing "dog food recall", "food recall", and "cross stitch pattern".
Some of the diseases highly correlated in search interest to CFS are "ocular migraine", "plantar warts", "Raynaud's", and "tendonitis".
It is interesting that multiple sclerosis is highly correlated to CFS here, but was not when I tested previously. I think that is because I previously tested the ME/CFS "Topic" vs the multiple sclerosis "Topic". We know that the Multiple Sclerosis Topic includes the short abbreviation "MS", since Mississippi (abbreviation MS) has extremely high scores, so it's possible that searches for "MS" make the Topic scores less true to searches specifically about multiple sclerosis.
Here are plots for a couple of the most significant terms. (Note that the data in the plots may not be identical to that used in the initial correlations because they are based on re-downloaded data.)
Some potential other things to try:
Use states or countries instead of metros.
Set target term to "chronic fatigue" or the ME/CFS Topic.
Test correlations using Spearman's rho instead of Pearson.
I uploaded the script to GitHub if anyone else wants to try experimenting with it.
Now just need to figure out what ME/CFS, migraines, and cross stitching have in common...
Edit: The time span used for all trends data was 2004-01-01 to 2026-03-24.
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.