Review Omics-based computational approaches for biomarker identification, prediction, and treatment of Long COVID, 2025, Pinero et al.

SNT Gatchaman

Senior Member (Voting Rights)
Staff member
Omics-based computational approaches for biomarker identification, prediction, and treatment of Long COVID
Sindy Pinero; Xiaomei Li; Junpeng Zhang; Marnie Winter; Sang Hong Lee; Thin Nguyen; Lin Liu; Jiuyong Li; Thuc Duy Le

Long COVID, or post-acute sequelae of COVID-19 (PASC), is a major global health problem, with cumulative estimates suggesting that around 400 million people worldwide have been affected. It is characterized by persistent or new symptoms such as fatigue, cognitive impairment, and breathlessness lasting beyond four weeks after acute infection. Diverse clinical manifestations, chronic course, and incompletely understood pathophysiology—including hypotheses involving viral persistence, immune dysregulation, autoimmunity, endothelial dysfunction, and metabolic reprogramming—impede the development of diagnostic criteria, biomarkers, and targeted therapies.

We conducted a critical review of 101 Long COVID omics studies, focusing on the computational methods used and their methodological quality. Using standardized criteria, we evaluated study design, statistical rigor, reproducibility, and clinical relevance across genomics, epigenomics, transcriptomics, proteomics, metabolomics, and multiomics integration, and mapped these findings onto regulatory and translational frameworks. Despite substantial methodological heterogeneity, convergent biological signals emerged.

Genomic studies implicate risk loci in immune and cardiopulmonary pathways. Epigenomic analyses identify differentially methylated regions in immune and circadian genes. Transcriptomic studies reveal persistent dysregulation of innate immune and coagulation pathways, as well as reproducible molecular endotypes. Proteomic studies consistently show abnormalities in the complement cascade and coagulation, with a small panel of complement proteins showing highly reproducible changes across independent cohorts. Metabolomic studies demonstrate sustained mitochondrial dysfunction and altered cellular bioenergetics for up to two years after infection.

Multiomics integration supports at least two major endotypes, characterized by predominant inflammatory versus metabolic dysregulation, and provides a basis for patient stratification and computational treatment discovery. Machine learning models frequently achieve high classification performance, but are rarely externally validated.

Critical limitations restrict clinical translation. Most studies are underpowered relative to analytical complexity, use heterogeneous case definitions and controls, and report platform-specific signatures with limited overlap. External validation, preregistered analysis plans, and regulatory-aligned assay development are uncommon. To date, no regulatory-approved diagnostic assay or evidence-based therapeutic intervention has directly emerged from these computational findings. Future progress requires harmonized phenotyping protocols, adequately powered longitudinal cohorts with external validation, integration of spatial omics and explainable artificial intelligence, and early engagement with regulatory and health-technology assessment pathways.

This review provides a critical assessment and a translational roadmap, outlining how methodologically robust computational omics can be advanced toward clinically actionable tools for Long COVID.

Web | DOI | Critical Reviews in Clinical Laboratory Sciences | Paywall
 
University of South Australia's press release:

Who is more likely to get long COVID? New study uncovers genetic drivers behind the disease​

Australian scientists have identified the key genetic drivers behind long COVID, revealing why some people continue to experience debilitating symptoms long after their initial infection. The breakthrough, made using large scale biological datasets, could pave the way for targeted treatments and personalised diagnostics.

The team, led by University of South Australia scientists, integrated genetic and molecular data from more than 100 different international studies, identifying 32 causal genes that increase the likelihood of a person developing long COVID, including 13 new genes not previous associated with the disease.

Their findings have been reported in two new scientific papers published in PLOS Computational Biology and Critical Reviews in Clinical Laboratory Sciences.

An estimated 400 million people have been affected by long COVID since 2020, imposing a $1 trillion annual cost to the global economy. Characterised by symptoms like prolonged fatigue, breathlessness, cardiovascular complications and cognitive impairment beyond four weeks, the condition has proved stubbornly difficult to diagnose and treat. Many people have experienced symptoms for weeks, months, and sometimes years after contracting the virus.

Lead author UniSA PhD candidate in Bioinformatics, Sindy Pinero, says large-scale datasets and advanced computational methods can more quickly identify the causes, risk factors, and potential treatment options for long COVID. The methods combine advanced bioinformatics and artificial intelligence to interpret massive biological datasets known as “omics” data – encompassing genomics, proteomics, metabolomics, transcriptomics, and epigenomics. “These findings mark a major step towards a more precise way of diagnosing and treating the condition,” Pinero says. “Long COVID is incredibly complex. It affects multiple organs, shows highly variable symptoms, and has no single final diagnostic marker. However, by using computational models to integrate data from across the world, we can begin to uncover consistent molecular signatures of disease and identify biomarkers that point to new treatment targets.”

The review identifies dozens of genetic, epigenetic, and protein-level biomarkers linked to immune dysfunction, persistent inflammation, and mitochondrial and metabolic abnormalities. Among the key discoveries is a genetic variant in the FOX P4 gene, associated with immune regulation and lung function, that appears to increase people’s susceptibility to long COVID. Researchers also found 71 molecular switches that can turn genes on or off, persisting a year after infection, and more than 1500 altered gene expression profiles tied to immune and neurological disruption.

By integrating these findings using machine learning, the study demonstrates how different layers of biological data can be combined to predict which patients are at risk of long-term complications and how their symptoms may evolve. “This computational framework not only improves our understanding of long COVID but could also accelerate the search for treatments for other post-viral symptoms such as chronic fatigue and fibromyalgia,” according to Assoc Prof Le.

Co-author, UniSA Associate Professor Thuc Le, says that computational science is essential to solving the long COVID puzzle. “Traditional biomedical research can’t keep pace with the complexity of this condition,” Assoc Prof Le says. “By applying artificial intelligence to global datasets, we can identify causal relationships that are invisible in small clinical trials – for example, how specific genes interact with immune pathways to drive persistent inflammation.”

The review also highlights the urgent need for larger, more diverse international datasets and longitudinal studies that follow patients for several years after infection. “Many existing studies are small and inconsistent, which makes it hard to identify reliable biomarkers. Global collaboration and data sharing are the key to producing results that can translate into clinical tools. “This research is not only about long COVID. It represents a blueprint for how global science can use big data, AI and molecular biology to respond to future pandemics and complex chronic diseases.”

‘Integrative Multi-Omics Framework for Causal Gene Discovery in long COVID’ is published in PLOS Computational Biology DOI: 10.1371/journal.pcbi.1013725

‘Omics-based computational approaches for biomarker identification, prediction and treatment of long COVID’ is published in Critical Reviews in Clinical Laboratory Sciences (ILAB). DOI: 10.1080/10408363.2025.2583083
 
Back
Top Bottom