A NETWORK-BASED COMPUTATIONALPIPELINE TO STUDY THE VARIABILITY OFTRANSCRIPTOME PROFILES FOR HUMANDISEASES
DOI:
https://doi.org/10.7494/csci.2025.26.SI.7067Abstract
Machine learning applications to high-throughput data in medicine – one of the
biggest resource for understanding complex disease – so far have been limited.
Here we present a computational approach for assessing the intrinsic variability
in the most prominent data type, transcriptomics data for disease cohorts. Our
study looks at situations, where multiple datasets for the same disease are
available. We leverage concepts of network medicine to assess, how the match
between a biological network and a set of differentially expressed genes varies
across different networks and experiments. Our results show that different
biological networks yield markedly different results. Also, the clustering of
diseases depends strongly on the choice of parameters contained in the data
analysis and network processing.
Downloads
References
References
[1] Abramson J., Adler J., Dunger J., Evans R., Green T., Pritzel A., Ron-
neberger O., et al.: Accurate structure prediction of biomolecular interactions
with AlphaFold 3, Nature, pp. 1–3, 2024. doi: 10.1038/s41586-024-07487-w.
[2] Alon U.: Network motifs: theory and experimental approaches, Nature Reviews
Genetics, vol. 8(6), pp. 450–461, 2007. doi: 10.1038/nrg2102.
[3] Alon U.: Systems medicine: physiological circuits and the dynamics of disease,
CRC Press, 2023. doi: 10.1201/9781003356929.
[4] Barabási A.L.: Network medicine—from obesity to the “diseasome”, 2007.
[5] Barabási A.L.: Network Science, Cambridge University Press, 2016.
[6] Barabási A.L., Gulbahce N., Loscalzo J.: Network medicine: a network-based
approach to human disease, Nature Reviews Genetics, vol. 12(1), pp. 56–68, 2011.
[7] Barabasi A.L., Oltvai Z.N.: Network biology: understanding the cell’s functional
organization, Nature Reviews Genetics, vol. 5(2), 101, 2004. doi: 10.1038/nrg1272.
[8] Best M.G., Sol N., Kooi I., Tannous J., Westerman B.A., Rustenburg F.,
Schellen P., et al.: RNA-Seq of Tumor-Educated Platelets Enables Blood-Based
Pan-Cancer, Multiclass, and Molecular Pathway Cancer Diagnostics, Cancer Cell,
vol. 28(5), pp. 666–676, 2015. doi: 10.1016/j.ccell.2015.09.018.
[9] Brunk E., Sahoo S., Zielinski D.C., Altunkaya A., Dräger A., Mih N., Gatto F.,
et al.: Recon3D enables a three-dimensional view of gene variation in human
metabolism, Nature Biotechnology, vol. 36(3), pp. 272–281, 2018. doi: 10.1038/
nbt.4072.
[10] Cakir E., Lesne A., Hütt M.T.: The economy of chromosomal distances in bacte-
rial gene regulation, npj Systems Biology and Applications, vol. 7(1), p. 49, 2021.
doi: 10.1038/s41540-021-00209-2.
[11] Cannarozzi A.L., Latiano A., Massimino L., Bossa F., Giuliani F., Riva M., Un-
garo F., et al.: Inflammatory bowel disease genomics, transcriptomics, proteomics
and metagenomics meet artificial intelligence, United European Gastroenterology
Journal, 2024. doi: 10.1002/ueg2.12655.
[12] Casamassimi A., Federico A., Rienzo M., Esposito S., Ciccodicola A.: Tran-
scriptome profiling in human diseases: new advances and perspectives, Interna-
tional Journal of Molecular Sciences, vol. 18(8), p. 1652, 2017. doi: 10.3390/
ijms18081652.
[13] Chen K.A., Nishiyama N.C., Kennedy Ng M.M., Shumway A., Joisa C.U.,
Schaner M.R., Lian G., et al.: Linking gene expression to clinical outcomes in
pediatric Crohn’s disease using machine learning, Scientific Reports, vol. 14(1),
2667, 2024. doi: 10.1038/s41598-024-52678-0.
[14] Costa V., Esposito R., Ziviello C., Sepe R., Bim L.V., Cacciola N.A., Decaussin-
Petrucci M., et al.: New somatic mutations and WNK1-B4GALNT3 gene fusion
in papillary thyroid carcinoma, Oncotarget, vol. 6(13), pp. 11242–11251, 2015.
doi: 10.18632/oncotarget.3593.
[15] Cunningham F., Allen J.E., Allen J., Alvarez-Jarreta J., Amode M.R.,
Armean I.M., Austine-Orimoloye O., et al.: Ensembl 2022, Nucleic Acids Re-
search, vol. 50(D1), pp. D988–D995, 2022. doi: 10.1093/NAR/GKAB1049.
[16] Desai N., Neyaz A., Szabolcs A., Shih A.R., Chen J.H., Thapar V., Nie-
man L.T., et al.: Temporal and spatial heterogeneity of host response to SARS-
CoV-2 pulmonary infection, Nature Communications, vol. 11(1), p. 6319, 2020.
doi: 10.1038/s41467-020-20139-7.
[17] Eswaran J., Cyanam D., Mudvari P., Reddy S.D.N., Pakala S.B., Nair S.S.,
Florea L., et al.: Transcriptomic landscape of breast cancers through mRNA
sequencing, Scientific Reports, vol. 2(1), p. 264, 2012. doi: 10.1038/srep00264.
[18] Fröhlich H., Balling R., Beerenwinkel N., Kohlbacher O., Kumar S., Lengauer T.,
Maathuis M.H., et al.: From hype to reality: data science enabling personalized
medicine, BMC Medicine, vol. 16, pp. 1–15, 2018. doi: 10.1186/s12916-018-1122-
7.
[19] Gosak M., Markovič R., Dolenšek J., Rupnik M.S., Marhl M., Stožer A., Perc M.:
Network science of biological systems at different scales: A review, Physics of Life
Reviews, vol. 24, pp. 118–135, 2018. doi: 10.1016/j.plrev.2017.11.003.
20] Haberman Y., Schirmer M., Dexheimer P.J., Karns R., Braun T., Kim M.O.,
Walters T.D., et al.: Age-of-diagnosis dependent ileal immune intensification and
reduced alpha-defensin in older versus younger pediatric Crohn Disease patients
despite already established dysbiosis, Mucosal Immunology, vol. 12(2), pp. 491–
502, 2019. doi: 10.1038/s41385-018-0114-4.
[21] Haberman Y., Tickle T.L., Dexheimer P.J., Kim M.O., Tang D., Karns R., Bal-
dassano R.N., et al.: Erratum: Pediatric Crohn disease patients exhibit specific
ileal transcriptome and microbiome signature (Journal of Clinical Investigation
(2014) 124: 8 (3617-3633) DOI: 10.1172/JCI75436), Journal of Clinical Investi-
gation, vol. 125(3), p. 1363, 2015. doi: 10.1172/JCI79657.
[22] Howell K.J., Kraiczy J., Nayak K.M., Gasparetto M., Ross A., Lee C., Mak T.N.,
et al.: DNA Methylation and Transcription Patterns in Intestinal Epithelial Cells
From Pediatric Patients With Inflammatory Bowel Diseases Differentiate Disease
Subtypes and Associate With Outcome, Gastroenterology, vol. 154(3), pp. 585–
598, 2018. doi: 10.1053/j.gastro.2017.10.007.
[23] Hütt M.T.: Understanding genetic variation – the value of systems biol-
ogy, British Journal of Clinical Pharmacology, vol. 77(4), pp. 597–605, 2014.
doi: 10.1111/bcp.12266.
[24] Ideker T., Krogan N.J.: Differential network biology, Molecular Systems Biology,
vol. 8(1), p. 565, 2012. doi: 10.1038/msb.2011.99.
[25] Ilott N.E., Neyazi M., Arancibia-Cárcamo C.V., Powrie F., Geremia A., Inves-
tigators O.T.G.U., et al.: Tissue-dependent transcriptional and bacterial associ-
ations in primary sclerosing cholangitis-associated inflammatory bowel disease,
Wellcome Open Research, vol. 6, 2021. doi: 10.12688/wellcomeopenres.16901.1.
[26] Jablonski K.P., Carron L., Mozziconacci J., Forné T., Hütt M.T., Lesne A.:
Contribution of 3D genome topological domains to genetic risk of cancers:
a genome-wide computational study, Human Genomics, vol. 16(1), 2, 2022.
doi: 10.1186/s40246-022-00375-2.
[27] Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tun-
yasuvunakool K., et al.: Highly accurate protein structure prediction with Al-
phaFold, Nature, vol. 596(7873), pp. 583–589, 2021. doi: 10.1038/s41586-021-
03819-2.
[28] Ker J., Wang L., Rao J., Lim T.: Deep learning applications in medical
image analysis, IEEE Access, vol. 6, pp. 9375–9389, 2017. doi: 10.1109/
access.2017.2788044.
[29] Kharchenko P., Church G.M., Vitkup D.: Expression dynamics of a cellu-
lar metabolic network, Molecular Systems Biology, vol. 1(1), 2005.0016, 2005.
doi: 10.1038/msb4100023.
[30] Knecht C., Fretter C., Rosenstiel P., Krawczak M., Hütt M.T.: Distinct metabolic
network states manifest in the gene expression profiles of pediatric inflamma-
tory bowel disease patients and controls, Scientific Reports, vol. 6(32584), 2016.
doi: 10.1038/srep32584
[31] Kosmidis K., Jablonski K.P., Muskhelishvili G., Hütt M.T.: Chromosomal origin
of replication coordinates logically distinct types of bacterial genetic regulation,
npj Systems Biology and Applications, vol. 6(1), pp. 1–9, 2020. doi: 10.1038/
s41540-020-0124-1.
[32] Krijger P.H.L., De Laat W.: Regulation of disease-associated gene expression in
the 3D genome, Nature Reviews Molecular Cell Biology, vol. 17(12), pp. 771–782,
2016. doi: 10.1038/nrm.2016.138.
[33] Ktena I., Wiles O., Albuquerque I., Rebuffi S.A., Tanno R., Roy A.G., Azizi S.,
et al.: Generative models improve fairness of medical classifiers under distribution
shifts, Nature Medicine, pp. 1–8, 2024. doi: 10.1038/s41591-024-02838-6.
[34] Loscalzo J., Barabasi A.L.: Systems biology and the future of medicine, Wiley
Interdisciplinary Reviews: Systems Biology and Medicine, vol. 3(6), pp. 619–627,
2011. doi: 10.1002/wsbm.144.
[35] Love M., Anders S., Huber W.: Differential analysis of count data – the DESeq2
package, Genome Biology, vol. 15, 550, 2014. doi: 10.1186/s13059-014-0550-8.
[36] Love M.I., Huber W., Anders S.: Moderated estimation of fold change and dis-
persion for RNA-seq data with DESeq2, Genome Biology, vol. 15, pp. 1–21, 2014.
doi: 10.1186/s13059-014-0550-8.
[37] Ma H., Zeng A.P.: Reconstruction of metabolic networks from genome data and
analysis of their global structure for various organisms, Bioinformatics, vol. 19(2),
pp. 270–277, 2003. doi: 10.1093/bioinformatics/19.2.270.
[38] Ma H.W., Zeng A.P.: The connectivity structure, giant strong component and
centrality of metabolic networks, Bioinformatics, vol. 19(11), pp. 1423–1430,
2003. doi: 10.1093/bioinformatics/btg177.
[39] Marigorta U.M., Denson L.A., Hyams J.S., Mondal K., Prince J., Walters T.D.,
Griffiths A., et al.: Transcriptional risk scores link GWAS to eQTLs and predict
complications in Crohn’s disease, Nature Genetics, vol. 49(10), pp. 1517–1521,
2017. doi: 10.1038/ng.3936.
[40] Marr C., Geertz M., Hütt M.T., Muskhelishvili G.: Dissecting the logical types
of network control in gene expression profiles, BMC Systems Biology, vol. 2(1),
pp. 1–9, 2008. doi: 10.1186/1752-0509-2-18.
[41] Meyer S., Reverchon S., Nasser W., Muskhelishvili G.: Chromosomal organiza-
tion of transcription: in a nutshell, Current Genetics, vol. 64, pp. 555–565, 2018.
[42] Mo A., Marigorta U.M., Arafat D., Chan L.H.K., Ponder L., Jang S.R., Prince J.,
et al.: Disease-specific regulation of gene expression in a comparative analysis of
juvenile idiopathic arthritis and inflammatory bowel disease, Genome Medicine,
vol. 10, 48, 2018. doi: 10.1186/s13073-018-0558-x.
[43] Moncrieffe H., Bennett M.F., Tsoras M., Luyrink L.K., Johnson A.L., Xu H.,
Dare J., et al.: Transcriptional profiles of JIA patient blood with subsequent poor
response to methotrexate, Rheumatology (United Kingdom), vol. 56(9), pp. 1542–
1551, 2017. doi: 10.1093/rheumatology/kex206.
44] Nyczka P., Hütt M.T.: Generative network model of transcriptome patterns in
disease cohorts with tunable signal strength, Physical Review Research, vol. 2(3),
033130, 2020. doi: 10.1103/physrevresearch.2.033130.
[45] Nyczka P., Hütt M.T., Lesne A.: Inferring pattern generators on networks, Phys-
ica A, vol. 566, 125631, 2021. doi: 10.1016/j.physa.2020.125631.
[46] Oughtred R., Rust J., Chang C., Breitkreutz B.J., Stark C., Willems A.,
Boucher L., et al.: The BioGRID database: A comprehensive biomedical re-
source of curated protein, genetic, and chemical interactions, Protein Science,
vol. 30(1), pp. 187–200, 2021. doi: 10.1002/PRO.3978.
[47] Palsson B.Ø.: Systems biology: properties of reconstructed networks, Cambridge
university press, 2006. doi: 10.1017/cbo9780511790515.
[48] Papatheodorou I., Fonseca N.A., Keays M., Tang Y.A., Barrera E., Bazant W.,
Burke M., et al.: Expression Atlas: gene and protein expression across multiple
studies and organisms, Nucleic Acids Research, vol. 46(D1), pp. D246–D251,
2018.
[49] Peck B.C., Weiser M., Lee S.E., Gipson G.R., Iyer V.B., Sartor R.B., Her-
farth H.H., et al.: MicroRNAs classify different disease behavior phenotypes of
Crohn’s disease and may have prognostic utility, Inflammatory Bowel Diseases,
vol. 21(9), pp. 2178–2187, 2015. doi: 10.1097/MIB.0000000000000478.
[50] Peeters J.G., Vervoort S.J., Tan S.C., Mijnheer G., de Roock S., Vastert S.J.,
Nieuwenhuis E.E., et al.: Inhibition of Super-Enhancer Activity in Autoinflam-
matory Site-Derived T Cells Reduces Disease-Associated Gene Expression, Cell
Reports, vol. 12(12), pp. 1986–1996, 2015. doi: 10.1016/j.celrep.2015.08.046.
[51] Peters L.A., Perrigoue J., Mortha A., Iuga A., Song W.M., Neiman E.M.,
Llewellyn S.R., et al.: A functional genomics predictive network model identifies
regulators of inflammatory bowel disease, Nature Genetics, vol. 49(10), pp. 1437–
1449, 2017. doi: 10.1038/ng.3947.
[52] Prudencio M., Belzil V.V., Batra R., Ross C.A., Gendron T.F., Pregent L.J., Mur-
ray M.E., et al.: Distinct brain transcriptome profiles in C9orf72-associated and
sporadic ALS, Nature Neuroscience, vol. 18(8), pp. 1175–1182, 2015. doi: 10.1038/
nn.4065.
[53] Qiao L., Khalilimeybodi A., Linden-Santangeli N.J., Rangamani P.: The evo-
lution of systems biology and systems medicine: From mechanistic models to
uncertainty quantification, arXiv preprint arXiv:240805395, 2024. doi: 10.1146/
annurev-bioeng-102723-065309.
[54] Quraishi M.N., Acharjee A., Beggs A.D., Horniblow R., Tselepis C., Gkoutos G.,
Ghosh S., et al.: A Pilot Integrative Analysis of Colonic Gene Expression,
Gut Microbiota, and Immune Infiltration in Primary Sclerosing Cholangitis-
Inflammatory Bowel Disease: Association of Disease With Bile Acid Pathways,
Journal of Crohn’s and Colitis, vol. 14(7), pp. 935–947, 2020. doi: 10.1093/ecco-
jcc/jjaa021
[55] Rajpurkar P., Lungren M.P.: The current and future state of AI interpretation of
medical images, New England Journal of Medicine, vol. 388(21), pp. 1981–1990,
2023. doi: 10.1056/nejmra2301725.
[56] Ricarte-Filho J.C., Li S., Garcia-Rendueles M.E., Montero-Conde C., Voza F.,
Knauf J.A., Heguy A., et al.: Identification of kinase fusion oncogenes in post-
Chernobyl radiation-induced thyroid cancers, Journal of Clinical Investigation,
vol. 123(11), pp. 4935–4944, 2013. doi: 10.1172/JCI69766.
[57] Sabari B.R., Dall’Agnese A., Young R.A.: Biomolecular condensates in the
nucleus, Trends in Biochemical Sciences, vol. 45(11), pp. 961–977, 2020.
doi: 10.1016/j.tibs.2020.06.007.
[58] Sareen D., O’Rourke J.G., Meera P., Muhammad A.K.M.G., Grant S., Simp-
kinson M., Bell S., et al.: Targeting RNA Foci in iPSC-Derived Motor Neu-
rons from ALS Patients with a C9ORF72 Repeat Expansion, Science Trans-
lational Medicine, vol. 5(208), pp. 208ra149–208ra149, 2013. doi: 10.1126/
scitranslmed.3007529.
[59] Sarkans U., Gostev M., Athar A., Behrangi E., Melnichuk O., Ali A., Minguet J.,
et al.: The BioStudies database-one stop shop for all data supporting a life sci-
ences study, Nucleic Acids Research, vol. 46(D1), pp. D1266–D1270, 2018.
[60] Sastry A.V., Gao Y., Szubin R., Hefner Y., Xu S., Kim D., Choudhary K.S., et al.:
The Escherichia coli transcriptome mostly consists of independently regulated
modules, Nature Communications, vol. 10(1), 5536, 2019.
[61] Schlicht K., Nyczka P., Caliebe A., Freitag-Wolf S., Claringbould A., Franke L.,
Võsa U., et al.: The metabolic network coherence of human transcriptomes is
associated with genetic variation at the cadherin 18 locus, Human Genetics,
vol. 138(4), pp. 375–388, 2019. doi: 10.1007/s00439-019-01994-x.
[62] Sharma A., Lysenko A., Jia S., Boroevich K.A., Tsunoda T.: Advances in AI and
machine learning for predictive medicine, Journal of Human Genetics, pp. 1–11,
2024. doi: 10.1038/s10038-024-01231-y.
[63] Shin Y., Chang Y.C., Lee D.S., Berry J., Sanders D.W., Ronceray P.,
Wingreen N.S., et al.: Liquid nuclear condensates mechanically sense and re-
structure the genome, Cell, vol. 175(6), pp. 1481–1491, 2018. doi: 10.1016/
j.cell.2019.02.025.
[64] Sonnenschein N., Geertz M., Muskhelishvili G., Hütt M.T.: Analog regulation of
metabolic demand, BMC Systems Biology, vol. 5(1), 40, 2011. doi: 10.1186/1752-
0509-5-40.
[65] Sonnenschein N., Golib Dzib J.F., Lesne A., Eilebrecht S., Boulkroun S., Zen-
naro M.C., Benecke A., et al.: A network perspective on metabolic inconsistency,
BMC Systems Biology, vol. 6, pp. 1–13, 2012. doi: 10.1186/1752-0509-6-41
66] Szklarczyk D., Gable A.L., Nastou K.C., Lyon D., Kirsch R., Pyysalo S.,
Doncheva N.T., et al.: The STRING database in 2021: customizable
protein-protein networks, and functional characterization of user-uploaded
gene/measurement sets, Nucleic Acids Research, vol. 49(D1), pp. D605–D612,
2021. doi: 10.1093/NAR/GKAA1074.
[67] Travers A., Muskhelishvili G.: DNA supercoiling – a global transcriptional regula-
tor for enterobacterial growth?, Nature Reviews Microbiology, vol. 3(2), pp. 157–
169, 2005. doi: 10.1038/nrmicro1088.
[68] Travers A., Muskhelishvili G., Thompson J.: DNA information: from digital
code to analogue structure, Philosophical Transactions of the Royal Society A:
Mathematical, Physical and Engineering Sciences, vol. 370(1969), pp. 2960–2986,
2012. doi: 10.1098/rsta.2011.0231.
[69] VanDussen K.L., Stojmirović A., Li K., Liu T.C., Kimes P.K., Muegge B.D.,
Simpson K.F., et al.: Abnormal Small Intestinal Epithelial Microvilli in Pa-
tients With Crohn’s Disease, Gastroenterology, vol. 155(3), pp. 815–828, 2018.
doi: 10.1053/j.gastro.2018.05.028.
[70] Wu M., Chen Y., Xia H., Wang C., Tan C.Y., Cai X., Liu Y., et al.: Tran-
scriptional and proteomic insights into the host response in fatal COVID-19
cases, Proceedings of the National Academy of Sciences of the United States of
America, vol. 117(45), pp. 28336–28343, 2020. doi: 10.1073/PNAS.2018030117/
SUPPL_FILE/PNAS.2018030117.SD01.XLSX.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Computer Science

This work is licensed under a Creative Commons Attribution 4.0 International License.