Media release
From:
Most disease-associated gut bacteria are not associated with disease, machine learning study finds
Many bacterial-linked illnesses, such as inflammatory bowel disease or colorectal cancer, are associated with an altered gut microbiome. But a study published November 13, 2024, in the Cell Press journal Cell reveals that these diseases may be driven by changes in the microbial load—the density of microbes—rather than the relative abundance of individual species. Using a machine learning model, researchers found that changes in the total microbial load—not the disease itself—may be triggering observed changed in gut bacterial species for at least half of what are generally considered bacterial-associated conditions.
“We were surprised to find that many microbial species, previously believed to be associated with disease, were more strongly explained by changes in microbial load,” Peer Bork of the European Molecular Biology Laboratory (EMBL) Heidelberg, one of the senior authors on the study. “This indicates that these species are mainly associated with symptoms like diarrhea and constipation, rather than being directly linked to the disease conditions themselves.”
Microbial density is closely associated with fecal transit time, stool consistency, water content, and pH in the gut. When diseases have general symptoms like diarrhea or constipation, this impacts the microbial density, which, in turn, impacts the diversity of microbes in the gut. Therefore, when we see changes in a certain bacterial species, it may be cause not by a specific disease, but by general symptoms that many diseases share, like diarrhea.
Microbial load has long been recognized as an important factor in microbiome research, but large-scale analysis has been largely limited due to the high cost and labor-intensive nature of experimental methods. The investigators used a machine-learning approach to overcome this limitation. They developed a prediction model for fecal microbial load based on the relative microbiome composition and applied it to a large-scale metagenomic dataset to explore its variation in health and disease.
“Measuring microbial load in fecal samples takes a lot of effort and we were glad to have access to two large metagenomic datasets where the microbial load had been experimentally measured.,” says Michael Kuhn, also of EMBL and the other senior author on the study. “With our approach, we want to generalize these data for the benefit of the larger field and with the tools we provide, microbial load can be predicted for all adult human gut microbiome studies.”
The new datasets the team generated for the research are thousands of metagenomes and experimentally measured microbial load in the EU-funded GALAXY (Gut-and-Liver Axis in Alcoholic Liver Fibrosis) and the Novo Nordisk Foundation’s MicrobLiver projects. They also used metagenomes and microbial load data from a previously public MetaCardis study population. For exploratory datasets, they used tens of thousands of metagenomes from previous studies including deeply phenotyped populations from Japan and Estonia.
The team acknowledges limitations to the work. Because the analysis was based only on associations, they were not able to establish a clear direction of causality, nor could they provide mechanistic insight. Additionally, the method developed only applies to the human gut microbiome: Different training datasets are needed to predict the microbial load in other habitats.
Future research will focus on microbial species that are more directly associated with diseases, independent of microbial load, to better understand their roles in disease etiology and their potential use as biomarkers. Additionally, adapting this prediction model to other environments, such as ocean and soil microbiomes, could provide further insights into microbial ecology on a global scale.