New pipeline maps 129,312 TB genomes for global surveillance
Iñaki Comas led development of UShER-TB, which processed 129,312 genomes to enable real-time phylogenomic analysis and reveal TB transmission and novel lineages.
Tuberculosis remains a major global health challenge, and scientists have been hampered by the sheer scale of genomic data needed to understand how Mycobacterium tuberculosis (MTB) spreads and evolves around the world. To address that challenge, Iñaki Comas and colleagues created UShER-TB, a comprehensive pipeline built specifically for scalable phylogenomic analysis of MTB. Working strictly from genome sequences, the team processed 129,312 MTB genomes to build a single, comprehensive global phylogeny that captures an unprecedented breadth of genetic diversity. Creating such a reference tree enables researchers to place new samples quickly, compare strains across regions, and see patterns that were previously hidden by computational limits. By assembling a resource that is both large and designed for continuous updating, UShER-TB aims to make global-scale genomic context accessible to researchers and public health teams, including those working in settings with limited computing capacity. The platform thus sets the stage for more consistent, rapid comparison of genomes for surveillance and research purposes while overcoming long-standing barriers to scaling up MTB phylogenomics.
UShER-TB is described as a comprehensive pipeline for scalable phylogenomic MTB analysis and was applied to a dataset of 129,312 genomes to construct a global phylogeny. According to the abstract, this effort captured unprecedented genomic diversity and demonstrated high accuracy in transmission cluster reconstruction, meaning the pipeline can reliably group related cases that reflect chains of transmission. The comprehensive phylogeny also facilitated the identification of putative novel lineages and sublineages, expanding the map of MTB diversity, and it allowed successful placement of ancient DNA samples, linking modern and historical genomes. The UShER-TB platform enables real-time phylogenomic analysis of new genomes and, in doing so, can reveal transmission hotspots and introduction patterns at global scales. The authors emphasize that their approach overcomes longstanding computational barriers and provides researchers with efficient tools for TB genomic surveillance, with a particular focus on usefulness in resource-limited settings where the TB burden is highest.
The implications of the UShER-TB work are broad: a single, scalable pipeline and a massive reference tree make it realistic to perform routine, real-time phylogenomic analysis of MTB at global scale. That capability means public health teams could more rapidly place new cases into a global context, detect clusters of recent transmission, and track introductions between regions. The identification of putative novel lineages and sublineages enriches the scientific picture of MTB evolution and diversity, while the successful placement of ancient DNA samples points to a tightened link between genomic studies of the present and of the past. Importantly, the authors highlight that UShER-TB overcomes computational barriers that have limited previous efforts, making comprehensive genomic surveillance more attainable for laboratories with fewer resources. By lowering technical hurdles and enabling continuous updates and real-time placement of genomes, the platform could change how researchers and public health programs monitor and respond to TB globally.
UShER-TB can help public health teams rapidly detect transmission hotspots and track introductions of Mycobacterium tuberculosis across regions. By making large-scale genomic surveillance feasible in resource-limited settings, it promises broader and more equitable use of genomic tools against TB.
Author: Lily Karim