PAPER 19 Jul 2025 Global

Accurate DNA test finds mutations in leprosy bacterium

Conor J. Meehan shows short-read SNP calling is robust and highly accurate for Mycobacterium leprae, supporting molecular epidemiology for leprosy.

Mycobacterium leprae is the bacterium that causes leprosy, a long-standing disease marked by skin lesions and loss of feeling in peripheral nerves. Studying the bacterium’s genome has been difficult because M. leprae cannot be grown on artificial media, which limits classic laboratory approaches. In recent years, improvements in DNA extraction and sequencing technology have opened a new path: reading the bacterium’s DNA directly and using those sequences to track how strains spread and change. But to use those sequences for reliable epidemiology, researchers need a validated pipeline to call single nucleotide polymorphisms (SNPs) — the tiny DNA differences that can link cases together. Conor J. Meehan and colleagues addressed that gap by building a reference “ground truth” of pairwise SNP differences using Minimap2 and three complete genomes of M. leprae. From that foundation they generated simulated short sequencing reads and tested how well a commonly used SNP-calling tool performed. Their goal was to see whether short-read SNP calling could be trusted for M. leprae studies, despite the bacterium’s unusual laboratory challenges.

To evaluate SNP calling, the team first used Minimap2 to create a ground truth set of pairwise SNP differences among three complete M. leprae genomes. They then produced simulated short reads from each of those genomes and used the others as references, mirroring the way short-read sequencing data are compared in real studies. The simulated data allowed the investigators to measure precision and recall — how many real SNPs are found and how many calls are correct — when running the short-read SNP-calling workflow with Snippy. The results were encouraging: SNP calling from short reads proved robust and highly accurate for M. leprae. The authors report that approximately 80% of SNPs were called with false positives only due to ambiguous bases in certain genomes, indicating that many apparent errors were tied to uncertain base calls rather than failures of the pipeline itself. Unlike experience with M. tuberculosis, they found that repeat region masking was unnecessary for M. leprae SNP calling, simplifying the workflow.

These findings suggest that short-read SNP calling can be a reliable tool for molecular epidemiology of leprosy. Because M. leprae cannot be cultured in the lab, the ability to extract DNA from clinical samples and accurately call SNPs from short reads is especially valuable: it makes genomic investigation feasible without growing the bacterium. The demonstration that repeat region masking is not needed for M. leprae reduces an extra analytical step and may speed up analyses. By showing that Snippy-based short-read SNP calling achieves high precision and recall against a Minimap2-derived ground truth, Conor J. Meehan and colleagues provide a tested pipeline that researchers can adopt with greater confidence. That, in turn, could help public health teams use genomic data to trace transmission chains, improve detection of linked cases, and inform leprosy control strategies without relying on challenging culture methods.

Public Health Impact

Validated short-read SNP calling will allow genomic tracking of Mycobacterium leprae even when culture is impossible. This can help detect linked cases and guide leprosy control strategies in the field.

Mycobacterium leprae
SNP calling
molecular epidemiology
Minimap2
Snippy
{% if expert_links_html %}
Featured Experts

Author: Katherine Cox

Read Original Source →