ResearchHub | Open Science Community

Phylogenomics of Mycobacterium africanum reveals a new lineage and a complex evolutionary history

Mireia Coscollá et al.Jun 10, 2020

Abstract Human tuberculosis is caused by members of the Mycobacterium tuberculosis Complex (MTBC). The MTBC comprises several human-adapted lineages known as M. tuberculosis sensu stricto as well as two lineages (L5 and L6) traditionally referred to as M. africanum . Strains of L5 and L6 are largely limited to West Africa for reasons unknown, and little is known on their genomic diversity, phylogeography and evolution. Here, we analyzed the genomes of 365 L5 and 326 L6 strains, plus five related genomes that had not been classified into any of the known MTBC lineages, isolated from patients from 21 African countries. Our population genomic and phylogeographical analyses show that the unclassified genomes belonged to a new group that we propose to name MTBC Lineage 9 (L9). While the most likely ancestral distribution of L9 was predicted to be East Africa, the most likely ancestral distribution for both L5 and L6 was the Eastern part of West Africa. Moreover, we found important differences between L5 and L6 strains with respect to their phylogeographical substructure, genetic diversity and association with drug resistance. In conclusion, our study sheds new light onto the genomic diversity and evolutionary history of M. africanum, and highlights the need to consider the particularities of each MTBC lineage for understanding the ecology and epidemiology of tuberculosis in Africa and globally.

Genetics

Epidemiology

1

Paper

Save

Safety of high-dose amikacin in the first week of all-oral rifampicin-resistant tuberculosis treatment for the prevention of acquired resistance (STAKE): protocol for a single-arm clinical trial

Jihad Snobre et al.Jul 1, 2024

An effective rifampicin-resistant tuberculosis (RR-TB) treatment regimen should include prevention of resistance amplification. While bedaquiline (BDQ) has been recommended in all-oral RR-TB treatment regimen since 2019, resistance is rising at alarming rates. This may be due to BDQ's delayed bactericidal effect, which increases the risk of selecting for resistance to fluoroquinolones and/or BDQ in the first week of treatment when the bacterial load is highest. We aim to strengthen the first week of treatment with the injectable drug amikacin (AMK). To limit the ototoxicity risk while maximising the bactericidal effect, we will evaluate the safety of adding a 30 mg/kg AMK injection on the first and fourth day of treatment.

Epidemiology

Microbiology

0

Paper

Save

A sister lineage of the Mycobacterium tuberculosis complex discovered in the African Great Lakes region

Jean Ngabonziza et al.Jan 21, 2020

The human- and animal-adapted lineages of the Mycobacterium tuberculosis complex (MTBC) are thought to have clonally expanded from a common progenitor in Africa. However, the molecular events that accompanied this emergence remain largely unknown. Here, we describe two MTBC strains isolated from patients with multidrug-resistant tuberculosis, representing an as-yet-unknown lineage, named Lineage 8 (L8), seemingly restricted to the African Great Lakes region. Using genome-based phylogenetic reconstruction, we show that L8 is a sister clade to the known MTBC lineages. Comparison with other complete mycobacterial genomes indicate that the divergence of L8 preceded the loss of the cobF genome region - involved in the cobalamin/vitamin B12 synthesis - and gene interruptions in a subsequent common ancestor shared by all other known MTBC lineages. This discovery further supports an East African origin for the MTBC and provides additional molecular clues on the ancestral genome reduction associated with adaptation to a pathogenic lifestyle.

Genetics

Ecology

1

Paper

Save

Mycobacterium tuberculosiscomplex lineage 5 exhibits high levels of within-lineage genomic diversity and differing gene content compared to the type strain H37Rv

C. Sanoussi et al.Jun 22, 2020

Abstract Pathogens of the Mycobacterium tuberculosis complex (MTBC) are considered monomorphic, with little gene content variation between strains. Nevertheless, several genotypic and phenotypic factors separate the different MTBC lineages (L), especially L5 and L6 (traditionally termed Mycobacterium africanum ), from each other. However, genome variability and gene content especially of L5 and L6 strains have not been fully explored and may be potentially important for pathobiology and current approaches for genomic analysis of MTBC isolates, including transmission studies. We compared the genomes of 358 L5 clinical isolates (including 3 completed genomes and 355 Illumina WGS (whole genome sequenced) isolates) to the L5 complete genomes and H37Rv, and identified multiple genes differentially present or absent between H37Rv and L5 strains. Additionally, considerable gene content variability was found across L5 strains, including a split in the L5.3 sublineage into L5.3.1 and L5.3.2. These gene content differences had a small knock on effect on transmission cluster estimation, with clustering rates influenced by the selection of reference genome, and with potential over-estimation of recent transmission when using H37Rv as the reference genome. Our data show that the use of H37Rv as reference genome results in missing SNPs in genes unique for L5 strains. This potentially leads to an underestimation of the diversity present in the genome of L5 strains and in turn affects the transmission clustering rates. As such, a full capture of the gene diversity, especially for high resolution outbreak analysis, requires a variation of the single H37Rv-centric reference genome mapping approach currently used in most WGS data analysis pipelines. Moreover, the high within-lineage gene content variability suggests that the pan-genome of M. tuberculosis is at least several kilobases larger than previously thought, implying a concatenated or reference-free genome assembly ( de novo ) approach may be needed for particular questions. Data summary Sequence data for the Illumina dataset are available at European Genome-phenome Archive (EGA; https://www.ebi.ac.uk/ega/ ) under the study accession numbers PRJEB38317 and PRJEB38656. Individual runs accession numbers are indicated in Table S8. PacBio raw reads for the L5 Benin genome are available on the ENA accession SAME3170744. The assembled L5 Benin genome is available on NCBI with accession PRJNA641267. To ensure naming conventions of the genes in the three L5 genomes can be followed, we have uploaded these annotated GFF files to figshare at https://doi.org/10.6084/m9.figshare.12911849.v1 . Custom python scripts used in this analysis can be found at https://github.com/conmeehan/pathophy .

Genetics

Epidemiology

22

Paper

Genetics

Epidemiology

0

Save