Novel methods included in SpolLineages tool for fast and precise prediction of Mycobacterium tuberculosis complex spoligotype families - Institut Pasteur de la Guadeloupe Accéder directement au contenu
Article Dans Une Revue Database - The journal of Biological Databases and Curation Année : 2020

Novel methods included in SpolLineages tool for fast and precise prediction of Mycobacterium tuberculosis complex spoligotype families

Résumé

Bioinformatic tools are currently being developed to better understand the Mycobacterium tuberculosis complex (MTBC). Several approaches already exist for the identification of MTBC lineages using classical genotyping methods such as mycobacterial interspersed repetitive units-variable number of tandem DNA repeats and spoligotyping-based families. In the recently released SITVIT2 proprietary database of the Institut Pasteur de la Guadeloupe, a large number of spoligotype families were assigned by either manual curation/expertise or using an in-house algorithm. In this study, we present two complementary data-driven approaches allowing fast and precise family prediction from spoligotyping patterns. The first one is based on data transformation and the use of decision tree classifiers. In contrast, the second one searches for a set of simple rules using binary masks through a specifically designed evolutionary algorithm. The comparison with the three main approaches in the field highlighted the good performances of our contributions and the significant runtime gain. Finally, we propose the 'SpolLineages' software tool (https://github.com/dcouvin/SpolLineages), which implements these approaches for MTBC spoligotype families' identification.

Domaines

Bactériologie
Fichier principal
Vignette du fichier
Database-baaa108.pdf (1.14 Mo) Télécharger le fichier
Origine : Publication financée par une institution

Dates et versions

pasteur-03092701 , version 1 (02-01-2021)

Licence

Paternité

Identifiants

Citer

David Couvin, Wilfried Segretier, Erick Stattner, Nalin Rastogi. Novel methods included in SpolLineages tool for fast and precise prediction of Mycobacterium tuberculosis complex spoligotype families. Database - The journal of Biological Databases and Curation, 2020, 2020, pp.baaa108. ⟨10.1093/database/baaa108⟩. ⟨pasteur-03092701⟩
96 Consultations
65 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More