About

Mycobrowser, the reference annotation resource for Mycobacterium, is no longer maintained, and a large share of MTBC genes are still labelled hypothetical protein. This site takes over that role and re-annotates the MTBC gene set, with two distinctive choices.

Anchored on the ancestral genome

Annotation starts from MTBC0, the imputed ancestral genome of the most recent common ancestor of the MTBC (Harrison et al. 2024), rather than from H37Rv. Each protein shown here is the ancestral MTBC0 sequence; H37Rv remains as the historical anchor (its Rv locus tag and legacy product).

A traceable, graded pipeline

Every gene goes through: the MTBC0 PGAP re-annotation, a Pfam domain scan (hmmscan --cut_ga), the ESM Atlas protein-language-model signal (used only as an exploratory indicator), and a literature check. Each fiche carries an explicit verdict (Resolved, Family assigned, Still unknown), a confidence level, and a Sources section that cites the provenance of every field.

FEMTO-ST Institute, CNRS UMR 6174, Université de Franche-Comté. Content under CC-BY 4.0.