Genes
1051 genes matched.
| Gene | MTBC0 | Legacy (H37Rv) | MTBC0 PGAP / revised | Pfam | Verdict |
|---|---|---|---|---|---|
| Rv3517 Rv3517 |
mtbc0_003733 |
hypothetical protein | Contains AbiEi_1 (PF09407.17) domain(s); putative function inferred from the domain architecture. | AbiEi_1 |
Family assigned |
| Rv3519 Rv3519 |
mtbc0_003735 |
hypothetical protein | Acetoacetate decarboxylase family protein. Pfam: ADC (PF06314.17). | ADC |
Family assigned |
| Rv3521 Rv3521 |
- |
hypothetical protein | Contains OB_ChsH2_C (PF01796.24) domain(s); putative function inferred from the domain architecture. | OB_ChsH2_C |
Family assigned |
| Rv3527 Rv3527 |
mtbc0_003743 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv3528c Rv3528c |
mtbc0_003744 |
hypothetical protein | No Pfam domain above threshold; Foldseek indicates a fold similar to 7pga-assembly2_D Chimeric carminomycin-4-O-methyltransferase (DnrK) with regions (prob 1.00, TM 0.55). Structure-based, putative. | Family assigned | |
| Rv3529c Rv3529c |
mtbc0_003746 |
hypothetical protein | TLR2-mediated response inhibitor. Pfam: Sulfotransfer_1 (PF00685.34), Sulfotransfer_3 (PF13469.13). | Sulfotransfer_1 Sulfotransfer_3 |
Resolved |
| Rv3531c Rv3531c |
mtbc0_003748 |
hypothetical protein | No Pfam domain above threshold; Foldseek indicates a fold similar to 3u07-assembly2_B Crystal Structure of the VPA0106 protein from Vibrio parahaemol (prob 1.00, TM 0.41). Structure-based, putative. | Family assigned | |
| chsH1 Rv3541c |
mtbc0_003758 |
hypothetical protein | 3-oxo-23%2C24-bisnorchol-4%2C17(20)-dien-22-oyl-CoA hydratase subunit beta ChsH1. | Family assigned | |
| chsH2 Rv3542c |
mtbc0_003759 |
hypothetical protein | 3-oxo-23%2C24-bisnorchol-4%2C17(20)-dien-22-oyl-CoA hydratase subunit alpha ChsH2. Pfam: FAS1_DH_region (PF13452.12), OB_ChsH2_C (PF01796.24). | FAS1_DH_region OB_ChsH2_C |
Family assigned |
| Rv3555c Rv3555c |
mtbc0_003772 |
hypothetical protein | DUF559 domain-containing protein. Pfam: AbiEi_1 (PF09407.17), DUF559 (PF04480.19). | AbiEi_1 DUF559 |
Family assigned |
| Rv3566A Rv3566A |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 8xt2-assembly1_Lg Cryo-EM structure of the human 55S mitoribosome with (prob 0.01, TM 0.22). | Still unknown | |
| Rv3572 Rv3572 |
mtbc0_003791 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 1f0l-assembly2_B 1.55 ANGSTROM CRYSTAL STRUCTURE OF WILD TYPE DIPHTHER (prob 0.92, TM 0.36). | Still unknown | |
| Rv3577 Rv3577 |
- |
hypothetical protein | Binuclear metallo-beta-lactamase (MBL)-fold metallo-hydrolase of the UPF0173/UlaG family (InterPro IPR050114), substrate unassigned. RefSeq leaves it 'hypothetical protein'. The MBL HxHxDH motif (His74-His76-Asp78-His79) was confirmed as a genuine two-metal centre by co-folding the chain with two metal ions on AlphaFold Server (independent Zn and Fe jobs, top model iPTM 0.98): the ions bind a single bridged binuclear site 3.2-3.4 A apart, partitioned into a three-histidine metal (His74/His76/His137) and an aspartate-plus-two-histidine metal (Asp78/His79/His235), with identical geometry for Zn or Fe. HHpred top hits (>=99.8%) are UPF0173/UlaG metal-dependent hydrolases (UlaG 2wyl; COG2220). The fold-paralogue safeguard withholds the RNase Z (held by Rv2407) and glyoxalase II (Rv0634c/Rv2581c) labels: only the fold-level family is claimed. The six metal ligands are effectively invariant across ~250,724 genomes (most frequent non-synonymous ligand variant 0.0032%). A structural prediction, not a biochemical assay. | Resolved | |
| Rv3594 Rv3594 |
mtbc0_003813 |
hypothetical protein | N-acetylmuramoyl-L-alanine amidase. Pfam: Amidase_2 (PF01510.31), Rv3766_C (PF27131.1). | Amidase_2 Rv3766_C |
Resolved |
| Rv3603c Rv3603c |
mtbc0_003821 |
hypothetical protein | Rossmann-like and DUF2520 domain-containing protein. Pfam: Rossmann-like (PF10727.16), F420_oxidored (PF03807.24), DUF2520 (PF10728.15). | Rossmann-like F420_oxidored DUF2520 |
Family assigned |
| Rv3605c Rv3605c |
mtbc0_003823 |
hypothetical protein | Polytopic integral membrane protein with 4 predicted transmembrane helices (DeepTMHMM). RefSeq leaves it 'hypothetical protein'. A topological feature consistent with a membrane transporter/permease or membrane-embedded enzyme; the transported substrate and molecular function are undetermined. | DUF3180 |
Family assigned |
| Rv3612c Rv3612c |
mtbc0_000273 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 3j6b-assembly1_T Structure of the yeast mitochondrial large ribosomal (prob 0.11, TM 0.30). | Still unknown | |
| Rv3613c Rv3613c |
mtbc0_003830 |
hypothetical protein | Member of the espACD-Rv3613c-Rv3612c operon required for virulence-critical ESX-1 secretion. RefSeq leaves it of unknown function. This five-gene operon, positively regulated by EspR through a distal enhancer (the espA activating region), is essential for ESX-1 secretion and virulence (Hunt 2012). Molecular function of Rv3613c unfixed but tied to the ESX-1 secretion apparatus. | Family assigned | |
| Rv3626c Rv3626c |
mtbc0_003843 |
hypothetical protein | Zinc-dependent metalloprotease. Pfam: Zincin_2 (PF10103.16). | Zincin_2 |
Resolved |
| dacB Rv3627c |
mtbc0_003844 |
hypothetical protein | D-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase. Pfam: Rv3627c_N (PF23714.2), Peptidase_S13 (PF02113.21). | Rv3627c_N Peptidase_S13 |
Resolved |
| Rv3633 Rv3633 |
mtbc0_003850 |
hypothetical protein | Phytanoyl-CoA dioxygenase family protein. Pfam: PhyH (PF05721.20). | PhyH |
Family assigned |
| Rv3639c Rv3639c |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 8vdu-assembly1_E Crystal structure of hybrid insulin peptide (InsC8-15 (prob 0.01, TM 0.17). | Still unknown | |
| Rv3642c Rv3642c |
mtbc0_003857 |
hypothetical protein | Antitoxin VbhA family protein. Pfam: VbhA (PF18495.7). | VbhA |
Family assigned |
| Rv3643 Rv3643 |
mtbc0_003859 |
hypothetical protein | Tyrosine-recombinase / integrase-family fold (small, 63 aa); candidate site-specific recombination / mobile-element-associated protein RefSeq leaves this locus uncharacterised. | Family assigned | |
| Rv3647c Rv3647c |
mtbc0_003864 |
hypothetical protein | WhiA-like cell-division / homeostasis protein (LAGLIDADG-derived DNA-binding fold). RefSeq leaves it 'hypothetical protein'. Distinct from whiA/Rv3641c. | Family assigned | |
| Rv3651 Rv3651 |
mtbc0_003868 |
hypothetical protein | DUF5628 domain-containing protein. Pfam: Rv3651-like_N (PF18007.8), Rv3651-like_middle (PF18621.8), Rv3651-like_C (PF21043.4). | Rv3651-like_N Rv3651-like_middle Rv3651-like_C |
Family assigned |
| Rv3654c Rv3654c |
- |
hypothetical protein | Contains Tad (PF13400.12) domain(s); putative function inferred from the domain architecture. | Tad |
Family assigned |
| Rv3655c Rv3655c |
mtbc0_003873 |
hypothetical protein | Apoptosis inhibitor. | Resolved | |
| Rv3656c Rv3656c |
mtbc0_003874 |
hypothetical protein | Conserved hypothetical protein; DUF domain(s) DUF4244. Function unknown. | DUF4244 |
Still unknown |
| ssd Rv3660c |
mtbc0_003878 |
hypothetical protein | Septum site-determining protein Ssd. Pfam: Rv3660c_N (PF26563.1). | Rv3660c_N |
Resolved |
| Rv3661 Rv3661 |
mtbc0_003879 |
hypothetical protein | HAD-IB family hydrolase. Pfam: Hydrolase (PF00702.33), HAD (PF12710.14). | Hydrolase HAD |
Family assigned |
| Rv3662c Rv3662c |
mtbc0_003880 |
hypothetical protein | Oxidoreductase. | Resolved | |
| Rv3672c Rv3672c |
mtbc0_003891 |
hypothetical protein | CoA pyrophosphatase. Pfam: NUDIX (PF00293.35). | NUDIX |
Resolved |
| Rv3678A Rv3678A |
- |
hypothetical protein | No Pfam domain above threshold; Foldseek indicates a fold similar to 8i0t-assembly1_C The cryo-EM structure of human Bact-III complex (prob 1.00, TM 0.82). Structure-based, putative. | Family assigned | |
| Rv3678c Rv3678c |
mtbc0_003897 |
hypothetical protein | RidA family protein. Pfam: YjgF_endoribonc (PF14588.12), Ribonuc_L-PSP (PF01042.27). | YjgF_endoribonc Ribonuc_L-PSP |
Family assigned |
| Rv3683 Rv3683 |
mtbc0_003903 |
hypothetical protein | Metallophosphoesterase. Pfam: Metallophos (PF00149.34), Metallophos_2 (PF12850.13). | Metallophos Metallophos_2 |
Resolved |
| Rv3686c Rv3686c |
mtbc0_003907 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv3688c Rv3688c |
mtbc0_003909 |
hypothetical protein | GatB/YqeY domain-containing protein. Pfam: YqeY (PF09424.16). | YqeY |
Family assigned |
| Rv3691 Rv3691 |
- |
hypothetical protein | Conserved hypothetical protein; DUF domain(s) DUF4350. Function unknown. Foldseek best (non-significant) hit: 8hx9-assembly1_B Crystal structure of 4-amino-4-deoxychorismate syntha (prob 0.77, TM 0.44). | DUF4350 |
Still unknown |
| Rv3698 Rv3698 |
- |
hypothetical protein | Contains GCS2 (PF04107.19) domain(s); putative function inferred from the domain architecture. | GCS2 |
Family assigned |
| Rv3699 Rv3699 |
mtbc0_003921 |
hypothetical protein | Class I SAM-dependent methyltransferase. Pfam: TPMT (PF05724.18), Methyltransf_23 (PF13489.13), Methyltransf_31 (PF13847.13), Methyltransf_25 (PF13649.13), Methyltransf_11 (PF08241.19), Methyltransf_12 (PF08242.19). | TPMT Methyltransf_23 Methyltransf_31 Methyltransf_25 Methyltransf_11 Methyltransf_12 |
Resolved |
| Rv3705A Rv3705A |
- |
hypothetical protein | Effector of the TcrXY acid-sensing two-component system regulon, required for persistent infection. RefSeq leaves it of unknown function. Rv3705A is one of two characterised members of the ~70-gene TcrXY regulon implicated as a key determinant of Mtb survival in vivo by mitigating redox stress at acidic pH (Stupar 2024). Molecular function unfixed. | Family assigned | |
| Rv3705c Rv3705c |
mtbc0_003927 |
hypothetical protein | Sensor domain-containing protein. Pfam: PknH_C (PF14032.13). | PknH_C |
Family assigned |
| Rv3706c Rv3706c |
mtbc0_003929 |
hypothetical protein | Effector of the TcrXY acid-sensing two-component system regulon, required for persistent infection. RefSeq leaves it of unknown function. Rv3706c, with Rv3705A, is a characterised member of the TcrXY regulon implicated in Mtb survival in vivo by mitigating redox stress at acidic pH (Stupar 2024). Molecular function unfixed. | Family assigned | |
| Rv3707c Rv3707c |
- |
hypothetical protein | AraH2: endo-D-arabinofuranase, founding member (with AraH1/Rv1754c) of glycoside hydrolase family GH183 (Pfam DUF4185). Cleaves the D-arabinan core of arabinogalactan / lipoarabinomannan, i.e. mycobacterial cell-wall remodelling/degradation. NOTE: RefSeq/PGAP still annotate this gene as a hypothetical protein; it was characterised experimentally by Behrens et al. 2023. | DUF4185 |
Resolved |
| Rv3714c Rv3714c |
mtbc0_003937 |
hypothetical protein | Contains AbiEi_1 (PF09407.17) domain(s); putative function inferred from the domain architecture. | AbiEi_1 |
Family assigned |
| Rv3716c Rv3716c |
mtbc0_003939 |
hypothetical protein | YbaB/EbfC family nucleoid-associated protein. Pfam: YbaB_DNA_bd (PF02575.22). | YbaB_DNA_bd |
Family assigned |
| Rv3717 Rv3717 |
- |
hypothetical protein | Contains Amidase_3 (PF01520.24) domain(s); putative function inferred from the domain architecture. | Amidase_3 |
Family assigned |
| Rv3718c Rv3718c |
mtbc0_003941 |
hypothetical protein | SRPBCC family protein. Pfam: Polyketide_cyc2 (PF10604.16). | Polyketide_cyc2 |
Family assigned |
| Rv3719 Rv3719 |
- |
hypothetical protein | Contains FAD_binding_4 (PF01565.29) domain(s); putative function inferred from the domain architecture. | FAD_binding_4 |
Family assigned |