Genes
1051 genes matched.
| Gene | MTBC0 | Legacy (H37Rv) | MTBC0 PGAP / revised | Pfam | Verdict |
|---|---|---|---|---|---|
| Rv0979c Rv0979c |
mtbc0_001047 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 4dqz-assembly1_B Crystal Structure of C-terminal Half of Bacterial Hen (prob 0.20, TM 0.53). | Still unknown | |
| Rv0988 Rv0988 |
mtbc0_001059 |
hypothetical protein | Lipocalin-like domain-containing protein. Pfam: CrtC (PF07143.18), Lipocalin_9 (PF17186.11). | CrtC Lipocalin_9 |
Family assigned |
| Rv0990c Rv0990c |
mtbc0_001061 |
hypothetical protein | SAF domain-containing protein. Pfam: SAF (PF08666.18), ChapFlgA (PF13144.12). | SAF ChapFlgA |
Family assigned |
| Rv0991c Rv0991c |
mtbc0_001062 |
hypothetical protein | FmdB family zinc ribbon protein. Pfam: Zn_ribbon_8 (PF09723.16). | Zn_ribbon_8 |
Family assigned |
| Rv0997 Rv0997 |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv0997a Rv0997a |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 1ueb-assembly2_B Crystal structure of translation elongation factor P (prob 1.00, TM 0.70). | Still unknown | |
| Rv0999 Rv0999 |
- |
hypothetical protein | Lipoprotein / outer-membrane-associated protein (lipoprotein-like beta fold); structurally solved but functionally uncharacterised fold RefSeq leaves this locus uncharacterised. | DUF5642 |
Family assigned |
| Rv1000c Rv1000c |
- |
hypothetical protein | Contains 2OG-FeII_Oxy_2 (PF13532.13) domain(s); putative function inferred from the domain architecture. | 2OG-FeII_Oxy_2 |
Family assigned |
| Rv1006 Rv1006 |
mtbc0_001081 |
hypothetical protein | Glycoside-hydrolase-like fold (GH2, PDB 5T99); putative glycosidase. | Family assigned | |
| Rv1012 Rv1012 |
mtbc0_001087 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv1025 Rv1025 |
- |
hypothetical protein | Conserved hypothetical protein; DUF domain(s) DUF501. Function unknown. Foldseek best (non-significant) hit: 4xku-assembly1_A E coli BFR variant Y114F (prob 0.01, TM 0.19). | DUF501 |
Still unknown |
| ppx2 Rv1026 |
mtbc0_001102 |
hypothetical protein | Exopolyphosphatase Ppx2. Pfam: Ppx-GppA (PF02541.23). | Ppx-GppA |
Resolved |
| Rv1043c Rv1043c |
mtbc0_001120 |
hypothetical protein | Serine protease. Pfam: Trypsin (PF00089.33), Trypsin_2 (PF13365.13). | Trypsin Trypsin_2 |
Resolved |
| Rv1044 Rv1044 |
mtbc0_001121 |
hypothetical protein | Type IV toxin-antitoxin system AbiEi family antitoxin domain-containing protein. Pfam: AbiEi_4 (PF13338.13). | AbiEi_4 |
Family assigned |
| Rv1045 Rv1045 |
mtbc0_001122 |
hypothetical protein | Nucleotidyl transferase AbiEii/AbiGii toxin family protein. Pfam: AbiEii (PF08843.18). | AbiEii |
Family assigned |
| Rv1048c Rv1048c |
mtbc0_001124 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 4rgu-assembly1_A-2 Crystal Structure of Putative MarR Family Transcrip (prob 0.95, TM 0.60). | Still unknown | |
| Rv1051c Rv1051c |
- |
hypothetical protein | Contains AbiEii (PF08843.18), HTH_17 (PF12728.14) domain(s); putative function inferred from the domain architecture. | AbiEii HTH_17 |
Family assigned |
| Rv1052 Rv1052 |
- |
hypothetical protein | Nucleotidyltransferase (eggNOG COG2184, COG category D): a polymerase-beta-like nucleotidyltransferase, often associated with DNA repair / cell-division functions. RefSeq leaves it 'hypothetical protein'. The acceptor/substrate is undetermined. | Family assigned | |
| Rv1056 Rv1056 |
mtbc0_001135 |
hypothetical protein | DUF427 domain-containing protein. Pfam: NTP_transf_9 (PF04248.20). | NTP_transf_9 |
Family assigned |
| Rv1057 Rv1057 |
mtbc0_001137 |
hypothetical protein | YncE family protein. | Family assigned | |
| Rv1059 Rv1059 |
mtbc0_001139 |
hypothetical protein | Dihydrodipicolinate reductase. Pfam: DapB_N (PF01113.27), DAP_DH_C (PF19328.5). | DapB_N DAP_DH_C |
Resolved |
| Rv1060 Rv1060 |
mtbc0_001140 |
hypothetical protein | SRPBCC family protein. Pfam: Polyketide_cyc2 (PF10604.16). | Polyketide_cyc2 |
Family assigned |
| Rv1061 Rv1061 |
mtbc0_001141 |
hypothetical protein | Class II glutamine amidotransferase. Pfam: GATase_4 (PF13230.12), GATase_6 (PF13522.12). | GATase_4 GATase_6 |
Resolved |
| Rv1062 Rv1062 |
mtbc0_001142 |
hypothetical protein | Patatin-like phospholipase family protein. Pfam: Patatin (PF01734.28). | Patatin |
Family assigned |
| Rv1065 Rv1065 |
mtbc0_001145 |
hypothetical protein | Cysteine dioxygenase family protein. Pfam: CDO_I (PF05995.19). | CDO_I |
Family assigned |
| Rv1066 Rv1066 |
mtbc0_001146 |
hypothetical protein | Rhodanese-like domain-containing protein. Pfam: Rhodanese (PF00581.26). | Rhodanese |
Family assigned |
| Rv1069c Rv1069c |
mtbc0_001149 |
hypothetical protein | Alpha/beta-hydrolase family protein. Pfam: Abhydrolase_9_N (PF15420.12), Abhydrolase_9 (PF10081.16). | Abhydrolase_9_N Abhydrolase_9 |
Family assigned |
| Rv1073 Rv1073 |
mtbc0_001153 |
hypothetical protein | Nicking-endonuclease-like fold (type-II nicking enzyme V.NaeI-like). RefSeq leaves it 'hypothetical protein'. | Family assigned | |
| Rv1075c Rv1075c |
mtbc0_001155 |
hypothetical protein | SGNH/GDSL hydrolase family protein. Pfam: Lipase_GDSL (PF00657.29), Lipase_GDSL_2 (PF13472.13). | Lipase_GDSL Lipase_GDSL_2 |
Family assigned |
| pra Rv1078 |
mtbc0_001158 |
hypothetical protein | RDD family protein. Pfam: RDD (PF06271.19). | RDD |
Family assigned |
| Rv1083 Rv1083 |
mtbc0_001163 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv1084 Rv1084 |
mtbc0_001164 |
hypothetical protein | Thioredoxin domain-containing protein. Pfam: Thioredox_DsbH (PF03190.22), Thioredoxin_7 (PF13899.13). | Thioredox_DsbH Thioredoxin_7 |
Family assigned |
| Rv1088a Rv1088a |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv1097c Rv1097c |
mtbc0_001180 |
hypothetical protein | Apa-like fold (fibronectin-binding protein Apa, PDB 5ZX9); putative adhesin / secreted protein. | Family assigned | |
| Rv1100 Rv1100 |
- |
hypothetical protein | Conserved hypothetical protein; DUF domain(s) DUF4245. Function unknown. Foldseek best (non-significant) hit: 2p4b-assembly1_B Crystal structure of E.coli RseB (prob 0.41, TM 0.41). | DUF4245 |
Still unknown |
| Rv1101c Rv1101c |
mtbc0_001184 |
hypothetical protein | AI-2E family transporter. Pfam: AI-2E_transport (PF01594.23). | AI-2E_transport |
Family assigned |
| Rv1109c Rv1109c |
mtbc0_001191 |
hypothetical protein | Lipid droplet-associated protein. Pfam: Rv1109c_N (PF27128.1), DUF8129 (PF26450.2). | Rv1109c_N DUF8129 |
Resolved |
| Rv1111c Rv1111c |
mtbc0_001193 |
hypothetical protein | Polytopic integral membrane protein with 4 predicted transmembrane helices (DeepTMHMM). RefSeq leaves it 'hypothetical protein'. A topological feature consistent with a membrane transporter/permease or membrane-embedded enzyme; the transported substrate and molecular function are undetermined. | DUF6542 |
Family assigned |
| Rv1115 Rv1115 |
mtbc0_001197 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 8gra-assembly1_G Structure of Type VI secretion system cargo delivery (prob 0.01, TM 0.14). | Still unknown | |
| Rv1116 Rv1116 |
mtbc0_001198 |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv1117 Rv1117 |
mtbc0_001200 |
hypothetical protein | Putative quinol monooxygenase. Pfam: ABM (PF03992.23). | ABM |
Family assigned |
| Rv1118c Rv1118c |
- |
hypothetical protein | Circularly permuted NlpC/P60 (YaeF/YiiX-family) cysteine amidase with a structurally competent Cys234-His97 catalytic dyad (Glu115 a candidate third member). RefSeq leaves this locus 'hypothetical protein'; here it is re-annotated by structure-guided active-site mapping. The catalytic histidine partner of the nucleophile Cys234 is His97 (Sgamma-Ndelta1 = 3.6 A on the ESMFold model, 2.9 A on AlphaFold3), invisible to sequence proximity because the fold is circularly permuted - the three sequence-proximal histidines lie 7-23 A away. HHpred matches the permuted Peptidase_C92 / YaeF-YiiX family (>=99.7%) and lipid-acting permuted members (LRAT, H-REV107), and the 86%-hydrophobic pocket points to an N-acyl-amino-acid / lipoprotein amide substrate rather than a peptidoglycan muropeptide. Distinct from the five canonical peptidoglycan-hydrolase NlpC/P60 enzymes of H37Rv (Rv0024, RipA, RipB, RipD, Rv2190c). Catalytic codons essentially invariant across ~250,724 MTBC genomes. A structural prediction, not a biochemical assay. | Resolved | |
| Rv1119c Rv1119c |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv1120c Rv1120c |
- |
hypothetical protein | Contains Guanylate_cyc (PF00211.26) domain(s); putative function inferred from the domain architecture. | Guanylate_cyc |
Family assigned |
| Rv1125 Rv1125 |
mtbc0_001208 |
hypothetical protein | WS/DGAT domain-containing protein. Pfam: WS_DGAT_C (PF06974.19). | WS_DGAT_C |
Family assigned |
| Rv1126c Rv1126c |
mtbc0_001209 |
hypothetical protein | MarR family transcriptional regulator. | Family assigned | |
| Rv1128c Rv1128c |
- |
hypothetical protein | HNH-family endonuclease / nuclease (modification-dependent restriction-endonuclease or nuclease effector of a defense system); COG defense category V RefSeq leaves this locus uncharacterised. | DUF222 |
Family assigned |
| Rv1132 Rv1132 |
mtbc0_001215 |
hypothetical protein | Polytopic integral membrane protein with 11 predicted transmembrane helices (DeepTMHMM). RefSeq leaves it 'hypothetical protein'. A topological feature consistent with a membrane transporter/permease or membrane-embedded enzyme; the transported substrate and molecular function are undetermined. | DUF3556 |
Family assigned |
| Rv1134 Rv1134 |
mtbc0_001217 |
hypothetical protein | YCII-related domain protein (eggNOG COG3795, nucleotide-metabolism COG category F). RefSeq leaves it 'hypothetical protein'. A defined domain family of still-uncertain biochemical function. | Family assigned | |
| Rv1138a Rv1138a |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 3qxi-assembly1_A Crystal structure of enoyl-CoA hydratase EchA1 from M (prob 0.44, TM 0.54). | Still unknown |