Genes
1051 genes matched.
| Gene | MTBC0 | Legacy (H37Rv) | MTBC0 PGAP / revised | Pfam | Verdict |
|---|---|---|---|---|---|
| Rv0004 Rv0004 |
mtbc0_000004 |
hypothetical protein | DciA, DNA replication protein. Directly binds DNA and the replicative helicase DnaB and regulates the DnaB-DnaA interaction; functional analogue of the DnaC/DnaI helicase loaders that mycobacteria lack. Essential for viability; its depletion blocks cell-cycle progression. | DciA |
Resolved |
| Rv0021c Rv0021c |
mtbc0_000026 |
hypothetical protein | Nitronate monooxygenase (NMO) family protein; FMN-dependent oxidoreductase of the NMO / 2-nitropropane dioxygenase class. Pfam confirms a tandem NMO (PF03060) + FMN_dh (PF01070) architecture. The precise physiological substrate in M. tuberculosis is not experimentally established. | NMO FMN_dh |
Family assigned |
| Rv0025 Rv0025 |
mtbc0_000030 |
hypothetical protein | Conserved hypothetical protein; DUF4226 domain-containing. Function unknown. | DUF4226 |
Still unknown |
| Rv0026 Rv0026 |
- |
hypothetical protein | Conserved hypothetical protein; DUF domain(s) DUF4226. Function unknown. | DUF4226 |
Still unknown |
| Rv0027 Rv0027 |
mtbc0_000032 |
hypothetical protein | Type VII secretion system (ESX) EspC-family protein. Pfam assigns the T7SS_ESX_EspC domain (PF10824, E=2.2e-31), a more specific call than the PGAP 'ESX-1 secretion-associated protein' label; consistent with a secreted ESX/Esp-associated substrate. No Rv0027-specific experimental characterization was found. | T7SS_ESX_EspC |
Family assigned |
| Rv0028 Rv0028 |
mtbc0_000033 |
hypothetical protein | ESX-1 secretion-associated protein (EspH-like). RefSeq leaves it 'hypothetical protein'. | DUF2694 |
Family assigned |
| Rv0029 Rv0029 |
mtbc0_000035 |
hypothetical protein | Conserved hypothetical protein; tandem DUF5631 + DUF5632 domains. Foldseek finds a significant structural match to Rv3899c (PDB 5IMU), another conserved hypothetical M. tuberculosis protein of unknown function (a proposed vaccine candidate): the fold is thus structurally characterised but the biological function remains unassigned. | DUF5632 DUF5631 |
Still unknown |
| Rv0030 Rv0030 |
mtbc0_000036 |
hypothetical protein | Conserved hypothetical protein; DUF2710 domain-containing. Function unknown. Foldseek yields only weak, near-threshold matches to uncharacterised proteins (e.g. H. pylori HP0035), with no functional assignment. | DUF2710 |
Still unknown |
| Rv0034 Rv0034 |
mtbc0_000039 |
hypothetical protein | NTF2-like / SnoaL-like superfamily protein (Pfam SnoaL_2, PF12680). This alpha+beta cone fold occurs in polyketide cyclases, ketosteroid isomerases and NTF2 domains, often binding a hydrophobic ligand in a deep cavity. The precise physiological role in M. tuberculosis is not established. | SnoaL_2 |
Family assigned |
| Rv0036c Rv0036c |
mtbc0_000041 |
hypothetical protein | Putative DinB/YfiT-like metal-dependent enzyme (Pfam DinB_2 PF12867, with MDMPI_N PF11716). The DinB superfamily is a versatile metal-binding four-helix bundle; PGAP annotates a TIGR03084-family metal-binding protein. Distinct from the Y-family DNA polymerases DinB1/DinB2 (Rv1537/Rv3056); the specific catalytic activity of Rv0036c is not established. | MDMPI_N DinB_2 Wyosine_form |
Family assigned |
| Rv0038 Rv0038 |
mtbc0_000043 |
hypothetical protein | UPF0301 / YqgE/AlgH family protein (Pfam DUF179, PF02622). A widely conserved bacterial family of unknown precise molecular function; the AlgH homolog has been linked to alginate regulation in Pseudomonas, but the activity is uncharacterised. Role in M. tuberculosis unknown. | DUF179 |
Family assigned |
| mtc28 Rv0040c |
mtbc0_000045 |
hypothetical protein | Cell-envelope lipoprotein of the LpqN/LpqT family (Pfam Lpp-LpqN, PF10738). Members are anchored to the mycobacterial envelope; the specific physiological role of this locus is not established here. | Lpp-LpqN |
Family assigned |
| Rv0047c Rv0047c |
mtbc0_000052 |
hypothetical protein | PadR-family transcriptional regulator (Pfam PadR, PF03551). PadR repressors typically control detoxification and multidrug/stress-response genes; the regulon governed by this locus in M. tuberculosis is not established. | PadR |
Family assigned |
| Rv0049 Rv0049 |
- |
hypothetical protein | Conserved hypothetical protein; DUF domain(s) DUF5318. Function unknown. Foldseek best (non-significant) hit: 6ewl-assembly1_A Danio rerio CEP120 first C2 domain (C2A) (prob 0.20, TM 0.40). | DUF5318 |
Still unknown |
| Rv0052 Rv0052 |
- |
hypothetical protein | Contains DJ-1_PfpI (PF01965.31) domain(s); putative function inferred from the domain architecture. | DJ-1_PfpI |
Family assigned |
| Rv0057 Rv0057 |
mtbc0_000062 |
hypothetical protein | Conserved hypothetical protein. No Pfam-A domain above the gathering threshold and no Foldseek structural hit on the ESMFold model: genuinely uncharacterised at both the sequence and the structure level. | Still unknown | |
| darT Rv0059 |
mtbc0_000064 |
hypothetical protein | DarT (DarT_Mtb), toxin of the DarTG toxin-antitoxin system: a DNA ADP-ribosyltransferase that sequence-specifically modifies thymidines on single-stranded DNA. Unchecked it blocks replication and is bactericidal; it is neutralised and reversed by the cognate antitoxin DarG (Rv0060). The toxin is dispensable for viability. | DarT |
Resolved |
| darG Rv0060 |
mtbc0_000065 |
hypothetical protein | DarG (DarG_Mtb), antitoxin of the DarTG system: a macrodomain DNA ADP-ribosyl-glycohydrolase that removes the ADP-ribose mark deposited on DNA by the toxin DarT (Rv0059) and binds/neutralises it. In M. tuberculosis DarG is essential; its depletion triggers the DNA-damage response and cell death. | Macro |
Resolved |
| Rv0061c Rv0061c |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 5oun-assembly1_A NMR solution structure of the external DII domain of (prob 0.15, TM 0.64). | Still unknown | |
| Rv0063a Rv0063a |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv0074 Rv0074 |
mtbc0_000082 |
hypothetical protein | Amidohydrolase-superfamily metalloenzyme (Pfam Amidohydro_1 PF01979 + Amidohydro_3). A TIM-barrel metal-dependent hydrolase; the specific substrate in M. tuberculosis is not established. | Amidohydro_1 Amidohydro_3 |
Family assigned |
| Rv0078A Rv0078A |
- |
hypothetical protein | Contains AbiEii (PF08843.18) domain(s); putative function inferred from the domain architecture. | AbiEii |
Family assigned |
| Rv0078B Rv0078B |
- |
hypothetical protein | Contains Rv0078B (PF18993.7) domain(s); putative function inferred from the domain architecture. | Rv0078B |
Family assigned |
| Rv0079 Rv0079 |
mtbc0_000089 |
hypothetical protein | DATIN (Dormancy-Associated Translation Inhibitor), a DosR-regulon protein. Pfam Ribosom_S30AE_C (PF16321) places it in the ribosome-associated / hibernation-factor family; it is proposed to bind the 30S/70S ribosomal subunits and inhibit or stabilise translation during dormancy. The activity remains a (well-argued) prediction. | Ribosom_S30AE_C |
Resolved |
| Rv0080 Rv0080 |
mtbc0_000090 |
hypothetical protein | Pyridoxamine-5'-phosphate-oxidase (PNPOx) family protein / FMN-binding split-barrel (Pfam Pyridox_ox_2 PF12900). Putative FMN-dependent oxidoreductase; specific reaction not established. | Pyridox_ox_2 |
Family assigned |
| Rv0094c Rv0094c |
- |
hypothetical protein | Conserved hypothetical protein; DUF domain(s) DUF222. Function unknown. | DUF222 |
Still unknown |
| Rv0095c Rv0095c |
- |
hypothetical protein | Conserved hypothetical protein; DUF domain(s) DUF222. Function unknown. | DUF222 |
Still unknown |
| Rv0100 Rv0100 |
mtbc0_000109 |
hypothetical protein | Acyl carrier protein (ACP). Pfam PP-binding (PF00550) is the phosphopantetheine-attachment site: the protein carries acyl intermediates as a 4'-phosphopantetheine thioester for fatty-acid / polyketide biosynthesis. A standalone ACP, likely partnering a biosynthetic cluster in this genomic region. | PP-binding |
Resolved |
| Rv0104 Rv0104 |
mtbc0_000113 |
hypothetical protein | Cyclic-nucleotide-binding (cNMP) domain protein (Pfam cNMP_binding PF00027). A large (504 aa) protein carrying a cNMP regulatory module, likely a cyclic-nucleotide-responsive regulator/effector; the precise function is not established. | cNMP_binding |
Family assigned |
| Rv0106 Rv0106 |
mtbc0_000116 |
hypothetical protein | CobW/P47K-family nucleotide- and metal-binding protein (Pfam cobW PF02492 + CobW_C PF07683), of the COG0523 subfamily of putative metallochaperones (often involved in zinc/cobalt homeostasis). The specific metal and role are not established. | cobW CobW_C |
Family assigned |
| Rv0108c Rv0108c |
- |
hypothetical protein | Small OB-fold / twisted beta-sandwich protein; ambiguous fold-level match (phage head-to-tail joining gpFII-like and/or EF-P/eIF5A OB-fold); function undetermined RefSeq leaves this locus uncharacterised. | Family assigned | |
| Rv0115a Rv0115a |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. | Still unknown | |
| Rv0121c Rv0121c |
mtbc0_000132 |
hypothetical protein | F420-dependent / PNPOx-class oxidoreductase (Pfam PNPOx_N PF01243; PGAP TIGR03668 PPOX-class F420-dependent oxidoreductase). Putative deazaflavin (F420)-dependent oxidoreductase, a redox chemistry characteristic of mycobacteria. | PNPOx_N |
Family assigned |
| Rv0122 Rv0122 |
mtbc0_000133 |
hypothetical protein | Conserved hypothetical protein; no Pfam domain above threshold. Foldseek gives a suggestive but non-significant fold similarity to a ribonuclease (prob 0.98, E=5e-2, TM=0.61) with weaker hits to HigBA-type toxin-antitoxin RNases: a possible RNase/toxin-like fold, not conclusive. | Still unknown | |
| Rv0123 Rv0123 |
mtbc0_000134 |
hypothetical protein | Putative DNA-binding protein. No Pfam domain above threshold, but Foldseek matches antitoxin DNA-binding domains strongly and consistently (CopASO antitoxin prob 0.99 / TM 0.83; ParDE antitoxin; PutA HTH), pointing to a ribbon-helix-helix / antitoxin-type DNA-binding fold. Structure-based, consistent with the PGAP 'DNA-binding protein' call. | Family assigned | |
| Rv0138 Rv0138 |
mtbc0_000149 |
hypothetical protein | NTF2-like / SnoaL-like superfamily protein (Pfam SnoaL_4 PF13577 + SnoaL_2 PF12680). A cone fold often acting as a polyketide cyclase or hydrophobic-ligand-binding domain; the specific role is not established. | SnoaL_4 SnoaL_2 |
Family assigned |
| Rv0140 Rv0140 |
mtbc0_000151 |
hypothetical protein | Putative nucleotidyltransferase (Pfam NTP_transf_9 PF04248; the family formerly named DUF427). A minimal nucleotidyltransferase fold; the substrate is not established. | NTP_transf_9 |
Family assigned |
| Rv0141c Rv0141c |
mtbc0_000152 |
hypothetical protein | NTF2-like / SnoaL-like superfamily protein (Pfam SnoaL_2 PF12680), as for Rv0138. Cone-fold ligand-binding / cyclase-like domain; specific role not established. | SnoaL_2 |
Family assigned |
| Rv0142 Rv0142 |
mtbc0_000153 |
hypothetical protein | DNA-3-methyladenine glycosylase: a base-excision-repair enzyme that removes alkylated bases (e.g. 3-methyladenine) from DNA, initiating their repair. Assigned by RefSeq/PGAP homology (no Pfam domain above the gathering threshold). | Resolved | |
| Rv0150c Rv0150c |
mtbc0_000161 |
hypothetical protein | Conserved hypothetical protein. No Pfam domain above threshold and no Foldseek structural hit on the ESMFold model: genuinely uncharacterised at both sequence and structure level. | Still unknown | |
| Rv0157A Rv0157A |
- |
hypothetical protein | Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 8w9z-assembly1_F The cryo-EM structure of the Nicotiana tabacum PEP-PA (prob 0.09, TM 0.77). | Still unknown | |
| Rv0163 Rv0163 |
mtbc0_000176 |
hypothetical protein | Thioesterase of the hotdog-fold superfamily (Pfam 4HBT_2 PF13279 + 4HBT PF03061). Putative acyl-CoA / acyl-ACP thioesterase; the specific substrate is not established. | 4HBT_2 4HBT |
Family assigned |
| TB18.5 Rv0164 |
- |
hypothetical protein | Contains Polyketide_cyc2 (PF10604.16), Polyketide_cyc (PF03364.26) domain(s); putative function inferred from the domain architecture. | Polyketide_cyc2 Polyketide_cyc |
Family assigned |
| Rv0181c Rv0181c |
mtbc0_000194 |
hypothetical protein | Pirin-family cupin metalloenzyme (Pfam Pirin PF02678 + Pirin_C_2 PF17954). A bicupin, often Fe-dependent (quercetinase / related dioxygenase chemistry); specific substrate not established. | Pirin Pirin_C_2 |
Family assigned |
| Rv0184 Rv0184 |
mtbc0_000197 |
hypothetical protein | Conserved hypothetical protein; tandem DUF2786 + DUF7168 domains (Pfam), both of unknown function. | DUF2786 DUF7168 |
Still unknown |
| Rv0185 Rv0185 |
mtbc0_000198 |
hypothetical protein | Putative metallohydrolase (PGAP TIGR04338 family). No Pfam domain above threshold, but Foldseek weakly matches an M61-type aminopeptidase fold: a possible metal-dependent hydrolase/peptidase, not firmly assigned. | Family assigned | |
| Rv0190 Rv0190 |
mtbc0_000204 |
hypothetical protein | Metal-sensing transcriptional regulator (Pfam Trns_repr_metal PF02583; CsoR/RcnR-like). Putative metalloregulator controlling metal-homeostasis genes in response to metal ions. | Trns_repr_metal |
Family assigned |
| Rv0192 Rv0192 |
- |
hypothetical protein | Contains Big_10 (PF17964.8), YkuD (PF03734.20) domain(s); putative function inferred from the domain architecture. | Big_10 YkuD |
Family assigned |
| Rv0193c Rv0193c |
mtbc0_000207 |
hypothetical protein | Putative 2-oxoglutarate / Fe(II)-dependent oxygenase. No Pfam domain above threshold, but Foldseek gives a significant match to a proline-hydroxylase (2OG-Fe(II) oxygenase) fold (prob 1.00, E=1e-4, TM=0.57). Structure-based hypothesis. | Family assigned | |
| Rv0201c Rv0201c |
mtbc0_000215 |
hypothetical protein | XRE/Cro-family helix-turn-helix transcriptional regulator (PGAP). No Pfam domain above threshold, but Foldseek strongly matches DNA-binding folds (prob 1.00, TM=0.81). Putative DNA-binding regulator. | Family assigned |