Genes

1051 genes matched.

GeneMTBC0Legacy (H37Rv)MTBC0 PGAP / revisedPfamVerdict
Rv0004
Rv0004
mtbc0_000004 hypothetical protein DciA, DNA replication protein. Directly binds DNA and the replicative helicase DnaB and regulates the DnaB-DnaA interaction; functional analogue of the DnaC/DnaI helicase loaders that mycobacteria lack. Essential for viability; its depletion blocks cell-cycle progression. DciA Resolved
Rv0021c
Rv0021c
mtbc0_000026 hypothetical protein Nitronate monooxygenase (NMO) family protein; FMN-dependent oxidoreductase of the NMO / 2-nitropropane dioxygenase class. Pfam confirms a tandem NMO (PF03060) + FMN_dh (PF01070) architecture. The precise physiological substrate in M. tuberculosis is not experimentally established. NMO FMN_dh Family assigned
Rv0025
Rv0025
mtbc0_000030 hypothetical protein Conserved hypothetical protein; DUF4226 domain-containing. Function unknown. DUF4226 Still unknown
Rv0026
Rv0026
- hypothetical protein Conserved hypothetical protein; DUF domain(s) DUF4226. Function unknown. DUF4226 Still unknown
Rv0027
Rv0027
mtbc0_000032 hypothetical protein Type VII secretion system (ESX) EspC-family protein. Pfam assigns the T7SS_ESX_EspC domain (PF10824, E=2.2e-31), a more specific call than the PGAP 'ESX-1 secretion-associated protein' label; consistent with a secreted ESX/Esp-associated substrate. No Rv0027-specific experimental characterization was found. T7SS_ESX_EspC Family assigned
Rv0028
Rv0028
mtbc0_000033 hypothetical protein ESX-1 secretion-associated protein (EspH-like). RefSeq leaves it 'hypothetical protein'. DUF2694 Family assigned
Rv0029
Rv0029
mtbc0_000035 hypothetical protein Conserved hypothetical protein; tandem DUF5631 + DUF5632 domains. Foldseek finds a significant structural match to Rv3899c (PDB 5IMU), another conserved hypothetical M. tuberculosis protein of unknown function (a proposed vaccine candidate): the fold is thus structurally characterised but the biological function remains unassigned. DUF5632 DUF5631 Still unknown
Rv0030
Rv0030
mtbc0_000036 hypothetical protein Conserved hypothetical protein; DUF2710 domain-containing. Function unknown. Foldseek yields only weak, near-threshold matches to uncharacterised proteins (e.g. H. pylori HP0035), with no functional assignment. DUF2710 Still unknown
Rv0034
Rv0034
mtbc0_000039 hypothetical protein NTF2-like / SnoaL-like superfamily protein (Pfam SnoaL_2, PF12680). This alpha+beta cone fold occurs in polyketide cyclases, ketosteroid isomerases and NTF2 domains, often binding a hydrophobic ligand in a deep cavity. The precise physiological role in M. tuberculosis is not established. SnoaL_2 Family assigned
Rv0036c
Rv0036c
mtbc0_000041 hypothetical protein Putative DinB/YfiT-like metal-dependent enzyme (Pfam DinB_2 PF12867, with MDMPI_N PF11716). The DinB superfamily is a versatile metal-binding four-helix bundle; PGAP annotates a TIGR03084-family metal-binding protein. Distinct from the Y-family DNA polymerases DinB1/DinB2 (Rv1537/Rv3056); the specific catalytic activity of Rv0036c is not established. MDMPI_N DinB_2 Wyosine_form Family assigned
Rv0038
Rv0038
mtbc0_000043 hypothetical protein UPF0301 / YqgE/AlgH family protein (Pfam DUF179, PF02622). A widely conserved bacterial family of unknown precise molecular function; the AlgH homolog has been linked to alginate regulation in Pseudomonas, but the activity is uncharacterised. Role in M. tuberculosis unknown. DUF179 Family assigned
mtc28
Rv0040c
mtbc0_000045 hypothetical protein Cell-envelope lipoprotein of the LpqN/LpqT family (Pfam Lpp-LpqN, PF10738). Members are anchored to the mycobacterial envelope; the specific physiological role of this locus is not established here. Lpp-LpqN Family assigned
Rv0047c
Rv0047c
mtbc0_000052 hypothetical protein PadR-family transcriptional regulator (Pfam PadR, PF03551). PadR repressors typically control detoxification and multidrug/stress-response genes; the regulon governed by this locus in M. tuberculosis is not established. PadR Family assigned
Rv0049
Rv0049
- hypothetical protein Conserved hypothetical protein; DUF domain(s) DUF5318. Function unknown. Foldseek best (non-significant) hit: 6ewl-assembly1_A Danio rerio CEP120 first C2 domain (C2A) (prob 0.20, TM 0.40). DUF5318 Still unknown
Rv0052
Rv0052
- hypothetical protein Contains DJ-1_PfpI (PF01965.31) domain(s); putative function inferred from the domain architecture. DJ-1_PfpI Family assigned
Rv0057
Rv0057
mtbc0_000062 hypothetical protein Conserved hypothetical protein. No Pfam-A domain above the gathering threshold and no Foldseek structural hit on the ESMFold model: genuinely uncharacterised at both the sequence and the structure level. Still unknown
darT
Rv0059
mtbc0_000064 hypothetical protein DarT (DarT_Mtb), toxin of the DarTG toxin-antitoxin system: a DNA ADP-ribosyltransferase that sequence-specifically modifies thymidines on single-stranded DNA. Unchecked it blocks replication and is bactericidal; it is neutralised and reversed by the cognate antitoxin DarG (Rv0060). The toxin is dispensable for viability. DarT Resolved
darG
Rv0060
mtbc0_000065 hypothetical protein DarG (DarG_Mtb), antitoxin of the DarTG system: a macrodomain DNA ADP-ribosyl-glycohydrolase that removes the ADP-ribose mark deposited on DNA by the toxin DarT (Rv0059) and binds/neutralises it. In M. tuberculosis DarG is essential; its depletion triggers the DNA-damage response and cell death. Macro Resolved
Rv0061c
Rv0061c
- hypothetical protein Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 5oun-assembly1_A NMR solution structure of the external DII domain of (prob 0.15, TM 0.64). Still unknown
Rv0063a
Rv0063a
- hypothetical protein Conserved hypothetical protein; no recognised domain. Function unknown. Still unknown
Rv0074
Rv0074
mtbc0_000082 hypothetical protein Amidohydrolase-superfamily metalloenzyme (Pfam Amidohydro_1 PF01979 + Amidohydro_3). A TIM-barrel metal-dependent hydrolase; the specific substrate in M. tuberculosis is not established. Amidohydro_1 Amidohydro_3 Family assigned
Rv0078A
Rv0078A
- hypothetical protein Contains AbiEii (PF08843.18) domain(s); putative function inferred from the domain architecture. AbiEii Family assigned
Rv0078B
Rv0078B
- hypothetical protein Contains Rv0078B (PF18993.7) domain(s); putative function inferred from the domain architecture. Rv0078B Family assigned
Rv0079
Rv0079
mtbc0_000089 hypothetical protein DATIN (Dormancy-Associated Translation Inhibitor), a DosR-regulon protein. Pfam Ribosom_S30AE_C (PF16321) places it in the ribosome-associated / hibernation-factor family; it is proposed to bind the 30S/70S ribosomal subunits and inhibit or stabilise translation during dormancy. The activity remains a (well-argued) prediction. Ribosom_S30AE_C Resolved
Rv0080
Rv0080
mtbc0_000090 hypothetical protein Pyridoxamine-5'-phosphate-oxidase (PNPOx) family protein / FMN-binding split-barrel (Pfam Pyridox_ox_2 PF12900). Putative FMN-dependent oxidoreductase; specific reaction not established. Pyridox_ox_2 Family assigned
Rv0094c
Rv0094c
- hypothetical protein Conserved hypothetical protein; DUF domain(s) DUF222. Function unknown. DUF222 Still unknown
Rv0095c
Rv0095c
- hypothetical protein Conserved hypothetical protein; DUF domain(s) DUF222. Function unknown. DUF222 Still unknown
Rv0100
Rv0100
mtbc0_000109 hypothetical protein Acyl carrier protein (ACP). Pfam PP-binding (PF00550) is the phosphopantetheine-attachment site: the protein carries acyl intermediates as a 4'-phosphopantetheine thioester for fatty-acid / polyketide biosynthesis. A standalone ACP, likely partnering a biosynthetic cluster in this genomic region. PP-binding Resolved
Rv0104
Rv0104
mtbc0_000113 hypothetical protein Cyclic-nucleotide-binding (cNMP) domain protein (Pfam cNMP_binding PF00027). A large (504 aa) protein carrying a cNMP regulatory module, likely a cyclic-nucleotide-responsive regulator/effector; the precise function is not established. cNMP_binding Family assigned
Rv0106
Rv0106
mtbc0_000116 hypothetical protein CobW/P47K-family nucleotide- and metal-binding protein (Pfam cobW PF02492 + CobW_C PF07683), of the COG0523 subfamily of putative metallochaperones (often involved in zinc/cobalt homeostasis). The specific metal and role are not established. cobW CobW_C Family assigned
Rv0108c
Rv0108c
- hypothetical protein Small OB-fold / twisted beta-sandwich protein; ambiguous fold-level match (phage head-to-tail joining gpFII-like and/or EF-P/eIF5A OB-fold); function undetermined RefSeq leaves this locus uncharacterised. Family assigned
Rv0115a
Rv0115a
- hypothetical protein Conserved hypothetical protein; no recognised domain. Function unknown. Still unknown
Rv0121c
Rv0121c
mtbc0_000132 hypothetical protein F420-dependent / PNPOx-class oxidoreductase (Pfam PNPOx_N PF01243; PGAP TIGR03668 PPOX-class F420-dependent oxidoreductase). Putative deazaflavin (F420)-dependent oxidoreductase, a redox chemistry characteristic of mycobacteria. PNPOx_N Family assigned
Rv0122
Rv0122
mtbc0_000133 hypothetical protein Conserved hypothetical protein; no Pfam domain above threshold. Foldseek gives a suggestive but non-significant fold similarity to a ribonuclease (prob 0.98, E=5e-2, TM=0.61) with weaker hits to HigBA-type toxin-antitoxin RNases: a possible RNase/toxin-like fold, not conclusive. Still unknown
Rv0123
Rv0123
mtbc0_000134 hypothetical protein Putative DNA-binding protein. No Pfam domain above threshold, but Foldseek matches antitoxin DNA-binding domains strongly and consistently (CopASO antitoxin prob 0.99 / TM 0.83; ParDE antitoxin; PutA HTH), pointing to a ribbon-helix-helix / antitoxin-type DNA-binding fold. Structure-based, consistent with the PGAP 'DNA-binding protein' call. Family assigned
Rv0138
Rv0138
mtbc0_000149 hypothetical protein NTF2-like / SnoaL-like superfamily protein (Pfam SnoaL_4 PF13577 + SnoaL_2 PF12680). A cone fold often acting as a polyketide cyclase or hydrophobic-ligand-binding domain; the specific role is not established. SnoaL_4 SnoaL_2 Family assigned
Rv0140
Rv0140
mtbc0_000151 hypothetical protein Putative nucleotidyltransferase (Pfam NTP_transf_9 PF04248; the family formerly named DUF427). A minimal nucleotidyltransferase fold; the substrate is not established. NTP_transf_9 Family assigned
Rv0141c
Rv0141c
mtbc0_000152 hypothetical protein NTF2-like / SnoaL-like superfamily protein (Pfam SnoaL_2 PF12680), as for Rv0138. Cone-fold ligand-binding / cyclase-like domain; specific role not established. SnoaL_2 Family assigned
Rv0142
Rv0142
mtbc0_000153 hypothetical protein DNA-3-methyladenine glycosylase: a base-excision-repair enzyme that removes alkylated bases (e.g. 3-methyladenine) from DNA, initiating their repair. Assigned by RefSeq/PGAP homology (no Pfam domain above the gathering threshold). Resolved
Rv0150c
Rv0150c
mtbc0_000161 hypothetical protein Conserved hypothetical protein. No Pfam domain above threshold and no Foldseek structural hit on the ESMFold model: genuinely uncharacterised at both sequence and structure level. Still unknown
Rv0157A
Rv0157A
- hypothetical protein Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 8w9z-assembly1_F The cryo-EM structure of the Nicotiana tabacum PEP-PA (prob 0.09, TM 0.77). Still unknown
Rv0163
Rv0163
mtbc0_000176 hypothetical protein Thioesterase of the hotdog-fold superfamily (Pfam 4HBT_2 PF13279 + 4HBT PF03061). Putative acyl-CoA / acyl-ACP thioesterase; the specific substrate is not established. 4HBT_2 4HBT Family assigned
TB18.5
Rv0164
- hypothetical protein Contains Polyketide_cyc2 (PF10604.16), Polyketide_cyc (PF03364.26) domain(s); putative function inferred from the domain architecture. Polyketide_cyc2 Polyketide_cyc Family assigned
Rv0181c
Rv0181c
mtbc0_000194 hypothetical protein Pirin-family cupin metalloenzyme (Pfam Pirin PF02678 + Pirin_C_2 PF17954). A bicupin, often Fe-dependent (quercetinase / related dioxygenase chemistry); specific substrate not established. Pirin Pirin_C_2 Family assigned
Rv0184
Rv0184
mtbc0_000197 hypothetical protein Conserved hypothetical protein; tandem DUF2786 + DUF7168 domains (Pfam), both of unknown function. DUF2786 DUF7168 Still unknown
Rv0185
Rv0185
mtbc0_000198 hypothetical protein Putative metallohydrolase (PGAP TIGR04338 family). No Pfam domain above threshold, but Foldseek weakly matches an M61-type aminopeptidase fold: a possible metal-dependent hydrolase/peptidase, not firmly assigned. Family assigned
Rv0190
Rv0190
mtbc0_000204 hypothetical protein Metal-sensing transcriptional regulator (Pfam Trns_repr_metal PF02583; CsoR/RcnR-like). Putative metalloregulator controlling metal-homeostasis genes in response to metal ions. Trns_repr_metal Family assigned
Rv0192
Rv0192
- hypothetical protein Contains Big_10 (PF17964.8), YkuD (PF03734.20) domain(s); putative function inferred from the domain architecture. Big_10 YkuD Family assigned
Rv0193c
Rv0193c
mtbc0_000207 hypothetical protein Putative 2-oxoglutarate / Fe(II)-dependent oxygenase. No Pfam domain above threshold, but Foldseek gives a significant match to a proline-hydroxylase (2OG-Fe(II) oxygenase) fold (prob 1.00, E=1e-4, TM=0.57). Structure-based hypothesis. Family assigned
Rv0201c
Rv0201c
mtbc0_000215 hypothetical protein XRE/Cro-family helix-turn-helix transcriptional regulator (PGAP). No Pfam domain above threshold, but Foldseek strongly matches DNA-binding folds (prob 1.00, TM=0.81). Putative DNA-binding regulator. Family assigned