Rv0157A Still unknown · low auto-curated

H37Rv Rv0157A · MTBC0 - · 42 aa · 186495–186623 (-) · RefSeq YP_007408797.1

Annotation: from legacy to revised

Legacy (H37Rv / Mycobrowser)hypothetical protein
MTBC0 PGAP re-annotation
Revised (this work)Conserved hypothetical protein; no recognised domain. Function unknown. Foldseek best (non-significant) hit: 8w9z-assembly1_F The cryo-EM structure of the Nicotiana tabacum PEP-PA (prob 0.09, TM 0.77).

Auto-curated: this verdict and function were generated by rules from PGAP + Pfam + Foldseek and have not been hand-reviewed.

Annotated on the H37Rv protein: this gene has no 1:1 ancestral MTBC0 anchor (PE/PPE, paralogue, IS element, or otherwise unanchored CDS).

Curated reference (UniProt)

UniProt I6WXK8 TrEMBL · unreviewed · Evidence at protein level
UniProt nameConserved protein

UniProt still lists this protein as Conserved protein; the revised annotation above is ahead of the current UniProt record.

Functional vocabulary (eggNOG-mapper, orthology transfer)

COG category S Function unknown
eggNOG descriptionCysteine-rich secretory protein family
Orthologous groupCOG2340

Orthology-based transfer (eggNOG 5.0.2, diamond). EC/KO/GO/CAZy are computed annotations, not manual curation; cross-check against the primary literature before treating a specific reaction as established.

Domains (Pfam, hmmscan --cut_ga)

No Pfam-A domain above the gathering threshold (or not yet scanned).

Structural neighbours (Foldseek on the ESMFold model, exploratory)

ESMFold model confidence: mean pLDDT 60.6 (low). Low-confidence model: the fold may be unreliable, so treat these structural hits with caution.

Best matches against the PDB, ranked by Foldseek homology probability. A high probability / TM-score suggests a shared fold; unless flagged sig (E < 0.01) these are fold hypotheses, not assignments.

TargetProbTME-valueDescription
8w9z-assembly1_F 0.09 0.77 9.9e+00 8w9z-assembly1_F The cryo-EM structure of the Nicotiana tabacum PEP-PAP

Functional interaction network (STRING v12, guilt-by-association)

Closest characterised functional partner: Rv0158 (transcriptional regulator), medium confidence from genomic context alone (score 558 excluding text-mining). This association is the citable seed of a function hypothesis for this hypothetical protein.

PartnerProductScoreNo text-miningChannels (≥400)
Rv0158 transcriptional regulator 558 558 ctx neighborhood:558
Rv1200 integral membrane transport protein 548 541 database:487
Rv1972 Mce associated membrane protein 483 459
Rv1950c hyp hypothetical protein 441 441 coexpression:431
Rv0756c hyp hypothetical protein 439 440 coexpression:430
Rv3881c espB ESX-1 secretion-associated protein EspB 438 439 coexpression:429
Rv0909 antitoxin 438 438 coexpression:428
Rv3903c cpnT hyp hypothetical protein 438 438 coexpression:428
Rv1305 atpE ATP synthase subunit C 430 430 database:418
Rv1362c membrane protein 435 410
Rv1973 Mce associated membrane protein 435 410
Rv3492c Mce associated protein 418 392
Rv1363c membrane protein 417 391
Rv2390c hyp hypothetical protein 416 390
Rv0200 transmembrane protein 416 390

STRING combines evidence channels (neighborhood, fusion, cooccurrence, coexpression, experimental, database, text-mining) into a 0–1000 score. The ctx badge marks edges carried by the genomic-context channels (conserved neighborhood, fusion, phylogenetic co-occurrence), which are independent of orthology and structure and the strongest signal for an unknown gene. The no text-mining column recomputes the score from data alone, so a link that does not depend on the literature is visible. Association is a function hypothesis, not proof: corroborate with the operon context and the primary literature before assigning a function.

Evidence

  • Annotation from H37Rv (no MTBC0 1:1 anchor; H37Rv protein used): hypothetical protein
  • Foldseek best: 8w9z-assembly1_F The cryo-EM structure of the Nicotiana tabacum PEP-PAP (prob 0.09, E=1e+01, TM=0.77)
  • (auto-curated by rules from PGAP + Pfam + Foldseek; not hand-reviewed)

Sources

  • Ancestral sequence & coordinates: Harrison LB et al. (2024), An imputed ancestral reference genome for the MTBC, doi:10.1101/2023.09.07.556366
  • Product annotation: NCBI PGAP on MTBC0; legacy from H37Rv NC_000962.3 (RefSeq YP_007408797.1)
  • Domains: Pfam-A via hmmscan --cut_ga — none above threshold
  • Sequence-level signal: ESM Atlas (EvolutionaryScale × BioHub) — exploratory
  • Controlled vocabulary: eggNOG-mapper 2.1.12 (Cantalapiedra et al. 2021, doi:10.1093/molbev/msab293), eggNOG 5.0 DB (Huerta-Cepas et al. 2019) — OG COG2340
  • Curated reference: UniProt I6WXK8 (TrEMBL, unreviewed; Evidence at protein level)
  • Model confidence: ESMFold per-residue pLDDT (mean 60.6, low)
  • Interaction network: STRING v12.0 (Szklarczyk et al. 2023, doi:10.1093/nar/gkac1000), taxon 83332, CC-BY 4.0 — 19 functional partner(s); context anchor Rv0158
  • Primary literature: none located yet; annotation rests on the domain/homology sources above.

Ancestral MTBC0 protein sequence

>H37Rv|Rv0157A|
MMDPSPDYDVSDEIEFFFRYLTWGLRGVETGDGYPPPAYPPV