yfiH Family assigned · medium auto-curated

H37Rv Rv2149c · MTBC0 - · 250 aa · 2407622–2408374 (-) · RefSeq NP_216665.1

Annotation: from legacy to revised

Legacy (H37Rv / Mycobrowser)laccase domain-containing protein
MTBC0 PGAP re-annotation
Revised (this work)Laccase domain-containing protein. Pfam: Cu-oxidase_4 (PF02578.21).

Auto-curated: this verdict and function were generated by rules from PGAP + Pfam + Foldseek and have not been hand-reviewed.

Annotated on the H37Rv protein: this gene has no 1:1 ancestral MTBC0 anchor (PE/PPE, paralogue, IS element, or otherwise unanchored CDS).

Curated reference (UniProt)

UniProt P9WKD5 SwissProt · reviewed · Evidence at protein level
UniProt namePurine nucleoside phosphorylase Rv2149c
EC (curated) EC 2.4.2.1, EC 2.4.2.28, EC 3.5.4.4
Curated functionPurine nucleoside enzyme that catalyzes the phosphorolysis of adenosine and inosine nucleosides, yielding D-ribose 1-phosphate and the respective free bases, adenine and hypoxanthine. Also catalyzes the phosphorolysis of S-methyl-5'-thioadenosine into adenine and S-methyl-5-thio-alpha-D-ribose 1-phosphate. Also has adenosine deaminase activity.

Functional vocabulary (eggNOG-mapper, orthology transfer)

COG category S Function unknown
Preferred nameyfiH
eggNOG descriptionBelongs to the multicopper oxidase YfiH RL5 family
Orthologous groupCOG1496
KEGG orthology K05810
Gene Ontology (13) GO:0003674, GO:0005488, GO:0005507, GO:0005575, GO:0005618, GO:0005623, GO:0030312, GO:0043167, GO:0043169, GO:0044464, GO:0046872, GO:0046914 +1 more

Orthology-based transfer (eggNOG 5.0.2, diamond). EC/KO/GO/CAZy are computed annotations, not manual curation; cross-check against the primary literature before treating a specific reaction as established.

Domains (Pfam, hmmscan --cut_ga)

PfamAccessioni-EvalueResiduesDescription
Cu-oxidase_4PF02578.21 1.9e-7425–248 Multi-copper polyphenol oxidoreductase laccase

Functional interaction network (STRING v12, guilt-by-association)

Closest characterised functional partner: ftsZ (cell division protein FtsZ), high confidence from genomic context alone (score 886 excluding text-mining).

PartnerProductScoreNo text-miningChannels (≥400)
Rv2148c hyp hypothetical protein 978 960 ctx neighborhood:882 coexpression:672 textmining:496
Rv2150c ftsZ cell division protein FtsZ 899 886 ctx neighborhood:879
Rv2147c sepF cell division protein SepF 968 828 ctx neighborhood:816 textmining:823
Rv2152c murC UDP-N-acetylmuramate--alanine ligase 803 804 ctx neighborhood:734
Rv2154c ftsW lipid II flippase FtsW 915 770 ctx neighborhood:707 textmining:645
Rv2153c murG UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol-N-acetylglucosamine transferase 904 767 ctx neighborhood:715 textmining:607
Rv2151c ftsQ cell division protein FtsQ 740 729 ctx neighborhood:715
Rv2156c murX phospho-N-acetylmuramoyl-pentappeptidetransferase 704 677 ctx neighborhood:605
Rv2157c murF UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 668 667 ctx neighborhood:611
Rv2158c murE UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase 606 606 ctx neighborhood:604
Rv2146c transmembrane protein 623 598 ctx neighborhood:571
Rv2155c murD UDP-N-acetylmuramoylalanine--D-glutamate ligase 804 575 ctx neighborhood:559 textmining:558
Rv2145c wag31 cell wall synthesis protein Wag31 466 446 ctx neighborhood:423
Rv2160A Rv2160A, len: 211 aa. Conserved hypothetical protein, possibly a TetR-family transcriptional regulator,similar to N-terminal half of AL51266 435 435 ctx neighborhood:420
Rv2159c hyp hypothetical protein 426 426 ctx neighborhood:420

STRING combines evidence channels (neighborhood, fusion, cooccurrence, coexpression, experimental, database, text-mining) into a 0–1000 score. The ctx badge marks edges carried by the genomic-context channels (conserved neighborhood, fusion, phylogenetic co-occurrence), which are independent of orthology and structure and the strongest signal for an unknown gene. The no text-mining column recomputes the score from data alone, so a link that does not depend on the literature is visible. Association is a function hypothesis, not proof: corroborate with the operon context and the primary literature before assigning a function.

Evidence

  • Annotation from H37Rv (no MTBC0 1:1 anchor; H37Rv protein used): laccase domain-containing protein
  • Pfam (hmmscan --cut_ga): Cu-oxidase_4 PF02578.21 (E=2e-74)
  • (auto-curated by rules from PGAP + Pfam + Foldseek; not hand-reviewed)

Sources

  • Ancestral sequence & coordinates: Harrison LB et al. (2024), An imputed ancestral reference genome for the MTBC, doi:10.1101/2023.09.07.556366
  • Product annotation: NCBI PGAP on MTBC0; legacy from H37Rv NC_000962.3 (RefSeq NP_216665.1)
  • Domains: Pfam-A via hmmscan --cut_ga — Cu-oxidase_4 (PF02578.21)
  • Sequence-level signal: ESM Atlas (EvolutionaryScale × BioHub) — exploratory
  • Controlled vocabulary: eggNOG-mapper 2.1.12 (Cantalapiedra et al. 2021, doi:10.1093/molbev/msab293), eggNOG 5.0 DB (Huerta-Cepas et al. 2019) — OG COG1496
  • Curated reference: UniProt P9WKD5 (SwissProt, reviewed; Evidence at protein level)
  • Interaction network: STRING v12.0 (Szklarczyk et al. 2023, doi:10.1093/nar/gkac1000), taxon 83332, CC-BY 4.0 — 26 functional partner(s); context anchor ftsZ
  • Primary literature: none located yet; annotation rests on the domain/homology sources above.

Ancestral MTBC0 protein sequence

>H37Rv|Rv2149c|yfiH
MLASTRHIARGDTGNVSVRIRRVTTTRAGGVSAPPFDTFNLGDHVGDDPAAVAANRARLAAAIGLPGNRVVWMNQVHGDRVELVDQPRNTALDDTDGLVTATPRLALAVVTADCVPVLMADARAGIAAAVHAGRAGAQRGVVVRALEVMLSLGAQVRDISALLGPAVSGRNYEVPAAMADEVEAALPGSRTTTAAGTPGVDLRAGIACQLRDLGVESIDVDPRCTVADPTLFSHRRDAPTGRFASLVWME