Rv3363c Still unknown · low
H37Rv Rv3363c · MTBC0 mtbc0_003578 ·
122 aa · 3800844–3801212 (-) ·
RefSeq NP_217880.1
Annotation: from legacy to revised
| Legacy (H37Rv / Mycobrowser) | hypothetical protein |
|---|---|
| MTBC0 PGAP re-annotation | DUF742 domain-containing protein |
| Revised (this work) | Conserved hypothetical; function unknown. Structure-based hint rejected: non-transferable cross-kingdom fold (human FAM91A1/TBC1D23 Golgi trafficking); function unknown. |
Curated reference (UniProt)
| UniProt |
O50392
TrEMBL · unreviewed
· Predicted
|
|---|---|
| UniProt name | DUF742 domain-containing protein |
Functional vocabulary (eggNOG-mapper, orthology transfer)
| COG category |
K Transcription
|
|---|---|
| eggNOG description | Protein of unknown function (DUF742) |
| Orthologous group | COG1846 |
Orthology-based transfer (eggNOG 5.0.2, diamond). EC/KO/GO/CAZy are computed annotations, not manual curation; cross-check against the primary literature before treating a specific reaction as established.
Domains (Pfam, hmmscan --cut_ga)
| Pfam | Accession | i-Evalue | Residues | Description |
|---|---|---|---|---|
DUF742 | PF05331.18 | 2.6e-36 | 10–122 | Protein of unknown function (DUF742) |
Structural neighbours (Foldseek on the ESMFold model, exploratory)
ESMFold model confidence: mean pLDDT 58.2 (low). Low-confidence model: the fold may be unreliable, so treat these structural hits with caution.
Best matches against the PDB, ranked by Foldseek homology probability. A high probability / TM-score suggests a shared fold; unless flagged sig (E < 0.01) these are fold hypotheses, not assignments.
| Target | Prob | TM | E-value | Description |
|---|---|---|---|---|
8jj9-assembly1_A |
0.99 | 0.68 | 4.5e-02 | 8jj9-assembly1_A Human FAM91A1 N terminal domain in complex with TBC1D23 |
8fw5-assembly1_B |
0.99 | 0.69 | 5.9e-02 | 8fw5-assembly1_B Chimeric HsGATOR1-SpGtr-SpLam complex |
2yu3-assembly1_A |
0.98 | 0.58 | 2.3e-02 | 2yu3-assembly1_A Solution structure of the domain swapped WingedHelix in DNA-directed RNA polymerase III 39 kDa polypeptide |
8fw5-assembly1_C |
0.85 | 0.68 | 4.0e-01 | 8fw5-assembly1_C Chimeric HsGATOR1-SpGtr-SpLam complex |
3k9t-assembly1_A |
0.47 | 0.56 | 5.8e-01 | 3k9t-assembly1_A Crystal structure of putative peptidase (NP_348812.1) from CLOSTRIDIUM ACETOBUTYLICUM at 2.37 A resolution |
7t3c-assembly1_B |
0.44 | 0.59 | 7.1e-01 | 7t3c-assembly1_B GATOR1-RAG-RAGULATOR - Dual Complex |
3tgn-assembly1_B |
0.30 | 0.67 | 2.1e+00 | 3tgn-assembly1_B Crystal Structure of the zinc-dependent MarR Family Transcriptional Regulator AdcR in the Zn(II)-bound State |
6cet-assembly1_N |
0.18 | 0.58 | 2.7e+00 | 6cet-assembly1_N Cryo-EM structure of GATOR1 |
Functional interaction network (STRING v12, guilt-by-association)
Closest characterised functional partner: Rv3362c (ATP/GTP-binding protein), high confidence from genomic context alone (score 995 excluding text-mining). This association is the citable seed of a function hypothesis for this hypothetical protein.
| Partner | Product | Score | No text-mining | Channels (≥400) |
|---|---|---|---|---|
Rv3362c |
ATP/GTP-binding protein | 999 | 995 ctx | neighborhood:882 cooccurence:774 coexpression:824 textmining:892 |
Rv3364c hyp |
hypothetical protein | 997 | 984 ctx | neighborhood:882 cooccurence:774 coexpression:458 textmining:870 |
Rv3365c hyp |
hypothetical protein | 997 | 981 ctx | neighborhood:882 cooccurence:772 textmining:881 |
Rv3361c mfpA hyp |
hypothetical protein | 995 | 967 ctx | neighborhood:882 coexpression:735 textmining:870 |
Rv0860 fadB |
fatty oxidation protein FadB | 797 | 788 | coexpression:647 |
Rv3028c fixB |
electron transfer flavoprotein subunit alpha | 668 | 654 | coexpression:412 experimental:419 |
Rv3029c fixA |
electron transfer flavoprotein subunit beta | 664 | 650 | coexpression:406 experimental:418 |
Rv3153 nuoI |
NADH-quinone oxidoreductase subunit I | 652 | 638 | |
Rv2940c mas |
multifunctional mycocerosic acid synthase | 619 | 588 | database:459 |
Rv3825c pks2 |
phthioceranic/hydroxyphthioceranic acid synthase | 619 | 588 | database:459 |
Rv2048c pks12 |
polyketide synthase | 619 | 588 | database:459 |
Rv1043c hyp |
hypothetical protein | 586 | 587 | database:431 |
Rv2933 ppsC |
phthiocerol synthesis polyketide synthase type I PpsC | 617 | 586 | database:459 |
Rv1527c pks5 |
polyketide synthase | 615 | 584 | database:459 |
Rv0673 echA4 |
enoyl-CoA hydratase EchA4 | 577 | 578 |
STRING combines evidence channels (neighborhood, fusion, cooccurrence, coexpression, experimental, database, text-mining) into a 0–1000 score. The ctx badge marks edges carried by the genomic-context channels (conserved neighborhood, fusion, phylogenetic co-occurrence), which are independent of orthology and structure and the strongest signal for an unknown gene. The no text-mining column recomputes the score from data alone, so a link that does not depend on the literature is visible. Association is a function hypothesis, not proof: corroborate with the operon context and the primary literature before assigning a function.
Evidence
- Structural Foldseek hit not propagated -- non-transferable cross-kingdom fold (human FAM91A1/TBC1D23 Golgi trafficking); function unknown
- Reviewed against literature (extended structural cross-check, 2026-06-02)
Sources
- Ancestral sequence & coordinates: Harrison LB et al. (2024), An imputed ancestral reference genome for the MTBC, doi:10.1101/2023.09.07.556366
- Product annotation: NCBI PGAP on MTBC0; legacy from H37Rv NC_000962.3 (RefSeq NP_217880.1)
- Domains: Pfam-A via hmmscan --cut_ga — DUF742 (PF05331.18)
- Sequence-level signal: ESM Atlas (EvolutionaryScale × BioHub) — exploratory
- Controlled vocabulary: eggNOG-mapper 2.1.12 (Cantalapiedra et al. 2021,
doi:10.1093/molbev/msab293), eggNOG 5.0 DB
(Huerta-Cepas et al. 2019) — OG
COG1846 - Curated reference: UniProt O50392 (TrEMBL, unreviewed; Predicted)
- Model confidence: ESMFold per-residue pLDDT (mean 58.2, low)
- Interaction network: STRING v12.0 (Szklarczyk et al. 2023,
doi:10.1093/nar/gkac1000), taxon 83332, CC-BY 4.0 —
112 functional partner(s); context anchor
Rv3362c - Primary literature: none located yet; annotation rests on the domain/homology sources above.
Ancestral MTBC0 protein sequence
>mtbc0_003578|Rv3363c| MFNPAGDRPKAGLVRPYTLTAGRTGTDVDLPLQAPVQTLPAGPAGRWPAYDMRRRILQLCIGSPSVAEISARLDLPVGVARVLVGDLVTSGYLRVHATLTDRSTRDERHELIGRTLRGLKAL