crgA Resolved · high auto-curated

H37Rv Rv0011c · MTBC0 mtbc0_000015 · 93 aa · 13714–13995 (-) · RefSeq NP_214525.1

Annotation: from legacy to revised

Legacy (H37Rv / Mycobrowser)cell division protein CrgA
MTBC0 PGAP re-annotationcell division protein CrgA
Revised (this work)Cell division protein CrgA. Pfam: CrgA (PF06781.18).

Auto-curated: this verdict and function were generated by rules from PGAP + Pfam + Foldseek and have not been hand-reviewed.

Curated reference (UniProt)

UniProt P9WP57 SwissProt · reviewed · Evidence at protein level
UniProt nameCell division protein CrgA
Curated functionInvolved in cell division. Plays an important role in septal peptidoglycan synthesis and cell shape morphogenesis. May facilitate the recruitment of the peptidoglycan synthesis machinery to poles and septal zones and coordinate peptidoglycan synthesis at these sites.

Functional vocabulary (eggNOG-mapper, orthology transfer)

COG category D Cell cycle control, cell division, chromosome partitioning
Preferred namecrgA
eggNOG descriptionCell division protein CrgA
Orthologous group2B592

Orthology-based transfer (eggNOG 5.0.2, diamond). EC/KO/GO/CAZy are computed annotations, not manual curation; cross-check against the primary literature before treating a specific reaction as established.

Conservation & selection (intra-MTBC, 145 209 strains)

pN/pS 0.0 · strong purifying
Polymorphic sites (≥ 0.1% of strains) 1 synonymous, 0 missense, 0 nonsense, 0 frameshift

pN/pS from segregating SNPs (singletons removed) normalised by possible sites. Low pN/pS = purifying selection (a strong signal that a "hypothetical" is a real, constrained gene). A high pN/pS is ambiguous: relaxed constraint or positive selection (drug resistance, antigenic variation) inflate it; e.g. rpoB/katG/pncA score high here for resistance, not loss of function. A clonal disruption (one allele over a clade) suggests lineage pseudogenisation; a convergent one (many independent alleles) is typical of resistance loss-of-function.

Domains (Pfam, hmmscan --cut_ga)

PfamAccessioni-EvalueResiduesDescription
CrgAPF06781.18 1.1e-271–93 Cell division protein CrgA

Functional interaction network (STRING v12, guilt-by-association)

Closest characterised functional partner: Rv0010c (membrane protein), high confidence from genomic context alone (score 797 excluding text-mining).

PartnerProductScoreNo text-miningChannels (≥400)
Rv0010c membrane protein 903 797 ctx neighborhood:749 textmining:545
Rv0012 membrane protein 784 785 ctx neighborhood:782
Rv0013 trpG anthranilate synthase component II 749 749 ctx neighborhood:747
Rv3311 hyp hypothetical protein 663 663 ctx cooccurence:661
Rv2170 GCN5-like N-acetyltransferase 638 638 ctx cooccurence:638
Rv3259 hyp hypothetical protein 613 613 ctx cooccurence:611
Rv2520c membrane protein 596 597 ctx cooccurence:594
Rv3212 hyp hypothetical protein 542 543 ctx cooccurence:538
Rv0016c pbpA penicillin-binding protein PbpA 901 522 textmining:802
Rv2150c ftsZ cell division protein FtsZ 875 514 experimental:500 textmining:754
Rv2163c pbpB penicillin-binding membrane protein PbpB 900 506 textmining:806
Rv0008c cwsA cell wall synthesis protein CwsA 937 501 experimental:500 textmining:879
Rv3782 glfT1 galactofuranosyl transferase GlfT 485 486 ctx cooccurence:471
Rv1417 membrane protein 485 485 ctx cooccurence:480
Rv1222 rseA anti-sigma E factor RseA 448 449 ctx cooccurence:447

STRING combines evidence channels (neighborhood, fusion, cooccurrence, coexpression, experimental, database, text-mining) into a 0–1000 score. The ctx badge marks edges carried by the genomic-context channels (conserved neighborhood, fusion, phylogenetic co-occurrence), which are independent of orthology and structure and the strongest signal for an unknown gene. The no text-mining column recomputes the score from data alone, so a link that does not depend on the literature is visible. Association is a function hypothesis, not proof: corroborate with the operon context and the primary literature before assigning a function.

Evidence

  • Legacy H37Rv annotation: cell division protein CrgA
  • MTBC0 PGAP product: cell division protein CrgA
  • Pfam (hmmscan --cut_ga): CrgA PF06781.18 (E=1e-27)
  • (auto-curated by rules from PGAP + Pfam + Foldseek; not hand-reviewed)

Sources

  • Ancestral sequence & coordinates: Harrison LB et al. (2024), An imputed ancestral reference genome for the MTBC, doi:10.1101/2023.09.07.556366
  • Product annotation: NCBI PGAP on MTBC0; legacy from H37Rv NC_000962.3 (RefSeq NP_214525.1)
  • Domains: Pfam-A via hmmscan --cut_ga — CrgA (PF06781.18)
  • Sequence-level signal: ESM Atlas (EvolutionaryScale × BioHub) — exploratory
  • Controlled vocabulary: eggNOG-mapper 2.1.12 (Cantalapiedra et al. 2021, doi:10.1093/molbev/msab293), eggNOG 5.0 DB (Huerta-Cepas et al. 2019) — OG 2B592
  • Curated reference: UniProt P9WP57 (SwissProt, reviewed; Evidence at protein level)
  • Intra-MTBC selection: pN/pS and disruption from SPDI variants of 145 209 MTBC strains (this work, local collection vs H37Rv NC_000962.3)
  • Interaction network: STRING v12.0 (Szklarczyk et al. 2023, doi:10.1093/nar/gkac1000), taxon 83332, CC-BY 4.0 — 50 functional partner(s); context anchor Rv0010c
  • Primary literature: none located yet; annotation rests on the domain/homology sources above.

Ancestral MTBC0 protein sequence

>mtbc0_000015|Rv0011c|crgA
MPKSKVRKKNDFTVSAVSRTPMKVKVGPSSVWFVSLFIGLMLIGLIWLMVFQLAAIGSQAPTALNWMAQLGPWNYAIAFAFMITGLLLTMRWH