GCW_03760







Uniprot: A0A0F6CLF2
Description: CRISPR-associated protein Csn1
EC number: 3.1.-.-
Annotation score: 3 out of 5
Miscellaneous [CC]: 
Protein existence: Inferred from homology
Catalytic activity: 
Cofactor: COFACTOR: Name=Mg(2+); Xref=ChEBI:CHEBI:18420; Evidence={ECO:0000256|HAMAP-Rule:MF_01480};
Enzyme regulation: 
Function [CC]: FUNCTION: CRISPR (clustered regularly interspaced short palindromic repeat) is an adaptive immune system that provides protection against mobile genetic elements (viruses, transposable elements and conjugative plasmids). CRISPR clusters contain spacers, sequences complementary to antecedent mobile elements, and target invading nucleic acids. CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA). In type II CRISPR systems correct processing of pre-crRNA requires a trans-encoded small RNA (tracrRNA), endogenous ribonuclease 3 (rnc) and this protein. The tracrRNA serves as a guide for ribonuclease 3-aided processing of pre-crRNA. Subsequently Cas9/crRNA/tracrRNA endonucleolytically cleaves linear or circular dsDNA target complementary to the spacer; Cas9 is inactive in the absence of the 2 guide RNAs (gRNA). Cas9 recognizes the protospacer adjacent motif (PAM) in the CRISPR repeat sequences to help distinguish self versus nonself, as targets within the bacterial CRISPR locus do not have PAMs. PAM recognition is also required for catalytic activity. {ECO:0000256|HAMAP-Rule:MF_01480}.
Pathway: 
Active site: ACT_SITE 16 16 For RuvC-like nuclease domain. {ECO:0000256|HAMAP-Rule:MF_01480}.; ACT_SITE 670 670 Proton acceptor for HNH nuclease domain. {ECO:0000256|HAMAP-Rule:MF_01480}.
Binding site: 
Calcium binding: 
DNA binding: 
Metal binding: METAL 16 16 Magnesium 1. {ECO:0000256|HAMAP-Rule:MF_01480}.; METAL 16 16 Magnesium 2. {ECO:0000256|HAMAP-Rule:MF_01480}.; METAL 582 582 Magnesium 1. {ECO:0000256|HAMAP-Rule:MF_01480}.; METAL 586 586 Magnesium 1. {ECO:0000256|HAMAP-Rule:MF_01480}.; METAL 586 586 Magnesium 2. {ECO:0000256|HAMAP-Rule:MF_01480}.; METAL 824 824 Magnesium 2; via pros nitrogen. {ECO:0000256|HAMAP-Rule:MF_01480}.
Nucleotide binding: 
Site: 
Gene names (primary): csn1
Gene names (synonym): cas9
Mass: 148,526
Subunit structure [CC]: SUBUNIT: Monomer. Binds crRNA and tracrRNA. {ECO:0000256|HAMAP-Rule:MF_01480}.
Gene ontology (GO): DNA binding [GO:0003677]; endonuclease activity [GO:0004519]; metal ion binding [GO:0046872]; RNA binding [GO:0003723]; defense response to virus [GO:0051607]; maintenance of CRISPR repeat elements [GO:0043571]
Gene ontology IDs: GO:0003677; GO:0003723; GO:0004519; GO:0043571; GO:0046872; GO:0051607
Chain: 
Signal peptide: 
Domain [CC]: DOMAIN: Has 2 endonuclease domains. The discontinuous RuvC-like domain cleaves the target DNA noncomplementary to crRNA while the HNH nuclease domain cleaves the target DNA complementary to crRNA. {ECO:0000256|HAMAP-Rule:MF_01480}.
Sequence similarities: SIMILARITY: Belongs to the CRISPR-associated Cas9 family. {ECO:0000256|HAMAP-Rule:MF_01480}.; SIMILARITY: Contains 1 HNH Cas9-type domain. {ECO:0000256|HAMAP-Rule:MF_01480}.
Protein families: CRISPR-associated Cas9 family
Coiled coil: 
Domain [FT]: DOMAIN 592 762 HNH Cas9-type. {ECO:0000259|PROSITE:PS51749}.
Motif: 
Region: 
EMBL: CP006916
ProteinModelPortal: 
MEROPS: 
EnsemblBacteria KO: AHB99924
UniPathway: K09952
CDD: 
Gene3D: 
HAMAP: 
InterPro: MF_01480
PANTHER: IPR028629;IPR033114;IPR003615
PIRSF: 
PRINTS: 
PROSITE: 
Pfam: PS51749
ProDom: PF13395
SMART: 
SUPFAM: 
TIGRFAMs: 
897216-901026(+)

>nucleotide sequence
ATGAATAACTCTATAAAATCAAAACCAGAAGTTACAATTGGACTAGACCTGGGTGTGGGT
TCAGTTGGTTGAGCAATTGTTGATAACGAAACTAATATTATTCACCATTTGGGGAGCAGA
CTTTTTAGTCAAGCTAAAACAGCAGAAGATCGTCGAAGTTTTCGTGGTGTTAGACGTTTA
ATTAGAAGAAGAAAATACAAACTAAAGAGATTTGTAAATTTAATCTGAAAATATAATTCT
TATTTCGGATTTAAGAATAAAGAAGATATTCTTAACAACTATCAAGAACAACAAAAGCTT
CACAACACTGTTTTGAACTTAAAATCAGAAGCTCTTAATGCAAAAATTGATCCAAAAGCA
TTGTCTTGAATCTTGCACGACTACTTAAAAAATAGAGGTCATTTTTATGAAGATAATCGT
GATTTTAACGTTTATCCAACAAAAGAATTGGCTAAATACTTTGATAAGTATGGTTATTAT
AAGGGAATCATTGATAGCAAGGAAGACAACGACAATAAACTAGAAGAAGAATTAACAAAA
TACAAATTCTCTAATAAGCATTGATTAGAAGAAGTTAAAAAAGTATTATCAAATCAAACA
GGACTACCAGAAAAATTTAAAGAAGAATATGAATCTTTATTTAGCTATGTAAGAAATTAT
TCTGAAGGTCCTGGTAGTATCAACAGCGTTTCTCCTTACGGTATCTATCATTTAGACGAA
AAAGAAGGTAAAGTTGTTCAAAAGTATAACAACATTTGGGATAAGACGATCGGTAAATGT
AATATTTTCCCCGACGAGTATCGTGCTCCAAAAAATTCGCCGATCGCGATGATCTTTAAT
GAAATTAATGAATTAAGCACAATTAGAAGTTATTCTATTTATTTAACTGGTTGGTTCATT
AATCAAGAATTTAAAAAAGCATATTTAAACAAACTTTTAGATTTGCTTATTAAAACAAAC
GGAGAAAAACCAATCGATGCAAGACAATTCAAAAAACTAAGAGAGGAAACAATTGCTGAA
TCAATTGGTAAAGAAACGCTTAAAGATGTCGAAAATGAAGAAAAACTAGAAAAAGAAGAC
CATAAGTGAAAATTAAAAGGATTAAAGTTAAACACAAATGGTAAAATCCAATATAACGAT
CTTTCTTCGTTAGCAAAATTTGTTCACAAACTAAAACAACACCTTAAGCTTGATTTCTTA
TTAGAAGATCAATACGCAACTCTTGATAAAATAAACTTCTTACAGTCATTATTTGTTTAT
TTAGGTAAACATTTACGTTATAACAACAGAGTTGATTCAGCTAATTTAAAAGAGTTTAGT
GATAGTAATAGACTCTTTGAAAGAGTTCTACAAGAACAAAAAGATGGCTTGTTCAAACTA
TTTGAACAAACAGATAAGGACGATGAAAAGATCTTAGCTCAAACCCACTCTTTATCTACT
AAAGCAATGCTATTAGCCATCACTAGAATGACTAATTTAGATAATGATGAAGATAATCAG
AAAAATAACGATAAAGGGTGGAACTTTGAAGCAATTAAAAACTTCGATCAAAAGTTTATT
GATATCACCAAGACAAACAATAATCTAAGTTTAAAACAAGATAAAAGATATTTAGATGAT
CGATTTATTAATGATGCGATCTTATCACCTGGGGTTAAAAGAATTTTACGTGAGGCAACA
AAAGTATTTAATGCTATCCTCAAGCAATTTTCTCAGGAATATGATGTTACTAAAGTTGTT
ATCGAATTAGCTAGAGAGTTATCTGAAGAAAAAGAACTTGAAAATAATAAAAACTACAAG
AAATTAATTAAGAAAAATAGTGATAAGATCTCAGAGGGTTTAAAAGCTCTAGATATTGCA
GAAGATAAGATCGAAGACATCTTAAAGAGTCCAACTAAAAGCTATAAAGTTTTGCTTTGA
TTACAACAAGATCATATTGACCCTTATTCACAAAAAGAAATTGCTTTTGAGGATATTCTT
ACAAAAACTGAAAAAACTGAAATTGATCATATCATTCCTTATAGTATTTCTTTTGATGAT
TCTAGTAGTAATAAACTACTAGTGTTAGCAGAATCTAACCAAGCTAAATCAAATCAAACA
CCTTATGAATTCATTACTTCAGGAAATGCTGGAATTAAGTGAGAGGATTATGAGGCTTAC
TGTCGTAAATTTAAGGATGGAGATACTTCGTTATTAGATAGCACTCAAAGAAGTAAGAAG
TTTGCAAAGATGATGAAAACTGACACTTCTTCTAAATACGACATTGGGTTCTTGGCAAGA
AATCTTAATGATACAAGATATGCAACGATCGTTTTTAGAGATGCTTTAAAAGATTACGCA
AATAACCATTTAGTTGAAGACAAACCAATGTTTAAAGTGGTGTGTATTAATGGTGGTGTA
ACTTCATTCTTGCGTAAGAATTTCGATAAAAGTTGATATGCAAAAAAAGATCGTGATAAA
AACATTCACCACGCAGTAGATGCTAGTATTATTTCCATCTTCTCAAATAAGACAAAAACT
TTATTTGATCAACTCACACAATTTGCTGATTATAAACTTTTTAAAAACACTGACGGGAGT
TGAAAGAAGATTGATCCCAAAACTGGTGTAGTTACTGAAGTTACGGATGAAAACTGAAAA
CAAATTAGAGTTCGTAACCAAGTCTCTAAGATTGCCGAAGAGATCGACAAATGTATCCAG
GATAGTAATATCGAACGTAAAGCTCGATATTCTAGAAAAATTGAAAATAAAACAAACATT
AGTTTATTTAACGATACGGTTTACTCTGCTAAAAAGGTAGGTTACGACGACCAGATAAAA
AGAAAAAATCTTAAAACACTTGATATTGATGAGAGTGTAGAAGAGAACAAAAATTCTAAG
GTTAAAAAACAATTCGTTTATCGAAAATTAGTTAATGTTTCATTATTGAATAATGATAAG
TTAGCTGATTTGTTTGCTGAAAAAGAAGACATCTTAATGTATCGTGCTAATCCATGGGTT
ATAAATTTAGCTGAACAAATCTTTAACGAATACACTGAAAACAGGAAAATTAAAAGCCAA
AATGTATTTGGAAAATACATGTTAGATCTAACTAAAGAGTTCCCAGAAAAATTTTCGGAA
GCTTTCGTCAAATCGATGTTAAGAAATAAAACTGCAATTATCTATAATGTCGAAAAGAAA
GTAGTTCACCGCATTAAAAGATTAAAGATACTAAGTTCAGAACTAAAAGAAAATAAATTA
AGCAATGTGATCATTAGATCTAAGAACGAAAGTGGAACCAAACTCTCTTATCAAGATACG
ATCAACTCGGTCGCTTTAATGATTATGAGAAGTATTGATCCCACTGCAAAAAAACAATAT
ATCCGTGTTCCATTGAATACACTAAACCTTCATCTAGGAGATCATGATTTTGATCTCCAT
AATATAGATGCTTATTTAAAAAAACCTAAGTTTGTTAAATACTTAAAAGCAAATGAAATT
GGAGATGAATACAAACCTTGAAGGGTTTTAATATCAGGATCGCTTTTAATTCATAAAAGA
GATAAGAAACTTATGTATATCTCGTCTTTCCAAAATTTGAATGATTTAATTGAAATTAAA
AATTTAATTGAAACTGAGTATAAGGAAAACGTTGATAGCGATCCTAAAAAAAAGAAGAAA
GCAAGTCAGATCTTAAGGAGTTTAAGTACTATTTTAAATGACTACATCTTACTTGATGCT
AAAGACAATTTTGATATATTAGGGTTAAGCAAAAACAGAATTGATGAGATTTTAAACAGT
AAGCTTGACCTTGACAAAATAGCAAAATAG


>protein sequence
MNNSIKSKPEVTIGLDLGVGSVGWAIVDNETNIIHHLGSRLFSQAKTAEDRRSFRGVRRL
IRRRKYKLKRFVNLIWKYNSYFGFKNKEDILNNYQEQQKLHNTVLNLKSEALNAKIDPKA
LSWILHDYLKNRGHFYEDNRDFNVYPTKELAKYFDKYGYYKGIIDSKEDNDNKLEEELTK
YKFSNKHWLEEVKKVLSNQTGLPEKFKEEYESLFSYVRNYSEGPGSINSVSPYGIYHLDE
KEGKVVQKYNNIWDKTIGKCNIFPDEYRAPKNSPIAMIFNEINELSTIRSYSIYLTGWFI
NQEFKKAYLNKLLDLLIKTNGEKPIDARQFKKLREETIAESIGKETLKDVENEEKLEKED
HKWKLKGLKLNTNGKIQYNDLSSLAKFVHKLKQHLKLDFLLEDQYATLDKINFLQSLFVY
LGKHLRYNNRVDSANLKEFSDSNRLFERVLQEQKDGLFKLFEQTDKDDEKILAQTHSLST
KAMLLAITRMTNLDNDEDNQKNNDKGWNFEAIKNFDQKFIDITKTNNNLSLKQDKRYLDD
RFINDAILSPGVKRILREATKVFNAILKQFSQEYDVTKVVIELARELSEEKELENNKNYK
KLIKKNSDKISEGLKALDIAEDKIEDILKSPTKSYKVLLWLQQDHIDPYSQKEIAFEDIL
TKTEKTEIDHIIPYSISFDDSSSNKLLVLAESNQAKSNQTPYEFITSGNAGIKWEDYEAY
CRKFKDGDTSLLDSTQRSKKFAKMMKTDTSSKYDIGFLARNLNDTRYATIVFRDALKDYA
NNHLVEDKPMFKVVCINGGVTSFLRKNFDKSWYAKKDRDKNIHHAVDASIISIFSNKTKT
LFDQLTQFADYKLFKNTDGSWKKIDPKTGVVTEVTDENWKQIRVRNQVSKIAEEIDKCIQ
DSNIERKARYSRKIENKTNISLFNDTVYSAKKVGYDDQIKRKNLKTLDIDESVEENKNSK
VKKQFVYRKLVNVSLLNNDKLADLFAEKEDILMYRANPWVINLAEQIFNEYTENRKIKSQ
NVFGKYMLDLTKEFPEKFSEAFVKSMLRNKTAIIYNVEKKVVHRIKRLKILSSELKENKL
SNVIIRSKNESGTKLSYQDTINSVALMIMRSIDPTAKKQYIRVPLNTLNLHLGDHDFDLH
NIDAYLKKPKFVKYLKANEIGDEYKPWRVLISGSLLIHKRDKKLMYISSFQNLNDLIEIK
NLIETEYKENVDSDPKKKKKASQILRSLSTILNDYILLDAKDNFDILGLSKNRIDEILNS
KLDLDKIAK






















© Fisunov Lab of Proteomics, 2016.