| Identification |
| HMDB Protein ID
| CDBP04657 |
| Secondary Accession Numbers
| Not Available |
| Name
| Cytosine-specific methyltransferase |
| Description
| Not Available |
| Synonyms
|
Not Available
|
| Gene Name
| Not Available |
| Protein Type
| Enzyme |
| Biological Properties |
| General Function
| Involved in DNA binding |
| Specific Function
| S-adenosyl-L-methionine + DNA = S-adenosyl-L- homocysteine + DNA containing 5-methylcytosine |
| GO Classification
|
| Component |
| organelle |
| membrane-bounded organelle |
| intracellular membrane-bounded organelle |
| nucleus |
| Function |
| catalytic activity |
| transition metal ion binding |
| zinc ion binding |
| transferase activity |
| protein binding |
| transcription factor binding |
| nucleic acid binding |
| dna binding |
| s-adenosylmethionine-dependent methyltransferase activity |
| transferase activity, transferring one-carbon groups |
| methyltransferase activity |
| dna (cytosine-5-)-methyltransferase activity |
| ion binding |
| cation binding |
| metal ion binding |
| binding |
| Process |
| dna modification |
| dna alkylation |
| dna methylation |
| metabolic process |
| macromolecule metabolic process |
| cellular macromolecule metabolic process |
| dna metabolic process |
|
| Cellular Location
|
- Cytoplasmic
|
| Pathways
|
Not Available
|
| Gene Properties |
| Chromosome Location
| Not Available |
| Locus
| Not Available |
| SNPs
| Not Available |
| Gene Sequence
|
>4822 bp
GGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGATTTGGAAAG
AGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCT
GCAAACAGAAATAAAGAATCAGTTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATT
ATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTGTCCTTGGA
GAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGC
AAGAAGTGAAGCCCGTAGAGTGGGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTC
CAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAACCTTCACC
TAGCCCCAGGATTACAAGGAAAAGCACCAGGCAAACCACCATCACATCTCATTTTGCAAA
GGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCAT
CAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGT
TGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGAGCAAAATCAGGAACGCGCACTGAAAA
GGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAAC
ACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGA
CGAGGACGAAGATGGAGACGAGAAAGATGAGAAGAAGCACAGAAGTCAACCCAAAGATCT
AGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGA
TGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAAC
GGAGAAAAAAATGGCTCGCGCCAAAACAGTCATGAACTCCAAGACCCACCCTCCCAAGTG
CATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCACCAGA
CGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGA
GTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAGCACAAACTGACCTGCTTCAGTGTGTA
CTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCGAACT
CTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCGTCTCTTGAAGGTGGTGT
TAATGGCAAAAATCTTGGCCCCATAAATGAATGGTGGATCACTGGCTTTGATGGAGGTGA
AAAGGCCCTCATCGGCTTCAGCACCTCATTTGCCGAATACATTCTGATGGATCCCAGTCC
CGAGTATGCGCCCATATTTGGGCTGATGCAGGAGAAGATCTACATCAGCAAGATTGTGGT
GGAGTTCCTGCAGAGCAATTCCGACTCGACCTATGAGGACCTGATCAACAAGATCGAGAC
CACGGTTCCTCCTTCTGGCCTCAACTTGAACCGCTTCACAGAGGACTCCCTCCTGCGACA
CGCGCAGTTTGTGGTGGAGCAGGTGGAGAGTTATGACGAGGCCGGGGACAGTGATGAGCA
GCCCATCTTCCTGACGCCCTGCATGCGGGACCTGATCAAGCTGGCTGGGGTCACGCTGGG
ACAGAGGCGAGCCCAGGCGAGGCGGCAGACCATCAGGCATTCTACCAGGGAGAAGGACAG
GGGACCCACGAAAGCCACCACCACCAAGCTGGTCTACCAGATCTTCGATACTTTCTTCGC
AGAGCAAATTGAAAAGGATGACAGAGAAGACAAGGAGAACGCCTTTAAGCGCCGGCGATG
TGGCGTCTGTGAGGTGTGTCAGCAGCCTGAGTGTGGGAAATGTAAAGCCTGCAAGGACAT
GGTTAAATTTGGTGGCAGTGGACGGAGCAAGCAGGCTTGCCAAGAGCGGAGGTGTCCCAA
TATGGCCATGAAGGAGGCAGATGACGATGAGGAAGTCGATGATAACATCCCAGAGATGCC
GTCACCCAAAAAAATGCACCAGGGGAAGAAGAAGAAACAGAACAAGAATCGCATCTCTTG
GGTCGGAGAAGCCGTCAAGACTGATGGGAAGAAGAGTTACTATAAGAAGGTGTGCATTGA
TGCGGAAACCCTGGAAGTGGGGGACTGTGTCTCTGTTATTCCAGATGATTCCTCAAAACC
GCTGTATCTAGCAAGGGTCACGGCGCTGTGGGAGGACAGCAGCAACGGGCAGATGTTTCA
CGCCCACTGGTTCTGCGCTGGGACAGACACAGTCCTCGGGGCCACGTCGGACCCTCTGGA
GCTGTTCTTGGTGGATGAATGTGAGGACATGCAGCTTTCATATATCCACAGCAAAGTGAA
AGTCATCTACAAAGCCCCCTCCGAAAACTGGGCCATGGAGGGAGGCATGGATCCCGAGTC
CCTGCTGGAGGGGGACGACGGGAAGACCTACTTCTACCAGCTGTGGTATGATCAAGACTA
CGCGAGATTCGAGTCCCCTCCAAAAACCCAGCCAACAGAGGACAACAAGTTCAAATTCTG
TGTGAGCTGTGCCCGTCTGGCTGAGATGAGGCAAAAAGAAATCCCCAGGGTCCTGGAGCA
GCTCGAGGACCTGGATAGCCGGGTCCTCTACTACTCAGCCACCAAGAACGGCATCCTGTA
CCGAGTTGGTGATGGTGTGTACCTGCCCCCTGAGGCCTTCACGTTCAACATCAAGCTGTC
CAGTCCCGTGAAACGCCCACGGAAGGAGCCCGTGGATGAGGACCTGTACCCAGAGCACTA
CCGGAAATACTCCGACTACATCAAAGGCAGCAACCTGGATGCCCCTGAGCCCTACCGAAT
TGGCCGGATCAAAGAGATCTTCTGTCCCAAGAAGAGCAACGGCAGGCCCAATGAGACTGA
CATCAAAATCCGGGTCAACAAGTTCTACAGGCCTGAGAACACCCACAAGTCCACTCCAGC
GAGCTACCACGCAGACATCAACCTGCTCTACTGGAGCGACGAGGAGGCCGTGGTGGACTT
CAAGGCTGTGCAGGGCCGCTGCACCGTGGAGTATGGGGAGGACCTGCCCGAGTGCGTCCA
GGTGTACTCCATGGGCGGCCCCAACCGCTTCTACTTCCTCGAGGCCTATAATGCAAAGAG
CAAAAGCTTTGAAGATCCTCCCAACCATGCCCGTAGCCCTGGAAACAAAGGGAAGGGCAA
GGGAAAAGGGAAGGGCAAGCCCAAGTCCCAAGCCTGTGAGCCGAGCGAGCCAGAGATAGA
GATCAAGCTGCCCAAGCTGCGGACCCTGGATGTGTTTTCTGGCTGCGGGGGGTTGTCGGA
GGGATTCCACCAAGCAGGCATCTCTGACACGCTGTGGGCCATCGAGATGTGGGACCCTGC
GGCCCAGGCGTTCCGGCTGAACAACCCCGGCTCCACAGTGTTCACAGAGGACTGCAACAT
CCTGCTGAAGCTGGTCATGGCTGGGGAGACCACCAACTCCCGCGGCCAGCGGCTGCCCCA
GAAGGGAGACGTGGAGATGCTGTGCGGCGGGCCGCCCTGCCAGGGCTTCAGCGGCATGAA
CCGCTTCAATTCGCGCACCTACTCCAAGTTCAAAAACTCTCTGGTGGTTTCCTTCCTCAG
CTACTGCGACTACTACCGGCCCCGGTTCTTCCTCCTGGAGAATGTCAGGAACTTTGTCTC
CTTCAAGCGCTCCATGGTCCTGAAGCTCACCCTCCGCTGCCTGGTCCGCATGGGCTATCA
GTGCACCTTCGGCGTGCTGCAGGCCGGTCAGTACGGCGTGGCCCAGACTAGGAGGCGGGC
CATCATCCTGGCCGCGGCCCCTGGAGAGAAGCTCCCTCTGTTCCCGGAGCCACTGCACGT
GTTTGCTCCCCGGGCCTGCCAGCTGAGCGTGGTGGTGGATGACAAGAAGTTTGTGAGCAA
CATAACCAGGTTGAGCTCGGGTCCTTTCCGGACCATCACGGTGCGAGACACGATGTCCGA
CCTGCCGGAGGTGCGGAATGGAGCCTCGGCACTGGAGATCTCCTACAACGGGGAGCCTCA
GTCCTGGTTCCAGAGGCAGCTCCGGGGCGCACAGTACCAGCCCATCCTCAGGGACCACAT
CTGTAAGGACATGAGTGCATTGGTGGCTGCCCGCATGCGGCACATCCCCTTGGCCCCAGG
GTCAGACTGGCGCGATCTGCCCAACATCGAGGTGCGGCTCTCAGACGGCACCATGGCCAG
GAAGCTGCGGTATACCCACCATGACAGGAAGAACGGCCGCAGCAGCTCTGGGGCCCTCCG
TGGGGTCTGCTCCTGCGTGGAAGGTGGGTCCTCCGGCAAAGCCTGCGACCCCGCAGCCAG
GCAGTTCAACACCCTCATCCCCTGGTGCCTGCCCCACACCGGGAACCGGCACAACCACTG
GGCTGGCCTCTATGGAAGGCTCGAGTGGGACGGCTTCTTCAGCACAACCGTCACCAACCC
CGAGCCCATGGGCAAGCAGGGCCGCGTGCTCCACCCAGAGCAGCACCGTGTGGTGAGCGT
GCGGGAGTGTGCCCGCTCCCAGGGCTTCCCTGACACCTACCGGCTCTTCGGCAACATCCT
GGACAAGCACCGGCAGGTGGGCAATGCCGTGCCACCGCCCCTGGCCAAAGCCATTGGCTT
GGAGATCAAGCTTTGTATGTTGGCCAAAGCCCGAGAGAGTGCCTCAGCTAAAATAAAGGA
GGAGGAAGCTGCTAAGGACTAG
|
| Protein Properties |
| Number of Residues
| 1606 |
| Molecular Weight
| 182018.2 |
| Theoretical pI
| 7.69 |
| Pfam Domain Function
|
|
| Signals
|
|
|
Transmembrane Regions
|
|
| Protein Sequence
|
>Cytosine-specific methyltransferase
AVPAISLPDDVRRRLKDLERDSLTEKECVKEKLNLLHEFLQTEIKNQLCDLETKLRKEEL
SEEGYLAKVKSLLNKDLSLENGAHAYNREVNGRLENGNQARSEARRVGMADANSPPKPLS
KPRTPRRSKSDGEAKPEPSPSPRITRKSTRQTTITSHFAKGPAKRKPQEESERAKSDESI
KEEDKDQDEKRRRVTSRERVARPLPAEEPERAKSGTRTEKEEERDEKEEKRLRSQTKEPT
PKQKLKEEPDREARAGVQADEDEDGDEKDEKKHRSQPKDLAAKRRPEEKEPEKVNPQISD
EKDEDEKEEKRRKTTPKEPTEKKMARAKTVMNSKTHPPKCIQCGQYLDDPDLKYGQHPPD
AVDEPQMLTNEKLSIFDANESGFESYEALPQHKLTCFSVYCKHGHLCPIDTGLIEKNIEL
FFSGSAKPIYDDDPSLEGGVNGKNLGPINEWWITGFDGGEKALIGFSTSFAEYILMDPSP
EYAPIFGLMQEKIYISKIVVEFLQSNSDSTYEDLINKIETTVPPSGLNLNRFTEDSLLRH
AQFVVEQVESYDEAGDSDEQPIFLTPCMRDLIKLAGVTLGQRRAQARRQTIRHSTREKDR
GPTKATTTKLVYQIFDTFFAEQIEKDDREDKENAFKRRRCGVCEVCQQPECGKCKACKDM
VKFGGSGRSKQACQERRCPNMAMKEADDDEEVDDNIPEMPSPKKMHQGKKKKQNKNRISW
VGEAVKTDGKKSYYKKVCIDAETLEVGDCVSVIPDDSSKPLYLARVTALWEDSSNGQMFH
AHWFCAGTDTVLGATSDPLELFLVDECEDMQLSYIHSKVKVIYKAPSENWAMEGGMDPES
LLEGDDGKTYFYQLWYDQDYARFESPPKTQPTEDNKFKFCVSCARLAEMRQKEIPRVLEQ
LEDLDSRVLYYSATKNGILYRVGDGVYLPPEAFTFNIKLSSPVKRPRKEPVDEDLYPEHY
RKYSDYIKGSNLDAPEPYRIGRIKEIFCPKKSNGRPNETDIKIRVNKFYRPENTHKSTPA
SYHADINLLYWSDEEAVVDFKAVQGRCTVEYGEDLPECVQVYSMGGPNRFYFLEAYNAKS
KSFEDPPNHARSPGNKGKGKGKGKGKPKSQACEPSEPEIEIKLPKLRTLDVFSGCGGLSE
GFHQAGISDTLWAIEMWDPAAQAFRLNNPGSTVFTEDCNILLKLVMAGETTNSRGQRLPQ
KGDVEMLCGGPPCQGFSGMNRFNSRTYSKFKNSLVVSFLSYCDYYRPRFFLLENVRNFVS
FKRSMVLKLTLRCLVRMGYQCTFGVLQAGQYGVAQTRRRAIILAAAPGEKLPLFPEPLHV
FAPRACQLSVVVDDKKFVSNITRLSSGPFRTITVRDTMSDLPEVRNGASALEISYNGEPQ
SWFQRQLRGAQYQPILRDHICKDMSALVAARMRHIPLAPGSDWRDLPNIEVRLSDGTMAR
KLRYTHHDRKNGRSSSGALRGVCSCVEGGSSGKACDPAARQFNTLIPWCLPHTGNRHNHW
AGLYGRLEWDGFFSTTVTNPEPMGKQGRVLHPEQHRVVSVRECARSQGFPDTYRLFGNIL
DKHRQVGNAVPPPLAKAIGLEIKLCMLAKARESASAKIKEEEAAKD
|
| External Links |
| GenBank ID Protein
| 62088406 |
| UniProtKB/Swiss-Prot ID
| Q59FP7 |
| UniProtKB/Swiss-Prot Entry Name
| Q59FP7_HUMAN |
| PDB IDs
|
Not Available |
| GenBank Gene ID
| AB209413 |
| GeneCard ID
| Not Available |
| GenAtlas ID
| Not Available |
| HGNC ID
| HGNC:2976 |
| References |
| General References
| Not Available |