Showing Protein Histone-lysine N-methyltransferase NSD2 (CDBP04214)
Identification | ||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
HMDB Protein ID | CDBP04214 | |||||||||||||||||||||||||||||||||||||||||
Secondary Accession Numbers | Not Available | |||||||||||||||||||||||||||||||||||||||||
Name | Histone-lysine N-methyltransferase NSD2 | |||||||||||||||||||||||||||||||||||||||||
Description | Not Available | |||||||||||||||||||||||||||||||||||||||||
Synonyms |
|
|||||||||||||||||||||||||||||||||||||||||
Gene Name | WHSC1 | |||||||||||||||||||||||||||||||||||||||||
Protein Type | Enzyme | |||||||||||||||||||||||||||||||||||||||||
Biological Properties | ||||||||||||||||||||||||||||||||||||||||||
General Function | Involved in histone-lysine N-methyltransferase activity | |||||||||||||||||||||||||||||||||||||||||
Specific Function | Histone methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. Isoform RE-IIBP may act as a transcription regulator that binds DNA and suppresses IL5 transcription through HDAC recruitment. | |||||||||||||||||||||||||||||||||||||||||
GO Classification |
|
|||||||||||||||||||||||||||||||||||||||||
Cellular Location |
|
|||||||||||||||||||||||||||||||||||||||||
Pathways |
|
|||||||||||||||||||||||||||||||||||||||||
Gene Properties | ||||||||||||||||||||||||||||||||||||||||||
Chromosome Location | 4 | |||||||||||||||||||||||||||||||||||||||||
Locus | 4p16.3 | |||||||||||||||||||||||||||||||||||||||||
SNPs | WHSC1 | |||||||||||||||||||||||||||||||||||||||||
Gene Sequence |
>4098 bp ATGGAATTTAGCATCAAGCAGAGTCCCCTTTCTGTTCAGAGTGTTGTAAAGTGCATAAAG ATGAAGCAGGCACCAGAAATCCTCGGCAGTGCCAACGGGAAGACTCCGAGCTGCGAGGTG AACCGCGAGTGTTCTGTGTTCCTCAGCAAAGCCCAGCTCTCCAGTAGCCTGCAGGAGGGG GTCATGCAGAAGTTTAACGGCCACGACGCCCTGCCCTTTATTCCAGCCGACAAGCTGAAA GATCTTACTTCCCGGGTGTTTAATGGAGAACCCGGCGCACACGATGCCAAACTGCGTTTT GAGTCCCAGGAAATGAAAGGGATTGGGACACCCCCTAACACTACCCCTATCAAAAATGGC TCTCCAGAAATTAAGCTGAAAATCACCAAAACATACATGAATGGGAAGCCTCTCTTTGAA TCTTCCATTTGTGGTGACAGTGCTGCTGATGTGTCTCAGTCAGAAGAAAATGGACAAAAA CCAGAAAACAAGGCGAGAAGGAACAGGAAGAGGAGCATAAAATATGACTCCTTGCTGGAG CAGGGCCTTGTCGAAGCAGCTCTTGTGTCTAAGATCTCAAGTCCTTCAGATAAAAAGATT CCAGCTAAGAAAGAGTCTTGTCCAAACACTGGAAGAGACAAAGACCACCTGTTGAAATAC AACGTTGGTGATTTGGTGTGGTCCAAAGTGTCGGGTTACCCTTGGTGGCCTTGCATGGTT TCTGCAGATCCACTCCTTCACAGCTATACCAAACTTAAAGGTCAGAAAAAGAGTGCACGC CAGTATCACGTACAGTTCTTTGGTGACGCCCCAGAAAGAGCTTGGATATTTGAGAAGAGC CTCGTAGCTTTTGAAGGAGAAGGACAGTTTGAAAAATTATGCCAGGAAAGTGCCAAGCAG GCACCCACGAAAGCTGAGAAAATTAAGCTATTGAAACCAATTTCAGGGAAATTGAGGGCC CAGTGGGAAATGGGCATTGTTCAAGCAGAAGAAGCTGCAAGCATGTCAGTGGAGGAGCGG AAAGCCAAGTTCACCTTTCTCTATGTGGGGGACCAGCTTCATCTCAACCCTCAAGTAGCC AAGGAGGCTGGCATTGCTGCAGAGTCTTTGGGAGAAATGGCAGAATCCTCAGGAGTCAGT GAAGAAGCTGCTGAAAACCCCAAGTCTGTGAGAGAAGAGTGCATTCCCATGAAGAGAAGG CGGAGGGCCAAACTGTGTAGCTCTGCAGAGACCCTGGAGAGTCACCCCGACATAGGGAAG AGTACTCCTCAAAAGACGGCAGAGGCTGACCCCAGAAGAGGAGTAGGGTCTCCTCCTGGG AGGAAGAAGACCACAGTCTCCATGCCACGAAGCAGGAAGGGAGATGCAGCATCCCAGTTT TTGGTCTTCTGTCAAAAACACAGGGATGAGGTGGTAGCTGAGCACCCAGATGCTTCAGGT GAGGAGATTGAAGAGCTGCTCAGGTCACAGTGGAGTCTGCTGAGTGAGAAGCAGAGAGCA CGCTACAACACCAAGTTTGCCCTGGTGGCCCCTGTCCAGGCTGAAGAAGACTCTGGTAAT GTAAATGGGAAAAAAAGAAACCACACAAAGAGGATACAGGACCCTACAGAAGATGCTGAA GCTGAGGACACACCCAGGAAAAGACTCAGGACGGACAAGCACAGTCTTCGGAAGAGAGAC ACAATCACTGACAAAACGGCCAGAACAAGCTCTTACAAGGCCATGGAGGCAGCCTCCTCG CTCAAGAGCCAGGCAGCAACGAAAAATCTGTCTGATGCATGTAAACCACTGAAGAAGCGA AATCGGGCTTCCACGGCAGCATCTTCAGCTCTTGGGTTTAGCAAAAGTTCATCTCCTTCT GCATCCTTAACTGAGAATGAGGTCTCGGACAGCCCGGGAGACGAGCCCTCGGAGTCCCCA TACGAAAGTGCAGACGAAACACAAACTGAAGTATCTGTCTCATCCAAAAAGTCTGAGCGA GGAGTGACTGCCAAAAAGGAGTATGTGTGCCAGCTGTGTGAGAAGCCGGGCAGCCTCCTG CTCTGTGAAGGACCCTGCTGCGGAGCTTTCCACCTCGCCTGCCTTGGGCTTTCCCGGAGG CCAGAAGGGAGGTTCACCTGCAGCGAGTGTGCCTCAGGGATTCACTCATGTTTCGTGTGT AAAGAGAGCAAGACAGATGTTAAGCGCTGTGTGGTAACTCAGTGTGGAAAATTTTACCAT GAGGCTTGTGTGAAAAAATACCCTCTGACTGTATTTGAGAGCCGAGGTTTCCGCTGCCCC CTCCACAGCTGTGTGAGCTGCCATGCTTCCAACCCTTCAAACCCAAGGCCGTCAAAAGGT AAAATGATGCGGTGTGTCCGCTGCCCCGTTGCCTATCACAGCGGGGATGCTTGTCTGGCA GCAGGATGCTCAGTGATCGCCTCCAACAGCATCATCTGCACTGCCCACTTCACTGCTCGG AAGGGGAAGCGACACCACGCCCACGTCAACGTGAGCTGGTGCTTCGTGTGCTCCAAAGGG GGGAGCCTTCTGTGCTGTGAGTCCTGCCCAGCGGCCTTCCACCCTGACTGCCTGAACATC GAGATGCCTGACGGCAGCTGGTTCTGCAATGACTGCAGGGCTGGGAAGAAGCTGCACTTC CAGGATATCATTTGGGTGAAACTTGGGAACTACAGATGGTGGCCGGCAGAAGTTTGCCAT CCCAAAAATGTTCCCCCAAATATTCAGAAAATGAAGCACGAGATTGGAGAATTCCCTGTG TTTTTCTTTGGGTCTAAAGATTATTACTGGACGCATCAGGCGCGAGTGTTCCCGTACATG GAGGGGGACCGGGGCAGCCGCTACCAGGGGGTCAGAGGGATCGGAAGAGTCTTCAAAAAC GCACTGCAAGAAGCTGAAGCTCGTTTTCGTGAAATTAAGCTTCAGAGGGAAGCCCGAGAA ACACAGGAGAGCGAGCGCAAGCCCCCACCATACAAGCACATCAAGGTGAATAAGCCTTAC GGGAAAGTCCAGATCTACACAGCGGATATTTCAGAAATCCCTAAGTGCAACTGCAAGCCC ACAGATGAGAATCCTTGTGGCTTTGATTCGGAGTGTCTGAACAGGATGCTGATGTTTGAG TGCCACCCGCAGGTGTGTCCCGCGGGCGAGTTCTGCCAGAACCAGTGCTTCACCAAGCGC CAGTACCCAGAGACCAAGATCATCAAGACAGATGGCAAAGGGTGGGGCCTGGTCGCCAAG AGGGACATCAGAAAGGGAGAATTTGTTAACGAGTACGTTGGGGAGCTGATCGACGAGGAG GAGTGCATGGCGAGAATCAAGCACGCACACGAGAACGACATCACCCACTTCTACATGCTC ACTATAGACAAGGACCGTATAATAGACGCTGGCCCCAAAGGAAACTACTCTCGATTTATG AATCACAGCTGCCAGCCCAACTGTGAGACCCTCAAGTGGACAGTGAATGGGGACACTCGT GTGGGCCTGTTTGCCGTCTGTGACATTCCTGCAGGGACGGAGCTGACTTTTAACTACAAC CTCGATTGTCTGGGCAATGAAAAAACGGTCTGCCGGTGTGGAGCCTCCAATTGCAGTGGA TTCCTCGGGGATAGACCAAAGACCTCGACGACCCTTTCATCAGAGGAAAAGGGCAAAAAG ACCAAGAAGAAAACGAGGCGGCGCAGAGCAAAAGGGGAAGGGAAGAGGCAGTCAGAGGAC GAGTGCTTCCGCTGCGGTGATGGCGGGCAGCTGGTGCTGTGTGACCGCAAGTTCTGCACC AAGGCCTACCACCTGTCCTGCCTGGGCCTTGGCAAGCGGCCCTTCGGGAAGTGGGAATGT CCTTGGCATCATTGTGACGTGTGTGGCAAACCTTCGACTTCATTTTGCCACCTCTGCCCC AATTCGTTCTGTAAGGAGCACCAGGACGGGACAGCCTTCAGCTGCACCCCGGACGGGCGG TCCTACTGCTGTGAGCATGACTTAGGGGCGGCATCGGTCAGAAGCACCAAGACTGAGAAG CCCCCCCCAGAGCCAGGGAAGCCGAAGGGGAAGAGGCGGCGGCGGAGGGGCTGGCGGAGA GTCACAGAGGGCAAATAG |
|||||||||||||||||||||||||||||||||||||||||
Protein Properties | ||||||||||||||||||||||||||||||||||||||||||
Number of Residues | 1365 | |||||||||||||||||||||||||||||||||||||||||
Molecular Weight | 152257.02 | |||||||||||||||||||||||||||||||||||||||||
Theoretical pI | 8.685 | |||||||||||||||||||||||||||||||||||||||||
Pfam Domain Function | ||||||||||||||||||||||||||||||||||||||||||
Signals | Not Available | |||||||||||||||||||||||||||||||||||||||||
Transmembrane Regions | Not Available | |||||||||||||||||||||||||||||||||||||||||
Protein Sequence |
>Probable histone-lysine N-methyltransferase NSD2 MEFSIKQSPLSVQSVVKCIKMKQAPEILGSANGKTPSCEVNRECSVFLSKAQLSSSLQEG VMQKFNGHDALPFIPADKLKDLTSRVFNGEPGAHDAKLRFESQEMKGIGTPPNTTPIKNG SPEIKLKITKTYMNGKPLFESSICGDSAADVSQSEENGQKPENKARRNRKRSIKYDSLLE QGLVEAALVSKISSPSDKKIPAKKESCPNTGRDKDHLLKYNVGDLVWSKVSGYPWWPCMV SADPLLHSYTKLKGQKKSARQYHVQFFGDAPERAWIFEKSLVAFEGEGQFEKLCQESAKQ APTKAEKIKLLKPISGKLRAQWEMGIVQAEEAASMSVEERKAKFTFLYVGDQLHLNPQVA KEAGIAAESLGEMAESSGVSEEAAENPKSVREECIPMKRRRRAKLCSSAETLESHPDIGK STPQKTAEADPRRGVGSPPGRKKTTVSMPRSRKGDAASQFLVFCQKHRDEVVAEHPDASG EEIEELLRSQWSLLSEKQRARYNTKFALVAPVQAEEDSGNVNGKKRNHTKRIQDPTEDAE AEDTPRKRLRTDKHSLRKRDTITDKTARTSSYKAMEAASSLKSQAATKNLSDACKPLKKR NRASTAASSALGFSKSSSPSASLTENEVSDSPGDEPSESPYESADETQTEVSVSSKKSER GVTAKKEYVCQLCEKPGSLLLCEGPCCGAFHLACLGLSRRPEGRFTCSECASGIHSCFVC KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKG KMMRCVRCPVAYHSGDACLAAGCSVIASNSIICTAHFTARKGKRHHAHVNVSWCFVCSKG GSLLCCESCPAAFHPDCLNIEMPDGSWFCNDCRAGKKLHFQDIIWVKLGNYRWWPAEVCH PKNVPPNIQKMKHEIGEFPVFFFGSKDYYWTHQARVFPYMEGDRGSRYQGVRGIGRVFKN ALQEAEARFREIKLQREARETQESERKPPPYKHIKVNKPYGKVQIYTADISEIPKCNCKP TDENPCGFDSECLNRMLMFECHPQVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAK RDIRKGEFVNEYVGELIDEEECMARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFM NHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSG FLGDRPKTSTTLSSEEKGKKTKKKTRRRRAKGEGKRQSEDECFRCGDGGQLVLCDRKFCT KAYHLSCLGLGKRPFGKWECPWHHCDVCGKPSTSFCHLCPNSFCKEHQDGTAFSCTPDGR SYCCEHDLGAASVRSTKTEKPPPEPGKPKGKRRRRRGWRRVTEGK |
|||||||||||||||||||||||||||||||||||||||||
External Links | ||||||||||||||||||||||||||||||||||||||||||
GenBank ID Protein | 109633019 | |||||||||||||||||||||||||||||||||||||||||
UniProtKB/Swiss-Prot ID | O96028 | |||||||||||||||||||||||||||||||||||||||||
UniProtKB/Swiss-Prot Entry Name | NSD2_HUMAN | |||||||||||||||||||||||||||||||||||||||||
PDB IDs | Not Available | |||||||||||||||||||||||||||||||||||||||||
GenBank Gene ID | NM_001042424.2 | |||||||||||||||||||||||||||||||||||||||||
GeneCard ID | WHSC1 | |||||||||||||||||||||||||||||||||||||||||
GenAtlas ID | WHSC1 | |||||||||||||||||||||||||||||||||||||||||
HGNC ID | HGNC:12766 | |||||||||||||||||||||||||||||||||||||||||
References | ||||||||||||||||||||||||||||||||||||||||||
General References | Not Available |