| Identification |
| HMDB Protein ID
| CDBP04214 |
| Secondary Accession Numbers
| Not Available |
| Name
| Histone-lysine N-methyltransferase NSD2 |
| Description
| Not Available |
| Synonyms
|
- Multiple myeloma SET domain-containing protein
- Nuclear SET domain-containing protein 2
- Protein trithorax-5
- Wolf-Hirschhorn syndrome candidate 1 protein
- MMSET
- NSD2
- WHSC1
|
| Gene Name
| WHSC1 |
| Protein Type
| Enzyme |
| Biological Properties |
| General Function
| Involved in histone-lysine N-methyltransferase activity |
| Specific Function
| Histone methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. Isoform RE-IIBP may act as a transcription regulator that binds DNA and suppresses IL5 transcription through HDAC recruitment.
|
| GO Classification
|
| Biological Process |
| anatomical structure morphogenesis |
| negative regulation of transcription from RNA polymerase II promoter |
| bone development |
| transcription, DNA-dependent |
| atrial septum primum morphogenesis |
| atrial septum secundum morphogenesis |
| membranous septum morphogenesis |
| Cellular Component |
| nucleolus |
| nuclear membrane |
| chromosome |
| cytoplasm |
| Component |
| organelle |
| membrane-bounded organelle |
| intracellular membrane-bounded organelle |
| nucleus |
| Function |
| protein-lysine n-methyltransferase activity |
| histone-lysine n-methyltransferase activity |
| nucleic acid binding |
| dna binding |
| transferase activity, transferring one-carbon groups |
| methyltransferase activity |
| ion binding |
| cation binding |
| metal ion binding |
| binding |
| catalytic activity |
| transition metal ion binding |
| zinc ion binding |
| transferase activity |
| protein binding |
| protein methyltransferase activity |
| Molecular Function |
| histone-lysine N-methyltransferase activity |
| metal ion binding |
| zinc ion binding |
| chromatin binding |
| DNA binding |
|
| Cellular Location
|
- Isoform 4:Cytoplasm
|
| Pathways
|
| Name | SMPDB/Pathwhiz | KEGG | | Transcriptional misregulation in cancer | Not Available |  |
|
| Gene Properties |
| Chromosome Location
| 4 |
| Locus
| 4p16.3 |
| SNPs
| WHSC1 |
| Gene Sequence
|
>4098 bp
ATGGAATTTAGCATCAAGCAGAGTCCCCTTTCTGTTCAGAGTGTTGTAAAGTGCATAAAG
ATGAAGCAGGCACCAGAAATCCTCGGCAGTGCCAACGGGAAGACTCCGAGCTGCGAGGTG
AACCGCGAGTGTTCTGTGTTCCTCAGCAAAGCCCAGCTCTCCAGTAGCCTGCAGGAGGGG
GTCATGCAGAAGTTTAACGGCCACGACGCCCTGCCCTTTATTCCAGCCGACAAGCTGAAA
GATCTTACTTCCCGGGTGTTTAATGGAGAACCCGGCGCACACGATGCCAAACTGCGTTTT
GAGTCCCAGGAAATGAAAGGGATTGGGACACCCCCTAACACTACCCCTATCAAAAATGGC
TCTCCAGAAATTAAGCTGAAAATCACCAAAACATACATGAATGGGAAGCCTCTCTTTGAA
TCTTCCATTTGTGGTGACAGTGCTGCTGATGTGTCTCAGTCAGAAGAAAATGGACAAAAA
CCAGAAAACAAGGCGAGAAGGAACAGGAAGAGGAGCATAAAATATGACTCCTTGCTGGAG
CAGGGCCTTGTCGAAGCAGCTCTTGTGTCTAAGATCTCAAGTCCTTCAGATAAAAAGATT
CCAGCTAAGAAAGAGTCTTGTCCAAACACTGGAAGAGACAAAGACCACCTGTTGAAATAC
AACGTTGGTGATTTGGTGTGGTCCAAAGTGTCGGGTTACCCTTGGTGGCCTTGCATGGTT
TCTGCAGATCCACTCCTTCACAGCTATACCAAACTTAAAGGTCAGAAAAAGAGTGCACGC
CAGTATCACGTACAGTTCTTTGGTGACGCCCCAGAAAGAGCTTGGATATTTGAGAAGAGC
CTCGTAGCTTTTGAAGGAGAAGGACAGTTTGAAAAATTATGCCAGGAAAGTGCCAAGCAG
GCACCCACGAAAGCTGAGAAAATTAAGCTATTGAAACCAATTTCAGGGAAATTGAGGGCC
CAGTGGGAAATGGGCATTGTTCAAGCAGAAGAAGCTGCAAGCATGTCAGTGGAGGAGCGG
AAAGCCAAGTTCACCTTTCTCTATGTGGGGGACCAGCTTCATCTCAACCCTCAAGTAGCC
AAGGAGGCTGGCATTGCTGCAGAGTCTTTGGGAGAAATGGCAGAATCCTCAGGAGTCAGT
GAAGAAGCTGCTGAAAACCCCAAGTCTGTGAGAGAAGAGTGCATTCCCATGAAGAGAAGG
CGGAGGGCCAAACTGTGTAGCTCTGCAGAGACCCTGGAGAGTCACCCCGACATAGGGAAG
AGTACTCCTCAAAAGACGGCAGAGGCTGACCCCAGAAGAGGAGTAGGGTCTCCTCCTGGG
AGGAAGAAGACCACAGTCTCCATGCCACGAAGCAGGAAGGGAGATGCAGCATCCCAGTTT
TTGGTCTTCTGTCAAAAACACAGGGATGAGGTGGTAGCTGAGCACCCAGATGCTTCAGGT
GAGGAGATTGAAGAGCTGCTCAGGTCACAGTGGAGTCTGCTGAGTGAGAAGCAGAGAGCA
CGCTACAACACCAAGTTTGCCCTGGTGGCCCCTGTCCAGGCTGAAGAAGACTCTGGTAAT
GTAAATGGGAAAAAAAGAAACCACACAAAGAGGATACAGGACCCTACAGAAGATGCTGAA
GCTGAGGACACACCCAGGAAAAGACTCAGGACGGACAAGCACAGTCTTCGGAAGAGAGAC
ACAATCACTGACAAAACGGCCAGAACAAGCTCTTACAAGGCCATGGAGGCAGCCTCCTCG
CTCAAGAGCCAGGCAGCAACGAAAAATCTGTCTGATGCATGTAAACCACTGAAGAAGCGA
AATCGGGCTTCCACGGCAGCATCTTCAGCTCTTGGGTTTAGCAAAAGTTCATCTCCTTCT
GCATCCTTAACTGAGAATGAGGTCTCGGACAGCCCGGGAGACGAGCCCTCGGAGTCCCCA
TACGAAAGTGCAGACGAAACACAAACTGAAGTATCTGTCTCATCCAAAAAGTCTGAGCGA
GGAGTGACTGCCAAAAAGGAGTATGTGTGCCAGCTGTGTGAGAAGCCGGGCAGCCTCCTG
CTCTGTGAAGGACCCTGCTGCGGAGCTTTCCACCTCGCCTGCCTTGGGCTTTCCCGGAGG
CCAGAAGGGAGGTTCACCTGCAGCGAGTGTGCCTCAGGGATTCACTCATGTTTCGTGTGT
AAAGAGAGCAAGACAGATGTTAAGCGCTGTGTGGTAACTCAGTGTGGAAAATTTTACCAT
GAGGCTTGTGTGAAAAAATACCCTCTGACTGTATTTGAGAGCCGAGGTTTCCGCTGCCCC
CTCCACAGCTGTGTGAGCTGCCATGCTTCCAACCCTTCAAACCCAAGGCCGTCAAAAGGT
AAAATGATGCGGTGTGTCCGCTGCCCCGTTGCCTATCACAGCGGGGATGCTTGTCTGGCA
GCAGGATGCTCAGTGATCGCCTCCAACAGCATCATCTGCACTGCCCACTTCACTGCTCGG
AAGGGGAAGCGACACCACGCCCACGTCAACGTGAGCTGGTGCTTCGTGTGCTCCAAAGGG
GGGAGCCTTCTGTGCTGTGAGTCCTGCCCAGCGGCCTTCCACCCTGACTGCCTGAACATC
GAGATGCCTGACGGCAGCTGGTTCTGCAATGACTGCAGGGCTGGGAAGAAGCTGCACTTC
CAGGATATCATTTGGGTGAAACTTGGGAACTACAGATGGTGGCCGGCAGAAGTTTGCCAT
CCCAAAAATGTTCCCCCAAATATTCAGAAAATGAAGCACGAGATTGGAGAATTCCCTGTG
TTTTTCTTTGGGTCTAAAGATTATTACTGGACGCATCAGGCGCGAGTGTTCCCGTACATG
GAGGGGGACCGGGGCAGCCGCTACCAGGGGGTCAGAGGGATCGGAAGAGTCTTCAAAAAC
GCACTGCAAGAAGCTGAAGCTCGTTTTCGTGAAATTAAGCTTCAGAGGGAAGCCCGAGAA
ACACAGGAGAGCGAGCGCAAGCCCCCACCATACAAGCACATCAAGGTGAATAAGCCTTAC
GGGAAAGTCCAGATCTACACAGCGGATATTTCAGAAATCCCTAAGTGCAACTGCAAGCCC
ACAGATGAGAATCCTTGTGGCTTTGATTCGGAGTGTCTGAACAGGATGCTGATGTTTGAG
TGCCACCCGCAGGTGTGTCCCGCGGGCGAGTTCTGCCAGAACCAGTGCTTCACCAAGCGC
CAGTACCCAGAGACCAAGATCATCAAGACAGATGGCAAAGGGTGGGGCCTGGTCGCCAAG
AGGGACATCAGAAAGGGAGAATTTGTTAACGAGTACGTTGGGGAGCTGATCGACGAGGAG
GAGTGCATGGCGAGAATCAAGCACGCACACGAGAACGACATCACCCACTTCTACATGCTC
ACTATAGACAAGGACCGTATAATAGACGCTGGCCCCAAAGGAAACTACTCTCGATTTATG
AATCACAGCTGCCAGCCCAACTGTGAGACCCTCAAGTGGACAGTGAATGGGGACACTCGT
GTGGGCCTGTTTGCCGTCTGTGACATTCCTGCAGGGACGGAGCTGACTTTTAACTACAAC
CTCGATTGTCTGGGCAATGAAAAAACGGTCTGCCGGTGTGGAGCCTCCAATTGCAGTGGA
TTCCTCGGGGATAGACCAAAGACCTCGACGACCCTTTCATCAGAGGAAAAGGGCAAAAAG
ACCAAGAAGAAAACGAGGCGGCGCAGAGCAAAAGGGGAAGGGAAGAGGCAGTCAGAGGAC
GAGTGCTTCCGCTGCGGTGATGGCGGGCAGCTGGTGCTGTGTGACCGCAAGTTCTGCACC
AAGGCCTACCACCTGTCCTGCCTGGGCCTTGGCAAGCGGCCCTTCGGGAAGTGGGAATGT
CCTTGGCATCATTGTGACGTGTGTGGCAAACCTTCGACTTCATTTTGCCACCTCTGCCCC
AATTCGTTCTGTAAGGAGCACCAGGACGGGACAGCCTTCAGCTGCACCCCGGACGGGCGG
TCCTACTGCTGTGAGCATGACTTAGGGGCGGCATCGGTCAGAAGCACCAAGACTGAGAAG
CCCCCCCCAGAGCCAGGGAAGCCGAAGGGGAAGAGGCGGCGGCGGAGGGGCTGGCGGAGA
GTCACAGAGGGCAAATAG
|
| Protein Properties |
| Number of Residues
| 1365 |
| Molecular Weight
| 152257.02 |
| Theoretical pI
| 8.685 |
| Pfam Domain Function
|
|
| Signals
|
Not Available
|
|
Transmembrane Regions
|
Not Available
|
| Protein Sequence
|
>Probable histone-lysine N-methyltransferase NSD2
MEFSIKQSPLSVQSVVKCIKMKQAPEILGSANGKTPSCEVNRECSVFLSKAQLSSSLQEG
VMQKFNGHDALPFIPADKLKDLTSRVFNGEPGAHDAKLRFESQEMKGIGTPPNTTPIKNG
SPEIKLKITKTYMNGKPLFESSICGDSAADVSQSEENGQKPENKARRNRKRSIKYDSLLE
QGLVEAALVSKISSPSDKKIPAKKESCPNTGRDKDHLLKYNVGDLVWSKVSGYPWWPCMV
SADPLLHSYTKLKGQKKSARQYHVQFFGDAPERAWIFEKSLVAFEGEGQFEKLCQESAKQ
APTKAEKIKLLKPISGKLRAQWEMGIVQAEEAASMSVEERKAKFTFLYVGDQLHLNPQVA
KEAGIAAESLGEMAESSGVSEEAAENPKSVREECIPMKRRRRAKLCSSAETLESHPDIGK
STPQKTAEADPRRGVGSPPGRKKTTVSMPRSRKGDAASQFLVFCQKHRDEVVAEHPDASG
EEIEELLRSQWSLLSEKQRARYNTKFALVAPVQAEEDSGNVNGKKRNHTKRIQDPTEDAE
AEDTPRKRLRTDKHSLRKRDTITDKTARTSSYKAMEAASSLKSQAATKNLSDACKPLKKR
NRASTAASSALGFSKSSSPSASLTENEVSDSPGDEPSESPYESADETQTEVSVSSKKSER
GVTAKKEYVCQLCEKPGSLLLCEGPCCGAFHLACLGLSRRPEGRFTCSECASGIHSCFVC
KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKG
KMMRCVRCPVAYHSGDACLAAGCSVIASNSIICTAHFTARKGKRHHAHVNVSWCFVCSKG
GSLLCCESCPAAFHPDCLNIEMPDGSWFCNDCRAGKKLHFQDIIWVKLGNYRWWPAEVCH
PKNVPPNIQKMKHEIGEFPVFFFGSKDYYWTHQARVFPYMEGDRGSRYQGVRGIGRVFKN
ALQEAEARFREIKLQREARETQESERKPPPYKHIKVNKPYGKVQIYTADISEIPKCNCKP
TDENPCGFDSECLNRMLMFECHPQVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAK
RDIRKGEFVNEYVGELIDEEECMARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFM
NHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEKTVCRCGASNCSG
FLGDRPKTSTTLSSEEKGKKTKKKTRRRRAKGEGKRQSEDECFRCGDGGQLVLCDRKFCT
KAYHLSCLGLGKRPFGKWECPWHHCDVCGKPSTSFCHLCPNSFCKEHQDGTAFSCTPDGR
SYCCEHDLGAASVRSTKTEKPPPEPGKPKGKRRRRRGWRRVTEGK
|
| External Links |
| GenBank ID Protein
| 109633019 |
| UniProtKB/Swiss-Prot ID
| O96028 |
| UniProtKB/Swiss-Prot Entry Name
| NSD2_HUMAN |
| PDB IDs
|
Not Available |
| GenBank Gene ID
| NM_001042424.2 |
| GeneCard ID
| WHSC1 |
| GenAtlas ID
| WHSC1 |
| HGNC ID
| HGNC:12766 |
| References |
| General References
| Not Available |