| Identification |
| HMDB Protein ID
| CDBP01573 |
| Secondary Accession Numbers
| Not Available |
| Name
| Histone-lysine N-methyltransferase SETD1A |
| Description
| Not Available |
| Synonyms
|
- Lysine N-methyltransferase 2F
- SET domain-containing protein 1A
- Set1/Ash2 histone methyltransferase complex subunit SET1
- hSET1A
|
| Gene Name
| SETD1A |
| Protein Type
| Enzyme |
| Biological Properties |
| General Function
| Involved in nucleotide binding |
| Specific Function
| Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. The non-overalpping localization with SETD1B suggests that SETD1A and SETD1B make non-redundant contributions to the epigenetic control of chromatin structure and gene expression.
|
| GO Classification
|
| Biological Process |
| regulation of transcription, DNA-dependent |
| transcription, DNA-dependent |
| Cellular Component |
| chromosome |
| nuclear speck |
| Set1C/COMPASS complex |
| Function |
| binding |
| nucleotide binding |
| nucleic acid binding |
| Molecular Function |
| RNA binding |
| histone methyltransferase activity (H3-K4 specific) |
| nucleotide binding |
|
| Cellular Location
|
- Nucleus speckle
- Chromosome
|
| Pathways
|
Not Available
|
| Gene Properties |
| Chromosome Location
| 16 |
| Locus
| 16p11.2 |
| SNPs
| SETD1A |
| Gene Sequence
|
>5124 bp
ATGGATCAGGAAGGTGGGGGAGATGGGCAGAAGGCCCCGAGCTTCCAGTGGCGGAACTAC
AAGCTCATCGTGGATCCTGCCTTGGACCCTGCCCTGCGCAGGCCTTCTCAGAAGGTGTAC
CGCTATGATGGAGTCCACTTCAGTGTCAACGACTCAAAGTATATACCAGTCGAAGACCTC
CAAGACCCCCGTTGCCATGTCAGGTCCAAAAACAGAGACTTTTCCCTCCCAGTCCCTAAG
TTTAAGCTGGACGAGTTCTATATTGGACAGATTCCACTGAAGGAAGTGACTTTTGCAAGG
CTGAATGACAACGTGCGGGAGACCTTCCTGAAGGATATGTGCCGTAAGTACGGTGAGGTG
GAAGAGGTAGAGATCCTCCTTCACCCCCGTACGCGCAAGCACCTGGGCCTGGCCCGTGTG
CTCTTCACCAGCACTCGGGGCGCCAAGGAAACGGTCAAAAACCTCCACCTTACCTCCGTC
ATGGGCAACATCATCCATGCCCAGCTTGACATCAAAGGACAACAACGAATGAAATACTAT
GAACTAATTGTCAATGGCTCCTACACCCCTCAGACTGTGCCCACTGGGGGCAAGGCCCTG
AGTGAGAAGTTCCAAGGCTCGGGTGCAGCCACTGAGACGGCCGAATCCCGCCGCCGCTCT
TCCTCTGACACAGCTGCCTACCCAGCAGGCACCACTGCGGTGGGCACTCCTGGCAACGGC
ACCCCCTGCTCCCAGGACACAAGCTTCTCCAGCAGCCGACAAGATACCCCATCTTCCTTT
GGCCAGTTCACACCTCAGTCCTCCCAAGGAACCCCCTACACGTCTCGGGGCAGCACCCCC
TACTCTCAGGACTCTGCCTACTCCAGCAGCACCACTTCAACCTCCTTCAAGCCCCGGCGG
TCAGAGAACAGCTACCAAGATGCCTTTTCCCGCCGCCACTTCTCTGCATCTTCAGCCTCC
ACAACCGCCTCCACGGCCATCGCCGCCACCACTGCAGCCACTGCCTCATCCTCCGCCTCT
TCCTCCTCATTGTCCTCGTCCTCCTCGTCATCCTCTTCCTCCTCGTCCTCTCAGTTTCGT
AGTTCTGATGCAAACTACCCAGCGTATTATGAAAGCTGGAATCGCTACCAGCGCCATACT
TCCTACCCACCACGCCGGGCCACACGGGAGGAACCCCCTGGAGCCCCTTTTGCTGAAAAT
ACAGCTGAGCGCTTCCCACCTTCTTACACCTCCTACCTGCCCCCCGAGCCCAGCCGGCCC
ACCGACCAGGACTACCGGCCTCCTGCCTCAGAGGCTCCACCCCCGGAGCCTCCAGAACCT
GGTGGAGGCGGGGGTGGAGGAGGGCCCAGCCCTGAGAGAGAAGAAGTTCGGACTTCCCCC
CGCCCAGCCTCCCCTGCCCGCTCTGGCTCCCCAGCCCCGGAGACCACCAATGAGAGTGTG
CCCTTCGCCCAGCACAGCAGCCTGGATTCCCGCATCGAGATGCTGCTGAAGGAGCAGCGC
TCCAAGTTTTCCTTCTTGGCCTCTGACACAGAGGAGGAGGAAGAGAACAGCAGCATGGTC
CTTGGGGCCAGAGATACAGGGAGTGAGGTGCCTTCTGGGTCAGGGCATGGGCCCTGCACA
CCCCCTCCGGCCCCAGCTAATTTTGAGGATGTGGCACCTACAGGGAGCGGGGAGCCAGGG
GCTACCCGGGAGTCTCCCAAGGCAAATGGACAGAACCAGGCTTCTCCATGCTCTTCTGGA
GACGACATGGAGATCTCCGACGACGACCGGGGTGGCTCACCCCCTCCGGCCCCGACGCCC
CCTCAGCAGCCTCCGCCACCTCCCCCTCCCCCGCCGCCTCCTCCTCCCTACCTGGCGTCC
CTTCCTCTTGGTTATCCTCCCCACCAACCTGCCTACCTCCTCCCACCCAGACCTGATGGG
CCGCCGCCCCCTGAGTACCCCCCACCTCCTCCACCACCCCCGCACATCTATGACTTTGTG
AACTCCTTGGAGCTCATGGACCGACTTGGGGCTCAGTGGGGAGGGATGCCCATGTCCTTC
CAGATGCAGACCCAGATGTTAACTCGGCTCCATCAGCTGCGGCAGGGCAAGGGATTGATT
GCCGCCTCAGCTGGCCCCCCCGGTGGGGCCTTTGGGGAGGCCTTCCTCCCGTTTCCACCC
CCGCAGGAGGCAGCCTACGGCTTGCCGTATGCTCTATATGCACAGGGGCAGGAGGGCAGA
GGGGCATACTCACGGGAGGCCTACCACCTGCCCATGCCAATGGCAGCCGAGCCCCTGCCC
TCCTCCTCAGTCTCGGGAGAGGAGGCCCGGCTGCCACCCAGGGAAGAAGCAGAGCTGGCA
GAGGGCAAGACCCTCCCGACAGCAGGCACCGTGGGCCGTGTGCTCGCCATGCTGGTCCAG
GAGATGAAGAGCATCATGCAGCGAGACCTCAACCGCAAGATGGTGGAGAACGTGGCCTTC
GGAGCCTTTGACCAGTGGTGGGAGAGCAAGGAGGAGAAGGCCAAGCCATTCCAGAACGCG
GCCAAGCAGCAAGCCAAGGAGGAGGATAAAGAGAAGACGAAGCTGAAGGAGCCTGGCCTG
CTGTCCCTCGTGGACTGGGCCAAGAGCGGGGGCACTACGGGCATCGAGGCTTTCGCCTTT
GGGTCAGGGCTGAGAGGGGCCCTGCGGCTGCCTTCATTCAAGGTAAAGCGGAAAGAGCCA
TCGGAAATTTCCGAGGCCAGTGAGGAAAAGAGGCCTCGTCCCTCCACTCCTGCTGAGGAA
GATGAAGACGACCCTGAACAAGAGAAGGAGGCTGGAGAGCCAGGACGTCCGGGGACCAAG
CCCCCGAAGCGGGACGAAGAGCGAGGCAAGACCCAGGGCAAGCACCGCAAGTCCTTTGCT
CTGGACAGCGAAGGGGAGGAGGCATCCCAGGAGTCCTCCTCGGAGAAGGATGAGGAGGAT
GACGAGGAAGATGAGGAAGATGAAGATCGAGAGGAAGCTGTGGATACCACAAAGAAGGAG
ACAGAGGTGTCGGATGGCGAGGACGAGGAAAGCGATTCGTCTTCCAAATGTTCTCTGTAT
GCTGACTCAGATGGCGAAAATGACAGCACATCAGACTCCGAGAGCAGCAGCTCTTCCAGC
TCCTCATCCTCCTCCTCCTCCTCGTCCTCATCCTCCTCGTCCTCTTCATCCTCTGAGTCC
TCCTCTGAAGATGAAGAGGAAGAGGAGCGGCCAGCAGCCCTTCCCTCAGCCTCCCCGCCC
CCCAGAGAAGTCCCAGTGCCCACGCCAGCACCTGTGGAGGTGCCAGTGCCGGAAAGGGTT
GCAGGCTCCCCAGTCACACCCCTGCCCGAACAGGAGGCGTCTCCAGCAAGGCCTGCAGGC
CCCACGGAGGAGTCACCCCCCAGTGCGCCTCTGCGTCCCCCAGAACCACCTGCTGGGCCC
CCGGCCCCTGCCCCACGCCCCGATGAGCGTCCCTCTTCTCCCATCCCCCTCCTGCCCCCA
CCCAAGAAACGCCGGAAAACTGTCTCCTTCTCTGCCATCGAGGTGGTGCCAGCCCCGGAG
CCCCCTCCAGCCACACCGCCGCAGGCCAAGTTTCCCGGCCCAGCCTCCCGCAAGGCTCCC
CGGGGCGTGGAGCGGACCATCCGCAACCTGCCCCTGGACCACGCATCTCTGGTCAAGAGT
TGGCCCGAGGAGGTGTCCCGAGGAGGCCGGAGCCGGGCTGGAGGCCGAGGCCGCCTCACC
GAGGAAGAGGAGGCTGAGCCAGGGACAGAGGTGGACCTGGCGGTCCTGGCCGACCTGGCC
CTGACCCCTGCCCGGCGCGGGCTGCCTGCCCTGCCTGCTGTTGAAGACTCAGAGGCCACA
GAGACATCGGACGAGGCCGAGCGCCCTAGGCCCCTGCTCAGCCACATCCTCCTGGAGCAC
AACTATGCCCTGGCCGTCAAGCCCACGCCCCCTGCGCCAGCCCTGCGGCCCCCGGAGCCA
GTGCCCGCACCCGCCGCCCTCTTCAGTTCCCCAGCTGATGAGGTCCTGGAGGCCCCCGAG
GTGGTGGTGGCTGAGGCGGAGGAGCCCAAGCCGCAGCAACTGCAGCAGCAGCGGGAGGAG
GGCGAAGAGGAGGGGGAGGAAGAGGGGGAGGAAGAGGAGGAGGAGTCCTCTGACAGCAGC
AGCAGCAGCGATGGGGAGGGCGCCCTCCGGAGGCGCAGCCTCCGCTCCCACGCCCGGCGC
CGCCGCCCTCCGCCCCCACCCCCGCCGCCACCGCCCCGCGCCTACGAGCCACGCAGTGAG
TTTGAACAGATGACCATCCTGTATGACATTTGGAACTCGGGCCTGGACTCAGAGGACATG
AGTTACCTGCGGCTTACGTACGAGCGGCTGCTGCAGCAGACAAGCGGGGCTGACTGGCTC
AACGACACTCACTGGGTCCATCACACAATCACCAACCTGACCACCCCAAAACGCAAGCGG
CGGCCCCAGGATGGGCCCCGGGAGCACCAGACAGGCTCAGCCCGCAGCGAAGGCTACTAC
CCCATCAGCAAGAAGGAGAAGGACAAGTACCTGGACGTGTGCCCAGTCTCGGCCCGGCAG
CTGGAGGGCGTGGACACTCAGGGGACGAACCGCGTGCTGTCCGAGCGCCGGTCCGAGCAG
CGGCGGCTGCTGAGCGCCATCGGTACCTCCGCCATCATGGACAGTGACCTGCTGAAACTC
AACCAGCTCAAGTTCCGGAAGAAGAAGCTCCGATTTGGCCGGAGCCGGATCCACGAGTGG
GGTCTGTTTGCCATGGAACCCATTGCTGCTGACGAGATGGTCATCGAATACGTGGGTCAG
AACATCCGTCAGATGGTGGCCGACATGCGGGAGAAGCGCTACGTGCAGGAGGGCATTGGC
AGCAGCTACCTGTTCCGGGTGGACCACGACACCATCATCGATGCCACCAAGTGTGGCAAC
CTGGCCAGATTCATCAACCACTGCTGCACGCCTAACTGCTACGCCAAGGTCATCACCATC
GAGTCCCAGAAGAAGATCGTGATCTACTCCAAGCAGCCCATTGGCGTGGACGAGGAGATC
ACCTACGACTACAAGTTCCCACTGGAAGACAACAAGATCCCGTGTCTGTGTGGCACAGAG
AGCTGCCGGGGCTCCCTAAACTGA
|
| Protein Properties |
| Number of Residues
| 1707 |
| Molecular Weight
| 186032.16 |
| Theoretical pI
| 5.141 |
| Pfam Domain Function
|
|
| Signals
|
Not Available
|
|
Transmembrane Regions
|
Not Available
|
| Protein Sequence
|
>Histone-lysine N-methyltransferase SETD1A
MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVNDSKYIPVEDL
QDPRCHVRSKNRDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEV
EEVEILLHPRTRKHLGLARVLFTSTRGAKETVKNLHLTSVMGNIIHAQLDIKGQQRMKYY
ELIVNGSYTPQTVPTGGKALSEKFQGSGAATETAESRRRSSSDTAAYPAGTTAVGTPGNG
TPCSQDTSFSSSRQDTPSSFGQFTPQSSQGTPYTSRGSTPYSQDSAYSSSTTSTSFKPRR
SENSYQDAFSRRHFSASSASTTASTAIAATTAATASSSASSSSLSSSSSSSSSSSSSQFR
SSDANYPAYYESWNRYQRHTSYPPRRATREEPPGAPFAENTAERFPPSYTSYLPPEPSRP
TDQDYRPPASEAPPPEPPEPGGGGGGGGPSPEREEVRTSPRPASPARSGSPAPETTNESV
PFAQHSSLDSRIEMLLKEQRSKFSFLASDTEEEEENSSMVLGARDTGSEVPSGSGHGPCT
PPPAPANFEDVAPTGSGEPGATRESPKANGQNQASPCSSGDDMEISDDDRGGSPPPAPTP
PQQPPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPHIYDFV
NSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLIAASAGPPGGAFGEAFLPFPP
PQEAAYGLPYALYAQGQEGRGAYSREAYHLPMPMAAEPLPSSSVSGEEARLPPREEAELA
EGKTLPTAGTVGRVLAMLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPFQNA
AKQQAKEEDKEKTKLKEPGLLSLVDWAKSGGTTGIEAFAFGSGLRGALRLPSFKVKRKEP
SEISEASEEKRPRPSTPAEEDEDDPEQEKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFA
LDSEGEEASQESSSEKDEEDDEEDEEDEDREEAVDTTKKETEVSDGEDEESDSSSKCSLY
ADSDGENDSTSDSESSSSSSSSSSSSSSSSSSSSSSSSESSSEDEEEEERPAALPSASPP
PREVPVPTPAPVEVPVPERVAGSPVTPLPEQEASPARPAGPTEESPPSAPLRPPEPPAGP
PAPAPRPDERPSSPIPLLPPPKKRRKTVSFSAIEVVPAPEPPPATPPQAKFPGPASRKAP
RGVERTIRNLPLDHASLVKSWPEEVSRGGRSRAGGRGRLTEEEEAEPGTEVDLAVLADLA
LTPARRGLPALPAVEDSEATETSDEAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEP
VPAPAALFSSPADEVLEAPEVVVAEAEEPKPQQLQQQREEGEEEGEEEGEEEEEESSDSS
SSSDGEGALRRRSLRSHARRRRPPPPPPPPPPRAYEPRSEFEQMTILYDIWNSGLDSEDM
SYLRLTYERLLQQTSGADWLNDTHWVHHTITNLTTPKRKRRPQDGPREHQTGSARSEGYY
PISKKEKDKYLDVCPVSARQLEGVDTQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKL
NQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIG
SSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEI
TYDYKFPLEDNKIPCLCGTESCRGSLN
|
| External Links |
| GenBank ID Protein
| 55741677 |
| UniProtKB/Swiss-Prot ID
| O15047 |
| UniProtKB/Swiss-Prot Entry Name
| SET1A_HUMAN |
| PDB IDs
|
|
| GenBank Gene ID
| NM_014712.1 |
| GeneCard ID
| SETD1A |
| GenAtlas ID
| SETD1A |
| HGNC ID
| HGNC:29010 |
| References |
| General References
| Not Available |