| Identification |
| HMDB Protein ID
| CDBP01584 |
| Secondary Accession Numbers
| Not Available |
| Name
| Collagen alpha-2(I) chain |
| Description
| Not Available |
| Synonyms
|
- Alpha-2 type I collagen
|
| Gene Name
| COL1A2 |
| Protein Type
| Enzyme |
| Biological Properties |
| General Function
| Involved in extracellular matrix structural constituent |
| Specific Function
| Type I collagen is a member of group I collagen (fibrillar forming collagen) |
| GO Classification
|
| Component |
| extracellular region part |
| collagen |
| extracellular matrix part |
| Function |
| structural molecule activity |
| extracellular matrix structural constituent |
|
| Cellular Location
|
- Secreted
- extracellular space
- extracellular matrix
|
| Pathways
|
Not Available
|
| Gene Properties |
| Chromosome Location
| Chromosome:7 |
| Locus
| 7q22.1 |
| SNPs
| COL1A2 |
| Gene Sequence
|
>4101 bp
ATGCTCAGCTTTGTGGATACGCGGACTTTGTTGCTGCTTGCAGTAACCTTATGCCTAGCA
ACATGCCAATCTTTACAAGAGGAAACTGTAAGAAAGGGCCCAGCCGGAGATAGAGGACCA
CGTGGAGAAAGGGGTCCACCAGGCCCCCCAGGCAGAGATGGTGAAGATGGTCCCACAGGC
CCTCCTGGTCCACCTGGTCCTCCTGGCCCCCCTGGTCTCGGTGGGAACTTTGCTGCTCAG
TATGATGGAAAAGGAGTTGGACTTGGCCCTGGACCAATGGGCTTAATGGGACCTAGAGGC
CCACCTGGTGCAGCTGGAGCCCCAGGCCCTCAAGGTTTCCAAGGACCTGCTGGTGAGCCT
GGTGAACCTGGTCAAACTGGTCCTGCAGGTGCTCGTGGTCCAGCTGGCCCTCCTGGCAAG
GCTGGTGAAGATGGTCACCCTGGAAAACCCGGACGACCTGGTGAGAGAGGAGTTGTTGGA
CCACAGGGTGCTCGTGGTTTCCCTGGAACTCCTGGACTTCCTGGCTTCAAAGGCATTAGG
GGACACAATGGTCTGGATGGATTGAAGGGACAGCCCGGTGCTCCTGGTGTGAAGGGTGAA
CCTGGTGCCCCTGGTGAAAATGGAACTCCAGGTCAAACAGGAGCCCGTGGTCTTCCTGGT
GAGAGAGGACGTGTTGGTGCCCCTGGTCCAGCTGGTGCCCGTGGAAGTGATGGAAGTGTG
GGTCCCGTAGGTCCTGCTGGTCCTAATGGGTCTGCTGGCCCTCCAGGTTTCCCAGGTGCC
CCTGGTCCCAAGGGTGAAATTGGAGCTGTTGGTAACGCTGGTCCTACTGGACCCGCCGGT
CCCCGTGGTGAAGTGGGTCTTCCAGGCCTCTCCGGCCCCGTTGGACCTCCTGGTAATCCT
GGAGCAAACGGCCTTACTGGTGCCAAGGGTGCTGCTGGCCTTCCCGGCGTTGCTGGGGCT
CCCGGCCTCCCTGGACCCCGCGGTATTCCTGGCCCTCCTGGTGCTGCCGGTACTACTGGT
GCCAGAGGACTTGTTGGTGAGCCTGGTCCAGCTGGCTCCAAAGGAGAGAGCGGTAACAAG
GGTGAGCCCGGCTCCGCTGGTCCCCAAGGTCCTCCTGGTCCCAGTGGTGAAGAAGGAAAG
AGAGGCCCTAATGGGGAAGCTGGATCTGCCGGCCCTCCAGGACCTCCTGGGCTGAGAGGT
AGTCCTGGTTCTCGTGGTCTTCCTGGAGCTGATGGCAGAGCTGGCGTCATGGGCCCTCCT
GGTAGTCGTGGTGCAAGTGGCCCTGCTGGAGTCCGAGGACCTAATGGAGATGCTGGTCGC
CCTGGGGAGCCTGGTCTCATGGGACCCAGAGGTCTTCCTGGTTCCCCTGGAAATATCGGC
CCCGCTGGAAAAGAAGGTCCTGTCGGCCTCCCTGGCATCGACGGCAGGCCTGGCCCAATT
GGCCCCGTTGGAGCAAGAGGAGAGCCTGGCAACATTGGATTCCCTGGACCCAAAGGCCCC
ACTGGTGACCCTGGCAAAAACGGTGATAAAGGTCATGCTGGTCTTGCTGGTGCTCGGGGT
GCTCCAGGTCCTGATGGAAACAATGGTGCTCAGGGACCTCCTGGACCACAGGGTGTTCAA
GGTGGAAAAGGTGAACAGGGTCCCGCTGGTCCTCCAGGCTTCCAGGGTCTGCCTGGCCCC
TCAGGTCCCGCTGGTGAAGTTGGCAAACCAGGAGAAAGGGGTCTCCATGGTGAGTTTGGT
CTCCCTGGTCCTGCTGGTCCAAGAGGGGAACGCGGTCCCCCAGGTGAGAGTGGTGCTGCC
GGTCCTACTGGTCCTATTGGAAGCCGAGGTCCTTCTGGACCCCCAGGGCCTGATGGAAAC
AAGGGTGAACCTGGTGTGGTTGGTGCTGTGGGCACTGCTGGTCCATCTGGTCCTAGTGGA
CTCCCAGGAGAGAGGGGTGCTGCTGGCATACCTGGAGGCAAGGGAGAAAAGGGTGAACCT
GGTCTCAGAGGTGAAATTGGTAACCCTGGCAGAGATGGTGCTCGTGGTGCTCATGGTGCT
GTAGGTGCCCCTGGTCCTGCTGGAGCCACAGGTGACCGGGGCGAAGCTGGGGCTGCTGGT
CCTGCTGGTCCTGCTGGTCCTCGGGGAAGCCCTGGTGAACGTGGCGAGGTCGGTCCTGCT
GGCCCCAACGGATTTGCTGGTCCGGCTGGTGCTGCTGGTCAACCGGGTGCTAAAGGAGAA
AGAGGAGGCAAAGGGCCTAAGGGTGAAAACGGTGTTGTTGGTCCCACAGGCCCCGTTGGA
GCTGCTGGCCCAGCTGGTCCAAATGGTCCCCCCGGTCCTGCTGGAAGTCGTGGTGATGGA
GGCCCCCCTGGTATGACTGGTTTCCCTGGTGCTGCTGGACGGACTGGTCCCCCAGGACCC
TCTGGTATTTCTGGCCCTCCTGGTCCCCCTGGTCCTGCTGGGAAAGAAGGGCTTCGTGGT
CCTCGTGGTGACCAAGGTCCAGTTGGCCGAACTGGAGAAGTAGGTGCAGTTGGTCCCCCT
GGCTTCGCTGGTGAGAAGGGTCCCTCTGGAGAGGCTGGTACTGCTGGACCTCCTGGCACT
CCAGGTCCTCAGGGTCTTCTTGGTGCTCCTGGTATTCTGGGTCTCCCTGGCTCGAGAGGT
GAACGTGGTCTACCTGGTGTTGCTGGTGCTGTGGGTGAACCTGGTCCTCTTGGCATTGCC
GGCCCTCCTGGGGCCCGTGGTCCTCCTGGTGCTGTGGGTAGTCCTGGAGTCAACGGTGCT
CCTGGTGAAGCTGGTCGTGATGGCAACCCTGGGAACGATGGTCCCCCAGGTCGCGATGGT
CAACCCGGACACAAGGGAGAGCGCGGTTACCCTGGCAATATTGGTCCCGTTGGTGCTGCA
GGTGCACCTGGTCCTCATGGCCCCGTGGGTCCTGCTGGCAAACATGGAAACCGTGGTGAA
ACTGGTCCTTCTGGTCCTGTTGGTCCTGCTGGTGCTGTTGGCCCAAGAGGTCCTAGTGGC
CCACAAGGCATTCGTGGCGATAAGGGAGAGCCCGGTGAAAAGGGGCCCAGAGGTCTTCCT
GGCTTCAAGGGACACAATGGATTGCAAGGTCTGCCTGGTATCGCTGGTCACCATGGTGAT
CAAGGTGCTCCTGGCTCCGTGGGTCCTGCTGGTCCTAGGGGCCCTGCTGGTCCTTCTGGC
CCTGCTGGAAAAGATGGTCGCACTGGACATCCTGGTACGGTTGGACCTGCTGGCATTCGA
GGCCCTCAGGGTCACCAAGGCCCTGCTGGCCCCCCTGGTCCCCCTGGCCCTCCTGGACCT
CCAGGTGTAAGCGGTGGTGGTTATGACTTTGGTTACGATGGAGACTTCTACAGGGCTGAC
CAGCCTCGCTCAGCACCTTCTCTCAGACCCAAGGACTATGAAGTTGATGCTACTCTGAAG
TCTCTCAACAACCAGATTGAGACCCTTCTTACTCCTGAAGGCTCTAGAAAGAACCCAGCT
CGCACATGCCGTGACTTGAGACTCAGCCACCCAGAGTGGAGCAGCGGTTACTACTGGATT
GACCCCAACCAAGGATGCACTATGGAAGCCATCAAAGTATACTGTGATTTCCCTACCGGC
GAAACCTGTATCCGGGCCCAACCTGAAAACATCCCAGCCAAGAACTGGTATAGGAGCTCC
AAGGACAAGAAACACGTCTGGCTAGGAGAAACTATCAATGCTGGCAGCCAGTTTGAATAT
AATGTTGAAGGAGTGACTTCCAAGGAAATGGCTACCCAACTTGCCTTCATGCGCCTGCTG
GCCAACTATGCCTCTCAGAACATCACCTACCACTGCAAGAACAGCATTGCATACATGGAT
GAGGAGACTGGCAACCTGAAAAAGGCTGTCATTCTACAGGGCTCTAATGATGTTGAACTT
GTTGCTGAGGGCAACAGCAGGTTCACTTACACTGTTCTTGTAGATGGCTGCTCTAAAAAG
ACAAATGAATGGGGAAAGACAATCATTGAATACAAAACAAATAAGCCATCACGCCTGCCC
TTCCTTGATATTGCACCTTTGGACATCGGTGGTGCTGACCATGAATTCTTTGTGGACATT
GGCCCAGTCTGTTTCAAATAA
|
| Protein Properties |
| Number of Residues
| 1366 |
| Molecular Weight
| 129313.6 |
| Theoretical pI
| 9.36 |
| Pfam Domain Function
|
|
| Signals
|
|
|
Transmembrane Regions
|
|
| Protein Sequence
|
>Collagen alpha-2(I) chain
MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQ
GGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLP
GLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK
|
| External Links |
| GenBank ID Protein
| 179596 |
| UniProtKB/Swiss-Prot ID
| P08123 |
| UniProtKB/Swiss-Prot Entry Name
| CO1A2_HUMAN |
| PDB IDs
|
Not Available |
| GenBank Gene ID
| J03464 |
| GeneCard ID
| COL1A2 |
| GenAtlas ID
| COL1A2 |
| HGNC ID
| HGNC:2198 |
| References |
| General References
| Not Available |