Identification |
HMDB Protein ID
| CDBP01584 |
Secondary Accession Numbers
| Not Available |
Name
| Collagen alpha-2(I) chain |
Description
| Not Available |
Synonyms
|
- Alpha-2 type I collagen
|
Gene Name
| COL1A2 |
Protein Type
| Enzyme |
Biological Properties |
General Function
| Involved in extracellular matrix structural constituent |
Specific Function
| Type I collagen is a member of group I collagen (fibrillar forming collagen) |
GO Classification
|
Component |
extracellular region part |
collagen |
extracellular matrix part |
Function |
structural molecule activity |
extracellular matrix structural constituent |
|
Cellular Location
|
- Secreted
- extracellular space
- extracellular matrix
|
Pathways
|
Not Available
|
Gene Properties |
Chromosome Location
| Chromosome:7 |
Locus
| 7q22.1 |
SNPs
| COL1A2 |
Gene Sequence
|
>4101 bp
ATGCTCAGCTTTGTGGATACGCGGACTTTGTTGCTGCTTGCAGTAACCTTATGCCTAGCA
ACATGCCAATCTTTACAAGAGGAAACTGTAAGAAAGGGCCCAGCCGGAGATAGAGGACCA
CGTGGAGAAAGGGGTCCACCAGGCCCCCCAGGCAGAGATGGTGAAGATGGTCCCACAGGC
CCTCCTGGTCCACCTGGTCCTCCTGGCCCCCCTGGTCTCGGTGGGAACTTTGCTGCTCAG
TATGATGGAAAAGGAGTTGGACTTGGCCCTGGACCAATGGGCTTAATGGGACCTAGAGGC
CCACCTGGTGCAGCTGGAGCCCCAGGCCCTCAAGGTTTCCAAGGACCTGCTGGTGAGCCT
GGTGAACCTGGTCAAACTGGTCCTGCAGGTGCTCGTGGTCCAGCTGGCCCTCCTGGCAAG
GCTGGTGAAGATGGTCACCCTGGAAAACCCGGACGACCTGGTGAGAGAGGAGTTGTTGGA
CCACAGGGTGCTCGTGGTTTCCCTGGAACTCCTGGACTTCCTGGCTTCAAAGGCATTAGG
GGACACAATGGTCTGGATGGATTGAAGGGACAGCCCGGTGCTCCTGGTGTGAAGGGTGAA
CCTGGTGCCCCTGGTGAAAATGGAACTCCAGGTCAAACAGGAGCCCGTGGTCTTCCTGGT
GAGAGAGGACGTGTTGGTGCCCCTGGTCCAGCTGGTGCCCGTGGAAGTGATGGAAGTGTG
GGTCCCGTAGGTCCTGCTGGTCCTAATGGGTCTGCTGGCCCTCCAGGTTTCCCAGGTGCC
CCTGGTCCCAAGGGTGAAATTGGAGCTGTTGGTAACGCTGGTCCTACTGGACCCGCCGGT
CCCCGTGGTGAAGTGGGTCTTCCAGGCCTCTCCGGCCCCGTTGGACCTCCTGGTAATCCT
GGAGCAAACGGCCTTACTGGTGCCAAGGGTGCTGCTGGCCTTCCCGGCGTTGCTGGGGCT
CCCGGCCTCCCTGGACCCCGCGGTATTCCTGGCCCTCCTGGTGCTGCCGGTACTACTGGT
GCCAGAGGACTTGTTGGTGAGCCTGGTCCAGCTGGCTCCAAAGGAGAGAGCGGTAACAAG
GGTGAGCCCGGCTCCGCTGGTCCCCAAGGTCCTCCTGGTCCCAGTGGTGAAGAAGGAAAG
AGAGGCCCTAATGGGGAAGCTGGATCTGCCGGCCCTCCAGGACCTCCTGGGCTGAGAGGT
AGTCCTGGTTCTCGTGGTCTTCCTGGAGCTGATGGCAGAGCTGGCGTCATGGGCCCTCCT
GGTAGTCGTGGTGCAAGTGGCCCTGCTGGAGTCCGAGGACCTAATGGAGATGCTGGTCGC
CCTGGGGAGCCTGGTCTCATGGGACCCAGAGGTCTTCCTGGTTCCCCTGGAAATATCGGC
CCCGCTGGAAAAGAAGGTCCTGTCGGCCTCCCTGGCATCGACGGCAGGCCTGGCCCAATT
GGCCCCGTTGGAGCAAGAGGAGAGCCTGGCAACATTGGATTCCCTGGACCCAAAGGCCCC
ACTGGTGACCCTGGCAAAAACGGTGATAAAGGTCATGCTGGTCTTGCTGGTGCTCGGGGT
GCTCCAGGTCCTGATGGAAACAATGGTGCTCAGGGACCTCCTGGACCACAGGGTGTTCAA
GGTGGAAAAGGTGAACAGGGTCCCGCTGGTCCTCCAGGCTTCCAGGGTCTGCCTGGCCCC
TCAGGTCCCGCTGGTGAAGTTGGCAAACCAGGAGAAAGGGGTCTCCATGGTGAGTTTGGT
CTCCCTGGTCCTGCTGGTCCAAGAGGGGAACGCGGTCCCCCAGGTGAGAGTGGTGCTGCC
GGTCCTACTGGTCCTATTGGAAGCCGAGGTCCTTCTGGACCCCCAGGGCCTGATGGAAAC
AAGGGTGAACCTGGTGTGGTTGGTGCTGTGGGCACTGCTGGTCCATCTGGTCCTAGTGGA
CTCCCAGGAGAGAGGGGTGCTGCTGGCATACCTGGAGGCAAGGGAGAAAAGGGTGAACCT
GGTCTCAGAGGTGAAATTGGTAACCCTGGCAGAGATGGTGCTCGTGGTGCTCATGGTGCT
GTAGGTGCCCCTGGTCCTGCTGGAGCCACAGGTGACCGGGGCGAAGCTGGGGCTGCTGGT
CCTGCTGGTCCTGCTGGTCCTCGGGGAAGCCCTGGTGAACGTGGCGAGGTCGGTCCTGCT
GGCCCCAACGGATTTGCTGGTCCGGCTGGTGCTGCTGGTCAACCGGGTGCTAAAGGAGAA
AGAGGAGGCAAAGGGCCTAAGGGTGAAAACGGTGTTGTTGGTCCCACAGGCCCCGTTGGA
GCTGCTGGCCCAGCTGGTCCAAATGGTCCCCCCGGTCCTGCTGGAAGTCGTGGTGATGGA
GGCCCCCCTGGTATGACTGGTTTCCCTGGTGCTGCTGGACGGACTGGTCCCCCAGGACCC
TCTGGTATTTCTGGCCCTCCTGGTCCCCCTGGTCCTGCTGGGAAAGAAGGGCTTCGTGGT
CCTCGTGGTGACCAAGGTCCAGTTGGCCGAACTGGAGAAGTAGGTGCAGTTGGTCCCCCT
GGCTTCGCTGGTGAGAAGGGTCCCTCTGGAGAGGCTGGTACTGCTGGACCTCCTGGCACT
CCAGGTCCTCAGGGTCTTCTTGGTGCTCCTGGTATTCTGGGTCTCCCTGGCTCGAGAGGT
GAACGTGGTCTACCTGGTGTTGCTGGTGCTGTGGGTGAACCTGGTCCTCTTGGCATTGCC
GGCCCTCCTGGGGCCCGTGGTCCTCCTGGTGCTGTGGGTAGTCCTGGAGTCAACGGTGCT
CCTGGTGAAGCTGGTCGTGATGGCAACCCTGGGAACGATGGTCCCCCAGGTCGCGATGGT
CAACCCGGACACAAGGGAGAGCGCGGTTACCCTGGCAATATTGGTCCCGTTGGTGCTGCA
GGTGCACCTGGTCCTCATGGCCCCGTGGGTCCTGCTGGCAAACATGGAAACCGTGGTGAA
ACTGGTCCTTCTGGTCCTGTTGGTCCTGCTGGTGCTGTTGGCCCAAGAGGTCCTAGTGGC
CCACAAGGCATTCGTGGCGATAAGGGAGAGCCCGGTGAAAAGGGGCCCAGAGGTCTTCCT
GGCTTCAAGGGACACAATGGATTGCAAGGTCTGCCTGGTATCGCTGGTCACCATGGTGAT
CAAGGTGCTCCTGGCTCCGTGGGTCCTGCTGGTCCTAGGGGCCCTGCTGGTCCTTCTGGC
CCTGCTGGAAAAGATGGTCGCACTGGACATCCTGGTACGGTTGGACCTGCTGGCATTCGA
GGCCCTCAGGGTCACCAAGGCCCTGCTGGCCCCCCTGGTCCCCCTGGCCCTCCTGGACCT
CCAGGTGTAAGCGGTGGTGGTTATGACTTTGGTTACGATGGAGACTTCTACAGGGCTGAC
CAGCCTCGCTCAGCACCTTCTCTCAGACCCAAGGACTATGAAGTTGATGCTACTCTGAAG
TCTCTCAACAACCAGATTGAGACCCTTCTTACTCCTGAAGGCTCTAGAAAGAACCCAGCT
CGCACATGCCGTGACTTGAGACTCAGCCACCCAGAGTGGAGCAGCGGTTACTACTGGATT
GACCCCAACCAAGGATGCACTATGGAAGCCATCAAAGTATACTGTGATTTCCCTACCGGC
GAAACCTGTATCCGGGCCCAACCTGAAAACATCCCAGCCAAGAACTGGTATAGGAGCTCC
AAGGACAAGAAACACGTCTGGCTAGGAGAAACTATCAATGCTGGCAGCCAGTTTGAATAT
AATGTTGAAGGAGTGACTTCCAAGGAAATGGCTACCCAACTTGCCTTCATGCGCCTGCTG
GCCAACTATGCCTCTCAGAACATCACCTACCACTGCAAGAACAGCATTGCATACATGGAT
GAGGAGACTGGCAACCTGAAAAAGGCTGTCATTCTACAGGGCTCTAATGATGTTGAACTT
GTTGCTGAGGGCAACAGCAGGTTCACTTACACTGTTCTTGTAGATGGCTGCTCTAAAAAG
ACAAATGAATGGGGAAAGACAATCATTGAATACAAAACAAATAAGCCATCACGCCTGCCC
TTCCTTGATATTGCACCTTTGGACATCGGTGGTGCTGACCATGAATTCTTTGTGGACATT
GGCCCAGTCTGTTTCAAATAA
|
Protein Properties |
Number of Residues
| 1366 |
Molecular Weight
| 129313.6 |
Theoretical pI
| 9.36 |
Pfam Domain Function
|
|
Signals
|
|
Transmembrane Regions
|
|
Protein Sequence
|
>Collagen alpha-2(I) chain
MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQ
GGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLP
GLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK
|
External Links |
GenBank ID Protein
| 179596 |
UniProtKB/Swiss-Prot ID
| P08123 |
UniProtKB/Swiss-Prot Entry Name
| CO1A2_HUMAN |
PDB IDs
|
Not Available |
GenBank Gene ID
| J03464 |
GeneCard ID
| COL1A2 |
GenAtlas ID
| COL1A2 |
HGNC ID
| HGNC:2198 |
References |
General References
| Not Available |