Identification
HMDB Protein ID CDBP01584
Secondary Accession Numbers Not Available
Name Collagen alpha-2(I) chain
Description Not Available
Synonyms
  1. Alpha-2 type I collagen
Gene Name COL1A2
Protein Type Enzyme
Biological Properties
General Function Involved in extracellular matrix structural constituent
Specific Function Type I collagen is a member of group I collagen (fibrillar forming collagen)
GO Classification
Component
extracellular region part
collagen
extracellular matrix part
Function
structural molecule activity
extracellular matrix structural constituent
Cellular Location
  1. Secreted
  2. extracellular space
  3. extracellular matrix
Pathways
Gene Properties
Chromosome Location Chromosome:7
Locus 7q22.1
SNPs COL1A2
Gene Sequence
>4101 bp
ATGCTCAGCTTTGTGGATACGCGGACTTTGTTGCTGCTTGCAGTAACCTTATGCCTAGCA
ACATGCCAATCTTTACAAGAGGAAACTGTAAGAAAGGGCCCAGCCGGAGATAGAGGACCA
CGTGGAGAAAGGGGTCCACCAGGCCCCCCAGGCAGAGATGGTGAAGATGGTCCCACAGGC
CCTCCTGGTCCACCTGGTCCTCCTGGCCCCCCTGGTCTCGGTGGGAACTTTGCTGCTCAG
TATGATGGAAAAGGAGTTGGACTTGGCCCTGGACCAATGGGCTTAATGGGACCTAGAGGC
CCACCTGGTGCAGCTGGAGCCCCAGGCCCTCAAGGTTTCCAAGGACCTGCTGGTGAGCCT
GGTGAACCTGGTCAAACTGGTCCTGCAGGTGCTCGTGGTCCAGCTGGCCCTCCTGGCAAG
GCTGGTGAAGATGGTCACCCTGGAAAACCCGGACGACCTGGTGAGAGAGGAGTTGTTGGA
CCACAGGGTGCTCGTGGTTTCCCTGGAACTCCTGGACTTCCTGGCTTCAAAGGCATTAGG
GGACACAATGGTCTGGATGGATTGAAGGGACAGCCCGGTGCTCCTGGTGTGAAGGGTGAA
CCTGGTGCCCCTGGTGAAAATGGAACTCCAGGTCAAACAGGAGCCCGTGGTCTTCCTGGT
GAGAGAGGACGTGTTGGTGCCCCTGGTCCAGCTGGTGCCCGTGGAAGTGATGGAAGTGTG
GGTCCCGTAGGTCCTGCTGGTCCTAATGGGTCTGCTGGCCCTCCAGGTTTCCCAGGTGCC
CCTGGTCCCAAGGGTGAAATTGGAGCTGTTGGTAACGCTGGTCCTACTGGACCCGCCGGT
CCCCGTGGTGAAGTGGGTCTTCCAGGCCTCTCCGGCCCCGTTGGACCTCCTGGTAATCCT
GGAGCAAACGGCCTTACTGGTGCCAAGGGTGCTGCTGGCCTTCCCGGCGTTGCTGGGGCT
CCCGGCCTCCCTGGACCCCGCGGTATTCCTGGCCCTCCTGGTGCTGCCGGTACTACTGGT
GCCAGAGGACTTGTTGGTGAGCCTGGTCCAGCTGGCTCCAAAGGAGAGAGCGGTAACAAG
GGTGAGCCCGGCTCCGCTGGTCCCCAAGGTCCTCCTGGTCCCAGTGGTGAAGAAGGAAAG
AGAGGCCCTAATGGGGAAGCTGGATCTGCCGGCCCTCCAGGACCTCCTGGGCTGAGAGGT
AGTCCTGGTTCTCGTGGTCTTCCTGGAGCTGATGGCAGAGCTGGCGTCATGGGCCCTCCT
GGTAGTCGTGGTGCAAGTGGCCCTGCTGGAGTCCGAGGACCTAATGGAGATGCTGGTCGC
CCTGGGGAGCCTGGTCTCATGGGACCCAGAGGTCTTCCTGGTTCCCCTGGAAATATCGGC
CCCGCTGGAAAAGAAGGTCCTGTCGGCCTCCCTGGCATCGACGGCAGGCCTGGCCCAATT
GGCCCCGTTGGAGCAAGAGGAGAGCCTGGCAACATTGGATTCCCTGGACCCAAAGGCCCC
ACTGGTGACCCTGGCAAAAACGGTGATAAAGGTCATGCTGGTCTTGCTGGTGCTCGGGGT
GCTCCAGGTCCTGATGGAAACAATGGTGCTCAGGGACCTCCTGGACCACAGGGTGTTCAA
GGTGGAAAAGGTGAACAGGGTCCCGCTGGTCCTCCAGGCTTCCAGGGTCTGCCTGGCCCC
TCAGGTCCCGCTGGTGAAGTTGGCAAACCAGGAGAAAGGGGTCTCCATGGTGAGTTTGGT
CTCCCTGGTCCTGCTGGTCCAAGAGGGGAACGCGGTCCCCCAGGTGAGAGTGGTGCTGCC
GGTCCTACTGGTCCTATTGGAAGCCGAGGTCCTTCTGGACCCCCAGGGCCTGATGGAAAC
AAGGGTGAACCTGGTGTGGTTGGTGCTGTGGGCACTGCTGGTCCATCTGGTCCTAGTGGA
CTCCCAGGAGAGAGGGGTGCTGCTGGCATACCTGGAGGCAAGGGAGAAAAGGGTGAACCT
GGTCTCAGAGGTGAAATTGGTAACCCTGGCAGAGATGGTGCTCGTGGTGCTCATGGTGCT
GTAGGTGCCCCTGGTCCTGCTGGAGCCACAGGTGACCGGGGCGAAGCTGGGGCTGCTGGT
CCTGCTGGTCCTGCTGGTCCTCGGGGAAGCCCTGGTGAACGTGGCGAGGTCGGTCCTGCT
GGCCCCAACGGATTTGCTGGTCCGGCTGGTGCTGCTGGTCAACCGGGTGCTAAAGGAGAA
AGAGGAGGCAAAGGGCCTAAGGGTGAAAACGGTGTTGTTGGTCCCACAGGCCCCGTTGGA
GCTGCTGGCCCAGCTGGTCCAAATGGTCCCCCCGGTCCTGCTGGAAGTCGTGGTGATGGA
GGCCCCCCTGGTATGACTGGTTTCCCTGGTGCTGCTGGACGGACTGGTCCCCCAGGACCC
TCTGGTATTTCTGGCCCTCCTGGTCCCCCTGGTCCTGCTGGGAAAGAAGGGCTTCGTGGT
CCTCGTGGTGACCAAGGTCCAGTTGGCCGAACTGGAGAAGTAGGTGCAGTTGGTCCCCCT
GGCTTCGCTGGTGAGAAGGGTCCCTCTGGAGAGGCTGGTACTGCTGGACCTCCTGGCACT
CCAGGTCCTCAGGGTCTTCTTGGTGCTCCTGGTATTCTGGGTCTCCCTGGCTCGAGAGGT
GAACGTGGTCTACCTGGTGTTGCTGGTGCTGTGGGTGAACCTGGTCCTCTTGGCATTGCC
GGCCCTCCTGGGGCCCGTGGTCCTCCTGGTGCTGTGGGTAGTCCTGGAGTCAACGGTGCT
CCTGGTGAAGCTGGTCGTGATGGCAACCCTGGGAACGATGGTCCCCCAGGTCGCGATGGT
CAACCCGGACACAAGGGAGAGCGCGGTTACCCTGGCAATATTGGTCCCGTTGGTGCTGCA
GGTGCACCTGGTCCTCATGGCCCCGTGGGTCCTGCTGGCAAACATGGAAACCGTGGTGAA
ACTGGTCCTTCTGGTCCTGTTGGTCCTGCTGGTGCTGTTGGCCCAAGAGGTCCTAGTGGC
CCACAAGGCATTCGTGGCGATAAGGGAGAGCCCGGTGAAAAGGGGCCCAGAGGTCTTCCT
GGCTTCAAGGGACACAATGGATTGCAAGGTCTGCCTGGTATCGCTGGTCACCATGGTGAT
CAAGGTGCTCCTGGCTCCGTGGGTCCTGCTGGTCCTAGGGGCCCTGCTGGTCCTTCTGGC
CCTGCTGGAAAAGATGGTCGCACTGGACATCCTGGTACGGTTGGACCTGCTGGCATTCGA
GGCCCTCAGGGTCACCAAGGCCCTGCTGGCCCCCCTGGTCCCCCTGGCCCTCCTGGACCT
CCAGGTGTAAGCGGTGGTGGTTATGACTTTGGTTACGATGGAGACTTCTACAGGGCTGAC
CAGCCTCGCTCAGCACCTTCTCTCAGACCCAAGGACTATGAAGTTGATGCTACTCTGAAG
TCTCTCAACAACCAGATTGAGACCCTTCTTACTCCTGAAGGCTCTAGAAAGAACCCAGCT
CGCACATGCCGTGACTTGAGACTCAGCCACCCAGAGTGGAGCAGCGGTTACTACTGGATT
GACCCCAACCAAGGATGCACTATGGAAGCCATCAAAGTATACTGTGATTTCCCTACCGGC
GAAACCTGTATCCGGGCCCAACCTGAAAACATCCCAGCCAAGAACTGGTATAGGAGCTCC
AAGGACAAGAAACACGTCTGGCTAGGAGAAACTATCAATGCTGGCAGCCAGTTTGAATAT
AATGTTGAAGGAGTGACTTCCAAGGAAATGGCTACCCAACTTGCCTTCATGCGCCTGCTG
GCCAACTATGCCTCTCAGAACATCACCTACCACTGCAAGAACAGCATTGCATACATGGAT
GAGGAGACTGGCAACCTGAAAAAGGCTGTCATTCTACAGGGCTCTAATGATGTTGAACTT
GTTGCTGAGGGCAACAGCAGGTTCACTTACACTGTTCTTGTAGATGGCTGCTCTAAAAAG
ACAAATGAATGGGGAAAGACAATCATTGAATACAAAACAAATAAGCCATCACGCCTGCCC
TTCCTTGATATTGCACCTTTGGACATCGGTGGTGCTGACCATGAATTCTTTGTGGACATT
GGCCCAGTCTGTTTCAAATAA
Protein Properties
Number of Residues 1366
Molecular Weight 129313.6
Theoretical pI 9.36
Pfam Domain Function
Signals
  • ["1-24"]
Transmembrane Regions
  • ["None"]
Protein Sequence
>Collagen alpha-2(I) chain
MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPAGARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQ
GGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGEIGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLP
GLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDGDFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCFK
GenBank ID Protein 179596
UniProtKB/Swiss-Prot ID P08123
UniProtKB/Swiss-Prot Entry Name CO1A2_HUMAN
PDB IDs Not Available
GenBank Gene ID J03464
GeneCard ID COL1A2
GenAtlas ID COL1A2
HGNC ID HGNC:2198
References
General References Not Available