P11087: Collagen alpha-1(I) chain (Mouse)

Protein names
  • Collagen alpha-1(I) chain
  • Alpha-1 type I collagen
  • CO1A1
Gene names Col1a1 (Cola1)
Organism Mus musculus
Protease Family (Not Available)
Protease ID (Not Available)
Chromosome 11

Isoforms

  • Isoform 2 of Collagen alpha-1(I) chain (P11087-2): sequence view | extended view

Sequence

        10         20         30         40         50         60 
MFSFVDLRLL LLLGATALLT HGQEDIPEVS CIHNGLRVPN GETWKPEVCL ICICHNGTAV 
        70         80         90        100        110        120 
CDDVQCNEEL DCPNPQRREG ECCAFCPEEY VSPNSEDVGV EGPKGDPGPQ GPRGPVGPPG 
       130        140        150        160        170        180 
RDGIPGQPGL PGPPGPPGPP GPPGLGGNFA SQMSYGYDEK SAGVSVPGPM GPSGPRGLPG 
       190        200        210        220        230        240 
PPGAPGPQGF QGPPGEPGEP GGSGPMGPRG PPGPPGKNGD DGEAGKPGRP GERGPPGPQG 
       250        260        270        280        290        300 
ARGLPGTAGL PGMKGHRGFS GLDGAKGDAG PAGPKGEPGS PGENGAPGQM GPRGLPGERG 
       310        320        330        340        350        360 
RPGPPGTAGA RGNDGAVGAA GPPGPTGPTG PPGFPGAVGA KGEAGPQGAR GSEGPQGVRG 
       370        380        390        400        410        420 
EPGPPGPAGA AGPAGNPGAD GQPGAKGANG APGIAGAPGF PGARGPSGPQ GPSGPPGPKG 
       430        440        450        460        470        480 
NSGEPGAPGN KGDTGAKGEP GATGVQGPPG PAGEEGKRGA RGEPGPSGLP GPPGERGGPG 
       490        500        510        520        530        540 
SRGFPGADGV AGPKGPSGER GAPGPAGPKG SPGEAGRPGE AGLPGAKGLT GSPGSPGPDG 
       550        560        570        580        590        600 
KTGPPGPAGQ DGRPGPAGPP GARGQAGVMG FPGPKGTAGE PGKAGERGLP GPPGAVGPAG 
       610        620        630        640        650        660 
KDGEAGAQGA PGPAGPAGER GEQGPAGSPG FQGLPGPAGP PGEAGKPGEQ GVPGDLGAPG 
       670        680        690        700        710        720 
PSGARGERGF PGERGVQGPP GPAGPRGNNG APGNDGAKGD TGAPGAPGSQ GAPGLQGMPG 
       730        740        750        760        770        780 
ERGAAGLPGP KGDRGDAGPK GADGSPGKDG ARGLTGPIGP PGPAGAPGDK GEAGPSGPPG 
       790        800        810        820        830        840 
PTGARGAPGD RGEAGPPGPA GFAGPPGADG QPGAKGEPGD TGVKGDAGPP GPAGPAGPPG 
       850        860        870        880        890        900 
PIGNVGAPGP KGPRGAAGPP GATGFPGAAG RVGPPGPSGN AGPPGPPGPV GKEGGKGPRG 
       910        920        930        940        950        960 
ETGPAGRPGE VGPPGPPGPA GEKGSPGADG PAGSPGTPGP QGIAGQRGVV GLPGQRGERG 
       970        980        990       1000       1010       1020 
FPGLPGPSGE PGKQGPSGSS GERGPPGPMG PPGLAGPPGE SGREGSPGAE GSPGRDGAPG 
      1030       1040       1050       1060       1070       1080 
AKGDRGETGP AGPPGAPGAP GAPGPVGPAG KNGDRGETGP AGPAGPIGPA GARGPAGPQG 
      1090       1100       1110       1120       1130       1140 
PRGDKGETGE QGDRGIKGHR GFSGLQGPPG SPGSPGEQGP SGASGPAGPR GPPGSAGSPG 
      1150       1160       1170       1180       1190       1200 
KDGLNGLPGP IGPPGPRGRT GDSGPAGPPG PPGPPGPPGP PSGGYDFSFL PQPPQEKSQD 
      1210       1220       1230       1240       1250       1260 
GGRYYRADDA NVVRDRDLEV DTTLKSLSQQ IENIRSPEGS RKNPARTCRD LKMCHSDWKS 
      1270       1280       1290       1300       1310       1320 
GEYWIDPNQG CNLDAIKVYC NMETGQTCVF PTQPSVPQKN WYISPNPKEK KHVWFGESMT 
      1330       1340       1350       1360       1370       1380 
DGFPFEYGSE GSDPADVAIQ LTFLRLMSTE ASQNITYHCK NSVAYMDQQT GNLKKALLLQ 
      1390       1400       1410       1420       1430       1440 
GSNEIELRGE GNSRFTYSTL VDGCTSHTGT WGKTVIEYKT TKTSRLPIID VAPLDIGAPD 
      1450    
QEFGLDIGPA CFV

Annotation ?

Network neighborhood
[show protein-protein interactions]
 
100
200
300
400
500
600
700
800
900
1000
1100
1200
1300
1400
Chains:
 
 
N-termini:

NNNNNNNNNN
 
C-termini:
CCCCCCC
 
Cleavage sites:
     
 
 
Binding & active sites:
     
 
Sequence variations:
          
 
Modifications:
    
 
(EKSAG(163)|(164)VSVPG)
163,1453

Collagenase 3 (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Jaenisch R Liu X et al.:A targeted mutation at the know... (M10.013) unknown yes

Targeted features: ?

163,1453,1207,152 CHAIN - Collagen alpha-1(I) chain. (152|1207)
163,1453,167,152 REGION - Nonhelical region (N-terminal). (152|167)
 
(PGPQG(189)|(190)FQGPP)
189,1453

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A1_MOUSE @189 unknown none

Targeted features: ?

189,1453,1207,152 CHAIN - Collagen alpha-1(I) chain. (152|1207)
189,1453,1181,168 REGION - Triple-helical region. (168|1181)
 
(PGAKG(528)|(529)LTGSP)
528,1453

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A1_MOUSE @528 unknown none

Targeted features: ?

528,1453,1207,152 CHAIN - Collagen alpha-1(I) chain. (152|1207)
528,1453,1181,168 REGION - Triple-helical region. (168|1181)
 
(AGQRG(948)|(949)VVGLP)
948,1453

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A1_MOUSE @948 unknown none

Targeted features: ?

948,1453,1030,803 VAR_SEQ - Missing (in isoform 2). (803|1030)
948,1453,1207,152 CHAIN - Collagen alpha-1(I) chain. (152|1207)
948,1453,1181,168 REGION - Triple-helical region. (168|1181)
 
(MGPPG(993)|(994)LAGPP)
993,1453

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Overall CM Prudova A et al.:Multiplex N-terminome analysis ... (M10.004) unknown none

Matrix metalloproteinase-9 (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Overall CM Prudova A et al.:Multiplex N-terminome analysis ... (M10.004) unknown none

Targeted features: ?

993,1453,1030,803 VAR_SEQ - Missing (in isoform 2). (803|1030)
993,1453,1207,152 CHAIN - Collagen alpha-1(I) chain. (152|1207)
993,1453,1181,168 REGION - Triple-helical region. (168|1181)
 
unknown-1MFSFVD...
1,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from isoform by sequence similarity TopFIND inferred from TNt71257 indirect unknown (unknown)

Affected feature boundaries: ?

1,1453,22,1 SIGNAL - (1|22)
 
unknown-98VGVEGP...
98,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation TISdb inferred from TISdb unknown unknown (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TNt196913 indirect unknown (unknown)
 
unknown-152QMSYGY...
152,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation UniProtKB inferred from uniprot unknown unknown 0.0 (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TNt115964 indirect unknown (unknown)

Affected feature boundaries: ?

152,1453,1207,152 CHAIN - Collagen alpha-1(I) chain. (152|1207)
152,1453,152,152 MOD_RES - Pyrrolidone carboxylic acid (By similarity). (152|152)
152,1453,167,152 REGION - Nonhelical region (N-terminal). (152|167)
 
unknown-164VSVPGP...
164,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC32749 indirect unknown (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TNt145325 indirect unknown (unknown)
 
unknown-190FQGPPG...
190,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33328 indirect unknown (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TNt146237 indirect unknown (unknown)
 
unknown-330GPPGFP...
330,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation TISdb inferred from TISdb unknown unknown (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TNt199011 indirect unknown (unknown)
 
unknown-438GEPGAT...
438,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation TISdb inferred from TISdb unknown unknown (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TNt199493 indirect unknown (unknown)
 
unknown-529LTGSPG...
529,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33330 indirect unknown (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TNt153736 indirect unknown (unknown)
 
unknown-949VVGLPG...
949,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33332 indirect unknown (unknown)
 
unknown-994LAGPPG...
994,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33333 indirect unknown (unknown)
inferred from cleavage TopFIND Inferred from cleavage TC34620 indirect unknown (unknown)
 

Protein C-Termini [export]

...EKSAG163-unknown
163,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC32749 indirect unknown (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TCt128616 indirect unknown (unknown)
 
...PGPQG189-unknown
189,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33328 indirect unknown (unknown)
inferred from isoform by sequence similarity TopFIND inferred from TCt129541 indirect unknown (unknown)
 
...PGAKG528-unknown
528,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33330 indirect unknown (unknown)
 
...AGQRG948-unknown
948,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33332 indirect unknown (unknown)
 
...MGPPG993-unknown
993,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33333 indirect unknown (unknown)
inferred from cleavage TopFIND Inferred from cleavage TC34620 indirect unknown (unknown)
 
...RYYRA1207-unknown
1207,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation UniProtKB inferred from uniprot unknown unknown 0.0 (unknown)

Affected feature boundaries: ?

1207,1453,1207,152 CHAIN - Collagen alpha-1(I) chain. (152|1207)
1207,1453,1207,1182 REGION - Nonhelical region (C-terminal). (1182|1207)
 
...PACFV1453-unknown
1453,1453

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from isoform by sequence similarity TopFIND inferred from TCt66875 indirect unknown (unknown)

Affected feature boundaries: ?

1453,1453,1453,1218 DOMAIN - Fibrillar collagen NC1. (1218|1453)
1453,1453,1453,1208 PROPEP - C-terminal propeptide. (1208|1453)
 
Filter by evidence: ?

Directness ?

  • unknown
  • indirect

Physiological relevance ?

  • yes
  • none
  • unknown

Evidencecode ?

Method ?

Perturbation ?

Confidence greater than?

  • unknown

Experimental system ?

Certainty of Protease assignment ?

  • unknown

Evidencecode ?

Tissue distribution ?

Specific evidence ?

Derived from database?

Laboratory ?