Q01149: Collagen alpha-2(I) chain (Mouse)

Protein names
  • Collagen alpha-2(I) chain
  • Alpha-2 type I collagen
  • CO1A2
Gene names Col1a2 (Cola2)
Organism Mus musculus
Protease Family (Not Available)
Protease ID (Not Available)
Chromosome 6

Sequence

        10         20         30         40         50         60 
MLSFVDTRTL LLLAVTSCLA TCQYLQSGSV RKGPTGDRGP RGQRGPAGPR GRDGVDGPMG 
        70         80         90        100        110        120 
PPGPPGSPGP PGSPAPPGLT GNFAAQYSDK GVSSGPGPMG LMGPRGPPGA VGAPGPQGFQ 
       130        140        150        160        170        180 
GPAGEPGEPG QTGPAGPRGP AGSPGKAGED GHPGKPGRPG ERGVVGPQGA RGFPGTPGLP 
       190        200        210        220        230        240 
GFKGVKGHSG MDGLKGQPGA QGVKGEPGAP GENGTPGQAG ARGLPGERGR VGAPGPAGAR 
       250        260        270        280        290        300 
GSDGSVGPVG PAGPIGSAGP PGFPGAPGPK GELGPVGNPG PAGPAGPRGE VGLPGLSGPV 
       310        320        330        340        350        360 
GPPGNPGTNG LTGAKGATGL PGVAGAPGLP GPRGIPGPAG AAGATGARGL VGEPGPAGSK 
       370        380        390        400        410        420 
GESGNKGEPG SVGAQGPPGP SGEEGKRGSP GEAGSAGPAG PPGLRGSPGS RGLPGADGRA 
       430        440        450        460        470        480 
GVMGPPGNRG STGPAGIRGP NGDAGRPGEP GLMGPRGLPG SPGNVGPSGK EGPVGLPGID 
       490        500        510        520        530        540 
GRPGPIGPAG PRGEAGNIGF PGPKGPSGDP GKPGERGHPG LAGARGAPGP DGNNGAQGPP 
       550        560        570        580        590        600 
GPQGVQGGKG EQGPAGPPGF QGLPGPSGTT GEVGKPGERG LPGEFGLPGP AGPRGERGTP 
       610        620        630        640        650        660 
GESGAAGPSG PIGSRGPSGA PGPDGNKGEA GAVGAPGSAG ASGPGGLPGE RGAAGIPGGK 
       670        680        690        700        710        720 
GEKGETGLRG DTGNTGRDGA RGIPGAVGAP GPAGASGDRG EAGAAGPSGP AGPRGSPGER 
       730        740        750        760        770        780 
GEVGPAGPNG FAGPAGAAGQ PGAKGEKGTK GPKGENGIVG PTGSVGAAGP SGPNGPPGPV 
       790        800        810        820        830        840 
GSRGDGGPPG MTGFPGAAGR TGPPGPSGIA GPPGPPGAAG KEGIRGPRGD QGPVGRTGET 
       850        860        870        880        890        900 
GASGPPGFVG EKGPSGEPGT AGAPGTAGPQ GLLGAPGILG LPGSRGERGL PGIAGALGEP 
       910        920        930        940        950        960 
GPLGISGPPG ARGPPGAVGS PGVNGAPGEA GRDGNPGSDG PPGRDGQPGH KGERGYPGSI 
       970        980        990       1000       1010       1020 
GPTGAAGAPG PHGSVGPAGK HGNRGEPGPA GSVGPVGAVG PRGPSGPQGI RGDKGEPGDK 
      1030       1040       1050       1060       1070       1080 
GHRGLPGLKG YSGLQGLPGL AGLHGDQGAP GPVGPAGPRG PAGPSGPVGK DGRSGQPGPV 
      1090       1100       1110       1120       1130       1140 
GPAGVRGSQG SQGPAGPPGP PGPPGPPGVS GGGYDFGFEG DFYRADQPRS QPSLRPKDYE 
      1150       1160       1170       1180       1190       1200 
VDATLKSLNN QIETLLTPEG SRKNPARTCR DLRLSHPEWN SDYYWIDPNQ GCTMDAIKVY 
      1210       1220       1230       1240       1250       1260 
CDFSTGETCI QAQPVNTPAK NSYSRAQANK HVWLGETING GSQFEYNVEG VSSKEMATQL 
      1270       1280       1290       1300       1310       1320 
AFMRLLANRA SQNITYHCKN SIAYLDEETG SLNKAVLLQG SNDVELVAEG NSRFTYSVLV 
      1330       1340       1350       1360       1370    
DGCSKKTNEW GKTIIEYKTN KPSRLPFLDI APLDIGGADQ EFRVEVGPVC FK

Annotation ?

Network neighborhood
[show protein-protein interactions]
 
100
200
300
400
500
600
700
800
900
1000
1100
1200
1300
Chains:
 
 
N-termini:
NNNNNNNNNNNNNN
N
 
C-termini:
CCCCCCCCCCC
 
Cleavage sites:
      
     
 
Features:
   
 
 
Binding & active sites:
     
 
Sequence variations:
  
 
Modifications:
  
 
(AGPIG(256)|(257)SAGPP)
256,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A2_MOUSE @256 unknown none

Targeted features: ?

256,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
(PGPKG(271)|(272)ELGPV)
271,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A2_MOUSE @271 unknown none

Targeted features: ?

271,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
(TGPAG(436)|(437)IRGPN)
436,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A2_MOUSE @436 unknown none

Targeted features: ?

436,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
(VGPTG(763)|(764)SVGAA)
763,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A2_MOUSE @763 unknown none

Targeted features: ?

763,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
(GGPPG(790)|(791)MTGFP)
790,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A2_MOUSE @790 unknown none

Targeted features: ?

790,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
(PGPSG(808)|(809)IAGPP)
808,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Overall CM Prudova A et al.:Multiplex N-terminome analysis ... (M10.004) unknown none

Matrix metalloproteinase-9 (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Overall CM Prudova A et al.:Multiplex N-terminome analysis ... (M10.004) unknown none

Targeted features: ?

808,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
(SGPPG(847)|(848)FVGEK)
847,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A2_MOUSE @847 unknown none

Targeted features: ?

847,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
(PGLAG(1042)|(1043)LHGDQ)
1042,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Overall CM Prudova A et al.:Multiplex N-terminome analysis ... (M10.004) unknown none

Targeted features: ?

1042,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
(PGPPG(1108)|(1109)VSGGG)
1108,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A2_MOUSE @1108 unknown none

Affected feature boundaries: ?

1108,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
1108,1372,1372,1109 PROPEP - C-terminal propeptide (By similarity). (1109|1372)
 
(ATLKS(1147)|(1148)LNNQI)
1147,1372

72 kDa type IV collagenase (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance
inferred from experiment MEROPS Merops MMP2_MOUSE -> CO1A2_MOUSE @1147 unknown none

Targeted features: ?

1147,1372,1372,1139 DOMAIN - Fibrillar collagen NC1. (1139|1372)
1147,1372,1372,1109 PROPEP - C-terminal propeptide (By similarity). (1109|1372)
 
unknown-23QYLQSG...
23,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation UniProtKB inferred from uniprot unknown unknown 0.0 (unknown)

Affected feature boundaries: ?

23,1372,85,23 PROPEP - N-terminal propeptide (By similarity). (23|85)
 
unknown-86QYSDKG...
86,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation UniProtKB inferred from uniprot unknown unknown 0.0 (unknown)

Affected feature boundaries: ?

86,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
86,1372,86,86 MOD_RES - Pyrrolidone carboxylic acid (By similarity). (86|86)
 
unknown-257SAGPPG...
257,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33851 indirect unknown (unknown)
 
unknown-272ELGPVG...
272,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33852 indirect unknown (unknown)
 
unknown-391GEAGSA...
391,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation TISdb inferred from TISdb unknown unknown (unknown)
 
unknown-437IRGPNG...
437,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33853 indirect unknown (unknown)
 
unknown-560FQGLPG...
560,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
Overall CM. Kleifeldt O. et al.: Isotopic labeling of terminal amines in complex samples identifies... direct likely 0.0 (unknown)
 
unknown-764SVGAAG...
764,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33854 indirect unknown (unknown)
 
unknown-791MTGFPG...
791,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33855 indirect unknown (unknown)
 
unknown-809IAGPPG...
809,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33856 indirect unknown (unknown)
inferred from cleavage TopFIND Inferred from cleavage TC34630 indirect unknown (unknown)
 
unknown-848FVGEKG...
848,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33857 indirect unknown (unknown)
 
unknown-1043LHGDQG...
1043,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33858 indirect unknown (unknown)
 
unknown-1109VSGGGY...
1109,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33859 indirect unknown (unknown)

Affected feature boundaries: ?

1109,1372,1372,1109 PROPEP - C-terminal propeptide (By similarity). (1109|1372)
 
unknown-1147SLNNQI...
1147,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
Overall CM. Kleifeldt O. et al.: Isotopic labeling of terminal amines in complex samples identifies... direct likely 0.0 (unknown)
 
unknown-1148LNNQIE...
1148,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33860 indirect unknown (unknown)
 

Protein C-Termini [export]

...FPGTP177-unknown
177,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation Ensembl inferred from ensembl protein ENSMUSP00000125275 unknown unknown (unknown)
 
...AGPIG256-unknown
256,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33851 indirect unknown (unknown)
 
...PGPKG271-unknown
271,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33852 indirect unknown (unknown)
 
...TGPAG436-unknown
436,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33853 indirect unknown (unknown)
 
...VGPTG763-unknown
763,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33854 indirect unknown (unknown)
 
...GGPPG790-unknown
790,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33855 indirect unknown (unknown)
 
...PGPSG808-unknown
808,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33856 indirect unknown (unknown)
inferred from cleavage TopFIND Inferred from cleavage TC34630 indirect unknown (unknown)
 
...SGPPG847-unknown
847,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33857 indirect unknown (unknown)
 
...PGLAG1042-unknown
1042,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33858 indirect unknown (unknown)
 
...PGPPG1108-unknown
1108,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from electronic annotation UniProtKB inferred from uniprot unknown unknown 0.0 (unknown)
inferred from cleavage TopFIND Inferred from cleavage TC33859 indirect unknown (unknown)

Affected feature boundaries: ?

1108,1372,1108,86 CHAIN - Collagen alpha-2(I) chain. (86|1108)
 
...ATLKS1147-unknown
1147,1372

Evidence: (more...)

Evidence ? Source (database) ? Source (laboratory) ? Name ? Directness of identification ? Phys Relevance Confidence
inferred from cleavage TopFIND Inferred from cleavage TC33860 indirect unknown (unknown)
 
Filter by evidence: ?

Directness ?

  • unknown
  • indirect
  • direct

Physiological relevance ?

  • none
  • unknown
  • likely

Evidencecode ?

Method ?

Perturbation ?

Confidence greater than?

  • unknown

Experimental system ?

Certainty of Protease assignment ?

  • unknown

Evidencecode ?

Tissue distribution ?

Specific evidence ?

Derived from database?

Laboratory ?