BLASTX 7.6.2
Query= RU05445 /QuerySize=1437
(1436 letters)
Database: UniProt/Swiss-Prot;
462,764 sequences; 163,773,382 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 239 4e-062
sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 233 2e-060
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 233 2e-060
sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 229 4e-059
sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 228 9e-059
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 228 9e-059
sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 224 1e-057
sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1 218 1e-055
sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max P... 217 2e-055
sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 204 1e-051
sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 204 2e-051
sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1 201 2e-050
sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata P... 200 2e-050
sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2 172 6e-042
sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2... 171 1e-041
sp|Q90686|CATK_CHICK Cathepsin K OS=Gallus gallus GN=CTSK PE=2 SV=1 167 2e-040
sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTS... 166 4e-040
sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=... 166 5e-040
sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1 165 7e-040
sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2 164 1e-039
sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1 163 3e-039
sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK ... 163 3e-039
sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 ... 163 3e-039
sp|P36184|ACP1_ENTHI Cysteine proteinase ACP1 OS=Entamoeba histo... 160 2e-038
sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1 158 1e-037
sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1 154 2e-036
sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1 151 1e-035
sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 146 3e-034
sp|Q9PYY5|CATV_GVXN Viral cathepsin OS=Xestia c-nigrum granulosi... 146 4e-034
sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata P... 144 2e-033
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana
GN=RD21A PE=1 SV=1
Length = 462
Score = 239 bits (609), Expect = 4e-062
Identities = 114/198 (57%), Positives = 140/198 (70%), Gaps = 5/198 (2%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTT-NEGCSGGYMDYAFEWVISNGGI 1215
GSCWAFS+ GA+EGIN + TGDLI+LSEQELVDCDT+ NEGC+GG MDYAFE++I NGGI
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218
Query: 1214 DTESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQQPISVGMDGSALD 1038
DT+ DY Y GVDGTC+ ++ KVVTID Y DV +E + AV QPIS+ ++
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278
Query: 1037 FQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNT 858
FQLY GI+DG C ++DH V+ VGYG+ENG+DYWIV+NSWG SWG GY + RN
Sbjct: 279 FQLYDSGIFDGSCG---TQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNI 335
Query: 857 DLPYGVCAVNAMASYPTK 804
G C + SYP K
Sbjct: 336 ASSSGKCGIAIEPSYPIK 353
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica
GN=Os04g0650000 PE=1 SV=2
Length = 458
Score = 233 bits (594), Expect = 2e-060
Identities = 112/199 (56%), Positives = 138/199 (69%), Gaps = 5/199 (2%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTT-NEGCSGGYMDYAFEWVISNGGI 1215
GSCWAFS+ A+EGIN + TGDLISLSEQELVDCDT+ NEGC+GG MDYAF+++I+NGGI
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGI 210
Query: 1214 DTESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALD 1038
DTE DY Y G D C+V ++ KVVTID Y DV +ET + AV QP+SV ++
Sbjct: 211 DTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRA 270
Query: 1037 FQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNT 858
FQLY+ GI+ G C +DH V VGYG+ENG+DYWIV+NSWG SWG GY + RN
Sbjct: 271 FQLYSSGIFTGKCG---TALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNI 327
Query: 857 DLPYGVCAVNAMASYPTKE 801
G C + SYP K+
Sbjct: 328 KASSGKCGIAVEPSYPLKK 346
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana
GN=XCP2 PE=1 SV=2
Length = 356
Score = 233 bits (594), Expect = 2e-060
Identities = 108/198 (54%), Positives = 141/198 (71%), Gaps = 5/198 (2%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTT-NEGCSGGYMDYAFEWVISNGGI 1215
GSCWAFS+ A+EGIN + TG+L +LSEQEL+DCDTT N GC+GG MDYAFE+++ NGG+
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGL 219
Query: 1214 DTESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALD 1038
E DY Y+ +GTC + K+E++ VTI+G+ DV E + A+ QP+SV +D S +
Sbjct: 220 RKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGRE 279
Query: 1037 FQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNT 858
FQ Y+GG++DG C D +DH V VGYGS G DY IVKNSWG WG +GY L+RNT
Sbjct: 280 FQFYSGGVFDGRCGVD---LDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRLKRNT 336
Query: 857 DLPYGVCAVNAMASYPTK 804
P G+C +N MAS+PTK
Sbjct: 337 GKPEGLCGINKMASFPTK 354
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis
thaliana GN=At3g19400 PE=2 SV=1
Length = 362
Score = 229 bits (583), Expect = 4e-059
Identities = 112/203 (55%), Positives = 140/203 (68%), Gaps = 8/203 (3%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDT--TNEGCSGGYMDYAFEWVISNGG 1218
GSCWAFS+ GA+EGIN + TG+LISLSEQELVDCD N GC GG M+YAFE+++ NGG
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211
Query: 1217 IDTESDYAYTGVD-GTCNVTK-EETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGS 1047
I+T+ DY Y D G CN K T+VVTIDGY DV + E + AV QP+SV ++ S
Sbjct: 212 IETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEAS 271
Query: 1046 ALDFQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLR 867
+ FQLY G+ G C +DH V++VGYGS +GEDYWI++NSWG +WG GY L+
Sbjct: 272 SQAFQLYKSGVMTGTCG---ISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQ 328
Query: 866 RNTDLPYGVCAVNAMASYPTKES 798
RN D P+G C + M SYPTK S
Sbjct: 329 RNIDDPFGKCGIAMMPSYPTKSS 351
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus
PE=2 SV=1
Length = 328
Score = 228 bits (580), Expect = 9e-059
Identities = 109/200 (54%), Positives = 138/200 (69%), Gaps = 5/200 (2%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTT-NEGCSGGYMDYAFEWVISNGGI 1215
GSCWAFS+ A+EGIN + TG+L+SLSEQELVDCD + N+GC+GG MDYAF++++ NGG+
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGL 181
Query: 1214 DTESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALD 1038
+TE DY Y G +G CN + ++VVTIDGY DV + ET + AV QP+SV +D
Sbjct: 182 NTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRA 241
Query: 1037 FQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNT 858
FQ Y GI+ G C +DHAV+ VGYGSENG DYWIV+NSWGT WG +GY + RN
Sbjct: 242 FQHYQSGIFTGKCG---TNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNV 298
Query: 857 DLPYGVCAVNAMASYPTKES 798
G C + ASYP K S
Sbjct: 299 ASKSGKCGIAIEASYPVKYS 318
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana
GN=XCP1 PE=1 SV=1
Length = 355
Score = 228 bits (580), Expect = 9e-059
Identities = 107/198 (54%), Positives = 136/198 (68%), Gaps = 5/198 (2%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTT-NEGCSGGYMDYAFEWVISNGGI 1215
GSCWAFS+ A+EGIN + TG+L SLSEQEL+DCDTT N GC+GG MDYAF+++IS GG+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218
Query: 1214 DTESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALD 1038
E DY Y +G C KE+ + VTI GY DV E + + A+ QP+SV ++ S D
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278
Query: 1037 FQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNT 858
FQ Y GG+++G C D +DH V VGYGS G DY IVKNSWG WG +G+ ++RNT
Sbjct: 279 FQFYKGGVFNGKCGTD---LDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNT 335
Query: 857 DLPYGVCAVNAMASYPTK 804
P G+C +N MASYPTK
Sbjct: 336 GKPEGLCGINKMASYPTK 353
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica
GN=Os04g0670200 PE=1 SV=2
Length = 466
Score = 224 bits (570), Expect = 1e-057
Identities = 103/199 (51%), Positives = 136/199 (68%), Gaps = 6/199 (3%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTT--NEGCSGGYMDYAFEWVISNGG 1218
GSCWAFS+ +E IN L TG++I+LSEQELV+C T N GC+GG MD AF+++I NGG
Sbjct: 163 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 222
Query: 1217 IDTESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSAL 1041
IDTE DY Y VDG C++ +E KVV+IDG+ DV + E + AV QP+SV ++
Sbjct: 223 IDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGR 282
Query: 1040 DFQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
+FQLY G++ G C +DH V+ VGYG++NG+DYWIV+NSWG WG GY + RN
Sbjct: 283 EFQLYHSGVFSGRCG---TSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERN 339
Query: 860 TDLPYGVCAVNAMASYPTK 804
++ G C + MASYPTK
Sbjct: 340 INVTTGKCGIAMMASYPTK 358
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 218 bits (553), Expect = 1e-055
Identities = 104/198 (52%), Positives = 133/198 (67%), Gaps = 5/198 (2%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFS+ A+EGIN + TGDLISLSEQ+LVDC T N GC GG+M+ AF+++++NGGI+
Sbjct: 25 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTANHGCRGGWMNPAFQFIVNNGGIN 84
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALDF 1035
+E Y Y G DG CN T VV+ID Y +V E + AV QP+SV MD + DF
Sbjct: 85 SEETYPYRGQDGICNST-VNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDF 143
Query: 1034 QLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNTD 855
QLY GI+ G C+ N HA+ +VGYG+EN +D+WIVKNSWG +WG GY RN +
Sbjct: 144 QLYRSGIFTGSCNISAN---HALTVVGYGTENDKDFWIVKNSWGKNWGESGYIRAERNIE 200
Query: 854 LPYGVCAVNAMASYPTKE 801
P G C + ASYP K+
Sbjct: 201 NPDGKCGITRFASYPVKK 218
>sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max PE=1 SV=1
Length = 379
Score = 217 bits (552), Expect = 2e-055
Identities = 110/205 (53%), Positives = 136/205 (66%), Gaps = 11/205 (5%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
G WAFS+TGAIE +A+ TGDL+SLSEQELVDC +EG G+ +FEWV+ +GGI
Sbjct: 157 GRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEESEGSYNGWQYQSFEWVLEHGGIA 216
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDV--------EETETGVFNAVLQQPISVGM 1056
T+ DY Y +G C K + K VTIDGY + ETE +A+L+QPISV +
Sbjct: 217 TDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETEQAFLSAILEQPISVSI 275
Query: 1055 DGSALDFQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYF 876
D A DF LYTGGIYDG+ P I+H VL+VGYGS +G DYWI KNSWG WG +GY
Sbjct: 276 D--AKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAKNSWGFDWGEDGYI 333
Query: 875 YLRRNTDLPYGVCAVNAMASYPTKE 801
+++RNT GVC +N ASYPTKE
Sbjct: 334 WIQRNTGNLLGVCGMNYFASYPTKE 358
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis
thaliana GN=At4g11310 PE=2 SV=1
Length = 364
Score = 204 bits (518), Expect = 1e-051
Identities = 92/199 (46%), Positives = 135/199 (67%), Gaps = 5/199 (2%)
Frame = -1
Query: 1388 SCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGIDT 1209
SCWAFS+ GA+EG+N + TG+L++LSEQ+L++C+ N GC GG ++ A+E+++ NGG+ T
Sbjct: 160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGT 219
Query: 1208 ESDYAYTGVDGTCN-VTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALDF 1035
++DY Y V+G C+ KE K V IDGY ++ E+ + AV QP++ +D S+ +F
Sbjct: 220 DNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREF 279
Query: 1034 QLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNTD 855
QLY G++DG C ++H V++VGYG+ENG DYW+VKNS G +WG GY + RN
Sbjct: 280 QLYESGVFDGSCG---TNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIA 336
Query: 854 LPYGVCAVNAMASYPTKES 798
P G+C + ASYP K S
Sbjct: 337 NPRGLCGIAMRASYPLKNS 355
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis
thaliana GN=At4g11320 PE=2 SV=1
Length = 371
Score = 204 bits (517), Expect = 2e-051
Identities = 93/202 (46%), Positives = 136/202 (67%), Gaps = 5/202 (2%)
Frame = -1
Query: 1397 LLGSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGG 1218
L SCWAFS+ GA+EG+N + TG+L++LSEQ+L++C+ N GC GG ++ A+E++++NGG
Sbjct: 164 LCRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGG 223
Query: 1217 IDTESDYAYTGVDGTC-NVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSA 1044
+ T++DY Y ++G C KE+ K V IDGY ++ E + AV QP++ +D S+
Sbjct: 224 LGTDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSS 283
Query: 1043 LDFQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRR 864
+FQLY G++DG C ++H V++VGYG+ENG DYWIVKNS G +WG GY + R
Sbjct: 284 REFQLYESGVFDGTCG---TNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMAR 340
Query: 863 NTDLPYGVCAVNAMASYPTKES 798
N P G+C + ASYP K S
Sbjct: 341 NIANPRGLCGIAMRASYPLKNS 362
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 201 bits (509), Expect = 2e-050
Identities = 100/198 (50%), Positives = 126/198 (63%), Gaps = 5/198 (2%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAF + A+EGIN + TGDLISLSEQ+LVDC T N GC GG+ AF+++I+NGGI+
Sbjct: 25 GSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTRNHGCEGGWPYRAFQYIINNGGIN 84
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALDF 1035
+E Y YTG +GTC+ TKE VV+ID Y +V E + AV QP+SV MD + DF
Sbjct: 85 SEEHYPYTGTNGTCD-TKENAHVVSIDSYRNVPSNDEKSLQKAVANQPVSVTMDAAGRDF 143
Query: 1034 QLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNTD 855
QLY GI+ G C+ N H + G +EN +DYW VKNSWG +WG GY + RN
Sbjct: 144 QLYRNGIFTGSCNISAN---HYRTVGGRETENDKDYWTVKNSWGKNWGESGYIRVERNIA 200
Query: 854 LPYGVCAVNAMASYPTKE 801
G C + SYP KE
Sbjct: 201 ESSGKCGIAISPSYPIKE 218
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 200 bits (508), Expect = 2e-050
Identities = 92/194 (47%), Positives = 135/194 (69%), Gaps = 6/194 (3%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFS+ A+E IN + TG LISLSEQELVDCDT + GC+GG+M+ AF+++I+NGGID
Sbjct: 23 GSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTASHGCNGGWMNNAFQYIITNGGID 82
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALDF 1035
T+ +Y Y+ V G+C + +VV+I+G+ V E+ + +AV QP+SV ++ + F
Sbjct: 83 TQQNYPYSAVQGSCKPYR--LRVVSINGFQRVTRNNESALQSAVASQPVSVTVEAAGAPF 140
Query: 1034 QLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNTD 855
Q Y+ GI+ G C N H V+IVGYG+++G++YWIV+NSWG +WG +GY ++ RN
Sbjct: 141 QHYSSGIFTGPCGTAQN---HGVVIVGYGTQSGKNYWIVRNSWGQNWGNQGYIWMERNVA 197
Query: 854 LPYGVCAVNAMASY 813
G+C + + SY
Sbjct: 198 SSAGLCGIAQLPSY 211
>sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2
Length = 329
Score = 172 bits (435), Expect = 6e-042
Identities = 92/196 (46%), Positives = 120/196 (61%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC T N GC GGYM AF++V NGGID
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTENYGCGGGYMTTAFQYVQQNGGID 196
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVE-ETETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G D +C + K GY ++ E + AV + PISV +D S
Sbjct: 197 SEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLAS 255
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y+ G+ YD +C D + ++HAVL+VGYG++ G +WI+KNSWG SWG +GY L RN
Sbjct: 256 FQFYSRGVYYDENC--DRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGYALLARN 313
Query: 860 TDLPYGVCAVNAMASY 813
+ C + MAS+
Sbjct: 314 KN---NACGITNMASF 326
>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
Length = 329
Score = 171 bits (433), Expect = 1e-041
Identities = 90/196 (45%), Positives = 120/196 (61%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC + N GC GGYM AF++V NGGID
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSENYGCGGGYMTTAFQYVQQNGGID 196
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVE-ETETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G D +C + K GY ++ E + AV + P+SV +D S
Sbjct: 197 SEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDASLTS 255
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y+ G+ YD +C D + ++HAVL+VGYG++ G YWI+KNSWG SWG +GY L RN
Sbjct: 256 FQFYSRGVYYDENC--DRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLARN 313
Query: 860 TDLPYGVCAVNAMASY 813
+ C + +AS+
Sbjct: 314 KN---NACGITNLASF 326
>sp|Q90686|CATK_CHICK Cathepsin K OS=Gallus gallus GN=CTSK PE=2 SV=1
Length = 334
Score = 167 bits (421), Expect = 2e-040
Identities = 91/195 (46%), Positives = 117/195 (60%), Gaps = 7/195 (3%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L+SLS Q LV C + N GC GGYM AFE+V N GID
Sbjct: 142 GSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSNNNGCGGGYMTNAFEYVRLNRGID 201
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G D +C + K GY ++ E+ E + AV + P+SVG+D S
Sbjct: 202 SEDAYPYIGQDESC-MYSPTGKAAKCRGYREIPEDNEKALKRAVARIGPVSVGIDASLPS 260
Query: 1037 FQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNT 858
FQ Y+ G+Y D +P I+HAVL VGYG++ G +WI+KNSWGT WG +GY L RN
Sbjct: 261 FQFYSRGVY-YDTGCNPENINHAVLAVGYGAQKGTKHWIIKNSWGTEWGNKGYVLLARNM 319
Query: 857 DLPYGVCAVNAMASY 813
C + +AS+
Sbjct: 320 K---QTCGIANLASF 331
>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK PE=1 SV=1
Length = 329
Score = 166 bits (419), Expect = 4e-040
Identities = 89/196 (45%), Positives = 118/196 (60%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC + N GC GGYM AF++V N GID
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTNAFQYVQRNRGID 196
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G D +C + K GY ++ E E + AV + P+SV +D S
Sbjct: 197 SEDAYPYVGQDESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTS 255
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y+ G+ YD +CS D ++HAVL VGYG + G +WI+KNSWG SWG +GY + RN
Sbjct: 256 FQFYSKGVYYDENCSSD--NVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILMARN 313
Query: 860 TDLPYGVCAVNAMASY 813
+ C + +AS+
Sbjct: 314 KN---NACGIANLASF 326
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 166 bits (418), Expect = 5e-040
Identities = 88/196 (44%), Positives = 119/196 (60%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC + N+GC GGYM AF++V N GID
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGID 197
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G D +C + K GY ++ E E + AV + PISV +D S
Sbjct: 198 SEDAYPYVGQDESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTS 256
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y+ G+ YD +C+ D ++HAVL VGYG + G +WI+KNSWG +WG +GY + RN
Sbjct: 257 FQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARN 314
Query: 860 TDLPYGVCAVNAMASY 813
+ C + +AS+
Sbjct: 315 KN---NACGIANLASF 327
>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330
Score = 165 bits (417), Expect = 7e-040
Identities = 87/196 (44%), Positives = 119/196 (60%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC + N+GC GGYM AF++V N GID
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGID 197
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G D C + K GY ++ E E + AV + P+SV +D S
Sbjct: 198 SEDAYPYVGQDENC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTS 256
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y+ G+ YD +C+ D ++HAVL VGYG + G+ +WI+KNSWG +WG +GY + RN
Sbjct: 257 FQFYSKGVYYDENCNSD--NLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMARN 314
Query: 860 TDLPYGVCAVNAMASY 813
+ C + +AS+
Sbjct: 315 KN---NACGIANLASF 327
>sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2
Length = 329
Score = 164 bits (415), Expect = 1e-039
Identities = 88/196 (44%), Positives = 117/196 (59%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC + N+GC GGYM AF++V N GID
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGID 196
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G D C + K GY ++ E E + AV + PISV +D S
Sbjct: 197 SEDAYPYVGQDENC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTS 255
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y G+ YD +C+ D ++HAVL VGYG + G +WI+KNSWG +WG +GY + RN
Sbjct: 256 FQFYRKGVYYDENCNSD--NLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARN 313
Query: 860 TDLPYGVCAVNAMASY 813
+ C + +AS+
Sbjct: 314 KN---NACGIANLASF 326
>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329
Score = 163 bits (412), Expect = 3e-039
Identities = 86/196 (43%), Positives = 118/196 (60%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC + N+GC GGYM AF++V N GID
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGID 196
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G + +C + K GY ++ E E + AV + P+SV +D S
Sbjct: 197 SEDAYPYVGQEESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTS 255
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y+ G+ YD C+ D ++HAVL VGYG + G +WI+KNSWG +WG +GY + RN
Sbjct: 256 FQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARN 313
Query: 860 TDLPYGVCAVNAMASY 813
+ C + +AS+
Sbjct: 314 KN---NACGIANLASF 326
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 163 bits (412), Expect = 3e-039
Identities = 86/196 (43%), Positives = 118/196 (60%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC + N+GC GGYM AF++V N GID
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGID 196
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G + +C + K GY ++ E E + AV + P+SV +D S
Sbjct: 197 SEDAYPYVGQEESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTS 255
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y+ G+ YD C+ D ++HAVL VGYG + G +WI+KNSWG +WG +GY + RN
Sbjct: 256 FQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARN 313
Query: 860 TDLPYGVCAVNAMASY 813
+ C + +AS+
Sbjct: 314 KN---NACGIANLASF 326
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 163 bits (412), Expect = 3e-039
Identities = 86/196 (43%), Positives = 118/196 (60%), Gaps = 9/196 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFSS GA+EG TG L++LS Q LVDC + N+GC GGYM AF++V N GID
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGID 196
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQ-QPISVGMDGSALD 1038
+E Y Y G + +C + K GY ++ E E + AV + P+SV +D S
Sbjct: 197 SEDAYPYVGQEESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTS 255
Query: 1037 FQLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRN 861
FQ Y+ G+ YD C+ D ++HAVL VGYG + G +WI+KNSWG +WG +GY + RN
Sbjct: 256 FQFYSKGVYYDESCNSD--NLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARN 313
Query: 860 TDLPYGVCAVNAMASY 813
+ C + +AS+
Sbjct: 314 KN---NACGIANLASF 326
>sp|P36184|ACP1_ENTHI Cysteine proteinase ACP1 OS=Entamoeba histolytica GN=ACP1
PE=1 SV=2
Length = 308
Score = 160 bits (404), Expect = 2e-038
Identities = 82/197 (41%), Positives = 114/197 (57%), Gaps = 8/197 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCW F +T +EG D G L S SEQ+LVDCD ++ GC GG+ + +++ N G+
Sbjct: 113 GSCWTFCTTAVLEGRVNKDLGKLYSFSEQQLVDCDASDNGCEGGHPSNSLKFIQENNGLG 172
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQQ-PISVGMDGSALD 1038
ESDY Y V GTC K+ V T+ G V + +ETG+ + + P++VGMD S
Sbjct: 173 LESDYPYKAVAGTC---KKVKNVATVTGSRRVTDGSETGLQTIIAENGPVAVGMDASRPS 229
Query: 1037 FQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNT 858
FQLY G D ++H V VGYGS + YWI++NSWGTSWG GYF L R++
Sbjct: 230 FQLYKKGTIYSDTKCRSRMMNHCVTAVGYGSNSNGKYWIIRNSWGTSWGDAGYFLLARDS 289
Query: 857 DLPYGVCAVNAMASYPT 807
+ +C + ++YPT
Sbjct: 290 N---NMCGIGRDSNYPT 303
>sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 215
Score = 158 bits (398), Expect = 1e-037
Identities = 81/195 (41%), Positives = 109/195 (55%), Gaps = 8/195 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFS+ +EGIN + TG LISLSEQEL+DCD + GC GGY + ++V NGG+
Sbjct: 23 GSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRRSHGCKGGYQTGSIQYVADNGGVH 82
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDV-EETETGVFNAVLQQPISVGMDGSALDF 1035
TE +Y Y G C +++ V I GY V E + + QP+SV + F
Sbjct: 83 TEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQGIGNQPVSVLHESKGRAF 142
Query: 1034 QLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNTD 855
QLY GGI++G C + DHAV +GYG D KNSWG +WG +GY ++R +
Sbjct: 143 QLYKGGIFNGPCG---YKNDHAVTAIGYGKAQLLD----KNSWGPNWGEKGYIKIKRASG 195
Query: 854 LPYGVCAVNAMASYP 810
G C V + +P
Sbjct: 196 KSEGTCGVYKSSYFP 210
>sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1
Length = 462
Score = 154 bits (387), Expect = 2e-036
Identities = 88/198 (44%), Positives = 118/198 (59%), Gaps = 13/198 (6%)
Frame = -1
Query: 1397 LLGSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGG 1218
+ GSCWAFS TG +EG L+ G L+SLSEQEL+DCD ++ C GG A+ + + GG
Sbjct: 269 MCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLGG 328
Query: 1217 IDTESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEETETGVFNAVLQQ-PISVGMDGSAL 1041
++TE DY Y G TCN + + KV I+ ++ E + + Q+ PISV + +A
Sbjct: 329 LETEDDYGYQGHVQTCNFSAQMAKVY-INDSVELSRNENKIAAWLAQKGPISVAI--NAF 385
Query: 1040 DFQLYTGGI---YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYL 870
Q Y GI + CS P IDHAVL+VGYG+ + YW +KNSWG+ WG EGY+YL
Sbjct: 386 GMQFYRHGIAHPFRPLCS--PWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYYYL 443
Query: 869 RRNTDLPYGVCAVNAMAS 816
R + G C VN MAS
Sbjct: 444 YRGS----GACGVNTMAS 457
>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
Length = 484
Score = 151 bits (381), Expect = 1e-035
Identities = 85/197 (43%), Positives = 114/197 (57%), Gaps = 11/197 (5%)
Frame = -1
Query: 1397 LLGSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGG 1218
+ GSCWAFS TG +EG L+ G L+SLSEQEL+DCD ++ C GG A+ + + GG
Sbjct: 291 MCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGG 350
Query: 1217 IDTESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEETETGVFNAVLQQPISVGMDGSALD 1038
++TE DY+Y G +CN + E+ KV D + + + PISV + +A
Sbjct: 351 LETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI--NAFG 408
Query: 1037 FQLYTGGI---YDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLR 867
Q Y GI CS P IDHAVL+VGYG+ + +W +KNSWGT WG +GY+YL
Sbjct: 409 MQFYRHGISRPLRPLCS--PWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLH 466
Query: 866 RNTDLPYGVCAVNAMAS 816
R + G C VN MAS
Sbjct: 467 RGS----GACGVNTMAS 479
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE
PE=2 SV=2
Length = 344
Score = 146 bits (368), Expect = 3e-034
Identities = 75/154 (48%), Positives = 98/154 (63%), Gaps = 5/154 (3%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
G CW+FS+TG+ EG + G+L+SLSEQ L+DC T N GC GG M YAFE++I+N GID
Sbjct: 134 GGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENSGCDGGLMTYAFEYIINNNGID 193
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQQPISVGMDGSALDF 1035
TES Y Y +G C K E T+ Y V +E+ + +AV P+SV +D S F
Sbjct: 194 TESSYPYKAENGKCEY-KSENSGATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSF 252
Query: 1034 QLYTGGI-YDGDCSDDPNEIDHAVLIVGYGSENG 936
QLYT GI Y+ +CS + +DH VL VGYGS +G
Sbjct: 253 QLYTSGIYYEPECSSE--NLDHGVLAVGYGSGSG 284
>sp|Q9PYY5|CATV_GVXN Viral cathepsin OS=Xestia c-nigrum granulosis virus
GN=VCATH PE=3 SV=1
Length = 346
Score = 146 bits (367), Expect = 4e-034
Identities = 77/181 (42%), Positives = 106/181 (58%), Gaps = 9/181 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFS+ IE + + + LSEQ+LVDCD N GC+GG M +AFE +I GGI
Sbjct: 155 GSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCDKVNNGCNGGLMSWAFEGIIRAGGIS 214
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEETETGVFNAVLQQ--PISVGMDGSALD 1038
E+ Y YTGVDG C K T+ V + G + VL + P+SV +D +D
Sbjct: 215 YEAPYPYTGVDGVC---KNTTRYVQLSGCYAYDLRSEKKLRQVLHEKGPVSVAID--VVD 269
Query: 1037 FQLYTGGIYDGDCSDDPNEIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYLRRNT 858
Y G+ CS D + ++H VL+VGYG EN YW +KNSWG+ WG +G+F ++R+
Sbjct: 270 LTNYKSGVAK-HCSVD-HGLNHGVLLVGYGQENDVKYWTLKNSWGSDWGEQGFFRIKRDV 327
Query: 857 D 855
+
Sbjct: 328 N 328
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 144 bits (362), Expect = 2e-033
Identities = 72/148 (48%), Positives = 96/148 (64%), Gaps = 7/148 (4%)
Frame = -1
Query: 1391 GSCWAFSSTGAIEGINALDTGDLISLSEQELVDCDTTNEGCSGGYMDYAFEWVISNGGID 1212
GSCWAFS+ +E IN + TG+LISLSEQELVDCD N GC GG +A++++I+NGGID
Sbjct: 23 GSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKNHGCLGGAFVFAYQYIINNGGID 82
Query: 1211 TESDYAYTGVDGTCNVTKEETKVVTIDGYTDVEE-TETGVFNAVLQQPISVGMDGSALDF 1035
T+++Y Y V G C +KVV+IDGY V E + AV QP +V +D S+ F
Sbjct: 83 TQANYPYKAVQGPCQAA---SKVVSIDGYNGVPFCNEXALKQAVAVQPSTVAIDASSAQF 139
Query: 1034 QLYTGGIYDGDCSDDPNEIDHAVLIVGY 951
Q Y+ GI+ G C +++H V IVGY
Sbjct: 140 QQYSSGIFSGPCG---TKLNHGVTIVGY 164
Database: UniProt/Swiss-Prot
Posted date: Thu Apr 23 14:22:33 2009
Number of letters in database: 163,773,382
Number of sequences in database: 462,764
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,390,147,223
Number of Sequences: 462764
Number of Extensions: 8390147223
Number of Successful Extensions: 74969301
Number of sequences better than 0.0: 0
|