BLASTX 7.6.2
Query= RU03723 /QuerySize=1843
(1842 letters)
Database: UniProt/Swiss-Prot;
462,764 sequences; 163,773,382 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O04057|ASPR_CUCPE Aspartic proteinase OS=Cucurbita pepo PE=2 ... 730 1e-209
sp|P42210|ASPR_HORVU Phytepsin OS=Hordeum vulgare PE=1 SV=1 724 7e-208
sp|Q42456|ASPR1_ORYSJ Aspartic proteinase oryzasin-1 OS=Oryza sa... 718 5e-206
sp|P40782|CYPR1_CYNCA Cyprosin (Fragment) OS=Cynara cardunculus ... 660 1e-188
sp|P42211|ASPRX_ORYSJ Aspartic proteinase OS=Oryza sativa subsp.... 584 9e-166
sp|Q05744|CATD_CHICK Cathepsin D OS=Gallus gallus GN=CTSD PE=1 SV=1 247 3e-064
sp|O93428|CATD_CHIHA Cathepsin D OS=Chionodraco hamatus GN=ctsd ... 245 1e-063
sp|O76856|CATD_DICDI Cathepsin D OS=Dictyostelium discoideum GN=... 245 1e-063
sp|Q03168|ASPP_AEDAE Lysosomal aspartic protease OS=Aedes aegypt... 243 4e-063
sp|Q9DEX3|CATD_CLUHA Cathepsin D OS=Clupea harengus GN=ctsd PE=1... 240 2e-062
sp|Q805F3|CATEA_XENLA Cathepsin E-A OS=Xenopus laevis GN=ctse-A ... 228 1e-058
sp|P16228|CATE_RAT Cathepsin E OS=Rattus norvegicus GN=Ctse PE=1... 215 1e-054
sp|P70269|CATE_MOUSE Cathepsin E OS=Mus musculus GN=Ctse PE=1 SV=1 215 1e-054
sp|P43159|CATE_RABIT Cathepsin E OS=Oryctolagus cuniculus GN=CTS... 213 3e-054
sp|Q800A0|CATE_RANCA Cathepsin E OS=Rana catesbeiana GN=CTSE PE=... 211 2e-053
sp|Q01294|CARP_NEUCR Vacuolar protease A OS=Neurospora crassa GN... 205 8e-052
sp|P07267|CARP_YEAST Saccharopepsin OS=Saccharomyces cerevisiae ... 200 5e-050
sp|Q9GMY4|PEPC_SORUN Gastricsin OS=Sorex unguiculatus GN=PGC PE=... 198 2e-049
sp|P85137|CARDF_CYNCA Cardosin-F (Fragments) OS=Cynara carduncul... 197 3e-049
sp|P10977|CARPV_CANAL Vacuolar aspartic protease OS=Candida albi... 195 1e-048
sp|P85138|CARDG_CYNCA Cardosin-G (Fragments) OS=Cynara carduncul... 175 9e-043
sp|P85139|CARDH_CYNCA Cardosin-H (Fragments) OS=Cynara carduncul... 174 2e-042
sp|P85136|CARDE_CYNCA Cardosin-E (Fragments) OS=Cynara carduncul... 122 9e-027
sp|P80209|CATD_BOVIN Cathepsin D OS=Bos taurus GN=CTSD PE=1 SV=2 112 1e-023
sp|P18242|CATD_MOUSE Cathepsin D OS=Mus musculus GN=Ctsd PE=1 SV=1 111 2e-023
sp|P07339|CATD_HUMAN Cathepsin D OS=Homo sapiens GN=CTSD PE=1 SV=1 111 2e-023
sp|P24268|CATD_RAT Cathepsin D OS=Rattus norvegicus GN=Ctsd PE=1... 108 2e-022
>sp|O04057|ASPR_CUCPE Aspartic proteinase OS=Cucurbita pepo PE=2 SV=1
Length = 513
Score = 730 bits (1882), Expect = 1e-209
Identities = 347/491 (70%), Positives = 405/491 (82%), Gaps = 1/491 (0%)
Frame = -1
Query: 1719 SASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTAVIRKYNLRGTLGDDQDIDIVSLK 1540
SASNDGLLRVGLKK K D NR+AA + SK+ + + A RKYN +G LG+ D DIV+LK
Sbjct: 24 SASNDGLLRVGLKKIKLDPENRLAARVESKDAEILKAAFRKYNPKGNLGESSDTDIVALK 83
Query: 1539 NYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXXXX 1360
NY+DAQY+GEI IGTPPQKFTVIFDTGSSNLWV +C FS+AC+ H
Sbjct: 84 NYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWV-LCECLFSVACHFHARYKSSRSSSYKK 142
Query: 1359 NGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFQ 1180
NG A+I+YGTGA+SGFFS D+V VGDLVVK+Q FIEAT+EP +TFLVAKFDG+LGLGFQ
Sbjct: 143 NGTSASIRYGTGAVSGFFSYDNVKVGDLVVKEQVFIEATREPSLTFLVAKFDGLLGLGFQ 202
Query: 1179 EISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYVPV 1000
EI+VGNAVPVWYNMV+QGL+KEPVFSFW NRN + DP HY G+HTYVPV
Sbjct: 203 EIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNVEEEEGGEIVFGGVDPKHYRGKHTYVPV 262
Query: 999 TQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGATGIVSQ 820
TQKGYWQFDMGDVLIDG+ TGFC GGC+AIADSGTSLL GPT +IT +NHAIGA G+VSQ
Sbjct: 263 TQKGYWQFDMGDVLIDGEPTGFCDGGCSAIADSGTSLLAGPTPVITMINHAIGAKGVVSQ 322
Query: 819 ECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDGTRGVSVGIKSVVDEDNHKSSAGL 640
+CK VVA+YG TI+ ++L++ P+KICSQI LCTFDGTRGVS+GI+SVVDE+ KSS L
Sbjct: 323 QCKAVVAQYGQTIMDLLLSEADPKKICSQINLCTFDGTRGVSMGIESVVDENAGKSSDSL 382
Query: 639 SDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCDRLPSPMGESAVDCAGLSSMPNVS 460
D MCS CEMTVVWMQNQL+QNQT++ I++Y+N+LCDR+PSPMG+SAVDC LSSMP VS
Sbjct: 383 HDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCDRMPSPMGQSAVDCGQLSSMPTVS 442
Query: 459 FTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPPPRGPLWILGDVFMGQFHTVFDY 280
FTIGGK FDLAPE+Y+LKVGEG VAQCISGFTA D+PPPRGPLWILGDVFMG++HTVFD+
Sbjct: 443 FTIGGKIFDLAPEEYILKVGEGPVAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDF 502
Query: 279 GNERIGFAEAA 247
G R+G AEAA
Sbjct: 503 GKLRVGSAEAA 513
>sp|P42210|ASPR_HORVU Phytepsin OS=Hordeum vulgare PE=1 SV=1
Length = 508
Score = 724 bits (1867), Expect = 7e-208
Identities = 346/487 (71%), Positives = 401/487 (82%), Gaps = 6/487 (1%)
Frame = -1
Query: 1707 DGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTAVIRKYNLRGTLGDDQDIDIVSLKNYMD 1528
+GL+R+ LKKR D+N+RVA L +G ++ N L +++ DIV+LKNYM+
Sbjct: 28 EGLVRIALKKRPIDRNSRVATGL---SGGEEQPLLSGAN---PLRSEEEGDIVALKNYMN 81
Query: 1527 AQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXXXXNGKP 1348
AQYFGEIG+GTPPQKFTVIFDTGSSNLWVPS+KCYFS+ACYLH NGKP
Sbjct: 82 AQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYKAGASSTYKKNGKP 141
Query: 1347 AAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFQEISV 1168
AAIQYGTG+I+G+FSED VTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGF+EISV
Sbjct: 142 AAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFKEISV 201
Query: 1167 GNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYVPVTQKG 988
G AVPVWY M++QGL+ +PVFSFW NR+ D DP HYVGEHTYVPVTQKG
Sbjct: 202 GKAVPVWYKMIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVPVTQKG 261
Query: 987 YWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGATGIVSQECKT 808
YWQFDMGDVL+ G++TGFCAGGCAAIADSGTSLL GPT IITE+N IGA G+VSQECKT
Sbjct: 262 YWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKT 321
Query: 807 VVAEYGDTIIKMILAKDQPQKICSQIGLCTFDGTRGVSVGIKSVVDEDNHKSSAGLSDAM 628
+V++YG I+ ++LA+ QP+KICSQ+GLCTFDGTRGVS GI+SVVD++ KS+ +D M
Sbjct: 322 IVSQYGQQILDLLLAETQPKKICSQVGLCTFDGTRGVSAGIRSVVDDEPVKSNGLRADPM 381
Query: 627 CSACEMTVVWMQNQLKQNQTQDHILDYVNQLCDRLPSPMGESAVDCAGLSSMPNVSFTIG 448
CSACEM VVWMQNQL QN+TQD ILDYVNQLC+RLPSPMGESAVDC L SMP++ FTIG
Sbjct: 382 CSACEMAVVWMQNQLAQNKTQDLILDYVNQLCNRLPSPMGESAVDCGSLGSMPDIEFTIG 441
Query: 447 GKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPPPRGPLWILGDVFMGQFHTVFDYGNER 268
GK+F L PE+Y+LKVGEG AQCISGFTA+D+PPPRGPLWILGDVFMG +HTVFDYG R
Sbjct: 442 GKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGPYHTVFDYGKLR 501
Query: 267 IGFAEAA 247
IGFA+AA
Sbjct: 502 IGFAKAA 508
>sp|Q42456|ASPR1_ORYSJ Aspartic proteinase oryzasin-1 OS=Oryza sativa subsp.
japonica GN=Os05g0567100 PE=2 SV=2
Length = 509
Score = 718 bits (1851), Expect = 5e-206
Identities = 345/512 (67%), Positives = 405/512 (79%), Gaps = 9/512 (1%)
Frame = -1
Query: 1773 RSVTATXXXXXXXXXXXXSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTAVIRKY 1594
RSV +++ +GL+R+ LKKR D+N+RVAA L + G R+
Sbjct: 4 RSVALVLLAAVLLQALLPASAAEGLVRIALKKRPIDENSRVAARLSGEEG------ARRL 57
Query: 1593 NLRGTL---GDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCY 1423
LRG G + DIV+LKNYM+AQYFGEIG+GTPPQKFTVIFDTGSSNLWVPS+KCY
Sbjct: 58 GLRGANSLGGGGGEGDIVALKNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCY 117
Query: 1422 FSLACYLHPXXXXXXXXXXXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEAT 1243
FS+AC+ H NGKPAAIQYGTG+I+GFFSED VTVGDLVVKDQEFIEAT
Sbjct: 118 FSIACFFHSRYKSGQSSTYQKNGKPAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEAT 177
Query: 1242 KEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXX 1063
KEPG+TF+VAKFDGILGLGFQEISVG+AVPVWY MV+QGL+ EPVFSFWFNR++D
Sbjct: 178 KEPGLTFMVAKFDGILGLGFQEISVGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGG 237
Query: 1062 XXXXXXXDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLV 883
DP HY G HTYVPV+QKGYWQF+MGDVLI G+TTGFCA GC+AIADSGTSLL
Sbjct: 238 EIVFGGMDPSHYKGNHTYVPVSQKGYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLA 297
Query: 882 GPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDGTR 703
GPT IITE+N IGATG+VSQECKTVV++YG I+ ++LA+ QP KICSQ+GLCTFDG
Sbjct: 298 GPTAIITEINEKIGATGVVSQECKTVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKH 357
Query: 702 GVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCDRL 523
GVS GIKSVVD++ +S+ S MC+ACEM VVWMQNQL QN+TQD IL+Y+NQLCD+L
Sbjct: 358 GVSAGIKSVVDDEAGESNGLQSGPMCNACEMAVVWMQNQLAQNKTQDLILNYINQLCDKL 417
Query: 522 PSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPPP 343
PSPMGES+VDC L+SMP +SFTIGGK+F L PE+Y+LKVGEG AQCISGFTA+D+PPP
Sbjct: 418 PSPMGESSVDCGSLASMPEISFTIGGKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPP 477
Query: 342 RGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
RGPLWILGDVFMG +HTVFDYG R+GFA++A
Sbjct: 478 RGPLWILGDVFMGAYHTVFDYGKMRVGFAKSA 509
>sp|P40782|CYPR1_CYNCA Cyprosin (Fragment) OS=Cynara cardunculus GN=CYPRO1 PE=1
SV=2
Length = 473
Score = 660 bits (1701), Expect = 1e-188
Identities = 313/452 (69%), Positives = 376/452 (83%), Gaps = 3/452 (0%)
Frame = -1
Query: 1602 RKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCY 1423
RKY +RG D D ++++LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCY
Sbjct: 25 RKYGVRGNF-RDSDGELIALKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCY 83
Query: 1422 FSLACYLHPXXXXXXXXXXXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEAT 1243
FS+AC H NGK AAIQYGTG+ISGFFS+D V +GDL+VK+Q+FIEAT
Sbjct: 84 FSVACLFHSKYRSTDSTTYKKNGKSAAIQYGTGSISGFFSQDSVKLGDLLVKEQDFIEAT 143
Query: 1242 KEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXX 1063
KEPGITFL AKFDGILGLGFQEISVG+AVPVWY M+ QGL++EPVFSFW NRNAD
Sbjct: 144 KEPGITFLAAKFDGILGLGFQEISVGDAVPVWYTMLNQGLVQEPVFSFWLNRNADEQEGG 203
Query: 1062 XXXXXXXDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLV 883
DP+H+ GEHTYVPVTQKGYWQF+MGDVLI +TTGFCA GCAAIADSGTSLL
Sbjct: 204 ELVFGGVDPNHFKGEHTYVPVTQKGYWQFEMGDVLIGDKTTGFCASGCAAIADSGTSLLA 263
Query: 882 GPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDGTR 703
G TTI+T++N AIGA G++SQ+CK++V +YG ++I+M+L+++QP+KICSQ+ LC+FDG+
Sbjct: 264 GTTTIVTQINQAIGAAGVMSQQCKSLVDQYGKSMIEMLLSEEQPEKICSQMKLCSFDGSH 323
Query: 702 GVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCDRL 523
S+ I+SVVD+ KSS GL C C VVWMQNQ++QN+T+++I++YV++LC+RL
Sbjct: 324 DTSMIIESVVDKSKGKSS-GL-PMRCVPCARWVVWMQNQIRQNETEENIINYVDKLCERL 381
Query: 522 PSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPPP 343
PSPMGESAVDC+ LSSMPN++FT+GGK F+L+PEQYVLKVGEG AQCISGFTA+DV PP
Sbjct: 382 PSPMGESAVDCSSLSSMPNIAFTVGGKTFNLSPEQYVLKVGEGATAQCISGFTAMDVAPP 441
Query: 342 RGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
GPLWILGDVFMGQ+HTVFDYGN R+GFAEAA
Sbjct: 442 HGPLWILGDVFMGQYHTVFDYGNLRVGFAEAA 473
>sp|P42211|ASPRX_ORYSJ Aspartic proteinase OS=Oryza sativa subsp. japonica
GN=RAP PE=2 SV=2
Length = 496
Score = 584 bits (1504), Expect = 9e-166
Identities = 270/439 (61%), Positives = 339/439 (77%), Gaps = 5/439 (1%)
Frame = -1
Query: 1563 DIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXX 1384
D D V L +Y++ QY+G IG+G+PPQ FTVIFDTGSSNLWVPS+KCYFS+ACYLH
Sbjct: 63 DSDPVPLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYNS 122
Query: 1383 XXXXXXXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFD 1204
+G+ I YG+GAISGFFS+D+V VGDLVVK+Q+FIEAT+E +TF++ KFD
Sbjct: 123 KKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETSVTFIIGKFD 182
Query: 1203 GILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYV 1024
GILGLG+ EISVG A P+W +M +Q LL + VFSFW NR+ D DP HY
Sbjct: 183 GILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVFGGMDPKHYK 242
Query: 1023 GEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAI 844
G+HTYVPV++KGYWQF+MGD+LIDG +TGFCA GCAAI DSGTSLL GPT I+ ++NHAI
Sbjct: 243 GDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAI 302
Query: 843 GATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDGTRGVSVGIKSVVDED 664
GA GI+S ECK VV+EYG+ I+ +++A+ PQK+CSQ+GLC FDG R VS GI+SVVD++
Sbjct: 303 GAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSNGIESVVDKE 362
Query: 663 NHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCDRLPSPMGESAVDCAG 484
N SDAMCS CEM VVW++NQL++N+T++ IL+Y NQLC+RLPSP GES V C
Sbjct: 363 NLG-----SDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGESTVSCHQ 417
Query: 483 LSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPPPRGPLWILGDVFMG 304
+S MPN++FTI K F L PEQY++K+ +G CISGF A D+PPPRGPLWILGDVFMG
Sbjct: 418 ISKMPNLAFTIANKTFILTPEQYIVKLEQGGQTVCISGFMAFDIPPPRGPLWILGDVFMG 477
Query: 303 QFHTVFDYGNERIGFAEAA 247
+HTVFD+G +RIGFA++A
Sbjct: 478 AYHTVFDFGKDRIGFAKSA 496
>sp|Q05744|CATD_CHICK Cathepsin D OS=Gallus gallus GN=CTSD PE=1 SV=1
Length = 398
Score = 247 bits (628), Expect = 3e-064
Identities = 133/293 (45%), Positives = 178/293 (60%), Gaps = 5/293 (1%)
Frame = -1
Query: 1701 LLRVGLKKRKFDQNNRVAANLYSKNGDAVTAVIRKYNLRGTLGDDQDIDIVSLKNYMDAQ 1522
L+R+ L KF R+ + S+ D + A+ + + D + LKNYMDAQ
Sbjct: 21 LIRIPL--TKFTSTRRMLTEVGSEIPD-MNAITQFLKFKLGFADLAEPTPEILKNYMDAQ 77
Query: 1521 YFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCY-FSLACYLHPXXXXXXXXXXXXNGKPA 1345
Y+GEIGIGTPPQKFTV+FDTGSSNLWVPS C+ +AC LH NG
Sbjct: 78 YYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDASKSSTYVENGTEF 137
Query: 1344 AIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFQEISVG 1165
AI YGTG++SGF S+D VT+G+L +K+Q F EA K+PGITF+ AKFDGILG+ F ISV
Sbjct: 138 AIHYGTGSLSGFLSQDTVTLGNLKIKNQIFGEAVKQPGITFIAAKFDGILGMAFPRISVD 197
Query: 1164 NAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYVPVTQKGY 985
P + N+++Q L+++ +FSF+ NR+ DP +Y G+ ++V VT+K Y
Sbjct: 198 KVTPFFDNVMQQKLIEKNIFSFYLNRDPTAQPGGELLLGGTDPKYYSGDFSWVNVTRKAY 257
Query: 984 WQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGATGIV 826
WQ M V + T C GGC AI D+GTSL+ GPT + EL AIGA ++
Sbjct: 258 WQVHMDSVDVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPLI 309
>sp|O93428|CATD_CHIHA Cathepsin D OS=Chionodraco hamatus GN=ctsd PE=1 SV=2
Length = 396
Score = 245 bits (624), Expect = 1e-063
Identities = 134/313 (42%), Positives = 180/313 (57%), Gaps = 19/313 (6%)
Frame = -1
Query: 1713 SNDGLLRVGLKK-----RKFDQNNRVAANLYSKNGDAVTAVIRKYNLRGTLGDDQDIDIV 1549
+ND L+R+ LKK R+ + + A L + + KYNL + +
Sbjct: 15 TNDALVRIPLKKFRSIRRQLTDSGKRAEELLADHHSL------KYNLSFPASNAPTPE-- 66
Query: 1548 SLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSLACYLHPXXXXXXXX 1372
+LKNY+DAQY+GEIG+GTPPQ FTV+FDTGSSNLWVPS C +AC LH
Sbjct: 67 TLKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIACLLHHKYNSGKSS 126
Query: 1371 XXXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILG 1192
NG AIQYG+G++SG+ S+D T+GDL + Q F EA K+PG+ F+ AKFDGILG
Sbjct: 127 TYVKNGTAFAIQYGSGSLSGYLSQDTCTIGDLAIDSQLFGEAIKQPGVAFIAAKFDGILG 186
Query: 1191 LGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHT 1012
+ + ISV PV+ N++ Q +++ VFSF+ NRN D DP +Y G+
Sbjct: 187 MAYPRISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFN 246
Query: 1011 YVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGATG 832
YV VT++ YWQ + D + G C GGC AI DSGTSL+ GP+ + L AIGA
Sbjct: 247 YVNVTRQAYWQIRV-DSMAVGDQLSLCTGGCEAIVDSGTSLITGPSVEVKALQKAIGAFP 305
Query: 831 IVSQE----CKTV 805
++ E C TV
Sbjct: 306 LIQGEYMVNCDTV 318
Score = 109 bits (270), Expect = 1e-022
Identities = 49/91 (53%), Positives = 64/91 (70%)
Frame = -1
Query: 522 PSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPPP 343
P GE V+C + S+P +SFT+GG+ + L EQY+LKV + C+SGF LD+P P
Sbjct: 305 PLIQGEYMVNCDTVPSLPVISFTVGGQVYTLTGEQYILKVTQAGKTMCLSGFMGLDIPAP 364
Query: 342 RGPLWILGDVFMGQFHTVFDYGNERIGFAEA 250
GPLWILGDVFMGQ++TVFD R+GFA+A
Sbjct: 365 AGPLWILGDVFMGQYYTVFDRDANRVGFAKA 395
>sp|O76856|CATD_DICDI Cathepsin D OS=Dictyostelium discoideum GN=ctsD PE=1 SV=1
Length = 383
Score = 245 bits (624), Expect = 1e-063
Identities = 125/239 (52%), Positives = 150/239 (62%), Gaps = 2/239 (0%)
Frame = -1
Query: 1551 VSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFS-LACYLHPXXXXXXX 1375
+ + ++ DAQY+G I IGTP Q F V+FDTGSSNLW+PS KC + +AC LH
Sbjct: 53 IPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGAS 112
Query: 1374 XXXXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGIL 1195
NG IQYG+GA+SGF S+D VTVG L VKDQ F EAT EPGI F AKFDGIL
Sbjct: 113 STYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGIL 172
Query: 1194 GLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEH 1015
GL FQ ISV + PV+YNM+ QGL+ +FSFW +R D Y G+
Sbjct: 173 GLAFQSISVNSIPPVFYNMLSQGLVSSTLFSFWLSR-TPGANGGELSFGSIDNTKYTGDI 231
Query: 1014 TYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGA 838
TYVP+T + YW+F M D IDGQ+ GFC C AI DSGTSL+ GP IT LN +GA
Sbjct: 232 TYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSGTSLIAGPMADITALNEKLGA 290
>sp|Q03168|ASPP_AEDAE Lysosomal aspartic protease OS=Aedes aegypti GN=AAEL006169
PE=1 SV=2
Length = 387
Score = 243 bits (619), Expect = 4e-063
Identities = 121/244 (49%), Positives = 153/244 (62%), Gaps = 3/244 (1%)
Frame = -1
Query: 1545 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYF-SLACYLHPXXXXXXXXX 1369
L NY+DAQY+G I IGTPPQ F V+FDTGSSNLWVPS +C F ++AC +H
Sbjct: 60 LSNYLDAQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSST 119
Query: 1368 XXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGL 1189
NG IQYG+G++SG+ S D V +G + V Q F EA EPG+ F+ AKFDGILGL
Sbjct: 120 FEKNGTAFHIQYGSGSLSGYLSTDTVGLGGVSVTKQTFAEAINEPGLVFVAAKFDGILGL 179
Query: 1188 GFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTY 1009
G+ ISV VPV+YNM QGL+ PVFSF+ NR+ D + Y G+ TY
Sbjct: 180 GYSSISVDGVVPVFYNMFNQGLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFTY 239
Query: 1008 VPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGATGI 829
+ V +K YWQF M V + T FC GC AIAD+GTSL+ GP + +T +N AIG T I
Sbjct: 240 LSVDRKAYWQFKMDSVKVG--DTEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTPI 297
Query: 828 VSQE 817
++ E
Sbjct: 298 MNGE 301
Score = 112 bits (280), Expect = 7e-024
Identities = 49/99 (49%), Positives = 67/99 (67%)
Frame = -1
Query: 546 VNQLCDRLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGF 367
+N+ P GE VDC+ + +P +SF +GGK FDL YVL+V + C+SGF
Sbjct: 288 INKAIGGTPIMNGEYMVDCSLIPKLPKISFVLGGKSFDLEGADYVLRVAQMGKTICLSGF 347
Query: 366 TALDVPPPRGPLWILGDVFMGQFHTVFDYGNERIGFAEA 250
+D+PPP GPLWILGDVF+G+++T FD GN+R+GFA A
Sbjct: 348 MGIDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFATA 386
>sp|Q9DEX3|CATD_CLUHA Cathepsin D OS=Clupea harengus GN=ctsd PE=1 SV=1
Length = 396
Score = 240 bits (612), Expect = 2e-062
Identities = 122/253 (48%), Positives = 158/253 (62%), Gaps = 6/253 (2%)
Frame = -1
Query: 1548 SLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFS-LACYLHPXXXXXXXX 1372
+LKNYMDAQY+GEIG+GTP Q FTV+FDTGSSNLW+PS C F+ +AC LH
Sbjct: 67 TLKNYMDAQYYGEIGLGTPVQMFTVVFDTGSSNLWLPSIHCSFTDIACLLHHKYNGAKSS 126
Query: 1371 XXXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILG 1192
NG AIQYG+G++SG+ S+D T+GD+VV+ Q F EA K+PG+ F+ AKFDGILG
Sbjct: 127 TYVKNGTEFAIQYGSGSLSGYLSQDSCTIGDIVVEKQLFGEAIKQPGVAFIAAKFDGILG 186
Query: 1191 LGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHT 1012
+ + ISV PV+ M+ Q +++ VFSF+ NRN D DP +Y G+
Sbjct: 187 MAYPRISVDGVPPVFDMMMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFN 246
Query: 1011 YVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGATG 832
YVPVT++ YWQ M + I Q T C GC AI D+GTSL+ GP + L AIGA
Sbjct: 247 YVPVTRQAYWQIHMDGMSIGSQLT-LCKDGCEAIVDTGTSLITGPPAEVRALQKAIGAIP 305
Query: 831 IVSQE----CKTV 805
++ E CK V
Sbjct: 306 LIQGEYMIDCKKV 318
Score = 109 bits (271), Expect = 8e-023
Identities = 46/92 (50%), Positives = 65/92 (70%)
Frame = -1
Query: 525 LPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPP 346
+P GE +DC + ++P +SF +GGK + L EQYVLK +G C+SG L++PP
Sbjct: 304 IPLIQGEYMIDCKKVPTLPTISFNVGGKTYSLTGEQYVLKESQGGKTICLSGLMGLEIPP 363
Query: 345 PRGPLWILGDVFMGQFHTVFDYGNERIGFAEA 250
P GPLWILGDVF+GQ++TVFD + R+GFA++
Sbjct: 364 PAGPLWILGDVFIGQYYTVFDRESNRVGFAKS 395
>sp|Q805F3|CATEA_XENLA Cathepsin E-A OS=Xenopus laevis GN=ctse-A PE=1 SV=1
Length = 397
Score = 228 bits (580), Expect = 1e-058
Identities = 113/237 (47%), Positives = 154/237 (64%), Gaps = 2/237 (0%)
Frame = -1
Query: 1545 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXX 1366
L NYMD +YFGEI +GTPPQ FTVIFDTGSSNLWVPS C S AC H
Sbjct: 66 LINYMDVEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC-ISQACAQHDRFQPQLSSTY 124
Query: 1365 XXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLG 1186
NG ++QYGTG++SG D VTV ++V++Q+F E+ EPG TF+ A+FDGILGLG
Sbjct: 125 ESNGNNFSLQYGTGSLSGVIGIDAVTVEGILVQNQQFGESVSEPGSTFVDAEFDGILGLG 184
Query: 1185 FQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYV 1006
+ I+VG+ PV+ NM+ Q L++ P+FS + +RN + D + G+ +V
Sbjct: 185 YPSIAVGDCTPVFDNMIAQNLVELPMFSVYMSRNPNSAVGGELVFGGFDASRFSGQLNWV 244
Query: 1005 PVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGAT 835
PVT +GYWQ + +V I+G+ FC+GGC AI D+GTSL+ GP++ I +L + IGA+
Sbjct: 245 PVTNQGYWQIQLDNVQINGEVL-FCSGGCQAIVDTGTSLITGPSSDIVQLQNIIGAS 300
>sp|P16228|CATE_RAT Cathepsin E OS=Rattus norvegicus GN=Ctse PE=1 SV=3
Length = 398
Score = 215 bits (546), Expect = 1e-054
Identities = 111/250 (44%), Positives = 150/250 (60%), Gaps = 5/250 (2%)
Frame = -1
Query: 1545 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXX 1366
L NY+D +YFG + IG+P Q FTVIFDTGSSNLWVPS C S AC HP
Sbjct: 72 LINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYC-TSPACKAHPVFHPSQSSTY 130
Query: 1365 XXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLG 1186
G +IQYGTG+++G D V+V L V+ Q+F E+ KEPG TF+ A+FDGILGLG
Sbjct: 131 MEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGLG 190
Query: 1185 FQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYV 1006
+ ++VG PV+ NM+ Q L+ P+FS + + + DP H+ G ++
Sbjct: 191 YPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWI 250
Query: 1005 PVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGAT--- 835
PVT++GYWQ + + + G T FC+ GC AI D+GTSL+ GP I +L AIGAT
Sbjct: 251 PVTKQGYWQIALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPMD 309
Query: 834 GIVSQECKTV 805
G + +C T+
Sbjct: 310 GEYAVDCATL 319
>sp|P70269|CATE_MOUSE Cathepsin E OS=Mus musculus GN=Ctse PE=1 SV=1
Length = 397
Score = 215 bits (545), Expect = 1e-054
Identities = 112/250 (44%), Positives = 147/250 (58%), Gaps = 5/250 (2%)
Frame = -1
Query: 1545 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXX 1366
L NY+D +YFG I IGTPPQ FTVIFDTGSSNLWVPS C S AC HP
Sbjct: 71 LINYLDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYC-TSPACKAHPVFHPSQSDTY 129
Query: 1365 XXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLG 1186
G +IQYGTG+++G D V+V L V Q+F E+ KEPG TF+ A+FDGILGLG
Sbjct: 130 TEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGLG 189
Query: 1185 FQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYV 1006
+ ++ G PV+ NM+ Q L+ P+FS + + + DP H+ G ++
Sbjct: 190 YPSLAAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWI 249
Query: 1005 PVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGAT--- 835
PVT++ YWQ + + + G T FC+ GC AI D+GTSL+ GP I L AIGAT
Sbjct: 250 PVTKQAYWQIALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPDKIKHLQEAIGATPID 308
Query: 834 GIVSQECKTV 805
G + +C T+
Sbjct: 309 GEYAVDCATL 318
>sp|P43159|CATE_RABIT Cathepsin E OS=Oryctolagus cuniculus GN=CTSE PE=2 SV=1
Length = 396
Score = 213 bits (542), Expect = 3e-054
Identities = 111/250 (44%), Positives = 149/250 (59%), Gaps = 5/250 (2%)
Frame = -1
Query: 1545 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXX 1366
L NY+D +YFG I IG+PPQ FTVIFDT SSNLWVPS C S AC +HP
Sbjct: 70 LINYLDMEYFGTISIGSPPQNFTVIFDTVSSNLWVPSVYC-TSPACQMHPQFRPSQSNTY 128
Query: 1365 XXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLG 1186
G P +I YGTG+++G D V+V L V Q+F E+ KEPG TF+ A+FDGILGLG
Sbjct: 129 SEVGTPFSIAYGTGSLTGIIGADQVSVQGLTVVGQQFGESVKEPGQTFVNAEFDGILGLG 188
Query: 1185 FQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYV 1006
+ ++ G PV+ NM+ Q L+ P+FS + + N + D H+ G +V
Sbjct: 189 YPSLAAGGVTPVFDNMMAQNLVSLPMFSVYMSSNPEGGSGSELTFGGYDSSHFSGSLNWV 248
Query: 1005 PVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGAT--- 835
PVT++GYWQ + ++ + G FC GC AI D+GTSL+ GP+ I +L AIGAT
Sbjct: 249 PVTKQGYWQIALDEIQVGGSPM-FCPEGCQAIVDTGTSLITGPSDKIIQLQAAIGATPMD 307
Query: 834 GIVSQECKTV 805
G + EC+ +
Sbjct: 308 GEYAVECENL 317
>sp|Q800A0|CATE_RANCA Cathepsin E OS=Rana catesbeiana GN=CTSE PE=1 SV=1
Length = 397
Score = 211 bits (536), Expect = 2e-053
Identities = 108/250 (43%), Positives = 146/250 (58%), Gaps = 5/250 (2%)
Frame = -1
Query: 1545 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXX 1366
L NY+D +YFG+I IGTPPQ+FTVIFDTGSSNLWVPS C S AC H
Sbjct: 66 LMNYLDVEYFGQISIGTPPQQFTVIFDTGSSNLWVPSIYC-TSQACTKHNRYRPSESTTY 124
Query: 1365 XXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLG 1186
NG+ IQYGTG ++G D VTV + V+ Q F E+ EPG TF + FDGILGL
Sbjct: 125 VSNGEAFFIQYGTGNLTGILGIDQVTVQGITVQSQTFAESVSEPGSTFQDSNFDGILGLA 184
Query: 1185 FQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYV 1006
+ ++V N +PV+ NM+ Q L++ P+F + NR+ + D + G+ +V
Sbjct: 185 YPNLAVDNCIPVFDNMIAQNLVELPLFGVYMNRDPNSADGGELVLGGFDTSRFSGQLNWV 244
Query: 1005 PVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGAT--- 835
P+T +GYWQ + + + GQ FC+ GC AI D+GTSL+ GP+ I +L + IG T
Sbjct: 245 PITVQGYWQIQVDSIQVAGQVI-FCSDGCQAIVDTGTSLITGPSGDIEQLQNYIGVTNTN 303
Query: 834 GIVSQECKTV 805
G C T+
Sbjct: 304 GEYGVSCSTL 313
>sp|Q01294|CARP_NEUCR Vacuolar protease A OS=Neurospora crassa GN=pep-4 PE=3
SV=2
Length = 396
Score = 205 bits (521), Expect = 8e-052
Identities = 108/239 (45%), Positives = 142/239 (59%), Gaps = 8/239 (3%)
Frame = -1
Query: 1551 VSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXX 1372
V + N+M+AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C S+ACYLH
Sbjct: 75 VPITNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACYLHNKYESSESS 133
Query: 1371 XXXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILG 1192
NG I+YG+G++SGF S+D +T+GD+ + DQ F EAT EPG+ F +FDGILG
Sbjct: 134 TYKKNGTSFKIEYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLAFAFGRFDGILG 193
Query: 1191 LGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHT 1012
LG+ I+V P +Y MV+Q L+ EPVFSF+ AD + D Y G+ T
Sbjct: 194 LGYDRIAVNGITPPFYKMVEQKLVDEPVFSFYL---ADQDGESEVVFGGVNKDRYTGKIT 250
Query: 1011 YVPVTQKGYWQFDMGDVLIDGQTTGFC-AGGCAAIADSGTSLLVGPTTIITELNHAIGA 838
+P+ +K YW+ D + G F G I D+GTSL+ P+ + LN IGA
Sbjct: 251 TIPLRRKAYWEVDFDAI---GYGKDFAELEGHGVILDTGTSLIALPSQLAEMLNAQIGA 306
>sp|P07267|CARP_YEAST Saccharopepsin OS=Saccharomyces cerevisiae GN=PEP4 PE=1
SV=1
Length = 405
Score = 200 bits (506), Expect = 5e-050
Identities = 106/253 (41%), Positives = 149/253 (58%), Gaps = 8/253 (3%)
Frame = -1
Query: 1551 VSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXX 1372
V L NY++AQY+ +I +GTPPQ F VI DTGSSNLWVPS++C SLAC+LH
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 1371 XXXXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILG 1192
NG AIQYGTG++ G+ S+D +++GDL + Q+F EAT EPG+TF KFDGILG
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 1191 LGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWF-NRNADXXXXXXXXXXXXDPDHYVGEH 1015
LG+ ISV VP +YN ++Q LL E F+F+ + + D D + G+
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 1014 TYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGA- 838
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Query: 837 ---TGIVSQECKT 808
TG + +C T
Sbjct: 318 KGWTGQYTLDCNT 330
>sp|Q9GMY4|PEPC_SORUN Gastricsin OS=Sorex unguiculatus GN=PGC PE=2 SV=1
Length = 389
Score = 198 bits (501), Expect = 2e-049
Identities = 100/233 (42%), Positives = 135/233 (57%), Gaps = 1/233 (0%)
Frame = -1
Query: 1536 YMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXXXXN 1357
Y+DA YFGEI IGTPPQ F V+FDTGSSNLWVPS C S AC H N
Sbjct: 68 YLDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTGHARFNPSKSSTYSTN 126
Query: 1356 GKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFQE 1177
G+ ++QYG+G+++GFF D +T+ ++ V QEF + EPG F+ A+FDGI+G+ +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTMTLQNIKVPHQEFGLSQNEPGENFVYAQFDGIMGMAYPT 186
Query: 1176 ISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYVPVT 997
+++G A M++ G L PVFSF+ + D Y G+ + PVT
Sbjct: 187 LAMGGATTALQGMLQAGALDSPVFSFYLSNQQSSKDGGAVVFGGVDNSLYTGQIFWTPVT 246
Query: 996 QKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGA 838
Q+ YWQ + LI GQ TG+C+ GC AI D+GTSLL P ++ L A GA
Sbjct: 247 QELYWQIGVEQFLIGGQATGWCSQGCQAIVDTGTSLLTVPQQYLSALQQATGA 299
>sp|P85137|CARDF_CYNCA Cardosin-F (Fragments) OS=Cynara cardunculus PE=1 SV=1
Length = 281
Score = 197 bits (499), Expect = 3e-049
Identities = 99/168 (58%), Positives = 118/168 (70%), Gaps = 6/168 (3%)
Frame = -1
Query: 1335 YGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFQEISVGNAV 1156
Y + S + S+D VT+GDLVVK+Q+FIEAT+E FL FDGILGL FQ IS V
Sbjct: 53 YESSGSSTYKSQDSVTIGDLVVKEQDFIEATEEADNVFLNRLFDGILGLSFQTIS----V 108
Query: 1155 PVWYNMVKQGLLKEPVFSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYVPVTQKGYWQF 976
PVWYNM+ QGL+K FSFW NRN D DP+H+ G+HTYVPVT + YWQF
Sbjct: 109 PVWYNMLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQF 166
Query: 975 DMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAIGATG 832
+GDVLI ++TGFCA GC A ADSGTSLL GPT I+T++NHAIGA G
Sbjct: 167 GIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIGANG 214
>sp|P10977|CARPV_CANAL Vacuolar aspartic protease OS=Candida albicans GN=APR1
PE=3 SV=3
Length = 419
Score = 195 bits (493), Expect = 1e-048
Identities = 105/243 (43%), Positives = 142/243 (58%), Gaps = 14/243 (5%)
Frame = -1
Query: 1545 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSLACYLHPXXXXXXXXXX 1366
L NY++AQYF EI IGTP Q F VI DTGSSNLWVPS C SLAC+LH
Sbjct: 96 LTNYLNAQYFTEIQIGTPGQPFKVILDTGSSNLWVPSQDC-TSLACFLHAKYDHDASSTY 154
Query: 1365 XXNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLG 1186
NG +IQYG+G++ G+ S+D +T+GDLV+ Q+F EAT EPG+ F KFDGILGL
Sbjct: 155 KVNGSEFSIQYGSGSMEGYISQDVLTIGDLVIPGQDFAEATSEPGLAFAFGKFDGILGLA 214
Query: 1185 FQEISVGNAVPVWYNMVKQGLLKEPVFSFWF-NRNADXXXXXXXXXXXXDPDHYVGEHTY 1009
+ ISV + VP YN + QGLL++P F F+ + + D D + G+ T+
Sbjct: 215 YDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDGGLATFGGYDASLFQGKITW 274
Query: 1008 VPVTQKGYWQ-----FDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGPTTIITELNHAI 844
+P+ +K YW+ +GD + TG A D+GTSL+ P+++ +N I
Sbjct: 275 LPIRRKAYWEVSFEGIGLGDEYAELHKTG-------AAIDTGTSLITLPSSLAEIINAKI 327
Query: 843 GAT 835
GAT
Sbjct: 328 GAT 330
>sp|P85138|CARDG_CYNCA Cardosin-G (Fragments) OS=Cynara cardunculus PE=1 SV=1
Length = 266
Score = 175 bits (443), Expect = 9e-043
Identities = 88/144 (61%), Positives = 102/144 (70%), Gaps = 6/144 (4%)
Frame = -1
Query: 1269 KDQEFIEATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFN 1090
K+Q+FIEAT+E FL FDGILGL FQ IS VPVWYNMV QGL+K FSFW N
Sbjct: 61 KEQDFIEATEEADNVFLNRLFDGILGLSFQTIS----VPVWYNMVNQGLVKR--FSFWLN 114
Query: 1089 RNADXXXXXXXXXXXXDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAI 910
RN D DP+H+ G+HTYVPVT + YWQF +GDVLI ++TGFCA GC A
Sbjct: 115 RNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAF 174
Query: 909 ADSGTSLLVGPTTIITELNHAIGA 838
ADSGTSLL GPT I+T++NHAIGA
Sbjct: 175 ADSGTSLLSGPTAIVTQINHAIGA 198
>sp|P85139|CARDH_CYNCA Cardosin-H (Fragments) OS=Cynara cardunculus PE=1 SV=1
Length = 265
Score = 174 bits (441), Expect = 2e-042
Identities = 88/149 (59%), Positives = 102/149 (68%), Gaps = 6/149 (4%)
Frame = -1
Query: 1284 GDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVF 1105
G K+Q+FIEAT E FL FDGILGL FQ IS VPVWYNM+ QGL+K F
Sbjct: 56 GSSTYKEQDFIEATDETDNVFLHRLFDGILGLSFQTIS----VPVWYNMLNQGLVKR--F 109
Query: 1104 SFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAG 925
SFW NRN D DP+H+ G+HTYVPVT + YWQF +GDVLI ++TGFCA
Sbjct: 110 SFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAP 169
Query: 924 GCAAIADSGTSLLVGPTTIITELNHAIGA 838
GC A ADSGTSLL GPT I+T++NHAIGA
Sbjct: 170 GCQAFADSGTSLLSGPTAIVTQINHAIGA 198
>sp|P85136|CARDE_CYNCA Cardosin-E (Fragments) OS=Cynara cardunculus PE=1 SV=1
Length = 224
Score = 122 bits (305), Expect = 9e-027
Identities = 55/90 (61%), Positives = 65/90 (72%)
Frame = -1
Query: 1107 FSFWFNRNADXXXXXXXXXXXXDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCA 928
FSFW NRN D DP+H+ G+HTYVPVT + YWQF +GDVLI ++TGFCA
Sbjct: 67 FSFWLNRNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCA 126
Query: 927 GGCAAIADSGTSLLVGPTTIITELNHAIGA 838
GC A ADSGTSLL GPT I+T++NHAIGA
Sbjct: 127 PGCQAFADSGTSLLSGPTAIVTQINHAIGA 156
>sp|P80209|CATD_BOVIN Cathepsin D OS=Bos taurus GN=CTSD PE=1 SV=2
Length = 390
Score = 112 bits (278), Expect = 1e-023
Identities = 48/93 (51%), Positives = 65/93 (69%)
Frame = -1
Query: 525 LPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPP 346
+P GE + C +SS+P V+ +GGK + L+PE Y LKV + E C+SGF +D+PP
Sbjct: 296 VPLIQGEYMIPCEKVSSLPEVTVKLGGKDYALSPEDYALKVSQAETTVCLSGFMGMDIPP 355
Query: 345 PRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
P GPLWILGDVF+G+++TVFD R+G AEAA
Sbjct: 356 PGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 388
>sp|P18242|CATD_MOUSE Cathepsin D OS=Mus musculus GN=Ctsd PE=1 SV=1
Length = 410
Score = 111 bits (277), Expect = 2e-023
Identities = 47/92 (51%), Positives = 65/92 (70%)
Frame = -1
Query: 525 LPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPP 346
+P GE + C +SS+P V +GGK ++L P++Y+LKV +G C+SGF +D+PP
Sbjct: 316 VPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIPP 375
Query: 345 PRGPLWILGDVFMGQFHTVFDYGNERIGFAEA 250
P GPLWILGDVF+G ++TVFD N R+GFA A
Sbjct: 376 PSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>sp|P07339|CATD_HUMAN Cathepsin D OS=Homo sapiens GN=CTSD PE=1 SV=1
Length = 412
Score = 111 bits (276), Expect = 2e-023
Identities = 47/93 (50%), Positives = 66/93 (70%)
Frame = -1
Query: 525 LPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPP 346
+P GE + C +S++P ++ +GGK + L+PE Y LKV + C+SGF +D+PP
Sbjct: 318 VPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPP 377
Query: 345 PRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
P GPLWILGDVF+G+++TVFD N R+GFAEAA
Sbjct: 378 PSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>sp|P24268|CATD_RAT Cathepsin D OS=Rattus norvegicus GN=Ctsd PE=1 SV=1
Length = 407
Score = 108 bits (268), Expect = 2e-022
Identities = 46/93 (49%), Positives = 67/93 (72%)
Frame = -1
Query: 525 LPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPP 346
+P GE + C +SS+P ++F +GG+ ++L PE+Y+LKV + C+SGF +D+PP
Sbjct: 313 VPLIQGEYMIPCEKVSSLPIITFKLGGQNYELHPEKYILKVSQAGKTICLSGFMGMDIPP 372
Query: 345 PRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
P GPLWILGDVF+G ++TVFD R+GFA+AA
Sbjct: 373 PSGPLWILGDVFIGCYYTVFDREYNRVGFAKAA 405
Database: UniProt/Swiss-Prot
Posted date: Thu Apr 23 14:22:33 2009
Number of letters in database: 163,773,382
Number of sequences in database: 462,764
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,073,084,167
Number of Sequences: 462764
Number of Extensions: 6073084167
Number of Successful Extensions: 56121479
Number of sequences better than 0.0: 0
|