BLASTX 7.6.2
Query= RU21475 /QuerySize=1074
(1073 letters)
Database: UniProt/TrEMBL;
7,695,149 sequences; 2,506,224,640 total letters
Score E
Sequences producing significant alignments: (bits) Value
tr|B9MY43|B9MY43_POPTR Predicted protein OS=Populus trichocarpa ... 431 8e-119
tr|B9T542|B9T542_RICCO O-sialoglycoprotein endopeptidase, putati... 430 1e-118
tr|O49653|O49653_ARATH Glycoprotein endopeptidase - like protein... 419 2e-115
tr|A7QM84|A7QM84_VITVI Chromosome chr5 scaffold_124, whole genom... 415 3e-114
tr|Q2PYX9|Q2PYX9_SOLTU Glycoprotein endopeptidase-like protein O... 403 2e-110
tr|A9NN15|A9NN15_PICSI Putative uncharacterized protein OS=Picea... 391 9e-107
tr|A2Y190|A2Y190_ORYSI Putative uncharacterized protein OS=Oryza... 374 6e-102
tr|Q6L4N8|Q6L4N8_ORYSJ Os05g0194600 protein OS=Oryza sativa subs... 374 6e-102
tr|B4FYG1|B4FYG1_MAIZE Putative uncharacterized protein OS=Zea m... 374 8e-102
tr|B6TBH3|B6TBH3_MAIZE O-sialoglycoprotein endopeptidase OS=Zea ... 374 8e-102
tr|A9SUY4|A9SUY4_PHYPA Predicted protein OS=Physcomitrella paten... 361 1e-097
tr|B3RVJ6|B3RVJ6_TRIAD Putative uncharacterized protein OS=Trich... 344 7e-093
tr|B6PN74|B6PN74_BRAFL Putative uncharacterized protein OS=Branc... 337 1e-090
tr|B6LCL9|B6LCL9_BRAFL Putative uncharacterized protein OS=Branc... 334 1e-089
tr|Q0V9I9|Q0V9I9_XENTR O-sialoglycoprotein endopeptidase OS=Xeno... 332 5e-089
tr|B7PKV7|B7PKV7_IXOSC O-sialoglycoprotein endopeptidase, putati... 330 1e-088
tr|C0HD86|C0HD86_SALSA Probable O-sialoglycoprotein endopeptidas... 329 2e-088
tr|B4J0U0|B4J0U0_DROGR GH15886 OS=Drosophila grimshawi GN=GH1588... 324 1e-086
tr|Q561S3|Q561S3_DANRE O-sialoglycoprotein endopeptidase OS=Dani... 322 3e-086
tr|Q5RHZ6|Q5RHZ6_DANRE Novel protein similar to vertebrate O-sia... 322 3e-086
tr|Q4SY05|Q4SY05_TETNG Chromosome undetermined SCAF12247, whole ... 321 6e-086
tr|B4LI30|B4LI30_DROVI GJ12789 OS=Drosophila virilis GN=GJ12789 ... 320 2e-085
tr|B0BMW7|B0BMW7_RAT O-sialoglycoprotein endopeptidase, isoform ... 319 2e-085
tr|B3M5K1|B3M5K1_DROAN GF10115 OS=Drosophila ananassae GN=GF1011... 319 4e-085
tr|B4L9L3|B4L9L3_DROMO GI16537 OS=Drosophila mojavensis GN=GI165... 317 9e-085
tr|Q582L2|Q582L2_9TRYP O-sialoglycoprotein endopeptidase, putati... 315 4e-084
tr|A4I6A3|A4I6A3_LEIIN O-sialoglycoprotein endopeptidase, putati... 315 5e-084
tr|Q4Q6Q4|Q4Q6Q4_LEIMA O-sialoglycoprotein endopeptidase, putati... 315 6e-084
tr|Q58EU1|Q58EU1_MOUSE O-sialoglycoprotein endopeptidase OS=Mus ... 315 6e-084
tr|B3NIB9|B3NIB9_DROER GG15998 OS=Drosophila erecta GN=GG15998 P... 314 1e-083
>tr|B9MY43|B9MY43_POPTR Predicted protein OS=Populus trichocarpa
GN=POPTRDRAFT_1111823 PE=4 SV=1
Length = 360
Score = 431 bits (1106), Expect = 8e-119
Identities = 209/232 (90%), Positives = 220/232 (94%)
Frame = -3
Query: 696 MKKQIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLI 517
MK+ IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQ++LPL+
Sbjct: 1 MKRMIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQHVLPLV 60
Query: 516 KSALETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMG 337
KSALETAK+TP EIDCLCYTKGPGMGAPLQV+A+V+RVLSQ WKKPIVAVNHCVAHIEMG
Sbjct: 61 KSALETAKITPDEIDCLCYTKGPGMGAPLQVSAVVIRVLSQLWKKPIVAVNHCVAHIEMG 120
Query: 336 RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPG 157
RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVL+LSNDPAPG
Sbjct: 121 RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLQLSNDPAPG 180
Query: 156 YNIEQLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
YNIEQLAKKGEQFID+PY VKGMDVSFSGILS+IEAT EKLKN P L
Sbjct: 181 YNIEQLAKKGEQFIDLPYVVKGMDVSFSGILSFIEATTEEKLKNNECTPADL 232
>tr|B9T542|B9T542_RICCO O-sialoglycoprotein endopeptidase, putative OS=Ricinus
communis GN=RCOM_0452810 PE=4 SV=1
Length = 346
Score = 430 bits (1104), Expect = 1e-118
Identities = 209/232 (90%), Positives = 221/232 (95%)
Frame = -3
Query: 696 MKKQIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLI 517
MKK IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHL+++LPL+
Sbjct: 1 MKKMIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLEHVLPLV 60
Query: 516 KSALETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMG 337
KSALETA+VTP +IDCLCYTKGPGMGAPLQV+AIV+RVLSQ WKKPI+AVNHCVAHIEMG
Sbjct: 61 KSALETAQVTPDDIDCLCYTKGPGMGAPLQVSAIVIRVLSQLWKKPIIAVNHCVAHIEMG 120
Query: 336 RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPG 157
RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVL+LSNDPAPG
Sbjct: 121 RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLQLSNDPAPG 180
Query: 156 YNIEQLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
YNIEQLAKKGEQFID+PY VKGMDVSFSGILS+IEATA EKLKN P L
Sbjct: 181 YNIEQLAKKGEQFIDLPYVVKGMDVSFSGILSFIEATAEEKLKNNECTPADL 232
>tr|O49653|O49653_ARATH Glycoprotein endopeptidase - like protein OS=Arabidopsis
thaliana GN=T12H17.110 PE=2 SV=1
Length = 353
Score = 419 bits (1077), Expect = 2e-115
Identities = 202/231 (87%), Positives = 217/231 (93%)
Frame = -3
Query: 693 KKQIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIK 514
KK IA+GFEGSANKIGVG+VTLDGTIL+NPRHTYITPPG GFLPRETA HHL ++LPL+K
Sbjct: 3 KKMIAIGFEGSANKIGVGIVTLDGTILANPRHTYITPPGHGFLPRETAHHHLDHVLPLVK 62
Query: 513 SALETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGR 334
SALET++VTP+EIDC+CYTKGPGMGAPLQV+AIVVRVLSQ WKKPIVAVNHCVAHIEMGR
Sbjct: 63 SALETSQVTPEEIDCICYTKGPGMGAPLQVSAIVVRVLSQLWKKPIVAVNHCVAHIEMGR 122
Query: 333 IVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGY 154
+VTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVL+LSNDP+PGY
Sbjct: 123 VVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLKLSNDPSPGY 182
Query: 153 NIEQLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
NIEQLAKKGE FID+PYAVKGMDVSFSGILSYIE TA EKLKN P L
Sbjct: 183 NIEQLAKKGENFIDLPYAVKGMDVSFSGILSYIETTAEEKLKNNECTPADL 233
>tr|A7QM84|A7QM84_VITVI Chromosome chr5 scaffold_124, whole genome shotgun
sequence OS=Vitis vinifera GN=GSVIVT00001958001 PE=4 SV=1
Length = 353
Score = 415 bits (1066), Expect = 3e-114
Identities = 203/232 (87%), Positives = 216/232 (93%)
Frame = -3
Query: 696 MKKQIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLI 517
MK IALGFEGSANKIG+GVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHL ++LPL+
Sbjct: 1 MKNLIALGFEGSANKIGIGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLNHVLPLV 60
Query: 516 KSALETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMG 337
+SAL+ A V+P +IDCLCYTKGPGMGAPLQV+AIVVRVLSQ WKKPIVAVNHCVAHIEMG
Sbjct: 61 RSALDEAGVSPAQIDCLCYTKGPGMGAPLQVSAIVVRVLSQLWKKPIVAVNHCVAHIEMG 120
Query: 336 RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPG 157
R+VTGA DPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVL LSNDP+PG
Sbjct: 121 RVVTGAVDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLTLSNDPSPG 180
Query: 156 YNIEQLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
YNIEQLAKKGEQFIDIPY VKGMDVSFSG+LSYIEATAVEKL+N P L
Sbjct: 181 YNIEQLAKKGEQFIDIPYVVKGMDVSFSGLLSYIEATAVEKLQNNECTPADL 232
>tr|Q2PYX9|Q2PYX9_SOLTU Glycoprotein endopeptidase-like protein OS=Solanum
tuberosum PE=2 SV=1
Length = 346
Score = 403 bits (1034), Expect = 2e-110
Identities = 198/231 (85%), Positives = 211/231 (91%)
Frame = -3
Query: 693 KKQIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIK 514
KK I+L FE +A KIGVGVV +DGTILSNPRHTYITPPGQGFLPRETAQHH Q+ILPL+K
Sbjct: 4 KKLISLWFESAAKKIGVGVVAIDGTILSNPRHTYITPPGQGFLPRETAQHHHQHILPLVK 63
Query: 513 SALETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGR 334
SALETA VTP EIDC+CYTKGPGMGAPLQV+A+VVRVLSQ WKKPIV VNHCVAHIEMGR
Sbjct: 64 SALETAGVTPDEIDCICYTKGPGMGAPLQVSAVVVRVLSQLWKKPIVGVNHCVAHIEMGR 123
Query: 333 IVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGY 154
IVTGA DPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVL LSNDP+PGY
Sbjct: 124 IVTGAVDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLTLSNDPSPGY 183
Query: 153 NIEQLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
NIEQLAKKGE+FI++PY VKGMDVSFSGILS+IEATA EKLKN P L
Sbjct: 184 NIEQLAKKGEKFIELPYVVKGMDVSFSGILSFIEATAEEKLKNNECSPADL 234
>tr|A9NN15|A9NN15_PICSI Putative uncharacterized protein OS=Picea sitchensis
PE=2 SV=1
Length = 360
Score = 391 bits (1002), Expect = 9e-107
Identities = 186/228 (81%), Positives = 205/228 (89%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
IA+GFEGSANKI VG+V LDGTILSNPRHTYITPPG GFLPRETA HHLQ++LPL++SAL
Sbjct: 2 IAIGFEGSANKIAVGIVQLDGTILSNPRHTYITPPGHGFLPRETAIHHLQHVLPLVRSAL 61
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
+ A + P EIDCLCYTKGPGMGAPLQV+A+VVR+LSQ WKKPIV VNHCVAHIEMGR+VT
Sbjct: 62 KEANIQPHEIDCLCYTKGPGMGAPLQVSAVVVRMLSQLWKKPIVGVNHCVAHIEMGRVVT 121
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
A DPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRF RVL++SNDP+PGYNIE
Sbjct: 122 AAHDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFGRVLKISNDPSPGYNIE 181
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
QLAKKG QF+++PY VKGMDVSFSGILSYIEATA EKL+ P L
Sbjct: 182 QLAKKGSQFVELPYVVKGMDVSFSGILSYIEATAAEKLETNECTPADL 229
>tr|A2Y190|A2Y190_ORYSI Putative uncharacterized protein OS=Oryza sativa subsp.
indica GN=OsI_18771 PE=4 SV=1
Length = 380
Score = 374 bits (960), Expect = 6e-102
Identities = 180/228 (78%), Positives = 199/228 (87%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
+ALG E SANKIG+GVV+L G ILSNPRHTY+TPPG GFLPRETA HHL ++LPL+++AL
Sbjct: 15 LALGLESSANKIGIGVVSLSGEILSNPRHTYVTPPGHGFLPRETAHHHLAHLLPLLRAAL 74
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
A VTP ++ C+CYTKGPGMGAPLQVAA R LS W KP+V VNHCVAH+EMGR VT
Sbjct: 75 GEAGVTPADLACVCYTKGPGMGAPLQVAAAAARALSLLWGKPLVGVNHCVAHVEMGRAVT 134
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
GA DPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDP+PGYNIE
Sbjct: 135 GAVDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPSPGYNIE 194
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
QLAKKGE+FID+PY VKGMDVSFSGILS+IEATA+EKLKN P L
Sbjct: 195 QLAKKGEKFIDLPYVVKGMDVSFSGILSFIEATAIEKLKNNECTPADL 242
>tr|Q6L4N8|Q6L4N8_ORYSJ Os05g0194600 protein OS=Oryza sativa subsp. japonica
GN=P0473H02.4 PE=4 SV=1
Length = 380
Score = 374 bits (960), Expect = 6e-102
Identities = 180/228 (78%), Positives = 199/228 (87%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
+ALG E SANKIG+GVV+L G ILSNPRHTY+TPPG GFLPRETA HHL ++LPL+++AL
Sbjct: 15 LALGLESSANKIGIGVVSLSGEILSNPRHTYVTPPGHGFLPRETAHHHLAHLLPLLRAAL 74
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
A VTP ++ C+CYTKGPGMGAPLQVAA R LS W KP+V VNHCVAH+EMGR VT
Sbjct: 75 GEAGVTPADLACVCYTKGPGMGAPLQVAAAAARALSLLWGKPLVGVNHCVAHVEMGRAVT 134
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
GA DPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDP+PGYNIE
Sbjct: 135 GAVDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPSPGYNIE 194
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
QLAKKGE+FID+PY VKGMDVSFSGILS+IEATA+EKLKN P L
Sbjct: 195 QLAKKGEKFIDLPYVVKGMDVSFSGILSFIEATAIEKLKNNECTPADL 242
>tr|B4FYG1|B4FYG1_MAIZE Putative uncharacterized protein OS=Zea mays PE=2 SV=1
Length = 381
Score = 374 bits (959), Expect = 8e-102
Identities = 179/228 (78%), Positives = 199/228 (87%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
+ALG E SANKIG+GVV+L G ILSNPRHTY+TPPG GFLPRETAQHHL ++LPL+++AL
Sbjct: 16 LALGLESSANKIGIGVVSLSGDILSNPRHTYVTPPGHGFLPRETAQHHLAHLLPLLRAAL 75
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
+ V P ++ C+CYTKGPGMG PLQVAA R LS W+KP+VAVNHCVAHIEMGR VT
Sbjct: 76 AESGVAPADLACVCYTKGPGMGGPLQVAAAAARALSLLWRKPLVAVNHCVAHIEMGRAVT 135
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
GA DPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDP+PGYNIE
Sbjct: 136 GAVDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPSPGYNIE 195
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
QLAKKGE+FID+PY VKGMDVSFSGILS+IEA A+EKLKN P L
Sbjct: 196 QLAKKGEKFIDVPYVVKGMDVSFSGILSFIEAAAIEKLKNNECTPADL 243
>tr|B6TBH3|B6TBH3_MAIZE O-sialoglycoprotein endopeptidase OS=Zea mays PE=2 SV=1
Length = 381
Score = 374 bits (959), Expect = 8e-102
Identities = 179/228 (78%), Positives = 199/228 (87%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
+ALG E SANKIG+GVV+L G ILSNPRHTY+TPPG GFLPRETAQHHL ++LPL+++AL
Sbjct: 16 LALGLESSANKIGIGVVSLSGDILSNPRHTYVTPPGHGFLPRETAQHHLAHLLPLLRAAL 75
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
+ V P ++ C+CYTKGPGMG PLQVAA R LS W+KP+VAVNHCVAHIEMGR VT
Sbjct: 76 AESGVAPADLACVCYTKGPGMGGPLQVAAAAARALSLLWRKPLVAVNHCVAHIEMGRAVT 135
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
GA DPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDP+PGYNIE
Sbjct: 136 GAVDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPSPGYNIE 195
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
QLAKKGE+FID+PY VKGMDVSFSGILS+IEA A+EKLKN P L
Sbjct: 196 QLAKKGEKFIDVPYVVKGMDVSFSGILSFIEAAAIEKLKNNECTPADL 243
>tr|A9SUY4|A9SUY4_PHYPA Predicted protein OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_215978 PE=4 SV=1
Length = 339
Score = 361 bits (924), Expect = 1e-097
Identities = 169/228 (74%), Positives = 196/228 (85%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
IALGFE SANKIGVG+V DG IL+NPRHTYITPPG GFLPR TA+HH ++L L+ +AL
Sbjct: 2 IALGFESSANKIGVGIVDADGNILANPRHTYITPPGHGFLPRHTAEHHHAHVLGLVHAAL 61
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
+ AK+TP IDCL YTKGPGMGAPLQV+AIVVR+LSQ W+KPIV VNHCV HIEMGR+VT
Sbjct: 62 KEAKLTPASIDCLTYTKGPGMGAPLQVSAIVVRILSQLWRKPIVGVNHCVGHIEMGRVVT 121
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
GA DPVVLYVSGGNTQVIAYSEGRYRIFGET+DIAVGNCLDRFAR L++SNDP+PGYNIE
Sbjct: 122 GAQDPVVLYVSGGNTQVIAYSEGRYRIFGETVDIAVGNCLDRFARCLKISNDPSPGYNIE 181
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHPCRL 1
QLAKKG++ +++PY VKGMDVSFSG+LS++E A L + + P L
Sbjct: 182 QLAKKGQKLVELPYVVKGMDVSFSGLLSFVEELAARTLNDNEITPADL 229
>tr|B3RVJ6|B3RVJ6_TRIAD Putative uncharacterized protein OS=Trichoplax adhaerens
GN=TRIADDRAFT_23779 PE=4 SV=1
Length = 336
Score = 344 bits (882), Expect = 7e-093
Identities = 163/224 (72%), Positives = 191/224 (85%), Gaps = 1/224 (0%)
Frame = -3
Query: 681 ALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALE 502
A+GFEGSANK+G+G++ DG +LSN RHTYITPPGQGF PR+TA+HH +IL +++ AL+
Sbjct: 4 AIGFEGSANKLGIGIIR-DGKVLSNVRHTYITPPGQGFQPRDTAKHHRDHILSVLRKALD 62
Query: 501 TAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTG 322
A VTP EIDC+CYTKGPGMGAPL AIV R ++Q W KPIVAVNHC+AHIEMGR+VTG
Sbjct: 63 NADVTPDEIDCVCYTKGPGMGAPLVAVAIVARTVAQLWNKPIVAVNHCIAHIEMGRLVTG 122
Query: 321 ADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQ 142
AD+P VLYVSGGNTQVIAY RYRIFGETIDIAVGNCLDRFARVL+LSNDP+PGYNIEQ
Sbjct: 123 ADNPTVLYVSGGNTQVIAYLMNRYRIFGETIDIAVGNCLDRFARVLKLSNDPSPGYNIEQ 182
Query: 141 LAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHP 10
+AK+G++FI++PY VKGMDVSFSGILSYIE A +KL P
Sbjct: 183 MAKRGKKFIELPYTVKGMDVSFSGILSYIEDIAQKKLDGGECTP 226
>tr|B6PN74|B6PN74_BRAFL Putative uncharacterized protein OS=Branchiostoma
floridae GN=BRAFLDRAFT_288806 PE=4 SV=1
Length = 350
Score = 337 bits (862), Expect = 1e-090
Identities = 160/223 (71%), Positives = 189/223 (84%), Gaps = 1/223 (0%)
Frame = -3
Query: 678 LGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALET 499
+GFEGSANK+GVG++ DG +LSNPRHTYITPPGQGFLPR+TA+HH +IL +++ AL+
Sbjct: 19 IGFEGSANKLGVGIIR-DGEVLSNPRHTYITPPGQGFLPRDTAKHHQAHILDVLQQALDI 77
Query: 498 AKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTGA 319
AKV PQ+IDC+ YTKGPGMGAPL A+V R ++Q W KP++ VNHC+ HIEMGR VTGA
Sbjct: 78 AKVKPQDIDCVAYTKGPGMGAPLVSTAVVARTVAQLWNKPLLGVNHCIGHIEMGRRVTGA 137
Query: 318 DDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQL 139
+PVVLYVSGGNTQVIAY RYRIFGETIDIAVGNCLDRFARVL++SNDP+PGYNIEQ+
Sbjct: 138 VNPVVLYVSGGNTQVIAYQLKRYRIFGETIDIAVGNCLDRFARVLKISNDPSPGYNIEQM 197
Query: 138 AKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHP 10
AKKGE+ ID+PY VKGMDVSFSGILSYIE A L +R+ P
Sbjct: 198 AKKGEKLIDLPYGVKGMDVSFSGILSYIEDAAQTLLDSRQATP 240
>tr|B6LCL9|B6LCL9_BRAFL Putative uncharacterized protein OS=Branchiostoma
floridae GN=BRAFLDRAFT_118952 PE=4 SV=1
Length = 350
Score = 334 bits (854), Expect = 1e-089
Identities = 158/223 (70%), Positives = 189/223 (84%), Gaps = 1/223 (0%)
Frame = -3
Query: 678 LGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALET 499
+GFEGSANK+GVG++ DG +LSNPRHTYITPPGQGFLPR+TA+HH +IL +++ AL+
Sbjct: 19 IGFEGSANKLGVGIIR-DGEVLSNPRHTYITPPGQGFLPRDTAKHHQAHILDVLQQALDI 77
Query: 498 AKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTGA 319
AKV PQ+IDC+ YTKGPGMGAPL A+V R ++Q W KP++ VNHC+ HIEMGR VTGA
Sbjct: 78 AKVKPQDIDCVAYTKGPGMGAPLVSTAVVARTVAQLWNKPLLGVNHCIGHIEMGRRVTGA 137
Query: 318 DDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQL 139
+PVVLYVSGGNTQVIAY RYRIFGETIDIAVGNCLDRFARVL++SNDP+PGYNIEQ+
Sbjct: 138 VNPVVLYVSGGNTQVIAYQLKRYRIFGETIDIAVGNCLDRFARVLKISNDPSPGYNIEQM 197
Query: 138 AKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHP 10
AKKG+Q ID+P+ VKGMDVSFSGILSYIE A L +++ P
Sbjct: 198 AKKGKQLIDLPHGVKGMDVSFSGILSYIEDAAQTLLDSKQATP 240
>tr|Q0V9I9|Q0V9I9_XENTR O-sialoglycoprotein endopeptidase OS=Xenopus tropicalis
GN=osgep PE=2 SV=1
Length = 335
Score = 332 bits (849), Expect = 5e-089
Identities = 155/225 (68%), Positives = 190/225 (84%), Gaps = 1/225 (0%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
I +GFEGSANKIGVG++ DG +LSNPR TYITPPGQGF+P +TA+HH IL +++ AL
Sbjct: 3 IVVGFEGSANKIGVGIIQ-DGKVLSNPRRTYITPPGQGFMPSDTARHHRSCILDVLQEAL 61
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
E AK+ PQ++DC+ YTKGPGMGAPL AIV R ++Q WKKP++ VNHC+ HIEMGR++T
Sbjct: 62 EEAKIKPQDVDCVAYTKGPGMGAPLLSVAIVARTVAQLWKKPLLGVNHCIGHIEMGRLIT 121
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
GA++P VLYVSGGNTQVIAYSE YRIFGETIDIAVGNCLDRFARVL++SNDP+PGYNIE
Sbjct: 122 GAENPSVLYVSGGNTQVIAYSERCYRIFGETIDIAVGNCLDRFARVLKISNDPSPGYNIE 181
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHP 10
Q+AKKG++F+++PY VKGMDVSFSGILSYIE + + L + P
Sbjct: 182 QMAKKGKKFVELPYTVKGMDVSFSGILSYIEDMSHKMLSSGECTP 226
>tr|B7PKV7|B7PKV7_IXOSC O-sialoglycoprotein endopeptidase, putative OS=Ixodes
scapularis GN=IscW_ISCW006830 PE=4 SV=1
Length = 318
Score = 330 bits (845), Expect = 1e-088
Identities = 153/215 (71%), Positives = 186/215 (86%), Gaps = 1/215 (0%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
+A+GFEGSANK+GVG+V DG +LSNPR TYITPPG+GFLPR+TA HH ++L +++ +L
Sbjct: 3 VAIGFEGSANKLGVGIVR-DGQVLSNPRVTYITPPGEGFLPRDTAVHHRAHVLDVLEKSL 61
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
A +TP EID +CYTKGPGMGAPL A+V R ++Q W KPIV VNHC+ HIEMGR++T
Sbjct: 62 REANITPDEIDVVCYTKGPGMGAPLVSVAVVARTVAQLWNKPIVGVNHCIGHIEMGRLIT 121
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
GAD+P VLYVSGGNTQVIAYSE RYRIFGETIDIAVGNCLDRFARVL+LSNDP+PGYNIE
Sbjct: 122 GADNPTVLYVSGGNTQVIAYSEKRYRIFGETIDIAVGNCLDRFARVLKLSNDPSPGYNIE 181
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAV 40
Q+AK+G++ I +PY VKGMDVSFSG+LS+IEA ++
Sbjct: 182 QMAKRGKKLIPLPYVVKGMDVSFSGLLSFIEAESL 216
>tr|C0HD86|C0HD86_SALSA Probable O-sialoglycoprotein endopeptidase OS=Salmo
salar GN=GCP PE=2 SV=1
Length = 335
Score = 329 bits (843), Expect = 2e-088
Identities = 154/219 (70%), Positives = 185/219 (84%), Gaps = 1/219 (0%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
+ +GFEGSANKIGVG+V DG +LSNPR TYITPPGQGFLP ETA+HH IL ++K AL
Sbjct: 3 VVIGFEGSANKIGVGIVR-DGEVLSNPRRTYITPPGQGFLPSETARHHRSVILTVLKEAL 61
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
E A + P ++DC+ YTKGPGMGAPL A+V R ++Q W KP++ VNHC+ HIEMGR++T
Sbjct: 62 EEAGLKPADVDCVAYTKGPGMGAPLVTVALVARTVAQLWGKPLLGVNHCIGHIEMGRLIT 121
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
A++P VLYVSGGNTQVIAYSE RYRIFGETIDIAVGNCLDRFARV+++SNDP+PGYNIE
Sbjct: 122 QANNPTVLYVSGGNTQVIAYSERRYRIFGETIDIAVGNCLDRFARVIKISNDPSPGYNIE 181
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLK 28
Q+AKKG Q++++PY VKGMDVSFSGILSYIE A + LK
Sbjct: 182 QMAKKGTQYVELPYTVKGMDVSFSGILSYIEEAAGKMLK 220
>tr|B4J0U0|B4J0U0_DROGR GH15886 OS=Drosophila grimshawi GN=GH15886 PE=4 SV=1
Length = 347
Score = 324 bits (828), Expect = 1e-086
Identities = 153/225 (68%), Positives = 190/225 (84%), Gaps = 2/225 (0%)
Frame = -3
Query: 681 ALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALE 502
ALG EGSANKIG+G+V DG +L+N R TYITPPG+GFLP+ETA+HH + IL L++++L+
Sbjct: 4 ALGIEGSANKIGIGIVN-DGKVLANVRRTYITPPGEGFLPKETAKHHREVILALVQASLK 62
Query: 501 TAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTG 322
A++ P ++D +CYTKGPGM PL V AIV R LS W+KP++ VNHC+ HIEMGR++TG
Sbjct: 63 EAQLQPADLDVICYTKGPGMAPPLLVGAIVARTLSLLWQKPLLGVNHCIGHIEMGRLITG 122
Query: 321 ADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQ 142
A +P+VLYVSGGNTQVIAYS RYRIFGETIDIAVGNCLDRFAR+++LSNDP+PGYNIEQ
Sbjct: 123 AQNPIVLYVSGGNTQVIAYSNKRYRIFGETIDIAVGNCLDRFARIIKLSNDPSPGYNIEQ 182
Query: 141 LAKKGEQFIDIPYAVKGMDVSFSGILSYIEATA-VEKLKNRRVHP 10
LAK+G+Q+I +PY VKGMDVSFSGILS+IE A EK +N+R P
Sbjct: 183 LAKEGKQYIKLPYVVKGMDVSFSGILSHIEELAEPEKRRNKRKKP 227
>tr|Q561S3|Q561S3_DANRE O-sialoglycoprotein endopeptidase OS=Danio rerio
GN=si:ch211-214j24.11 PE=2 SV=1
Length = 335
Score = 322 bits (825), Expect = 3e-086
Identities = 152/225 (67%), Positives = 184/225 (81%), Gaps = 1/225 (0%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
I +GFEGSANKIG+G++ DG +LSNPR TYITPPGQGFLP ETA+HH IL +++ AL
Sbjct: 3 IVIGFEGSANKIGIGIIK-DGEVLSNPRRTYITPPGQGFLPGETAKHHRSVILTVLQEAL 61
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
+ A + +IDC+ YTKGPGMGAPL AIV R ++Q W KP++ VNHC+ HIEMGR++T
Sbjct: 62 DEAGLKAADIDCVAYTKGPGMGAPLVTVAIVARTVAQLWGKPLLGVNHCIGHIEMGRLIT 121
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
A +P VLYVSGGNTQVIAYSE RYRIFGETIDIAVGNCLDRFARV+++SNDP+PGYNIE
Sbjct: 122 NAQNPTVLYVSGGNTQVIAYSERRYRIFGETIDIAVGNCLDRFARVIKISNDPSPGYNIE 181
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHP 10
Q+AKKG ++I++PY VKGMDVSFSGILSYIE A + L + P
Sbjct: 182 QMAKKGNKYIELPYTVKGMDVSFSGILSYIEDAAHKMLSTDQCTP 226
>tr|Q5RHZ6|Q5RHZ6_DANRE Novel protein similar to vertebrate O-sialoglycoprotein
endopeptidase (OSGEP) OS=Danio rerio GN=si:ch211-214j24.11 PE=2 SV=1
Length = 335
Score = 322 bits (825), Expect = 3e-086
Identities = 152/225 (67%), Positives = 184/225 (81%), Gaps = 1/225 (0%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
I +GFEGSANKIG+G++ DG +LSNPR TYITPPGQGFLP ETA+HH IL +++ AL
Sbjct: 3 IVIGFEGSANKIGIGIIK-DGEVLSNPRRTYITPPGQGFLPGETAKHHRSVILTVLQEAL 61
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
+ A + +IDC+ YTKGPGMGAPL AIV R ++Q W KP++ VNHC+ HIEMGR++T
Sbjct: 62 DEAGLKAADIDCVAYTKGPGMGAPLVTVAIVARTVAQLWGKPLLGVNHCIGHIEMGRLIT 121
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
A +P VLYVSGGNTQVIAYSE RYRIFGETIDIAVGNCLDRFARV+++SNDP+PGYNIE
Sbjct: 122 NAQNPTVLYVSGGNTQVIAYSERRYRIFGETIDIAVGNCLDRFARVIKISNDPSPGYNIE 181
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHP 10
Q+AKKG ++I++PY VKGMDVSFSGILSYIE A + L + P
Sbjct: 182 QMAKKGNKYIELPYTVKGMDVSFSGILSYIEDAAHKMLSTDQCTP 226
>tr|Q4SY05|Q4SY05_TETNG Chromosome undetermined SCAF12247, whole genome shotgun
sequence OS=Tetraodon nigroviridis GN=GSTENG00010571001 PE=4 SV=1
Length = 335
Score = 321 bits (822), Expect = 6e-086
Identities = 148/220 (67%), Positives = 185/220 (84%), Gaps = 1/220 (0%)
Frame = -3
Query: 684 IALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSAL 505
+ +GFEGSANKIG+G++ DG +LSNPR TYITPPGQGF+P +TA+HH IL +++ AL
Sbjct: 3 VVIGFEGSANKIGIGILR-DGEVLSNPRRTYITPPGQGFMPSDTARHHRAVILTVLQEAL 61
Query: 504 ETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVT 325
+ A + P +IDC+ YTKGPGMGAPL A+V R ++Q W KP++ VNHC+ HIEMGR++T
Sbjct: 62 DQAGLKPADIDCVAYTKGPGMGAPLVTVALVARTVAQLWGKPLLGVNHCIGHIEMGRLIT 121
Query: 324 GADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIE 145
A++P VLYVSGGNTQVIAYS+ RYRIFGETIDIAVGNCLDRFARV+++SNDP+PGYNIE
Sbjct: 122 QANNPTVLYVSGGNTQVIAYSQRRYRIFGETIDIAVGNCLDRFARVIKISNDPSPGYNIE 181
Query: 144 QLAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKN 25
QLAKKG QF+++PY VKGMDVSFSGILSYIE + + L +
Sbjct: 182 QLAKKGSQFVELPYTVKGMDVSFSGILSYIEDASHKMLSS 221
>tr|B4LI30|B4LI30_DROVI GJ12789 OS=Drosophila virilis GN=GJ12789 PE=4 SV=1
Length = 347
Score = 320 bits (818), Expect = 2e-085
Identities = 151/222 (68%), Positives = 187/222 (84%), Gaps = 2/222 (0%)
Frame = -3
Query: 681 ALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALE 502
ALG EGSANKIGVG++ DG +L+N R TYITPPG+GFLP+ETA+HH + IL L++++L+
Sbjct: 4 ALGIEGSANKIGVGIIN-DGKVLANVRRTYITPPGEGFLPKETAKHHREAILALVQASLK 62
Query: 501 TAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTG 322
A++ P ++D +CYTKGPGM PL V AIV R LS WKKP++ VNHC+ HIEMGR++TG
Sbjct: 63 EAQLKPSDLDVICYTKGPGMAPPLLVGAIVARTLSLLWKKPLLGVNHCIGHIEMGRLITG 122
Query: 321 ADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQ 142
A +P+VLYVSGGNTQVIAYS RYRIFGETIDIAVGNCLDRFAR+++LSNDP+PGYNIEQ
Sbjct: 123 AQNPIVLYVSGGNTQVIAYSNKRYRIFGETIDIAVGNCLDRFARIIKLSNDPSPGYNIEQ 182
Query: 141 LAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVE-KLKNRR 19
LAK+G+ +I +PY VKGMDVSFSGILS+IE A K +N+R
Sbjct: 183 LAKQGQHYIKLPYVVKGMDVSFSGILSHIEELAEPGKRRNKR 224
>tr|B0BMW7|B0BMW7_RAT O-sialoglycoprotein endopeptidase, isoform CRA_b OS=Rattus
norvegicus GN=Osgep PE=2 SV=1
Length = 335
Score = 319 bits (817), Expect = 2e-085
Identities = 151/223 (67%), Positives = 183/223 (82%), Gaps = 1/223 (0%)
Frame = -3
Query: 678 LGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALET 499
LGFEGSANKIGVGVV DGT+L+NPR TY+T PG GFLP +TA+HH IL L++ AL
Sbjct: 5 LGFEGSANKIGVGVVR-DGTVLANPRRTYVTAPGTGFLPGDTARHHRAVILDLLQEALTE 63
Query: 498 AKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTGA 319
A +TP++IDC+ YTKGPGMGAPL A+V R ++Q W KP++ VNHC+ HIEMGR++TGA
Sbjct: 64 AGLTPKDIDCIAYTKGPGMGAPLASVAVVARTVAQLWNKPLLGVNHCIGHIEMGRLITGA 123
Query: 318 DDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQL 139
+P VLYVSGGNTQVI+YSE RYRIFGETIDIAVGNCLDRFARVL++SNDP+PGYNIEQ+
Sbjct: 124 VNPTVLYVSGGNTQVISYSEHRYRIFGETIDIAVGNCLDRFARVLKISNDPSPGYNIEQM 183
Query: 138 AKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHP 10
AK+G++ +++PY VKGMDVSFSGILS+IE A L P
Sbjct: 184 AKRGKKLVELPYTVKGMDVSFSGILSFIEDAAQRMLATGECTP 226
>tr|B3M5K1|B3M5K1_DROAN GF10115 OS=Drosophila ananassae GN=GF10115 PE=4 SV=1
Length = 347
Score = 319 bits (815), Expect = 4e-085
Identities = 152/222 (68%), Positives = 185/222 (83%), Gaps = 2/222 (0%)
Frame = -3
Query: 681 ALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALE 502
ALG EGSANKIG+G++ DG +L+N R TYITPPG+GFLP+ETA+HH + IL L++S+L+
Sbjct: 4 ALGIEGSANKIGIGIIK-DGEVLANVRRTYITPPGEGFLPKETAKHHREAILGLVQSSLK 62
Query: 501 TAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTG 322
AK+ P ++D +CYTKGPGM PL V AIV R LS W KP++ VNHC+ HIEMGR++TG
Sbjct: 63 EAKLQPADLDVICYTKGPGMAPPLLVGAIVARTLSLLWAKPLLGVNHCIGHIEMGRLITG 122
Query: 321 ADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQ 142
A +P+VLYVSGGNTQVIAYS RYRIFGETIDIAVGNCLDRFAR+++LSNDP+PGYNIEQ
Sbjct: 123 AQNPIVLYVSGGNTQVIAYSNQRYRIFGETIDIAVGNCLDRFARIIKLSNDPSPGYNIEQ 182
Query: 141 LAKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVE-KLKNRR 19
LAKK ++I +PY VKGMDVSFSGILSYIE A K +N+R
Sbjct: 183 LAKKSNRYIKLPYVVKGMDVSFSGILSYIEDLAEPGKRQNKR 224
>tr|B4L9L3|B4L9L3_DROMO GI16537 OS=Drosophila mojavensis GN=GI16537 PE=4 SV=1
Length = 347
Score = 317 bits (812), Expect = 9e-085
Identities = 150/222 (67%), Positives = 188/222 (84%), Gaps = 2/222 (0%)
Frame = -3
Query: 681 ALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALE 502
ALG EGSANKIGVG++ +G +L+N R TYITPPG+GFLP+ETA+HH + IL L++++L+
Sbjct: 4 ALGIEGSANKIGVGIIN-NGKVLANVRRTYITPPGEGFLPKETAKHHREAILGLVQASLK 62
Query: 501 TAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTG 322
A++ P ++D +CYTKGPGM PL V AIV R LS W+KP++ VNHC+ HIEMGR++TG
Sbjct: 63 EAQLKPADLDVICYTKGPGMAPPLLVGAIVARTLSLLWQKPLLGVNHCIGHIEMGRLITG 122
Query: 321 ADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQ 142
A +P+VLYVSGGNTQVIAYS RYRIFGETIDIAVGNCLDRFAR+++LSNDP+PGYNIEQ
Sbjct: 123 AQNPIVLYVSGGNTQVIAYSNKRYRIFGETIDIAVGNCLDRFARIIKLSNDPSPGYNIEQ 182
Query: 141 LAKKGEQFIDIPYAVKGMDVSFSGILSYIEATA-VEKLKNRR 19
LAK+G+Q+I +PY VKGMDVSFSGILS+IE A K +N+R
Sbjct: 183 LAKQGKQYIKLPYVVKGMDVSFSGILSHIEELADPSKRRNKR 224
>tr|Q582L2|Q582L2_9TRYP O-sialoglycoprotein endopeptidase, putative
OS=Trypanosoma brucei GN=Tb927.7.6470 PE=4 SV=1
Length = 372
Score = 315 bits (807), Expect = 4e-084
Identities = 146/215 (67%), Positives = 178/215 (82%)
Frame = -3
Query: 693 KKQIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIK 514
++ +ALG EGSANKI VG+V +G +LSN R TYITPPG GF+PRETAQHH +IL L++
Sbjct: 6 QRMLALGIEGSANKIAVGIVDRNGNVLSNERETYITPPGTGFMPRETAQHHTAHILRLVQ 65
Query: 513 SALETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGR 334
+A++ AKV +I +CYTKGPGMGAPL V V + LS W P+V VNHC+ HIEMGR
Sbjct: 66 AAMKAAKVHASDISVICYTKGPGMGAPLAVGCTVAKTLSLLWSVPLVGVNHCIGHIEMGR 125
Query: 333 IVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGY 154
+VTG+++P+VLYVSGGNTQVIAY+E RYRIFGETIDIAVGNCLDR AR+L LSNDPAPGY
Sbjct: 126 VVTGSENPIVLYVSGGNTQVIAYAEHRYRIFGETIDIAVGNCLDRVARLLNLSNDPAPGY 185
Query: 153 NIEQLAKKGEQFIDIPYAVKGMDVSFSGILSYIEA 49
NIEQ AK+G FI++PY VKGMD+SFSG+LS++EA
Sbjct: 186 NIEQCAKRGRVFIELPYVVKGMDMSFSGLLSFVEA 220
>tr|A4I6A3|A4I6A3_LEIIN O-sialoglycoprotein endopeptidase, putative
(Metallo-peptidase, clan mk, family m67) OS=Leishmania infantum
GN=LinJ31.0100 PE=4 SV=1
Length = 364
Score = 315 bits (806), Expect = 5e-084
Identities = 151/215 (70%), Positives = 178/215 (82%)
Frame = -3
Query: 696 MKKQIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLI 517
MK+ ++LG EGSANKIGVGVV GT+LSN R TYITPPG GFLPRETA HH Q++L ++
Sbjct: 1 MKRTLSLGIEGSANKIGVGVVDQSGTVLSNVRETYITPPGTGFLPRETAIHHSQHVLQVV 60
Query: 516 KSALETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMG 337
+ A+ A VTP ID + YTKGPGMGAPL V V + LS W KP+V VNHCV HIEMG
Sbjct: 61 QRAMHDAAVTPAAIDIISYTKGPGMGAPLTVGCTVAKTLSLLWGKPLVGVNHCVGHIEMG 120
Query: 336 RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPG 157
R+VT +++PVVLYVSGGNTQVIAY++ RYRIFGETIDIAVGNCLDR AR+L++SNDPAPG
Sbjct: 121 RVVTKSENPVVLYVSGGNTQVIAYADHRYRIFGETIDIAVGNCLDRVARLLDISNDPAPG 180
Query: 156 YNIEQLAKKGEQFIDIPYAVKGMDVSFSGILSYIE 52
YNIEQ AKKG+ +I +PY VKGMD+SF+GILSYIE
Sbjct: 181 YNIEQKAKKGKCYIRLPYTVKGMDMSFTGILSYIE 215
>tr|Q4Q6Q4|Q4Q6Q4_LEIMA O-sialoglycoprotein endopeptidase, putative
(Metallo-peptidase, clan mk, family m67) OS=Leishmania major
GN=LmjF31.0100 PE=4 SV=1
Length = 364
Score = 315 bits (805), Expect = 6e-084
Identities = 150/215 (69%), Positives = 177/215 (82%)
Frame = -3
Query: 696 MKKQIALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLI 517
MK+ ++LG EGSANKIGVGVV GT+LSN R TYITPPG GFLPRETA HH Q++L ++
Sbjct: 1 MKRTLSLGIEGSANKIGVGVVDQSGTVLSNVRETYITPPGSGFLPRETAIHHSQHVLQVV 60
Query: 516 KSALETAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMG 337
+ A+ A VTP +ID + YTKGPGMG PL V V + LS W KP+V VNHCV HIEMG
Sbjct: 61 QRAMHDAAVTPADIDIISYTKGPGMGGPLSVGCTVAKTLSLLWGKPLVGVNHCVGHIEMG 120
Query: 336 RIVTGADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPG 157
R+VT +++PVVLYVSGGNTQVIAY++ RYRIFGETIDIAVGNCLDR AR+L +SNDPAPG
Sbjct: 121 RVVTKSENPVVLYVSGGNTQVIAYADHRYRIFGETIDIAVGNCLDRVARLLNISNDPAPG 180
Query: 156 YNIEQLAKKGEQFIDIPYAVKGMDVSFSGILSYIE 52
YNIEQ AKKG+ +I +PY VKGMD+SF+GILSYIE
Sbjct: 181 YNIEQKAKKGKCYIRLPYTVKGMDMSFTGILSYIE 215
>tr|Q58EU1|Q58EU1_MOUSE O-sialoglycoprotein endopeptidase OS=Mus musculus
GN=Osgep PE=2 SV=1
Length = 335
Score = 315 bits (805), Expect = 6e-084
Identities = 149/223 (66%), Positives = 182/223 (81%), Gaps = 1/223 (0%)
Frame = -3
Query: 678 LGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALET 499
LGFEGSANKIGVGVV DGT+L+NPR TY+T PG GFLP +TA+HH IL L++ AL
Sbjct: 5 LGFEGSANKIGVGVVR-DGTVLANPRRTYVTAPGTGFLPGDTARHHRAVILDLLQEALAE 63
Query: 498 AKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTGA 319
A +T ++IDC+ +TKGPGMGAPL A+V R ++Q W KP++ VNHC+ HIEMGR++TGA
Sbjct: 64 AGLTSKDIDCIAFTKGPGMGAPLASVAVVARTVAQLWNKPLLGVNHCIGHIEMGRLITGA 123
Query: 318 DDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQL 139
+P VLYVSGGNTQVI+YSE RYRIFGETIDIAVGNCLDRFARVL++SNDP+PGYNIEQ+
Sbjct: 124 VNPTVLYVSGGNTQVISYSEHRYRIFGETIDIAVGNCLDRFARVLKISNDPSPGYNIEQM 183
Query: 138 AKKGEQFIDIPYAVKGMDVSFSGILSYIEATAVEKLKNRRVHP 10
AK+G++ +++PY VKGMDVSFSGILS+IE A L P
Sbjct: 184 AKRGKKLVELPYTVKGMDVSFSGILSFIEDAAQRMLATGECTP 226
>tr|B3NIB9|B3NIB9_DROER GG15998 OS=Drosophila erecta GN=GG15998 PE=4 SV=1
Length = 347
Score = 314 bits (803), Expect = 1e-083
Identities = 147/213 (69%), Positives = 179/213 (84%), Gaps = 1/213 (0%)
Frame = -3
Query: 681 ALGFEGSANKIGVGVVTLDGTILSNPRHTYITPPGQGFLPRETAQHHLQYILPLIKSALE 502
ALG EGSANKIG+G++ DG +L+N R TYITPPG+GFLP+ETA+HH + IL L+KS+L+
Sbjct: 4 ALGIEGSANKIGIGIIR-DGEVLANVRRTYITPPGEGFLPKETAKHHREAILGLVKSSLK 62
Query: 501 TAKVTPQEIDCLCYTKGPGMGAPLQVAAIVVRVLSQPWKKPIVAVNHCVAHIEMGRIVTG 322
A++ P ++D +CYTKGPGM PL V AIV R LS W+ P++ VNHC+ HIEMGR++TG
Sbjct: 63 EAQLEPSDLDVICYTKGPGMAPPLLVGAIVARTLSLLWEIPLLGVNHCIGHIEMGRLITG 122
Query: 321 ADDPVVLYVSGGNTQVIAYSEGRYRIFGETIDIAVGNCLDRFARVLELSNDPAPGYNIEQ 142
A +P+VLYVSGGNTQVIAYS RYRIFGETIDIAVGNCLDRFAR+++LSNDP+PGYNIEQ
Sbjct: 123 AQNPIVLYVSGGNTQVIAYSNKRYRIFGETIDIAVGNCLDRFARIIKLSNDPSPGYNIEQ 182
Query: 141 LAKKGEQFIDIPYAVKGMDVSFSGILSYIEATA 43
LAK ++I +PY VKGMDVSFSGILSYIE A
Sbjct: 183 LAKSSNRYIKLPYVVKGMDVSFSGILSYIEDLA 215
Database: UniProt/TrEMBL
Posted date: Sun May 10 14:41:50 2009
Number of letters in database: 2,506,224,640
Number of sequences in database: 7,695,149
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 344,827,939,568
Number of Sequences: 7695149
Number of Extensions: 344827939568
Number of Successful Extensions: 265040416
Number of sequences better than 0.0: 0
|