BLASTX 7.6.2
Query= UN14186 /QuerySize=1345
(1344 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT... 649 4e-184
gi|30682289|ref|NP_187876.2| aspartyl protease family protein [A... 621 9e-176
gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana] 617 2e-174
gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 ... 383 5e-104
gi|225462334|ref|XP_002265771.1| PREDICTED: hypothetical protein... 380 4e-103
gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vini... 376 5e-102
gi|259490398|ref|NP_001159203.1| hypothetical protein LOC1003042... 297 5e-078
gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precur... 294 4e-077
gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT... 235 1e-059
gi|238479750|ref|NP_001154610.1| aspartyl protease family protei... 235 2e-059
gi|212722026|ref|NP_001131674.1| hypothetical protein LOC1001930... 233 9e-059
gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT... 205 3e-050
gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Trit... 194 4e-047
gi|226491620|ref|NP_001149154.1| pepsin A [Zea mays] 159 2e-036
gi|224029721|gb|ACN33936.1| unknown [Zea mays] 158 2e-036
gi|242068179|ref|XP_002449366.1| hypothetical protein SORBIDRAFT... 96 2e-017
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632
[Arabidopsis lyrata subsp. lyrata]
Length = 449
Score = 649 bits (1673), Expect = 4e-184
Identities = 327/427 (76%), Positives = 363/427 (85%), Gaps = 9/427 (2%)
Frame = +1
Query: 25 KEMKRTRALLCLI-PILLTAAADSTEDTAVRLKISHRDTLFPTSSHRIEDIISEDQKRHS 201
+ +K + + LCLI +LL AADSTEDTAVRLK++HRDTL+P RIEDII DQKRHS
Sbjct: 2 QRIKTSLSCLCLITTLLLLTAADSTEDTAVRLKLAHRDTLWPNPLSRIEDIIGADQKRHS 61
Query: 202 LITRKRKTNGGGAKLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRG 381
LI+RKRK GG K+ L SG DYG AQYF +V+VGTPAK+FRVVVDTGSELTWVNCR+RG
Sbjct: 62 LISRKRKFK-GGVKMDLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRG 120
Query: 382 KGKGKEKEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYAD 561
+GKG K K RRVFRAEES SF+ VGC TQTCK DLMNLFSLS CPTPSTPCSYDYRYAD
Sbjct: 121 RGKG--KVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYAD 178
Query: 562 GSSAQGVFAKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSK 741
GS+AQGVFAKET TV LTNGR ARLRGLL+GCSSSF G SFQGADGVLGLA SD+SFTS
Sbjct: 179 GSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTST 238
Query: 742 ATNLFGGKFSYCLVDHRSHKNVSSYLIFGSTTKPTAT-----RTTPLDLNLIPPFYAINI 906
AT+LFG K SYCLVDH S+KN+S+YLIFG ++ T+T RTTPLDL LIPPFYAINI
Sbjct: 239 ATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINI 298
Query: 907 IGISLGDDMLDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKP 1086
IGIS+GDDMLDIP+QVWDAT GGGTILDSGTSLTLLA+AAYKPVV+GL RYLV LKRVKP
Sbjct: 299 IGISIGDDMLDIPTQVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKP 358
Query: 1087 EGVPIEYCFDVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPL 1266
EG+PIEYCF TSGFNESKLPQL FH GGARFEPHR+SYLVDAA GVKCLGF+SAGTP
Sbjct: 359 EGIPIEYCFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPA 418
Query: 1267 LMWLGHM 1287
+G++
Sbjct: 419 TNVVGNI 425
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis
thaliana]
Length = 461
Score = 621 bits (1601), Expect = 9e-176
Identities = 319/430 (74%), Positives = 354/430 (82%), Gaps = 10/430 (2%)
Frame = +1
Query: 10 GKGKKKEMKRTRALL-CLI-PILLTAAADSTEDTAVRLKISHRDTLFPTSSHRIEDIISE 183
G K +E K + LL CLI +LL ADS +DT+VRLK++HRDTL P RIED+I
Sbjct: 14 GDKKNQEEKMQKTLLSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGA 73
Query: 184 DQKRHSLITRKRKTNGGGAKLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWV 363
DQKRHSLI+RKR + G K+ L SG DYG AQYF +++VGTPAK+FRVVVDTGSELTWV
Sbjct: 74 DQKRHSLISRKRNST-VGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWV 132
Query: 364 NCRFRGKGKGKEKEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSY 543
NCR+R +GK RRVFRA+ES SF+ VGCLTQTCK DLMNLFSL+ CPTPSTPCSY
Sbjct: 133 NCRYRARGK-----DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY 187
Query: 544 DYRYADGSSAQGVFAKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSD 723
DYRYADGS+AQGVFAKET TV LTNGR+ARL G LIGCSSSF G SFQGADGVLGLA SD
Sbjct: 188 DYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSD 247
Query: 724 YSFTSKATNLFGGKFSYCLVDHRSHKNVSSYLIFGS--TTKPTATRTTPLDLNLIPPFYA 897
+SFTS AT+L+G KFSYCLVDH S+KNVS+YLIFGS +TK RTTPLDL IPPFYA
Sbjct: 248 FSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYA 307
Query: 898 INIIGISLGDDMLDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKR 1077
IN+IGISLG DMLDIPSQVWDAT+GGGTILDSGTSLTLLADAAYK VV+GL RYLV LKR
Sbjct: 308 INVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKR 367
Query: 1078 VKPEGVPIEYCFDVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAG 1257
VKPEGVPIEYCF TSGFN SKLPQL FH GGARFEPHR+SYLVDAA GVKCLGFVSAG
Sbjct: 368 VKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAG 427
Query: 1258 TPLLMWLGHM 1287
TP +G++
Sbjct: 428 TPATNVIGNI 437
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 617 bits (1590), Expect = 2e-174
Identities = 314/416 (75%), Positives = 347/416 (83%), Gaps = 9/416 (2%)
Frame = +1
Query: 49 LLCLI-PILLTAAADSTEDTAVRLKISHRDTLFPTSSHRIEDIISEDQKRHSLITRKRKT 225
L CLI +LL ADS +DT+VRLK++HRDTL P RIED+I DQKRHSLI+RKR +
Sbjct: 6 LSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGADQKRHSLISRKRNS 65
Query: 226 NGGGAKLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKE 405
G K+ L SG DYG AQYF +++VGTPAK+FRVVVDTGSELTWVNCR+R +GK
Sbjct: 66 T-VGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGK----- 119
Query: 406 KKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVF 585
RRVFRA+ES SF+ VGCLTQTCK DLMNLFSL+ CPTPSTPCSYDYRYADGS+AQGVF
Sbjct: 120 DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVF 179
Query: 586 AKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGK 765
AKET TV LTNGR+ARL G LIGCSSSF G SFQGADGVLGLA SD+SFTS AT+L+G K
Sbjct: 180 AKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAK 239
Query: 766 FSYCLVDHRSHKNVSSYLIFGS--TTKPTATRTTPLDLNLIPPFYAINIIGISLGDDMLD 939
FSYCLVDH S+KNVS+YLIFGS +TK RTTPLDL IPPFYAIN+IGISLG DMLD
Sbjct: 240 FSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLD 299
Query: 940 IPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDV 1119
IPSQVWDAT+GGGTILDSGTSLTLLADAAYK VV+GL RYLV LKRVKPEGVPIEYCF
Sbjct: 300 IPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSF 359
Query: 1120 TSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
TSGFN SKLPQL FH GGARFEPHR+SYLVDAA GVKCLGFVSAGTP +G++
Sbjct: 360 TSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNI 415
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis
vinifera]
Length = 449
Score = 383 bits (982), Expect = 5e-104
Identities = 189/350 (54%), Positives = 243/350 (69%), Gaps = 10/350 (2%)
Frame = +1
Query: 241 KLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR-- 414
++P+ +DYG QYF KVGTP+++F +V DTGS+LTW++C++ + + K R
Sbjct: 69 EVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRI 128
Query: 415 ---RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVF 585
RVF A SSSF+ + CLT CK +LM+LFSL+NCPTP TPC YDYRY+DGS+A G F
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFF 188
Query: 586 AKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGK 765
A ET TV+L GR +L +LIGCS SF G SFQ ADGV+GL S YSF KA FGGK
Sbjct: 189 ANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK 248
Query: 766 FSYCLVDHRSHKNVSSYLIFGSTTKPTATRT----TPLDLNLIPPFYAINIIGISLGDDM 933
FSYCLVDH SHKNVS+YL FGS+ A T L L ++ FYA+N++GIS+G M
Sbjct: 249 FSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAM 308
Query: 934 LDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCF 1113
L IPS+VWD GGTILDSG+SLT L + AY+PV++ L L+ ++V+ + P+EYCF
Sbjct: 309 LKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 368
Query: 1114 DVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTP 1263
+ T GF ES +P+L+FHF GA FEP +SY++ AA GV+CLGFVS P
Sbjct: 369 NST-GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 417
>gi|225462334|ref|XP_002265771.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 486
Score = 380 bits (974), Expect = 4e-103
Identities = 188/350 (53%), Positives = 242/350 (69%), Gaps = 10/350 (2%)
Frame = +1
Query: 241 KLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR-- 414
++P+ +DYG QY KVGTP+++F +V DTGS+LTW++C++ + + K R
Sbjct: 106 EVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRI 165
Query: 415 ---RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVF 585
RVF A SSSF+ + CLT CK +LM+LFSL+NCPTP TPC YDYRY+DGS+A G F
Sbjct: 166 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFF 225
Query: 586 AKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGK 765
A ET TV+L GR +L +LIGCS SF G SFQ ADGV+GL S YSF KA FGGK
Sbjct: 226 ANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK 285
Query: 766 FSYCLVDHRSHKNVSSYLIFGSTTKPTATRT----TPLDLNLIPPFYAINIIGISLGDDM 933
FSYCLVDH SHKNVS+YL FGS+ A T L L ++ FYA+N++GIS+G M
Sbjct: 286 FSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAM 345
Query: 934 LDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCF 1113
L IPS+VWD GGTILDSG+SLT L + AY+PV++ L L+ ++V+ + P+EYCF
Sbjct: 346 LKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 405
Query: 1114 DVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTP 1263
+ T GF ES +P+L+FHF GA FEP +SY++ AA GV+CLGFVS P
Sbjct: 406 NST-GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 454
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 376 bits (965), Expect = 5e-102
Identities = 187/347 (53%), Positives = 239/347 (68%), Gaps = 10/347 (2%)
Frame = +1
Query: 250 LRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR----- 414
+ +DYG QY KVGTP+++F +V DTGS+LTW++C++ + + K R
Sbjct: 1 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60
Query: 415 RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVFAKE 594
RVF A SSSF+ + CLT CK +LM+LFSL+NCPTP TPC YDYRY+DGS+A G FA E
Sbjct: 61 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120
Query: 595 TFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSY 774
T TV+L GR +L +LIGCS SF G SFQ ADGV+GL S YSF KA FGGKFSY
Sbjct: 121 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180
Query: 775 CLVDHRSHKNVSSYLIFGSTTKPTATRT----TPLDLNLIPPFYAINIIGISLGDDMLDI 942
CLVDH SHKNVS+YL FGS+ A T L L ++ FYA+N++GIS+G ML I
Sbjct: 181 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 240
Query: 943 PSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDVT 1122
PS+VWD GGTILDSG+SLT L + AY+PV++ L L+ ++V+ + P+EYCF+ T
Sbjct: 241 PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 300
Query: 1123 SGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTP 1263
GF ES +P+L+FHF GA FEP +SY++ AA GV+CLGFVS P
Sbjct: 301 -GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 346
>gi|259490398|ref|NP_001159203.1| hypothetical protein LOC100304289 [Zea mays]
Length = 378
Score = 297 bits (758), Expect = 5e-078
Identities = 162/362 (44%), Positives = 224/362 (61%), Gaps = 23/362 (6%)
Frame = +1
Query: 244 LPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKRRVF 423
+PL SG+ G QYF +VGTPA+ F +V DTGS+LTWV C RG + R F
Sbjct: 1 MPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKC--RGAAGPPASDPPAREF 58
Query: 424 RAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVFAKETFT 603
RA ES S+ + C + TC + + FSL+NC +P++PC+YDYRY DGS+A+GV + T
Sbjct: 59 RASESRSWAPLACSSDTCTSYVP--FSLANCSSPASPCAYDYRYKDGSAARGVVGTDAAT 116
Query: 604 VDLT----------NGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNL 753
+ L+ GR A+L+G+++GC++++DG SFQ +DGVL L S+ SF S+A
Sbjct: 117 IALSGSGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAAR 176
Query: 754 FGGKFSYCLVDHRSHKNVSSYLIFGSTTK----PTATRTTPLDLNLIPPFYAINIIGISL 921
FGG+FSYCLVDH + +N SSYL FG + P A LD + PFYA+ + + +
Sbjct: 177 FGGRFSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLD-RRVSPFYAVAVDAVYV 235
Query: 922 GDDMLDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPI 1101
+ LDIP+ VWD GGG ILDSGTSLT+LA AY+ VV+ L L L RV + P
Sbjct: 236 AGEALDIPADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMD--PF 293
Query: 1102 EYCFDVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLG 1281
EYC++ T+G E +P+L F G AR EP +SY++DAA GVKC+G P + +G
Sbjct: 294 EYCYNWTAGAPE--IPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIG 351
Query: 1282 HM 1287
++
Sbjct: 352 NI 353
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative
[Ricinus communis]
Length = 489
Score = 294 bits (750), Expect = 4e-077
Identities = 156/355 (43%), Positives = 218/355 (61%), Gaps = 7/355 (1%)
Frame = +1
Query: 238 AKLPLRSGSDYGAAQYFADVKVGTP-AKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR 414
A++P+ SG+D G +QYF +++GTP ++F +V DTGS+LTW+NC + K K
Sbjct: 104 AQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPG 163
Query: 415 RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVFAKE 594
RVFRA +SSSFR + C + CK +L + FSL+ CP P+ PC +DYRY +G A GVFA E
Sbjct: 164 RVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANE 223
Query: 595 TFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSY 774
T TV L + + RL +LIGC+ SF+ ++ DGV+GL +S + +FG KFSY
Sbjct: 224 TVTVGLNDHKKIRLFDVLIGCTESFN-ETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSY 282
Query: 775 CLVDHRSHKNVSSYLIFGS--TTKPTATRTTPLDLNLIPPFYAINIIGISLGDDMLDIPS 948
CLVDH S N ++L FG K + T L L I FY +N+ GIS+G ML I S
Sbjct: 283 CLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISS 342
Query: 949 QVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVP--IEYCFDVT 1122
+W+ T GG I+DSGTSLT+LA AY VV L+ K+V P +P +CF+
Sbjct: 343 DIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFE-D 401
Query: 1123 SGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
GF+ + +P+L+ HF GA F+P +SY++D A G+KCLG + A P LG++
Sbjct: 402 KGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNV 456
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050
[Sorghum bicolor]
Length = 466
Score = 235 bits (599), Expect = 1e-059
Identities = 130/296 (43%), Positives = 186/296 (62%), Gaps = 11/296 (3%)
Frame = +1
Query: 415 RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSS-AQGVFAK 591
RVFR + S S+ + C + TCK D+ F+L+NC +P++PC+YDYRY +GS+ A+G+
Sbjct: 152 RVFRPKTSRSWAPIPCSSDTCKLDVP--FTLANCSSPASPCTYDYRYKEGSAGARGIVGT 209
Query: 592 ETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFS 771
E+ T+ L G+VA+L+ +++GCSSS DG SF+ ADGVL L + SF ++A FGG FS
Sbjct: 210 ESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFS 269
Query: 772 YCLVDHRSHKNVSSYLIFGSTTKP-TATRTTPLDLNLIPPFYAINIIGISLGDDMLDIPS 948
YCLVDH + +N + YL FG P T T L L+ PFY + + I + LDIP+
Sbjct: 270 YCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPA 329
Query: 949 QVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDVTS- 1125
+VWDA GG ILDSG +LT+LA AYK VV+ L ++L G+ +V P E+C++ T+
Sbjct: 330 EVWDA-KSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKV--SFPPFEHCYNWTAR 386
Query: 1126 --GFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
G E +P+L F G AR EP +SY++D GVKC+G P L +G++
Sbjct: 387 RPGAPEI-IPKLAVQFAGSARLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNI 441
>gi|238479750|ref|NP_001154610.1| aspartyl protease family protein [Arabidopsis
thaliana]
Length = 263
Score = 235 bits (597), Expect = 2e-059
Identities = 120/183 (65%), Positives = 140/183 (76%), Gaps = 8/183 (4%)
Frame = +1
Query: 10 GKGKKKEMKRTRALL-CLI-PILLTAAADSTEDTAVRLKISHRDTLFPTSSHRIEDIISE 183
G K +E K + LL CLI +LL ADS +DT+VRLK++HRDTL P RIED+I
Sbjct: 14 GDKKNQEEKMQKTLLSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGA 73
Query: 184 DQKRHSLITRKRKTNGGGAKLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWV 363
DQKRHSLI+RKR + G K+ L SG DYG AQYF +++VGTPAK+FRVVVDTGSELTWV
Sbjct: 74 DQKRHSLISRKRNST-VGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWV 132
Query: 364 NCRFRGKGKGKEKEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSY 543
NCR+R +GK RRVFRA+ES SF+ VGCLTQTCK DLMNLFSL+ CPTPSTPCSY
Sbjct: 133 NCRYRARGK-----DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY 187
Query: 544 DYR 552
DYR
Sbjct: 188 DYR 190
>gi|212722026|ref|NP_001131674.1| hypothetical protein LOC100193034 [Zea mays]
Length = 441
Score = 233 bits (592), Expect = 9e-059
Identities = 127/293 (43%), Positives = 181/293 (61%), Gaps = 8/293 (2%)
Frame = +1
Query: 418 VFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSS-AQGVFAKE 594
VFR E S S+ V C + TCK D+ FSL+NC + ++PCSYDYRY +GS+ A GV +
Sbjct: 129 VFRPEASKSWAPVPCSSDTCKLDVP--FSLANCSSSASPCSYDYRYKEGSAGALGVVGTD 186
Query: 595 TFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSY 774
+ T+ L G+VA+L+ +++GCSS+ DG SF+ DGVL L + SF S+A FGG FSY
Sbjct: 187 SATIALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSY 246
Query: 775 CLVDHRSHKNVSSYLIFGSTTKP-TATRTTPLDLNLIPPFYAINIIGISLGDDMLDIPSQ 951
CLVDH + +N + YL FG P T T L L+ PFY + + + + LDIP++
Sbjct: 247 CLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAE 306
Query: 952 VWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDVTSGF 1131
VWD GG ILDSGT+LT+LA AYK VV+ L + L G+ +V + P E+C++ T+
Sbjct: 307 VWD-PKSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKV--DFPPFEHCYNWTAPR 363
Query: 1132 -NESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
++P+L F G AR EP +SY++D GVKC+G P + +G++
Sbjct: 364 PGAPEIPKLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNI 416
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060
[Sorghum bicolor]
Length = 466
Score = 205 bits (519), Expect = 3e-050
Identities = 102/227 (44%), Positives = 147/227 (64%), Gaps = 3/227 (1%)
Frame = +1
Query: 607 DLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSYCLVD 786
D + GR A+L+G+++GC++++DG SFQ +DGVL L S+ SF S+A FGG+FSYCLVD
Sbjct: 218 DSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVD 277
Query: 787 HRSHKNVSSYLIFGSTTKPTATRTTPLDLNLIPPFYAINIIGISLGDDMLDIPSQVWDAT 966
H + +N +SYL FG A +T L + PFYA+ + + + + LDIP+ VWD
Sbjct: 278 HLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVD 337
Query: 967 NGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDVTSGFNESKL 1146
GG ILDSGTSLT+LA AY+ VV+ L ++L GL RV + P EYC++ T ++
Sbjct: 338 RNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTMD--PFEYCYNWTDA-GALEI 394
Query: 1147 PQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
P++ HF G AR EP +SY++DAA GVKC+G P + +G++
Sbjct: 395 PKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNI 441
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 194 bits (491), Expect = 4e-047
Identities = 99/214 (46%), Positives = 132/214 (61%), Gaps = 9/214 (4%)
Frame = +1
Query: 244 LPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR--- 414
+PL SG+ G QYF +VGTPA+ F +V DTGS+LTWV CR
Sbjct: 84 MPLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPG 143
Query: 415 --RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVFA 588
R FR E+S ++ + C + TC L FSL+ CPTP +PC+YDYRY DGS+A+G
Sbjct: 144 PGRAFRPEDSRTWAPISCASDTCTKSLP--FSLATCPTPGSPCAYDYRYKDGSAARGTVG 201
Query: 589 KETFTVDLT--NGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGG 762
E+ T+ L+ R A+L+GL++GCSSS+ G SF+ +DGVL L S SF S A + FGG
Sbjct: 202 TESATIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGG 261
Query: 763 KFSYCLVDHRSHKNVSSYLIFGSTTKPTATRTTP 864
+FSYCLVDH S +N +SYL FG ++ R +P
Sbjct: 262 RFSYCLVDHLSPRNATSYLTFGPNPAVSSPRASP 295
>gi|226491620|ref|NP_001149154.1| pepsin A [Zea mays]
Length = 537
Score = 159 bits (400), Expect = 2e-036
Identities = 95/250 (38%), Positives = 136/250 (54%), Gaps = 15/250 (6%)
Frame = +1
Query: 397 EKEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPS--TPCSYDYRYADGSS 570
+KE + +R +SSS+R++ C + C + + C +PS CSY + DG+
Sbjct: 182 KKEASKNWYRPAKSSSWRRIRCSQKECA-----VLPYNTCQSPSKAESCSYFQKTQDGTV 236
Query: 571 AQGVFAKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATN 750
G++ KE TV +++GR+A+L GL++GCS G S DGVL L D SF A
Sbjct: 237 TIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAK 296
Query: 751 LFGGKFSYCLVDHRSHKNVSSYLIFGSTTKPTATRTTPLDLNL---IPPFYAINIIGISL 921
FG +FS+CL+ S ++ SSYL FG T D+ + P Y + G+ +
Sbjct: 297 RFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLV 356
Query: 922 GDDMLDIPSQVWDATN--GGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRV-KPEG 1092
G + LDIP +VWDA GGG ILD+ TS+T L AY PV + L+R+L L RV + EG
Sbjct: 357 GGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEG 416
Query: 1093 VPIEYCFDVT 1122
EYC+ T
Sbjct: 417 --FEYCYKWT 424
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
Length = 534
Score = 158 bits (399), Expect = 2e-036
Identities = 95/249 (38%), Positives = 135/249 (54%), Gaps = 15/249 (6%)
Frame = +1
Query: 400 KEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPS--TPCSYDYRYADGSSA 573
KE + +R +SSS+R++ C + C + + C +PS CSY + DG+
Sbjct: 180 KEASKNWYRPAKSSSWRRIRCSQKECA-----VLPYNTCQSPSKAESCSYFQKTQDGTVT 234
Query: 574 QGVFAKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNL 753
G++ KE TV +++GR+A+L GL++GCS G S DGVL L D SF A
Sbjct: 235 IGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKR 294
Query: 754 FGGKFSYCLVDHRSHKNVSSYLIFGSTTKPTATRTTPLDLNL---IPPFYAINIIGISLG 924
FG +FS+CL+ S ++ SSYL FG T D+ + P Y + G+ +G
Sbjct: 295 FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVG 354
Query: 925 DDMLDIPSQVWDATN--GGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRV-KPEGV 1095
+ LDIP +VWDA GGG ILD+ TS+T L AY PV + L+R+L L RV + EG
Sbjct: 355 GERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEG- 413
Query: 1096 PIEYCFDVT 1122
EYC+ T
Sbjct: 414 -FEYCYKWT 421
>gi|242068179|ref|XP_002449366.1| hypothetical protein SORBIDRAFT_05g008660
[Sorghum bicolor]
Length = 193
Score = 96 bits (236), Expect = 2e-017
Identities = 49/102 (48%), Positives = 71/102 (69%), Gaps = 4/102 (3%)
Frame = +1
Query: 607 DLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSYCLVD 786
D + GR A+L+G+++GC+++++G SFQ +DGVL L S+ SF S+A FGG+FSYCLVD
Sbjct: 60 DSSGGRCAKLQGIVLGCTATYNGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVD 119
Query: 787 HRSHKNVSSYLIFG-STTKPTATRTTPLDLN-LIPPFYAINI 906
H + +N +SYL FG T P A TPL L+ + PF A+ +
Sbjct: 120 HLAPRNATSYLTFGPGATAPAA--QTPLFLDRRMSPFNAVTV 159
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,687,995,622,555
Number of Sequences: 15229318
Number of Extensions: 1687995622555
Number of Successful Extensions: 433925934
Number of sequences better than 0.0: 0
|