Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN14186


BLASTX 7.6.2

Query= UN14186 /QuerySize=1345
        (1344 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT...    649   4e-184
gi|30682289|ref|NP_187876.2| aspartyl protease family protein [A...    621   9e-176
gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]      617   2e-174
gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 ...    383   5e-104
gi|225462334|ref|XP_002265771.1| PREDICTED: hypothetical protein...    380   4e-103
gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vini...    376   5e-102
gi|259490398|ref|NP_001159203.1| hypothetical protein LOC1003042...    297   5e-078
gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precur...    294   4e-077
gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT...    235   1e-059
gi|238479750|ref|NP_001154610.1| aspartyl protease family protei...    235   2e-059
gi|212722026|ref|NP_001131674.1| hypothetical protein LOC1001930...    233   9e-059
gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT...    205   3e-050
gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Trit...    194   4e-047
gi|226491620|ref|NP_001149154.1| pepsin A [Zea mays]                   159   2e-036
gi|224029721|gb|ACN33936.1| unknown [Zea mays]                         158   2e-036
gi|242068179|ref|XP_002449366.1| hypothetical protein SORBIDRAFT...     96   2e-017

>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632
        [Arabidopsis lyrata subsp. lyrata]

          Length = 449

 Score =  649 bits (1673), Expect = 4e-184
 Identities = 327/427 (76%), Positives = 363/427 (85%), Gaps = 9/427 (2%)
 Frame = +1

Query:   25 KEMKRTRALLCLI-PILLTAAADSTEDTAVRLKISHRDTLFPTSSHRIEDIISEDQKRHS 201
            + +K + + LCLI  +LL  AADSTEDTAVRLK++HRDTL+P    RIEDII  DQKRHS
Sbjct:    2 QRIKTSLSCLCLITTLLLLTAADSTEDTAVRLKLAHRDTLWPNPLSRIEDIIGADQKRHS 61

Query:  202 LITRKRKTNGGGAKLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRG 381
            LI+RKRK   GG K+ L SG DYG AQYF +V+VGTPAK+FRVVVDTGSELTWVNCR+RG
Sbjct:   62 LISRKRKFK-GGVKMDLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRG 120

Query:  382 KGKGKEKEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYAD 561
            +GKG  K K RRVFRAEES SF+ VGC TQTCK DLMNLFSLS CPTPSTPCSYDYRYAD
Sbjct:  121 RGKG--KVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYAD 178

Query:  562 GSSAQGVFAKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSK 741
            GS+AQGVFAKET TV LTNGR ARLRGLL+GCSSSF G SFQGADGVLGLA SD+SFTS 
Sbjct:  179 GSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTST 238

Query:  742 ATNLFGGKFSYCLVDHRSHKNVSSYLIFGSTTKPTAT-----RTTPLDLNLIPPFYAINI 906
            AT+LFG K SYCLVDH S+KN+S+YLIFG ++  T+T     RTTPLDL LIPPFYAINI
Sbjct:  239 ATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINI 298

Query:  907 IGISLGDDMLDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKP 1086
            IGIS+GDDMLDIP+QVWDAT GGGTILDSGTSLTLLA+AAYKPVV+GL RYLV LKRVKP
Sbjct:  299 IGISIGDDMLDIPTQVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKP 358

Query: 1087 EGVPIEYCFDVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPL 1266
            EG+PIEYCF  TSGFNESKLPQL FH  GGARFEPHR+SYLVDAA GVKCLGF+SAGTP 
Sbjct:  359 EGIPIEYCFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPA 418

Query: 1267 LMWLGHM 1287
               +G++
Sbjct:  419 TNVVGNI 425

>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis
        thaliana]

          Length = 461

 Score =  621 bits (1601), Expect = 9e-176
 Identities = 319/430 (74%), Positives = 354/430 (82%), Gaps = 10/430 (2%)
 Frame = +1

Query:   10 GKGKKKEMKRTRALL-CLI-PILLTAAADSTEDTAVRLKISHRDTLFPTSSHRIEDIISE 183
            G  K +E K  + LL CLI  +LL   ADS +DT+VRLK++HRDTL P    RIED+I  
Sbjct:   14 GDKKNQEEKMQKTLLSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGA 73

Query:  184 DQKRHSLITRKRKTNGGGAKLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWV 363
            DQKRHSLI+RKR +   G K+ L SG DYG AQYF +++VGTPAK+FRVVVDTGSELTWV
Sbjct:   74 DQKRHSLISRKRNST-VGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWV 132

Query:  364 NCRFRGKGKGKEKEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSY 543
            NCR+R +GK       RRVFRA+ES SF+ VGCLTQTCK DLMNLFSL+ CPTPSTPCSY
Sbjct:  133 NCRYRARGK-----DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY 187

Query:  544 DYRYADGSSAQGVFAKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSD 723
            DYRYADGS+AQGVFAKET TV LTNGR+ARL G LIGCSSSF G SFQGADGVLGLA SD
Sbjct:  188 DYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSD 247

Query:  724 YSFTSKATNLFGGKFSYCLVDHRSHKNVSSYLIFGS--TTKPTATRTTPLDLNLIPPFYA 897
            +SFTS AT+L+G KFSYCLVDH S+KNVS+YLIFGS  +TK    RTTPLDL  IPPFYA
Sbjct:  248 FSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYA 307

Query:  898 INIIGISLGDDMLDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKR 1077
            IN+IGISLG DMLDIPSQVWDAT+GGGTILDSGTSLTLLADAAYK VV+GL RYLV LKR
Sbjct:  308 INVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKR 367

Query: 1078 VKPEGVPIEYCFDVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAG 1257
            VKPEGVPIEYCF  TSGFN SKLPQL FH  GGARFEPHR+SYLVDAA GVKCLGFVSAG
Sbjct:  368 VKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAG 427

Query: 1258 TPLLMWLGHM 1287
            TP    +G++
Sbjct:  428 TPATNVIGNI 437

>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]

          Length = 439

 Score =  617 bits (1590), Expect = 2e-174
 Identities = 314/416 (75%), Positives = 347/416 (83%), Gaps = 9/416 (2%)
 Frame = +1

Query:   49 LLCLI-PILLTAAADSTEDTAVRLKISHRDTLFPTSSHRIEDIISEDQKRHSLITRKRKT 225
            L CLI  +LL   ADS +DT+VRLK++HRDTL P    RIED+I  DQKRHSLI+RKR +
Sbjct:    6 LSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGADQKRHSLISRKRNS 65

Query:  226 NGGGAKLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKE 405
               G K+ L SG DYG AQYF +++VGTPAK+FRVVVDTGSELTWVNCR+R +GK     
Sbjct:   66 T-VGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGK----- 119

Query:  406 KKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVF 585
              RRVFRA+ES SF+ VGCLTQTCK DLMNLFSL+ CPTPSTPCSYDYRYADGS+AQGVF
Sbjct:  120 DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVF 179

Query:  586 AKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGK 765
            AKET TV LTNGR+ARL G LIGCSSSF G SFQGADGVLGLA SD+SFTS AT+L+G K
Sbjct:  180 AKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAK 239

Query:  766 FSYCLVDHRSHKNVSSYLIFGS--TTKPTATRTTPLDLNLIPPFYAINIIGISLGDDMLD 939
            FSYCLVDH S+KNVS+YLIFGS  +TK    RTTPLDL  IPPFYAIN+IGISLG DMLD
Sbjct:  240 FSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLD 299

Query:  940 IPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDV 1119
            IPSQVWDAT+GGGTILDSGTSLTLLADAAYK VV+GL RYLV LKRVKPEGVPIEYCF  
Sbjct:  300 IPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSF 359

Query: 1120 TSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
            TSGFN SKLPQL FH  GGARFEPHR+SYLVDAA GVKCLGFVSAGTP    +G++
Sbjct:  360 TSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNI 415

>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis
        vinifera]

          Length = 449

 Score =  383 bits (982), Expect = 5e-104
 Identities = 189/350 (54%), Positives = 243/350 (69%), Gaps = 10/350 (2%)
 Frame = +1

Query:  241 KLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR-- 414
            ++P+   +DYG  QYF   KVGTP+++F +V DTGS+LTW++C++  + +     K R  
Sbjct:   69 EVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRI 128

Query:  415 ---RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVF 585
               RVF A  SSSF+ + CLT  CK +LM+LFSL+NCPTP TPC YDYRY+DGS+A G F
Sbjct:  129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFF 188

Query:  586 AKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGK 765
            A ET TV+L  GR  +L  +LIGCS SF G SFQ ADGV+GL  S YSF  KA   FGGK
Sbjct:  189 ANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK 248

Query:  766 FSYCLVDHRSHKNVSSYLIFGSTTKPTATRT----TPLDLNLIPPFYAINIIGISLGDDM 933
            FSYCLVDH SHKNVS+YL FGS+    A       T L L ++  FYA+N++GIS+G  M
Sbjct:  249 FSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAM 308

Query:  934 LDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCF 1113
            L IPS+VWD    GGTILDSG+SLT L + AY+PV++ L   L+  ++V+ +  P+EYCF
Sbjct:  309 LKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 368

Query: 1114 DVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTP 1263
            + T GF ES +P+L+FHF  GA FEP  +SY++ AA GV+CLGFVS   P
Sbjct:  369 NST-GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 417

>gi|225462334|ref|XP_002265771.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 486

 Score =  380 bits (974), Expect = 4e-103
 Identities = 188/350 (53%), Positives = 242/350 (69%), Gaps = 10/350 (2%)
 Frame = +1

Query:  241 KLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR-- 414
            ++P+   +DYG  QY    KVGTP+++F +V DTGS+LTW++C++  + +     K R  
Sbjct:  106 EVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRI 165

Query:  415 ---RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVF 585
               RVF A  SSSF+ + CLT  CK +LM+LFSL+NCPTP TPC YDYRY+DGS+A G F
Sbjct:  166 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFF 225

Query:  586 AKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGK 765
            A ET TV+L  GR  +L  +LIGCS SF G SFQ ADGV+GL  S YSF  KA   FGGK
Sbjct:  226 ANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK 285

Query:  766 FSYCLVDHRSHKNVSSYLIFGSTTKPTATRT----TPLDLNLIPPFYAINIIGISLGDDM 933
            FSYCLVDH SHKNVS+YL FGS+    A       T L L ++  FYA+N++GIS+G  M
Sbjct:  286 FSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAM 345

Query:  934 LDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCF 1113
            L IPS+VWD    GGTILDSG+SLT L + AY+PV++ L   L+  ++V+ +  P+EYCF
Sbjct:  346 LKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 405

Query: 1114 DVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTP 1263
            + T GF ES +P+L+FHF  GA FEP  +SY++ AA GV+CLGFVS   P
Sbjct:  406 NST-GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 454

>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]

          Length = 378

 Score =  376 bits (965), Expect = 5e-102
 Identities = 187/347 (53%), Positives = 239/347 (68%), Gaps = 10/347 (2%)
 Frame = +1

Query:  250 LRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR----- 414
            +   +DYG  QY    KVGTP+++F +V DTGS+LTW++C++  + +     K R     
Sbjct:    1 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60

Query:  415 RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVFAKE 594
            RVF A  SSSF+ + CLT  CK +LM+LFSL+NCPTP TPC YDYRY+DGS+A G FA E
Sbjct:   61 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120

Query:  595 TFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSY 774
            T TV+L  GR  +L  +LIGCS SF G SFQ ADGV+GL  S YSF  KA   FGGKFSY
Sbjct:  121 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180

Query:  775 CLVDHRSHKNVSSYLIFGSTTKPTATRT----TPLDLNLIPPFYAINIIGISLGDDMLDI 942
            CLVDH SHKNVS+YL FGS+    A       T L L ++  FYA+N++GIS+G  ML I
Sbjct:  181 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 240

Query:  943 PSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDVT 1122
            PS+VWD    GGTILDSG+SLT L + AY+PV++ L   L+  ++V+ +  P+EYCF+ T
Sbjct:  241 PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 300

Query: 1123 SGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTP 1263
             GF ES +P+L+FHF  GA FEP  +SY++ AA GV+CLGFVS   P
Sbjct:  301 -GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWP 346

>gi|259490398|ref|NP_001159203.1| hypothetical protein LOC100304289 [Zea mays]

          Length = 378

 Score =  297 bits (758), Expect = 5e-078
 Identities = 162/362 (44%), Positives = 224/362 (61%), Gaps = 23/362 (6%)
 Frame = +1

Query:  244 LPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKRRVF 423
            +PL SG+  G  QYF   +VGTPA+ F +V DTGS+LTWV C  RG       +   R F
Sbjct:    1 MPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKC--RGAAGPPASDPPAREF 58

Query:  424 RAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVFAKETFT 603
            RA ES S+  + C + TC + +   FSL+NC +P++PC+YDYRY DGS+A+GV   +  T
Sbjct:   59 RASESRSWAPLACSSDTCTSYVP--FSLANCSSPASPCAYDYRYKDGSAARGVVGTDAAT 116

Query:  604 VDLT----------NGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNL 753
            + L+           GR A+L+G+++GC++++DG SFQ +DGVL L  S+ SF S+A   
Sbjct:  117 IALSGSGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAAR 176

Query:  754 FGGKFSYCLVDHRSHKNVSSYLIFGSTTK----PTATRTTPLDLNLIPPFYAINIIGISL 921
            FGG+FSYCLVDH + +N SSYL FG   +    P A     LD   + PFYA+ +  + +
Sbjct:  177 FGGRFSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLD-RRVSPFYAVAVDAVYV 235

Query:  922 GDDMLDIPSQVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPI 1101
              + LDIP+ VWD   GGG ILDSGTSLT+LA  AY+ VV+ L   L  L RV  +  P 
Sbjct:  236 AGEALDIPADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMD--PF 293

Query: 1102 EYCFDVTSGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLG 1281
            EYC++ T+G  E  +P+L   F G AR EP  +SY++DAA GVKC+G      P +  +G
Sbjct:  294 EYCYNWTAGAPE--IPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIG 351

Query: 1282 HM 1287
            ++
Sbjct:  352 NI 353

>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative
        [Ricinus communis]

          Length = 489

 Score =  294 bits (750), Expect = 4e-077
 Identities = 156/355 (43%), Positives = 218/355 (61%), Gaps = 7/355 (1%)
 Frame = +1

Query:  238 AKLPLRSGSDYGAAQYFADVKVGTP-AKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR 414
            A++P+ SG+D G +QYF  +++GTP  ++F +V DTGS+LTW+NC +  K   K      
Sbjct:  104 AQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPG 163

Query:  415 RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVFAKE 594
            RVFRA +SSSFR + C +  CK +L + FSL+ CP P+ PC +DYRY +G  A GVFA E
Sbjct:  164 RVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANE 223

Query:  595 TFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSY 774
            T TV L + +  RL  +LIGC+ SF+ ++    DGV+GL    +S   +   +FG KFSY
Sbjct:  224 TVTVGLNDHKKIRLFDVLIGCTESFN-ETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSY 282

Query:  775 CLVDHRSHKNVSSYLIFGS--TTKPTATRTTPLDLNLIPPFYAINIIGISLGDDMLDIPS 948
            CLVDH S  N  ++L FG     K    + T L L  I  FY +N+ GIS+G  ML I S
Sbjct:  283 CLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISS 342

Query:  949 QVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVP--IEYCFDVT 1122
             +W+ T  GG I+DSGTSLT+LA  AY  VV  L+      K+V P  +P    +CF+  
Sbjct:  343 DIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFE-D 401

Query: 1123 SGFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
             GF+ + +P+L+ HF  GA F+P  +SY++D A G+KCLG + A  P    LG++
Sbjct:  402 KGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNV 456

>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050
        [Sorghum bicolor]

          Length = 466

 Score =  235 bits (599), Expect = 1e-059
 Identities = 130/296 (43%), Positives = 186/296 (62%), Gaps = 11/296 (3%)
 Frame = +1

Query:  415 RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSS-AQGVFAK 591
            RVFR + S S+  + C + TCK D+   F+L+NC +P++PC+YDYRY +GS+ A+G+   
Sbjct:  152 RVFRPKTSRSWAPIPCSSDTCKLDVP--FTLANCSSPASPCTYDYRYKEGSAGARGIVGT 209

Query:  592 ETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFS 771
            E+ T+ L  G+VA+L+ +++GCSSS DG SF+ ADGVL L  +  SF ++A   FGG FS
Sbjct:  210 ESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFS 269

Query:  772 YCLVDHRSHKNVSSYLIFGSTTKP-TATRTTPLDLNLIPPFYAINIIGISLGDDMLDIPS 948
            YCLVDH + +N + YL FG    P T    T L L+   PFY + +  I +    LDIP+
Sbjct:  270 YCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPA 329

Query:  949 QVWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDVTS- 1125
            +VWDA   GG ILDSG +LT+LA  AYK VV+ L ++L G+ +V     P E+C++ T+ 
Sbjct:  330 EVWDA-KSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKV--SFPPFEHCYNWTAR 386

Query: 1126 --GFNESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
              G  E  +P+L   F G AR EP  +SY++D   GVKC+G      P L  +G++
Sbjct:  387 RPGAPEI-IPKLAVQFAGSARLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNI 441

>gi|238479750|ref|NP_001154610.1| aspartyl protease family protein [Arabidopsis
        thaliana]

          Length = 263

 Score =  235 bits (597), Expect = 2e-059
 Identities = 120/183 (65%), Positives = 140/183 (76%), Gaps = 8/183 (4%)
 Frame = +1

Query:  10 GKGKKKEMKRTRALL-CLI-PILLTAAADSTEDTAVRLKISHRDTLFPTSSHRIEDIISE 183
           G  K +E K  + LL CLI  +LL   ADS +DT+VRLK++HRDTL P    RIED+I  
Sbjct:  14 GDKKNQEEKMQKTLLSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGA 73

Query: 184 DQKRHSLITRKRKTNGGGAKLPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWV 363
           DQKRHSLI+RKR +   G K+ L SG DYG AQYF +++VGTPAK+FRVVVDTGSELTWV
Sbjct:  74 DQKRHSLISRKRNST-VGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWV 132

Query: 364 NCRFRGKGKGKEKEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSY 543
           NCR+R +GK       RRVFRA+ES SF+ VGCLTQTCK DLMNLFSL+ CPTPSTPCSY
Sbjct: 133 NCRYRARGK-----DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY 187

Query: 544 DYR 552
           DYR
Sbjct: 188 DYR 190

>gi|212722026|ref|NP_001131674.1| hypothetical protein LOC100193034 [Zea mays]

          Length = 441

 Score =  233 bits (592), Expect = 9e-059
 Identities = 127/293 (43%), Positives = 181/293 (61%), Gaps = 8/293 (2%)
 Frame = +1

Query:  418 VFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSS-AQGVFAKE 594
            VFR E S S+  V C + TCK D+   FSL+NC + ++PCSYDYRY +GS+ A GV   +
Sbjct:  129 VFRPEASKSWAPVPCSSDTCKLDVP--FSLANCSSSASPCSYDYRYKEGSAGALGVVGTD 186

Query:  595 TFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSY 774
            + T+ L  G+VA+L+ +++GCSS+ DG SF+  DGVL L  +  SF S+A   FGG FSY
Sbjct:  187 SATIALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSY 246

Query:  775 CLVDHRSHKNVSSYLIFGSTTKP-TATRTTPLDLNLIPPFYAINIIGISLGDDMLDIPSQ 951
            CLVDH + +N + YL FG    P T    T L L+   PFY + +  + +    LDIP++
Sbjct:  247 CLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAE 306

Query:  952 VWDATNGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDVTSGF 1131
            VWD    GG ILDSGT+LT+LA  AYK VV+ L + L G+ +V  +  P E+C++ T+  
Sbjct:  307 VWD-PKSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKV--DFPPFEHCYNWTAPR 363

Query: 1132 -NESKLPQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
                ++P+L   F G AR EP  +SY++D   GVKC+G      P +  +G++
Sbjct:  364 PGAPEIPKLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNI 416

>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060
        [Sorghum bicolor]

          Length = 466

 Score =  205 bits (519), Expect = 3e-050
 Identities = 102/227 (44%), Positives = 147/227 (64%), Gaps = 3/227 (1%)
 Frame = +1

Query:  607 DLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSYCLVD 786
            D + GR A+L+G+++GC++++DG SFQ +DGVL L  S+ SF S+A   FGG+FSYCLVD
Sbjct:  218 DSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVD 277

Query:  787 HRSHKNVSSYLIFGSTTKPTATRTTPLDLNLIPPFYAINIIGISLGDDMLDIPSQVWDAT 966
            H + +N +SYL FG      A +T  L    + PFYA+ +  + +  + LDIP+ VWD  
Sbjct:  278 HLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVD 337

Query:  967 NGGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRVKPEGVPIEYCFDVTSGFNESKL 1146
              GG ILDSGTSLT+LA  AY+ VV+ L ++L GL RV  +  P EYC++ T      ++
Sbjct:  338 RNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTMD--PFEYCYNWTDA-GALEI 394

Query: 1147 PQLMFHFDGGARFEPHRRSYLVDAAHGVKCLGFVSAGTPLLMWLGHM 1287
            P++  HF G AR EP  +SY++DAA GVKC+G      P +  +G++
Sbjct:  395 PKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNI 441

>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]

          Length = 477

 Score =  194 bits (491), Expect = 4e-047
 Identities = 99/214 (46%), Positives = 132/214 (61%), Gaps = 9/214 (4%)
 Frame = +1

Query: 244 LPLRSGSDYGAAQYFADVKVGTPAKRFRVVVDTGSELTWVNCRFRGKGKGKEKEKKR--- 414
           +PL SG+  G  QYF   +VGTPA+ F +V DTGS+LTWV CR                 
Sbjct:  84 MPLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPG 143

Query: 415 --RVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPSTPCSYDYRYADGSSAQGVFA 588
             R FR E+S ++  + C + TC   L   FSL+ CPTP +PC+YDYRY DGS+A+G   
Sbjct: 144 PGRAFRPEDSRTWAPISCASDTCTKSLP--FSLATCPTPGSPCAYDYRYKDGSAARGTVG 201

Query: 589 KETFTVDLT--NGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGG 762
            E+ T+ L+    R A+L+GL++GCSSS+ G SF+ +DGVL L  S  SF S A + FGG
Sbjct: 202 TESATIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGG 261

Query: 763 KFSYCLVDHRSHKNVSSYLIFGSTTKPTATRTTP 864
           +FSYCLVDH S +N +SYL FG     ++ R +P
Sbjct: 262 RFSYCLVDHLSPRNATSYLTFGPNPAVSSPRASP 295

>gi|226491620|ref|NP_001149154.1| pepsin A [Zea mays]

          Length = 537

 Score =  159 bits (400), Expect = 2e-036
 Identities = 95/250 (38%), Positives = 136/250 (54%), Gaps = 15/250 (6%)
 Frame = +1

Query:  397 EKEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPS--TPCSYDYRYADGSS 570
            +KE  +  +R  +SSS+R++ C  + C      +   + C +PS    CSY  +  DG+ 
Sbjct:  182 KKEASKNWYRPAKSSSWRRIRCSQKECA-----VLPYNTCQSPSKAESCSYFQKTQDGTV 236

Query:  571 AQGVFAKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATN 750
              G++ KE  TV +++GR+A+L GL++GCS    G S    DGVL L   D SF   A  
Sbjct:  237 TIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAK 296

Query:  751 LFGGKFSYCLVDHRSHKNVSSYLIFGSTTKPTATRTTPLDLNL---IPPFYAINIIGISL 921
             FG +FS+CL+   S ++ SSYL FG         T   D+     + P Y   + G+ +
Sbjct:  297 RFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLV 356

Query:  922 GDDMLDIPSQVWDATN--GGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRV-KPEG 1092
            G + LDIP +VWDA    GGG ILD+ TS+T L   AY PV + L+R+L  L RV + EG
Sbjct:  357 GGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEG 416

Query: 1093 VPIEYCFDVT 1122
               EYC+  T
Sbjct:  417 --FEYCYKWT 424

>gi|224029721|gb|ACN33936.1| unknown [Zea mays]

          Length = 534

 Score =  158 bits (399), Expect = 2e-036
 Identities = 95/249 (38%), Positives = 135/249 (54%), Gaps = 15/249 (6%)
 Frame = +1

Query:  400 KEKKRRVFRAEESSSFRQVGCLTQTCKADLMNLFSLSNCPTPS--TPCSYDYRYADGSSA 573
            KE  +  +R  +SSS+R++ C  + C      +   + C +PS    CSY  +  DG+  
Sbjct:  180 KEASKNWYRPAKSSSWRRIRCSQKECA-----VLPYNTCQSPSKAESCSYFQKTQDGTVT 234

Query:  574 QGVFAKETFTVDLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNL 753
             G++ KE  TV +++GR+A+L GL++GCS    G S    DGVL L   D SF   A   
Sbjct:  235 IGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKR 294

Query:  754 FGGKFSYCLVDHRSHKNVSSYLIFGSTTKPTATRTTPLDLNL---IPPFYAINIIGISLG 924
            FG +FS+CL+   S ++ SSYL FG         T   D+     + P Y   + G+ +G
Sbjct:  295 FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVG 354

Query:  925 DDMLDIPSQVWDATN--GGGTILDSGTSLTLLADAAYKPVVSGLERYLVGLKRV-KPEGV 1095
             + LDIP +VWDA    GGG ILD+ TS+T L   AY PV + L+R+L  L RV + EG 
Sbjct:  355 GERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEG- 413

Query: 1096 PIEYCFDVT 1122
              EYC+  T
Sbjct:  414 -FEYCYKWT 421

>gi|242068179|ref|XP_002449366.1| hypothetical protein SORBIDRAFT_05g008660
        [Sorghum bicolor]

          Length = 193

 Score =  96 bits (236), Expect = 2e-017
 Identities = 49/102 (48%), Positives = 71/102 (69%), Gaps = 4/102 (3%)
 Frame = +1

Query: 607 DLTNGRVARLRGLLIGCSSSFDGDSFQGADGVLGLALSDYSFTSKATNLFGGKFSYCLVD 786
           D + GR A+L+G+++GC+++++G SFQ +DGVL L  S+ SF S+A   FGG+FSYCLVD
Sbjct:  60 DSSGGRCAKLQGIVLGCTATYNGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVD 119

Query: 787 HRSHKNVSSYLIFG-STTKPTATRTTPLDLN-LIPPFYAINI 906
           H + +N +SYL FG   T P A   TPL L+  + PF A+ +
Sbjct: 120 HLAPRNATSYLTFGPGATAPAA--QTPLFLDRRMSPFNAVTV 159

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,687,995,622,555
Number of Sequences: 15229318
Number of Extensions: 1687995622555
Number of Successful Extensions: 433925934
Number of sequences better than 0.0: 0