Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN20227


BLASTX 7.6.2

Query= UN20227 /QuerySize=1338
        (1337 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabi...    500   4e-139
gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thal...    497   2e-138
gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT...    490   2e-136
gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabi...    471   1e-130
gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A p...    469   5e-130
gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT...    453   5e-125
gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease f...    451   2e-124
gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cyste...    423   6e-116
gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabi...    374   3e-101
gi|595986|gb|AAA79915.1| cysteine proteinase [Dianthus caryophyl...    347   4e-093
gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis ...    344   3e-092
gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon gran...    343   4e-092
gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]    341   3e-091
gi|226495425|ref|NP_001148706.1| cysteine protease 1 [Zea mays]        340   4e-091
gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidop...    340   5e-091
gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidops...    340   5e-091
gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]          333   5e-089
gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT...    317   3e-084
gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sini...    283   5e-074
gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]          280   5e-073

>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis
        thaliana]

          Length = 376

 Score =  500 bits (1285), Expect = 4e-139
 Identities = 255/376 (67%), Positives = 294/376 (78%), Gaps = 5/376 (1%)
 Frame = +2

Query:   65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRF 244
            I F T+AL+ LS LL+S SLG VTATE++ NE EV  MYEQWLVEN KN N LGEKERRF
Sbjct:    3 ISFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRF 62

Query:  245 NIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYK 424
             IFKDNLK IE HNS P+R+YE GL +F+DLT DEF+A +L GKM + S   + +RY YK
Sbjct:   63 KIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYK 122

Query:  425 EGDVLPEEVDWREKGAVVP-VKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLD 601
            EGDVLP+EVDWRE+GAVVP VK QG+CG CWAFAA GAVEG+N+I TGELVSLSEQEL+D
Sbjct:  123 EGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELID 182

Query:  602 CDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTID 781
            CDRG D  NFGC GG A  AFEFI +N GIV+D+VY YT  DTAACKAIEM TTR VTI+
Sbjct:  183 CDRGND--NFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTIN 240

Query:  782 SYEDAPHNDEMSLKKAVAHQPISVMIEAENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTT 961
             +E  P NDEMSLKKAVA+QPISVMI A NM  YKSGV+ G C + +G+HNV++VGYGT+
Sbjct:  241 GHEVVPVNDEMSLKKAVAYQPISVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTS 300

Query:  962 ERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSAFGLLSLSVC 1141
                DYW+IRNSWG  WGE GY++LQRNFH  TG C VA+ PVYP+KSNS+  LLS SV 
Sbjct:  301 SDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPIKSNSSSHLLSPSVF 360

Query: 1142 KLGVLFV--LIGWVLL 1183
            KL VLFV  LI   LL
Sbjct:  361 KLVVLFVFQLISLALL 376

>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]

          Length = 376

 Score =  497 bits (1278), Expect = 2e-138
 Identities = 254/376 (67%), Positives = 293/376 (77%), Gaps = 5/376 (1%)
 Frame = +2

Query:   65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRF 244
            I F T+AL+ LS LL+S SLG VTATE++ NE  V  MYEQWLVEN KN N LGEKERRF
Sbjct:    3 ISFRTLALLTLSVLLISISLGVVTATESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRF 62

Query:  245 NIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYK 424
             IFKDNLK IE HNS P+R+YE GL +F+DLT DEF+A +L GKM + S   + +RY YK
Sbjct:   63 KIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYK 122

Query:  425 EGDVLPEEVDWREKGAVVP-VKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLD 601
            EGDVLP+EVDWRE+GAVVP VK QG+CG CWAFAA GAVEG+N+I TGELVSLSEQEL+D
Sbjct:  123 EGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELID 182

Query:  602 CDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTID 781
            CDRG D  NFGC GG A  AFEFI +N GIV+D+VY YT  DTAACKAIEM TTR VTI+
Sbjct:  183 CDRGND--NFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTIN 240

Query:  782 SYEDAPHNDEMSLKKAVAHQPISVMIEAENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTT 961
             +E  P NDEMSLKKAVA+QPISVMI A NM  YKSGV+ G C + +G+HNV++VGYGT+
Sbjct:  241 GHEVVPVNDEMSLKKAVAYQPISVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTS 300

Query:  962 ERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSAFGLLSLSVC 1141
                DYW+IRNSWG  WGE GY++LQRNFH  TG C VA+ PVYP+KSNS+  LLS SV 
Sbjct:  301 SDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPIKSNSSSHLLSPSVF 360

Query: 1142 KLGVLFV--LIGWVLL 1183
            KL VLFV  LI   LL
Sbjct:  361 KLVVLFVFQLISLALL 376

>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828
        [Arabidopsis lyrata subsp. lyrata]

          Length = 376

 Score =  490 bits (1261), Expect = 2e-136
 Identities = 248/376 (65%), Positives = 296/376 (78%), Gaps = 5/376 (1%)
 Frame = +2

Query:   65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRF 244
            I F T+AL+ LS LL+S SLG VTATE+  NEAEVR +YE+WLVE+ KN N LGEKERRF
Sbjct:    3 ISFRTLALLTLSVLLISLSLGVVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRF 62

Query:  245 NIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYK 424
             IFKDNLK IE HNS P+R+Y+ GL +F+DLT DEF+A +L GK+ + S   + +RY YK
Sbjct:   63 KIFKDNLKHIEEHNSDPNRSYDRGLNQFSDLTVDEFQASYLGGKIEKKSLSDVAERYQYK 122

Query:  425 EGDVLPEEVDWREKGAVVP-VKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLD 601
            EGD+LP+EVDWRE+GAVVP VK QGDCG CWAFAA GAVEG+N+I TGEL+SLSEQEL+D
Sbjct:  123 EGDILPDEVDWRERGAVVPRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELID 182

Query:  602 CDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTID 781
            CDRG+D  NFGC GG A  AFEFI +N GIVTD+ Y YT +DTAACKAIEM TTR VTI+
Sbjct:  183 CDRGKD--NFGCAGGGAVWAFEFIKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTIN 240

Query:  782 SYEDAPHNDEMSLKKAVAHQPISVMIEAENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTT 961
             +E  P NDEMSLKKAV++QPISVMI A NM  YKSGV+ GPC + +G+HNV++VGYGT+
Sbjct:  241 GHEVVPVNDEMSLKKAVSYQPISVMISAANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTS 300

Query:  962 ERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSAFGLLSLSVC 1141
                DYW+IRNSWG  WGE GY++LQRNF+  TG C VA+ PVYP+K+NSA  LLS SV 
Sbjct:  301 SDEGDYWLIRNSWGPGWGEGGYLRLQRNFNEPTGKCAVAVAPVYPIKTNSASNLLSPSVF 360

Query: 1142 KLGVL--FVLIGWVLL 1183
            KL +L  F LI   LL
Sbjct:  361 KLVLLCIFQLISLALL 376

>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis
        thaliana]

          Length = 362

 Score =  471 bits (1211), Expect = 1e-130
 Identities = 244/362 (67%), Positives = 278/362 (76%), Gaps = 8/362 (2%)
 Frame = +2

Query:   56 AIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKE 235
            A PIR +  ALVILS LLLSSSLG  T TE + NE EVR MYEQWLVENRKN N LGEKE
Sbjct:    3 ATPIRVIVSALVILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKE 62

Query:  236 RRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRY 415
            RRF IFKDNLK ++ HNSVPDRT+E+GLTRFADLT++EFRAI+LR KM RT D V  +RY
Sbjct:   63 RRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERTKDSVKTERY 122

Query:  416 LYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQEL 595
            LYKEGDVLP+EVDWR  GAVV VK+QG+CG CWAF+AVGAVEG+N+I TGEL+SLSEQEL
Sbjct:  123 LYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQEL 182

Query:  596 LDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVT 775
            +DCDRG    N GC GG    AFEFI+ N GI TD+ YPY  ND   C A +   TR VT
Sbjct:  183 VDCDRG--FVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVT 240

Query:  776 IDSYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKSGVFTGPCDHWYGNHNVVVVG 949
            ID YED P +DE SLKKAVAHQP+SV IEA +   +LYKSGV TG C     +H VVVVG
Sbjct:  241 IDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISL-DHGVVVVG 299

Query:  950 YGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSN--SAFGL 1123
            YG+T  GEDYWIIRNSWG NWG+SGY+KLQRN  +  G CG+A+ P YP KS+  S+F L
Sbjct:  300 YGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPTKSSFPSSFDL 358

Query: 1124 LS 1129
            LS
Sbjct:  359 LS 360

>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor
        [Arabidopsis thaliana]

          Length = 362

 Score =  469 bits (1206), Expect = 5e-130
 Identities = 243/362 (67%), Positives = 277/362 (76%), Gaps = 8/362 (2%)
 Frame = +2

Query:   56 AIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKE 235
            A PIR +  ALVILS LLLSSSLG  T TE + NE EVR MYEQWLVENRKN N LGEKE
Sbjct:    3 ATPIRVIVSALVILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKE 62

Query:  236 RRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRY 415
            RRF IFKDNLK ++ HNSVPDRT+E+GLTRFADLT++EFRAI+LR KM R  D V  +RY
Sbjct:   63 RRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERNKDSVKTERY 122

Query:  416 LYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQEL 595
            LYKEGDVLP+EVDWR  GAVV VK+QG+CG CWAF+AVGAVEG+N+I TGEL+SLSEQEL
Sbjct:  123 LYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQEL 182

Query:  596 LDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVT 775
            +DCDRG    N GC GG    AFEFI+ N GI TD+ YPY  ND   C A +   TR VT
Sbjct:  183 VDCDRG--FVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVT 240

Query:  776 IDSYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKSGVFTGPCDHWYGNHNVVVVG 949
            ID YED P +DE SLKKAVAHQP+SV IEA +   +LYKSGV TG C     +H VVVVG
Sbjct:  241 IDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISL-DHGVVVVG 299

Query:  950 YGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSN--SAFGL 1123
            YG+T  GEDYWIIRNSWG NWG+SGY+KLQRN  +  G CG+A+ P YP KS+  S+F L
Sbjct:  300 YGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPTKSSFPSSFDL 358

Query: 1124 LS 1129
            LS
Sbjct:  359 LS 360

>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457
        [Arabidopsis lyrata subsp. lyrata]

          Length = 452

 Score =  453 bits (1163), Expect = 5e-125
 Identities = 229/356 (64%), Positives = 274/356 (76%), Gaps = 8/356 (2%)
 Frame = +2

Query:   53 MAIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEK 232
            MA PI+ +T+AL+I S LL+S SLG VTA +T  NEAE RRMYEQWLVENRKN N LGEK
Sbjct:    1 MATPIKSITLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEK 60

Query:  233 ERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDR 412
            E RF IF DNLK IE HNSVP++T+E+GLTRFADLT+DEFRAI+LR KM RT  PV G+R
Sbjct:   61 ETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGER 120

Query:  413 YLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQE 592
            YLYK GD LP+++DWR KGAV PVK+QG+CG CWAF+A+GAVEG+N+IKTGEL+SLSEQE
Sbjct:  121 YLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180

Query:  593 LLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYV 772
            L+DCD    S N GC GG    AF+FI++N GI T++ YPYT  D   C + +   +R V
Sbjct:  181 LVDCD---TSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNS-DKKNSRVV 236

Query:  773 TIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVV 946
            TID YED P NDE SLKKA+A+QPISV IEA     +LYKSGVFTG C     +H VV V
Sbjct:  237 TIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSL-DHGVVAV 295

Query:  947 GYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSA 1114
            GYG +E G+DYWI+RNSWG+NWGESGY KL+RN   S+G CGVA+   YP KS+ +
Sbjct:  296 GYG-SEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTKSSGS 350

>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein
        [Arabidopsis thaliana]

          Length = 452

 Score =  451 bits (1159), Expect = 2e-124
 Identities = 228/356 (64%), Positives = 272/356 (76%), Gaps = 8/356 (2%)
 Frame = +2

Query:   53 MAIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEK 232
            MA  I+ +T+AL+I S LL+S SLG VTATET  NEAE RRMYE+WLVENRKN N LGEK
Sbjct:    1 MATSIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEK 60

Query:  233 ERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDR 412
            ERRF IFKDNLK +E H+S+P+RTYE+GLTRFADLT+DEFRAI+LR KM RT  PV G++
Sbjct:   61 ERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEK 120

Query:  413 YLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQE 592
            YLYK GD LP+ +DWR KGAV PVK+QG CG CWAF+A+GAVEG+N+IKTGEL+SLSEQE
Sbjct:  121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180

Query:  593 LLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYV 772
            L+DCD    S N GC GG    AF+FI++N GI T++ YPY   D   C + +   TR V
Sbjct:  181 LVDCD---TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNS-DKKNTRVV 236

Query:  773 TIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVV 946
            TID YED P NDE SLKKA+A+QPISV IEA     +LY SGVFTG C     +H VV V
Sbjct:  237 TIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSL-DHGVVAV 295

Query:  947 GYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSA 1114
            GYG +E G+DYWI+RNSWG+NWGESGY KL+RN   S+G CGVA+   YP KS+ +
Sbjct:  296 GYG-SEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTKSSGS 350

>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease
        [Brassica oleracea]

          Length = 445

 Score =  423 bits (1085), Expect = 6e-116
 Identities = 222/350 (63%), Positives = 261/350 (74%), Gaps = 10/350 (2%)
 Frame = +2

Query:   71 FLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRFNI 250
            F  IALV L  LL SSSL GVTA     N  EV +M+E+WLVEN KN N LGEK++RF I
Sbjct:    2 FTAIALVTLLVLLASSSLSGVTAKADHRNPEEV-KMFERWLVENHKNYNGLGEKDKRFEI 60

Query:  251 FKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYKEG 430
            F DNLK ++ HNSVP+++YELGLTRFADLT++EFRAI+LR KM RT D V  +RYL+  G
Sbjct:   61 FMDNLKFVQEHNSVPNQSYELGLTRFADLTNEEFRAIYLRSKMERTRDSVKSERYLHNVG 120

Query:  431 DVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLDCDR 610
            D LP+EVDWR KGAVVPVK+QG CG CWAF+A+GAVEG+N+IKTGELVSLSEQEL+DCD 
Sbjct:  121 DKLPDEVDWRAKGAVVPVKDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCD- 179

Query:  611 GEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTIDSYE 790
               S N GC GG    AF+FI+ N GI T++ YPYT  D   C   +   TR VTID YE
Sbjct:  180 --TSYNNGCGGGLMDYAFQFIISNGGIDTEEDYPYTATDDNICNT-DKKNTRVVTIDGYE 236

Query:  791 DAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTTE 964
            D P N E SLKKA+A+QPISV IEA     +LYKSGVFTG C     +H VV VGYGT+E
Sbjct:  237 DVPEN-ENSLKKALANQPISVAIEAGGRGFQLYKSGVFTGTCGTAL-DHGVVAVGYGTSE 294

Query:  965 RGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSA 1114
             G+DYWIIRNSWG+NWGESGYIKLQRN  +S+G CGVA+   YP KS+ +
Sbjct:  295 -GQDYWIIRNSWGSNWGESGYIKLQRNIKDSSGKCGVAMMASYPTKSSGS 343

>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis
        thaliana]

          Length = 290

 Score =  374 bits (958), Expect = 3e-101
 Identities = 193/280 (68%), Positives = 218/280 (77%), Gaps = 4/280 (1%)
 Frame = +2

Query:  56 AIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKE 235
           A PIR +  ALVILS LLLSSSLG  T TE + NE EVR MYEQWLVENRKN N LGEKE
Sbjct:   3 ATPIRVIVSALVILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKE 62

Query: 236 RRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRY 415
           RRF IFKDNLK ++ HNSVPDRT+E+GLTRFADLT++EFRAI+LR KM RT D V  +RY
Sbjct:  63 RRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERTKDSVKTERY 122

Query: 416 LYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQEL 595
           LYKEGDVLP+EVDWR  GAVV VK+QG+CG CWAF+AVGAVEG+N+I TGEL+SLSEQEL
Sbjct: 123 LYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQEL 182

Query: 596 LDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVT 775
           +DCDRG    N GC GG    AFEFI+ N GI TD+ YPY  ND   C A +   TR VT
Sbjct: 183 VDCDRG--FVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVT 240

Query: 776 IDSYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKS 889
           ID YED P +DE SLKKAVAHQP+SV IEA +   +LYKS
Sbjct: 241 IDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKS 280

>gi|595986|gb|AAA79915.1| cysteine proteinase [Dianthus caryophyllus]

          Length = 427

 Score =  347 bits (888), Expect = 4e-093
 Identities = 175/315 (55%), Positives = 225/315 (71%), Gaps = 12/315 (3%)
 Frame = +2

Query:  182 EQWLVENRKNSNALGEKERRFNIFKDNLKLIEAHNSVPD----RTYELGLTRFADLTDDE 349
            + WLV++RKN NALGEKE+RF IF+DNL+ I+ HN+  +      +ELGL +FADLT+DE
Sbjct:    6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65

Query:  350 FRAIHLRGKMVRTSDPVIGDRYLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAV 529
            FR I+   K    ++ V  DRY  KEGD LPE VDWR+KGAV  VK+QG CG CWAF+A+
Sbjct:   66 FRRIYFGVKRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAFSAI 125

Query:  530 GAVEGLNKIKTGELVSLSEQELLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVY 709
            GAVEG+NKI TG+L++LSEQEL+DCD    S N GC GG    AF FI++N GI TDK Y
Sbjct:  126 GAVEGINKIVTGDLITLSEQELVDCD---TSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182

Query:  710 PYTENDTAACKAIEMVTTRYVTIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLY 883
            PY   D  +C +      + VTID  ED P N+E +L+KAVAHQP+ + IEA   + +LY
Sbjct:  183 PYKATD-GSCDS-NRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLY 240

Query:  884 KSGVFTGPCDHWYGNHNVVVVGYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTG 1063
            KSGVFTG C     +H VV VGYGTT+ G+DYWI+RNSWG +WGE GYI+++RN  + +G
Sbjct:  241 KSGVFTGSCGTSL-DHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSG 299

Query: 1064 NCGVAIRPVYPLKSN 1108
             CG+AI P YP+K++
Sbjct:  300 KCGIAIEPSYPVKTS 314

>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp.
        lyrata]

          Length = 455

 Score =  344 bits (880), Expect = 3e-092
 Identities = 183/351 (52%), Positives = 237/351 (67%), Gaps = 13/351 (3%)
 Frame = +2

Query:   65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRK--NSNALGEKER 238
            +  + +A  +  +++      GV+ T  + ++AEV  +YE WLV++ K  N N+L EK+R
Sbjct:    6 LAMVAVASAVDMSIISYDEKHGVSTTGGR-SDAEVMSIYEAWLVKHGKAQNQNSLVEKDR 64

Query:  239 RFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYL 418
            RF IFKDNL+ I+ HN   + +Y LGLTRFADLT+DE+R+ +L  KM +  +     RY 
Sbjct:   65 RFEIFKDNLRFIDDHNK-KNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSQRYE 123

Query:  419 YKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELL 598
             + GD LPE +DWR+KGAV  VK+QG CG CWAF+ +GAVEG+N+I TG+L++LSEQEL+
Sbjct:  124 ARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQELV 183

Query:  599 DCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTI 778
            DCD    S N GC GG    AFEFI+ N GI TDK YPY   D   C  I     + VTI
Sbjct:  184 DCD---TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK-NAKVVTI 238

Query:  779 DSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVVGY 952
            DSYED P   E SLKKAVAHQP+SV IEA     +LY SG+F G C     +H VV VGY
Sbjct:  239 DSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQL-DHGVVAVGY 297

Query:  953 GTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKS 1105
            G TE G+DYWI+RNSWG +WGESGY+K+ RN  +S+G CG+AI P YP+K+
Sbjct:  298 G-TENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKCGIAIEPSYPIKN 347

>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]

          Length = 462

 Score =  343 bits (879), Expect = 4e-092
 Identities = 186/357 (52%), Positives = 240/357 (67%), Gaps = 16/357 (4%)
 Frame = +2

Query:   53 MAIPIRFLTIALVILSALLLSSSLGGVTATETKGN---EAEVRRMYEQWLVENRKNSNAL 223
            MAI + F   AL + S+ L  S +       +K +   + EV  MYE WLV++ K+ NAL
Sbjct:    8 MAIALLF---ALFVASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNAL 64

Query:  224 GEKERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVI 403
            GEKE+RF IFKDNL+ I+ HN+  + +Y++GL RFADLT++E+R+ +L  K       V 
Sbjct:   65 GEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVK 124

Query:  404 GDRYLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLS 583
             DRY  + GD LPE VDWR KGAV P+K+QG CG CWAF+ V AVEG+N+I TGEL++LS
Sbjct:  125 SDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLS 184

Query:  584 EQELLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTT 763
            EQEL+DCD+   S N GC GG     FEFI++N GI TDK YPY   D A C        
Sbjct:  185 EQELVDCDK---SYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRD-ARCDQYRK-NA 239

Query:  764 RYVTIDSYEDAPHNDEMSLKKAVAHQPISVMIE--AENMKLYKSGVFTGPCDHWYGNHNV 937
            + VTIDSYED P N+E +LKKAVA QP+SV IE      + Y SG+FTG C     +H V
Sbjct:  240 KVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTAL-DHGV 298

Query:  938 VVVGYGTTERGEDYWIIRNSWGANWGESGYIKLQRNF-HNSTGNCGVAIRPVYPLKS 1105
             VVGYG TE+G+DYWI+RNSWG++WGE+GYI+++RN    S G CG+A+ P YPLK+
Sbjct:  299 NVVGYG-TEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKN 354

>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]

          Length = 462

 Score =  341 bits (872), Expect = 3e-091
 Identities = 181/351 (51%), Positives = 238/351 (67%), Gaps = 13/351 (3%)
 Frame = +2

Query:   65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRK--NSNALGEKER 238
            +  +T++  +  +++      GV+ T  + +EAEV  +YE WLV++ K  + N+L EK+R
Sbjct:   13 LAMVTVSSAVDMSIISYDEKHGVSTTGGR-SEAEVMSIYEAWLVKHGKAQSQNSLVEKDR 71

Query:  239 RFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYL 418
            RF IFKDNL+ ++ HN   + +Y LGLTRFADLT+DE+R+ +L  KM +  +     RY 
Sbjct:   72 RFEIFKDNLRFVDEHNE-KNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLRYE 130

Query:  419 YKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELL 598
             + GD LPE +DWR+KGAV  VK+QG CG CWAF+ +GAVEG+N+I TG+L++LSEQEL+
Sbjct:  131 ARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELV 190

Query:  599 DCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTI 778
            DCD    S N GC GG    AFEFI+ N GI TDK YPY   D   C  I     + VTI
Sbjct:  191 DCD---TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK-NAKVVTI 245

Query:  779 DSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVVGY 952
            DSYED P   E SLKKAVAHQPIS+ IEA     +LY SG+F G C     +H VV VGY
Sbjct:  246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQL-DHGVVAVGY 304

Query:  953 GTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKS 1105
            G TE G+DYWI+RNSWG +WGESGY+++ RN  +S+G CG+AI P YP+K+
Sbjct:  305 G-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKN 354

>gi|226495425|ref|NP_001148706.1| cysteine protease 1 [Zea mays]

          Length = 463

 Score =  340 bits (871), Expect = 4e-091
 Identities = 176/352 (50%), Positives = 234/352 (66%), Gaps = 14/352 (3%)
 Frame = +2

Query:   77 TIALVILSALLLSSSLGGVTATETKGNEA--EVRRMYEQWLVENRKNSNALGEKERRFNI 250
            T A   L  LLLS +     +  + G  +  E RRMY +W+  + +  NA+GE+ERR+ +
Sbjct:    5 TTAAAALLLLLLSLAAAADMSIVSYGERSXEEARRMYAEWMAAHGRTYNAVGEEERRYQV 64

Query:  251 FKDNLKLIEAHNSVPD---RTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLY 421
            F+DNL+ I+AHN+  D    ++ LGL RFADLT+DE+RA +L  +     +  +G RY  
Sbjct:   65 FRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHA 124

Query:  422 KEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLD 601
             + + LPE VDWR KGAV  VK+QG CG CWAF+ + AVEG+N+I TG+L+SLSEQEL+D
Sbjct:  125 ADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVD 184

Query:  602 CDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTID 781
            CD    S N GC GG    AFEFI++N GI T+K YPY   D   C  +     + VTID
Sbjct:  185 CD---TSYNQGCNGGLMDYAFEFIINNGGIDTEKDYPYKGTD-GRCD-VNRKNAKVVTID 239

Query:  782 SYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKSGVFTGPCDHWYGNHNVVVVGYG 955
            SYED P NDE SL+KAVA+QP+SV IEA     +LY SG+FTG C     +H V  VGYG
Sbjct:  240 SYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSSGIFTGSCGTAL-DHGVTAVGYG 298

Query:  956 TTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNS 1111
             TE G+DYWI++NSWG++WGESGY++++RN   S+G CG+A+ P YPLK  +
Sbjct:  299 -TENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCGIAVEPSYPLKEGA 349

>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]

          Length = 462

 Score =  340 bits (870), Expect = 5e-091
 Identities = 185/353 (52%), Positives = 240/353 (67%), Gaps = 18/353 (5%)
 Frame = +2

Query:   74 LTIALVILSALLLSSSLG-----GVTATETKGNEAEVRRMYEQWLVENRK--NSNALGEK 232
            L +A+V +S+ +  S +      GV+ T  + +EAEV  +YE WLV++ K  + N+L EK
Sbjct:   11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGR-SEAEVMSIYEAWLVKHGKAQSQNSLVEK 69

Query:  233 ERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDR 412
            +RRF IFKDNL+ ++ HN   + +Y LGLTRFADLT+DE+R+ +L  KM +  +     R
Sbjct:   70 DRRFEIFKDNLRFVDEHNE-KNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128

Query:  413 YLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQE 592
            Y  + GD LPE +DWR+KGAV  VK+QG CG CWAF+ +GAVEG+N+I TG+L++LSEQE
Sbjct:  129 YEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQE 188

Query:  593 LLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYV 772
            L+DCD    S N GC GG    AFEFI+ N GI TDK YPY   D   C  I     + V
Sbjct:  189 LVDCD---TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK-NAKVV 243

Query:  773 TIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVV 946
            TIDSYED P   E SLKKAVAHQPIS+ IEA     +LY SG+F G C     +H VV V
Sbjct:  244 TIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQL-DHGVVAV 302

Query:  947 GYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKS 1105
            GYG TE G+DYWI+RNSWG +WGESGY+++ RN  +S+G CG+AI P YP+K+
Sbjct:  303 GYG-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKN 354

>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]

          Length = 433

 Score =  340 bits (870), Expect = 5e-091
 Identities = 185/353 (52%), Positives = 240/353 (67%), Gaps = 18/353 (5%)
 Frame = +2

Query:   74 LTIALVILSALLLSSSLG-----GVTATETKGNEAEVRRMYEQWLVENRK--NSNALGEK 232
            L +A+V +S+ +  S +      GV+ T  + +EAEV  +YE WLV++ K  + N+L EK
Sbjct:   11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGR-SEAEVMSIYEAWLVKHGKAQSQNSLVEK 69

Query:  233 ERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDR 412
            +RRF IFKDNL+ ++ HN   + +Y LGLTRFADLT+DE+R+ +L  KM +  +     R
Sbjct:   70 DRRFEIFKDNLRFVDEHNE-KNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128

Query:  413 YLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQE 592
            Y  + GD LPE +DWR+KGAV  VK+QG CG CWAF+ +GAVEG+N+I TG+L++LSEQE
Sbjct:  129 YEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQE 188

Query:  593 LLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYV 772
            L+DCD    S N GC GG    AFEFI+ N GI TDK YPY   D   C  I     + V
Sbjct:  189 LVDCD---TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK-NAKVV 243

Query:  773 TIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVV 946
            TIDSYED P   E SLKKAVAHQPIS+ IEA     +LY SG+F G C     +H VV V
Sbjct:  244 TIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQL-DHGVVAV 302

Query:  947 GYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKS 1105
            GYG TE G+DYWI+RNSWG +WGESGY+++ RN  +S+G CG+AI P YP+K+
Sbjct:  303 GYG-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKN 354

>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]

          Length = 423

 Score =  333 bits (853), Expect = 5e-089
 Identities = 170/308 (55%), Positives = 219/308 (71%), Gaps = 10/308 (3%)
 Frame = +2

Query:  191 LVENRKNSNALGEKERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLR 370
            LV++ KN NALG KE+RF IFKDNL+ I+ HN   +++++LGL +FADL+++E++++ L 
Sbjct:   11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70

Query:  371 GKMVRTSDPVIGDRYLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLN 550
            G+MVR       DR+ Y  GD LP+ VDWREKGAV PVK+QG CG CWAF+ V AVEG+N
Sbjct:   71 GRMVRDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 130

Query:  551 KIKTGELVSLSEQELLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDT 730
            +I TG+L+SLSEQEL+DCD+G    N GC GG    AFEFIV N GI T+  YPY   D 
Sbjct:  131 QIATGDLISLSEQELVDCDKG---FNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187

Query:  731 AACKAIEMVTTRYVTIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTG 904
               +       + VTI+ +ED P NDE SLKKAVAHQP+SV IEA     +LY+SG+F G
Sbjct:  188 QCDQ--NRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNG 245

Query:  905 PCDHWYGNHNVVVVGYGTTERGEDYWIIRNSWGANWGESGYIKLQRNF-HNSTGNCGVAI 1081
             C     +H VV VGYG TE G+DYWI+RNSWG NWGE+GYI+L+RN    +TG CG+A+
Sbjct:  246 LCGTDL-DHGVVAVGYG-TEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAM 303

Query: 1082 RPVYPLKS 1105
            +P YP K+
Sbjct:  304 QPSYPTKT 311

>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695
        [Arabidopsis lyrata subsp. lyrata]

          Length = 308

 Score =  317 bits (811), Expect = 3e-084
 Identities = 160/245 (65%), Positives = 186/245 (75%), Gaps = 8/245 (3%)
 Frame = +2

Query:  407 DRYLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSE 586
            DRYLYKEGD+LP+E+DWR KGAVVPVK+QG+CG CWAF+AVGAVEG+N+IKTGEL+SLS+
Sbjct:   66 DRYLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSAVGAVEGINQIKTGELISLSD 125

Query:  587 QELLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTR 766
            QEL+DCDRG    N GC GG    AFEFI++N GI +D+ YPYT  D   C A +   TR
Sbjct:  126 QELIDCDRG--FVNAGCEGGVMNYAFEFIINNGGIESDQDYPYTATDLGVCNADKKNNTR 183

Query:  767 YVTIDSYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKSGVFTGPCDHWYGNHNVV 940
             V ID YE    NDE SLKKAVAHQP+ V IEA +   KLYKSGVFTG C   Y +H VV
Sbjct:  184 VVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKSGVFTGTCG-IYLDHGVV 242

Query:  941 VVGYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSN--SA 1114
            VVGYGT+  GEDYWIIRNSWG NWGE+GY+KLQRN  +S G CGVA+ P YP KS+  S+
Sbjct:  243 VVGYGTSS-GEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKCGVAMMPSYPTKSSFPSS 301

Query: 1115 FGLLS 1129
            F  LS
Sbjct:  302 FDFLS 306

>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]

          Length = 343

 Score =  283 bits (723), Expect = 5e-074
 Identities = 152/334 (45%), Positives = 206/334 (61%), Gaps = 9/334 (2%)
 Frame = +2

Query:  101 ALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRFNIFKDNLKLIEA 280
            ALL+   L  V  T     +A +   ++QW+ +  K  N   E E+RF IFK+N+  IE 
Sbjct:   13 ALLMCLGLWAVQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKENVNYIET 72

Query:  281 HNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYKEGDVLPEEVDWR 460
             N    R Y+LG+ +F DLT++EF A   R K    S  +  + Y Y+    +P  VDWR
Sbjct:   73 SNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRTNTYKYENVTTVPSNVDWR 132

Query:  461 EKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLDCD-RGEDSGNFGC 637
            +KGAV PVK+QG CG CWAF+AV A EG++++ TG+L+SLSEQEL+DCD +G D    GC
Sbjct:  133 QKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKGVDQ---GC 189

Query:  638 LGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTIDSYEDAPHNDEMS 817
             GG   DAF+FI+ N G+ T+  YPY   D   C A E  +    TI SYED P N+E +
Sbjct:  190 EGGLMDDAFKFIIQNHGLDTEAKYPYQGVD-GTCNANE-ASINAATITSYEDVPTNNEQA 247

Query:  818 LKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTTERGEDYWIIR 991
            L+KAVA+QPISV I+A   + + Y SGVFTG C     +H V  VGYG ++ G  YW+++
Sbjct:  248 LQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTEL-DHGVTAVGYGVSDDGTKYWLVK 306

Query:  992 NSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVY 1093
            NSWG +WGE GYI++QR      G CG+A++  Y
Sbjct:  307 NSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASY 340

>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]

          Length = 361

 Score =  280 bits (715), Expect = 5e-073
 Identities = 153/345 (44%), Positives = 203/345 (58%), Gaps = 16/345 (4%)
 Frame = +2

Query:   71 FLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRFNI 250
            F+  AL++L A           AT     EA +   +EQW+++  +      EK  RF I
Sbjct:   28 FMIAALILLGA-------WACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQI 80

Query:  251 FKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYKEG 430
            F DN+K IE  N    ++Y+L +  FAD T++EF+A     KM  +S P     + Y+  
Sbjct:   81 FMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKMAVSSRPSQTTLFRYENV 140

Query:  431 DVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLDCDR 610
              +P  +DWR+KGAV PVK+QG CG CWAF+ + A EG+ K+KTG+L+SLSEQEL+DCD+
Sbjct:  141 TAVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDK 200

Query:  611 -GEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTIDSY 787
             GED    GC GG   D FEFIV N GI  +  YPYT  D   C + E   +R   I  Y
Sbjct:  201 TGEDQ---GCEGGYMEDGFEFIVKNKGIALEASYPYTAAD-GTCNSKE-EASRAAKISGY 255

Query:  788 EDAPHNDEMSLKKAVAHQPISVMIEAENM--KLYKSGVFTGPCDHWYGNHNVVVVGYGTT 961
            E  P N E +L KAVA+QP+SV I+A  +  + Y SGVFTG C     +H V  VGYG T
Sbjct:  256 EKVPANSETALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDL-DHGVTAVGYGKT 314

Query:  962 ERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYP 1096
              G  YW+++NSWGA+WG+SGYI +QR      G CG+A+   YP
Sbjct:  315 SDGTKYWLVKNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYP 359

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,383,062,991,470
Number of Sequences: 15229318
Number of Extensions: 2383062991470
Number of Successful Extensions: 601402404
Number of sequences better than 0.0: 0