BLASTX 7.6.2
Query= UN20227 /QuerySize=1338
(1337 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabi... 500 4e-139
gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thal... 497 2e-138
gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT... 490 2e-136
gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabi... 471 1e-130
gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A p... 469 5e-130
gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT... 453 5e-125
gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease f... 451 2e-124
gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cyste... 423 6e-116
gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabi... 374 3e-101
gi|595986|gb|AAA79915.1| cysteine proteinase [Dianthus caryophyl... 347 4e-093
gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis ... 344 3e-092
gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon gran... 343 4e-092
gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana] 341 3e-091
gi|226495425|ref|NP_001148706.1| cysteine protease 1 [Zea mays] 340 4e-091
gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidop... 340 5e-091
gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidops... 340 5e-091
gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota] 333 5e-089
gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT... 317 3e-084
gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sini... 283 5e-074
gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota] 280 5e-073
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis
thaliana]
Length = 376
Score = 500 bits (1285), Expect = 4e-139
Identities = 255/376 (67%), Positives = 294/376 (78%), Gaps = 5/376 (1%)
Frame = +2
Query: 65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRF 244
I F T+AL+ LS LL+S SLG VTATE++ NE EV MYEQWLVEN KN N LGEKERRF
Sbjct: 3 ISFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRF 62
Query: 245 NIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYK 424
IFKDNLK IE HNS P+R+YE GL +F+DLT DEF+A +L GKM + S + +RY YK
Sbjct: 63 KIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYK 122
Query: 425 EGDVLPEEVDWREKGAVVP-VKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLD 601
EGDVLP+EVDWRE+GAVVP VK QG+CG CWAFAA GAVEG+N+I TGELVSLSEQEL+D
Sbjct: 123 EGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELID 182
Query: 602 CDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTID 781
CDRG D NFGC GG A AFEFI +N GIV+D+VY YT DTAACKAIEM TTR VTI+
Sbjct: 183 CDRGND--NFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTIN 240
Query: 782 SYEDAPHNDEMSLKKAVAHQPISVMIEAENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTT 961
+E P NDEMSLKKAVA+QPISVMI A NM YKSGV+ G C + +G+HNV++VGYGT+
Sbjct: 241 GHEVVPVNDEMSLKKAVAYQPISVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTS 300
Query: 962 ERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSAFGLLSLSVC 1141
DYW+IRNSWG WGE GY++LQRNFH TG C VA+ PVYP+KSNS+ LLS SV
Sbjct: 301 SDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPIKSNSSSHLLSPSVF 360
Query: 1142 KLGVLFV--LIGWVLL 1183
KL VLFV LI LL
Sbjct: 361 KLVVLFVFQLISLALL 376
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 497 bits (1278), Expect = 2e-138
Identities = 254/376 (67%), Positives = 293/376 (77%), Gaps = 5/376 (1%)
Frame = +2
Query: 65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRF 244
I F T+AL+ LS LL+S SLG VTATE++ NE V MYEQWLVEN KN N LGEKERRF
Sbjct: 3 ISFRTLALLTLSVLLISISLGVVTATESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRF 62
Query: 245 NIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYK 424
IFKDNLK IE HNS P+R+YE GL +F+DLT DEF+A +L GKM + S + +RY YK
Sbjct: 63 KIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYK 122
Query: 425 EGDVLPEEVDWREKGAVVP-VKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLD 601
EGDVLP+EVDWRE+GAVVP VK QG+CG CWAFAA GAVEG+N+I TGELVSLSEQEL+D
Sbjct: 123 EGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELID 182
Query: 602 CDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTID 781
CDRG D NFGC GG A AFEFI +N GIV+D+VY YT DTAACKAIEM TTR VTI+
Sbjct: 183 CDRGND--NFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTIN 240
Query: 782 SYEDAPHNDEMSLKKAVAHQPISVMIEAENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTT 961
+E P NDEMSLKKAVA+QPISVMI A NM YKSGV+ G C + +G+HNV++VGYGT+
Sbjct: 241 GHEVVPVNDEMSLKKAVAYQPISVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTS 300
Query: 962 ERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSAFGLLSLSVC 1141
DYW+IRNSWG WGE GY++LQRNFH TG C VA+ PVYP+KSNS+ LLS SV
Sbjct: 301 SDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPIKSNSSSHLLSPSVF 360
Query: 1142 KLGVLFV--LIGWVLL 1183
KL VLFV LI LL
Sbjct: 361 KLVVLFVFQLISLALL 376
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828
[Arabidopsis lyrata subsp. lyrata]
Length = 376
Score = 490 bits (1261), Expect = 2e-136
Identities = 248/376 (65%), Positives = 296/376 (78%), Gaps = 5/376 (1%)
Frame = +2
Query: 65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRF 244
I F T+AL+ LS LL+S SLG VTATE+ NEAEVR +YE+WLVE+ KN N LGEKERRF
Sbjct: 3 ISFRTLALLTLSVLLISLSLGVVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRF 62
Query: 245 NIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYK 424
IFKDNLK IE HNS P+R+Y+ GL +F+DLT DEF+A +L GK+ + S + +RY YK
Sbjct: 63 KIFKDNLKHIEEHNSDPNRSYDRGLNQFSDLTVDEFQASYLGGKIEKKSLSDVAERYQYK 122
Query: 425 EGDVLPEEVDWREKGAVVP-VKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLD 601
EGD+LP+EVDWRE+GAVVP VK QGDCG CWAFAA GAVEG+N+I TGEL+SLSEQEL+D
Sbjct: 123 EGDILPDEVDWRERGAVVPRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELID 182
Query: 602 CDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTID 781
CDRG+D NFGC GG A AFEFI +N GIVTD+ Y YT +DTAACKAIEM TTR VTI+
Sbjct: 183 CDRGKD--NFGCAGGGAVWAFEFIKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTIN 240
Query: 782 SYEDAPHNDEMSLKKAVAHQPISVMIEAENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTT 961
+E P NDEMSLKKAV++QPISVMI A NM YKSGV+ GPC + +G+HNV++VGYGT+
Sbjct: 241 GHEVVPVNDEMSLKKAVSYQPISVMISAANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTS 300
Query: 962 ERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSAFGLLSLSVC 1141
DYW+IRNSWG WGE GY++LQRNF+ TG C VA+ PVYP+K+NSA LLS SV
Sbjct: 301 SDEGDYWLIRNSWGPGWGEGGYLRLQRNFNEPTGKCAVAVAPVYPIKTNSASNLLSPSVF 360
Query: 1142 KLGVL--FVLIGWVLL 1183
KL +L F LI LL
Sbjct: 361 KLVLLCIFQLISLALL 376
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis
thaliana]
Length = 362
Score = 471 bits (1211), Expect = 1e-130
Identities = 244/362 (67%), Positives = 278/362 (76%), Gaps = 8/362 (2%)
Frame = +2
Query: 56 AIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKE 235
A PIR + ALVILS LLLSSSLG T TE + NE EVR MYEQWLVENRKN N LGEKE
Sbjct: 3 ATPIRVIVSALVILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKE 62
Query: 236 RRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRY 415
RRF IFKDNLK ++ HNSVPDRT+E+GLTRFADLT++EFRAI+LR KM RT D V +RY
Sbjct: 63 RRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERTKDSVKTERY 122
Query: 416 LYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQEL 595
LYKEGDVLP+EVDWR GAVV VK+QG+CG CWAF+AVGAVEG+N+I TGEL+SLSEQEL
Sbjct: 123 LYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQEL 182
Query: 596 LDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVT 775
+DCDRG N GC GG AFEFI+ N GI TD+ YPY ND C A + TR VT
Sbjct: 183 VDCDRG--FVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVT 240
Query: 776 IDSYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKSGVFTGPCDHWYGNHNVVVVG 949
ID YED P +DE SLKKAVAHQP+SV IEA + +LYKSGV TG C +H VVVVG
Sbjct: 241 IDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISL-DHGVVVVG 299
Query: 950 YGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSN--SAFGL 1123
YG+T GEDYWIIRNSWG NWG+SGY+KLQRN + G CG+A+ P YP KS+ S+F L
Sbjct: 300 YGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPTKSSFPSSFDL 358
Query: 1124 LS 1129
LS
Sbjct: 359 LS 360
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor
[Arabidopsis thaliana]
Length = 362
Score = 469 bits (1206), Expect = 5e-130
Identities = 243/362 (67%), Positives = 277/362 (76%), Gaps = 8/362 (2%)
Frame = +2
Query: 56 AIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKE 235
A PIR + ALVILS LLLSSSLG T TE + NE EVR MYEQWLVENRKN N LGEKE
Sbjct: 3 ATPIRVIVSALVILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKE 62
Query: 236 RRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRY 415
RRF IFKDNLK ++ HNSVPDRT+E+GLTRFADLT++EFRAI+LR KM R D V +RY
Sbjct: 63 RRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERNKDSVKTERY 122
Query: 416 LYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQEL 595
LYKEGDVLP+EVDWR GAVV VK+QG+CG CWAF+AVGAVEG+N+I TGEL+SLSEQEL
Sbjct: 123 LYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQEL 182
Query: 596 LDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVT 775
+DCDRG N GC GG AFEFI+ N GI TD+ YPY ND C A + TR VT
Sbjct: 183 VDCDRG--FVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVT 240
Query: 776 IDSYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKSGVFTGPCDHWYGNHNVVVVG 949
ID YED P +DE SLKKAVAHQP+SV IEA + +LYKSGV TG C +H VVVVG
Sbjct: 241 IDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISL-DHGVVVVG 299
Query: 950 YGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSN--SAFGL 1123
YG+T GEDYWIIRNSWG NWG+SGY+KLQRN + G CG+A+ P YP KS+ S+F L
Sbjct: 300 YGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPTKSSFPSSFDL 358
Query: 1124 LS 1129
LS
Sbjct: 359 LS 360
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457
[Arabidopsis lyrata subsp. lyrata]
Length = 452
Score = 453 bits (1163), Expect = 5e-125
Identities = 229/356 (64%), Positives = 274/356 (76%), Gaps = 8/356 (2%)
Frame = +2
Query: 53 MAIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEK 232
MA PI+ +T+AL+I S LL+S SLG VTA +T NEAE RRMYEQWLVENRKN N LGEK
Sbjct: 1 MATPIKSITLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEK 60
Query: 233 ERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDR 412
E RF IF DNLK IE HNSVP++T+E+GLTRFADLT+DEFRAI+LR KM RT PV G+R
Sbjct: 61 ETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGER 120
Query: 413 YLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQE 592
YLYK GD LP+++DWR KGAV PVK+QG+CG CWAF+A+GAVEG+N+IKTGEL+SLSEQE
Sbjct: 121 YLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180
Query: 593 LLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYV 772
L+DCD S N GC GG AF+FI++N GI T++ YPYT D C + + +R V
Sbjct: 181 LVDCD---TSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNS-DKKNSRVV 236
Query: 773 TIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVV 946
TID YED P NDE SLKKA+A+QPISV IEA +LYKSGVFTG C +H VV V
Sbjct: 237 TIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSL-DHGVVAV 295
Query: 947 GYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSA 1114
GYG +E G+DYWI+RNSWG+NWGESGY KL+RN S+G CGVA+ YP KS+ +
Sbjct: 296 GYG-SEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTKSSGS 350
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein
[Arabidopsis thaliana]
Length = 452
Score = 451 bits (1159), Expect = 2e-124
Identities = 228/356 (64%), Positives = 272/356 (76%), Gaps = 8/356 (2%)
Frame = +2
Query: 53 MAIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEK 232
MA I+ +T+AL+I S LL+S SLG VTATET NEAE RRMYE+WLVENRKN N LGEK
Sbjct: 1 MATSIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEK 60
Query: 233 ERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDR 412
ERRF IFKDNLK +E H+S+P+RTYE+GLTRFADLT+DEFRAI+LR KM RT PV G++
Sbjct: 61 ERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEK 120
Query: 413 YLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQE 592
YLYK GD LP+ +DWR KGAV PVK+QG CG CWAF+A+GAVEG+N+IKTGEL+SLSEQE
Sbjct: 121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180
Query: 593 LLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYV 772
L+DCD S N GC GG AF+FI++N GI T++ YPY D C + + TR V
Sbjct: 181 LVDCD---TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNS-DKKNTRVV 236
Query: 773 TIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVV 946
TID YED P NDE SLKKA+A+QPISV IEA +LY SGVFTG C +H VV V
Sbjct: 237 TIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSL-DHGVVAV 295
Query: 947 GYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSA 1114
GYG +E G+DYWI+RNSWG+NWGESGY KL+RN S+G CGVA+ YP KS+ +
Sbjct: 296 GYG-SEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTKSSGS 350
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease
[Brassica oleracea]
Length = 445
Score = 423 bits (1085), Expect = 6e-116
Identities = 222/350 (63%), Positives = 261/350 (74%), Gaps = 10/350 (2%)
Frame = +2
Query: 71 FLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRFNI 250
F IALV L LL SSSL GVTA N EV +M+E+WLVEN KN N LGEK++RF I
Sbjct: 2 FTAIALVTLLVLLASSSLSGVTAKADHRNPEEV-KMFERWLVENHKNYNGLGEKDKRFEI 60
Query: 251 FKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYKEG 430
F DNLK ++ HNSVP+++YELGLTRFADLT++EFRAI+LR KM RT D V +RYL+ G
Sbjct: 61 FMDNLKFVQEHNSVPNQSYELGLTRFADLTNEEFRAIYLRSKMERTRDSVKSERYLHNVG 120
Query: 431 DVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLDCDR 610
D LP+EVDWR KGAVVPVK+QG CG CWAF+A+GAVEG+N+IKTGELVSLSEQEL+DCD
Sbjct: 121 DKLPDEVDWRAKGAVVPVKDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCD- 179
Query: 611 GEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTIDSYE 790
S N GC GG AF+FI+ N GI T++ YPYT D C + TR VTID YE
Sbjct: 180 --TSYNNGCGGGLMDYAFQFIISNGGIDTEEDYPYTATDDNICNT-DKKNTRVVTIDGYE 236
Query: 791 DAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTTE 964
D P N E SLKKA+A+QPISV IEA +LYKSGVFTG C +H VV VGYGT+E
Sbjct: 237 DVPEN-ENSLKKALANQPISVAIEAGGRGFQLYKSGVFTGTCGTAL-DHGVVAVGYGTSE 294
Query: 965 RGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNSA 1114
G+DYWIIRNSWG+NWGESGYIKLQRN +S+G CGVA+ YP KS+ +
Sbjct: 295 -GQDYWIIRNSWGSNWGESGYIKLQRNIKDSSGKCGVAMMASYPTKSSGS 343
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis
thaliana]
Length = 290
Score = 374 bits (958), Expect = 3e-101
Identities = 193/280 (68%), Positives = 218/280 (77%), Gaps = 4/280 (1%)
Frame = +2
Query: 56 AIPIRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKE 235
A PIR + ALVILS LLLSSSLG T TE + NE EVR MYEQWLVENRKN N LGEKE
Sbjct: 3 ATPIRVIVSALVILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKE 62
Query: 236 RRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRY 415
RRF IFKDNLK ++ HNSVPDRT+E+GLTRFADLT++EFRAI+LR KM RT D V +RY
Sbjct: 63 RRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERTKDSVKTERY 122
Query: 416 LYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQEL 595
LYKEGDVLP+EVDWR GAVV VK+QG+CG CWAF+AVGAVEG+N+I TGEL+SLSEQEL
Sbjct: 123 LYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQEL 182
Query: 596 LDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVT 775
+DCDRG N GC GG AFEFI+ N GI TD+ YPY ND C A + TR VT
Sbjct: 183 VDCDRG--FVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVT 240
Query: 776 IDSYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKS 889
ID YED P +DE SLKKAVAHQP+SV IEA + +LYKS
Sbjct: 241 IDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKS 280
>gi|595986|gb|AAA79915.1| cysteine proteinase [Dianthus caryophyllus]
Length = 427
Score = 347 bits (888), Expect = 4e-093
Identities = 175/315 (55%), Positives = 225/315 (71%), Gaps = 12/315 (3%)
Frame = +2
Query: 182 EQWLVENRKNSNALGEKERRFNIFKDNLKLIEAHNSVPD----RTYELGLTRFADLTDDE 349
+ WLV++RKN NALGEKE+RF IF+DNL+ I+ HN+ + +ELGL +FADLT+DE
Sbjct: 6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65
Query: 350 FRAIHLRGKMVRTSDPVIGDRYLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAV 529
FR I+ K ++ V DRY KEGD LPE VDWR+KGAV VK+QG CG CWAF+A+
Sbjct: 66 FRRIYFGVKRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAFSAI 125
Query: 530 GAVEGLNKIKTGELVSLSEQELLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVY 709
GAVEG+NKI TG+L++LSEQEL+DCD S N GC GG AF FI++N GI TDK Y
Sbjct: 126 GAVEGINKIVTGDLITLSEQELVDCD---TSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182
Query: 710 PYTENDTAACKAIEMVTTRYVTIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLY 883
PY D +C + + VTID ED P N+E +L+KAVAHQP+ + IEA + +LY
Sbjct: 183 PYKATD-GSCDS-NRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLY 240
Query: 884 KSGVFTGPCDHWYGNHNVVVVGYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTG 1063
KSGVFTG C +H VV VGYGTT+ G+DYWI+RNSWG +WGE GYI+++RN + +G
Sbjct: 241 KSGVFTGSCGTSL-DHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSG 299
Query: 1064 NCGVAIRPVYPLKSN 1108
CG+AI P YP+K++
Sbjct: 300 KCGIAIEPSYPVKTS 314
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp.
lyrata]
Length = 455
Score = 344 bits (880), Expect = 3e-092
Identities = 183/351 (52%), Positives = 237/351 (67%), Gaps = 13/351 (3%)
Frame = +2
Query: 65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRK--NSNALGEKER 238
+ + +A + +++ GV+ T + ++AEV +YE WLV++ K N N+L EK+R
Sbjct: 6 LAMVAVASAVDMSIISYDEKHGVSTTGGR-SDAEVMSIYEAWLVKHGKAQNQNSLVEKDR 64
Query: 239 RFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYL 418
RF IFKDNL+ I+ HN + +Y LGLTRFADLT+DE+R+ +L KM + + RY
Sbjct: 65 RFEIFKDNLRFIDDHNK-KNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSQRYE 123
Query: 419 YKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELL 598
+ GD LPE +DWR+KGAV VK+QG CG CWAF+ +GAVEG+N+I TG+L++LSEQEL+
Sbjct: 124 ARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQELV 183
Query: 599 DCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTI 778
DCD S N GC GG AFEFI+ N GI TDK YPY D C I + VTI
Sbjct: 184 DCD---TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK-NAKVVTI 238
Query: 779 DSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVVGY 952
DSYED P E SLKKAVAHQP+SV IEA +LY SG+F G C +H VV VGY
Sbjct: 239 DSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQL-DHGVVAVGY 297
Query: 953 GTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKS 1105
G TE G+DYWI+RNSWG +WGESGY+K+ RN +S+G CG+AI P YP+K+
Sbjct: 298 G-TENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKCGIAIEPSYPIKN 347
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 343 bits (879), Expect = 4e-092
Identities = 186/357 (52%), Positives = 240/357 (67%), Gaps = 16/357 (4%)
Frame = +2
Query: 53 MAIPIRFLTIALVILSALLLSSSLGGVTATETKGN---EAEVRRMYEQWLVENRKNSNAL 223
MAI + F AL + S+ L S + +K + + EV MYE WLV++ K+ NAL
Sbjct: 8 MAIALLF---ALFVASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNAL 64
Query: 224 GEKERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVI 403
GEKE+RF IFKDNL+ I+ HN+ + +Y++GL RFADLT++E+R+ +L K V
Sbjct: 65 GEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVK 124
Query: 404 GDRYLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLS 583
DRY + GD LPE VDWR KGAV P+K+QG CG CWAF+ V AVEG+N+I TGEL++LS
Sbjct: 125 SDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLS 184
Query: 584 EQELLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTT 763
EQEL+DCD+ S N GC GG FEFI++N GI TDK YPY D A C
Sbjct: 185 EQELVDCDK---SYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRD-ARCDQYRK-NA 239
Query: 764 RYVTIDSYEDAPHNDEMSLKKAVAHQPISVMIE--AENMKLYKSGVFTGPCDHWYGNHNV 937
+ VTIDSYED P N+E +LKKAVA QP+SV IE + Y SG+FTG C +H V
Sbjct: 240 KVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTAL-DHGV 298
Query: 938 VVVGYGTTERGEDYWIIRNSWGANWGESGYIKLQRNF-HNSTGNCGVAIRPVYPLKS 1105
VVGYG TE+G+DYWI+RNSWG++WGE+GYI+++RN S G CG+A+ P YPLK+
Sbjct: 299 NVVGYG-TEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKN 354
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 341 bits (872), Expect = 3e-091
Identities = 181/351 (51%), Positives = 238/351 (67%), Gaps = 13/351 (3%)
Frame = +2
Query: 65 IRFLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRK--NSNALGEKER 238
+ +T++ + +++ GV+ T + +EAEV +YE WLV++ K + N+L EK+R
Sbjct: 13 LAMVTVSSAVDMSIISYDEKHGVSTTGGR-SEAEVMSIYEAWLVKHGKAQSQNSLVEKDR 71
Query: 239 RFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYL 418
RF IFKDNL+ ++ HN + +Y LGLTRFADLT+DE+R+ +L KM + + RY
Sbjct: 72 RFEIFKDNLRFVDEHNE-KNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLRYE 130
Query: 419 YKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELL 598
+ GD LPE +DWR+KGAV VK+QG CG CWAF+ +GAVEG+N+I TG+L++LSEQEL+
Sbjct: 131 ARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELV 190
Query: 599 DCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTI 778
DCD S N GC GG AFEFI+ N GI TDK YPY D C I + VTI
Sbjct: 191 DCD---TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK-NAKVVTI 245
Query: 779 DSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVVGY 952
DSYED P E SLKKAVAHQPIS+ IEA +LY SG+F G C +H VV VGY
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQL-DHGVVAVGY 304
Query: 953 GTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKS 1105
G TE G+DYWI+RNSWG +WGESGY+++ RN +S+G CG+AI P YP+K+
Sbjct: 305 G-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKN 354
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 [Zea mays]
Length = 463
Score = 340 bits (871), Expect = 4e-091
Identities = 176/352 (50%), Positives = 234/352 (66%), Gaps = 14/352 (3%)
Frame = +2
Query: 77 TIALVILSALLLSSSLGGVTATETKGNEA--EVRRMYEQWLVENRKNSNALGEKERRFNI 250
T A L LLLS + + + G + E RRMY +W+ + + NA+GE+ERR+ +
Sbjct: 5 TTAAAALLLLLLSLAAAADMSIVSYGERSXEEARRMYAEWMAAHGRTYNAVGEEERRYQV 64
Query: 251 FKDNLKLIEAHNSVPD---RTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLY 421
F+DNL+ I+AHN+ D ++ LGL RFADLT+DE+RA +L + + +G RY
Sbjct: 65 FRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHA 124
Query: 422 KEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLD 601
+ + LPE VDWR KGAV VK+QG CG CWAF+ + AVEG+N+I TG+L+SLSEQEL+D
Sbjct: 125 ADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVD 184
Query: 602 CDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTID 781
CD S N GC GG AFEFI++N GI T+K YPY D C + + VTID
Sbjct: 185 CD---TSYNQGCNGGLMDYAFEFIINNGGIDTEKDYPYKGTD-GRCD-VNRKNAKVVTID 239
Query: 782 SYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKSGVFTGPCDHWYGNHNVVVVGYG 955
SYED P NDE SL+KAVA+QP+SV IEA +LY SG+FTG C +H V VGYG
Sbjct: 240 SYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSSGIFTGSCGTAL-DHGVTAVGYG 298
Query: 956 TTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSNS 1111
TE G+DYWI++NSWG++WGESGY++++RN S+G CG+A+ P YPLK +
Sbjct: 299 -TENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCGIAVEPSYPLKEGA 349
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 340 bits (870), Expect = 5e-091
Identities = 185/353 (52%), Positives = 240/353 (67%), Gaps = 18/353 (5%)
Frame = +2
Query: 74 LTIALVILSALLLSSSLG-----GVTATETKGNEAEVRRMYEQWLVENRK--NSNALGEK 232
L +A+V +S+ + S + GV+ T + +EAEV +YE WLV++ K + N+L EK
Sbjct: 11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGR-SEAEVMSIYEAWLVKHGKAQSQNSLVEK 69
Query: 233 ERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDR 412
+RRF IFKDNL+ ++ HN + +Y LGLTRFADLT+DE+R+ +L KM + + R
Sbjct: 70 DRRFEIFKDNLRFVDEHNE-KNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 413 YLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQE 592
Y + GD LPE +DWR+KGAV VK+QG CG CWAF+ +GAVEG+N+I TG+L++LSEQE
Sbjct: 129 YEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQE 188
Query: 593 LLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYV 772
L+DCD S N GC GG AFEFI+ N GI TDK YPY D C I + V
Sbjct: 189 LVDCD---TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK-NAKVV 243
Query: 773 TIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVV 946
TIDSYED P E SLKKAVAHQPIS+ IEA +LY SG+F G C +H VV V
Sbjct: 244 TIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQL-DHGVVAV 302
Query: 947 GYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKS 1105
GYG TE G+DYWI+RNSWG +WGESGY+++ RN +S+G CG+AI P YP+K+
Sbjct: 303 GYG-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKN 354
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 340 bits (870), Expect = 5e-091
Identities = 185/353 (52%), Positives = 240/353 (67%), Gaps = 18/353 (5%)
Frame = +2
Query: 74 LTIALVILSALLLSSSLG-----GVTATETKGNEAEVRRMYEQWLVENRK--NSNALGEK 232
L +A+V +S+ + S + GV+ T + +EAEV +YE WLV++ K + N+L EK
Sbjct: 11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGR-SEAEVMSIYEAWLVKHGKAQSQNSLVEK 69
Query: 233 ERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDR 412
+RRF IFKDNL+ ++ HN + +Y LGLTRFADLT+DE+R+ +L KM + + R
Sbjct: 70 DRRFEIFKDNLRFVDEHNE-KNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLR 128
Query: 413 YLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQE 592
Y + GD LPE +DWR+KGAV VK+QG CG CWAF+ +GAVEG+N+I TG+L++LSEQE
Sbjct: 129 YEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQE 188
Query: 593 LLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYV 772
L+DCD S N GC GG AFEFI+ N GI TDK YPY D C I + V
Sbjct: 189 LVDCD---TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK-NAKVV 243
Query: 773 TIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVV 946
TIDSYED P E SLKKAVAHQPIS+ IEA +LY SG+F G C +H VV V
Sbjct: 244 TIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQL-DHGVVAV 302
Query: 947 GYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKS 1105
GYG TE G+DYWI+RNSWG +WGESGY+++ RN +S+G CG+AI P YP+K+
Sbjct: 303 GYG-TENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKN 354
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 333 bits (853), Expect = 5e-089
Identities = 170/308 (55%), Positives = 219/308 (71%), Gaps = 10/308 (3%)
Frame = +2
Query: 191 LVENRKNSNALGEKERRFNIFKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLR 370
LV++ KN NALG KE+RF IFKDNL+ I+ HN +++++LGL +FADL+++E++++ L
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 371 GKMVRTSDPVIGDRYLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLN 550
G+MVR DR+ Y GD LP+ VDWREKGAV PVK+QG CG CWAF+ V AVEG+N
Sbjct: 71 GRMVRDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 130
Query: 551 KIKTGELVSLSEQELLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDT 730
+I TG+L+SLSEQEL+DCD+G N GC GG AFEFIV N GI T+ YPY D
Sbjct: 131 QIATGDLISLSEQELVDCDKG---FNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187
Query: 731 AACKAIEMVTTRYVTIDSYEDAPHNDEMSLKKAVAHQPISVMIEA--ENMKLYKSGVFTG 904
+ + VTI+ +ED P NDE SLKKAVAHQP+SV IEA +LY+SG+F G
Sbjct: 188 QCDQ--NRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNG 245
Query: 905 PCDHWYGNHNVVVVGYGTTERGEDYWIIRNSWGANWGESGYIKLQRNF-HNSTGNCGVAI 1081
C +H VV VGYG TE G+DYWI+RNSWG NWGE+GYI+L+RN +TG CG+A+
Sbjct: 246 LCGTDL-DHGVVAVGYG-TEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAM 303
Query: 1082 RPVYPLKS 1105
+P YP K+
Sbjct: 304 QPSYPTKT 311
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695
[Arabidopsis lyrata subsp. lyrata]
Length = 308
Score = 317 bits (811), Expect = 3e-084
Identities = 160/245 (65%), Positives = 186/245 (75%), Gaps = 8/245 (3%)
Frame = +2
Query: 407 DRYLYKEGDVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSE 586
DRYLYKEGD+LP+E+DWR KGAVVPVK+QG+CG CWAF+AVGAVEG+N+IKTGEL+SLS+
Sbjct: 66 DRYLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSAVGAVEGINQIKTGELISLSD 125
Query: 587 QELLDCDRGEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTR 766
QEL+DCDRG N GC GG AFEFI++N GI +D+ YPYT D C A + TR
Sbjct: 126 QELIDCDRG--FVNAGCEGGVMNYAFEFIINNGGIESDQDYPYTATDLGVCNADKKNNTR 183
Query: 767 YVTIDSYEDAPHNDEMSLKKAVAHQPISVMIEAEN--MKLYKSGVFTGPCDHWYGNHNVV 940
V ID YE NDE SLKKAVAHQP+ V IEA + KLYKSGVFTG C Y +H VV
Sbjct: 184 VVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKSGVFTGTCG-IYLDHGVV 242
Query: 941 VVGYGTTERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYPLKSN--SA 1114
VVGYGT+ GEDYWIIRNSWG NWGE+GY+KLQRN +S G CGVA+ P YP KS+ S+
Sbjct: 243 VVGYGTSS-GEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKCGVAMMPSYPTKSSFPSS 301
Query: 1115 FGLLS 1129
F LS
Sbjct: 302 FDFLS 306
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 283 bits (723), Expect = 5e-074
Identities = 152/334 (45%), Positives = 206/334 (61%), Gaps = 9/334 (2%)
Frame = +2
Query: 101 ALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRFNIFKDNLKLIEA 280
ALL+ L V T +A + ++QW+ + K N E E+RF IFK+N+ IE
Sbjct: 13 ALLMCLGLWAVQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKENVNYIET 72
Query: 281 HNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYKEGDVLPEEVDWR 460
N R Y+LG+ +F DLT++EF A R K S + + Y Y+ +P VDWR
Sbjct: 73 SNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRTNTYKYENVTTVPSNVDWR 132
Query: 461 EKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLDCD-RGEDSGNFGC 637
+KGAV PVK+QG CG CWAF+AV A EG++++ TG+L+SLSEQEL+DCD +G D GC
Sbjct: 133 QKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKGVDQ---GC 189
Query: 638 LGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTIDSYEDAPHNDEMS 817
GG DAF+FI+ N G+ T+ YPY D C A E + TI SYED P N+E +
Sbjct: 190 EGGLMDDAFKFIIQNHGLDTEAKYPYQGVD-GTCNANE-ASINAATITSYEDVPTNNEQA 247
Query: 818 LKKAVAHQPISVMIEA--ENMKLYKSGVFTGPCDHWYGNHNVVVVGYGTTERGEDYWIIR 991
L+KAVA+QPISV I+A + + Y SGVFTG C +H V VGYG ++ G YW+++
Sbjct: 248 LQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTEL-DHGVTAVGYGVSDDGTKYWLVK 306
Query: 992 NSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVY 1093
NSWG +WGE GYI++QR G CG+A++ Y
Sbjct: 307 NSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASY 340
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 280 bits (715), Expect = 5e-073
Identities = 153/345 (44%), Positives = 203/345 (58%), Gaps = 16/345 (4%)
Frame = +2
Query: 71 FLTIALVILSALLLSSSLGGVTATETKGNEAEVRRMYEQWLVENRKNSNALGEKERRFNI 250
F+ AL++L A AT EA + +EQW+++ + EK RF I
Sbjct: 28 FMIAALILLGA-------WACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQI 80
Query: 251 FKDNLKLIEAHNSVPDRTYELGLTRFADLTDDEFRAIHLRGKMVRTSDPVIGDRYLYKEG 430
F DN+K IE N ++Y+L + FAD T++EF+A KM +S P + Y+
Sbjct: 81 FMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKMAVSSRPSQTTLFRYENV 140
Query: 431 DVLPEEVDWREKGAVVPVKNQGDCGGCWAFAAVGAVEGLNKIKTGELVSLSEQELLDCDR 610
+P +DWR+KGAV PVK+QG CG CWAF+ + A EG+ K+KTG+L+SLSEQEL+DCD+
Sbjct: 141 TAVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDK 200
Query: 611 -GEDSGNFGCLGGNAADAFEFIVDNDGIVTDKVYPYTENDTAACKAIEMVTTRYVTIDSY 787
GED GC GG D FEFIV N GI + YPYT D C + E +R I Y
Sbjct: 201 TGEDQ---GCEGGYMEDGFEFIVKNKGIALEASYPYTAAD-GTCNSKE-EASRAAKISGY 255
Query: 788 EDAPHNDEMSLKKAVAHQPISVMIEAENM--KLYKSGVFTGPCDHWYGNHNVVVVGYGTT 961
E P N E +L KAVA+QP+SV I+A + + Y SGVFTG C +H V VGYG T
Sbjct: 256 EKVPANSETALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDL-DHGVTAVGYGKT 314
Query: 962 ERGEDYWIIRNSWGANWGESGYIKLQRNFHNSTGNCGVAIRPVYP 1096
G YW+++NSWGA+WG+SGYI +QR G CG+A+ YP
Sbjct: 315 SDGTKYWLVKNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYP 359
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,383,062,991,470
Number of Sequences: 15229318
Number of Extensions: 2383062991470
Number of Successful Extensions: 601402404
Number of sequences better than 0.0: 0
|