BLASTX 7.6.2
Query= UN41442 /QuerySize=1338
(1337 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|312281991|dbj|BAJ33861.1| unnamed protein product [Thellungie... 692 5e-197
gi|15230200|ref|NP_191261.1| strictosidine synthase family prote... 658 8e-187
gi|15230182|ref|NP_191260.1| strictosidine synthase family prote... 542 9e-152
gi|13877837|gb|AAK43996.1|AF370181_1 unknown protein [Arabidopsi... 540 2e-151
gi|297820484|ref|XP_002878125.1| strictosidine synthase family p... 540 3e-151
gi|6759491|emb|CAB69786.1| hypothetical protein [Arabidopsis tha... 532 7e-149
gi|297827769|ref|XP_002881767.1| hypothetical protein ARALYDRAFT... 530 2e-148
gi|79315403|ref|NP_001030876.1| strictosidine synthase family pr... 473 4e-131
gi|3894193|gb|AAC78542.1| putative strictosidine synthase [Arabi... 455 8e-126
gi|145360869|ref|NP_181662.3| strictosidine synthase-like 1 [Ara... 455 8e-126
gi|297820488|ref|XP_002878127.1| strictosidine synthase family p... 349 8e-094
gi|6911873|emb|CAB72173.1| putative protein [Arabidopsis thaliana] 330 3e-088
gi|30694556|ref|NP_191262.2| strictosidine synthase family prote... 330 3e-088
gi|297820490|ref|XP_002878128.1| strictosidine synthase family p... 329 9e-088
gi|110743953|dbj|BAE99809.1| hypothetical protein [Arabidopsis t... 328 2e-087
gi|255583680|ref|XP_002532594.1| strictosidine synthase, putativ... 322 8e-086
gi|225441250|ref|XP_002273764.1| PREDICTED: hypothetical protein... 322 1e-085
gi|156763850|emb|CAO99127.1| strictosidine synthase-like protein... 321 2e-085
gi|224139742|ref|XP_002323255.1| predicted protein [Populus tric... 290 3e-076
gi|224139738|ref|XP_002323253.1| predicted protein [Populus tric... 265 2e-068
>gi|312281991|dbj|BAJ33861.1| unnamed protein product [Thellungiella halophila]
Length = 370
Score = 692 bits (1784), Expect = 5e-197
Identities = 326/370 (88%), Positives = 348/370 (94%), Gaps = 1/370 (0%)
Frame = -2
Query: 1273 MPISEKIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEWD 1094
MP+S+K+PTWA VPA AVFSVISYQ +IAPDNL GTK+VLSMAKTIPLPV GPESIEWD
Sbjct: 1 MPLSQKVPTWAAVPAVLAVFSVISYQTIIAPDNLKGTKHVLSMAKTIPLPVHGPESIEWD 60
Query: 1093 PQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKTG 914
PQGGGPYAAVVDGRILKW+GDG+GWVEFAYTSPHRGNCSRHEVVPTCGRPLGL FEKKTG
Sbjct: 61 PQGGGPYAAVVDGRILKWQGDGIGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLKFEKKTG 120
Query: 913 DLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFR 734
DLYICDGY GVMKVGPEGGLAELVVDQ EGRKVMFANQ+DIDEEEDV YFNDSSDKYHFR
Sbjct: 121 DLYICDGYLGVMKVGPEGGLAELVVDQAEGRKVMFANQIDIDEEEDVLYFNDSSDKYHFR 180
Query: 733 EVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRY 554
EVFYV NG+R+GRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCES+TGLVHRY
Sbjct: 181 EVFYVASNGDRTGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESSTGLVHRY 240
Query: 553 WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKT 374
WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFW+GIHCKKN GR +NN + LGK+VEKT
Sbjct: 241 WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWLGIHCKKNPLGRFMINN-RWLGKIVEKT 299
Query: 373 VKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSV 194
V L+LLI ++NGFKPHG+AVKISGETGEI+E+LED EGKTMQYVSEAYERDDGK+WFGSV
Sbjct: 300 VNLDLLIAVMNGFKPHGIAVKISGETGEILEVLEDIEGKTMQYVSEAYERDDGKLWFGSV 359
Query: 193 FKPAVWVLDR 164
F PAVWVLDR
Sbjct: 360 FTPAVWVLDR 369
>gi|15230200|ref|NP_191261.1| strictosidine synthase family protein [Arabidopsis
thaliana]
Length = 370
Score = 658 bits (1696), Expect = 8e-187
Identities = 311/370 (84%), Positives = 341/370 (92%), Gaps = 1/370 (0%)
Frame = -2
Query: 1273 MPISEKIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEWD 1094
MPI++KIPTW VPA FAV SVISYQ LI P+NL G KNVL+MAKTIP+PV GPESIE+D
Sbjct: 1 MPINQKIPTWFAVPAVFAVLSVISYQTLIVPENLEGAKNVLTMAKTIPIPVAGPESIEFD 60
Query: 1093 PQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKTG 914
P+G GPYAAVVDGRILKWRGD LGWV+FAYTSPHRGNCS+ EVVPTCGRPLGL+FEKKTG
Sbjct: 61 PKGEGPYAAVVDGRILKWRGDDLGWVDFAYTSPHRGNCSKTEVVPTCGRPLGLTFEKKTG 120
Query: 913 DLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFR 734
DLYICDGY G+MKVGPEGGLAEL+VD+ EGRKVMFANQ DIDEEEDVFYFNDSSDKYHFR
Sbjct: 121 DLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQGDIDEEEDVFYFNDSSDKYHFR 180
Query: 733 EVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRY 554
+VF+V V+GERSGRVIRY+KKTKEAKV+MDNL CNNGLALNKDRSFLI+CES T LVHRY
Sbjct: 181 DVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLALNKDRSFLITCESGTSLVHRY 240
Query: 553 WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKT 374
WIKGPKAGTRDIFAKVPGYPDNIRLT TGDFWIG+HCKKNL GRL V Y+ LGKLVEKT
Sbjct: 241 WIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKKNLIGRLIV-KYKWLGKLVEKT 299
Query: 373 VKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSV 194
+KLE +I +NGFKPHGVAVKISGETGE++E+LEDKEGKTM+YVSEAYERDDGK+WFGSV
Sbjct: 300 MKLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKTMKYVSEAYERDDGKLWFGSV 359
Query: 193 FKPAVWVLDR 164
+ PAVWVLDR
Sbjct: 360 YWPAVWVLDR 369
>gi|15230182|ref|NP_191260.1| strictosidine synthase family protein [Arabidopsis
thaliana]
Length = 376
Score = 542 bits (1394), Expect = 9e-152
Identities = 255/370 (68%), Positives = 303/370 (81%), Gaps = 2/370 (0%)
Frame = -2
Query: 1273 MPISEKIPT-WAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEW 1097
MPIS ++ T P AV + +I PDNL GTK+VL AKTIPLPVDGPES+E+
Sbjct: 1 MPISRRVLTPITAAPVILAVLCFFFWSSIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEF 60
Query: 1096 DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKT 917
DPQG GPY V DGRILKWRG+ LGWV+FAYTSPHR NCS HEVVP+CGRPLGLSFE+KT
Sbjct: 61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKT 120
Query: 916 GDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 737
GDLYICDGYFGVMKVGPEGGLAELVVD+ EGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct: 121 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180
Query: 736 REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHR 557
R+VFYV+++G + GRVIRY+ K KEAKV+MD LR NGLAL+K+ SF+++CES+T + HR
Sbjct: 181 RDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240
Query: 556 YWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEK 377
W+KGPK+GT ++FA +PG PDNIR TPTGDFW+ +HCKKNLF R AV + +G+
Sbjct: 241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTR-AVLIHTWVGRFFMN 299
Query: 376 TVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGS 197
T+K+E +I +NG KPHG+ VK+SGETGEI+EILED EGKT++YVSEAYE DGK+W GS
Sbjct: 300 TMKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYVSEAYETKDGKLWIGS 359
Query: 196 VFKPAVWVLD 167
V+ PAVWVLD
Sbjct: 360 VYWPAVWVLD 369
>gi|13877837|gb|AAK43996.1|AF370181_1 unknown protein [Arabidopsis thaliana]
Length = 376
Score = 540 bits (1390), Expect = 2e-151
Identities = 254/370 (68%), Positives = 302/370 (81%), Gaps = 2/370 (0%)
Frame = -2
Query: 1273 MPISEKIPT-WAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEW 1097
MPIS ++ T P AV + +I PDNL GTK+VL AKTIPLPVDGPES+E+
Sbjct: 1 MPISRRVLTPITAAPVILAVLCFFFWSSIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEF 60
Query: 1096 DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKT 917
DPQG GPY V DGRILKWRG+ LGWV+FAYTSPHR NCS HEVVP+CGRPLGLSFE+KT
Sbjct: 61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKT 120
Query: 916 GDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 737
GDLYICDGYFGVMKVGPEGGL ELVVD+ EGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct: 121 GDLYICDGYFGVMKVGPEGGLGELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180
Query: 736 REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHR 557
R+VFYV+++G + GRVIRY+ K KEAKV+MD LR NGLAL+K+ SF+++CES+T + HR
Sbjct: 181 RDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240
Query: 556 YWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEK 377
W+KGPK+GT ++FA +PG PDNIR TPTGDFW+ +HCKKNLF R AV + +G+
Sbjct: 241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTR-AVLIHTWVGRFFMN 299
Query: 376 TVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGS 197
T+K+E +I +NG KPHG+ VK+SGETGEI+EILED EGKT++YVSEAYE DGK+W GS
Sbjct: 300 TMKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYVSEAYETKDGKLWIGS 359
Query: 196 VFKPAVWVLD 167
V+ PAVWVLD
Sbjct: 360 VYWPAVWVLD 369
>gi|297820484|ref|XP_002878125.1| strictosidine synthase family protein
[Arabidopsis lyrata subsp. lyrata]
Length = 371
Score = 540 bits (1389), Expect = 3e-151
Identities = 250/370 (67%), Positives = 304/370 (82%), Gaps = 2/370 (0%)
Frame = -2
Query: 1273 MPISEKIPT-WAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEW 1097
MPIS ++ T + P AV + +I PDN+ GTK+VL AKTIPLP DGPES+E+
Sbjct: 1 MPISRRVLTPVSAAPVILAVLCFFFWSSIIGPDNIKGTKHVLQDAKTIPLPADGPESLEF 60
Query: 1096 DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKT 917
DPQG GPY V DGRILKWRG+ LGWV+FAYTSPHR NCSRHEVVP+CGRPLGL+FEKKT
Sbjct: 61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSRHEVVPSCGRPLGLTFEKKT 120
Query: 916 GDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 737
GDLYICDGYFG+MKVGP+GGLAELVVD+ EGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct: 121 GDLYICDGYFGLMKVGPQGGLAELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180
Query: 736 REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHR 557
REVFYV+++G + GRVIRY+ K KEAKV+MD LR NGLAL+K+ SF+++CES+T + HR
Sbjct: 181 REVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240
Query: 556 YWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEK 377
W+KGPK+GT ++FA +PG PDNIR TPTGDFW+ +HCKKNLF R+A+ + +G+
Sbjct: 241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTRVAL-IHSLVGRFFMN 299
Query: 376 TVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGS 197
T+K+E +I +NG KPHG+ VK+SGETGEI+EILED EGKT++Y SEAYE +DGK+W GS
Sbjct: 300 TMKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYASEAYETEDGKLWIGS 359
Query: 196 VFKPAVWVLD 167
V+ PAVWV D
Sbjct: 360 VYWPAVWVYD 369
>gi|6759491|emb|CAB69786.1| hypothetical protein [Arabidopsis thaliana]
Length = 352
Score = 532 bits (1369), Expect = 7e-149
Identities = 247/342 (72%), Positives = 291/342 (85%), Gaps = 1/342 (0%)
Frame = -2
Query: 1192 LIAPDNLNGTKNVLSMAKTIPLPVDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWVE 1013
+I PDNL GTK+VL AKTIPLPVDGPES+E+DPQG GPY V DGRILKWRG+ LGWV+
Sbjct: 5 IIGPDNLKGTKHVLQDAKTIPLPVDGPESLEFDPQGEGPYVGVTDGRILKWRGEELGWVD 64
Query: 1012 FAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQ 833
FAYTSPHR NCS HEVVP+CGRPLGLSFE+KTGDLYICDGYFGVMKVGPEGGLAELVVD+
Sbjct: 65 FAYTSPHRDNCSSHEVVPSCGRPLGLSFERKTGDLYICDGYFGVMKVGPEGGLAELVVDE 124
Query: 832 VEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKV 653
EGRKVMFANQ DIDEEED+FYFNDSSD YHFR+VFYV+++G + GRVIRY+ K KEAKV
Sbjct: 125 AEGRKVMFANQGDIDEEEDIFYFNDSSDTYHFRDVFYVSLSGTKVGRVIRYDMKKKEAKV 184
Query: 652 VMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTP 473
+MD LR NGLAL+K+ SF+++CES+T HR W+KGPK+GT ++FA +PG PDNIR TP
Sbjct: 185 IMDKLRLPNGLALSKNGSFVVTCESSTNTCHRIWVKGPKSGTNEVFATLPGSPDNIRRTP 244
Query: 472 TGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGETG 293
TGDFW+ +HCKKNLF R AV + +G+ T+K+E +I +NG KPHG+ VK+SGETG
Sbjct: 245 TGDFWVALHCKKNLFTR-AVLIHTWVGRFFMNTMKMETVIHFMNGGKPHGIVVKLSGETG 303
Query: 292 EIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWVLD 167
EI+EILED EGKT++YVSEAYE DGK+W GSV+ PAVWVLD
Sbjct: 304 EILEILEDSEGKTVKYVSEAYETKDGKLWIGSVYWPAVWVLD 345
>gi|297827769|ref|XP_002881767.1| hypothetical protein ARALYDRAFT_321816
[Arabidopsis lyrata subsp. lyrata]
Length = 370
Score = 530 bits (1365), Expect = 2e-148
Identities = 252/371 (67%), Positives = 301/371 (81%), Gaps = 3/371 (0%)
Frame = -2
Query: 1273 MPISEKIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEWD 1094
MP+S K+ TWAVV A AV V +I P+++ G+KNVL+MA+TIPLPVDGPES++WD
Sbjct: 1 MPVSRKVQTWAVVVAVMAVLVVFVGPYIIGPESIEGSKNVLTMARTIPLPVDGPESLDWD 60
Query: 1093 PQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKTG 914
P+G GPY V DGRILKW G+ LGWV+FAY+SPHR NCSRH+V P CGRPLGLSFEKK+G
Sbjct: 61 PRGEGPYVGVTDGRILKWSGEDLGWVQFAYSSPHRENCSRHKVEPACGRPLGLSFEKKSG 120
Query: 913 DLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF- 737
DLY CDGY G+MKVGP+GGLAE VVD+ EG+KVMFANQMDIDEEED YFNDSSD YHF
Sbjct: 121 DLYFCDGYLGIMKVGPKGGLAEKVVDEAEGQKVMFANQMDIDEEEDAIYFNDSSDTYHFG 180
Query: 736 REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHR 557
R+VFY + GE++GR IRY+KKTKEAKV+MD L NGLAL+KD SF++SCE T LVHR
Sbjct: 181 RDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSKDGSFVLSCEVPTQLVHR 240
Query: 556 YWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEK 377
YW KGPKAGTRDIFAK+PGY DNIR T TGDFW+ +H KK F RL++ + +GK K
Sbjct: 241 YWAKGPKAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRLSM-IHPWVGKFFIK 299
Query: 376 TVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGS 197
T+K+ELL+ L G KPH VAVK+SG+TGEI+EILED EGK M+++SE ER DG++WFGS
Sbjct: 300 TLKMELLLFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFISEVQER-DGRLWFGS 358
Query: 196 VFKPAVWVLDR 164
VF P+VWVLDR
Sbjct: 359 VFLPSVWVLDR 369
>gi|79315403|ref|NP_001030876.1| strictosidine synthase family protein
[Arabidopsis thaliana]
Length = 356
Score = 473 bits (1216), Expect = 4e-131
Identities = 224/262 (85%), Positives = 245/262 (93%), Gaps = 1/262 (0%)
Frame = -2
Query: 949 RPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVF 770
RPLGL+FEKKTGDLYICDGY G+MKVGPEGGLAEL+VD+ EGRKVMFANQ DIDEEEDVF
Sbjct: 95 RPLGLTFEKKTGDLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQGDIDEEEDVF 154
Query: 769 YFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLI 590
YFNDSSDKYHFR+VF+V V+GERSGRVIRY+KKTKEAKV+MDNL CNNGLALNKDRSFLI
Sbjct: 155 YFNDSSDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLALNKDRSFLI 214
Query: 589 SCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVN 410
+CES T LVHRYWIKGPKAGTRDIFAKVPGYPDNIRLT TGDFWIG+HCKKNL GRL V
Sbjct: 215 TCESGTSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKKNLIGRLIV- 273
Query: 409 NYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAY 230
Y+ LGKLVEKT+KLE +I +NGFKPHGVAVKISGETGE++E+LEDKEGKTM+YVSEAY
Sbjct: 274 KYKWLGKLVEKTMKLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKTMKYVSEAY 333
Query: 229 ERDDGKIWFGSVFKPAVWVLDR 164
ERDDGK+WFGSV+ PAVWVLDR
Sbjct: 334 ERDDGKLWFGSVYWPAVWVLDR 355
>gi|3894193|gb|AAC78542.1| putative strictosidine synthase [Arabidopsis
thaliana]
Length = 395
Score = 455 bits (1170), Expect = 8e-126
Identities = 221/326 (67%), Positives = 258/326 (79%), Gaps = 9/326 (2%)
Frame = -2
Query: 1120 DGPESIEW------DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVP 959
D P S W DP+G GPY V DGRILKW G+ LGW+EFAY+SPHR NCS H+V P
Sbjct: 71 DNPPSRGWTGEPGLDPRGEGPYVGVTDGRILKWSGEDLGWIEFAYSSPHRKNCSSHKVEP 130
Query: 958 TCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEE 779
CGRPLGLSFEKK+GDLY CDGY GVMKVGP+GGLAE VVD+VEG+KVMFANQMDIDEEE
Sbjct: 131 ACGRPLGLSFEKKSGDLYFCDGYLGVMKVGPKGGLAEKVVDEVEGQKVMFANQMDIDEEE 190
Query: 778 DVFYFNDSSDKYHF-REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDR 602
D YFNDSSD YHF R+VFY + GE++GR IRY+KKTKEAKV+MD L NGLAL+ D
Sbjct: 191 DAIYFNDSSDTYHFGRDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSIDG 250
Query: 601 SFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGR 422
SF++SCE T LVHRYW KGP AGTRDIFAK+PGY DNIR T TGDFW+ +H KK F R
Sbjct: 251 SFVLSCEVPTQLVHRYWAKGPNAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSR 310
Query: 421 LAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYV 242
L++ + +GK KT+K+ELL+ L G KPH VAVK+SG+TGEI+EILED EGK M+++
Sbjct: 311 LSM-IHPWVGKFFIKTLKMELLVFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFI 369
Query: 241 SEAYERDDGKIWFGSVFKPAVWVLDR 164
SE ER DG++WFGSVF P+VWVLDR
Sbjct: 370 SEVQER-DGRLWFGSVFLPSVWVLDR 394
>gi|145360869|ref|NP_181662.3| strictosidine synthase-like 1 [Arabidopsis
thaliana]
Length = 394
Score = 455 bits (1170), Expect = 8e-126
Identities = 219/324 (67%), Positives = 256/324 (79%), Gaps = 8/324 (2%)
Frame = -2
Query: 1120 DGPESIEW------DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVP 959
D P S W DP+G GPY V DGRILKW G+ LGW+EFAY+SPHR NCS H+V P
Sbjct: 71 DNPPSRGWTGEPGLDPRGEGPYVGVTDGRILKWSGEDLGWIEFAYSSPHRKNCSSHKVEP 130
Query: 958 TCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEE 779
CGRPLGLSFEKK+GDLY CDGY GVMKVGP+GGLAE VVD+VEG+KVMFANQMDIDEEE
Sbjct: 131 ACGRPLGLSFEKKSGDLYFCDGYLGVMKVGPKGGLAEKVVDEVEGQKVMFANQMDIDEEE 190
Query: 778 DVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRS 599
D YFNDSSD YHF +VFY + GE++GR IRY+KKTKEAKV+MD L NGLAL+ D S
Sbjct: 191 DAIYFNDSSDTYHFGDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSIDGS 250
Query: 598 FLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRL 419
F++SCE T LVHRYW KGP AGTRDIFAK+PGY DNIR T TGDFW+ +H KK F RL
Sbjct: 251 FVLSCEVPTQLVHRYWAKGPNAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRL 310
Query: 418 AVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVS 239
++ + +GK KT+K+ELL+ L G KPH VAVK+SG+TGEI+EILED EGK M+++S
Sbjct: 311 SM-IHPWVGKFFIKTLKMELLVFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFIS 369
Query: 238 EAYERDDGKIWFGSVFKPAVWVLD 167
E ER DG++WFGSVF P+VWVLD
Sbjct: 370 EVQER-DGRLWFGSVFLPSVWVLD 392
>gi|297820488|ref|XP_002878127.1| strictosidine synthase family protein
[Arabidopsis lyrata subsp. lyrata]
Length = 343
Score = 349 bits (894), Expect = 8e-094
Identities = 166/196 (84%), Positives = 181/196 (92%), Gaps = 1/196 (0%)
Frame = -2
Query: 754 SDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESA 575
SDKYHFR+VF+V V+GERSGRVIRY+KKTKEAKVVMDNL CNNGLALNKDRSFLI+CES
Sbjct: 147 SDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVVMDNLVCNNGLALNKDRSFLITCESG 206
Query: 574 TGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCL 395
T LVHRYWIKGPKAGTRDIFAKVPGYPDNIRLT TGDFWIGIHCKKNL GRL V Y+ L
Sbjct: 207 TSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGIHCKKNLLGRLIV-RYKWL 265
Query: 394 GKLVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDG 215
GKLVEKT+KLE +I +NGFKP GVAVKISGETGE++E+LEDKEGKTM+YVSEAYERDDG
Sbjct: 266 GKLVEKTIKLEYVIAFINGFKPQGVAVKISGETGEVLEVLEDKEGKTMKYVSEAYERDDG 325
Query: 214 KIWFGSVFKPAVWVLD 167
K+WFGSV+ PAVWVLD
Sbjct: 326 KLWFGSVYWPAVWVLD 341
>gi|6911873|emb|CAB72173.1| putative protein [Arabidopsis thaliana]
Length = 372
Score = 330 bits (846), Expect = 3e-088
Identities = 159/344 (46%), Positives = 232/344 (67%), Gaps = 6/344 (1%)
Frame = -2
Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWV 1016
+ AP ++G+++V AK + L GPESI +DP G GPY V DGRILKWRG+ LGW
Sbjct: 28 IFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGEGPYVGVSDGRILKWRGEPLGWS 87
Query: 1015 EFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAEL 845
+FA+TS +R C+R E+ CGRPLGL F+KKTGDLYI D YFG++ VGP GGLA+
Sbjct: 88 DFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKP 147
Query: 844 VVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTK 665
+V + EG+ F N +DIDE+EDV YF D+S ++ R+ +N +++GR I+Y++ +K
Sbjct: 148 LVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSK 207
Query: 664 EAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNI 485
+A V++ L NG+AL+KDRSF++ E+ T + R W+ GP AGT +FA++PG+PDNI
Sbjct: 208 KATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHQVFAELPGFPDNI 267
Query: 484 RLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKIS 305
R G+FW+ +H KK LF +L++ ++ + + L L G PH A+K+S
Sbjct: 268 RRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPISPQRLHSLFTGGIPHATAIKLS 327
Query: 304 GETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWV 173
E+G+++E+LEDKEGKT++++SE E+ DGK+W GSV P + V
Sbjct: 328 -ESGKVLEVLEDKEGKTLRFISEVEEK-DGKLWIGSVLVPFLGV 369
>gi|30694556|ref|NP_191262.2| strictosidine synthase family protein [Arabidopsis
thaliana]
Length = 374
Score = 330 bits (846), Expect = 3e-088
Identities = 159/344 (46%), Positives = 232/344 (67%), Gaps = 6/344 (1%)
Frame = -2
Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWV 1016
+ AP ++G+++V AK + L GPESI +DP G GPY V DGRILKWRG+ LGW
Sbjct: 30 IFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGEGPYVGVSDGRILKWRGEPLGWS 89
Query: 1015 EFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAEL 845
+FA+TS +R C+R E+ CGRPLGL F+KKTGDLYI D YFG++ VGP GGLA+
Sbjct: 90 DFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKP 149
Query: 844 VVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTK 665
+V + EG+ F N +DIDE+EDV YF D+S ++ R+ +N +++GR I+Y++ +K
Sbjct: 150 LVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSK 209
Query: 664 EAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNI 485
+A V++ L NG+AL+KDRSF++ E+ T + R W+ GP AGT +FA++PG+PDNI
Sbjct: 210 KATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHQVFAELPGFPDNI 269
Query: 484 RLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKIS 305
R G+FW+ +H KK LF +L++ ++ + + L L G PH A+K+S
Sbjct: 270 RRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPISPQRLHSLFTGGIPHATAIKLS 329
Query: 304 GETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWV 173
E+G+++E+LEDKEGKT++++SE E+ DGK+W GSV P + V
Sbjct: 330 -ESGKVLEVLEDKEGKTLRFISEVEEK-DGKLWIGSVLVPFLGV 371
>gi|297820490|ref|XP_002878128.1| strictosidine synthase family protein
[Arabidopsis lyrata subsp. lyrata]
Length = 374
Score = 329 bits (842), Expect = 9e-088
Identities = 156/344 (45%), Positives = 231/344 (67%), Gaps = 6/344 (1%)
Frame = -2
Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWV 1016
+ AP ++G+++V AK + L GPESI +DP G GPY V DGR+LKWR + LGW
Sbjct: 30 IFAPPEISGSRDVFPSAKVVTLTGASGPESIAFDPAGEGPYVGVSDGRVLKWRSESLGWS 89
Query: 1015 EFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAEL 845
+FAYTS +R C R E+ CGRPLGL F+KKTGDLYI D YFG++ VGP GGLA+
Sbjct: 90 DFAYTSSNRQECVRPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKP 149
Query: 844 VVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTK 665
+V + EG+ F N +DIDE+EDV YF D+S ++ R+ +N +++GR I+Y++ +K
Sbjct: 150 LVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSK 209
Query: 664 EAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNI 485
+A V++ L NG+AL+KDRSF++ E+ T + R W+ GP AGT ++FA++PG+PDNI
Sbjct: 210 KATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHEVFAELPGFPDNI 269
Query: 484 RLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKIS 305
R G+FW+ +H KK LF +L+++ ++ + + L L G +PH A+K+S
Sbjct: 270 RRNSNGEFWVALHSKKGLFAKLSLSQTWFRDLVLRLPISPQRLHSLFTGGRPHATAIKLS 329
Query: 304 GETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWV 173
E+G+++E+LED EGK ++++SE E+ DGK+W GSV P + V
Sbjct: 330 -ESGKVLEVLEDNEGKRLRFISEVEEK-DGKLWIGSVLMPFLGV 371
>gi|110743953|dbj|BAE99809.1| hypothetical protein [Arabidopsis thaliana]
Length = 374
Score = 328 bits (839), Expect = 2e-087
Identities = 158/344 (45%), Positives = 231/344 (67%), Gaps = 6/344 (1%)
Frame = -2
Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWV 1016
+ AP ++G+++V AK + L GPESI +DP G GPY V DGRILKWRG+ LGW
Sbjct: 30 IFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGEGPYVGVSDGRILKWRGEPLGWS 89
Query: 1015 EFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAEL 845
+FA+TS +R C+R E+ CGRPLGL F+KKTGDLYI D YFG++ VGP GGLA+
Sbjct: 90 DFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKP 149
Query: 844 VVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTK 665
+V + EG+ F N +DIDE+EDV YF D+S ++ R+ +N +++GR I+Y++ +K
Sbjct: 150 LVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSK 209
Query: 664 EAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNI 485
+A V++ L NG+AL+KDRSF++ E+ T + R W+ GP AGT +FA++PG+PDNI
Sbjct: 210 KATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHQVFAELPGFPDNI 269
Query: 484 RLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKIS 305
R G+FW+ +H KK LF +L++ ++ + + L L G PH A+K+S
Sbjct: 270 RRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPISPQRLHSLFTGGIPHATAIKLS 329
Query: 304 GETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWV 173
E+G+++E+L DKEGKT++++SE E+ DGK+W GSV P + V
Sbjct: 330 -ESGKVLEVLGDKEGKTLRFISEVEEK-DGKLWIGSVLVPFLGV 371
>gi|255583680|ref|XP_002532594.1| strictosidine synthase, putative [Ricinus
communis]
Length = 372
Score = 322 bits (825), Expect = 8e-086
Identities = 158/362 (43%), Positives = 233/362 (64%), Gaps = 6/362 (1%)
Frame = -2
Query: 1258 KIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGG 1082
K+ A A + + + AP L + + L AK +P+ GPES+ +DP G
Sbjct: 6 KVGVAATAIVALASIIITNPNNIFAPPPLPSSNDNLHSAKIVPITGAVGPESLVFDPNGE 65
Query: 1081 GPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGD 911
GPY V DGRILKW+GD LGW +FA+T+ +R C R E+ CGRPLGL F+KKTGD
Sbjct: 66 GPYTGVADGRILKWQGDSLGWTDFAFTTSNRKECIRPFAPELEHVCGRPLGLRFDKKTGD 125
Query: 910 LYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFRE 731
LYI D Y G+ VGP GGLA VV +VEG + F N MDIDE+ DV YF D+S + R+
Sbjct: 126 LYIADAYLGLQVVGPNGGLATPVVSEVEGHPLRFTNDMDIDEQNDVIYFTDTSKIFQRRQ 185
Query: 730 VFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYW 551
++ +++GR+++Y+K +KE ++++ L NG+AL+KDRSF++ E++T + R+W
Sbjct: 186 FMASILHKDKTGRLLKYDKSSKEVTILLEGLSFANGVALSKDRSFVLVAETSTCQISRFW 245
Query: 550 IKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTV 371
+ GP AG D+FAK+PG+PDNIR G+FW+ +H K+ +LA++N L++ +
Sbjct: 246 LHGPNAGKVDVFAKLPGFPDNIRRNSKGEFWVALHAKEGFLAKLALSNSWIGKTLLKFPL 305
Query: 370 KLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVF 191
+ L L+ G KPH A+K+SG+ G+IV++LED +GK ++++SE E+ DGK+W GSV
Sbjct: 306 SFKQLHSLLVGGKPHATAIKLSGD-GKIVQVLEDCDGKRLRFISEVEEK-DGKLWIGSVL 363
Query: 190 KP 185
P
Sbjct: 364 MP 365
>gi|225441250|ref|XP_002273764.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 370
Score = 322 bits (823), Expect = 1e-085
Identities = 160/369 (43%), Positives = 236/369 (63%), Gaps = 7/369 (1%)
Frame = -2
Query: 1267 ISEKIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDP 1091
++ K+ A+ A ++ ++ L P ++ GT ++L ++ I + GPESI +DP
Sbjct: 1 MNTKLILTAITLAAISIILAVNSNHLFKPPSIPGTHDLLHGSEVIQVTGAFGPESIAFDP 60
Query: 1090 QGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKK 920
+G GPY V DGR+LKW GDG GW +FA T+ R C R E+ CGRPLGL F+KK
Sbjct: 61 KGEGPYTGVADGRVLKWEGDGRGWTDFAVTTSERKECVRPFAPEMEHICGRPLGLRFDKK 120
Query: 919 TGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYH 740
TGDLYI D YFG+ V P GGLA +V +VEGR+++F N MDIDE EDV YF D+S +H
Sbjct: 121 TGDLYIADAYFGLQVVEPNGGLATPLVTEVEGRRLLFTNDMDIDEVEDVIYFTDTSTDFH 180
Query: 739 FREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVH 560
R+ ++G+ +GR+++Y+K +KE V++ L NG+A++KDRSF++ E+ TG +
Sbjct: 181 RRQFMAALLSGDNTGRLMKYDKSSKEVTVLLRGLAFANGVAMSKDRSFVLVAETTTGKII 240
Query: 559 RYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVE 380
RYW+KGP AG D+FA+VPGYPDN+R G+FW+ +H KK +N L++
Sbjct: 241 RYWLKGPNAGKSDVFAEVPGYPDNVRRNSKGEFWVALHAKKGPHANWITSNSWVGKTLLK 300
Query: 379 KTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFG 200
+ + L L+ + H A+K+S E G+++E+LED EGK+M+++SE E +GK+W G
Sbjct: 301 LPLTFKQLHKLI-VVEAHATAIKLS-EEGQVLEVLEDCEGKSMRFISEV-EEHNGKLWLG 357
Query: 199 SVFKPAVWV 173
SV P + V
Sbjct: 358 SVMMPFIGV 366
>gi|156763850|emb|CAO99127.1| strictosidine synthase-like protein [Nicotiana
tabacum]
Length = 380
Score = 321 bits (822), Expect = 2e-085
Identities = 156/343 (45%), Positives = 224/343 (65%), Gaps = 6/343 (1%)
Frame = -2
Query: 1183 PDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWVEFA 1007
P + G+++VLS A+ I L G ES+ +DP G GPY V DGRILKW+ WV+FA
Sbjct: 38 PAPIPGSQDVLSKAELIQLKGAFGAESVAFDPNGEGPYTGVADGRILKWQPHSQTWVDFA 97
Query: 1006 YTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVD 836
TS R NCSR E+ CGRPLGL F+ KTGDLYI D YFG+ VGP GGLA +V
Sbjct: 98 VTSSQRKNCSRPSAPEMEHVCGRPLGLRFDHKTGDLYIADAYFGLHVVGPTGGLATPLVQ 157
Query: 835 QVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAK 656
EG+ ++F N +DID+++D+ YF D+S Y R+ T +G+++GR+++YNK TKE
Sbjct: 158 DFEGQPLLFTNDLDIDDDDDIIYFTDTSTIYQRRQFVAATASGDKTGRLMKYNKSTKEVT 217
Query: 655 VVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLT 476
V + L NG+AL+KDRSFL+ E++ + RYW+KGP G DIFA++PG+PDN+R+
Sbjct: 218 VALGGLAFANGVALSKDRSFLLVAETSACRILRYWLKGPNVGNHDIFAELPGFPDNVRIN 277
Query: 475 PTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGET 296
G+FW+ +H K + RL ++N LGK + + + L L+ G +PH A+K+S E
Sbjct: 278 SRGEFWVALHAKASPLARLIISN-SWLGKTLLREFNFQQLHNLLVGGQPHATAIKLS-ED 335
Query: 295 GEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWVLD 167
G ++E+LED EGK ++++SE +E + GK+W SV ++ V D
Sbjct: 336 GRVLEVLEDVEGKILRFISEVHEEESGKLWISSVIMSSLGVYD 378
>gi|224139742|ref|XP_002323255.1| predicted protein [Populus trichocarpa]
Length = 375
Score = 290 bits (742), Expect = 3e-076
Identities = 147/343 (42%), Positives = 219/343 (63%), Gaps = 9/343 (2%)
Frame = -2
Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKW--RGDGLG 1022
L+ P + + + L AK + + GPES+ +DP G GPY V DGR+LKW DG G
Sbjct: 28 LLGPPTIPTSNDHLHSAKILHVSGAVGPESLVFDPNGEGPYTGVADGRVLKWIAGDDGSG 87
Query: 1021 -WVEFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGL 854
W +FA TS +R C R E+ CGRPLGL F+KKTG+LYI D Y G+ VGP GGL
Sbjct: 88 SWTDFATTSSNRNECVRPFAPEMEHVCGRPLGLRFDKKTGNLYIADAYLGLQVVGPTGGL 147
Query: 853 AELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNK 674
A VV ++EG+ + F N +DIDE+EDV YF D+S + R+ + +++GR+++Y+K
Sbjct: 148 ATPVVTELEGQPMRFTNDLDIDEQEDVIYFTDTSMVFQRRQFILSLLTKDKTGRLLKYDK 207
Query: 673 KTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYP 494
+KE V+ L NG+AL+KD +FL+ E+ T + R+W+ GP AG D+F ++PG+P
Sbjct: 208 SSKEVTVLARGLAFANGVALSKDSTFLLVAETTTCRILRFWLHGPNAGKSDVFTELPGFP 267
Query: 493 DNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAV 314
DNIR G+FW+ +H KK LF ++ ++N L++ + + L L+ G K H A+
Sbjct: 268 DNIRRNSKGEFWVALHSKKGLFAKVVLSNSWIGKTLLKFPLSFKQLHSLLVGGKAHATAI 327
Query: 313 KISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKP 185
K+S E G+++++LED +GKT++++SE E+ DGK+W GSV P
Sbjct: 328 KLS-EEGKVLDVLEDCDGKTLRFISEVEEK-DGKLWIGSVLMP 368
>gi|224139738|ref|XP_002323253.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 265 bits (675), Expect = 2e-068
Identities = 128/316 (40%), Positives = 202/316 (63%), Gaps = 8/316 (2%)
Frame = -2
Query: 1117 GPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNC----SRHEVVPTCG 950
GPES +D G GPY ++ DGRI+KW+GD W++FA TSP+R C H++ CG
Sbjct: 30 GPESFAFDSLGEGPYTSLSDGRIIKWQGDKKRWIDFAVTSPNRDGCGGPHDHHQMEHVCG 89
Query: 949 RPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVF 770
RPLG F++ GDLYI D Y G+++VGPEGGLA + +G F N +DID+
Sbjct: 90 RPLGSCFDETHGDLYIADAYMGLLRVGPEGGLATKIATHAQGIPFRFTNSLDIDQSSGAI 149
Query: 769 YFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLI 590
YF DSS +Y R+ V ++G++SGR+++Y+ +K+ V++ NL NG+AL+ D SF++
Sbjct: 150 YFTDSSTQYQRRDYLSVVLSGDKSGRLMKYDTASKQVTVLLKNLTFPNGVALSTDGSFVL 209
Query: 589 SCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVN 410
E+ + + RYWIK KAG ++FA++ G+PDNI+ +P G +W+GI+ K+ L +
Sbjct: 210 LAETTSCRILRYWIKTSKAGALEVFAQLQGFPDNIKRSPRGGYWVGINSKREKLSEL-LF 268
Query: 409 NYQCLGK-LVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEA 233
+Y +GK L++ + + + ++ G+AV++S E G+IVE+ ED++G ++ +SE
Sbjct: 269 SYPWIGKVLLKLPLDITKFQTALAKYRGGGLAVRLS-ENGDIVEVFEDRDGNRLKSISEV 327
Query: 232 YERDDGKIWFGSVFKP 185
E+ DGK+W GS+ P
Sbjct: 328 MEK-DGKLWIGSIDLP 342
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,587,916,767,856
Number of Sequences: 15229318
Number of Extensions: 4587916767856
Number of Successful Extensions: 1071853064
Number of sequences better than 0.0: 0
|