Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN41442


BLASTX 7.6.2

Query= UN41442 /QuerySize=1338
        (1337 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|312281991|dbj|BAJ33861.1| unnamed protein product [Thellungie...    692   5e-197
gi|15230200|ref|NP_191261.1| strictosidine synthase family prote...    658   8e-187
gi|15230182|ref|NP_191260.1| strictosidine synthase family prote...    542   9e-152
gi|13877837|gb|AAK43996.1|AF370181_1 unknown protein [Arabidopsi...    540   2e-151
gi|297820484|ref|XP_002878125.1| strictosidine synthase family p...    540   3e-151
gi|6759491|emb|CAB69786.1| hypothetical protein [Arabidopsis tha...    532   7e-149
gi|297827769|ref|XP_002881767.1| hypothetical protein ARALYDRAFT...    530   2e-148
gi|79315403|ref|NP_001030876.1| strictosidine synthase family pr...    473   4e-131
gi|3894193|gb|AAC78542.1| putative strictosidine synthase [Arabi...    455   8e-126
gi|145360869|ref|NP_181662.3| strictosidine synthase-like 1 [Ara...    455   8e-126
gi|297820488|ref|XP_002878127.1| strictosidine synthase family p...    349   8e-094
gi|6911873|emb|CAB72173.1| putative protein [Arabidopsis thaliana]     330   3e-088
gi|30694556|ref|NP_191262.2| strictosidine synthase family prote...    330   3e-088
gi|297820490|ref|XP_002878128.1| strictosidine synthase family p...    329   9e-088
gi|110743953|dbj|BAE99809.1| hypothetical protein [Arabidopsis t...    328   2e-087
gi|255583680|ref|XP_002532594.1| strictosidine synthase, putativ...    322   8e-086
gi|225441250|ref|XP_002273764.1| PREDICTED: hypothetical protein...    322   1e-085
gi|156763850|emb|CAO99127.1| strictosidine synthase-like protein...    321   2e-085
gi|224139742|ref|XP_002323255.1| predicted protein [Populus tric...    290   3e-076
gi|224139738|ref|XP_002323253.1| predicted protein [Populus tric...    265   2e-068

>gi|312281991|dbj|BAJ33861.1| unnamed protein product [Thellungiella halophila]

          Length = 370

 Score =  692 bits (1784), Expect = 5e-197
 Identities = 326/370 (88%), Positives = 348/370 (94%), Gaps = 1/370 (0%)
 Frame = -2

Query: 1273 MPISEKIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEWD 1094
            MP+S+K+PTWA VPA  AVFSVISYQ +IAPDNL GTK+VLSMAKTIPLPV GPESIEWD
Sbjct:    1 MPLSQKVPTWAAVPAVLAVFSVISYQTIIAPDNLKGTKHVLSMAKTIPLPVHGPESIEWD 60

Query: 1093 PQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKTG 914
            PQGGGPYAAVVDGRILKW+GDG+GWVEFAYTSPHRGNCSRHEVVPTCGRPLGL FEKKTG
Sbjct:   61 PQGGGPYAAVVDGRILKWQGDGIGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLKFEKKTG 120

Query:  913 DLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFR 734
            DLYICDGY GVMKVGPEGGLAELVVDQ EGRKVMFANQ+DIDEEEDV YFNDSSDKYHFR
Sbjct:  121 DLYICDGYLGVMKVGPEGGLAELVVDQAEGRKVMFANQIDIDEEEDVLYFNDSSDKYHFR 180

Query:  733 EVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRY 554
            EVFYV  NG+R+GRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCES+TGLVHRY
Sbjct:  181 EVFYVASNGDRTGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESSTGLVHRY 240

Query:  553 WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKT 374
            WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFW+GIHCKKN  GR  +NN + LGK+VEKT
Sbjct:  241 WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWLGIHCKKNPLGRFMINN-RWLGKIVEKT 299

Query:  373 VKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSV 194
            V L+LLI ++NGFKPHG+AVKISGETGEI+E+LED EGKTMQYVSEAYERDDGK+WFGSV
Sbjct:  300 VNLDLLIAVMNGFKPHGIAVKISGETGEILEVLEDIEGKTMQYVSEAYERDDGKLWFGSV 359

Query:  193 FKPAVWVLDR 164
            F PAVWVLDR
Sbjct:  360 FTPAVWVLDR 369

>gi|15230200|ref|NP_191261.1| strictosidine synthase family protein [Arabidopsis
        thaliana]

          Length = 370

 Score =  658 bits (1696), Expect = 8e-187
 Identities = 311/370 (84%), Positives = 341/370 (92%), Gaps = 1/370 (0%)
 Frame = -2

Query: 1273 MPISEKIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEWD 1094
            MPI++KIPTW  VPA FAV SVISYQ LI P+NL G KNVL+MAKTIP+PV GPESIE+D
Sbjct:    1 MPINQKIPTWFAVPAVFAVLSVISYQTLIVPENLEGAKNVLTMAKTIPIPVAGPESIEFD 60

Query: 1093 PQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKTG 914
            P+G GPYAAVVDGRILKWRGD LGWV+FAYTSPHRGNCS+ EVVPTCGRPLGL+FEKKTG
Sbjct:   61 PKGEGPYAAVVDGRILKWRGDDLGWVDFAYTSPHRGNCSKTEVVPTCGRPLGLTFEKKTG 120

Query:  913 DLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFR 734
            DLYICDGY G+MKVGPEGGLAEL+VD+ EGRKVMFANQ DIDEEEDVFYFNDSSDKYHFR
Sbjct:  121 DLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQGDIDEEEDVFYFNDSSDKYHFR 180

Query:  733 EVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRY 554
            +VF+V V+GERSGRVIRY+KKTKEAKV+MDNL CNNGLALNKDRSFLI+CES T LVHRY
Sbjct:  181 DVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLALNKDRSFLITCESGTSLVHRY 240

Query:  553 WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKT 374
            WIKGPKAGTRDIFAKVPGYPDNIRLT TGDFWIG+HCKKNL GRL V  Y+ LGKLVEKT
Sbjct:  241 WIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKKNLIGRLIV-KYKWLGKLVEKT 299

Query:  373 VKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSV 194
            +KLE +I  +NGFKPHGVAVKISGETGE++E+LEDKEGKTM+YVSEAYERDDGK+WFGSV
Sbjct:  300 MKLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKTMKYVSEAYERDDGKLWFGSV 359

Query:  193 FKPAVWVLDR 164
            + PAVWVLDR
Sbjct:  360 YWPAVWVLDR 369

>gi|15230182|ref|NP_191260.1| strictosidine synthase family protein [Arabidopsis
        thaliana]

          Length = 376

 Score =  542 bits (1394), Expect = 9e-152
 Identities = 255/370 (68%), Positives = 303/370 (81%), Gaps = 2/370 (0%)
 Frame = -2

Query: 1273 MPISEKIPT-WAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEW 1097
            MPIS ++ T     P   AV     +  +I PDNL GTK+VL  AKTIPLPVDGPES+E+
Sbjct:    1 MPISRRVLTPITAAPVILAVLCFFFWSSIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEF 60

Query: 1096 DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKT 917
            DPQG GPY  V DGRILKWRG+ LGWV+FAYTSPHR NCS HEVVP+CGRPLGLSFE+KT
Sbjct:   61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKT 120

Query:  916 GDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 737
            GDLYICDGYFGVMKVGPEGGLAELVVD+ EGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct:  121 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180

Query:  736 REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHR 557
            R+VFYV+++G + GRVIRY+ K KEAKV+MD LR  NGLAL+K+ SF+++CES+T + HR
Sbjct:  181 RDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240

Query:  556 YWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEK 377
             W+KGPK+GT ++FA +PG PDNIR TPTGDFW+ +HCKKNLF R AV  +  +G+    
Sbjct:  241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTR-AVLIHTWVGRFFMN 299

Query:  376 TVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGS 197
            T+K+E +I  +NG KPHG+ VK+SGETGEI+EILED EGKT++YVSEAYE  DGK+W GS
Sbjct:  300 TMKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYVSEAYETKDGKLWIGS 359

Query:  196 VFKPAVWVLD 167
            V+ PAVWVLD
Sbjct:  360 VYWPAVWVLD 369

>gi|13877837|gb|AAK43996.1|AF370181_1 unknown protein [Arabidopsis thaliana]

          Length = 376

 Score =  540 bits (1390), Expect = 2e-151
 Identities = 254/370 (68%), Positives = 302/370 (81%), Gaps = 2/370 (0%)
 Frame = -2

Query: 1273 MPISEKIPT-WAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEW 1097
            MPIS ++ T     P   AV     +  +I PDNL GTK+VL  AKTIPLPVDGPES+E+
Sbjct:    1 MPISRRVLTPITAAPVILAVLCFFFWSSIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEF 60

Query: 1096 DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKT 917
            DPQG GPY  V DGRILKWRG+ LGWV+FAYTSPHR NCS HEVVP+CGRPLGLSFE+KT
Sbjct:   61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKT 120

Query:  916 GDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 737
            GDLYICDGYFGVMKVGPEGGL ELVVD+ EGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct:  121 GDLYICDGYFGVMKVGPEGGLGELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180

Query:  736 REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHR 557
            R+VFYV+++G + GRVIRY+ K KEAKV+MD LR  NGLAL+K+ SF+++CES+T + HR
Sbjct:  181 RDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240

Query:  556 YWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEK 377
             W+KGPK+GT ++FA +PG PDNIR TPTGDFW+ +HCKKNLF R AV  +  +G+    
Sbjct:  241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTR-AVLIHTWVGRFFMN 299

Query:  376 TVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGS 197
            T+K+E +I  +NG KPHG+ VK+SGETGEI+EILED EGKT++YVSEAYE  DGK+W GS
Sbjct:  300 TMKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYVSEAYETKDGKLWIGS 359

Query:  196 VFKPAVWVLD 167
            V+ PAVWVLD
Sbjct:  360 VYWPAVWVLD 369

>gi|297820484|ref|XP_002878125.1| strictosidine synthase family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 371

 Score =  540 bits (1389), Expect = 3e-151
 Identities = 250/370 (67%), Positives = 304/370 (82%), Gaps = 2/370 (0%)
 Frame = -2

Query: 1273 MPISEKIPT-WAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEW 1097
            MPIS ++ T  +  P   AV     +  +I PDN+ GTK+VL  AKTIPLP DGPES+E+
Sbjct:    1 MPISRRVLTPVSAAPVILAVLCFFFWSSIIGPDNIKGTKHVLQDAKTIPLPADGPESLEF 60

Query: 1096 DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKT 917
            DPQG GPY  V DGRILKWRG+ LGWV+FAYTSPHR NCSRHEVVP+CGRPLGL+FEKKT
Sbjct:   61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSRHEVVPSCGRPLGLTFEKKT 120

Query:  916 GDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 737
            GDLYICDGYFG+MKVGP+GGLAELVVD+ EGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct:  121 GDLYICDGYFGLMKVGPQGGLAELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180

Query:  736 REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHR 557
            REVFYV+++G + GRVIRY+ K KEAKV+MD LR  NGLAL+K+ SF+++CES+T + HR
Sbjct:  181 REVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240

Query:  556 YWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEK 377
             W+KGPK+GT ++FA +PG PDNIR TPTGDFW+ +HCKKNLF R+A+  +  +G+    
Sbjct:  241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTRVAL-IHSLVGRFFMN 299

Query:  376 TVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGS 197
            T+K+E +I  +NG KPHG+ VK+SGETGEI+EILED EGKT++Y SEAYE +DGK+W GS
Sbjct:  300 TMKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYASEAYETEDGKLWIGS 359

Query:  196 VFKPAVWVLD 167
            V+ PAVWV D
Sbjct:  360 VYWPAVWVYD 369

>gi|6759491|emb|CAB69786.1| hypothetical protein [Arabidopsis thaliana]

          Length = 352

 Score =  532 bits (1369), Expect = 7e-149
 Identities = 247/342 (72%), Positives = 291/342 (85%), Gaps = 1/342 (0%)
 Frame = -2

Query: 1192 LIAPDNLNGTKNVLSMAKTIPLPVDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWVE 1013
            +I PDNL GTK+VL  AKTIPLPVDGPES+E+DPQG GPY  V DGRILKWRG+ LGWV+
Sbjct:    5 IIGPDNLKGTKHVLQDAKTIPLPVDGPESLEFDPQGEGPYVGVTDGRILKWRGEELGWVD 64

Query: 1012 FAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQ 833
            FAYTSPHR NCS HEVVP+CGRPLGLSFE+KTGDLYICDGYFGVMKVGPEGGLAELVVD+
Sbjct:   65 FAYTSPHRDNCSSHEVVPSCGRPLGLSFERKTGDLYICDGYFGVMKVGPEGGLAELVVDE 124

Query:  832 VEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKV 653
             EGRKVMFANQ DIDEEED+FYFNDSSD YHFR+VFYV+++G + GRVIRY+ K KEAKV
Sbjct:  125 AEGRKVMFANQGDIDEEEDIFYFNDSSDTYHFRDVFYVSLSGTKVGRVIRYDMKKKEAKV 184

Query:  652 VMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTP 473
            +MD LR  NGLAL+K+ SF+++CES+T   HR W+KGPK+GT ++FA +PG PDNIR TP
Sbjct:  185 IMDKLRLPNGLALSKNGSFVVTCESSTNTCHRIWVKGPKSGTNEVFATLPGSPDNIRRTP 244

Query:  472 TGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGETG 293
            TGDFW+ +HCKKNLF R AV  +  +G+    T+K+E +I  +NG KPHG+ VK+SGETG
Sbjct:  245 TGDFWVALHCKKNLFTR-AVLIHTWVGRFFMNTMKMETVIHFMNGGKPHGIVVKLSGETG 303

Query:  292 EIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWVLD 167
            EI+EILED EGKT++YVSEAYE  DGK+W GSV+ PAVWVLD
Sbjct:  304 EILEILEDSEGKTVKYVSEAYETKDGKLWIGSVYWPAVWVLD 345

>gi|297827769|ref|XP_002881767.1| hypothetical protein ARALYDRAFT_321816
        [Arabidopsis lyrata subsp. lyrata]

          Length = 370

 Score =  530 bits (1365), Expect = 2e-148
 Identities = 252/371 (67%), Positives = 301/371 (81%), Gaps = 3/371 (0%)
 Frame = -2

Query: 1273 MPISEKIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLPVDGPESIEWD 1094
            MP+S K+ TWAVV A  AV  V     +I P+++ G+KNVL+MA+TIPLPVDGPES++WD
Sbjct:    1 MPVSRKVQTWAVVVAVMAVLVVFVGPYIIGPESIEGSKNVLTMARTIPLPVDGPESLDWD 60

Query: 1093 PQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLSFEKKTG 914
            P+G GPY  V DGRILKW G+ LGWV+FAY+SPHR NCSRH+V P CGRPLGLSFEKK+G
Sbjct:   61 PRGEGPYVGVTDGRILKWSGEDLGWVQFAYSSPHRENCSRHKVEPACGRPLGLSFEKKSG 120

Query:  913 DLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF- 737
            DLY CDGY G+MKVGP+GGLAE VVD+ EG+KVMFANQMDIDEEED  YFNDSSD YHF 
Sbjct:  121 DLYFCDGYLGIMKVGPKGGLAEKVVDEAEGQKVMFANQMDIDEEEDAIYFNDSSDTYHFG 180

Query:  736 REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHR 557
            R+VFY  + GE++GR IRY+KKTKEAKV+MD L   NGLAL+KD SF++SCE  T LVHR
Sbjct:  181 RDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSKDGSFVLSCEVPTQLVHR 240

Query:  556 YWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEK 377
            YW KGPKAGTRDIFAK+PGY DNIR T TGDFW+ +H KK  F RL++  +  +GK   K
Sbjct:  241 YWAKGPKAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRLSM-IHPWVGKFFIK 299

Query:  376 TVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGS 197
            T+K+ELL+ L  G KPH VAVK+SG+TGEI+EILED EGK M+++SE  ER DG++WFGS
Sbjct:  300 TLKMELLLFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFISEVQER-DGRLWFGS 358

Query:  196 VFKPAVWVLDR 164
            VF P+VWVLDR
Sbjct:  359 VFLPSVWVLDR 369

>gi|79315403|ref|NP_001030876.1| strictosidine synthase family protein
        [Arabidopsis thaliana]

          Length = 356

 Score =  473 bits (1216), Expect = 4e-131
 Identities = 224/262 (85%), Positives = 245/262 (93%), Gaps = 1/262 (0%)
 Frame = -2

Query: 949 RPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVF 770
           RPLGL+FEKKTGDLYICDGY G+MKVGPEGGLAEL+VD+ EGRKVMFANQ DIDEEEDVF
Sbjct:  95 RPLGLTFEKKTGDLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQGDIDEEEDVF 154

Query: 769 YFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLI 590
           YFNDSSDKYHFR+VF+V V+GERSGRVIRY+KKTKEAKV+MDNL CNNGLALNKDRSFLI
Sbjct: 155 YFNDSSDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLALNKDRSFLI 214

Query: 589 SCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVN 410
           +CES T LVHRYWIKGPKAGTRDIFAKVPGYPDNIRLT TGDFWIG+HCKKNL GRL V 
Sbjct: 215 TCESGTSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKKNLIGRLIV- 273

Query: 409 NYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAY 230
            Y+ LGKLVEKT+KLE +I  +NGFKPHGVAVKISGETGE++E+LEDKEGKTM+YVSEAY
Sbjct: 274 KYKWLGKLVEKTMKLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKTMKYVSEAY 333

Query: 229 ERDDGKIWFGSVFKPAVWVLDR 164
           ERDDGK+WFGSV+ PAVWVLDR
Sbjct: 334 ERDDGKLWFGSVYWPAVWVLDR 355

>gi|3894193|gb|AAC78542.1| putative strictosidine synthase [Arabidopsis
        thaliana]

          Length = 395

 Score =  455 bits (1170), Expect = 8e-126
 Identities = 221/326 (67%), Positives = 258/326 (79%), Gaps = 9/326 (2%)
 Frame = -2

Query: 1120 DGPESIEW------DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVP 959
            D P S  W      DP+G GPY  V DGRILKW G+ LGW+EFAY+SPHR NCS H+V P
Sbjct:   71 DNPPSRGWTGEPGLDPRGEGPYVGVTDGRILKWSGEDLGWIEFAYSSPHRKNCSSHKVEP 130

Query:  958 TCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEE 779
             CGRPLGLSFEKK+GDLY CDGY GVMKVGP+GGLAE VVD+VEG+KVMFANQMDIDEEE
Sbjct:  131 ACGRPLGLSFEKKSGDLYFCDGYLGVMKVGPKGGLAEKVVDEVEGQKVMFANQMDIDEEE 190

Query:  778 DVFYFNDSSDKYHF-REVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDR 602
            D  YFNDSSD YHF R+VFY  + GE++GR IRY+KKTKEAKV+MD L   NGLAL+ D 
Sbjct:  191 DAIYFNDSSDTYHFGRDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSIDG 250

Query:  601 SFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGR 422
            SF++SCE  T LVHRYW KGP AGTRDIFAK+PGY DNIR T TGDFW+ +H KK  F R
Sbjct:  251 SFVLSCEVPTQLVHRYWAKGPNAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSR 310

Query:  421 LAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYV 242
            L++  +  +GK   KT+K+ELL+ L  G KPH VAVK+SG+TGEI+EILED EGK M+++
Sbjct:  311 LSM-IHPWVGKFFIKTLKMELLVFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFI 369

Query:  241 SEAYERDDGKIWFGSVFKPAVWVLDR 164
            SE  ER DG++WFGSVF P+VWVLDR
Sbjct:  370 SEVQER-DGRLWFGSVFLPSVWVLDR 394

>gi|145360869|ref|NP_181662.3| strictosidine synthase-like 1 [Arabidopsis
        thaliana]

          Length = 394

 Score =  455 bits (1170), Expect = 8e-126
 Identities = 219/324 (67%), Positives = 256/324 (79%), Gaps = 8/324 (2%)
 Frame = -2

Query: 1120 DGPESIEW------DPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSRHEVVP 959
            D P S  W      DP+G GPY  V DGRILKW G+ LGW+EFAY+SPHR NCS H+V P
Sbjct:   71 DNPPSRGWTGEPGLDPRGEGPYVGVTDGRILKWSGEDLGWIEFAYSSPHRKNCSSHKVEP 130

Query:  958 TCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEE 779
             CGRPLGLSFEKK+GDLY CDGY GVMKVGP+GGLAE VVD+VEG+KVMFANQMDIDEEE
Sbjct:  131 ACGRPLGLSFEKKSGDLYFCDGYLGVMKVGPKGGLAEKVVDEVEGQKVMFANQMDIDEEE 190

Query:  778 DVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRS 599
            D  YFNDSSD YHF +VFY  + GE++GR IRY+KKTKEAKV+MD L   NGLAL+ D S
Sbjct:  191 DAIYFNDSSDTYHFGDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSIDGS 250

Query:  598 FLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRL 419
            F++SCE  T LVHRYW KGP AGTRDIFAK+PGY DNIR T TGDFW+ +H KK  F RL
Sbjct:  251 FVLSCEVPTQLVHRYWAKGPNAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRL 310

Query:  418 AVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVS 239
            ++  +  +GK   KT+K+ELL+ L  G KPH VAVK+SG+TGEI+EILED EGK M+++S
Sbjct:  311 SM-IHPWVGKFFIKTLKMELLVFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFIS 369

Query:  238 EAYERDDGKIWFGSVFKPAVWVLD 167
            E  ER DG++WFGSVF P+VWVLD
Sbjct:  370 EVQER-DGRLWFGSVFLPSVWVLD 392

>gi|297820488|ref|XP_002878127.1| strictosidine synthase family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 343

 Score =  349 bits (894), Expect = 8e-094
 Identities = 166/196 (84%), Positives = 181/196 (92%), Gaps = 1/196 (0%)
 Frame = -2

Query: 754 SDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESA 575
           SDKYHFR+VF+V V+GERSGRVIRY+KKTKEAKVVMDNL CNNGLALNKDRSFLI+CES 
Sbjct: 147 SDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVVMDNLVCNNGLALNKDRSFLITCESG 206

Query: 574 TGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCL 395
           T LVHRYWIKGPKAGTRDIFAKVPGYPDNIRLT TGDFWIGIHCKKNL GRL V  Y+ L
Sbjct: 207 TSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGIHCKKNLLGRLIV-RYKWL 265

Query: 394 GKLVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDG 215
           GKLVEKT+KLE +I  +NGFKP GVAVKISGETGE++E+LEDKEGKTM+YVSEAYERDDG
Sbjct: 266 GKLVEKTIKLEYVIAFINGFKPQGVAVKISGETGEVLEVLEDKEGKTMKYVSEAYERDDG 325

Query: 214 KIWFGSVFKPAVWVLD 167
           K+WFGSV+ PAVWVLD
Sbjct: 326 KLWFGSVYWPAVWVLD 341

>gi|6911873|emb|CAB72173.1| putative protein [Arabidopsis thaliana]

          Length = 372

 Score =  330 bits (846), Expect = 3e-088
 Identities = 159/344 (46%), Positives = 232/344 (67%), Gaps = 6/344 (1%)
 Frame = -2

Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWV 1016
            + AP  ++G+++V   AK + L    GPESI +DP G GPY  V DGRILKWRG+ LGW 
Sbjct:   28 IFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGEGPYVGVSDGRILKWRGEPLGWS 87

Query: 1015 EFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAEL 845
            +FA+TS +R  C+R    E+   CGRPLGL F+KKTGDLYI D YFG++ VGP GGLA+ 
Sbjct:   88 DFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKP 147

Query:  844 VVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTK 665
            +V + EG+   F N +DIDE+EDV YF D+S ++  R+     +N +++GR I+Y++ +K
Sbjct:  148 LVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSK 207

Query:  664 EAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNI 485
            +A V++  L   NG+AL+KDRSF++  E+ T  + R W+ GP AGT  +FA++PG+PDNI
Sbjct:  208 KATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHQVFAELPGFPDNI 267

Query:  484 RLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKIS 305
            R    G+FW+ +H KK LF +L++        ++   +  + L  L  G  PH  A+K+S
Sbjct:  268 RRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPISPQRLHSLFTGGIPHATAIKLS 327

Query:  304 GETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWV 173
             E+G+++E+LEDKEGKT++++SE  E+ DGK+W GSV  P + V
Sbjct:  328 -ESGKVLEVLEDKEGKTLRFISEVEEK-DGKLWIGSVLVPFLGV 369

>gi|30694556|ref|NP_191262.2| strictosidine synthase family protein [Arabidopsis
        thaliana]

          Length = 374

 Score =  330 bits (846), Expect = 3e-088
 Identities = 159/344 (46%), Positives = 232/344 (67%), Gaps = 6/344 (1%)
 Frame = -2

Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWV 1016
            + AP  ++G+++V   AK + L    GPESI +DP G GPY  V DGRILKWRG+ LGW 
Sbjct:   30 IFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGEGPYVGVSDGRILKWRGEPLGWS 89

Query: 1015 EFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAEL 845
            +FA+TS +R  C+R    E+   CGRPLGL F+KKTGDLYI D YFG++ VGP GGLA+ 
Sbjct:   90 DFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKP 149

Query:  844 VVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTK 665
            +V + EG+   F N +DIDE+EDV YF D+S ++  R+     +N +++GR I+Y++ +K
Sbjct:  150 LVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSK 209

Query:  664 EAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNI 485
            +A V++  L   NG+AL+KDRSF++  E+ T  + R W+ GP AGT  +FA++PG+PDNI
Sbjct:  210 KATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHQVFAELPGFPDNI 269

Query:  484 RLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKIS 305
            R    G+FW+ +H KK LF +L++        ++   +  + L  L  G  PH  A+K+S
Sbjct:  270 RRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPISPQRLHSLFTGGIPHATAIKLS 329

Query:  304 GETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWV 173
             E+G+++E+LEDKEGKT++++SE  E+ DGK+W GSV  P + V
Sbjct:  330 -ESGKVLEVLEDKEGKTLRFISEVEEK-DGKLWIGSVLVPFLGV 371

>gi|297820490|ref|XP_002878128.1| strictosidine synthase family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 374

 Score =  329 bits (842), Expect = 9e-088
 Identities = 156/344 (45%), Positives = 231/344 (67%), Gaps = 6/344 (1%)
 Frame = -2

Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWV 1016
            + AP  ++G+++V   AK + L    GPESI +DP G GPY  V DGR+LKWR + LGW 
Sbjct:   30 IFAPPEISGSRDVFPSAKVVTLTGASGPESIAFDPAGEGPYVGVSDGRVLKWRSESLGWS 89

Query: 1015 EFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAEL 845
            +FAYTS +R  C R    E+   CGRPLGL F+KKTGDLYI D YFG++ VGP GGLA+ 
Sbjct:   90 DFAYTSSNRQECVRPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKP 149

Query:  844 VVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTK 665
            +V + EG+   F N +DIDE+EDV YF D+S ++  R+     +N +++GR I+Y++ +K
Sbjct:  150 LVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSK 209

Query:  664 EAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNI 485
            +A V++  L   NG+AL+KDRSF++  E+ T  + R W+ GP AGT ++FA++PG+PDNI
Sbjct:  210 KATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHEVFAELPGFPDNI 269

Query:  484 RLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKIS 305
            R    G+FW+ +H KK LF +L+++       ++   +  + L  L  G +PH  A+K+S
Sbjct:  270 RRNSNGEFWVALHSKKGLFAKLSLSQTWFRDLVLRLPISPQRLHSLFTGGRPHATAIKLS 329

Query:  304 GETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWV 173
             E+G+++E+LED EGK ++++SE  E+ DGK+W GSV  P + V
Sbjct:  330 -ESGKVLEVLEDNEGKRLRFISEVEEK-DGKLWIGSVLMPFLGV 371

>gi|110743953|dbj|BAE99809.1| hypothetical protein [Arabidopsis thaliana]

          Length = 374

 Score =  328 bits (839), Expect = 2e-087
 Identities = 158/344 (45%), Positives = 231/344 (67%), Gaps = 6/344 (1%)
 Frame = -2

Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWV 1016
            + AP  ++G+++V   AK + L    GPESI +DP G GPY  V DGRILKWRG+ LGW 
Sbjct:   30 IFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGEGPYVGVSDGRILKWRGEPLGWS 89

Query: 1015 EFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAEL 845
            +FA+TS +R  C+R    E+   CGRPLGL F+KKTGDLYI D YFG++ VGP GGLA+ 
Sbjct:   90 DFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLAKP 149

Query:  844 VVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTK 665
            +V + EG+   F N +DIDE+EDV YF D+S ++  R+     +N +++GR I+Y++ +K
Sbjct:  150 LVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRSSK 209

Query:  664 EAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNI 485
            +A V++  L   NG+AL+KDRSF++  E+ T  + R W+ GP AGT  +FA++PG+PDNI
Sbjct:  210 KATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHQVFAELPGFPDNI 269

Query:  484 RLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKIS 305
            R    G+FW+ +H KK LF +L++        ++   +  + L  L  G  PH  A+K+S
Sbjct:  270 RRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPISPQRLHSLFTGGIPHATAIKLS 329

Query:  304 GETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWV 173
             E+G+++E+L DKEGKT++++SE  E+ DGK+W GSV  P + V
Sbjct:  330 -ESGKVLEVLGDKEGKTLRFISEVEEK-DGKLWIGSVLVPFLGV 371

>gi|255583680|ref|XP_002532594.1| strictosidine synthase, putative [Ricinus
        communis]

          Length = 372

 Score =  322 bits (825), Expect = 8e-086
 Identities = 158/362 (43%), Positives = 233/362 (64%), Gaps = 6/362 (1%)
 Frame = -2

Query: 1258 KIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGG 1082
            K+   A      A   + +   + AP  L  + + L  AK +P+    GPES+ +DP G 
Sbjct:    6 KVGVAATAIVALASIIITNPNNIFAPPPLPSSNDNLHSAKIVPITGAVGPESLVFDPNGE 65

Query: 1081 GPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGD 911
            GPY  V DGRILKW+GD LGW +FA+T+ +R  C R    E+   CGRPLGL F+KKTGD
Sbjct:   66 GPYTGVADGRILKWQGDSLGWTDFAFTTSNRKECIRPFAPELEHVCGRPLGLRFDKKTGD 125

Query:  910 LYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFRE 731
            LYI D Y G+  VGP GGLA  VV +VEG  + F N MDIDE+ DV YF D+S  +  R+
Sbjct:  126 LYIADAYLGLQVVGPNGGLATPVVSEVEGHPLRFTNDMDIDEQNDVIYFTDTSKIFQRRQ 185

Query:  730 VFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYW 551
                 ++ +++GR+++Y+K +KE  ++++ L   NG+AL+KDRSF++  E++T  + R+W
Sbjct:  186 FMASILHKDKTGRLLKYDKSSKEVTILLEGLSFANGVALSKDRSFVLVAETSTCQISRFW 245

Query:  550 IKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTV 371
            + GP AG  D+FAK+PG+PDNIR    G+FW+ +H K+    +LA++N      L++  +
Sbjct:  246 LHGPNAGKVDVFAKLPGFPDNIRRNSKGEFWVALHAKEGFLAKLALSNSWIGKTLLKFPL 305

Query:  370 KLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVF 191
              + L  L+ G KPH  A+K+SG+ G+IV++LED +GK ++++SE  E+ DGK+W GSV 
Sbjct:  306 SFKQLHSLLVGGKPHATAIKLSGD-GKIVQVLEDCDGKRLRFISEVEEK-DGKLWIGSVL 363

Query:  190 KP 185
             P
Sbjct:  364 MP 365

>gi|225441250|ref|XP_002273764.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 370

 Score =  322 bits (823), Expect = 1e-085
 Identities = 160/369 (43%), Positives = 236/369 (63%), Gaps = 7/369 (1%)
 Frame = -2

Query: 1267 ISEKIPTWAVVPATFAVFSVISYQILIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDP 1091
            ++ K+   A+  A  ++   ++   L  P ++ GT ++L  ++ I +    GPESI +DP
Sbjct:    1 MNTKLILTAITLAAISIILAVNSNHLFKPPSIPGTHDLLHGSEVIQVTGAFGPESIAFDP 60

Query: 1090 QGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKK 920
            +G GPY  V DGR+LKW GDG GW +FA T+  R  C R    E+   CGRPLGL F+KK
Sbjct:   61 KGEGPYTGVADGRVLKWEGDGRGWTDFAVTTSERKECVRPFAPEMEHICGRPLGLRFDKK 120

Query:  919 TGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYH 740
            TGDLYI D YFG+  V P GGLA  +V +VEGR+++F N MDIDE EDV YF D+S  +H
Sbjct:  121 TGDLYIADAYFGLQVVEPNGGLATPLVTEVEGRRLLFTNDMDIDEVEDVIYFTDTSTDFH 180

Query:  739 FREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVH 560
             R+     ++G+ +GR+++Y+K +KE  V++  L   NG+A++KDRSF++  E+ TG + 
Sbjct:  181 RRQFMAALLSGDNTGRLMKYDKSSKEVTVLLRGLAFANGVAMSKDRSFVLVAETTTGKII 240

Query:  559 RYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVE 380
            RYW+KGP AG  D+FA+VPGYPDN+R    G+FW+ +H KK        +N      L++
Sbjct:  241 RYWLKGPNAGKSDVFAEVPGYPDNVRRNSKGEFWVALHAKKGPHANWITSNSWVGKTLLK 300

Query:  379 KTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFG 200
              +  + L  L+   + H  A+K+S E G+++E+LED EGK+M+++SE  E  +GK+W G
Sbjct:  301 LPLTFKQLHKLI-VVEAHATAIKLS-EEGQVLEVLEDCEGKSMRFISEV-EEHNGKLWLG 357

Query:  199 SVFKPAVWV 173
            SV  P + V
Sbjct:  358 SVMMPFIGV 366

>gi|156763850|emb|CAO99127.1| strictosidine synthase-like protein [Nicotiana
        tabacum]

          Length = 380

 Score =  321 bits (822), Expect = 2e-085
 Identities = 156/343 (45%), Positives = 224/343 (65%), Gaps = 6/343 (1%)
 Frame = -2

Query: 1183 PDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWVEFA 1007
            P  + G+++VLS A+ I L    G ES+ +DP G GPY  V DGRILKW+     WV+FA
Sbjct:   38 PAPIPGSQDVLSKAELIQLKGAFGAESVAFDPNGEGPYTGVADGRILKWQPHSQTWVDFA 97

Query: 1006 YTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVD 836
             TS  R NCSR    E+   CGRPLGL F+ KTGDLYI D YFG+  VGP GGLA  +V 
Sbjct:   98 VTSSQRKNCSRPSAPEMEHVCGRPLGLRFDHKTGDLYIADAYFGLHVVGPTGGLATPLVQ 157

Query:  835 QVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAK 656
              EG+ ++F N +DID+++D+ YF D+S  Y  R+    T +G+++GR+++YNK TKE  
Sbjct:  158 DFEGQPLLFTNDLDIDDDDDIIYFTDTSTIYQRRQFVAATASGDKTGRLMKYNKSTKEVT 217

Query:  655 VVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLT 476
            V +  L   NG+AL+KDRSFL+  E++   + RYW+KGP  G  DIFA++PG+PDN+R+ 
Sbjct:  218 VALGGLAFANGVALSKDRSFLLVAETSACRILRYWLKGPNVGNHDIFAELPGFPDNVRIN 277

Query:  475 PTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAVKISGET 296
              G+FW+ +H K +   RL ++N   LGK + +    + L  L+ G +PH  A+K+S E 
Sbjct:  278 SRGEFWVALHAKASPLARLIISN-SWLGKTLLREFNFQQLHNLLVGGQPHATAIKLS-ED 335

Query:  295 GEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKPAVWVLD 167
            G ++E+LED EGK ++++SE +E + GK+W  SV   ++ V D
Sbjct:  336 GRVLEVLEDVEGKILRFISEVHEEESGKLWISSVIMSSLGVYD 378

>gi|224139742|ref|XP_002323255.1| predicted protein [Populus trichocarpa]

          Length = 375

 Score =  290 bits (742), Expect = 3e-076
 Identities = 147/343 (42%), Positives = 219/343 (63%), Gaps = 9/343 (2%)
 Frame = -2

Query: 1192 LIAPDNLNGTKNVLSMAKTIPLP-VDGPESIEWDPQGGGPYAAVVDGRILKW--RGDGLG 1022
            L+ P  +  + + L  AK + +    GPES+ +DP G GPY  V DGR+LKW    DG G
Sbjct:   28 LLGPPTIPTSNDHLHSAKILHVSGAVGPESLVFDPNGEGPYTGVADGRVLKWIAGDDGSG 87

Query: 1021 -WVEFAYTSPHRGNCSR---HEVVPTCGRPLGLSFEKKTGDLYICDGYFGVMKVGPEGGL 854
             W +FA TS +R  C R    E+   CGRPLGL F+KKTG+LYI D Y G+  VGP GGL
Sbjct:   88 SWTDFATTSSNRNECVRPFAPEMEHVCGRPLGLRFDKKTGNLYIADAYLGLQVVGPTGGL 147

Query:  853 AELVVDQVEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFREVFYVTVNGERSGRVIRYNK 674
            A  VV ++EG+ + F N +DIDE+EDV YF D+S  +  R+     +  +++GR+++Y+K
Sbjct:  148 ATPVVTELEGQPMRFTNDLDIDEQEDVIYFTDTSMVFQRRQFILSLLTKDKTGRLLKYDK 207

Query:  673 KTKEAKVVMDNLRCNNGLALNKDRSFLISCESATGLVHRYWIKGPKAGTRDIFAKVPGYP 494
             +KE  V+   L   NG+AL+KD +FL+  E+ T  + R+W+ GP AG  D+F ++PG+P
Sbjct:  208 SSKEVTVLARGLAFANGVALSKDSTFLLVAETTTCRILRFWLHGPNAGKSDVFTELPGFP 267

Query:  493 DNIRLTPTGDFWIGIHCKKNLFGRLAVNNYQCLGKLVEKTVKLELLIGLVNGFKPHGVAV 314
            DNIR    G+FW+ +H KK LF ++ ++N      L++  +  + L  L+ G K H  A+
Sbjct:  268 DNIRRNSKGEFWVALHSKKGLFAKVVLSNSWIGKTLLKFPLSFKQLHSLLVGGKAHATAI 327

Query:  313 KISGETGEIVEILEDKEGKTMQYVSEAYERDDGKIWFGSVFKP 185
            K+S E G+++++LED +GKT++++SE  E+ DGK+W GSV  P
Sbjct:  328 KLS-EEGKVLDVLEDCDGKTLRFISEVEEK-DGKLWIGSVLMP 368

>gi|224139738|ref|XP_002323253.1| predicted protein [Populus trichocarpa]

          Length = 349

 Score =  265 bits (675), Expect = 2e-068
 Identities = 128/316 (40%), Positives = 202/316 (63%), Gaps = 8/316 (2%)
 Frame = -2

Query: 1117 GPESIEWDPQGGGPYAAVVDGRILKWRGDGLGWVEFAYTSPHRGNC----SRHEVVPTCG 950
            GPES  +D  G GPY ++ DGRI+KW+GD   W++FA TSP+R  C      H++   CG
Sbjct:   30 GPESFAFDSLGEGPYTSLSDGRIIKWQGDKKRWIDFAVTSPNRDGCGGPHDHHQMEHVCG 89

Query:  949 RPLGLSFEKKTGDLYICDGYFGVMKVGPEGGLAELVVDQVEGRKVMFANQMDIDEEEDVF 770
            RPLG  F++  GDLYI D Y G+++VGPEGGLA  +    +G    F N +DID+     
Sbjct:   90 RPLGSCFDETHGDLYIADAYMGLLRVGPEGGLATKIATHAQGIPFRFTNSLDIDQSSGAI 149

Query:  769 YFNDSSDKYHFREVFYVTVNGERSGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLI 590
            YF DSS +Y  R+   V ++G++SGR+++Y+  +K+  V++ NL   NG+AL+ D SF++
Sbjct:  150 YFTDSSTQYQRRDYLSVVLSGDKSGRLMKYDTASKQVTVLLKNLTFPNGVALSTDGSFVL 209

Query:  589 SCESATGLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWIGIHCKKNLFGRLAVN 410
              E+ +  + RYWIK  KAG  ++FA++ G+PDNI+ +P G +W+GI+ K+     L + 
Sbjct:  210 LAETTSCRILRYWIKTSKAGALEVFAQLQGFPDNIKRSPRGGYWVGINSKREKLSEL-LF 268

Query:  409 NYQCLGK-LVEKTVKLELLIGLVNGFKPHGVAVKISGETGEIVEILEDKEGKTMQYVSEA 233
            +Y  +GK L++  + +      +  ++  G+AV++S E G+IVE+ ED++G  ++ +SE 
Sbjct:  269 SYPWIGKVLLKLPLDITKFQTALAKYRGGGLAVRLS-ENGDIVEVFEDRDGNRLKSISEV 327

Query:  232 YERDDGKIWFGSVFKP 185
             E+ DGK+W GS+  P
Sbjct:  328 MEK-DGKLWIGSIDLP 342

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,587,916,767,856
Number of Sequences: 15229318
Number of Extensions: 4587916767856
Number of Successful Extensions: 1071853064
Number of sequences better than 0.0: 0