Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN21852


BLASTX 7.6.2

Query= UN21852 /QuerySize=1414
        (1413 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|15230182|ref|NP_191260.1| strictosidine synthase family prote...    675   7e-192
gi|13877837|gb|AAK43996.1|AF370181_1 unknown protein [Arabidopsi...    673   2e-191
gi|297820484|ref|XP_002878125.1| strictosidine synthase family p...    663   2e-188
gi|6759491|emb|CAB69786.1| hypothetical protein [Arabidopsis tha...    627   2e-177
gi|15230200|ref|NP_191261.1| strictosidine synthase family prote...    555   1e-155
gi|312281991|dbj|BAJ33861.1| unnamed protein product [Thellungie...    543   4e-152
gi|297827769|ref|XP_002881767.1| hypothetical protein ARALYDRAFT...    525   1e-146
gi|145360869|ref|NP_181662.3| strictosidine synthase-like 1 [Ara...    473   3e-131
gi|3894193|gb|AAC78542.1| putative strictosidine synthase [Arabi...    469   8e-130
gi|79315403|ref|NP_001030876.1| strictosidine synthase family pr...    416   5e-114
gi|6911873|emb|CAB72173.1| putative protein [Arabidopsis thaliana]     338   2e-090
gi|30694556|ref|NP_191262.2| strictosidine synthase family prote...    338   2e-090
gi|297820490|ref|XP_002878128.1| strictosidine synthase family p...    337   3e-090
gi|110743953|dbj|BAE99809.1| hypothetical protein [Arabidopsis t...    335   1e-089
gi|225441250|ref|XP_002273764.1| PREDICTED: hypothetical protein...    323   5e-086
gi|156763850|emb|CAO99127.1| strictosidine synthase-like protein...    318   2e-084
gi|224139742|ref|XP_002323255.1| predicted protein [Populus tric...    305   1e-080
gi|297820488|ref|XP_002878127.1| strictosidine synthase family p...    290   5e-076
gi|224139738|ref|XP_002323253.1| predicted protein [Populus tric...    287   4e-075
gi|225441248|ref|XP_002267323.1| PREDICTED: hypothetical protein...    276   7e-072

>gi|15230182|ref|NP_191260.1| strictosidine synthase family protein [Arabidopsis
        thaliana]

          Length = 376

 Score =  675 bits (1740), Expect = 7e-192
 Identities = 317/374 (84%), Positives = 344/374 (91%)
 Frame = +1

Query:   61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEF 240
            MPISRRVLTP+ AAPVILAV+C+ FWS+II PD L+GTKHVLQ AKTIPLP  GPESLEF
Sbjct:    1 MPISRRVLTPITAAPVILAVLCFFFWSSIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEF 60

Query:  241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
            D QGEGPYVGVTDGRILKWRGEE GWVDFAYTSPHRDNCS ++VVPSCGRPLGLSF RKT
Sbjct:   61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKT 120

Query:  421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
            GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct:  121 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180

Query:  601 GEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHR 780
             +VFYVS+SG KVGRVIRYDMKKKEAKVIMDKL LPNGLALSK+GSFV+TCE  T I HR
Sbjct:  181 RDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240

Query:  781 IWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHT 960
            IWVKGPK+GT EVFA +PG PDNIRRTPTGDFWVALHCK NLFTR  LIH+WVG+FFM+T
Sbjct:  241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTRAVLIHTWVGRFFMNT 300

Query:  961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
            +K+ETV+H MNGGKPHGI++KLSGETGEI+E+LEDSEG T+KYVSEAYE +DGKLWIGSV
Sbjct:  301 MKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYVSEAYETKDGKLWIGSV 360

Query: 1141 YWPAVWVLDKSVYE 1182
            YWPAVWVLD SVY+
Sbjct:  361 YWPAVWVLDTSVYD 374

>gi|13877837|gb|AAK43996.1|AF370181_1 unknown protein [Arabidopsis thaliana]

          Length = 376

 Score =  673 bits (1736), Expect = 2e-191
 Identities = 316/374 (84%), Positives = 343/374 (91%)
 Frame = +1

Query:   61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEF 240
            MPISRRVLTP+ AAPVILAV+C+ FWS+II PD L+GTKHVLQ AKTIPLP  GPESLEF
Sbjct:    1 MPISRRVLTPITAAPVILAVLCFFFWSSIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEF 60

Query:  241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
            D QGEGPYVGVTDGRILKWRGEE GWVDFAYTSPHRDNCS ++VVPSCGRPLGLSF RKT
Sbjct:   61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKT 120

Query:  421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
            GDLYICDGYFGVMKVGPEGGL ELVVDEAEGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct:  121 GDLYICDGYFGVMKVGPEGGLGELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180

Query:  601 GEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHR 780
             +VFYVS+SG KVGRVIRYDMKKKEAKVIMDKL LPNGLALSK+GSFV+TCE  T I HR
Sbjct:  181 RDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240

Query:  781 IWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHT 960
            IWVKGPK+GT EVFA +PG PDNIRRTPTGDFWVALHCK NLFTR  LIH+WVG+FFM+T
Sbjct:  241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTRAVLIHTWVGRFFMNT 300

Query:  961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
            +K+ETV+H MNGGKPHGI++KLSGETGEI+E+LEDSEG T+KYVSEAYE +DGKLWIGSV
Sbjct:  301 MKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYVSEAYETKDGKLWIGSV 360

Query: 1141 YWPAVWVLDKSVYE 1182
            YWPAVWVLD SVY+
Sbjct:  361 YWPAVWVLDTSVYD 374

>gi|297820484|ref|XP_002878125.1| strictosidine synthase family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 371

 Score =  663 bits (1710), Expect = 2e-188
 Identities = 311/369 (84%), Positives = 340/369 (92%)
 Frame = +1

Query:   61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEF 240
            MPISRRVLTPV+AAPVILAV+C+ FWS+II PD ++GTKHVLQ AKTIPLP  GPESLEF
Sbjct:    1 MPISRRVLTPVSAAPVILAVLCFFFWSSIIGPDNIKGTKHVLQDAKTIPLPADGPESLEF 60

Query:  241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
            D QGEGPYVGVTDGRILKWRGEE GWVDFAYTSPHRDNCSR++VVPSCGRPLGL+F +KT
Sbjct:   61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSRHEVVPSCGRPLGLTFEKKT 120

Query:  421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
            GDLYICDGYFG+MKVGP+GGLAELVVDEAEGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct:  121 GDLYICDGYFGLMKVGPQGGLAELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180

Query:  601 GEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHR 780
             EVFYVS+SG KVGRVIRYDMKKKEAKVIMDKL LPNGLALSK+GSFV+TCE  T I HR
Sbjct:  181 REVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240

Query:  781 IWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHT 960
            IWVKGPK+GT EVFA +PG PDNIRRTPTGDFWVALHCK NLFTR+ LIHS VG+FFM+T
Sbjct:  241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTRVALIHSLVGRFFMNT 300

Query:  961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
            +K+ETV+H MNGGKPHGI++KLSGETGEI+E+LEDSEG T+KY SEAYE EDGKLWIGSV
Sbjct:  301 MKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYASEAYETEDGKLWIGSV 360

Query: 1141 YWPAVWVLD 1167
            YWPAVWV D
Sbjct:  361 YWPAVWVYD 369

>gi|6759491|emb|CAB69786.1| hypothetical protein [Arabidopsis thaliana]

          Length = 352

 Score =  627 bits (1616), Expect = 2e-177
 Identities = 295/348 (84%), Positives = 318/348 (91%)
 Frame = +1

Query:  139 STIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEFDSQGEGPYVGVTDGRILKWRGEEHGW 318
            S II PD L+GTKHVLQ AKTIPLP  GPESLEFD QGEGPYVGVTDGRILKWRGEE GW
Sbjct:    3 SAIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEFDPQGEGPYVGVTDGRILKWRGEELGW 62

Query:  319 VDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLAELVV 498
            VDFAYTSPHRDNCS ++VVPSCGRPLGLSF RKTGDLYICDGYFGVMKVGPEGGLAELVV
Sbjct:   63 VDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKTGDLYICDGYFGVMKVGPEGGLAELVV 122

Query:  499 DEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMKKKEA 678
            DEAEGRKVMFANQ DIDEEED+FYFNDSSD YHF +VFYVS+SG KVGRVIRYDMKKKEA
Sbjct:  123 DEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHFRDVFYVSLSGTKVGRVIRYDMKKKEA 182

Query:  679 KVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPDNIRR 858
            KVIMDKL LPNGLALSK+GSFV+TCE  T   HRIWVKGPK+GT EVFA +PG PDNIRR
Sbjct:  183 KVIMDKLRLPNGLALSKNGSFVVTCESSTNTCHRIWVKGPKSGTNEVFATLPGSPDNIRR 242

Query:  859 TPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHTLKLETVVHLMNGGKPHGIILKLSGET 1038
            TPTGDFWVALHCK NLFTR  LIH+WVG+FFM+T+K+ETV+H MNGGKPHGI++KLSGET
Sbjct:  243 TPTGDFWVALHCKKNLFTRAVLIHTWVGRFFMNTMKMETVIHFMNGGKPHGIVVKLSGET 302

Query: 1039 GEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVYWPAVWVLDKSVYE 1182
            GEI+E+LEDSEG T+KYVSEAYE +DGKLWIGSVYWPAVWVLD SVY+
Sbjct:  303 GEILEILEDSEGKTVKYVSEAYETKDGKLWIGSVYWPAVWVLDTSVYD 350

>gi|15230200|ref|NP_191261.1| strictosidine synthase family protein [Arabidopsis
        thaliana]

          Length = 370

 Score =  555 bits (1428), Expect = 1e-155
 Identities = 255/369 (69%), Positives = 307/369 (83%), Gaps = 1/369 (0%)
 Frame = +1

Query:   64 PISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEFD 243
            PI++++ T   A P + AV+  + + T+I P+ LEG K+VL +AKTIP+P  GPES+EFD
Sbjct:    2 PINQKIPT-WFAVPAVFAVLSVISYQTLIVPENLEGAKNVLTMAKTIPIPVAGPESIEFD 60

Query:  244 SQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKTG 423
             +GEGPY  V DGRILKWRG++ GWVDFAYTSPHR NCS+ +VVP+CGRPLGL+F +KTG
Sbjct:   61 PKGEGPYAAVVDGRILKWRGDDLGWVDFAYTSPHRGNCSKTEVVPTCGRPLGLTFEKKTG 120

Query:  424 DLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFG 603
            DLYICDGY G+MKVGPEGGLAEL+VDEAEGRKVMFANQ DIDEEEDVFYFNDSSDKYHF 
Sbjct:  121 DLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQGDIDEEEDVFYFNDSSDKYHFR 180

Query:  604 EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRI 783
            +VF+V++SG++ GRVIRYD K KEAKVIMD L   NGLAL+KD SF+ITCE GT ++HR 
Sbjct:  181 DVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLALNKDRSFLITCESGTSLVHRY 240

Query:  784 WVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHTL 963
            W+KGPKAGT ++FAKVPG PDNIR T TGDFW+ LHCK NL  RL + + W+GK    T+
Sbjct:  241 WIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKKNLIGRLIVKYKWLGKLVEKTM 300

Query:  964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
            KLE V+  +NG KPHG+ +K+SGETGE++E+LED EG TMKYVSEAYER+DGKLW GSVY
Sbjct:  301 KLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKTMKYVSEAYERDDGKLWFGSVY 360

Query: 1144 WPAVWVLDK 1170
            WPAVWVLD+
Sbjct:  361 WPAVWVLDR 369

>gi|312281991|dbj|BAJ33861.1| unnamed protein product [Thellungiella halophila]

          Length = 370

 Score =  543 bits (1397), Expect = 4e-152
 Identities = 254/369 (68%), Positives = 304/369 (82%), Gaps = 1/369 (0%)
 Frame = +1

Query:   64 PISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEFD 243
            P+S++V T  AA P +LAV   + + TII PD L+GTKHVL +AKTIPLP  GPES+E+D
Sbjct:    2 PLSQKVPT-WAAVPAVLAVFSVISYQTIIAPDNLKGTKHVLSMAKTIPLPVHGPESIEWD 60

Query:  244 SQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKTG 423
             QG GPY  V DGRILKW+G+  GWV+FAYTSPHR NCSR++VVP+CGRPLGL F +KTG
Sbjct:   61 PQGGGPYAAVVDGRILKWQGDGIGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLKFEKKTG 120

Query:  424 DLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFG 603
            DLYICDGY GVMKVGPEGGLAELVVD+AEGRKVMFANQ+DIDEEEDV YFNDSSDKYHF 
Sbjct:  121 DLYICDGYLGVMKVGPEGGLAELVVDQAEGRKVMFANQIDIDEEEDVLYFNDSSDKYHFR 180

Query:  604 EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRI 783
            EVFYV+ +GD+ GRVIRY+ K KEAKV+MD L   NGLAL+KD SF+I+CE  TG++HR 
Sbjct:  181 EVFYVASNGDRTGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESSTGLVHRY 240

Query:  784 WVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHTL 963
            W+KGPKAGT ++FAKVPG PDNIR TPTGDFW+ +HCK N   R  + + W+GK    T+
Sbjct:  241 WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWLGIHCKKNPLGRFMINNRWLGKIVEKTV 300

Query:  964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
             L+ ++ +MNG KPHGI +K+SGETGEI+E+LED EG TM+YVSEAYER+DGKLW GSV+
Sbjct:  301 NLDLLIAVMNGFKPHGIAVKISGETGEILEVLEDIEGKTMQYVSEAYERDDGKLWFGSVF 360

Query: 1144 WPAVWVLDK 1170
             PAVWVLD+
Sbjct:  361 TPAVWVLDR 369

>gi|297827769|ref|XP_002881767.1| hypothetical protein ARALYDRAFT_321816
        [Arabidopsis lyrata subsp. lyrata]

          Length = 370

 Score =  525 bits (1350), Expect = 1e-146
 Identities = 249/371 (67%), Positives = 302/371 (81%), Gaps = 3/371 (0%)
 Frame = +1

Query:   61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEF 240
            MP+SR+V T  A    ++AV+       II P+ +EG+K+VL +A+TIPLP  GPESL++
Sbjct:    1 MPVSRKVQT-WAVVVAVMAVLVVFVGPYIIGPESIEGSKNVLTMARTIPLPVDGPESLDW 59

Query:  241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
            D +GEGPYVGVTDGRILKW GE+ GWV FAY+SPHR+NCSR++V P+CGRPLGLSF +K+
Sbjct:   60 DPRGEGPYVGVTDGRILKWSGEDLGWVQFAYSSPHRENCSRHKVEPACGRPLGLSFEKKS 119

Query:  421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
            GDLY CDGY G+MKVGP+GGLAE VVDEAEG+KVMFANQMDIDEEED  YFNDSSD YHF
Sbjct:  120 GDLYFCDGYLGIMKVGPKGGLAEKVVDEAEGQKVMFANQMDIDEEEDAIYFNDSSDTYHF 179

Query:  601 G-EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILH 777
            G +VFY  + G+K GR IRYD K KEAKVIMD+LH PNGLALSKDGSFV++CE  T ++H
Sbjct:  180 GRDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSKDGSFVLSCEVPTQLVH 239

Query:  778 RIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH 957
            R W KGPKAGT ++FAK+PG  DNIRRT TGDFWVALH K   F+RL +IH WVGKFF+ 
Sbjct:  240 RYWAKGPKAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRLSMIHPWVGKFFIK 299

Query:  958 TLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGS 1137
            TLK+E ++ L  GGKPH + +KLSG+TGEI+E+LEDSEG  MK++SE  ER DG+LW GS
Sbjct:  300 TLKMELLLFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFISEVQER-DGRLWFGS 358

Query: 1138 VYWPAVWVLDK 1170
            V+ P+VWVLD+
Sbjct:  359 VFLPSVWVLDR 369

>gi|145360869|ref|NP_181662.3| strictosidine synthase-like 1 [Arabidopsis
        thaliana]

          Length = 394

 Score =  473 bits (1217), Expect = 3e-131
 Identities = 218/310 (70%), Positives = 257/310 (82%), Gaps = 1/310 (0%)
 Frame = +1

Query:  241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
            D +GEGPYVGVTDGRILKW GE+ GW++FAY+SPHR NCS ++V P+CGRPLGLSF +K+
Sbjct:   85 DPRGEGPYVGVTDGRILKWSGEDLGWIEFAYSSPHRKNCSSHKVEPACGRPLGLSFEKKS 144

Query:  421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
            GDLY CDGY GVMKVGP+GGLAE VVDE EG+KVMFANQMDIDEEED  YFNDSSD YHF
Sbjct:  145 GDLYFCDGYLGVMKVGPKGGLAEKVVDEVEGQKVMFANQMDIDEEEDAIYFNDSSDTYHF 204

Query:  601 GEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHR 780
            G+VFY  + G+K GR IRYD K KEAKVIMD+LH PNGLALS DGSFV++CE  T ++HR
Sbjct:  205 GDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSIDGSFVLSCEVPTQLVHR 264

Query:  781 IWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHT 960
             W KGP AGT ++FAK+PG  DNIRRT TGDFWVALH K   F+RL +IH WVGKFF+ T
Sbjct:  265 YWAKGPNAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRLSMIHPWVGKFFIKT 324

Query:  961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
            LK+E +V L  GGKPH + +KLSG+TGEI+E+LEDSEG  MK++SE  ER DG+LW GSV
Sbjct:  325 LKMELLVFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFISEVQER-DGRLWFGSV 383

Query: 1141 YWPAVWVLDK 1170
            + P+VWVLD+
Sbjct:  384 FLPSVWVLDR 393

>gi|3894193|gb|AAC78542.1| putative strictosidine synthase [Arabidopsis
        thaliana]

          Length = 395

 Score =  469 bits (1205), Expect = 8e-130
 Identities = 218/311 (70%), Positives = 257/311 (82%), Gaps = 2/311 (0%)
 Frame = +1

Query:  241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
            D +GEGPYVGVTDGRILKW GE+ GW++FAY+SPHR NCS ++V P+CGRPLGLSF +K+
Sbjct:   85 DPRGEGPYVGVTDGRILKWSGEDLGWIEFAYSSPHRKNCSSHKVEPACGRPLGLSFEKKS 144

Query:  421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
            GDLY CDGY GVMKVGP+GGLAE VVDE EG+KVMFANQMDIDEEED  YFNDSSD YHF
Sbjct:  145 GDLYFCDGYLGVMKVGPKGGLAEKVVDEVEGQKVMFANQMDIDEEEDAIYFNDSSDTYHF 204

Query:  601 G-EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILH 777
            G +VFY  + G+K GR IRYD K KEAKVIMD+LH PNGLALS DGSFV++CE  T ++H
Sbjct:  205 GRDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSIDGSFVLSCEVPTQLVH 264

Query:  778 RIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH 957
            R W KGP AGT ++FAK+PG  DNIRRT TGDFWVALH K   F+RL +IH WVGKFF+ 
Sbjct:  265 RYWAKGPNAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRLSMIHPWVGKFFIK 324

Query:  958 TLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGS 1137
            TLK+E +V L  GGKPH + +KLSG+TGEI+E+LEDSEG  MK++SE  ER DG+LW GS
Sbjct:  325 TLKMELLVFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFISEVQER-DGRLWFGS 383

Query: 1138 VYWPAVWVLDK 1170
            V+ P+VWVLD+
Sbjct:  384 VFLPSVWVLDR 394

>gi|79315403|ref|NP_001030876.1| strictosidine synthase family protein
        [Arabidopsis thaliana]

          Length = 356

 Score =  416 bits (1069), Expect = 5e-114
 Identities = 191/261 (73%), Positives = 223/261 (85%)
 Frame = +1

Query:  388 RPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVF 567
            RPLGL+F +KTGDLYICDGY G+MKVGPEGGLAEL+VDEAEGRKVMFANQ DIDEEEDVF
Sbjct:   95 RPLGLTFEKKTGDLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQGDIDEEEDVF 154

Query:  568 YFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVI 747
            YFNDSSDKYHF +VF+V++SG++ GRVIRYD K KEAKVIMD L   NGLAL+KD SF+I
Sbjct:  155 YFNDSSDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLALNKDRSFLI 214

Query:  748 TCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLI 927
            TCE GT ++HR W+KGPKAGT ++FAKVPG PDNIR T TGDFW+ LHCK NL  RL + 
Sbjct:  215 TCESGTSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKKNLIGRLIVK 274

Query:  928 HSWVGKFFMHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYE 1107
            + W+GK    T+KLE V+  +NG KPHG+ +K+SGETGE++E+LED EG TMKYVSEAYE
Sbjct:  275 YKWLGKLVEKTMKLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKTMKYVSEAYE 334

Query: 1108 REDGKLWIGSVYWPAVWVLDK 1170
            R+DGKLW GSVYWPAVWVLD+
Sbjct:  335 RDDGKLWFGSVYWPAVWVLDR 355

>gi|6911873|emb|CAB72173.1| putative protein [Arabidopsis thaliana]

          Length = 372

 Score =  338 bits (865), Expect = 2e-090
 Identities = 178/368 (48%), Positives = 236/368 (64%), Gaps = 7/368 (1%)
 Frame = +1

Query:   79 VLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGE 255
            V   V AA + + V      S I  P  + G++ V   AK + L G  GPES+ FD  GE
Sbjct:    6 VFLTVIAAVLAILVKNSQTGSGIFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGE 65

Query:  256 GPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGD 426
            GPYVGV+DGRILKWRGE  GW DFA+TS +R  C+R    ++   CGRPLGL F +KTGD
Sbjct:   66 GPYVGVSDGRILKWRGEPLGWSDFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGD 125

Query:  427 LYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGE 606
            LYI D YFG++ VGP GGLA+ +V EAEG+   F N +DIDE+EDV YF D+S ++   +
Sbjct:  126 LYIADAYFGLLVVGPAGGLAKPLVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQ 185

Query:  607 VFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIW 786
                 ++ DK GR I+YD   K+A V++  L   NG+ALSKD SFV+  E  T  + R+W
Sbjct:  186 FLAAVLNVDKTGRFIKYDRSSKKATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLW 245

Query:  787 VKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH-TL 963
            + GP AGT +VFA++PG PDNIRR   G+FWVALH K  LF +L L  +W     +   +
Sbjct:  246 LSGPNAGTHQVFAELPGFPDNIRRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPI 305

Query:  964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
              + +  L  GG PH   +KLS E+G+++E+LED EG T++++SE  E +DGKLWIGSV 
Sbjct:  306 SPQRLHSLFTGGIPHATAIKLS-ESGKVLEVLEDKEGKTLRFISEV-EEKDGKLWIGSVL 363

Query: 1144 WPAVWVLD 1167
             P + V D
Sbjct:  364 VPFLGVYD 371

>gi|30694556|ref|NP_191262.2| strictosidine synthase family protein [Arabidopsis
        thaliana]

          Length = 374

 Score =  338 bits (865), Expect = 2e-090
 Identities = 178/368 (48%), Positives = 236/368 (64%), Gaps = 7/368 (1%)
 Frame = +1

Query:   79 VLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGE 255
            V   V AA + + V      S I  P  + G++ V   AK + L G  GPES+ FD  GE
Sbjct:    8 VFLTVIAAVLAILVKNSQTGSGIFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGE 67

Query:  256 GPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGD 426
            GPYVGV+DGRILKWRGE  GW DFA+TS +R  C+R    ++   CGRPLGL F +KTGD
Sbjct:   68 GPYVGVSDGRILKWRGEPLGWSDFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGD 127

Query:  427 LYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGE 606
            LYI D YFG++ VGP GGLA+ +V EAEG+   F N +DIDE+EDV YF D+S ++   +
Sbjct:  128 LYIADAYFGLLVVGPAGGLAKPLVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQ 187

Query:  607 VFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIW 786
                 ++ DK GR I+YD   K+A V++  L   NG+ALSKD SFV+  E  T  + R+W
Sbjct:  188 FLAAVLNVDKTGRFIKYDRSSKKATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLW 247

Query:  787 VKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH-TL 963
            + GP AGT +VFA++PG PDNIRR   G+FWVALH K  LF +L L  +W     +   +
Sbjct:  248 LSGPNAGTHQVFAELPGFPDNIRRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPI 307

Query:  964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
              + +  L  GG PH   +KLS E+G+++E+LED EG T++++SE  E +DGKLWIGSV 
Sbjct:  308 SPQRLHSLFTGGIPHATAIKLS-ESGKVLEVLEDKEGKTLRFISEV-EEKDGKLWIGSVL 365

Query: 1144 WPAVWVLD 1167
             P + V D
Sbjct:  366 VPFLGVYD 373

>gi|297820490|ref|XP_002878128.1| strictosidine synthase family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 374

 Score =  337 bits (863), Expect = 3e-090
 Identities = 172/348 (49%), Positives = 228/348 (65%), Gaps = 7/348 (2%)
 Frame = +1

Query:  139 STIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGEGPYVGVTDGRILKWRGEEHG 315
            S I  P  + G++ V   AK + L G  GPES+ FD  GEGPYVGV+DGR+LKWR E  G
Sbjct:   28 SGIFAPPEISGSRDVFPSAKVVTLTGASGPESIAFDPAGEGPYVGVSDGRVLKWRSESLG 87

Query:  316 WVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLA 486
            W DFAYTS +R  C R    ++   CGRPLGL F +KTGDLYI D YFG++ VGP GGLA
Sbjct:   88 WSDFAYTSSNRQECVRPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLA 147

Query:  487 ELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMK 666
            + +V EAEG+   F N +DIDE+EDV YF D+S ++   +     ++ DK GR I+YD  
Sbjct:  148 KPLVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRS 207

Query:  667 KKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPD 846
             K+A V++  L   NG+ALSKD SFV+  E  T  + R+W+ GP AGT EVFA++PG PD
Sbjct:  208 SKKATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHEVFAELPGFPD 267

Query:  847 NIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH-TLKLETVVHLMNGGKPHGIILK 1023
            NIRR   G+FWVALH K  LF +L L  +W     +   +  + +  L  GG+PH   +K
Sbjct:  268 NIRRNSNGEFWVALHSKKGLFAKLSLSQTWFRDLVLRLPISPQRLHSLFTGGRPHATAIK 327

Query: 1024 LSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVYWPAVWVLD 1167
            LS E+G+++E+LED+EG  ++++SE  E +DGKLWIGSV  P + V D
Sbjct:  328 LS-ESGKVLEVLEDNEGKRLRFISEV-EEKDGKLWIGSVLMPFLGVYD 373

>gi|110743953|dbj|BAE99809.1| hypothetical protein [Arabidopsis thaliana]

          Length = 374

 Score =  335 bits (858), Expect = 1e-089
 Identities = 177/368 (48%), Positives = 235/368 (63%), Gaps = 7/368 (1%)
 Frame = +1

Query:   79 VLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGE 255
            V   V AA + + V      S I  P  + G++ V   AK + L G  GPES+ FD  GE
Sbjct:    8 VFLTVIAAVLAILVKNSQTGSGIFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGE 67

Query:  256 GPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGD 426
            GPYVGV+DGRILKWRGE  GW DFA+TS +R  C+R    ++   CGRPLGL F +KTGD
Sbjct:   68 GPYVGVSDGRILKWRGEPLGWSDFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGD 127

Query:  427 LYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGE 606
            LYI D YFG++ VGP GGLA+ +V EAEG+   F N +DIDE+EDV YF D+S ++   +
Sbjct:  128 LYIADAYFGLLVVGPAGGLAKPLVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQ 187

Query:  607 VFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIW 786
                 ++ DK GR I+YD   K+A V++  L   NG+ALSKD SFV+  E  T  + R+W
Sbjct:  188 FLAAVLNVDKTGRFIKYDRSSKKATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLW 247

Query:  787 VKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH-TL 963
            + GP AGT +VFA++PG PDNIRR   G+FWVALH K  LF +L L  +W     +   +
Sbjct:  248 LSGPNAGTHQVFAELPGFPDNIRRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPI 307

Query:  964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
              + +  L  GG PH   +KLS E+G+++E+L D EG T++++SE  E +DGKLWIGSV 
Sbjct:  308 SPQRLHSLFTGGIPHATAIKLS-ESGKVLEVLGDKEGKTLRFISEV-EEKDGKLWIGSVL 365

Query: 1144 WPAVWVLD 1167
             P + V D
Sbjct:  366 VPFLGVYD 373

>gi|225441250|ref|XP_002273764.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 370

 Score =  323 bits (827), Expect = 5e-086
 Identities = 169/372 (45%), Positives = 231/372 (62%), Gaps = 12/372 (3%)
 Frame = +1

Query:   70 SRRVLTPV--AAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEF 240
            ++ +LT +  AA  +ILAV      + + +P  + GT  +L  ++ I + G  GPES+ F
Sbjct:    3 TKLILTAITLAAISIILAVNS----NHLFKPPSIPGTHDLLHGSEVIQVTGAFGPESIAF 58

Query:  241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFH 411
            D +GEGPY GV DGR+LKW G+  GW DFA T+  R  C R    ++   CGRPLGL F 
Sbjct:   59 DPKGEGPYTGVADGRVLKWEGDGRGWTDFAVTTSERKECVRPFAPEMEHICGRPLGLRFD 118

Query:  412 RKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDK 591
            +KTGDLYI D YFG+  V P GGLA  +V E EGR+++F N MDIDE EDV YF D+S  
Sbjct:  119 KKTGDLYIADAYFGLQVVEPNGGLATPLVTEVEGRRLLFTNDMDIDEVEDVIYFTDTSTD 178

Query:  592 YHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGI 771
            +H  +     +SGD  GR+++YD   KE  V++  L   NG+A+SKD SFV+  E  TG 
Sbjct:  179 FHRRQFMAALLSGDNTGRLMKYDKSSKEVTVLLRGLAFANGVAMSKDRSFVLVAETTTGK 238

Query:  772 LHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFF 951
            + R W+KGP AG ++VFA+VPG PDN+RR   G+FWVALH K          +SWVGK  
Sbjct:  239 IIRYWLKGPNAGKSDVFAEVPGYPDNVRRNSKGEFWVALHAKKGPHANWITSNSWVGKTL 298

Query:  952 MHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWI 1131
            +        +H +   + H   +KLS E G+++E+LED EG +M+++SE  E  +GKLW+
Sbjct:  299 LKLPLTFKQLHKLIVVEAHATAIKLS-EEGQVLEVLEDCEGKSMRFISEV-EEHNGKLWL 356

Query: 1132 GSVYWPAVWVLD 1167
            GSV  P + V D
Sbjct:  357 GSVMMPFIGVYD 368

>gi|156763850|emb|CAO99127.1| strictosidine synthase-like protein [Nicotiana
        tabacum]

          Length = 380

 Score =  318 bits (814), Expect = 2e-084
 Identities = 157/345 (45%), Positives = 219/345 (63%), Gaps = 5/345 (1%)
 Frame = +1

Query:  145 IIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGEGPYVGVTDGRILKWRGEEHGWV 321
            + +P  + G++ VL  A+ I L G  G ES+ FD  GEGPY GV DGRILKW+     WV
Sbjct:   35 VFKPAPIPGSQDVLSKAELIQLKGAFGAESVAFDPNGEGPYTGVADGRILKWQPHSQTWV 94

Query:  322 DFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLAEL 492
            DFA TS  R NCSR    ++   CGRPLGL F  KTGDLYI D YFG+  VGP GGLA  
Sbjct:   95 DFAVTSSQRKNCSRPSAPEMEHVCGRPLGLRFDHKTGDLYIADAYFGLHVVGPTGGLATP 154

Query:  493 VVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMKKK 672
            +V + EG+ ++F N +DID+++D+ YF D+S  Y   +    + SGDK GR+++Y+   K
Sbjct:  155 LVQDFEGQPLLFTNDLDIDDDDDIIYFTDTSTIYQRRQFVAATASGDKTGRLMKYNKSTK 214

Query:  673 EAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPDNI 852
            E  V +  L   NG+ALSKD SF++  E     + R W+KGP  G  ++FA++PG PDN+
Sbjct:  215 EVTVALGGLAFANGVALSKDRSFLLVAETSACRILRYWLKGPNVGNHDIFAELPGFPDNV 274

Query:  853 RRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHTLKLETVVHLMNGGKPHGIILKLSG 1032
            R    G+FWVALH K++   RL + +SW+GK  +     + + +L+ GG+PH   +KLS 
Sbjct:  275 RINSRGEFWVALHAKASPLARLIISNSWLGKTLLREFNFQQLHNLLVGGQPHATAIKLS- 333

Query: 1033 ETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVYWPAVWVLD 1167
            E G ++E+LED EG  ++++SE +E E GKLWI SV   ++ V D
Sbjct:  334 EDGRVLEVLEDVEGKILRFISEVHEEESGKLWISSVIMSSLGVYD 378

>gi|224139742|ref|XP_002323255.1| predicted protein [Populus trichocarpa]

          Length = 375

 Score =  305 bits (781), Expect = 1e-080
 Identities = 163/363 (44%), Positives = 225/363 (61%), Gaps = 12/363 (3%)
 Frame = +1

Query:   91 VAAAPVILAVVCYLFWS--TIIEPDRLEGTKHVLQVAKTIPLPG-VGPESLEFDSQGEGP 261
            V     ++A+V  L  S   ++ P  +  +   L  AK + + G VGPESL FD  GEGP
Sbjct:    8 VVTTTTLVAIVSILLTSPTKLLGPPTIPTSNDHLHSAKILHVSGAVGPESLVFDPNGEGP 67

Query:  262 YVGVTDGRILKWRGEEHG---WVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTG 423
            Y GV DGR+LKW   + G   W DFA TS +R+ C R    ++   CGRPLGL F +KTG
Sbjct:   68 YTGVADGRVLKWIAGDDGSGSWTDFATTSSNRNECVRPFAPEMEHVCGRPLGLRFDKKTG 127

Query:  424 DLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFG 603
            +LYI D Y G+  VGP GGLA  VV E EG+ + F N +DIDE+EDV YF D+S  +   
Sbjct:  128 NLYIADAYLGLQVVGPTGGLATPVVTELEGQPMRFTNDLDIDEQEDVIYFTDTSMVFQRR 187

Query:  604 EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRI 783
            +     ++ DK GR+++YD   KE  V+   L   NG+ALSKD +F++  E  T  + R 
Sbjct:  188 QFILSLLTKDKTGRLLKYDKSSKEVTVLARGLAFANGVALSKDSTFLLVAETTTCRILRF 247

Query:  784 WVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFM-HT 960
            W+ GP AG ++VF ++PG PDNIRR   G+FWVALH K  LF ++ L +SW+GK  +   
Sbjct:  248 WLHGPNAGKSDVFTELPGFPDNIRRNSKGEFWVALHSKKGLFAKVVLSNSWIGKTLLKFP 307

Query:  961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
            L  + +  L+ GGK H   +KLS E G+++++LED +G T++++SE  E +DGKLWIGSV
Sbjct:  308 LSFKQLHSLLVGGKAHATAIKLS-EEGKVLDVLEDCDGKTLRFISEV-EEKDGKLWIGSV 365

Query: 1141 YWP 1149
              P
Sbjct:  366 LMP 368

>gi|297820488|ref|XP_002878127.1| strictosidine synthase family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 343

 Score =  290 bits (741), Expect = 5e-076
 Identities = 130/195 (66%), Positives = 159/195 (81%)
 Frame = +1

Query:  583 SDKYHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGG 762
            SDKYHF +VF+V++SG++ GRVIRYD K KEAKV+MD L   NGLAL+KD SF+ITCE G
Sbjct:  147 SDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVVMDNLVCNNGLALNKDRSFLITCESG 206

Query:  763 TGILHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVG 942
            T ++HR W+KGPKAGT ++FAKVPG PDNIR T TGDFW+ +HCK NL  RL + + W+G
Sbjct:  207 TSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGIHCKKNLLGRLIVRYKWLG 266

Query:  943 KFFMHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGK 1122
            K    T+KLE V+  +NG KP G+ +K+SGETGE++E+LED EG TMKYVSEAYER+DGK
Sbjct:  267 KLVEKTIKLEYVIAFINGFKPQGVAVKISGETGEVLEVLEDKEGKTMKYVSEAYERDDGK 326

Query: 1123 LWIGSVYWPAVWVLD 1167
            LW GSVYWPAVWVLD
Sbjct:  327 LWFGSVYWPAVWVLD 341

>gi|224139738|ref|XP_002323253.1| predicted protein [Populus trichocarpa]

          Length = 349

 Score =  287 bits (733), Expect = 4e-075
 Identities = 145/325 (44%), Positives = 202/325 (62%), Gaps = 14/325 (4%)
 Frame = +1

Query:  202 IPLPG-VGPESLEFDSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNC----SRN 366
            IP+ G +GPES  FDS GEGPY  ++DGRI+KW+G++  W+DFA TSP+RD C      +
Sbjct:   23 IPIVGAIGPESFAFDSLGEGPYTSLSDGRIIKWQGDKKRWIDFAVTSPNRDGCGGPHDHH 82

Query:  367 QVVPSCGRPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDI 546
            Q+   CGRPLG  F    GDLYI D Y G+++VGPEGGLA  +   A+G    F N +DI
Sbjct:   83 QMEHVCGRPLGSCFDETHGDLYIADAYMGLLRVGPEGGLATKIATHAQGIPFRFTNSLDI 142

Query:  547 DEEEDVFYFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALS 726
            D+     YF DSS +Y   +   V +SGDK GR+++YD   K+  V++  L  PNG+ALS
Sbjct:  143 DQSSGAIYFTDSSTQYQRRDYLSVVLSGDKSGRLMKYDTASKQVTVLLKNLTFPNGVALS 202

Query:  727 KDGSFVITCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNL 906
             DGSFV+  E  +  + R W+K  KAG  EVFA++ G PDNI+R+P G +WV ++ K   
Sbjct:  203 TDGSFVLLAETTSCRILRYWIKTSKAGALEVFAQLQGFPDNIKRSPRGGYWVGINSKREK 262

Query:  907 FTRLFLIHSWVGKFF----MHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEG 1074
             + L   + W+GK      +   K +T +    GG   G+ ++LS E G+I+E+ ED +G
Sbjct:  263 LSELLFSYPWIGKVLLKLPLDITKFQTALAKYRGG---GLAVRLS-ENGDIVEVFEDRDG 318

Query: 1075 TTMKYVSEAYEREDGKLWIGSVYWP 1149
              +K +SE  E+ DGKLWIGS+  P
Sbjct:  319 NRLKSISEVMEK-DGKLWIGSIDLP 342

>gi|225441248|ref|XP_002267323.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 378

 Score =  276 bits (705), Expect = 7e-072
 Identities = 153/373 (41%), Positives = 207/373 (55%), Gaps = 10/373 (2%)
 Frame = +1

Query:   61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLP---GVGPES 231
            M +S ++L   AAA   L         T +     +  K   Q    IP+P    +GPES
Sbjct:    1 MAMSSKLLLAAAAAAAFLITALIAGKRTSLSSPEFDSEKFSNQKDAVIPIPTPGAIGPES 60

Query:  232 LEFDSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNC--SRNQVVPS--CGRPLG 399
            L FDS G GPY GV+DGRI+KW   E  WVDFA TS  R+ C  SR+ V     CGRPLG
Sbjct:   61 LAFDSVGGGPYTGVSDGRIIKWEENEERWVDFATTSSKREGCRGSRDHVPLEHICGRPLG 120

Query:  400 LSFHRKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFND 579
            LSF   TG+LYI D Y G++ VGP GGLA  V  EA+G    F+N +DI +     YF+D
Sbjct:  121 LSFSELTGELYIADAYMGLLVVGPNGGLASTVASEAQGTPFGFSNGVDIHQTNGAVYFSD 180

Query:  580 SSDKYHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEG 759
            SS +Y         ISGD  GR+++Y+ + K+  V++  L  PNG+ALSK+G F++  E 
Sbjct:  181 SSSRYQRRNFVAAIISGDNTGRLMKYEPESKQVTVLLRSLGFPNGVALSKNGDFILLSET 240

Query:  760 GTGILHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWV 939
                + R W++  KAGT EVF  +PG PDNI+R   G+FWV +H +       FL + W+
Sbjct:  241 SRCRILRFWLQTSKAGTVEVFTLLPGFPDNIKRNSKGEFWVGMHSRKGKLVEWFLSYPWI 300

Query:  940 GKFFMHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGT-TMKYVSEAYERED 1116
            G+  +        +   +  +  G  ++LS E GE++E+ E   G   +  +SE YER D
Sbjct:  301 GRTLLKLPFPHGFLSFFSKWRKTGFAVRLS-EEGEVLEIFEPKNGNGWISSISEVYER-D 358

Query: 1117 GKLWIGSVYWPAV 1155
            G LWIGSV  P V
Sbjct:  359 GSLWIGSVTTPCV 371

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,584,745,891,651
Number of Sequences: 15229318
Number of Extensions: 2584745891651
Number of Successful Extensions: 639272726
Number of sequences better than 0.0: 0