BLASTX 7.6.2
Query= UN21852 /QuerySize=1414
(1413 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|15230182|ref|NP_191260.1| strictosidine synthase family prote... 675 7e-192
gi|13877837|gb|AAK43996.1|AF370181_1 unknown protein [Arabidopsi... 673 2e-191
gi|297820484|ref|XP_002878125.1| strictosidine synthase family p... 663 2e-188
gi|6759491|emb|CAB69786.1| hypothetical protein [Arabidopsis tha... 627 2e-177
gi|15230200|ref|NP_191261.1| strictosidine synthase family prote... 555 1e-155
gi|312281991|dbj|BAJ33861.1| unnamed protein product [Thellungie... 543 4e-152
gi|297827769|ref|XP_002881767.1| hypothetical protein ARALYDRAFT... 525 1e-146
gi|145360869|ref|NP_181662.3| strictosidine synthase-like 1 [Ara... 473 3e-131
gi|3894193|gb|AAC78542.1| putative strictosidine synthase [Arabi... 469 8e-130
gi|79315403|ref|NP_001030876.1| strictosidine synthase family pr... 416 5e-114
gi|6911873|emb|CAB72173.1| putative protein [Arabidopsis thaliana] 338 2e-090
gi|30694556|ref|NP_191262.2| strictosidine synthase family prote... 338 2e-090
gi|297820490|ref|XP_002878128.1| strictosidine synthase family p... 337 3e-090
gi|110743953|dbj|BAE99809.1| hypothetical protein [Arabidopsis t... 335 1e-089
gi|225441250|ref|XP_002273764.1| PREDICTED: hypothetical protein... 323 5e-086
gi|156763850|emb|CAO99127.1| strictosidine synthase-like protein... 318 2e-084
gi|224139742|ref|XP_002323255.1| predicted protein [Populus tric... 305 1e-080
gi|297820488|ref|XP_002878127.1| strictosidine synthase family p... 290 5e-076
gi|224139738|ref|XP_002323253.1| predicted protein [Populus tric... 287 4e-075
gi|225441248|ref|XP_002267323.1| PREDICTED: hypothetical protein... 276 7e-072
>gi|15230182|ref|NP_191260.1| strictosidine synthase family protein [Arabidopsis
thaliana]
Length = 376
Score = 675 bits (1740), Expect = 7e-192
Identities = 317/374 (84%), Positives = 344/374 (91%)
Frame = +1
Query: 61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEF 240
MPISRRVLTP+ AAPVILAV+C+ FWS+II PD L+GTKHVLQ AKTIPLP GPESLEF
Sbjct: 1 MPISRRVLTPITAAPVILAVLCFFFWSSIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEF 60
Query: 241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
D QGEGPYVGVTDGRILKWRGEE GWVDFAYTSPHRDNCS ++VVPSCGRPLGLSF RKT
Sbjct: 61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKT 120
Query: 421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct: 121 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180
Query: 601 GEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHR 780
+VFYVS+SG KVGRVIRYDMKKKEAKVIMDKL LPNGLALSK+GSFV+TCE T I HR
Sbjct: 181 RDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240
Query: 781 IWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHT 960
IWVKGPK+GT EVFA +PG PDNIRRTPTGDFWVALHCK NLFTR LIH+WVG+FFM+T
Sbjct: 241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTRAVLIHTWVGRFFMNT 300
Query: 961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
+K+ETV+H MNGGKPHGI++KLSGETGEI+E+LEDSEG T+KYVSEAYE +DGKLWIGSV
Sbjct: 301 MKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYVSEAYETKDGKLWIGSV 360
Query: 1141 YWPAVWVLDKSVYE 1182
YWPAVWVLD SVY+
Sbjct: 361 YWPAVWVLDTSVYD 374
>gi|13877837|gb|AAK43996.1|AF370181_1 unknown protein [Arabidopsis thaliana]
Length = 376
Score = 673 bits (1736), Expect = 2e-191
Identities = 316/374 (84%), Positives = 343/374 (91%)
Frame = +1
Query: 61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEF 240
MPISRRVLTP+ AAPVILAV+C+ FWS+II PD L+GTKHVLQ AKTIPLP GPESLEF
Sbjct: 1 MPISRRVLTPITAAPVILAVLCFFFWSSIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEF 60
Query: 241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
D QGEGPYVGVTDGRILKWRGEE GWVDFAYTSPHRDNCS ++VVPSCGRPLGLSF RKT
Sbjct: 61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKT 120
Query: 421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
GDLYICDGYFGVMKVGPEGGL ELVVDEAEGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct: 121 GDLYICDGYFGVMKVGPEGGLGELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180
Query: 601 GEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHR 780
+VFYVS+SG KVGRVIRYDMKKKEAKVIMDKL LPNGLALSK+GSFV+TCE T I HR
Sbjct: 181 RDVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240
Query: 781 IWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHT 960
IWVKGPK+GT EVFA +PG PDNIRRTPTGDFWVALHCK NLFTR LIH+WVG+FFM+T
Sbjct: 241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTRAVLIHTWVGRFFMNT 300
Query: 961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
+K+ETV+H MNGGKPHGI++KLSGETGEI+E+LEDSEG T+KYVSEAYE +DGKLWIGSV
Sbjct: 301 MKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYVSEAYETKDGKLWIGSV 360
Query: 1141 YWPAVWVLDKSVYE 1182
YWPAVWVLD SVY+
Sbjct: 361 YWPAVWVLDTSVYD 374
>gi|297820484|ref|XP_002878125.1| strictosidine synthase family protein
[Arabidopsis lyrata subsp. lyrata]
Length = 371
Score = 663 bits (1710), Expect = 2e-188
Identities = 311/369 (84%), Positives = 340/369 (92%)
Frame = +1
Query: 61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEF 240
MPISRRVLTPV+AAPVILAV+C+ FWS+II PD ++GTKHVLQ AKTIPLP GPESLEF
Sbjct: 1 MPISRRVLTPVSAAPVILAVLCFFFWSSIIGPDNIKGTKHVLQDAKTIPLPADGPESLEF 60
Query: 241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
D QGEGPYVGVTDGRILKWRGEE GWVDFAYTSPHRDNCSR++VVPSCGRPLGL+F +KT
Sbjct: 61 DPQGEGPYVGVTDGRILKWRGEELGWVDFAYTSPHRDNCSRHEVVPSCGRPLGLTFEKKT 120
Query: 421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
GDLYICDGYFG+MKVGP+GGLAELVVDEAEGRKVMFANQ DIDEEED+FYFNDSSD YHF
Sbjct: 121 GDLYICDGYFGLMKVGPQGGLAELVVDEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHF 180
Query: 601 GEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHR 780
EVFYVS+SG KVGRVIRYDMKKKEAKVIMDKL LPNGLALSK+GSFV+TCE T I HR
Sbjct: 181 REVFYVSLSGTKVGRVIRYDMKKKEAKVIMDKLRLPNGLALSKNGSFVVTCESSTNICHR 240
Query: 781 IWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHT 960
IWVKGPK+GT EVFA +PG PDNIRRTPTGDFWVALHCK NLFTR+ LIHS VG+FFM+T
Sbjct: 241 IWVKGPKSGTNEVFATLPGSPDNIRRTPTGDFWVALHCKKNLFTRVALIHSLVGRFFMNT 300
Query: 961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
+K+ETV+H MNGGKPHGI++KLSGETGEI+E+LEDSEG T+KY SEAYE EDGKLWIGSV
Sbjct: 301 MKMETVIHFMNGGKPHGIVVKLSGETGEILEILEDSEGKTVKYASEAYETEDGKLWIGSV 360
Query: 1141 YWPAVWVLD 1167
YWPAVWV D
Sbjct: 361 YWPAVWVYD 369
>gi|6759491|emb|CAB69786.1| hypothetical protein [Arabidopsis thaliana]
Length = 352
Score = 627 bits (1616), Expect = 2e-177
Identities = 295/348 (84%), Positives = 318/348 (91%)
Frame = +1
Query: 139 STIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEFDSQGEGPYVGVTDGRILKWRGEEHGW 318
S II PD L+GTKHVLQ AKTIPLP GPESLEFD QGEGPYVGVTDGRILKWRGEE GW
Sbjct: 3 SAIIGPDNLKGTKHVLQDAKTIPLPVDGPESLEFDPQGEGPYVGVTDGRILKWRGEELGW 62
Query: 319 VDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLAELVV 498
VDFAYTSPHRDNCS ++VVPSCGRPLGLSF RKTGDLYICDGYFGVMKVGPEGGLAELVV
Sbjct: 63 VDFAYTSPHRDNCSSHEVVPSCGRPLGLSFERKTGDLYICDGYFGVMKVGPEGGLAELVV 122
Query: 499 DEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMKKKEA 678
DEAEGRKVMFANQ DIDEEED+FYFNDSSD YHF +VFYVS+SG KVGRVIRYDMKKKEA
Sbjct: 123 DEAEGRKVMFANQGDIDEEEDIFYFNDSSDTYHFRDVFYVSLSGTKVGRVIRYDMKKKEA 182
Query: 679 KVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPDNIRR 858
KVIMDKL LPNGLALSK+GSFV+TCE T HRIWVKGPK+GT EVFA +PG PDNIRR
Sbjct: 183 KVIMDKLRLPNGLALSKNGSFVVTCESSTNTCHRIWVKGPKSGTNEVFATLPGSPDNIRR 242
Query: 859 TPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHTLKLETVVHLMNGGKPHGIILKLSGET 1038
TPTGDFWVALHCK NLFTR LIH+WVG+FFM+T+K+ETV+H MNGGKPHGI++KLSGET
Sbjct: 243 TPTGDFWVALHCKKNLFTRAVLIHTWVGRFFMNTMKMETVIHFMNGGKPHGIVVKLSGET 302
Query: 1039 GEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVYWPAVWVLDKSVYE 1182
GEI+E+LEDSEG T+KYVSEAYE +DGKLWIGSVYWPAVWVLD SVY+
Sbjct: 303 GEILEILEDSEGKTVKYVSEAYETKDGKLWIGSVYWPAVWVLDTSVYD 350
>gi|15230200|ref|NP_191261.1| strictosidine synthase family protein [Arabidopsis
thaliana]
Length = 370
Score = 555 bits (1428), Expect = 1e-155
Identities = 255/369 (69%), Positives = 307/369 (83%), Gaps = 1/369 (0%)
Frame = +1
Query: 64 PISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEFD 243
PI++++ T A P + AV+ + + T+I P+ LEG K+VL +AKTIP+P GPES+EFD
Sbjct: 2 PINQKIPT-WFAVPAVFAVLSVISYQTLIVPENLEGAKNVLTMAKTIPIPVAGPESIEFD 60
Query: 244 SQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKTG 423
+GEGPY V DGRILKWRG++ GWVDFAYTSPHR NCS+ +VVP+CGRPLGL+F +KTG
Sbjct: 61 PKGEGPYAAVVDGRILKWRGDDLGWVDFAYTSPHRGNCSKTEVVPTCGRPLGLTFEKKTG 120
Query: 424 DLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFG 603
DLYICDGY G+MKVGPEGGLAEL+VDEAEGRKVMFANQ DIDEEEDVFYFNDSSDKYHF
Sbjct: 121 DLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQGDIDEEEDVFYFNDSSDKYHFR 180
Query: 604 EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRI 783
+VF+V++SG++ GRVIRYD K KEAKVIMD L NGLAL+KD SF+ITCE GT ++HR
Sbjct: 181 DVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLALNKDRSFLITCESGTSLVHRY 240
Query: 784 WVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHTL 963
W+KGPKAGT ++FAKVPG PDNIR T TGDFW+ LHCK NL RL + + W+GK T+
Sbjct: 241 WIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKKNLIGRLIVKYKWLGKLVEKTM 300
Query: 964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
KLE V+ +NG KPHG+ +K+SGETGE++E+LED EG TMKYVSEAYER+DGKLW GSVY
Sbjct: 301 KLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKTMKYVSEAYERDDGKLWFGSVY 360
Query: 1144 WPAVWVLDK 1170
WPAVWVLD+
Sbjct: 361 WPAVWVLDR 369
>gi|312281991|dbj|BAJ33861.1| unnamed protein product [Thellungiella halophila]
Length = 370
Score = 543 bits (1397), Expect = 4e-152
Identities = 254/369 (68%), Positives = 304/369 (82%), Gaps = 1/369 (0%)
Frame = +1
Query: 64 PISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEFD 243
P+S++V T AA P +LAV + + TII PD L+GTKHVL +AKTIPLP GPES+E+D
Sbjct: 2 PLSQKVPT-WAAVPAVLAVFSVISYQTIIAPDNLKGTKHVLSMAKTIPLPVHGPESIEWD 60
Query: 244 SQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKTG 423
QG GPY V DGRILKW+G+ GWV+FAYTSPHR NCSR++VVP+CGRPLGL F +KTG
Sbjct: 61 PQGGGPYAAVVDGRILKWQGDGIGWVEFAYTSPHRGNCSRHEVVPTCGRPLGLKFEKKTG 120
Query: 424 DLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFG 603
DLYICDGY GVMKVGPEGGLAELVVD+AEGRKVMFANQ+DIDEEEDV YFNDSSDKYHF
Sbjct: 121 DLYICDGYLGVMKVGPEGGLAELVVDQAEGRKVMFANQIDIDEEEDVLYFNDSSDKYHFR 180
Query: 604 EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRI 783
EVFYV+ +GD+ GRVIRY+ K KEAKV+MD L NGLAL+KD SF+I+CE TG++HR
Sbjct: 181 EVFYVASNGDRTGRVIRYNKKTKEAKVVMDNLRCNNGLALNKDRSFLISCESSTGLVHRY 240
Query: 784 WVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHTL 963
W+KGPKAGT ++FAKVPG PDNIR TPTGDFW+ +HCK N R + + W+GK T+
Sbjct: 241 WIKGPKAGTRDIFAKVPGYPDNIRLTPTGDFWLGIHCKKNPLGRFMINNRWLGKIVEKTV 300
Query: 964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
L+ ++ +MNG KPHGI +K+SGETGEI+E+LED EG TM+YVSEAYER+DGKLW GSV+
Sbjct: 301 NLDLLIAVMNGFKPHGIAVKISGETGEILEVLEDIEGKTMQYVSEAYERDDGKLWFGSVF 360
Query: 1144 WPAVWVLDK 1170
PAVWVLD+
Sbjct: 361 TPAVWVLDR 369
>gi|297827769|ref|XP_002881767.1| hypothetical protein ARALYDRAFT_321816
[Arabidopsis lyrata subsp. lyrata]
Length = 370
Score = 525 bits (1350), Expect = 1e-146
Identities = 249/371 (67%), Positives = 302/371 (81%), Gaps = 3/371 (0%)
Frame = +1
Query: 61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGVGPESLEF 240
MP+SR+V T A ++AV+ II P+ +EG+K+VL +A+TIPLP GPESL++
Sbjct: 1 MPVSRKVQT-WAVVVAVMAVLVVFVGPYIIGPESIEGSKNVLTMARTIPLPVDGPESLDW 59
Query: 241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
D +GEGPYVGVTDGRILKW GE+ GWV FAY+SPHR+NCSR++V P+CGRPLGLSF +K+
Sbjct: 60 DPRGEGPYVGVTDGRILKWSGEDLGWVQFAYSSPHRENCSRHKVEPACGRPLGLSFEKKS 119
Query: 421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
GDLY CDGY G+MKVGP+GGLAE VVDEAEG+KVMFANQMDIDEEED YFNDSSD YHF
Sbjct: 120 GDLYFCDGYLGIMKVGPKGGLAEKVVDEAEGQKVMFANQMDIDEEEDAIYFNDSSDTYHF 179
Query: 601 G-EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILH 777
G +VFY + G+K GR IRYD K KEAKVIMD+LH PNGLALSKDGSFV++CE T ++H
Sbjct: 180 GRDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSKDGSFVLSCEVPTQLVH 239
Query: 778 RIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH 957
R W KGPKAGT ++FAK+PG DNIRRT TGDFWVALH K F+RL +IH WVGKFF+
Sbjct: 240 RYWAKGPKAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRLSMIHPWVGKFFIK 299
Query: 958 TLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGS 1137
TLK+E ++ L GGKPH + +KLSG+TGEI+E+LEDSEG MK++SE ER DG+LW GS
Sbjct: 300 TLKMELLLFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFISEVQER-DGRLWFGS 358
Query: 1138 VYWPAVWVLDK 1170
V+ P+VWVLD+
Sbjct: 359 VFLPSVWVLDR 369
>gi|145360869|ref|NP_181662.3| strictosidine synthase-like 1 [Arabidopsis
thaliana]
Length = 394
Score = 473 bits (1217), Expect = 3e-131
Identities = 218/310 (70%), Positives = 257/310 (82%), Gaps = 1/310 (0%)
Frame = +1
Query: 241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
D +GEGPYVGVTDGRILKW GE+ GW++FAY+SPHR NCS ++V P+CGRPLGLSF +K+
Sbjct: 85 DPRGEGPYVGVTDGRILKWSGEDLGWIEFAYSSPHRKNCSSHKVEPACGRPLGLSFEKKS 144
Query: 421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
GDLY CDGY GVMKVGP+GGLAE VVDE EG+KVMFANQMDIDEEED YFNDSSD YHF
Sbjct: 145 GDLYFCDGYLGVMKVGPKGGLAEKVVDEVEGQKVMFANQMDIDEEEDAIYFNDSSDTYHF 204
Query: 601 GEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHR 780
G+VFY + G+K GR IRYD K KEAKVIMD+LH PNGLALS DGSFV++CE T ++HR
Sbjct: 205 GDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSIDGSFVLSCEVPTQLVHR 264
Query: 781 IWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHT 960
W KGP AGT ++FAK+PG DNIRRT TGDFWVALH K F+RL +IH WVGKFF+ T
Sbjct: 265 YWAKGPNAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRLSMIHPWVGKFFIKT 324
Query: 961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
LK+E +V L GGKPH + +KLSG+TGEI+E+LEDSEG MK++SE ER DG+LW GSV
Sbjct: 325 LKMELLVFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFISEVQER-DGRLWFGSV 383
Query: 1141 YWPAVWVLDK 1170
+ P+VWVLD+
Sbjct: 384 FLPSVWVLDR 393
>gi|3894193|gb|AAC78542.1| putative strictosidine synthase [Arabidopsis
thaliana]
Length = 395
Score = 469 bits (1205), Expect = 8e-130
Identities = 218/311 (70%), Positives = 257/311 (82%), Gaps = 2/311 (0%)
Frame = +1
Query: 241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSRNQVVPSCGRPLGLSFHRKT 420
D +GEGPYVGVTDGRILKW GE+ GW++FAY+SPHR NCS ++V P+CGRPLGLSF +K+
Sbjct: 85 DPRGEGPYVGVTDGRILKWSGEDLGWIEFAYSSPHRKNCSSHKVEPACGRPLGLSFEKKS 144
Query: 421 GDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHF 600
GDLY CDGY GVMKVGP+GGLAE VVDE EG+KVMFANQMDIDEEED YFNDSSD YHF
Sbjct: 145 GDLYFCDGYLGVMKVGPKGGLAEKVVDEVEGQKVMFANQMDIDEEEDAIYFNDSSDTYHF 204
Query: 601 G-EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILH 777
G +VFY + G+K GR IRYD K KEAKVIMD+LH PNGLALS DGSFV++CE T ++H
Sbjct: 205 GRDVFYAFLCGEKTGRAIRYDKKTKEAKVIMDRLHFPNGLALSIDGSFVLSCEVPTQLVH 264
Query: 778 RIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH 957
R W KGP AGT ++FAK+PG DNIRRT TGDFWVALH K F+RL +IH WVGKFF+
Sbjct: 265 RYWAKGPNAGTRDIFAKLPGYADNIRRTETGDFWVALHSKKTPFSRLSMIHPWVGKFFIK 324
Query: 958 TLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGS 1137
TLK+E +V L GGKPH + +KLSG+TGEI+E+LEDSEG MK++SE ER DG+LW GS
Sbjct: 325 TLKMELLVFLFEGGKPHAVAVKLSGKTGEIMEILEDSEGKNMKFISEVQER-DGRLWFGS 383
Query: 1138 VYWPAVWVLDK 1170
V+ P+VWVLD+
Sbjct: 384 VFLPSVWVLDR 394
>gi|79315403|ref|NP_001030876.1| strictosidine synthase family protein
[Arabidopsis thaliana]
Length = 356
Score = 416 bits (1069), Expect = 5e-114
Identities = 191/261 (73%), Positives = 223/261 (85%)
Frame = +1
Query: 388 RPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVF 567
RPLGL+F +KTGDLYICDGY G+MKVGPEGGLAEL+VDEAEGRKVMFANQ DIDEEEDVF
Sbjct: 95 RPLGLTFEKKTGDLYICDGYLGLMKVGPEGGLAELIVDEAEGRKVMFANQGDIDEEEDVF 154
Query: 568 YFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVI 747
YFNDSSDKYHF +VF+V++SG++ GRVIRYD K KEAKVIMD L NGLAL+KD SF+I
Sbjct: 155 YFNDSSDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVIMDNLVCNNGLALNKDRSFLI 214
Query: 748 TCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLI 927
TCE GT ++HR W+KGPKAGT ++FAKVPG PDNIR T TGDFW+ LHCK NL RL +
Sbjct: 215 TCESGTSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGLHCKKNLIGRLIVK 274
Query: 928 HSWVGKFFMHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYE 1107
+ W+GK T+KLE V+ +NG KPHG+ +K+SGETGE++E+LED EG TMKYVSEAYE
Sbjct: 275 YKWLGKLVEKTMKLEYVIAFINGFKPHGVAVKISGETGEVLELLEDKEGKTMKYVSEAYE 334
Query: 1108 REDGKLWIGSVYWPAVWVLDK 1170
R+DGKLW GSVYWPAVWVLD+
Sbjct: 335 RDDGKLWFGSVYWPAVWVLDR 355
>gi|6911873|emb|CAB72173.1| putative protein [Arabidopsis thaliana]
Length = 372
Score = 338 bits (865), Expect = 2e-090
Identities = 178/368 (48%), Positives = 236/368 (64%), Gaps = 7/368 (1%)
Frame = +1
Query: 79 VLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGE 255
V V AA + + V S I P + G++ V AK + L G GPES+ FD GE
Sbjct: 6 VFLTVIAAVLAILVKNSQTGSGIFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGE 65
Query: 256 GPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGD 426
GPYVGV+DGRILKWRGE GW DFA+TS +R C+R ++ CGRPLGL F +KTGD
Sbjct: 66 GPYVGVSDGRILKWRGEPLGWSDFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGD 125
Query: 427 LYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGE 606
LYI D YFG++ VGP GGLA+ +V EAEG+ F N +DIDE+EDV YF D+S ++ +
Sbjct: 126 LYIADAYFGLLVVGPAGGLAKPLVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQ 185
Query: 607 VFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIW 786
++ DK GR I+YD K+A V++ L NG+ALSKD SFV+ E T + R+W
Sbjct: 186 FLAAVLNVDKTGRFIKYDRSSKKATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLW 245
Query: 787 VKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH-TL 963
+ GP AGT +VFA++PG PDNIRR G+FWVALH K LF +L L +W + +
Sbjct: 246 LSGPNAGTHQVFAELPGFPDNIRRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPI 305
Query: 964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
+ + L GG PH +KLS E+G+++E+LED EG T++++SE E +DGKLWIGSV
Sbjct: 306 SPQRLHSLFTGGIPHATAIKLS-ESGKVLEVLEDKEGKTLRFISEV-EEKDGKLWIGSVL 363
Query: 1144 WPAVWVLD 1167
P + V D
Sbjct: 364 VPFLGVYD 371
>gi|30694556|ref|NP_191262.2| strictosidine synthase family protein [Arabidopsis
thaliana]
Length = 374
Score = 338 bits (865), Expect = 2e-090
Identities = 178/368 (48%), Positives = 236/368 (64%), Gaps = 7/368 (1%)
Frame = +1
Query: 79 VLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGE 255
V V AA + + V S I P + G++ V AK + L G GPES+ FD GE
Sbjct: 8 VFLTVIAAVLAILVKNSQTGSGIFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGE 67
Query: 256 GPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGD 426
GPYVGV+DGRILKWRGE GW DFA+TS +R C+R ++ CGRPLGL F +KTGD
Sbjct: 68 GPYVGVSDGRILKWRGEPLGWSDFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGD 127
Query: 427 LYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGE 606
LYI D YFG++ VGP GGLA+ +V EAEG+ F N +DIDE+EDV YF D+S ++ +
Sbjct: 128 LYIADAYFGLLVVGPAGGLAKPLVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQ 187
Query: 607 VFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIW 786
++ DK GR I+YD K+A V++ L NG+ALSKD SFV+ E T + R+W
Sbjct: 188 FLAAVLNVDKTGRFIKYDRSSKKATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLW 247
Query: 787 VKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH-TL 963
+ GP AGT +VFA++PG PDNIRR G+FWVALH K LF +L L +W + +
Sbjct: 248 LSGPNAGTHQVFAELPGFPDNIRRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPI 307
Query: 964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
+ + L GG PH +KLS E+G+++E+LED EG T++++SE E +DGKLWIGSV
Sbjct: 308 SPQRLHSLFTGGIPHATAIKLS-ESGKVLEVLEDKEGKTLRFISEV-EEKDGKLWIGSVL 365
Query: 1144 WPAVWVLD 1167
P + V D
Sbjct: 366 VPFLGVYD 373
>gi|297820490|ref|XP_002878128.1| strictosidine synthase family protein
[Arabidopsis lyrata subsp. lyrata]
Length = 374
Score = 337 bits (863), Expect = 3e-090
Identities = 172/348 (49%), Positives = 228/348 (65%), Gaps = 7/348 (2%)
Frame = +1
Query: 139 STIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGEGPYVGVTDGRILKWRGEEHG 315
S I P + G++ V AK + L G GPES+ FD GEGPYVGV+DGR+LKWR E G
Sbjct: 28 SGIFAPPEISGSRDVFPSAKVVTLTGASGPESIAFDPAGEGPYVGVSDGRVLKWRSESLG 87
Query: 316 WVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLA 486
W DFAYTS +R C R ++ CGRPLGL F +KTGDLYI D YFG++ VGP GGLA
Sbjct: 88 WSDFAYTSSNRQECVRPFAPELEHVCGRPLGLRFDKKTGDLYIADAYFGLLVVGPAGGLA 147
Query: 487 ELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMK 666
+ +V EAEG+ F N +DIDE+EDV YF D+S ++ + ++ DK GR I+YD
Sbjct: 148 KPLVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQFLAAVLNVDKTGRFIKYDRS 207
Query: 667 KKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPD 846
K+A V++ L NG+ALSKD SFV+ E T + R+W+ GP AGT EVFA++PG PD
Sbjct: 208 SKKATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLWLSGPNAGTHEVFAELPGFPD 267
Query: 847 NIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH-TLKLETVVHLMNGGKPHGIILK 1023
NIRR G+FWVALH K LF +L L +W + + + + L GG+PH +K
Sbjct: 268 NIRRNSNGEFWVALHSKKGLFAKLSLSQTWFRDLVLRLPISPQRLHSLFTGGRPHATAIK 327
Query: 1024 LSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVYWPAVWVLD 1167
LS E+G+++E+LED+EG ++++SE E +DGKLWIGSV P + V D
Sbjct: 328 LS-ESGKVLEVLEDNEGKRLRFISEV-EEKDGKLWIGSVLMPFLGVYD 373
>gi|110743953|dbj|BAE99809.1| hypothetical protein [Arabidopsis thaliana]
Length = 374
Score = 335 bits (858), Expect = 1e-089
Identities = 177/368 (48%), Positives = 235/368 (63%), Gaps = 7/368 (1%)
Frame = +1
Query: 79 VLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGE 255
V V AA + + V S I P + G++ V AK + L G GPES+ FD GE
Sbjct: 8 VFLTVIAAVLAILVKNSQTGSGIFAPPEISGSRDVFPSAKVVNLTGASGPESIAFDPAGE 67
Query: 256 GPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGD 426
GPYVGV+DGRILKWRGE GW DFA+TS +R C+R ++ CGRPLGL F +KTGD
Sbjct: 68 GPYVGVSDGRILKWRGEPLGWSDFAHTSSNRQECARPFAPELEHVCGRPLGLRFDKKTGD 127
Query: 427 LYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGE 606
LYI D YFG++ VGP GGLA+ +V EAEG+ F N +DIDE+EDV YF D+S ++ +
Sbjct: 128 LYIADAYFGLLVVGPAGGLAKPLVTEAEGQPFRFTNDLDIDEQEDVIYFTDTSARFQRRQ 187
Query: 607 VFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIW 786
++ DK GR I+YD K+A V++ L NG+ALSKD SFV+ E T + R+W
Sbjct: 188 FLAAVLNVDKTGRFIKYDRSSKKATVLLQGLAFANGVALSKDRSFVLVVETTTCKILRLW 247
Query: 787 VKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMH-TL 963
+ GP AGT +VFA++PG PDNIRR G+FWVALH K LF +L L +W + +
Sbjct: 248 LSGPNAGTHQVFAELPGFPDNIRRNSNGEFWVALHSKKGLFAKLSLTQTWFRDLVLRLPI 307
Query: 964 KLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVY 1143
+ + L GG PH +KLS E+G+++E+L D EG T++++SE E +DGKLWIGSV
Sbjct: 308 SPQRLHSLFTGGIPHATAIKLS-ESGKVLEVLGDKEGKTLRFISEV-EEKDGKLWIGSVL 365
Query: 1144 WPAVWVLD 1167
P + V D
Sbjct: 366 VPFLGVYD 373
>gi|225441250|ref|XP_002273764.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 370
Score = 323 bits (827), Expect = 5e-086
Identities = 169/372 (45%), Positives = 231/372 (62%), Gaps = 12/372 (3%)
Frame = +1
Query: 70 SRRVLTPV--AAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEF 240
++ +LT + AA +ILAV + + +P + GT +L ++ I + G GPES+ F
Sbjct: 3 TKLILTAITLAAISIILAVNS----NHLFKPPSIPGTHDLLHGSEVIQVTGAFGPESIAF 58
Query: 241 DSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFH 411
D +GEGPY GV DGR+LKW G+ GW DFA T+ R C R ++ CGRPLGL F
Sbjct: 59 DPKGEGPYTGVADGRVLKWEGDGRGWTDFAVTTSERKECVRPFAPEMEHICGRPLGLRFD 118
Query: 412 RKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDK 591
+KTGDLYI D YFG+ V P GGLA +V E EGR+++F N MDIDE EDV YF D+S
Sbjct: 119 KKTGDLYIADAYFGLQVVEPNGGLATPLVTEVEGRRLLFTNDMDIDEVEDVIYFTDTSTD 178
Query: 592 YHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGI 771
+H + +SGD GR+++YD KE V++ L NG+A+SKD SFV+ E TG
Sbjct: 179 FHRRQFMAALLSGDNTGRLMKYDKSSKEVTVLLRGLAFANGVAMSKDRSFVLVAETTTGK 238
Query: 772 LHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFF 951
+ R W+KGP AG ++VFA+VPG PDN+RR G+FWVALH K +SWVGK
Sbjct: 239 IIRYWLKGPNAGKSDVFAEVPGYPDNVRRNSKGEFWVALHAKKGPHANWITSNSWVGKTL 298
Query: 952 MHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWI 1131
+ +H + + H +KLS E G+++E+LED EG +M+++SE E +GKLW+
Sbjct: 299 LKLPLTFKQLHKLIVVEAHATAIKLS-EEGQVLEVLEDCEGKSMRFISEV-EEHNGKLWL 356
Query: 1132 GSVYWPAVWVLD 1167
GSV P + V D
Sbjct: 357 GSVMMPFIGVYD 368
>gi|156763850|emb|CAO99127.1| strictosidine synthase-like protein [Nicotiana
tabacum]
Length = 380
Score = 318 bits (814), Expect = 2e-084
Identities = 157/345 (45%), Positives = 219/345 (63%), Gaps = 5/345 (1%)
Frame = +1
Query: 145 IIEPDRLEGTKHVLQVAKTIPLPGV-GPESLEFDSQGEGPYVGVTDGRILKWRGEEHGWV 321
+ +P + G++ VL A+ I L G G ES+ FD GEGPY GV DGRILKW+ WV
Sbjct: 35 VFKPAPIPGSQDVLSKAELIQLKGAFGAESVAFDPNGEGPYTGVADGRILKWQPHSQTWV 94
Query: 322 DFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLAEL 492
DFA TS R NCSR ++ CGRPLGL F KTGDLYI D YFG+ VGP GGLA
Sbjct: 95 DFAVTSSQRKNCSRPSAPEMEHVCGRPLGLRFDHKTGDLYIADAYFGLHVVGPTGGLATP 154
Query: 493 VVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMKKK 672
+V + EG+ ++F N +DID+++D+ YF D+S Y + + SGDK GR+++Y+ K
Sbjct: 155 LVQDFEGQPLLFTNDLDIDDDDDIIYFTDTSTIYQRRQFVAATASGDKTGRLMKYNKSTK 214
Query: 673 EAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPDNI 852
E V + L NG+ALSKD SF++ E + R W+KGP G ++FA++PG PDN+
Sbjct: 215 EVTVALGGLAFANGVALSKDRSFLLVAETSACRILRYWLKGPNVGNHDIFAELPGFPDNV 274
Query: 853 RRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFMHTLKLETVVHLMNGGKPHGIILKLSG 1032
R G+FWVALH K++ RL + +SW+GK + + + +L+ GG+PH +KLS
Sbjct: 275 RINSRGEFWVALHAKASPLARLIISNSWLGKTLLREFNFQQLHNLLVGGQPHATAIKLS- 333
Query: 1033 ETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSVYWPAVWVLD 1167
E G ++E+LED EG ++++SE +E E GKLWI SV ++ V D
Sbjct: 334 EDGRVLEVLEDVEGKILRFISEVHEEESGKLWISSVIMSSLGVYD 378
>gi|224139742|ref|XP_002323255.1| predicted protein [Populus trichocarpa]
Length = 375
Score = 305 bits (781), Expect = 1e-080
Identities = 163/363 (44%), Positives = 225/363 (61%), Gaps = 12/363 (3%)
Frame = +1
Query: 91 VAAAPVILAVVCYLFWS--TIIEPDRLEGTKHVLQVAKTIPLPG-VGPESLEFDSQGEGP 261
V ++A+V L S ++ P + + L AK + + G VGPESL FD GEGP
Sbjct: 8 VVTTTTLVAIVSILLTSPTKLLGPPTIPTSNDHLHSAKILHVSGAVGPESLVFDPNGEGP 67
Query: 262 YVGVTDGRILKWRGEEHG---WVDFAYTSPHRDNCSR---NQVVPSCGRPLGLSFHRKTG 423
Y GV DGR+LKW + G W DFA TS +R+ C R ++ CGRPLGL F +KTG
Sbjct: 68 YTGVADGRVLKWIAGDDGSGSWTDFATTSSNRNECVRPFAPEMEHVCGRPLGLRFDKKTG 127
Query: 424 DLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFNDSSDKYHFG 603
+LYI D Y G+ VGP GGLA VV E EG+ + F N +DIDE+EDV YF D+S +
Sbjct: 128 NLYIADAYLGLQVVGPTGGLATPVVTELEGQPMRFTNDLDIDEQEDVIYFTDTSMVFQRR 187
Query: 604 EVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGGTGILHRI 783
+ ++ DK GR+++YD KE V+ L NG+ALSKD +F++ E T + R
Sbjct: 188 QFILSLLTKDKTGRLLKYDKSSKEVTVLARGLAFANGVALSKDSTFLLVAETTTCRILRF 247
Query: 784 WVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVGKFFM-HT 960
W+ GP AG ++VF ++PG PDNIRR G+FWVALH K LF ++ L +SW+GK +
Sbjct: 248 WLHGPNAGKSDVFTELPGFPDNIRRNSKGEFWVALHSKKGLFAKVVLSNSWIGKTLLKFP 307
Query: 961 LKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGKLWIGSV 1140
L + + L+ GGK H +KLS E G+++++LED +G T++++SE E +DGKLWIGSV
Sbjct: 308 LSFKQLHSLLVGGKAHATAIKLS-EEGKVLDVLEDCDGKTLRFISEV-EEKDGKLWIGSV 365
Query: 1141 YWP 1149
P
Sbjct: 366 LMP 368
>gi|297820488|ref|XP_002878127.1| strictosidine synthase family protein
[Arabidopsis lyrata subsp. lyrata]
Length = 343
Score = 290 bits (741), Expect = 5e-076
Identities = 130/195 (66%), Positives = 159/195 (81%)
Frame = +1
Query: 583 SDKYHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEGG 762
SDKYHF +VF+V++SG++ GRVIRYD K KEAKV+MD L NGLAL+KD SF+ITCE G
Sbjct: 147 SDKYHFRDVFFVAVSGERSGRVIRYDKKTKEAKVVMDNLVCNNGLALNKDRSFLITCESG 206
Query: 763 TGILHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWVG 942
T ++HR W+KGPKAGT ++FAKVPG PDNIR T TGDFW+ +HCK NL RL + + W+G
Sbjct: 207 TSLVHRYWIKGPKAGTRDIFAKVPGYPDNIRLTSTGDFWIGIHCKKNLLGRLIVRYKWLG 266
Query: 943 KFFMHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGTTMKYVSEAYEREDGK 1122
K T+KLE V+ +NG KP G+ +K+SGETGE++E+LED EG TMKYVSEAYER+DGK
Sbjct: 267 KLVEKTIKLEYVIAFINGFKPQGVAVKISGETGEVLEVLEDKEGKTMKYVSEAYERDDGK 326
Query: 1123 LWIGSVYWPAVWVLD 1167
LW GSVYWPAVWVLD
Sbjct: 327 LWFGSVYWPAVWVLD 341
>gi|224139738|ref|XP_002323253.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 287 bits (733), Expect = 4e-075
Identities = 145/325 (44%), Positives = 202/325 (62%), Gaps = 14/325 (4%)
Frame = +1
Query: 202 IPLPG-VGPESLEFDSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNC----SRN 366
IP+ G +GPES FDS GEGPY ++DGRI+KW+G++ W+DFA TSP+RD C +
Sbjct: 23 IPIVGAIGPESFAFDSLGEGPYTSLSDGRIIKWQGDKKRWIDFAVTSPNRDGCGGPHDHH 82
Query: 367 QVVPSCGRPLGLSFHRKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDI 546
Q+ CGRPLG F GDLYI D Y G+++VGPEGGLA + A+G F N +DI
Sbjct: 83 QMEHVCGRPLGSCFDETHGDLYIADAYMGLLRVGPEGGLATKIATHAQGIPFRFTNSLDI 142
Query: 547 DEEEDVFYFNDSSDKYHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALS 726
D+ YF DSS +Y + V +SGDK GR+++YD K+ V++ L PNG+ALS
Sbjct: 143 DQSSGAIYFTDSSTQYQRRDYLSVVLSGDKSGRLMKYDTASKQVTVLLKNLTFPNGVALS 202
Query: 727 KDGSFVITCEGGTGILHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNL 906
DGSFV+ E + + R W+K KAG EVFA++ G PDNI+R+P G +WV ++ K
Sbjct: 203 TDGSFVLLAETTSCRILRYWIKTSKAGALEVFAQLQGFPDNIKRSPRGGYWVGINSKREK 262
Query: 907 FTRLFLIHSWVGKFF----MHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEG 1074
+ L + W+GK + K +T + GG G+ ++LS E G+I+E+ ED +G
Sbjct: 263 LSELLFSYPWIGKVLLKLPLDITKFQTALAKYRGG---GLAVRLS-ENGDIVEVFEDRDG 318
Query: 1075 TTMKYVSEAYEREDGKLWIGSVYWP 1149
+K +SE E+ DGKLWIGS+ P
Sbjct: 319 NRLKSISEVMEK-DGKLWIGSIDLP 342
>gi|225441248|ref|XP_002267323.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 378
Score = 276 bits (705), Expect = 7e-072
Identities = 153/373 (41%), Positives = 207/373 (55%), Gaps = 10/373 (2%)
Frame = +1
Query: 61 MPISRRVLTPVAAAPVILAVVCYLFWSTIIEPDRLEGTKHVLQVAKTIPLP---GVGPES 231
M +S ++L AAA L T + + K Q IP+P +GPES
Sbjct: 1 MAMSSKLLLAAAAAAAFLITALIAGKRTSLSSPEFDSEKFSNQKDAVIPIPTPGAIGPES 60
Query: 232 LEFDSQGEGPYVGVTDGRILKWRGEEHGWVDFAYTSPHRDNC--SRNQVVPS--CGRPLG 399
L FDS G GPY GV+DGRI+KW E WVDFA TS R+ C SR+ V CGRPLG
Sbjct: 61 LAFDSVGGGPYTGVSDGRIIKWEENEERWVDFATTSSKREGCRGSRDHVPLEHICGRPLG 120
Query: 400 LSFHRKTGDLYICDGYFGVMKVGPEGGLAELVVDEAEGRKVMFANQMDIDEEEDVFYFND 579
LSF TG+LYI D Y G++ VGP GGLA V EA+G F+N +DI + YF+D
Sbjct: 121 LSFSELTGELYIADAYMGLLVVGPNGGLASTVASEAQGTPFGFSNGVDIHQTNGAVYFSD 180
Query: 580 SSDKYHFGEVFYVSISGDKVGRVIRYDMKKKEAKVIMDKLHLPNGLALSKDGSFVITCEG 759
SS +Y ISGD GR+++Y+ + K+ V++ L PNG+ALSK+G F++ E
Sbjct: 181 SSSRYQRRNFVAAIISGDNTGRLMKYEPESKQVTVLLRSLGFPNGVALSKNGDFILLSET 240
Query: 760 GTGILHRIWVKGPKAGTTEVFAKVPGPPDNIRRTPTGDFWVALHCKSNLFTRLFLIHSWV 939
+ R W++ KAGT EVF +PG PDNI+R G+FWV +H + FL + W+
Sbjct: 241 SRCRILRFWLQTSKAGTVEVFTLLPGFPDNIKRNSKGEFWVGMHSRKGKLVEWFLSYPWI 300
Query: 940 GKFFMHTLKLETVVHLMNGGKPHGIILKLSGETGEIIEMLEDSEGT-TMKYVSEAYERED 1116
G+ + + + + G ++LS E GE++E+ E G + +SE YER D
Sbjct: 301 GRTLLKLPFPHGFLSFFSKWRKTGFAVRLS-EEGEVLEIFEPKNGNGWISSISEVYER-D 358
Query: 1117 GKLWIGSVYWPAV 1155
G LWIGSV P V
Sbjct: 359 GSLWIGSVTTPCV 371
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,584,745,891,651
Number of Sequences: 15229318
Number of Extensions: 2584745891651
Number of Successful Extensions: 639272726
Number of sequences better than 0.0: 0
|