BLASTX 7.6.2
Query= UN21103 /QuerySize=1192
(1191 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297829784|ref|XP_002882774.1| hypothetical protein ARALYDRAFT... 397 2e-108
gi|18399568|ref|NP_566419.1| uncharacterized protein [Arabidopsi... 386 4e-105
gi|297806721|ref|XP_002871244.1| hypothetical protein ARALYDRAFT... 233 7e-059
gi|334187476|ref|NP_001190245.1| uncharacterized protein [Arabid... 228 2e-057
gi|18415369|ref|NP_568176.1| uncharacterized protein [Arabidopsi... 204 4e-050
gi|30681695|ref|NP_850785.1| uncharacterized protein [Arabidopsi... 202 1e-049
gi|334187478|ref|NP_001190246.1| uncharacterized protein [Arabid... 194 4e-047
gi|14596123|gb|AAK68789.1| Unknown protein [Arabidopsis thaliana] 173 5e-041
gi|312281469|dbj|BAJ33600.1| unnamed protein product [Thellungie... 151 3e-034
gi|255578233|ref|XP_002529984.1| hypothetical protein RCOM_12643... 91 3e-016
>gi|297829784|ref|XP_002882774.1| hypothetical protein ARALYDRAFT_897442
[Arabidopsis lyrata subsp. lyrata]
Length = 273
Score = 397 bits (1019), Expect = 2e-108
Identities = 214/279 (76%), Positives = 233/279 (83%), Gaps = 17/279 (6%)
Frame = +3
Query: 201 MDCYTGRNFEDFVVPNYQETSSETC-LDGMWGGWSMNSPEAAEKCFDYDRFN------TQ 359
MDCY GRNFE+ VVP+YQE+SSET GMWGGWSM+SPEAAEKCFDYD FN +Q
Sbjct: 1 MDCYAGRNFEELVVPSYQESSSETYPSTGMWGGWSMSSPEAAEKCFDYDGFNGGGMMYSQ 60
Query: 360 MGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDVHRAS 539
MGMRTS EEEEESKRSKAFYGASSLH FEGIEQMDDIFLSSILEDVP +GDVHRAS
Sbjct: 61 MGMRTS----EEEEESKRSKAFYGASSLHDFEGIEQMDDIFLSSILEDVPEDEGDVHRAS 116
Query: 540 STNNSVGSSSMYGGS-EVPLFHSHAMPLKEGAPFTISDLSEENMF---VDDEMSSEELVL 707
S+NNSVGSSSM+GG EVP+FH H M KE APFTISDLSEENM DE+SSEELVL
Sbjct: 117 SSNNSVGSSSMFGGGREVPMFHCHDMSFKEEAPFTISDLSEENMLDSNYGDELSSEELVL 176
Query: 708 QDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRYDYGDNTRLM 887
QDLQRAS+KLTDETRKCFRDTFYRLAR+SQ+ DS + NS E +Q SRY+YGD RL
Sbjct: 177 QDLQRASQKLTDETRKCFRDTFYRLARSSQDNSDS-VSPNSEELLVQTSRYNYGDGNRL- 234
Query: 888 SREEEMESETNSIDRAVANLTYNKMESNISNFPLPERVQ 1004
SREEE+E+ETNSIDRAVANLT+NKMESNISNFPL ERVQ
Sbjct: 235 SREEEIETETNSIDRAVANLTFNKMESNISNFPLSERVQ 273
>gi|18399568|ref|NP_566419.1| uncharacterized protein [Arabidopsis thaliana]
Length = 269
Score = 386 bits (991), Expect = 4e-105
Identities = 212/279 (75%), Positives = 227/279 (81%), Gaps = 21/279 (7%)
Frame = +3
Query: 201 MDCYTGRNFEDFVVPNYQETSSETC-LDGMWGGWSMNSPEAAEKCFDYDRFN------TQ 359
MDCY E+ VVPNYQE+SSET GMWGGWSM+SPEAAEKCFDYD FN +Q
Sbjct: 1 MDCYA----EELVVPNYQESSSETYPSTGMWGGWSMSSPEAAEKCFDYDGFNGEGMMYSQ 56
Query: 360 MGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDVHRAS 539
M MRTS EEEEESKRSKAFYGASSLH FEGIEQMDD+FLSSILEDVP DGDVHRA+
Sbjct: 57 MSMRTS----EEEEESKRSKAFYGASSLHDFEGIEQMDDMFLSSILEDVPEDDGDVHRAT 112
Query: 540 STNNSVGSSSMYGGS-EVPLFHSHAMPLKEGAPFTISDLSEENMF---VDDEMSSEELVL 707
S+NNSVGSSSMYGG EVP+FH H M KE APFTISDLSEENM DE+SSEE VL
Sbjct: 113 SSNNSVGSSSMYGGGREVPMFHCHDMSFKEEAPFTISDLSEENMLDSNYGDELSSEEFVL 172
Query: 708 QDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRYDYGDNTRLM 887
QDLQRAS+KLTDETRKCFRDTFYRLAR+SQ+K DS + NS E MQ SRYDYGD R
Sbjct: 173 QDLQRASQKLTDETRKCFRDTFYRLARSSQDKSDS-VSPNSEELLMQTSRYDYGDGNR-F 230
Query: 888 SREEEMESETNSIDRAVANLTYNKMESNISNFPLPERVQ 1004
SREEE+ESETNSIDRAVANLT+NKMESNISNFPL ERVQ
Sbjct: 231 SREEEIESETNSIDRAVANLTFNKMESNISNFPLSERVQ 269
>gi|297806721|ref|XP_002871244.1| hypothetical protein ARALYDRAFT_487518
[Arabidopsis lyrata subsp. lyrata]
Length = 269
Score = 233 bits (592), Expect = 7e-059
Identities = 152/278 (54%), Positives = 175/278 (62%), Gaps = 47/278 (16%)
Frame = +3
Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
MD Y+ RN ED VVPNYQETS MWG GWSMNS EAAEKCFDYD +
Sbjct: 1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60
Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
QM M TSE EEE + K S F SLH F+ I+QMDD+FLSSILEDVPG + +
Sbjct: 61 SQMQMEMGTSEQVEEETKRLKASGCF--DRSLHDFDEIQQMDDMFLSSILEDVPGDENFL 118
Query: 528 -HRASSTNNSVGSSSMY----GGSEVPLFH------SHAMPLKEGAPFTISDLSEENMFV 674
+ S TNNS GSSS Y G EVP+FH ++E AP +L EEN+
Sbjct: 119 SFKESDTNNSSGSSSAYLDTTDGREVPMFHYNWETCQDMQLMEEDAPM---NLCEENI-- 173
Query: 675 DDEMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQAS 854
+E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+K DS N EF
Sbjct: 174 -EEASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQKSDS----NPDEF----- 223
Query: 855 RYDYGDNTRLMSREEEMESETNSIDRAVANLTYNKMES 968
D T SRE E+ ETNSIDRAVANLT+NKMES
Sbjct: 224 ---LEDRT---SRESEL--ETNSIDRAVANLTFNKMES 253
>gi|334187476|ref|NP_001190245.1| uncharacterized protein [Arabidopsis
thaliana]
Length = 279
Score = 228 bits (580), Expect = 2e-057
Identities = 146/287 (50%), Positives = 177/287 (61%), Gaps = 33/287 (11%)
Frame = +3
Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
MD Y+ RN ED VVPNYQETS MWG GWSMNS EAAEKCFDYD +
Sbjct: 1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60
Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
+M M TSE EEE ++ K S F SLH F+ I+ MDD+F SILEDVPG++ +
Sbjct: 61 SQMEMDMGTSEQVEEETKKLKASGCF--DRSLHDFDEIQHMDDMFF-SILEDVPGNENFL 117
Query: 528 HRASSTNNSVGSSSMY---GGSEVPLFH-----SHAMPL-KEGAPFTISDLSEENMFVDD 680
S NN+ SSS G EVPLFH MPL +E AP +L EEN +
Sbjct: 118 SFKESDNNNSSSSSYLDTTDGREVPLFHYNWETCQDMPLMEEDAPM---NLCEEN---KE 171
Query: 681 EMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRY 860
E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+K DS NS EF +
Sbjct: 172 EASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQKSDS----NSDEFLEDRTSS 227
Query: 861 DYGDNTRLMSREEEMESETNSIDRAVANLTYNKMESNISNFPLPERV 1001
+ + ++ + NSIDRAVANLT+NKMESN+ N P P+R+
Sbjct: 228 NDSSPSMTFLSVGKLNLKPNSIDRAVANLTFNKMESNMRNMPPPKRL 274
>gi|18415369|ref|NP_568176.1| uncharacterized protein [Arabidopsis thaliana]
Length = 275
Score = 204 bits (517), Expect = 4e-050
Identities = 136/269 (50%), Positives = 162/269 (60%), Gaps = 43/269 (15%)
Frame = +3
Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
MD Y+ RN ED VVPNYQETS MWG GWSMNS EAAEKCFDYD +
Sbjct: 1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60
Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
+M M TSE EEE ++ K S F SLH F+ I+ MDD+FLSSILEDVPG++ +
Sbjct: 61 SQMEMDMGTSEQVEEETKKLKASGCF--DRSLHDFDEIQHMDDMFLSSILEDVPGNENFL 118
Query: 528 HRASSTNNSVGSSSMY---GGSEVPLFH-----SHAMPL-KEGAPFTISDLSEENMFVDD 680
S NN+ SSS G EVPLFH MPL +E AP +L EEN +
Sbjct: 119 SFKESDNNNSSSSSYLDTTDGREVPLFHYNWETCQDMPLMEEDAPM---NLCEEN---KE 172
Query: 681 EMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRY 860
E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+K DS NS EF
Sbjct: 173 EASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQKSDS----NSDEF------- 221
Query: 861 DYGDNTRLMSREEEMESETNSIDRAVANL 947
D T SRE E E++ N R +++
Sbjct: 222 -LEDRT---SRETEFETKLNRQSRGQSHI 246
>gi|30681695|ref|NP_850785.1| uncharacterized protein [Arabidopsis thaliana]
Length = 274
Score = 202 bits (512), Expect = 1e-049
Identities = 135/269 (50%), Positives = 161/269 (59%), Gaps = 44/269 (16%)
Frame = +3
Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
MD Y+ RN ED VVPNYQETS MWG GWSMNS EAAEKCFDYD +
Sbjct: 1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60
Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
+M M TSE EEE ++ K S F SLH F+ I+ MDD+FLSSILEDVPG++ +
Sbjct: 61 SQMEMDMGTSEQVEEETKKLKASGCF--DRSLHDFDEIQHMDDMFLSSILEDVPGNENFL 118
Query: 528 HRASSTNNSVGSSSMY---GGSEVPLFH-----SHAMPL-KEGAPFTISDLSEENMFVDD 680
S NN+ SSS G EVPLFH MPL +E AP +L EEN +
Sbjct: 119 SFKESDNNNSSSSSYLDTTDGREVPLFHYNWETCQDMPLMEEDAPM---NLCEEN---KE 172
Query: 681 EMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRY 860
E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+K DS NS EF
Sbjct: 173 EASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQKSDS----NSDEF------- 221
Query: 861 DYGDNTRLMSREEEMESETNSIDRAVANL 947
D T RE E E++ N R +++
Sbjct: 222 -LEDRT----RETEFETKLNRQSRGQSHI 245
>gi|334187478|ref|NP_001190246.1| uncharacterized protein [Arabidopsis
thaliana]
Length = 266
Score = 194 bits (491), Expect = 4e-047
Identities = 120/220 (54%), Positives = 141/220 (64%), Gaps = 28/220 (12%)
Frame = +3
Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
MD Y+ RN ED VVPNYQETS MWG GWSMNS EAAEKCFDYD +
Sbjct: 1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60
Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
+M M TSE EEE ++ K S F SLH F+ I+ MDD+FLSSILEDVPG++ +
Sbjct: 61 SQMEMDMGTSEQVEEETKKLKASGCF--DRSLHDFDEIQHMDDMFLSSILEDVPGNENFL 118
Query: 528 HRASSTNNSVGSSSMY---GGSEVPLFH-----SHAMPL-KEGAPFTISDLSEENMFVDD 680
S NN+ SSS G EVPLFH MPL +E AP +L EEN +
Sbjct: 119 SFKESDNNNSSSSSYLDTTDGREVPLFHYNWETCQDMPLMEEDAPM---NLCEEN---KE 172
Query: 681 EMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQE 800
E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+
Sbjct: 173 EASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQ 212
>gi|14596123|gb|AAK68789.1| Unknown protein [Arabidopsis thaliana]
Length = 246
Score = 173 bits (438), Expect = 5e-041
Identities = 119/241 (49%), Positives = 144/241 (59%), Gaps = 44/241 (18%)
Frame = +3
Query: 285 MWG-GWSMNSPEAAEKCFDYDRFNT----------QMGMRTSEDEEEEEEESKRSKAFYG 431
MWG GWSMNS EAAEKCFDYD + +M M TSE EEE ++ K S F
Sbjct: 1 MWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLYSQMEMDMGTSEQVEEETKKLKASGCF-- 58
Query: 432 ASSLHGFEGIEQMDDIFLSSILEDVPGSDGDVHRASSTNNSVGSSSMY---GGSEVPLFH 602
SLH F+ I+ MDD+FLSSILEDVPG++ + S NN+ SSS G EVPLFH
Sbjct: 59 DRSLHDFDEIQHMDDMFLSSILEDVPGNENFLSFKESDNNNSSSSSYLDTTDGREVPLFH 118
Query: 603 -----SHAMPL-KEGAPFTISDLSEENMFVDDEMSSEELVLQDLQRASEKLTDETRKCFR 764
MPL +E AP +L EEN +E S+EE+VLQDLQRA+E LTD+TRKCFR
Sbjct: 119 YNWETCQDMPLMEEDAPM---NLCEEN---KEEASAEEVVLQDLQRATEMLTDDTRKCFR 172
Query: 765 DTFYRLARNSQEKLDSDHTANSGEFHMQASRYDYGDNTRLMSREEEMESETNSIDRAVAN 944
DTFYRLA+NSQ+K DS NS EF D T RE E E++ N R ++
Sbjct: 173 DTFYRLAKNSQQKSDS----NSDEF--------LEDRT----RETEFETKLNRQSRGQSH 216
Query: 945 L 947
+
Sbjct: 217 I 217
>gi|312281469|dbj|BAJ33600.1| unnamed protein product [Thellungiella halophila]
Length = 286
Score = 151 bits (380), Expect = 3e-034
Identities = 99/183 (54%), Positives = 119/183 (65%), Gaps = 21/183 (11%)
Frame = +3
Query: 339 YDRFNTQMGMRTSEDEEEEEEESKRSKA----FY-GASSLHGFEGIEQMDDIFLSSILED 503
Y + +M M E EEESKR KA FY SSLH F+GI+QMDDIFLSSILED
Sbjct: 59 YSQMEMEMEMEMG-TSGEVEEESKRLKAAGDCFYRPTSSLHDFDGIQQMDDIFLSSILED 117
Query: 504 VPGSDGDVHRASSTNNSVGSSSMY----GGSEVPLFH-----SHAMPL--KEGAPFTISD 650
VPG++G + N+S G SS Y G EVP+FH MPL + A IS+
Sbjct: 118 VPGNEGLHSFSELDNDSPGPSSAYLSNLDGIEVPMFHYDWETCQDMPLMGEHEASMKISE 177
Query: 651 LSEENMFVDDEMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANS 830
L EENM +E S+EE+VLQDLQRA+EKLTD+TRKCFRDTFYRLA+NSQ+K +S + N
Sbjct: 178 LCEENM---EEPSNEEVVLQDLQRATEKLTDDTRKCFRDTFYRLAKNSQQKSESGNN-NP 233
Query: 831 GEF 839
EF
Sbjct: 234 EEF 236
Score = 75 bits (183), Expect = 2e-011
Identities = 40/69 (57%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
Frame = +3
Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWGGWSMNSPEAAEKCFDYDRFNTQM-GMRTS 377
MD Y+ RN ED VVPNYQETS MWGGWSMNS +AAEKCFD+D N G S
Sbjct: 1 MDRYSRRNSEDLVVPNYQETSDSYPSPDMWGGWSMNSQKAAEKCFDFDVINNGFSGGLYS 60
Query: 378 EDEEEEEEE 404
+ E E E E
Sbjct: 61 QMEMEMEME 69
Score = 57 bits (135), Expect = 7e-006
Identities = 25/38 (65%), Positives = 33/38 (86%)
Frame = +3
Query: 888 SREEEMESETNSIDRAVANLTYNKMESNISNFPLPERV 1001
+R+++ E ETNSIDRA+ANLT+NKMESN+ N P P+RV
Sbjct: 244 NRDQKTELETNSIDRAIANLTFNKMESNMRNLPPPKRV 281
>gi|255578233|ref|XP_002529984.1| hypothetical protein RCOM_1264330 [Ricinus
communis]
Length = 365
Score = 91 bits (225), Expect = 3e-016
Identities = 65/181 (35%), Positives = 98/181 (54%), Gaps = 10/181 (5%)
Frame = +3
Query: 450 FEGIEQMDDIFLSSILEDVPGSDGDVHRASSTNNSVGSSSMYGGSEVPLFHS---HAMPL 620
FE + D + SIL D+ + D SS SVGSS L ++ H P
Sbjct: 134 FEPELENDMVHGDSILTDM---NLDTPSISSDTQSVGSSKTSSDDIRALSYTYVRHEAPH 190
Query: 621 KEGAPFTISDLSEENMFVDDEMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQE 800
E + D +EN VD+E S EE VLQ+++ +LTD+TR CFRD YRLA+NS++
Sbjct: 191 VEVLVPSEHDSIKEN--VDEETSLEESVLQEMEMVMSQLTDKTRICFRDALYRLAKNSRQ 248
Query: 801 KLDSDHTANSGEFHMQASRYDYGDNTRLMSREEEMESETNSIDRAVANLTYNKMESNISN 980
+ + + +G + S++ D ++ ME ETN+IDRA+ANL +NKM+ N+ +
Sbjct: 249 NVVTKN--QNGNLQLAISQWTDQDCKIRPREKKTMELETNTIDRAIANLMFNKMDINVHD 306
Query: 981 F 983
+
Sbjct: 307 Y 307
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,498,019,856,450
Number of Sequences: 15229318
Number of Extensions: 2498019856450
Number of Successful Extensions: 628698129
Number of sequences better than 0.0: 0
|