Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN21103


BLASTX 7.6.2

Query= UN21103 /QuerySize=1192
        (1191 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297829784|ref|XP_002882774.1| hypothetical protein ARALYDRAFT...    397   2e-108
gi|18399568|ref|NP_566419.1| uncharacterized protein [Arabidopsi...    386   4e-105
gi|297806721|ref|XP_002871244.1| hypothetical protein ARALYDRAFT...    233   7e-059
gi|334187476|ref|NP_001190245.1| uncharacterized protein [Arabid...    228   2e-057
gi|18415369|ref|NP_568176.1| uncharacterized protein [Arabidopsi...    204   4e-050
gi|30681695|ref|NP_850785.1| uncharacterized protein [Arabidopsi...    202   1e-049
gi|334187478|ref|NP_001190246.1| uncharacterized protein [Arabid...    194   4e-047
gi|14596123|gb|AAK68789.1| Unknown protein [Arabidopsis thaliana]      173   5e-041
gi|312281469|dbj|BAJ33600.1| unnamed protein product [Thellungie...    151   3e-034
gi|255578233|ref|XP_002529984.1| hypothetical protein RCOM_12643...     91   3e-016

>gi|297829784|ref|XP_002882774.1| hypothetical protein ARALYDRAFT_897442
        [Arabidopsis lyrata subsp. lyrata]

          Length = 273

 Score =  397 bits (1019), Expect = 2e-108
 Identities = 214/279 (76%), Positives = 233/279 (83%), Gaps = 17/279 (6%)
 Frame = +3

Query:  201 MDCYTGRNFEDFVVPNYQETSSETC-LDGMWGGWSMNSPEAAEKCFDYDRFN------TQ 359
            MDCY GRNFE+ VVP+YQE+SSET    GMWGGWSM+SPEAAEKCFDYD FN      +Q
Sbjct:    1 MDCYAGRNFEELVVPSYQESSSETYPSTGMWGGWSMSSPEAAEKCFDYDGFNGGGMMYSQ 60

Query:  360 MGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDVHRAS 539
            MGMRTS    EEEEESKRSKAFYGASSLH FEGIEQMDDIFLSSILEDVP  +GDVHRAS
Sbjct:   61 MGMRTS----EEEEESKRSKAFYGASSLHDFEGIEQMDDIFLSSILEDVPEDEGDVHRAS 116

Query:  540 STNNSVGSSSMYGGS-EVPLFHSHAMPLKEGAPFTISDLSEENMF---VDDEMSSEELVL 707
            S+NNSVGSSSM+GG  EVP+FH H M  KE APFTISDLSEENM      DE+SSEELVL
Sbjct:  117 SSNNSVGSSSMFGGGREVPMFHCHDMSFKEEAPFTISDLSEENMLDSNYGDELSSEELVL 176

Query:  708 QDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRYDYGDNTRLM 887
            QDLQRAS+KLTDETRKCFRDTFYRLAR+SQ+  DS  + NS E  +Q SRY+YGD  RL 
Sbjct:  177 QDLQRASQKLTDETRKCFRDTFYRLARSSQDNSDS-VSPNSEELLVQTSRYNYGDGNRL- 234

Query:  888 SREEEMESETNSIDRAVANLTYNKMESNISNFPLPERVQ 1004
            SREEE+E+ETNSIDRAVANLT+NKMESNISNFPL ERVQ
Sbjct:  235 SREEEIETETNSIDRAVANLTFNKMESNISNFPLSERVQ 273

>gi|18399568|ref|NP_566419.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 269

 Score =  386 bits (991), Expect = 4e-105
 Identities = 212/279 (75%), Positives = 227/279 (81%), Gaps = 21/279 (7%)
 Frame = +3

Query:  201 MDCYTGRNFEDFVVPNYQETSSETC-LDGMWGGWSMNSPEAAEKCFDYDRFN------TQ 359
            MDCY     E+ VVPNYQE+SSET    GMWGGWSM+SPEAAEKCFDYD FN      +Q
Sbjct:    1 MDCYA----EELVVPNYQESSSETYPSTGMWGGWSMSSPEAAEKCFDYDGFNGEGMMYSQ 56

Query:  360 MGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDVHRAS 539
            M MRTS    EEEEESKRSKAFYGASSLH FEGIEQMDD+FLSSILEDVP  DGDVHRA+
Sbjct:   57 MSMRTS----EEEEESKRSKAFYGASSLHDFEGIEQMDDMFLSSILEDVPEDDGDVHRAT 112

Query:  540 STNNSVGSSSMYGGS-EVPLFHSHAMPLKEGAPFTISDLSEENMF---VDDEMSSEELVL 707
            S+NNSVGSSSMYGG  EVP+FH H M  KE APFTISDLSEENM      DE+SSEE VL
Sbjct:  113 SSNNSVGSSSMYGGGREVPMFHCHDMSFKEEAPFTISDLSEENMLDSNYGDELSSEEFVL 172

Query:  708 QDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRYDYGDNTRLM 887
            QDLQRAS+KLTDETRKCFRDTFYRLAR+SQ+K DS  + NS E  MQ SRYDYGD  R  
Sbjct:  173 QDLQRASQKLTDETRKCFRDTFYRLARSSQDKSDS-VSPNSEELLMQTSRYDYGDGNR-F 230

Query:  888 SREEEMESETNSIDRAVANLTYNKMESNISNFPLPERVQ 1004
            SREEE+ESETNSIDRAVANLT+NKMESNISNFPL ERVQ
Sbjct:  231 SREEEIESETNSIDRAVANLTFNKMESNISNFPLSERVQ 269

>gi|297806721|ref|XP_002871244.1| hypothetical protein ARALYDRAFT_487518
        [Arabidopsis lyrata subsp. lyrata]

          Length = 269

 Score =  233 bits (592), Expect = 7e-059
 Identities = 152/278 (54%), Positives = 175/278 (62%), Gaps = 47/278 (16%)
 Frame = +3

Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
           MD Y+ RN ED VVPNYQETS       MWG GWSMNS EAAEKCFDYD  +        
Sbjct:   1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60

Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
              QM M TSE  EEE +  K S  F    SLH F+ I+QMDD+FLSSILEDVPG +  +
Sbjct:  61 SQMQMEMGTSEQVEEETKRLKASGCF--DRSLHDFDEIQQMDDMFLSSILEDVPGDENFL 118

Query: 528 -HRASSTNNSVGSSSMY----GGSEVPLFH------SHAMPLKEGAPFTISDLSEENMFV 674
             + S TNNS GSSS Y     G EVP+FH           ++E AP    +L EEN+  
Sbjct: 119 SFKESDTNNSSGSSSAYLDTTDGREVPMFHYNWETCQDMQLMEEDAPM---NLCEENI-- 173

Query: 675 DDEMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQAS 854
            +E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+K DS    N  EF     
Sbjct: 174 -EEASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQKSDS----NPDEF----- 223

Query: 855 RYDYGDNTRLMSREEEMESETNSIDRAVANLTYNKMES 968
                D T   SRE E+  ETNSIDRAVANLT+NKMES
Sbjct: 224 ---LEDRT---SRESEL--ETNSIDRAVANLTFNKMES 253

>gi|334187476|ref|NP_001190245.1| uncharacterized protein [Arabidopsis
        thaliana]

          Length = 279

 Score =  228 bits (580), Expect = 2e-057
 Identities = 146/287 (50%), Positives = 177/287 (61%), Gaps = 33/287 (11%)
 Frame = +3

Query:  201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
            MD Y+ RN ED VVPNYQETS       MWG GWSMNS EAAEKCFDYD  +        
Sbjct:    1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60

Query:  357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
               +M M TSE  EEE ++ K S  F    SLH F+ I+ MDD+F  SILEDVPG++  +
Sbjct:   61 SQMEMDMGTSEQVEEETKKLKASGCF--DRSLHDFDEIQHMDDMFF-SILEDVPGNENFL 117

Query:  528 HRASSTNNSVGSSSMY---GGSEVPLFH-----SHAMPL-KEGAPFTISDLSEENMFVDD 680
                S NN+  SSS      G EVPLFH        MPL +E AP    +L EEN    +
Sbjct:  118 SFKESDNNNSSSSSYLDTTDGREVPLFHYNWETCQDMPLMEEDAPM---NLCEEN---KE 171

Query:  681 EMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRY 860
            E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+K DS    NS EF    +  
Sbjct:  172 EASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQKSDS----NSDEFLEDRTSS 227

Query:  861 DYGDNTRLMSREEEMESETNSIDRAVANLTYNKMESNISNFPLPERV 1001
            +    +       ++  + NSIDRAVANLT+NKMESN+ N P P+R+
Sbjct:  228 NDSSPSMTFLSVGKLNLKPNSIDRAVANLTFNKMESNMRNMPPPKRL 274

>gi|18415369|ref|NP_568176.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 275

 Score =  204 bits (517), Expect = 4e-050
 Identities = 136/269 (50%), Positives = 162/269 (60%), Gaps = 43/269 (15%)
 Frame = +3

Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
           MD Y+ RN ED VVPNYQETS       MWG GWSMNS EAAEKCFDYD  +        
Sbjct:   1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60

Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
              +M M TSE  EEE ++ K S  F    SLH F+ I+ MDD+FLSSILEDVPG++  +
Sbjct:  61 SQMEMDMGTSEQVEEETKKLKASGCF--DRSLHDFDEIQHMDDMFLSSILEDVPGNENFL 118

Query: 528 HRASSTNNSVGSSSMY---GGSEVPLFH-----SHAMPL-KEGAPFTISDLSEENMFVDD 680
               S NN+  SSS      G EVPLFH        MPL +E AP    +L EEN    +
Sbjct: 119 SFKESDNNNSSSSSYLDTTDGREVPLFHYNWETCQDMPLMEEDAPM---NLCEEN---KE 172

Query: 681 EMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRY 860
           E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+K DS    NS EF       
Sbjct: 173 EASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQKSDS----NSDEF------- 221

Query: 861 DYGDNTRLMSREEEMESETNSIDRAVANL 947
              D T   SRE E E++ N   R  +++
Sbjct: 222 -LEDRT---SRETEFETKLNRQSRGQSHI 246

>gi|30681695|ref|NP_850785.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 274

 Score =  202 bits (512), Expect = 1e-049
 Identities = 135/269 (50%), Positives = 161/269 (59%), Gaps = 44/269 (16%)
 Frame = +3

Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
           MD Y+ RN ED VVPNYQETS       MWG GWSMNS EAAEKCFDYD  +        
Sbjct:   1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60

Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
              +M M TSE  EEE ++ K S  F    SLH F+ I+ MDD+FLSSILEDVPG++  +
Sbjct:  61 SQMEMDMGTSEQVEEETKKLKASGCF--DRSLHDFDEIQHMDDMFLSSILEDVPGNENFL 118

Query: 528 HRASSTNNSVGSSSMY---GGSEVPLFH-----SHAMPL-KEGAPFTISDLSEENMFVDD 680
               S NN+  SSS      G EVPLFH        MPL +E AP    +L EEN    +
Sbjct: 119 SFKESDNNNSSSSSYLDTTDGREVPLFHYNWETCQDMPLMEEDAPM---NLCEEN---KE 172

Query: 681 EMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANSGEFHMQASRY 860
           E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+K DS    NS EF       
Sbjct: 173 EASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQKSDS----NSDEF------- 221

Query: 861 DYGDNTRLMSREEEMESETNSIDRAVANL 947
              D T    RE E E++ N   R  +++
Sbjct: 222 -LEDRT----RETEFETKLNRQSRGQSHI 245

>gi|334187478|ref|NP_001190246.1| uncharacterized protein [Arabidopsis
        thaliana]

          Length = 266

 Score =  194 bits (491), Expect = 4e-047
 Identities = 120/220 (54%), Positives = 141/220 (64%), Gaps = 28/220 (12%)
 Frame = +3

Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWG-GWSMNSPEAAEKCFDYDRFNT------- 356
           MD Y+ RN ED VVPNYQETS       MWG GWSMNS EAAEKCFDYD  +        
Sbjct:   1 MDRYSRRNLEDLVVPNYQETSDSYPSPDMWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLY 60

Query: 357 ---QMGMRTSEDEEEEEEESKRSKAFYGASSLHGFEGIEQMDDIFLSSILEDVPGSDGDV 527
              +M M TSE  EEE ++ K S  F    SLH F+ I+ MDD+FLSSILEDVPG++  +
Sbjct:  61 SQMEMDMGTSEQVEEETKKLKASGCF--DRSLHDFDEIQHMDDMFLSSILEDVPGNENFL 118

Query: 528 HRASSTNNSVGSSSMY---GGSEVPLFH-----SHAMPL-KEGAPFTISDLSEENMFVDD 680
               S NN+  SSS      G EVPLFH        MPL +E AP    +L EEN    +
Sbjct: 119 SFKESDNNNSSSSSYLDTTDGREVPLFHYNWETCQDMPLMEEDAPM---NLCEEN---KE 172

Query: 681 EMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQE 800
           E S+EE+VLQDLQRA+E LTD+TRKCFRDTFYRLA+NSQ+
Sbjct: 173 EASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQ 212

>gi|14596123|gb|AAK68789.1| Unknown protein [Arabidopsis thaliana]

          Length = 246

 Score =  173 bits (438), Expect = 5e-041
 Identities = 119/241 (49%), Positives = 144/241 (59%), Gaps = 44/241 (18%)
 Frame = +3

Query: 285 MWG-GWSMNSPEAAEKCFDYDRFNT----------QMGMRTSEDEEEEEEESKRSKAFYG 431
           MWG GWSMNS EAAEKCFDYD  +           +M M TSE  EEE ++ K S  F  
Sbjct:   1 MWGTGWSMNSSEAAEKCFDYDVIHNGFSGGLYSQMEMDMGTSEQVEEETKKLKASGCF-- 58

Query: 432 ASSLHGFEGIEQMDDIFLSSILEDVPGSDGDVHRASSTNNSVGSSSMY---GGSEVPLFH 602
             SLH F+ I+ MDD+FLSSILEDVPG++  +    S NN+  SSS      G EVPLFH
Sbjct:  59 DRSLHDFDEIQHMDDMFLSSILEDVPGNENFLSFKESDNNNSSSSSYLDTTDGREVPLFH 118

Query: 603 -----SHAMPL-KEGAPFTISDLSEENMFVDDEMSSEELVLQDLQRASEKLTDETRKCFR 764
                   MPL +E AP    +L EEN    +E S+EE+VLQDLQRA+E LTD+TRKCFR
Sbjct: 119 YNWETCQDMPLMEEDAPM---NLCEEN---KEEASAEEVVLQDLQRATEMLTDDTRKCFR 172

Query: 765 DTFYRLARNSQEKLDSDHTANSGEFHMQASRYDYGDNTRLMSREEEMESETNSIDRAVAN 944
           DTFYRLA+NSQ+K DS    NS EF          D T    RE E E++ N   R  ++
Sbjct: 173 DTFYRLAKNSQQKSDS----NSDEF--------LEDRT----RETEFETKLNRQSRGQSH 216

Query: 945 L 947
           +
Sbjct: 217 I 217

>gi|312281469|dbj|BAJ33600.1| unnamed protein product [Thellungiella halophila]

          Length = 286

 Score =  151 bits (380), Expect = 3e-034
 Identities = 99/183 (54%), Positives = 119/183 (65%), Gaps = 21/183 (11%)
 Frame = +3

Query: 339 YDRFNTQMGMRTSEDEEEEEEESKRSKA----FY-GASSLHGFEGIEQMDDIFLSSILED 503
           Y +   +M M       E EEESKR KA    FY   SSLH F+GI+QMDDIFLSSILED
Sbjct:  59 YSQMEMEMEMEMG-TSGEVEEESKRLKAAGDCFYRPTSSLHDFDGIQQMDDIFLSSILED 117

Query: 504 VPGSDGDVHRASSTNNSVGSSSMY----GGSEVPLFH-----SHAMPL--KEGAPFTISD 650
           VPG++G    +   N+S G SS Y     G EVP+FH        MPL  +  A   IS+
Sbjct: 118 VPGNEGLHSFSELDNDSPGPSSAYLSNLDGIEVPMFHYDWETCQDMPLMGEHEASMKISE 177

Query: 651 LSEENMFVDDEMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQEKLDSDHTANS 830
           L EENM   +E S+EE+VLQDLQRA+EKLTD+TRKCFRDTFYRLA+NSQ+K +S +  N 
Sbjct: 178 LCEENM---EEPSNEEVVLQDLQRATEKLTDDTRKCFRDTFYRLAKNSQQKSESGNN-NP 233

Query: 831 GEF 839
            EF
Sbjct: 234 EEF 236


 Score =  75 bits (183), Expect = 2e-011
 Identities = 40/69 (57%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
 Frame = +3

Query: 201 MDCYTGRNFEDFVVPNYQETSSETCLDGMWGGWSMNSPEAAEKCFDYDRFNTQM-GMRTS 377
           MD Y+ RN ED VVPNYQETS       MWGGWSMNS +AAEKCFD+D  N    G   S
Sbjct:   1 MDRYSRRNSEDLVVPNYQETSDSYPSPDMWGGWSMNSQKAAEKCFDFDVINNGFSGGLYS 60

Query: 378 EDEEEEEEE 404
           + E E E E
Sbjct:  61 QMEMEMEME 69


 Score =  57 bits (135), Expect = 7e-006
 Identities = 25/38 (65%), Positives = 33/38 (86%)
 Frame = +3

Query:  888 SREEEMESETNSIDRAVANLTYNKMESNISNFPLPERV 1001
            +R+++ E ETNSIDRA+ANLT+NKMESN+ N P P+RV
Sbjct:  244 NRDQKTELETNSIDRAIANLTFNKMESNMRNLPPPKRV 281

>gi|255578233|ref|XP_002529984.1| hypothetical protein RCOM_1264330 [Ricinus
        communis]

          Length = 365

 Score =  91 bits (225), Expect = 3e-016
 Identities = 65/181 (35%), Positives = 98/181 (54%), Gaps = 10/181 (5%)
 Frame = +3

Query: 450 FEGIEQMDDIFLSSILEDVPGSDGDVHRASSTNNSVGSSSMYGGSEVPLFHS---HAMPL 620
           FE   + D +   SIL D+   + D    SS   SVGSS         L ++   H  P 
Sbjct: 134 FEPELENDMVHGDSILTDM---NLDTPSISSDTQSVGSSKTSSDDIRALSYTYVRHEAPH 190

Query: 621 KEGAPFTISDLSEENMFVDDEMSSEELVLQDLQRASEKLTDETRKCFRDTFYRLARNSQE 800
            E    +  D  +EN  VD+E S EE VLQ+++    +LTD+TR CFRD  YRLA+NS++
Sbjct: 191 VEVLVPSEHDSIKEN--VDEETSLEESVLQEMEMVMSQLTDKTRICFRDALYRLAKNSRQ 248

Query: 801 KLDSDHTANSGEFHMQASRYDYGDNTRLMSREEEMESETNSIDRAVANLTYNKMESNISN 980
            + + +   +G   +  S++   D       ++ ME ETN+IDRA+ANL +NKM+ N+ +
Sbjct: 249 NVVTKN--QNGNLQLAISQWTDQDCKIRPREKKTMELETNTIDRAIANLMFNKMDINVHD 306

Query: 981 F 983
           +
Sbjct: 307 Y 307

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,498,019,856,450
Number of Sequences: 15229318
Number of Extensions: 2498019856450
Number of Successful Extensions: 628698129
Number of sequences better than 0.0: 0