Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN28842


BLASTX 7.6.2

Query= UN28842 /QuerySize=910
        (909 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|2651308|gb|AAB87588.1| hypothetical protein [Arabidopsis thal...    246   4e-063
gi|79572219|ref|NP_181580.2| uncharacterized protein [Arabidopsi...    246   4e-063
gi|297823979|ref|XP_002879872.1| hypothetical protein ARALYDRAFT...    243   3e-062
gi|224139326|ref|XP_002323057.1| predicted protein [Populus tric...     94   2e-017
gi|297816948|ref|XP_002876357.1| hypothetical protein ARALYDRAFT...     70   5e-010
gi|15228874|ref|NP_191186.1| uncharacterized protein [Arabidopsi...     69   1e-009
gi|194697882|gb|ACF83025.1| unknown [Zea mays]                          64   4e-008
gi|226528531|ref|NP_001144574.1| hypothetical protein LOC1002775...     64   4e-008
gi|255559639|ref|XP_002520839.1| conserved hypothetical protein ...     61   2e-007
gi|118481019|gb|ABK92463.1| unknown [Populus trichocarpa]               59   7e-007
gi|255586107|ref|XP_002533717.1| conserved hypothetical protein ...     58   2e-006

>gi|2651308|gb|AAB87588.1| hypothetical protein [Arabidopsis thaliana]

          Length = 541

 Score =  246 bits (627), Expect = 4e-063
 Identities = 133/173 (76%), Positives = 147/173 (84%), Gaps = 12/173 (6%)
 Frame = +3

Query: 228 SSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQLAAENAFRGWTKHS-------P 386
           SSR G+LKKLEEAT+GVKQSKQ LE ALNR+EIANVKQLAAENAFRGWTK S       P
Sbjct: 362 SSRRGILKKLEEATEGVKQSKQALEAALNRVEIANVKQLAAENAFRGWTKDSLKGDNFTP 421

Query: 387 VSQTRRSFFRHLNKQHEPVNNLSKPVLKSNVSMRDVLRRKQVPKEDVVVSER--LEGQ-- 554
           ++ TRRSFF HLNK HEP++ L KPVLKSN+SMRDVLRRKQVPKEDVV  +R  LEGQ  
Sbjct: 422 LNHTRRSFFSHLNKHHEPLDILPKPVLKSNISMRDVLRRKQVPKEDVVAPQRQSLEGQIP 481

Query: 555 RRNVNLSQMLTELKHDIKSSWARGEKEEVHKEKRFVTQRRKFGFIHITLPMQK 713
           RRNVNLSQML ELK D+K S ARGEKEEVH+EK++VTQRRKFGFIHITLP+QK
Sbjct: 482 RRNVNLSQMLKELKQDVKFS-ARGEKEEVHEEKQYVTQRRKFGFIHITLPLQK 533


 Score =  89 bits (219), Expect = 8e-016
 Identities = 56/75 (74%), Positives = 61/75 (81%), Gaps = 8/75 (10%)
 Frame = +3

Query:  12 DETPREQVKMVA---ETCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITMLS 182
           DE  REQVKMVA   ET LN QNKN L  TAEMRLVAARKMEEAA+AAEALAIAEITML 
Sbjct: 266 DEPLREQVKMVAEADETGLNLQNKNSL-RTAEMRLVAARKMEEAAKAAEALAIAEITML- 323

Query: 183 SSSSSNGESEEDDTD 227
              SSNGES++DD++
Sbjct: 324 ---SSNGESQDDDSE 335

>gi|79572219|ref|NP_181580.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 518

 Score =  246 bits (627), Expect = 4e-063
 Identities = 133/173 (76%), Positives = 147/173 (84%), Gaps = 12/173 (6%)
 Frame = +3

Query: 228 SSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQLAAENAFRGWTKHS-------P 386
           SSR G+LKKLEEAT+GVKQSKQ LE ALNR+EIANVKQLAAENAFRGWTK S       P
Sbjct: 339 SSRRGILKKLEEATEGVKQSKQALEAALNRVEIANVKQLAAENAFRGWTKDSLKGDNFTP 398

Query: 387 VSQTRRSFFRHLNKQHEPVNNLSKPVLKSNVSMRDVLRRKQVPKEDVVVSER--LEGQ-- 554
           ++ TRRSFF HLNK HEP++ L KPVLKSN+SMRDVLRRKQVPKEDVV  +R  LEGQ  
Sbjct: 399 LNHTRRSFFSHLNKHHEPLDILPKPVLKSNISMRDVLRRKQVPKEDVVAPQRQSLEGQIP 458

Query: 555 RRNVNLSQMLTELKHDIKSSWARGEKEEVHKEKRFVTQRRKFGFIHITLPMQK 713
           RRNVNLSQML ELK D+K S ARGEKEEVH+EK++VTQRRKFGFIHITLP+QK
Sbjct: 459 RRNVNLSQMLKELKQDVKFS-ARGEKEEVHEEKQYVTQRRKFGFIHITLPLQK 510


 Score =  89 bits (219), Expect = 8e-016
 Identities = 56/75 (74%), Positives = 61/75 (81%), Gaps = 8/75 (10%)
 Frame = +3

Query:  12 DETPREQVKMVA---ETCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITMLS 182
           DE  REQVKMVA   ET LN QNKN L  TAEMRLVAARKMEEAA+AAEALAIAEITML 
Sbjct: 243 DEPLREQVKMVAEADETGLNLQNKNSL-RTAEMRLVAARKMEEAAKAAEALAIAEITML- 300

Query: 183 SSSSSNGESEEDDTD 227
              SSNGES++DD++
Sbjct: 301 ---SSNGESQDDDSE 312

>gi|297823979|ref|XP_002879872.1| hypothetical protein ARALYDRAFT_483106
        [Arabidopsis lyrata subsp. lyrata]

          Length = 518

 Score =  243 bits (620), Expect = 3e-062
 Identities = 132/174 (75%), Positives = 144/174 (82%), Gaps = 12/174 (6%)
 Frame = +3

Query: 225 DSSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQLAAENAFRGWTKHS------- 383
           +SSR G+LKKLEEAT+GVKQSKQ LE ALNR EIANVKQLAAENAFRGWTK S       
Sbjct: 338 NSSRRGILKKLEEATEGVKQSKQALEAALNRAEIANVKQLAAENAFRGWTKDSSKGDNFT 397

Query: 384 PVSQTRRSFFRHLNKQHEPVNNLSKPVLKSNVSMRDVLRRKQVPKEDVVVSER--LEGQ- 554
           P+  TRRSFF HLNK HEP++NL KPVLKSNVSMRDVLRRKQVPKEDVV  +R  LEGQ 
Sbjct: 398 PLHHTRRSFFSHLNKHHEPLDNLPKPVLKSNVSMRDVLRRKQVPKEDVVAPQRQSLEGQI 457

Query: 555 -RRNVNLSQMLTELKHDIKSSWARGEKEEVHKEKRFVTQRRKFGFIHITLPMQK 713
            RRN NLSQML ELK D+K S  R EKEEVH+EK++VTQRRKFGFIHITLP+QK
Sbjct: 458 PRRNANLSQMLKELKQDVKFS-TRAEKEEVHEEKQYVTQRRKFGFIHITLPLQK 510


 Score =  90 bits (222), Expect = 4e-016
 Identities = 57/76 (75%), Positives = 61/76 (80%), Gaps = 8/76 (10%)
 Frame = +3

Query:   9 PDETPREQVKMVA---ETCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITML 179
           PDE  RE VKMVA   ET LN QNKN L  TAEMRLVAARKMEEAARAAEALAIAEITML
Sbjct: 242 PDEPLREHVKMVAETDETGLNLQNKNRL-RTAEMRLVAARKMEEAARAAEALAIAEITML 300

Query: 180 SSSSSSNGESEEDDTD 227
               SSNGES++DD++
Sbjct: 301 ----SSNGESQDDDSE 312

>gi|224139326|ref|XP_002323057.1| predicted protein [Populus trichocarpa]

          Length = 426

 Score =  94 bits (233), Expect = 2e-017
 Identities = 58/108 (53%), Positives = 73/108 (67%), Gaps = 5/108 (4%)
 Frame = +3

Query:  84 IITAEMRLVAARKMEEAARAAEALAIAEITMLSSSSSSNGESEEDDTDSSRSGVLKKLEE 263
           I TAE+RL+AARKM+EAARA EA+A+AEI  LSS  +S+ +S +         VLKK+EE
Sbjct: 198 IQTAEIRLIAARKMKEAARAVEAVALAEIKALSSHENSSAKSTQKPEGME---VLKKVEE 254

Query: 264 ATQGVKQSKQDLETALNRLEIANVKQLAAENAFRGWTKHSPVSQTRRS 407
           AT+ +K SK+ LE ALNR+E AN  +LA E A R W   S   Q RRS
Sbjct: 255 ATEEIKTSKKALEEALNRVEAANKGKLAVEEALRRW--RSEHGQKRRS 300

>gi|297816948|ref|XP_002876357.1| hypothetical protein ARALYDRAFT_486067
        [Arabidopsis lyrata subsp. lyrata]

          Length = 440

 Score =  70 bits (169), Expect = 5e-010
 Identities = 41/56 (73%), Positives = 46/56 (82%), Gaps = 4/56 (7%)
 Frame = +3

Query:  27 EQVKMVAE---TCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITMLSS 185
           EQ+KMV E   T  +KQ+K CL  TAEMRLVAARKMEEAARAAEA AIAE+T+LSS
Sbjct: 223 EQIKMVVETNDTAFHKQSKTCL-RTAEMRLVAARKMEEAARAAEAFAIAEMTILSS 277


 Score =  63 bits (152), Expect = 5e-008
 Identities = 32/59 (54%), Positives = 43/59 (72%)
 Frame = +3

Query: 207 SEEDDTDSSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQLAAENAFRGWTKHS 383
           ++E   + SR  +L+KLEEA + VKQSK+ LE ALNR+E+AN KQL A++AFR W   S
Sbjct: 301 NKELSANVSRIEILRKLEEANEEVKQSKKALEMALNRVEVANTKQLEAQDAFRQWNIES 359

>gi|15228874|ref|NP_191186.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 446

 Score =  69 bits (166), Expect = 1e-009
 Identities = 39/78 (50%), Positives = 50/78 (64%)
 Frame = +3

Query: 207 SEEDDTDSSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQLAAENAFRGWTKHSP 386
           ++E  T+ SR  +L+KLEEA + VKQSKQ LE ALNR+EIA+VKQL AE AFR W   S 
Sbjct: 308 NKELSTNVSRIEILRKLEEANEEVKQSKQALEVALNRVEIASVKQLEAEEAFRQWNIESW 367

Query: 387 VSQTRRSFFRHLNKQHEP 440
             Q      R + ++  P
Sbjct: 368 KDQKAVGAKRSMKRESFP 385


 Score =  67 bits (161), Expect = 4e-009
 Identities = 39/56 (69%), Positives = 46/56 (82%), Gaps = 4/56 (7%)
 Frame = +3

Query:  27 EQVKMVAE---TCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITMLSS 185
           EQ+KMV E   T  +KQ+K C   TA+MRLVAARKMEEAARAAEALA+AE+T+LSS
Sbjct: 230 EQIKMVVETYDTAFHKQSKTC-PRTADMRLVAARKMEEAARAAEALALAEMTILSS 284

>gi|194697882|gb|ACF83025.1| unknown [Zea mays]

          Length = 318

 Score =  64 bits (153), Expect = 4e-008
 Identities = 45/114 (39%), Positives = 65/114 (57%), Gaps = 15/114 (13%)
 Frame = +3

Query:  12 DETPREQVKMVAETCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITMLSSSS 191
           D++  E + + AE     +     I TAE+R +AARKME+AARAAEALA+AEI  L SS 
Sbjct:  36 DDSKSEAMVLAAEI----EQVKASICTAEVRCIAARKMEDAARAAEALALAEIKALVSS- 90

Query: 192 SSNGESEEDDTDSSRSGVLKKLEE-------ATQGVKQSKQDLETALNRLEIAN 332
              G S E DT S   GV   +EE       A +  + S++ +E A+ +++ A+
Sbjct:  91 ---GSSFEGDTASDGGGVTLSMEEYFKLCSRALEADESSRRKVENAMLQVDAAD 141

>gi|226528531|ref|NP_001144574.1| hypothetical protein LOC100277584 [Zea mays]

          Length = 545

 Score =  64 bits (153), Expect = 4e-008
 Identities = 45/114 (39%), Positives = 65/114 (57%), Gaps = 15/114 (13%)
 Frame = +3

Query:  12 DETPREQVKMVAETCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITMLSSSS 191
           D++  E + + AE     +     I TAE+R +AARKME+AARAAEALA+AEI  L SS 
Sbjct: 263 DDSKSEAMVLAAEI----EQVKASICTAEVRCIAARKMEDAARAAEALALAEIKALVSS- 317

Query: 192 SSNGESEEDDTDSSRSGVLKKLEE-------ATQGVKQSKQDLETALNRLEIAN 332
              G S E DT S   GV   +EE       A +  + S++ +E A+ +++ A+
Sbjct: 318 ---GSSFEGDTASDGGGVTLSMEEYFKLCSRALEADESSRRKVENAMLQVDAAD 368

>gi|255559639|ref|XP_002520839.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 561

 Score =  61 bits (147), Expect = 2e-007
 Identities = 46/133 (34%), Positives = 76/133 (57%), Gaps = 10/133 (7%)
 Frame = +3

Query:  42 VAETCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITMLSSSSSSNG----ES 209
           V++  L  ++    + TAE+RL+AA+KMEEAARAAEA+A+AEI  LS + SS+G    E 
Sbjct: 269 VSKAMLANEHTKTNLKTAELRLLAAKKMEEAARAAEAVALAEIKALSGNDSSSGFVLPEP 328

Query: 210 EEDDTDSSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQLAAENAFRGWTKHSPV 389
           E+  +  +R+ +  K ++A    K+    +E A  +   AN+ +++     R  T+   V
Sbjct: 329 EKVSSFDARTPLTPKAQKAEGLAKK----VEVARLQRREANITKMSILRKLREATEE--V 382

Query: 390 SQTRRSFFRHLNK 428
            Q+++     LNK
Sbjct: 383 KQSKQVLEEALNK 395


 Score =  60 bits (144), Expect = 4e-007
 Identities = 43/115 (37%), Positives = 61/115 (53%), Gaps = 9/115 (7%)
 Frame = +3

Query: 219 DTDSSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQLAAENAFRGWTKHSPVSQ- 395
           + + ++  +L+KL EAT+ VKQSKQ LE ALN++E+AN KQ AAE A R W   +     
Sbjct: 363 EANITKMSILRKLREATEEVKQSKQVLEEALNKVEMANRKQSAAEEAIRKWMPENDQEGQ 422

Query: 396 -----TRRSFFRHLNKQHEPVNNLSKPVLKSNVSMRDVLRRKQVPKEDVVVSERL 545
                T R    HL   H+P N+   P+ ++  S      RK V K  V + + L
Sbjct: 423 AAAYCTTRFSNYHL---HQPNNHQDSPLHEAKESNLVNEDRKPVLKSTVSMRDVL 474

>gi|118481019|gb|ABK92463.1| unknown [Populus trichocarpa]

          Length = 538

 Score =  59 bits (142), Expect = 7e-007
 Identities = 60/182 (32%), Positives = 99/182 (54%), Gaps = 26/182 (14%)
 Frame = +3

Query:   3 VKPDETPREQV--KMVAETCLNKQNKNCLIITAEMRLVAARKMEEAARAAEALAIAEITM 176
           +K DE  + +V   M+AE      N    I TAE+RL+AA+KMEEAARAAEA+A+AEI  
Sbjct: 236 MKMDEVQKTEVLKGMLAE------NIKTNIRTAELRLLAAKKMEEAARAAEAVALAEIKA 289

Query: 177 LSSSSSSNG----ESEEDDTDSSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQL 344
           LS+  SS+G    E E+  +  +RS +  K ++A +    S++ +ET     +  +  ++
Sbjct: 290 LSTDESSSGYALPEPEKVPSFEARSPLNPKDQKAEE---LSQKKVETLKLPKQEVHFTKM 346

Query: 345 AAENAFRGWTKHSPVSQTRRSFFRHLNKQHEPVNNLSKPVLKSNVSMRDVLRRKQVPKED 524
           +  N  R  T+   V  ++++    LNK  E  N       +  V++ + + RK +P++D
Sbjct: 347 SILNKLREATEE--VKLSKQALEEALNKV-EMAN-------RKQVAVEEAI-RKWMPEDD 395

Query: 525 VV 530
            V
Sbjct: 396 QV 397

>gi|255586107|ref|XP_002533717.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 555

 Score =  58 bits (138), Expect = 2e-006
 Identities = 38/92 (41%), Positives = 59/92 (64%), Gaps = 8/92 (8%)
 Frame = +3

Query:  84 IITAEMRLVAARKMEEAARAAEALAIAEITMLSSSSSSNGESEEDD-----TDSSRSGVL 248
           I TAE+RLVAARKM++AA+AAEA+A+AEI  +SS  +S+G+S +       T    S + 
Sbjct: 281 IKTAEIRLVAARKMKQAAKAAEAVALAEIKAMSSHENSSGDSSKKAEGVTLTFEEYSSLT 340

Query: 249 KKLEEATQGVKQSKQDLETALNRLEIANVKQL 344
            K +EA +    SK  +  A+ +++ ANV ++
Sbjct: 341 SKAQEAEE---LSKTKVIDAMLQVDEANVSKM 369


 Score =  57 bits (136), Expect = 4e-006
 Identities = 34/80 (42%), Positives = 47/80 (58%), Gaps = 2/80 (2%)
 Frame = +3

Query: 165 EITMLSSSSSSNGESEEDDTDSSRSGVLKKLEEATQGVKQSKQDLETALNRLEIANVKQL 344
           E   LS +   +   + D+ + S+  +LKK+EEAT+ +K SK+ LE ALNR+E AN  +L
Sbjct: 345 EAEELSKTKVIDAMLQVDEANVSKMEILKKVEEATEEIKTSKKALEEALNRVEAANKGKL 404

Query: 345 AAENAFRGWTKHSPVSQTRR 404
           A E A R W   S   Q RR
Sbjct: 405 AVEEALRKW--RSEHGQKRR 422

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,432,271,374,555
Number of Sequences: 15229318
Number of Extensions: 3432271374555
Number of Successful Extensions: 818036173
Number of sequences better than 0.0: 0