Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN72791


BLASTX 7.6.2

Query= UN72791 /QuerySize=666
        (665 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT...    217   1e-054
gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]              206   3e-051
gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsi...    203   2e-050
gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT...    161   7e-038
gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabid...    141   8e-032
gi|255551795|ref|XP_002516943.1| conserved hypothetical protein ...     98   1e-018
gi|224107110|ref|XP_002314378.1| predicted protein [Populus tric...     96   4e-018
gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vini...     89   4e-016
gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein...     82   7e-014
gi|255645721|gb|ACU23354.1| unknown [Glycine max]                       72   6e-011
gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabid...     64   2e-008
gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis...     64   2e-008
gi|119176049|ref|XP_001240156.1| hypothetical protein CIMG_09777...     59   4e-007
gi|303318213|ref|XP_003069106.1| hypothetical protein CPC735_022...     59   4e-007
gi|320031713|gb|EFW13672.1| conserved hypothetical protein [Cocc...     59   4e-007

>gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876
        [Arabidopsis lyrata subsp. lyrata]

          Length = 384

 Score =  217 bits (551), Expect = 1e-054
 Identities = 121/163 (74%), Positives = 133/163 (81%), Gaps = 15/163 (9%)
 Frame = +3

Query: 195 QQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRS 374
           + Q+MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSS PSFRS
Sbjct:   5 KDQDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSS-PSFRS 63

Query: 375 DSLATSTTTTTVSASLSLSASGARNVTANNNNNKLPFLLA--KKKALTPSSSST--NIVY 542
           DS+   +TTT  +ASLSLS SGA       NNNKLPFLLA  KKK LT SSS+T  NIVY
Sbjct:  64 DSV--GSTTTASAASLSLSVSGA------TNNNKLPFLLAKKKKKMLTASSSATTANIVY 115

Query: 543 KRSQS--TTTTAYRVSDSTPRKRSGFWSFLHLHSYKHNSSSRK 665
           KRSQS  TT T Y  SD +PRKR+GFWSFLHL+S KH+ SS+K
Sbjct: 116 KRSQSTRTTKTTYGDSDLSPRKRNGFWSFLHLYSSKHHGSSKK 158

>gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]

          Length = 388

 Score =  206 bits (523), Expect = 3e-051
 Identities = 115/157 (73%), Positives = 128/157 (81%), Gaps = 14/157 (8%)
 Frame = +3

Query: 207 MGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSDSLA 386
           MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHL+SSSTSSS PSFRSDS+ 
Sbjct:   1 MGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLTSSSTSSS-PSFRSDSVG 59

Query: 387 TSTTTT--TVSASLSLSASGARNVTANNNNNKLPFLLA--KKKALTPSSSST---NIVYK 545
           ++TT +   +SASLSLS SGA     NNNN+KLPFLLA  KKK LT SSSS+   NIVYK
Sbjct:  60 STTTASAANLSASLSLSVSGA----TNNNNSKLPFLLAKKKKKMLTASSSSSTTANIVYK 115

Query: 546 RSQS--TTTTAYRVSDSTPRKRSGFWSFLHLHSYKHN 650
           RSQS  TT T Y  SD +PRKR+GFWSF HL+S K +
Sbjct: 116 RSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQH 152

>gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 396

 Score =  203 bits (515), Expect = 2e-050
 Identities = 116/170 (68%), Positives = 131/170 (77%), Gaps = 12/170 (7%)
 Frame = +3

Query: 177 MVGAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSS 356
           MV AKD  Q+MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHL+SSSTSSS
Sbjct:   1 MVEAKD--QDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLTSSSTSSS 58

Query: 357 PPSFRSDSLATSTTTT--TVSASLSLSASGARNVTANNNNNKLPFLLAKKKALTPSSSST 530
            PSFRSDS+ ++TT +   +SASLSLS SGA N   NN+         KKK LT SSSS+
Sbjct:  59 -PSFRSDSVGSTTTASAANLSASLSLSVSGATN--NNNSKLPFLLAKKKKKMLTASSSSS 115

Query: 531 ---NIVYKRSQS--TTTTAYRVSDSTPRKRSGFWSFLHLHSYKHNSSSRK 665
              NIVYKRSQS  TT T Y  SD +PRKR+GFWSF HL+S K + SS+K
Sbjct: 116 TTANIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQHGSSKK 165

>gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT_478007
        [Arabidopsis lyrata subsp. lyrata]

          Length = 372

 Score =  161 bits (407), Expect = 7e-038
 Identities = 97/168 (57%), Positives = 116/168 (69%), Gaps = 29/168 (17%)
 Frame = +3

Query: 195 QQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPK--HLSSSSTSSSPPSF 368
           + Q+MG+GMQC+ HP+TKNPGGICA CLQEKLGKLVTSSFP+PK  HLSSSS  S  PS 
Sbjct:   5 KDQDMGEGMQCIRHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKSFTPS- 63

Query: 369 RSDSLATSTTTTTVSASLSLSASGARNVTANNNNNKLPFLLAKKK-----------ALTP 515
                    TT++++ SLS SAS  R+ T+NNN   LPFLLAKKK           + + 
Sbjct:  64 ---------TTSSLALSLS-SASNGRDSTSNNN---LPFLLAKKKKNMLAASSSSSSSSS 110

Query: 516 SSSSTNIVYKRSQSTTTTAYRVSDSTPRKRSGFWSFLHLHSYKHNSSS 659
           SSSS N++YKRS+S T  AY  S S  RKRSGFWSFLHL+S KH  S+
Sbjct: 111 SSSSANLIYKRSKS-TAAAYGESFS-QRKRSGFWSFLHLYSSKHQISN 156

>gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabidopsis
        thaliana]

          Length = 369

 Score =  141 bits (355), Expect = 8e-032
 Identities = 85/158 (53%), Positives = 103/158 (65%), Gaps = 12/158 (7%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPK--HLSSSSTSSSPPSFR 371
           QQ+MG+GMQC+ HP+TKNPGGICA CLQEKLGKLVTSSFP+PK  HLSSSS  S  PS  
Sbjct:   7 QQDMGEGMQCITHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKSFTPSTT 66

Query: 372 SDSLATSTTTTTVSASLSLSASGARNVTANNNNNKLPFLLAKKKALTPSSSST--NIVYK 545
           S +L+ S      SAS    ++   N+       K   L A   + + SSSS+  N++YK
Sbjct:  67 SLALSLS------SASNGRDSTNNNNLPFLLAKKKKNMLAASSSSSSSSSSSSSANLIYK 120

Query: 546 RSQSTTTTAYRVSDSTPRKRSGFWSFLHLHSYKHNSSS 659
           RS+S T  AY  S S  RKRSGFWSF HL+S KH  S+
Sbjct: 121 RSKS-TAAAYGESFS-QRKRSGFWSFFHLYSSKHQISN 156

>gi|255551795|ref|XP_002516943.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 450

 Score =  98 bits (242), Expect = 1e-018
 Identities = 49/86 (56%), Positives = 63/86 (73%), Gaps = 2/86 (2%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSD 377
           +++MGDGMQC +HP+  NPGGICAFCLQEKLGKLV+SSFPLP  + +SS+SSS PSFRSD
Sbjct:  27 EEDMGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLP--IRASSSSSSSPSFRSD 84

Query: 378 SLATSTTTTTVSASLSLSASGARNVT 455
             +       V AS  +S   A +++
Sbjct:  85 IGSGVVGVGVVGASNGVSVGPAASLS 110

>gi|224107110|ref|XP_002314378.1| predicted protein [Populus trichocarpa]

          Length = 389

 Score =  96 bits (237), Expect = 4e-018
 Identities = 47/79 (59%), Positives = 59/79 (74%), Gaps = 2/79 (2%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSD 377
           ++++GDGMQC +HP+  NPGGICAFCLQEKLGKLV+SSFPLP  +  SS+SSS PSFRS 
Sbjct:   1 EEDLGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLP--IRGSSSSSSSPSFRSV 58

Query: 378 SLATSTTTTTVSASLSLSA 434
                ++      SLSL+A
Sbjct:  59 IGVGGSSNVGAGTSLSLAA 77

>gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vinifera]

          Length = 387

 Score =  89 bits (220), Expect = 4e-016
 Identities = 58/153 (37%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSD 377
           + ++G+GMQC +HP+  NPGGICAFCLQEKLGKL+     +   +      +S     S 
Sbjct:  48 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLIGGGAGVGVGVGGGGGGAS-----ST 102

Query: 378 SLATSTTTTTVSASLSLSASGARNVTANNNNNKLPFLLAKKKALTPSSSSTNIVYKRSQS 557
           SL+   T+++ S S S       N +       L     KKK     S +  IV KRS+S
Sbjct: 103 SLSVRPTSSSSSYSASKDCHYHGNYSRRARIPFLLAQKKKKKKEVMGSDAVGIVLKRSKS 162

Query: 558 TTT--------TAYRVSDSTPRKRSGFWSFLHL 632
           TTT         +   +D +P+KR GFWSFL+L
Sbjct: 163 TTTPRRGHFLVESEDANDYSPQKR-GFWSFLYL 194

>gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 420

 Score =  82 bits (200), Expect = 7e-014
 Identities = 38/60 (63%), Positives = 48/60 (80%), Gaps = 3/60 (5%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSD 377
           + ++G+GMQC +HP+  NPGGICAFCLQEKLGKLV+SSFP   +    S+SSS PSFRS+
Sbjct:   9 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFP---NAIFPSSSSSSPSFRSE 65

>gi|255645721|gb|ACU23354.1| unknown [Glycine max]

          Length = 324

 Score =  72 bits (175), Expect = 6e-011
 Identities = 38/75 (50%), Positives = 46/75 (61%), Gaps = 4/75 (5%)
 Frame = +3

Query: 177 MVGAKDQQQEMGDGMQCVNHPF----TKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSS 344
           M G   +  E+ DGMQC+NHP       NPGGICA CLQ+KL  L++SSFP      SSS
Sbjct:   1 MEGVGARHNEISDGMQCMNHPHRNNNNNNPGGICALCLQDKLRNLLSSSFPTSSPPFSSS 60

Query: 345 TSSSPPSFRSDSLAT 389
           +SSSP    S S+ T
Sbjct:  61 SSSSPSFTSSSSVKT 75

>gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabidopsis
        thaliana]

          Length = 214

 Score =  64 bits (153), Expect = 2e-008
 Identities = 33/49 (67%), Positives = 37/49 (75%), Gaps = 2/49 (4%)
 Frame = -2

Query: 382 RESDLKEGGEEEVEEEERC--LGRGKEEVTSFPSFSWRQKAQIPPGFFV 242
           R  ++ EG ++  EEEER   LG GKEEVTS PSFSWR KAQIPPGFFV
Sbjct: 166 RAREVVEGVKDLGEEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 214

>gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis thaliana]

          Length = 207

 Score =  64 bits (153), Expect = 2e-008
 Identities = 33/49 (67%), Positives = 37/49 (75%), Gaps = 2/49 (4%)
 Frame = -2

Query: 382 RESDLKEGGEEEVEEEERC--LGRGKEEVTSFPSFSWRQKAQIPPGFFV 242
           R  ++ EG ++  EEEER   LG GKEEVTS PSFSWR KAQIPPGFFV
Sbjct: 159 RAREVVEGVKDLGEEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 207

>gi|119176049|ref|XP_001240156.1| hypothetical protein CIMG_09777 [Coccidioides
        immitis RS]

          Length = 1291

 Score =  59 bits (142), Expect = 4e-007
 Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
 Frame = +1

Query:  346 LLPPLLPSDLTL*LPPPPPPPSPPLSPSPPQEPETSQQTTTTTSFRFYLQRRRL*LPPPP 525
            ++PP+LP    L  PPPPPPP PP +PS P +P +S Q+    SF   +    + +  P 
Sbjct: 1127 VIPPVLPELQHLSNPPPPPPPPPPTAPSQPMDPNSSSQSEERNSFVSGVGTINIAIDEPA 1186

Query:  526 --RRTSFTREASQRQRPRTESLTPP 594
                T  T + SQ   P TE + PP
Sbjct: 1187 VLAPTEITHQRSQSAVPPTEYMLPP 1211

>gi|303318213|ref|XP_003069106.1| hypothetical protein CPC735_022970
        [Coccidioides posadasii C735 delta SOWgp]

          Length = 1319

 Score =  59 bits (142), Expect = 4e-007
 Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
 Frame = +1

Query:  346 LLPPLLPSDLTL*LPPPPPPPSPPLSPSPPQEPETSQQTTTTTSFRFYLQRRRL*LPPPP 525
            ++PP+LP    L  PPPPPPP PP +PS P +P +S Q+    SF   +    + +  P 
Sbjct: 1155 VIPPVLPELQHLSNPPPPPPPPPPAAPSQPMDPNSSSQSEERNSFVSGVGTINIAIDEPA 1214

Query:  526 --RRTSFTREASQRQRPRTESLTPP 594
                T  T + SQ   P TE + PP
Sbjct: 1215 VLAPTEITHQRSQSAVPPTEYMLPP 1239

>gi|320031713|gb|EFW13672.1| conserved hypothetical protein [Coccidioides
        posadasii str. Silveira]

          Length = 1582

 Score =  59 bits (142), Expect = 4e-007
 Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
 Frame = +1

Query:  346 LLPPLLPSDLTL*LPPPPPPPSPPLSPSPPQEPETSQQTTTTTSFRFYLQRRRL*LPPPP 525
            ++PP+LP    L  PPPPPPP PP +PS P +P +S Q+    SF   +    + +  P 
Sbjct: 1418 VIPPVLPELQHLSNPPPPPPPPPPAAPSQPMDPNSSSQSEERNSFVSGVGTINIAIDEPA 1477

Query:  526 --RRTSFTREASQRQRPRTESLTPP 594
                T  T + SQ   P TE + PP
Sbjct: 1478 VLAPTEITHQRSQSAVPPTEYMLPP 1502

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 670,546,551,498
Number of Sequences: 15229318
Number of Extensions: 670546551498
Number of Successful Extensions: 203192682
Number of sequences better than 0.0: 0