Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN74855


BLASTX 7.6.2

Query= UN74855 /QuerySize=782
        (781 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsi...    245   7e-063
gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT...    240   2e-061
gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]              236   3e-060
gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT...    176   3e-042
gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabid...    158   1e-036
gi|255551795|ref|XP_002516943.1| conserved hypothetical protein ...    102   9e-020
gi|224107110|ref|XP_002314378.1| predicted protein [Populus tric...    101   2e-019
gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vini...     87   2e-015
gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein...     84   3e-014
gi|255645721|gb|ACU23354.1| unknown [Glycine max]                       70   2e-010
gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabid...     63   5e-008
gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis...     63   5e-008

>gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 396

 Score =  245 bits (624), Expect = 7e-063
 Identities = 141/192 (73%), Positives = 152/192 (79%), Gaps = 23/192 (11%)
 Frame = +3

Query: 177 MVEAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSS 356
           MVEAKD  Q+MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHL+SSST SS
Sbjct:   1 MVEAKD--QDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLTSSST-SS 57

Query: 357 SPSFRSDSVATSTTTT---VSASLSLSASGARNLTANNNNNNKLPFLLA--KKKALTPSS 521
           SPSFRSDSV ++TT +   +SASLSLS SG     A NNNN+KLPFLLA  KKK LT SS
Sbjct:  58 SPSFRSDSVGSTTTASAANLSASLSLSVSG-----ATNNNNSKLPFLLAKKKKKMLTASS 112

Query: 522 SSS--SNIVYKRSQS--TTTTTYRVSDSSPRKRSGFWSFLHLHSYKHHSSSKKVGNFHDS 689
           SSS  +NIVYKRSQS  TT TTY  SD SPRKR+GFWSF HL+S K H SSKKVGNFH  
Sbjct: 113 SSSTTANIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQHGSSKKVGNFH-- 170

Query: 690 SRPQQPTKHTET 725
               QP   TET
Sbjct: 171 ----QPISQTET 178

>gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876
        [Arabidopsis lyrata subsp. lyrata]

          Length = 384

 Score =  240 bits (612), Expect = 2e-061
 Identities = 139/187 (74%), Positives = 151/187 (80%), Gaps = 17/187 (9%)
 Frame = +3

Query: 183 EAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSP 362
           E KD  Q+MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSST SSSP
Sbjct:   3 EVKD--QDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSST-SSSP 59

Query: 363 SFRSDSVATSTTTTVSASLSLSASGARNLTANNNNNNKLPFLLA--KKKALTPSSS-SSS 533
           SFRSDSV  STTT  +ASLSLS SGA        NNNKLPFLLA  KKK LT SSS +++
Sbjct:  60 SFRSDSVG-STTTASAASLSLSVSGA-------TNNNKLPFLLAKKKKKMLTASSSATTA 111

Query: 534 NIVYKRSQS--TTTTTYRVSDSSPRKRSGFWSFLHLHSYKHHSSSKKVGNFHD-SSRPQQ 704
           NIVYKRSQS  TT TTY  SD SPRKR+GFWSFLHL+S KHH SSKKVGNFH  +S+ + 
Sbjct: 112 NIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFLHLYSSKHHGSSKKVGNFHQPTSQIEI 171

Query: 705 PTKHTET 725
            T+ TET
Sbjct: 172 KTELTET 178

>gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]

          Length = 388

 Score =  236 bits (601), Expect = 3e-060
 Identities = 134/182 (73%), Positives = 144/182 (79%), Gaps = 21/182 (11%)
 Frame = +3

Query: 207 MGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRSDSVA 386
           MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHL+SSST SSSPSFRSDSV 
Sbjct:   1 MGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLTSSST-SSSPSFRSDSVG 59

Query: 387 TSTTTT---VSASLSLSASGARNLTANNNNNNKLPFLLA--KKKALTPSSSSS--SNIVY 545
           ++TT +   +SASLSLS SG     A NNNN+KLPFLLA  KKK LT SSSSS  +NIVY
Sbjct:  60 STTTASAANLSASLSLSVSG-----ATNNNNSKLPFLLAKKKKKMLTASSSSSTTANIVY 114

Query: 546 KRSQS--TTTTTYRVSDSSPRKRSGFWSFLHLHSYKHHSSSKKVGNFHDSSRPQQPTKHT 719
           KRSQS  TT TTY  SD SPRKR+GFWSF HL+S K H SSKKVGNFH      QP   T
Sbjct: 115 KRSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQHGSSKKVGNFH------QPISQT 168

Query: 720 ET 725
           ET
Sbjct: 169 ET 170

>gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT_478007
        [Arabidopsis lyrata subsp. lyrata]

          Length = 372

 Score =  176 bits (446), Expect = 3e-042
 Identities = 110/195 (56%), Positives = 133/195 (68%), Gaps = 32/195 (16%)
 Frame = +3

Query: 183 EAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPK--HLSSSSTSSS 356
           E KD  Q+MG+GMQC+ HP+TKNPGGICA CLQEKLGKLVTSSFP+PK  HLSSSS  S 
Sbjct:   3 ELKD--QDMGEGMQCIRHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKSF 60

Query: 357 SPSFRSDSVATSTTTTVSASLSLSASGARNLTANNNNNNKLPFLLAKKK----------A 506
           +P         STT++++ SLS SAS  R+ T+NNN    LPFLLAKKK          +
Sbjct:  61 TP---------STTSSLALSLS-SASNGRDSTSNNN----LPFLLAKKKKNMLAASSSSS 106

Query: 507 LTPSSSSSSNIVYKRSQSTTTTTYRVSDSSPRKRSGFWSFLHLHSYKHH--SSSKKVGNF 680
            + SSSSS+N++YKRS+S T   Y  S  S RKRSGFWSFLHL+S KH   +++KKV NF
Sbjct: 107 SSSSSSSSANLIYKRSKS-TAAAYGES-FSQRKRSGFWSFLHLYSSKHQISNTTKKVDNF 164

Query: 681 HDSSRPQQPTKHTET 725
             S R Q+    TET
Sbjct: 165 SHSRRNQRTESTTET 179

>gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabidopsis
        thaliana]

          Length = 369

 Score =  158 bits (398), Expect = 1e-036
 Identities = 96/186 (51%), Positives = 119/186 (63%), Gaps = 12/186 (6%)
 Frame = +3

Query: 180 VEAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPK--HLSSSSTSS 353
           VE KD QQ+MG+GMQC+ HP+TKNPGGICA CLQEKLGKLVTSSFP+PK  HLSSSS  S
Sbjct:   2 VELKD-QQDMGEGMQCITHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKS 60

Query: 354 SSPSFRSDSVATSTTTTVSASLSLSASGARNLTANNNNNNKLPFLLAKKKALTPSSSSSS 533
            +PS  S +++ S     SAS    ++   NL        K     +   + + SSSSS+
Sbjct:  61 FTPSTTSLALSLS-----SASNGRDSTNNNNLPFLLAKKKKNMLAASSSSSSSSSSSSSA 115

Query: 534 NIVYKRSQSTTTTTYRVSDSSPRKRSGFWSFLHLHSYKHH--SSSKKVGNFHDSSRPQQP 707
           N++YKRS+S T   Y  S  S RKRSGFWSF HL+S KH   +++KKV NF    R Q+ 
Sbjct: 116 NLIYKRSKS-TAAAYGES-FSQRKRSGFWSFFHLYSSKHQISNTTKKVDNFSHLRRNQRT 173

Query: 708 TKHTET 725
              TET
Sbjct: 174 ESKTET 179

>gi|255551795|ref|XP_002516943.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 450

 Score =  102 bits (252), Expect = 9e-020
 Identities = 53/103 (51%), Positives = 67/103 (65%), Gaps = 13/103 (12%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRSD 377
           +++MGDGMQC +HP+  NPGGICAFCLQEKLGKLV+SSFPLP  + +SS+SSSSPSFRSD
Sbjct:  27 EEDMGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLP--IRASSSSSSSPSFRSD 84

Query: 378 -----------SVATSTTTTVSASLSLSASGARNLTANNNNNN 473
                        +   +   +ASLSL+         N+  NN
Sbjct:  85 IGSGVVGVGVVGASNGVSVGPAASLSLAVHSTSTKGRNDGGNN 127

>gi|224107110|ref|XP_002314378.1| predicted protein [Populus trichocarpa]

          Length = 389

 Score =  101 bits (249), Expect = 2e-019
 Identities = 52/93 (55%), Positives = 65/93 (69%), Gaps = 3/93 (3%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRS- 374
           ++++GDGMQC +HP+  NPGGICAFCLQEKLGKLV+SSFPLP  +  SS+SSSSPSFRS 
Sbjct:   1 EEDLGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLP--IRGSSSSSSSPSFRSV 58

Query: 375 DSVATSTTTTVSASLSLSASGARNLTANNNNNN 473
             V  S+      SLSL+A        N+  +N
Sbjct:  59 IGVGGSSNVGAGTSLSLAARPTTTKCRNDGGSN 91

>gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vinifera]

          Length = 387

 Score =  87 bits (215), Expect = 2e-015
 Identities = 59/154 (38%), Positives = 80/154 (51%), Gaps = 15/154 (9%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRSD 377
           + ++G+GMQC +HP+  NPGGICAFCLQEKLGKL+     +   +      +SS S    
Sbjct:  48 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLIGGGAGVGVGVGGGGGGASSTSL--S 105

Query: 378 SVATSTTTTVSASLSLSASGARNLTANNNNNNKLPFLLAKKKALTPSSSSSSNIVYKRSQ 557
              TS++++ SAS      G  +  A             KKK      S +  IV KRS+
Sbjct: 106 VRPTSSSSSYSASKDCHYHGNYSRRA----RIPFLLAQKKKKKKEVMGSDAVGIVLKRSK 161

Query: 558 STTT--------TTYRVSDSSPRKRSGFWSFLHL 635
           STTT         +   +D SP+KR GFWSFL+L
Sbjct: 162 STTTPRRGHFLVESEDANDYSPQKR-GFWSFLYL 194

>gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 420

 Score =  84 bits (205), Expect = 3e-014
 Identities = 39/60 (65%), Positives = 49/60 (81%), Gaps = 3/60 (5%)
 Frame = +3

Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRSD 377
           + ++G+GMQC +HP+  NPGGICAFCLQEKLGKLV+SSFP   +    S+SSSSPSFRS+
Sbjct:   9 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFP---NAIFPSSSSSSPSFRSE 65

>gi|255645721|gb|ACU23354.1| unknown [Glycine max]

          Length = 324

 Score =  70 bits (171), Expect = 2e-010
 Identities = 38/66 (57%), Positives = 44/66 (66%), Gaps = 5/66 (7%)
 Frame = +3

Query: 195 QQQEMGDGMQCVNHPF----TKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSP 362
           +  E+ DGMQC+NHP       NPGGICA CLQ+KL  L++SSFP      SSS SSSSP
Sbjct:   7 RHNEISDGMQCMNHPHRNNNNNNPGGICALCLQDKLRNLLSSSFPTSSPPFSSS-SSSSP 65

Query: 363 SFRSDS 380
           SF S S
Sbjct:  66 SFTSSS 71

>gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabidopsis
        thaliana]

          Length = 214

 Score =  63 bits (151), Expect = 5e-008
 Identities = 32/46 (69%), Positives = 36/46 (78%), Gaps = 2/46 (4%)
 Frame = -1

Query: 373 DLKEGEEEEVEEEERC--LGRGKEEVTSFPSFSWRQKAQIPPGFFV 242
           ++ EG ++  EEEER   LG GKEEVTS PSFSWR KAQIPPGFFV
Sbjct: 169 EVVEGVKDLGEEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 214

>gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis thaliana]

          Length = 207

 Score =  63 bits (151), Expect = 5e-008
 Identities = 32/46 (69%), Positives = 36/46 (78%), Gaps = 2/46 (4%)
 Frame = -1

Query: 373 DLKEGEEEEVEEEERC--LGRGKEEVTSFPSFSWRQKAQIPPGFFV 242
           ++ EG ++  EEEER   LG GKEEVTS PSFSWR KAQIPPGFFV
Sbjct: 162 EVVEGVKDLGEEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 207

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 798,366,029,682
Number of Sequences: 15229318
Number of Extensions: 798366029682
Number of Successful Extensions: 240520672
Number of sequences better than 0.0: 0