Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN20000


BLASTX 7.6.2

Query= UN20000 /QuerySize=650
        (649 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297844182|ref|XP_002889972.1| expressed protein [Arabidopsis ...    188   7e-046
gi|147742772|gb|ABQ50553.1| hypothetical protein [Brassica rapa]       163   2e-038
gi|8698728|gb|AAF78486.1|AC012187_6 Contains weak similarity to ...    161   7e-038
gi|18391437|ref|NP_563914.1| proline-rich family protein [Arabid...    161   7e-038
gi|255631334|gb|ACU16034.1| unknown [Glycine max]                      144   9e-033
gi|255543609|ref|XP_002512867.1| LRX1, putative [Ricinus communis]     142   4e-032
gi|224055807|ref|XP_002298663.1| predicted protein [Populus tric...    139   5e-031
gi|238478456|ref|NP_001154331.1| proline-rich family protein [Ar...    138   6e-031
gi|224129224|ref|XP_002328921.1| predicted protein [Populus tric...    138   1e-030
gi|296082290|emb|CBI21295.3| unnamed protein product [Vitis vini...    135   7e-030
gi|225451569|ref|XP_002274558.1| PREDICTED: hypothetical protein...    131   1e-028
gi|218201524|gb|EEC83951.1| hypothetical protein OsI_30050 [Oryz...    116   4e-024
gi|115477537|ref|NP_001062364.1| Os08g0536400 [Oryza sativa Japo...    111   8e-023
gi|147742781|gb|ABQ50561.1| hypothetical protein [Brassica rapa]        70   2e-010
gi|332025507|gb|EGI65670.1| hypothetical protein G5I_05770 [Acro...     67   2e-009
gi|255721947|ref|XP_002545908.1| predicted protein [Candida trop...     66   3e-009
gi|225458705|ref|XP_002283001.1| PREDICTED: hypothetical protein...     60   2e-007
gi|125552245|gb|EAY97954.1| hypothetical protein OsI_19871 [Oryz...     57   2e-006

>gi|297844182|ref|XP_002889972.1| expressed protein [Arabidopsis lyrata subsp.
        lyrata]

          Length = 132

 Score =  188 bits (476), Expect = 7e-046
 Identities = 90/133 (67%), Positives = 98/133 (73%), Gaps = 19/133 (14%)
 Frame = -2

Query: 600 MSYDYGKVPPETYPPPGYQSQYP-------PPPPVYP-PSHHHEGY-PQHPHGYPSYPPP 448
           MSYD  KVPPE+YPPPGYQS YP       PPPP YP P  HHEGY P  PHG   Y P 
Sbjct:   1 MSYD--KVPPESYPPPGYQSHYPPPGYPSAPPPPGYPSPPSHHEGYPPPQPHG--GYQPY 56

Query: 447 RPPSRPPYEGGYQEYFS-GGFP-----PPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRG 286
            PPS  PYEGGYQ YF+ GG+P     PPPPPPPQ YNQCHHDHH+YQDS+SGC SF+RG
Sbjct:  57 PPPSSRPYEGGYQGYFAGGGYPHHHHGPPPPPPPQNYNQCHHDHHHYQDSDSGCFSFVRG 116

Query: 285 CLATLCCCCLLDD 247
           CLA LCCCCLL++
Sbjct: 117 CLAALCCCCLLEE 129

>gi|147742772|gb|ABQ50553.1| hypothetical protein [Brassica rapa]

          Length = 116

 Score =  163 bits (412), Expect = 2e-038
 Identities = 77/115 (66%), Positives = 83/115 (72%), Gaps = 15/115 (13%)
 Frame = -2

Query: 588 YGKVPPETY-PPPGYQSQYP-------PPPPVYPPSHHHEGY-PQHPHGYPSYPPPRPPS 436
           Y KVPPE+Y PPPGYQS YP       PPPP YPP  HHEGY P  PHGYP YPPPR   
Sbjct:   3 YEKVPPESYPPPPGYQSHYPPPGYPSAPPPPGYPPP-HHEGYPPPQPHGYPPYPPPR--- 58

Query: 435 RPPYEGGYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATL 271
             PYEGGYQ YF+G +PPPPPPPPQQ N   HDHH+YQDSNS  +SFLRGCL  +
Sbjct:  59 --PYEGGYQGYFAGNYPPPPPPPPQQCNHYQHDHHHYQDSNSDGSSFLRGCLCVV 111

>gi|8698728|gb|AAF78486.1|AC012187_6 Contains weak similarity to GATA-6
        DNA-binding protein from Homo sapiens gb|X95701. ESTs gb|N38392,
        gb|AI998367, gb|H36135, gb|Z26200 come from this gene [Arabidopsis
        thaliana]

          Length = 198

 Score =  161 bits (407), Expect = 7e-038
 Identities = 75/113 (66%), Positives = 83/113 (73%), Gaps = 13/113 (11%)
 Frame = -2

Query: 564 YPPPGYQSQYPPPPPVYP-PSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGGYQEYFS-GG 391
           YPPPGY S   PPPP YP P  HHEGYP  P  Y  YP   PPS  PYEGGYQ YF+ GG
Sbjct:  89 YPPPGYPS--APPPPGYPSPPSHHEGYPP-PQPYGGYP---PPSSRPYEGGYQGYFAGGG 142

Query: 390 FP-----PPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
           +P     PPPPPPPQ Y+ CHHDHH+YQDS+SGC SF+RGCLA LCCCCLL++
Sbjct: 143 YPHQHHGPPPPPPPQNYDHCHHDHHHYQDSDSGCFSFIRGCLAALCCCCLLEE 195

>gi|18391437|ref|NP_563914.1| proline-rich family protein [Arabidopsis
        thaliana]

          Length = 129

 Score =  161 bits (407), Expect = 7e-038
 Identities = 75/113 (66%), Positives = 83/113 (73%), Gaps = 13/113 (11%)
 Frame = -2

Query: 564 YPPPGYQSQYPPPPPVYP-PSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGGYQEYFS-GG 391
           YPPPGY S   PPPP YP P  HHEGYP  P  Y  YP   PPS  PYEGGYQ YF+ GG
Sbjct:  20 YPPPGYPS--APPPPGYPSPPSHHEGYPP-PQPYGGYP---PPSSRPYEGGYQGYFAGGG 73

Query: 390 FP-----PPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
           +P     PPPPPPPQ Y+ CHHDHH+YQDS+SGC SF+RGCLA LCCCCLL++
Sbjct:  74 YPHQHHGPPPPPPPQNYDHCHHDHHHYQDSDSGCFSFIRGCLAALCCCCLLEE 126

>gi|255631334|gb|ACU16034.1| unknown [Glycine max]

          Length = 118

 Score =  144 bits (363), Expect = 9e-033
 Identities = 64/117 (54%), Positives = 78/117 (66%), Gaps = 6/117 (5%)
 Frame = -2

Query: 591 DYGKVPPETYPPPGYQSQYPPPPPVYPPSHHHEGY--PQHPHGYPSYPPPRPPSRPPYEG 418
           ++ ++  E+YPPPG+ S YPPP P YP +  HEGY  P  P GY  YPPP PP  PPY+ 
Sbjct:   3 NFQRISHESYPPPGHGSPYPPPQPGYPSAPPHEGYPPPPPPPGYGGYPPPHPPQHPPYD- 61

Query: 417 GYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
            YQ YF  G PPPPPPP   Y+  H DHH+  D   GC SFLRGC+A LCCCC+L++
Sbjct:  62 SYQGYFDNGRPPPPPPP--HYHYQHVDHHHLHD-EPGCFSFLRGCIAALCCCCVLEE 115

>gi|255543609|ref|XP_002512867.1| LRX1, putative [Ricinus communis]

          Length = 118

 Score =  142 bits (357), Expect = 4e-032
 Identities = 70/122 (57%), Positives = 82/122 (67%), Gaps = 17/122 (13%)
 Frame = -2

Query: 588 YGKVPPETYPPPGYQSQYPPP------PPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPP 427
           Y K P E YPPPGY   YPPP      PP  PP   +EGYP  P GYP  PPP P  R  
Sbjct:   3 YQKPPHEHYPPPGYAPSYPPPGYTPSAPP--PPPQPYEGYP--PPGYP--PPPGP--RQQ 54

Query: 426 YEGGYQEYFSGGFPPPP--PPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLL 253
           YE GYQ YF+ G+PPPP  P PPQ ++Q H++HH+YQD+  GCTSF +GCLA LCCCC+L
Sbjct:  55 YE-GYQGYFAEGYPPPPPRPGPPQYHHQYHYEHHHYQDNTGGCTSFFQGCLAALCCCCVL 113

Query: 252 DD 247
           D+
Sbjct: 114 DE 115

>gi|224055807|ref|XP_002298663.1| predicted protein [Populus trichocarpa]

          Length = 125

 Score =  139 bits (348), Expect = 5e-031
 Identities = 68/128 (53%), Positives = 81/128 (63%), Gaps = 22/128 (17%)
 Frame = -2

Query: 588 YGKVPPETYPPPGYQSQYPPP--PPVYPP-------SHHHEGYP---QHPHGYPSYPPPR 445
           Y K P + YPP GY   YPPP  PP  PP       +  + GYP     P GY  YPPP 
Sbjct:   3 YQKAPHQPYPPSGYSPPYPPPGYPPTTPPYGGYPPTTPPYGGYPPPGAPPPGYSGYPPPG 62

Query: 444 PPSRPPYEGGYQEYFSGGFPPPPPPP-PQQYNQC-HHDHHYYQDSNSGCTSFLRGCLATL 271
           PP       GYQ YF+ G+PPPPPPP PQQY +C H++HH+YQD   GC+SFLRGCLA L
Sbjct:  63 PPR------GYQGYFAEGYPPPPPPPGPQQYQECYHYEHHHYQD--DGCSSFLRGCLAAL 114

Query: 270 CCCCLLDD 247
           CCCC+L++
Sbjct: 115 CCCCVLEE 122

>gi|238478456|ref|NP_001154331.1| proline-rich family protein [Arabidopsis
        thaliana]

          Length = 162

 Score =  138 bits (347), Expect = 6e-031
 Identities = 66/101 (65%), Positives = 72/101 (71%), Gaps = 13/101 (12%)
 Frame = -2

Query: 564 YPPPGYQSQYPPPPPVYP-PSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGGYQEYFS-GG 391
           YPPPGY S   PPPP YP P  HHEGYP  P  Y  YP   PPS  PYEGGYQ YF+ GG
Sbjct:  20 YPPPGYPS--APPPPGYPSPPSHHEGYPP-PQPYGGYP---PPSSRPYEGGYQGYFAGGG 73

Query: 390 FP-----PPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGC 283
           +P     PPPPPPPQ Y+ CHHDHH+YQDS+SGC SF+RGC
Sbjct:  74 YPHQHHGPPPPPPPQNYDHCHHDHHHYQDSDSGCFSFIRGC 114

>gi|224129224|ref|XP_002328921.1| predicted protein [Populus trichocarpa]

          Length = 143

 Score =  138 bits (345), Expect = 1e-030
 Identities = 68/124 (54%), Positives = 77/124 (62%), Gaps = 21/124 (16%)
 Frame = -2

Query: 564 YPPPGYQSQYPPP------PPVYPPSHHHEGY---PQHPHGYPSYPPPRPPSRPPYEG-- 418
           YPPPGY    PPP      PP  PP   HEGY   P  P GYP YPPP PP  P Y G  
Sbjct:  20 YPPPGYSPSAPPPPPHEGYPPPPPPPPPHEGYPPPPPPPPGYPGYPPPGPPP-PGYPGYP 78

Query: 417 ------GYQEYFSGGFPPPPPPPPQQYNQ-CHHDHHYYQDSNSGCTSFLRGCLATLCCCC 259
                 GYQ YF+ G+P PP PP  QY Q CH++HH YQD+  GC+SFLRGCLA LCCCC
Sbjct:  79 PPGPPRGYQGYFAEGYPTPPGPP--QYQQCCHYEHHPYQDNYGGCSSFLRGCLAALCCCC 136

Query: 258 LLDD 247
           +L++
Sbjct: 137 VLEE 140

>gi|296082290|emb|CBI21295.3| unnamed protein product [Vitis vinifera]

          Length = 136

 Score =  135 bits (338), Expect = 7e-030
 Identities = 70/120 (58%), Positives = 78/120 (65%), Gaps = 17/120 (14%)
 Frame = -2

Query: 588 YGKVPPETYPPPGYQSQYPPP------PPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPP 427
           Y KVP E YPPPGY   YPPP      PP  PP     G P  P GYP YPPP PP  PP
Sbjct:  24 YQKVPQEPYPPPGYSVPYPPPSGYPSAPPPPPP-----GCPP-PPGYPGYPPPPPPPGPP 77

Query: 426 YEGGYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
           Y+ GYQ YF+ G+ PPPPPPPQQY Q  +  + YQD  SG +SFL GCLA LCCCCLL++
Sbjct:  78 YQ-GYQGYFNEGY-PPPPPPPQQYQQ--YQQYQYQD-QSGPSSFLPGCLAALCCCCLLEE 132

>gi|225451569|ref|XP_002274558.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 115

 Score =  131 bits (328), Expect = 1e-028
 Identities = 68/117 (58%), Positives = 76/117 (64%), Gaps = 17/117 (14%)
 Frame = -2

Query: 579 VPPETYPPPGYQSQYPPP------PPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPPYEG 418
           VP E YPPPGY   YPPP      PP  PP     G P  P GYP YPPP PP  PPY+ 
Sbjct:   6 VPQEPYPPPGYSVPYPPPSGYPSAPPPPPP-----GCPP-PPGYPGYPPPPPPPGPPYQ- 58

Query: 417 GYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
           GYQ YF+ G+ PPPPPPPQQY Q  +  + YQD  SG +SFL GCLA LCCCCLL++
Sbjct:  59 GYQGYFNEGY-PPPPPPPQQYQQ--YQQYQYQD-QSGPSSFLPGCLAALCCCCLLEE 111

>gi|218201524|gb|EEC83951.1| hypothetical protein OsI_30050 [Oryza sativa Indica
        Group]

          Length = 120

 Score =  116 bits (288), Expect = 4e-024
 Identities = 65/126 (51%), Positives = 77/126 (61%), Gaps = 23/126 (18%)
 Frame = -2

Query: 588 YGKVPP-ETYPPPGY--QSQYPPPPP---VYPPSHHHEGYPQHPHGYPSYPPPRPPSRPP 427
           Y +VPP E YPPPGY     YP PPP   VYPP    +GYP   HG   YPPP+ P  PP
Sbjct:   3 YQRVPPDEPYPPPGYPQSGPYPYPPPSGAVYPP----QGYPS-SHGV--YPPPQGPYPPP 55

Query: 426 YE---GGYQEYFSGG---FPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCC 265
           ++    GYQ YF+ G   + PPPPPPP  Y+ C   HH+  D  SG   FL+GCLA LCC
Sbjct:  56 HQPPPPGYQGYFNQGQQPYYPPPPPPPPPYDHC---HHHCGDEGSG-AGFLKGCLAALCC 111

Query: 264 CCLLDD 247
           CCLL++
Sbjct: 112 CCLLEE 117

>gi|115477537|ref|NP_001062364.1| Os08g0536400 [Oryza sativa Japonica Group]

          Length = 117

 Score =  111 bits (277), Expect = 8e-023
 Identities = 63/123 (51%), Positives = 74/123 (60%), Gaps = 20/123 (16%)
 Frame = -2

Query: 588 YGKVPP-ETYPPPGY--QSQYPPPPP---VYPPSHHHEGYPQHPHGYPSYPPPRPPSRPP 427
           Y +VPP E YPPPGY     YP PPP   VYPP    +GYP   HG   YPPP+ P  PP
Sbjct:   3 YQRVPPDEPYPPPGYPQSGPYPYPPPSGAVYPP----QGYPS-SHGV--YPPPQGPYPPP 55

Query: 426 YE---GGYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCL 256
           ++    GYQ YF+ G  P  PPPP  Y+ C   HH+  D  SG   FL+GCLA LCCCCL
Sbjct:  56 HQPPPPGYQGYFNQGQQPYYPPPPLPYDHC---HHHCGDEGSG-AGFLKGCLAALCCCCL 111

Query: 255 LDD 247
           L++
Sbjct: 112 LEE 114

>gi|147742781|gb|ABQ50561.1| hypothetical protein [Brassica rapa]

          Length = 715

 Score =  70 bits (170), Expect = 2e-010
 Identities = 50/108 (46%), Positives = 60/108 (55%), Gaps = 12/108 (11%)
 Frame = +3

Query: 252 LTDSSNRE*RDSHGGKRCNQS*SPGNSDDHDDTDCIAAAEVEEEEESHRRSTPDIPLRKE 431
           + ++   E R SHGGKR +QS +PGN  D   + C  AA  EEEE S+RRS+ D  LRK+
Sbjct: 604 VNNNGKLERRTSHGGKRSHQSLNPGNGGDRAGSGCTVAAVEEEEEGSYRRSSLDNLLRKD 663

Query: 432 DVKAVVVVDTMDIRVGVEDILRDGATEDIPEAG-------EDTEIGNL 554
            V     VDT +      DILRD A EDIPEA         D EIG L
Sbjct: 664 AVAG--TVDTREAE--AVDILRD-AAEDIPEAAAPMDNQEADNEIGIL 706

>gi|332025507|gb|EGI65670.1| hypothetical protein G5I_05770 [Acromyrmex
        echinatior]

          Length = 481

 Score =  67 bits (162), Expect = 2e-009
 Identities = 34/78 (43%), Positives = 38/78 (48%), Gaps = 3/78 (3%)
 Frame = -2

Query: 597 SYDYGKVPPETYPPPGYQSQYPPPPPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPPYEG 418
           SY Y   PP   PPP      PPPPP  PP  +    P + +  P  PPP PP  PP   
Sbjct:  45 SYSYPPPPPPPPPPP---PPPPPPPPPPPPPGYPSTQPSYSYPPPPPPPPPPPPSPPGYP 101

Query: 417 GYQEYFSGGFPPPPPPPP 364
             Q  +S   PPPPPPPP
Sbjct: 102 STQPSYSYPPPPPPPPPP 119

>gi|255721947|ref|XP_002545908.1| predicted protein [Candida tropicalis
        MYA-3404]

          Length = 257

 Score =  66 bits (160), Expect = 3e-009
 Identities = 39/95 (41%), Positives = 48/95 (50%), Gaps = 17/95 (17%)
 Frame = -2

Query: 576 PPETYPPPGYQS-----QYPPPP-PVYPPSHHHEGYPQHPHGYPSY--PPPRPPSRPPY- 424
           PP   PPP + S      YPPPP P  PP +H E      HG+P Y   PP PP  PP+ 
Sbjct: 124 PPPPPPPPHHFSGHHHGSYPPPPLPPPPPFYHGE------HGFPGYGSAPPPPPPPPPFN 177

Query: 423 --EGGYQEYFSGGFPPPPPPPPQQYNQCHHDHHYY 325
             E G+  Y S   PPPPPPPP  +   ++  + Y
Sbjct: 178 HGERGFPGYGSAPPPPPPPPPPFHHQSGNYGGYPY 212

>gi|225458705|ref|XP_002283001.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 298

 Score =  60 bits (145), Expect = 2e-007
 Identities = 30/69 (43%), Positives = 36/69 (52%)
 Frame = -2

Query: 573 PETYPPPGYQSQYPPPPPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGGYQEYFSG 394
           P+T PP GY SQ  PPPP    S  +   PQ  +  P  PPP+PP+  PY    Q   S 
Sbjct: 213 PQTCPPSGYPSQTFPPPPQPLTSQPYPPPPQPLNSQPYPPPPQPPASHPYPSPPQAPASQ 272

Query: 393 GFPPPPPPP 367
            +PPPP  P
Sbjct: 273 AYPPPPQAP 281

>gi|125552245|gb|EAY97954.1| hypothetical protein OsI_19871 [Oryza sativa Indica
        Group]

          Length = 359

 Score =  57 bits (136), Expect = 2e-006
 Identities = 30/85 (35%), Positives = 36/85 (42%), Gaps = 5/85 (5%)
 Frame = -2

Query: 594 YDYGKVPPETYPPPGYQSQYPPPPPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGG 415
           Y   + PP   PPP +   YPPPPP  P  HHH   P  P  +    P RPP  P +   
Sbjct:  15 YYGARPPPPPPPPPHHYYTYPPPPPPAPHHHHHPPPPPPPPHHHHQHPYRPPPPPHHATS 74

Query: 414 YQEYFSGGFPPPPPPPPQQYNQCHH 340
              Y+        PPPP  Y+   H
Sbjct:  75 SSSYYY-----HHPPPPHAYHGPWH 94

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,354,205,064,082
Number of Sequences: 15229318
Number of Extensions: 2354205064082
Number of Successful Extensions: 584002404
Number of sequences better than 0.0: 0