BLASTX 7.6.2
Query= UN20000 /QuerySize=650
(649 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297844182|ref|XP_002889972.1| expressed protein [Arabidopsis ... 188 7e-046
gi|147742772|gb|ABQ50553.1| hypothetical protein [Brassica rapa] 163 2e-038
gi|8698728|gb|AAF78486.1|AC012187_6 Contains weak similarity to ... 161 7e-038
gi|18391437|ref|NP_563914.1| proline-rich family protein [Arabid... 161 7e-038
gi|255631334|gb|ACU16034.1| unknown [Glycine max] 144 9e-033
gi|255543609|ref|XP_002512867.1| LRX1, putative [Ricinus communis] 142 4e-032
gi|224055807|ref|XP_002298663.1| predicted protein [Populus tric... 139 5e-031
gi|238478456|ref|NP_001154331.1| proline-rich family protein [Ar... 138 6e-031
gi|224129224|ref|XP_002328921.1| predicted protein [Populus tric... 138 1e-030
gi|296082290|emb|CBI21295.3| unnamed protein product [Vitis vini... 135 7e-030
gi|225451569|ref|XP_002274558.1| PREDICTED: hypothetical protein... 131 1e-028
gi|218201524|gb|EEC83951.1| hypothetical protein OsI_30050 [Oryz... 116 4e-024
gi|115477537|ref|NP_001062364.1| Os08g0536400 [Oryza sativa Japo... 111 8e-023
gi|147742781|gb|ABQ50561.1| hypothetical protein [Brassica rapa] 70 2e-010
gi|332025507|gb|EGI65670.1| hypothetical protein G5I_05770 [Acro... 67 2e-009
gi|255721947|ref|XP_002545908.1| predicted protein [Candida trop... 66 3e-009
gi|225458705|ref|XP_002283001.1| PREDICTED: hypothetical protein... 60 2e-007
gi|125552245|gb|EAY97954.1| hypothetical protein OsI_19871 [Oryz... 57 2e-006
>gi|297844182|ref|XP_002889972.1| expressed protein [Arabidopsis lyrata subsp.
lyrata]
Length = 132
Score = 188 bits (476), Expect = 7e-046
Identities = 90/133 (67%), Positives = 98/133 (73%), Gaps = 19/133 (14%)
Frame = -2
Query: 600 MSYDYGKVPPETYPPPGYQSQYP-------PPPPVYP-PSHHHEGY-PQHPHGYPSYPPP 448
MSYD KVPPE+YPPPGYQS YP PPPP YP P HHEGY P PHG Y P
Sbjct: 1 MSYD--KVPPESYPPPGYQSHYPPPGYPSAPPPPGYPSPPSHHEGYPPPQPHG--GYQPY 56
Query: 447 RPPSRPPYEGGYQEYFS-GGFP-----PPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRG 286
PPS PYEGGYQ YF+ GG+P PPPPPPPQ YNQCHHDHH+YQDS+SGC SF+RG
Sbjct: 57 PPPSSRPYEGGYQGYFAGGGYPHHHHGPPPPPPPQNYNQCHHDHHHYQDSDSGCFSFVRG 116
Query: 285 CLATLCCCCLLDD 247
CLA LCCCCLL++
Sbjct: 117 CLAALCCCCLLEE 129
>gi|147742772|gb|ABQ50553.1| hypothetical protein [Brassica rapa]
Length = 116
Score = 163 bits (412), Expect = 2e-038
Identities = 77/115 (66%), Positives = 83/115 (72%), Gaps = 15/115 (13%)
Frame = -2
Query: 588 YGKVPPETY-PPPGYQSQYP-------PPPPVYPPSHHHEGY-PQHPHGYPSYPPPRPPS 436
Y KVPPE+Y PPPGYQS YP PPPP YPP HHEGY P PHGYP YPPPR
Sbjct: 3 YEKVPPESYPPPPGYQSHYPPPGYPSAPPPPGYPPP-HHEGYPPPQPHGYPPYPPPR--- 58
Query: 435 RPPYEGGYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATL 271
PYEGGYQ YF+G +PPPPPPPPQQ N HDHH+YQDSNS +SFLRGCL +
Sbjct: 59 --PYEGGYQGYFAGNYPPPPPPPPQQCNHYQHDHHHYQDSNSDGSSFLRGCLCVV 111
>gi|8698728|gb|AAF78486.1|AC012187_6 Contains weak similarity to GATA-6
DNA-binding protein from Homo sapiens gb|X95701. ESTs gb|N38392,
gb|AI998367, gb|H36135, gb|Z26200 come from this gene [Arabidopsis
thaliana]
Length = 198
Score = 161 bits (407), Expect = 7e-038
Identities = 75/113 (66%), Positives = 83/113 (73%), Gaps = 13/113 (11%)
Frame = -2
Query: 564 YPPPGYQSQYPPPPPVYP-PSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGGYQEYFS-GG 391
YPPPGY S PPPP YP P HHEGYP P Y YP PPS PYEGGYQ YF+ GG
Sbjct: 89 YPPPGYPS--APPPPGYPSPPSHHEGYPP-PQPYGGYP---PPSSRPYEGGYQGYFAGGG 142
Query: 390 FP-----PPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
+P PPPPPPPQ Y+ CHHDHH+YQDS+SGC SF+RGCLA LCCCCLL++
Sbjct: 143 YPHQHHGPPPPPPPQNYDHCHHDHHHYQDSDSGCFSFIRGCLAALCCCCLLEE 195
>gi|18391437|ref|NP_563914.1| proline-rich family protein [Arabidopsis
thaliana]
Length = 129
Score = 161 bits (407), Expect = 7e-038
Identities = 75/113 (66%), Positives = 83/113 (73%), Gaps = 13/113 (11%)
Frame = -2
Query: 564 YPPPGYQSQYPPPPPVYP-PSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGGYQEYFS-GG 391
YPPPGY S PPPP YP P HHEGYP P Y YP PPS PYEGGYQ YF+ GG
Sbjct: 20 YPPPGYPS--APPPPGYPSPPSHHEGYPP-PQPYGGYP---PPSSRPYEGGYQGYFAGGG 73
Query: 390 FP-----PPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
+P PPPPPPPQ Y+ CHHDHH+YQDS+SGC SF+RGCLA LCCCCLL++
Sbjct: 74 YPHQHHGPPPPPPPQNYDHCHHDHHHYQDSDSGCFSFIRGCLAALCCCCLLEE 126
>gi|255631334|gb|ACU16034.1| unknown [Glycine max]
Length = 118
Score = 144 bits (363), Expect = 9e-033
Identities = 64/117 (54%), Positives = 78/117 (66%), Gaps = 6/117 (5%)
Frame = -2
Query: 591 DYGKVPPETYPPPGYQSQYPPPPPVYPPSHHHEGY--PQHPHGYPSYPPPRPPSRPPYEG 418
++ ++ E+YPPPG+ S YPPP P YP + HEGY P P GY YPPP PP PPY+
Sbjct: 3 NFQRISHESYPPPGHGSPYPPPQPGYPSAPPHEGYPPPPPPPGYGGYPPPHPPQHPPYD- 61
Query: 417 GYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
YQ YF G PPPPPPP Y+ H DHH+ D GC SFLRGC+A LCCCC+L++
Sbjct: 62 SYQGYFDNGRPPPPPPP--HYHYQHVDHHHLHD-EPGCFSFLRGCIAALCCCCVLEE 115
>gi|255543609|ref|XP_002512867.1| LRX1, putative [Ricinus communis]
Length = 118
Score = 142 bits (357), Expect = 4e-032
Identities = 70/122 (57%), Positives = 82/122 (67%), Gaps = 17/122 (13%)
Frame = -2
Query: 588 YGKVPPETYPPPGYQSQYPPP------PPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPP 427
Y K P E YPPPGY YPPP PP PP +EGYP P GYP PPP P R
Sbjct: 3 YQKPPHEHYPPPGYAPSYPPPGYTPSAPP--PPPQPYEGYP--PPGYP--PPPGP--RQQ 54
Query: 426 YEGGYQEYFSGGFPPPP--PPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLL 253
YE GYQ YF+ G+PPPP P PPQ ++Q H++HH+YQD+ GCTSF +GCLA LCCCC+L
Sbjct: 55 YE-GYQGYFAEGYPPPPPRPGPPQYHHQYHYEHHHYQDNTGGCTSFFQGCLAALCCCCVL 113
Query: 252 DD 247
D+
Sbjct: 114 DE 115
>gi|224055807|ref|XP_002298663.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 139 bits (348), Expect = 5e-031
Identities = 68/128 (53%), Positives = 81/128 (63%), Gaps = 22/128 (17%)
Frame = -2
Query: 588 YGKVPPETYPPPGYQSQYPPP--PPVYPP-------SHHHEGYP---QHPHGYPSYPPPR 445
Y K P + YPP GY YPPP PP PP + + GYP P GY YPPP
Sbjct: 3 YQKAPHQPYPPSGYSPPYPPPGYPPTTPPYGGYPPTTPPYGGYPPPGAPPPGYSGYPPPG 62
Query: 444 PPSRPPYEGGYQEYFSGGFPPPPPPP-PQQYNQC-HHDHHYYQDSNSGCTSFLRGCLATL 271
PP GYQ YF+ G+PPPPPPP PQQY +C H++HH+YQD GC+SFLRGCLA L
Sbjct: 63 PPR------GYQGYFAEGYPPPPPPPGPQQYQECYHYEHHHYQD--DGCSSFLRGCLAAL 114
Query: 270 CCCCLLDD 247
CCCC+L++
Sbjct: 115 CCCCVLEE 122
>gi|238478456|ref|NP_001154331.1| proline-rich family protein [Arabidopsis
thaliana]
Length = 162
Score = 138 bits (347), Expect = 6e-031
Identities = 66/101 (65%), Positives = 72/101 (71%), Gaps = 13/101 (12%)
Frame = -2
Query: 564 YPPPGYQSQYPPPPPVYP-PSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGGYQEYFS-GG 391
YPPPGY S PPPP YP P HHEGYP P Y YP PPS PYEGGYQ YF+ GG
Sbjct: 20 YPPPGYPS--APPPPGYPSPPSHHEGYPP-PQPYGGYP---PPSSRPYEGGYQGYFAGGG 73
Query: 390 FP-----PPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGC 283
+P PPPPPPPQ Y+ CHHDHH+YQDS+SGC SF+RGC
Sbjct: 74 YPHQHHGPPPPPPPQNYDHCHHDHHHYQDSDSGCFSFIRGC 114
>gi|224129224|ref|XP_002328921.1| predicted protein [Populus trichocarpa]
Length = 143
Score = 138 bits (345), Expect = 1e-030
Identities = 68/124 (54%), Positives = 77/124 (62%), Gaps = 21/124 (16%)
Frame = -2
Query: 564 YPPPGYQSQYPPP------PPVYPPSHHHEGY---PQHPHGYPSYPPPRPPSRPPYEG-- 418
YPPPGY PPP PP PP HEGY P P GYP YPPP PP P Y G
Sbjct: 20 YPPPGYSPSAPPPPPHEGYPPPPPPPPPHEGYPPPPPPPPGYPGYPPPGPPP-PGYPGYP 78
Query: 417 ------GYQEYFSGGFPPPPPPPPQQYNQ-CHHDHHYYQDSNSGCTSFLRGCLATLCCCC 259
GYQ YF+ G+P PP PP QY Q CH++HH YQD+ GC+SFLRGCLA LCCCC
Sbjct: 79 PPGPPRGYQGYFAEGYPTPPGPP--QYQQCCHYEHHPYQDNYGGCSSFLRGCLAALCCCC 136
Query: 258 LLDD 247
+L++
Sbjct: 137 VLEE 140
>gi|296082290|emb|CBI21295.3| unnamed protein product [Vitis vinifera]
Length = 136
Score = 135 bits (338), Expect = 7e-030
Identities = 70/120 (58%), Positives = 78/120 (65%), Gaps = 17/120 (14%)
Frame = -2
Query: 588 YGKVPPETYPPPGYQSQYPPP------PPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPP 427
Y KVP E YPPPGY YPPP PP PP G P P GYP YPPP PP PP
Sbjct: 24 YQKVPQEPYPPPGYSVPYPPPSGYPSAPPPPPP-----GCPP-PPGYPGYPPPPPPPGPP 77
Query: 426 YEGGYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
Y+ GYQ YF+ G+ PPPPPPPQQY Q + + YQD SG +SFL GCLA LCCCCLL++
Sbjct: 78 YQ-GYQGYFNEGY-PPPPPPPQQYQQ--YQQYQYQD-QSGPSSFLPGCLAALCCCCLLEE 132
>gi|225451569|ref|XP_002274558.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 115
Score = 131 bits (328), Expect = 1e-028
Identities = 68/117 (58%), Positives = 76/117 (64%), Gaps = 17/117 (14%)
Frame = -2
Query: 579 VPPETYPPPGYQSQYPPP------PPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPPYEG 418
VP E YPPPGY YPPP PP PP G P P GYP YPPP PP PPY+
Sbjct: 6 VPQEPYPPPGYSVPYPPPSGYPSAPPPPPP-----GCPP-PPGYPGYPPPPPPPGPPYQ- 58
Query: 417 GYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCLLDD 247
GYQ YF+ G+ PPPPPPPQQY Q + + YQD SG +SFL GCLA LCCCCLL++
Sbjct: 59 GYQGYFNEGY-PPPPPPPQQYQQ--YQQYQYQD-QSGPSSFLPGCLAALCCCCLLEE 111
>gi|218201524|gb|EEC83951.1| hypothetical protein OsI_30050 [Oryza sativa Indica
Group]
Length = 120
Score = 116 bits (288), Expect = 4e-024
Identities = 65/126 (51%), Positives = 77/126 (61%), Gaps = 23/126 (18%)
Frame = -2
Query: 588 YGKVPP-ETYPPPGY--QSQYPPPPP---VYPPSHHHEGYPQHPHGYPSYPPPRPPSRPP 427
Y +VPP E YPPPGY YP PPP VYPP +GYP HG YPPP+ P PP
Sbjct: 3 YQRVPPDEPYPPPGYPQSGPYPYPPPSGAVYPP----QGYPS-SHGV--YPPPQGPYPPP 55
Query: 426 YE---GGYQEYFSGG---FPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCC 265
++ GYQ YF+ G + PPPPPPP Y+ C HH+ D SG FL+GCLA LCC
Sbjct: 56 HQPPPPGYQGYFNQGQQPYYPPPPPPPPPYDHC---HHHCGDEGSG-AGFLKGCLAALCC 111
Query: 264 CCLLDD 247
CCLL++
Sbjct: 112 CCLLEE 117
>gi|115477537|ref|NP_001062364.1| Os08g0536400 [Oryza sativa Japonica Group]
Length = 117
Score = 111 bits (277), Expect = 8e-023
Identities = 63/123 (51%), Positives = 74/123 (60%), Gaps = 20/123 (16%)
Frame = -2
Query: 588 YGKVPP-ETYPPPGY--QSQYPPPPP---VYPPSHHHEGYPQHPHGYPSYPPPRPPSRPP 427
Y +VPP E YPPPGY YP PPP VYPP +GYP HG YPPP+ P PP
Sbjct: 3 YQRVPPDEPYPPPGYPQSGPYPYPPPSGAVYPP----QGYPS-SHGV--YPPPQGPYPPP 55
Query: 426 YE---GGYQEYFSGGFPPPPPPPPQQYNQCHHDHHYYQDSNSGCTSFLRGCLATLCCCCL 256
++ GYQ YF+ G P PPPP Y+ C HH+ D SG FL+GCLA LCCCCL
Sbjct: 56 HQPPPPGYQGYFNQGQQPYYPPPPLPYDHC---HHHCGDEGSG-AGFLKGCLAALCCCCL 111
Query: 255 LDD 247
L++
Sbjct: 112 LEE 114
>gi|147742781|gb|ABQ50561.1| hypothetical protein [Brassica rapa]
Length = 715
Score = 70 bits (170), Expect = 2e-010
Identities = 50/108 (46%), Positives = 60/108 (55%), Gaps = 12/108 (11%)
Frame = +3
Query: 252 LTDSSNRE*RDSHGGKRCNQS*SPGNSDDHDDTDCIAAAEVEEEEESHRRSTPDIPLRKE 431
+ ++ E R SHGGKR +QS +PGN D + C AA EEEE S+RRS+ D LRK+
Sbjct: 604 VNNNGKLERRTSHGGKRSHQSLNPGNGGDRAGSGCTVAAVEEEEEGSYRRSSLDNLLRKD 663
Query: 432 DVKAVVVVDTMDIRVGVEDILRDGATEDIPEAG-------EDTEIGNL 554
V VDT + DILRD A EDIPEA D EIG L
Sbjct: 664 AVAG--TVDTREAE--AVDILRD-AAEDIPEAAAPMDNQEADNEIGIL 706
>gi|332025507|gb|EGI65670.1| hypothetical protein G5I_05770 [Acromyrmex
echinatior]
Length = 481
Score = 67 bits (162), Expect = 2e-009
Identities = 34/78 (43%), Positives = 38/78 (48%), Gaps = 3/78 (3%)
Frame = -2
Query: 597 SYDYGKVPPETYPPPGYQSQYPPPPPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPPYEG 418
SY Y PP PPP PPPPP PP + P + + P PPP PP PP
Sbjct: 45 SYSYPPPPPPPPPPP---PPPPPPPPPPPPPGYPSTQPSYSYPPPPPPPPPPPPSPPGYP 101
Query: 417 GYQEYFSGGFPPPPPPPP 364
Q +S PPPPPPPP
Sbjct: 102 STQPSYSYPPPPPPPPPP 119
>gi|255721947|ref|XP_002545908.1| predicted protein [Candida tropicalis
MYA-3404]
Length = 257
Score = 66 bits (160), Expect = 3e-009
Identities = 39/95 (41%), Positives = 48/95 (50%), Gaps = 17/95 (17%)
Frame = -2
Query: 576 PPETYPPPGYQS-----QYPPPP-PVYPPSHHHEGYPQHPHGYPSY--PPPRPPSRPPY- 424
PP PPP + S YPPPP P PP +H E HG+P Y PP PP PP+
Sbjct: 124 PPPPPPPPHHFSGHHHGSYPPPPLPPPPPFYHGE------HGFPGYGSAPPPPPPPPPFN 177
Query: 423 --EGGYQEYFSGGFPPPPPPPPQQYNQCHHDHHYY 325
E G+ Y S PPPPPPPP + ++ + Y
Sbjct: 178 HGERGFPGYGSAPPPPPPPPPPFHHQSGNYGGYPY 212
>gi|225458705|ref|XP_002283001.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 298
Score = 60 bits (145), Expect = 2e-007
Identities = 30/69 (43%), Positives = 36/69 (52%)
Frame = -2
Query: 573 PETYPPPGYQSQYPPPPPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGGYQEYFSG 394
P+T PP GY SQ PPPP S + PQ + P PPP+PP+ PY Q S
Sbjct: 213 PQTCPPSGYPSQTFPPPPQPLTSQPYPPPPQPLNSQPYPPPPQPPASHPYPSPPQAPASQ 272
Query: 393 GFPPPPPPP 367
+PPPP P
Sbjct: 273 AYPPPPQAP 281
>gi|125552245|gb|EAY97954.1| hypothetical protein OsI_19871 [Oryza sativa Indica
Group]
Length = 359
Score = 57 bits (136), Expect = 2e-006
Identities = 30/85 (35%), Positives = 36/85 (42%), Gaps = 5/85 (5%)
Frame = -2
Query: 594 YDYGKVPPETYPPPGYQSQYPPPPPVYPPSHHHEGYPQHPHGYPSYPPPRPPSRPPYEGG 415
Y + PP PPP + YPPPPP P HHH P P + P RPP P +
Sbjct: 15 YYGARPPPPPPPPPHHYYTYPPPPPPAPHHHHHPPPPPPPPHHHHQHPYRPPPPPHHATS 74
Query: 414 YQEYFSGGFPPPPPPPPQQYNQCHH 340
Y+ PPPP Y+ H
Sbjct: 75 SSSYYY-----HHPPPPHAYHGPWH 94
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,354,205,064,082
Number of Sequences: 15229318
Number of Extensions: 2354205064082
Number of Successful Extensions: 584002404
Number of sequences better than 0.0: 0
|