BLASTX 7.6.2
Query= UN74855 /QuerySize=782
(781 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsi... 245 7e-063
gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT... 240 2e-061
gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana] 236 3e-060
gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT... 176 3e-042
gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabid... 158 1e-036
gi|255551795|ref|XP_002516943.1| conserved hypothetical protein ... 102 9e-020
gi|224107110|ref|XP_002314378.1| predicted protein [Populus tric... 101 2e-019
gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vini... 87 2e-015
gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein... 84 3e-014
gi|255645721|gb|ACU23354.1| unknown [Glycine max] 70 2e-010
gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabid... 63 5e-008
gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis... 63 5e-008
>gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsis thaliana]
Length = 396
Score = 245 bits (624), Expect = 7e-063
Identities = 141/192 (73%), Positives = 152/192 (79%), Gaps = 23/192 (11%)
Frame = +3
Query: 177 MVEAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSS 356
MVEAKD Q+MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHL+SSST SS
Sbjct: 1 MVEAKD--QDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLTSSST-SS 57
Query: 357 SPSFRSDSVATSTTTT---VSASLSLSASGARNLTANNNNNNKLPFLLA--KKKALTPSS 521
SPSFRSDSV ++TT + +SASLSLS SG A NNNN+KLPFLLA KKK LT SS
Sbjct: 58 SPSFRSDSVGSTTTASAANLSASLSLSVSG-----ATNNNNSKLPFLLAKKKKKMLTASS 112
Query: 522 SSS--SNIVYKRSQS--TTTTTYRVSDSSPRKRSGFWSFLHLHSYKHHSSSKKVGNFHDS 689
SSS +NIVYKRSQS TT TTY SD SPRKR+GFWSF HL+S K H SSKKVGNFH
Sbjct: 113 SSSTTANIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQHGSSKKVGNFH-- 170
Query: 690 SRPQQPTKHTET 725
QP TET
Sbjct: 171 ----QPISQTET 178
>gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876
[Arabidopsis lyrata subsp. lyrata]
Length = 384
Score = 240 bits (612), Expect = 2e-061
Identities = 139/187 (74%), Positives = 151/187 (80%), Gaps = 17/187 (9%)
Frame = +3
Query: 183 EAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSP 362
E KD Q+MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSST SSSP
Sbjct: 3 EVKD--QDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSST-SSSP 59
Query: 363 SFRSDSVATSTTTTVSASLSLSASGARNLTANNNNNNKLPFLLA--KKKALTPSSS-SSS 533
SFRSDSV STTT +ASLSLS SGA NNNKLPFLLA KKK LT SSS +++
Sbjct: 60 SFRSDSVG-STTTASAASLSLSVSGA-------TNNNKLPFLLAKKKKKMLTASSSATTA 111
Query: 534 NIVYKRSQS--TTTTTYRVSDSSPRKRSGFWSFLHLHSYKHHSSSKKVGNFHD-SSRPQQ 704
NIVYKRSQS TT TTY SD SPRKR+GFWSFLHL+S KHH SSKKVGNFH +S+ +
Sbjct: 112 NIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFLHLYSSKHHGSSKKVGNFHQPTSQIEI 171
Query: 705 PTKHTET 725
T+ TET
Sbjct: 172 KTELTET 178
>gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]
Length = 388
Score = 236 bits (601), Expect = 3e-060
Identities = 134/182 (73%), Positives = 144/182 (79%), Gaps = 21/182 (11%)
Frame = +3
Query: 207 MGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRSDSVA 386
MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHL+SSST SSSPSFRSDSV
Sbjct: 1 MGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLTSSST-SSSPSFRSDSVG 59
Query: 387 TSTTTT---VSASLSLSASGARNLTANNNNNNKLPFLLA--KKKALTPSSSSS--SNIVY 545
++TT + +SASLSLS SG A NNNN+KLPFLLA KKK LT SSSSS +NIVY
Sbjct: 60 STTTASAANLSASLSLSVSG-----ATNNNNSKLPFLLAKKKKKMLTASSSSSTTANIVY 114
Query: 546 KRSQS--TTTTTYRVSDSSPRKRSGFWSFLHLHSYKHHSSSKKVGNFHDSSRPQQPTKHT 719
KRSQS TT TTY SD SPRKR+GFWSF HL+S K H SSKKVGNFH QP T
Sbjct: 115 KRSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQHGSSKKVGNFH------QPISQT 168
Query: 720 ET 725
ET
Sbjct: 169 ET 170
>gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT_478007
[Arabidopsis lyrata subsp. lyrata]
Length = 372
Score = 176 bits (446), Expect = 3e-042
Identities = 110/195 (56%), Positives = 133/195 (68%), Gaps = 32/195 (16%)
Frame = +3
Query: 183 EAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPK--HLSSSSTSSS 356
E KD Q+MG+GMQC+ HP+TKNPGGICA CLQEKLGKLVTSSFP+PK HLSSSS S
Sbjct: 3 ELKD--QDMGEGMQCIRHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKSF 60
Query: 357 SPSFRSDSVATSTTTTVSASLSLSASGARNLTANNNNNNKLPFLLAKKK----------A 506
+P STT++++ SLS SAS R+ T+NNN LPFLLAKKK +
Sbjct: 61 TP---------STTSSLALSLS-SASNGRDSTSNNN----LPFLLAKKKKNMLAASSSSS 106
Query: 507 LTPSSSSSSNIVYKRSQSTTTTTYRVSDSSPRKRSGFWSFLHLHSYKHH--SSSKKVGNF 680
+ SSSSS+N++YKRS+S T Y S S RKRSGFWSFLHL+S KH +++KKV NF
Sbjct: 107 SSSSSSSSANLIYKRSKS-TAAAYGES-FSQRKRSGFWSFLHLYSSKHQISNTTKKVDNF 164
Query: 681 HDSSRPQQPTKHTET 725
S R Q+ TET
Sbjct: 165 SHSRRNQRTESTTET 179
>gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabidopsis
thaliana]
Length = 369
Score = 158 bits (398), Expect = 1e-036
Identities = 96/186 (51%), Positives = 119/186 (63%), Gaps = 12/186 (6%)
Frame = +3
Query: 180 VEAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPK--HLSSSSTSS 353
VE KD QQ+MG+GMQC+ HP+TKNPGGICA CLQEKLGKLVTSSFP+PK HLSSSS S
Sbjct: 2 VELKD-QQDMGEGMQCITHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKS 60
Query: 354 SSPSFRSDSVATSTTTTVSASLSLSASGARNLTANNNNNNKLPFLLAKKKALTPSSSSSS 533
+PS S +++ S SAS ++ NL K + + + SSSSS+
Sbjct: 61 FTPSTTSLALSLS-----SASNGRDSTNNNNLPFLLAKKKKNMLAASSSSSSSSSSSSSA 115
Query: 534 NIVYKRSQSTTTTTYRVSDSSPRKRSGFWSFLHLHSYKHH--SSSKKVGNFHDSSRPQQP 707
N++YKRS+S T Y S S RKRSGFWSF HL+S KH +++KKV NF R Q+
Sbjct: 116 NLIYKRSKS-TAAAYGES-FSQRKRSGFWSFFHLYSSKHQISNTTKKVDNFSHLRRNQRT 173
Query: 708 TKHTET 725
TET
Sbjct: 174 ESKTET 179
>gi|255551795|ref|XP_002516943.1| conserved hypothetical protein [Ricinus
communis]
Length = 450
Score = 102 bits (252), Expect = 9e-020
Identities = 53/103 (51%), Positives = 67/103 (65%), Gaps = 13/103 (12%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRSD 377
+++MGDGMQC +HP+ NPGGICAFCLQEKLGKLV+SSFPLP + +SS+SSSSPSFRSD
Sbjct: 27 EEDMGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLP--IRASSSSSSSPSFRSD 84
Query: 378 -----------SVATSTTTTVSASLSLSASGARNLTANNNNNN 473
+ + +ASLSL+ N+ NN
Sbjct: 85 IGSGVVGVGVVGASNGVSVGPAASLSLAVHSTSTKGRNDGGNN 127
>gi|224107110|ref|XP_002314378.1| predicted protein [Populus trichocarpa]
Length = 389
Score = 101 bits (249), Expect = 2e-019
Identities = 52/93 (55%), Positives = 65/93 (69%), Gaps = 3/93 (3%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRS- 374
++++GDGMQC +HP+ NPGGICAFCLQEKLGKLV+SSFPLP + SS+SSSSPSFRS
Sbjct: 1 EEDLGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLP--IRGSSSSSSSPSFRSV 58
Query: 375 DSVATSTTTTVSASLSLSASGARNLTANNNNNN 473
V S+ SLSL+A N+ +N
Sbjct: 59 IGVGGSSNVGAGTSLSLAARPTTTKCRNDGGSN 91
>gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vinifera]
Length = 387
Score = 87 bits (215), Expect = 2e-015
Identities = 59/154 (38%), Positives = 80/154 (51%), Gaps = 15/154 (9%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRSD 377
+ ++G+GMQC +HP+ NPGGICAFCLQEKLGKL+ + + +SS S
Sbjct: 48 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLIGGGAGVGVGVGGGGGGASSTSL--S 105
Query: 378 SVATSTTTTVSASLSLSASGARNLTANNNNNNKLPFLLAKKKALTPSSSSSSNIVYKRSQ 557
TS++++ SAS G + A KKK S + IV KRS+
Sbjct: 106 VRPTSSSSSYSASKDCHYHGNYSRRA----RIPFLLAQKKKKKKEVMGSDAVGIVLKRSK 161
Query: 558 STTT--------TTYRVSDSSPRKRSGFWSFLHL 635
STTT + +D SP+KR GFWSFL+L
Sbjct: 162 STTTPRRGHFLVESEDANDYSPQKR-GFWSFLYL 194
>gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 420
Score = 84 bits (205), Expect = 3e-014
Identities = 39/60 (65%), Positives = 49/60 (81%), Gaps = 3/60 (5%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSPSFRSD 377
+ ++G+GMQC +HP+ NPGGICAFCLQEKLGKLV+SSFP + S+SSSSPSFRS+
Sbjct: 9 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFP---NAIFPSSSSSSPSFRSE 65
>gi|255645721|gb|ACU23354.1| unknown [Glycine max]
Length = 324
Score = 70 bits (171), Expect = 2e-010
Identities = 38/66 (57%), Positives = 44/66 (66%), Gaps = 5/66 (7%)
Frame = +3
Query: 195 QQQEMGDGMQCVNHPF----TKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSSP 362
+ E+ DGMQC+NHP NPGGICA CLQ+KL L++SSFP SSS SSSSP
Sbjct: 7 RHNEISDGMQCMNHPHRNNNNNNPGGICALCLQDKLRNLLSSSFPTSSPPFSSS-SSSSP 65
Query: 363 SFRSDS 380
SF S S
Sbjct: 66 SFTSSS 71
>gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabidopsis
thaliana]
Length = 214
Score = 63 bits (151), Expect = 5e-008
Identities = 32/46 (69%), Positives = 36/46 (78%), Gaps = 2/46 (4%)
Frame = -1
Query: 373 DLKEGEEEEVEEEERC--LGRGKEEVTSFPSFSWRQKAQIPPGFFV 242
++ EG ++ EEEER LG GKEEVTS PSFSWR KAQIPPGFFV
Sbjct: 169 EVVEGVKDLGEEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 214
>gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis thaliana]
Length = 207
Score = 63 bits (151), Expect = 5e-008
Identities = 32/46 (69%), Positives = 36/46 (78%), Gaps = 2/46 (4%)
Frame = -1
Query: 373 DLKEGEEEEVEEEERC--LGRGKEEVTSFPSFSWRQKAQIPPGFFV 242
++ EG ++ EEEER LG GKEEVTS PSFSWR KAQIPPGFFV
Sbjct: 162 EVVEGVKDLGEEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 207
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 798,366,029,682
Number of Sequences: 15229318
Number of Extensions: 798366029682
Number of Successful Extensions: 240520672
Number of sequences better than 0.0: 0
|