BLASTX 7.6.2
Query= UN72791 /QuerySize=666
(665 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT... 217 1e-054
gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana] 206 3e-051
gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsi... 203 2e-050
gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT... 161 7e-038
gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabid... 141 8e-032
gi|255551795|ref|XP_002516943.1| conserved hypothetical protein ... 98 1e-018
gi|224107110|ref|XP_002314378.1| predicted protein [Populus tric... 96 4e-018
gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vini... 89 4e-016
gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein... 82 7e-014
gi|255645721|gb|ACU23354.1| unknown [Glycine max] 72 6e-011
gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabid... 64 2e-008
gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis... 64 2e-008
gi|119176049|ref|XP_001240156.1| hypothetical protein CIMG_09777... 59 4e-007
gi|303318213|ref|XP_003069106.1| hypothetical protein CPC735_022... 59 4e-007
gi|320031713|gb|EFW13672.1| conserved hypothetical protein [Cocc... 59 4e-007
>gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876
[Arabidopsis lyrata subsp. lyrata]
Length = 384
Score = 217 bits (551), Expect = 1e-054
Identities = 121/163 (74%), Positives = 133/163 (81%), Gaps = 15/163 (9%)
Frame = +3
Query: 195 QQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRS 374
+ Q+MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSS PSFRS
Sbjct: 5 KDQDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSS-PSFRS 63
Query: 375 DSLATSTTTTTVSASLSLSASGARNVTANNNNNKLPFLLA--KKKALTPSSSST--NIVY 542
DS+ +TTT +ASLSLS SGA NNNKLPFLLA KKK LT SSS+T NIVY
Sbjct: 64 DSV--GSTTTASAASLSLSVSGA------TNNNKLPFLLAKKKKKMLTASSSATTANIVY 115
Query: 543 KRSQS--TTTTAYRVSDSTPRKRSGFWSFLHLHSYKHNSSSRK 665
KRSQS TT T Y SD +PRKR+GFWSFLHL+S KH+ SS+K
Sbjct: 116 KRSQSTRTTKTTYGDSDLSPRKRNGFWSFLHLYSSKHHGSSKK 158
>gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]
Length = 388
Score = 206 bits (523), Expect = 3e-051
Identities = 115/157 (73%), Positives = 128/157 (81%), Gaps = 14/157 (8%)
Frame = +3
Query: 207 MGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSDSLA 386
MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHL+SSSTSSS PSFRSDS+
Sbjct: 1 MGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLTSSSTSSS-PSFRSDSVG 59
Query: 387 TSTTTT--TVSASLSLSASGARNVTANNNNNKLPFLLA--KKKALTPSSSST---NIVYK 545
++TT + +SASLSLS SGA NNNN+KLPFLLA KKK LT SSSS+ NIVYK
Sbjct: 60 STTTASAANLSASLSLSVSGA----TNNNNSKLPFLLAKKKKKMLTASSSSSTTANIVYK 115
Query: 546 RSQS--TTTTAYRVSDSTPRKRSGFWSFLHLHSYKHN 650
RSQS TT T Y SD +PRKR+GFWSF HL+S K +
Sbjct: 116 RSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQH 152
>gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsis thaliana]
Length = 396
Score = 203 bits (515), Expect = 2e-050
Identities = 116/170 (68%), Positives = 131/170 (77%), Gaps = 12/170 (7%)
Frame = +3
Query: 177 MVGAKDQQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSS 356
MV AKD Q+MGDGMQC+NHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHL+SSSTSSS
Sbjct: 1 MVEAKD--QDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLTSSSTSSS 58
Query: 357 PPSFRSDSLATSTTTT--TVSASLSLSASGARNVTANNNNNKLPFLLAKKKALTPSSSST 530
PSFRSDS+ ++TT + +SASLSLS SGA N NN+ KKK LT SSSS+
Sbjct: 59 -PSFRSDSVGSTTTASAANLSASLSLSVSGATN--NNNSKLPFLLAKKKKKMLTASSSSS 115
Query: 531 ---NIVYKRSQS--TTTTAYRVSDSTPRKRSGFWSFLHLHSYKHNSSSRK 665
NIVYKRSQS TT T Y SD +PRKR+GFWSF HL+S K + SS+K
Sbjct: 116 TTANIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQHGSSKK 165
>gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT_478007
[Arabidopsis lyrata subsp. lyrata]
Length = 372
Score = 161 bits (407), Expect = 7e-038
Identities = 97/168 (57%), Positives = 116/168 (69%), Gaps = 29/168 (17%)
Frame = +3
Query: 195 QQQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPK--HLSSSSTSSSPPSF 368
+ Q+MG+GMQC+ HP+TKNPGGICA CLQEKLGKLVTSSFP+PK HLSSSS S PS
Sbjct: 5 KDQDMGEGMQCIRHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKSFTPS- 63
Query: 369 RSDSLATSTTTTTVSASLSLSASGARNVTANNNNNKLPFLLAKKK-----------ALTP 515
TT++++ SLS SAS R+ T+NNN LPFLLAKKK + +
Sbjct: 64 ---------TTSSLALSLS-SASNGRDSTSNNN---LPFLLAKKKKNMLAASSSSSSSSS 110
Query: 516 SSSSTNIVYKRSQSTTTTAYRVSDSTPRKRSGFWSFLHLHSYKHNSSS 659
SSSS N++YKRS+S T AY S S RKRSGFWSFLHL+S KH S+
Sbjct: 111 SSSSANLIYKRSKS-TAAAYGESFS-QRKRSGFWSFLHLYSSKHQISN 156
>gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabidopsis
thaliana]
Length = 369
Score = 141 bits (355), Expect = 8e-032
Identities = 85/158 (53%), Positives = 103/158 (65%), Gaps = 12/158 (7%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPK--HLSSSSTSSSPPSFR 371
QQ+MG+GMQC+ HP+TKNPGGICA CLQEKLGKLVTSSFP+PK HLSSSS S PS
Sbjct: 7 QQDMGEGMQCITHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKSFTPSTT 66
Query: 372 SDSLATSTTTTTVSASLSLSASGARNVTANNNNNKLPFLLAKKKALTPSSSST--NIVYK 545
S +L+ S SAS ++ N+ K L A + + SSSS+ N++YK
Sbjct: 67 SLALSLS------SASNGRDSTNNNNLPFLLAKKKKNMLAASSSSSSSSSSSSSANLIYK 120
Query: 546 RSQSTTTTAYRVSDSTPRKRSGFWSFLHLHSYKHNSSS 659
RS+S T AY S S RKRSGFWSF HL+S KH S+
Sbjct: 121 RSKS-TAAAYGESFS-QRKRSGFWSFFHLYSSKHQISN 156
>gi|255551795|ref|XP_002516943.1| conserved hypothetical protein [Ricinus
communis]
Length = 450
Score = 98 bits (242), Expect = 1e-018
Identities = 49/86 (56%), Positives = 63/86 (73%), Gaps = 2/86 (2%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSD 377
+++MGDGMQC +HP+ NPGGICAFCLQEKLGKLV+SSFPLP + +SS+SSS PSFRSD
Sbjct: 27 EEDMGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLP--IRASSSSSSSPSFRSD 84
Query: 378 SLATSTTTTTVSASLSLSASGARNVT 455
+ V AS +S A +++
Sbjct: 85 IGSGVVGVGVVGASNGVSVGPAASLS 110
>gi|224107110|ref|XP_002314378.1| predicted protein [Populus trichocarpa]
Length = 389
Score = 96 bits (237), Expect = 4e-018
Identities = 47/79 (59%), Positives = 59/79 (74%), Gaps = 2/79 (2%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSD 377
++++GDGMQC +HP+ NPGGICAFCLQEKLGKLV+SSFPLP + SS+SSS PSFRS
Sbjct: 1 EEDLGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLP--IRGSSSSSSSPSFRSV 58
Query: 378 SLATSTTTTTVSASLSLSA 434
++ SLSL+A
Sbjct: 59 IGVGGSSNVGAGTSLSLAA 77
>gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vinifera]
Length = 387
Score = 89 bits (220), Expect = 4e-016
Identities = 58/153 (37%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSD 377
+ ++G+GMQC +HP+ NPGGICAFCLQEKLGKL+ + + +S S
Sbjct: 48 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLIGGGAGVGVGVGGGGGGAS-----ST 102
Query: 378 SLATSTTTTTVSASLSLSASGARNVTANNNNNKLPFLLAKKKALTPSSSSTNIVYKRSQS 557
SL+ T+++ S S S N + L KKK S + IV KRS+S
Sbjct: 103 SLSVRPTSSSSSYSASKDCHYHGNYSRRARIPFLLAQKKKKKKEVMGSDAVGIVLKRSKS 162
Query: 558 TTT--------TAYRVSDSTPRKRSGFWSFLHL 632
TTT + +D +P+KR GFWSFL+L
Sbjct: 163 TTTPRRGHFLVESEDANDYSPQKR-GFWSFLYL 194
>gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 420
Score = 82 bits (200), Expect = 7e-014
Identities = 38/60 (63%), Positives = 48/60 (80%), Gaps = 3/60 (5%)
Frame = +3
Query: 198 QQEMGDGMQCVNHPFTKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSSTSSSPPSFRSD 377
+ ++G+GMQC +HP+ NPGGICAFCLQEKLGKLV+SSFP + S+SSS PSFRS+
Sbjct: 9 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFP---NAIFPSSSSSSPSFRSE 65
>gi|255645721|gb|ACU23354.1| unknown [Glycine max]
Length = 324
Score = 72 bits (175), Expect = 6e-011
Identities = 38/75 (50%), Positives = 46/75 (61%), Gaps = 4/75 (5%)
Frame = +3
Query: 177 MVGAKDQQQEMGDGMQCVNHPF----TKNPGGICAFCLQEKLGKLVTSSFPLPKHLSSSS 344
M G + E+ DGMQC+NHP NPGGICA CLQ+KL L++SSFP SSS
Sbjct: 1 MEGVGARHNEISDGMQCMNHPHRNNNNNNPGGICALCLQDKLRNLLSSSFPTSSPPFSSS 60
Query: 345 TSSSPPSFRSDSLAT 389
+SSSP S S+ T
Sbjct: 61 SSSSPSFTSSSSVKT 75
>gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabidopsis
thaliana]
Length = 214
Score = 64 bits (153), Expect = 2e-008
Identities = 33/49 (67%), Positives = 37/49 (75%), Gaps = 2/49 (4%)
Frame = -2
Query: 382 RESDLKEGGEEEVEEEERC--LGRGKEEVTSFPSFSWRQKAQIPPGFFV 242
R ++ EG ++ EEEER LG GKEEVTS PSFSWR KAQIPPGFFV
Sbjct: 166 RAREVVEGVKDLGEEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 214
>gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis thaliana]
Length = 207
Score = 64 bits (153), Expect = 2e-008
Identities = 33/49 (67%), Positives = 37/49 (75%), Gaps = 2/49 (4%)
Frame = -2
Query: 382 RESDLKEGGEEEVEEEERC--LGRGKEEVTSFPSFSWRQKAQIPPGFFV 242
R ++ EG ++ EEEER LG GKEEVTS PSFSWR KAQIPPGFFV
Sbjct: 159 RAREVVEGVKDLGEEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 207
>gi|119176049|ref|XP_001240156.1| hypothetical protein CIMG_09777 [Coccidioides
immitis RS]
Length = 1291
Score = 59 bits (142), Expect = 4e-007
Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Frame = +1
Query: 346 LLPPLLPSDLTL*LPPPPPPPSPPLSPSPPQEPETSQQTTTTTSFRFYLQRRRL*LPPPP 525
++PP+LP L PPPPPPP PP +PS P +P +S Q+ SF + + + P
Sbjct: 1127 VIPPVLPELQHLSNPPPPPPPPPPTAPSQPMDPNSSSQSEERNSFVSGVGTINIAIDEPA 1186
Query: 526 --RRTSFTREASQRQRPRTESLTPP 594
T T + SQ P TE + PP
Sbjct: 1187 VLAPTEITHQRSQSAVPPTEYMLPP 1211
>gi|303318213|ref|XP_003069106.1| hypothetical protein CPC735_022970
[Coccidioides posadasii C735 delta SOWgp]
Length = 1319
Score = 59 bits (142), Expect = 4e-007
Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Frame = +1
Query: 346 LLPPLLPSDLTL*LPPPPPPPSPPLSPSPPQEPETSQQTTTTTSFRFYLQRRRL*LPPPP 525
++PP+LP L PPPPPPP PP +PS P +P +S Q+ SF + + + P
Sbjct: 1155 VIPPVLPELQHLSNPPPPPPPPPPAAPSQPMDPNSSSQSEERNSFVSGVGTINIAIDEPA 1214
Query: 526 --RRTSFTREASQRQRPRTESLTPP 594
T T + SQ P TE + PP
Sbjct: 1215 VLAPTEITHQRSQSAVPPTEYMLPP 1239
>gi|320031713|gb|EFW13672.1| conserved hypothetical protein [Coccidioides
posadasii str. Silveira]
Length = 1582
Score = 59 bits (142), Expect = 4e-007
Identities = 32/85 (37%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Frame = +1
Query: 346 LLPPLLPSDLTL*LPPPPPPPSPPLSPSPPQEPETSQQTTTTTSFRFYLQRRRL*LPPPP 525
++PP+LP L PPPPPPP PP +PS P +P +S Q+ SF + + + P
Sbjct: 1418 VIPPVLPELQHLSNPPPPPPPPPPAAPSQPMDPNSSSQSEERNSFVSGVGTINIAIDEPA 1477
Query: 526 --RRTSFTREASQRQRPRTESLTPP 594
T T + SQ P TE + PP
Sbjct: 1478 VLAPTEITHQRSQSAVPPTEYMLPP 1502
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 670,546,551,498
Number of Sequences: 15229318
Number of Extensions: 670546551498
Number of Successful Extensions: 203192682
Number of sequences better than 0.0: 0
|