BLASTX 7.6.2
Query= UN45521 /QuerySize=692
(691 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297796179|ref|XP_002865974.1| thylakoid lumenal 17.4 kDa prot... 164 2e-038
gi|30696344|ref|NP_851183.1| thylakoid lumenal protein [Arabidop... 163 4e-038
gi|30696347|ref|NP_200161.2| thylakoid lumenal protein [Arabidop... 159 3e-037
gi|334188366|ref|NP_001190531.1| thylakoid lumenal protein [Arab... 131 1e-028
gi|224120874|ref|XP_002318440.1| predicted protein [Populus tric... 104 1e-020
gi|255570589|ref|XP_002526251.1| Thylakoid lumenal 17.4 kDa prot... 102 4e-020
gi|255647148|gb|ACU24042.1| unknown [Glycine max] 100 3e-019
gi|225455324|ref|XP_002275994.1| PREDICTED: hypothetical protein... 98 8e-019
gi|212721648|ref|NP_001132583.1| hypothetical protein LOC1001940... 96 3e-018
gi|217071608|gb|ACJ84164.1| unknown [Medicago truncatula] 96 3e-018
gi|115482792|ref|NP_001064989.1| Os10g0502000 [Oryza sativa Japo... 94 2e-017
gi|242034055|ref|XP_002464422.1| hypothetical protein SORBIDRAFT... 94 2e-017
gi|116785879|gb|ABK23895.1| unknown [Picea sitchensis] 84 1e-014
>gi|297796179|ref|XP_002865974.1| thylakoid lumenal 17.4 kDa protein,
chloroplast [Arabidopsis lyrata subsp. lyrata]
Length = 236
Score = 164 bits (413), Expect = 2e-038
Identities = 86/118 (72%), Positives = 95/118 (80%), Gaps = 4/118 (3%)
Frame = +1
Query: 16 MASLPVHFSRNHFSSPNFSRKFRRSTETRSFVALVHCSA--RENDQGIKTTLFPVKELGC 189
MASLPV F+RNHFSSP FS K RR E RS V ++ + REN GIK +L P+KELG
Sbjct: 1 MASLPVQFTRNHFSSPFFSVKLRR--EPRSLVTVMFSAGENRENGDGIKKSLLPIKELGS 58
Query: 190 LACAALFAFTLTMASPVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKPLDFRF 363
+ACAAL A TLTMASPVIAANQRLPPLST+P RCE+AFVGNTIGQANGVYDKPLD RF
Sbjct: 59 IACAALCACTLTMASPVIAANQRLPPLSTEPDRCEKAFVGNTIGQANGVYDKPLDLRF 116
Score = 91 bits (223), Expect = 2e-016
Identities = 43/46 (93%), Positives = 45/46 (97%)
Frame = +2
Query: 353 ISGSTFEEANLEDVVFEDTIIGYIDLQKICRNVTINEEGRLVLGCR 490
+SGSTFEEANLEDVVFEDTIIGYIDLQKICRN +INEEGRLVLGCR
Sbjct: 191 LSGSTFEEANLEDVVFEDTIIGYIDLQKICRNESINEEGRLVLGCR 236
>gi|30696344|ref|NP_851183.1| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 236
Score = 163 bits (410), Expect = 4e-038
Identities = 87/119 (73%), Positives = 96/119 (80%), Gaps = 6/119 (5%)
Frame = +1
Query: 16 MASLPVHFSRNHFSSPNFSRKFRRSTETRSFVALVHCSA---RENDQGIKTTLFPVKELG 186
MASLPV F+RN SSP FS RR E RS V VHCSA REN +G+K +LFP+KELG
Sbjct: 1 MASLPVQFTRNQISSPFFSVNLRR--EPRSLVT-VHCSAGENRENGEGVKKSLFPLKELG 57
Query: 187 CLACAALFAFTLTMASPVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKPLDFRF 363
+ACAAL A TLT+ASPVIAANQRLPPLST+P RCE+AFVGNTIGQANGVYDKPLD RF
Sbjct: 58 SIACAALCACTLTIASPVIAANQRLPPLSTEPDRCEKAFVGNTIGQANGVYDKPLDLRF 116
Score = 91 bits (223), Expect = 2e-016
Identities = 43/46 (93%), Positives = 45/46 (97%)
Frame = +2
Query: 353 ISGSTFEEANLEDVVFEDTIIGYIDLQKICRNVTINEEGRLVLGCR 490
+SGSTFEEANLEDVVFEDTIIGYIDLQKICRN +INEEGRLVLGCR
Sbjct: 191 LSGSTFEEANLEDVVFEDTIIGYIDLQKICRNESINEEGRLVLGCR 236
>gi|30696347|ref|NP_200161.2| thylakoid lumenal protein [Arabidopsis thaliana]
Length = 235
Score = 159 bits (402), Expect = 3e-037
Identities = 85/117 (72%), Positives = 94/117 (80%), Gaps = 5/117 (4%)
Frame = +1
Query: 19 ASLPVHFSRNHFSSPNFSRKFRRSTETRSFVALVHCSA--RENDQGIKTTLFPVKELGCL 192
ASLPV F+RN SSP FS RR E RS V VHCS REN +G+K +LFP+KELG +
Sbjct: 2 ASLPVQFTRNQISSPFFSVNLRR--EPRSLVT-VHCSGENRENGEGVKKSLFPLKELGSI 58
Query: 193 ACAALFAFTLTMASPVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKPLDFRF 363
ACAAL A TLT+ASPVIAANQRLPPLST+P RCE+AFVGNTIGQANGVYDKPLD RF
Sbjct: 59 ACAALCACTLTIASPVIAANQRLPPLSTEPDRCEKAFVGNTIGQANGVYDKPLDLRF 115
Score = 91 bits (223), Expect = 2e-016
Identities = 43/46 (93%), Positives = 45/46 (97%)
Frame = +2
Query: 353 ISGSTFEEANLEDVVFEDTIIGYIDLQKICRNVTINEEGRLVLGCR 490
+SGSTFEEANLEDVVFEDTIIGYIDLQKICRN +INEEGRLVLGCR
Sbjct: 190 LSGSTFEEANLEDVVFEDTIIGYIDLQKICRNESINEEGRLVLGCR 235
>gi|334188366|ref|NP_001190531.1| thylakoid lumenal protein [Arabidopsis
thaliana]
Length = 250
Score = 131 bits (328), Expect = 1e-028
Identities = 62/77 (80%), Positives = 70/77 (90%)
Frame = +1
Query: 133 RENDQGIKTTLFPVKELGCLACAALFAFTLTMASPVIAANQRLPPLSTDPTRCEQAFVGN 312
REN +G+K +LFP+KELG +ACAAL A TLT+ASPVIAANQRLPPLST+P RCE+AFVGN
Sbjct: 54 RENGEGVKKSLFPLKELGSIACAALCACTLTIASPVIAANQRLPPLSTEPDRCEKAFVGN 113
Query: 313 TIGQANGVYDKPLDFRF 363
TIGQANGVYDKPLD RF
Sbjct: 114 TIGQANGVYDKPLDLRF 130
Score = 91 bits (223), Expect = 2e-016
Identities = 43/46 (93%), Positives = 45/46 (97%)
Frame = +2
Query: 353 ISGSTFEEANLEDVVFEDTIIGYIDLQKICRNVTINEEGRLVLGCR 490
+SGSTFEEANLEDVVFEDTIIGYIDLQKICRN +INEEGRLVLGCR
Sbjct: 205 LSGSTFEEANLEDVVFEDTIIGYIDLQKICRNESINEEGRLVLGCR 250
>gi|224120874|ref|XP_002318440.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 104 bits (259), Expect = 1e-020
Identities = 57/117 (48%), Positives = 75/117 (64%), Gaps = 2/117 (1%)
Frame = +1
Query: 19 ASLPVHFSRNHFSSPNFSRKFRR--STETRSFVALVHCSARENDQGIKTTLFPVKELGCL 192
A+ + S+N F +FS RR + S + + +R+ Q ++ KE+ +
Sbjct: 4 ATFSLPLSQNKFPLYHFSSARRRFPIPDLHSPLKICCSGSRDGSQSRESLFQFKKEINYV 63
Query: 193 ACAALFAFTLTMASPVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKPLDFRF 363
AC L A+ +T ASPVIAA QRLPPLST+P RCE+AFVGNTIGQANGVYDKP+D RF
Sbjct: 64 ACGILAAWAVTAASPVIAAGQRLPPLSTEPNRCEKAFVGNTIGQANGVYDKPIDLRF 120
Score = 76 bits (185), Expect = 4e-012
Identities = 34/46 (73%), Positives = 39/46 (84%)
Frame = +2
Query: 353 ISGSTFEEANLEDVVFEDTIIGYIDLQKICRNVTINEEGRLVLGCR 490
+SGSTF+EA LED +FEDTIIGYIDLQKICRN +I +GR LGCR
Sbjct: 195 LSGSTFDEAQLEDAIFEDTIIGYIDLQKICRNTSIGPDGRAELGCR 240
>gi|255570589|ref|XP_002526251.1| Thylakoid lumenal 17.4 kDa protein,
chloroplast precursor, putative [Ricinus communis]
Length = 228
Score = 102 bits (254), Expect = 4e-020
Identities = 48/65 (73%), Positives = 55/65 (84%)
Frame = +1
Query: 169 PVKELGCLACAALFAFTLTMASPVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKP 348
P KEL +AC L A+ +T ASPVIAA+QRLPPLST+P RCE+AFVGNTIGQANGVYDKP
Sbjct: 44 PFKELQSVACGLLAAWAVTSASPVIAASQRLPPLSTEPNRCEKAFVGNTIGQANGVYDKP 103
Query: 349 LDFRF 363
+D RF
Sbjct: 104 IDLRF 108
Score = 76 bits (185), Expect = 4e-012
Identities = 34/46 (73%), Positives = 40/46 (86%)
Frame = +2
Query: 353 ISGSTFEEANLEDVVFEDTIIGYIDLQKICRNVTINEEGRLVLGCR 490
+SGSTF+EA L D VFEDTIIGYIDLQK+C+N +IN EGR +LGCR
Sbjct: 183 LSGSTFDEAQLADAVFEDTIIGYIDLQKLCKNTSINLEGREILGCR 228
>gi|255647148|gb|ACU24042.1| unknown [Glycine max]
Length = 239
Score = 100 bits (247), Expect = 3e-019
Identities = 60/119 (50%), Positives = 73/119 (61%), Gaps = 5/119 (4%)
Frame = +1
Query: 16 MASLPVHFSRNHFSSPNFSRKFRRSTETRSFVALVHCSARENDQGI---KTTLFPVKELG 186
MA++ + RN FS PN+S K T + S + + CS G K LF +
Sbjct: 1 MANVSIPLPRNGFSKPNYSTKRPCFTPSASTLR-ISCSGVAEFDGSHQKKGGLFNFNGIK 59
Query: 187 CLACAALFAFTLTMAS-PVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKPLDFR 360
+AC L A +T A+ PV AA QRLPPLST+P RCE+AFVGNTIGQANGVYDKPLD R
Sbjct: 60 GVACGILAACAVTSAAFPVTAATQRLPPLSTEPNRCERAFVGNTIGQANGVYDKPLDLR 118
>gi|225455324|ref|XP_002275994.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 232
Score = 98 bits (243), Expect = 8e-019
Identities = 49/82 (59%), Positives = 59/82 (71%)
Frame = +1
Query: 118 VHCSARENDQGIKTTLFPVKELGCLACAALFAFTLTMASPVIAANQRLPPLSTDPTRCEQ 297
+ CSA + +K + KEL +A L +T ASPVIAA+QRLPPLST+P RCE+
Sbjct: 31 ISCSASWDSPELKASSSQFKELKNVAFGILAVCAVTAASPVIAASQRLPPLSTEPNRCER 90
Query: 298 AFVGNTIGQANGVYDKPLDFRF 363
AFVGNTIGQANGVYDKP+D RF
Sbjct: 91 AFVGNTIGQANGVYDKPIDLRF 112
>gi|212721648|ref|NP_001132583.1| hypothetical protein LOC100194054 [Zea mays]
Length = 225
Score = 96 bits (238), Expect = 3e-018
Identities = 54/83 (65%), Positives = 60/83 (72%), Gaps = 3/83 (3%)
Frame = +1
Query: 118 VHCSARENDQGIKTTLFPVKELGCLACAALFAFTLTMAS-PVIAANQRLPPLSTDPTRCE 294
V CSA D G T K G LAC L A+++ AS PVIAA+QRLPPLST+P RCE
Sbjct: 25 VACSAA--DAGGSTGPAWAKGAGRLACGVLAAWSVASASNPVIAASQRLPPLSTEPNRCE 82
Query: 295 QAFVGNTIGQANGVYDKPLDFRF 363
+AFVGNTIGQANGVYDKPLD RF
Sbjct: 83 RAFVGNTIGQANGVYDKPLDLRF 105
>gi|217071608|gb|ACJ84164.1| unknown [Medicago truncatula]
Length = 240
Score = 96 bits (238), Expect = 3e-018
Identities = 59/120 (49%), Positives = 71/120 (59%), Gaps = 6/120 (5%)
Frame = +1
Query: 16 MASLPVHFSRNHFSSPNFSRKFRRSTETRSFVALVHCS----ARENDQGIKTTLFPVKEL 183
MA+L + R S NFS K R T + + CS A + K L + ++
Sbjct: 1 MANLSIQLPRTSLSIRNFSTK-RPCFTTSALPFTITCSVVGEAELDGTENKPRLLSLNKI 59
Query: 184 GCLACAALFAFTLTMAS-PVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKPLDFR 360
+AC L A+ +T AS PV AA QRLPPLSTDP RCE+AFVGNTIGQANGVYDK LD R
Sbjct: 60 KGVACGILAAYAVTSASFPVTAATQRLPPLSTDPNRCERAFVGNTIGQANGVYDKALDLR 119
Score = 76 bits (186), Expect = 3e-012
Identities = 35/46 (76%), Positives = 39/46 (84%)
Frame = +2
Query: 353 ISGSTFEEANLEDVVFEDTIIGYIDLQKICRNVTINEEGRLVLGCR 490
+SGSTF++A LE VFEDTIIGYIDLQKICRN TI +EGR LGCR
Sbjct: 195 LSGSTFDDAKLEGAVFEDTIIGYIDLQKICRNTTIGDEGRAELGCR 240
>gi|115482792|ref|NP_001064989.1| Os10g0502000 [Oryza sativa Japonica Group]
Length = 236
Score = 94 bits (232), Expect = 2e-017
Identities = 46/64 (71%), Positives = 54/64 (84%), Gaps = 1/64 (1%)
Frame = +1
Query: 175 KELGCLACAALFAFTL-TMASPVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKPL 351
K +G LAC L A+ + + +SPVIAA+QRLPPLST+P RCE+AFVGNTIGQANGVYDKPL
Sbjct: 53 KAVGGLACGVLAAWAVASSSSPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKPL 112
Query: 352 DFRF 363
D RF
Sbjct: 113 DLRF 116
>gi|242034055|ref|XP_002464422.1| hypothetical protein SORBIDRAFT_01g017890
[Sorghum bicolor]
Length = 221
Score = 94 bits (231), Expect = 2e-017
Identities = 47/64 (73%), Positives = 52/64 (81%), Gaps = 1/64 (1%)
Frame = +1
Query: 175 KELGCLACAALFAFTLTMAS-PVIAANQRLPPLSTDPTRCEQAFVGNTIGQANGVYDKPL 351
K G LAC L A+ + AS PVIAA+QRLPPLST+P RCE+AFVGNTIGQANGVYDKPL
Sbjct: 38 KGAGRLACGVLAAWAVASASNPVIAASQRLPPLSTEPNRCERAFVGNTIGQANGVYDKPL 97
Query: 352 DFRF 363
D RF
Sbjct: 98 DLRF 101
>gi|116785879|gb|ABK23895.1| unknown [Picea sitchensis]
Length = 239
Score = 84 bits (207), Expect = 1e-014
Identities = 43/84 (51%), Positives = 51/84 (60%), Gaps = 3/84 (3%)
Frame = +1
Query: 118 VHCSARENDQ-GIKTTLFPVKELGCLACAALFAFTLTMASPVIAANQ--RLPPLSTDPTR 288
+ CS ND I+ PV+ +AC + +T P AA RLPPLS DP R
Sbjct: 35 IRCSDGSNDNVSIQVKTSPVERFKSIACGLCAVWAVTATYPTFAATSTPRLPPLSNDPNR 94
Query: 289 CEQAFVGNTIGQANGVYDKPLDFR 360
CE+AFVGNTIGQANGVYDKP+D R
Sbjct: 95 CERAFVGNTIGQANGVYDKPIDLR 118
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,939,538,178,972
Number of Sequences: 15229318
Number of Extensions: 4939538178972
Number of Successful Extensions: 1157257337
Number of sequences better than 0.0: 0
|