BLASTX 7.6.2
Query= UN19730 /QuerySize=590
(589 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|90186625|gb|ABD91572.1| unknown [Brassica rapa] 190 2e-046
gi|297789402|ref|XP_002862672.1| hypothetical protein ARALYDRAFT... 176 2e-042
gi|18402859|ref|NP_566674.1| photosystem II subunit T [Arabidops... 176 3e-042
gi|1465366|emb|CAA66701.1| photosystem II [Arabidopsis thaliana] 171 9e-041
gi|312283313|dbj|BAJ34522.1| unnamed protein product [Thellungie... 142 4e-032
gi|297852864|ref|XP_002894313.1| photosystem II 5 kD protein [Ar... 141 1e-031
gi|21537121|gb|AAM61462.1| photosystem II [Arabidopsis thaliana] 137 1e-030
gi|18403499|ref|NP_564589.1| Photosystem II 5 kD protein [Arabid... 137 1e-030
gi|2129671|pir||S71280 photosystem II protein psbT - Arabidopsis... 122 3e-026
gi|224054994|ref|XP_002298398.1| predicted protein [Populus tric... 106 4e-021
gi|224106185|ref|XP_002314076.1| predicted protein [Populus tric... 105 5e-021
gi|255544866|ref|XP_002513494.1| Photosystem II 5 kDa protein, c... 94 1e-017
gi|206586409|gb|ACI15739.1| chloroplast photosystem II 5 kDa pre... 84 8e-015
gi|400198|sp|P31336.1|PST2_GOSHI RecName: Full=Photosystem II 5 ... 82 4e-014
gi|255632908|gb|ACU16808.1| unknown [Glycine max] 81 9e-014
gi|242062124|ref|XP_002452351.1| hypothetical protein SORBIDRAFT... 69 3e-010
gi|226508866|ref|NP_001143858.1| hypothetical protein LOC1002766... 63 3e-008
>gi|90186625|gb|ABD91572.1| unknown [Brassica rapa]
Length = 106
Score = 190 bits (480), Expect = 2e-046
Identities = 97/106 (91%), Positives = 101/106 (95%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMFT 340
MAS+TMTATFLPAVAKLPS T+GRRMS+VRAS SENTTSLEVK KEEQ STTMRRDLMFT
Sbjct: 1 MASITMTATFLPAVAKLPSATSGRRMSVVRASKSENTTSLEVKTKEEQSSTTMRRDLMFT 60
Query: 339 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 202
AAA+AVCALAK AMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY
Sbjct: 61 AAAAAVCALAKAAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 106
>gi|297789402|ref|XP_002862672.1| hypothetical protein ARALYDRAFT_920397
[Arabidopsis lyrata subsp. lyrata]
Length = 103
Score = 176 bits (445), Expect = 2e-042
Identities = 91/106 (85%), Positives = 99/106 (93%), Gaps = 3/106 (2%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMFT 340
MASMTMTATFLPA+AKLPS T GRR+S+VRASTS+NT SL+VK EQ STTMRRDLMFT
Sbjct: 1 MASMTMTATFLPAIAKLPSATGGRRLSVVRASTSDNTPSLQVK---EQCSTTMRRDLMFT 57
Query: 339 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 202
AAA+AVC+LAKVAMA+EEEPKRGTEAAKKKYAQVCVTMPTAKICRY
Sbjct: 58 AAAAAVCSLAKVAMAEEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 103
>gi|18402859|ref|NP_566674.1| photosystem II subunit T [Arabidopsis thaliana]
Length = 103
Score = 176 bits (444), Expect = 3e-042
Identities = 91/106 (85%), Positives = 98/106 (92%), Gaps = 3/106 (2%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMFT 340
MASMTMTATF PAVAK+PS T GRR+S+VRASTS+NT SLEVK EQ STTMRRDLMFT
Sbjct: 1 MASMTMTATFFPAVAKVPSATGGRRLSVVRASTSDNTPSLEVK---EQSSTTMRRDLMFT 57
Query: 339 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 202
AAA+AVC+LAKVAMA+EEEPKRGTEAAKKKYAQVCVTMPTAKICRY
Sbjct: 58 AAAAAVCSLAKVAMAEEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 103
>gi|1465366|emb|CAA66701.1| photosystem II [Arabidopsis thaliana]
Length = 103
Score = 171 bits (431), Expect = 9e-041
Identities = 89/106 (83%), Positives = 96/106 (90%), Gaps = 3/106 (2%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMFT 340
MASMTMTATF PAVAK+PS T GRR+S+VRASTS+NT SLEVK EQ STTMRRDLMFT
Sbjct: 1 MASMTMTATFFPAVAKVPSATGGRRLSVVRASTSDNTPSLEVK---EQSSTTMRRDLMFT 57
Query: 339 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 202
AAA+AVC+LAKVAMA+EEEPKRGTEA KKKYAQVCVTM TAKICRY
Sbjct: 58 AAAAAVCSLAKVAMAEEEEPKRGTEAGKKKYAQVCVTMRTAKICRY 103
>gi|312283313|dbj|BAJ34522.1| unnamed protein product [Thellungiella halophila]
Length = 106
Score = 142 bits (356), Expect = 4e-032
Identities = 76/108 (70%), Positives = 93/108 (86%), Gaps = 6/108 (5%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLPSPTAG---RRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDL 349
MASMTMT++FLPAV+KLP+ G R +++V+ASTSENTTSLE +++++S MRRD+
Sbjct: 1 MASMTMTSSFLPAVSKLPTAITGSNRRSLTVVKASTSENTTSLE--NRKQEQSMKMRRDM 58
Query: 348 MFTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICR 205
+FTAAA+AVC+LAK AMAD EEPKRGTEAAKKKYA VCVTMPTAKICR
Sbjct: 59 VFTAAAAAVCSLAKAAMAD-EEPKRGTEAAKKKYAPVCVTMPTAKICR 105
>gi|297852864|ref|XP_002894313.1| photosystem II 5 kD protein [Arabidopsis
lyrata subsp. lyrata]
Length = 106
Score = 141 bits (353), Expect = 1e-031
Identities = 76/108 (70%), Positives = 93/108 (86%), Gaps = 6/108 (5%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLPSPTAG---RRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDL 349
MASMTMT++FLP V+KLP+ +G R +++V+AS SENTTSLE K++++S MRRDL
Sbjct: 1 MASMTMTSSFLPTVSKLPANISGNSRRSLTVVKASASENTTSLE--NKKQEQSMKMRRDL 58
Query: 348 MFTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICR 205
+FTAAA+AVC+LAKVAMAD EEPKRGTEAAKKKYA VCVTMPTA+ICR
Sbjct: 59 VFTAAAAAVCSLAKVAMAD-EEPKRGTEAAKKKYAPVCVTMPTARICR 105
>gi|21537121|gb|AAM61462.1| photosystem II [Arabidopsis thaliana]
Length = 106
Score = 137 bits (344), Expect = 1e-030
Identities = 75/108 (69%), Positives = 91/108 (84%), Gaps = 6/108 (5%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLP---SPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDL 349
MASMTMT++FLP V+ LP S + R +++V+AS SENTTSLE KE+++S MRRDL
Sbjct: 1 MASMTMTSSFLPTVSNLPANISSNSRRSLTVVKASGSENTTSLE--NKEQEQSMKMRRDL 58
Query: 348 MFTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICR 205
+FTAAA+AVC+LAKVAMAD +EPKRGTEAAKKKYA VCVTMPTA+ICR
Sbjct: 59 VFTAAAAAVCSLAKVAMAD-DEPKRGTEAAKKKYAPVCVTMPTARICR 105
>gi|18403499|ref|NP_564589.1| Photosystem II 5 kD protein [Arabidopsis
thaliana]
Length = 106
Score = 137 bits (343), Expect = 1e-030
Identities = 75/108 (69%), Positives = 91/108 (84%), Gaps = 6/108 (5%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLP---SPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDL 349
MASMTMT++FLP V+ LP S + R +++V+AS SENTTSLE K++++S MRRDL
Sbjct: 1 MASMTMTSSFLPTVSNLPANISSNSRRSLTVVKASGSENTTSLE--NKKQEQSMKMRRDL 58
Query: 348 MFTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICR 205
+FTAAA+AVC+LAKVAMAD +EPKRGTEAAKKKYA VCVTMPTAKICR
Sbjct: 59 VFTAAAAAVCSLAKVAMAD-DEPKRGTEAAKKKYAPVCVTMPTAKICR 105
>gi|2129671|pir||S71280 photosystem II protein psbT - Arabidopsis thaliana
Length = 102
Score = 122 bits (306), Expect = 3e-026
Identities = 67/102 (65%), Positives = 77/102 (75%), Gaps = 4/102 (3%)
Frame = -2
Query: 507 TMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMFTAAAS 328
TMTATF PAVAK+PS TA + + ++ T L + E+ MRRDLMFTAAA+
Sbjct: 5 TMTATFFPAVAKVPSATATKALRS-QSLHERQHTQLRSQGTEQHH---MRRDLMFTAAAA 60
Query: 327 AVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 202
AVC+LAKVAMA+EEEPKRGTEA KKKYAQVCVTM TAKICRY
Sbjct: 61 AVCSLAKVAMAEEEEPKRGTEAGKKKYAQVCVTMRTAKICRY 102
>gi|224054994|ref|XP_002298398.1| predicted protein [Populus trichocarpa]
Length = 105
Score = 106 bits (262), Expect = 4e-021
Identities = 55/105 (52%), Positives = 72/105 (68%), Gaps = 1/105 (0%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMFT 340
MASMTMTA+FL P R ++ A S T + V+ K ++ S++ RRDLMF
Sbjct: 1 MASMTMTASFLAGSTMAKQPLTTPRRGLIVAKASRTTEGVNVEMKNKEESSSGRRDLMFA 60
Query: 339 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICR 205
AAA+A ++A+VA+AD EEP+RGT AKKKYA +CVTMPTA+ICR
Sbjct: 61 AAAAAAYSIARVAIAD-EEPERGTPEAKKKYAPICVTMPTARICR 104
>gi|224106185|ref|XP_002314076.1| predicted protein [Populus trichocarpa]
Length = 105
Score = 105 bits (261), Expect = 5e-021
Identities = 60/107 (56%), Positives = 80/107 (74%), Gaps = 5/107 (4%)
Frame = -2
Query: 519 MASMTMTATFL--PAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLM 346
MAS+TMTA+FL A+AK PS T R + + +AS + ++E+K +EE S+ RRDLM
Sbjct: 1 MASITMTASFLTGSAMAKQPSTTPRRGLIVAKASRATEGVNVEMKNREE--SSGGRRDLM 58
Query: 345 FTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICR 205
F AAA+A ++A+VA+AD EEP+RGT AKKKYA +CVTMPTA+ICR
Sbjct: 59 FAAAAAAAYSIARVAIAD-EEPRRGTPEAKKKYAPICVTMPTARICR 104
>gi|255544866|ref|XP_002513494.1| Photosystem II 5 kDa protein, chloroplast
precursor, putative [Ricinus communis]
Length = 106
Score = 94 bits (232), Expect = 1e-017
Identities = 54/106 (50%), Positives = 70/106 (66%), Gaps = 4/106 (3%)
Frame = -2
Query: 519 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTS--LEVKPKEEQRSTTMRRDLM 346
MASMTMTA+FL P R ++ A S+ T + V+ K ++ S++ RRDL+
Sbjct: 1 MASMTMTASFLAGSTLTRQPFTAPRRGLIVAKASKVTEGERVNVEMKNKEESSSGRRDLV 60
Query: 345 FTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKIC 208
F AAA+A ++AKVAMAD EPK GT AKKKYA +CV+MPTAKIC
Sbjct: 61 FAAAAAAAFSVAKVAMAD--EPKAGTLDAKKKYASICVSMPTAKIC 104
>gi|206586409|gb|ACI15739.1| chloroplast photosystem II 5 kDa precursor protein
[Picrorhiza kurrooa]
Length = 102
Score = 84 bits (207), Expect = 8e-015
Identities = 55/104 (52%), Positives = 66/104 (63%), Gaps = 8/104 (7%)
Frame = -2
Query: 507 TMTATFL-PAVAKLPSPTAGR-RMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMFTAA 334
TMT +F + A PT GR ++MVRA S+ + L V KEE +T RRDLMF A
Sbjct: 5 TMTTSFFCRSAAAKQLPTTGRGGVAMVRA--SKESEKLTVNVKEESNNT--RRDLMFAMA 60
Query: 333 ASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 202
A ++A AMAD EPKRGT AKKKYA VCVT PTA+IC+Y
Sbjct: 61 AVVASSIANFAMAD--EPKRGTVEAKKKYAAVCVTNPTARICKY 102
>gi|400198|sp|P31336.1|PST2_GOSHI RecName: Full=Photosystem II 5 kDa protein,
chloroplastic; Short=PSII-T; AltName: Full=Light-regulated unknown 11
kDa protein; Flags: Precursor
Length = 105
Score = 82 bits (201), Expect = 4e-014
Identities = 46/106 (43%), Positives = 65/106 (61%), Gaps = 5/106 (4%)
Frame = -2
Query: 516 ASMTMTATFLPA--VAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMF 343
AS+TMT +FL + K RR+ + A+ S+++ + + RR++MF
Sbjct: 2 ASITMTTSFLSTTNLTKGSPRITQRRLVVANAAKGAQVESVQMSGERKTEGNNGRREMMF 61
Query: 342 TAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICR 205
AAA+A+C++A VA A EPKRG+ AKK YA VCVTMPTA+ICR
Sbjct: 62 AAAAAAICSVAGVATA---EPKRGSAEAKKAYAPVCVTMPTARICR 104
>gi|255632908|gb|ACU16808.1| unknown [Glycine max]
Length = 104
Score = 81 bits (198), Expect = 9e-014
Identities = 49/105 (46%), Positives = 66/105 (62%), Gaps = 6/105 (5%)
Frame = -2
Query: 513 SMTMTATFL--PAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTMRRDLMFT 340
S TMTA+ L P V + RR +V A+ + + +V + + RR+LMF
Sbjct: 3 SFTMTASILGSPTVTNRSAVATQRRSLVVNAAKA--VEAEKVSYDNDMDGSNGRRNLMFA 60
Query: 339 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICR 205
AAA+AVC++A +A+AD EPK GT AKKKYA +CVTMPTA+ICR
Sbjct: 61 AAAAAVCSVAGMAVAD--EPKPGTPEAKKKYAPICVTMPTARICR 103
>gi|242062124|ref|XP_002452351.1| hypothetical protein SORBIDRAFT_04g024130
[Sorghum bicolor]
Length = 152
Score = 69 bits (168), Expect = 3e-010
Identities = 50/120 (41%), Positives = 67/120 (55%), Gaps = 22/120 (18%)
Frame = -2
Query: 525 QAMASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQRSTTM----- 361
+AMAS+TM A+F A + +P+ ++VRA+ N +EE S +
Sbjct: 39 KAMASLTMMASF--AAVAVAAPSRRGSFAVVRAA---NKADHRCHQQEEPASARLAAAAA 93
Query: 360 -------RRDLMFTAAASAVCAL--AKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKIC 208
RR +M AAA+AV A+ A AMAD PK+G+ AKKKYA +CVTMPTAKIC
Sbjct: 94 AEEPAEGRRAVMLAAAAAAVAAIGGAGAAMAD---PKKGSPEAKKKYAPICVTMPTAKIC 150
>gi|226508866|ref|NP_001143858.1| hypothetical protein LOC100276652 [Zea mays]
Length = 104
Score = 63 bits (151), Expect = 3e-008
Identities = 43/105 (40%), Positives = 59/105 (56%), Gaps = 8/105 (7%)
Frame = -2
Query: 513 SMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTSLEVKPKEEQR-STTMRRDLMFTA 337
S+TM A+F A + +P+ ++VR++ + E R + RR +M A
Sbjct: 3 SLTMMASF--AAVAVAAPSRRGTFAVVRSAKVDRCQEPATLAATEARPAADGRRAVMLAA 60
Query: 336 AASAVCAL--AKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKIC 208
AA+AV A+ A AMA PK GT AKKKYA +CVTMPTAK+C
Sbjct: 61 AAAAVAAIGGAGAAMAG---PKNGTPEAKKKYAAICVTMPTAKVC 102
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,354,205,064,082
Number of Sequences: 15229318
Number of Extensions: 2354205064082
Number of Successful Extensions: 584002404
Number of sequences better than 0.0: 0
|