BLASTX 7.6.2
Query= UN18808 /QuerySize=554
(553 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|90186625|gb|ABD91572.1| unknown [Brassica rapa] 187 8e-046
gi|297789402|ref|XP_002862672.1| hypothetical protein ARALYDRAFT... 174 9e-042
gi|18402859|ref|NP_566674.1| photosystem II subunit T [Arabidops... 173 1e-041
gi|1465366|emb|CAA66701.1| photosystem II [Arabidopsis thaliana] 168 4e-040
gi|312283313|dbj|BAJ34522.1| unnamed protein product [Thellungie... 140 1e-031
gi|297852864|ref|XP_002894313.1| photosystem II 5 kD protein [Ar... 138 4e-031
gi|21537121|gb|AAM61462.1| photosystem II [Arabidopsis thaliana] 135 4e-030
gi|18403499|ref|NP_564589.1| Photosystem II 5 kD protein [Arabid... 134 6e-030
gi|2129671|pir||S71280 photosystem II protein psbT - Arabidopsis... 121 7e-026
gi|224054994|ref|XP_002298398.1| predicted protein [Populus tric... 104 1e-020
gi|224106185|ref|XP_002314076.1| predicted protein [Populus tric... 104 1e-020
gi|255544866|ref|XP_002513494.1| Photosystem II 5 kDa protein, c... 92 3e-017
gi|206586409|gb|ACI15739.1| chloroplast photosystem II 5 kDa pre... 84 1e-014
gi|400198|sp|P31336.1|PST2_GOSHI RecName: Full=Photosystem II 5 ... 80 1e-013
gi|255632908|gb|ACU16808.1| unknown [Glycine max] 80 2e-013
gi|242062124|ref|XP_002452351.1| hypothetical protein SORBIDRAFT... 69 2e-010
gi|226508866|ref|NP_001143858.1| hypothetical protein LOC1002766... 64 1e-008
>gi|90186625|gb|ABD91572.1| unknown [Brassica rapa]
Length = 106
Score = 187 bits (474), Expect = 8e-046
Identities = 94/106 (88%), Positives = 101/106 (95%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMFT 299
MAS+TMTATFLPAVAKLPS T+GRRMS+VRAS SENTT+LEVK KEEQ STTMRRD+MFT
Sbjct: 1 MASITMTATFLPAVAKLPSATSGRRMSVVRASKSENTTSLEVKTKEEQSSTTMRRDLMFT 60
Query: 298 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCRY 161
AAA+AVCALAK AMADEEEPKRGTEAAKKKYAQVCVTMPTAK+CRY
Sbjct: 61 AAAAAVCALAKAAMADEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 106
>gi|297789402|ref|XP_002862672.1| hypothetical protein ARALYDRAFT_920397
[Arabidopsis lyrata subsp. lyrata]
Length = 103
Score = 174 bits (439), Expect = 9e-042
Identities = 88/106 (83%), Positives = 99/106 (93%), Gaps = 3/106 (2%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMFT 299
MASMTMTATFLPA+AKLPS T GRR+S+VRASTS+NT +L+VK EQ STTMRRD+MFT
Sbjct: 1 MASMTMTATFLPAIAKLPSATGGRRLSVVRASTSDNTPSLQVK---EQCSTTMRRDLMFT 57
Query: 298 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCRY 161
AAA+AVC+LAKVAMA+EEEPKRGTEAAKKKYAQVCVTMPTAK+CRY
Sbjct: 58 AAAAAVCSLAKVAMAEEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 103
>gi|18402859|ref|NP_566674.1| photosystem II subunit T [Arabidopsis thaliana]
Length = 103
Score = 173 bits (438), Expect = 1e-041
Identities = 88/106 (83%), Positives = 98/106 (92%), Gaps = 3/106 (2%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMFT 299
MASMTMTATF PAVAK+PS T GRR+S+VRASTS+NT +LEVK EQ STTMRRD+MFT
Sbjct: 1 MASMTMTATFFPAVAKVPSATGGRRLSVVRASTSDNTPSLEVK---EQSSTTMRRDLMFT 57
Query: 298 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCRY 161
AAA+AVC+LAKVAMA+EEEPKRGTEAAKKKYAQVCVTMPTAK+CRY
Sbjct: 58 AAAAAVCSLAKVAMAEEEEPKRGTEAAKKKYAQVCVTMPTAKICRY 103
>gi|1465366|emb|CAA66701.1| photosystem II [Arabidopsis thaliana]
Length = 103
Score = 168 bits (425), Expect = 4e-040
Identities = 86/106 (81%), Positives = 96/106 (90%), Gaps = 3/106 (2%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMFT 299
MASMTMTATF PAVAK+PS T GRR+S+VRASTS+NT +LEVK EQ STTMRRD+MFT
Sbjct: 1 MASMTMTATFFPAVAKVPSATGGRRLSVVRASTSDNTPSLEVK---EQSSTTMRRDLMFT 57
Query: 298 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCRY 161
AAA+AVC+LAKVAMA+EEEPKRGTEA KKKYAQVCVTM TAK+CRY
Sbjct: 58 AAAAAVCSLAKVAMAEEEEPKRGTEAGKKKYAQVCVTMRTAKICRY 103
>gi|312283313|dbj|BAJ34522.1| unnamed protein product [Thellungiella halophila]
Length = 106
Score = 140 bits (351), Expect = 1e-031
Identities = 74/108 (68%), Positives = 93/108 (86%), Gaps = 6/108 (5%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLPSPTAG---RRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDI 308
MASMTMT++FLPAV+KLP+ G R +++V+ASTSENTT+LE +++++S MRRD+
Sbjct: 1 MASMTMTSSFLPAVSKLPTAITGSNRRSLTVVKASTSENTTSLE--NRKQEQSMKMRRDM 58
Query: 307 MFTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCR 164
+FTAAA+AVC+LAK AMAD EEPKRGTEAAKKKYA VCVTMPTAK+CR
Sbjct: 59 VFTAAAAAVCSLAKAAMAD-EEPKRGTEAAKKKYAPVCVTMPTAKICR 105
>gi|297852864|ref|XP_002894313.1| photosystem II 5 kD protein [Arabidopsis
lyrata subsp. lyrata]
Length = 106
Score = 138 bits (347), Expect = 4e-031
Identities = 73/108 (67%), Positives = 93/108 (86%), Gaps = 6/108 (5%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLPSPTAG---RRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDI 308
MASMTMT++FLP V+KLP+ +G R +++V+AS SENTT+LE K++++S MRRD+
Sbjct: 1 MASMTMTSSFLPTVSKLPANISGNSRRSLTVVKASASENTTSLE--NKKQEQSMKMRRDL 58
Query: 307 MFTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCR 164
+FTAAA+AVC+LAKVAMAD EEPKRGTEAAKKKYA VCVTMPTA++CR
Sbjct: 59 VFTAAAAAVCSLAKVAMAD-EEPKRGTEAAKKKYAPVCVTMPTARICR 105
>gi|21537121|gb|AAM61462.1| photosystem II [Arabidopsis thaliana]
Length = 106
Score = 135 bits (338), Expect = 4e-030
Identities = 72/108 (66%), Positives = 91/108 (84%), Gaps = 6/108 (5%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLP---SPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDI 308
MASMTMT++FLP V+ LP S + R +++V+AS SENTT+LE KE+++S MRRD+
Sbjct: 1 MASMTMTSSFLPTVSNLPANISSNSRRSLTVVKASGSENTTSLE--NKEQEQSMKMRRDL 58
Query: 307 MFTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCR 164
+FTAAA+AVC+LAKVAMAD +EPKRGTEAAKKKYA VCVTMPTA++CR
Sbjct: 59 VFTAAAAAVCSLAKVAMAD-DEPKRGTEAAKKKYAPVCVTMPTARICR 105
>gi|18403499|ref|NP_564589.1| Photosystem II 5 kD protein [Arabidopsis
thaliana]
Length = 106
Score = 134 bits (337), Expect = 6e-030
Identities = 72/108 (66%), Positives = 91/108 (84%), Gaps = 6/108 (5%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLP---SPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDI 308
MASMTMT++FLP V+ LP S + R +++V+AS SENTT+LE K++++S MRRD+
Sbjct: 1 MASMTMTSSFLPTVSNLPANISSNSRRSLTVVKASGSENTTSLE--NKKQEQSMKMRRDL 58
Query: 307 MFTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCR 164
+FTAAA+AVC+LAKVAMAD +EPKRGTEAAKKKYA VCVTMPTAK+CR
Sbjct: 59 VFTAAAAAVCSLAKVAMAD-DEPKRGTEAAKKKYAPVCVTMPTAKICR 105
>gi|2129671|pir||S71280 photosystem II protein psbT - Arabidopsis thaliana
Length = 102
Score = 121 bits (302), Expect = 7e-026
Identities = 65/102 (63%), Positives = 77/102 (75%), Gaps = 4/102 (3%)
Frame = -1
Query: 466 TMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMFTAAAS 287
TMTATF PAVAK+PS TA + + ++ T L + E+ MRRD+MFTAAA+
Sbjct: 5 TMTATFFPAVAKVPSATATKALRS-QSLHERQHTQLRSQGTEQHH---MRRDLMFTAAAA 60
Query: 286 AVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCRY 161
AVC+LAKVAMA+EEEPKRGTEA KKKYAQVCVTM TAK+CRY
Sbjct: 61 AVCSLAKVAMAEEEEPKRGTEAGKKKYAQVCVTMRTAKICRY 102
>gi|224054994|ref|XP_002298398.1| predicted protein [Populus trichocarpa]
Length = 105
Score = 104 bits (257), Expect = 1e-020
Identities = 53/105 (50%), Positives = 72/105 (68%), Gaps = 1/105 (0%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMFT 299
MASMTMTA+FL P R ++ A S T + V+ K ++ S++ RRD+MF
Sbjct: 1 MASMTMTASFLAGSTMAKQPLTTPRRGLIVAKASRTTEGVNVEMKNKEESSSGRRDLMFA 60
Query: 298 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCR 164
AAA+A ++A+VA+AD EEP+RGT AKKKYA +CVTMPTA++CR
Sbjct: 61 AAAAAAYSIARVAIAD-EEPERGTPEAKKKYAPICVTMPTARICR 104
>gi|224106185|ref|XP_002314076.1| predicted protein [Populus trichocarpa]
Length = 105
Score = 104 bits (257), Expect = 1e-020
Identities = 58/107 (54%), Positives = 79/107 (73%), Gaps = 5/107 (4%)
Frame = -1
Query: 478 MASMTMTATFL--PAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIM 305
MAS+TMTA+FL A+AK PS T R + + +AS + +E+K +EE S+ RRD+M
Sbjct: 1 MASITMTASFLTGSAMAKQPSTTPRRGLIVAKASRATEGVNVEMKNREE--SSGGRRDLM 58
Query: 304 FTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCR 164
F AAA+A ++A+VA+AD EEP+RGT AKKKYA +CVTMPTA++CR
Sbjct: 59 FAAAAAAAYSIARVAIAD-EEPRRGTPEAKKKYAPICVTMPTARICR 104
>gi|255544866|ref|XP_002513494.1| Photosystem II 5 kDa protein, chloroplast
precursor, putative [Ricinus communis]
Length = 106
Score = 92 bits (228), Expect = 3e-017
Identities = 52/106 (49%), Positives = 70/106 (66%), Gaps = 4/106 (3%)
Frame = -1
Query: 478 MASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENT--TTLEVKPKEEQRSTTMRRDIM 305
MASMTMTA+FL P R ++ A S+ T + V+ K ++ S++ RRD++
Sbjct: 1 MASMTMTASFLAGSTLTRQPFTAPRRGLIVAKASKVTEGERVNVEMKNKEESSSGRRDLV 60
Query: 304 FTAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVC 167
F AAA+A ++AKVAMAD EPK GT AKKKYA +CV+MPTAK+C
Sbjct: 61 FAAAAAAAFSVAKVAMAD--EPKAGTLDAKKKYASICVSMPTAKIC 104
>gi|206586409|gb|ACI15739.1| chloroplast photosystem II 5 kDa precursor protein
[Picrorhiza kurrooa]
Length = 102
Score = 84 bits (205), Expect = 1e-014
Identities = 51/104 (49%), Positives = 63/104 (60%), Gaps = 8/104 (7%)
Frame = -1
Query: 466 TMTATFL-PAVAKLPSPTAGR-RMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMFTAA 293
TMT +F + A PT GR ++MVRAS T+ VK + S RRD+MF A
Sbjct: 5 TMTTSFFCRSAAAKQLPTTGRGGVAMVRASKESEKLTVNVK----EESNNTRRDLMFAMA 60
Query: 292 ASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCRY 161
A ++A AMAD EPKRGT AKKKYA VCVT PTA++C+Y
Sbjct: 61 AVVASSIANFAMAD--EPKRGTVEAKKKYAAVCVTNPTARICKY 102
>gi|400198|sp|P31336.1|PST2_GOSHI RecName: Full=Photosystem II 5 kDa protein,
chloroplastic; Short=PSII-T; AltName: Full=Light-regulated unknown 11
kDa protein; Flags: Precursor
Length = 105
Score = 80 bits (196), Expect = 1e-013
Identities = 44/106 (41%), Positives = 65/106 (61%), Gaps = 5/106 (4%)
Frame = -1
Query: 475 ASMTMTATFLPA--VAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMF 302
AS+TMT +FL + K RR+ + A+ ++++ + + RR++MF
Sbjct: 2 ASITMTTSFLSTTNLTKGSPRITQRRLVVANAAKGAQVESVQMSGERKTEGNNGRREMMF 61
Query: 301 TAAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCR 164
AAA+A+C++A VA A EPKRG+ AKK YA VCVTMPTA++CR
Sbjct: 62 AAAAAAICSVAGVATA---EPKRGSAEAKKAYAPVCVTMPTARICR 104
>gi|255632908|gb|ACU16808.1| unknown [Glycine max]
Length = 104
Score = 80 bits (195), Expect = 2e-013
Identities = 47/105 (44%), Positives = 65/105 (61%), Gaps = 6/105 (5%)
Frame = -1
Query: 472 SMTMTATFL--PAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTMRRDIMFT 299
S TMTA+ L P V + RR +V A+ + +V + + RR++MF
Sbjct: 3 SFTMTASILGSPTVTNRSAVATQRRSLVVNAAKAVEAE--KVSYDNDMDGSNGRRNLMFA 60
Query: 298 AAASAVCALAKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVCR 164
AAA+AVC++A +A+AD EPK GT AKKKYA +CVTMPTA++CR
Sbjct: 61 AAAAAVCSVAGMAVAD--EPKPGTPEAKKKYAPICVTMPTARICR 103
>gi|242062124|ref|XP_002452351.1| hypothetical protein SORBIDRAFT_04g024130
[Sorghum bicolor]
Length = 152
Score = 69 bits (168), Expect = 2e-010
Identities = 49/120 (40%), Positives = 67/120 (55%), Gaps = 22/120 (18%)
Frame = -1
Query: 484 QAMASMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQRSTTM----- 320
+AMAS+TM A+F A + +P+ ++VRA+ N +EE S +
Sbjct: 39 KAMASLTMMASF--AAVAVAAPSRRGSFAVVRAA---NKADHRCHQQEEPASARLAAAAA 93
Query: 319 -------RRDIMFTAAASAVCAL--AKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVC 167
RR +M AAA+AV A+ A AMAD PK+G+ AKKKYA +CVTMPTAK+C
Sbjct: 94 AEEPAEGRRAVMLAAAAAAVAAIGGAGAAMAD---PKKGSPEAKKKYAPICVTMPTAKIC 150
>gi|226508866|ref|NP_001143858.1| hypothetical protein LOC100276652 [Zea mays]
Length = 104
Score = 64 bits (153), Expect = 1e-008
Identities = 44/105 (41%), Positives = 59/105 (56%), Gaps = 8/105 (7%)
Frame = -1
Query: 472 SMTMTATFLPAVAKLPSPTAGRRMSMVRASTSENTTTLEVKPKEEQR-STTMRRDIMFTA 296
S+TM A+F A + +P+ ++VR++ + E R + RR +M A
Sbjct: 3 SLTMMASF--AAVAVAAPSRRGTFAVVRSAKVDRCQEPATLAATEARPAADGRRAVMLAA 60
Query: 295 AASAVCAL--AKVAMADEEEPKRGTEAAKKKYAQVCVTMPTAKVC 167
AA+AV A+ A AMA PK GT AKKKYA +CVTMPTAKVC
Sbjct: 61 AAAAVAAIGGAGAAMAG---PKNGTPEAKKKYAAICVTMPTAKVC 102
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,252,929,668,096
Number of Sequences: 15229318
Number of Extensions: 2252929668096
Number of Successful Extensions: 564714463
Number of sequences better than 0.0: 0
|