BLASTX 7.6.2
Query= UN38073 /QuerySize=728
(727 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi... 119 5e-025
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT... 117 2e-024
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric... 80 3e-013
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein... 72 7e-011
gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 ... 70 2e-010
gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein... 70 2e-010
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta... 69 8e-010
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi... 59 8e-007
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana] 59 8e-007
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal... 59 8e-007
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric... 57 2e-006
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT... 57 2e-006
gi|255633610|gb|ACU17164.1| unknown [Glycine max] 56 5e-006
>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
thaliana]
Length = 139
Score = 119 bits (297), Expect = 5e-025
Identities = 66/90 (73%), Positives = 72/90 (80%), Gaps = 10/90 (11%)
Frame = +2
Query: 377 SLCNACGIRNRKKRRGGGEDKKQPKKPNSCGGGGGGDLKRNPKFGESLR--MMDLGMTKR 550
SLCNACGIRNRKKRRGG ED K+ KK +S GG N KFGESL+ +MDLG+ KR
Sbjct: 58 SLCNACGIRNRKKRRGGTEDNKKLKKSSSGGG--------NRKFGESLKQSLMDLGIRKR 109
Query: 551 STVEKQLRKLGEEEQAAVLLMALSYGSVYA 640
STVEKQ +KLGEEEQAAVLLMALSYGSVYA
Sbjct: 110 STVEKQRQKLGEEEQAAVLLMALSYGSVYA 139
Score = 76 bits (185), Expect = 5e-012
Identities = 40/57 (70%), Positives = 42/57 (73%), Gaps = 6/57 (10%)
Frame = +1
Query: 142 MLDHSEK----DSE-RRRGGEDVIEQNKACCND-KKTCADCGTSKTPLWRGGPAGPK 294
MLDHSEK DSE + ED+IEQN ND KKTCADCGTSKTPLWRGGP GPK
Sbjct: 1 MLDHSEKVLLVDSETMKTRAEDMIEQNNTSVNDKKKTCADCGTSKTPLWRGGPVGPK 57
>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
[Arabidopsis lyrata subsp. lyrata]
Length = 111
Score = 117 bits (291), Expect = 2e-024
Identities = 67/90 (74%), Positives = 71/90 (78%), Gaps = 11/90 (12%)
Frame = +2
Query: 377 SLCNACGIRNRKKRRGGGEDKKQPKKPNSCGGGGGGDLKRNPKFGESL--RMMDLGMTKR 550
SLCNACGIRNRKKRR G ED K+ KK +S GG NPK GESL R+MD G+TKR
Sbjct: 31 SLCNACGIRNRKKRR-GTEDNKKLKKSSSGGG--------NPKLGESLKQRLMDFGITKR 81
Query: 551 STVEKQLRKLGEEEQAAVLLMALSYGSVYA 640
STVEKQ RKLGEEEQAAVLLMALSYGSVYA
Sbjct: 82 STVEKQRRKLGEEEQAAVLLMALSYGSVYA 111
Score = 60 bits (144), Expect = 3e-007
Identities = 25/29 (86%)
Frame = +1
Query: 208 KACCNDKKTCADCGTSKTPLWRGGPAGPK 294
K DKKTCADCGTSKTPLWRGGPAGPK
Sbjct: 2 KTRAQDKKTCADCGTSKTPLWRGGPAGPK 30
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 80 bits (196), Expect = 3e-013
Identities = 50/98 (51%), Positives = 60/98 (61%), Gaps = 18/98 (18%)
Frame = +2
Query: 377 SLCNACGIRNRKKRR-------GGGEDKKQPKKPNSCGGGGGGDLKRNPKFGESLRMMDL 535
SLCNACGIR+RKK+R GG + K S G LK+ R++ L
Sbjct: 36 SLCNACGIRSRKKKRDILGLNKGGAAANDKRAKKGSTNNGSSDGLKQ--------RLLAL 87
Query: 536 G---MTKRSTVEKQLRKLGEEEQAAVLLMALSYGSVYA 640
G + + STVE++ RKLGEEEQAAVLLMALSYGSVYA
Sbjct: 88 GREVLVQGSTVERRRRKLGEEEQAAVLLMALSYGSVYA 125
Score = 57 bits (136), Expect = 2e-006
Identities = 23/23 (100%)
Frame = +1
Query: 226 KKTCADCGTSKTPLWRGGPAGPK 294
KKTCADCGTSKTPLWRGGPAGPK
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPK 35
>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 153
Score = 72 bits (175), Expect = 7e-011
Identities = 44/79 (55%), Positives = 53/79 (67%), Gaps = 7/79 (8%)
Frame = +2
Query: 419 RGGGEDKKQPKKPNSCGGGGGGDLKRNPKFGESL--RMMDLG---MTKRSTVEKQLRKLG 583
+G +D+K + N GGG+ N K G+SL R+ LG + +RSTVEKQ RKLG
Sbjct: 77 KGSTDDRKAKRSSNHSHNNGGGN--GNNKLGDSLKRRLFALGREVLLQRSTVEKQRRKLG 134
Query: 584 EEEQAAVLLMALSYGSVYA 640
EEEQAAVLLMALSYG VYA
Sbjct: 135 EEEQAAVLLMALSYGYVYA 153
Score = 61 bits (147), Expect = 1e-007
Identities = 31/55 (56%), Positives = 37/55 (67%), Gaps = 4/55 (7%)
Frame = +1
Query: 142 MLDHSEKDSE---RRRGGEDVIEQNKACCND-KKTCADCGTSKTPLWRGGPAGPK 294
M+D SEK SE D + ++ N+ KKTCADCGT+KTPLWRGGPAGPK
Sbjct: 1 MVDLSEKGSESEDMNNKNPDAVSSAESQVNEPKKTCADCGTTKTPLWRGGPAGPK 55
>gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 [Vitis
vinifera]
Length = 125
Score = 70 bits (171), Expect = 2e-010
Identities = 47/91 (51%), Positives = 58/91 (63%), Gaps = 9/91 (9%)
Frame = +2
Query: 377 SLCNACGIRNRKKRRGGGEDKKQPKKPNSCGGGGGGDLKRNPKFGESLRMMD---LGMTK 547
SLCNACGIR RK+R K+ ++ NS G DL K +SL + + +
Sbjct: 41 SLCNACGIRYRKRRSSMVGVNKKKERMNS----GSHDLSETLK--QSLMALGNEVMMQRQ 94
Query: 548 RSTVEKQLRKLGEEEQAAVLLMALSYGSVYA 640
RS+V+KQ RKLGEEEQAAVLLMALS GSV+A
Sbjct: 95 RSSVKKQRRKLGEEEQAAVLLMALSCGSVFA 125
>gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 124
Score = 70 bits (171), Expect = 2e-010
Identities = 47/91 (51%), Positives = 58/91 (63%), Gaps = 9/91 (9%)
Frame = +2
Query: 377 SLCNACGIRNRKKRRGGGEDKKQPKKPNSCGGGGGGDLKRNPKFGESLRMMD---LGMTK 547
SLCNACGIR RK+R K+ ++ NS G DL K +SL + + +
Sbjct: 40 SLCNACGIRYRKRRSSMVGVNKKKERMNS----GSHDLSETLK--QSLMALGNEVMMQRQ 93
Query: 548 RSTVEKQLRKLGEEEQAAVLLMALSYGSVYA 640
RS+V+KQ RKLGEEEQAAVLLMALS GSV+A
Sbjct: 94 RSSVKKQRRKLGEEEQAAVLLMALSCGSVFA 124
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
communis]
Length = 149
Score = 69 bits (166), Expect = 8e-010
Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 5/80 (6%)
Frame = +2
Query: 416 RRGGGEDKKQPKKPNSCGGGGGGDLKRNPKFGESL--RMMDLG---MTKRSTVEKQLRKL 580
R DKK K +S G + + + G+ L R++ LG + +RS+VEKQ RKL
Sbjct: 70 RASSNPDKKSRKHSSSNGSSNNHNSNNSNRLGDGLKQRLLALGREVLMQRSSVEKQRRKL 129
Query: 581 GEEEQAAVLLMALSYGSVYA 640
GEEEQAAVLLMALSYGSVYA
Sbjct: 130 GEEEQAAVLLMALSYGSVYA 149
>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
thaliana]
Length = 149
Score = 59 bits (140), Expect = 8e-007
Identities = 24/30 (80%), Positives = 27/30 (90%)
Frame = +1
Query: 205 NKACCNDKKTCADCGTSKTPLWRGGPAGPK 294
N+A N+KK+CA CGTSKTPLWRGGPAGPK
Sbjct: 33 NEAISNEKKSCAICGTSKTPLWRGGPAGPK 62
>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]
Length = 136
Score = 59 bits (140), Expect = 8e-007
Identities = 24/30 (80%), Positives = 27/30 (90%)
Frame = +1
Query: 205 NKACCNDKKTCADCGTSKTPLWRGGPAGPK 294
N+A N+KK+CA CGTSKTPLWRGGPAGPK
Sbjct: 20 NEAISNEKKSCAICGTSKTPLWRGGPAGPK 49
>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]
Length = 136
Score = 59 bits (140), Expect = 8e-007
Identities = 24/30 (80%), Positives = 27/30 (90%)
Frame = +1
Query: 205 NKACCNDKKTCADCGTSKTPLWRGGPAGPK 294
N+A N+KK+CA CGTSKTPLWRGGPAGPK
Sbjct: 20 NEAISNEKKSCAICGTSKTPLWRGGPAGPK 49
>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 57 bits (136), Expect = 2e-006
Identities = 23/23 (100%)
Frame = +1
Query: 226 KKTCADCGTSKTPLWRGGPAGPK 294
KKTCADCGTSKTPLWRGGPAGPK
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPK 35
>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
[Arabidopsis lyrata subsp. lyrata]
Length = 137
Score = 57 bits (136), Expect = 2e-006
Identities = 23/30 (76%), Positives = 26/30 (86%)
Frame = +1
Query: 205 NKACCNDKKTCADCGTSKTPLWRGGPAGPK 294
N+ N+KK+CA CGTSKTPLWRGGPAGPK
Sbjct: 21 NEGISNEKKSCAICGTSKTPLWRGGPAGPK 50
>gi|255633610|gb|ACU17164.1| unknown [Glycine max]
Length = 130
Score = 56 bits (133), Expect = 5e-006
Identities = 22/23 (95%), Positives = 23/23 (100%)
Frame = +1
Query: 226 KKTCADCGTSKTPLWRGGPAGPK 294
KKTCADCGT+KTPLWRGGPAGPK
Sbjct: 36 KKTCADCGTTKTPLWRGGPAGPK 58
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,281,875,611,404
Number of Sequences: 15229318
Number of Extensions: 4281875611404
Number of Successful Extensions: 1013544274
Number of sequences better than 0.0: 0
|