BLASTX 7.6.2
Query= UN54737 /QuerySize=786
(785 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi... 119 8e-025
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT... 116 4e-024
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric... 79 7e-013
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein... 72 1e-010
gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 ... 70 3e-010
gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein... 70 3e-010
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta... 67 3e-009
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi... 59 7e-007
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana] 59 9e-007
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal... 59 9e-007
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT... 57 4e-006
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric... 55 8e-006
>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
thaliana]
Length = 139
Score = 119 bits (296), Expect = 8e-025
Identities = 66/91 (72%), Positives = 72/91 (79%), Gaps = 11/91 (12%)
Frame = -2
Query: 415 SLCNACGIRNRKKRRGGGEDKKQPKKPNSCGGGGGGGDLKRNPKFGESLR--MMDLGMTK 242
SLCNACGIRNRKKRRGG ED K+ KK +S GG N KFGESL+ +MDLG+ K
Sbjct: 58 SLCNACGIRNRKKRRGGTEDNKKLKKSSSGGG---------NRKFGESLKQSLMDLGIRK 108
Query: 241 RSTVEKQLRKLGEEEQAAVLLMALSYGSVYA 149
RSTVEKQ +KLGEEEQAAVLLMALSYGSVYA
Sbjct: 109 RSTVEKQRQKLGEEEQAAVLLMALSYGSVYA 139
Score = 66 bits (159), Expect = 6e-009
Identities = 32/46 (69%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Frame = -1
Query: 629 DSE-RRIGGEDVIEQNKACCND-KKTCADCGTSKTPFWRGGPAGPK 498
DSE + ED+IEQN ND KKTCADCGTSKTP WRGGP GPK
Sbjct: 12 DSETMKTRAEDMIEQNNTSVNDKKKTCADCGTSKTPLWRGGPVGPK 57
>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
[Arabidopsis lyrata subsp. lyrata]
Length = 111
Score = 116 bits (290), Expect = 4e-024
Identities = 67/91 (73%), Positives = 71/91 (78%), Gaps = 12/91 (13%)
Frame = -2
Query: 415 SLCNACGIRNRKKRRGGGEDKKQPKKPNSCGGGGGGGDLKRNPKFGESL--RMMDLGMTK 242
SLCNACGIRNRKKRR G ED K+ KK +S GG NPK GESL R+MD G+TK
Sbjct: 31 SLCNACGIRNRKKRR-GTEDNKKLKKSSSGGG---------NPKLGESLKQRLMDFGITK 80
Query: 241 RSTVEKQLRKLGEEEQAAVLLMALSYGSVYA 149
RSTVEKQ RKLGEEEQAAVLLMALSYGSVYA
Sbjct: 81 RSTVEKQRRKLGEEEQAAVLLMALSYGSVYA 111
Score = 59 bits (140), Expect = 9e-007
Identities = 24/29 (82%)
Frame = -1
Query: 584 KACCNDKKTCADCGTSKTPFWRGGPAGPK 498
K DKKTCADCGTSKTP WRGGPAGPK
Sbjct: 2 KTRAQDKKTCADCGTSKTPLWRGGPAGPK 30
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 79 bits (193), Expect = 7e-013
Identities = 51/100 (51%), Positives = 64/100 (64%), Gaps = 21/100 (21%)
Frame = -2
Query: 415 SLCNACGIRNRKKRR--------GGGEDKKQPKKPNSCGGGGGGGDLKRNPKFGESLRMM 260
SLCNACGIR+RKK+R G + K+ KK ++ G G LK+ R++
Sbjct: 36 SLCNACGIRSRKKKRDILGLNKGGAAANDKRAKKGSTNNGSSDG--LKQ--------RLL 85
Query: 259 DLG---MTKRSTVEKQLRKLGEEEQAAVLLMALSYGSVYA 149
LG + + STVE++ RKLGEEEQAAVLLMALSYGSVYA
Sbjct: 86 ALGREVLVQGSTVERRRRKLGEEEQAAVLLMALSYGSVYA 125
Score = 55 bits (132), Expect = 8e-006
Identities = 22/23 (95%)
Frame = -1
Query: 566 KKTCADCGTSKTPFWRGGPAGPK 498
KKTCADCGTSKTP WRGGPAGPK
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPK 35
>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 153
Score = 72 bits (174), Expect = 1e-010
Identities = 44/80 (55%), Positives = 52/80 (65%), Gaps = 8/80 (10%)
Frame = -2
Query: 373 RGGGEDKKQPKKPNSCGGGGGGGDLKRNPKFGESL--RMMDLG---MTKRSTVEKQLRKL 209
+G +D+K + N GGG N K G+SL R+ LG + +RSTVEKQ RKL
Sbjct: 77 KGSTDDRKAKRSSNHSHNNGGGNG---NNKLGDSLKRRLFALGREVLLQRSTVEKQRRKL 133
Query: 208 GEEEQAAVLLMALSYGSVYA 149
GEEEQAAVLLMALSYG VYA
Sbjct: 134 GEEEQAAVLLMALSYGYVYA 153
>gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 [Vitis
vinifera]
Length = 125
Score = 70 bits (170), Expect = 3e-010
Identities = 47/92 (51%), Positives = 58/92 (63%), Gaps = 10/92 (10%)
Frame = -2
Query: 415 SLCNACGIRNRKKRRGGGEDKKQPKKPNSCGGGGGGGDLKRNPKFGESLRMMD---LGMT 245
SLCNACGIR RK+R K+ ++ NS G DL K +SL + +
Sbjct: 41 SLCNACGIRYRKRRSSMVGVNKKKERMNS-----GSHDLSETLK--QSLMALGNEVMMQR 93
Query: 244 KRSTVEKQLRKLGEEEQAAVLLMALSYGSVYA 149
+RS+V+KQ RKLGEEEQAAVLLMALS GSV+A
Sbjct: 94 QRSSVKKQRRKLGEEEQAAVLLMALSCGSVFA 125
>gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 124
Score = 70 bits (170), Expect = 3e-010
Identities = 47/92 (51%), Positives = 58/92 (63%), Gaps = 10/92 (10%)
Frame = -2
Query: 415 SLCNACGIRNRKKRRGGGEDKKQPKKPNSCGGGGGGGDLKRNPKFGESLRMMD---LGMT 245
SLCNACGIR RK+R K+ ++ NS G DL K +SL + +
Sbjct: 40 SLCNACGIRYRKRRSSMVGVNKKKERMNS-----GSHDLSETLK--QSLMALGNEVMMQR 92
Query: 244 KRSTVEKQLRKLGEEEQAAVLLMALSYGSVYA 149
+RS+V+KQ RKLGEEEQAAVLLMALS GSV+A
Sbjct: 93 QRSSVKKQRRKLGEEEQAAVLLMALSCGSVFA 124
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
communis]
Length = 149
Score = 67 bits (161), Expect = 3e-009
Identities = 39/80 (48%), Positives = 51/80 (63%), Gaps = 5/80 (6%)
Frame = -2
Query: 373 RGGGEDKKQPKKPNSCGGGGGGGDLKRNPKFGESL--RMMDLG---MTKRSTVEKQLRKL 209
R K+ +K +S G + + + G+ L R++ LG + +RS+VEKQ RKL
Sbjct: 70 RASSNPDKKSRKHSSSNGSSNNHNSNNSNRLGDGLKQRLLALGREVLMQRSSVEKQRRKL 129
Query: 208 GEEEQAAVLLMALSYGSVYA 149
GEEEQAAVLLMALSYGSVYA
Sbjct: 130 GEEEQAAVLLMALSYGSVYA 149
>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
thaliana]
Length = 149
Score = 59 bits (141), Expect = 7e-007
Identities = 27/51 (52%), Positives = 34/51 (66%), Gaps = 6/51 (11%)
Frame = -1
Query: 632 QDSERRIGGEDVIEQ------NKACCNDKKTCADCGTSKTPFWRGGPAGPK 498
+ E ++ D IE+ N+A N+KK+CA CGTSKTP WRGGPAGPK
Sbjct: 12 ESMESKLTSVDAIEEHSSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPK 62
>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]
Length = 136
Score = 59 bits (140), Expect = 9e-007
Identities = 27/48 (56%), Positives = 33/48 (68%), Gaps = 6/48 (12%)
Frame = -1
Query: 623 ERRIGGEDVIEQ------NKACCNDKKTCADCGTSKTPFWRGGPAGPK 498
E ++ D IE+ N+A N+KK+CA CGTSKTP WRGGPAGPK
Sbjct: 2 ESKLTSVDAIEEHSSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPK 49
>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]
Length = 136
Score = 59 bits (140), Expect = 9e-007
Identities = 27/48 (56%), Positives = 33/48 (68%), Gaps = 6/48 (12%)
Frame = -1
Query: 623 ERRIGGEDVIEQ------NKACCNDKKTCADCGTSKTPFWRGGPAGPK 498
E ++ D IE+ N+A N+KK+CA CGTSKTP WRGGPAGPK
Sbjct: 2 ESKLTSVDAIEEHSSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPK 49
>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
[Arabidopsis lyrata subsp. lyrata]
Length = 137
Score = 57 bits (135), Expect = 4e-006
Identities = 26/49 (53%), Positives = 32/49 (65%), Gaps = 7/49 (14%)
Frame = -1
Query: 623 ERRIGGEDVIEQ-------NKACCNDKKTCADCGTSKTPFWRGGPAGPK 498
E ++ D IE+ N+ N+KK+CA CGTSKTP WRGGPAGPK
Sbjct: 2 ESKLTSVDAIEEHSSSSSSNEGISNEKKSCAICGTSKTPLWRGGPAGPK 50
>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 55 bits (132), Expect = 8e-006
Identities = 22/23 (95%)
Frame = -1
Query: 566 KKTCADCGTSKTPFWRGGPAGPK 498
KKTCADCGTSKTP WRGGPAGPK
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPK 35
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,737,270,142,965
Number of Sequences: 15229318
Number of Extensions: 5737270142965
Number of Successful Extensions: 1346303982
Number of sequences better than 0.0: 0
|