BLASTX 7.6.2
Query= UN47770 /QuerySize=922
(921 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|312281983|dbj|BAJ33857.1| unnamed protein product [Thellungie... 349 4e-094
gi|297802492|ref|XP_002869130.1| hypothetical protein ARALYDRAFT... 332 8e-089
gi|15236172|ref|NP_195194.1| GATA transcription factor 3 [Arabid... 330 3e-088
gi|20466648|gb|AAM20641.1| GATA transcription factor 3 [Arabidop... 328 8e-088
gi|15232355|ref|NP_191612.1| GATA transcription factor 4 [Arabid... 99 1e-018
gi|312282833|dbj|BAJ34282.1| unnamed protein product [Thellungie... 93 6e-017
gi|224059138|ref|XP_002299734.1| predicted protein [Populus tric... 91 3e-016
gi|255637027|gb|ACU18846.1| unknown [Glycine max] 82 1e-013
gi|297835478|ref|XP_002885621.1| hypothetical protein ARALYDRAFT... 80 3e-013
gi|15229571|ref|NP_189047.1| GATA transcription factor 1 [Arabid... 80 5e-013
gi|21593190|gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1... 80 5e-013
gi|110743205|dbj|BAE99493.1| GATA transcription factor 1 [Arabid... 79 9e-013
gi|302398797|gb|ADL36693.1| GATA domain class transcription fact... 76 8e-012
gi|302398805|gb|ADL36697.1| GATA domain class transcription fact... 76 8e-012
gi|15239503|ref|NP_197955.1| GATA transcription factor 12 [Arabi... 73 6e-011
gi|301133588|gb|ADK63416.1| GATA type zinc finger protein [Brass... 72 1e-010
gi|15225399|ref|NP_182031.1| GATA transcription factor 2 [Arabid... 70 4e-010
gi|37572447|dbj|BAC98493.1| AG-motif binding protein-3 [Nicotian... 70 4e-010
gi|297824543|ref|XP_002880154.1| hypothetical protein ARALYDRAFT... 70 4e-010
gi|326518913|dbj|BAJ92617.1| predicted protein [Hordeum vulgare ... 70 4e-010
>gi|312281983|dbj|BAJ33857.1| unnamed protein product [Thellungiella halophila]
Length = 269
Score = 349 bits (895), Expect = 4e-094
Identities = 179/212 (84%), Positives = 190/212 (89%), Gaps = 5/212 (2%)
Frame = +2
Query: 215 MELWTEARALKASLRGEAIKHQVVVSEELSRTSSAEDFSVECFLDFSEEVQEE-EEELVS 391
ME+WTEARALKASLRGEAIKHQV++SEELSRTSSAEDFSVECFLDFSE +EE EEELVS
Sbjct: 1 MEMWTEARALKASLRGEAIKHQVLMSEELSRTSSAEDFSVECFLDFSEGQEEEPEEELVS 60
Query: 392 VSSSHEEQEQD-CCIFSSQPCIFDQLPALPVEDVEELEWVSRVVDDCSSQEVSLLLTQTH 568
VSSS EE EQ+ CIFSSQP +FDQLP+LP EDVEELEWVSRVVDDCSS EVSLL TQTH
Sbjct: 61 VSSSQEEHEQEQDCIFSSQPSVFDQLPSLPDEDVEELEWVSRVVDDCSSPEVSLLFTQTH 120
Query: 569 NTKPSFSSRIPVKPRTKRPRNSLTGDRVWPVVSTNQHAAGERQWKKKKKKQELAVVFQRR 748
TKPSF+SRIPVKPRTKR RNSLTG RVWP+VSTNQHAA ER W +KKKQE AV FQRR
Sbjct: 121 KTKPSFTSRIPVKPRTKRSRNSLTGGRVWPLVSTNQHAATER-W--RKKKQETAVAFQRR 177
Query: 749 CSHCGTNTTPQWRTGPVGPKTLCNACVVRFKS 844
CSHCGTN TPQWRTGP+GPKTLCNAC VRFKS
Sbjct: 178 CSHCGTNNTPQWRTGPLGPKTLCNACGVRFKS 209
>gi|297802492|ref|XP_002869130.1| hypothetical protein ARALYDRAFT_491187
[Arabidopsis lyrata subsp. lyrata]
Length = 268
Score = 332 bits (849), Expect = 8e-089
Identities = 179/217 (82%), Positives = 190/217 (87%), Gaps = 12/217 (5%)
Frame = +2
Query: 215 MELWTEARALKASLRGEAI----KHQVVVSEELSRTSS-AEDFSVECFLDFSEEVQEEEE 379
MELWTEARALKASLRGE+ HQ++VSE+LSRTSS +EDFSVECFLDFSE Q+EEE
Sbjct: 1 MELWTEARALKASLRGESTTSLKHHQLIVSEDLSRTSSLSEDFSVECFLDFSEG-QKEEE 59
Query: 380 ELVSVSSSHEEQEQD-CCIFSSQPCIFDQLPALPVEDVEELEWVSRVVDDCSSQEVSLLL 556
ELVSVSSS EEQEQ+ CIFSSQPCIFDQLP+LP EDVEELEWVSRVVDDCSS EVSLLL
Sbjct: 60 ELVSVSSSQEEQEQEQDCIFSSQPCIFDQLPSLPDEDVEELEWVSRVVDDCSSPEVSLLL 119
Query: 557 TQTHNTKPSFSSRIPVKPRTKRPRNSLTGDRVWPVVSTN-QHAAGERQWKKKKKKQELAV 733
TQTH TKPSF SRIPVKPRTKR RNSLTG RVWP+VSTN QHAA E + +KKKQE AV
Sbjct: 120 TQTHKTKPSF-SRIPVKPRTKRSRNSLTGGRVWPLVSTNHQHAATE---QLRKKKQETAV 175
Query: 734 VFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKS 844
VFQRRCSHCGTN TPQWRTGPVGPKTLCNAC VRFKS
Sbjct: 176 VFQRRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFKS 212
>gi|15236172|ref|NP_195194.1| GATA transcription factor 3 [Arabidopsis
thaliana]
Length = 269
Score = 330 bits (844), Expect = 3e-088
Identities = 174/217 (80%), Positives = 188/217 (86%), Gaps = 11/217 (5%)
Frame = +2
Query: 215 MELWTEARALKASLRGEAI----KHQVVVSEELSRTSS-AEDFSVECFLDFSEEVQEEEE 379
MELWTEARALKASLRGE+ HQV+VSE+LSRTSS EDFSVECFLDFSE +EEEE
Sbjct: 1 MELWTEARALKASLRGESTISLKHHQVIVSEDLSRTSSLPEDFSVECFLDFSEGQKEEEE 60
Query: 380 ELVSVSSSHEEQEQD-CCIFSSQPCIFDQLPALPVEDVEELEWVSRVVDDCSSQEVSLLL 556
E+VSVSSS E++EQ+ C+FSSQPCIFDQLP+LP EDVEELEWVSRVVDDCSS EVSLLL
Sbjct: 61 EVVSVSSSQEQEEQEHDCVFSSQPCIFDQLPSLPDEDVEELEWVSRVVDDCSSPEVSLLL 120
Query: 557 TQTHNTKPSFSSRIPVKPRTKRPRNSLTGDRVWPVVSTN-QHAAGERQWKKKKKKQELAV 733
TQTH TKPSF SRIPVKPRTKR RNSLTG RVWP+VSTN QHAA E + +KKKQE +
Sbjct: 121 TQTHKTKPSF-SRIPVKPRTKRSRNSLTGSRVWPLVSTNHQHAATE---QLRKKKQETVL 176
Query: 734 VFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKS 844
VFQRRCSHCGTN TPQWRTGPVGPKTLCNAC VRFKS
Sbjct: 177 VFQRRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFKS 213
>gi|20466648|gb|AAM20641.1| GATA transcription factor 3 [Arabidopsis thaliana]
Length = 269
Score = 328 bits (840), Expect = 8e-088
Identities = 173/217 (79%), Positives = 188/217 (86%), Gaps = 11/217 (5%)
Frame = +2
Query: 215 MELWTEARALKASLRGEAI----KHQVVVSEELSRTSS-AEDFSVECFLDFSEEVQEEEE 379
MELWTEARALKASLRGE+ HQV+VSE+LS+TSS EDFSVECFLDFSE +EEEE
Sbjct: 1 MELWTEARALKASLRGESTISLKHHQVIVSEDLSQTSSLPEDFSVECFLDFSEGQKEEEE 60
Query: 380 ELVSVSSSHEEQEQD-CCIFSSQPCIFDQLPALPVEDVEELEWVSRVVDDCSSQEVSLLL 556
E+VSVSSS E++EQ+ C+FSSQPCIFDQLP+LP EDVEELEWVSRVVDDCSS EVSLLL
Sbjct: 61 EVVSVSSSQEQEEQEHDCVFSSQPCIFDQLPSLPDEDVEELEWVSRVVDDCSSPEVSLLL 120
Query: 557 TQTHNTKPSFSSRIPVKPRTKRPRNSLTGDRVWPVVSTN-QHAAGERQWKKKKKKQELAV 733
TQTH TKPSF SRIPVKPRTKR RNSLTG RVWP+VSTN QHAA E + +KKKQE +
Sbjct: 121 TQTHKTKPSF-SRIPVKPRTKRSRNSLTGSRVWPLVSTNHQHAATE---QLRKKKQETVL 176
Query: 734 VFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKS 844
VFQRRCSHCGTN TPQWRTGPVGPKTLCNAC VRFKS
Sbjct: 177 VFQRRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFKS 213
>gi|15232355|ref|NP_191612.1| GATA transcription factor 4 [Arabidopsis
thaliana]
Length = 240
Score = 99 bits (244), Expect = 1e-018
Identities = 54/131 (41%), Positives = 76/131 (58%), Gaps = 5/131 (3%)
Frame = +2
Query: 473 LPVEDVEELEWVSRVVDDCSSQEVSLLLTQTHNTKPSFSSRIPVKPRTKRPRNSLTGDRV 652
+P +D LEW+SR VDD S + LT T + SF+ + P R++ P S+ G
Sbjct: 69 VPSDDAAHLEWLSRFVDDSFSDFPANPLTMTVRPEISFTGK-PRSRRSRAPAPSVAG--T 125
Query: 653 WPVVSTNQ--HAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNAC 826
W +S ++ H+ + + KK + + RRC+HC + TPQWRTGP+GPKTLCNAC
Sbjct: 126 WAPMSESELCHSVAKPKPKKVYNAESVTADGARRCTHCASEKTPQWRTGPLGPKTLCNAC 185
Query: 827 VVRFKSVGYVP 859
VR+KS VP
Sbjct: 186 GVRYKSGRLVP 196
>gi|312282833|dbj|BAJ34282.1| unnamed protein product [Thellungiella halophila]
Length = 247
Score = 93 bits (229), Expect = 6e-017
Identities = 53/133 (39%), Positives = 72/133 (54%), Gaps = 7/133 (5%)
Frame = +2
Query: 473 LPVEDVEELEWVSRVVDDCSSQEVSLLLTQTHNTKPSFSSRIPVKPRTKRPRNSLTGDRV 652
+P +D LEW+SR VDD S + LT T + SF+ + P R++ P + G
Sbjct: 74 VPSDDAAHLEWLSRFVDDSFSDYPANPLTMTVRPEMSFTGK-PRSRRSRAPAPPVAG--T 130
Query: 653 WPVVSTNQHAAGERQWKKKKKKQELAVVFQ----RRCSHCGTNTTPQWRTGPVGPKTLCN 820
W + ++ + K KK + + RRC+HC + TPQWRTGP+GPKTLCN
Sbjct: 131 WAPMPESELCYSVAKTKPNKKFEAEPMAADGGGARRCTHCASEKTPQWRTGPLGPKTLCN 190
Query: 821 ACVVRFKSVGYVP 859
AC VRFKS VP
Sbjct: 191 ACGVRFKSGRLVP 203
>gi|224059138|ref|XP_002299734.1| predicted protein [Populus trichocarpa]
Length = 178
Score = 91 bits (223), Expect = 3e-016
Identities = 56/132 (42%), Positives = 75/132 (56%), Gaps = 8/132 (6%)
Frame = +2
Query: 482 EDVEELEWVSRVVDDCSSQEVSLLLTQTH--NTKPSFSSRIPV--KPRTKRPRNSLT--G 643
+D+ ELEW+S V+D S + S L T H + P+F P+ K R+KR R +
Sbjct: 1 DDMAELEWLSNFVEDSFSTDQS-LQTNIHILSGNPAFQPETPLPGKARSKRSRAAPCDWS 59
Query: 644 DRVWPVVSTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNA 823
R+ V ST + + E+Q ++ + RRC HCG TPQWRTGP+GPKTLCNA
Sbjct: 60 TRLLHVPSTTK-MSSEKQLRESPDPNLDSNAMVRRCLHCGAEKTPQWRTGPMGPKTLCNA 118
Query: 824 CVVRFKSVGYVP 859
C VR+KS VP
Sbjct: 119 CGVRYKSGRLVP 130
>gi|255637027|gb|ACU18846.1| unknown [Glycine max]
Length = 352
Score = 82 bits (200), Expect = 1e-013
Identities = 37/61 (60%), Positives = 40/61 (65%)
Frame = +2
Query: 677 HAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGYV 856
H E KK KKK + RRCSHCG TPQWRTGP+GPKTLCNAC VRFKS +
Sbjct: 244 HLCSEPNTKKMKKKPSSDTLAPRRCSHCGVQKTPQWRTGPLGPKTLCNACGVRFKSGRLL 303
Query: 857 P 859
P
Sbjct: 304 P 304
>gi|297835478|ref|XP_002885621.1| hypothetical protein ARALYDRAFT_479930
[Arabidopsis lyrata subsp. lyrata]
Length = 270
Score = 80 bits (197), Expect = 3e-013
Identities = 41/89 (46%), Positives = 50/89 (56%), Gaps = 3/89 (3%)
Frame = +2
Query: 593 RIPVKPRTKRPRNSLTGDRVWPVVSTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGTNT 772
+ P K R+KR R TG R V+ T G ++ K A++ R+C HCG
Sbjct: 145 KAPAKARSKRRR---TGRRDLGVLWTGNEQVGIQKRKTPSVAAAAAMIMGRKCQHCGAEK 201
Query: 773 TPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
TPQWR GP GPKTLCNAC VR+KS VP
Sbjct: 202 TPQWRAGPAGPKTLCNACGVRYKSGRLVP 230
>gi|15229571|ref|NP_189047.1| GATA transcription factor 1 [Arabidopsis
thaliana]
Length = 274
Score = 80 bits (195), Expect = 5e-013
Identities = 47/104 (45%), Positives = 55/104 (52%), Gaps = 9/104 (8%)
Frame = +2
Query: 563 THNTKPSFSS-----RIPVKPRTKRPRNSLTGDRVWPVVSTNQHAAGERQWKKKKKKQEL 727
T T P+ S + P K R+KR R TG R V+ T G Q KK
Sbjct: 133 TTTTTPTIMSCCVGFKAPAKARSKRRR---TGRRDLRVLWTGNEQGG-IQKKKTMTVAAA 188
Query: 728 AVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
A++ R+C HCG TPQWR GP GPKTLCNAC VR+KS VP
Sbjct: 189 ALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVP 232
>gi|21593190|gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1) [Arabidopsis
thaliana]
Length = 268
Score = 80 bits (195), Expect = 5e-013
Identities = 47/104 (45%), Positives = 55/104 (52%), Gaps = 9/104 (8%)
Frame = +2
Query: 563 THNTKPSFSS-----RIPVKPRTKRPRNSLTGDRVWPVVSTNQHAAGERQWKKKKKKQEL 727
T T P+ S + P K R+KR R TG R V+ T G Q KK
Sbjct: 127 TTTTTPTIMSCCVGFKAPAKARSKRRR---TGRRDLRVLWTGNEQGG-IQKKKTMTVAAA 182
Query: 728 AVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
A++ R+C HCG TPQWR GP GPKTLCNAC VR+KS VP
Sbjct: 183 ALIMGRKCQHCGAEKTPQWRAGPAGPKTLCNACGVRYKSGRLVP 226
>gi|110743205|dbj|BAE99493.1| GATA transcription factor 1 [Arabidopsis
thaliana]
Length = 134
Score = 79 bits (193), Expect = 9e-013
Identities = 43/89 (48%), Positives = 50/89 (56%), Gaps = 4/89 (4%)
Frame = +2
Query: 593 RIPVKPRTKRPRNSLTGDRVWPVVSTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGTNT 772
+ P K R+KR R TG R V+ T G Q KK A++ R+C HCG
Sbjct: 8 KAPAKARSKRRR---TGRRDLRVLWTGNEQGG-IQKKKTMTVAAAALIMGRKCQHCGAEK 63
Query: 773 TPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
TPQWR GP GPKTLCNAC VR+KS VP
Sbjct: 64 TPQWRAGPAGPKTLCNACGVRYKSGRLVP 92
>gi|302398797|gb|ADL36693.1| GATA domain class transcription factor [Malus x
domestica]
Length = 323
Score = 76 bits (185), Expect = 8e-012
Identities = 37/60 (61%), Positives = 40/60 (66%), Gaps = 7/60 (11%)
Frame = +2
Query: 686 GERQWKKKKKKQ-------ELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKS 844
GE KK+KKK + FQRRCSHC TPQWRTGP+GPKTLCNAC VRFKS
Sbjct: 214 GEPAAKKQKKKPAVQTGEGSIGGQFQRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKS 273
>gi|302398805|gb|ADL36697.1| GATA domain class transcription factor [Malus x
domestica]
Length = 321
Score = 76 bits (185), Expect = 8e-012
Identities = 37/60 (61%), Positives = 40/60 (66%), Gaps = 7/60 (11%)
Frame = +2
Query: 686 GERQWKKKKKKQ-------ELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKS 844
GE KK+KKK + FQRRCSHC TPQWRTGP+GPKTLCNAC VRFKS
Sbjct: 212 GEPAAKKQKKKPAVQTGEGSIGGQFQRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKS 271
>gi|15239503|ref|NP_197955.1| GATA transcription factor 12 [Arabidopsis
thaliana]
Length = 331
Score = 73 bits (177), Expect = 6e-011
Identities = 34/62 (54%), Positives = 39/62 (62%)
Frame = +2
Query: 674 QHAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGY 853
Q G + KK E +RRC HC T+ TPQWRTGP+GPKTLCNAC VR+KS
Sbjct: 196 QAVDGGHRRKKDVSSPESGGAEERRCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL 255
Query: 854 VP 859
VP
Sbjct: 256 VP 257
>gi|301133588|gb|ADK63416.1| GATA type zinc finger protein [Brassica rapa]
Length = 256
Score = 72 bits (175), Expect = 1e-010
Identities = 30/39 (76%), Positives = 33/39 (84%)
Frame = +2
Query: 743 RRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
RRC+HC T+ TPQWRTGP+GPKTLCNAC VRFKS VP
Sbjct: 170 RRCTHCATDKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 208
>gi|15225399|ref|NP_182031.1| GATA transcription factor 2 [Arabidopsis
thaliana]
Length = 264
Score = 70 bits (170), Expect = 4e-010
Identities = 29/39 (74%), Positives = 32/39 (82%)
Frame = +2
Query: 743 RRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
RRC+HC + TPQWRTGP+GPKTLCNAC VRFKS VP
Sbjct: 179 RRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 217
>gi|37572447|dbj|BAC98493.1| AG-motif binding protein-3 [Nicotiana tabacum]
Length = 256
Score = 70 bits (170), Expect = 4e-010
Identities = 29/39 (74%), Positives = 32/39 (82%)
Frame = +2
Query: 743 RRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
RRC+HC + TPQWRTGP+GPKTLCNAC VRFKS VP
Sbjct: 167 RRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 205
>gi|297824543|ref|XP_002880154.1| hypothetical protein ARALYDRAFT_903940
[Arabidopsis lyrata subsp. lyrata]
Length = 262
Score = 70 bits (170), Expect = 4e-010
Identities = 29/39 (74%), Positives = 32/39 (82%)
Frame = +2
Query: 743 RRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
RRC+HC + TPQWRTGP+GPKTLCNAC VRFKS VP
Sbjct: 177 RRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVP 215
>gi|326518913|dbj|BAJ92617.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 377
Score = 70 bits (170), Expect = 4e-010
Identities = 31/46 (67%), Positives = 34/46 (73%)
Frame = +2
Query: 722 ELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSVGYVP 859
E A RRC HC T+ TPQWRTGP+GPKTLCNAC VR+KS VP
Sbjct: 241 EAAAAEGRRCLHCETDKTPQWRTGPLGPKTLCNACGVRYKSGRLVP 286
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,129,805,850,752
Number of Sequences: 15229318
Number of Extensions: 5129805850752
Number of Successful Extensions: 1187520479
Number of sequences better than 0.0: 0
|