BLASTX 7.6.2
Query= UN72878 /QuerySize=542
(541 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|312281983|dbj|BAJ33857.1| unnamed protein product [Thellungie... 243 1e-062
gi|297802492|ref|XP_002869130.1| hypothetical protein ARALYDRAFT... 227 8e-058
gi|15236172|ref|NP_195194.1| GATA transcription factor 3 [Arabid... 226 1e-057
gi|20466648|gb|AAM20641.1| GATA transcription factor 3 [Arabidop... 226 1e-057
gi|255637027|gb|ACU18846.1| unknown [Glycine max] 134 7e-030
gi|302398797|gb|ADL36693.1| GATA domain class transcription fact... 128 4e-028
gi|297735055|emb|CBI17417.3| unnamed protein product [Vitis vini... 126 2e-027
gi|302398805|gb|ADL36697.1| GATA domain class transcription fact... 126 2e-027
gi|15229571|ref|NP_189047.1| GATA transcription factor 1 [Arabid... 124 6e-027
gi|21593190|gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1... 124 6e-027
gi|110743205|dbj|BAE99493.1| GATA transcription factor 1 [Arabid... 124 6e-027
gi|255560976|ref|XP_002521500.1| conserved hypothetical protein ... 124 1e-026
gi|225427744|ref|XP_002274872.1| PREDICTED: hypothetical protein... 124 1e-026
gi|297744743|emb|CBI38005.3| unnamed protein product [Vitis vini... 124 1e-026
gi|297835478|ref|XP_002885621.1| hypothetical protein ARALYDRAFT... 123 1e-026
gi|255543845|ref|XP_002512985.1| GATA transcription factor, puta... 122 2e-026
gi|78499690|gb|ABB45844.1| hypothetical protein [Eutrema halophi... 122 4e-026
gi|224105311|ref|XP_002313763.1| predicted protein [Populus tric... 122 4e-026
gi|225431219|ref|XP_002272762.1| PREDICTED: hypothetical protein... 121 5e-026
gi|15239503|ref|NP_197955.1| GATA transcription factor 12 [Arabi... 121 6e-026
>gi|312281983|dbj|BAJ33857.1| unnamed protein product [Thellungiella halophila]
Length = 269
Score = 243 bits (618), Expect = 1e-062
Identities = 120/142 (84%), Positives = 125/142 (88%), Gaps = 3/142 (2%)
Frame = -2
Query: 540 IPVKPRTKRPWNSLTGDRVWPLVPTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGTNTT 361
IPVKPRTKR NSLTG RVWPLV TNQHAA ER W +KKKQE AV FQRRCSHCGTN T
Sbjct: 130 IPVKPRTKRSRNSLTGGRVWPLVSTNQHAATER-W--RKKKQETAVAFQRRCSHCGTNNT 186
Query: 360 PQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKELV 181
PQWRTGP+GPKTLCNAC VRFKSGR+CPEYRPADSPTF NEIHSNLHRKVLELRKSKEL
Sbjct: 187 PQWRTGPLGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLHRKVLELRKSKELG 246
Query: 180 KGTGEAINKSDQVKFGSQVVEK 115
+ TGEA KSDQVKFGS+VVEK
Sbjct: 247 EETGEATTKSDQVKFGSKVVEK 268
>gi|297802492|ref|XP_002869130.1| hypothetical protein ARALYDRAFT_491187
[Arabidopsis lyrata subsp. lyrata]
Length = 268
Score = 227 bits (577), Expect = 8e-058
Identities = 114/139 (82%), Positives = 120/139 (86%), Gaps = 4/139 (2%)
Frame = -2
Query: 540 IPVKPRTKRPWNSLTGDRVWPLVPTN-QHAAGERQWKKKKKKQELAVVFQRRCSHCGTNT 364
IPVKPRTKR NSLTG RVWPLV TN QHAA E + +KKKQE AVVFQRRCSHCGTN
Sbjct: 132 IPVKPRTKRSRNSLTGGRVWPLVSTNHQHAATE---QLRKKKQETAVVFQRRCSHCGTNN 188
Query: 363 TPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKEL 184
TPQWRTGPVGPKTLCNAC VRFKSGR+CPEYRPADSPTF EIHSNLHRKVLELRKSKEL
Sbjct: 189 TPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSTEIHSNLHRKVLELRKSKEL 248
Query: 183 VKGTGEAINKSDQVKFGSQ 127
+ TGEA KS+QVKFGS+
Sbjct: 249 GEETGEASTKSNQVKFGSK 267
>gi|15236172|ref|NP_195194.1| GATA transcription factor 3 [Arabidopsis
thaliana]
Length = 269
Score = 226 bits (576), Expect = 1e-057
Identities = 113/139 (81%), Positives = 119/139 (85%), Gaps = 4/139 (2%)
Frame = -2
Query: 540 IPVKPRTKRPWNSLTGDRVWPLVPTN-QHAAGERQWKKKKKKQELAVVFQRRCSHCGTNT 364
IPVKPRTKR NSLTG RVWPLV TN QHAA E + +KKKQE +VFQRRCSHCGTN
Sbjct: 133 IPVKPRTKRSRNSLTGSRVWPLVSTNHQHAATE---QLRKKKQETVLVFQRRCSHCGTNN 189
Query: 363 TPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKEL 184
TPQWRTGPVGPKTLCNAC VRFKSGR+CPEYRPADSPTF NEIHSNLHRKVLELRKSKEL
Sbjct: 190 TPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLHRKVLELRKSKEL 249
Query: 183 VKGTGEAINKSDQVKFGSQ 127
+ TGEA KSD VKFGS+
Sbjct: 250 GEETGEASTKSDPVKFGSK 268
>gi|20466648|gb|AAM20641.1| GATA transcription factor 3 [Arabidopsis thaliana]
Length = 269
Score = 226 bits (576), Expect = 1e-057
Identities = 113/139 (81%), Positives = 119/139 (85%), Gaps = 4/139 (2%)
Frame = -2
Query: 540 IPVKPRTKRPWNSLTGDRVWPLVPTN-QHAAGERQWKKKKKKQELAVVFQRRCSHCGTNT 364
IPVKPRTKR NSLTG RVWPLV TN QHAA E + +KKKQE +VFQRRCSHCGTN
Sbjct: 133 IPVKPRTKRSRNSLTGSRVWPLVSTNHQHAATE---QLRKKKQETVLVFQRRCSHCGTNN 189
Query: 363 TPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKEL 184
TPQWRTGPVGPKTLCNAC VRFKSGR+CPEYRPADSPTF NEIHSNLHRKVLELRKSKEL
Sbjct: 190 TPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLHRKVLELRKSKEL 249
Query: 183 VKGTGEAINKSDQVKFGSQ 127
+ TGEA KSD VKFGS+
Sbjct: 250 GEETGEASTKSDPVKFGSK 268
>gi|255637027|gb|ACU18846.1| unknown [Glycine max]
Length = 352
Score = 134 bits (336), Expect = 7e-030
Identities = 62/93 (66%), Positives = 69/93 (74%)
Frame = -2
Query: 459 HAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSGRVC 280
H E KK KKK + RRCSHCG TPQWRTGP+GPKTLCNAC VRFKSGR+
Sbjct: 244 HLCSEPNTKKMKKKPSSDTLAPRRCSHCGVQKTPQWRTGPLGPKTLCNACGVRFKSGRLL 303
Query: 279 PEYRPADSPTFFNEIHSNLHRKVLELRKSKELV 181
PEYRPA SPTF +E+HSN HRKVLE+R+ KE V
Sbjct: 304 PEYRPACSPTFSSELHSNHHRKVLEMRQKKETV 336
>gi|302398797|gb|ADL36693.1| GATA domain class transcription factor [Malus x
domestica]
Length = 323
Score = 128 bits (321), Expect = 4e-028
Identities = 62/96 (64%), Positives = 70/96 (72%), Gaps = 7/96 (7%)
Frame = -2
Query: 450 GERQWKKKKKKQ-------ELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKS 292
GE KK+KKK + FQRRCSHC TPQWRTGP+GPKTLCNAC VRFKS
Sbjct: 214 GEPAAKKQKKKPAVQTGEGSIGGQFQRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKS 273
Query: 291 GRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKEL 184
GR+ PEYRPA SPTF ++HSN HRKVLE+RK KE+
Sbjct: 274 GRLFPEYRPACSPTFSGDVHSNSHRKVLEMRKRKEV 309
>gi|297735055|emb|CBI17417.3| unnamed protein product [Vitis vinifera]
Length = 305
Score = 126 bits (316), Expect = 2e-027
Identities = 64/124 (51%), Positives = 79/124 (63%), Gaps = 7/124 (5%)
Frame = -2
Query: 537 PVKPRTKRPWNSLTGDRVW----PLVPTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGT 370
P K R+KR + TG RVW P + + ++ + A RCSHCG
Sbjct: 173 PAKARSKR---ARTGGRVWSMGSPSLTESSSSSSSSSSSLDPEASGSAQPTPHRCSHCGV 229
Query: 369 NTTPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSK 190
TPQWRTGP+G KTLCNAC VR+KSGR+ PEYRPA SPTF +EIHSN HRKVLE+R+ K
Sbjct: 230 QKTPQWRTGPLGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSEIHSNHHRKVLEMRRKK 289
Query: 189 ELVK 178
E+ +
Sbjct: 290 EVTR 293
>gi|302398805|gb|ADL36697.1| GATA domain class transcription factor [Malus x
domestica]
Length = 321
Score = 126 bits (315), Expect = 2e-027
Identities = 61/96 (63%), Positives = 69/96 (71%), Gaps = 7/96 (7%)
Frame = -2
Query: 450 GERQWKKKKKKQ-------ELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKS 292
GE KK+KKK + FQRRCSHC TPQWRTGP+GPKTLCNAC VRFKS
Sbjct: 212 GEPAAKKQKKKPAVQTGEGSIGGQFQRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKS 271
Query: 291 GRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKEL 184
GR+ PEYRPA SPTF +HSN HRKVLE+RK K++
Sbjct: 272 GRLFPEYRPACSPTFSGAVHSNSHRKVLEMRKRKDV 307
>gi|15229571|ref|NP_189047.1| GATA transcription factor 1 [Arabidopsis
thaliana]
Length = 274
Score = 124 bits (311), Expect = 6e-027
Identities = 63/124 (50%), Positives = 78/124 (62%), Gaps = 4/124 (3%)
Frame = -2
Query: 537 PVKPRTKRPWNSLTGDRVWPLVPTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTP 358
P K R+KR TG R ++ T G Q KK A++ R+C HCG TP
Sbjct: 150 PAKARSKR---RRTGRRDLRVLWTGNEQGG-IQKKKTMTVAAAALIMGRKCQHCGAEKTP 205
Query: 357 QWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKELVK 178
QWR GP GPKTLCNAC VR+KSGR+ PEYRPA+SPTF E+HSN HRK++E+RK +
Sbjct: 206 QWRAGPAGPKTLCNACGVRYKSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQYQSGD 265
Query: 177 GTGE 166
G G+
Sbjct: 266 GDGD 269
>gi|21593190|gb|AAM65139.1| GATA transcription factor 1 (AtGATA-1) [Arabidopsis
thaliana]
Length = 268
Score = 124 bits (311), Expect = 6e-027
Identities = 63/124 (50%), Positives = 78/124 (62%), Gaps = 4/124 (3%)
Frame = -2
Query: 537 PVKPRTKRPWNSLTGDRVWPLVPTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTP 358
P K R+KR TG R ++ T G Q KK A++ R+C HCG TP
Sbjct: 144 PAKARSKR---RRTGRRDLRVLWTGNEQGG-IQKKKTMTVAAAALIMGRKCQHCGAEKTP 199
Query: 357 QWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKELVK 178
QWR GP GPKTLCNAC VR+KSGR+ PEYRPA+SPTF E+HSN HRK++E+RK +
Sbjct: 200 QWRAGPAGPKTLCNACGVRYKSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQYQSGD 259
Query: 177 GTGE 166
G G+
Sbjct: 260 GDGD 263
>gi|110743205|dbj|BAE99493.1| GATA transcription factor 1 [Arabidopsis
thaliana]
Length = 134
Score = 124 bits (311), Expect = 6e-027
Identities = 63/124 (50%), Positives = 78/124 (62%), Gaps = 4/124 (3%)
Frame = -2
Query: 537 PVKPRTKRPWNSLTGDRVWPLVPTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTP 358
P K R+KR TG R ++ T G Q KK A++ R+C HCG TP
Sbjct: 10 PAKARSKR---RRTGRRDLRVLWTGNEQGG-IQKKKTMTVAAAALIMGRKCQHCGAEKTP 65
Query: 357 QWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKELVK 178
QWR GP GPKTLCNAC VR+KSGR+ PEYRPA+SPTF E+HSN HRK++E+RK +
Sbjct: 66 QWRAGPAGPKTLCNACGVRYKSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRKQYQSGD 125
Query: 177 GTGE 166
G G+
Sbjct: 126 GDGD 129
>gi|255560976|ref|XP_002521500.1| conserved hypothetical protein [Ricinus
communis]
Length = 398
Score = 124 bits (309), Expect = 1e-026
Identities = 55/71 (77%), Positives = 61/71 (85%)
Frame = -2
Query: 393 RRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRK 214
RRCSHCG TPQWRTGP+G KTLCNAC VRFKSGR+ PEYRPA SPTF +E+HSN HRK
Sbjct: 313 RRCSHCGVQKTPQWRTGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFCSELHSNHHRK 372
Query: 213 VLELRKSKELV 181
VLE+RK KE+V
Sbjct: 373 VLEMRKKKEVV 383
>gi|225427744|ref|XP_002274872.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 317
Score = 124 bits (309), Expect = 1e-026
Identities = 59/83 (71%), Positives = 63/83 (75%)
Frame = -2
Query: 435 KKKKKKQELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADS 256
KK KK QRRCSHC TPQWRTGP+GPKTLCNAC VRFKSGR+ PEYRPA S
Sbjct: 228 KKPKKSPSADSQPQRRCSHCLVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACS 287
Query: 255 PTFFNEIHSNLHRKVLELRKSKE 187
PTF EIHSN HRKVLE+R+ KE
Sbjct: 288 PTFSVEIHSNSHRKVLEIRRKKE 310
>gi|297744743|emb|CBI38005.3| unnamed protein product [Vitis vinifera]
Length = 352
Score = 124 bits (309), Expect = 1e-026
Identities = 59/83 (71%), Positives = 63/83 (75%)
Frame = -2
Query: 435 KKKKKKQELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADS 256
KK KK QRRCSHC TPQWRTGP+GPKTLCNAC VRFKSGR+ PEYRPA S
Sbjct: 263 KKPKKSPSADSQPQRRCSHCLVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACS 322
Query: 255 PTFFNEIHSNLHRKVLELRKSKE 187
PTF EIHSN HRKVLE+R+ KE
Sbjct: 323 PTFSVEIHSNSHRKVLEIRRKKE 345
>gi|297835478|ref|XP_002885621.1| hypothetical protein ARALYDRAFT_479930
[Arabidopsis lyrata subsp. lyrata]
Length = 270
Score = 123 bits (308), Expect = 1e-026
Identities = 59/114 (51%), Positives = 74/114 (64%), Gaps = 3/114 (2%)
Frame = -2
Query: 537 PVKPRTKRPWNSLTGDRVWPLVPTNQHAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTP 358
P K R+KR TG R ++ T G ++ K A++ R+C HCG TP
Sbjct: 147 PAKARSKR---RRTGRRDLGVLWTGNEQVGIQKRKTPSVAAAAAMIMGRKCQHCGAEKTP 203
Query: 357 QWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLHRKVLELRK 196
QWR GP GPKTLCNAC VR+KSGR+ PEYRPA+SPTF E+HSN HRK++E+RK
Sbjct: 204 QWRAGPAGPKTLCNACGVRYKSGRLVPEYRPANSPTFTAELHSNSHRKIVEMRK 257
>gi|255543845|ref|XP_002512985.1| GATA transcription factor, putative [Ricinus
communis]
Length = 368
Score = 122 bits (306), Expect = 2e-026
Identities = 55/72 (76%), Positives = 60/72 (83%)
Frame = -2
Query: 399 FQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLH 220
FQRRCSHC TPQWRTGP+G KTLCNAC VR+KSGR+ PEYRPA SPTF +IHSN H
Sbjct: 283 FQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTFSGDIHSNSH 342
Query: 219 RKVLELRKSKEL 184
RKVLE+RK KEL
Sbjct: 343 RKVLEIRKKKEL 354
>gi|78499690|gb|ABB45844.1| hypothetical protein [Eutrema halophilum]
Length = 332
Score = 122 bits (304), Expect = 4e-026
Identities = 57/99 (57%), Positives = 69/99 (69%), Gaps = 6/99 (6%)
Frame = -2
Query: 432 KKKKKQELAVVF------QRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSGRVCPEY 271
KK KK+ V+ QRRCSHCG TPQWR GP+G KTLCNAC VR+KSGR+ PEY
Sbjct: 223 KKHKKRSAESVYSGQPLQQRRCSHCGIQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEY 282
Query: 270 RPADSPTFFNEIHSNLHRKVLELRKSKELVKGTGEAINK 154
RPA SPTF +E+HSN HRKV+E+R+ KE +N+
Sbjct: 283 RPACSPTFSSELHSNHHRKVMEMRRKKEPTDDNATGLNQ 321
>gi|224105311|ref|XP_002313763.1| predicted protein [Populus trichocarpa]
Length = 329
Score = 122 bits (304), Expect = 4e-026
Identities = 56/82 (68%), Positives = 65/82 (79%), Gaps = 1/82 (1%)
Frame = -2
Query: 399 FQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSGRVCPEYRPADSPTFFNEIHSNLH 220
FQRRCSHC TPQWRTGP+G KTLCNAC VR+KSGR+ PEYRPA SPTF +E+HSN H
Sbjct: 244 FQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTFSSEVHSNSH 303
Query: 219 RKVLELRKSKELVKGTGEAINK 154
RKVLE+R+ KE V G +N+
Sbjct: 304 RKVLEMRRKKE-VAGAEPRLNQ 324
>gi|225431219|ref|XP_002272762.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 338
Score = 121 bits (303), Expect = 5e-026
Identities = 59/99 (59%), Positives = 71/99 (71%), Gaps = 5/99 (5%)
Frame = -2
Query: 459 HAAGERQWKKKKKKQE-----LAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFK 295
H+A + KK KK+ + A RCSHCG TPQWRTGP+G KTLCNAC VR+K
Sbjct: 228 HSAVKPPAKKHKKRLDPEASGSAQPTPHRCSHCGVQKTPQWRTGPLGAKTLCNACGVRYK 287
Query: 294 SGRVCPEYRPADSPTFFNEIHSNLHRKVLELRKSKELVK 178
SGR+ PEYRPA SPTF +EIHSN HRKVLE+R+ KE+ +
Sbjct: 288 SGRLLPEYRPACSPTFSSEIHSNHHRKVLEMRRKKEVTR 326
>gi|15239503|ref|NP_197955.1| GATA transcription factor 12 [Arabidopsis
thaliana]
Length = 331
Score = 121 bits (302), Expect = 6e-026
Identities = 58/102 (56%), Positives = 69/102 (67%)
Frame = -2
Query: 462 QHAAGERQWKKKKKKQELAVVFQRRCSHCGTNTTPQWRTGPVGPKTLCNACVVRFKSGRV 283
Q G + KK E +RRC HC T+ TPQWRTGP+GPKTLCNAC VR+KSGR+
Sbjct: 196 QAVDGGHRRKKDVSSPESGGAEERRCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL 255
Query: 282 CPEYRPADSPTFFNEIHSNLHRKVLELRKSKELVKGTGEAIN 157
PEYRPA SPTF HSN HRKV+ELR+ KE+ + E I+
Sbjct: 256 VPEYRPAASPTFVLAKHSNSHRKVMELRRQKEMSRAHHEFIH 297
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 670,546,551,498
Number of Sequences: 15229318
Number of Extensions: 670546551498
Number of Successful Extensions: 203192682
Number of sequences better than 0.0: 0
|