BLASTX 7.6.2
Query= UN25379 /QuerySize=841
(840 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT... 264 2e-068
gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabi... 215 9e-054
gi|240255906|ref|NP_680707.4| GATA type zinc finger transcriptio... 167 2e-039
gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT... 152 7e-035
gi|224123912|ref|XP_002330240.1| predicted protein [Populus tric... 102 1e-019
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein... 101 2e-019
gi|326502532|dbj|BAJ95329.1| predicted protein [Hordeum vulgare ... 101 2e-019
gi|255633610|gb|ACU17164.1| unknown [Glycine max] 100 4e-019
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta... 94 2e-017
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric... 92 1e-016
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT... 89 7e-016
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric... 88 1e-015
gi|115456383|ref|NP_001051792.1| Os03g0831200 [Oryza sativa Japo... 87 4e-015
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi... 86 8e-015
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana] 86 8e-015
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal... 86 8e-015
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi... 84 2e-014
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT... 84 2e-014
gi|225437491|ref|XP_002269588.1| PREDICTED: hypothetical protein... 80 3e-013
>gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT_329901
[Arabidopsis lyrata subsp. lyrata]
Length = 176
Score = 264 bits (673), Expect = 2e-068
Identities = 136/179 (75%), Positives = 153/179 (85%), Gaps = 9/179 (5%)
Frame = +1
Query: 154 TEESKMTKLESAGDSSDVENGNCSSSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSL 333
TEE+K TKLESAGDSSDV+NGNCSSSGS GGDTKKTCVDCGT++TP WRGGPAGPKSL
Sbjct: 4 TEETKTTKLESAGDSSDVDNGNCSSSGS---GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60
Query: 334 CNACGIKSRKKRQAALGIKQGDNNKMKNNKSNDSCLALDNQTVKNGKGDSGNVKNKIKTE 513
CNACGIKSRKKRQAALGI+Q D NKMKN +N+ L L+N+TVK GKG+ GNVKNKIKT+
Sbjct: 61 CNACGIKSRKKRQAALGIRQED-NKMKNKCNNN--LNLENRTVKIGKGEPGNVKNKIKTD 117
Query: 514 TED---CNDKKSVKRGSRFLDLGFKVPVMRRSVVEKKRVWMKLGEEERAAVLLMALSCG 681
E+ N+ K+VK+ RFLD GFKVP M+RS VEKKR+W KLGEEERAAVLLMALSCG
Sbjct: 118 PENFSSSNNNKNVKKVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 176
>gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabidopsis
thaliana]
Length = 190
Score = 215 bits (546), Expect = 9e-054
Identities = 120/189 (63%), Positives = 139/189 (73%), Gaps = 15/189 (7%)
Frame = +1
Query: 145 MSMTEESKMTKLESAGDSSDVENGNCSSSGSGGG--GGDTKKTCVDCGTNKTPFWRGGPA 318
MS E TKL+SAG+ SDV+N NCSSSGSGGG GDTK+TCVDCGT +TP WRGGPA
Sbjct: 1 MSEGSEDTKTKLDSAGELSDVDNENCSSSGSGGGSSSGDTKRTCVDCGTIRTPLWRGGPA 60
Query: 319 GPKSLCNACGIKSRKKRQAALGIKQGDNNKMKNNKSN-DSCLALDNQTVKNGK---GDSG 486
GPKSLCNACGIKSRKKRQAALG++ K KN KSN ++ L LD++ K K D G
Sbjct: 61 GPKSLCNACGIKSRKKRQAALGMR--SEEKKKNRKSNCNNDLNLDHRNAKKYKINIVDDG 118
Query: 487 NVKNKIKTETEDCNDKKSV-----KRGSRFLDLGFKVPVMRRSVVEKKRVWMKLGEEERA 651
+ I + + CN+K+S K S+FLDLGFKVPVM+RS VEKKR+W KLGEEERA
Sbjct: 119 KI--DIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMKRSAVEKKRLWRKLGEEERA 176
Query: 652 AVLLMALSC 678
AVLLMALSC
Sbjct: 177 AVLLMALSC 185
>gi|240255906|ref|NP_680707.4| GATA type zinc finger transcription factor family
protein [Arabidopsis thaliana]
Length = 197
Score = 167 bits (422), Expect = 2e-039
Identities = 84/114 (73%), Positives = 98/114 (85%), Gaps = 6/114 (5%)
Frame = +1
Query: 154 TEESKMTKLESAGDSSDVENGNCSSSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSL 333
TEE+K TKLESAGDSSDV+NGNCSSSGS GGDTKKTCVDCGT++TP WRGGPAGPKSL
Sbjct: 4 TEETKTTKLESAGDSSDVDNGNCSSSGS---GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60
Query: 334 CNACGIKSRKKRQAALGIKQGDNNKMKNNKSNDSCLALDNQTVKNGKGDSGNVK 495
CNACGIKSRKKRQAALGI+Q D+ K+K+ +N+ L L+++ VK GKG+ NVK
Sbjct: 61 CNACGIKSRKKRQAALGIRQ-DDIKIKSKSNNN--LGLESRNVKTGKGEPVNVK 111
Score = 113 bits (282), Expect = 4e-023
Identities = 76/176 (43%), Positives = 93/176 (52%), Gaps = 16/176 (9%)
Frame = +1
Query: 184 SAGDSSDVENGNCSSSGSG------GGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNAC 345
S+G D + C G+ GG K C CG K+ R G +
Sbjct: 28 SSGSGGDTKK-TCVDCGTSRTPLWRGGPAGPKSLCNACGI-KSRKKRQAALGIRQ----D 81
Query: 346 GIKSRKKRQAALGIKQGDNNKMKNNKSNDSCLALDNQTVKNGKGDSGNVKNKIKTETEDC 525
IK + K LG++ + K N + VK KG+ GNVKNKIK + E+
Sbjct: 82 DIKIKSKSNNNLGLESRNVKTGKGEPVNVKIAKCEPGIVKIAKGEPGNVKNKIKRDPENS 141
Query: 526 ----NDKKSVKRGSRFLDLGFKVPVMRRSVVEKKRVWMKLGEEERAAVLLMALSCG 681
N+KK+VKR RFLD GFKVP M+RS VEKKR+W KLGEEERAAVLLMALSCG
Sbjct: 142 SSSNNNKKNVKRVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 197
>gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT_479155
[Arabidopsis lyrata subsp. lyrata]
Length = 175
Score = 152 bits (383), Expect = 7e-035
Identities = 80/133 (60%), Positives = 95/133 (71%), Gaps = 5/133 (3%)
Frame = +1
Query: 145 MSMTEESKMTKLESAGDSSDVENGNCSSSGSGGG-GGDTKKTCVDCGTNKTPFWRGGPAG 321
MS E TK++SAG+ SDV+N NCSSSGSGGG GDTK+TCVDCGT +TP WRGGPAG
Sbjct: 1 MSEGSEETKTKVDSAGELSDVDNENCSSSGSGGGSSGDTKRTCVDCGTIRTPLWRGGPAG 60
Query: 322 PKSLCNACGIKSRKKRQAALGIKQGDNNKMKNNKSNDSCLALDNQTVKNGK--GDSGNVK 495
PKSLCNACGIKSRKKRQAALG++ K KN KS+ + L LD++ KN K D
Sbjct: 61 PKSLCNACGIKSRKKRQAALGMR--SEEKKKNRKSSGNDLNLDHRNAKNDKINKDDDAKN 118
Query: 496 NKIKTETEDCNDK 534
+KI + + NDK
Sbjct: 119 DKINKDDDAKNDK 131
>gi|224123912|ref|XP_002330240.1| predicted protein [Populus trichocarpa]
Length = 161
Score = 102 bits (252), Expect = 1e-019
Identities = 61/150 (40%), Positives = 88/150 (58%), Gaps = 10/150 (6%)
Frame = +1
Query: 235 SGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQGDNNKMK 414
SG G + KK C DC T KTP WRGGPAGPKSLCNACGI+ RKKR + + +++G K +
Sbjct: 15 SGSGDIEGKKACTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKKR-SVMRLEKGPEKKRE 73
Query: 415 NNKSNDSCLALDNQTVKNGKGDSGNVKNKIKTETEDCNDKKSVKRGSRFLDLGFKVPVMR 594
++++ A D T+ + N + + + +S++ + LG ++ + R
Sbjct: 74 KTTTSNTTTATDISTI-----TTATTTNTAQVVSGNGLISESLRMS--LMVLGEEMMLQR 126
Query: 595 RSVVEKKRVW--MKLGEEERAAVLLMALSC 678
SVV+K+R KL EEE+AA LMALSC
Sbjct: 127 PSVVKKQRCQRKRKLREEEQAAFSLMALSC 156
>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 153
Score = 101 bits (249), Expect = 2e-019
Identities = 50/104 (48%), Positives = 64/104 (61%), Gaps = 8/104 (7%)
Frame = +1
Query: 181 ESAGDSSDVENGNCSS-SGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKS 357
E +S D+ N N + S + + KKTC DCGT KTP WRGGPAGPKSLCNACGI+S
Sbjct: 6 EKGSESEDMNNKNPDAVSSAESQVNEPKKTCADCGTTKTPLWRGGPAGPKSLCNACGIRS 65
Query: 358 RKKRQAALGIKQGDNNKMKNNKSNDSCLALDNQTVKNGKGDSGN 489
RKKR+A LG+ +G + K +S+ N + NG G+ N
Sbjct: 66 RKKRRAFLGLNKGSTDDRKAKRSS-------NHSHNNGGGNGNN 102
>gi|326502532|dbj|BAJ95329.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 181
Score = 101 bits (249), Expect = 2e-019
Identities = 64/174 (36%), Positives = 96/174 (55%), Gaps = 12/174 (6%)
Frame = +1
Query: 166 KMTKLESAGDSSDVENGNCSSSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNAC 345
KM +E ++ + +C++SG+G K+C DC T KTP WRGGP GPKSLCNAC
Sbjct: 14 KMKMIEVEVQAAAADPDDCTASGAG-----DPKSCADCNTTKTPLWRGGPNGPKSLCNAC 68
Query: 346 GIKSRKKRQAALGIKQGDNNKMKNNKSNDSCLALDNQTVKNGKGDSGNVKNKIKTETEDC 525
GI+ RK+R+ A+G+ K K + + +S A + + + K + +
Sbjct: 69 GIRYRKRRRVAMGLDPEAKRKPKRDDAINSAAAAAEASTQQQE-----EVTKPTDDDKAV 123
Query: 526 NDKKSVKRGSRFLDL-GF-KVPVMRRSVVEKKRVWMKLGEEERAAVLLMALSCG 681
+ K+ K + L + GF K V+++ ++R LGEEERAA+LLMALS G
Sbjct: 124 STNKTTKTHTVELHMVGFGKDAVLKQRRRMRRRKPSCLGEEERAAMLLMALSSG 177
>gi|255633610|gb|ACU17164.1| unknown [Glycine max]
Length = 130
Score = 100 bits (247), Expect = 4e-019
Identities = 49/95 (51%), Positives = 59/95 (62%), Gaps = 7/95 (7%)
Frame = +1
Query: 178 LESAGDSSDVE------NGNCSSSG-SGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLC 336
++ G S++E N N SSG S + KKTC DCGT KTP WRGGPAGPKSLC
Sbjct: 2 VDPTGKGSEIEVEDSNSNPNAPSSGNSPSSNNEQKKTCADCGTTKTPLWRGGPAGPKSLC 61
Query: 337 NACGIKSRKKRQAALGIKQGDNNKMKNNKSNDSCL 441
NACGI+SRKK++A LGI +G N + K L
Sbjct: 62 NACGIRSRKKKRAILGINKGSNEDGRKGKRTGGAL 96
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
communis]
Length = 149
Score = 94 bits (233), Expect = 2e-017
Identities = 43/88 (48%), Positives = 55/88 (62%)
Frame = +1
Query: 226 SSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQGDNN 405
SS S G KK+C DCGT KTP WRGGPAGPKSLCNACGI+SRKK++ +LG+ + +N
Sbjct: 15 SSKSAEGENQQKKSCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKKRDSLGLNRASSN 74
Query: 406 KMKNNKSNDSCLALDNQTVKNGKGDSGN 489
K ++ + S N N G+
Sbjct: 75 PDKKSRKHSSSNGSSNNHNSNNSNRLGD 102
>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 92 bits (226), Expect = 1e-016
Identities = 45/85 (52%), Positives = 57/85 (67%), Gaps = 1/85 (1%)
Frame = +1
Query: 226 SSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQG-DN 402
SS S KKTC DCGT+KTP WRGGPAGPKSLCNACGI+SRKK++ LG+ +G N
Sbjct: 2 SSNSQETESPLKKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGAAN 61
Query: 403 NKMKNNKSNDSCLALDNQTVKNGKG 477
+K SN++ + +N + G G
Sbjct: 62 DKRAKKGSNNNGSSNNNNNKQLGDG 86
>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
[Arabidopsis lyrata subsp. lyrata]
Length = 137
Score = 89 bits (219), Expect = 7e-016
Identities = 41/83 (49%), Positives = 54/83 (65%)
Frame = +1
Query: 178 LESAGDSSDVENGNCSSSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKS 357
+ES S D + SSS S G + KK+C CGT+KTP WRGGPAGPKSLCNACGI++
Sbjct: 1 MESKLTSVDAIEEHSSSSSSNEGISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRN 60
Query: 358 RKKRQAALGIKQGDNNKMKNNKS 426
RKKR+ + + D +N++
Sbjct: 61 RKKRRTLISNRSEDKKNKNHNRN 83
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 88 bits (217), Expect = 1e-015
Identities = 41/77 (53%), Positives = 52/77 (67%), Gaps = 5/77 (6%)
Frame = +1
Query: 259 KKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQG-----DNNKMKNNK 423
KKTC DCGT+KTP WRGGPAGPKSLCNACGI+SRKK++ LG+ +G D K +
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGGAAANDKRAKKGST 72
Query: 424 SNDSCLALDNQTVKNGK 474
+N S L + + G+
Sbjct: 73 NNGSSDGLKQRLLALGR 89
>gi|115456383|ref|NP_001051792.1| Os03g0831200 [Oryza sativa Japonica Group]
Length = 136
Score = 87 bits (213), Expect = 4e-015
Identities = 40/77 (51%), Positives = 47/77 (61%)
Frame = +1
Query: 193 DSSDVENGNCSSSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQ 372
DSS VE G+ S K C DC T KTP WRGGP+GPKSLCNACGI+ RKKR+
Sbjct: 2 DSSSVEKGSGSIDPDERTASGEPKACTDCHTTKTPLWRGGPSGPKSLCNACGIRYRKKRR 61
Query: 373 AALGIKQGDNNKMKNNK 423
ALG+ G+ + K
Sbjct: 62 EALGLDAGEGGAERQEK 78
>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
thaliana]
Length = 149
Score = 86 bits (210), Expect = 8e-015
Identities = 37/68 (54%), Positives = 48/68 (70%)
Frame = +1
Query: 223 SSSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQGDN 402
SSS S + KK+C CGT+KTP WRGGPAGPKSLCNACGI++RKKR+ + + D
Sbjct: 28 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 87
Query: 403 NKMKNNKS 426
K +N++
Sbjct: 88 KKKSHNRN 95
>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]
Length = 136
Score = 86 bits (210), Expect = 8e-015
Identities = 37/68 (54%), Positives = 48/68 (70%)
Frame = +1
Query: 223 SSSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQGDN 402
SSS S + KK+C CGT+KTP WRGGPAGPKSLCNACGI++RKKR+ + + D
Sbjct: 15 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 74
Query: 403 NKMKNNKS 426
K +N++
Sbjct: 75 KKKSHNRN 82
>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]
Length = 136
Score = 86 bits (210), Expect = 8e-015
Identities = 37/68 (54%), Positives = 48/68 (70%)
Frame = +1
Query: 223 SSSGSGGGGGDTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQGDN 402
SSS S + KK+C CGT+KTP WRGGPAGPKSLCNACGI++RKKR+ + + D
Sbjct: 15 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 74
Query: 403 NKMKNNKS 426
K +N++
Sbjct: 75 KKKSHNRN 82
>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
thaliana]
Length = 139
Score = 84 bits (207), Expect = 2e-014
Identities = 37/58 (63%), Positives = 43/58 (74%), Gaps = 4/58 (6%)
Frame = +1
Query: 253 DTKKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQGDNNKMKNNKS 426
D KKTC DCGT+KTP WRGGP GPKSLCNACGI++RKKR+ DN K+K + S
Sbjct: 33 DKKKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRRGG----TEDNKKLKKSSS 86
>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
[Arabidopsis lyrata subsp. lyrata]
Length = 111
Score = 84 bits (206), Expect = 2e-014
Identities = 37/56 (66%), Positives = 43/56 (76%), Gaps = 5/56 (8%)
Frame = +1
Query: 259 KKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIKQGDNNKMKNNKS 426
KKTC DCGT+KTP WRGGPAGPKSLCNACGI++RKKR+ DN K+K + S
Sbjct: 8 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRGT-----EDNKKLKKSSS 58
>gi|225437491|ref|XP_002269588.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 138
Score = 80 bits (197), Expect = 3e-013
Identities = 34/59 (57%), Positives = 46/59 (77%), Gaps = 2/59 (3%)
Frame = +1
Query: 259 KKTCVDCGTNKTPFWRGGPAGPKSLCNACGIKSRKKRQAALGIK--QGDNNKMKNNKSN 429
K++C DC T +TP WRGGPAGP+SLCNACGI+ RK+R A LG+ +G+ NK K N+++
Sbjct: 26 KRSCADCHTTRTPLWRGGPAGPRSLCNACGIRYRKQRSALLGLATGRGEKNKKKINRTS 84
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,061,447,628,928
Number of Sequences: 15229318
Number of Extensions: 3061447628928
Number of Successful Extensions: 717782310
Number of sequences better than 0.0: 0
|