BLASTX 7.6.2
Query= UN38631 /QuerySize=811
(810 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi... 175 9e-042
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT... 162 6e-038
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi... 144 2e-032
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal... 144 2e-032
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana] 143 5e-032
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein... 142 7e-032
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT... 141 1e-031
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric... 138 1e-030
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta... 136 5e-030
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric... 132 9e-029
gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 ... 115 1e-023
gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein... 115 1e-023
gi|115456383|ref|NP_001051792.1| Os03g0831200 [Oryza sativa Japo... 104 2e-020
gi|255633610|gb|ACU17164.1| unknown [Glycine max] 87 2e-015
gi|240255906|ref|NP_680707.4| GATA type zinc finger transcriptio... 87 3e-015
gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT... 86 8e-015
gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabi... 84 2e-014
gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT... 84 3e-014
gi|326502532|dbj|BAJ95329.1| predicted protein [Hordeum vulgare ... 78 1e-012
gi|297598423|ref|NP_001045570.2| Os01g0976800 [Oryza sativa Japo... 77 2e-012
>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
thaliana]
Length = 139
Score = 175 bits (442), Expect = 9e-042
Identities = 98/140 (70%), Positives = 107/140 (76%), Gaps = 13/140 (9%)
Frame = -2
Query: 626 MLDHCAK----DSK-RRRGGEDVIEQNEACSND-KKTCADCGASKTPLWRGGPAGPKSLC 465
MLDH K DS+ + ED+IEQN ND KKTCADCG SKTPLWRGGP GPKSLC
Sbjct: 1 MLDHSEKVLLVDSETMKTRAEDMIEQNNTSVNDKKKTCADCGTSKTPLWRGGPVGPKSLC 60
Query: 464 NACGIRNRKKRRGGGEDKKQPKKSNSGGGGDLKRNPKFGESMKQRMMDLGMTKRSSSTVE 285
NACGIRNRKKRRGG ED K+ KKS+SGGG N KFGES+KQ +MDLG+ KR STVE
Sbjct: 61 NACGIRNRKKRRGGTEDNKKLKKSSSGGG-----NRKFGESLKQSLMDLGIRKR--STVE 113
Query: 284 KQLRKLGEEE*AAVLLMALS 225
KQ +KLGEEE AAVLLMALS
Sbjct: 114 KQRQKLGEEEQAAVLLMALS 133
>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
[Arabidopsis lyrata subsp. lyrata]
Length = 111
Score = 162 bits (409), Expect = 6e-038
Identities = 85/108 (78%), Positives = 91/108 (84%), Gaps = 8/108 (7%)
Frame = -2
Query: 551 SNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGGGEDKKQPKKSNSGGGGD 372
+ DKKTCADCG SKTPLWRGGPAGPKSLCNACGIRNRKKRR G ED K+ KKS+SGGG
Sbjct: 5 AQDKKTCADCGTSKTPLWRGGPAGPKSLCNACGIRNRKKRR-GTEDNKKLKKSSSGGG-- 61
Query: 371 LKRNPKFGESMKQRMMDLGMTKRSSSTVEKQLRKLGEEE*AAVLLMAL 228
NPK GES+KQR+MD G+TKR STVEKQ RKLGEEE AAVLLMAL
Sbjct: 62 ---NPKLGESLKQRLMDFGITKR--STVEKQRRKLGEEEQAAVLLMAL 104
>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
thaliana]
Length = 149
Score = 144 bits (361), Expect = 2e-032
Identities = 78/118 (66%), Positives = 87/118 (73%), Gaps = 7/118 (5%)
Frame = -2
Query: 563 NEACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGGGEDKKQPKKSNSG 384
NEA SN+KK+CA CG SKTPLWRGGPAGPKSLCNACGIRNRKKRR ++ + KK S
Sbjct: 33 NEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSH 92
Query: 383 GGGDLKRNPKFGESMKQRMMDLGM-TKRSSSTVEKQLR-KLGEEE*AAVLLMALSYGS 216
RNPKFG+S+KQR+M+LG ST E Q R KLGEEE AAVLLMALSY S
Sbjct: 93 -----NRNPKFGDSLKQRLMELGREVMMQRSTAENQRRNKLGEEEQAAVLLMALSYAS 145
>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]
Length = 136
Score = 144 bits (361), Expect = 2e-032
Identities = 78/118 (66%), Positives = 87/118 (73%), Gaps = 7/118 (5%)
Frame = -2
Query: 563 NEACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGGGEDKKQPKKSNSG 384
NEA SN+KK+CA CG SKTPLWRGGPAGPKSLCNACGIRNRKKRR ++ + KK S
Sbjct: 20 NEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSH 79
Query: 383 GGGDLKRNPKFGESMKQRMMDLGM-TKRSSSTVEKQLR-KLGEEE*AAVLLMALSYGS 216
RNPKFG+S+KQR+M+LG ST E Q R KLGEEE AAVLLMALSY S
Sbjct: 80 -----NRNPKFGDSLKQRLMELGREVMMQRSTAENQRRNKLGEEEQAAVLLMALSYAS 132
>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]
Length = 136
Score = 143 bits (358), Expect = 5e-032
Identities = 77/118 (65%), Positives = 87/118 (73%), Gaps = 7/118 (5%)
Frame = -2
Query: 563 NEACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGGGEDKKQPKKSNSG 384
NEA SN+KK+CA CG SKTPLWRGGPAGPKSLCNACGIRNRKKRR ++ + KK S
Sbjct: 20 NEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSH 79
Query: 383 GGGDLKRNPKFGESMKQRMMDLGM-TKRSSSTVEKQLR-KLGEEE*AAVLLMALSYGS 216
RNPKFG+S++QR+M+LG ST E Q R KLGEEE AAVLLMALSY S
Sbjct: 80 -----NRNPKFGDSLRQRLMELGREVMMQRSTAENQRRNKLGEEEQAAVLLMALSYAS 132
>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 153
Score = 142 bits (357), Expect = 7e-032
Identities = 81/136 (59%), Positives = 95/136 (69%), Gaps = 14/136 (10%)
Frame = -2
Query: 578 DVIEQNEACSND-KKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRR-------GG 423
D + E+ N+ KKTCADCG +KTPLWRGGPAGPKSLCNACGIR+RKKRR G
Sbjct: 20 DAVSSAESQVNEPKKTCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKRRAFLGLNKGS 79
Query: 422 GEDKKQPKKSN---SGGGGDLKRNPKFGESMKQRMMDLGM-TKRSSSTVEKQLRKLGEEE 255
+D+K + SN + GGG+ N K G+S+K+R+ LG STVEKQ RKLGEEE
Sbjct: 80 TDDRKAKRSSNHSHNNGGGN--GNNKLGDSLKRRLFALGREVLLQRSTVEKQRRKLGEEE 137
Query: 254 *AAVLLMALSYGSVYA 207
AAVLLMALSYG VYA
Sbjct: 138 QAAVLLMALSYGYVYA 153
>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
[Arabidopsis lyrata subsp. lyrata]
Length = 137
Score = 141 bits (355), Expect = 1e-031
Identities = 76/118 (64%), Positives = 87/118 (73%), Gaps = 7/118 (5%)
Frame = -2
Query: 563 NEACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGGGEDKKQPKKSNSG 384
NE SN+KK+CA CG SKTPLWRGGPAGPKSLCNACGIRNRKKRR ++ + KK+ +
Sbjct: 21 NEGISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKNKNH 80
Query: 383 GGGDLKRNPKFGESMKQRMMDLGM-TKRSSSTVEKQLR-KLGEEE*AAVLLMALSYGS 216
RNPKFG+S+KQR+M+LG ST E Q R KLGEEE AAVLLMALSY S
Sbjct: 81 -----NRNPKFGDSLKQRLMELGREVMMQRSTAENQRRKKLGEEEQAAVLLMALSYAS 133
>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 138 bits (346), Expect = 1e-030
Identities = 73/119 (61%), Positives = 82/119 (68%), Gaps = 13/119 (10%)
Frame = -2
Query: 542 KKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRR-------GGGEDKKQPKKSNSG 384
KKTCADCG SKTPLWRGGPAGPKSLCNACGIR+RKK+R G DK+ K SN+
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGAANDKRAKKGSNNN 72
Query: 383 GGGDLKRNPKFGESMKQRMMDLGMTKRSSSTVEKQLRKLGEEE*AAVLLMALSYGSVYA 207
G + N + G+ KQR++ LG V Q RKLGEEE AAVLLMALSYGSVYA
Sbjct: 73 GSSNNNNNKQLGDGSKQRLLALG------REVLMQRRKLGEEEQAAVLLMALSYGSVYA 125
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
communis]
Length = 149
Score = 136 bits (341), Expect = 5e-030
Identities = 74/130 (56%), Positives = 88/130 (67%), Gaps = 12/130 (9%)
Frame = -2
Query: 560 EACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRR--------GGGEDKKQ 405
E + KK+CADCG +KTPLWRGGPAGPKSLCNACGIR+RKK+R DKK
Sbjct: 20 EGENQQKKSCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKKRDSLGLNRASSNPDKKS 79
Query: 404 PKKSNSGGGG---DLKRNPKFGESMKQRMMDLGM-TKRSSSTVEKQLRKLGEEE*AAVLL 237
K S+S G + + + G+ +KQR++ LG S+VEKQ RKLGEEE AAVLL
Sbjct: 80 RKHSSSNGSSNNHNSNNSNRLGDGLKQRLLALGREVLMQRSSVEKQRRKLGEEEQAAVLL 139
Query: 236 MALSYGSVYA 207
MALSYGSVYA
Sbjct: 140 MALSYGSVYA 149
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 132 bits (330), Expect = 9e-029
Identities = 73/127 (57%), Positives = 85/127 (66%), Gaps = 17/127 (13%)
Frame = -2
Query: 560 EACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRR--------GGGEDKKQ 405
E S KKTCADCG SKTPLWRGGPAGPKSLCNACGIR+RKK+R G + K+
Sbjct: 7 ETESPQKKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGGAAANDKR 66
Query: 404 PKKSNSGGGGDLKRNPKFGESMKQRMMDLGM-TKRSSSTVEKQLRKLGEEE*AAVLLMAL 228
KK ++ G + +KQR++ LG STVE++ RKLGEEE AAVLLMAL
Sbjct: 67 AKKGSTNNGS--------SDGLKQRLLALGREVLVQGSTVERRRRKLGEEEQAAVLLMAL 118
Query: 227 SYGSVYA 207
SYGSVYA
Sbjct: 119 SYGSVYA 125
>gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 [Vitis
vinifera]
Length = 125
Score = 115 bits (286), Expect = 1e-023
Identities = 64/118 (54%), Positives = 78/118 (66%), Gaps = 10/118 (8%)
Frame = -2
Query: 563 NEACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGGGEDKKQPKKSNSG 384
+E + KK C DC +KTPLWRGGPAGPKSLCNACGIR RK+R K+ ++ NSG
Sbjct: 11 SEEMNEIKKCCTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKRRSSMVGVNKKKERMNSG 70
Query: 383 GGGDLKRNPKFGESMKQRMMDLG---MTKRSSSTVEKQLRKLGEEE*AAVLLMALSYG 219
+ E++KQ +M LG M +R S+V+KQ RKLGEEE AAVLLMALS G
Sbjct: 71 -------SHDLSETLKQSLMALGNEVMMQRQRSSVKKQRRKLGEEEQAAVLLMALSCG 121
>gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 124
Score = 115 bits (286), Expect = 1e-023
Identities = 64/118 (54%), Positives = 78/118 (66%), Gaps = 10/118 (8%)
Frame = -2
Query: 563 NEACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGGGEDKKQPKKSNSG 384
+E + KK C DC +KTPLWRGGPAGPKSLCNACGIR RK+R K+ ++ NSG
Sbjct: 10 SEEMNEIKKCCTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKRRSSMVGVNKKKERMNSG 69
Query: 383 GGGDLKRNPKFGESMKQRMMDLG---MTKRSSSTVEKQLRKLGEEE*AAVLLMALSYG 219
+ E++KQ +M LG M +R S+V+KQ RKLGEEE AAVLLMALS G
Sbjct: 70 -------SHDLSETLKQSLMALGNEVMMQRQRSSVKKQRRKLGEEEQAAVLLMALSCG 120
>gi|115456383|ref|NP_001051792.1| Os03g0831200 [Oryza sativa Japonica Group]
Length = 136
Score = 104 bits (259), Expect = 2e-020
Identities = 60/141 (42%), Positives = 79/141 (56%), Gaps = 17/141 (12%)
Frame = -2
Query: 602 SKRRRGGEDVIEQNEACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRR-- 429
S +G + S + K C DC +KTPLWRGGP+GPKSLCNACGIR RKKRR
Sbjct: 4 SSVEKGSGSIDPDERTASGEPKACTDCHTTKTPLWRGGPSGPKSLCNACGIRYRKKRREA 63
Query: 428 ------GGGEDKKQPKKSNSGGGGDLKRNPKFGESMKQRMMDLGM-TKRSSSTVEKQLRK 270
GG ++++ KKS G ++ +M+ RM+ G ++ R+
Sbjct: 64 LGLDAGEGGAERQEKKKSKRERGEEV--------TMELRMVGFGKEVVLKQRRRMRRRRR 115
Query: 269 LGEEE*AAVLLMALSYGSVYA 207
LGEEE AA+LLMALS G +YA
Sbjct: 116 LGEEEKAAILLMALSSGVIYA 136
>gi|255633610|gb|ACU17164.1| unknown [Glycine max]
Length = 130
Score = 87 bits (215), Expect = 2e-015
Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 13/98 (13%)
Frame = -2
Query: 563 NEACSND--KKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRR-------GGGEDK 411
N SN+ KKTCADCG +KTPLWRGGPAGPKSLCNACGIR+RKK+R G ED
Sbjct: 27 NSPSSNNEQKKTCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKKRAILGINKGSNEDG 86
Query: 410 KQPKKSNSGGGGDLKRN----PKFGESMKQRMMDLGMT 309
++ K++ G ++ + K GE K ++ + ++
Sbjct: 87 RKGKRTGGALGKEVLLHRSHWKKLGEEEKAAVLLMSLS 124
>gi|240255906|ref|NP_680707.4| GATA type zinc finger transcription factor family
protein [Arabidopsis thaliana]
Length = 197
Score = 87 bits (213), Expect = 3e-015
Identities = 48/98 (48%), Positives = 59/98 (60%), Gaps = 11/98 (11%)
Frame = -2
Query: 608 KDSKRRRGGEDVIEQNEACSND------KKTCADCGASKTPLWRGGPAGPKSLCNACGIR 447
K +K G+ N CS+ KKTC DCG S+TPLWRGGPAGPKSLCNACGI+
Sbjct: 8 KTTKLESAGDSSDVDNGNCSSSGSGGDTKKTCVDCGTSRTPLWRGGPAGPKSLCNACGIK 67
Query: 446 NRKKRRGG----GEDKKQPKKSNSGGGGDLKRNPKFGE 345
+RKKR+ +D K KSN+ G + RN K G+
Sbjct: 68 SRKKRQAALGIRQDDIKIKSKSNNNLGLE-SRNVKTGK 104
>gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT_329901
[Arabidopsis lyrata subsp. lyrata]
Length = 176
Score = 86 bits (210), Expect = 8e-015
Identities = 43/84 (51%), Positives = 51/84 (60%), Gaps = 10/84 (11%)
Frame = -2
Query: 608 KDSKRRRGGEDVIEQNEACSND------KKTCADCGASKTPLWRGGPAGPKSLCNACGIR 447
K +K G+ N CS+ KKTC DCG S+TPLWRGGPAGPKSLCNACGI+
Sbjct: 8 KTTKLESAGDSSDVDNGNCSSSGSGGDTKKTCVDCGTSRTPLWRGGPAGPKSLCNACGIK 67
Query: 446 NRKKRRGG----GEDKKQPKKSNS 387
+RKKR+ ED K K N+
Sbjct: 68 SRKKRQAALGIRQEDNKMKNKCNN 91
>gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabidopsis
thaliana]
Length = 190
Score = 84 bits (207), Expect = 2e-014
Identities = 41/80 (51%), Positives = 54/80 (67%), Gaps = 6/80 (7%)
Frame = -2
Query: 542 KKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGG----GEDKKQPKKSNSGGGG 375
K+TC DCG +TPLWRGGPAGPKSLCNACGI++RKKR+ E+KK+ +KSN
Sbjct: 41 KRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGMRSEEKKKNRKSNC--NN 98
Query: 374 DLKRNPKFGESMKQRMMDLG 315
DL + + + K ++D G
Sbjct: 99 DLNLDHRNAKKYKINIVDDG 118
>gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT_479155
[Arabidopsis lyrata subsp. lyrata]
Length = 175
Score = 84 bits (205), Expect = 3e-014
Identities = 39/67 (58%), Positives = 47/67 (70%), Gaps = 4/67 (5%)
Frame = -2
Query: 542 KKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGG-GEDKKQPKKSNSGGGGDLK 366
K+TC DCG +TPLWRGGPAGPKSLCNACGI++RKKR+ G ++ KK+ G DL
Sbjct: 40 KRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGMRSEEKKKNRKSSGNDLN 99
Query: 365 ---RNPK 354
RN K
Sbjct: 100 LDHRNAK 106
>gi|326502532|dbj|BAJ95329.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 181
Score = 78 bits (191), Expect = 1e-012
Identities = 35/59 (59%), Positives = 44/59 (74%), Gaps = 4/59 (6%)
Frame = -2
Query: 551 SNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGG----GEDKKQPKKSNS 387
+ D K+CADC +KTPLWRGGP GPKSLCNACGIR RK+RR E K++PK+ ++
Sbjct: 37 AGDPKSCADCNTTKTPLWRGGPNGPKSLCNACGIRYRKRRRVAMGLDPEAKRKPKRDDA 95
>gi|297598423|ref|NP_001045570.2| Os01g0976800 [Oryza sativa Japonica Group]
Length = 142
Score = 77 bits (189), Expect = 2e-012
Identities = 44/108 (40%), Positives = 60/108 (55%), Gaps = 6/108 (5%)
Frame = -2
Query: 578 DVIEQNEACSNDKKTCADCGASKTPLWRGGPAGPKSLCNACGIRNRKKRRGG-GEDKKQP 402
D ++ +E N K CADC +KTPLWRGGP GPKSLCNACGIR RK+RR G D
Sbjct: 11 DKVDPDEC--NGSKACADCHTTKTPLWRGGPGGPKSLCNACGIRYRKRRRAALGLDSSAT 68
Query: 401 KKSNSGGGGDLKRNPKFGESMKQRM-MDLGMT--KRSSSTVEKQLRKL 267
+ G K K ++ ++ + M+L + + V KQ R++
Sbjct: 69 ATATDGAEQQKKTKAKKEKAQEEEVTMELHTVGFRSKDAAVFKQRRRM 116
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,281,875,611,404
Number of Sequences: 15229318
Number of Extensions: 4281875611404
Number of Successful Extensions: 1013544274
Number of sequences better than 0.0: 0
|