BLASTX 7.6.2
Query= UN09672 /QuerySize=864
(863 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT... 267 2e-069
gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabi... 215 9e-054
gi|240255906|ref|NP_680707.4| GATA type zinc finger transcriptio... 172 7e-041
gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT... 158 1e-036
gi|326497045|dbj|BAK02107.1| predicted protein [Hordeum vulgare ... 103 4e-020
gi|255633610|gb|ACU17164.1| unknown [Glycine max] 100 3e-019
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein... 99 6e-019
gi|224123912|ref|XP_002330240.1| predicted protein [Populus tric... 96 8e-018
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT... 94 2e-017
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta... 92 9e-017
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi... 89 8e-016
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana] 89 8e-016
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal... 89 8e-016
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric... 89 1e-015
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric... 87 4e-015
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi... 86 5e-015
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT... 85 1e-014
gi|115456383|ref|NP_001051792.1| Os03g0831200 [Oryza sativa Japo... 84 3e-014
gi|326502532|dbj|BAJ95329.1| predicted protein [Hordeum vulgare ... 83 4e-014
>gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT_329901
[Arabidopsis lyrata subsp. lyrata]
Length = 176
Score = 267 bits (681), Expect = 2e-069
Identities = 138/179 (77%), Positives = 155/179 (86%), Gaps = 10/179 (5%)
Frame = -3
Query: 702 EESKTTKLESAGDSSDVENGNCSSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSL 523
EE+KTTKLESAGDSSDV+NGNCSSSGS GGDTKKTCVDCGT++TPLWRGGPAGPKSL
Sbjct: 5 EETKTTKLESAGDSSDVDNGNCSSSGS----GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60
Query: 522 CNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTE 343
CNACGIKSRKKRQAALGI+QEDNKMKN +N+ L +N+TVK GKG+ G+VKNKI KT+
Sbjct: 61 CNACGIKSRKKRQAALGIRQEDNKMKNKCNNN--LNLENRTVKIGKGEPGNVKNKI-KTD 117
Query: 342 SED---CNDKKSVKRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALSCG 175
E+ N+ K+VK+ GRFLD GFKVP MKRS VEKKR+W KLGEEERAAVLLMALSCG
Sbjct: 118 PENFSSSNNNKNVKKVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 176
>gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabidopsis
thaliana]
Length = 190
Score = 215 bits (546), Expect = 9e-054
Identities = 120/188 (63%), Positives = 137/188 (72%), Gaps = 12/188 (6%)
Frame = -3
Query: 714 MSMTEESKTTKLESAGDSSDVENGNCSSSGSGGG-GGGDTKKTCVDCGTNKTPLWRGGPA 538
MS E TKL+SAG+ SDV+N NCSSSGSGGG GDTK+TCVDCGT +TPLWRGGPA
Sbjct: 1 MSEGSEDTKTKLDSAGELSDVDNENCSSSGSGGGSSSGDTKRTCVDCGTIRTPLWRGGPA 60
Query: 537 GPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVKNK 358
GPKSLCNACGIKSRKKRQAALG++ E+ K KN KSN C N +N K ++ +
Sbjct: 61 GPKSLCNACGIKSRKKRQAALGMRSEEKK-KNRKSN--CNNDLNLDHRNAKKYKINIVDD 117
Query: 357 IKKTESED---CNDKKSV-----KRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEERAA 202
K +D CN+K+S K +FLDLGFKVPVMKRS VEKKR+W KLGEEERAA
Sbjct: 118 GKIDIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMKRSAVEKKRLWRKLGEEERAA 177
Query: 201 VLLMALSC 178
VLLMALSC
Sbjct: 178 VLLMALSC 185
>gi|240255906|ref|NP_680707.4| GATA type zinc finger transcription factor family
protein [Arabidopsis thaliana]
Length = 197
Score = 172 bits (435), Expect = 7e-041
Identities = 89/138 (64%), Positives = 107/138 (77%), Gaps = 12/138 (8%)
Frame = -3
Query: 702 EESKTTKLESAGDSSDVENGNCSSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSL 523
EE+KTTKLESAGDSSDV+NGNCSSSGS GGDTKKTCVDCGT++TPLWRGGPAGPKSL
Sbjct: 5 EETKTTKLESAGDSSDVDNGNCSSSGS----GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60
Query: 522 CNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVK------N 361
CNACGIKSRKKRQAALGI+Q+D K+K+ +N+ L +++ VK GKG+ +VK
Sbjct: 61 CNACGIKSRKKRQAALGIRQDDIKIKSKSNNN--LGLESRNVKTGKGEPVNVKIAKCEPG 118
Query: 360 KIKKTESEDCNDKKSVKR 307
+K + E N K +KR
Sbjct: 119 IVKIAKGEPGNVKNKIKR 136
Score = 111 bits (276), Expect = 2e-022
Identities = 64/115 (55%), Positives = 75/115 (65%), Gaps = 4/115 (3%)
Frame = -3
Query: 507 IKSRKKRQAALGIKQEDNKM-KNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKK---TES 340
IK + K LG++ + K K N + VK KG+ G+VKNKIK+ S
Sbjct: 83 IKIKSKSNNNLGLESRNVKTGKGEPVNVKIAKCEPGIVKIAKGEPGNVKNKIKRDPENSS 142
Query: 339 EDCNDKKSVKRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALSCG 175
N+KK+VKR GRFLD GFKVP MKRS VEKKR+W KLGEEERAAVLLMALSCG
Sbjct: 143 SSNNNKKNVKRVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 197
>gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT_479155
[Arabidopsis lyrata subsp. lyrata]
Length = 175
Score = 158 bits (399), Expect = 1e-036
Identities = 82/133 (61%), Positives = 99/133 (74%), Gaps = 4/133 (3%)
Frame = -3
Query: 714 MSMTEESKTTKLESAGDSSDVENGNCSSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAG 535
MS E TK++SAG+ SDV+N NCSSSGSGGG GDTK+TCVDCGT +TPLWRGGPAG
Sbjct: 1 MSEGSEETKTKVDSAGELSDVDNENCSSSGSGGGSSGDTKRTCVDCGTIRTPLWRGGPAG 60
Query: 534 PKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGK--GDSGSVKN 361
PKSLCNACGIKSRKKRQAALG++ E+ K KN KS+ + L D++ KN K D + +
Sbjct: 61 PKSLCNACGIKSRKKRQAALGMRSEEKK-KNRKSSGNDLNLDHRNAKNDKINKDDDAKND 119
Query: 360 KIKKTESEDCNDK 322
KI K + + NDK
Sbjct: 120 KINK-DDDAKNDK 131
>gi|326497045|dbj|BAK02107.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 162
Score = 103 bits (256), Expect = 4e-020
Identities = 61/161 (37%), Positives = 86/161 (53%), Gaps = 6/161 (3%)
Frame = -3
Query: 663 SSDVENGNCSSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQ 484
SS VE G+ S G K C C T KTPLWRGGP+GP SLCNACGI+ RKKR+
Sbjct: 2 SSSVEKGSGSLDPGERPASGSQPKACTACNTTKTPLWRGGPSGPMSLCNACGIRYRKKRR 61
Query: 483 AALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKKSVKRG 304
ALG+ D K + + A G+S + ++ + + K+ +
Sbjct: 62 EALGL---DEPPKKRQPAAAASAAAAAACSEAGGESAEPDQQQQQPKKKTTTTKRGREVE 118
Query: 303 GRFLDLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALS 181
R + G +V + +R + ++R +LGEEE+AA+LLMALS
Sbjct: 119 LRVVGFGKEVVLKQRRRMRRRR---RLGEEEKAAILLMALS 156
>gi|255633610|gb|ACU17164.1| unknown [Glycine max]
Length = 130
Score = 100 bits (248), Expect = 3e-019
Identities = 46/83 (55%), Positives = 57/83 (68%), Gaps = 6/83 (7%)
Frame = -3
Query: 681 LESAGDSSDVE------NGNCSSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLC 520
++ G S++E N N SSG+ + KKTC DCGT KTPLWRGGPAGPKSLC
Sbjct: 2 VDPTGKGSEIEVEDSNSNPNAPSSGNSPSSNNEQKKTCADCGTTKTPLWRGGPAGPKSLC 61
Query: 519 NACGIKSRKKRQAALGIKQEDNK 451
NACGI+SRKK++A LGI + N+
Sbjct: 62 NACGIRSRKKKRAILGINKGSNE 84
>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 153
Score = 99 bits (246), Expect = 6e-019
Identities = 57/132 (43%), Positives = 74/132 (56%), Gaps = 7/132 (5%)
Frame = -3
Query: 678 ESAGDSSDVENGNCSSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKS 499
E +S D+ N N + S + KKTC DCGT KTPLWRGGPAGPKSLCNACGI+S
Sbjct: 6 EKGSESEDMNNKNPDAVSSAESQVNEPKKTCADCGTTKTPLWRGGPAGPKSLCNACGIRS 65
Query: 498 RKKRQAALGIKQ--EDNKMKNNKSNDSCLAPDNQTVKNGKGDSG-SVKNKIKKTESEDCN 328
RKKR+A LG+ + D++ SN S N NG G S+K ++ E
Sbjct: 66 RKKRRAFLGLNKGSTDDRKAKRSSNHS----HNNGGGNGNNKLGDSLKRRLFALGREVLL 121
Query: 327 DKKSVKRGGRFL 292
+ +V++ R L
Sbjct: 122 QRSTVEKQRRKL 133
>gi|224123912|ref|XP_002330240.1| predicted protein [Populus trichocarpa]
Length = 161
Score = 96 bits (236), Expect = 8e-018
Identities = 60/149 (40%), Positives = 77/149 (51%), Gaps = 6/149 (4%)
Frame = -3
Query: 624 SGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMK 445
S G G + KK C DC T KTPLWRGGPAGPKSLCNACGI+ RKKR K + K +
Sbjct: 14 SSGSGDIEGKKACTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKKRSVMRLEKGPEKKRE 73
Query: 444 NNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKKSVKRGGRFLDLGFKVPVM 265
++++ A D T+ + + SE V L + V+
Sbjct: 74 KTTTSNTTTATDISTITTATTTNTAQVVSGNGLISESLRMSLMVLGEEMMLQ---RPSVV 130
Query: 264 KRSVVEKKRVWMKLGEEERAAVLLMALSC 178
K+ ++KR KL EEE+AA LMALSC
Sbjct: 131 KKQRCQRKR---KLREEEQAAFSLMALSC 156
>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
[Arabidopsis lyrata subsp. lyrata]
Length = 137
Score = 94 bits (233), Expect = 2e-017
Identities = 42/69 (60%), Positives = 51/69 (73%)
Frame = -3
Query: 636 SSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQED 457
SSS S G + KK+C CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ + + ED
Sbjct: 15 SSSSSSNEGISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSED 74
Query: 456 NKMKNNKSN 430
K KN+ N
Sbjct: 75 KKNKNHNRN 83
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
communis]
Length = 149
Score = 92 bits (227), Expect = 9e-017
Identities = 49/117 (41%), Positives = 66/117 (56%), Gaps = 6/117 (5%)
Frame = -3
Query: 630 SGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ---- 463
S G KK+C DCGT KTPLWRGGPAGPKSLCNACGI+SRKK++ +LG+ +
Sbjct: 15 SSKSAEGENQQKKSCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKKRDSLGLNRASSN 74
Query: 462 EDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKKSVKRGGRFL 292
D K + + S++ N N GD +K ++ E + SV++ R L
Sbjct: 75 PDKKSRKHSSSNGSSNNHNSNNSNRLGD--GLKQRLLALGREVLMQRSSVEKQRRKL 129
>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
thaliana]
Length = 149
Score = 89 bits (219), Expect = 8e-016
Identities = 39/68 (57%), Positives = 49/68 (72%)
Frame = -3
Query: 633 SSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDN 454
SS S + KK+C CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ + + ED
Sbjct: 28 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 87
Query: 453 KMKNNKSN 430
K K++ N
Sbjct: 88 KKKSHNRN 95
>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]
Length = 136
Score = 89 bits (219), Expect = 8e-016
Identities = 39/68 (57%), Positives = 49/68 (72%)
Frame = -3
Query: 633 SSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDN 454
SS S + KK+C CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ + + ED
Sbjct: 15 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 74
Query: 453 KMKNNKSN 430
K K++ N
Sbjct: 75 KKKSHNRN 82
>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]
Length = 136
Score = 89 bits (219), Expect = 8e-016
Identities = 39/68 (57%), Positives = 49/68 (72%)
Frame = -3
Query: 633 SSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDN 454
SS S + KK+C CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ + + ED
Sbjct: 15 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 74
Query: 453 KMKNNKSN 430
K K++ N
Sbjct: 75 KKKSHNRN 82
>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 89 bits (218), Expect = 1e-015
Identities = 41/74 (55%), Positives = 54/74 (72%), Gaps = 2/74 (2%)
Frame = -3
Query: 597 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ--EDNKMKNNKSNDS 424
KKTC DCGT+KTPLWRGGPAGPKSLCNACGI+SRKK++ LG+ + ++K SN++
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGAANDKRAKKGSNNN 72
Query: 423 CLAPDNQTVKNGKG 382
+ +N + G G
Sbjct: 73 GSSNNNNNKQLGDG 86
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 87 bits (213), Expect = 4e-015
Identities = 38/62 (61%), Positives = 47/62 (75%), Gaps = 5/62 (8%)
Frame = -3
Query: 597 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ-----EDNKMKNNKS 433
KKTC DCGT+KTPLWRGGPAGPKSLCNACGI+SRKK++ LG+ + D + K +
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGGAAANDKRAKKGST 72
Query: 432 ND 427
N+
Sbjct: 73 NN 74
>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
thaliana]
Length = 139
Score = 86 bits (212), Expect = 5e-015
Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 4/95 (4%)
Frame = -3
Query: 603 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDS 424
D KKTC DCGT+KTPLWRGGP GPKSLCNACGI++RKKR+ EDNK S+
Sbjct: 33 DKKKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRRGG----TEDNKKLKKSSSGG 88
Query: 423 CLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKK 319
+++K D G K + + + +++
Sbjct: 89 GNRKFGESLKQSLMDLGIRKRSTVEKQRQKLGEEE 123
>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
[Arabidopsis lyrata subsp. lyrata]
Length = 111
Score = 85 bits (209), Expect = 1e-014
Identities = 42/78 (53%), Positives = 50/78 (64%), Gaps = 5/78 (6%)
Frame = -3
Query: 597 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCL 418
KKTC DCGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ EDNK S+
Sbjct: 8 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRGT-----EDNKKLKKSSSGGGN 62
Query: 417 APDNQTVKNGKGDSGSVK 364
+++K D G K
Sbjct: 63 PKLGESLKQRLMDFGITK 80
>gi|115456383|ref|NP_001051792.1| Os03g0831200 [Oryza sativa Japonica Group]
Length = 136
Score = 84 bits (205), Expect = 3e-014
Identities = 40/66 (60%), Positives = 46/66 (69%), Gaps = 1/66 (1%)
Frame = -3
Query: 666 DSSDVENGNCSSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKR 487
DSS VE G+ S G+ K C DC T KTPLWRGGP+GPKSLCNACGI+ RKKR
Sbjct: 2 DSSSVEKGSGSIDPDERTASGE-PKACTDCHTTKTPLWRGGPSGPKSLCNACGIRYRKKR 60
Query: 486 QAALGI 469
+ ALG+
Sbjct: 61 REALGL 66
>gi|326502532|dbj|BAJ95329.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 181
Score = 83 bits (204), Expect = 4e-014
Identities = 47/126 (37%), Positives = 66/126 (52%), Gaps = 14/126 (11%)
Frame = -3
Query: 693 KTTKLESAGDSSDVENGNCSSSGSGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNA 514
K +E ++ + +C++SG+G K+C DC T KTPLWRGGP GPKSLCNA
Sbjct: 14 KMKMIEVEVQAAAADPDDCTASGAG------DPKSCADCNTTKTPLWRGGPNGPKSLCNA 67
Query: 513 CGIKSRKKRQAALGIKQE-DNKMKNNKSNDSCLA-------PDNQTVKNGKGDSGSVKNK 358
CGI+ RK+R+ A+G+ E K K + + +S A + K D NK
Sbjct: 68 CGIRYRKRRRVAMGLDPEAKRKPKRDDAINSAAAAAEASTQQQEEVTKPTDDDKAVSTNK 127
Query: 357 IKKTES 340
KT +
Sbjct: 128 TTKTHT 133
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,183,829,113,609
Number of Sequences: 15229318
Number of Extensions: 1183829113609
Number of Successful Extensions: 337516149
Number of sequences better than 0.0: 0
|