BLASTX 7.6.2
Query= UN47756 /QuerySize=823
(822 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT... 262 5e-068
gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabi... 214 1e-053
gi|240255906|ref|NP_680707.4| GATA type zinc finger transcriptio... 168 2e-039
gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT... 152 7e-035
gi|255633610|gb|ACU17164.1| unknown [Glycine max] 99 5e-019
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein... 94 2e-017
gi|224123912|ref|XP_002330240.1| predicted protein [Populus tric... 93 5e-017
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta... 91 1e-016
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT... 91 1e-016
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric... 89 9e-016
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi... 87 3e-015
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana] 87 3e-015
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal... 87 3e-015
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric... 87 3e-015
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi... 86 5e-015
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT... 85 1e-014
gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 ... 82 7e-014
gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein... 82 7e-014
gi|296089747|emb|CBI39566.3| unnamed protein product [Vitis vini... 81 2e-013
>gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT_329901
[Arabidopsis lyrata subsp. lyrata]
Length = 176
Score = 262 bits (669), Expect = 5e-068
Identities = 136/179 (75%), Positives = 153/179 (85%), Gaps = 12/179 (6%)
Frame = +3
Query: 162 SKTTKLESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSL 341
+KTTKLESAGDSSDV+NGNCSSSGS GGDTKKTCVDCGT++TPLWRGGPAGPKSL
Sbjct: 7 TKTTKLESAGDSSDVDNGNCSSSGS------GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60
Query: 342 CNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTE 521
CNACGIKSRKKRQAALGI+QEDNKMKN +N+ L +N+TVK GKG+ G+VKNKI KT+
Sbjct: 61 CNACGIKSRKKRQAALGIRQEDNKMKNKCNNN--LNLENRTVKIGKGEPGNVKNKI-KTD 117
Query: 522 SED---CNDKKSVKRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALSCG 689
E+ N+ K+VK+ GRFLD GFKVP MKRS VEKKR+W KLGEEERAAVLLMALSCG
Sbjct: 118 PENFSSSNNNKNVKKVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 176
>gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabidopsis
thaliana]
Length = 190
Score = 214 bits (544), Expect = 1e-053
Identities = 120/190 (63%), Positives = 139/190 (73%), Gaps = 14/190 (7%)
Frame = +3
Query: 144 MSMTEESKTTKLESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGP 323
MS E TKL+SAG+ SDV+N NCSSSGS GGG GDTK+TCVDCGT +TPLWRGGP
Sbjct: 1 MSEGSEDTKTKLDSAGELSDVDNENCSSSGS-GGGSSSGDTKRTCVDCGTIRTPLWRGGP 59
Query: 324 AGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSN-DSCLAPDNQTVKNGK---GDSG 491
AGPKSLCNACGIKSRKKRQAALG++ E+ K KN KSN ++ L D++ K K D G
Sbjct: 60 AGPKSLCNACGIKSRKKRQAALGMRSEEKK-KNRKSNCNNDLNLDHRNAKKYKINIVDDG 118
Query: 492 SVKNKIKKTESEDCNDKKSV-----KRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEER 656
+ + + CN+K+S K +FLDLGFKVPVMKRS VEKKR+W KLGEEER
Sbjct: 119 KID---IDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMKRSAVEKKRLWRKLGEEER 175
Query: 657 AAVLLMALSC 686
AAVLLMALSC
Sbjct: 176 AAVLLMALSC 185
>gi|240255906|ref|NP_680707.4| GATA type zinc finger transcription factor family
protein [Arabidopsis thaliana]
Length = 197
Score = 168 bits (423), Expect = 2e-039
Identities = 87/138 (63%), Positives = 105/138 (76%), Gaps = 14/138 (10%)
Frame = +3
Query: 162 SKTTKLESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSL 341
+KTTKLESAGDSSDV+NGNCSSSGS GGDTKKTCVDCGT++TPLWRGGPAGPKSL
Sbjct: 7 TKTTKLESAGDSSDVDNGNCSSSGS------GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60
Query: 342 CNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVK------N 503
CNACGIKSRKKRQAALGI+Q+D K+K+ +N+ L +++ VK GKG+ +VK
Sbjct: 61 CNACGIKSRKKRQAALGIRQDDIKIKSKSNNN--LGLESRNVKTGKGEPVNVKIAKCEPG 118
Query: 504 KIKKTESEDCNDKKSVKR 557
+K + E N K +KR
Sbjct: 119 IVKIAKGEPGNVKNKIKR 136
Score = 111 bits (276), Expect = 2e-022
Identities = 64/115 (55%), Positives = 75/115 (65%), Gaps = 4/115 (3%)
Frame = +3
Query: 357 IKSRKKRQAALGIKQEDNKM-KNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKK---TES 524
IK + K LG++ + K K N + VK KG+ G+VKNKIK+ S
Sbjct: 83 IKIKSKSNNNLGLESRNVKTGKGEPVNVKIAKCEPGIVKIAKGEPGNVKNKIKRDPENSS 142
Query: 525 EDCNDKKSVKRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALSCG 689
N+KK+VKR GRFLD GFKVP MKRS VEKKR+W KLGEEERAAVLLMALSCG
Sbjct: 143 SSNNNKKNVKRVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 197
>gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT_479155
[Arabidopsis lyrata subsp. lyrata]
Length = 175
Score = 152 bits (383), Expect = 7e-035
Identities = 82/132 (62%), Positives = 101/132 (76%), Gaps = 7/132 (5%)
Frame = +3
Query: 153 TEESKTTKLESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGP 332
+EE+K TK++SAG+ SDV+N NCSSSGS GGG GDTK+TCVDCGT +TPLWRGGPAGP
Sbjct: 5 SEETK-TKVDSAGELSDVDNENCSSSGS--GGGSSGDTKRTCVDCGTIRTPLWRGGPAGP 61
Query: 333 KSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGK--GDSGSVKNK 506
KSLCNACGIKSRKKRQAALG++ E+ K KN KS+ + L D++ KN K D + +K
Sbjct: 62 KSLCNACGIKSRKKRQAALGMRSEEKK-KNRKSSGNDLNLDHRNAKNDKINKDDDAKNDK 120
Query: 507 IKKTESEDCNDK 542
I K + + NDK
Sbjct: 121 INK-DDDAKNDK 131
>gi|255633610|gb|ACU17164.1| unknown [Glycine max]
Length = 130
Score = 99 bits (246), Expect = 5e-019
Identities = 45/83 (54%), Positives = 57/83 (68%), Gaps = 4/83 (4%)
Frame = +3
Query: 177 LESAGDSSDVE----NGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLC 344
++ G S++E N N ++ SG + KKTC DCGT KTPLWRGGPAGPKSLC
Sbjct: 2 VDPTGKGSEIEVEDSNSNPNAPSSGNSPSSNNEQKKTCADCGTTKTPLWRGGPAGPKSLC 61
Query: 345 NACGIKSRKKRQAALGIKQEDNK 413
NACGI+SRKK++A LGI + N+
Sbjct: 62 NACGIRSRKKKRAILGINKGSNE 84
>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 153
Score = 94 bits (233), Expect = 2e-017
Identities = 57/134 (42%), Positives = 74/134 (55%), Gaps = 9/134 (6%)
Frame = +3
Query: 180 ESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGI 359
E +S D+ N N + S + KKTC DCGT KTPLWRGGPAGPKSLCNACGI
Sbjct: 6 EKGSESEDMNNKNPDAVSS--AESQVNEPKKTCADCGTTKTPLWRGGPAGPKSLCNACGI 63
Query: 360 KSRKKRQAALGIKQ--EDNKMKNNKSNDSCLAPDNQTVKNGKGDSG-SVKNKIKKTESED 530
+SRKKR+A LG+ + D++ SN S N NG G S+K ++ E
Sbjct: 64 RSRKKRRAFLGLNKGSTDDRKAKRSSNHS----HNNGGGNGNNKLGDSLKRRLFALGREV 119
Query: 531 CNDKKSVKRGGRFL 572
+ +V++ R L
Sbjct: 120 LLQRSTVEKQRRKL 133
>gi|224123912|ref|XP_002330240.1| predicted protein [Populus trichocarpa]
Length = 161
Score = 93 bits (229), Expect = 5e-017
Identities = 61/157 (38%), Positives = 78/157 (49%), Gaps = 6/157 (3%)
Frame = +3
Query: 213 GNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALG 392
G SS G G + KK C DC T KTPLWRGGPAGPKSLCNACGI+ RKKR
Sbjct: 5 GTKSSREDESSGSGDIEGKKACTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKKRSVMRL 64
Query: 393 IKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKKSVKRGGRFL 572
K + K + ++++ A D T+ + + SE V L
Sbjct: 65 EKGPEKKREKTTTSNTTTATDISTITTATTTNTAQVVSGNGLISESLRMSLMVLGEEMML 124
Query: 573 DLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALS 683
+ V+K+ ++KR KL EEE+AA LMALS
Sbjct: 125 Q---RPSVVKKQRCQRKR---KLREEEQAAFSLMALS 155
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
communis]
Length = 149
Score = 91 bits (225), Expect = 1e-016
Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 6/111 (5%)
Frame = +3
Query: 252 GGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ----EDNKMK 419
G KK+C DCGT KTPLWRGGPAGPKSLCNACGI+SRKK++ +LG+ + D K +
Sbjct: 21 GENQQKKSCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKKRDSLGLNRASSNPDKKSR 80
Query: 420 NNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKKSVKRGGRFL 572
+ S++ N N GD +K ++ E + SV++ R L
Sbjct: 81 KHSSSNGSSNNHNSNNSNRLGD--GLKQRLLALGREVLMQRSSVEKQRRKL 129
>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
[Arabidopsis lyrata subsp. lyrata]
Length = 137
Score = 91 bits (225), Expect = 1e-016
Identities = 40/69 (57%), Positives = 49/69 (71%)
Frame = +3
Query: 228 SGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQED 407
S S G + KK+C CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ + + ED
Sbjct: 15 SSSSSSNEGISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSED 74
Query: 408 NKMKNNKSN 434
K KN+ N
Sbjct: 75 KKNKNHNRN 83
>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 89 bits (218), Expect = 9e-016
Identities = 41/74 (55%), Positives = 54/74 (72%), Gaps = 2/74 (2%)
Frame = +3
Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ--EDNKMKNNKSNDS 440
KKTC DCGT+KTPLWRGGPAGPKSLCNACGI+SRKK++ LG+ + ++K SN++
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGAANDKRAKKGSNNN 72
Query: 441 CLAPDNQTVKNGKG 482
+ +N + G G
Sbjct: 73 GSSNNNNNKQLGDG 86
>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
thaliana]
Length = 149
Score = 87 bits (213), Expect = 3e-015
Identities = 36/56 (64%), Positives = 45/56 (80%)
Frame = +3
Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSN 434
KK+C CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ + + ED K K++ N
Sbjct: 40 KKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSHNRN 95
>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]
Length = 136
Score = 87 bits (213), Expect = 3e-015
Identities = 36/56 (64%), Positives = 45/56 (80%)
Frame = +3
Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSN 434
KK+C CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ + + ED K K++ N
Sbjct: 27 KKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSHNRN 82
>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]
Length = 136
Score = 87 bits (213), Expect = 3e-015
Identities = 36/56 (64%), Positives = 45/56 (80%)
Frame = +3
Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSN 434
KK+C CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ + + ED K K++ N
Sbjct: 27 KKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSHNRN 82
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 87 bits (213), Expect = 3e-015
Identities = 38/62 (61%), Positives = 47/62 (75%), Gaps = 5/62 (8%)
Frame = +3
Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ-----EDNKMKNNKS 431
KKTC DCGT+KTPLWRGGPAGPKSLCNACGI+SRKK++ LG+ + D + K +
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGGAAANDKRAKKGST 72
Query: 432 ND 437
N+
Sbjct: 73 NN 74
>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
thaliana]
Length = 139
Score = 86 bits (212), Expect = 5e-015
Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 4/95 (4%)
Frame = +3
Query: 261 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDS 440
D KKTC DCGT+KTPLWRGGP GPKSLCNACGI++RKKR+ EDNK S+
Sbjct: 33 DKKKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRRGG----TEDNKKLKKSSSGG 88
Query: 441 CLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKK 545
+++K D G K + + + +++
Sbjct: 89 GNRKFGESLKQSLMDLGIRKRSTVEKQRQKLGEEE 123
>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
[Arabidopsis lyrata subsp. lyrata]
Length = 111
Score = 85 bits (209), Expect = 1e-014
Identities = 42/78 (53%), Positives = 50/78 (64%), Gaps = 5/78 (6%)
Frame = +3
Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCL 446
KKTC DCGT+KTPLWRGGPAGPKSLCNACGI++RKKR+ EDNK S+
Sbjct: 8 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRGT-----EDNKKLKKSSSGGGN 62
Query: 447 APDNQTVKNGKGDSGSVK 500
+++K D G K
Sbjct: 63 PKLGESLKQRLMDFGITK 80
>gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 [Vitis
vinifera]
Length = 125
Score = 82 bits (202), Expect = 7e-014
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 10/97 (10%)
Frame = +3
Query: 261 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSND- 437
+ KK C DC T KTPLWRGGPAGPKSLCNACGI+ RK+R + +G+ ++ +M N+ S+D
Sbjct: 16 EIKKCCTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKRRSSMVGVNKKKERM-NSGSHDL 74
Query: 438 ------SCLAPDNQTVKNGKGDSGSVKNKIKKTESED 530
S +A N+ + + SVK + +K E+
Sbjct: 75 SETLKQSLMALGNEVMM--QRQRSSVKKQRRKLGEEE 109
>gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 124
Score = 82 bits (202), Expect = 7e-014
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 10/97 (10%)
Frame = +3
Query: 261 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSND- 437
+ KK C DC T KTPLWRGGPAGPKSLCNACGI+ RK+R + +G+ ++ +M N+ S+D
Sbjct: 15 EIKKCCTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKRRSSMVGVNKKKERM-NSGSHDL 73
Query: 438 ------SCLAPDNQTVKNGKGDSGSVKNKIKKTESED 530
S +A N+ + + SVK + +K E+
Sbjct: 74 SETLKQSLMALGNEVMM--QRQRSSVKKQRRKLGEEE 108
>gi|296089747|emb|CBI39566.3| unnamed protein product [Vitis vinifera]
Length = 109
Score = 81 bits (198), Expect = 2e-013
Identities = 32/59 (54%), Positives = 43/59 (72%)
Frame = +3
Query: 261 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSND 437
+ KK C DC T KTPLWRGGPAGPKSLCNACGI+ RK+R + +G+ ++ +M + +
Sbjct: 16 EIKKCCTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKRRSSMVGVNKKKERMNSETEEE 74
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,129,805,850,752
Number of Sequences: 15229318
Number of Extensions: 5129805850752
Number of Successful Extensions: 1187520479
Number of sequences better than 0.0: 0
|