BLASTX 7.6.2
Query= UN48012 /QuerySize=851
(850 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT... 260 2e-067
gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabi... 206 3e-051
gi|240255906|ref|NP_680707.4| GATA type zinc finger transcriptio... 163 4e-038
gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT... 144 2e-032
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein... 99 7e-019
gi|326502532|dbj|BAJ95329.1| predicted protein [Hordeum vulgare ... 96 5e-018
gi|255633610|gb|ACU17164.1| unknown [Glycine max] 95 1e-017
gi|224123912|ref|XP_002330240.1| predicted protein [Populus tric... 94 2e-017
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta... 92 7e-017
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT... 90 4e-016
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric... 89 1e-015
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi... 87 4e-015
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana] 87 4e-015
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal... 87 4e-015
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi... 85 1e-014
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT... 84 2e-014
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric... 82 7e-014
gi|115456383|ref|NP_001051792.1| Os03g0831200 [Oryza sativa Japo... 81 2e-013
gi|224128400|ref|XP_002320320.1| predicted protein [Populus tric... 75 9e-012
>gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT_329901
[Arabidopsis lyrata subsp. lyrata]
Length = 176
Score = 260 bits (664), Expect = 2e-067
Identities = 134/179 (74%), Positives = 150/179 (83%), Gaps = 9/179 (5%)
Frame = +2
Query: 149 TEESKTTKLESGGDSSDIENGNCSSSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSS 328
TEE+KTTKLES GDSSD++NGNCSSSGS GGDTKKTCVDCGT++TP WRGGPAGPKS
Sbjct: 4 TEETKTTKLESAGDSSDVDNGNCSSSGS---GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60
Query: 329 CNACGIKSRKKRQAALGIKQEDNNKMKNNKSNDSYLAPDNQTVKNGKGDSGNVKNKIKTE 508
CNACGIKSRKKRQAALGI+QED NKMKN +N+ L +N+TVK GKG+ GNVKNKIKT+
Sbjct: 61 CNACGIKSRKKRQAALGIRQED-NKMKNKCNNNLNL--ENRTVKIGKGEPGNVKNKIKTD 117
Query: 509 TED---CNEKKSVKRGSRFLDLGFKVPVMKRSVVEKKRVWMKLGEEGRAAVLLMALSCG 676
E+ N K+VK+ RFLD GFKVP MKRS VEKKR+W KLGEE RAAVLLMALSCG
Sbjct: 118 PENFSSSNNNKNVKKVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 176
>gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabidopsis
thaliana]
Length = 190
Score = 206 bits (524), Expect = 3e-051
Identities = 116/189 (61%), Positives = 135/189 (71%), Gaps = 15/189 (7%)
Frame = +2
Query: 140 MSMTEESKTTKLESGGDSSDIENGNCSSSGSGSG--GGDTKKTCVDCGTNKTPFWRGGPA 313
MS E TKL+S G+ SD++N NCSSSGSG G GDTK+TCVDCGT +TP WRGGPA
Sbjct: 1 MSEGSEDTKTKLDSAGELSDVDNENCSSSGSGGGSSSGDTKRTCVDCGTIRTPLWRGGPA 60
Query: 314 GPKSSCNACGIKSRKKRQAALGIKQEDNNKMKNNKSN-DSYLAPDNQTVKNGK---GDSG 481
GPKS CNACGIKSRKKRQAALG++ E+ K KN KSN ++ L D++ K K D G
Sbjct: 61 GPKSLCNACGIKSRKKRQAALGMRSEE--KKKNRKSNCNNDLNLDHRNAKKYKINIVDDG 118
Query: 482 NVKNKIKTETEDCNEKKSV-----KRGSRFLDLGFKVPVMKRSVVEKKRVWMKLGEEGRA 646
+ I + + CN K+S K S+FLDLGFKVPVMKRS VEKKR+W KLGEE RA
Sbjct: 119 KI--DIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMKRSAVEKKRLWRKLGEEERA 176
Query: 647 AVLLMALSC 673
AVLLMALSC
Sbjct: 177 AVLLMALSC 185
>gi|240255906|ref|NP_680707.4| GATA type zinc finger transcription factor family
protein [Arabidopsis thaliana]
Length = 197
Score = 163 bits (411), Expect = 4e-038
Identities = 81/114 (71%), Positives = 96/114 (84%), Gaps = 6/114 (5%)
Frame = +2
Query: 149 TEESKTTKLESGGDSSDIENGNCSSSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSS 328
TEE+KTTKLES GDSSD++NGNCSSSGS GGDTKKTCVDCGT++TP WRGGPAGPKS
Sbjct: 4 TEETKTTKLESAGDSSDVDNGNCSSSGS---GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60
Query: 329 CNACGIKSRKKRQAALGIKQEDNNKMKNNKSNDSYLAPDNQTVKNGKGDSGNVK 490
CNACGIKSRKKRQAALGI+Q+D K+K+ +N+ L +++ VK GKG+ NVK
Sbjct: 61 CNACGIKSRKKRQAALGIRQDD-IKIKSKSNNN--LGLESRNVKTGKGEPVNVK 111
Score = 109 bits (272), Expect = 5e-022
Identities = 62/115 (53%), Positives = 72/115 (62%), Gaps = 4/115 (3%)
Frame = +2
Query: 344 IKSRKKRQAALGIKQEDNNKMKNNKSNDSYLAPDNQTVKNGKGDSGNVKNKIKTETEDC- 520
IK + K LG++ + K N + VK KG+ GNVKNKIK + E+
Sbjct: 83 IKIKSKSNNNLGLESRNVKTGKGEPVNVKIAKCEPGIVKIAKGEPGNVKNKIKRDPENSS 142
Query: 521 ---NEKKSVKRGSRFLDLGFKVPVMKRSVVEKKRVWMKLGEEGRAAVLLMALSCG 676
N KK+VKR RFLD GFKVP MKRS VEKKR+W KLGEE RAAVLLMALSCG
Sbjct: 143 SSNNNKKNVKRVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 197
>gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT_479155
[Arabidopsis lyrata subsp. lyrata]
Length = 175
Score = 144 bits (362), Expect = 2e-032
Identities = 75/133 (56%), Positives = 93/133 (69%), Gaps = 5/133 (3%)
Frame = +2
Query: 140 MSMTEESKTTKLESGGDSSDIENGNCSSSGSGSG-GGDTKKTCVDCGTNKTPFWRGGPAG 316
MS E TK++S G+ SD++N NCSSSGSG G GDTK+TCVDCGT +TP WRGGPAG
Sbjct: 1 MSEGSEETKTKVDSAGELSDVDNENCSSSGSGGGSSGDTKRTCVDCGTIRTPLWRGGPAG 60
Query: 317 PKSSCNACGIKSRKKRQAALGIKQEDNNKMKNNKSNDSYLAPDNQTVKNGK--GDSGNVK 490
PKS CNACGIKSRKKRQAALG++ E+ K KN KS+ + L D++ KN K D
Sbjct: 61 PKSLCNACGIKSRKKRQAALGMRSEE--KKKNRKSSGNDLNLDHRNAKNDKINKDDDAKN 118
Query: 491 NKIKTETEDCNEK 529
+KI + + N+K
Sbjct: 119 DKINKDDDAKNDK 131
>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 153
Score = 99 bits (245), Expect = 7e-019
Identities = 50/104 (48%), Positives = 64/104 (61%), Gaps = 8/104 (7%)
Frame = +2
Query: 176 ESGGDSSDIENGNCSS-SGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKS 352
E G +S D+ N N + S + S + KKTC DCGT KTP WRGGPAGPKS CNACGI+S
Sbjct: 6 EKGSESEDMNNKNPDAVSSAESQVNEPKKTCADCGTTKTPLWRGGPAGPKSLCNACGIRS 65
Query: 353 RKKRQAALGIKQEDNNKMKNNKSNDSYLAPDNQTVKNGKGDSGN 484
RKKR+A LG+ + + K +S+ N + NG G+ N
Sbjct: 66 RKKRRAFLGLNKGSTDDRKAKRSS-------NHSHNNGGGNGNN 102
>gi|326502532|dbj|BAJ95329.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 181
Score = 96 bits (238), Expect = 5e-018
Identities = 59/156 (37%), Positives = 86/156 (55%), Gaps = 10/156 (6%)
Frame = +2
Query: 212 NCSSSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQE 391
+C++SG+G K+C DC T KTP WRGGP GPKS CNACGI+ RK+R+ A+G+ E
Sbjct: 31 DCTASGAGD-----PKSCADCNTTKTPLWRGGPNGPKSLCNACGIRYRKRRRVAMGLDPE 85
Query: 392 DNNKMKNNKSNDSYLAPDNQTVKNGKGDSGNVKNKIKTETEDCNEKKSVKRGSRFLDLGF 571
K K + + +S A + + + + + T + +V+ +GF
Sbjct: 86 AKRKPKRDDAINSAAAAAEASTQQQEEVTKPTDDDKAVSTNKTTKTHTVELHM----VGF 141
Query: 572 -KVPVMKRSVVEKKRVWMKLGEEGRAAVLLMALSCG 676
K V+K+ ++R LGEE RAA+LLMALS G
Sbjct: 142 GKDAVLKQRRRMRRRKPSCLGEEERAAMLLMALSSG 177
>gi|255633610|gb|ACU17164.1| unknown [Glycine max]
Length = 130
Score = 95 bits (235), Expect = 1e-017
Identities = 49/94 (52%), Positives = 57/94 (60%), Gaps = 7/94 (7%)
Frame = +2
Query: 185 GDSSDIE------NGNCSSSG-SGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACG 343
G S+IE N N SSG S S + KKTC DCGT KTP WRGGPAGPKS CNACG
Sbjct: 6 GKGSEIEVEDSNSNPNAPSSGNSPSSNNEQKKTCADCGTTKTPLWRGGPAGPKSLCNACG 65
Query: 344 IKSRKKRQAALGIKQEDNNKMKNNKSNDSYLAPD 445
I+SRKK++A LGI + N + K L +
Sbjct: 66 IRSRKKKRAILGINKGSNEDGRKGKRTGGALGKE 99
>gi|224123912|ref|XP_002330240.1| predicted protein [Populus trichocarpa]
Length = 161
Score = 94 bits (232), Expect = 2e-017
Identities = 60/163 (36%), Positives = 89/163 (54%), Gaps = 12/163 (7%)
Frame = +2
Query: 197 DIENGNCSSSGSGSGGGDT--KKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQA 370
D++ S SG GD KK C DC T KTP WRGGPAGPKS CNACGI+ RKKR +
Sbjct: 2 DLKGTKSSREDESSGSGDIEGKKACTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKKR-S 60
Query: 371 ALGIKQEDNNKMKNNKSNDSYLAPDNQTVKNGKGDSGNVKNKIKTETEDCNEKKSVKRGS 550
+ +++ K + ++++ A D T+ + N + + + +S++
Sbjct: 61 VMRLEKGPEKKREKTTTSNTTTATDISTI-----TTATTTNTAQVVSGNGLISESLRMS- 114
Query: 551 RFLDLGFKVPVMKRSVVEKKRVW--MKLGEEGRAAVLLMALSC 673
+ LG ++ + + SVV+K+R KL EE +AA LMALSC
Sbjct: 115 -LMVLGEEMMLQRPSVVKKQRCQRKRKLREEEQAAFSLMALSC 156
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
communis]
Length = 149
Score = 92 bits (228), Expect = 7e-017
Identities = 42/88 (47%), Positives = 54/88 (61%)
Frame = +2
Query: 221 SSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQEDNN 400
SS S G KK+C DCGT KTP WRGGPAGPKS CNACGI+SRKK++ +LG+ + +N
Sbjct: 15 SSKSAEGENQQKKSCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKKRDSLGLNRASSN 74
Query: 401 KMKNNKSNDSYLAPDNQTVKNGKGDSGN 484
K ++ + S N N G+
Sbjct: 75 PDKKSRKHSSSNGSSNNHNSNNSNRLGD 102
>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
[Arabidopsis lyrata subsp. lyrata]
Length = 137
Score = 90 bits (221), Expect = 4e-016
Identities = 44/87 (50%), Positives = 56/87 (64%), Gaps = 1/87 (1%)
Frame = +2
Query: 173 LESGGDSSDIENGNCSSSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKS 352
+ES S D + SSS S G + KK+C CGT+KTP WRGGPAGPKS CNACGI++
Sbjct: 1 MESKLTSVDAIEEHSSSSSSNEGISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRN 60
Query: 353 RKKRQAALGIKQEDNNKMKNNKSNDSY 433
RKKR+ + + ED K KN+ N +
Sbjct: 61 RKKRRTLISNRSED-KKNKNHNRNPKF 86
>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 89 bits (218), Expect = 1e-015
Identities = 44/85 (51%), Positives = 55/85 (64%), Gaps = 1/85 (1%)
Frame = +2
Query: 221 SSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGI-KQEDN 397
SS S KKTC DCGT+KTP WRGGPAGPKS CNACGI+SRKK++ LG+ K N
Sbjct: 2 SSNSQETESPLKKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGAAN 61
Query: 398 NKMKNNKSNDSYLAPDNQTVKNGKG 472
+K SN++ + +N + G G
Sbjct: 62 DKRAKKGSNNNGSSNNNNNKQLGDG 86
>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
thaliana]
Length = 149
Score = 87 bits (213), Expect = 4e-015
Identities = 37/68 (54%), Positives = 48/68 (70%)
Frame = +2
Query: 218 SSSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQEDN 397
SSS S + KK+C CGT+KTP WRGGPAGPKS CNACGI++RKKR+ + + ED
Sbjct: 28 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 87
Query: 398 NKMKNNKS 421
K +N++
Sbjct: 88 KKKSHNRN 95
>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]
Length = 136
Score = 87 bits (213), Expect = 4e-015
Identities = 37/68 (54%), Positives = 48/68 (70%)
Frame = +2
Query: 218 SSSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQEDN 397
SSS S + KK+C CGT+KTP WRGGPAGPKS CNACGI++RKKR+ + + ED
Sbjct: 15 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 74
Query: 398 NKMKNNKS 421
K +N++
Sbjct: 75 KKKSHNRN 82
>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]
Length = 136
Score = 87 bits (213), Expect = 4e-015
Identities = 37/68 (54%), Positives = 48/68 (70%)
Frame = +2
Query: 218 SSSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQEDN 397
SSS S + KK+C CGT+KTP WRGGPAGPKS CNACGI++RKKR+ + + ED
Sbjct: 15 SSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDK 74
Query: 398 NKMKNNKS 421
K +N++
Sbjct: 75 KKKSHNRN 82
>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
thaliana]
Length = 139
Score = 85 bits (208), Expect = 1e-014
Identities = 37/58 (63%), Positives = 43/58 (74%), Gaps = 4/58 (6%)
Frame = +2
Query: 248 DTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQEDNNKMKNNKS 421
D KKTC DCGT+KTP WRGGP GPKS CNACGI++RKKR+ EDN K+K + S
Sbjct: 33 DKKKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRRGG----TEDNKKLKKSSS 86
>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
[Arabidopsis lyrata subsp. lyrata]
Length = 111
Score = 84 bits (207), Expect = 2e-014
Identities = 37/56 (66%), Positives = 43/56 (76%), Gaps = 5/56 (8%)
Frame = +2
Query: 254 KKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQEDNNKMKNNKS 421
KKTC DCGT+KTP WRGGPAGPKS CNACGI++RKKR+ EDN K+K + S
Sbjct: 8 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRGT-----EDNKKLKKSSS 58
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 82 bits (202), Expect = 7e-014
Identities = 37/64 (57%), Positives = 45/64 (70%), Gaps = 5/64 (7%)
Frame = +2
Query: 254 KKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQ-----EDNNKMKNNK 418
KKTC DCGT+KTP WRGGPAGPKS CNACGI+SRKK++ LG+ + D K +
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGGAAANDKRAKKGST 72
Query: 419 SNDS 430
+N S
Sbjct: 73 NNGS 76
>gi|115456383|ref|NP_001051792.1| Os03g0831200 [Oryza sativa Japonica Group]
Length = 136
Score = 81 bits (199), Expect = 2e-013
Identities = 37/77 (48%), Positives = 45/77 (58%)
Frame = +2
Query: 188 DSSDIENGNCSSSGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQ 367
DSS +E G+ S K C DC T KTP WRGGP+GPKS CNACGI+ RKKR+
Sbjct: 2 DSSSVEKGSGSIDPDERTASGEPKACTDCHTTKTPLWRGGPSGPKSLCNACGIRYRKKRR 61
Query: 368 AALGIKQEDNNKMKNNK 418
ALG+ + + K
Sbjct: 62 EALGLDAGEGGAERQEK 78
>gi|224128400|ref|XP_002320320.1| predicted protein [Populus trichocarpa]
Length = 121
Score = 75 bits (184), Expect = 9e-012
Identities = 33/69 (47%), Positives = 44/69 (63%)
Frame = +2
Query: 224 SGSGSGGGDTKKTCVDCGTNKTPFWRGGPAGPKSSCNACGIKSRKKRQAALGIKQEDNNK 403
S S G + K+ C+DC T +TP WRGGPAGP++ CNACGI+ RKKR+A G + +
Sbjct: 3 STQSSKGNEIKRRCMDCQTTRTPCWRGGPAGPRTLCNACGIRQRKKRRALHGSDKGGAER 62
Query: 404 MKNNKSNDS 430
KN + S
Sbjct: 63 SKNKIAKSS 71
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,159,296,754,985
Number of Sequences: 15229318
Number of Extensions: 5159296754985
Number of Successful Extensions: 1204912839
Number of sequences better than 0.0: 0
|