BLASTX 7.6.2
Query= UN42145 /QuerySize=789
(788 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabi... 249 5e-064
gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT... 210 2e-052
gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT... 205 8e-051
gi|240255906|ref|NP_680707.4| GATA type zinc finger transcriptio... 127 3e-027
gi|302398795|gb|ADL36692.1| GATA domain class transcription fact... 102 1e-019
gi|224123912|ref|XP_002330240.1| predicted protein [Populus tric... 101 1e-019
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein... 93 3e-017
gi|255633610|gb|ACU17164.1| unknown [Glycine max] 93 3e-017
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta... 84 3e-014
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi... 81 1e-013
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi... 80 2e-013
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana] 80 2e-013
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal... 80 2e-013
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT... 80 2e-013
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric... 80 3e-013
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric... 79 5e-013
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT... 79 9e-013
gi|255542842|ref|XP_002512484.1| conserved hypothetical protein ... 77 3e-012
gi|118488832|gb|ABK96226.1| unknown [Populus trichocarpa x Popul... 77 3e-012
>gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabidopsis
thaliana]
Length = 190
Score = 249 bits (634), Expect = 5e-064
Identities = 134/194 (69%), Positives = 154/194 (79%), Gaps = 23/194 (11%)
Frame = +2
Query: 80 MSEGS----VKVDSAGELSDVDNENC-SSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPA 244
MSEGS K+DSAGELSDVDNENC SSG GGGSSSGDTKR CVDCGT+RTPLWRGGPA
Sbjct: 1 MSEGSEDTKTKLDSAGELSDVDNENCSSSGSGGGSSSGDTKRTCVDCGTIRTPLWRGGPA 60
Query: 245 GPKSLCNACGIKSRKKRQAAALGIKPEEKKRKRRSNSSSDSDLSFDEHRDAKKKIINKGD 424
GPKSLCNACGIKSRKKRQ AALG++ EEKK+ R+SN ++D +L +HR+AKK IN D
Sbjct: 61 GPKSLCNACGIKSRKKRQ-AALGMRSEEKKKNRKSNCNNDLNL---DHRNAKKYKINIVD 116
Query: 425 DDDL--------------TSRSNSEGVSKYLDIGFKVPAMKRSAVEKKRLWKKLGEEERA 562
D + +S S+++GVSK+LD+GFKVP MKRSAVEKKRLW+KLGEEERA
Sbjct: 117 DGKIDIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMKRSAVEKKRLWRKLGEEERA 176
Query: 563 AVLLMALSCGYVYS 604
AVLLMALSC VY+
Sbjct: 177 AVLLMALSCSSVYA 190
>gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT_479155
[Arabidopsis lyrata subsp. lyrata]
Length = 175
Score = 210 bits (534), Expect = 2e-052
Identities = 117/176 (66%), Positives = 132/176 (75%), Gaps = 12/176 (6%)
Frame = +2
Query: 80 MSEGS----VKVDSAGELSDVDNENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAG 247
MSEGS KVDSAGELSDVDNENCSS G GG SSGDTKR CVDCGT+RTPLWRGGPAG
Sbjct: 1 MSEGSEETKTKVDSAGELSDVDNENCSSSGSGGGSSGDTKRTCVDCGTIRTPLWRGGPAG 60
Query: 248 PKSLCNACGIKSRKKRQAAALGIKPEEKKRKRRSNSSSDSDLSFDEHRDAKKKIINKGDD 427
PKSLCNACGIKSRKKRQ AALG++ EEKK+ R+ SS +DL+ D HR+AK INK DD
Sbjct: 61 PKSLCNACGIKSRKKRQ-AALGMRSEEKKKNRK---SSGNDLNLD-HRNAKNDKINK-DD 114
Query: 428 DDLTSRSNSEGVSK--YLDIGFKVPAMKRSAVEKKRLWKKLGEEERAAVLLMALSC 589
D + N + +K ++ + VEKKRLW+KLGEEERAAVLLMALSC
Sbjct: 115 DAKNDKINKDDDAKNDKINKDDDLKTCNSKTVEKKRLWRKLGEEERAAVLLMALSC 170
>gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT_329901
[Arabidopsis lyrata subsp. lyrata]
Length = 176
Score = 205 bits (520), Expect = 8e-051
Identities = 110/175 (62%), Positives = 132/175 (75%), Gaps = 15/175 (8%)
Frame = +2
Query: 92 SVKVDSAGELSDVDNENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNAC 271
+ K++SAG+ SDVDN NCSS G G GDTK+ CVDCGT RTPLWRGGPAGPKSLCNAC
Sbjct: 9 TTKLESAGDSSDVDNGNCSSSGSG----GDTKKTCVDCGTSRTPLWRGGPAGPKSLCNAC 64
Query: 272 GIKSRKKRQAAALGIKPEEKKRKRRSNSSSDSD-----LSFDEHRDAKKKIINKGDDDDL 436
GIKSRKKRQ AALGI+ E+ K K + N++ + + + E + K KI K D ++
Sbjct: 65 GIKSRKKRQ-AALGIRQEDNKMKNKCNNNLNLENRTVKIGKGEPGNVKNKI--KTDPENF 121
Query: 437 TSRSNSEGVSK---YLDIGFKVPAMKRSAVEKKRLWKKLGEEERAAVLLMALSCG 592
+S +N++ V K +LD GFKVPAMKRSAVEKKRLW+KLGEEERAAVLLMALSCG
Sbjct: 122 SSSNNNKNVKKVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 176
>gi|240255906|ref|NP_680707.4| GATA type zinc finger transcription factor family
protein [Arabidopsis thaliana]
Length = 197
Score = 127 bits (317), Expect = 3e-027
Identities = 62/89 (69%), Positives = 73/89 (82%), Gaps = 5/89 (5%)
Frame = +2
Query: 92 SVKVDSAGELSDVDNENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNAC 271
+ K++SAG+ SDVDN NCSS G G GDTK+ CVDCGT RTPLWRGGPAGPKSLCNAC
Sbjct: 9 TTKLESAGDSSDVDNGNCSSSGSG----GDTKKTCVDCGTSRTPLWRGGPAGPKSLCNAC 64
Query: 272 GIKSRKKRQAAALGIKPEEKKRKRRSNSS 358
GIKSRKKRQ AALGI+ ++ K K +SN++
Sbjct: 65 GIKSRKKRQ-AALGIRQDDIKIKSKSNNN 92
Score = 86 bits (210), Expect = 7e-015
Identities = 47/74 (63%), Positives = 57/74 (77%), Gaps = 6/74 (8%)
Frame = +2
Query: 383 EHRDAKKKIINKGDDDDLTSRSNS----EGVSKYLDIGFKVPAMKRSAVEKKRLWKKLGE 550
E + K KI K D ++ +S +N+ + V ++LD GFKVPAMKRSAVEKKRLW+KLGE
Sbjct: 126 EPGNVKNKI--KRDPENSSSSNNNKKNVKRVGRFLDFGFKVPAMKRSAVEKKRLWRKLGE 183
Query: 551 EERAAVLLMALSCG 592
EERAAVLLMALSCG
Sbjct: 184 EERAAVLLMALSCG 197
>gi|302398795|gb|ADL36692.1| GATA domain class transcription factor [Malus x
domestica]
Length = 342
Score = 102 bits (252), Expect = 1e-019
Identities = 60/159 (37%), Positives = 84/159 (52%), Gaps = 9/159 (5%)
Frame = +2
Query: 140 NCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKRQAAALGIK 319
+CS+ SS R+C DC T +TPLWR GP GPKSLCNACGI+ RK R+A A
Sbjct: 187 SCSNNSSNNMSSLPIIRVCSDCSTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAA 246
Query: 320 PEEKKRKRRSNSSSDSDLSFDEHRDAKKKI-----INKGDDDDLTSRSNSEGVSKYLDIG 484
+ ++ S +H+D K ++ K + LTS +S G SK L
Sbjct: 247 AAAASGTTLTVAAPSMKSSKVQHKDNKSRVSSTVPFKKRPYNKLTSSPSSRGKSKKL--C 304
Query: 485 FKVPAMKRSAVEKKRLWKKLGEEERAAVLLMALSCGYVY 601
F+ P + +R++ + +E AA+LLMALSCG V+
Sbjct: 305 FEAPTAAAATTALQRVFPQ--DEREAAILLMALSCGLVH 341
>gi|224123912|ref|XP_002330240.1| predicted protein [Populus trichocarpa]
Length = 161
Score = 101 bits (251), Expect = 1e-019
Identities = 68/159 (42%), Positives = 92/159 (57%), Gaps = 11/159 (6%)
Frame = +2
Query: 146 SSGGGGGSSSGDT--KRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKRQAAALGIK 319
SS S SGD K+ C DC T +TPLWRGGPAGPKSLCNACGI+ RKKR L
Sbjct: 8 SSREDESSGSGDIEGKKACTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKKRSVMRLEKG 67
Query: 320 PEEKKRK-RRSNSSSDSDLS-FDEHRDAKKKIINKGDDDDLTSRSNSEGVSKYLDIGFKV 493
PE+K+ K SN+++ +D+S + G + L S S + + +G ++
Sbjct: 68 PEKKREKTTTSNTTTATDISTITTATTTNTAQVVSG--NGLISESLRMSL---MVLGEEM 122
Query: 494 PAMKRSAVEKKRLW--KKLGEEERAAVLLMALSCGYVYS 604
+ S V+K+R +KL EEE+AA LMALSCG V++
Sbjct: 123 MLQRPSVVKKQRCQRKRKLREEEQAAFSLMALSCGSVFA 161
>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 153
Score = 93 bits (230), Expect = 3e-017
Identities = 45/91 (49%), Positives = 61/91 (67%), Gaps = 4/91 (4%)
Frame = +2
Query: 104 DSAGELSDVDNENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKS 283
+ E D++N+N + S + K+ C DCGT +TPLWRGGPAGPKSLCNACGI+S
Sbjct: 6 EKGSESEDMNNKNPDAVSSAESQVNEPKKTCADCGTTKTPLWRGGPAGPKSLCNACGIRS 65
Query: 284 RKKRQAAALGI---KPEEKKRKRRSNSSSDS 367
RKKR+ A LG+ +++K KR SN S ++
Sbjct: 66 RKKRR-AFLGLNKGSTDDRKAKRSSNHSHNN 95
>gi|255633610|gb|ACU17164.1| unknown [Glycine max]
Length = 130
Score = 93 bits (230), Expect = 3e-017
Identities = 50/112 (44%), Positives = 70/112 (62%), Gaps = 12/112 (10%)
Frame = +2
Query: 101 VDSAGELSDVD------NENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLC 262
VD G+ S+++ N N S G SS+ + K+ C DCGT +TPLWRGGPAGPKSLC
Sbjct: 2 VDPTGKGSEIEVEDSNSNPNAPSSGNSPSSNNEQKKTCADCGTTKTPLWRGGPAGPKSLC 61
Query: 263 NACGIKSRKKRQAAALGIKP---EEKKRKRRSNSSSDSDLSFDEHRDAKKKI 409
NACGI+SRKK++ A LGI E+ ++ +R+ + ++ HR KK+
Sbjct: 62 NACGIRSRKKKR-AILGINKGSNEDGRKGKRTGGALGKEVLL--HRSHWKKL 110
>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
communis]
Length = 149
Score = 84 bits (205), Expect = 3e-014
Identities = 42/83 (50%), Positives = 57/83 (68%), Gaps = 8/83 (9%)
Frame = +2
Query: 137 ENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKRQAAALGI 316
E+ SS G + K+ C DCGT +TPLWRGGPAGPKSLCNACGI+SRKK++ +LG+
Sbjct: 12 EDMSSKSAEGEN--QQKKSCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKKR-DSLGL 68
Query: 317 -----KPEEKKRKRRSNSSSDSD 370
P++K RK S++ S ++
Sbjct: 69 NRASSNPDKKSRKHSSSNGSSNN 91
>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
thaliana]
Length = 139
Score = 81 bits (199), Expect = 1e-013
Identities = 37/73 (50%), Positives = 47/73 (64%), Gaps = 6/73 (8%)
Frame = +2
Query: 167 SSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKRQAAALGIKPEEKKRKRR 346
+S D K+ C DCGT +TPLWRGGP GPKSLCNACGI++RKKR+ E +K +
Sbjct: 29 TSVNDKKKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRRGGT------EDNKKLK 82
Query: 347 SNSSSDSDLSFDE 385
+SS + F E
Sbjct: 83 KSSSGGGNRKFGE 95
>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
thaliana]
Length = 149
Score = 80 bits (197), Expect = 2e-013
Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Frame = +2
Query: 116 ELSDVDNENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKR 295
+L+ VD S + + K+ C CGT +TPLWRGGPAGPKSLCNACGI++RKKR
Sbjct: 17 KLTSVDAIEEHSSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR 76
Query: 296 QAAALGIKPEEKKRKRRSNSSSDSD 370
+ + + E+KK+K + + D
Sbjct: 77 R-TLISNRSEDKKKKSHNRNPKFGD 100
>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]
Length = 136
Score = 80 bits (197), Expect = 2e-013
Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Frame = +2
Query: 116 ELSDVDNENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKR 295
+L+ VD S + + K+ C CGT +TPLWRGGPAGPKSLCNACGI++RKKR
Sbjct: 4 KLTSVDAIEEHSSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR 63
Query: 296 QAAALGIKPEEKKRKRRSNSSSDSD 370
+ + + E+KK+K + + D
Sbjct: 64 R-TLISNRSEDKKKKSHNRNPKFGD 87
>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]
Length = 136
Score = 80 bits (197), Expect = 2e-013
Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 1/85 (1%)
Frame = +2
Query: 116 ELSDVDNENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKR 295
+L+ VD S + + K+ C CGT +TPLWRGGPAGPKSLCNACGI++RKKR
Sbjct: 4 KLTSVDAIEEHSSSSSNEAISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR 63
Query: 296 QAAALGIKPEEKKRKRRSNSSSDSD 370
+ + + E+KK+K + + D
Sbjct: 64 R-TLISNRSEDKKKKSHNRNPKFGD 87
>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
[Arabidopsis lyrata subsp. lyrata]
Length = 137
Score = 80 bits (197), Expect = 2e-013
Identities = 40/86 (46%), Positives = 53/86 (61%), Gaps = 2/86 (2%)
Frame = +2
Query: 116 ELSDVDN-ENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKK 292
+L+ VD E SS + K+ C CGT +TPLWRGGPAGPKSLCNACGI++RKK
Sbjct: 4 KLTSVDAIEEHSSSSSSNEGISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKK 63
Query: 293 RQAAALGIKPEEKKRKRRSNSSSDSD 370
R+ + + E+KK K + + D
Sbjct: 64 RR-TLISNRSEDKKNKNHNRNPKFGD 88
>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 80 bits (196), Expect = 3e-013
Identities = 37/65 (56%), Positives = 49/65 (75%), Gaps = 4/65 (6%)
Frame = +2
Query: 185 KRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKRQAAALGIK---PEEKKRKRRSNS 355
K+ C DCGT +TPLWRGGPAGPKSLCNACGI+SRKK++ LG+ +K+ K+ SN+
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKR-DILGLNKGAANDKRAKKGSNN 71
Query: 356 SSDSD 370
+ S+
Sbjct: 72 NGSSN 76
>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 79 bits (194), Expect = 5e-013
Identities = 38/66 (57%), Positives = 46/66 (69%), Gaps = 4/66 (6%)
Frame = +2
Query: 185 KRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSR-KKRQAAAL---GIKPEEKKRKRRSN 352
K+ C DCGT +TPLWRGGPAGPKSLCNACGI+SR KKR L G +K+ K+ S
Sbjct: 13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGGAAANDKRAKKGST 72
Query: 353 SSSDSD 370
++ SD
Sbjct: 73 NNGSSD 78
>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
[Arabidopsis lyrata subsp. lyrata]
Length = 111
Score = 79 bits (192), Expect = 9e-013
Identities = 34/57 (59%), Positives = 44/57 (77%), Gaps = 6/57 (10%)
Frame = +2
Query: 185 KRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKRQAAALGIKPEEKKRKRRSNS 355
K+ C DCGT +TPLWRGGPAGPKSLCNACGI++RKKR+ E+ K+ ++S+S
Sbjct: 8 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRGT------EDNKKLKKSSS 58
>gi|255542842|ref|XP_002512484.1| conserved hypothetical protein [Ricinus
communis]
Length = 151
Score = 77 bits (188), Expect = 3e-012
Identities = 37/70 (52%), Positives = 44/70 (62%)
Frame = +2
Query: 134 NENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKRQAAALG 313
+E SS G S D+K+ C DC T TPLWR GPAGPKSLCNACGI+ RK ++
Sbjct: 4 DERKSSCGDDDKSKNDSKKSCTDCKTTETPLWRAGPAGPKSLCNACGIRYRKTKRDILSF 63
Query: 314 IKPEEKKRKR 343
K K+RKR
Sbjct: 64 HKSPFKRRKR 73
>gi|118488832|gb|ABK96226.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 147
Score = 77 bits (187), Expect = 3e-012
Identities = 41/85 (48%), Positives = 50/85 (58%), Gaps = 11/85 (12%)
Frame = +2
Query: 116 ELSDVDNENCSSGGGGGSSSGDTKRICVDCGTLRTPLWRGGPAGPKSLCNACGIKSRKKR 295
E D+DN + S + KR C DC T RTP WRGGPAGP++LCNACGI+ RKKR
Sbjct: 11 ESEDMDNTH-------PSKCNEIKRRCTDCQTTRTPCWRGGPAGPRTLCNACGIRQRKKR 63
Query: 296 QAAALGIK---PEEKKRKRRSNSSS 361
+ A LG PE + K S+S
Sbjct: 64 R-ALLGFDKGGPERSREKMAKGSNS 87
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,631,408,714,302
Number of Sequences: 15229318
Number of Extensions: 4631408714302
Number of Successful Extensions: 1098110979
Number of sequences better than 0.0: 0
|