Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN47756


BLASTX 7.6.2

Query= UN47756 /QuerySize=823
        (822 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT...    262   5e-068
gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabi...    214   1e-053
gi|240255906|ref|NP_680707.4| GATA type zinc finger transcriptio...    168   2e-039
gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT...    152   7e-035
gi|255633610|gb|ACU17164.1| unknown [Glycine max]                       99   5e-019
gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein...     94   2e-017
gi|224123912|ref|XP_002330240.1| predicted protein [Populus tric...     93   5e-017
gi|255556286|ref|XP_002519177.1| GATA transcription factor, puta...     91   1e-016
gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT...     91   1e-016
gi|224130312|ref|XP_002328578.1| predicted protein [Populus tric...     89   9e-016
gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabi...     87   3e-015
gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]               87   3e-015
gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thal...     87   3e-015
gi|224110254|ref|XP_002315462.1| predicted protein [Populus tric...     87   3e-015
gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabi...     86   5e-015
gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT...     85   1e-014
gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 ...     82   7e-014
gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein...     82   7e-014
gi|296089747|emb|CBI39566.3| unnamed protein product [Vitis vini...     81   2e-013

>gi|297800552|ref|XP_002868160.1| hypothetical protein ARALYDRAFT_329901
        [Arabidopsis lyrata subsp. lyrata]

          Length = 176

 Score =  262 bits (669), Expect = 5e-068
 Identities = 136/179 (75%), Positives = 153/179 (85%), Gaps = 12/179 (6%)
 Frame = +3

Query: 162 SKTTKLESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSL 341
           +KTTKLESAGDSSDV+NGNCSSSGS      GGDTKKTCVDCGT++TPLWRGGPAGPKSL
Sbjct:   7 TKTTKLESAGDSSDVDNGNCSSSGS------GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60

Query: 342 CNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTE 521
           CNACGIKSRKKRQAALGI+QEDNKMKN  +N+  L  +N+TVK GKG+ G+VKNKI KT+
Sbjct:  61 CNACGIKSRKKRQAALGIRQEDNKMKNKCNNN--LNLENRTVKIGKGEPGNVKNKI-KTD 117

Query: 522 SED---CNDKKSVKRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALSCG 689
            E+    N+ K+VK+ GRFLD GFKVP MKRS VEKKR+W KLGEEERAAVLLMALSCG
Sbjct: 118 PENFSSSNNNKNVKKVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 176

>gi|15228899|ref|NP_188312.1| GATA transcription factor 17 [Arabidopsis
        thaliana]

          Length = 190

 Score =  214 bits (544), Expect = 1e-053
 Identities = 120/190 (63%), Positives = 139/190 (73%), Gaps = 14/190 (7%)
 Frame = +3

Query: 144 MSMTEESKTTKLESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGP 323
           MS   E   TKL+SAG+ SDV+N NCSSSGS GGG   GDTK+TCVDCGT +TPLWRGGP
Sbjct:   1 MSEGSEDTKTKLDSAGELSDVDNENCSSSGS-GGGSSSGDTKRTCVDCGTIRTPLWRGGP 59

Query: 324 AGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSN-DSCLAPDNQTVKNGK---GDSG 491
           AGPKSLCNACGIKSRKKRQAALG++ E+ K KN KSN ++ L  D++  K  K    D G
Sbjct:  60 AGPKSLCNACGIKSRKKRQAALGMRSEEKK-KNRKSNCNNDLNLDHRNAKKYKINIVDDG 118

Query: 492 SVKNKIKKTESEDCNDKKSV-----KRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEER 656
            +       + + CN+K+S      K   +FLDLGFKVPVMKRS VEKKR+W KLGEEER
Sbjct: 119 KID---IDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMKRSAVEKKRLWRKLGEEER 175

Query: 657 AAVLLMALSC 686
           AAVLLMALSC
Sbjct: 176 AAVLLMALSC 185

>gi|240255906|ref|NP_680707.4| GATA type zinc finger transcription factor family
        protein [Arabidopsis thaliana]

          Length = 197

 Score =  168 bits (423), Expect = 2e-039
 Identities = 87/138 (63%), Positives = 105/138 (76%), Gaps = 14/138 (10%)
 Frame = +3

Query: 162 SKTTKLESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSL 341
           +KTTKLESAGDSSDV+NGNCSSSGS      GGDTKKTCVDCGT++TPLWRGGPAGPKSL
Sbjct:   7 TKTTKLESAGDSSDVDNGNCSSSGS------GGDTKKTCVDCGTSRTPLWRGGPAGPKSL 60

Query: 342 CNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVK------N 503
           CNACGIKSRKKRQAALGI+Q+D K+K+  +N+  L  +++ VK GKG+  +VK       
Sbjct:  61 CNACGIKSRKKRQAALGIRQDDIKIKSKSNNN--LGLESRNVKTGKGEPVNVKIAKCEPG 118

Query: 504 KIKKTESEDCNDKKSVKR 557
            +K  + E  N K  +KR
Sbjct: 119 IVKIAKGEPGNVKNKIKR 136


 Score =  111 bits (276), Expect = 2e-022
 Identities = 64/115 (55%), Positives = 75/115 (65%), Gaps = 4/115 (3%)
 Frame = +3

Query: 357 IKSRKKRQAALGIKQEDNKM-KNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKK---TES 524
           IK + K    LG++  + K  K    N      +   VK  KG+ G+VKNKIK+     S
Sbjct:  83 IKIKSKSNNNLGLESRNVKTGKGEPVNVKIAKCEPGIVKIAKGEPGNVKNKIKRDPENSS 142

Query: 525 EDCNDKKSVKRGGRFLDLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALSCG 689
              N+KK+VKR GRFLD GFKVP MKRS VEKKR+W KLGEEERAAVLLMALSCG
Sbjct: 143 SSNNNKKNVKRVGRFLDFGFKVPAMKRSAVEKKRLWRKLGEEERAAVLLMALSCG 197

>gi|297834584|ref|XP_002885174.1| hypothetical protein ARALYDRAFT_479155
        [Arabidopsis lyrata subsp. lyrata]

          Length = 175

 Score =  152 bits (383), Expect = 7e-035
 Identities = 82/132 (62%), Positives = 101/132 (76%), Gaps = 7/132 (5%)
 Frame = +3

Query: 153 TEESKTTKLESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGP 332
           +EE+K TK++SAG+ SDV+N NCSSSGS  GGG  GDTK+TCVDCGT +TPLWRGGPAGP
Sbjct:   5 SEETK-TKVDSAGELSDVDNENCSSSGS--GGGSSGDTKRTCVDCGTIRTPLWRGGPAGP 61

Query: 333 KSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCLAPDNQTVKNGK--GDSGSVKNK 506
           KSLCNACGIKSRKKRQAALG++ E+ K KN KS+ + L  D++  KN K   D  +  +K
Sbjct:  62 KSLCNACGIKSRKKRQAALGMRSEEKK-KNRKSSGNDLNLDHRNAKNDKINKDDDAKNDK 120

Query: 507 IKKTESEDCNDK 542
           I K + +  NDK
Sbjct: 121 INK-DDDAKNDK 131

>gi|255633610|gb|ACU17164.1| unknown [Glycine max]

          Length = 130

 Score =  99 bits (246), Expect = 5e-019
 Identities = 45/83 (54%), Positives = 57/83 (68%), Gaps = 4/83 (4%)
 Frame = +3

Query: 177 LESAGDSSDVE----NGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLC 344
           ++  G  S++E    N N ++  SG       + KKTC DCGT KTPLWRGGPAGPKSLC
Sbjct:   2 VDPTGKGSEIEVEDSNSNPNAPSSGNSPSSNNEQKKTCADCGTTKTPLWRGGPAGPKSLC 61

Query: 345 NACGIKSRKKRQAALGIKQEDNK 413
           NACGI+SRKK++A LGI +  N+
Sbjct:  62 NACGIRSRKKKRAILGINKGSNE 84

>gi|225431869|ref|XP_002275498.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 153

 Score =  94 bits (233), Expect = 2e-017
 Identities = 57/134 (42%), Positives = 74/134 (55%), Gaps = 9/134 (6%)
 Frame = +3

Query: 180 ESAGDSSDVENGNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGI 359
           E   +S D+ N N  +  S        + KKTC DCGT KTPLWRGGPAGPKSLCNACGI
Sbjct:   6 EKGSESEDMNNKNPDAVSS--AESQVNEPKKTCADCGTTKTPLWRGGPAGPKSLCNACGI 63

Query: 360 KSRKKRQAALGIKQ--EDNKMKNNKSNDSCLAPDNQTVKNGKGDSG-SVKNKIKKTESED 530
           +SRKKR+A LG+ +   D++     SN S     N    NG    G S+K ++     E 
Sbjct:  64 RSRKKRRAFLGLNKGSTDDRKAKRSSNHS----HNNGGGNGNNKLGDSLKRRLFALGREV 119

Query: 531 CNDKKSVKRGGRFL 572
              + +V++  R L
Sbjct: 120 LLQRSTVEKQRRKL 133

>gi|224123912|ref|XP_002330240.1| predicted protein [Populus trichocarpa]

          Length = 161

 Score =  93 bits (229), Expect = 5e-017
 Identities = 61/157 (38%), Positives = 78/157 (49%), Gaps = 6/157 (3%)
 Frame = +3

Query: 213 GNCSSSGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALG 392
           G  SS      G G  + KK C DC T KTPLWRGGPAGPKSLCNACGI+ RKKR     
Sbjct:   5 GTKSSREDESSGSGDIEGKKACTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKKRSVMRL 64

Query: 393 IKQEDNKMKNNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKKSVKRGGRFL 572
            K  + K +   ++++  A D  T+      + +         SE       V      L
Sbjct:  65 EKGPEKKREKTTTSNTTTATDISTITTATTTNTAQVVSGNGLISESLRMSLMVLGEEMML 124

Query: 573 DLGFKVPVMKRSVVEKKRVWMKLGEEERAAVLLMALS 683
               +  V+K+   ++KR   KL EEE+AA  LMALS
Sbjct: 125 Q---RPSVVKKQRCQRKR---KLREEEQAAFSLMALS 155

>gi|255556286|ref|XP_002519177.1| GATA transcription factor, putative [Ricinus
        communis]

          Length = 149

 Score =  91 bits (225), Expect = 1e-016
 Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 6/111 (5%)
 Frame = +3

Query: 252 GGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ----EDNKMK 419
           G    KK+C DCGT KTPLWRGGPAGPKSLCNACGI+SRKK++ +LG+ +     D K +
Sbjct:  21 GENQQKKSCADCGTTKTPLWRGGPAGPKSLCNACGIRSRKKKRDSLGLNRASSNPDKKSR 80

Query: 420 NNKSNDSCLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKKSVKRGGRFL 572
            + S++      N    N  GD   +K ++     E    + SV++  R L
Sbjct:  81 KHSSSNGSSNNHNSNNSNRLGD--GLKQRLLALGREVLMQRSSVEKQRRKL 129

>gi|297829216|ref|XP_002882490.1| hypothetical protein ARALYDRAFT_477989
        [Arabidopsis lyrata subsp. lyrata]

          Length = 137

 Score =  91 bits (225), Expect = 1e-016
 Identities = 40/69 (57%), Positives = 49/69 (71%)
 Frame = +3

Query: 228 SGSGGGGGGGGDTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQED 407
           S S     G  + KK+C  CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+  +  + ED
Sbjct:  15 SSSSSSNEGISNEKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSED 74

Query: 408 NKMKNNKSN 434
            K KN+  N
Sbjct:  75 KKNKNHNRN 83

>gi|224130312|ref|XP_002328578.1| predicted protein [Populus trichocarpa]

          Length = 125

 Score =  89 bits (218), Expect = 9e-016
 Identities = 41/74 (55%), Positives = 54/74 (72%), Gaps = 2/74 (2%)
 Frame = +3

Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ--EDNKMKNNKSNDS 440
           KKTC DCGT+KTPLWRGGPAGPKSLCNACGI+SRKK++  LG+ +   ++K     SN++
Sbjct:  13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGAANDKRAKKGSNNN 72

Query: 441 CLAPDNQTVKNGKG 482
             + +N   + G G
Sbjct:  73 GSSNNNNNKQLGDG 86

>gi|18397703|ref|NP_566290.1| GATA transcription factor 15 [Arabidopsis
        thaliana]

          Length = 149

 Score =  87 bits (213), Expect = 3e-015
 Identities = 36/56 (64%), Positives = 45/56 (80%)
 Frame = +3

Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSN 434
           KK+C  CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+  +  + ED K K++  N
Sbjct:  40 KKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSHNRN 95

>gi|21536761|gb|AAM61093.1| unknown [Arabidopsis thaliana]

          Length = 136

 Score =  87 bits (213), Expect = 3e-015
 Identities = 36/56 (64%), Positives = 45/56 (80%)
 Frame = +3

Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSN 434
           KK+C  CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+  +  + ED K K++  N
Sbjct:  27 KKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSHNRN 82

>gi|7549639|gb|AAF63824.1| hypothetical protein [Arabidopsis thaliana]

          Length = 136

 Score =  87 bits (213), Expect = 3e-015
 Identities = 36/56 (64%), Positives = 45/56 (80%)
 Frame = +3

Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSN 434
           KK+C  CGT+KTPLWRGGPAGPKSLCNACGI++RKKR+  +  + ED K K++  N
Sbjct:  27 KKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRTLISNRSEDKKKKSHNRN 82

>gi|224110254|ref|XP_002315462.1| predicted protein [Populus trichocarpa]

          Length = 125

 Score =  87 bits (213), Expect = 3e-015
 Identities = 38/62 (61%), Positives = 47/62 (75%), Gaps = 5/62 (8%)
 Frame = +3

Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQ-----EDNKMKNNKS 431
           KKTC DCGT+KTPLWRGGPAGPKSLCNACGI+SRKK++  LG+ +      D + K   +
Sbjct:  13 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKKRDILGLNKGGAAANDKRAKKGST 72

Query: 432 ND 437
           N+
Sbjct:  73 NN 74

>gi|15239847|ref|NP_199741.1| GATA transcription factor 16 [Arabidopsis
        thaliana]

          Length = 139

 Score =  86 bits (212), Expect = 5e-015
 Identities = 42/95 (44%), Positives = 56/95 (58%), Gaps = 4/95 (4%)
 Frame = +3

Query: 261 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDS 440
           D KKTC DCGT+KTPLWRGGP GPKSLCNACGI++RKKR+       EDNK     S+  
Sbjct:  33 DKKKTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRRGG----TEDNKKLKKSSSGG 88

Query: 441 CLAPDNQTVKNGKGDSGSVKNKIKKTESEDCNDKK 545
                 +++K    D G  K    + + +   +++
Sbjct:  89 GNRKFGESLKQSLMDLGIRKRSTVEKQRQKLGEEE 123

>gi|297795681|ref|XP_002865725.1| hypothetical protein ARALYDRAFT_917909
        [Arabidopsis lyrata subsp. lyrata]

          Length = 111

 Score =  85 bits (209), Expect = 1e-014
 Identities = 42/78 (53%), Positives = 50/78 (64%), Gaps = 5/78 (6%)
 Frame = +3

Query: 267 KKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSNDSCL 446
           KKTC DCGT+KTPLWRGGPAGPKSLCNACGI++RKKR+       EDNK     S+    
Sbjct:   8 KKTCADCGTSKTPLWRGGPAGPKSLCNACGIRNRKKRRGT-----EDNKKLKKSSSGGGN 62

Query: 447 APDNQTVKNGKGDSGSVK 500
               +++K    D G  K
Sbjct:  63 PKLGESLKQRLMDFGITK 80

>gi|147814791|emb|CAN74414.1| hypothetical protein VITISV_042395 [Vitis
        vinifera]

          Length = 125

 Score =  82 bits (202), Expect = 7e-014
 Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 10/97 (10%)
 Frame = +3

Query: 261 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSND- 437
           + KK C DC T KTPLWRGGPAGPKSLCNACGI+ RK+R + +G+ ++  +M N+ S+D 
Sbjct:  16 EIKKCCTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKRRSSMVGVNKKKERM-NSGSHDL 74

Query: 438 ------SCLAPDNQTVKNGKGDSGSVKNKIKKTESED 530
                 S +A  N+ +   +    SVK + +K   E+
Sbjct:  75 SETLKQSLMALGNEVMM--QRQRSSVKKQRRKLGEEE 109

>gi|225450647|ref|XP_002278369.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 124

 Score =  82 bits (202), Expect = 7e-014
 Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 10/97 (10%)
 Frame = +3

Query: 261 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSND- 437
           + KK C DC T KTPLWRGGPAGPKSLCNACGI+ RK+R + +G+ ++  +M N+ S+D 
Sbjct:  15 EIKKCCTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKRRSSMVGVNKKKERM-NSGSHDL 73

Query: 438 ------SCLAPDNQTVKNGKGDSGSVKNKIKKTESED 530
                 S +A  N+ +   +    SVK + +K   E+
Sbjct:  74 SETLKQSLMALGNEVMM--QRQRSSVKKQRRKLGEEE 108

>gi|296089747|emb|CBI39566.3| unnamed protein product [Vitis vinifera]

          Length = 109

 Score =  81 bits (198), Expect = 2e-013
 Identities = 32/59 (54%), Positives = 43/59 (72%)
 Frame = +3

Query: 261 DTKKTCVDCGTNKTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIKQEDNKMKNNKSND 437
           + KK C DC T KTPLWRGGPAGPKSLCNACGI+ RK+R + +G+ ++  +M +    +
Sbjct:  16 EIKKCCTDCKTTKTPLWRGGPAGPKSLCNACGIRYRKRRSSMVGVNKKKERMNSETEEE 74

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,129,805,850,752
Number of Sequences: 15229318
Number of Extensions: 5129805850752
Number of Successful Extensions: 1187520479
Number of sequences better than 0.0: 0