Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN81806


BLASTX 7.6.2

Query= UN81806 /QuerySize=827
        (826 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|2916772|emb|CAA11837.1| AT-hook protein 2 [Arabidopsis thaliana]    322   7e-086
gi|18414996|ref|NP_567546.1| AT hook motif DNA-binding family pr...    322   7e-086
gi|2894604|emb|CAA17138.1| putative protein [Arabidopsis thaliana]     299   3e-079
gi|297800288|ref|XP_002868028.1| hypothetical protein ARALYDRAFT...    292   6e-077
gi|297794575|ref|XP_002865172.1| hypothetical protein ARALYDRAFT...    168   1e-039
gi|15237481|ref|NP_199476.1| AT hook motif DNA-binding family pr...    144   1e-032
gi|255557601|ref|XP_002519830.1| DNA binding protein, putative [...    120   3e-025
gi|224126489|ref|XP_002329567.1| predicted protein [Populus tric...    119   8e-025
gi|224138096|ref|XP_002326517.1| predicted protein [Populus tric...     88   2e-015
gi|297304594|ref|XP_001094961.2| PREDICTED: hypothetical protein...     60   5e-007
gi|153791910|ref|NP_001093392.1| UDP-N-acetylglucosamine transfe...     59   6e-007
gi|221044226|dbj|BAH13790.1| unnamed protein product [Homo sapiens]     59   6e-007
gi|221044312|dbj|BAH13833.1| unnamed protein product [Homo sapiens]     59   6e-007
gi|332226151|ref|XP_003262252.1| PREDICTED: UDP-N-acetylglucosam...     59   6e-007
gi|332226159|ref|XP_003262256.1| PREDICTED: UDP-N-acetylglucosam...     59   6e-007

>gi|2916772|emb|CAA11837.1| AT-hook protein 2 [Arabidopsis thaliana]

          Length = 439

 Score =  322 bits (823), Expect = 7e-086
 Identities = 181/251 (72%), Positives = 189/251 (75%), Gaps = 37/251 (14%)
 Frame = +3

Query: 123 MDSRELHQHQQQQQQHQQQQQQ-----QLQLQPPPGFLM---GSYNRNPNAAAAAAAAAL 278
           MDSRE+H  QQQQQQ QQQQQQ     Q Q QPPPG LM    SYNRNPN    AAAA L
Sbjct:   1 MDSREIHHQQQQQQQQQQQQQQQQQHLQQQQQPPPGMLMSHHNSYNRNPN----AAAAVL 56

Query: 279 MG-PTSTSQAMHHRLPF-GALSPHQPQQHHQQQHPHHHQPQPQHQIDQKTLESLGFEGSP 452
           MG  TSTSQAMH RLPF G++SPHQPQQH       +H PQPQ QIDQKTLESLGF+GSP
Sbjct:  57 MGHNTSTSQAMHQRLPFGGSMSPHQPQQH------QYHHPQPQQQIDQKTLESLGFDGSP 110

Query: 453 SSVAAAAATTQQQQPMRFGIEQQQAKKKRGRPRKYAPD----GGGGNNIGLALAPTSP-- 614
           SSVAA      QQ  MRFGI+ QQ KKKRGRPRKYA D    GGGG+NI L LAPTSP  
Sbjct: 111 SSVAAT-----QQHSMRFGIDHQQVKKKRGRPRKYAADGGGGGGGGSNIALGLAPTSPLP 165

Query: 615 -ASNSYGGGTEGGGGGGDSGGGGGGGNANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVG 791
            ASNSYGGG EGGGGG  +     G NANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVG
Sbjct: 166 SASNSYGGGNEGGGGGDSA-----GANANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVG 220

Query: 792 FTPHVIEVKTG 824
           FTPHVIEVKTG
Sbjct: 221 FTPHVIEVKTG 231

>gi|18414996|ref|NP_567546.1| AT hook motif DNA-binding family protein
        [Arabidopsis thaliana]

          Length = 439

 Score =  322 bits (823), Expect = 7e-086
 Identities = 181/251 (72%), Positives = 189/251 (75%), Gaps = 37/251 (14%)
 Frame = +3

Query: 123 MDSRELHQHQQQQQQHQQQQQQ-----QLQLQPPPGFLM---GSYNRNPNAAAAAAAAAL 278
           MDSRE+H  QQQQQQ QQQQQQ     Q Q QPPPG LM    SYNRNPN    AAAA L
Sbjct:   1 MDSREIHHQQQQQQQQQQQQQQQQQHLQQQQQPPPGMLMSHHNSYNRNPN----AAAAVL 56

Query: 279 MG-PTSTSQAMHHRLPF-GALSPHQPQQHHQQQHPHHHQPQPQHQIDQKTLESLGFEGSP 452
           MG  TSTSQAMH RLPF G++SPHQPQQH       +H PQPQ QIDQKTLESLGF+GSP
Sbjct:  57 MGHNTSTSQAMHQRLPFGGSMSPHQPQQH------QYHHPQPQQQIDQKTLESLGFDGSP 110

Query: 453 SSVAAAAATTQQQQPMRFGIEQQQAKKKRGRPRKYAPD----GGGGNNIGLALAPTSP-- 614
           SSVAA      QQ  MRFGI+ QQ KKKRGRPRKYA D    GGGG+NI L LAPTSP  
Sbjct: 111 SSVAAT-----QQHSMRFGIDHQQVKKKRGRPRKYAADGGGGGGGGSNIALGLAPTSPLP 165

Query: 615 -ASNSYGGGTEGGGGGGDSGGGGGGGNANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVG 791
            ASNSYGGG EGGGGG  +     G NANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVG
Sbjct: 166 SASNSYGGGNEGGGGGDSA-----GANANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVG 220

Query: 792 FTPHVIEVKTG 824
           FTPHVIEVKTG
Sbjct: 221 FTPHVIEVKTG 231

>gi|2894604|emb|CAA17138.1| putative protein [Arabidopsis thaliana]

          Length = 455

 Score =  299 bits (765), Expect = 3e-079
 Identities = 173/251 (68%), Positives = 181/251 (72%), Gaps = 45/251 (17%)
 Frame = +3

Query: 123 MDSRELHQHQQQQQQHQQQQQQ-----QLQLQPPPGFLM---GSYNRNPNAAAAAAAAAL 278
           MDSRE+H  QQQQQQ QQQQQQ     Q Q QPPPG LM    SYNRNPN    AAAA L
Sbjct:   1 MDSREIHHQQQQQQQQQQQQQQQQQHLQQQQQPPPGMLMSHHNSYNRNPN----AAAAVL 56

Query: 279 MG-PTSTSQAMHHRLPF-GALSPHQPQQHHQQQHPHHHQPQPQHQIDQKTLESLGFEGSP 452
           MG  TSTSQAMH RLPF G++SPHQPQQH       +H PQPQ QIDQKTLESLGF+GSP
Sbjct:  57 MGHNTSTSQAMHQRLPFGGSMSPHQPQQH------QYHHPQPQQQIDQKTLESLGFDGSP 110

Query: 453 SSVAAAAATTQQQQPMRFGIEQQQAKKKRGRPRKYAPD----GGGGNNIGLALAPTSP-- 614
           SSVAA      QQ  MRFGI+ QQ KKKRGRPRKYA D    GGGG+NI L LAPTSP  
Sbjct: 111 SSVAAT-----QQHSMRFGIDHQQVKKKRGRPRKYAADGGGGGGGGSNIALGLAPTSPLP 165

Query: 615 -ASNSYGGGTEGGGGGGDSGGGGGGGNANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVG 791
            ASNSYGGG EGGGGG  +     G NANSSDPPAKRNRGRPPGS        GGTGGVG
Sbjct: 166 SASNSYGGGNEGGGGGDSA-----GANANSSDPPAKRNRGRPPGS--------GGTGGVG 212

Query: 792 FTPHVIEVKTG 824
           FTPHVIEVKTG
Sbjct: 213 FTPHVIEVKTG 223

>gi|297800288|ref|XP_002868028.1| hypothetical protein ARALYDRAFT_914905
        [Arabidopsis lyrata subsp. lyrata]

          Length = 404

 Score =  292 bits (746), Expect = 6e-077
 Identities = 158/202 (78%), Positives = 164/202 (81%), Gaps = 22/202 (10%)
 Frame = +3

Query: 231 YNRNPNAAAAAAAAALMG-PTSTSQAMHHRLPFGALSPHQPQQHHQQQHPHHHQPQPQHQ 407
           YNRNPN   AAAAA LMG  TSTSQAMH RLPFG++SPHQPQQH       +H PQPQ Q
Sbjct:   9 YNRNPN---AAAAAVLMGHNTSTSQAMHQRLPFGSMSPHQPQQH------QYHHPQPQQQ 59

Query: 408 IDQKTLESLGFEGSPSSVAAAAATTQQQQPMRFGIEQQQAKKKRGRPRKYAPDGGGGNNI 587
           IDQKTLESLGF+GSPSSVAA    T QQQ MRFGI+ QQ KKKRGRPRKYA D GGG+NI
Sbjct:  60 IDQKTLESLGFDGSPSSVAA----TTQQQSMRFGIDHQQVKKKRGRPRKYAAD-GGGSNI 114

Query: 588 GLALAPTSP---ASNSYGGGTEGGGGGGDSGGGGGGGNANSSDPPAKRNRGRPPGSGKKQ 758
            L LAPTSP   ASNSYGGG EGGG GGDS    GG NANSSDPPAKRNRGRPPGSGKKQ
Sbjct: 115 ALGLAPTSPLPTASNSYGGGNEGGGTGGDS----GGANANSSDPPAKRNRGRPPGSGKKQ 170

Query: 759 LDALGGTGGVGFTPHVIEVKTG 824
           LDALGGTGGVGFTPHVIEVKTG
Sbjct: 171 LDALGGTGGVGFTPHVIEVKTG 192

>gi|297794575|ref|XP_002865172.1| hypothetical protein ARALYDRAFT_494313
        [Arabidopsis lyrata subsp. lyrata]

          Length = 391

 Score =  168 bits (424), Expect = 1e-039
 Identities = 105/192 (54%), Positives = 112/192 (58%), Gaps = 36/192 (18%)
 Frame = +3

Query: 309 HHRLPFGALSPHQPQQHHQQQHPHHH----------QPQPQHQIDQKTLESLGFEGSPSS 458
           H+R P  A +         Q   HHH          Q Q  HQ  Q   ++L   G    
Sbjct:  23 HYRNPNAAAAALMVPTSTSQSIQHHHRLPFSNQQQQQSQTFHQQQQMDQKTLESLGFGDG 82

Query: 459 VAAAAATTQQQQPMRFGIEQQ------QAKKKRGRPRKYAPDGGGGNNIGLALAPTSP-- 614
                  +   QPMRFGIE Q      Q KKKRGRPRKY PDG    +I L LAPTSP  
Sbjct:  83 -------SPSSQPMRFGIEDQNQNQQLQVKKKRGRPRKYTPDG----SIALGLAPTSPLL 131

Query: 615 --ASNSYGGGTEGGGGGGDSGGGGGGGNANSSDPPAKRNRGRPPGSGKKQLDALGGTGGV 788
             ASNSYGG   G GG GDS  GGGGGN NS+DPPAKRNRGRPPGS KKQLDALGGT GV
Sbjct: 132 SAASNSYGG---GDGGVGDS--GGGGGNGNSADPPAKRNRGRPPGSSKKQLDALGGTAGV 186

Query: 789 GFTPHVIEVKTG 824
           GFTPHVIEVKTG
Sbjct: 187 GFTPHVIEVKTG 198

>gi|15237481|ref|NP_199476.1| AT hook motif DNA-binding family protein
        [Arabidopsis thaliana]

          Length = 386

 Score =  144 bits (363), Expect = 1e-032
 Identities = 80/119 (67%), Positives = 82/119 (68%), Gaps = 22/119 (18%)
 Frame = +3

Query: 492 QPMRFGIEQQ----QAKKKRGRPRKYAPDGGGGNNIGLALAPTSP----ASNSYGGGTEG 647
           QPMRFGI+ Q    Q KKKRGRPRKY PDG    +I L LAPTSP    ASNSYG G   
Sbjct:  86 QPMRFGIDDQNQQLQVKKKRGRPRKYTPDG----SIALGLAPTSPLLSAASNSYGEG--- 138

Query: 648 GGGGGDSGGGGGGGNANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVGFTPHVIEVKTG 824
                  G G  GGN NS DPP KRNRGRPPGS KKQLDALGGT GVGFTPHVIEV TG
Sbjct: 139 -------GVGDSGGNGNSVDPPVKRNRGRPPGSSKKQLDALGGTSGVGFTPHVIEVNTG 190

>gi|255557601|ref|XP_002519830.1| DNA binding protein, putative [Ricinus
        communis]

          Length = 376

 Score =  120 bits (300), Expect = 3e-025
 Identities = 68/109 (62%), Positives = 74/109 (67%), Gaps = 7/109 (6%)
 Frame = +3

Query: 498 MRFGIEQQQAKKKRGRPRKYAPDGGGGNNIGLALAPTSPASNSYGGGTEGGGGGGDSGGG 677
           MRF ++   AKKKRGRPRKY PDG    NI L L+PT P S+S           G   G 
Sbjct:  86 MRFSMD--PAKKKRGRPRKYTPDG----NIALGLSPT-PISSSATSLPPHVADSGSGVGV 138

Query: 678 GGGGNANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVGFTPHVIEVKTG 824
           G G  A +SDPP+KRNRGRPPGSGKKQLDALGG GGVGFTPHVI VK G
Sbjct: 139 GIGTPAIASDPPSKRNRGRPPGSGKKQLDALGGVGGVGFTPHVITVKAG 187

>gi|224126489|ref|XP_002329567.1| predicted protein [Populus trichocarpa]

          Length = 375

 Score =  119 bits (296), Expect = 8e-025
 Identities = 68/109 (62%), Positives = 74/109 (67%), Gaps = 14/109 (12%)
 Frame = +3

Query: 498 MRFGIEQQQAKKKRGRPRKYAPDGGGGNNIGLALAPTSPASNSYGGGTEGGGGGGDSGGG 677
           MRF IE   AKKKRGRPRKY PDG    NI L L+PT   S           G  DSGGG
Sbjct:  93 MRFSIE--PAKKKRGRPRKYTPDG----NIALGLSPTPVPSGI-------SAGHADSGGG 139

Query: 678 GGGGNANSSDPPAKRNRGRPPGSGKKQLDALGGTGGVGFTPHVIEVKTG 824
           G   +A +S+ P+K+NRGRPPGSGKKQLDALGG GGVGFTPHVI VK G
Sbjct: 140 GVTHDA-ASEHPSKKNRGRPPGSGKKQLDALGGVGGVGFTPHVITVKAG 187

>gi|224138096|ref|XP_002326517.1| predicted protein [Populus trichocarpa]

          Length = 286

 Score =  88 bits (216), Expect = 2e-015
 Identities = 48/83 (57%), Positives = 54/83 (65%), Gaps = 9/83 (10%)
 Frame = +3

Query: 582 NIGLALAPTSPASNSYGGGTEGGGGGGDSGGGGGGGNAN--SSDPPAKRNRGRPPGSGKK 755
           NI L L+PT   S           G  DS GG G G     +S+ P+K++RGRPPGSGKK
Sbjct:  23 NIALGLSPTPIHSGM-------SAGQADSSGGAGSGVMPDVASEHPSKKHRGRPPGSGKK 75

Query: 756 QLDALGGTGGVGFTPHVIEVKTG 824
           QLDALGGTGGVGFTPHVI VK G
Sbjct:  76 QLDALGGTGGVGFTPHVITVKAG 98

>gi|297304594|ref|XP_001094961.2| PREDICTED: hypothetical protein LOC706590
        [Macaca mulatta]

          Length = 1147

 Score =  60 bits (143), Expect = 5e-007
 Identities = 27/50 (54%), Positives = 29/50 (58%)
 Frame = -1

Query: 715 AGGSEEFAFPPPPPPPLSPPPPPPSVPPPYELEAGDVGARARPMLLPPPP 566
           AG S     PPPPPPP  PPPPPP  PPP  L+ G+      P  LPPPP
Sbjct: 926 AGASLPPPPPPPPPPPPPPPPPPPPPPPPPPLDVGEASNLQPPPPLPPPP 975

>gi|153791910|ref|NP_001093392.1| UDP-N-acetylglucosamine transferase subunit
        ALG13 homolog isoform 1 [Homo sapiens]

          Length = 1137

 Score =  59 bits (142), Expect = 6e-007
 Identities = 24/41 (58%), Positives = 26/41 (63%)
 Frame = -1

Query: 688 PPPPPPPLSPPPPPPSVPPPYELEAGDVGARARPMLLPPPP 566
           PPPPPPP  PPPPPP  PPP  L+ G+      P  LPPPP
Sbjct: 925 PPPPPPPPPPPPPPPPPPPPPALDVGETSNLQPPPPLPPPP 965

>gi|221044226|dbj|BAH13790.1| unnamed protein product [Homo sapiens]

          Length = 1137

 Score =  59 bits (142), Expect = 6e-007
 Identities = 24/41 (58%), Positives = 26/41 (63%)
 Frame = -1

Query: 688 PPPPPPPLSPPPPPPSVPPPYELEAGDVGARARPMLLPPPP 566
           PPPPPPP  PPPPPP  PPP  L+ G+      P  LPPPP
Sbjct: 925 PPPPPPPPPPPPPPPPPPPPPALDVGETSNLQPPPPLPPPP 965

>gi|221044312|dbj|BAH13833.1| unnamed protein product [Homo sapiens]

          Length = 1059

 Score =  59 bits (142), Expect = 6e-007
 Identities = 24/41 (58%), Positives = 26/41 (63%)
 Frame = -1

Query: 688 PPPPPPPLSPPPPPPSVPPPYELEAGDVGARARPMLLPPPP 566
           PPPPPPP  PPPPPP  PPP  L+ G+      P  LPPPP
Sbjct: 847 PPPPPPPPPPPPPPPPPPPPPALDVGETSNLQPPPPLPPPP 887

>gi|332226151|ref|XP_003262252.1| PREDICTED: UDP-N-acetylglucosamine transferase
        subunit ALG13 homolog isoform 1 [Nomascus leucogenys]

          Length = 1140

 Score =  59 bits (142), Expect = 6e-007
 Identities = 24/41 (58%), Positives = 26/41 (63%)
 Frame = -1

Query: 688 PPPPPPPLSPPPPPPSVPPPYELEAGDVGARARPMLLPPPP 566
           PPPPPPP  PPPPPP  PPP  L+ G+      P  LPPPP
Sbjct: 928 PPPPPPPPPPPPPPPPPPPPPALDVGEASNLQPPPPLPPPP 968

>gi|332226159|ref|XP_003262256.1| PREDICTED: UDP-N-acetylglucosamine transferase
        subunit ALG13 homolog isoform 5 [Nomascus leucogenys]

          Length = 1062

 Score =  59 bits (142), Expect = 6e-007
 Identities = 24/41 (58%), Positives = 26/41 (63%)
 Frame = -1

Query: 688 PPPPPPPLSPPPPPPSVPPPYELEAGDVGARARPMLLPPPP 566
           PPPPPPP  PPPPPP  PPP  L+ G+      P  LPPPP
Sbjct: 850 PPPPPPPPPPPPPPPPPPPPPALDVGEASNLQPPPPLPPPP 890

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,222,279,010,256
Number of Sequences: 15229318
Number of Extensions: 1222279010256
Number of Successful Extensions: 337045799
Number of sequences better than 0.0: 0