Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN38328


BLASTX 7.6.2

Query= UN38328 /QuerySize=1422
        (1421 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|22330420|ref|NP_683468.1| uncharacterized protein [Arabidopsi...    401   2e-109
gi|297840019|ref|XP_002887891.1| hypothetical protein ARALYDRAFT...    388   1e-105
gi|225426358|ref|XP_002270995.1| PREDICTED: hypothetical protein...    202   2e-049
gi|224058306|ref|XP_002299479.1| predicted protein [Populus tric...    141   3e-031
gi|224072188|ref|XP_002303644.1| predicted protein [Populus tric...    132   2e-028
gi|255537773|ref|XP_002509953.1| conserved hypothetical protein ...     79   2e-012

>gi|22330420|ref|NP_683468.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 351

 Score =  401 bits (1030), Expect = 2e-109
 Identities = 241/368 (65%), Positives = 277/368 (75%), Gaps = 37/368 (10%)
 Frame = -3

Query: 1260 MDANTFVLLLCIALLL--GTKVEGEAQIV---SNLTDTSV---PQIVTDSSKSS----DH 1117
            M+ N  VLLLCIAL+    TKV+GEAQ+V   SNLTDT      + VTDSS  S    DH
Sbjct:    1 MEVN-LVLLLCIALIFVADTKVDGEAQVVVSNSNLTDTRFGGGSENVTDSSSKSIITIDH 59

Query: 1116 STNTTTTN--QLGASSDGSETKPIVDDSTSQKNIGGGGGSDESESSATTSSNSSSSRKKK 943
            S N+T  +  QLG   DGS+    +  S S K+  G   SDES+     + + +SSRKK+
Sbjct:   60 SKNSTNDDDTQLG---DGSK----MIGSDSSKSDQGKIASDESDKEEEEAVSKNSSRKKQ 112

Query:  942 --EGEDCDPSYMCSDDQHLFLACLRVPGDDDDAPHLSLLIKNKAKTALLLTITAPSFVRL 769
               GE+CDPS MC DD+H F ACLRVPG  +DAPHLSLLI+NK K AL++TITAP FVRL
Sbjct:  113 GFHGEECDPSNMCIDDEHEFSACLRVPG--NDAPHLSLLIQNKGKRALIVTITAPVFVRL 170

Query:  768 ETNQVQLPESQDTKVKVSIKKGGSNDSAITLTSSNGGHCSLELKDLAGASSHETGKDSTV 589
            E ++VQL +++D KVKVSIKKGGSNDSAI L SS  G C LELKDLA A++HET  D TV
Sbjct:  171 EKDKVQLLQNEDIKVKVSIKKGGSNDSAIVLASSK-GRCRLELKDLA-AAAHETESDDTV 228

Query:  588 AVSRPSILNISSRTLIVIAMISFLVLSLVIIPVIYHVYRSKSQAKSKYQRLDNMELPVSS 409
            +VSRPSILNISSRTLIVI MISFLVLSLVIIPVI HVY++KS+  +KYQRLD MELPVS+
Sbjct:  229 SVSRPSILNISSRTLIVIIMISFLVLSLVIIPVIIHVYKNKSRGNNKYQRLD-MELPVSN 287

Query:  408 TAAAAALVSKSDQESGDDGWNNNWGDDW----GDGDEEQPNTPVLPLTPSVSSRGLAPRR 241
                 ALV+KSDQESGDDGWNNNWGDDW    G GDEEQPNTPVLPLTPS+SSRGLAPRR
Sbjct:  288 ----PALVTKSDQESGDDGWNNNWGDDWDDENGGGDEEQPNTPVLPLTPSLSSRGLAPRR 343

Query:  240 LSKEGWKD 217
            LSKEGWKD
Sbjct:  344 LSKEGWKD 351

>gi|297840019|ref|XP_002887891.1| hypothetical protein ARALYDRAFT_474911
        [Arabidopsis lyrata subsp. lyrata]

          Length = 342

 Score =  388 bits (996), Expect = 1e-105
 Identities = 238/361 (65%), Positives = 267/361 (73%), Gaps = 43/361 (11%)
 Frame = -3

Query: 1242 VLLLCIALLL--GTKVEGEAQIV----SNLTDTSV---PQIVTDSSKS--SDHSTNTTTT 1096
            VLLLCIAL+    TKV    +I     SNLTDT      + VTDSSKS   DHS N+T  
Sbjct:    6 VLLLCIALIFVADTKVYFTIEISSITNSNLTDTRFGGGSENVTDSSKSITIDHSKNSTND 65

Query: 1095 N--QLGASSDGSETKPIVDDSTSQKNIGGGGGSDESESSATTSSNSSSSRKKK--EGEDC 928
            +  QLG   DGS  K I  DS+          S ESE++    + S SSRKK+   GE+C
Sbjct:   66 DDTQLG---DGS--KMIGSDSSK---------SGESENTKEEDAMSDSSRKKEGFHGEEC 111

Query:  927 DPSYMCSDDQHLFLACLRVPGDDDDAPHLSLLIKNKAKTALLLTITAPSFVRLETNQVQL 748
            DPS MC+DDQH F ACLRVPG  +DAPHLSLLI+NK K  L++TITAP FVRLE ++VQL
Sbjct:  112 DPSNMCTDDQHEFAACLRVPG--NDAPHLSLLIQNKGKRPLIVTITAPGFVRLEKDKVQL 169

Query:  747 PESQDTKVKVSIKKGGSNDSAITLTSSNGGHCSLELKDLAGASSHETGKDSTVAVSRPSI 568
             +++DTKVKVSIKKGGSNDSAI L SS  G CSLELKDLA A  HET  D TV+VSRPSI
Sbjct:  170 LQNEDTKVKVSIKKGGSNDSAIVLASSK-GRCSLELKDLAAA--HETESDDTVSVSRPSI 226

Query:  567 LNISSRTLIVIAMISFLVLSLVIIPVIYHVYRSKSQAKSKYQRLDNMELPVSSTAAAAAL 388
            L ISSRTLIVI MISFLVLSLVIIPVI HVY++KS+  +KYQRLD MELPVS+     AL
Sbjct:  227 LYISSRTLIVIIMISFLVLSLVIIPVIIHVYKNKSRGNNKYQRLD-MELPVSN----PAL 281

Query:  387 VSKSDQESGDDGWNNNWGDDW----GDGDEEQPNTPVLPLTPSVSSRGLAPRRLSKEGWK 220
            V+KSDQESGDDGWNNNWGDDW    G GDEEQPNTPVLPLTPS+SSRGLAPRRLSKEGWK
Sbjct:  282 VTKSDQESGDDGWNNNWGDDWDDENGGGDEEQPNTPVLPLTPSLSSRGLAPRRLSKEGWK 341

Query:  219 D 217
            D
Sbjct:  342 D 342

>gi|225426358|ref|XP_002270995.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 381

 Score =  202 bits (512), Expect = 2e-049
 Identities = 127/316 (40%), Positives = 185/316 (58%), Gaps = 25/316 (7%)
 Frame = -3

Query: 1164 TSVPQIVTDSSKSSDHSTNTTTTNQLGASSDGSETKPIVDDSTSQKNIGGGGGSDESESS 985
            +S+ Q+  DS ++ +  T   + ++   +  G   K    D +  K     GG++     
Sbjct:   91 SSIKQL--DSKEADNEHTGKGSLSKELETEGGDNKKEKPGDGSKSKQASKEGGNE----- 143

Query:  984 ATTSSNSSSSRKKKEGEDCDPSYMCSDDQHLFLACLRVPGDDDDAPHLSLLIKNKAKTAL 805
                S+    ++  +GE+CDPS  C DD +  +ACLRVPG  +D+P LSLLI+NK KTAL
Sbjct:  144 GVLESSKPGKKESLQGEECDPSNQCVDDINKLVACLRVPG--NDSPDLSLLIQNKGKTAL 201

Query:  804 LLTITAPSFVRLETNQVQLPESQDTKVKVSIKKGGSNDSAITLTSSNGGHCSLELKDLAG 625
             +TI+AP FV+LE+ +++L E +D KVKVSI+ GGS D++I LT+   G CSL+ KDL  
Sbjct:  202 TVTISAPDFVKLESTKIELQEKEDKKVKVSIRNGGS-DNSIVLTAGK-GRCSLDFKDLI- 258

Query:  624 ASSHETGKDSTVAVSRPSILNISSRTLIVIAMISFLVLSLVIIPVIYHVYRSKSQAKSKY 445
            A   + G D+    +  + L  +S +L  + +++ +  +   I + +      S   SKY
Sbjct:  259 AQIAQKGTDNIPESTDGNFLTRTS-SLAFLFLVALVAAASAWICISFKRKYFPSSG-SKY 316

Query:  444 QRLDNMELPVSSTAAAAALVSKSDQESGDDGWNNNWGDDWGDGDEEQPNTPVLPLTPSVS 265
            Q+LD MELPVS      A +        +DGW+N+WGD W   DEE P TP +PLTPS+S
Sbjct:  317 QKLD-MELPVSGGGKVEADI--------NDGWDNSWGDTW--DDEEAPKTPSMPLTPSLS 365

Query:  264 SRGLAPRRLSKEGWKD 217
            +RGLA RRLSKEGWKD
Sbjct:  366 ARGLAARRLSKEGWKD 381

>gi|224058306|ref|XP_002299479.1| predicted protein [Populus trichocarpa]

          Length = 373

 Score =  141 bits (355), Expect = 3e-031
 Identities = 101/285 (35%), Positives = 149/285 (52%), Gaps = 23/285 (8%)
 Frame = -3

Query: 1125 SDHSTNTTTTNQLGASSDGSETKPIVDDSTSQKNIGGGGGSDESESSATTSSNSSSSRKK 946
            S    N       G SS+  + K    D   +K + GG   +ES+      ++   ++ +
Sbjct:   90 SGSKDNENAKEDKGNSSEEFQAKE--GDHNKKKGLSGG---EESKDFPEEKNDERDTQSR 144

Query:  945 KEG---EDCDPSYMCSDDQHLFLACLRVPGDDDDAPHLSLLIKNKAKTALLLTITAPSFV 775
            KEG   E+CDPS  C+D+++  +ACLRVPG  +++P LSLLI+NK K  L +TI+AP FV
Sbjct:  145 KEGPHVEECDPSNKCTDEENKLVACLRVPG--NESPDLSLLIQNKGKGPLNVTISAPDFV 202

Query:  774 RLETNQVQLPESQDTKVKVSIKKGGSNDSAITLTSSNGGHCSLELKDLAGASSHETGKDS 595
             LE  ++QL E  + KVKVSI  GGS ++ I LT+   G C L++KD     +H  GK+ 
Sbjct:  203 HLEKTKIQLQEKDNKKVKVSITGGGS-ENLIVLTAGK-GQCKLDIKD---TIAHYLGKEL 257

Query:  594 TVAVSRPSILNISSRTLIVIAMISFLVLSLVIIPVIYHVYRSK--SQAKSKYQRLDNMEL 421
              +     I+N  SRT   IA++SF  L ++    +   +R K  S    +YQRL+ MEL
Sbjct:  258 HKSHESADIINSMSRT-STIAVLSFAALLILASGWMCISFRRKHLSYNNPRYQRLE-MEL 315

Query:  420 PVSSTAAAAALVSKSDQESGDDGWNNNWGDDWGDGDEEQPNTPVL 286
            PVS              +  D+ W ++W D+        P TP L
Sbjct:  316 PVS----GGGKTESKTNDGWDNNWGDDWDDEEAPKTPSLPVTPSL 356


 Score =  82 bits (200), Expect = 3e-013
 Identities = 35/48 (72%), Positives = 41/48 (85%), Gaps = 2/48 (4%)
 Frame = -3

Query: 360 DDGWNNNWGDDWGDGDEEQPNTPVLPLTPSVSSRGLAPRRLSKEGWKD 217
           +DGW+NNWGDDW   DEE P TP LP+TPS+SS+GLA RRLSK+GWKD
Sbjct: 328 NDGWDNNWGDDW--DDEEAPKTPSLPVTPSLSSKGLASRRLSKDGWKD 373

>gi|224072188|ref|XP_002303644.1| predicted protein [Populus trichocarpa]

          Length = 288

 Score =  132 bits (330), Expect = 2e-028
 Identities = 90/253 (35%), Positives = 141/253 (55%), Gaps = 17/253 (6%)
 Frame = -3

Query: 1086 GASSDGSETKPIVDDSTSQKNIGGGGGSDESESSATTSSNSSSSRKKKEG---EDCDPSY 916
            G  +   E++    D + +++   G    ESE  +   ++   ++ +KEG   E+CD S 
Sbjct:   16 GKHNSSEESQAKKGDHSKKEDSSSG---VESEDLSKEKNDKGDTQSRKEGPRVEECDQSN 72

Query:  915 MCSDDQHLFLACLRVPGDDDDAPHLSLLIKNKAKTALLLTITAPSFVRLETNQVQLPESQ 736
             C+D+++  +ACLRVPG  +++P LSLLI+NK K +L +TI+AP FV LE  ++QL E +
Sbjct:   73 KCTDEENKLVACLRVPG--NESPDLSLLIQNKGKGSLSVTISAPDFVHLEKTKIQLKEKE 130

Query:  735 DTKVKVSIKKGGSNDSAITLTSSNGGHCSLELKDLAGASSHETGKDSTVAVSRPSILNIS 556
            D KVKVSI   GS ++ I L + N G C L++KD     +H  GK+   +     I+N  
Sbjct:  131 DKKVKVSITSRGS-ENLIVLRAGN-GQCKLDIKD---TIAHYFGKEFDKSHKSTDIINFM 185

Query:  555 SRTLIVIAMISFLVLSLVIIPVIYHVYRSK--SQAKSKYQRLDNMELPVSSTAAAAALVS 382
            SRT  ++ ++SF  L ++    +   +R K  S   SKYQRL+ MELPVS      +  +
Sbjct:  186 SRTSTIV-VLSFAALLILASGWMCISFRRKHPSNNTSKYQRLE-MELPVSGEGKTESETN 243

Query:  381 KSDQESGDDGWNN 343
                 S  D W++
Sbjct:  244 DGWDNSWGDDWDD 256


 Score =  77 bits (189), Expect = 5e-012
 Identities = 35/55 (63%), Positives = 44/55 (80%), Gaps = 3/55 (5%)
 Frame = -3

Query: 381 KSDQESGDDGWNNNWGDDWGDGDEEQPNTPVLPLTPSVSSRGLAPRRLSKEGWKD 217
           K++ E+ +DGW+N+WGDDW   DEE P  P LP+TPS+SS+GLA RRLSKE WKD
Sbjct: 237 KTESET-NDGWDNSWGDDW--DDEEAPKAPSLPVTPSLSSKGLASRRLSKEAWKD 288

>gi|255537773|ref|XP_002509953.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 372

 Score =  79 bits (193), Expect = 2e-012
 Identities = 36/55 (65%), Positives = 44/55 (80%), Gaps = 3/55 (5%)
 Frame = -3

Query: 381 KSDQESGDDGWNNNWGDDWGDGDEEQPNTPVLPLTPSVSSRGLAPRRLSKEGWKD 217
           K++ E  +DGW++ WGDDW   DEE P TP LP+TPS+SS+GLA RRLSKEGWKD
Sbjct: 321 KAESEQ-NDGWDDKWGDDW--DDEEAPKTPSLPVTPSLSSKGLASRRLSKEGWKD 372

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,281,875,611,404
Number of Sequences: 15229318
Number of Extensions: 4281875611404
Number of Successful Extensions: 1013544274
Number of sequences better than 0.0: 0