Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN04733


BLASTX 7.6.2

Query= UN04733 /QuerySize=1074
        (1073 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297834374|ref|XP_002885069.1| hypothetical protein ARALYDRAFT...    441   1e-121
gi|18400638|ref|NP_566501.1| uncharacterized protein [Arabidopsi...    425   6e-117
gi|297847708|ref|XP_002891735.1| hypothetical protein ARALYDRAFT...    123   7e-026
gi|7769852|gb|AAF69530.1|AC008007_5 F12M16.8 [Arabidopsis thaliana]    113   7e-023
gi|15219218|ref|NP_175726.1| uncharacterized protein [Arabidopsi...    113   7e-023
gi|147776896|emb|CAN63558.1| hypothetical protein VITISV_034122 ...     58   3e-006
gi|225461562|ref|XP_002285236.1| PREDICTED: hypothetical protein...     58   3e-006
gi|302142946|emb|CBI20241.3| unnamed protein product [Vitis vini...     58   3e-006

>gi|297834374|ref|XP_002885069.1| hypothetical protein ARALYDRAFT_478943
        [Arabidopsis lyrata subsp. lyrata]

          Length = 334

 Score =  441 bits (1133), Expect = 1e-121
 Identities = 245/341 (71%), Positives = 264/341 (77%), Gaps = 25/341 (7%)
 Frame = +1

Query:   91 MDPTLNDHQELEQIEAIDDLLEDFWFFDNLLDRRSRILRYCHSDPYPLSP----PSSSTC 258
            MD T NDHQELEQ EAIDDLLEDFWFFDNLLDRRSRILRYCHSDPYP SP     SSSTC
Sbjct:    1 MDSTWNDHQELEQFEAIDDLLEDFWFFDNLLDRRSRILRYCHSDPYPFSPSSSSSSSSTC 60

Query:  259 PN---SKDGDSVSDKKLLKAPTGGDSVQLPCIVNKEGGSEPEKMNK-MRRQFSEKIRVQE 426
            P     K GDS S+ KLL+A TGG SV  PCI  KEGG EPEK+NK MRRQFSEK+RVQE
Sbjct:   61 PKPEIPKIGDSDSETKLLEASTGGGSVPPPCIEKKEGGGEPEKINKMMRRQFSEKVRVQE 120

Query:  427 RRTYLQKKEPVVREKGIRESSKKNRAGSTSSFCNNNS-----LQRTQTLPSYIGREDVGN 591
            RRTYLQKKEPVVREKGI+E S+KN+  STSS  NNNS     LQRTQTLPSYIGRE   N
Sbjct:  121 RRTYLQKKEPVVREKGIKEGSRKNKTSSTSSCSNNNSSMGGGLQRTQTLPSYIGREGDVN 180

Query:  592 EFQDQEIDDSRMGFLIREAIASSSSE--FTPTKQTHQRVQAFQDLNHRDTRDQRS---YS 756
            EFQDQEIDDSRMGFLIREAIASSSS    TPTK    ++ +     HR  R+ RS     
Sbjct:  181 EFQDQEIDDSRMGFLIREAIASSSSSSGLTPTKHNTPKISSIP--RHRPPRNSRSEEAIQ 238

Query:  757 EMVAKSQRSPRGKTLRKTLSSVDTKDLLMLKDLDITEPETNQAKDEEEQRRVPRAAVKSR 936
            E+V KSQRSP  KTLRKTLSS++TKD+ MLKDLDI      + K EEEQR VPRA  K+R
Sbjct:  239 ELVVKSQRSPNRKTLRKTLSSIETKDIQMLKDLDI----ELEKKQEEEQRSVPRATAKTR 294

Query:  937 SAAVVVGQPIPVWVPKESRRDMKAQIKFWARTVASNVRQEC 1059
            S A VVGQPIPVWVPK+SR+DMKAQIKFWARTVASNVRQEC
Sbjct:  295 STA-VVGQPIPVWVPKDSRKDMKAQIKFWARTVASNVRQEC 334

>gi|18400638|ref|NP_566501.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 339

 Score =  425 bits (1092), Expect = 6e-117
 Identities = 238/345 (68%), Positives = 258/345 (74%), Gaps = 28/345 (8%)
 Frame = +1

Query:   91 MDPTLNDHQELEQIEAIDDLLEDFWFFDNLLDRRSRILRYCHSDPYPLSPPSSSTCPN-- 264
            MD T NDHQELEQ EAIDDLLEDFWFFDNLLDRRSRILRYCHSDPYP +  SSSTCP   
Sbjct:    1 MDSTSNDHQELEQFEAIDDLLEDFWFFDNLLDRRSRILRYCHSDPYPFTSSSSSTCPKPE 60

Query:  265 -SKDGDSVSDKKLLKAPTGGDSVQLPCIVNKEGGSEPEKMNK-MRRQFSEKIRVQERRTY 438
              K GDS S+ KLL+A TGGD V  PCI  KEGG EPEK+NK MRRQFSEK RVQERRTY
Sbjct:   61 LPKIGDSDSEIKLLEASTGGDFVPPPCIEKKEGGGEPEKINKVMRRQFSEKTRVQERRTY 120

Query:  439 LQKKEPVVREKGIRESS-KKNRAGSTSSFCNNN----------SLQRTQTLPSYIGREDV 585
            LQKKEPVVREKGI+E S KKNR   T   C+NN          SLQRTQTLPSY+GRED 
Sbjct:  121 LQKKEPVVREKGIKEGSRKKNR---TRISCSNNNSVQSCSMGGSLQRTQTLPSYLGREDD 177

Query:  586 GNEFQDQEIDDSRMGFLIREAIA----SSSSEFTPTKQTHQRVQAFQDLNHRDTRDQRS- 750
             NEFQDQEIDDSRMGFLIREAIA    SSSS FTPTKQ   +V       HR  R+ RS 
Sbjct:  178 VNEFQDQEIDDSRMGFLIREAIANSSSSSSSGFTPTKQNIPKVSCIP--RHRPPRNSRSE 235

Query:  751 --YSEMVAKSQRSPRGKTLRKTLSSVDTKDLLMLKDLDITEPETNQAKDEEEQRRVPRAA 924
                E+V KSQ+SP  KTLRKTLSS++TKD+ MLKD  I E E  Q +DEE+QR+VP   
Sbjct:  236 DAIQELVVKSQKSPNRKTLRKTLSSIETKDIQMLKDFHI-ETEKKQEEDEEKQRKVPCTT 294

Query:  925 VKSRSAAVVVGQPIPVWVPKESRRDMKAQIKFWARTVASNVRQEC 1059
                 +  VVGQPIPVWVPK+SR+DMKAQIKFWARTVASNVRQEC
Sbjct:  295 TGKNRSTAVVGQPIPVWVPKDSRKDMKAQIKFWARTVASNVRQEC 339

>gi|297847708|ref|XP_002891735.1| hypothetical protein ARALYDRAFT_474441
        [Arabidopsis lyrata subsp. lyrata]

          Length = 358

 Score =  123 bits (307), Expect = 7e-026
 Identities = 96/209 (45%), Positives = 127/209 (60%), Gaps = 25/209 (11%)
 Frame = +1

Query: 337 PCIVNKEGGSEPEKM-NKMRRQFSEKIRVQE----RRTYLQKKEPVVREKGIRESSKKNR 501
           P I  ++   E +KM NK+ RQFSEKIRV E       +LQKKE +VR+KGI ESS++N+
Sbjct: 143 PQIEKRDMDREAKKMINKLTRQFSEKIRVLEPTRPGEHFLQKKETIVRDKGISESSRRNK 202

Query: 502 AGSTSSFCNNN--SLQRTQTLPSYIGREDVG--NEFQDQEIDDSRMGFLIREAIASSSSE 669
            GS+SS  ++   SLQRTQT+P+ I RE+    +EF+DQE  DSRMGFLIREA+ASS + 
Sbjct: 203 IGSSSSSYSSGKISLQRTQTMPNNIRREEDNEIDEFEDQE-SDSRMGFLIREALASSHN- 260

Query: 670 FTPTKQTHQRVQAFQDLNHRDTRDQRSYSEMVAKSQRSPRGKTLRKTLSSVDT-KDLLML 846
             P    +QR +  + L   DT        MV +   SP  KTLRKTLSSV+T K++   
Sbjct: 261 -VPKVSNNQRQRPPRSLRLEDT-------VMVKQGGSSP--KTLRKTLSSVETSKEIQRH 310

Query: 847 KDLD-ITEPETNQAKDEEEQRRVPRAAVK 930
           +D D + EP    A       RVP+ + K
Sbjct: 311 RDYDQLVEPRV--ASGLATPPRVPKDSSK 337


 Score =  76 bits (185), Expect = 1e-011
 Identities = 38/77 (49%), Positives = 51/77 (66%), Gaps = 4/77 (5%)
 Frame = +1

Query: 139 IDDLLEDFWFFDNLLDRRSRILRYCHSDPYPLSPPSSSTCPNSKDGDSVSDKKLLKAPTG 318
           IDDLLED+WFF+NL+ RRSR L YCHSDPY    PSSST   ++    +    +L+A TG
Sbjct:  11 IDDLLEDYWFFENLITRRSRGLSYCHSDPY----PSSSTSTAAEKMGDLDSGNVLEASTG 66

Query: 319 GDSVQLPCIVNKEGGSE 369
              ++   I ++EGGS+
Sbjct:  67 RSLIRASSIDSREGGSQ 83

>gi|7769852|gb|AAF69530.1|AC008007_5 F12M16.8 [Arabidopsis thaliana]

          Length = 457

 Score =  113 bits (281), Expect = 7e-023
 Identities = 78/174 (44%), Positives = 107/174 (61%), Gaps = 16/174 (9%)
 Frame = +1

Query: 238 PPSSSTCPNSKDGDSVSDKKLLKAPTGGDSVQLPCIVNKEGGSEPEKM-NKMRRQFSEKI 414
           P S S     K  ++ + + L++AP+       P I  +E   E +KM NK+ RQFSEKI
Sbjct: 215 PKSGSRSAPGKIQEASTKRGLIRAPS-----LPPQIEKREMDREAKKMINKLTRQFSEKI 269

Query: 415 RVQE----RRTYLQKKEPVVREKGIRESSKKNRAGSTSSFCN-NNSLQRTQTLPSYIGRE 579
           RV E       +LQKKE + R+KGI ESS+ N+ GS+SS+ +   SLQRTQT+P+ +GRE
Sbjct: 270 RVLEPTRPGEHFLQKKETIARDKGITESSRSNKTGSSSSYSSVKISLQRTQTMPNNMGRE 329

Query: 580 DVG--NEFQDQEIDDSRMGFLIREAIASSSSEFTPTKQTHQRVQAFQDLNHRDT 735
           +    +EF+DQE  DSRMGFLIREA+A  SS + P    +QR +  + L   DT
Sbjct: 330 EDNEEDEFEDQE-SDSRMGFLIREALA--SSHYVPKVSNNQRQRPPRSLRLEDT 380


 Score =  111 bits (276), Expect = 3e-022
 Identities = 76/178 (42%), Positives = 101/178 (56%), Gaps = 18/178 (10%)
 Frame = +1

Query:  82 FLEMDPTLNDHQELEQIEA----IDDLLEDFWFFDNLLDRRSRILRYCHSDPYPLSPPSS 249
           FL     L   +E++ + +    IDDLLED+WFF+NL  RRSR LRYCHSDPYP S  S+
Sbjct:  87 FLTFFCNLLSEKEMDHVSSSELLIDDLLEDYWFFENLFTRRSRGLRYCHSDPYP-SSSST 145

Query: 250 STCPNSKDGDSVSDKKLLKAPTGGDSVQLPCIVNKEGGSEPEKMNKMRRQFSEKIRVQER 429
           ST P  K GDS    K+L+A TG   ++   I ++EGGS+     K+  +FSEKIRVQE+
Sbjct: 146 STSP-EKMGDS-DIGKVLEASTGRSLIRASSIDSREGGSQ----TKLTGRFSEKIRVQEQ 199

Query: 430 R---TYLQKKEPVVREKGIRESSKKNRAGSTSSFCNNNSLQRTQTLPSYIGREDVGNE 594
           R   + LQKKE VV  K    S  ++  G          L R  +LP  I + ++  E
Sbjct: 200 RQVGSSLQKKEHVVLPK----SGSRSAPGKIQEASTKRGLIRAPSLPPQIEKREMDRE 253

>gi|15219218|ref|NP_175726.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 358

 Score =  113 bits (281), Expect = 7e-023
 Identities = 78/174 (44%), Positives = 107/174 (61%), Gaps = 16/174 (9%)
 Frame = +1

Query: 238 PPSSSTCPNSKDGDSVSDKKLLKAPTGGDSVQLPCIVNKEGGSEPEKM-NKMRRQFSEKI 414
           P S S     K  ++ + + L++AP+       P I  +E   E +KM NK+ RQFSEKI
Sbjct: 116 PKSGSRSAPGKIQEASTKRGLIRAPS-----LPPQIEKREMDREAKKMINKLTRQFSEKI 170

Query: 415 RVQE----RRTYLQKKEPVVREKGIRESSKKNRAGSTSSFCN-NNSLQRTQTLPSYIGRE 579
           RV E       +LQKKE + R+KGI ESS+ N+ GS+SS+ +   SLQRTQT+P+ +GRE
Sbjct: 171 RVLEPTRPGEHFLQKKETIARDKGITESSRSNKTGSSSSYSSVKISLQRTQTMPNNMGRE 230

Query: 580 DVG--NEFQDQEIDDSRMGFLIREAIASSSSEFTPTKQTHQRVQAFQDLNHRDT 735
           +    +EF+DQE  DSRMGFLIREA+A  SS + P    +QR +  + L   DT
Sbjct: 231 EDNEEDEFEDQE-SDSRMGFLIREALA--SSHYVPKVSNNQRQRPPRSLRLEDT 281


 Score =  110 bits (274), Expect = 5e-022
 Identities = 74/165 (44%), Positives = 96/165 (58%), Gaps = 15/165 (9%)
 Frame = +1

Query: 109 DHQELEQIEAIDDLLEDFWFFDNLLDRRSRILRYCHSDPYPLSPPSSSTCPNSKDGDSVS 288
           DH    ++  IDDLLED+WFF+NL  RRSR LRYCHSDPYP S  S+ST P  K GDS  
Sbjct:   2 DHVSSSEL-LIDDLLEDYWFFENLFTRRSRGLRYCHSDPYP-SSSSTSTSP-EKMGDS-D 57

Query: 289 DKKLLKAPTGGDSVQLPCIVNKEGGSEPEKMNKMRRQFSEKIRVQERR---TYLQKKEPV 459
             K+L+A TG   ++   I ++EGGS+     K+  +FSEKIRVQE+R   + LQKKE V
Sbjct:  58 IGKVLEASTGRSLIRASSIDSREGGSQ----TKLTGRFSEKIRVQEQRQVGSSLQKKEHV 113

Query: 460 VREKGIRESSKKNRAGSTSSFCNNNSLQRTQTLPSYIGREDVGNE 594
           V  K    S  ++  G          L R  +LP  I + ++  E
Sbjct: 114 VLPK----SGSRSAPGKIQEASTKRGLIRAPSLPPQIEKREMDRE 154

>gi|147776896|emb|CAN63558.1| hypothetical protein VITISV_034122 [Vitis
        vinifera]

          Length = 399

 Score =  58 bits (138), Expect = 3e-006
 Identities = 26/33 (78%), Positives = 27/33 (81%)
 Frame = +1

Query:  961 PIPVWVPKESRRDMKAQIKFWARTVASNVRQEC 1059
            PIP WV K S +DMKAQIKFWAR VASNV QEC
Sbjct:  367 PIPNWVGKSSAQDMKAQIKFWARAVASNVHQEC 399

>gi|225461562|ref|XP_002285236.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 399

 Score =  58 bits (138), Expect = 3e-006
 Identities = 26/33 (78%), Positives = 27/33 (81%)
 Frame = +1

Query:  961 PIPVWVPKESRRDMKAQIKFWARTVASNVRQEC 1059
            PIP WV K S +DMKAQIKFWAR VASNV QEC
Sbjct:  367 PIPNWVGKSSAQDMKAQIKFWARAVASNVHQEC 399

>gi|302142946|emb|CBI20241.3| unnamed protein product [Vitis vinifera]

          Length = 184

 Score =  58 bits (138), Expect = 3e-006
 Identities = 26/33 (78%), Positives = 27/33 (81%)
 Frame = +1

Query:  961 PIPVWVPKESRRDMKAQIKFWARTVASNVRQEC 1059
            PIP WV K S +DMKAQIKFWAR VASNV QEC
Sbjct:  152 PIPNWVGKSSAQDMKAQIKFWARAVASNVHQEC 184

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 533,275,694,958
Number of Sequences: 15229318
Number of Extensions: 533275694958
Number of Successful Extensions: 154799666
Number of sequences better than 0.0: 0