Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN30156


BLASTX 7.6.2

Query= UN30156 /QuerySize=1294
        (1293 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|296090276|emb|CBI40095.3| unnamed protein product [Vitis vini...    143   8e-032
gi|297793237|ref|XP_002864503.1| hypothetical protein ARALYDRAFT...    129   1e-027
gi|30687076|ref|NP_849445.1| CCT motif family protein [Arabidops...    122   1e-025
gi|4538928|emb|CAB39664.1| putative protein [Arabidopsis thaliana]     122   1e-025
gi|18416659|ref|NP_567737.1| CCT motif family protein [Arabidops...    122   1e-025
gi|30696840|ref|NP_568852.2| chloroplast import apparatus 2 prot...    119   2e-024
gi|30696838|ref|NP_851201.1| chloroplast import apparatus 2 prot...    119   2e-024
gi|147784441|emb|CAN72725.1| hypothetical protein VITISV_015092 ...    119   2e-024
gi|225470656|ref|XP_002268213.1| PREDICTED: hypothetical protein...    119   2e-024
gi|297799402|ref|XP_002867585.1| hypothetical protein ARALYDRAFT...    112   1e-022
gi|255554380|ref|XP_002518229.1| CIL, putative [Ricinus communis]       89   1e-015
gi|224142697|ref|XP_002324691.1| predicted protein [Populus tric...     89   2e-015
gi|224087284|ref|XP_002308112.1| predicted protein [Populus tric...     84   6e-014
gi|79331135|ref|NP_001032087.1| chloroplast import apparatus 2 p...     67   8e-009

>gi|296090276|emb|CBI40095.3| unnamed protein product [Vitis vinifera]

          Length = 392

 Score =  143 bits (359), Expect = 8e-032
 Identities = 123/329 (37%), Positives = 155/329 (47%), Gaps = 45/329 (13%)
 Frame = +1

Query:  373 PLPRHQQSPNLQTQSPPGS-------QERSGNDRTRAT-TQAAALLSTAYPNIFFSPSST 528
            P   H  SP+        S       + R+   R   T  +AAALLSTAYPNIF   S+ 
Sbjct:   52 PRTSHSSSPSSTISESSNSPIAISTRKPRTPRKRPNQTYNEAAALLSTAYPNIF---STK 108

Query:  529 NHLYANKKTHHHFYGFDDDVDDDAELLLPSEPPD----FLFNPILQAYSDQKEVSSGVSV 696
            N     K T  H    D  ++D +ELL P    D     L  P+ +      E+  G   
Sbjct:  109 NLKNPCKFTKSH----DSFLEDSSELLFPFRAFDASGFLLHQPVQEKPRKSPELCDG--- 161

Query:  697 NESEVSQFEFSDEFESILVEEVEEGGVDSIMGKLEPGSGHRRGV------NHRFDHQIVK 858
                   FE   + ESIL EE+ EGG+DSIMG L   +            N  + + I  
Sbjct:  162 -------FEEDFDAESILDEEI-EGGIDSIMGNLSVDNEMSDEATNPVCFNSYYGNGIPM 213

Query:  859 GFKFAENIPLGLGLR---TALRDHNNDDDARFQTVDLDRIQTTVTVVDEVKKSKKKKKKV 1029
            G  F      G G+R    ALR  +  D  RF TVD+  I      V     ++KKKKKV
Sbjct:  214 GLGFGGKFEFGFGMRRGVRALRHVDEGDWWRFPTVDILEISPKFNKV----SAEKKKKKV 269

Query: 1030 AAAAAVVPLATVSSREETEERSGRPMLKLDYDGVLEAWSDKATPFGEESEVTGADVNARL 1209
              A  +    +       +  S   +LKL+YD VL AWSD+ +PF  E+E  G D  ARL
Sbjct:  270 EKAQELRSWESPKGNSIPKSNSSL-LLKLNYDDVLSAWSDRGSPFSRETEFPGNDTAARL 328

Query: 1210 AQIDLLGD-SGVREASVLRYKEKRQTRLF 1293
            AQIDL  +  GVREASVLRYKEKR+TRLF
Sbjct:  329 AQIDLFSECGGVREASVLRYKEKRRTRLF 357


 Score =  85 bits (209), Expect = 2e-014
 Identities = 46/71 (64%), Positives = 56/71 (78%), Gaps = 4/71 (5%)
 Frame = +3

Query: 264 SGQEMSACLSSGASAAYSFELEKIKSPPSSSTTTTRATSPSSTITESSNS---ISTRKPR 434
           S ++MS+CL SGA   Y FELE +KSP S+S  T+ ++SPSSTI+ESSNS   ISTRKPR
Sbjct:  22 SHKKMSSCL-SGAGRTYGFELEIVKSPSSTSPRTSHSSSPSSTISESSNSPIAISTRKPR 80

Query: 435 TQRKRPNQSYN 467
           T RKRPNQ+YN
Sbjct:  81 TPRKRPNQTYN 91

>gi|297793237|ref|XP_002864503.1| hypothetical protein ARALYDRAFT_495815
        [Arabidopsis lyrata subsp. lyrata]

          Length = 426

 Score =  129 bits (323), Expect = 1e-027
 Identities = 71/126 (56%), Positives = 86/126 (68%), Gaps = 6/126 (4%)
 Frame = +1

Query:  934 DARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGR---P 1104
            +    TV  ++      +  E    KKKKKK+ AAA  + +    S E+TEE S +   P
Sbjct:  265 ETAISTVHEEKSDGKKVISGEKSNKKKKKKKMTAAATPLLITESKSSEDTEETSPKRTGP 324

Query: 1105 MLKLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEK 1275
            +LKLDYDGVLEAWSDK +PF +E   SE TG DVNARLA+IDL GDSG+REASVLRYKEK
Sbjct:  325 LLKLDYDGVLEAWSDKTSPFPDEILGSEATGIDVNARLAEIDLFGDSGMREASVLRYKEK 384

Query: 1276 RQTRLF 1293
            R+TRLF
Sbjct:  385 RRTRLF 390


 Score =  102 bits (254), Expect = 1e-019
 Identities = 57/69 (82%), Positives = 62/69 (89%), Gaps = 6/69 (8%)
 Frame = +3

Query: 276 MSACLSS--GASAAYSFELEKIKSPPSSS-TTTTRATSPSSTITESSNS---ISTRKPRT 437
           MSAC+SS  G +AAYSFELEK+KSPPSSS TTTTRATSPSSTI+ESSNS   ISTRKPRT
Sbjct:   1 MSACISSGGGGAAAYSFELEKVKSPPSSSTTTTTRATSPSSTISESSNSPLAISTRKPRT 60

Query: 438 QRKRPNQSY 464
           QRKRPNQ+Y
Sbjct:  61 QRKRPNQTY 69

>gi|30687076|ref|NP_849445.1| CCT motif family protein [Arabidopsis thaliana]

          Length = 409

 Score =  122 bits (306), Expect = 1e-025
 Identities = 68/125 (54%), Positives = 84/125 (67%), Gaps = 4/125 (3%)
 Frame = +1

Query:  931 DDARFQTVDLDRIQTTVTVV-DEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPM 1107
            DD +   VD  +I+T VT   D+ KK KKKKKKVA AAA    + V+      E+   P+
Sbjct:  233 DDGQSNVVDSSKIKTIVTAEGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL 292

Query: 1108 LKLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEKR 1278
            LKLDYDGVLEAWS K +PF +E   S+  G D + RL +IDL G+SG+REASVLRYKEKR
Sbjct:  293 LKLDYDGVLEAWSGKESPFSDEILGSDADGVDFHVRLGEIDLFGESGMREASVLRYKEKR 352

Query: 1279 QTRLF 1293
            + RLF
Sbjct:  353 RNRLF 357

>gi|4538928|emb|CAB39664.1| putative protein [Arabidopsis thaliana]

          Length = 378

 Score =  122 bits (306), Expect = 1e-025
 Identities = 68/125 (54%), Positives = 84/125 (67%), Gaps = 4/125 (3%)
 Frame = +1

Query:  931 DDARFQTVDLDRIQTTVTVV-DEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPM 1107
            DD +   VD  +I+T VT   D+ KK KKKKKKVA AAA    + V+      E+   P+
Sbjct:  233 DDGQSNVVDSSKIKTIVTAEGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL 292

Query: 1108 LKLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEKR 1278
            LKLDYDGVLEAWS K +PF +E   S+  G D + RL +IDL G+SG+REASVLRYKEKR
Sbjct:  293 LKLDYDGVLEAWSGKESPFSDEILGSDADGVDFHVRLGEIDLFGESGMREASVLRYKEKR 352

Query: 1279 QTRLF 1293
            + RLF
Sbjct:  353 RNRLF 357

>gi|18416659|ref|NP_567737.1| CCT motif family protein [Arabidopsis thaliana]

          Length = 394

 Score =  122 bits (306), Expect = 1e-025
 Identities = 68/125 (54%), Positives = 84/125 (67%), Gaps = 4/125 (3%)
 Frame = +1

Query:  931 DDARFQTVDLDRIQTTVTVV-DEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPM 1107
            DD +   VD  +I+T VT   D+ KK KKKKKKVA AAA    + V+      E+   P+
Sbjct:  233 DDGQSNVVDSSKIKTIVTAEGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL 292

Query: 1108 LKLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEKR 1278
            LKLDYDGVLEAWS K +PF +E   S+  G D + RL +IDL G+SG+REASVLRYKEKR
Sbjct:  293 LKLDYDGVLEAWSGKESPFSDEILGSDADGVDFHVRLGEIDLFGESGMREASVLRYKEKR 352

Query: 1279 QTRLF 1293
            + RLF
Sbjct:  353 RNRLF 357

>gi|30696840|ref|NP_568852.2| chloroplast import apparatus 2 protein
        [Arabidopsis thaliana]

          Length = 435

 Score =  119 bits (296), Expect = 2e-024
 Identities = 68/124 (54%), Positives = 84/124 (67%), Gaps = 5/124 (4%)
 Frame = +1

Query:  934 DARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGR---P 1104
            +    TVD ++      V+   K +KKKKKK       + +    S E+TEE S +   P
Sbjct:  277 ETAISTVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTL-ITESKSLEDTEETSLKRTGP 335

Query: 1105 MLKLDYDGVLEAWSDKATPFGEESEVTGA-DVNARLAQIDLLGDSGVREASVLRYKEKRQ 1281
            +LKLDYDGVLEAWSDK +PF +E + + A DVNARLAQIDL GDSG+REASVLRYKEKR+
Sbjct:  336 LLKLDYDGVLEAWSDKTSPFPDEIQGSEAVDVNARLAQIDLFGDSGMREASVLRYKEKRR 395

Query: 1282 TRLF 1293
            TRLF
Sbjct:  396 TRLF 399

>gi|30696838|ref|NP_851201.1| chloroplast import apparatus 2 protein
        [Arabidopsis thaliana]

          Length = 424

 Score =  119 bits (296), Expect = 2e-024
 Identities = 68/124 (54%), Positives = 84/124 (67%), Gaps = 5/124 (4%)
 Frame = +1

Query:  934 DARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGR---P 1104
            +    TVD ++      V+   K +KKKKKK       + +    S E+TEE S +   P
Sbjct:  277 ETAISTVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTL-ITESKSLEDTEETSLKRTGP 335

Query: 1105 MLKLDYDGVLEAWSDKATPFGEESEVTGA-DVNARLAQIDLLGDSGVREASVLRYKEKRQ 1281
            +LKLDYDGVLEAWSDK +PF +E + + A DVNARLAQIDL GDSG+REASVLRYKEKR+
Sbjct:  336 LLKLDYDGVLEAWSDKTSPFPDEIQGSEAVDVNARLAQIDLFGDSGMREASVLRYKEKRR 395

Query: 1282 TRLF 1293
            TRLF
Sbjct:  396 TRLF 399

>gi|147784441|emb|CAN72725.1| hypothetical protein VITISV_015092 [Vitis
        vinifera]

          Length = 392

 Score =  119 bits (296), Expect = 2e-024
 Identities = 86/202 (42%), Positives = 107/202 (52%), Gaps = 16/202 (7%)
 Frame = +1

Query:  718 FEFSDEFESILVEEVEEGGVDSIMGKLEPGSGHRRGV------NHRFDHQIVKGFKFAEN 879
            FE   + ESIL EE+ EGG+DSIMG L   +            N  + + I  G  F   
Sbjct:  162 FEEDFDAESILDEEI-EGGIDSIMGNLSVDNEMSDEATNPVCFNSYYGNGIPMGLGFGGK 220

Query:  880 IPLGLGLR---TALRDHNNDDDARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVV 1050
               G G+R    ALR  +  D  RF TVD+  I      V     ++KKKKKV  A  + 
Sbjct:  221 FEFGFGMRRGVRALRHVDEGDWWRFPTVDILEISPKFNKV----SAEKKKKKVEKAQELR 276

Query: 1051 PLATVSSREETEERSGRPMLKLDYDGVLEAWSDKATPFGEESEVTGADVNARLAQIDLLG 1230
               +       +  S   +LKL+YD VL AWSD+ +PF  E+E  G D  ARLAQIDL  
Sbjct:  277 SWESPKGNSIPKSNSSL-LLKLNYDDVLSAWSDRGSPFSRETEFPGNDTAARLAQIDLFS 335

Query: 1231 D-SGVREASVLRYKEKRQTRLF 1293
            +  GVREASVLRYKEKR+TRLF
Sbjct:  336 ECGGVREASVLRYKEKRRTRLF 357


 Score =  77 bits (187), Expect = 7e-012
 Identities = 42/66 (63%), Positives = 50/66 (75%), Gaps = 4/66 (6%)
 Frame = +3

Query: 279 SACLSSGASAAYSFELEKIKSPPSSSTTTTRATSPSSTITESSNS---ISTRKPRTQRKR 449
           S+CL SGA   Y FELE +K P S+S  T+ ++SPSSTI+ESSNS   ISTRK RT RKR
Sbjct:   2 SSCL-SGAGRTYGFELEIVKXPSSTSPRTSHSSSPSSTISESSNSPIAISTRKXRTPRKR 60

Query: 450 PNQSYN 467
           PNQ+YN
Sbjct:  61 PNQTYN 66

>gi|225470656|ref|XP_002268213.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 392

 Score =  119 bits (296), Expect = 2e-024
 Identities = 86/202 (42%), Positives = 107/202 (52%), Gaps = 16/202 (7%)
 Frame = +1

Query:  718 FEFSDEFESILVEEVEEGGVDSIMGKLEPGSGHRRGV------NHRFDHQIVKGFKFAEN 879
            FE   + ESIL EE+ EGG+DSIMG L   +            N  + + I  G  F   
Sbjct:  162 FEEDFDAESILDEEI-EGGIDSIMGNLSVDNEMSDEATNPVCFNSYYGNGIPMGLGFGGK 220

Query:  880 IPLGLGLR---TALRDHNNDDDARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVV 1050
               G G+R    ALR  +  D  RF TVD+  I      V     ++KKKKKV  A  + 
Sbjct:  221 FEFGFGMRRGVRALRHVDEGDWWRFPTVDILEISPKFNKV----SAEKKKKKVEKAQELR 276

Query: 1051 PLATVSSREETEERSGRPMLKLDYDGVLEAWSDKATPFGEESEVTGADVNARLAQIDLLG 1230
               +       +  S   +LKL+YD VL AWSD+ +PF  E+E  G D  ARLAQIDL  
Sbjct:  277 SWESPKGNSIPKSNSSL-LLKLNYDDVLSAWSDRGSPFSRETEFPGNDTAARLAQIDLFS 335

Query: 1231 D-SGVREASVLRYKEKRQTRLF 1293
            +  GVREASVLRYKEKR+TRLF
Sbjct:  336 ECGGVREASVLRYKEKRRTRLF 357


 Score =  82 bits (200), Expect = 2e-013
 Identities = 44/66 (66%), Positives = 52/66 (78%), Gaps = 4/66 (6%)
 Frame = +3

Query: 279 SACLSSGASAAYSFELEKIKSPPSSSTTTTRATSPSSTITESSNS---ISTRKPRTQRKR 449
           S+CL SGA   Y FELE +KSP S+S  T+ ++SPSSTI+ESSNS   ISTRKPRT RKR
Sbjct:   2 SSCL-SGAGRTYGFELEIVKSPSSTSPRTSHSSSPSSTISESSNSPIAISTRKPRTPRKR 60

Query: 450 PNQSYN 467
           PNQ+YN
Sbjct:  61 PNQTYN 66

>gi|297799402|ref|XP_002867585.1| hypothetical protein ARALYDRAFT_354188
        [Arabidopsis lyrata subsp. lyrata]

          Length = 392

 Score =  112 bits (280), Expect = 1e-022
 Identities = 63/124 (50%), Positives = 80/124 (64%), Gaps = 5/124 (4%)
 Frame = +1

Query:  931 DDARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPML 1110
            DD +   +D +  +T VT  +  K+ KKKKKK   A     L    S  + E+R   P+L
Sbjct:  234 DDGQSNVLDSNNNKTIVT-AEGGKRKKKKKKKTKVAPPTAELEVPDSSPKMEQRVS-PLL 291

Query: 1111 KLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEKRQ 1281
            KLDYDGVLEAWS K +PF +E   S+  G D +ARL +IDL G+SG+REASVLRYKEKR+
Sbjct:  292 KLDYDGVLEAWSGKESPFSDEILGSDAAGVDFHARLGEIDLFGESGMREASVLRYKEKRR 351

Query: 1282 TRLF 1293
             RLF
Sbjct:  352 NRLF 355

>gi|255554380|ref|XP_002518229.1| CIL, putative [Ricinus communis]

          Length = 397

 Score =  89 bits (219), Expect = 1e-015
 Identities = 53/99 (53%), Positives = 63/99 (63%), Gaps = 6/99 (6%)
 Frame = +1

Query: 1000 KKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPMLKLDYDGVLEAWSDKATPFGEE-- 1173
            K  +KKKKKV     +  L          + S   +LKL+YDGVL AWSDK +PF EE  
Sbjct:  276 KSDEKKKKKV---VELKNLEMAKKESSVPQSSSGLLLKLNYDGVLNAWSDKGSPFSEEIS 332

Query: 1174 SEVTGADVNARLAQIDLLGDS-GVREASVLRYKEKRQTR 1287
                G DV+ARLAQIDL  ++ GVREASVLRYKEKR+TR
Sbjct:  333 GSEGGNDVSARLAQIDLFSENGGVREASVLRYKEKRRTR 371

>gi|224142697|ref|XP_002324691.1| predicted protein [Populus trichocarpa]

          Length = 141

 Score =  89 bits (218), Expect = 2e-015
 Identities = 44/67 (65%), Positives = 54/67 (80%), Gaps = 2/67 (2%)
 Frame = +1

Query: 1099 RPMLKLDYDGVLEAWSDKATPFGEES--EVTGADVNARLAQIDLLGDSGVREASVLRYKE 1272
            R +LKL+YD VL AWSD+ +PF EE+     G DV+ARLAQIDL  ++G+REASVLRYKE
Sbjct:   29 RLILKLNYDNVLNAWSDRGSPFSEETMGSAEGTDVSARLAQIDLFSENGMREASVLRYKE 88

Query: 1273 KRQTRLF 1293
            KR+TRLF
Sbjct:   89 KRRTRLF 95

>gi|224087284|ref|XP_002308112.1| predicted protein [Populus trichocarpa]

          Length = 448

 Score =  84 bits (205), Expect = 6e-014
 Identities = 42/65 (64%), Positives = 52/65 (80%), Gaps = 2/65 (3%)
 Frame = +1

Query: 1105 MLKLDYDGVLEAWSDKATPFGEESE--VTGADVNARLAQIDLLGDSGVREASVLRYKEKR 1278
            +LKL+YD VL  WSD+ +PF +ES     G DV+ARLAQIDL  ++G+REASVLRYKEKR
Sbjct:  344 ILKLNYDHVLSEWSDRGSPFSDESMGCAEGNDVSARLAQIDLFSENGMREASVLRYKEKR 403

Query: 1279 QTRLF 1293
            +TRLF
Sbjct:  404 RTRLF 408


 Score =  74 bits (180), Expect = 5e-011
 Identities = 40/67 (59%), Positives = 48/67 (71%), Gaps = 3/67 (4%)
 Frame = +3

Query: 276 MSACLSSGASAAYSFELEKIKSPPSSSTTTTRATSPSSTITESSNS---ISTRKPRTQRK 446
           MS+   SG    Y F+LE +KS  +SST T+  +SPSSTI+ESSNS   ISTRK RT RK
Sbjct:   1 MSSPCISGGGRTYGFDLEIVKSSSTSSTRTSHTSSPSSTISESSNSPLAISTRKSRTPRK 60

Query: 447 RPNQSYN 467
           RPNQ+YN
Sbjct:  61 RPNQTYN 67

>gi|79331135|ref|NP_001032087.1| chloroplast import apparatus 2 protein
        [Arabidopsis thaliana]

          Length = 376

 Score =  67 bits (161), Expect = 8e-009
 Identities = 42/99 (42%), Positives = 56/99 (56%), Gaps = 5/99 (5%)
 Frame = +1

Query:  934 DARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGR---P 1104
            +    TVD ++      V+   K +KKKKKK       + +    S E+TEE S +   P
Sbjct:  277 ETAISTVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTL-ITESKSLEDTEETSLKRTGP 335

Query: 1105 MLKLDYDGVLEAWSDKATPFGEESEVTGA-DVNARLAQI 1218
            +LKLDYDGVLEAWSDK +PF +E + + A DVN  L  I
Sbjct:  336 LLKLDYDGVLEAWSDKTSPFPDEIQGSEAVDVNVCLTLI 374

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,531,826,676,690
Number of Sequences: 15229318
Number of Extensions: 3531826676690
Number of Successful Extensions: 836928167
Number of sequences better than 0.0: 0