Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN80696


BLASTX 7.6.2

Query= UN80696 /QuerySize=743
        (742 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|24417230|gb|AAN60225.1| unknown [Arabidopsis thaliana]              278   5e-073
gi|30682651|ref|NP_566456.3| uncharacterized protein [Arabidopsi...    277   1e-072
gi|297834166|ref|XP_002884965.1| hypothetical protein ARALYDRAFT...    275   6e-072
gi|297847918|ref|XP_002891840.1| hypothetical protein ARALYDRAFT...    263   3e-068
gi|255564784|ref|XP_002523386.1| conserved hypothetical protein ...    168   7e-040
gi|147774374|emb|CAN72399.1| hypothetical protein VITISV_041203 ...    160   3e-037
gi|224056921|ref|XP_002299090.1| predicted protein [Populus tric...    149   5e-034
gi|21618303|gb|AAM67353.1| unknown [Arabidopsis thaliana]              141   2e-031
gi|186491177|ref|NP_001117500.1| uncharacterized protein [Arabid...    127   1e-027
gi|255642161|gb|ACU21345.1| unknown [Glycine max]                      125   7e-027
gi|297819868|ref|XP_002877817.1| predicted protein [Arabidopsis ...    118   9e-025
gi|297844056|ref|XP_002889909.1| hypothetical protein ARALYDRAFT...    118   9e-025
gi|4204260|gb|AAD10641.1| Hypothetical protein [Arabidopsis thal...     87   2e-015

>gi|24417230|gb|AAN60225.1| unknown [Arabidopsis thaliana]

          Length = 321

 Score =  278 bits (711), Expect = 5e-073
 Identities = 165/225 (73%), Positives = 179/225 (79%), Gaps = 18/225 (8%)
 Frame = +3

Query:  81 KMRIGMLALLVTLSAI---EIGLASSPNTVPAFLWSPHLLQDGN----EGVNYQVMSGKD 239
           K++IG +ALLV LS +   EIGLA SPNTVPAFLWSPH LQ  N    E VNYQVMS KD
Sbjct:   3 KIQIGAVALLVFLSVVSLFEIGLA-SPNTVPAFLWSPH-LQSANGELDEAVNYQVMSAKD 60

Query: 240 LVDSVFTQGGWSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSKSSS---LVNTLSGLF 410
           LV SVFTQGGWSN LCSEKK EQPV DVA VF+GRELLSSDVSSK +S   LVNTL+ LF
Sbjct:  61 LVGSVFTQGGWSNFLCSEKKLEQPV-DVALVFIGRELLSSDVSSKRNSDPALVNTLNNLF 119

Query: 411 TSSNFSLAFPYIAAPEEERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLSD 590
           T+SNFSLAFPYIAAPEEERME+LLLSGLK+ACP+N GV SNIVFSDSCFVEDGTIQKLSD
Sbjct: 120 TASNFSLAFPYIAAPEEERMENLLLSGLKEACPNNVGV-SNIVFSDSCFVEDGTIQKLSD 178

Query: 591 LQSFKDHLLARKETRKK-RN*LGCALFRGF*IK---GQSHSEHEA 713
           LQSFKDHLLAR+ETRK+    L      G       GQSHSE E+
Sbjct: 179 LQSFKDHLLARRETRKEGETDLVVLCSEGSESNSQAGQSHSERES 223

>gi|30682651|ref|NP_566456.3| uncharacterized protein [Arabidopsis thaliana]

          Length = 321

 Score =  277 bits (708), Expect = 1e-072
 Identities = 165/225 (73%), Positives = 178/225 (79%), Gaps = 18/225 (8%)
 Frame = +3

Query:  81 KMRIGMLALLVTLSA---IEIGLASSPNTVPAFLWSPHLLQDGN----EGVNYQVMSGKD 239
           K++IG +ALLV LS     EIGLA SPNTVPAFLWSPH LQ  N    E VNYQVMS KD
Sbjct:   3 KIQIGAVALLVFLSVASLFEIGLA-SPNTVPAFLWSPH-LQSANGELDEAVNYQVMSAKD 60

Query: 240 LVDSVFTQGGWSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSKSSS---LVNTLSGLF 410
           LV SVFTQGGWSN LCSEKK EQPV DVA VF+GRELLSSDVSSK +S   LVNTL+ LF
Sbjct:  61 LVGSVFTQGGWSNFLCSEKKLEQPV-DVALVFIGRELLSSDVSSKRNSDPALVNTLNNLF 119

Query: 411 TSSNFSLAFPYIAAPEEERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLSD 590
           T+SNFSLAFPYIAAPEEERME+LLLSGLK+ACP+N GV SNIVFSDSCFVEDGTIQKLSD
Sbjct: 120 TASNFSLAFPYIAAPEEERMENLLLSGLKEACPNNVGV-SNIVFSDSCFVEDGTIQKLSD 178

Query: 591 LQSFKDHLLARKETRKK-RN*LGCALFRGF*IK---GQSHSEHEA 713
           LQSFKDHLLAR+ETRK+    L      G       GQSHSE E+
Sbjct: 179 LQSFKDHLLARRETRKEGETDLVVLCSEGSESNSQAGQSHSERES 223

>gi|297834166|ref|XP_002884965.1| hypothetical protein ARALYDRAFT_478722
        [Arabidopsis lyrata subsp. lyrata]

          Length = 321

 Score =  275 bits (702), Expect = 6e-072
 Identities = 163/230 (70%), Positives = 178/230 (77%), Gaps = 15/230 (6%)
 Frame = +3

Query:  78 MKMRIGMLALLVTL---SAIEIGLASSPNTVPAFLWSPHLLQDGN----EGVNYQVMSGK 236
           MK++I  +ALLV L   S  EIGLAS+ NTVPAFLWSPH LQ  N    E VNYQVMS K
Sbjct:   2 MKIQISAVALLVALSMASLFEIGLAST-NTVPAFLWSPH-LQSANGELDEVVNYQVMSAK 59

Query: 237 DLVDSVFTQGGWSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSK---SSSLVNTLSGL 407
           DLV SVFTQGGWSN LCSEKK EQPV DVA VF+GRELLSSDVSSK    S+LVNTLS L
Sbjct:  60 DLVGSVFTQGGWSNFLCSEKKLEQPV-DVALVFIGRELLSSDVSSKRNSDSALVNTLSNL 118

Query: 408 FTSSNFSLAFPYIAAPEEERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLS 587
           FT+SNFSLAFPYIAAPEEERME+LLLSGLK+ACP+N GV SNIVFSDSCFV+DGTIQKLS
Sbjct: 119 FTASNFSLAFPYIAAPEEERMENLLLSGLKEACPNNVGV-SNIVFSDSCFVQDGTIQKLS 177

Query: 588 DLQSFKDHLLARKETRKK-RN*LGCALFRGF*IKGQSHSEHEASPSLLVL 734
           DLQSFKDHLLAR+ETRK+    L      G   K Q+   H    S+L L
Sbjct: 178 DLQSFKDHLLARRETRKEGETDLVVLCSEGSESKSQAAQSHSERESILEL 227

>gi|297847918|ref|XP_002891840.1| hypothetical protein ARALYDRAFT_314773
        [Arabidopsis lyrata subsp. lyrata]

          Length = 313

 Score =  263 bits (670), Expect = 3e-068
 Identities = 143/192 (74%), Positives = 163/192 (84%), Gaps = 10/192 (5%)
 Frame = +3

Query:  78 MKMRIGMLALLVTLSAIEIGLASSPNTVPAFLWSPHLLQ-DGNEGVNYQVMSGKDLVDSV 254
           M+MRI ++ALLV   A++ GLA SP+TVPAFLWSPHL   +G   VNYQVMS KDLVDSV
Sbjct:   2 MRMRIVLVALLV---ALDFGLA-SPSTVPAFLWSPHLQSANGEMDVNYQVMSAKDLVDSV 57

Query: 255 FTQGGWSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSKSSS---LVNTLSGLFTSSNF 425
           FTQGGWSN LCSEK  +QPV DVA VF+GRELLSSDVSS  +S   LVN L  L+T+SNF
Sbjct:  58 FTQGGWSNFLCSEKNLQQPV-DVALVFIGRELLSSDVSSNRNSDPALVNILKNLYTASNF 116

Query: 426 SLAFPYIAAPEEERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLSDLQSFK 605
           SLAFPYIAAPEEERME+LLLSGLK+AC HN GV +N+VFSDSCFVEDGTIQKLS++QSFK
Sbjct: 117 SLAFPYIAAPEEERMENLLLSGLKEACAHNVGV-TNVVFSDSCFVEDGTIQKLSNVQSFK 175

Query: 606 DHLLARKETRKK 641
           DHL+ARKETRK+
Sbjct: 176 DHLVARKETRKE 187

>gi|255564784|ref|XP_002523386.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 318

 Score =  168 bits (425), Expect = 7e-040
 Identities = 93/183 (50%), Positives = 128/183 (69%), Gaps = 10/183 (5%)
 Frame = +3

Query: 105 LLVTLSAIEIGLASSPNTVPAFLWSPHLLQDG--NEGVNYQVMSGKDLVDSVFTQGGWSN 278
           L+V L  +   LASS   VPAF WS H   +   +E VNYQ +S +DL  S+ ++GGWSN
Sbjct:  14 LMVALCNVPGLLASS---VPAFFWSSHQFSNNGMDEAVNYQTISSRDLARSILSEGGWSN 70

Query: 279 ILCSEKKAEQPVIDVAPVFVGRELLSSDVSSKSS---SLVNTLSGLFTSSNFSLAFPYIA 449
           ILCSEKK +QPV D+A VFVGRELLS+D+S++ S   +LV+ L  L+  SNFS+AFPYIA
Sbjct:  71 ILCSEKKLQQPV-DLALVFVGRELLSTDISTRKSADPALVSLLKVLYGRSNFSMAFPYIA 129

Query: 450 APEEERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLSDLQSFKDHLLARKE 629
           A EEE ME+ L+SG  ++C  + G+ +N+ FS+SC VE    +KL+D+ +  DHL++R E
Sbjct: 130 ASEEETMENSLVSGFVESCGQDLGI-NNVAFSESCSVEGENFEKLADVHAVHDHLVSRME 188

Query: 630 TRK 638
            R+
Sbjct: 189 KRQ 191

>gi|147774374|emb|CAN72399.1| hypothetical protein VITISV_041203 [Vitis
        vinifera]

          Length = 315

 Score =  160 bits (403), Expect = 3e-037
 Identities = 91/186 (48%), Positives = 124/186 (66%), Gaps = 8/186 (4%)
 Frame = +3

Query:  90 IGMLALLVTLSAIEIGLASSPNTVPAFLWSPHLLQDGNEGVNYQVMSGKDLVDSVFTQGG 269
           + +L +L+ +S +    A  P+TVPAFLWS H   +  E VNYQ +S KDL  SV ++GG
Sbjct:   6 VSLLMVLLVVSRVPYRTA-LPSTVPAFLWSHH-QXEMKEAVNYQTLSPKDLAKSVVSEGG 63

Query: 270 WSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSK---SSSLVNTLSGLFTSSNFSLAFP 440
           WSN+LCS +K +QPV D+A VFVGREL S D+S       +LV+ L   F  SNFS+AFP
Sbjct:  64 WSNLLCSGEKDQQPV-DLALVFVGRELSSLDISGSKHADPALVDLLKVSFARSNFSMAFP 122

Query: 441 YIAAPEE-ERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLSDLQSFKDHLL 617
           Y+A  EE E ME+ L+SG  + C H+ GV SN+ F +SC VE G  +KL+DL S  D+L+
Sbjct: 123 YVAVSEEKEAMENSLISGFTETCGHDLGV-SNVAFLESCSVEGGNFKKLADLHSVHDYLV 181

Query: 618 ARKETR 635
           +R++ R
Sbjct: 182 SRRKMR 187

>gi|224056921|ref|XP_002299090.1| predicted protein [Populus trichocarpa]

          Length = 316

 Score =  149 bits (375), Expect = 5e-034
 Identities = 92/188 (48%), Positives = 122/188 (64%), Gaps = 14/188 (7%)
 Frame = +3

Query:  90 IGMLALLVTLSAIEIGLASSPN--TVPAFLWSP-HLLQDGNEGVNYQVMSGKDLVDSVFT 260
           I  L ++   S +    ASSP+  TVPAFLWSP H     +E VNYQ +S KDL  SV +
Sbjct:   7 IWWLTVVAAASRLSFLHASSPSSTTVPAFLWSPHHPHHQMSEVVNYQTISSKDLARSVLS 66

Query: 261 QGGWSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSKSS---SLVNTLSGLFTSSNFSL 431
           +GGWSN+LCSEKK +Q V D+A VF+GR LLS+DVS+  +   +LVN L      SNFS+
Sbjct:  67 EGGWSNLLCSEKKVQQSV-DLALVFIGRGLLSTDVSANKNTDPALVNLL-----KSNFSM 120

Query: 432 AFPYIAAPEEERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLSDLQSFKDH 611
           AF Y+AA  EE ME+ L+SG  +AC  +  + SN+ FS+SC VE    QKL++L +  D+
Sbjct: 121 AFSYVAA-SEEAMENSLVSGFAEACGQDLEI-SNVAFSESCSVEGENFQKLANLHAINDY 178

Query: 612 LLARKETR 635
           L +R E R
Sbjct: 179 LASRMEKR 186

>gi|21618303|gb|AAM67353.1| unknown [Arabidopsis thaliana]

          Length = 203

 Score =  141 bits (353), Expect = 2e-031
 Identities = 70/78 (89%), Positives = 76/78 (97%), Gaps = 1/78 (1%)
 Frame = +3

Query: 408 FTSSNFSLAFPYIAAPEEERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLS 587
           FT+SNFSLAFPYIAAPEEERME+LLLSGLK+ACP+N GV SNIVFSDSCFVEDGTIQKLS
Sbjct:   1 FTASNFSLAFPYIAAPEEERMENLLLSGLKEACPNNVGV-SNIVFSDSCFVEDGTIQKLS 59

Query: 588 DLQSFKDHLLARKETRKK 641
           DLQSFKDHLLAR+ETRK+
Sbjct:  60 DLQSFKDHLLARRETRKE 77

>gi|186491177|ref|NP_001117500.1| uncharacterized protein [Arabidopsis
        thaliana]

          Length = 131

 Score =  127 bits (319), Expect = 1e-027
 Identities = 73/108 (67%), Positives = 82/108 (75%), Gaps = 6/108 (5%)
 Frame = +3

Query:  96 MLALLVTLSAIEIGLASSPNTVPAFLWSPHL-LQDGNEGVNYQVMSGKDLVDSVFTQGGW 272
           +L +L   S ++ GLA SP+TVPAFLWSPHL   +G   VNYQVMS KDLVDSVFT GGW
Sbjct:  26 LLVVLEFASLVDFGLA-SPSTVPAFLWSPHLQYANGETDVNYQVMSAKDLVDSVFTLGGW 84

Query: 273 SNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSKSSS---LVNTLSGL 407
           SN LCSEKK +QPV DVA VF+GRELLSSDVSS  +S   LVNTL  L
Sbjct:  85 SNFLCSEKKLQQPV-DVALVFIGRELLSSDVSSNQNSDPVLVNTLKYL 131

>gi|255642161|gb|ACU21345.1| unknown [Glycine max]

          Length = 201

 Score =  125 bits (313), Expect = 7e-027
 Identities = 77/197 (39%), Positives = 119/197 (60%), Gaps = 13/197 (6%)
 Frame = +3

Query: 102 ALLVTLSAIEIGLASSPNTVPAFLWSPH---LLQDG-NEGVNYQVMSGKDLVDSVFTQGG 269
           A ++  + I  GL + P+TVPAFLWS H     ++G  E VNYQV+S KDL  SVF++ G
Sbjct:   8 AFILFFAFIPNGLLAVPSTVPAFLWSSHYELASENGLKESVNYQVISPKDLAKSVFSEAG 67

Query: 270 WSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSK---SSSLVNTLSGLFTSSNFSLAFP 440
           WSN LC  KK  +P +D+A +FVGREL SSD++      S+L++ L   F  SN S+AFP
Sbjct:  68 WSNFLCKGKKLHEP-LDLALLFVGRELQSSDLNMNKHADSALLDLLKISFARSNTSVAFP 126

Query: 441 YIAAPEEERMESLLLSGLKQACPHNAGVSSNIVFSDSCFVEDGTIQKLSDLQSFKDHLLA 620
           Y++  E+  +E+ L+SG  +AC  + G+  N+ F  SC ++    ++++   +    L  
Sbjct: 127 YVSTSEDVLLENSLISGFAEAC-GDMGI-GNVAFHGSCSMDGANHEEITTF-ALSSRLFD 183

Query: 621 RKETR--KKRN*LGCAL 665
           +++ R  ++ N  GC L
Sbjct: 184 QEDGRESQRENRFGCVL 200

>gi|297819868|ref|XP_002877817.1| predicted protein [Arabidopsis lyrata subsp.
        lyrata]

          Length = 189

 Score =  118 bits (295), Expect = 9e-025
 Identities = 65/85 (76%), Positives = 72/85 (84%), Gaps = 5/85 (5%)
 Frame = +3

Query: 351 LSSDVSSKSSSLVNTLSGLFTSSNFSLAFPYIAAPEEERMESLLLSGLKQACPHNAGVSS 530
           L + ++SKSS   N    LFT+SNFSLAFPYIAA EEERME+LLLSGLK+ACP+N GV S
Sbjct:  79 LHNTLTSKSSFQQN----LFTASNFSLAFPYIAASEEERMENLLLSGLKEACPNNVGV-S 133

Query: 531 NIVFSDSCFVEDGTIQKLSDLQSFK 605
           NIVFSDSCFVE GTIQKLSDLQSFK
Sbjct: 134 NIVFSDSCFVEHGTIQKLSDLQSFK 158


 Score =  92 bits (226), Expect = 9e-017
 Identities = 53/70 (75%), Positives = 56/70 (80%), Gaps = 4/70 (5%)
 Frame = +3

Query: 201 NEGVNYQVMSGKDLVDSVFTQGGWSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSK-- 374
           +E VNYQVMS KDLV SVFTQGGWSN L SEKK EQ V DV  VF+GRELLSSDVSSK  
Sbjct:   1 DEVVNYQVMSAKDLVGSVFTQGGWSNFLFSEKKLEQRV-DVVLVFIGRELLSSDVSSKRN 59

Query: 375 -SSSLVNTLS 401
             S+LVNTLS
Sbjct:  60 SDSALVNTLS 69

>gi|297844056|ref|XP_002889909.1| hypothetical protein ARALYDRAFT_334493
        [Arabidopsis lyrata subsp. lyrata]

          Length = 172

 Score =  118 bits (295), Expect = 9e-025
 Identities = 65/85 (76%), Positives = 72/85 (84%), Gaps = 5/85 (5%)
 Frame = +3

Query: 351 LSSDVSSKSSSLVNTLSGLFTSSNFSLAFPYIAAPEEERMESLLLSGLKQACPHNAGVSS 530
           L + ++SKSS   N    LFT+SNFSLAFPYIAA EEERME+LLLSGLK+ACP+N GV S
Sbjct:  71 LHNTLTSKSSFQQN----LFTASNFSLAFPYIAASEEERMENLLLSGLKEACPNNVGV-S 125

Query: 531 NIVFSDSCFVEDGTIQKLSDLQSFK 605
           NIVFSDSCFVE GTIQKLSDLQSFK
Sbjct: 126 NIVFSDSCFVEHGTIQKLSDLQSFK 150

>gi|4204260|gb|AAD10641.1| Hypothetical protein [Arabidopsis thaliana]

          Length = 76

 Score =  87 bits (214), Expect = 2e-015
 Identities = 49/72 (68%), Positives = 54/72 (75%), Gaps = 4/72 (5%)
 Frame = +3

Query: 225 MSGKDLVDSVFTQGGWSNILCSEKKAEQPVIDVAPVFVGRELLSSDVSSKSSS---LVNT 395
           MS KDLVDSVFT GGWSN LCSEKK +QPV DVA VF+GRELLSSDVSS  +S   LVNT
Sbjct:   1 MSAKDLVDSVFTLGGWSNFLCSEKKLQQPV-DVALVFIGRELLSSDVSSNQNSDPVLVNT 59

Query: 396 LSGLFTSSNFSL 431
           L  L +   + L
Sbjct:  60 LKSLNSEDPYVL 71

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,178,980,611,991
Number of Sequences: 15229318
Number of Extensions: 1178980611991
Number of Successful Extensions: 310660920
Number of sequences better than 0.0: 0