Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN82542


BLASTX 7.6.2

Query= UN82542 /QuerySize=811
        (810 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|7594584|emb|CAB88077.1| hypothetical protein [Arabidopsis tha...    318   7e-085
gi|21554414|gb|AAM63519.1| glycoprotein homolog [Arabidopsis tha...    318   7e-085
gi|15235884|ref|NP_193412.1| hydroxyproline-rich glycoprotein fa...    316   3e-084
gi|8919877|emb|CAB96200.1| hypothetical protein [Capsella rubella]     314   1e-083
gi|297800428|ref|XP_002868098.1| hydroxyproline-rich glycoprotei...    307   2e-081
gi|255581309|ref|XP_002531465.1| conserved hypothetical protein ...    109   8e-022
gi|296085704|emb|CBI29503.3| unnamed protein product [Vitis vini...     95   1e-017
gi|224074191|ref|XP_002304294.1| predicted protein [Populus tric...     87   2e-015
gi|225453426|ref|XP_002272713.1| PREDICTED: hypothetical protein...     77   3e-012
gi|147843896|emb|CAN81597.1| hypothetical protein VITISV_039396 ...     75   1e-011
gi|297820898|ref|XP_002878332.1| hypothetical protein ARALYDRAFT...     65   8e-009
gi|15232309|ref|NP_191597.1| uncharacterized protein [Arabidopsi...     63   4e-008
gi|255541082|ref|XP_002511605.1| hypothetical protein RCOM_16086...     62   1e-007

>gi|7594584|emb|CAB88077.1| hypothetical protein [Arabidopsis thaliana]

          Length = 471

 Score =  318 bits (814), Expect = 7e-085
 Identities = 186/279 (66%), Positives = 207/279 (74%), Gaps = 39/279 (13%)
 Frame = +1

Query:   1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
           Q G+KEDQNPRKFYSRF+FKALIL +LC++VPVFLSQTPELANQTRLLELLH++FVGIAV
Sbjct:  10 QLGNKEDQNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQTRLLELLHLVFVGIAV 69

Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN---NNTNNPHPYVPKILEVSSSVFNVDHES---GSD 342
           SYGLFSRRNYDGGGGG  SN+DHN   ++ NN H YVPKILEV SSVFNV HES    SD
Sbjct:  70 SYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEV-SSVFNVGHESESEPSD 128

Query: 343 DSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHLS 522
           DSSGD     +  +NKY  K  E       E+RFVDRVSS  REKPLLLPVRSLNYS +S
Sbjct: 129 DSSGDQ-RKFQTWKNKYHMKIPEV------ETRFVDRVSSENREKPLLLPVRSLNYSRVS 181

Query: 523 D-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME--------------- 654
           D SGD SGRWE+VRSKR+LLKTL DD+S D LPSPIPWRSRSS                 
Sbjct: 182 DSSGDNSGRWEKVRSKRELLKTLGDDNS-DVLPSPIPWRSRSSSSSSSSSKEVESLPSVK 240

Query: 655 ----IESQPLIKTPASLTSSSALSSSPRKSTPLPKLTSE 759
               +ESQPLIK   +LT SS+  SSPRKS P+P L SE
Sbjct: 241 NLTTVESQPLIK---NLTPSSSF-SSPRKSNPIPNLASE 275

>gi|21554414|gb|AAM63519.1| glycoprotein homolog [Arabidopsis thaliana]

          Length = 473

 Score =  318 bits (814), Expect = 7e-085
 Identities = 186/279 (66%), Positives = 207/279 (74%), Gaps = 39/279 (13%)
 Frame = +1

Query:   1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
           Q G+KEDQNPRKFYSRF+FKALIL +LC++VPVFLSQTPELANQTRLLELLH++FVGIAV
Sbjct:  12 QLGNKEDQNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQTRLLELLHLVFVGIAV 71

Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN---NNTNNPHPYVPKILEVSSSVFNVDHES---GSD 342
           SYGLFSRRNYDGGGGG  SN+DHN   ++ NN H YVPKILEV SSVFNV HES    SD
Sbjct:  72 SYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEV-SSVFNVGHESESEPSD 130

Query: 343 DSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHLS 522
           DSSGD     +  +NKY  K  E       E+RFVDRVSS  REKPLLLPVRSLNYS +S
Sbjct: 131 DSSGDQ-RKFQTWKNKYHMKIPEV------ETRFVDRVSSENREKPLLLPVRSLNYSRVS 183

Query: 523 D-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME--------------- 654
           D SGD SGRWE+VRSKR+LLKTL DD+S D LPSPIPWRSRSS                 
Sbjct: 184 DSSGDNSGRWEKVRSKRELLKTLGDDNS-DVLPSPIPWRSRSSSSSSSSSKEVESLPSVK 242

Query: 655 ----IESQPLIKTPASLTSSSALSSSPRKSTPLPKLTSE 759
               +ESQPLIK   +LT SS+  SSPRKS P+P L SE
Sbjct: 243 NLTTVESQPLIK---NLTPSSSF-SSPRKSNPIPNLASE 277

>gi|15235884|ref|NP_193412.1| hydroxyproline-rich glycoprotein family protein
        [Arabidopsis thaliana]

          Length = 473

 Score =  316 bits (809), Expect = 3e-084
 Identities = 185/279 (66%), Positives = 206/279 (73%), Gaps = 39/279 (13%)
 Frame = +1

Query:   1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
           Q G+KEDQNPRKFYSRF+FKALIL +LC++VPVFLSQTPELANQTRLLELLH++FVGIAV
Sbjct:  12 QLGNKEDQNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQTRLLELLHLVFVGIAV 71

Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN---NNTNNPHPYVPKILEVSSSVFNVDHES---GSD 342
           SYGLFSRRNYDGGGGG  SN+DHN   ++ NN H YVPKILEV SSVFNV HES    SD
Sbjct:  72 SYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEV-SSVFNVGHESESEPSD 130

Query: 343 DSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHLS 522
           DSSGD     +  +NKY  K  E       E+RFVDRVSS  REKPLLLPVRSLNYS +S
Sbjct: 131 DSSGDQ-RKFQTWKNKYHMKIPEV------ETRFVDRVSSENREKPLLLPVRSLNYSRVS 183

Query: 523 D-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME--------------- 654
           D SGD SGRWE+VRSKR+LLKTL DD+S D LPSPIPWRSRSS                 
Sbjct: 184 DSSGDNSGRWEKVRSKRELLKTLGDDNS-DVLPSPIPWRSRSSSSSSSSSKEVESLPSVK 242

Query: 655 ----IESQPLIKTPASLTSSSALSSSPRKSTPLPKLTSE 759
               +ESQPLIK   +LT  S+  SSPRKS P+P L SE
Sbjct: 243 NLTTVESQPLIK---NLTPPSSF-SSPRKSNPIPNLASE 277

>gi|8919877|emb|CAB96200.1| hypothetical protein [Capsella rubella]

          Length = 470

 Score =  314 bits (803), Expect = 1e-083
 Identities = 181/276 (65%), Positives = 205/276 (74%), Gaps = 36/276 (13%)
 Frame = +1

Query:   1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
           Q G KEDQNP +FYSRF+FKALIL +LC++VPVFLSQTPELANQTRLLELLH++FVGIAV
Sbjct:  12 QLGTKEDQNPTRFYSRFIFKALILTVLCAVVPVFLSQTPELANQTRLLELLHLVFVGIAV 71

Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN----NNTNNPHPYVPKILEVSSSVFNVDHES---GS 339
           SYGLFSRRNYDGGG G  SN+D+N    +N NN H YVPK+LEV SSVFNVDHES    S
Sbjct:  72 SYGLFSRRNYDGGGAGGSSNSDYNKADHHNNNNSHSYVPKLLEV-SSVFNVDHESESEPS 130

Query: 340 DDSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHL 519
           DDSSGDH    +  +NKY  K  E       E+RFVDRVSS +REKPLLLPVRSLNY  +
Sbjct: 131 DDSSGDH-RKFQAWRNKYHMKIPEV------ETRFVDRVSSEIREKPLLLPVRSLNYYPV 183

Query: 520 SD-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME-------------- 654
            D SGD SGRW++VRSKRQLLKTL DD+S D LPSPIPWRSRSS                
Sbjct: 184 PDSSGDNSGRWDKVRSKRQLLKTLGDDNS-DVLPSPIPWRSRSSSSSKEIESPPSIKNLT 242

Query: 655 -IESQPLIKTPASLTSSSALSSSPRKSTPLPKLTSE 759
            +ESQPLIK   +LT SS+  SSPRKS P+P L S+
Sbjct: 243 TVESQPLIK---NLTPSSSY-SSPRKSNPIPNLASQ 274

>gi|297800428|ref|XP_002868098.1| hydroxyproline-rich glycoprotein family
        protein [Arabidopsis lyrata subsp. lyrata]

          Length = 461

 Score =  307 bits (785), Expect = 2e-081
 Identities = 180/267 (67%), Positives = 203/267 (76%), Gaps = 27/267 (10%)
 Frame = +1

Query:   1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
           Q G+KEDQNPRKFYSRFLFKALIL LLC++VPVFLSQTPELANQTRL+ELLH++FVGIAV
Sbjct:  12 QLGNKEDQNPRKFYSRFLFKALILTLLCAVVPVFLSQTPELANQTRLIELLHLVFVGIAV 71

Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN---NNTNNPHPYVPKILEVSSSVFNVDHES---GSD 342
           SYGLFSRRNYDGGGG   SN+D+N   ++ NN HPYVPKILEV SSVFNV +ES    SD
Sbjct:  72 SYGLFSRRNYDGGGGEGTSNSDNNKADHSNNNLHPYVPKILEV-SSVFNVGNESESEPSD 130

Query: 343 DSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHLS 522
           DSSGD     +  +NKY  K  E       E+RFV+RVSS +REKPLLLPVRSLNYS + 
Sbjct: 131 DSSGDQ-RKFQTWKNKYHMKIPEV------ETRFVERVSSEIREKPLLLPVRSLNYSRVP 183

Query: 523 D-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME-------IESQPLIK 678
           D S D SGRWE+VRSKR+LLKTL DD+S D LPSPIPWRSRSS         +ESQP IK
Sbjct: 184 DSSSDNSGRWEKVRSKRELLKTLGDDNS-DVLPSPIPWRSRSSSSSVKNMATVESQPWIK 242

Query: 679 TPASLTSSSALSSSPRKSTPLPKLTSE 759
              +LT SSA   SPRKS  LP L S+
Sbjct: 243 ---NLTPSSAF-PSPRKSNLLPNLASQ 265

>gi|255581309|ref|XP_002531465.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 565

 Score =  109 bits (270), Expect = 8e-022
 Identities = 72/183 (39%), Positives = 100/183 (54%), Gaps = 11/183 (6%)
 Frame = +1

Query:   1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVG 171
           Q+    + NP KFYS FL+KALI+ +   ++P+F SQ PE  NQ   TR  E LH+IFVG
Sbjct:  14 QNQANNNNNPSKFYSHFLYKALIVTIFLVILPLFPSQAPEFINQTLNTRGWEFLHLIFVG 73

Query: 172 IAVSYGLFSRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGSDDSS 351
           IAVSYGLFSRRN +        +N  N+  +N   YV + L+V SSVF+ D +S S    
Sbjct:  74 IAVSYGLFSRRNDE-----TEKDNSSNSKFDNAQSYVSRFLQV-SSVFDDDADSPSKSDV 127

Query: 352 GDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVR--EKPLLLPVRSLNYSHLSD 525
            +           Y+ +       + + +   ++ S+G R  EKPLLLP+RSL    L  
Sbjct: 128 SNSTSVQTWNNQYYRNEPVVVVAEEQHPAFDQEQRSTGSRIGEKPLLLPIRSLKSRVLDA 187

Query: 526 SGD 534
            G+
Sbjct: 188 DGN 190

>gi|296085704|emb|CBI29503.3| unnamed protein product [Vitis vinifera]

          Length = 386

 Score =  95 bits (234), Expect = 1e-017
 Identities = 56/113 (49%), Positives = 76/113 (67%), Gaps = 13/113 (11%)
 Frame = +1

Query:  25 NPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQT---RLLELLHMIFVGIAVSYGLF 195
           NP KFYS FL+KALI+ L  +++P+F SQ PE  NQT   R  ELLH++FVGIAVSYGLF
Sbjct:  13 NPSKFYSGFLYKALIVTLFLAILPLFPSQAPEFINQTVFNRSWELLHLVFVGIAVSYGLF 72

Query: 196 SRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFN--VDHESGSDDS 348
           SRRN +       +  ++++  +N   YV + L+V SSVF+  V+  SGS +S
Sbjct:  73 SRRNDE-------TEKENHSKFDNAQSYVSRFLQV-SSVFDEEVESPSGSGES 117

>gi|224074191|ref|XP_002304294.1| predicted protein [Populus trichocarpa]

          Length = 580

 Score =  87 bits (215), Expect = 2e-015
 Identities = 49/110 (44%), Positives = 72/110 (65%), Gaps = 6/110 (5%)
 Frame = +1

Query:  13 KEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVS 183
           ++  NP K+Y+ FL+KALI+ +   ++P+F SQ PE  NQ   TR  E LH++FVGIAVS
Sbjct:  13 QKQANPTKYYTHFLYKALIVTVFLIILPLFPSQAPEFINQTLNTRGWEFLHLVFVGIAVS 72

Query: 184 YGLFSRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHES 333
           YGLFS+RN +       +N+ + +  +N   YV + L+V SSVF+ D +S
Sbjct:  73 YGLFSKRNDE--TEKENNNSSNQSKFDNAQSYVSRFLQV-SSVFDDDVDS 119

>gi|225453426|ref|XP_002272713.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 549

 Score =  77 bits (188), Expect = 3e-012
 Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 15/117 (12%)
 Frame = +1

Query:  13 KEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVS 183
           + ++ P K  + FL K+LI AL   ++P+F SQ PE  N    T+  ELLH++F+GIAVS
Sbjct:  11 RPNRTPSKSCTHFLCKSLIFALFLVVIPLFPSQAPEYINHTLITKFWELLHLLFIGIAVS 70

Query: 184 YGLFSRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGSDDSSG 354
           YG+FSRRN D G        + ++  +N   Y  + L V SS+F    E G ++S G
Sbjct:  71 YGVFSRRNVDRG-------IESHSTVDNSESYASRFLHV-SSIF----EDGFENSCG 115

>gi|147843896|emb|CAN81597.1| hypothetical protein VITISV_039396 [Vitis
        vinifera]

          Length = 909

 Score =  75 bits (183), Expect = 1e-011
 Identities = 46/117 (39%), Positives = 67/117 (57%), Gaps = 15/117 (12%)
 Frame = +1

Query:  13 KEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVS 183
           + ++ P K  + FL K+LI AL   ++P+F SQ PE  N    T+  ELLH++F+GIAVS
Sbjct:  11 RPNRTPSKSCTHFLCKSLIFALFLVVIPLFPSQAPEYINHTLITKFWELLHLLFIGIAVS 70

Query: 184 YGLFSRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGSDDSSG 354
           YG+FSRRN D G        + ++  +N   Y  +   V SS+F    E G ++S G
Sbjct:  71 YGVFSRRNVDRG-------IESHSTVDNSESYASRFXHV-SSIF----EDGFENSCG 115

>gi|297820898|ref|XP_002878332.1| hypothetical protein ARALYDRAFT_907560
        [Arabidopsis lyrata subsp. lyrata]

          Length = 741

 Score =  65 bits (158), Expect = 8e-009
 Identities = 39/100 (39%), Positives = 61/100 (61%), Gaps = 9/100 (9%)
 Frame = +1

Query:  49 FLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVSYGLFSRRNYDGG 219
           F  K+++ AL    +P+F SQ P+   +   T+  EL+H++FVGIAV+YGLFSRRN + G
Sbjct:  33 FFCKSVLFALFLFALPLFPSQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESG 92

Query: 220 GGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGS 339
              R +  D ++ +     YV +I +V SSVF+ + +  S
Sbjct:  93 VDLRMNRVDESSLS-----YVSRIFQV-SSVFDEEFDDNS 126

>gi|15232309|ref|NP_191597.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 743

 Score =  63 bits (152), Expect = 4e-008
 Identities = 38/100 (38%), Positives = 60/100 (60%), Gaps = 9/100 (9%)
 Frame = +1

Query:  49 FLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVSYGLFSRRNYDGG 219
           F  K+++ AL    +P+F SQ P+   +   T+  EL+H++FVGIAV+YGLFSRRN +  
Sbjct:  32 FFCKSVLFALFLLALPLFPSQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESA 91

Query: 220 GGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGS 339
              R +  D ++ +     YV +I +V SSVF+ + +  S
Sbjct:  92 VDLRMTRVDESSLS-----YVSRIFQV-SSVFDEEFDDNS 125

>gi|255541082|ref|XP_002511605.1| hypothetical protein RCOM_1608690 [Ricinus
        communis]

          Length = 638

 Score =  62 bits (148), Expect = 1e-007
 Identities = 30/64 (46%), Positives = 41/64 (64%), Gaps = 3/64 (4%)
 Frame = +1

Query:  34 KFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLL---ELLHMIFVGIAVSYGLFSRR 204
           K + R + K+L   L    +P+F SQ P   NQT L    EL+H++F+G+AVSYGLFS R
Sbjct:  19 KSFIRIICKSLFFVLFLIAIPLFPSQAPNFVNQTLLTKFWELVHLLFIGVAVSYGLFSSR 78

Query: 205 NYDG 216
           N +G
Sbjct:  79 NVEG 82

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,352,524,086,339
Number of Sequences: 15229318
Number of Extensions: 1352524086339
Number of Successful Extensions: 374284986
Number of sequences better than 0.0: 0