Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN33379


BLASTX 7.6.2

Query= UN33379 /QuerySize=1346
        (1345 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|18415554|ref|NP_567615.1| dentin sialophosphoprotein-related ...    527   2e-147
gi|5262212|emb|CAB45838.1| hypothetical protein [Arabidopsis tha...    527   2e-147
gi|15220414|ref|NP_172002.1| dentin sialophosphoprotein-like pro...    525   1e-146
gi|4056417|gb|AAC97991.1| ESTs gb|H76594 and gb|H76252 come from...    525   1e-146
gi|297843308|ref|XP_002889535.1| hypothetical protein ARALYDRAFT...    501   1e-139

>gi|18415554|ref|NP_567615.1| dentin sialophosphoprotein-related protein
        [Arabidopsis thaliana]

          Length = 729

 Score =  527 bits (1356), Expect = 2e-147
 Identities = 287/424 (67%), Positives = 331/424 (78%), Gaps = 30/424 (7%)
 Frame = -3

Query: 1337 FEGKDAQRDSSFKDDESFGLFQG-KDAQRNSALKEDANFGLFEG-EDGQSNSASKEDEDF 1164
            FEGKDAQR SS KDDESFG+F+G KDAQRNS+ KED +FG+FEG ED Q NS+SKE+E+F
Sbjct:  308 FEGKDAQRTSSSKDDESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENF 367

Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQS----VSQEKSSSDPFVSSPVDLSAHMD 996
            G FE A  SNA +KSFDDK+V +SS WDSDFQS    +SQ+K   DPFVSSPVDL+AHMD
Sbjct:  368 GFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMD 427

Query:  995 TVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGGN 825
            +VFGSGKDL Y +P DSST YVS AGDWLQDDLFG+VTG+ Q ND  VH   EGQ+VGGN
Sbjct:  428 SVFGSGKDLLYAQPADSSTAYVSKAGDWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGN 487

Query:  824 GSSSMDIDWMGDDLWQTSEKKAVEQTPTD----------DFASSVNSKTPSNLLSRTMES 675
            G+SSMDIDW+GDDLWQT+EKK++E+TPTD          DFASS NSKTP+N LS+TMES
Sbjct:  488 GNSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMES 547

Query:  674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
            SQ  EIF G A  KN V EQS DEKQNT  + V+SDIGK QEDDLFG WD+FTSST+LQT
Sbjct:  548 SQ-FEIFYGHAQDKNGVKEQSVDEKQNTDTS-VMSDIGKCQEDDLFGTWDSFTSSTILQT 605

Query:  494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
             +Q  T     S E NPE+ LFG+N++  DL FD       F ES+GG+T SEEV  +PS
Sbjct:  606 SLQPPTIHANPSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 665

Query:  332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
            GTSTL+R SDPDG KD+T+DLV   TTT  KSKSDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct:  666 GTSTLDRPSDPDGSKDQTVDLVVGTTTTVPKSKSDVAEELMSQMHDLSFMLETKLSVPPI 725

Query:  161 TKAE 150
            +K E
Sbjct:  726 SKTE 729

>gi|5262212|emb|CAB45838.1| hypothetical protein [Arabidopsis thaliana]

          Length = 758

 Score =  527 bits (1356), Expect = 2e-147
 Identities = 287/424 (67%), Positives = 331/424 (78%), Gaps = 30/424 (7%)
 Frame = -3

Query: 1337 FEGKDAQRDSSFKDDESFGLFQG-KDAQRNSALKEDANFGLFEG-EDGQSNSASKEDEDF 1164
            FEGKDAQR SS KDDESFG+F+G KDAQRNS+ KED +FG+FEG ED Q NS+SKE+E+F
Sbjct:  337 FEGKDAQRTSSSKDDESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENF 396

Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQS----VSQEKSSSDPFVSSPVDLSAHMD 996
            G FE A  SNA +KSFDDK+V +SS WDSDFQS    +SQ+K   DPFVSSPVDL+AHMD
Sbjct:  397 GFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMD 456

Query:  995 TVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGGN 825
            +VFGSGKDL Y +P DSST YVS AGDWLQDDLFG+VTG+ Q ND  VH   EGQ+VGGN
Sbjct:  457 SVFGSGKDLLYAQPADSSTAYVSKAGDWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGN 516

Query:  824 GSSSMDIDWMGDDLWQTSEKKAVEQTPTD----------DFASSVNSKTPSNLLSRTMES 675
            G+SSMDIDW+GDDLWQT+EKK++E+TPTD          DFASS NSKTP+N LS+TMES
Sbjct:  517 GNSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMES 576

Query:  674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
            SQ  EIF G A  KN V EQS DEKQNT  + V+SDIGK QEDDLFG WD+FTSST+LQT
Sbjct:  577 SQ-FEIFYGHAQDKNGVKEQSVDEKQNTDTS-VMSDIGKCQEDDLFGTWDSFTSSTILQT 634

Query:  494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
             +Q  T     S E NPE+ LFG+N++  DL FD       F ES+GG+T SEEV  +PS
Sbjct:  635 SLQPPTIHANPSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 694

Query:  332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
            GTSTL+R SDPDG KD+T+DLV   TTT  KSKSDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct:  695 GTSTLDRPSDPDGSKDQTVDLVVGTTTTVPKSKSDVAEELMSQMHDLSFMLETKLSVPPI 754

Query:  161 TKAE 150
            +K E
Sbjct:  755 SKTE 758

>gi|15220414|ref|NP_172002.1| dentin sialophosphoprotein-like protein
        [Arabidopsis thaliana]

          Length = 706

 Score =  525 bits (1350), Expect = 1e-146
 Identities = 286/424 (67%), Positives = 330/424 (77%), Gaps = 30/424 (7%)
 Frame = -3

Query: 1337 FEGKDAQRDSSFKDDESFGLFQG-KDAQRNSALKEDANFGLFEG-EDGQSNSASKEDEDF 1164
            FEGKDAQR SS KDDESFG+F+G KDAQRNS+ KED +FG+FEG ED Q NS+SKE+E+F
Sbjct:  285 FEGKDAQRTSSSKDDESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENF 344

Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQS----VSQEKSSSDPFVSSPVDLSAHMD 996
            G FE A  SNA +KSFDDK+V +SS WDSDFQS    +SQ+K   DPFVSSPVDL+AHMD
Sbjct:  345 GFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMD 404

Query:  995 TVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGGN 825
            +VFGSGKDL Y +P DSST YVS AGDWLQDDLFG+VTG+ Q ND  VH   EGQ+VGGN
Sbjct:  405 SVFGSGKDLLYAQPADSSTAYVSKAGDWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGN 464

Query:  824 GSSSMDIDWMGDDLWQTSEKKAVEQTPTD----------DFASSVNSKTPSNLLSRTMES 675
            G+SSMDIDW+GDDLWQT+EKK++E+TPTD          DFASS NSKTP+N LS+TMES
Sbjct:  465 GNSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMES 524

Query:  674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
            SQ  EIF G A  KN V EQS DEKQNT  + V+SDIGK QEDDLFG WD+FTSST+LQT
Sbjct:  525 SQ-FEIFYGHAQDKNGVKEQSVDEKQNTDTS-VMSDIGKCQEDDLFGTWDSFTSSTILQT 582

Query:  494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
             +Q  T     S E NPE+ LFG+N++  DL FD       F ES+GG+T SEEV  +PS
Sbjct:  583 SLQPPTIHANPSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 642

Query:  332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
            GTSTL+R SDPDG KD+T+DLV   TTT  KS SDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct:  643 GTSTLDRPSDPDGSKDQTVDLVVGTTTTVPKSMSDVAEELMSQMHDLSFMLETKLSVPPI 702

Query:  161 TKAE 150
            +K E
Sbjct:  703 SKTE 706

>gi|4056417|gb|AAC97991.1| ESTs gb|H76594 and gb|H76252 come from this gene
        [Arabidopsis thaliana]

          Length = 747

 Score =  525 bits (1350), Expect = 1e-146
 Identities = 286/424 (67%), Positives = 330/424 (77%), Gaps = 30/424 (7%)
 Frame = -3

Query: 1337 FEGKDAQRDSSFKDDESFGLFQG-KDAQRNSALKEDANFGLFEG-EDGQSNSASKEDEDF 1164
            FEGKDAQR SS KDDESFG+F+G KDAQRNS+ KED +FG+FEG ED Q NS+SKE+E+F
Sbjct:  326 FEGKDAQRTSSSKDDESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENF 385

Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQS----VSQEKSSSDPFVSSPVDLSAHMD 996
            G FE A  SNA +KSFDDK+V +SS WDSDFQS    +SQ+K   DPFVSSPVDL+AHMD
Sbjct:  386 GFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMD 445

Query:  995 TVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGGN 825
            +VFGSGKDL Y +P DSST YVS AGDWLQDDLFG+VTG+ Q ND  VH   EGQ+VGGN
Sbjct:  446 SVFGSGKDLLYAQPADSSTAYVSKAGDWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGN 505

Query:  824 GSSSMDIDWMGDDLWQTSEKKAVEQTPTD----------DFASSVNSKTPSNLLSRTMES 675
            G+SSMDIDW+GDDLWQT+EKK++E+TPTD          DFASS NSKTP+N LS+TMES
Sbjct:  506 GNSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMES 565

Query:  674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
            SQ  EIF G A  KN V EQS DEKQNT  + V+SDIGK QEDDLFG WD+FTSST+LQT
Sbjct:  566 SQ-FEIFYGHAQDKNGVKEQSVDEKQNTDTS-VMSDIGKCQEDDLFGTWDSFTSSTILQT 623

Query:  494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
             +Q  T     S E NPE+ LFG+N++  DL FD       F ES+GG+T SEEV  +PS
Sbjct:  624 SLQPPTIHANPSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 683

Query:  332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
            GTSTL+R SDPDG KD+T+DLV   TTT  KS SDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct:  684 GTSTLDRPSDPDGSKDQTVDLVVGTTTTVPKSMSDVAEELMSQMHDLSFMLETKLSVPPI 743

Query:  161 TKAE 150
            +K E
Sbjct:  744 SKTE 747

>gi|297843308|ref|XP_002889535.1| hypothetical protein ARALYDRAFT_470500
        [Arabidopsis lyrata subsp. lyrata]

          Length = 701

 Score =  501 bits (1289), Expect = 1e-139
 Identities = 277/424 (65%), Positives = 320/424 (75%), Gaps = 53/424 (12%)
 Frame = -3

Query: 1343 GFFEGKDAQRDSSFKDDESFGLFQGKDAQRNSALKEDANFGLFEG-EDGQSNSASKEDED 1167
            G FEGKD QR+SS K+DES GLF GKDAQR S+ K+D +FG+FEG ED Q NS+SKEDE+
Sbjct:  305 GLFEGKDTQRNSSSKEDESPGLFMGKDAQRTSSSKDDESFGMFEGKEDAQRNSSSKEDEN 364

Query: 1166 FGLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQSV----SQEKSSSDPFVSSPVDLSAHM 999
            FGLFE A SS A +KSFDDK+V +SS WDSDFQS     SQ+K   DPFVSSPVDL+AHM
Sbjct:  365 FGLFEGAPSSTADLKSFDDKIVATSSDWDSDFQSADHNPSQKKVGGDPFVSSPVDLAAHM 424

Query:  998 DTVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGG 828
            D+VFGSGKDL Y KP           GDWLQDDLFG+VTG+ QN+D  VH   EGQVVGG
Sbjct:  425 DSVFGSGKDLLYAKP-----------GDWLQDDLFGNVTGEAQNSDSAVHDKNEGQVVGG 473

Query:  827 NGSSSMDIDWMGDDLWQTSEKKAVEQTPTD---------DFASSVNSKTPSNLLSRTMES 675
            NGSSSMDIDW+GDDLWQT+EKK++E+TPTD         DFASS NSKTP+N LS+TMES
Sbjct:  474 NGSSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDWNDFASSANSKTPNNPLSQTMES 533

Query:  674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
            SQ +E F G A VKN V EQS DEKQNT    V+SDIGKGQEDD+FG WD+FTSST+ QT
Sbjct:  534 SQ-DEFFYGQAQVKNGVKEQSVDEKQNT----VMSDIGKGQEDDIFGTWDSFTSSTIPQT 588

Query:  494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
                       S E  P++ LFG+N++  DL FD       F ES+GG+T SEEV  +PS
Sbjct:  589 -----------SGEKYPKMNLFGENNNHRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 637

Query:  332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
            GTSTL+RTSDPDG KD+T+DLV   TTT+ KSKSDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct:  638 GTSTLDRTSDPDGSKDQTVDLVVGTTTTAPKSKSDVAEELMSQMHDLSFMLETKLSVPPI 697

Query:  161 TKAE 150
            +K E
Sbjct:  698 SKTE 701


 Score =  90 bits (222), Expect = 7e-016
 Identities = 55/136 (40%), Positives = 75/136 (55%), Gaps = 9/136 (6%)
 Frame = -3

Query: 1337 FEGKDAQRDSSFKDDESFGLFQGKDAQRNSALKEDANFGLFEGEDGQSNSASKEDEDFGL 1158
            FEGK AQ+ SS K+DESFGLF+GKD QRNS+ KED + GLF G+D Q  S+SK+DE FG+
Sbjct:  287 FEGKVAQKTSSSKEDESFGLFEGKDTQRNSSSKEDESPGLFMGKDAQRTSSSKDDESFGM 346

Query: 1157 FEAALSSNAGIKSFDDKVVTSSSTWDSDFQSVSQEKSSSDPFVSSPVDLSA------HMD 996
            FE    +     S +D+   +   ++    S +  KS  D  V++  D  +      H  
Sbjct:  347 FEGKEDAQRNSSSKEDE---NFGLFEGAPSSTADLKSFDDKIVATSSDWDSDFQSADHNP 403

Query:  995 TVFGSGKDLFYEKPED 948
            +    G D F   P D
Sbjct:  404 SQKKVGGDPFVSSPVD 419


 Score =  83 bits (204), Expect = 8e-014
 Identities = 50/127 (39%), Positives = 69/127 (54%), Gaps = 5/127 (3%)
 Frame = -3

Query: 1343 GFFEGKDAQRDSSFKDDESFGLFQGKDAQRNSALKEDANFGLFEGEDGQSNSASKEDEDF 1164
            GFFE KD Q  +SFK++E+  LF+GK AQ+ S+ KED +FGLFEG+D Q NS+SKEDE  
Sbjct:  267 GFFEEKDGQ--NSFKENENLSLFEGKVAQKTSSSKEDESFGLFEGKDTQRNSSSKEDESP 324

Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQSVSQEKSSSDPFVSSP---VDLSAHMDT 993
            GLF    +        D+         D+   S S+E  +   F  +P    DL +  D 
Sbjct:  325 GLFMGKDAQRTSSSKDDESFGMFEGKEDAQRNSSSKEDENFGLFEGAPSSTADLKSFDDK 384

Query:  992 VFGSGKD 972
            +  +  D
Sbjct:  385 IVATSSD 391

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,810,539,025,462
Number of Sequences: 15229318
Number of Extensions: 3810539025462
Number of Successful Extensions: 885398663
Number of sequences better than 0.0: 0