Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN10842


BLASTX 7.6.2

Query= UN10842 /QuerySize=1434
        (1433 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|18415554|ref|NP_567615.1| dentin sialophosphoprotein-related ...    560   3e-157
gi|5262212|emb|CAB45838.1| hypothetical protein [Arabidopsis tha...    560   3e-157
gi|15220414|ref|NP_172002.1| dentin sialophosphoprotein-like pro...    555   8e-156
gi|4056417|gb|AAC97991.1| ESTs gb|H76594 and gb|H76252 come from...    555   8e-156
gi|297843308|ref|XP_002889535.1| hypothetical protein ARALYDRAFT...    507   2e-141

>gi|18415554|ref|NP_567615.1| dentin sialophosphoprotein-related protein
        [Arabidopsis thaliana]

          Length = 729

 Score =  560 bits (1441), Expect = 3e-157
 Identities = 310/455 (68%), Positives = 353/455 (77%), Gaps = 33/455 (7%)
 Frame = -1

Query: 1412 F*EDKNLSLFDGK-DSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEG-KGAQRN 1239
            F ED+NLSLF+GK D+ RTSSS+ D+SFGFFEG+DAQ TSS K+DESFG+FEG K AQRN
Sbjct:  278 FKEDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDDESFGMFEGKKDAQRN 337

Query: 1238 SASKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSD 1062
            S+SKED + G+FEGK D QRNSSSKE+ +FG FEGAP S+   KSFDD + A+SSDWDSD
Sbjct:  338 SSSKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSD 397

Query: 1061 FQS----VSHEKISGDPFVSSPVDLSVHMDSVFGSGKQ------ADSSTAYVSKAGDWPL 912
            FQS    +S +KI GDPFVSSPVDL+ HMDSVFGSGK       ADSSTAYVSKAGDW L
Sbjct:  398 FQSADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAGDW-L 456

Query:  911 QDDLFGNFTGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTD 741
            QDDLFGN TG++  ND A H   EGQ+V GNG SSMDIDWIGDDLWQT+E+K++EKT TD
Sbjct:  457 QDDLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDIDWIGDDLWQTNEKKSIEKTPTD 516

Query:  740 DNDDDDDDDWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTG 561
             N DDDDDDWNDFASSANSKTP+N LS+TME SQ EIF G ++ KN V+EQS  EKQNT 
Sbjct:  517 VN-DDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVKEQSVDEKQNTD 575

Query:  560 TSMISDIAKVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRD 381
            TS++SDI K QEDDLFG WDSF+SS +LQT +QPPT H  PS E+N  MNL  E+N++RD
Sbjct:  576 TSVMSDIGKCQEDDLFGTWDSFTSSTILQTSLQPPTIHANPSGEKNPEMNLFGENNNNRD 635

Query:  380 LDF------GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDS 243
            LDF        FSES GG TNSEEV  +PSGT ST +R  DPD  +DQ        TT  
Sbjct:  636 LDFDSISRSDFFSESSGGKTNSEEVKVIPSGT-STLDRPSDPDGSKDQTVDLVVGTTTTV 694

Query:  242 RKSKSDVAEELMSQMHDLSFMLETKLSVSPISKAE 138
             KSKSDVAEELMSQMHDLSFMLETKLSV PISK E
Sbjct:  695 PKSKSDVAEELMSQMHDLSFMLETKLSVPPISKTE 729

>gi|5262212|emb|CAB45838.1| hypothetical protein [Arabidopsis thaliana]

          Length = 758

 Score =  560 bits (1441), Expect = 3e-157
 Identities = 310/455 (68%), Positives = 353/455 (77%), Gaps = 33/455 (7%)
 Frame = -1

Query: 1412 F*EDKNLSLFDGK-DSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEG-KGAQRN 1239
            F ED+NLSLF+GK D+ RTSSS+ D+SFGFFEG+DAQ TSS K+DESFG+FEG K AQRN
Sbjct:  307 FKEDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDDESFGMFEGKKDAQRN 366

Query: 1238 SASKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSD 1062
            S+SKED + G+FEGK D QRNSSSKE+ +FG FEGAP S+   KSFDD + A+SSDWDSD
Sbjct:  367 SSSKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSD 426

Query: 1061 FQS----VSHEKISGDPFVSSPVDLSVHMDSVFGSGKQ------ADSSTAYVSKAGDWPL 912
            FQS    +S +KI GDPFVSSPVDL+ HMDSVFGSGK       ADSSTAYVSKAGDW L
Sbjct:  427 FQSADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAGDW-L 485

Query:  911 QDDLFGNFTGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTD 741
            QDDLFGN TG++  ND A H   EGQ+V GNG SSMDIDWIGDDLWQT+E+K++EKT TD
Sbjct:  486 QDDLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDIDWIGDDLWQTNEKKSIEKTPTD 545

Query:  740 DNDDDDDDDWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTG 561
             N DDDDDDWNDFASSANSKTP+N LS+TME SQ EIF G ++ KN V+EQS  EKQNT 
Sbjct:  546 VN-DDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVKEQSVDEKQNTD 604

Query:  560 TSMISDIAKVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRD 381
            TS++SDI K QEDDLFG WDSF+SS +LQT +QPPT H  PS E+N  MNL  E+N++RD
Sbjct:  605 TSVMSDIGKCQEDDLFGTWDSFTSSTILQTSLQPPTIHANPSGEKNPEMNLFGENNNNRD 664

Query:  380 LDF------GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDS 243
            LDF        FSES GG TNSEEV  +PSGT ST +R  DPD  +DQ        TT  
Sbjct:  665 LDFDSISRSDFFSESSGGKTNSEEVKVIPSGT-STLDRPSDPDGSKDQTVDLVVGTTTTV 723

Query:  242 RKSKSDVAEELMSQMHDLSFMLETKLSVSPISKAE 138
             KSKSDVAEELMSQMHDLSFMLETKLSV PISK E
Sbjct:  724 PKSKSDVAEELMSQMHDLSFMLETKLSVPPISKTE 758

>gi|15220414|ref|NP_172002.1| dentin sialophosphoprotein-like protein
        [Arabidopsis thaliana]

          Length = 706

 Score =  555 bits (1429), Expect = 8e-156
 Identities = 307/453 (67%), Positives = 351/453 (77%), Gaps = 33/453 (7%)
 Frame = -1

Query: 1406 EDKNLSLFDGK-DSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEG-KGAQRNSA 1233
            +D+NLSLF+GK D+ RTSSS+ D+SFGFFEG+DAQ TSS K+DESFG+FEG K AQRNS+
Sbjct:  257 KDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDDESFGMFEGKKDAQRNSS 316

Query: 1232 SKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSDFQ 1056
            SKED + G+FEGK D QRNSSSKE+ +FG FEGAP S+   KSFDD + A+SSDWDSDFQ
Sbjct:  317 SKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSDFQ 376

Query: 1055 S----VSHEKISGDPFVSSPVDLSVHMDSVFGSGKQ------ADSSTAYVSKAGDWPLQD 906
            S    +S +KI GDPFVSSPVDL+ HMDSVFGSGK       ADSSTAYVSKAGDW LQD
Sbjct:  377 SADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAGDW-LQD 435

Query:  905 DLFGNFTGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTDDN 735
            DLFGN TG++  ND A H   EGQ+V GNG SSMDIDWIGDDLWQT+E+K++EKT TD N
Sbjct:  436 DLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDIDWIGDDLWQTNEKKSIEKTPTDVN 495

Query:  734 DDDDDDDWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTGTS 555
             DDDDDDWNDFASSANSKTP+N LS+TME SQ EIF G ++ KN V+EQS  EKQNT TS
Sbjct:  496 -DDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVKEQSVDEKQNTDTS 554

Query:  554 MISDIAKVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRDLD 375
            ++SDI K QEDDLFG WDSF+SS +LQT +QPPT H  PS E+N  MNL  E+N++RDLD
Sbjct:  555 VMSDIGKCQEDDLFGTWDSFTSSTILQTSLQPPTIHANPSGEKNPEMNLFGENNNNRDLD 614

Query:  374 F------GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDSRK 237
            F        FSES GG TNSEEV  +PSGT ST +R  DPD  +DQ        TT   K
Sbjct:  615 FDSISRSDFFSESSGGKTNSEEVKVIPSGT-STLDRPSDPDGSKDQTVDLVVGTTTTVPK 673

Query:  236 SKSDVAEELMSQMHDLSFMLETKLSVSPISKAE 138
            S SDVAEELMSQMHDLSFMLETKLSV PISK E
Sbjct:  674 SMSDVAEELMSQMHDLSFMLETKLSVPPISKTE 706

>gi|4056417|gb|AAC97991.1| ESTs gb|H76594 and gb|H76252 come from this gene
        [Arabidopsis thaliana]

          Length = 747

 Score =  555 bits (1429), Expect = 8e-156
 Identities = 307/453 (67%), Positives = 351/453 (77%), Gaps = 33/453 (7%)
 Frame = -1

Query: 1406 EDKNLSLFDGK-DSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEG-KGAQRNSA 1233
            +D+NLSLF+GK D+ RTSSS+ D+SFGFFEG+DAQ TSS K+DESFG+FEG K AQRNS+
Sbjct:  298 KDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDDESFGMFEGKKDAQRNSS 357

Query: 1232 SKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSDFQ 1056
            SKED + G+FEGK D QRNSSSKE+ +FG FEGAP S+   KSFDD + A+SSDWDSDFQ
Sbjct:  358 SKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSDFQ 417

Query: 1055 S----VSHEKISGDPFVSSPVDLSVHMDSVFGSGKQ------ADSSTAYVSKAGDWPLQD 906
            S    +S +KI GDPFVSSPVDL+ HMDSVFGSGK       ADSSTAYVSKAGDW LQD
Sbjct:  418 SADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAGDW-LQD 476

Query:  905 DLFGNFTGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTDDN 735
            DLFGN TG++  ND A H   EGQ+V GNG SSMDIDWIGDDLWQT+E+K++EKT TD N
Sbjct:  477 DLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDIDWIGDDLWQTNEKKSIEKTPTDVN 536

Query:  734 DDDDDDDWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTGTS 555
             DDDDDDWNDFASSANSKTP+N LS+TME SQ EIF G ++ KN V+EQS  EKQNT TS
Sbjct:  537 -DDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVKEQSVDEKQNTDTS 595

Query:  554 MISDIAKVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRDLD 375
            ++SDI K QEDDLFG WDSF+SS +LQT +QPPT H  PS E+N  MNL  E+N++RDLD
Sbjct:  596 VMSDIGKCQEDDLFGTWDSFTSSTILQTSLQPPTIHANPSGEKNPEMNLFGENNNNRDLD 655

Query:  374 F------GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDSRK 237
            F        FSES GG TNSEEV  +PSGT ST +R  DPD  +DQ        TT   K
Sbjct:  656 FDSISRSDFFSESSGGKTNSEEVKVIPSGT-STLDRPSDPDGSKDQTVDLVVGTTTTVPK 714

Query:  236 SKSDVAEELMSQMHDLSFMLETKLSVSPISKAE 138
            S SDVAEELMSQMHDLSFMLETKLSV PISK E
Sbjct:  715 SMSDVAEELMSQMHDLSFMLETKLSVPPISKTE 747

>gi|297843308|ref|XP_002889535.1| hypothetical protein ARALYDRAFT_470500
        [Arabidopsis lyrata subsp. lyrata]

          Length = 701

 Score =  507 bits (1305), Expect = 2e-141
 Identities = 284/447 (63%), Positives = 332/447 (74%), Gaps = 45/447 (10%)
 Frame = -1

Query: 1412 F*EDKNLSLFDGKDSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEGKGAQRNSA 1233
            F E++NLSLF+GK + +TSSS++D+SFG FEG+D Q  SS KEDES G+F GK AQR S+
Sbjct:  278 FKENENLSLFEGKVAQKTSSSKEDESFGLFEGKDTQRNSSSKEDESPGLFMGKDAQRTSS 337

Query: 1232 SKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSDFQ 1056
            SK+D + G+FEGK D QRNSSSKED +FGLFEGAPSS+   KSFDD + A+SSDWDSDFQ
Sbjct:  338 SKDDESFGMFEGKEDAQRNSSSKEDENFGLFEGAPSSTADLKSFDDKIVATSSDWDSDFQ 397

Query: 1055 SVSH----EKISGDPFVSSPVDLSVHMDSVFGSGKQADSSTAYVSKAGDWPLQDDLFGNF 888
            S  H    +K+ GDPFVSSPVDL+ HMDSVFGSGK         +K GDW LQDDLFGN 
Sbjct:  398 SADHNPSQKKVGGDPFVSSPVDLAAHMDSVFGSGKD-----LLYAKPGDW-LQDDLFGNV 451

Query:  887 TGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTDDNDDDDDD 717
            TG++ N+D A H   EGQVV GNG+SSMDIDWIGDDLWQT+E+K++EKT TD N  DDDD
Sbjct:  452 TGEAQNSDSAVHDKNEGQVVGGNGSSSMDIDWIGDDLWQTNEKKSIEKTPTDVN--DDDD 509

Query:  716 DWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTGTSMISDIA 537
            DWNDFASSANSKTP+N LS+TME SQ+E F G ++VKN V+EQS  EKQNT   ++SDI 
Sbjct:  510 DWNDFASSANSKTPNNPLSQTMESSQDEFFYGQAQVKNGVKEQSVDEKQNT---VMSDIG 566

Query:  536 KVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRDLDF----- 372
            K QEDD+FG WDSF+SS + QT           S E+   MNL  E+N+HRDLDF     
Sbjct:  567 KGQEDDIFGTWDSFTSSTIPQT-----------SGEKYPKMNLFGENNNHRDLDFDSISR 615

Query:  371 -GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDSRKSKSDVA 219
               FSES GG TNSEEV  +PSGT ST +RT DPD  +DQ        TT + KSKSDVA
Sbjct:  616 SDFFSESSGGKTNSEEVKVIPSGT-STLDRTSDPDGSKDQTVDLVVGTTTTAPKSKSDVA 674

Query:  218 EELMSQMHDLSFMLETKLSVSPISKAE 138
            EELMSQMHDLSFMLETKLSV PISK E
Sbjct:  675 EELMSQMHDLSFMLETKLSVPPISKTE 701


 Score =  109 bits (272), Expect = 1e-021
 Identities = 87/235 (37%), Positives = 116/235 (49%), Gaps = 19/235 (8%)
 Frame = -1

Query: 1406 EDKNLSLFDGKDSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEGKGAQRNSASK 1227
            E +NLSLF G+D+  + S  +  +FGFFE +D  G +SFKE+E+  +FEGK AQ+ S+SK
Sbjct:  242 EHENLSLFAGRDAQESVSLAEQGNFGFFEEKD--GQNSFKENENLSLFEGKVAQKTSSSK 299

Query: 1226 EDGNLGLFEGKDGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSDFQSVS 1047
            ED + GLFEGKD QRNSSSKED S GLF G  +        D++        D+   S S
Sbjct:  300 EDESFGLFEGKDTQRNSSSKEDESPGLFMGKDAQRTSSSKDDESFGMFEGKEDAQRNSSS 359

Query: 1046 HEK-----ISGDPFVSSPVDLSVHMDSVFGSGKQADSSTAYVSKAGDWPLQDDLFGNFTG 882
             E        G P  SS  DL    D +  +    DS       A   P Q  + G+   
Sbjct:  360 KEDENFGLFEGAP--SSTADLKSFDDKIVATSSDWDSD---FQSADHNPSQKKVGGD-PF 413

Query:  881 KSGNNDEAGHEGQVVSGNGTSSMDI---DWIGDDLW--QTSEQKAVEKTRTDDND 732
             S   D A H   V  G+G   +     DW+ DDL+   T E +  +    D N+
Sbjct:  414 VSSPVDLAAHMDSVF-GSGKDLLYAKPGDWLQDDLFGNVTGEAQNSDSAVHDKNE 467


 Score =  102 bits (254), Expect = 1e-019
 Identities = 54/103 (52%), Positives = 69/103 (66%), Gaps = 2/103 (1%)
 Frame = -1

Query: 1406 EDKNLSLFDGKDSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEGKGAQRNSASK 1227
            E  N   F+ KD    +S +++++   FEG+ AQ TSS KEDESFG+FEGK  QRNS+SK
Sbjct:  262 EQGNFGFFEEKDG--QNSFKENENLSLFEGKVAQKTSSSKEDESFGLFEGKDTQRNSSSK 319

Query: 1226 EDGNLGLFEGKDGQRNSSSKEDISFGLFEGAPSSSDGFKSFDD 1098
            ED + GLF GKD QR SSSK+D SFG+FEG   +     S +D
Sbjct:  320 EDESPGLFMGKDAQRTSSSKDDESFGMFEGKEDAQRNSSSKED 362

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,313,363,328,120
Number of Sequences: 15229318
Number of Extensions: 1313363328120
Number of Successful Extensions: 373990951
Number of sequences better than 0.0: 0