BLASTX 7.6.2
Query= UN10842 /QuerySize=1434
(1433 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|18415554|ref|NP_567615.1| dentin sialophosphoprotein-related ... 560 3e-157
gi|5262212|emb|CAB45838.1| hypothetical protein [Arabidopsis tha... 560 3e-157
gi|15220414|ref|NP_172002.1| dentin sialophosphoprotein-like pro... 555 8e-156
gi|4056417|gb|AAC97991.1| ESTs gb|H76594 and gb|H76252 come from... 555 8e-156
gi|297843308|ref|XP_002889535.1| hypothetical protein ARALYDRAFT... 507 2e-141
>gi|18415554|ref|NP_567615.1| dentin sialophosphoprotein-related protein
[Arabidopsis thaliana]
Length = 729
Score = 560 bits (1441), Expect = 3e-157
Identities = 310/455 (68%), Positives = 353/455 (77%), Gaps = 33/455 (7%)
Frame = -1
Query: 1412 F*EDKNLSLFDGK-DSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEG-KGAQRN 1239
F ED+NLSLF+GK D+ RTSSS+ D+SFGFFEG+DAQ TSS K+DESFG+FEG K AQRN
Sbjct: 278 FKEDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDDESFGMFEGKKDAQRN 337
Query: 1238 SASKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSD 1062
S+SKED + G+FEGK D QRNSSSKE+ +FG FEGAP S+ KSFDD + A+SSDWDSD
Sbjct: 338 SSSKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSD 397
Query: 1061 FQS----VSHEKISGDPFVSSPVDLSVHMDSVFGSGKQ------ADSSTAYVSKAGDWPL 912
FQS +S +KI GDPFVSSPVDL+ HMDSVFGSGK ADSSTAYVSKAGDW L
Sbjct: 398 FQSADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAGDW-L 456
Query: 911 QDDLFGNFTGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTD 741
QDDLFGN TG++ ND A H EGQ+V GNG SSMDIDWIGDDLWQT+E+K++EKT TD
Sbjct: 457 QDDLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDIDWIGDDLWQTNEKKSIEKTPTD 516
Query: 740 DNDDDDDDDWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTG 561
N DDDDDDWNDFASSANSKTP+N LS+TME SQ EIF G ++ KN V+EQS EKQNT
Sbjct: 517 VN-DDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVKEQSVDEKQNTD 575
Query: 560 TSMISDIAKVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRD 381
TS++SDI K QEDDLFG WDSF+SS +LQT +QPPT H PS E+N MNL E+N++RD
Sbjct: 576 TSVMSDIGKCQEDDLFGTWDSFTSSTILQTSLQPPTIHANPSGEKNPEMNLFGENNNNRD 635
Query: 380 LDF------GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDS 243
LDF FSES GG TNSEEV +PSGT ST +R DPD +DQ TT
Sbjct: 636 LDFDSISRSDFFSESSGGKTNSEEVKVIPSGT-STLDRPSDPDGSKDQTVDLVVGTTTTV 694
Query: 242 RKSKSDVAEELMSQMHDLSFMLETKLSVSPISKAE 138
KSKSDVAEELMSQMHDLSFMLETKLSV PISK E
Sbjct: 695 PKSKSDVAEELMSQMHDLSFMLETKLSVPPISKTE 729
>gi|5262212|emb|CAB45838.1| hypothetical protein [Arabidopsis thaliana]
Length = 758
Score = 560 bits (1441), Expect = 3e-157
Identities = 310/455 (68%), Positives = 353/455 (77%), Gaps = 33/455 (7%)
Frame = -1
Query: 1412 F*EDKNLSLFDGK-DSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEG-KGAQRN 1239
F ED+NLSLF+GK D+ RTSSS+ D+SFGFFEG+DAQ TSS K+DESFG+FEG K AQRN
Sbjct: 307 FKEDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDDESFGMFEGKKDAQRN 366
Query: 1238 SASKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSD 1062
S+SKED + G+FEGK D QRNSSSKE+ +FG FEGAP S+ KSFDD + A+SSDWDSD
Sbjct: 367 SSSKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSD 426
Query: 1061 FQS----VSHEKISGDPFVSSPVDLSVHMDSVFGSGKQ------ADSSTAYVSKAGDWPL 912
FQS +S +KI GDPFVSSPVDL+ HMDSVFGSGK ADSSTAYVSKAGDW L
Sbjct: 427 FQSADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAGDW-L 485
Query: 911 QDDLFGNFTGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTD 741
QDDLFGN TG++ ND A H EGQ+V GNG SSMDIDWIGDDLWQT+E+K++EKT TD
Sbjct: 486 QDDLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDIDWIGDDLWQTNEKKSIEKTPTD 545
Query: 740 DNDDDDDDDWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTG 561
N DDDDDDWNDFASSANSKTP+N LS+TME SQ EIF G ++ KN V+EQS EKQNT
Sbjct: 546 VN-DDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVKEQSVDEKQNTD 604
Query: 560 TSMISDIAKVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRD 381
TS++SDI K QEDDLFG WDSF+SS +LQT +QPPT H PS E+N MNL E+N++RD
Sbjct: 605 TSVMSDIGKCQEDDLFGTWDSFTSSTILQTSLQPPTIHANPSGEKNPEMNLFGENNNNRD 664
Query: 380 LDF------GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDS 243
LDF FSES GG TNSEEV +PSGT ST +R DPD +DQ TT
Sbjct: 665 LDFDSISRSDFFSESSGGKTNSEEVKVIPSGT-STLDRPSDPDGSKDQTVDLVVGTTTTV 723
Query: 242 RKSKSDVAEELMSQMHDLSFMLETKLSVSPISKAE 138
KSKSDVAEELMSQMHDLSFMLETKLSV PISK E
Sbjct: 724 PKSKSDVAEELMSQMHDLSFMLETKLSVPPISKTE 758
>gi|15220414|ref|NP_172002.1| dentin sialophosphoprotein-like protein
[Arabidopsis thaliana]
Length = 706
Score = 555 bits (1429), Expect = 8e-156
Identities = 307/453 (67%), Positives = 351/453 (77%), Gaps = 33/453 (7%)
Frame = -1
Query: 1406 EDKNLSLFDGK-DSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEG-KGAQRNSA 1233
+D+NLSLF+GK D+ RTSSS+ D+SFGFFEG+DAQ TSS K+DESFG+FEG K AQRNS+
Sbjct: 257 KDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDDESFGMFEGKKDAQRNSS 316
Query: 1232 SKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSDFQ 1056
SKED + G+FEGK D QRNSSSKE+ +FG FEGAP S+ KSFDD + A+SSDWDSDFQ
Sbjct: 317 SKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSDFQ 376
Query: 1055 S----VSHEKISGDPFVSSPVDLSVHMDSVFGSGKQ------ADSSTAYVSKAGDWPLQD 906
S +S +KI GDPFVSSPVDL+ HMDSVFGSGK ADSSTAYVSKAGDW LQD
Sbjct: 377 SADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAGDW-LQD 435
Query: 905 DLFGNFTGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTDDN 735
DLFGN TG++ ND A H EGQ+V GNG SSMDIDWIGDDLWQT+E+K++EKT TD N
Sbjct: 436 DLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDIDWIGDDLWQTNEKKSIEKTPTDVN 495
Query: 734 DDDDDDDWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTGTS 555
DDDDDDWNDFASSANSKTP+N LS+TME SQ EIF G ++ KN V+EQS EKQNT TS
Sbjct: 496 -DDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVKEQSVDEKQNTDTS 554
Query: 554 MISDIAKVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRDLD 375
++SDI K QEDDLFG WDSF+SS +LQT +QPPT H PS E+N MNL E+N++RDLD
Sbjct: 555 VMSDIGKCQEDDLFGTWDSFTSSTILQTSLQPPTIHANPSGEKNPEMNLFGENNNNRDLD 614
Query: 374 F------GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDSRK 237
F FSES GG TNSEEV +PSGT ST +R DPD +DQ TT K
Sbjct: 615 FDSISRSDFFSESSGGKTNSEEVKVIPSGT-STLDRPSDPDGSKDQTVDLVVGTTTTVPK 673
Query: 236 SKSDVAEELMSQMHDLSFMLETKLSVSPISKAE 138
S SDVAEELMSQMHDLSFMLETKLSV PISK E
Sbjct: 674 SMSDVAEELMSQMHDLSFMLETKLSVPPISKTE 706
>gi|4056417|gb|AAC97991.1| ESTs gb|H76594 and gb|H76252 come from this gene
[Arabidopsis thaliana]
Length = 747
Score = 555 bits (1429), Expect = 8e-156
Identities = 307/453 (67%), Positives = 351/453 (77%), Gaps = 33/453 (7%)
Frame = -1
Query: 1406 EDKNLSLFDGK-DSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEG-KGAQRNSA 1233
+D+NLSLF+GK D+ RTSSS+ D+SFGFFEG+DAQ TSS K+DESFG+FEG K AQRNS+
Sbjct: 298 KDENLSLFEGKEDAQRTSSSKVDESFGFFEGKDAQRTSSSKDDESFGMFEGKKDAQRNSS 357
Query: 1232 SKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSDFQ 1056
SKED + G+FEGK D QRNSSSKE+ +FG FEGAP S+ KSFDD + A+SSDWDSDFQ
Sbjct: 358 SKEDESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSDFQ 417
Query: 1055 S----VSHEKISGDPFVSSPVDLSVHMDSVFGSGKQ------ADSSTAYVSKAGDWPLQD 906
S +S +KI GDPFVSSPVDL+ HMDSVFGSGK ADSSTAYVSKAGDW LQD
Sbjct: 418 SADQNLSQKKIDGDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAGDW-LQD 476
Query: 905 DLFGNFTGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTDDN 735
DLFGN TG++ ND A H EGQ+V GNG SSMDIDWIGDDLWQT+E+K++EKT TD N
Sbjct: 477 DLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDIDWIGDDLWQTNEKKSIEKTPTDVN 536
Query: 734 DDDDDDDWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTGTS 555
DDDDDDWNDFASSANSKTP+N LS+TME SQ EIF G ++ KN V+EQS EKQNT TS
Sbjct: 537 -DDDDDDWNDFASSANSKTPNNPLSQTMESSQFEIFYGHAQDKNGVKEQSVDEKQNTDTS 595
Query: 554 MISDIAKVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRDLD 375
++SDI K QEDDLFG WDSF+SS +LQT +QPPT H PS E+N MNL E+N++RDLD
Sbjct: 596 VMSDIGKCQEDDLFGTWDSFTSSTILQTSLQPPTIHANPSGEKNPEMNLFGENNNNRDLD 655
Query: 374 F------GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDSRK 237
F FSES GG TNSEEV +PSGT ST +R DPD +DQ TT K
Sbjct: 656 FDSISRSDFFSESSGGKTNSEEVKVIPSGT-STLDRPSDPDGSKDQTVDLVVGTTTTVPK 714
Query: 236 SKSDVAEELMSQMHDLSFMLETKLSVSPISKAE 138
S SDVAEELMSQMHDLSFMLETKLSV PISK E
Sbjct: 715 SMSDVAEELMSQMHDLSFMLETKLSVPPISKTE 747
>gi|297843308|ref|XP_002889535.1| hypothetical protein ARALYDRAFT_470500
[Arabidopsis lyrata subsp. lyrata]
Length = 701
Score = 507 bits (1305), Expect = 2e-141
Identities = 284/447 (63%), Positives = 332/447 (74%), Gaps = 45/447 (10%)
Frame = -1
Query: 1412 F*EDKNLSLFDGKDSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEGKGAQRNSA 1233
F E++NLSLF+GK + +TSSS++D+SFG FEG+D Q SS KEDES G+F GK AQR S+
Sbjct: 278 FKENENLSLFEGKVAQKTSSSKEDESFGLFEGKDTQRNSSSKEDESPGLFMGKDAQRTSS 337
Query: 1232 SKEDGNLGLFEGK-DGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSDFQ 1056
SK+D + G+FEGK D QRNSSSKED +FGLFEGAPSS+ KSFDD + A+SSDWDSDFQ
Sbjct: 338 SKDDESFGMFEGKEDAQRNSSSKEDENFGLFEGAPSSTADLKSFDDKIVATSSDWDSDFQ 397
Query: 1055 SVSH----EKISGDPFVSSPVDLSVHMDSVFGSGKQADSSTAYVSKAGDWPLQDDLFGNF 888
S H +K+ GDPFVSSPVDL+ HMDSVFGSGK +K GDW LQDDLFGN
Sbjct: 398 SADHNPSQKKVGGDPFVSSPVDLAAHMDSVFGSGKD-----LLYAKPGDW-LQDDLFGNV 451
Query: 887 TGKSGNNDEAGH---EGQVVSGNGTSSMDIDWIGDDLWQTSEQKAVEKTRTDDNDDDDDD 717
TG++ N+D A H EGQVV GNG+SSMDIDWIGDDLWQT+E+K++EKT TD N DDDD
Sbjct: 452 TGEAQNSDSAVHDKNEGQVVGGNGSSSMDIDWIGDDLWQTNEKKSIEKTPTDVN--DDDD 509
Query: 716 DWNDFASSANSKTPSNLLSRTMERSQEEIFDGMSRVKNDVEEQSEYEKQNTGTSMISDIA 537
DWNDFASSANSKTP+N LS+TME SQ+E F G ++VKN V+EQS EKQNT ++SDI
Sbjct: 510 DWNDFASSANSKTPNNPLSQTMESSQDEFFYGQAQVKNGVKEQSVDEKQNT---VMSDIG 566
Query: 536 KVQEDDLFGNWDSFSSSAVLQTPVQPPTNHVTPSPEQNQGMNLLEESNHHRDLDF----- 372
K QEDD+FG WDSF+SS + QT S E+ MNL E+N+HRDLDF
Sbjct: 567 KGQEDDIFGTWDSFTSSTIPQT-----------SGEKYPKMNLFGENNNHRDLDFDSISR 615
Query: 371 -GLFSESIGGHTNSEEVTAMPSGTSSTSERTGDPD-VQDQ-------VTTDSRKSKSDVA 219
FSES GG TNSEEV +PSGT ST +RT DPD +DQ TT + KSKSDVA
Sbjct: 616 SDFFSESSGGKTNSEEVKVIPSGT-STLDRTSDPDGSKDQTVDLVVGTTTTAPKSKSDVA 674
Query: 218 EELMSQMHDLSFMLETKLSVSPISKAE 138
EELMSQMHDLSFMLETKLSV PISK E
Sbjct: 675 EELMSQMHDLSFMLETKLSVPPISKTE 701
Score = 109 bits (272), Expect = 1e-021
Identities = 87/235 (37%), Positives = 116/235 (49%), Gaps = 19/235 (8%)
Frame = -1
Query: 1406 EDKNLSLFDGKDSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEGKGAQRNSASK 1227
E +NLSLF G+D+ + S + +FGFFE +D G +SFKE+E+ +FEGK AQ+ S+SK
Sbjct: 242 EHENLSLFAGRDAQESVSLAEQGNFGFFEEKD--GQNSFKENENLSLFEGKVAQKTSSSK 299
Query: 1226 EDGNLGLFEGKDGQRNSSSKEDISFGLFEGAPSSSDGFKSFDDNVAASSSDWDSDFQSVS 1047
ED + GLFEGKD QRNSSSKED S GLF G + D++ D+ S S
Sbjct: 300 EDESFGLFEGKDTQRNSSSKEDESPGLFMGKDAQRTSSSKDDESFGMFEGKEDAQRNSSS 359
Query: 1046 HEK-----ISGDPFVSSPVDLSVHMDSVFGSGKQADSSTAYVSKAGDWPLQDDLFGNFTG 882
E G P SS DL D + + DS A P Q + G+
Sbjct: 360 KEDENFGLFEGAP--SSTADLKSFDDKIVATSSDWDSD---FQSADHNPSQKKVGGD-PF 413
Query: 881 KSGNNDEAGHEGQVVSGNGTSSMDI---DWIGDDLW--QTSEQKAVEKTRTDDND 732
S D A H V G+G + DW+ DDL+ T E + + D N+
Sbjct: 414 VSSPVDLAAHMDSVF-GSGKDLLYAKPGDWLQDDLFGNVTGEAQNSDSAVHDKNE 467
Score = 102 bits (254), Expect = 1e-019
Identities = 54/103 (52%), Positives = 69/103 (66%), Gaps = 2/103 (1%)
Frame = -1
Query: 1406 EDKNLSLFDGKDSLRTSSSRKDDSFGFFEGRDAQGTSSFKEDESFGVFEGKGAQRNSASK 1227
E N F+ KD +S +++++ FEG+ AQ TSS KEDESFG+FEGK QRNS+SK
Sbjct: 262 EQGNFGFFEEKDG--QNSFKENENLSLFEGKVAQKTSSSKEDESFGLFEGKDTQRNSSSK 319
Query: 1226 EDGNLGLFEGKDGQRNSSSKEDISFGLFEGAPSSSDGFKSFDD 1098
ED + GLF GKD QR SSSK+D SFG+FEG + S +D
Sbjct: 320 EDESPGLFMGKDAQRTSSSKDDESFGMFEGKEDAQRNSSSKED 362
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,313,363,328,120
Number of Sequences: 15229318
Number of Extensions: 1313363328120
Number of Successful Extensions: 373990951
Number of sequences better than 0.0: 0
|