BLASTX 7.6.2
Query= UN33379 /QuerySize=1346
(1345 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|18415554|ref|NP_567615.1| dentin sialophosphoprotein-related ... 527 2e-147
gi|5262212|emb|CAB45838.1| hypothetical protein [Arabidopsis tha... 527 2e-147
gi|15220414|ref|NP_172002.1| dentin sialophosphoprotein-like pro... 525 1e-146
gi|4056417|gb|AAC97991.1| ESTs gb|H76594 and gb|H76252 come from... 525 1e-146
gi|297843308|ref|XP_002889535.1| hypothetical protein ARALYDRAFT... 501 1e-139
>gi|18415554|ref|NP_567615.1| dentin sialophosphoprotein-related protein
[Arabidopsis thaliana]
Length = 729
Score = 527 bits (1356), Expect = 2e-147
Identities = 287/424 (67%), Positives = 331/424 (78%), Gaps = 30/424 (7%)
Frame = -3
Query: 1337 FEGKDAQRDSSFKDDESFGLFQG-KDAQRNSALKEDANFGLFEG-EDGQSNSASKEDEDF 1164
FEGKDAQR SS KDDESFG+F+G KDAQRNS+ KED +FG+FEG ED Q NS+SKE+E+F
Sbjct: 308 FEGKDAQRTSSSKDDESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENF 367
Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQS----VSQEKSSSDPFVSSPVDLSAHMD 996
G FE A SNA +KSFDDK+V +SS WDSDFQS +SQ+K DPFVSSPVDL+AHMD
Sbjct: 368 GFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMD 427
Query: 995 TVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGGN 825
+VFGSGKDL Y +P DSST YVS AGDWLQDDLFG+VTG+ Q ND VH EGQ+VGGN
Sbjct: 428 SVFGSGKDLLYAQPADSSTAYVSKAGDWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGN 487
Query: 824 GSSSMDIDWMGDDLWQTSEKKAVEQTPTD----------DFASSVNSKTPSNLLSRTMES 675
G+SSMDIDW+GDDLWQT+EKK++E+TPTD DFASS NSKTP+N LS+TMES
Sbjct: 488 GNSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMES 547
Query: 674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
SQ EIF G A KN V EQS DEKQNT + V+SDIGK QEDDLFG WD+FTSST+LQT
Sbjct: 548 SQ-FEIFYGHAQDKNGVKEQSVDEKQNTDTS-VMSDIGKCQEDDLFGTWDSFTSSTILQT 605
Query: 494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
+Q T S E NPE+ LFG+N++ DL FD F ES+GG+T SEEV +PS
Sbjct: 606 SLQPPTIHANPSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 665
Query: 332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
GTSTL+R SDPDG KD+T+DLV TTT KSKSDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct: 666 GTSTLDRPSDPDGSKDQTVDLVVGTTTTVPKSKSDVAEELMSQMHDLSFMLETKLSVPPI 725
Query: 161 TKAE 150
+K E
Sbjct: 726 SKTE 729
>gi|5262212|emb|CAB45838.1| hypothetical protein [Arabidopsis thaliana]
Length = 758
Score = 527 bits (1356), Expect = 2e-147
Identities = 287/424 (67%), Positives = 331/424 (78%), Gaps = 30/424 (7%)
Frame = -3
Query: 1337 FEGKDAQRDSSFKDDESFGLFQG-KDAQRNSALKEDANFGLFEG-EDGQSNSASKEDEDF 1164
FEGKDAQR SS KDDESFG+F+G KDAQRNS+ KED +FG+FEG ED Q NS+SKE+E+F
Sbjct: 337 FEGKDAQRTSSSKDDESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENF 396
Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQS----VSQEKSSSDPFVSSPVDLSAHMD 996
G FE A SNA +KSFDDK+V +SS WDSDFQS +SQ+K DPFVSSPVDL+AHMD
Sbjct: 397 GFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMD 456
Query: 995 TVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGGN 825
+VFGSGKDL Y +P DSST YVS AGDWLQDDLFG+VTG+ Q ND VH EGQ+VGGN
Sbjct: 457 SVFGSGKDLLYAQPADSSTAYVSKAGDWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGN 516
Query: 824 GSSSMDIDWMGDDLWQTSEKKAVEQTPTD----------DFASSVNSKTPSNLLSRTMES 675
G+SSMDIDW+GDDLWQT+EKK++E+TPTD DFASS NSKTP+N LS+TMES
Sbjct: 517 GNSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMES 576
Query: 674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
SQ EIF G A KN V EQS DEKQNT + V+SDIGK QEDDLFG WD+FTSST+LQT
Sbjct: 577 SQ-FEIFYGHAQDKNGVKEQSVDEKQNTDTS-VMSDIGKCQEDDLFGTWDSFTSSTILQT 634
Query: 494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
+Q T S E NPE+ LFG+N++ DL FD F ES+GG+T SEEV +PS
Sbjct: 635 SLQPPTIHANPSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 694
Query: 332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
GTSTL+R SDPDG KD+T+DLV TTT KSKSDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct: 695 GTSTLDRPSDPDGSKDQTVDLVVGTTTTVPKSKSDVAEELMSQMHDLSFMLETKLSVPPI 754
Query: 161 TKAE 150
+K E
Sbjct: 755 SKTE 758
>gi|15220414|ref|NP_172002.1| dentin sialophosphoprotein-like protein
[Arabidopsis thaliana]
Length = 706
Score = 525 bits (1350), Expect = 1e-146
Identities = 286/424 (67%), Positives = 330/424 (77%), Gaps = 30/424 (7%)
Frame = -3
Query: 1337 FEGKDAQRDSSFKDDESFGLFQG-KDAQRNSALKEDANFGLFEG-EDGQSNSASKEDEDF 1164
FEGKDAQR SS KDDESFG+F+G KDAQRNS+ KED +FG+FEG ED Q NS+SKE+E+F
Sbjct: 285 FEGKDAQRTSSSKDDESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENF 344
Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQS----VSQEKSSSDPFVSSPVDLSAHMD 996
G FE A SNA +KSFDDK+V +SS WDSDFQS +SQ+K DPFVSSPVDL+AHMD
Sbjct: 345 GFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMD 404
Query: 995 TVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGGN 825
+VFGSGKDL Y +P DSST YVS AGDWLQDDLFG+VTG+ Q ND VH EGQ+VGGN
Sbjct: 405 SVFGSGKDLLYAQPADSSTAYVSKAGDWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGN 464
Query: 824 GSSSMDIDWMGDDLWQTSEKKAVEQTPTD----------DFASSVNSKTPSNLLSRTMES 675
G+SSMDIDW+GDDLWQT+EKK++E+TPTD DFASS NSKTP+N LS+TMES
Sbjct: 465 GNSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMES 524
Query: 674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
SQ EIF G A KN V EQS DEKQNT + V+SDIGK QEDDLFG WD+FTSST+LQT
Sbjct: 525 SQ-FEIFYGHAQDKNGVKEQSVDEKQNTDTS-VMSDIGKCQEDDLFGTWDSFTSSTILQT 582
Query: 494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
+Q T S E NPE+ LFG+N++ DL FD F ES+GG+T SEEV +PS
Sbjct: 583 SLQPPTIHANPSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 642
Query: 332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
GTSTL+R SDPDG KD+T+DLV TTT KS SDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct: 643 GTSTLDRPSDPDGSKDQTVDLVVGTTTTVPKSMSDVAEELMSQMHDLSFMLETKLSVPPI 702
Query: 161 TKAE 150
+K E
Sbjct: 703 SKTE 706
>gi|4056417|gb|AAC97991.1| ESTs gb|H76594 and gb|H76252 come from this gene
[Arabidopsis thaliana]
Length = 747
Score = 525 bits (1350), Expect = 1e-146
Identities = 286/424 (67%), Positives = 330/424 (77%), Gaps = 30/424 (7%)
Frame = -3
Query: 1337 FEGKDAQRDSSFKDDESFGLFQG-KDAQRNSALKEDANFGLFEG-EDGQSNSASKEDEDF 1164
FEGKDAQR SS KDDESFG+F+G KDAQRNS+ KED +FG+FEG ED Q NS+SKE+E+F
Sbjct: 326 FEGKDAQRTSSSKDDESFGMFEGKKDAQRNSSSKEDESFGMFEGKEDAQRNSSSKENENF 385
Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQS----VSQEKSSSDPFVSSPVDLSAHMD 996
G FE A SNA +KSFDDK+V +SS WDSDFQS +SQ+K DPFVSSPVDL+AHMD
Sbjct: 386 GFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQNLSQKKIDGDPFVSSPVDLAAHMD 445
Query: 995 TVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGGN 825
+VFGSGKDL Y +P DSST YVS AGDWLQDDLFG+VTG+ Q ND VH EGQ+VGGN
Sbjct: 446 SVFGSGKDLLYAQPADSSTAYVSKAGDWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGN 505
Query: 824 GSSSMDIDWMGDDLWQTSEKKAVEQTPTD----------DFASSVNSKTPSNLLSRTMES 675
G+SSMDIDW+GDDLWQT+EKK++E+TPTD DFASS NSKTP+N LS+TMES
Sbjct: 506 GNSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDDWNDFASSANSKTPNNPLSQTMES 565
Query: 674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
SQ EIF G A KN V EQS DEKQNT + V+SDIGK QEDDLFG WD+FTSST+LQT
Sbjct: 566 SQ-FEIFYGHAQDKNGVKEQSVDEKQNTDTS-VMSDIGKCQEDDLFGTWDSFTSSTILQT 623
Query: 494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
+Q T S E NPE+ LFG+N++ DL FD F ES+GG+T SEEV +PS
Sbjct: 624 SLQPPTIHANPSGEKNPEMNLFGENNNNRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 683
Query: 332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
GTSTL+R SDPDG KD+T+DLV TTT KS SDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct: 684 GTSTLDRPSDPDGSKDQTVDLVVGTTTTVPKSMSDVAEELMSQMHDLSFMLETKLSVPPI 743
Query: 161 TKAE 150
+K E
Sbjct: 744 SKTE 747
>gi|297843308|ref|XP_002889535.1| hypothetical protein ARALYDRAFT_470500
[Arabidopsis lyrata subsp. lyrata]
Length = 701
Score = 501 bits (1289), Expect = 1e-139
Identities = 277/424 (65%), Positives = 320/424 (75%), Gaps = 53/424 (12%)
Frame = -3
Query: 1343 GFFEGKDAQRDSSFKDDESFGLFQGKDAQRNSALKEDANFGLFEG-EDGQSNSASKEDED 1167
G FEGKD QR+SS K+DES GLF GKDAQR S+ K+D +FG+FEG ED Q NS+SKEDE+
Sbjct: 305 GLFEGKDTQRNSSSKEDESPGLFMGKDAQRTSSSKDDESFGMFEGKEDAQRNSSSKEDEN 364
Query: 1166 FGLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQSV----SQEKSSSDPFVSSPVDLSAHM 999
FGLFE A SS A +KSFDDK+V +SS WDSDFQS SQ+K DPFVSSPVDL+AHM
Sbjct: 365 FGLFEGAPSSTADLKSFDDKIVATSSDWDSDFQSADHNPSQKKVGGDPFVSSPVDLAAHM 424
Query: 998 DTVFGSGKDLFYEKPEDSSTTYVSNAGDWLQDDLFGDVTGKTQNNDQTVH---EGQVVGG 828
D+VFGSGKDL Y KP GDWLQDDLFG+VTG+ QN+D VH EGQVVGG
Sbjct: 425 DSVFGSGKDLLYAKP-----------GDWLQDDLFGNVTGEAQNSDSAVHDKNEGQVVGG 473
Query: 827 NGSSSMDIDWMGDDLWQTSEKKAVEQTPTD---------DFASSVNSKTPSNLLSRTMES 675
NGSSSMDIDW+GDDLWQT+EKK++E+TPTD DFASS NSKTP+N LS+TMES
Sbjct: 474 NGSSSMDIDWIGDDLWQTNEKKSIEKTPTDVNDDDDDWNDFASSANSKTPNNPLSQTMES 533
Query: 674 SQEEEIFDGLAHVKNDVNEQSKDEKQNTGIARVISDIGKGQEDDLFGNWDTFTSSTVLQT 495
SQ +E F G A VKN V EQS DEKQNT V+SDIGKGQEDD+FG WD+FTSST+ QT
Sbjct: 534 SQ-DEFFYGQAQVKNGVKEQSVDEKQNT----VMSDIGKGQEDDIFGTWDSFTSSTIPQT 588
Query: 494 PVQSHTNQVYSSAELNPEVYLFGDNSHRSDLAFDC------FPESTGGQTKSEEVTAMPS 333
S E P++ LFG+N++ DL FD F ES+GG+T SEEV +PS
Sbjct: 589 -----------SGEKYPKMNLFGENNNHRDLDFDSISRSDFFSESSGGKTNSEEVKVIPS 637
Query: 332 GTSTLERTSDPDG-KDKTLDLV--GTTTSHKSKSDVAEELISQMHDLSFMLETKLSVPPI 162
GTSTL+RTSDPDG KD+T+DLV TTT+ KSKSDVAEEL+SQMHDLSFMLETKLSVPPI
Sbjct: 638 GTSTLDRTSDPDGSKDQTVDLVVGTTTTAPKSKSDVAEELMSQMHDLSFMLETKLSVPPI 697
Query: 161 TKAE 150
+K E
Sbjct: 698 SKTE 701
Score = 90 bits (222), Expect = 7e-016
Identities = 55/136 (40%), Positives = 75/136 (55%), Gaps = 9/136 (6%)
Frame = -3
Query: 1337 FEGKDAQRDSSFKDDESFGLFQGKDAQRNSALKEDANFGLFEGEDGQSNSASKEDEDFGL 1158
FEGK AQ+ SS K+DESFGLF+GKD QRNS+ KED + GLF G+D Q S+SK+DE FG+
Sbjct: 287 FEGKVAQKTSSSKEDESFGLFEGKDTQRNSSSKEDESPGLFMGKDAQRTSSSKDDESFGM 346
Query: 1157 FEAALSSNAGIKSFDDKVVTSSSTWDSDFQSVSQEKSSSDPFVSSPVDLSA------HMD 996
FE + S +D+ + ++ S + KS D V++ D + H
Sbjct: 347 FEGKEDAQRNSSSKEDE---NFGLFEGAPSSTADLKSFDDKIVATSSDWDSDFQSADHNP 403
Query: 995 TVFGSGKDLFYEKPED 948
+ G D F P D
Sbjct: 404 SQKKVGGDPFVSSPVD 419
Score = 83 bits (204), Expect = 8e-014
Identities = 50/127 (39%), Positives = 69/127 (54%), Gaps = 5/127 (3%)
Frame = -3
Query: 1343 GFFEGKDAQRDSSFKDDESFGLFQGKDAQRNSALKEDANFGLFEGEDGQSNSASKEDEDF 1164
GFFE KD Q +SFK++E+ LF+GK AQ+ S+ KED +FGLFEG+D Q NS+SKEDE
Sbjct: 267 GFFEEKDGQ--NSFKENENLSLFEGKVAQKTSSSKEDESFGLFEGKDTQRNSSSKEDESP 324
Query: 1163 GLFEAALSSNAGIKSFDDKVVTSSSTWDSDFQSVSQEKSSSDPFVSSP---VDLSAHMDT 993
GLF + D+ D+ S S+E + F +P DL + D
Sbjct: 325 GLFMGKDAQRTSSSKDDESFGMFEGKEDAQRNSSSKEDENFGLFEGAPSSTADLKSFDDK 384
Query: 992 VFGSGKD 972
+ + D
Sbjct: 385 IVATSSD 391
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,810,539,025,462
Number of Sequences: 15229318
Number of Extensions: 3810539025462
Number of Successful Extensions: 885398663
Number of sequences better than 0.0: 0
|