BLASTX 7.6.2
Query= RU03723 /QuerySize=1843
(1842 letters)
Database: GenBank nr;
7,387,702 sequences; 2,551,671,255 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao] 852 2e-245
gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao] 826 2e-237
gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata] 808 5e-232
gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata] 791 5e-227
gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max] 777 7e-223
>gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 852 bits (2201), Expect = 2e-245
Identities = 399/510 (78%), Positives = 462/510 (90%)
Frame = -1
Query: 1776 LRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTAVIRK 1597
+++ T TLFLC LLFP+VFS SN+ L+R+GLKKRKFDQN R+AA+L SK +A A ++K
Sbjct: 5 VKTTTVTLFLCLLLFPIVFSISNERLVRIGLKKRKFDQNYRLAAHLDSKEREAFRASLKK 64
Query: 1596 YNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFS 1417
Y L+G L + +DIDIV+LKNY+DAQYFGEIGIGTPPQ FTVIFDTGSSNLWVPSSKCYFS
Sbjct: 65 YRLQGNLQESEDIDIVALKNYLDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 124
Query: 1416 LACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKE 1237
+ACYLH +YKSS S+TY NGKPA IQYGTGAISGFFSED+V VGDLVVK+QEFIEAT+E
Sbjct: 125 IACYLHSRYKSSRSSTYKANGKPADIQYGTGAISGFFSEDNVQVGDLVVKNQEFIEATRE 184
Query: 1236 PGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEVGGEI 1057
P ITFLVAKFDGILGLGFQEISVGNAVPVWYNMV QGL+KEPVFSFWFNR+ ++++GGE+
Sbjct: 185 PSITFLVAKFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRDPEDDIGGEV 244
Query: 1056 VFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGP 877
VFGG+DP H+ G+HTYVP+T+KGYWQFDMGDVLI QTTG CAGGC+AIADSGTSL+ GP
Sbjct: 245 VFGGMDPKHFKGDHTYVPITRKGYWQFDMGDVLIGNQTTGLCAGGCSAIADSGTSLITGP 304
Query: 876 TTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDGTRGV 697
T II ++NHAIGA+G+VSQECKTVV++YG+TII M+L+KDQP KICSQIGLCTFDGTRGV
Sbjct: 305 TAIIAQVNHAIGASGVVSQECKTVVSQYGETIIDMLLSKDQPLKICSQIGLCTFDGTRGV 364
Query: 696 SVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCDRLPS 517
S GI+SVV E+ K++ L DAMCS CEMTV+WMQNQLKQNQTQ+ IL+Y+N+LCDRLPS
Sbjct: 365 STGIESVVHENVGKATGDLHDAMCSTCEMTVIWMQNQLKQNQTQERILEYINELCDRLPS 424
Query: 516 PMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPPPRG 337
PMGESAVDC+ LS+MPNVSFTIGGK F+L+PEQYVLKVGEG+VAQC+SGFTALDVPPPRG
Sbjct: 425 PMGESAVDCSSLSTMPNVSFTIGGKIFELSPEQYVLKVGEGDVAQCLSGFTALDVPPPRG 484
Query: 336 PLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
PLWILGDVFMGQFHTVFDYGN ++GFAEAA
Sbjct: 485 PLWILGDVFMGQFHTVFDYGNLQVGFAEAA 514
>gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 826 bits (2132), Expect = 2e-237
Identities = 389/514 (75%), Positives = 446/514 (86%)
Frame = -1
Query: 1788 MEAELRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTA 1609
M ++ V +LF+ LLF +V S SNDGL+R+GLKK K D NNR+AA L SK+G+A+ A
Sbjct: 1 MGTTIKVVVLSLFISSLLFSVVSSVSNDGLVRIGLKKMKLDPNNRLAARLDSKDGEALRA 60
Query: 1608 VIRKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 1429
I+KY R LGD ++ DIV+LKNYMDAQY+GEIGIGTP QKFTVIFDTGSSNLWV S+K
Sbjct: 61 FIKKYRFRNNLGDSEETDIVALKNYMDAQYYGEIGIGTPTQKFTVIFDTGSSNLWVSSTK 120
Query: 1428 CYFSLACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIE 1249
CYFS+ACY H KYK+S S+TY K+GKPA+IQYGTGAISGFFS DHV VGDLVVKDQEFIE
Sbjct: 121 CYFSVACYFHEKYKASDSSTYKKDGKPASIQYGTGAISGFFSYDHVQVGDLVVKDQEFIE 180
Query: 1248 ATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEV 1069
ATKEPG+TF+VAKFDGILGLGF+EISVG+AVPVWYNM+KQGL+KEPVFSFW NRN DEE
Sbjct: 181 ATKEPGLTFMVAKFDGILGLGFKEISVGDAVPVWYNMIKQGLIKEPVFSFWLNRNVDEEA 240
Query: 1068 GGEIVFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSL 889
GGEIVFGGVDP+HY G+HTYVPVTQKGYWQFDMGDVLI + TG+CAG CAAIADSGTSL
Sbjct: 241 GGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIADKPTGYCAGSCAAIADSGTSL 300
Query: 888 LVGPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDG 709
L GP+T+IT +NHAIGATG+VSQECK VV +YG TII +++A+ QPQKICSQIGLCTF+G
Sbjct: 301 LAGPSTVITMINHAIGATGVVSQECKAVVQQYGRTIIDLLIAEAQPQKICSQIGLCTFNG 360
Query: 708 TRGVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCD 529
GVS GI+SVVDE N KSS L DAMC ACEM VVWMQNQ++QNQTQD IL YVN+LCD
Sbjct: 361 AHGVSTGIESVVDESNGKSSGVLRDAMCPACEMAVVWMQNQVRQNQTQDRILSYVNELCD 420
Query: 528 RLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVP 349
R+P+PMGESAVDC LSSMP +SFTIGGK FDL PE+Y+LKVGEG AQCISGFTALD+P
Sbjct: 421 RVPNPMGESAVDCGSLSSMPTISFTIGGKVFDLTPEEYILKVGEGSEAQCISGFTALDIP 480
Query: 348 PPRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
PPRGPLWILGD+FMG++HTVFD+G R+GFAEAA
Sbjct: 481 PPRGPLWILGDIFMGRYHTVFDFGKLRVGFAEAA 514
>gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata]
Length = 514
Score = 808 bits (2085), Expect = 5e-232
Identities = 384/513 (74%), Positives = 443/513 (86%)
Frame = -1
Query: 1788 MEAELRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTA 1609
M + RSV A+LFL LL PLV S+SND LLRVGLKKRK DQ NR ++ K +++
Sbjct: 1 MGSTSRSVLASLFLLLLLSPLVNSSSNDRLLRVGLKKRKLDQINRFSSLYGCKGKESINP 60
Query: 1608 VIRKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 1429
IRKY L LG+ D DI+SLKNYM+AQYFGEIGIGTPPQKFT+IFDTGSSNLWVPS+K
Sbjct: 61 AIRKYGLGNGLGNSDDADIISLKNYMNAQYFGEIGIGTPPQKFTLIFDTGSSNLWVPSAK 120
Query: 1428 CYFSLACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIE 1249
CYFS+ACY H KYKSS S++Y+KNGK A I YGTGAISGFFS+DHV +GDLVV++Q+FIE
Sbjct: 121 CYFSIACYFHSKYKSSLSSSYTKNGKSAEIHYGTGAISGFFSQDHVKLGDLVVENQDFIE 180
Query: 1248 ATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEV 1069
AT+EP ITF+ AKFDGILGLGFQEISVGNAVPVWYNMVKQGL+ EPVFSFW NRNA EE
Sbjct: 181 ATREPSITFVAAKFDGILGLGFQEISVGNAVPVWYNMVKQGLVNEPVFSFWLNRNATEEE 240
Query: 1068 GGEIVFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSL 889
GGEIVFGGVDP+HY GEHT+VPVT KGYWQFDM DVL+ G+TTG+C+GGC+AIADSGTSL
Sbjct: 241 GGEIVFGGVDPNHYKGEHTFVPVTHKGYWQFDMDDVLVGGETTGYCSGGCSAIADSGTSL 300
Query: 888 LVGPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDG 709
L GPTTI+ ++NHAIGA+G+VSQECK VVA+YG I+ M++++ QP+KICSQIGLCTFDG
Sbjct: 301 LAGPTTIVAQINHAIGASGVVSQECKAVVAQYGTAILDMLISETQPKKICSQIGLCTFDG 360
Query: 708 TRGVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCD 529
RGVSVGIKSVVD + SS+GL DA C+ACEMTVVWMQNQLKQNQT++ IL+YVN+LC+
Sbjct: 361 KRGVSVGIKSVVDMNVDGSSSGLQDATCTACEMTVVWMQNQLKQNQTEERILNYVNELCN 420
Query: 528 RLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVP 349
RLPSPMGESAVDC+ LSSMP VSFT+GGK FDL PEQY+L+VGEG QCISGFTALDV
Sbjct: 421 RLPSPMGESAVDCSSLSSMPGVSFTVGGKVFDLLPEQYILQVGEGVATQCISGFTALDVA 480
Query: 348 PPRGPLWILGDVFMGQFHTVFDYGNERIGFAEA 250
PP GPLWILGD+FMGQ+HTVFDYGN R+GFAEA
Sbjct: 481 PPLGPLWILGDIFMGQYHTVFDYGNMRVGFAEA 513
>gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata]
Length = 514
Score = 791 bits (2042), Expect = 5e-227
Identities = 375/514 (72%), Positives = 439/514 (85%)
Frame = -1
Query: 1788 MEAELRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTA 1609
M + TLFL LL PLV S S+D LLRVGLKKRK DQ NR++++ K + +
Sbjct: 1 MGRTFETALTTLFLLLLLSPLVTSLSSDRLLRVGLKKRKLDQINRLSSHYGCKGKGSTSP 60
Query: 1608 VIRKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 1429
I K+ L LG+ D DI+SLKNYMDAQYFGEIGIG+PPQKFTVIFDTGSSNLWVPS+K
Sbjct: 61 SIWKHGLGNGLGNSDDADIISLKNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSAK 120
Query: 1428 CYFSLACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIE 1249
CYFS+ACYLHPKYKS S+TY+KNGK AAI YGTGAISGFFS+DHV +GDLVV++Q+FIE
Sbjct: 121 CYFSIACYLHPKYKSFKSSTYAKNGKSAAIHYGTGAISGFFSQDHVKMGDLVVENQDFIE 180
Query: 1248 ATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEV 1069
ATKEP ITF+ AKFDGILGLGFQEISVG+AVP WYNM+ QGL+ EPVFSFW NR ++EE
Sbjct: 181 ATKEPSITFVAAKFDGILGLGFQEISVGDAVPAWYNMIDQGLVNEPVFSFWLNRKSEEEE 240
Query: 1068 GGEIVFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSL 889
GGEIVFGGVDP+HY GEHTYVPVT+KGYWQFDM DVL+ G+TTG+C+GGC+AIADSGTSL
Sbjct: 241 GGEIVFGGVDPNHYKGEHTYVPVTRKGYWQFDMDDVLVGGETTGYCSGGCSAIADSGTSL 300
Query: 888 LVGPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDG 709
L GPTTII ++NHAIGA+G+VSQECK VV++YG I+ ++A+ QPQKICSQIGLCTFDG
Sbjct: 301 LAGPTTIIVQINHAIGASGLVSQECKAVVSQYGKAILDALVAEAQPQKICSQIGLCTFDG 360
Query: 708 TRGVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCD 529
RGVS+GI+SVV+++ SS GL DAMC+ACEM VVWMQNQL+QN+T++ IL+YVN+LC+
Sbjct: 361 KRGVSMGIESVVEKNPGNSSDGLQDAMCTACEMAVVWMQNQLRQNRTEEQILNYVNELCN 420
Query: 528 RLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVP 349
RLPSPMGES+VDC LSSMPNVS TIGGK FDL+PE+YVLKVGEG AQCISGF ALD+
Sbjct: 421 RLPSPMGESSVDCGSLSSMPNVSLTIGGKVFDLSPEKYVLKVGEGVAAQCISGFIALDIA 480
Query: 348 PPRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
PPRGPLWILGD+FMGQ+HTVFDYGN +GFAEAA
Sbjct: 481 PPRGPLWILGDIFMGQYHTVFDYGNLSVGFAEAA 514
>gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max]
Length = 514
Score = 777 bits (2006), Expect = 7e-223
Identities = 368/514 (71%), Positives = 430/514 (83%)
Frame = -1
Query: 1788 MEAELRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTA 1609
M + ++ L + LL V+ A N GL R+GLKK K D NR+AA + SK+ D+ A
Sbjct: 1 MGNRMNAIVLCLLVSTLLVSAVYCAPNAGLRRIGLKKIKLDPKNRLAARVGSKDVDSFRA 60
Query: 1608 VIRKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 1429
IR+++L+ G ++ DIV+LKNY+DAQY+GEI IGT PQKF VIFDTGSSNLWVPSSK
Sbjct: 61 SIRQFHLQNNFGGTEETDIVALKNYLDAQYYGEIAIGTSPQKFAVIFDTGSSNLWVPSSK 120
Query: 1428 CYFSLACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIE 1249
C FS+ACY H KYKSS S+T+ KNG AAIQYGTGAISGFFS D V VG++VVK+QEFIE
Sbjct: 121 CTFSVACYFHAKYKSSKSSTFKKNGTAAAIQYGTGAISGFFSYDSVRVGEIVVKNQEFIE 180
Query: 1248 ATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEV 1069
AT+EPG+TFL AKFDGILGLGFQEISVGNA PVWYNMV QGLLKEPVFSFWFNRN +EE
Sbjct: 181 ATREPGVTFLAAKFDGILGLGFQEISVGNAAPVWYNMVDQGLLKEPVFSFWFNRNPEEEE 240
Query: 1068 GGEIVFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSL 889
GGEIVFGGVDP HY G+HTYVPVT+KGYWQFDMGDVLI G+ TG+CA GC+AIADSGTSL
Sbjct: 241 GGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDVLIGGKPTGYCANGCSAIADSGTSL 300
Query: 888 LVGPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDG 709
L GPTT+IT +NHAIGA+G++SQECKT+VAEYG TI+ ++LA+ QP+KICS+IGLC FDG
Sbjct: 301 LAGPTTVITMINHAIGASGVMSQECKTIVAEYGQTILDLLLAETQPKKICSRIGLCAFDG 360
Query: 708 TRGVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCD 529
T GV VGIKSVVDE+ KS G A C ACEM VVWMQNQL +NQTQD IL Y+NQLCD
Sbjct: 361 THGVDVGIKSVVDENERKSLGGHHGAACPACEMAVVWMQNQLSRNQTQDQILSYINQLCD 420
Query: 528 RLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVP 349
++PSPMGESAVDC +SS+P VSFTIGG+ FDL+PE+YVLKVGEG VAQCISGFTA+D+P
Sbjct: 421 KMPSPMGESAVDCGNISSLPVVSFTIGGRTFDLSPEEYVLKVGEGPVAQCISGFTAIDIP 480
Query: 348 PPRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247
PPRGPLWILGDVFMG++HTVFD+G R+GFA+AA
Sbjct: 481 PPRGPLWILGDVFMGRYHTVFDFGKLRVGFADAA 514
Database: GenBank nr
Posted date: Wed Dec 03 02:04:20 2008
Number of letters in database: 2,551,671,255
Number of sequences in database: 7,387,702
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 121,122,045,638
Number of Sequences: 7387702
Number of Extensions: 121122045638
Number of Successful Extensions: 65962939
Number of sequences better than 0.0: 0
|