BLASTX 7.6.2 Query= RU03723 /QuerySize=1843 (1842 letters) Database: GenBank nr; 7,387,702 sequences; 2,551,671,255 total letters Score E Sequences producing significant alignments: (bits) Value gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao] 852 2e-245 gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao] 826 2e-237 gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata] 808 5e-232 gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata] 791 5e-227 gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max] 777 7e-223 >gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao] Length = 514 Score = 852 bits (2201), Expect = 2e-245 Identities = 399/510 (78%), Positives = 462/510 (90%) Frame = -1 Query: 1776 LRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTAVIRK 1597 +++ T TLFLC LLFP+VFS SN+ L+R+GLKKRKFDQN R+AA+L SK +A A ++K Sbjct: 5 VKTTTVTLFLCLLLFPIVFSISNERLVRIGLKKRKFDQNYRLAAHLDSKEREAFRASLKK 64 Query: 1596 YNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFS 1417 Y L+G L + +DIDIV+LKNY+DAQYFGEIGIGTPPQ FTVIFDTGSSNLWVPSSKCYFS Sbjct: 65 YRLQGNLQESEDIDIVALKNYLDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 124 Query: 1416 LACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKE 1237 +ACYLH +YKSS S+TY NGKPA IQYGTGAISGFFSED+V VGDLVVK+QEFIEAT+E Sbjct: 125 IACYLHSRYKSSRSSTYKANGKPADIQYGTGAISGFFSEDNVQVGDLVVKNQEFIEATRE 184 Query: 1236 PGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEVGGEI 1057 P ITFLVAKFDGILGLGFQEISVGNAVPVWYNMV QGL+KEPVFSFWFNR+ ++++GGE+ Sbjct: 185 PSITFLVAKFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRDPEDDIGGEV 244 Query: 1056 VFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSLLVGP 877 VFGG+DP H+ G+HTYVP+T+KGYWQFDMGDVLI QTTG CAGGC+AIADSGTSL+ GP Sbjct: 245 VFGGMDPKHFKGDHTYVPITRKGYWQFDMGDVLIGNQTTGLCAGGCSAIADSGTSLITGP 304 Query: 876 TTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDGTRGV 697 T II ++NHAIGA+G+VSQECKTVV++YG+TII M+L+KDQP KICSQIGLCTFDGTRGV Sbjct: 305 TAIIAQVNHAIGASGVVSQECKTVVSQYGETIIDMLLSKDQPLKICSQIGLCTFDGTRGV 364 Query: 696 SVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCDRLPS 517 S GI+SVV E+ K++ L DAMCS CEMTV+WMQNQLKQNQTQ+ IL+Y+N+LCDRLPS Sbjct: 365 STGIESVVHENVGKATGDLHDAMCSTCEMTVIWMQNQLKQNQTQERILEYINELCDRLPS 424 Query: 516 PMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVPPPRG 337 PMGESAVDC+ LS+MPNVSFTIGGK F+L+PEQYVLKVGEG+VAQC+SGFTALDVPPPRG Sbjct: 425 PMGESAVDCSSLSTMPNVSFTIGGKIFELSPEQYVLKVGEGDVAQCLSGFTALDVPPPRG 484 Query: 336 PLWILGDVFMGQFHTVFDYGNERIGFAEAA 247 PLWILGDVFMGQFHTVFDYGN ++GFAEAA Sbjct: 485 PLWILGDVFMGQFHTVFDYGNLQVGFAEAA 514 >gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao] Length = 514 Score = 826 bits (2132), Expect = 2e-237 Identities = 389/514 (75%), Positives = 446/514 (86%) Frame = -1 Query: 1788 MEAELRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTA 1609 M ++ V +LF+ LLF +V S SNDGL+R+GLKK K D NNR+AA L SK+G+A+ A Sbjct: 1 MGTTIKVVVLSLFISSLLFSVVSSVSNDGLVRIGLKKMKLDPNNRLAARLDSKDGEALRA 60 Query: 1608 VIRKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 1429 I+KY R LGD ++ DIV+LKNYMDAQY+GEIGIGTP QKFTVIFDTGSSNLWV S+K Sbjct: 61 FIKKYRFRNNLGDSEETDIVALKNYMDAQYYGEIGIGTPTQKFTVIFDTGSSNLWVSSTK 120 Query: 1428 CYFSLACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIE 1249 CYFS+ACY H KYK+S S+TY K+GKPA+IQYGTGAISGFFS DHV VGDLVVKDQEFIE Sbjct: 121 CYFSVACYFHEKYKASDSSTYKKDGKPASIQYGTGAISGFFSYDHVQVGDLVVKDQEFIE 180 Query: 1248 ATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEV 1069 ATKEPG+TF+VAKFDGILGLGF+EISVG+AVPVWYNM+KQGL+KEPVFSFW NRN DEE Sbjct: 181 ATKEPGLTFMVAKFDGILGLGFKEISVGDAVPVWYNMIKQGLIKEPVFSFWLNRNVDEEA 240 Query: 1068 GGEIVFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSL 889 GGEIVFGGVDP+HY G+HTYVPVTQKGYWQFDMGDVLI + TG+CAG CAAIADSGTSL Sbjct: 241 GGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIADKPTGYCAGSCAAIADSGTSL 300 Query: 888 LVGPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDG 709 L GP+T+IT +NHAIGATG+VSQECK VV +YG TII +++A+ QPQKICSQIGLCTF+G Sbjct: 301 LAGPSTVITMINHAIGATGVVSQECKAVVQQYGRTIIDLLIAEAQPQKICSQIGLCTFNG 360 Query: 708 TRGVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCD 529 GVS GI+SVVDE N KSS L DAMC ACEM VVWMQNQ++QNQTQD IL YVN+LCD Sbjct: 361 AHGVSTGIESVVDESNGKSSGVLRDAMCPACEMAVVWMQNQVRQNQTQDRILSYVNELCD 420 Query: 528 RLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVP 349 R+P+PMGESAVDC LSSMP +SFTIGGK FDL PE+Y+LKVGEG AQCISGFTALD+P Sbjct: 421 RVPNPMGESAVDCGSLSSMPTISFTIGGKVFDLTPEEYILKVGEGSEAQCISGFTALDIP 480 Query: 348 PPRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247 PPRGPLWILGD+FMG++HTVFD+G R+GFAEAA Sbjct: 481 PPRGPLWILGDIFMGRYHTVFDFGKLRVGFAEAA 514 >gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata] Length = 514 Score = 808 bits (2085), Expect = 5e-232 Identities = 384/513 (74%), Positives = 443/513 (86%) Frame = -1 Query: 1788 MEAELRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTA 1609 M + RSV A+LFL LL PLV S+SND LLRVGLKKRK DQ NR ++ K +++ Sbjct: 1 MGSTSRSVLASLFLLLLLSPLVNSSSNDRLLRVGLKKRKLDQINRFSSLYGCKGKESINP 60 Query: 1608 VIRKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 1429 IRKY L LG+ D DI+SLKNYM+AQYFGEIGIGTPPQKFT+IFDTGSSNLWVPS+K Sbjct: 61 AIRKYGLGNGLGNSDDADIISLKNYMNAQYFGEIGIGTPPQKFTLIFDTGSSNLWVPSAK 120 Query: 1428 CYFSLACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIE 1249 CYFS+ACY H KYKSS S++Y+KNGK A I YGTGAISGFFS+DHV +GDLVV++Q+FIE Sbjct: 121 CYFSIACYFHSKYKSSLSSSYTKNGKSAEIHYGTGAISGFFSQDHVKLGDLVVENQDFIE 180 Query: 1248 ATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEV 1069 AT+EP ITF+ AKFDGILGLGFQEISVGNAVPVWYNMVKQGL+ EPVFSFW NRNA EE Sbjct: 181 ATREPSITFVAAKFDGILGLGFQEISVGNAVPVWYNMVKQGLVNEPVFSFWLNRNATEEE 240 Query: 1068 GGEIVFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSL 889 GGEIVFGGVDP+HY GEHT+VPVT KGYWQFDM DVL+ G+TTG+C+GGC+AIADSGTSL Sbjct: 241 GGEIVFGGVDPNHYKGEHTFVPVTHKGYWQFDMDDVLVGGETTGYCSGGCSAIADSGTSL 300 Query: 888 LVGPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDG 709 L GPTTI+ ++NHAIGA+G+VSQECK VVA+YG I+ M++++ QP+KICSQIGLCTFDG Sbjct: 301 LAGPTTIVAQINHAIGASGVVSQECKAVVAQYGTAILDMLISETQPKKICSQIGLCTFDG 360 Query: 708 TRGVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCD 529 RGVSVGIKSVVD + SS+GL DA C+ACEMTVVWMQNQLKQNQT++ IL+YVN+LC+ Sbjct: 361 KRGVSVGIKSVVDMNVDGSSSGLQDATCTACEMTVVWMQNQLKQNQTEERILNYVNELCN 420 Query: 528 RLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVP 349 RLPSPMGESAVDC+ LSSMP VSFT+GGK FDL PEQY+L+VGEG QCISGFTALDV Sbjct: 421 RLPSPMGESAVDCSSLSSMPGVSFTVGGKVFDLLPEQYILQVGEGVATQCISGFTALDVA 480 Query: 348 PPRGPLWILGDVFMGQFHTVFDYGNERIGFAEA 250 PP GPLWILGD+FMGQ+HTVFDYGN R+GFAEA Sbjct: 481 PPLGPLWILGDIFMGQYHTVFDYGNMRVGFAEA 513 >gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata] Length = 514 Score = 791 bits (2042), Expect = 5e-227 Identities = 375/514 (72%), Positives = 439/514 (85%) Frame = -1 Query: 1788 MEAELRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTA 1609 M + TLFL LL PLV S S+D LLRVGLKKRK DQ NR++++ K + + Sbjct: 1 MGRTFETALTTLFLLLLLSPLVTSLSSDRLLRVGLKKRKLDQINRLSSHYGCKGKGSTSP 60 Query: 1608 VIRKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 1429 I K+ L LG+ D DI+SLKNYMDAQYFGEIGIG+PPQKFTVIFDTGSSNLWVPS+K Sbjct: 61 SIWKHGLGNGLGNSDDADIISLKNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSAK 120 Query: 1428 CYFSLACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIE 1249 CYFS+ACYLHPKYKS S+TY+KNGK AAI YGTGAISGFFS+DHV +GDLVV++Q+FIE Sbjct: 121 CYFSIACYLHPKYKSFKSSTYAKNGKSAAIHYGTGAISGFFSQDHVKMGDLVVENQDFIE 180 Query: 1248 ATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEV 1069 ATKEP ITF+ AKFDGILGLGFQEISVG+AVP WYNM+ QGL+ EPVFSFW NR ++EE Sbjct: 181 ATKEPSITFVAAKFDGILGLGFQEISVGDAVPAWYNMIDQGLVNEPVFSFWLNRKSEEEE 240 Query: 1068 GGEIVFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSL 889 GGEIVFGGVDP+HY GEHTYVPVT+KGYWQFDM DVL+ G+TTG+C+GGC+AIADSGTSL Sbjct: 241 GGEIVFGGVDPNHYKGEHTYVPVTRKGYWQFDMDDVLVGGETTGYCSGGCSAIADSGTSL 300 Query: 888 LVGPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDG 709 L GPTTII ++NHAIGA+G+VSQECK VV++YG I+ ++A+ QPQKICSQIGLCTFDG Sbjct: 301 LAGPTTIIVQINHAIGASGLVSQECKAVVSQYGKAILDALVAEAQPQKICSQIGLCTFDG 360 Query: 708 TRGVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCD 529 RGVS+GI+SVV+++ SS GL DAMC+ACEM VVWMQNQL+QN+T++ IL+YVN+LC+ Sbjct: 361 KRGVSMGIESVVEKNPGNSSDGLQDAMCTACEMAVVWMQNQLRQNRTEEQILNYVNELCN 420 Query: 528 RLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVP 349 RLPSPMGES+VDC LSSMPNVS TIGGK FDL+PE+YVLKVGEG AQCISGF ALD+ Sbjct: 421 RLPSPMGESSVDCGSLSSMPNVSLTIGGKVFDLSPEKYVLKVGEGVAAQCISGFIALDIA 480 Query: 348 PPRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247 PPRGPLWILGD+FMGQ+HTVFDYGN +GFAEAA Sbjct: 481 PPRGPLWILGDIFMGQYHTVFDYGNLSVGFAEAA 514 >gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max] Length = 514 Score = 777 bits (2006), Expect = 7e-223 Identities = 368/514 (71%), Positives = 430/514 (83%) Frame = -1 Query: 1788 MEAELRSVTATLFLCFLLFPLVFSASNDGLLRVGLKKRKFDQNNRVAANLYSKNGDAVTA 1609 M + ++ L + LL V+ A N GL R+GLKK K D NR+AA + SK+ D+ A Sbjct: 1 MGNRMNAIVLCLLVSTLLVSAVYCAPNAGLRRIGLKKIKLDPKNRLAARVGSKDVDSFRA 60 Query: 1608 VIRKYNLRGTLGDDQDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSK 1429 IR+++L+ G ++ DIV+LKNY+DAQY+GEI IGT PQKF VIFDTGSSNLWVPSSK Sbjct: 61 SIRQFHLQNNFGGTEETDIVALKNYLDAQYYGEIAIGTSPQKFAVIFDTGSSNLWVPSSK 120 Query: 1428 CYFSLACYLHPKYKSSSSTTYSKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIE 1249 C FS+ACY H KYKSS S+T+ KNG AAIQYGTGAISGFFS D V VG++VVK+QEFIE Sbjct: 121 CTFSVACYFHAKYKSSKSSTFKKNGTAAAIQYGTGAISGFFSYDSVRVGEIVVKNQEFIE 180 Query: 1248 ATKEPGITFLVAKFDGILGLGFQEISVGNAVPVWYNMVKQGLLKEPVFSFWFNRNADEEV 1069 AT+EPG+TFL AKFDGILGLGFQEISVGNA PVWYNMV QGLLKEPVFSFWFNRN +EE Sbjct: 181 ATREPGVTFLAAKFDGILGLGFQEISVGNAAPVWYNMVDQGLLKEPVFSFWFNRNPEEEE 240 Query: 1068 GGEIVFGGVDPDHYVGEHTYVPVTQKGYWQFDMGDVLIDGQTTGFCAGGCAAIADSGTSL 889 GGEIVFGGVDP HY G+HTYVPVT+KGYWQFDMGDVLI G+ TG+CA GC+AIADSGTSL Sbjct: 241 GGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDVLIGGKPTGYCANGCSAIADSGTSL 300 Query: 888 LVGPTTIITELNHAIGATGIVSQECKTVVAEYGDTIIKMILAKDQPQKICSQIGLCTFDG 709 L GPTT+IT +NHAIGA+G++SQECKT+VAEYG TI+ ++LA+ QP+KICS+IGLC FDG Sbjct: 301 LAGPTTVITMINHAIGASGVMSQECKTIVAEYGQTILDLLLAETQPKKICSRIGLCAFDG 360 Query: 708 TRGVSVGIKSVVDEDNHKSSAGLSDAMCSACEMTVVWMQNQLKQNQTQDHILDYVNQLCD 529 T GV VGIKSVVDE+ KS G A C ACEM VVWMQNQL +NQTQD IL Y+NQLCD Sbjct: 361 THGVDVGIKSVVDENERKSLGGHHGAACPACEMAVVWMQNQLSRNQTQDQILSYINQLCD 420 Query: 528 RLPSPMGESAVDCAGLSSMPNVSFTIGGKQFDLAPEQYVLKVGEGEVAQCISGFTALDVP 349 ++PSPMGESAVDC +SS+P VSFTIGG+ FDL+PE+YVLKVGEG VAQCISGFTA+D+P Sbjct: 421 KMPSPMGESAVDCGNISSLPVVSFTIGGRTFDLSPEEYVLKVGEGPVAQCISGFTAIDIP 480 Query: 348 PPRGPLWILGDVFMGQFHTVFDYGNERIGFAEAA 247 PPRGPLWILGDVFMG++HTVFD+G R+GFA+AA Sbjct: 481 PPRGPLWILGDVFMGRYHTVFDFGKLRVGFADAA 514 Database: GenBank nr Posted date: Wed Dec 03 02:04:20 2008 Number of letters in database: 2,551,671,255 Number of sequences in database: 7,387,702 Lambda K H 0.267 0.041 0.140 Gapped Lambda K H 0.267 0.041 0.140 Matrix: blosum62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 121,122,045,638 Number of Sequences: 7387702 Number of Extensions: 121122045638 Number of Successful Extensions: 65962939 Number of sequences better than 0.0: 0 |