BLASTX 7.6.2
Query= UN00487 /QuerySize=992
(991 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297799382|ref|XP_002867575.1| hypothetical protein ARALYDRAFT... 136 5e-030
gi|4538948|emb|CAB39684.1| putative protein [Arabidopsis thaliana] 133 6e-029
gi|110738750|dbj|BAF01299.1| hypothetical protein [Arabidopsis t... 133 6e-029
gi|334186930|ref|NP_194349.2| haloacid dehalogenase-like hydrola... 133 6e-029
gi|237830605|ref|XP_002364600.1| IQ calmodulin-binding motif dom... 60 4e-007
gi|307201083|gb|EFN81015.1| hypothetical protein EAI_12862 [Harp... 57 4e-006
gi|158290594|ref|XP_312188.4| AGAP002737-PA [Anopheles gambiae s... 57 5e-006
gi|333467991|gb|EAA07771.5| AGAP002737-PA [Anopheles gambiae str... 57 5e-006
gi|291224326|ref|XP_002732156.1| PREDICTED: hypothetical protein... 56 9e-006
>gi|297799382|ref|XP_002867575.1| hypothetical protein ARALYDRAFT_492193
[Arabidopsis lyrata subsp. lyrata]
Length = 1061
Score = 136 bits (342), Expect = 5e-030
Identities = 85/182 (46%), Positives = 118/182 (64%), Gaps = 16/182 (8%)
Frame = +2
Query: 344 EDSCQQE-AQVSCEKPE----EDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGVD- 505
+DSC++ A C P EDSQ A +MTSS KK+RKKK+G NA +E+GVD
Sbjct: 178 QDSCEKPGAAQLCTDPNLSTWEDSQPDATNMTSS--PKKKRKKKRG---RNALKESGVDT 232
Query: 506 --TDSEVATKRDARQSSEAQGSKLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADAKADD 679
T++EV K + +EA+GS+ S EE + S ++N C K K DT +Q EG D K ++
Sbjct: 233 NTTNAEVVVKYNT--ITEAEGSRSMSIEENSITSVLINSCPKSKVDTGEQLEGNDVKINE 290
Query: 680 TVSETQNPRSKKKKRRKTKTTEVCDALENTLATTMESGSVECVEISAGECKVKTKKTEEK 859
TVS+T++P++KK+K++KTKT +VCD L NTL T+M+SG VECVE + G K TE K
Sbjct: 291 TVSQTESPKAKKRKKKKTKTMDVCDQLGNTLPTSMKSGPVECVENNDGN-KETDGTTEVK 349
Query: 860 EE 865
E+
Sbjct: 350 ED 351
Score = 116 bits (288), Expect = 1e-023
Identities = 64/172 (37%), Positives = 98/172 (56%), Gaps = 7/172 (4%)
Frame = +2
Query: 326 SNIRIQEDSCQQEAQVSCEKPEEDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGVD 505
++++I E Q E+ + K + + +D+ K G + + +
Sbjct: 284 NDVKINETVSQTESPKA--KKRKKKKTKTMDVCDQLGNTLPTSMKSGPVECVENNDGNKE 341
Query: 506 TDSEVATKRDARQ-----SSEAQGSKLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADAK 670
TD K D + SEA+GS+ + EEK S V++ CLK KDDTV+QQE D K
Sbjct: 342 TDGTTEVKEDVLEVKYDIISEAEGSRSMTKEEKSAASVVISSCLKSKDDTVEQQECTDVK 401
Query: 671 ADDTVSETQNPRSKKKKRRKTKTTEVCDALENTLATTMESGSVECVEISAGE 826
++TV++TQNP++K++K+RKTKT E CD L NTL+T+ +SG VECVE + G+
Sbjct: 402 LNETVAQTQNPKAKRRKKRKTKTLEDCDPLGNTLSTSTKSGPVECVENNDGD 453
>gi|4538948|emb|CAB39684.1| putative protein [Arabidopsis thaliana]
Length = 1067
Score = 133 bits (333), Expect = 6e-029
Identities = 79/184 (42%), Positives = 117/184 (63%), Gaps = 16/184 (8%)
Frame = +2
Query: 338 IQEDSCQQE-AQVSCEKPE----EDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGV 502
+ +DSC++ A C P +DS A +MTSS ++K+ +++ D N +E+GV
Sbjct: 187 LTQDSCEKPGAAQICTDPNLSTCKDSLPDATNMTSSSKRKRNKRR-----DRNVLKESGV 241
Query: 503 D---TDSEVATKRDARQSSEAQGSKLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADAKA 673
D T++EVA + ++EA+GS+ S EE V S ++NGCLK K DTV+Q +G D +
Sbjct: 242 DIGSTNAEVAVTDNT--TTEAEGSRSMSIEENSVASVLINGCLKSKVDTVEQLDGTDVQI 299
Query: 674 DDTVSETQNPRSKKKKRRKTKTTEVCDALENTLATTMESGSVECVEISAGECKVKTKKTE 853
++TV +TQ+ ++KK+K++KTKT E CD L NTL T+ ESG VECVE + G K K TE
Sbjct: 300 NETVFQTQSTKAKKRKKKKTKTMEACDPLGNTLPTSTESGPVECVENNDGN-KEKDGNTE 358
Query: 854 EKEE 865
KE+
Sbjct: 359 VKED 362
Score = 109 bits (271), Expect = 9e-022
Identities = 58/135 (42%), Positives = 83/135 (61%), Gaps = 5/135 (3%)
Frame = +2
Query: 503 DTDSEVATKRDARQ-----SSEAQGSKLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADA 667
+ D K D R+ SEA+GSK + EEK V S V++ CLK KDDTV+Q+E D
Sbjct: 352 EKDGNTEVKEDVREVKYDTISEAEGSKSTTKEEKSVASVVISSCLKSKDDTVEQKECTDV 411
Query: 668 KADDTVSETQNPRSKKKKRRKTKTTEVCDALENTLATTMESGSVECVEISAGECKVKTKK 847
+TV++TQ+P++K++K+RKT+T EVCD L NTL+T+M+SG VE VE + G +
Sbjct: 412 NISETVAQTQDPKAKRRKKRKTETIEVCDPLGNTLSTSMKSGPVERVENNDGNGGRELIS 471
Query: 848 TEEKEEENVTDRSRN 892
+ EN + N
Sbjct: 472 YSASQTENYVNGEEN 486
>gi|110738750|dbj|BAF01299.1| hypothetical protein [Arabidopsis thaliana]
Length = 524
Score = 133 bits (333), Expect = 6e-029
Identities = 79/184 (42%), Positives = 117/184 (63%), Gaps = 16/184 (8%)
Frame = +2
Query: 338 IQEDSCQQE-AQVSCEKPE----EDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGV 502
+ +DSC++ A C P +DS A +MTSS ++K+ +++ D N +E+GV
Sbjct: 187 LTQDSCEKPGAAQICTDPNLSTCKDSLPDATNMTSSSKRKRNKRR-----DRNVLKESGV 241
Query: 503 D---TDSEVATKRDARQSSEAQGSKLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADAKA 673
D T++EVA + ++EA+GS+ S EE V S ++NGCLK K DTV+Q +G D +
Sbjct: 242 DIGSTNAEVAVTDNT--TTEAEGSRSMSIEENSVASVLINGCLKSKVDTVEQLDGTDVQI 299
Query: 674 DDTVSETQNPRSKKKKRRKTKTTEVCDALENTLATTMESGSVECVEISAGECKVKTKKTE 853
++TV +TQ+ ++KK+K++KTKT E CD L NTL T+ ESG VECVE + G K K TE
Sbjct: 300 NETVFQTQSTKAKKRKKKKTKTMEACDPLGNTLPTSTESGPVECVENNDGN-KEKDGNTE 358
Query: 854 EKEE 865
KE+
Sbjct: 359 VKED 362
Score = 109 bits (271), Expect = 9e-022
Identities = 58/135 (42%), Positives = 83/135 (61%), Gaps = 5/135 (3%)
Frame = +2
Query: 503 DTDSEVATKRDARQ-----SSEAQGSKLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADA 667
+ D K D R+ SEA+GSK + EEK V S V++ CLK KDDTV+Q+E D
Sbjct: 352 EKDGNTEVKEDVREVKYDTISEAEGSKSTTKEEKSVASVVISSCLKSKDDTVEQKECTDV 411
Query: 668 KADDTVSETQNPRSKKKKRRKTKTTEVCDALENTLATTMESGSVECVEISAGECKVKTKK 847
+TV++TQ+P++K++K+RKT+T EVCD L NTL+T+M+SG VE VE + G +
Sbjct: 412 NISETVAQTQDPKAKRRKKRKTETIEVCDPLGNTLSTSMKSGPVERVENNDGNGGRELIS 471
Query: 848 TEEKEEENVTDRSRN 892
+ EN + N
Sbjct: 472 YSASQTENYVNGEEN 486
>gi|334186930|ref|NP_194349.2| haloacid dehalogenase-like hydrolase
domain-containing protein [Arabidopsis thaliana]
Length = 1057
Score = 133 bits (333), Expect = 6e-029
Identities = 79/184 (42%), Positives = 117/184 (63%), Gaps = 16/184 (8%)
Frame = +2
Query: 338 IQEDSCQQE-AQVSCEKPE----EDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGV 502
+ +DSC++ A C P +DS A +MTSS ++K+ +++ D N +E+GV
Sbjct: 187 LTQDSCEKPGAAQICTDPNLSTCKDSLPDATNMTSSSKRKRNKRR-----DRNVLKESGV 241
Query: 503 D---TDSEVATKRDARQSSEAQGSKLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADAKA 673
D T++EVA + ++EA+GS+ S EE V S ++NGCLK K DTV+Q +G D +
Sbjct: 242 DIGSTNAEVAVTDNT--TTEAEGSRSMSIEENSVASVLINGCLKSKVDTVEQLDGTDVQI 299
Query: 674 DDTVSETQNPRSKKKKRRKTKTTEVCDALENTLATTMESGSVECVEISAGECKVKTKKTE 853
++TV +TQ+ ++KK+K++KTKT E CD L NTL T+ ESG VECVE + G K K TE
Sbjct: 300 NETVFQTQSTKAKKRKKKKTKTMEACDPLGNTLPTSTESGPVECVENNDGN-KEKDGNTE 358
Query: 854 EKEE 865
KE+
Sbjct: 359 VKED 362
Score = 109 bits (271), Expect = 9e-022
Identities = 58/135 (42%), Positives = 83/135 (61%), Gaps = 5/135 (3%)
Frame = +2
Query: 503 DTDSEVATKRDARQ-----SSEAQGSKLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADA 667
+ D K D R+ SEA+GSK + EEK V S V++ CLK KDDTV+Q+E D
Sbjct: 352 EKDGNTEVKEDVREVKYDTISEAEGSKSTTKEEKSVASVVISSCLKSKDDTVEQKECTDV 411
Query: 668 KADDTVSETQNPRSKKKKRRKTKTTEVCDALENTLATTMESGSVECVEISAGECKVKTKK 847
+TV++TQ+P++K++K+RKT+T EVCD L NTL+T+M+SG VE VE + G +
Sbjct: 412 NISETVAQTQDPKAKRRKKRKTETIEVCDPLGNTLSTSMKSGPVERVENNDGNGGRELIS 471
Query: 848 TEEKEEENVTDRSRN 892
+ EN + N
Sbjct: 472 YSASQTENYVNGEEN 486
>gi|237830605|ref|XP_002364600.1| IQ calmodulin-binding motif domain-containing
protein [Toxoplasma gondii ME49]
Length = 2403
Score = 60 bits (145), Expect = 4e-007
Identities = 48/237 (20%), Positives = 104/237 (43%), Gaps = 4/237 (1%)
Frame = +2
Query: 206 EKKSKRRKKKKKKEEAEKTLDDVSEKNTEEDQLHLHKNPLSNIRIQEDSCQQEAQVSCEK 385
+K+ + RKKKK++EE +K ++ K EE++ K + +E+ +++ + E+
Sbjct: 1283 KKEEEERKKKKEEEERKKKKEEEERKKQEEEERKKKKEEEERKKKKEEEERKKKREEEER 1342
Query: 386 PEEDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGVDTDSEVATKRDARQSSEAQGS 565
+++ + ERKKK+ ++++ R+ + + K + + + +
Sbjct: 1343 KKQEEEERKKKKEEEERKKKKEEEERKKKKEEEERKKKKEEEERKKKKEEEERKKQEEEE 1402
Query: 566 KLKSTEEKIVESTVVNGCLKPKDDTVDQ---QEGADAKADDTVSETQNPRSKKKKRRKTK 736
+ K EE+ + K K++ Q QE K ++ + + +KKK+ + +
Sbjct: 1403 RKKQEEEERKKKKEEEERKKKKEEEERQKRKQEERQKKKEEEERKKKEEEERKKKKEEEE 1462
Query: 737 TTEVCDALENTLATTMESGSVECVEISAGECKVKTKKTEEKEEENVTDRSRNCRYGG 907
+ + E T E E E E + + KK EE+E + + +R + GG
Sbjct: 1463 RKKKKEEEERTKKEEEERTKKEEEERKKQE-EEERKKKEEEERKKKEEEARKKQEGG 1518
>gi|307201083|gb|EFN81015.1| hypothetical protein EAI_12862 [Harpegnathos
saltator]
Length = 577
Score = 57 bits (136), Expect = 4e-006
Identities = 39/149 (26%), Positives = 70/149 (46%), Gaps = 11/149 (7%)
Frame = +2
Query: 206 EKKSKRRKKKKKKEEAEKTLDDVSEKNTEEDQLHLHKNPLSNIRIQEDSCQQEAQVSCEK 385
+KK K++KKKKKK++ +K +K ++ + K +I+ +E+ ++E + EK
Sbjct: 38 KKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKKHIKKEEEGEKEEKEEEEEK 97
Query: 386 PEEDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGVDTDSEV----ATKRDARQSSE 553
EE+ + E +KK+RKKKK + E G + + E K + + E
Sbjct: 98 EEEEKE-------EREEEKKKRKKKKEKEEEEEEEEEGEEKEEEEEEEGEEKEEEEEKEE 150
Query: 554 AQGSKLKSTEEKIVESTVVNGCLKPKDDT 640
+ K K +E+ +ES + P T
Sbjct: 151 EKKKKKKRKKERGLESLSMGHLPLPSSST 179
>gi|158290594|ref|XP_312188.4| AGAP002737-PA [Anopheles gambiae str. PEST]
Length = 6668
Score = 57 bits (135), Expect = 5e-006
Identities = 43/222 (19%), Positives = 102/222 (45%), Gaps = 3/222 (1%)
Frame = +2
Query: 206 EKKSKRRKKKKKKEEAEKTLDDVSEKNTEEDQLHLHKNPLSNIRIQEDSCQQEAQVSCEK 385
+K + KKKK++EE +K ++ ++K E++ K + +E++ +++ + +K
Sbjct: 3808 KKAEEEAKKKKEEEEIKKKEEEEAKKKKAEEEAKKKKEEAKKKKEEEEAKKKKEEEEGKK 3867
Query: 386 PEEDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGVDTDSEVATKRDARQSSEAQGS 565
+E+ +A +KKK ++ K + +A ++ + + + A++ + +
Sbjct: 3868 KKEEEEAKKKKAEEDAKKKKAEEEAKKKEEEDAKKKKEEEAAKKEKEEEAAKKKKVEEEA 3927
Query: 566 KLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADAKADDTVSETQNPRSKKKKRRKTKTTE 745
K K EE + K K++ ++ +A+ +E + + K+++ K K E
Sbjct: 3928 KKKKEEEDAKKKQEEEAAKKKKEEEEANKKKEEAEVKKKKAEEEAKKKKEEEDAKKKQDE 3987
Query: 746 VCDALENTLATTMESGSVECVEISAGECKVKTKKTEEKEEEN 871
+A + + E + E A + K + + ++KEEEN
Sbjct: 3988 --EAAKKKMKEE-EQAKKKKEEEEAKKKKAEEEAKKKKEEEN 4026
>gi|333467991|gb|EAA07771.5| AGAP002737-PA [Anopheles gambiae str. PEST]
Length = 6685
Score = 57 bits (135), Expect = 5e-006
Identities = 43/222 (19%), Positives = 102/222 (45%), Gaps = 3/222 (1%)
Frame = +2
Query: 206 EKKSKRRKKKKKKEEAEKTLDDVSEKNTEEDQLHLHKNPLSNIRIQEDSCQQEAQVSCEK 385
+K + KKKK++EE +K ++ ++K E++ K + +E++ +++ + +K
Sbjct: 3905 KKAEEEAKKKKEEEEIKKKEEEEAKKKKAEEEAKKKKEEAKKKKEEEEAKKKKEEEEGKK 3964
Query: 386 PEEDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGVDTDSEVATKRDARQSSEAQGS 565
+E+ +A +KKK ++ K + +A ++ + + + A++ + +
Sbjct: 3965 KKEEEEAKKKKAEEDAKKKKAEEEAKKKEEEDAKKKKEEEAAKKEKEEEAAKKKKVEEEA 4024
Query: 566 KLKSTEEKIVESTVVNGCLKPKDDTVDQQEGADAKADDTVSETQNPRSKKKKRRKTKTTE 745
K K EE + K K++ ++ +A+ +E + + K+++ K K E
Sbjct: 4025 KKKKEEEDAKKKQEEEAAKKKKEEEEANKKKEEAEVKKKKAEEEAKKKKEEEDAKKKQDE 4084
Query: 746 VCDALENTLATTMESGSVECVEISAGECKVKTKKTEEKEEEN 871
+A + + E + E A + K + + ++KEEEN
Sbjct: 4085 --EAAKKKMKEE-EQAKKKKEEEEAKKKKAEEEAKKKKEEEN 4123
>gi|291224326|ref|XP_002732156.1| PREDICTED: hypothetical protein, partial
[Saccoglossus kowalevskii]
Length = 1049
Score = 56 bits (133), Expect = 9e-006
Identities = 47/201 (23%), Positives = 88/201 (43%), Gaps = 18/201 (8%)
Frame = +2
Query: 206 EKKSKRRKKKKKKEEAEKTLDDVSEKNTEEDQLHLHKNPLSNIRIQEDSCQQEAQVSCEK 385
E K+++ K++ K + A+K + EKN ED P ++++ +A+V E
Sbjct: 79 EAKARKEKEEAKAQAAQKREEKEQEKNKTED-----SKPKPQTEVKKEDSTPDAEVKGEA 133
Query: 386 PEEDSQAGAIDMTSSERKKKRRKKKKGITDTNAPRENGVDTDSEVATKRDARQSSEAQGS 565
PE D+ A D T ++KKK++K K T G ++ +DA + + +
Sbjct: 134 PEGDADDKADDKTKKDKKKKKKKDKAEEEKTKKKPGKG-----QIKAMQDALKRMKEEEE 188
Query: 566 KLKSTEEKIVESTVVNGCLKPKDDTVDQQE---GADAKADDTVSETQNPRSKKKKRRKTK 736
+LK+ EE +++ L+ + ++Q++ K D + +K++K K
Sbjct: 189 RLKAEEEARIKAEEEAERLREEKLRLEQEKKEREKQKKKDKKARLKAEGKFLTEKQKKDK 248
Query: 737 TTEVCDALENTLATTMESGSV 799
E TLA E G +
Sbjct: 249 A-----RAEATLALLREQGLI 264
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 57,432,053,104
Number of Sequences: 15229318
Number of Extensions: 57432053104
Number of Successful Extensions: 34504562
Number of sequences better than 0.0: 0
|