BLASTX 7.6.2
Query= UN19329 /QuerySize=1157
(1156 letters)
Database: UniProt/SwissProt;
518,415 sequences; 182,829,261 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O53553|PG54_MYCTU Uncharacterized PE-PGRS family protein PE_P... 72 5e-012
sp|Q9NL38|MA66_PINMA N66 matrix protein OS=Pinctada maxima PE=1 ... 65 8e-010
sp|P48810|RB87F_DROME Heterogeneous nuclear ribonucleoprotein 87... 65 8e-010
sp|P10495|GRP1_PHAVU Glycine-rich cell wall structural protein 1... 64 1e-009
sp|P07909|ROA1_DROME Heterogeneous nuclear ribonucleoprotein A1 ... 61 2e-008
sp|Q9UVI4|THYD_CLAFS Trihydrophobin OS=Claviceps fusiformis GN=T... 60 3e-008
sp|Q9P6U9|DED1_NEUCR ATP-dependent RNA helicase ded-1 OS=Neurosp... 59 6e-008
sp|P10496|GRP2_PHAVU Glycine-rich cell wall structural protein 1... 57 3e-007
sp|O18740|K1C9_CANFA Keratin, type I cytoskeletal 9 OS=Canis fam... 56 4e-007
sp|Q24573|DRI_DROME Protein dead ringer OS=Drosophila melanogast... 55 7e-007
sp|A3CG83|GRP1_ORYSJ Putative glycine-rich cell wall structural ... 55 1e-006
sp|Q6IFZ6|K2C1B_MOUSE Keratin, type II cytoskeletal 1b OS=Mus mu... 55 1e-006
sp|A2ZJC9|GRP1_ORYSI Putative glycine-rich cell wall structural ... 54 2e-006
sp|P08673|CSP_PLACC Circumsporozoite protein OS=Plasmodium cynom... 53 3e-006
sp|Q6EIZ0|K1C10_CANFA Keratin, type I cytoskeletal 10 OS=Canis f... 52 7e-006
>sp|O53553|PG54_MYCTU Uncharacterized PE-PGRS family protein PE_PGRS54
OS=Mycobacterium tuberculosis GN=PE_PGRS54 PE=3 SV=1
Length = 1901
Score = 72 bits (176), Expect = 5e-012
Identities = 56/185 (30%), Positives = 69/185 (37%), Gaps = 3/185 (1%)
Frame = +3
Query: 6 GYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSGP 185
G GG GG GG + S+ G G G NS GN G
Sbjct: 1719 GGTGGAGGAGGAGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGGA 1778
Query: 186 FGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSASGTN 365
GN GA GGGGG G G GGNG +G GT ++ + +G N
Sbjct: 1779 GGNGGAGGLGGGGGTGGTNGNGGLGGGGGNGGAGGAGGTPTGSGTEGTGGDGGDAGAGGN 1838
Query: 366 NTGYDGAGLAEFYGNGAVYSDPTWRSSAPETEGPGSFSYGIGGGGVGPSSD-VSARSSSP 542
G G G+G D + AP G G+ + G+GG G G +D SP
Sbjct: 1839 GGSATGVGNGGNGGDGGNGGD--GGNGAPGGFGGGAGAGGLGGSGAGGGTDGDDGNGGSP 1896
Query: 543 GYVGS 557
G GS
Sbjct: 1897 GTDGS 1901
Score = 56 bits (134), Expect = 4e-007
Identities = 54/183 (29%), Positives = 70/183 (38%), Gaps = 10/183 (5%)
Frame = +3
Query: 15 GGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSGPFGN 194
GGNGG GG S + L G +GG G ++ + G +G N G G
Sbjct: 995 GGNGGRGGDGGDGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGAGGAGGNGGDGGDGA 1054
Query: 195 WGAAPGGGGGGNNGVGNENLKFGYGGN-GESGFGLGTRNIGPSKAAPSSSFSSASGTNNT 371
GAA G G N GVG + G GN G +G GL T G AA + A G
Sbjct: 1055 TGAA---GLGDNGGVGGDGGAGGAAGNGGNAGVGL-TAKAGDGGAAGNGGNGGAGGAGGA 1110
Query: 372 G---YDG--AGLAEFYGNGAVYSDPTWRSSAPETEGPGSFSYGIGGGGVGPSSDVSARSS 536
G ++G G G G + T +A G + G GG G + V
Sbjct: 1111 GDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGVGGSGG 1170
Query: 537 SPG 545
+ G
Sbjct: 1171 TGG 1173
>sp|Q9NL38|MA66_PINMA N66 matrix protein OS=Pinctada maxima PE=1 SV=1
Length = 568
Score = 65 bits (157), Expect = 8e-010
Identities = 46/136 (33%), Positives = 61/136 (44%), Gaps = 5/136 (3%)
Frame = +3
Query: 12 EGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSGPFG 191
+ GN G G G++ + + N N G N+NNG ++N N GG + N +G G
Sbjct: 278 DNGNNGNGNGNNGYNGNNGYNGNNGNNGNGNNDNNGNDNNGNN--GGNGNNGNNGNGNNG 335
Query: 192 NWGAAPGG--GGGGNNGVGNENLKFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSASGTN 365
N G G GG GNNG N N G GNG +G G N G + + + + S N
Sbjct: 336 NNGNGNNGNNGGNGNNG-NNGNSNNGNNGNGNNGNNGGNGNNGNNGNGNNENNGNGSNGN 394
Query: 366 NTGYDGAGLAEFYGNG 413
N G G GNG
Sbjct: 395 NGGNGNNGNNGDNGNG 410
>sp|P48810|RB87F_DROME Heterogeneous nuclear ribonucleoprotein 87F OS=Drosophila
melanogaster GN=Hrb87F PE=1 SV=2
Length = 385
Score = 65 bits (157), Expect = 8e-010
Identities = 50/136 (36%), Positives = 59/136 (43%), Gaps = 22/136 (16%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSG 182
G + GG GGG GG N G +GG +NN G N N GG G N+
Sbjct: 252 GNFGGGQGGGSGG---------WNQQGGSGGGPWNNQGGGNGGWNGGGGGGYGGGNS--- 299
Query: 183 PFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSASGT 362
G+WG GGGGGG G GNE + YGG + N G ++ AP S G
Sbjct: 300 -NGSWG-GNGGGGGGGGGFGNE-YQQSYGGGPQR-----NSNFGNNRPAPYSQGGGGGGF 351
Query: 363 NNTGYDGAGLAEFYGN 410
N G G G F GN
Sbjct: 352 NK-GNQGGGQG-FAGN 365
>sp|P10495|GRP1_PHAVU Glycine-rich cell wall structural protein 1.0 OS=Phaseolus
vulgaris PE=2 SV=1
Length = 252
Score = 64 bits (155), Expect = 1e-009
Identities = 45/138 (32%), Positives = 51/138 (36%), Gaps = 12/138 (8%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSG 182
GG GG G GGG +G GG GA + Y GG SG
Sbjct: 104 GGGYGGGAGKGGGEGYGGGGANGGGYGGGGGSGGGGGGGAGGAGSGYGGGEGSGAG---- 159
Query: 183 PFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSASGT 362
G +G A GGGGGGN G G G G G G G G + A A+G
Sbjct: 160 --GGYGGANGGGGGGNGGGG------GGGSGGAHGGGAAGGGEGAGQGAGGGYGGGAAGG 211
Query: 363 NNTGYDGAGLAEFYGNGA 416
G G G + G GA
Sbjct: 212 GGRGSGGGGGGGYGGGGA 229
>sp|P07909|ROA1_DROME Heterogeneous nuclear ribonucleoprotein A1 OS=Drosophila
melanogaster GN=Hrb98DE PE=1 SV=1
Length = 365
Score = 61 bits (146), Expect = 2e-008
Identities = 40/105 (38%), Positives = 45/105 (42%), Gaps = 16/105 (15%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGA---NSNSNTYMGGTASGNNA 173
GG GG GG GG+ GN GG NY N NG N+ N + +N
Sbjct: 208 GGGRGGPGGRAGGNR-----------GNMGGGNYGNQNGGGNWNNGGNNWGNNRGGNDNW 256
Query: 174 LSGPFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLGTRN 308
+ FG G GG GGGNN GN N GNG FG G N
Sbjct: 257 GNNSFGGGGGGGGGYGGGNNSWGNNNP--WDNGNGGGNFGGGGNN 299
>sp|Q9UVI4|THYD_CLAFS Trihydrophobin OS=Claviceps fusiformis GN=TH1 PE=1 SV=1
Length = 394
Score = 60 bits (144), Expect = 3e-008
Identities = 36/79 (45%), Positives = 40/79 (50%), Gaps = 9/79 (11%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSG 182
GG GGNGG GG++ + N GNNGG N NN G N +N GG GNN
Sbjct: 130 GGNNGGNGGNNGGNTDYPGGNGGNNGGNNGGNNGGNNGGNNGGNN---GGNNGGNNG--- 183
Query: 183 PFGNWGAAPGG-GGGGNNG 236
GN G GG GG G NG
Sbjct: 184 --GNNGGNNGGNGGNGGNG 200
>sp|Q9P6U9|DED1_NEUCR ATP-dependent RNA helicase ded-1 OS=Neurospora crassa
GN=ded-1 PE=3 SV=1
Length = 688
Score = 59 bits (141), Expect = 6e-008
Identities = 34/95 (35%), Positives = 41/95 (43%), Gaps = 4/95 (4%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSG 182
GG GG GGG GG + +G +GG + G S Y GG G +G
Sbjct: 593 GGGRGGRGGGRGGGRGRTQTADYRKFGGSGGGGFGGGFGGAPASGGYGGGGYGGGGPPAG 652
Query: 183 PFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESG 287
+G G A GGGGG G G GYG G +G
Sbjct: 653 GYGGGGGAGYGGGGGGGGYGGG----GYGNPGGAG 683
>sp|P10496|GRP2_PHAVU Glycine-rich cell wall structural protein 1.8 OS=Phaseolus
vulgaris PE=2 SV=1
Length = 465
Score = 57 bits (135), Expect = 3e-007
Identities = 47/174 (27%), Positives = 59/174 (33%), Gaps = 11/174 (6%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNG--GLNYNNNNGANSNSNTYMGGTASGNNAL 176
GG G GG GG +G G G+ Y G+ + GG + A
Sbjct: 106 GGVAYGGGGERGGYGGGQGGGAGGGYGAGGEHGIGYGGGGGSGAGG----GGGYNAGGAQ 161
Query: 177 SGPFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSAS 356
G +G G A GGGGGG + G G GG G+G G + G
Sbjct: 162 GGGYGTGGGAGGGGGGGGDHGGGYGGGQGAGGGAGGGYGGGGEHGGGGGGGQGGGAGGGY 221
Query: 357 GTNNTGYDGAGLAEFYGNGAVYSDPTWRSSAPETEGPGSFSYGIGGGGVGPSSD 518
G GAG + G G Y + G G G GGG G +
Sbjct: 222 GAGGEHGGGAGGGQGGGAGGGYG-----AGGEHGGGAGGGQGGGAGGGYGAGGE 270
>sp|O18740|K1C9_CANFA Keratin, type I cytoskeletal 9 OS=Canis familiaris GN=KRT9
PE=3 SV=1
Length = 786
Score = 56 bits (134), Expect = 4e-007
Identities = 44/149 (29%), Positives = 63/149 (42%), Gaps = 16/149 (10%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSN-----TYMGGTASGN 167
GG GG G GGGSS + + + G+ G +++G S +Y GG++SG
Sbjct: 609 GGSYGGGSGSGGGSSCSYGGGSSSGGGSGGSYGGGSSSGGGSGGKGGSGCSYSGGSSSGG 668
Query: 168 NALSGPFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLG----TRNIGPSKAAPS 335
+ G +G G++ G G GG G YGG SG G G + ++ S
Sbjct: 669 GS-GGSYGG-GSSSGRGSGGRGGSAG-----SYGGGSGSGGGRGGGCEEGSGSGGRSGGS 721
Query: 336 SSFSSASGTNNTGYDGAGLAEFYGNGAVY 422
S SG + G G G G+G Y
Sbjct: 722 YGGGSGSGGRSGGSYGGGSGSGGGSGGSY 750
>sp|Q24573|DRI_DROME Protein dead ringer OS=Drosophila melanogaster GN=retn PE=1
SV=2
Length = 911
Score = 55 bits (132), Expect = 7e-007
Identities = 41/118 (34%), Positives = 57/118 (48%), Gaps = 14/118 (11%)
Frame = +3
Query: 42 SSSFFSSVTRNLWGNNGGLNYNN----NNGANSNSNTYMGGTASGNNALSGPFGNWGAAP 209
+S F VT + G NG +YN N +NSN+ T GGTA GP G G+
Sbjct: 166 NSVAFGHVTSSPSGGNGS-SYNGGTTPTNSSNSNATTNGGGTA-------GPGGTGGSGG 217
Query: 210 GGGGGGNNGVGNENLKFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSASGTN--NTGY 377
GGGGGG G G +F + + G R+ + A+ SS+ S AS ++ N G+
Sbjct: 218 GGGGGGGGGGGVGGHQFSFASPTAAPSGKEARHFAANSASNSSTSSEASNSSQQNNGW 275
>sp|A3CG83|GRP1_ORYSJ Putative glycine-rich cell wall structural protein 1
OS=Oryza sativa subsp. japonica GN=GRP-1 PE=4 SV=1
Length = 166
Score = 55 bits (130), Expect = 1e-006
Identities = 42/132 (31%), Positives = 48/132 (36%), Gaps = 17/132 (12%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSG 182
GG GG GGGGGG+ S G+ G YN G + GG SG G
Sbjct: 36 GGGGGGGGGGGGGNGS----------GSGSGYGYNYGKGGGQSG----GGQGSGGGGGGG 81
Query: 183 PFGNWGAAPGGGGGGNNGVGN---ENLKFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSA 353
G+ G+ G G G G GN + G GG G G G G G
Sbjct: 82 GGGSNGSGSGSGYGYGYGQGNGGAQGQGSGGGGGGGGGGGGGGSGQGSGSGYGYGYGKGG 141
Query: 354 SGTNNTGYDGAG 389
G G DG G
Sbjct: 142 GGGGGGGGDGGG 153
>sp|Q6IFZ6|K2C1B_MOUSE Keratin, type II cytoskeletal 1b OS=Mus musculus GN=Krt77
PE=1 SV=1
Length = 572
Score = 55 bits (130), Expect = 1e-006
Identities = 37/109 (33%), Positives = 47/109 (43%), Gaps = 4/109 (3%)
Frame = +3
Query: 21 NGGGGGGSSSFFSSVTRNLWGNNGG----LNYNNNNGANSNSNTYMGGTASGNNALSGPF 188
+G GGG + S +R G+ GG + + G+ S +G +ASG GP
Sbjct: 25 SGFGGGRQALVSVSQSRRYGGDYGGGFSSRSLYSLGGSKSIFGNLVGRSASGFCQSRGPG 84
Query: 189 GNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLGTRNIGPSKAAPS 335
G +G GGG GG G G GYGG G G G G G PS
Sbjct: 85 GGFGGGIGGGIGGGRGFGGGGFGGGYGGGGRFGGGFGGAGFGFGGFGPS 133
>sp|A2ZJC9|GRP1_ORYSI Putative glycine-rich cell wall structural protein 1
OS=Oryza sativa subsp. indica GN=GRP-1 PE=4 SV=1
Length = 165
Score = 54 bits (128), Expect = 2e-006
Identities = 34/100 (34%), Positives = 42/100 (42%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSG 182
GG GG GGGGGG S + + +G G + G + GG GN + SG
Sbjct: 32 GGSGGGGGGGGGGGGGGNGSGSGSGYGYGYGKAGGQSGGGQGSGGGGGGGGGGGNGSGSG 91
Query: 183 PFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLGT 302
+G G GG G G G GG G SG G G+
Sbjct: 92 SGYGYGYGQGNGGAQGQGSGGGGGGGGGGGGGGSGQGSGS 131
Score = 52 bits (124), Expect = 6e-006
Identities = 34/96 (35%), Positives = 40/96 (41%), Gaps = 8/96 (8%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSG 182
G GG GGGGGG+ S S +G G +G GG SG + SG
Sbjct: 73 GSGGGGGGGGGGGNGSGSGSGYGYGYGQGNGGAQGQGSGGGGGGGGGGGGGGSGQGSGSG 132
Query: 183 PFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGF 290
+G GGGGGG G G GG G SG+
Sbjct: 133 YGYGYGKGGGGGGGGGGG--------GGGGGGGSGY 160
>sp|P08673|CSP_PLACC Circumsporozoite protein OS=Plasmodium cynomolgi (strain
Ceylon) PE=3 SV=1
Length = 398
Score = 53 bits (126), Expect = 3e-006
Identities = 36/129 (27%), Positives = 47/129 (36%), Gaps = 4/129 (3%)
Frame = +3
Query: 3 GGYEGGNGGGGGGSSSFFSSVTRNLWGNNGGLNYNNNNGANSNSNTYMGGTASGNNALSG 182
GG G N GG ++ + GNN NN A + G A+GNNA +G
Sbjct: 166 GGEAGNNAAGGAAGNNAAAGEA----GNNAAGGEAGNNAAAGEAGNNAAGGAAGNNAAAG 221
Query: 183 PFGNWGAAPGGGGGGNNGVGNENLKFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSASGT 362
GN AA G G N G G G +G G + A + + +
Sbjct: 222 EAGNNAAAGAAGNNAAAGAAGNNAAAGEAGAGGAGRAGNNAAAGEAGAGGAGRAGNNAAA 281
Query: 363 NNTGYDGAG 389
G GAG
Sbjct: 282 GEAGAGGAG 290
>sp|Q6EIZ0|K1C10_CANFA Keratin, type I cytoskeletal 10 OS=Canis familiaris
GN=KRT10 PE=2 SV=1
Length = 568
Score = 52 bits (123), Expect = 7e-006
Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 10/125 (8%)
Frame = +3
Query: 81 GNNGGLNYNNNNGANSNSNTYMGGTASGNNALSGPFGNWGAAPGGGG--GGNNGVGNENL 254
G++GG Y G S+ Y G + G SG G G + GGGG GG++G
Sbjct: 447 GSSGGGGYGGGRGGGSSGGGYGGSSGGGYGGSSGGGGYGGGSSGGGGHIGGHSG------ 500
Query: 255 KFGYGGNGESGFGLGTRNIGPSKAAPSSSFSSASGTNNTGYDGAGLAEFYGNGAVYSDPT 434
G+ G+ G+G G+ + G SS + G ++ G G G + G+ + S
Sbjct: 501 --GHSGSSGGGYGGGSSSGGGGYGGGSSGGGGSHGGSSGGGYGGGSSSSGGHKSSSSGSV 558
Query: 435 WRSSA 449
SS+
Sbjct: 559 GESSS 563
Database: UniProt/SwissProt
Posted date: Sat Aug 07 14:36:18 2010
Number of letters in database: 182,829,261
Number of sequences in database: 518,415
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 78,908,948,754
Number of Sequences: 518415
Number of Extensions: 78908948754
Number of Successful Extensions: 532487479
Number of sequences better than 0.0: 0
|