BLASTX 7.6.2
Query= UN04123 /QuerySize=828
(827 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297811915|ref|XP_002873841.1| expressed protein [Arabidopsis ... 259 4e-067
gi|15238005|ref|NP_197267.1| glycine/proline-rich protein [Arabi... 258 9e-067
gi|110740633|dbj|BAE98420.1| glycine/proline-rich protein [Arabi... 258 9e-067
gi|224124002|ref|XP_002319217.1| predicted protein [Populus tric... 183 5e-044
gi|225436210|ref|XP_002268577.1| PREDICTED: hypothetical protein... 163 4e-038
gi|290795719|gb|ADD64698.1| glycine and proline rich protein 3 [... 156 5e-036
gi|255626727|gb|ACU13708.1| unknown [Glycine max] 141 1e-031
gi|255565978|ref|XP_002523977.1| Glycine-rich protein A3, putati... 140 4e-031
gi|225427603|ref|XP_002271036.1| PREDICTED: hypothetical protein... 139 5e-031
gi|255631744|gb|ACU16239.1| unknown [Glycine max] 131 1e-028
gi|195642550|gb|ACG40743.1| glycine-rich protein A3 [Zea mays] 110 4e-022
gi|226499166|ref|NP_001152003.1| glycine-rich protein A3 [Zea mays] 110 4e-022
gi|145049767|gb|ABP35530.1| glycine and proline-rich protein [Ip... 102 1e-019
gi|2494026|sp|Q28640.1|HRG_RABIT RecName: Full=Histidine-rich gl... 76 5e-012
gi|48994598|gb|AAT48006.1| rhodopsin [Abraliopsis pacificus] 68 1e-009
>gi|297811915|ref|XP_002873841.1| expressed protein [Arabidopsis lyrata subsp.
lyrata]
Length = 164
Score = 259 bits (661), Expect = 4e-067
Identities = 121/169 (71%), Positives = 129/169 (76%), Gaps = 20/169 (11%)
Frame = +1
Query: 85 MGEDQR---DKGLFHHL---AGGHYRPYGHHGYSNHGHHGYGIPYAYPAPPPP-YGYPPV 243
MG+DQ D+G FHHL AGGHYRP+ HGY +HG HGY PY YP PPPP +GYPPV
Sbjct: 1 MGKDQHNHSDRGFFHHLAGFAGGHYRPHS-HGYGHHG-HGYEAPYPYPPPPPPHHGYPPV 58
Query: 244 AYPPHGGYHPTGYPPTGYPPHGYPSHGH-------HHHGGIGAMIAGGAAMAAAAVGSHH 402
AYPPHGGY P GYPP GYP HGYPSHG+ HHHGGIGA+IAGG A AA A H
Sbjct: 59 AYPPHGGYPPAGYPPAGYPSHGYPSHGYPGPSHSGHHHGGIGAIIAGGVAAAAGAHHMSH 118
Query: 403 HGHYG-HHHGHGYGYGYHKHGKFKHGKFGKRWKHGIFGKHKGKFFKKWK 546
HGHYG HHHGHGYGYGYH HGKFKH GKRWKHG+FGKHKGKFFKKWK
Sbjct: 119 HGHYGHHHHGHGYGYGYHGHGKFKH---GKRWKHGMFGKHKGKFFKKWK 164
>gi|15238005|ref|NP_197267.1| glycine/proline-rich protein [Arabidopsis
thaliana]
Length = 173
Score = 258 bits (658), Expect = 9e-067
Identities = 123/175 (70%), Positives = 131/175 (74%), Gaps = 23/175 (13%)
Frame = +1
Query: 85 MGEDQR---DKGLFHHL---AGGHYRPYGHHGYSNHGHHGYGIPYAYPAPPPPYGYPPVA 246
MG DQ D+G FH+L AGG Y P+G HGY +HG HGYG Y YP PPPP+GYPPVA
Sbjct: 1 MGNDQHNHSDRGFFHNLAGFAGGQYPPHG-HGYGHHG-HGYGSSYPYPPPPPPHGYPPVA 58
Query: 247 YPPHGGYHPTGYPPTGYPP-----HGYPSHGH-------HHHGGIGAMIAGGAAMAAAAV 390
YPPHGGY P GYPP GYPP HGYPSHG+ HHHGGIGA+IAGG A AA A
Sbjct: 59 YPPHGGYPPAGYPPAGYPPAGYPAHGYPSHGYPRPSHSGHHHGGIGAIIAGGVAAAAGAH 118
Query: 391 G-SHHHGHYGHHHGHGYGYGYHKHGKFKHGKF--GKRWKHGIFGKHKGKFFKKWK 546
SHHHGHYGHHHGHGYGYGYH HGKFKHGKF GK KHG+FGKHKGKFFKKWK
Sbjct: 119 HMSHHHGHYGHHHGHGYGYGYHGHGKFKHGKFKHGKFGKHGMFGKHKGKFFKKWK 173
>gi|110740633|dbj|BAE98420.1| glycine/proline-rich protein [Arabidopsis
thaliana]
Length = 173
Score = 258 bits (658), Expect = 9e-067
Identities = 123/175 (70%), Positives = 131/175 (74%), Gaps = 23/175 (13%)
Frame = +1
Query: 85 MGEDQR---DKGLFHHL---AGGHYRPYGHHGYSNHGHHGYGIPYAYPAPPPPYGYPPVA 246
MG DQ D+G FH+L AGG Y P+G HGY +HG HGYG Y YP PPPP+GYPPVA
Sbjct: 1 MGNDQHNYSDRGFFHNLAGFAGGQYPPHG-HGYGHHG-HGYGSSYPYPPPPPPHGYPPVA 58
Query: 247 YPPHGGYHPTGYPPTGYPP-----HGYPSHGH-------HHHGGIGAMIAGGAAMAAAAV 390
YPPHGGY P GYPP GYPP HGYPSHG+ HHHGGIGA+IAGG A AA A
Sbjct: 59 YPPHGGYPPAGYPPAGYPPAGYPAHGYPSHGYPRPSHSGHHHGGIGAIIAGGVAAAAGAH 118
Query: 391 G-SHHHGHYGHHHGHGYGYGYHKHGKFKHGKF--GKRWKHGIFGKHKGKFFKKWK 546
SHHHGHYGHHHGHGYGYGYH HGKFKHGKF GK KHG+FGKHKGKFFKKWK
Sbjct: 119 HMSHHHGHYGHHHGHGYGYGYHGHGKFKHGKFKHGKFGKHGMFGKHKGKFFKKWK 173
>gi|224124002|ref|XP_002319217.1| predicted protein [Populus trichocarpa]
Length = 192
Score = 183 bits (462), Expect = 5e-044
Identities = 102/183 (55%), Positives = 113/183 (61%), Gaps = 31/183 (16%)
Frame = +1
Query: 91 EDQRDKGLFHHL---AGGHY---RPYGHHGYSNHGH--HGYGIPYAYPAP--PPPYGYPP 240
++ DKGLF +L AGGHY PY HGY G+ GY P YP PPP GYPP
Sbjct: 10 KESTDKGLFSNLAGYAGGHYPPSAPYPPHGYPQQGYPPAGYPPPGGYPPSGYPPPGGYPP 69
Query: 241 VAYPPHGGYHPT-GYPPTG-------YPPHGYP----SHGHHHHGGIGAMIAGGAAMAAA 384
YPP GGY P GYPP G YPP GYP SH H G+G M+AGGAA AA
Sbjct: 70 AGYPPPGGYPPPGGYPPPGGYPPPGAYPPAGYPGPSASHYSGHGPGMGTMLAGGAAAAAV 129
Query: 385 AVGSH---HHGHYGHHHG--HGYGYGYHK----HGKFKHGKFGKRWKHGIFGKHKGKFFK 537
A G+H H G +G+ HG HGYG+G K HGKFKHGKFGKRWKHG FGKHKGKFFK
Sbjct: 130 AYGAHQMSHGGSHGYGHGGYHGYGHGKFKHGYGHGKFKHGKFGKRWKHGGFGKHKGKFFK 189
Query: 538 KWK 546
+WK
Sbjct: 190 RWK 192
>gi|225436210|ref|XP_002268577.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 191
Score = 163 bits (411), Expect = 4e-038
Identities = 82/151 (54%), Positives = 93/151 (61%), Gaps = 15/151 (9%)
Frame = +1
Query: 139 YRPYGHHGYSNHGHHGYGIPYAYPAP--PPPYGYPPVAYPPHGGYHPTGYPPT-GYPPHG 309
Y P + + GY P YP PPP GYPP YPP GGY P YPP GYPP G
Sbjct: 41 YPPSAYPPPGGYPPSGYPPPGGYPPSGYPPPGGYPPAGYPPPGGYPPAPYPPPGGYPPSG 100
Query: 310 YPSHG----HHHHG-GIGAMIAGGAAMAAAAVGSHHHGHYGHHHGH----GYGYGYH--- 453
YP H HG GA++AGGAA AAA G+H H HH GH G+G+G+H
Sbjct: 101 YPGPSAPPYHSGHGSNTGALLAGGAAAAAAVYGAHQLSHGAHHLGHGGYYGHGFGHHGKF 160
Query: 454 KHGKFKHGKFGKRWKHGIFGKHKGKFFKKWK 546
KHGKFKHGKFGKRWKHG++GKHKG FFK+WK
Sbjct: 161 KHGKFKHGKFGKRWKHGMYGKHKGGFFKRWK 191
>gi|290795719|gb|ADD64698.1| glycine and proline rich protein 3 [Glycine max]
Length = 174
Score = 156 bits (393), Expect = 5e-036
Identities = 91/177 (51%), Positives = 102/177 (57%), Gaps = 38/177 (21%)
Frame = +1
Query: 94 DQRDKGLFHHLAGGHYRPYGHHGYSN--HGHHGYGIPYAYPAP----------PPPYGYP 237
D+ DKG+F LA HG + HG HGY P AYP P PP +GYP
Sbjct: 10 DESDKGIFSQLA---------HGVAGAAHGGHGYP-PGAYPPPPGAYPPHQGYPPQHGYP 59
Query: 238 PVAYPPHGGYHPTGYPPTGYPPHGYPSHGHHHHGGIGAMIAGGAAMAAAAVGSHH----- 402
P YPPH GY P GYPP GYP + + G H HGG+GAM+ GGAA AAAA G+HH
Sbjct: 60 PAGYPPHQGYPPAGYPPAGYPGSSH-APGSHGHGGMGAMLTGGAAAAAAAYGAHHVSHGS 118
Query: 403 HGHYGHH--HG----HGYGYGYHKHGKF---KHGKFGKRWKHGIFGKHKGKFFKKWK 546
HG YG + HG HG + KHGKF KHGKFGK KHG FGKH G FKKWK
Sbjct: 119 HGSYGQYAAHGAHMPHGKFKQHGKHGKFKHGKHGKFGKHGKHGKFGKHGGG-FKKWK 174
>gi|255626727|gb|ACU13708.1| unknown [Glycine max]
Length = 170
Score = 141 bits (355), Expect = 1e-031
Identities = 83/157 (52%), Positives = 93/157 (59%), Gaps = 13/157 (8%)
Frame = +1
Query: 94 DQRDKGLFHHLAGG-HYRPYGHHGYSNHGHHGYGIPYAYPAPP--PPYGYPPVAYPPHGG 264
D+ DKG+F HLA G +G HGY + P AYP PP GYPP YPPH G
Sbjct: 10 DESDKGIFSHLAHGVAGAAHGGHGYPPGAYP--PPPGAYPPQQGYPPAGYPPAGYPPHQG 67
Query: 265 YHPTGYPPTGYPPHGYPSHGHHHHGGIGAMIAGGAAMAAAAVGSHHHGHYGHHHGHGYGY 444
Y P GYPP GYP + + G H HGG+GAM+AGGAA AAAA G+HH H H Y +
Sbjct: 68 YPPAGYPPAGYPGSSH-APGSHGHGGMGAMLAGGAAAAAAAYGAHHVSHGSHGSYGQYAH 126
Query: 445 GYH-KHGKFK---HGKFGKRWKHGIFGKH--KGKFFK 537
G H HGKFK HGKF K KHG FGKH GKF K
Sbjct: 127 GAHMPHGKFKQHGHGKF-KHGKHGKFGKHGKHGKFGK 162
>gi|255565978|ref|XP_002523977.1| Glycine-rich protein A3, putative [Ricinus
communis]
Length = 177
Score = 140 bits (351), Expect = 4e-031
Identities = 75/159 (47%), Positives = 96/159 (60%), Gaps = 20/159 (12%)
Frame = +1
Query: 109 GLFHHLAGGHYRPYGHHGYSNHGHHGYGIPYAYPAP----PPPYGYPPVAYPPHGGYHPT 276
G +H + G Y P+G++ + G+ P YP+P PP YPP +YPP Y PT
Sbjct: 26 GNSYHSSPGAYPPHGYNSPQKYPPQGFP-PAGYPSPYGYSSPPSAYPP-SYPPQKPYGPT 83
Query: 277 ------GYPPTGYPPHGYPSHGHH--HHGGIGAMIAGGA-AMAAAAVGSHHHGHYGHHHG 429
GYPP YPP GYP HH H G+G M+AGGA AMAAA G+H+ YGH G
Sbjct: 84 GFPSPGGYPPVAYPPAGYPRPSHHSGHGSGMGVMLAGGATAMAAAGYGAHYMS-YGHGQG 142
Query: 430 HGYGYGYHKHGKFKHGKFGKRWKHGIFGKHKGKFFKKWK 546
HG GYG HG+ KHGK+G RWK G++ K++GK+ K+WK
Sbjct: 143 HG-GYG---HGRLKHGKYGNRWKGGMYEKYQGKYLKRWK 177
>gi|225427603|ref|XP_002271036.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 187
Score = 139 bits (350), Expect = 5e-031
Identities = 79/159 (49%), Positives = 87/159 (54%), Gaps = 37/159 (23%)
Frame = +1
Query: 130 GGHYRPYGHHGYSNHGHHGYGIPYAYPAPPPPYGYPPVAYPPHGGYHPTGYPPTGYPPHG 309
GG Y P+G GY HG GY PP G YPP GGY P GYP GYPP
Sbjct: 46 GGGYPPHGGGGYPPHGGGGY----------PPQG----GYPPQGGYPPQGYPQAGYPPGS 91
Query: 310 Y-------PSHGHHHHGGIGAMIAGGAAMAAAAVGSH---HHGHYGHHHGHGYGYGYHKH 459
Y PS H HGG+G M+AGGAA AAAA G+H H GH GH+ GHG+ G H H
Sbjct: 92 YPPAAYPGPSAPHSGHGGMGTMLAGGAAAAAAAYGAHQLGHGGHGGHNVGHGFYGGSHGH 151
Query: 460 GKF----------KHGKFGKRWKHGIFGKHKGKFFKKWK 546
GKF KHGKFGK +HG+FG G FKKWK
Sbjct: 152 GKFKHHGGKFKHGKHGKFGKHGQHGMFG---GGKFKKWK 187
>gi|255631744|gb|ACU16239.1| unknown [Glycine max]
Length = 183
Score = 131 bits (329), Expect = 1e-028
Identities = 69/131 (52%), Positives = 76/131 (58%), Gaps = 6/131 (4%)
Frame = +1
Query: 139 YRPYGHHGYSNHGHHGYGIPYAYPAPPPPYGYPPVAYPPHGGYHPTGYP-PTGYPPHGYP 315
Y P + + H GY PA PPP GYPP Y PH GYHP YP P GYPP P
Sbjct: 51 YPPPAYPPPGVYPHSGYYPSEYPPAYPPPGGYPPTTY-PHSGYHPPAYPAPHGYPP-AAP 108
Query: 316 SHGHHHHGGIGAMIAGGAAMAAAAVGSHHHGHYGHHHGHGYGYGYHKHGKFKHGKFGKRW 495
+ G+G ++AGG A AAAA G+HH H H GHG YH HGKFKHGKFGKRW
Sbjct: 109 PYPAGRGAGMGGLLAGGVAAAAAAYGAHHMAHGYHRFGHG---AYHGHGKFKHGKFGKRW 165
Query: 496 KHGIFGKHKGK 528
KHG FG K K
Sbjct: 166 KHGRFGFGKYK 176
Score = 65 bits (156), Expect = 1e-008
Identities = 29/48 (60%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Frame = +1
Query: 190 GIPYAYPAPPPPYGYPPVAYPPHGGYHPTGYPPTGYPPHG-YPSHGHH 330
G P A P PPP+GYPP YPP GGY PT YPP YPP G YP G++
Sbjct: 22 GYPSA-PPYPPPHGYPPSGYPPPGGYPPTAYPPPAYPPPGVYPHSGYY 68
>gi|195642550|gb|ACG40743.1| glycine-rich protein A3 [Zea mays]
Length = 188
Score = 110 bits (273), Expect = 4e-022
Identities = 69/134 (51%), Positives = 73/134 (54%), Gaps = 20/134 (14%)
Frame = +1
Query: 184 GYGIPYAYPAP---PPPYGYPPV----AYPPHGGYHPTGYPPTGYPP---HGYPSHGHHH 333
GY P YP P PP +G P AYPP G H YP GYP HG G H
Sbjct: 60 GYPPPGGYPQPGGYPPSHGAYPAPGAGAYPPSGYPHQPVYPQPGYPSMPGHGGMYGGGHG 119
Query: 334 HGGI---GAMIAGGAAMA-AAAVGSHHHGHYGHHHGHGYGYGYHKHGKFKHGKFGKRWKH 501
GG GAM+AGGAA A A SH HG YGH HGH G KHGKFKHGKFGK K
Sbjct: 120 AGGSAGHGAMLAGGAAAAYGAHTVSHSHGMYGHGHGH----GKFKHGKFKHGKFGKHKK- 174
Query: 502 GIFGKHKGKFFKKW 543
+FGKHK F +KW
Sbjct: 175 -MFGKHKNMFGRKW 187
>gi|226499166|ref|NP_001152003.1| glycine-rich protein A3 [Zea mays]
Length = 188
Score = 110 bits (273), Expect = 4e-022
Identities = 69/134 (51%), Positives = 73/134 (54%), Gaps = 20/134 (14%)
Frame = +1
Query: 184 GYGIPYAYPAP---PPPYGYPPV----AYPPHGGYHPTGYPPTGYPP---HGYPSHGHHH 333
GY P YP P PP +G P AYPP G H YP GYP HG G H
Sbjct: 60 GYPPPGGYPQPGGYPPSHGAYPAPGAGAYPPSGYPHQPVYPQPGYPSMPGHGGMYGGGHG 119
Query: 334 HGGI---GAMIAGGAAMA-AAAVGSHHHGHYGHHHGHGYGYGYHKHGKFKHGKFGKRWKH 501
GG GAM+AGGAA A A SH HG YGH HGH G KHGKFKHGKFGK K
Sbjct: 120 AGGSAGHGAMLAGGAAAAYGAHTVSHSHGMYGHGHGH----GKFKHGKFKHGKFGKHKK- 174
Query: 502 GIFGKHKGKFFKKW 543
+FGKHK F +KW
Sbjct: 175 -MFGKHKNMFGRKW 187
>gi|145049767|gb|ABP35530.1| glycine and proline-rich protein [Ipomoea batatas]
Length = 185
Score = 102 bits (252), Expect = 1e-019
Identities = 54/109 (49%), Positives = 61/109 (55%), Gaps = 12/109 (11%)
Frame = +1
Query: 94 DQRDKGLFHHLAGGHYRPYGHHGYSNHGHHGYGIPYAYPAPP-----PPYGYPPVAYPPH 258
D+ DKGLF HL GH + H GY G+ G P PP PP YPP YPP
Sbjct: 9 DESDKGLFSHL--GH---HAHQGYPPQGYPPQGYPPQQGYPPAGAGYPPQAYPPSGYPPQ 63
Query: 259 GGYHPTGYPPTGYPPHGYPSHGHH--HHGGIGAMIAGGAAMAAAAVGSH 399
GY P YPP GYP +PS HH H G+GAM+AGGAA AA A G+H
Sbjct: 64 QGYPPQAYPPAGYPGQPHPSASHHSGHGPGMGAMLAGGAAAAAVAYGAH 112
>gi|2494026|sp|Q28640.1|HRG_RABIT RecName: Full=Histidine-rich glycoprotein;
AltName: Full=Histidine-proline-rich glycoprotein; Short=HPRG; Flags:
Precursor
Length = 526
Score = 76 bits (186), Expect = 5e-012
Identities = 39/77 (50%), Positives = 47/77 (61%), Gaps = 12/77 (15%)
Frame = +1
Query: 145 PYGH--HGYSNHGHHGYG-IPYAYP--APP---PPYGYPPVAYPPHG----GYHPTGYPP 288
P+GH HG HGHH +G P+ +P PP PP+G PP +PPHG G+ P G PP
Sbjct: 330 PHGHHPHGPPPHGHHPHGPPPHGHPPHGPPPRHPPHGPPPHGHPPHGPPPHGHPPHGPPP 389
Query: 289 TGYPPHGYPSHGHHHHG 339
G+PPHG P HGH HG
Sbjct: 390 HGHPPHGPPPHGHPPHG 406
>gi|48994598|gb|AAT48006.1| rhodopsin [Abraliopsis pacificus]
Length = 299
Score = 68 bits (165), Expect = 1e-009
Identities = 27/39 (69%)
Frame = +1
Query: 208 PAPPPPYGYPPVAYPPHGGYHPTGYPPTGYPPHGYPSHG 324
PA PPP GYPP YPP GY P GYPP GYPP GYP G
Sbjct: 249 PAYPPPQGYPPQGYPPPQGYPPQGYPPQGYPPQGYPPQG 287
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 475,288,458,330
Number of Sequences: 15229318
Number of Extensions: 475288458330
Number of Successful Extensions: 120006622
Number of sequences better than 0.0: 0
|