BLASTX 7.6.2
Query= UN02836 /QuerySize=1435
(1434 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|18416364|ref|NP_567704.1| hydroxyproline-rich glycoprotein fa... 411 1e-112
gi|15028121|gb|AAK76684.1| unknown protein [Arabidopsis thaliana] 409 6e-112
gi|297803660|ref|XP_002869714.1| hydroxyproline-rich glycoprotei... 406 8e-111
gi|30686552|ref|NP_849436.1| hydroxyproline-rich glycoprotein fa... 363 5e-098
gi|255545984|ref|XP_002514052.1| conserved hypothetical protein ... 129 2e-027
gi|224063391|ref|XP_002301125.1| predicted protein [Populus tric... 118 3e-024
gi|46095228|gb|AAS80151.1| ACT11D09.5 [Cucumis melo] 110 7e-022
gi|255545986|ref|XP_002514053.1| conserved hypothetical protein ... 92 2e-016
gi|224081921|ref|XP_002306529.1| predicted protein [Populus tric... 79 2e-012
>gi|18416364|ref|NP_567704.1| hydroxyproline-rich glycoprotein family protein
[Arabidopsis thaliana]
Length = 319
Score = 411 bits (1056), Expect = 1e-112
Identities = 204/272 (75%), Positives = 223/272 (81%), Gaps = 18/272 (6%)
Frame = -2
Query: 1127 FYTDPMAAYSSFKRNKSPKQQYISSPSHQMSPPV-PQFPPSV-PGSMGNDYQVHPNHGGF 954
+YTDPMAAYSSFK+NK+PKQQYISSPSHQ S PV PQFPPSV PGS+ ++YQ NHGGF
Sbjct: 61 YYTDPMAAYSSFKKNKTPKQQYISSPSHQGSSPVPPQFPPSVPPGSLCSEYQAQTNHGGF 120
Query: 953 QEAHYGGDNQHTQPRGMA---PSYRGPPAPWNNNFRPPPPVNHLGPPQWVPRPYPFIQGN 783
AHY +PRGMA PS+RGPPA WNNNFR PPPVNH GPPQWVPRP+PF Q
Sbjct: 121 HAAHY-------EPRGMAHLSPSHRGPPAGWNNNFR-PPPVNHSGPPQWVPRPFPFSQEM 172
Query: 782 HDMGNNRYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGGGRGMNTSFGR 603
+MGNNR+GGR G YNN PPQF +YGRQN+NW GNTYPNSGRGR GRGMNTSFGR
Sbjct: 173 PNMGNNRFGGR----GSYNNTPPQFSNYGRQNANWGGNTYPNSGRGR-SRGRGMNTSFGR 227
Query: 602 GGGRRPMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMI 423
GGRRPME GAERFYSNSMAEDPWKHLKPVLWK+ SDASSS+STGQ W P SIAPKK +
Sbjct: 228 DGGRRPMEPGAERFYSNSMAEDPWKHLKPVLWKNCSDASSSSSTGQAWLPKSIAPKKSVT 287
Query: 422 SEASHKPSNNQQSLAEYLAASLDEATCDDPSN 327
SEA+HK S+NQQSLAEYLAASLD ATCD+ SN
Sbjct: 288 SEATHKTSSNQQSLAEYLAASLDGATCDESSN 319
Score = 82 bits (200), Expect = 3e-013
Identities = 40/61 (65%), Positives = 50/61 (81%), Gaps = 1/61 (1%)
Frame = -3
Query: 1306 EDSEKRKEMLKAMRMEAAAAASQNDVSTELETSMNTSHLSNPLADASTHQQESYDKPRFD 1127
EDSEKRK+MLKAMRME AAA + +D +T ETSM+T HLSNPLA+ S HQQ+S++ RFD
Sbjct: 2 EDSEKRKQMLKAMRME-AAAQNDDDATTGTETSMSTGHLSNPLAETSNHQQDSFETQRFD 60
Query: 1126 F 1124
+
Sbjct: 61 Y 61
>gi|15028121|gb|AAK76684.1| unknown protein [Arabidopsis thaliana]
Length = 319
Score = 409 bits (1051), Expect = 6e-112
Identities = 203/272 (74%), Positives = 223/272 (81%), Gaps = 18/272 (6%)
Frame = -2
Query: 1127 FYTDPMAAYSSFKRNKSPKQQYISSPSHQMSPPV-PQFPPSV-PGSMGNDYQVHPNHGGF 954
+YTDPMAAYSSFK+NK+PKQQYISSPSHQ S PV PQFPPSV PGS+ ++YQ NHGGF
Sbjct: 61 YYTDPMAAYSSFKKNKTPKQQYISSPSHQGSSPVPPQFPPSVPPGSLCSEYQAQTNHGGF 120
Query: 953 QEAHYGGDNQHTQPRGMA---PSYRGPPAPWNNNFRPPPPVNHLGPPQWVPRPYPFIQGN 783
AHY +PRGMA PS+RGPPA WNNNFR PPPVNH GPPQWVPRP+PF Q
Sbjct: 121 HAAHY-------EPRGMAHLSPSHRGPPAGWNNNFR-PPPVNHSGPPQWVPRPFPFSQEM 172
Query: 782 HDMGNNRYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGGGRGMNTSFGR 603
+MGNNR+GGR G YNN PPQF +YGRQN+NW GNT+PNSGRGR GRGMNTSFGR
Sbjct: 173 PNMGNNRFGGR----GSYNNTPPQFSNYGRQNANWGGNTHPNSGRGR-SRGRGMNTSFGR 227
Query: 602 GGGRRPMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMI 423
GGRRPME GAERFYSNSMAEDPWKHLKPVLWK+ SDASSS+STGQ W P SIAPKK +
Sbjct: 228 DGGRRPMEPGAERFYSNSMAEDPWKHLKPVLWKNCSDASSSSSTGQAWLPKSIAPKKSVT 287
Query: 422 SEASHKPSNNQQSLAEYLAASLDEATCDDPSN 327
SEA+HK S+NQQSLAEYLAASLD ATCD+ SN
Sbjct: 288 SEATHKTSSNQQSLAEYLAASLDGATCDESSN 319
Score = 82 bits (200), Expect = 3e-013
Identities = 40/61 (65%), Positives = 50/61 (81%), Gaps = 1/61 (1%)
Frame = -3
Query: 1306 EDSEKRKEMLKAMRMEAAAAASQNDVSTELETSMNTSHLSNPLADASTHQQESYDKPRFD 1127
EDSEKRK+MLKAMRME AAA + +D +T ETSM+T HLSNPLA+ S HQQ+S++ RFD
Sbjct: 2 EDSEKRKQMLKAMRME-AAAQNDDDATTGTETSMSTGHLSNPLAETSNHQQDSFETQRFD 60
Query: 1126 F 1124
+
Sbjct: 61 Y 61
>gi|297803660|ref|XP_002869714.1| hydroxyproline-rich glycoprotein family
protein [Arabidopsis lyrata subsp. lyrata]
Length = 320
Score = 406 bits (1041), Expect = 8e-111
Identities = 199/272 (73%), Positives = 222/272 (81%), Gaps = 18/272 (6%)
Frame = -2
Query: 1127 FYTDPMAAYSSFKRNKSPKQQYISSPSHQMSPPV-PQFPPSV-PGSMGNDYQVHPNHGGF 954
+YTDPM+AYSSFK+ K+PKQQYISSPSHQ S PV PQFPPSV PGS+G++YQ H NHGGF
Sbjct: 62 YYTDPMSAYSSFKKIKTPKQQYISSPSHQASSPVPPQFPPSVPPGSLGSEYQAHTNHGGF 121
Query: 953 QEAHYGGDNQHTQPRGM---APSYRGPPAPWNNNFRPPPPVNHLGPPQWVPRPYPFIQGN 783
Q AHY +PRGM +P YRG PA WNNNFR PPPVNH GPPQWVPRP+PF Q
Sbjct: 122 QAAHY-------EPRGMSHLSPPYRGSPASWNNNFR-PPPVNHPGPPQWVPRPFPFSQEI 173
Query: 782 HDMGNNRYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGGGRGMNTSFGR 603
+MGNNR+G R G YNN P F +YGRQN+NW GNTYPNSGRG GG GRGMNTSFGR
Sbjct: 174 PNMGNNRFGDR----GSYNNTAPHFSNYGRQNANWVGNTYPNSGRG-GGRGRGMNTSFGR 228
Query: 602 GGGRRPMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMI 423
GGRRP E GAER+YSNSMA+DPWK+LKPV+WKS SDASSSNSTGQ W P+S APKK +
Sbjct: 229 DGGRRPTELGAERYYSNSMADDPWKYLKPVIWKSCSDASSSNSTGQAWLPNSTAPKKSVT 288
Query: 422 SEASHKPSNNQQSLAEYLAASLDEATCDDPSN 327
SEA+HKPSNNQQSLAEYLAASLDEATCD+ S+
Sbjct: 289 SEATHKPSNNQQSLAEYLAASLDEATCDESSS 320
Score = 86 bits (211), Expect = 1e-014
Identities = 41/62 (66%), Positives = 50/62 (80%)
Frame = -3
Query: 1309 MEDSEKRKEMLKAMRMEAAAAASQNDVSTELETSMNTSHLSNPLADASTHQQESYDKPRF 1130
MEDSEKRK+MLKAMRMEAAA +D +T+ ETSMNT HLSNPLA+ ST Q+S++ RF
Sbjct: 1 MEDSEKRKQMLKAMRMEAAAQNDNDDSTTDPETSMNTGHLSNPLAETSTQHQDSFETSRF 60
Query: 1129 DF 1124
D+
Sbjct: 61 DY 62
>gi|30686552|ref|NP_849436.1| hydroxyproline-rich glycoprotein family protein
[Arabidopsis thaliana]
Length = 290
Score = 363 bits (931), Expect = 5e-098
Identities = 182/254 (71%), Positives = 200/254 (78%), Gaps = 18/254 (7%)
Frame = -2
Query: 1073 KQQYISSPSHQMSPPV-PQFPPSV-PGSMGNDYQVHPNHGGFQEAHYGGDNQHTQPRGMA 900
+Q + SHQ S PV PQFPPSV PGS+ ++YQ NHGGF AHY +PRGMA
Sbjct: 50 QQDSFETQSHQGSSPVPPQFPPSVPPGSLCSEYQAQTNHGGFHAAHY-------EPRGMA 102
Query: 899 ---PSYRGPPAPWNNNFRPPPPVNHLGPPQWVPRPYPFIQGNHDMGNNRYGGRGPRVGGY 729
PS+RGPPA WNNNFR PPPVNH GPPQWVPRP+PF Q +MGNNR+GGR G Y
Sbjct: 103 HLSPSHRGPPAGWNNNFR-PPPVNHSGPPQWVPRPFPFSQEMPNMGNNRFGGR----GSY 157
Query: 728 NNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGGGRGMNTSFGRGGGRRPMEQGAERFYSNS 549
NN PPQF +YGRQN+NW GNTYPNSGRGR GRGMNTSFGR GGRRPME GAERFYSNS
Sbjct: 158 NNTPPQFSNYGRQNANWGGNTYPNSGRGR-SRGRGMNTSFGRDGGRRPMEPGAERFYSNS 216
Query: 548 MAEDPWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMISEASHKPSNNQQSLAEYL 369
MAEDPWKHLKPVLWK+ SDASSS+STGQ W P SIAPKK + SEA+HK S+NQQSLAEYL
Sbjct: 217 MAEDPWKHLKPVLWKNCSDASSSSSTGQAWLPKSIAPKKSVTSEATHKTSSNQQSLAEYL 276
Query: 368 AASLDEATCDDPSN 327
AASLD ATCD+ SN
Sbjct: 277 AASLDGATCDESSN 290
Score = 75 bits (182), Expect = 3e-011
Identities = 37/55 (67%), Positives = 46/55 (83%), Gaps = 1/55 (1%)
Frame = -3
Query: 1306 EDSEKRKEMLKAMRMEAAAAASQNDVSTELETSMNTSHLSNPLADASTHQQESYD 1142
EDSEKRK+MLKAMRME AAA + +D +T ETSM+T HLSNPLA+ S HQQ+S++
Sbjct: 2 EDSEKRKQMLKAMRME-AAAQNDDDATTGTETSMSTGHLSNPLAETSNHQQDSFE 55
>gi|255545984|ref|XP_002514052.1| conserved hypothetical protein [Ricinus
communis]
Length = 412
Score = 129 bits (322), Expect = 2e-027
Identities = 114/293 (38%), Positives = 144/293 (49%), Gaps = 53/293 (18%)
Frame = -2
Query: 1127 FYTDPMAAYSSFKRNKS---PKQQYISSPSHQMSPPVPQFPPSVPGSMGND-------YQ 978
FYT+PMAA+S+ KR S P +Y PS+ + P+P F VPG GN YQ
Sbjct: 139 FYTNPMAAFSADKRIASINQPAPRYFIPPSN--NGPMPWFSSPVPGP-GNPGMTPSPVYQ 195
Query: 977 VH----PNHGGFQEAHYGGDNQHTQPR-GMAPSYRGPPAPWNNNFRPPPPVNHLGPPQWV 813
+ PN Q+ Y + PR G P ++G P WN P + P +
Sbjct: 196 MQSNYLPNQRTHQQGPYNSAVPYRSPRAGPFPMHQGTPDAWNG----PGGIAAAAPYRGR 251
Query: 812 PRPYPFIQGNHDM-----GNNRYG-GRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSG 651
PYP + N + YG GR P G N+ P+ H G +TY SG
Sbjct: 252 MCPYPIHESNPGFQPAGSPSFNYGQGRPPWSG--NSPSPRSVHGG-------SSTY--SG 300
Query: 650 RGRG---GGGRG-MNTSFGRGG----GRRPMEQ-GAERFYSNSMAEDPWKHLKPVLWKSF 498
RG+G G RG ++ GR G G P E G E FY SM EDPWK L+PV+WK
Sbjct: 301 RGQGQWHGSSRGQISGQSGRRGFHSRGPAPGEAFGPESFYEKSMVEDPWKQLEPVVWKML 360
Query: 497 SDASSSNSTGQTWRPSSIAPKKPMISEASHKPSNNQQSLAEYLAASLDEATCD 339
SSNS W P SI+ KKP SE+S+ SN++QSLAEYLAAS +EA D
Sbjct: 361 GVPGSSNS----WLPKSISRKKPRPSESSNN-SNSKQSLAEYLAASFNEAVKD 408
>gi|224063391|ref|XP_002301125.1| predicted protein [Populus trichocarpa]
Length = 347
Score = 118 bits (294), Expect = 3e-024
Identities = 96/269 (35%), Positives = 124/269 (46%), Gaps = 32/269 (11%)
Frame = -2
Query: 1085 NKSPKQQYISSPSHQMSPPVPQFPPSVPGSMGNDY----QVHPNHGGFQEAHYGGDNQHT 918
N S Q+ S Q +P V PS M N+Y Q+ N+ Q + G H
Sbjct: 93 NISSMPQFSSPHPGQRNPEV---TPSSAYQMQNNYSPANQMQSNYSPNQRMYPGQGPYHN 149
Query: 917 QPRGMAPS--------YRGPPAPWNNNFRPPPPVNHLGPP-QWVPRPYPFIQGNHDMGNN 765
PS +G P WN P NH P + + RPYP QGN G
Sbjct: 150 AAFYRTPSNFARPFTMNQGTPEMWNG--PGGPASNHSSTPYRGISRPYPIHQGNPGFG-- 205
Query: 764 RYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGG-GRGMNTSFGRGGGRR 588
G V GY +P GR G +SG G+ GG GRG F G
Sbjct: 206 PVGSSPSPVSGYGGSPAS---SGRGQGRGQGYWDSSSGLGQSGGRGRG----FRSRGFAL 258
Query: 587 PMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSNS---TGQTWRPSSIAPKKPMISE 417
Q E F+ NSM EDPW+HLKPVLW+ D ++ + + +W P SI+ KKP ISE
Sbjct: 259 NETQEPECFHDNSMVEDPWQHLKPVLWRGLDDPGNNLNGPVSSNSWLPKSISVKKPRISE 318
Query: 416 ASHKPSNNQQSLAEYLAASLDEATCDDPS 330
+S+K S + Q+LAEYL+A+ EAT D P+
Sbjct: 319 SSNK-STSGQTLAEYLSAAFTEATNDAPN 346
>gi|46095228|gb|AAS80151.1| ACT11D09.5 [Cucumis melo]
Length = 568
Score = 110 bits (274), Expect = 7e-022
Identities = 90/290 (31%), Positives = 131/290 (45%), Gaps = 39/290 (13%)
Frame = -2
Query: 1127 FYTDPMAAYSSFKRNKSPKQQYISS---PSHQMSPPVPQFPPSVPG------SMGNDYQV 975
+YT+PMAA+S+ K+ + Q +S P H + PP+ PG S + +Q
Sbjct: 293 YYTNPMAAFSTSKKKGKIENQPVSDTFVPYHHNTSSTTYLPPTFPGLRNPEMSPSSTHQF 352
Query: 974 H---PNHGGFQ---EAHYGGDNQHTQPRGMAPS------YRGPPAPWNNNFRPPPPVNHL 831
H P+ F ++ GG PR A + +RGP P+ N F P P +
Sbjct: 353 HQYSPDQRTFYARGDSEAGGHGSPGMPRPYAVNQGDPHMWRGPRRPFVNQF-PTHPPREM 411
Query: 830 GPPQWVPRPYPFIQGNHDMGNNRYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNS- 654
V P N +Y P G + + P GR + GN P+
Sbjct: 412 NSSSHVSGPRGNSYTNPTQDRAKYRSSSPNPGFHGSLSP-----GRGSHGHHGNMTPSPR 466
Query: 653 -GRGRGGGGRGMNTSFGRGGGRRPMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSN 477
G GRG G G ++ + G E+FY+ SM EDPWK L+P +W + +S+S
Sbjct: 467 FGYGRGTGFHGRHSLLDK--------SGPEQFYNVSMLEDPWKVLQPCIWTTIDSSSNSA 518
Query: 476 STGQTWRPSSIAPKKPMISEASHKPSNNQQ-SLAEYLAASLDEATCDDPS 330
++W S KK +S++S S++QQ SLAEYLAAS EA D P+
Sbjct: 519 KPSESW-ISKFGTKKARVSDSSSGRSSSQQPSLAEYLAASFKEAIEDAPN 567
>gi|255545986|ref|XP_002514053.1| conserved hypothetical protein [Ricinus
communis]
Length = 226
Score = 92 bits (228), Expect = 2e-016
Identities = 83/235 (35%), Positives = 103/235 (43%), Gaps = 31/235 (13%)
Frame = -2
Query: 1034 PPVPQFPPSVPGSMGNDYQVHPNHGGFQ-EAHYGGDNQHTQPR-GMAPSYRGPPAPWNNN 861
P P PS M ++Y PN Q + Y + PR G+ P ++G P WN
Sbjct: 4 PGNPGMTPSPAYQMQSNYL--PNQRTHQAQGPYNSAVPYRSPRTGLFPMHQGTPDAWNG- 60
Query: 860 FRPPPPVNHLGPPQWVPRPYPFIQGNHDMGNNR-----YG-GRGPRVGGYNNNPPQFPHY 699
P + P + PYP + N R YG GR P G NN P+ H
Sbjct: 61 ---PGGIAAAAPYRGRMCPYPIYESNPGFQPARSPSFNYGQGRPPWSG--NNPCPRSVHG 115
Query: 698 G------RQNSNWAGNTYPNSGRGRGGGGRGMNTSFGRGGGRRPMEQGAERFYSNSMAED 537
G R W G+ G GRG + S G G G E F+ SM ED
Sbjct: 116 GSSTYSRRGQGQWHGSNRGQISGQSGRRGRGFH-SRGPASGE---AFGPESFHDKSMVED 171
Query: 536 PWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMISEASHKPSNNQQSLAEY 372
PWK L+PV+WK SSNS W P SI+ KKP SE S+ SN++QSLAEY
Sbjct: 172 PWKQLEPVVWKMLEVPRSSNS----WLPKSISRKKPRPSEPSNN-SNSKQSLAEY 221
>gi|224081921|ref|XP_002306529.1| predicted protein [Populus trichocarpa]
Length = 331
Score = 79 bits (193), Expect = 2e-012
Identities = 55/136 (40%), Positives = 75/136 (55%), Gaps = 14/136 (10%)
Frame = -2
Query: 746 PRVGGYNNNPPQFPHYGRQ---NSNWAGNTYPNSGRGRGGG-GRGMNTSFGRGGGRRPME 579
P G ++P YG + G+ + +SG G+ GG GRG ++ G P E
Sbjct: 202 PGFGPVGSSPSPVSGYGGSPAISQTGQGHWHSSSGFGQSGGRGRGFHSR-----GFAPNE 256
Query: 578 -QGAERFYSNSMAEDPWKHLKPVLWKSFSD-ASSSNSTG--QTWRPSSIAPKKPMISEAS 411
QG E FY NSM EDPW+HL+PVLW D ++ N G + P SI+ KK ++E+S
Sbjct: 257 AQGPECFYDNSMVEDPWQHLEPVLWSGLDDWGNNLNGPGSSNSLLPKSISMKKSSVAESS 316
Query: 410 HKPSNNQQSLAEYLAA 363
+K S + SLAEYLAA
Sbjct: 317 NK-STSGVSLAEYLAA 331
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 359,848,489,672
Number of Sequences: 15229318
Number of Extensions: 359848489672
Number of Successful Extensions: 91916522
Number of sequences better than 0.0: 0
|