BLASTX 7.6.2
Query= UN82542 /QuerySize=811
(810 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|7594584|emb|CAB88077.1| hypothetical protein [Arabidopsis tha... 318 7e-085
gi|21554414|gb|AAM63519.1| glycoprotein homolog [Arabidopsis tha... 318 7e-085
gi|15235884|ref|NP_193412.1| hydroxyproline-rich glycoprotein fa... 316 3e-084
gi|8919877|emb|CAB96200.1| hypothetical protein [Capsella rubella] 314 1e-083
gi|297800428|ref|XP_002868098.1| hydroxyproline-rich glycoprotei... 307 2e-081
gi|255581309|ref|XP_002531465.1| conserved hypothetical protein ... 109 8e-022
gi|296085704|emb|CBI29503.3| unnamed protein product [Vitis vini... 95 1e-017
gi|224074191|ref|XP_002304294.1| predicted protein [Populus tric... 87 2e-015
gi|225453426|ref|XP_002272713.1| PREDICTED: hypothetical protein... 77 3e-012
gi|147843896|emb|CAN81597.1| hypothetical protein VITISV_039396 ... 75 1e-011
gi|297820898|ref|XP_002878332.1| hypothetical protein ARALYDRAFT... 65 8e-009
gi|15232309|ref|NP_191597.1| uncharacterized protein [Arabidopsi... 63 4e-008
gi|255541082|ref|XP_002511605.1| hypothetical protein RCOM_16086... 62 1e-007
>gi|7594584|emb|CAB88077.1| hypothetical protein [Arabidopsis thaliana]
Length = 471
Score = 318 bits (814), Expect = 7e-085
Identities = 186/279 (66%), Positives = 207/279 (74%), Gaps = 39/279 (13%)
Frame = +1
Query: 1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
Q G+KEDQNPRKFYSRF+FKALIL +LC++VPVFLSQTPELANQTRLLELLH++FVGIAV
Sbjct: 10 QLGNKEDQNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQTRLLELLHLVFVGIAV 69
Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN---NNTNNPHPYVPKILEVSSSVFNVDHES---GSD 342
SYGLFSRRNYDGGGGG SN+DHN ++ NN H YVPKILEV SSVFNV HES SD
Sbjct: 70 SYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEV-SSVFNVGHESESEPSD 128
Query: 343 DSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHLS 522
DSSGD + +NKY K E E+RFVDRVSS REKPLLLPVRSLNYS +S
Sbjct: 129 DSSGDQ-RKFQTWKNKYHMKIPEV------ETRFVDRVSSENREKPLLLPVRSLNYSRVS 181
Query: 523 D-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME--------------- 654
D SGD SGRWE+VRSKR+LLKTL DD+S D LPSPIPWRSRSS
Sbjct: 182 DSSGDNSGRWEKVRSKRELLKTLGDDNS-DVLPSPIPWRSRSSSSSSSSSKEVESLPSVK 240
Query: 655 ----IESQPLIKTPASLTSSSALSSSPRKSTPLPKLTSE 759
+ESQPLIK +LT SS+ SSPRKS P+P L SE
Sbjct: 241 NLTTVESQPLIK---NLTPSSSF-SSPRKSNPIPNLASE 275
>gi|21554414|gb|AAM63519.1| glycoprotein homolog [Arabidopsis thaliana]
Length = 473
Score = 318 bits (814), Expect = 7e-085
Identities = 186/279 (66%), Positives = 207/279 (74%), Gaps = 39/279 (13%)
Frame = +1
Query: 1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
Q G+KEDQNPRKFYSRF+FKALIL +LC++VPVFLSQTPELANQTRLLELLH++FVGIAV
Sbjct: 12 QLGNKEDQNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQTRLLELLHLVFVGIAV 71
Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN---NNTNNPHPYVPKILEVSSSVFNVDHES---GSD 342
SYGLFSRRNYDGGGGG SN+DHN ++ NN H YVPKILEV SSVFNV HES SD
Sbjct: 72 SYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEV-SSVFNVGHESESEPSD 130
Query: 343 DSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHLS 522
DSSGD + +NKY K E E+RFVDRVSS REKPLLLPVRSLNYS +S
Sbjct: 131 DSSGDQ-RKFQTWKNKYHMKIPEV------ETRFVDRVSSENREKPLLLPVRSLNYSRVS 183
Query: 523 D-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME--------------- 654
D SGD SGRWE+VRSKR+LLKTL DD+S D LPSPIPWRSRSS
Sbjct: 184 DSSGDNSGRWEKVRSKRELLKTLGDDNS-DVLPSPIPWRSRSSSSSSSSSKEVESLPSVK 242
Query: 655 ----IESQPLIKTPASLTSSSALSSSPRKSTPLPKLTSE 759
+ESQPLIK +LT SS+ SSPRKS P+P L SE
Sbjct: 243 NLTTVESQPLIK---NLTPSSSF-SSPRKSNPIPNLASE 277
>gi|15235884|ref|NP_193412.1| hydroxyproline-rich glycoprotein family protein
[Arabidopsis thaliana]
Length = 473
Score = 316 bits (809), Expect = 3e-084
Identities = 185/279 (66%), Positives = 206/279 (73%), Gaps = 39/279 (13%)
Frame = +1
Query: 1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
Q G+KEDQNPRKFYSRF+FKALIL +LC++VPVFLSQTPELANQTRLLELLH++FVGIAV
Sbjct: 12 QLGNKEDQNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQTRLLELLHLVFVGIAV 71
Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN---NNTNNPHPYVPKILEVSSSVFNVDHES---GSD 342
SYGLFSRRNYDGGGGG SN+DHN ++ NN H YVPKILEV SSVFNV HES SD
Sbjct: 72 SYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEV-SSVFNVGHESESEPSD 130
Query: 343 DSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHLS 522
DSSGD + +NKY K E E+RFVDRVSS REKPLLLPVRSLNYS +S
Sbjct: 131 DSSGDQ-RKFQTWKNKYHMKIPEV------ETRFVDRVSSENREKPLLLPVRSLNYSRVS 183
Query: 523 D-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME--------------- 654
D SGD SGRWE+VRSKR+LLKTL DD+S D LPSPIPWRSRSS
Sbjct: 184 DSSGDNSGRWEKVRSKRELLKTLGDDNS-DVLPSPIPWRSRSSSSSSSSSKEVESLPSVK 242
Query: 655 ----IESQPLIKTPASLTSSSALSSSPRKSTPLPKLTSE 759
+ESQPLIK +LT S+ SSPRKS P+P L SE
Sbjct: 243 NLTTVESQPLIK---NLTPPSSF-SSPRKSNPIPNLASE 277
>gi|8919877|emb|CAB96200.1| hypothetical protein [Capsella rubella]
Length = 470
Score = 314 bits (803), Expect = 1e-083
Identities = 181/276 (65%), Positives = 205/276 (74%), Gaps = 36/276 (13%)
Frame = +1
Query: 1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
Q G KEDQNP +FYSRF+FKALIL +LC++VPVFLSQTPELANQTRLLELLH++FVGIAV
Sbjct: 12 QLGTKEDQNPTRFYSRFIFKALILTVLCAVVPVFLSQTPELANQTRLLELLHLVFVGIAV 71
Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN----NNTNNPHPYVPKILEVSSSVFNVDHES---GS 339
SYGLFSRRNYDGGG G SN+D+N +N NN H YVPK+LEV SSVFNVDHES S
Sbjct: 72 SYGLFSRRNYDGGGAGGSSNSDYNKADHHNNNNSHSYVPKLLEV-SSVFNVDHESESEPS 130
Query: 340 DDSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHL 519
DDSSGDH + +NKY K E E+RFVDRVSS +REKPLLLPVRSLNY +
Sbjct: 131 DDSSGDH-RKFQAWRNKYHMKIPEV------ETRFVDRVSSEIREKPLLLPVRSLNYYPV 183
Query: 520 SD-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME-------------- 654
D SGD SGRW++VRSKRQLLKTL DD+S D LPSPIPWRSRSS
Sbjct: 184 PDSSGDNSGRWDKVRSKRQLLKTLGDDNS-DVLPSPIPWRSRSSSSSKEIESPPSIKNLT 242
Query: 655 -IESQPLIKTPASLTSSSALSSSPRKSTPLPKLTSE 759
+ESQPLIK +LT SS+ SSPRKS P+P L S+
Sbjct: 243 TVESQPLIK---NLTPSSSY-SSPRKSNPIPNLASQ 274
>gi|297800428|ref|XP_002868098.1| hydroxyproline-rich glycoprotein family
protein [Arabidopsis lyrata subsp. lyrata]
Length = 461
Score = 307 bits (785), Expect = 2e-081
Identities = 180/267 (67%), Positives = 203/267 (76%), Gaps = 27/267 (10%)
Frame = +1
Query: 1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLLELLHMIFVGIAV 180
Q G+KEDQNPRKFYSRFLFKALIL LLC++VPVFLSQTPELANQTRL+ELLH++FVGIAV
Sbjct: 12 QLGNKEDQNPRKFYSRFLFKALILTLLCAVVPVFLSQTPELANQTRLIELLHLVFVGIAV 71
Query: 181 SYGLFSRRNYDGGGGGRGSNNDHN---NNTNNPHPYVPKILEVSSSVFNVDHES---GSD 342
SYGLFSRRNYDGGGG SN+D+N ++ NN HPYVPKILEV SSVFNV +ES SD
Sbjct: 72 SYGLFSRRNYDGGGGEGTSNSDNNKADHSNNNLHPYVPKILEV-SSVFNVGNESESEPSD 130
Query: 343 DSSGDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVREKPLLLPVRSLNYSHLS 522
DSSGD + +NKY K E E+RFV+RVSS +REKPLLLPVRSLNYS +
Sbjct: 131 DSSGDQ-RKFQTWKNKYHMKIPEV------ETRFVERVSSEIREKPLLLPVRSLNYSRVP 183
Query: 523 D-SGDGSGRWERVRSKRQLLKTLVDDDSTDALPSPIPWRSRSSME-------IESQPLIK 678
D S D SGRWE+VRSKR+LLKTL DD+S D LPSPIPWRSRSS +ESQP IK
Sbjct: 184 DSSSDNSGRWEKVRSKRELLKTLGDDNS-DVLPSPIPWRSRSSSSSVKNMATVESQPWIK 242
Query: 679 TPASLTSSSALSSSPRKSTPLPKLTSE 759
+LT SSA SPRKS LP L S+
Sbjct: 243 ---NLTPSSAF-PSPRKSNLLPNLASQ 265
>gi|255581309|ref|XP_002531465.1| conserved hypothetical protein [Ricinus
communis]
Length = 565
Score = 109 bits (270), Expect = 8e-022
Identities = 72/183 (39%), Positives = 100/183 (54%), Gaps = 11/183 (6%)
Frame = +1
Query: 1 QSGDKEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVG 171
Q+ + NP KFYS FL+KALI+ + ++P+F SQ PE NQ TR E LH+IFVG
Sbjct: 14 QNQANNNNNPSKFYSHFLYKALIVTIFLVILPLFPSQAPEFINQTLNTRGWEFLHLIFVG 73
Query: 172 IAVSYGLFSRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGSDDSS 351
IAVSYGLFSRRN + +N N+ +N YV + L+V SSVF+ D +S S
Sbjct: 74 IAVSYGLFSRRNDE-----TEKDNSSNSKFDNAQSYVSRFLQV-SSVFDDDADSPSKSDV 127
Query: 352 GDHPHPHRRIQNKYQTKSTETGLSKDNESRFVDRVSSGVR--EKPLLLPVRSLNYSHLSD 525
+ Y+ + + + + ++ S+G R EKPLLLP+RSL L
Sbjct: 128 SNSTSVQTWNNQYYRNEPVVVVAEEQHPAFDQEQRSTGSRIGEKPLLLPIRSLKSRVLDA 187
Query: 526 SGD 534
G+
Sbjct: 188 DGN 190
>gi|296085704|emb|CBI29503.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 95 bits (234), Expect = 1e-017
Identities = 56/113 (49%), Positives = 76/113 (67%), Gaps = 13/113 (11%)
Frame = +1
Query: 25 NPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQT---RLLELLHMIFVGIAVSYGLF 195
NP KFYS FL+KALI+ L +++P+F SQ PE NQT R ELLH++FVGIAVSYGLF
Sbjct: 13 NPSKFYSGFLYKALIVTLFLAILPLFPSQAPEFINQTVFNRSWELLHLVFVGIAVSYGLF 72
Query: 196 SRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFN--VDHESGSDDS 348
SRRN + + ++++ +N YV + L+V SSVF+ V+ SGS +S
Sbjct: 73 SRRNDE-------TEKENHSKFDNAQSYVSRFLQV-SSVFDEEVESPSGSGES 117
>gi|224074191|ref|XP_002304294.1| predicted protein [Populus trichocarpa]
Length = 580
Score = 87 bits (215), Expect = 2e-015
Identities = 49/110 (44%), Positives = 72/110 (65%), Gaps = 6/110 (5%)
Frame = +1
Query: 13 KEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVS 183
++ NP K+Y+ FL+KALI+ + ++P+F SQ PE NQ TR E LH++FVGIAVS
Sbjct: 13 QKQANPTKYYTHFLYKALIVTVFLIILPLFPSQAPEFINQTLNTRGWEFLHLVFVGIAVS 72
Query: 184 YGLFSRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHES 333
YGLFS+RN + +N+ + + +N YV + L+V SSVF+ D +S
Sbjct: 73 YGLFSKRNDE--TEKENNNSSNQSKFDNAQSYVSRFLQV-SSVFDDDVDS 119
>gi|225453426|ref|XP_002272713.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 549
Score = 77 bits (188), Expect = 3e-012
Identities = 47/117 (40%), Positives = 68/117 (58%), Gaps = 15/117 (12%)
Frame = +1
Query: 13 KEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVS 183
+ ++ P K + FL K+LI AL ++P+F SQ PE N T+ ELLH++F+GIAVS
Sbjct: 11 RPNRTPSKSCTHFLCKSLIFALFLVVIPLFPSQAPEYINHTLITKFWELLHLLFIGIAVS 70
Query: 184 YGLFSRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGSDDSSG 354
YG+FSRRN D G + ++ +N Y + L V SS+F E G ++S G
Sbjct: 71 YGVFSRRNVDRG-------IESHSTVDNSESYASRFLHV-SSIF----EDGFENSCG 115
>gi|147843896|emb|CAN81597.1| hypothetical protein VITISV_039396 [Vitis
vinifera]
Length = 909
Score = 75 bits (183), Expect = 1e-011
Identities = 46/117 (39%), Positives = 67/117 (57%), Gaps = 15/117 (12%)
Frame = +1
Query: 13 KEDQNPRKFYSRFLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVS 183
+ ++ P K + FL K+LI AL ++P+F SQ PE N T+ ELLH++F+GIAVS
Sbjct: 11 RPNRTPSKSCTHFLCKSLIFALFLVVIPLFPSQAPEYINHTLITKFWELLHLLFIGIAVS 70
Query: 184 YGLFSRRNYDGGGGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGSDDSSG 354
YG+FSRRN D G + ++ +N Y + V SS+F E G ++S G
Sbjct: 71 YGVFSRRNVDRG-------IESHSTVDNSESYASRFXHV-SSIF----EDGFENSCG 115
>gi|297820898|ref|XP_002878332.1| hypothetical protein ARALYDRAFT_907560
[Arabidopsis lyrata subsp. lyrata]
Length = 741
Score = 65 bits (158), Expect = 8e-009
Identities = 39/100 (39%), Positives = 61/100 (61%), Gaps = 9/100 (9%)
Frame = +1
Query: 49 FLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVSYGLFSRRNYDGG 219
F K+++ AL +P+F SQ P+ + T+ EL+H++FVGIAV+YGLFSRRN + G
Sbjct: 33 FFCKSVLFALFLFALPLFPSQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESG 92
Query: 220 GGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGS 339
R + D ++ + YV +I +V SSVF+ + + S
Sbjct: 93 VDLRMNRVDESSLS-----YVSRIFQV-SSVFDEEFDDNS 126
>gi|15232309|ref|NP_191597.1| uncharacterized protein [Arabidopsis thaliana]
Length = 743
Score = 63 bits (152), Expect = 4e-008
Identities = 38/100 (38%), Positives = 60/100 (60%), Gaps = 9/100 (9%)
Frame = +1
Query: 49 FLFKALILALLCSLVPVFLSQTPELANQ---TRLLELLHMIFVGIAVSYGLFSRRNYDGG 219
F K+++ AL +P+F SQ P+ + T+ EL+H++FVGIAV+YGLFSRRN +
Sbjct: 32 FFCKSVLFALFLLALPLFPSQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESA 91
Query: 220 GGGRGSNNDHNNNTNNPHPYVPKILEVSSSVFNVDHESGS 339
R + D ++ + YV +I +V SSVF+ + + S
Sbjct: 92 VDLRMTRVDESSLS-----YVSRIFQV-SSVFDEEFDDNS 125
>gi|255541082|ref|XP_002511605.1| hypothetical protein RCOM_1608690 [Ricinus
communis]
Length = 638
Score = 62 bits (148), Expect = 1e-007
Identities = 30/64 (46%), Positives = 41/64 (64%), Gaps = 3/64 (4%)
Frame = +1
Query: 34 KFYSRFLFKALILALLCSLVPVFLSQTPELANQTRLL---ELLHMIFVGIAVSYGLFSRR 204
K + R + K+L L +P+F SQ P NQT L EL+H++F+G+AVSYGLFS R
Sbjct: 19 KSFIRIICKSLFFVLFLIAIPLFPSQAPNFVNQTLLTKFWELVHLLFIGVAVSYGLFSSR 78
Query: 205 NYDG 216
N +G
Sbjct: 79 NVEG 82
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,352,524,086,339
Number of Sequences: 15229318
Number of Extensions: 1352524086339
Number of Successful Extensions: 374284986
Number of sequences better than 0.0: 0
|