BLASTX 7.6.2
Query= UN16887 /QuerySize=877
(876 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|18423010|ref|NP_568708.1| hydroxyproline-rich glycoprotein fa... 303 3e-080
gi|297795679|ref|XP_002865724.1| hypothetical protein ARALYDRAFT... 297 2e-078
gi|297804592|ref|XP_002870180.1| predicted protein [Arabidopsis ... 189 6e-046
gi|15234820|ref|NP_193349.1| proline-rich family protein [Arabid... 179 4e-043
gi|224106966|ref|XP_002314326.1| predicted protein [Populus tric... 170 3e-040
gi|225431867|ref|XP_002271531.1| PREDICTED: hypothetical protein... 165 1e-038
gi|18397707|ref|NP_566291.1| hydroxyproline-rich glycoprotein fa... 157 2e-036
gi|297833450|ref|XP_002884607.1| hypothetical protein ARALYDRAFT... 157 2e-036
gi|224130440|ref|XP_002328609.1| predicted protein [Populus tric... 154 3e-035
gi|255556284|ref|XP_002519176.1| nutrient reservoir, putative [R... 153 4e-035
gi|255548545|ref|XP_002515329.1| nutrient reservoir, putative [R... 137 3e-030
gi|225678143|gb|EEH16427.1| conserved hypothetical protein [Para... 85 1e-014
gi|115386684|ref|XP_001209883.1| conserved hypothetical protein ... 80 5e-013
gi|332025507|gb|EGI65670.1| hypothetical protein G5I_05770 [Acro... 78 1e-012
gi|261190885|ref|XP_002621851.1| cytokinesis protein sepA [Ajell... 77 2e-012
>gi|18423010|ref|NP_568708.1| hydroxyproline-rich glycoprotein family protein
[Arabidopsis thaliana]
Length = 162
Score = 303 bits (774), Expect = 3e-080
Identities = 139/165 (84%), Positives = 149/165 (90%), Gaps = 4/165 (2%)
Frame = -2
Query: 755 MEANQLYGLSTLVLILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPPPSRPPP 576
ME N LY LSTLV++LL+S+T TVTSKDEVVSCTMCSSCDNPC+PV S PPPP P PPP
Sbjct: 1 METNHLYTLSTLVVMLLVSVTPTVTSKDEVVSCTMCSSCDNPCSPVQSSPPPPSP--PPP 58
Query: 575 SPSTTTACPPPPSPPSSGGGSSYYYPPPSQS-GGDKYPPPYGDGGQGYYYPPPYSGNYPT 399
S + TTACPPPPSPPSSGGGSSYYYPPPSQS GG KYPPPYG GGQGYYYPPPYSGNYPT
Sbjct: 59 S-TPTTACPPPPSPPSSGGGSSYYYPPPSQSGGGSKYPPPYGGGGQGYYYPPPYSGNYPT 117
Query: 398 PPPPNPIVPYFPFYYHTPPPGSGSDRFMGSGSVMFALFAMFLCLI 264
PPPPNPIVPYFPFYYHTPPPGSGSDRFM S S++FALFA+FLCL+
Sbjct: 118 PPPPNPIVPYFPFYYHTPPPGSGSDRFMSSYSIIFALFAVFLCLV 162
>gi|297795679|ref|XP_002865724.1| hypothetical protein ARALYDRAFT_494989
[Arabidopsis lyrata subsp. lyrata]
Length = 161
Score = 297 bits (759), Expect = 2e-078
Identities = 137/164 (83%), Positives = 144/164 (87%), Gaps = 4/164 (2%)
Frame = -2
Query: 755 MEANQLYGLSTLVLILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPPPSRPPP 576
ME N LY STLV+ILL+S+T TVTSKDEVVSCTMCSSCDNPC+PV S PPPP P PPP
Sbjct: 1 METNHLYTFSTLVVILLMSVTPTVTSKDEVVSCTMCSSCDNPCSPVQSSPPPPSP--PPP 58
Query: 575 SPSTTTACPPPPSPPSSGGGSSYYYPPPSQS-GGDKYPPPYGDGGQGYYYPPPYSGNYPT 399
S + TTACPPPPSPP SGGGSSYYYPPPSQS GG KYPPPYG GGQGYYYPPPYSGNYPT
Sbjct: 59 S-TPTTACPPPPSPPRSGGGSSYYYPPPSQSGGGSKYPPPYGGGGQGYYYPPPYSGNYPT 117
Query: 398 PPPPNPIVPYFPFYYHTPPPGSGSDRFMGSGSVMFALFAMFLCL 267
PPPPNPIVPYFPFYYHTPPPGSGSDRFM S SV+F FA+FLCL
Sbjct: 118 PPPPNPIVPYFPFYYHTPPPGSGSDRFMSSCSVIFTFFAVFLCL 161
>gi|297804592|ref|XP_002870180.1| predicted protein [Arabidopsis lyrata subsp.
lyrata]
Length = 167
Score = 189 bits (479), Expect = 6e-046
Identities = 98/168 (58%), Positives = 110/168 (65%), Gaps = 11/168 (6%)
Frame = -2
Query: 755 MEANQLYGLSTLVL-ILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPPPSRPP 579
ME + + L L+L + T+T+TS ++ CTMC+SCDNPC P PS PPP PS PP
Sbjct: 1 METLRTFHLFLLLLFFFFFTFTTTLTSPSQIADCTMCTSCDNPCQPNPSPPPPSNPSPPP 60
Query: 578 PSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDGGQGYYYPPPYS-GNYP 402
P+P TTTACPPPPS S GGG YYYPPPSQSG Y PP G GYYYPPP S GNYP
Sbjct: 61 PAP-TTTACPPPPS--SGGGGPYYYYPPPSQSG--SYRPPPSSSGGGYYYPPPKSGGNYP 115
Query: 401 TPPPPNPIVPYFPFYYHTPPPG---SGSD-RFMGSGSVMFALFAMFLC 270
PPPNPIVPYFPFYY+ PPP SGSD + S V F L LC
Sbjct: 116 FTPPPNPIVPYFPFYYYNPPPQSVMSGSDAKIRFSYGVSFLLLVFSLC 163
>gi|15234820|ref|NP_193349.1| proline-rich family protein [Arabidopsis
thaliana]
Length = 164
Score = 179 bits (454), Expect = 4e-043
Identities = 88/137 (64%), Positives = 96/137 (70%), Gaps = 10/137 (7%)
Frame = -2
Query: 719 VLILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPP-PSRPPPSPSTTTACPPP 543
+ + T+T+TS ++ CTMC+SCDNPC P PS PPPP PS PPPSP TTTACPPP
Sbjct: 11 LFFFFFTFTTTLTSPSQIADCTMCTSCDNPCQPNPSPPPPPSNPSPPPPSP-TTTACPPP 69
Query: 542 PSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDGGQGYYYPPPYS-GNYPTPPPPNPIVPYF 366
PS SSGGG YYYPP SQSG Y PP GYYYPPP S GNYP PPPNPIVPYF
Sbjct: 70 PS--SSGGGPYYYYPPASQSG--SYRPPPSSSSGGYYYPPPKSGGNYPYTPPPNPIVPYF 125
Query: 365 PFYYHTPPPG---SGSD 324
PFYY+ PPP SGSD
Sbjct: 126 PFYYYNPPPQSVMSGSD 142
>gi|224106966|ref|XP_002314326.1| predicted protein [Populus trichocarpa]
Length = 174
Score = 170 bits (430), Expect = 3e-040
Identities = 92/174 (52%), Positives = 104/174 (59%), Gaps = 12/174 (6%)
Frame = -2
Query: 755 MEANQLYGLSTLVLILLLSLTSTVTSKDE----VVSCTMCSSCDNPCNPVPSYPP--PPP 594
ME LS L L +LL T + T ++CTMCS+C PVPS PP PPP
Sbjct: 1 METRHKLKLSLLALFMLLPSTKSSTMPKSRMLYQIACTMCSTCCG-STPVPSPPPPSPPP 59
Query: 593 PSRPPPSPSTTTACPPPPSPPSSGGGSSYYY-PPPSQSGGDKYPPPYGDGGQGYYYPPPY 417
P+ PP P+TT CPPPPSPP SGGGS YY PPPS PPP G G YYPPP
Sbjct: 60 PAASPPPPATTAICPPPPSPPPSGGGSYYYSPPPPSTYTYSSPPPPQGGVVGGTYYPPPN 119
Query: 416 SGNYPTPPPPNPIVPYFPFYYHTPPPGSGSDRF----MGSGSVMFALFAMFLCL 267
NYPTPPPPNPIVPYFPFYY++PPP S S F S SV+ + A+ LCL
Sbjct: 120 YKNYPTPPPPNPIVPYFPFYYYSPPPPSMSASFKLMASYSTSVLVGVVALVLCL 173
>gi|225431867|ref|XP_002271531.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 196
Score = 165 bits (416), Expect = 1e-038
Identities = 86/158 (54%), Positives = 105/158 (66%), Gaps = 19/158 (12%)
Frame = -2
Query: 695 TSTVTSKDEV-VSCTMCSSCDNPCNPVPSYPPPPPPSRPPPS---PSTTTACPPPPSPPS 528
T+ V +K + + CTMCS+CDNPCN VPS PPPP PS PPPS PS+++ CPPPPSPPS
Sbjct: 44 TADVAAKSKYQIECTMCSACDNPCNQVPS-PPPPNPSPPPPSPPPPSSSSNCPPPPSPPS 102
Query: 527 SGGGSSYYY--PPPSQSG--GDKYPPPY----GDGGQGYYYPPPYSGNYPTPPPPNPIVP 372
+ +YYY PPPSQ PPP G GG G +YPPPY NYP PPPPNPIVP
Sbjct: 103 N---PTYYYSPPPPSQPTYVYSSPPPPNGYNGGGGGGGAFYPPPYQ-NYPAPPPPNPIVP 158
Query: 371 YFPFYYHTPPPGSGSDRFMGSGSVMF--ALFAMFLCLI 264
YFPF+YHTPPP S ++ S S ++ AL ++ C +
Sbjct: 159 YFPFWYHTPPPTSAANSVYFSDSTVYTIALISLLFCFL 196
>gi|18397707|ref|NP_566291.1| hydroxyproline-rich glycoprotein family protein
[Arabidopsis thaliana]
Length = 147
Score = 157 bits (396), Expect = 2e-036
Identities = 74/107 (69%), Positives = 80/107 (74%), Gaps = 5/107 (4%)
Frame = -2
Query: 581 PPSPSTTTACPPPPSPPSSGGGSSYYY--PPPSQSGGDKYPPPYGDGGQGYYYPPPYSGN 408
P +P ++ PPPP P SSGGG SYYY PPPS SGG KYPPPYG G G YYPPPY GN
Sbjct: 41 PCNPVPSSYSPPPPPPSSSGGGGSYYYSPPPPSSSGGVKYPPPYGGDGYGGYYPPPYYGN 100
Query: 407 YPTPPPPNPIVPYFPFYYHTPPPG-SGSDRFMGSGSVMFALFAMFLC 270
Y TPPPPNPIVPYFPFYYHTPP G SGS R S++FALFA+ LC
Sbjct: 101 YGTPPPPNPIVPYFPFYYHTPPQGYSGSARL--QNSLLFALFAVLLC 145
Score = 104 bits (259), Expect = 2e-020
Identities = 66/125 (52%), Positives = 69/125 (55%), Gaps = 8/125 (6%)
Frame = -2
Query: 752 EANQLYGLSTLVLILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPPPSRPPPS 573
EA + LS L LILL+SL VTSKDE VSCTMCSSCDNPCNPVPS PPPP PP S
Sbjct: 2 EAIRFSNLSGL-LILLMSLPLLVTSKDETVSCTMCSSCDNPCNPVPSSYSPPPP--PPSS 58
Query: 572 PSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDGGQGYYYPP----PYSGNY 405
+ P PPSS GG Y P G YPPPY G G PP PY Y
Sbjct: 59 SGGGGSYYYSPPPPSSSGGVKYPPPYGGDGYGGYYPPPY-YGNYGTPPPPNPIVPYFPFY 117
Query: 404 PTPPP 390
PP
Sbjct: 118 YHTPP 122
>gi|297833450|ref|XP_002884607.1| hypothetical protein ARALYDRAFT_477990
[Arabidopsis lyrata subsp. lyrata]
Length = 149
Score = 157 bits (396), Expect = 2e-036
Identities = 78/116 (67%), Positives = 83/116 (71%), Gaps = 13/116 (11%)
Frame = -2
Query: 593 PSRPPPSPSTTTACPPPPSPPSSGGGSSYYY--PPPSQSGGDKYPPPYGD---GGQGYYY 429
P P PS + PPP P SSGGG SYYY PPPS SGG KYPPPYG GGQGYYY
Sbjct: 41 PCNPVPS-----SYSPPPPPSSSGGGGSYYYSPPPPSSSGGAKYPPPYGGDGYGGQGYYY 95
Query: 428 PPPYSGNYPTPPPPNPIVPYFPFYYHTPPPG-SGSDRFMGSGSVMFALFAMFLCLI 264
PPPY GNY TPPPPNPIVPYFPFYYHTPP G SGS R S++FALFA+ LC +
Sbjct: 96 PPPYYGNYGTPPPPNPIVPYFPFYYHTPPQGYSGSAR--SHDSLLFALFAVLLCFV 149
>gi|224130440|ref|XP_002328609.1| predicted protein [Populus trichocarpa]
Length = 172
Score = 154 bits (387), Expect = 3e-035
Identities = 84/155 (54%), Positives = 96/155 (61%), Gaps = 13/155 (8%)
Frame = -2
Query: 755 MEANQLYGLSTLVLILLLSLT--STVTSKDEV---VSCTMCSSCDNPCNPVPSYPPPPPP 591
ME + LS L L++LLS T STV + ++CTMCS+C +PV S PPPPP
Sbjct: 1 METRHKFKLSLLALLMLLSSTKSSTVLPNSRMLYQIACTMCSTCCG-SSPVTSPPPPPP- 58
Query: 590 SRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGD---KYPPPYGDGG-QGYYYPP 423
PPPS +TT+ CPPPPSPP+S G YY PPP PPP DG G YYPP
Sbjct: 59 --PPPSLATTSNCPPPPSPPASPGVGFYYSPPPPPPPSTYTYSSPPPPQDGVIGGTYYPP 116
Query: 422 PYSGNYPTPPPPNPIVPYFPFYYHTPPPGSGSDRF 318
P NYPTPPPPNPIVPYFPFYY+ PPP S S F
Sbjct: 117 PNYKNYPTPPPPNPIVPYFPFYYYIPPPPSTSASF 151
>gi|255556284|ref|XP_002519176.1| nutrient reservoir, putative [Ricinus
communis]
Length = 171
Score = 153 bits (385), Expect = 4e-035
Identities = 82/144 (56%), Positives = 95/144 (65%), Gaps = 17/144 (11%)
Frame = -2
Query: 731 LSTLVLILLLSLTS-TVTSKDEV---VSCTMCSSCDNPCNPVPSYPPPPPPSRPPPSPST 564
LS L +++L TS + K + ++CTMCS+C CNPVPS PPPPPPS PP P++
Sbjct: 9 LSLLAFLVVLPFTSPSPVPKSRMLYQIACTMCSTC---CNPVPS-PPPPPPS--PPPPAS 62
Query: 563 TTACPPPPSPPSSGGGSSYYY--PPPSQSGGDKYPPPYGDGGQG--YYYPPPYS-GNYPT 399
T CPPPPSPPS G SYYY PPP+ PPP G G G YYYPPP NYP
Sbjct: 63 TNNCPPPPSPPSPSG--SYYYSPPPPATYTYSSPPPPQGGNGGGSYYYYPPPADYKNYPA 120
Query: 398 PPPPNPIVPYFPFYYHTPPPGSGS 327
PPPPNPIVPYFPFYY++PPP S S
Sbjct: 121 PPPPNPIVPYFPFYYYSPPPPSMS 144
>gi|255548545|ref|XP_002515329.1| nutrient reservoir, putative [Ricinus
communis]
Length = 154
Score = 137 bits (344), Expect = 3e-030
Identities = 65/116 (56%), Positives = 73/116 (62%), Gaps = 14/116 (12%)
Frame = -2
Query: 653 MCSSCDNPCNPVPSYP------PPPPPSRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPP 492
MCS+CDNPC P+PS P PPPPP PP P+ + CPPPP PSS G+ YY PPP
Sbjct: 1 MCSACDNPCQPLPSPPPPVSTCPPPPPPPSPPPPAIVSDCPPPPVTPSS--GAYYYSPPP 58
Query: 491 SQSGGDKY---PPPYGDGGQGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPPGS 333
Y PPP DGG +YP P G+Y PPPPNPIVPYFPFYY+ PPP S
Sbjct: 59 PAEPTYVYSSPPPPANDGG---FYPSPNYGSYQGPPPPNPIVPYFPFYYYNPPPSS 111
>gi|225678143|gb|EEH16427.1| conserved hypothetical protein [Paracoccidioides
brasiliensis Pb03]
Length = 675
Score = 85 bits (208), Expect = 1e-014
Identities = 43/98 (43%), Positives = 48/98 (48%)
Frame = -2
Query: 623 PVPSYPPPPPPSRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDGG 444
P S P PPPP PPP P+ +TA PPPP PP S PPP SG PPP+
Sbjct: 482 PPVSAPAPPPPPPPPPGPAPSTAVPPPPPPPPPPPSGSAPPPPPPPSGSAPPPPPHPAPS 541
Query: 443 QGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPPGSG 330
+ PPP S +PPPP P P PP GSG
Sbjct: 542 ASHSPPPPLSAPSSSPPPPPPPASGVPPPPPPPPAGSG 579
>gi|115386684|ref|XP_001209883.1| conserved hypothetical protein [Aspergillus
terreus NIH2624]
Length = 600
Score = 80 bits (195), Expect = 5e-013
Identities = 46/108 (42%), Positives = 53/108 (49%), Gaps = 9/108 (8%)
Frame = -2
Query: 623 PVPSYPPPPPP--SRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGD 450
P P PPPP P S PPP P ++ PPPP PPSS G ++ PPP+ GG PPP
Sbjct: 421 PPPGPPPPPAPASSVPPPPPPPSSHIPPPPPPPSSTGPTAPPPPPPAPGGGAPPPPPPPP 480
Query: 449 G-GQGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPPGSGSDRFMGS 309
G G PPP +G P PPPP P P P G D M +
Sbjct: 481 GAGAPPPPPPPGAGAPPPPPPPGGAAP------PLPKPAGGRDDLMAA 522
>gi|332025507|gb|EGI65670.1| hypothetical protein G5I_05770 [Acromyrmex
echinatior]
Length = 481
Score = 78 bits (191), Expect = 1e-012
Identities = 43/96 (44%), Positives = 44/96 (45%), Gaps = 3/96 (3%)
Frame = -2
Query: 617 PSY---PPPPPPSRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDG 447
PSY PPPPPP PPP P PPPP PS+ SY PPP PP Y
Sbjct: 44 PSYSYPPPPPPPPPPPPPPPPPPPPPPPPGYPSTQPSYSYPPPPPPPPPPPPSPPGYPST 103
Query: 446 GQGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPP 339
Y YPPP P PPPP P P P PPP
Sbjct: 104 QPSYSYPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP 139
>gi|261190885|ref|XP_002621851.1| cytokinesis protein sepA [Ajellomyces
dermatitidis SLH14081]
Length = 1704
Score = 77 bits (189), Expect = 2e-012
Identities = 45/102 (44%), Positives = 47/102 (46%), Gaps = 7/102 (6%)
Frame = -2
Query: 632 PCNPVPSYPPPPPPSRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGG--DKYPPP 459
P P P PPPP PPP P A PPPP PP GG PPP GG PPP
Sbjct: 947 PPPPPPGIGGPPPPPPPPPPPPGVGAPPPPPPPPPGAGGPPPPPPPPPGVGGPPPPPPPP 1006
Query: 458 YGDGGQGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPPGS 333
G GG PPP P PPPP P + P PPPG+
Sbjct: 1007 PGMGGPPLPPPPPPGMGGPPPPPPPPGMRGPP-----PPPGA 1043
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,992,423,490,450
Number of Sequences: 15229318
Number of Extensions: 1992423490450
Number of Successful Extensions: 533126175
Number of sequences better than 0.0: 0
|