BLASTX 7.6.2
Query= UN22100 /QuerySize=672
(671 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|1184277|gb|AAC37510.1| arabinogalactan protein [Brassica napus] 228 8e-058
gi|1184275|gb|AAC37509.1| arabinogalactan protein [Brassica napus] 225 5e-057
gi|145307801|gb|ABP57237.1| putative arabinogalactan protein [Br... 225 5e-057
gi|297832764|ref|XP_002884264.1| hypothetical protein ARALYDRAFT... 203 2e-050
gi|18395919|ref|NP_566146.1| arabinogalactan protein 11 [Arabido... 185 5e-045
gi|10880499|gb|AAG24279.1|AF195892_1 arabinogalactan protein [Ar... 182 5e-044
gi|4775268|emb|CAB42531.1| AGP6 protein [Arabidopsis thaliana] 132 6e-029
gi|297811557|ref|XP_002873662.1| hypothetical protein ARALYDRAFT... 132 6e-029
gi|15241392|ref|NP_196942.1| arabinogalactan protein 6 [Arabidop... 130 2e-028
gi|186510502|ref|NP_001118720.1| uncharacterized protein [Arabid... 74 2e-011
gi|255567969|ref|XP_002524962.1| copper ion binding protein, put... 72 6e-011
gi|225459205|ref|XP_002285739.1| PREDICTED: hypothetical protein... 67 2e-009
gi|147835211|emb|CAN61253.1| hypothetical protein VITISV_019773 ... 66 3e-009
gi|323449787|gb|EGB05672.1| hypothetical protein AURANDRAFT_7213... 58 1e-006
gi|326431946|gb|EGD77516.1| hypothetical protein PTSG_08614 [Sal... 56 3e-006
>gi|1184277|gb|AAC37510.1| arabinogalactan protein [Brassica napus]
Length = 136
Score = 228 bits (579), Expect = 8e-058
Identities = 123/138 (89%), Positives = 125/138 (90%), Gaps = 4/138 (2%)
Frame = +1
Query: 94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPA--TKAPAAAPKSSTSASAPKASS 267
MARQFVV LLALVVATAFAAEAPSAAPTASPTKAP TKAPAAAPKSS SASAPKASS
Sbjct: 1 MARQFVVLALLALVVATAFAAEAPSAAPTASPTKAPTTQTKAPAAAPKSS-SASAPKASS 59
Query: 268 PVAEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNV 447
PVAEEPTAEDDY SP+ES EGPTVSSPPAPTPE DGPSSDAPTPGPEVLDGSATNV
Sbjct: 60 PVAEEPTAEDDYAATSPSESAEGPTVSSPPAPTPEV-VDGPSSDAPTPGPEVLDGSATNV 118
Query: 448 KLSIAGTVAALGFFFFSL 501
KLSIAGTVAA+GFFFFSL
Sbjct: 119 KLSIAGTVAAVGFFFFSL 136
>gi|1184275|gb|AAC37509.1| arabinogalactan protein [Brassica napus]
Length = 136
Score = 225 bits (572), Expect = 5e-057
Identities = 121/138 (87%), Positives = 124/138 (89%), Gaps = 4/138 (2%)
Frame = +1
Query: 94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPA--TKAPAAAPKSSTSASAPKASS 267
MARQFVV LLALVVATAFAA+APSAAPT SPTKAP TKAPAAAPKSS SASAPKASS
Sbjct: 1 MARQFVVLALLALVVATAFAADAPSAAPTTSPTKAPTTQTKAPAAAPKSS-SASAPKASS 59
Query: 268 PVAEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNV 447
PVAEEPTAEDDY SP+ES EGPTVSSPPAPTPE DGPSSDAPTPGPEVLDGSATNV
Sbjct: 60 PVAEEPTAEDDYAATSPSESAEGPTVSSPPAPTPEV-VDGPSSDAPTPGPEVLDGSATNV 118
Query: 448 KLSIAGTVAALGFFFFSL 501
KLSIAGTVAA+GFFFFSL
Sbjct: 119 KLSIAGTVAAVGFFFFSL 136
>gi|145307801|gb|ABP57237.1| putative arabinogalactan protein [Brassica rapa
subsp. chinensis]
Length = 136
Score = 225 bits (572), Expect = 5e-057
Identities = 122/138 (88%), Positives = 124/138 (89%), Gaps = 4/138 (2%)
Frame = +1
Query: 94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPA--TKAPAAAPKSSTSASAPKASS 267
MARQFVV LLALVVATAFAAEAPSAAPTASPTKAP TKAPAAAPKSS SASAPKASS
Sbjct: 1 MARQFVVLALLALVVATAFAAEAPSAAPTASPTKAPTTQTKAPAAAPKSS-SASAPKASS 59
Query: 268 PVAEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNV 447
PVAEEPTAEDDY SP+ES GPTVSSPPAPTPE DGPSSDAPTPGPEVLDGSATNV
Sbjct: 60 PVAEEPTAEDDYAATSPSESAGGPTVSSPPAPTPEV-VDGPSSDAPTPGPEVLDGSATNV 118
Query: 448 KLSIAGTVAALGFFFFSL 501
KLSIAGTVAA+GFFFFSL
Sbjct: 119 KLSIAGTVAAVGFFFFSL 136
>gi|297832764|ref|XP_002884264.1| hypothetical protein ARALYDRAFT_477345
[Arabidopsis lyrata subsp. lyrata]
Length = 140
Score = 203 bits (516), Expect = 2e-050
Identities = 108/140 (77%), Positives = 116/140 (82%), Gaps = 5/140 (3%)
Frame = +1
Query: 85 KKKMARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKAS 264
KKKMARQFVVF LLAL VATAFAA+APSAAPTASPTKAP TKAPAAAPKS SA+APKAS
Sbjct: 3 KKKMARQFVVFALLALAVATAFAADAPSAAPTASPTKAPTTKAPAAAPKS--SAAAPKAS 60
Query: 265 SPVAEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEA---SADGPSSDAPTPGPEVLDGS 435
SPVAEEPT+EDDY+ +P++S E PTVSSPPAPTPEA S DGPSSD PT G+
Sbjct: 61 SPVAEEPTSEDDYSAATPSDSAEAPTVSSPPAPTPEADGPSTDGPSSDGPTAAESPKSGA 120
Query: 436 ATNVKLSIAGTVAALGFFFF 495
TNVKLSIAGTVAA GFF F
Sbjct: 121 TTNVKLSIAGTVAAAGFFIF 140
>gi|18395919|ref|NP_566146.1| arabinogalactan protein 11 [Arabidopsis thaliana]
Length = 136
Score = 185 bits (469), Expect = 5e-045
Identities = 102/139 (73%), Positives = 110/139 (79%), Gaps = 6/139 (4%)
Frame = +1
Query: 94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPV 273
MAR FVV LLAL V T FAA+APSAAPTASPTK+P TKAPAAAPKS SA+APKASSPV
Sbjct: 1 MARLFVVVALLALAVGTVFAADAPSAAPTASPTKSP-TKAPAAAPKS--SAAAPKASSPV 57
Query: 274 AEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEA---SADGPSSDAPTPGPEVLDGSATN 444
AEEPT EDDY+ SP++S E PTVSSPPAPTPEA S+DGPSSD P G+ TN
Sbjct: 58 AEEPTPEDDYSAASPSDSAEAPTVSSPPAPTPEADGPSSDGPSSDGPAAAESPKSGATTN 117
Query: 445 VKLSIAGTVAALGFFFFSL 501
VKLSIAGTVAA GFF FSL
Sbjct: 118 VKLSIAGTVAAAGFFIFSL 136
>gi|10880499|gb|AAG24279.1|AF195892_1 arabinogalactan protein [Arabidopsis
thaliana]
Length = 135
Score = 182 bits (460), Expect = 5e-044
Identities = 100/138 (72%), Positives = 108/138 (78%), Gaps = 6/138 (4%)
Frame = +1
Query: 97 ARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPVA 276
AR FVV LLAL V T FAA+APSAAPTASPTK+P TKAPA APKS SA+APKASSPVA
Sbjct: 1 ARLFVVVALLALAVGTVFAADAPSAAPTASPTKSP-TKAPAVAPKS--SAAAPKASSPVA 57
Query: 277 EEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEA---SADGPSSDAPTPGPEVLDGSATNV 447
EEPT EDDY+ SP++S E PTVSSPPAPTPEA S+DGPSSD P G+ TNV
Sbjct: 58 EEPTPEDDYSAASPSDSAEAPTVSSPPAPTPEADGPSSDGPSSDGPAAAESPKSGATTNV 117
Query: 448 KLSIAGTVAALGFFFFSL 501
KLSIAGTVAA GFF FSL
Sbjct: 118 KLSIAGTVAAAGFFIFSL 135
>gi|4775268|emb|CAB42531.1| AGP6 protein [Arabidopsis thaliana]
Length = 150
Score = 132 bits (330), Expect = 6e-029
Identities = 73/120 (60%), Positives = 89/120 (74%), Gaps = 10/120 (8%)
Frame = +1
Query: 160 APSAAPTASP---TKAPA--TKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYTGNSPTE 324
+P+AAPT +P TKAP+ TKAPAAAPKSS SAS+PKASSP AE P EDDY+ +SP++
Sbjct: 33 SPTAAPTKAPTATTKAPSAPTKAPAAAPKSS-SASSPKASSPAAEGPVPEDDYSASSPSD 91
Query: 325 SVEGPTVSSPPAPTPE--ASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAALGFFFFS 498
S E PTVSSPPAPTP+ ++ADGPS P+ G+ T K S+ GTVAA+GFFFFS
Sbjct: 92 SAEAPTVSSPPAPTPDSTSAADGPSDGPTAESPK--SGAVTTAKFSVVGTVAAVGFFFFS 149
>gi|297811557|ref|XP_002873662.1| hypothetical protein ARALYDRAFT_488278
[Arabidopsis lyrata subsp. lyrata]
Length = 150
Score = 132 bits (330), Expect = 6e-029
Identities = 73/120 (60%), Positives = 89/120 (74%), Gaps = 10/120 (8%)
Frame = +1
Query: 160 APSAAPT---ASPTKAPA--TKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYTGNSPTE 324
+P+A PT A+PTKAPA TKAPAAAPKSS SAS+PKASSP AE P +DDY+ +SP+
Sbjct: 33 SPTATPTKAPAAPTKAPAAPTKAPAAAPKSS-SASSPKASSPTAEGPVPDDDYSASSPSG 91
Query: 325 SVEGPTVSSPPAPTPE--ASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAALGFFFFS 498
S E PTVSSPPAPTP+ ++ADGPS P+ G+ T KLS+ GT+AA+GFFFFS
Sbjct: 92 SAEAPTVSSPPAPTPDSTSAADGPSDGPTAESPK--SGAVTTAKLSVVGTIAAVGFFFFS 149
>gi|15241392|ref|NP_196942.1| arabinogalactan protein 6 [Arabidopsis thaliana]
Length = 150
Score = 130 bits (326), Expect = 2e-028
Identities = 72/120 (60%), Positives = 88/120 (73%), Gaps = 10/120 (8%)
Frame = +1
Query: 160 APSAAPTASP---TKAPA--TKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYTGNSPTE 324
+P+AAPT +P TKAP+ TKAPAAAPKSS SAS+PKASSP AE P EDDY+ +SP++
Sbjct: 33 SPTAAPTKAPTATTKAPSAPTKAPAAAPKSS-SASSPKASSPAAEGPVPEDDYSASSPSD 91
Query: 325 SVEGPTVSSPPAPTPE--ASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAALGFFFFS 498
S E PTVSSPPAPTP+ ++ADGPS P+ G+ T K S+ GTVA +GFFFFS
Sbjct: 92 SAEAPTVSSPPAPTPDSTSAADGPSDGPTAESPK--SGAVTTAKFSVVGTVATVGFFFFS 149
>gi|186510502|ref|NP_001118720.1| uncharacterized protein [Arabidopsis
thaliana]
Length = 171
Score = 74 bits (179), Expect = 2e-011
Identities = 48/127 (37%), Positives = 69/127 (54%), Gaps = 8/127 (6%)
Frame = +1
Query: 139 ATAFAAEAPSA-APTASPTKAP--ATKAPAAAPKSSTSASAPKASS--PVAEEPTAEDDY 303
A A + ++P+A APT+ PT AP A K P S S+ +PK+SS A P +
Sbjct: 44 AAAASPKSPTASAPTSPPTAAPTMAKKNSTGTPSPSPSSPSPKSSSAKTPASSPDSSSGD 103
Query: 304 TGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVL--DGSATNVKLSIAGTV-A 474
+ PT S + PT SSPPAPTPE S + GPE A+++ +S++G+V A
Sbjct: 104 SSEGPTSSSDAPTASSPPAPTPEMSPSSDDGTGASDGPEASAPAAGASSLVISVSGSVLA 163
Query: 475 ALGFFFF 495
A+ + FF
Sbjct: 164 AVAWLFF 170
>gi|255567969|ref|XP_002524962.1| copper ion binding protein, putative [Ricinus
communis]
Length = 238
Score = 72 bits (175), Expect = 6e-011
Identities = 47/118 (39%), Positives = 57/118 (48%), Gaps = 3/118 (2%)
Frame = +1
Query: 127 ALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYT 306
A A + A PS AP A+ T PAT AP AP + +AP A+ A P+ +
Sbjct: 113 AATPAKSPAKSPPSPAPVAA-TPPPATSAPETAPPAPVPVAAPTAADVPAPTPSKKKPKK 171
Query: 307 GNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAAL 480
+ T GP VSSPPAP E A GPS DA +PGP V D S S+ V L
Sbjct: 172 HSHATSPAPGPDVSSPPAPPME--APGPSLDASSPGPSVADDSGAETIRSLQKMVGGL 227
>gi|225459205|ref|XP_002285739.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 138
Score = 67 bits (162), Expect = 2e-009
Identities = 45/135 (33%), Positives = 65/135 (48%), Gaps = 13/135 (9%)
Frame = +1
Query: 94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPV 273
MA VV ++ +VA + A++P+++PTASPTK+P P A P S T + + S+P
Sbjct: 1 MAYSSVVLVMMFALVAGSAFAQSPASSPTASPTKSPTASPPVATPPSPTPSPSTTPSAPA 60
Query: 274 AEEPTAEDDYTGNSPTESVEGPTVSSPPAP-----TPEASADGPSSDAPTPGPEVLDGSA 438
T SP S PT SPPAP +P + PS TP E SA
Sbjct: 61 PAPSTV-------SPPASTPSPTSGSPPAPPTSPASPPTTGASPSPSISTPPTEPPSPSA 113
Query: 439 TNVK-LSIAGTVAAL 480
+ ++ G+ AA+
Sbjct: 114 AALNTVTFTGSAAAV 128
>gi|147835211|emb|CAN61253.1| hypothetical protein VITISV_019773 [Vitis
vinifera]
Length = 138
Score = 66 bits (160), Expect = 3e-009
Identities = 40/110 (36%), Positives = 58/110 (52%), Gaps = 9/110 (8%)
Frame = +1
Query: 94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPV 273
MA VV ++ +VA + A++P+++PTASPTK+P P A P S T + + S+P
Sbjct: 1 MAYSSVVLVMMFALVAGSAFAQSPASSPTASPTKSPTASPPVATPPSPTPSPSTTPSAPA 60
Query: 274 AEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEV 423
T SP S PT SPPAP P + A P++ A +P P +
Sbjct: 61 PAPSTV-------SPPASTPSPTSGSPPAP-PTSPASAPTTGA-SPSPSI 101
>gi|323449787|gb|EGB05672.1| hypothetical protein AURANDRAFT_72138 [Aureococcus
anophagefferens]
Length = 5032
Score = 58 bits (138), Expect = 1e-006
Identities = 39/117 (33%), Positives = 50/117 (42%), Gaps = 8/117 (6%)
Frame = +1
Query: 151 AAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKAS------SPVAEEPTAEDDYTGN 312
A AP+AAPTA+PT P T +P AAP + T AP + +P PTA T
Sbjct: 2805 ATPAPTAAPTAAPTGTPTT-SPTAAP-TGTPTQAPTVTPGEPTLAPATPAPTAAPSVTPT 2862
Query: 313 SPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAALG 483
P S P P P + P+ TP P +A + S+A T A G
Sbjct: 2863 PAPSVTPTPAPSVSPTPAPTVTPGSPTLAPATPAPTAAPTAAPTLAPSVAPTAAPTG 2919
>gi|326431946|gb|EGD77516.1| hypothetical protein PTSG_08614 [Salpingoeca sp.
ATCC 50818]
Length = 516
Score = 56 bits (134), Expect = 3e-006
Identities = 29/81 (35%), Positives = 37/81 (45%)
Frame = +1
Query: 175 PTASPTKAPATKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYTGNSPTESVEGPTVSSP 354
PT +P AP PAA P S + P S P A P ++ +G P S PT + P
Sbjct: 186 PTTAPPAAPPAAPPAAKPPSGPPPAGPPPSKPPAGPPPSKPAPSGPPPPPSGPPPTTAPP 245
Query: 355 PAPTPEASADGPSSDAPTPGP 417
P P P++ APTP P
Sbjct: 246 PPKAPPGPPPPPATSAPTPPP 266
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,672,520,567,077
Number of Sequences: 15229318
Number of Extensions: 2672520567077
Number of Successful Extensions: 649898909
Number of sequences better than 0.0: 0
|