BLASTX 7.6.2
Query= UN51201 /QuerySize=1265
(1264 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|15217879|ref|NP_174150.1| arabinogalactan protein 31 [Arabido... 605 6e-171
gi|297851274|ref|XP_002893518.1| hypothetical protein ARALYDRAFT... 434 1e-119
gi|110738350|dbj|BAF01102.1| putative proline-rich protein [Arab... 428 1e-117
gi|145324054|ref|NP_001077616.1| arabinogalactan protein 31 [Ara... 426 4e-117
gi|15226166|ref|NP_180935.1| arabinogalactan protein 30 [Arabido... 258 1e-066
>gi|15217879|ref|NP_174150.1| arabinogalactan protein 31 [Arabidopsis thaliana]
Length = 359
Score = 605 bits (1559), Expect = 6e-171
Identities = 277/359 (77%), Positives = 298/359 (83%), Gaps = 8/359 (2%)
Frame = -2
Query: 1263 G*SVLLSLLALWCFTSCAFTEEVNHVTQTPSSAPAPSPYHHGHHHPHPPHHHHPHPHPPA 1084
G SVL+SL+ALWCFTS FTEEVNH TQTPS APAP+PYHHGHHHPHPPHHHHPHPHP
Sbjct: 5 GKSVLVSLVALWCFTSSVFTEEVNHKTQTPSLAPAPAPYHHGHHHPHPPHHHHPHPHPHP 64
Query: 1083 KAPVKPPVSPPSKPPVKPPVYPPTKSPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPV 904
P K PV KPPVK PV PP K PVKPP PP K PV PP KPPVKPPV PP K PV
Sbjct: 65 HPPAKSPV----KPPVKAPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPVSPPAKPPV 120
Query: 903 KPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVYPPT 724
KPPV PP K PV PP KPPVKPPVYPPTKAPVKPP KPPVKPPV PP K PVKPP PP
Sbjct: 121 KPPVYPPTKAPVKPPTKPPVKPPVYPPTKAPVKPPTKPPVKPPVYPPTKAPVKPPTKPPV 180
Query: 723 KAPVKPPTKAPVKPPVSP----PAKPPVSPPAKPPVRPPVYPPKFNRSLVAVQGTVFCKS 556
K PV PP K PVKPPV P P KPPVSPP KPPV PPVYPPKFNRSLVAV+GTV+CKS
Sbjct: 181 KPPVSPPAKPPVKPPVYPPTKAPVKPPVSPPTKPPVTPPVYPPKFNRSLVAVRGTVYCKS 240
Query: 555 CQYASSDSLIGAKPVEGAVVRLLCKSKKNIVAETKTDKNGYFLLLGPKTVTNYGFRGCRV 376
C+YA+ ++L+GAKP+EGA V+L+CKSKKNI AET TDKNGYFLLL PKTVTN+GFRGCRV
Sbjct: 241 CKYAAFNTLLGAKPIEGATVKLVCKSKKNITAETTTDKNGYFLLLAPKTVTNFGFRGCRV 300
Query: 375 YLVKSKDYKCNKVSKLFGGDVGAVLKPEKRKGKSAVVINQLIYGIFNVGPFAFDPVCPK 199
YLVKSKDYKC+KVSKLFGGDVGA LKPEK+ GKS VV+N+L+YG+FNVGPFAF+P CPK
Sbjct: 301 YLVKSKDYKCSKVSKLFGGDVGAELKPEKKLGKSTVVVNKLVYGLFNVGPFAFNPSCPK 359
>gi|297851274|ref|XP_002893518.1| hypothetical protein ARALYDRAFT_473038
[Arabidopsis lyrata subsp. lyrata]
Length = 315
Score = 434 bits (1116), Expect = 1e-119
Identities = 206/286 (72%), Positives = 225/286 (78%), Gaps = 5/286 (1%)
Frame = -2
Query: 1053 PSKPPVKPPVYPPTKSPVKP-PTKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVK 877
PS P P + P P P P PPAK PVKPPV KAPV PP KPPVK
Sbjct: 34 PSLAPAPAPYHHGHHHPHPPHHHHPHPHPHPHPPAKSPVKPPV----KAPVSPPAKPPVK 89
Query: 876 PPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTK 697
PPV PP K PVKPP PP K PV PP KPPVKPPV PP K PVKPP PP K PV PPTK
Sbjct: 90 PPVYPPTKAPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPVYPPTK 149
Query: 696 APVKPPVSPPAKPPVSPPAKPPVRPPVYPPKFNRSLVAVQGTVFCKSCQYASSDSLIGAK 517
APVKPP PP KPPVSPPAKPPV+PPVYPPKFNRSLVAV+GTV+CKSC+YA+ ++L+GAK
Sbjct: 150 APVKPPTKPPVKPPVSPPAKPPVKPPVYPPKFNRSLVAVRGTVYCKSCKYAAFNTLLGAK 209
Query: 516 PVEGAVVRLLCKSKKNIVAETKTDKNGYFLLLGPKTVTNYGFRGCRVYLVKSKDYKCNKV 337
P+EGA V+L+CKSKKNI AET TDKNGYFLLL PKTVTN+GFRGCRVYLVKSKDYKC+KV
Sbjct: 210 PIEGATVKLVCKSKKNITAETLTDKNGYFLLLAPKTVTNFGFRGCRVYLVKSKDYKCSKV 269
Query: 336 SKLFGGDVGAVLKPEKRKGKSAVVINQLIYGIFNVGPFAFDPVCPK 199
SKLFGGDVGA LKPE+R GK VV+N+L YG+FNVGPFAF+P CPK
Sbjct: 270 SKLFGGDVGAELKPERRPGKGTVVVNKLTYGLFNVGPFAFNPTCPK 315
Score = 322 bits (824), Expect = 1e-085
Identities = 143/179 (79%), Positives = 146/179 (81%), Gaps = 4/179 (2%)
Frame = -2
Query: 1263 G*SVLLSLLALWCFTSCAFTEEVNHVTQTPSSAPAPSPYHHGHHHPHPPHHHHPHPHPPA 1084
G SVL+SL+ALWCFTS FTEEVNHVTQTPS APAP+PYHHGHHHPHPPHHHHPHPHP
Sbjct: 5 GKSVLVSLVALWCFTSSVFTEEVNHVTQTPSLAPAPAPYHHGHHHPHPPHHHHPHPHPHP 64
Query: 1083 KAPVKPPVSPPSKPPVKPPVYPPTKSPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPV 904
P K PV KPPVK PV PP K PVKPP PP K PV PP KPPVKPPV PP K PV
Sbjct: 65 HPPAKSPV----KPPVKAPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPVSPPAKPPV 120
Query: 903 KPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVYPP 727
KPPV PP K PV PP KPPVKPPVYPPTKAPVKPP KPPVKPPVSPPAKPPVKPPVYPP
Sbjct: 121 KPPVYPPTKAPVKPPTKPPVKPPVYPPTKAPVKPPTKPPVKPPVSPPAKPPVKPPVYPP 179
Score = 221 bits (561), Expect = 3e-055
Identities = 104/142 (73%), Positives = 105/142 (73%), Gaps = 1/142 (0%)
Frame = -2
Query: 1089 PAKAPVKPPVSPPSKP-PVKPPVYPPTKSPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTK 913
PA AP P P P +P P K P KPPVK PVSPPAKPPVKPPVYPPTK
Sbjct: 38 PAPAPYHHGHHHPHPPHHHHPHPHPHPHPPAKSPVKPPVKAPVSPPAKPPVKPPVYPPTK 97
Query: 912 APVKPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVY 733
APVKPP KPPVKPPVSPPAKPPVKPPVYPPTKAPVKPP KPPVKPPV PP K PVKPP
Sbjct: 98 APVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPVYPPTKAPVKPPTK 157
Query: 732 PPTKAPVKPPTKAPVKPPVSPP 667
PP K PV PP K PVKPPV PP
Sbjct: 158 PPVKPPVSPPAKPPVKPPVYPP 179
>gi|110738350|dbj|BAF01102.1| putative proline-rich protein [Arabidopsis
thaliana]
Length = 315
Score = 428 bits (1100), Expect = 1e-117
Identities = 204/286 (71%), Positives = 223/286 (77%), Gaps = 5/286 (1%)
Frame = -2
Query: 1053 PSKPPVKPPVYPPTKSPVKP-PTKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVK 877
PS P P + P P P P PPAK PVKPPV KAPV PP KPPVK
Sbjct: 34 PSLAPAPAPYHHGHHHPHPPHHHHPHPHPHPHPPAKSPVKPPV----KAPVSPPAKPPVK 89
Query: 876 PPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTK 697
PPV PP K PVKPP PP K PV PP KPPVKPPV PP K PVKPP PP K PV PPTK
Sbjct: 90 PPVYPPTKAPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPVYPPTK 149
Query: 696 APVKPPVSPPAKPPVSPPAKPPVRPPVYPPKFNRSLVAVQGTVFCKSCQYASSDSLIGAK 517
APV PP P KPPVSPP KPPV PPVYPPKFNRSLVAV+GTV+CKSC+YA+ ++L+GAK
Sbjct: 150 APVYPPTKAPVKPPVSPPTKPPVTPPVYPPKFNRSLVAVRGTVYCKSCKYAAFNTLLGAK 209
Query: 516 PVEGAVVRLLCKSKKNIVAETKTDKNGYFLLLGPKTVTNYGFRGCRVYLVKSKDYKCNKV 337
P+EGA V+L+CKSKKNI AET TDKNGYFLLL PKTVTN+GFRGCRVYLVKSKDYKC+KV
Sbjct: 210 PIEGATVKLVCKSKKNITAETTTDKNGYFLLLAPKTVTNFGFRGCRVYLVKSKDYKCSKV 269
Query: 336 SKLFGGDVGAVLKPEKRKGKSAVVINQLIYGIFNVGPFAFDPVCPK 199
SKLFGGDVGA LKPEK+ GKS VV+N+L+YG+FNVGPFAF+P CPK
Sbjct: 270 SKLFGGDVGAELKPEKKLGKSTVVVNKLVYGLFNVGPFAFNPSCPK 315
Score = 310 bits (793), Expect = 4e-082
Identities = 138/179 (77%), Positives = 141/179 (78%), Gaps = 4/179 (2%)
Frame = -2
Query: 1263 G*SVLLSLLALWCFTSCAFTEEVNHVTQTPSSAPAPSPYHHGHHHPHPPHHHHPHPHPPA 1084
G SVL+SL+ALWCFTS FTEEVNH TQTPS APAP+PYHHGHHHPHPPHHHHPHPHP
Sbjct: 5 GKSVLVSLVALWCFTSSVFTEEVNHKTQTPSLAPAPAPYHHGHHHPHPPHHHHPHPHPHP 64
Query: 1083 KAPVKPPVSPPSKPPVKPPVYPPTKSPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPV 904
P K PV KPPVK PV PP K PVKPP PP K PV PP KPPVKPPV PP K PV
Sbjct: 65 HPPAKSPV----KPPVKAPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPVSPPAKPPV 120
Query: 903 KPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVYPP 727
KPPV PP K PV PP KPPVKPPVYPPTKAPV PP K PVKPPVSPP KPPV PPVYPP
Sbjct: 121 KPPVYPPTKAPVKPPTKPPVKPPVYPPTKAPVYPPTKAPVKPPVSPPTKPPVTPPVYPP 179
>gi|145324054|ref|NP_001077616.1| arabinogalactan protein 31 [Arabidopsis
thaliana]
Length = 315
Score = 426 bits (1095), Expect = 4e-117
Identities = 203/286 (70%), Positives = 222/286 (77%), Gaps = 5/286 (1%)
Frame = -2
Query: 1053 PSKPPVKPPVYPPTKSPVKP-PTKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVK 877
PS P P + P P P P PPAK PVKPPV KAPV PP KPPVK
Sbjct: 34 PSLAPAPAPYHHGHHHPHPPHHHHPHPHPHPHPPAKSPVKPPV----KAPVSPPAKPPVK 89
Query: 876 PPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTK 697
PPV PP K PVKPP PP K PV PP KPPVKPPV PP K PVKPP PP K PV PPTK
Sbjct: 90 PPVYPPTKAPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPVYPPTK 149
Query: 696 APVKPPVSPPAKPPVSPPAKPPVRPPVYPPKFNRSLVAVQGTVFCKSCQYASSDSLIGAK 517
PV PP P KPPVSPP KPPV PPVYPPKFNRSLVAV+GTV+CKSC+YA+ ++L+GAK
Sbjct: 150 PPVYPPTKAPVKPPVSPPTKPPVTPPVYPPKFNRSLVAVRGTVYCKSCKYAAFNTLLGAK 209
Query: 516 PVEGAVVRLLCKSKKNIVAETKTDKNGYFLLLGPKTVTNYGFRGCRVYLVKSKDYKCNKV 337
P+EGA V+L+CKSKKNI AET TDKNGYFLLL PKTVTN+GFRGCRVYLVKSKDYKC+KV
Sbjct: 210 PIEGATVKLVCKSKKNITAETTTDKNGYFLLLAPKTVTNFGFRGCRVYLVKSKDYKCSKV 269
Query: 336 SKLFGGDVGAVLKPEKRKGKSAVVINQLIYGIFNVGPFAFDPVCPK 199
SKLFGGDVGA LKPEK+ GKS VV+N+L+YG+FNVGPFAF+P CPK
Sbjct: 270 SKLFGGDVGAELKPEKKLGKSTVVVNKLVYGLFNVGPFAFNPSCPK 315
Score = 308 bits (788), Expect = 1e-081
Identities = 137/179 (76%), Positives = 140/179 (78%), Gaps = 4/179 (2%)
Frame = -2
Query: 1263 G*SVLLSLLALWCFTSCAFTEEVNHVTQTPSSAPAPSPYHHGHHHPHPPHHHHPHPHPPA 1084
G SVL+SL+ALWCFTS FTEEVNH TQTPS APAP+PYHHGHHHPHPPHHHHPHPHP
Sbjct: 5 GKSVLVSLVALWCFTSSVFTEEVNHKTQTPSLAPAPAPYHHGHHHPHPPHHHHPHPHPHP 64
Query: 1083 KAPVKPPVSPPSKPPVKPPVYPPTKSPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPV 904
P K PV KPPVK PV PP K PVKPP PP K PV PP KPPVKPPV PP K PV
Sbjct: 65 HPPAKSPV----KPPVKAPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPVSPPAKPPV 120
Query: 903 KPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVYPP 727
KPPV PP K PV PP KPPVKPPVYPPTK PV PP K PVKPPVSPP KPPV PPVYPP
Sbjct: 121 KPPVYPPTKAPVKPPTKPPVKPPVYPPTKPPVYPPTKAPVKPPVSPPTKPPVTPPVYPP 179
Score = 224 bits (570), Expect = 3e-056
Identities = 110/160 (68%), Positives = 113/160 (70%), Gaps = 5/160 (3%)
Frame = -2
Query: 1089 PAKAPVKPPVSPPSKP-PVKPPVYPPTKSPVKPPTKPPVKPPVSPPAKPPVKPPVYPPTK 913
PA AP P P P +P P K P KPPVK PVSPPAKPPVKPPVYPPTK
Sbjct: 38 PAPAPYHHGHHHPHPPHHHHPHPHPHPHPPAKSPVKPPVKAPVSPPAKPPVKPPVYPPTK 97
Query: 912 APVKPPVKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPVKPPVKPPVSPPAKPPVKPPVY 733
APVKPP KPPVKPPVSPPAKPPVKPPVYPPTKAPVKPP KPPVKPPV PP KPPVY
Sbjct: 98 APVKPPTKPPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTKPPVKPPV----YPPTKPPVY 153
Query: 732 PPTKAPVKPPTKAPVKPPVSPPAKPPVSPPAKPPVRPPVY 613
PPTKAPVKPP P KPPV+PP PP + VR VY
Sbjct: 154 PPTKAPVKPPVSPPTKPPVTPPVYPPKFNRSLVAVRGTVY 193
>gi|15226166|ref|NP_180935.1| arabinogalactan protein 30 [Arabidopsis thaliana]
Length = 239
Score = 258 bits (659), Expect = 1e-066
Identities = 118/198 (59%), Positives = 149/198 (75%), Gaps = 4/198 (2%)
Frame = -2
Query: 792 PPVKPPVSPPAKPPVKPPVYPPTKAPVKPPTKAPVKPPVSPPAKPPVSPPAKPPVRPPVY 613
PP+K P PPAK P+K P YPP KAP+K PT P K P+ K P PP KPPV PPVY
Sbjct: 46 PPIKLPTLPPAKAPIKLPAYPPAKAPIKLPTLPPAKAPI----KLPTLPPIKPPVLPPVY 101
Query: 612 PPKFNRSLVAVQGTVFCKSCQYASSDSLIGAKPVEGAVVRLLCKSKKNIVAETKTDKNGY 433
PPK+N++LVAV+G V+CK+C+YA +++ GAKPV+ AVVRL+CK+KKN ++ETKTDKNGY
Sbjct: 102 PPKYNKTLVAVRGVVYCKACKYAGVNNVQGAKPVKDAVVRLVCKNKKNSISETKTDKNGY 161
Query: 432 FLLLGPKTVTNYGFRGCRVYLVKSKDYKCNKVSKLFGGDVGAVLKPEKRKGKSAVVINQL 253
F+LL PKTVTNY +GCR +LVKS D KC+KVS L G G+VLKP + G S+ ++
Sbjct: 162 FMLLAPKTVTNYDIKGCRAFLVKSPDTKCSKVSSLHDGGKGSVLKPVLKPGFSSTIMRWF 221
Query: 252 IYGIFNVGPFAFDPVCPK 199
Y ++NVGPFAF+P CPK
Sbjct: 222 KYSVYNVGPFAFEPTCPK 239
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,509,288,697,968
Number of Sequences: 15229318
Number of Extensions: 5509288697968
Number of Successful Extensions: 1289383025
Number of sequences better than 0.0: 0
|