BLASTX 7.6.2
Query= RU02129 /QuerySize=1411
(1410 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular... 559 2e-159
TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological... 421 7e-118
TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological... 421 7e-118
TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular... 364 1e-100
TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular... 364 1e-100
TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular... 364 1e-100
TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:... 361 6e-100
TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein | chr1:... 339 3e-093
TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular... 309 4e-084
TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:... 307 1e-083
TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular... 305 7e-083
TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:... 291 1e-078
TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:... 134 2e-031
>TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:14544114-14546732 REVERSE
Length = 402
Score = 559 bits (1440), Expect = 2e-159
Identities = 265/343 (77%), Positives = 305/343 (88%), Gaps = 5/343 (1%)
Frame = +3
Query: 150 DGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQKQGGMLLIVPQGKHA 329
DG + NGDFE P+ GF DAI++ + IPSW+S+GTVE++++GQKQGGM+LIVP+G+HA
Sbjct: 38 DGLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHA 97
Query: 330 VRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLP-----ASQTIDLQTLYS 494
VRLGNDAE+SQ++ VEKGSIYSVTFSAARTCAQLESLNVSV ASQTIDLQT+YS
Sbjct: 98 VRLGNDAEISQELTVEKGSIYSVTFSAARTCAQLESLNVSVASSDEPIASQTIDLQTVYS 157
Query: 495 VQGWDPYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTPDKPKNNAVVNGD 674
VQGWDPYAWAFEA D V+LVF+NPGMEDDPTCGPIIDD+A+KKLFTPDKPK NAV+NGD
Sbjct: 158 VQGWDPYAWAFEAVVDRVRLVFKNPGMEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGD 217
Query: 675 FEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFSVPQGKRAIELLS 854
FEEGPWM N +LGVLLPTNLDEE SSLPGW VESNRAVR+IDS HFSVP+GKRA+ELLS
Sbjct: 218 FEEGPWMFRNTTLGVLLPTNLDEEISSLPGWTVESNRAVRFIDSDHFSVPEGKRALELLS 277
Query: 855 GKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNVHYTPNSNSTFQS 1034
GKEGIISQMVETK N PY ++FSLGHA DKCK+PLAVMAFAGDQAQN HY +NS+F+
Sbjct: 278 GKEGIISQMVETKANIPYKMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFER 337
Query: 1035 ANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWF 1163
+ +NFTAKAER+RIAFYS+YYNTRTDDM+SLCGPV+DDV+VWF
Sbjct: 338 SELNFTAKAERTRIAFYSIYYNTRTDDMTSLCGPVIDDVKVWF 380
>TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10504617 REVERSE
Length = 372
Score = 421 bits (1081), Expect = 7e-118
Identities = 193/235 (82%), Positives = 218/235 (92%)
Frame = +3
Query: 459 ASQTIDLQTLYSVQGWDPYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTP 638
AS+ +DLQTLYSVQGWDPYAWAFEAE+D V+LVF+NPGMEDDPTCGPIIDD+AIKKLFTP
Sbjct: 114 ASRNVDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTP 173
Query: 639 DKPKNNAVVNGDFEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFS 818
DKPK+NAV+NGDFE+GPWM N SLGVLLPTNLDEE SSLPGW VESNRAVR++DS HFS
Sbjct: 174 DKPKDNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFS 233
Query: 819 VPQGKRAIELLSGKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNV 998
VP+GKRA+ELLSGKEGIISQMVETK +KPY L+FSLGHA DKCK+PLA+MAFAGDQAQN
Sbjct: 234 VPKGKRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNF 293
Query: 999 HYTPNSNSTFQSANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWF 1163
HY +NS+F+ A +NFTAKA+R+R+AFYSVYYNTRTDDMSSLCGPV+DDVRVWF
Sbjct: 294 HYMAQANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRVWF 348
Score = 162 bits (409), Expect = 6e-040
Identities = 75/110 (68%), Positives = 91/110 (82%)
Frame = +3
Query: 147 LDGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQKQGGMLLIVPQGKH 326
+ G + NGDFE +P+ GF D + DGP+ IPSWKSNGTVE++ +GQKQGGM+LIVPQG+H
Sbjct: 1 MTGLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRH 60
Query: 327 AVRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLPASQTID 476
AVRLGNDAE+SQD+ VEKG +YSVTFSAARTCAQLES+NVSV + D
Sbjct: 61 AVRLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVASVNADAD 110
>TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10505994 REVERSE
Length = 408
Score = 421 bits (1081), Expect = 7e-118
Identities = 193/235 (82%), Positives = 218/235 (92%)
Frame = +3
Query: 459 ASQTIDLQTLYSVQGWDPYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTP 638
AS+ +DLQTLYSVQGWDPYAWAFEAE+D V+LVF+NPGMEDDPTCGPIIDD+AIKKLFTP
Sbjct: 150 ASRNVDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTP 209
Query: 639 DKPKNNAVVNGDFEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFS 818
DKPK+NAV+NGDFE+GPWM N SLGVLLPTNLDEE SSLPGW VESNRAVR++DS HFS
Sbjct: 210 DKPKDNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFS 269
Query: 819 VPQGKRAIELLSGKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNV 998
VP+GKRA+ELLSGKEGIISQMVETK +KPY L+FSLGHA DKCK+PLA+MAFAGDQAQN
Sbjct: 270 VPKGKRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNF 329
Query: 999 HYTPNSNSTFQSANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVWF 1163
HY +NS+F+ A +NFTAKA+R+R+AFYSVYYNTRTDDMSSLCGPV+DDVRVWF
Sbjct: 330 HYMAQANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRVWF 384
Score = 164 bits (414), Expect = 1e-040
Identities = 76/109 (69%), Positives = 91/109 (83%)
Frame = +3
Query: 150 DGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQKQGGMLLIVPQGKHA 329
DG + NGDFE +P+ GF D + DGP+ IPSWKSNGTVE++ +GQKQGGM+LIVPQG+HA
Sbjct: 38 DGLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHA 97
Query: 330 VRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLPASQTID 476
VRLGNDAE+SQD+ VEKG +YSVTFSAARTCAQLES+NVSV + D
Sbjct: 98 VRLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVASVNADAD 146
>TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 364 bits (933), Expect = 1e-100
Identities = 174/336 (51%), Positives = 230/336 (68%)
Frame = +3
Query: 150 DGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQKQGGMLLIVPQGKHA 329
DG LPNGDFE P V T IP+W+ +G VE + +G KQG M+L+VP+G A
Sbjct: 24 DGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFA 83
Query: 330 VRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLPASQTIDLQTLYSVQGWD 509
VRLGN+A + Q + V+KGS YS+TFSAARTCAQ E LNVSV P + +QT+YS GWD
Sbjct: 84 VRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWD 143
Query: 510 PYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTPDKPKNNAVVNGDFEEGP 689
Y+WAF+A+ D +V NPG+E+DP CGP+ID VA++ LF P N + NG FEEGP
Sbjct: 144 LYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGP 203
Query: 690 WMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGI 869
W+L N+S GVL+P N ++ S LPGW VES +AV+YIDS HFSVPQG+RA+EL++GKE
Sbjct: 204 WVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESA 263
Query: 870 ISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNVHYTPNSNSTFQSANVNF 1049
++Q+V T P K Y L+FS+G A++ C + V AFAG V Y F+ +++ F
Sbjct: 264 VAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRF 323
Query: 1050 TAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV 1157
A + R+R+ FYS +Y R DD SSLCGPV+DDV++
Sbjct: 324 VAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKL 359
>TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 364 bits (933), Expect = 1e-100
Identities = 174/336 (51%), Positives = 230/336 (68%)
Frame = +3
Query: 150 DGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQKQGGMLLIVPQGKHA 329
DG LPNGDFE P V T IP+W+ +G VE + +G KQG M+L+VP+G A
Sbjct: 24 DGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFA 83
Query: 330 VRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLPASQTIDLQTLYSVQGWD 509
VRLGN+A + Q + V+KGS YS+TFSAARTCAQ E LNVSV P + +QT+YS GWD
Sbjct: 84 VRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWD 143
Query: 510 PYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTPDKPKNNAVVNGDFEEGP 689
Y+WAF+A+ D +V NPG+E+DP CGP+ID VA++ LF P N + NG FEEGP
Sbjct: 144 LYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGP 203
Query: 690 WMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGI 869
W+L N+S GVL+P N ++ S LPGW VES +AV+YIDS HFSVPQG+RA+EL++GKE
Sbjct: 204 WVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESA 263
Query: 870 ISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNVHYTPNSNSTFQSANVNF 1049
++Q+V T P K Y L+FS+G A++ C + V AFAG V Y F+ +++ F
Sbjct: 264 VAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRF 323
Query: 1050 TAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV 1157
A + R+R+ FYS +Y R DD SSLCGPV+DDV++
Sbjct: 324 VAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKL 359
>TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr5:3644655-3646991 FORWARD
Length = 367
Score = 364 bits (933), Expect = 1e-100
Identities = 175/351 (49%), Positives = 238/351 (67%)
Frame = +3
Query: 105 FMLVLGSLASQISDLDGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQ 284
F+L++ ++ S I DG LPNGDFE P V IPSW+ +G VE +++GQ
Sbjct: 10 FVLLIATITSVICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSGFVEYIKSGQ 69
Query: 285 KQGGMLLIVPQGKHAVRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLPAS 464
KQG MLL+VP GK A+RLGN+A + Q + V KG YS+TFSAARTCAQ E LN+SV P S
Sbjct: 70 KQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPDS 129
Query: 465 QTIDLQTLYSVQGWDPYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTPDK 644
I +QT+YS GWD YAWAF+AE + ++V NPG E+DP CGP+ID VAIK L+ P
Sbjct: 130 GVIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRP 189
Query: 645 PKNNAVVNGDFEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFSVP 824
N + NG FEEGP++L N + GVL+P ++++ S LP W VES +A++Y+D HFSVP
Sbjct: 190 TNKNILKNGGFEEGPYVLPNATTGVLVPPFIEDDHSPLPAWMVESLKAIKYVDVEHFSVP 249
Query: 825 QGKRAIELLSGKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNVHY 1004
QG+RA+EL++GKE I+Q+ T K Y L+F++G AN+ C+ + V AFAG V Y
Sbjct: 250 QGRRAVELVAGKESAIAQVARTVVGKTYVLSFAVGDANNACQGSMVVEAFAGKDTLKVPY 309
Query: 1005 TPNSNSTFQSANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV 1157
F+ A++ F A + R+R+ FYS +Y+ R+DD SSLCGPV+DDV++
Sbjct: 310 ESRGKGGFKRASLRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 360
>TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:8863430-8865394
FORWARD
Length = 370
Score = 361 bits (926), Expect = 6e-100
Identities = 179/353 (50%), Positives = 235/353 (66%)
Frame = +3
Query: 99 LPFMLVLGSLASQISDLDGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEA 278
L F+ + S +S DG LPNGDFE P + IP+W+ G VE +++
Sbjct: 11 LLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWEVTGFVEYIKS 70
Query: 279 GQKQGGMLLIVPQGKHAVRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLP 458
G KQG MLL+VP GK AVRLGN+A + Q +KV KG YS+TFSAARTCAQ E LN+SV P
Sbjct: 71 GHKQGDMLLVVPAGKFAVRLGNEASIKQRLKVVKGMYYSLTFSAARTCAQDERLNISVAP 130
Query: 459 ASQTIDLQTLYSVQGWDPYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTP 638
S I +QT+YS GWD YAWAF+AE D ++V NPG+E+DP CGP+ID VA++ L+ P
Sbjct: 131 DSGVIPIQTVYSSSGWDLYAWAFQAESDVAEVVIHNPGVEEDPACGPLIDGVAMRSLYPP 190
Query: 639 DKPKNNAVVNGDFEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFS 818
N + NG FEEGP +L + GVL+P ++++ S LPGW VES +AV+Y+D HFS
Sbjct: 191 RPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHSPLPGWMVESLKAVKYVDVEHFS 250
Query: 819 VPQGKRAIELLSGKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNV 998
VPQG+RAIEL++GKE I+Q+V T K Y L+F++G AN+ CK + V AFAG V
Sbjct: 251 VPQGRRAIELVAGKESAIAQVVRTVIGKTYVLSFAVGDANNACKGSMVVEAFAGKDTLKV 310
Query: 999 HYTPNSNSTFQSANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV 1157
Y F+ A++ F A + RSRI FYS +Y R+DD SSLCGPV+DDV++
Sbjct: 311 PYESKGTGGFKRASIRFVAVSTRSRIMFYSTFYAMRSDDFSSLCGPVIDDVKL 363
>TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein |
chr1:30171520-30172799 REVERSE
Length = 371
Score = 339 bits (868), Expect = 3e-093
Identities = 163/336 (48%), Positives = 222/336 (66%)
Frame = +3
Query: 150 DGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQKQGGMLLIVPQGKHA 329
DG LPNG+FE P ++V T +P+W G VE +++GQKQ M+L+VPQG A
Sbjct: 27 DGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNIIGFVEFIKSGQKQDDMVLVVPQGSSA 86
Query: 330 VRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLPASQTIDLQTLYSVQGWD 509
VRLGN+A +SQ + V G +YS+TFSAARTCAQ E LN+SV S I +QT+Y GWD
Sbjct: 87 VRLGNEASISQKISVLPGRLYSITFSAARTCAQDERLNISVTHESGVIPIQTMYGSDGWD 146
Query: 510 PYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTPDKPKNNAVVNGDFEEGP 689
Y+WAF+A ++++ F NPG+E+ P CGP+ID VAIK LF P N + NG+FEEGP
Sbjct: 147 SYSWAFKAGGPEIEIRFHNPGVEEHPACGPLIDAVAIKALFPPRFSGYNLIKNGNFEEGP 206
Query: 690 WMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGI 869
++ GVL+P ++++ S LPGW +ES +AV+Y+D HF+VP+G RAIEL+ GKE
Sbjct: 207 YVFPTAKWGVLIPPFIEDDNSPLPGWMIESLKAVKYVDKAHFAVPEGHRAIELVGGKESA 266
Query: 870 ISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNVHYTPNSNSTFQSANVNF 1049
ISQ+V T NK Y LTF++G A D C+ P+ V AFAG V Y F+ + F
Sbjct: 267 ISQIVRTSLNKFYALTFNVGDARDGCEGPMIVEAFAGQGKVMVDYASKGKGGFRRGRLVF 326
Query: 1050 TAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV 1157
A + R+R+ F S +Y+ ++D SLCGPV+DDVR+
Sbjct: 327 KAVSARTRVTFLSTFYHMKSDHSGSLCGPVIDDVRL 362
>TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF642 (InterPro:IPR006946),
Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
NCBI BLink). | chr2:17439414-17441296 REVERSE
Length = 371
Score = 309 bits (790), Expect = 4e-084
Identities = 155/344 (45%), Positives = 213/344 (61%)
Frame = +3
Query: 129 ASQISDLDGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQKQGGMLLI 308
A + LDG LPNG+FE P + G +P W+ +G VE+V G + GG
Sbjct: 24 AQRTPHLDGLLPNGNFEQIPNKSNMRKRQIIGKYSLPHWEISGHVELVSGGPQPGGFYFA 83
Query: 309 VPQGKHAVRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLPASQTIDLQTL 488
VP+G HA RLGN A +SQ VKV+ G +YS+TF RTCAQ E++ +SV + + +QTL
Sbjct: 84 VPRGVHAARLGNLASISQYVKVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTL 143
Query: 489 YSVQGWDPYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTPDKPKNNAVVN 668
+S G D YAWAF+A D VK+ F NPG+++DPTCGPI+D VAIK++ K N V N
Sbjct: 144 FSTNGGDTYAWAFKATSDLVKVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKN 203
Query: 669 GDFEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFSVPQGKRAIEL 848
G FE GP + +N S G+L+P + + S LPGW VES + V+YID+ HF VP G AIEL
Sbjct: 204 GGFETGPHVFSNFSTGILIPAKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIEL 263
Query: 849 LSGKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNVHYTPNSNSTF 1028
++G+E I+Q++ T K Y L+F +G A++ C + V AFAG A V + N F
Sbjct: 264 VAGRESAIAQIIRTVSGKNYILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAF 323
Query: 1029 QSANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRVW 1160
+ F A + R+RI FYS +Y+T+ D LCGPV+D+V V+
Sbjct: 324 KVGRFAFRADSNRTRITFYSGFYHTKLHDFGHLCGPVLDNVSVF 367
>TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:2564191-2565819
FORWARD
Length = 366
Score = 307 bits (785), Expect = 1e-083
Identities = 160/360 (44%), Positives = 225/360 (62%), Gaps = 1/360 (0%)
Frame = +3
Query: 81 ATPK-WVLPFMLVLGSLASQISDLDGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNG 257
A PK +LP +L++ A +G L NG+FE +P ++ G +P W++ G
Sbjct: 2 AVPKAIILPILLLICGAALGAPASEGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETTG 61
Query: 258 TVEVVEAGQKQGGMLLIVPQGKHAVRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLES 437
VE + G + GGM V G HAVRLGN+A +SQ ++V+ GS+Y++TF A+RTCAQ E
Sbjct: 62 FVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYALTFGASRTCAQDEV 121
Query: 438 LNVSVLPASQTIDLQTLYSVQGWDPYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVA 617
L VSV S + LQTLY+ G D YAWAF A+ V + F NPG+++DP CGP++D VA
Sbjct: 122 LRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTFHNPGVQEDPACGPLLDAVA 181
Query: 618 IKKLFTPDKPKNNAVVNGDFEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRY 797
IK+L P + N V NG FEEGP L N + GVLLP ++ TS LPGW +ES +AV++
Sbjct: 182 IKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDLTSPLPGWIIESLKAVKF 241
Query: 798 IDSYHFSVPQGKRAIELLSGKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFA 977
IDS +F+VP G AIEL++GKE I+Q++ T P + YTL+F +G A + C + V AFA
Sbjct: 242 IDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSFVVGDAKNDCHGSMMVEAFA 301
Query: 978 GDQAQNVHYTPNSNSTFQSANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV 1157
V +T ++A+ F A R+RI F+S +Y+T+ D SLCGPV+D++ V
Sbjct: 302 ARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYHTKKTDTVSLCGPVIDEIVV 361
>TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
DURING: 4 anthesis, C globular stage, petal differentiation and
expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:17436671-17438005 REVERSE
Length = 371
Score = 305 bits (779), Expect = 7e-083
Identities = 149/341 (43%), Positives = 214/341 (62%)
Frame = +3
Query: 135 QISDLDGPLPNGDFEATPTGGFSSDAIVDGPTGIPSWKSNGTVEVVEAGQKQGGMLLIVP 314
++ LDG LPNG+FE TP + G +P W+ G VE+V G + GG VP
Sbjct: 26 RVPHLDGILPNGNFEITPLKSNMKGRQIIGANSLPHWEIAGHVELVSGGPQPGGFYFPVP 85
Query: 315 QGKHAVRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSVLPASQTIDLQTLYS 494
+G HAVRLGN +SQ+V+V+ G +YS+TF A RTCAQ E++ VSV + + LQT++S
Sbjct: 86 RGVHAVRLGNLGTISQNVRVKSGLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFS 145
Query: 495 VQGWDPYAWAFEAEEDDVKLVFRNPGMEDDPTCGPIIDDVAIKKLFTPDKPKNNAVVNGD 674
G D YAWAF+A D VK+ F NPG+++D TCGP++D VAIK++ + N V NG
Sbjct: 146 SDGGDTYAWAFKATSDVVKVTFHNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGG 205
Query: 675 FEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAVRYIDSYHFSVPQGKRAIELLS 854
FE GP + AN S G+L+P + + S LPGW VES + V+YID HF VP G+ A+EL++
Sbjct: 206 FEIGPHVFANFSTGILIPARIQDFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVA 265
Query: 855 GKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAVMAFAGDQAQNVHYTPNSNSTFQS 1034
G+E I+Q++ T K Y L+F++G A + C + V AFAG + + + F++
Sbjct: 266 GRESAIAQIIRTIAGKAYMLSFAVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKT 325
Query: 1035 ANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVDDVRV 1157
+ F A ++R+R+ FYS +Y+T+ D LCGPV+D V V
Sbjct: 326 GHFRFVADSDRTRLTFYSAFYHTKLHDFGHLCGPVLDSVVV 366
>TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:2564517-2565819
FORWARD
Length = 324
Score = 291 bits (743), Expect = 1e-078
Identities = 146/312 (46%), Positives = 202/312 (64%)
Frame = +3
Query: 222 GPTGIPSWKSNGTVEVVEAGQKQGGMLLIVPQGKHAVRLGNDAEVSQDVKVEKGSIYSVT 401
G +P W++ G VE + G + GGM V G HAVRLGN+A +SQ ++V+ GS+Y++T
Sbjct: 8 GKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYALT 67
Query: 402 FSAARTCAQLESLNVSVLPASQTIDLQTLYSVQGWDPYAWAFEAEEDDVKLVFRNPGMED 581
F A+RTCAQ E L VSV S + LQTLY+ G D YAWAF A+ V + F NPG+++
Sbjct: 68 FGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTFHNPGVQE 127
Query: 582 DPTCGPIIDDVAIKKLFTPDKPKNNAVVNGDFEEGPWMLANVSLGVLLPTNLDEETSSLP 761
DP CGP++D VAIK+L P + N V NG FEEGP L N + GVLLP ++ TS LP
Sbjct: 128 DPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDLTSPLP 187
Query: 762 GWNVESNRAVRYIDSYHFSVPQGKRAIELLSGKEGIISQMVETKPNKPYTLTFSLGHAND 941
GW +ES +AV++IDS +F+VP G AIEL++GKE I+Q++ T P + YTL+F +G A +
Sbjct: 188 GWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSFVVGDAKN 247
Query: 942 KCKQPLAVMAFAGDQAQNVHYTPNSNSTFQSANVNFTAKAERSRIAFYSVYYNTRTDDMS 1121
C + V AFA V +T ++A+ F A R+RI F+S +Y+T+ D
Sbjct: 248 DCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYHTKKTDTV 307
Query: 1122 SLCGPVVDDVRV 1157
SLCGPV+D++ V
Sbjct: 308 SLCGPVIDEIVV 319
>TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:4565246-4566653
REVERSE
Length = 384
Score = 134 bits (335), Expect = 2e-031
Identities = 111/364 (30%), Positives = 167/364 (45%), Gaps = 28/364 (7%)
Frame = +3
Query: 105 FMLVLGSLASQISDLDGPLPNGDFEA----TPTGGFSSDAIVDGPTGIPSWKSNGTVEVV 272
F+L+L S + L+ P DFE+ PT +S +D + +P W GTV V
Sbjct: 9 FLLLLVSCCASSDFLENP----DFESPPLNLPTNSNASSVSLDQNSTLPGWTFQGTVLYV 64
Query: 273 EAGQKQGGMLLIVPQGKHAVRLGNDAEVSQDVKVEKGSIYSVTFSAARTCAQLESLNVSV 452
E G + + GK N +++ ++ +++ A + C L+VS
Sbjct: 65 EL-PDTGHAVQLGEDGKI-----NQTFIAKGDELNYILTFAL-IHAGQNCTSSAGLSVSG 117
Query: 453 LPASQTIDLQTLYSVQGWDPYAWAFEA--EEDDVKLVFRNPGMEDD----PTCGPIIDDV 614
++ + YS W Y+ + + + LV + ++ D TC PIID +
Sbjct: 118 PDSNAVFSYRQNYSKVSWQSYSHNLGSWGNGEPINLVLESQAIDSDSDTNSTCWPIIDTL 177
Query: 615 AIKKL-FTPDKPKNNAVVNGDFEEGPWMLANVSLGVLLPTNLDEETSSLPGWNVESNRAV 791
IK + T + N ++NG FE GP L N + GVL+ S L W+V V
Sbjct: 178 LIKTVGVTLVQDSGNLLINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQWSVIG--TV 235
Query: 792 RYIDSYHFSVPQGKRAIELLS--GKEGIISQMVETKPNKPYTLTFSLGHANDKCKQPLAV 965
RYIDS HF VP+GK AIE+LS GI + T Y LTF+LG AND C+ V
Sbjct: 236 RYIDSEHFHVPEGKAAIEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDANDACRGHFVV 295
Query: 966 MAFAGDQAQNVHYTPNSNSTFQSANVNFTAKAERSRIAFYSVYYNTRTDDMSSLCGPVVD 1145
A AG QN N + + + F A + ++I+F S Y+ + +CGPV+D
Sbjct: 296 GAQAGSVTQNFTLESNGTGSGEKFGLVFEADKDAAQISFTS--YSVTMTKENVVCGPVID 353
Query: 1146 DVRV 1157
+V V
Sbjct: 354 EVMV 357
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 747,274,820
Number of Sequences: 33410
Number of Extensions: 747274820
Number of Successful Extensions: 41161775
Number of sequences better than 0.0: 0
|