BLASTX 7.6.2
Query= UN18527 /QuerySize=1351
(1350 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:... 519 2e-147
TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular... 507 9e-144
TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular... 463 1e-130
TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular... 463 1e-130
TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein | chr1:... 363 2e-100
TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:... 313 2e-085
TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:... 313 2e-085
TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular... 312 3e-085
TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular... 304 1e-082
TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular... 265 8e-071
TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological... 239 4e-063
TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological... 239 4e-063
TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:... 133 3e-031
>TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:8863430-8865394
FORWARD
Length = 370
Score = 519 bits (1335), Expect = 2e-147
Identities = 252/266 (94%), Positives = 263/266 (98%)
Frame = -2
Query: 995 VKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEI 816
VKGMYYSLTFSAARTCAQDERLNISVAPDSGVIP+QTVYSSSGWDLYAWAFQAES+VAE+
Sbjct: 103 VKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEV 162
Query: 815 VIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPF 636
VIHNPG EEDPACGPLIDGVAMR+LYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPF
Sbjct: 163 VIHNPGVEEDPACGPLIDGVAMRSLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPF 222
Query: 635 IEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVL 456
IEDDH+PLPGWMVESLKAVKYVD EHFSVPQGRRAIELVAGKESAIAQV RT+IGKTYVL
Sbjct: 223 IEDDHSPLPGWMVESLKAVKYVDVEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVL 282
Query: 455 SFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTF 276
SFAVGDANNACKGSMVVEAFAG+DTLKVPYES+GTGGFKRASIRFVAVSTR+R+MFYSTF
Sbjct: 283 SFAVGDANNACKGSMVVEAFAGKDTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTF 342
Query: 275 YAMRSDDFSSLCGPVIDDVKLLSVRK 198
YAMRSDDFSSLCGPVIDDVKL+SVRK
Sbjct: 343 YAMRSDDFSSLCGPVIDDVKLISVRK 368
Score = 165 bits (416), Expect = 8e-041
Identities = 84/101 (83%), Positives = 90/101 (89%), Gaps = 2/101 (1%)
Frame = -1
Query: 1293 MWGVTVVSLVLLLIATANSA--VVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWE 1120
M GVTVVS LL IATA +A V FRDG+LPNGDFELGPK SDMKGTEI+NK+AIP+WE
Sbjct: 1 MEGVTVVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWE 60
Query: 1119 VTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIK 997
VTGFVEYI SGHKQGDMLLVVPAGKFAVRLGNEASIKQR+K
Sbjct: 61 VTGFVEYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLK 101
>TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr5:3644655-3646991 FORWARD
Length = 367
Score = 507 bits (1304), Expect = 9e-144
Identities = 244/265 (92%), Positives = 258/265 (97%)
Frame = -2
Query: 992 KGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIV 813
KGMYYSLTFSAARTCAQDERLNISVAPDSGVIP+QTVYSSSGWDLYAWAFQAES VAEIV
Sbjct: 101 KGMYYSLTFSAARTCAQDERLNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESNVAEIV 160
Query: 812 IHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFI 633
IHNPGEEEDPACGPLIDGVA++ALYPPRPTNKNILKNGGFEEGP VLP +TTGVL+PPFI
Sbjct: 161 IHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGPYVLPNATTGVLVPPFI 220
Query: 632 EDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLS 453
EDDH+PLP WMVESLKA+KYVD EHFSVPQGRRA+ELVAGKESAIAQVART++GKTYVLS
Sbjct: 221 EDDHSPLPAWMVESLKAIKYVDVEHFSVPQGRRAVELVAGKESAIAQVARTVVGKTYVLS 280
Query: 452 FAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFY 273
FAVGDANNAC+GSMVVEAFAG+DTLKVPYESRG GGFKRAS+RFVAVSTRTRVMFYSTFY
Sbjct: 281 FAVGDANNACQGSMVVEAFAGKDTLKVPYESRGKGGFKRASLRFVAVSTRTRVMFYSTFY 340
Query: 272 AMRSDDFSSLCGPVIDDVKLLSVRK 198
+MRSDDFSSLCGPVIDDVKLLS RK
Sbjct: 341 SMRSDDFSSLCGPVIDDVKLLSARK 365
Score = 144 bits (362), Expect = 2e-034
Identities = 72/96 (75%), Positives = 83/96 (86%), Gaps = 1/96 (1%)
Frame = -1
Query: 1287 GVTVVSLVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVTGF 1108
G ++ L +LLIAT S V+ F DG+LPNGDFELGPK SDMKGT++INK AIPSWE++GF
Sbjct: 3 GGSLSFLFVLLIATITS-VICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSGF 61
Query: 1107 VEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRI 1000
VEYI SG KQGDMLLVVPAGKFA+RLGNEASIKQR+
Sbjct: 62 VEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRL 97
>TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 463 bits (1190), Expect = 1e-130
Identities = 221/270 (81%), Positives = 250/270 (92%)
Frame = -2
Query: 1007 KESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESE 828
K S KG YYS+TFSAARTCAQDERLN+SVAP V+P+QTVYSSSGWDLY+WAF+A+S+
Sbjct: 95 KISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSD 154
Query: 827 VAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVL 648
A+IVIHNPG EEDPACGPLIDGVAMRAL+PPRPTNKNILKNGGFEEGP VLP ++GVL
Sbjct: 155 YADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGPWVLPNISSGVL 214
Query: 647 IPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGK 468
IPP DDH+PLPGWMVESLKAVKY+D++HFSVPQGRRA+ELVAGKESA+AQV RTI GK
Sbjct: 215 IPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESAVAQVVRTIPGK 274
Query: 467 TYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMF 288
TYVLSF+VGDA+NAC GSM+VEAFAG+DT+KVPYES+G GGFKR+S+RFVAVS+RTRVMF
Sbjct: 275 TYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRFVAVSSRTRVMF 334
Query: 287 YSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 198
YSTFYAMR+DDFSSLCGPVIDDVKLLS R+
Sbjct: 335 YSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364
Score = 132 bits (332), Expect = 5e-031
Identities = 63/94 (67%), Positives = 77/94 (81%)
Frame = -1
Query: 1269 LVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVTGFVEYISS 1090
+VLLL+ + F DG+LPNGDFELGP+ SDMKGT++IN AIP+WE++GFVEYI S
Sbjct: 7 IVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPS 66
Query: 1089 GHKQGDMLLVVPAGKFAVRLGNEASIKQRIKSER 988
GHKQGDM+LVVP G FAVRLGNEASIKQ+I ++
Sbjct: 67 GHKQGDMILVVPKGAFAVRLGNEASIKQKISVKK 100
>TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 463 bits (1190), Expect = 1e-130
Identities = 221/270 (81%), Positives = 250/270 (92%)
Frame = -2
Query: 1007 KESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESE 828
K S KG YYS+TFSAARTCAQDERLN+SVAP V+P+QTVYSSSGWDLY+WAF+A+S+
Sbjct: 95 KISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSD 154
Query: 827 VAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVL 648
A+IVIHNPG EEDPACGPLIDGVAMRAL+PPRPTNKNILKNGGFEEGP VLP ++GVL
Sbjct: 155 YADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGPWVLPNISSGVL 214
Query: 647 IPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGK 468
IPP DDH+PLPGWMVESLKAVKY+D++HFSVPQGRRA+ELVAGKESA+AQV RTI GK
Sbjct: 215 IPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESAVAQVVRTIPGK 274
Query: 467 TYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMF 288
TYVLSF+VGDA+NAC GSM+VEAFAG+DT+KVPYES+G GGFKR+S+RFVAVS+RTRVMF
Sbjct: 275 TYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRFVAVSSRTRVMF 334
Query: 287 YSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 198
YSTFYAMR+DDFSSLCGPVIDDVKLLS R+
Sbjct: 335 YSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364
Score = 132 bits (332), Expect = 5e-031
Identities = 63/94 (67%), Positives = 77/94 (81%)
Frame = -1
Query: 1269 LVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVTGFVEYISS 1090
+VLLL+ + F DG+LPNGDFELGP+ SDMKGT++IN AIP+WE++GFVEYI S
Sbjct: 7 IVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPS 66
Query: 1089 GHKQGDMLLVVPAGKFAVRLGNEASIKQRIKSER 988
GHKQGDM+LVVP G FAVRLGNEASIKQ+I ++
Sbjct: 67 GHKQGDMILVVPKGAFAVRLGNEASIKQKISVKK 100
>TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein |
chr1:30171520-30172799 REVERSE
Length = 371
Score = 363 bits (931), Expect = 2e-100
Identities = 176/272 (64%), Positives = 211/272 (77%)
Frame = -2
Query: 1013 SNKESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAE 834
S K S + G YS+TFSAARTCAQDERLNISV +SGVIP+QT+Y S GWD Y+WAF+A
Sbjct: 96 SQKISVLPGRLYSITFSAARTCAQDERLNISVTHESGVIPIQTMYGSDGWDSYSWAFKAG 155
Query: 833 SEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTG 654
EI HNPG EE PACGPLID VA++AL+PPR + N++KNG FEEGP V P + G
Sbjct: 156 GPEIEIRFHNPGVEEHPACGPLIDAVAIKALFPPRFSGYNLIKNGNFEEGPYVFPTAKWG 215
Query: 653 VLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTII 474
VLIPPFIEDD++PLPGWM+ESLKAVKYVD HF+VP+G RAIELV GKESAI+Q+ RT +
Sbjct: 216 VLIPPFIEDDNSPLPGWMIESLKAVKYVDKAHFAVPEGHRAIELVGGKESAISQIVRTSL 275
Query: 473 GKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRV 294
K Y L+F VGDA + C+G M+VEAFAG+ + V Y S+G GGF+R + F AVS RTRV
Sbjct: 276 NKFYALTFNVGDARDGCEGPMIVEAFAGQGKVMVDYASKGKGGFRRGRLVFKAVSARTRV 335
Query: 293 MFYSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 198
F STFY M+SD SLCGPVIDDV+L++V K
Sbjct: 336 TFLSTFYHMKSDHSGSLCGPVIDDVRLVAVGK 367
Score = 111 bits (277), Expect = 1e-024
Identities = 54/98 (55%), Positives = 69/98 (70%)
Frame = -1
Query: 1293 MWGVTVVSLVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVT 1114
M+ + L LL I++ P RDG+LPNG+FELGPK S MKG+ + + A+P+W +
Sbjct: 2 MYQEAALLLALLFISSNVVLSAPVRDGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNII 61
Query: 1113 GFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRI 1000
GFVE+I SG KQ DM+LVVP G AVRLGNEASI Q+I
Sbjct: 62 GFVEFIKSGQKQDDMVLVVPQGSSAVRLGNEASISQKI 99
>TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:2564191-2565819
FORWARD
Length = 366
Score = 313 bits (801), Expect = 2e-085
Identities = 157/265 (59%), Positives = 193/265 (72%)
Frame = -2
Query: 1013 SNKESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAE 834
S K G Y+LTF A+RTCAQDE L +SV SG +P+QT+Y+S G D+YAWAF A+
Sbjct: 95 SQKLEVKPGSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAK 154
Query: 833 SEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTG 654
+ + HNPG +EDPACGPL+D VA++ L P T N++KNGGFEEGP L ST G
Sbjct: 155 TSQVTVTFHNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQG 214
Query: 653 VLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTII 474
VL+PP ED +PLPGW++ESLKAVK++D+++F+VP G AIELVAGKESAIAQV RT
Sbjct: 215 VLLPPKQEDLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSP 274
Query: 473 GKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRV 294
G+TY LSF VGDA N C GSM+VEAFA RDTLKVP+ S G G K AS +F AV RTR+
Sbjct: 275 GQTYTLSFVVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRI 334
Query: 293 MFYSTFYAMRSDDFSSLCGPVIDDV 219
F+S FY + D SLCGPVID++
Sbjct: 335 TFFSGFYHTKKTDTVSLCGPVIDEI 359
Score = 88 bits (216), Expect = 1e-017
Identities = 45/101 (44%), Positives = 62/101 (61%)
Frame = -1
Query: 1275 VSLVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVTGFVEYI 1096
+ L +LL+ + P +G L NG+FE PK +DMK T ++ K A+P WE TGFVEYI
Sbjct: 7 IILPILLLICGAALGAPASEGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETTGFVEYI 66
Query: 1095 SSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKSERNVLLA 973
+ G + G M V G AVRLGNEA+I Q+++ + L A
Sbjct: 67 AGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYA 107
>TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:2564517-2565819
FORWARD
Length = 324
Score = 313 bits (801), Expect = 2e-085
Identities = 157/265 (59%), Positives = 193/265 (72%)
Frame = -2
Query: 1013 SNKESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAE 834
S K G Y+LTF A+RTCAQDE L +SV SG +P+QT+Y+S G D+YAWAF A+
Sbjct: 53 SQKLEVKPGSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAK 112
Query: 833 SEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTG 654
+ + HNPG +EDPACGPL+D VA++ L P T N++KNGGFEEGP L ST G
Sbjct: 113 TSQVTVTFHNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQG 172
Query: 653 VLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTII 474
VL+PP ED +PLPGW++ESLKAVK++D+++F+VP G AIELVAGKESAIAQV RT
Sbjct: 173 VLLPPKQEDLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSP 232
Query: 473 GKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRV 294
G+TY LSF VGDA N C GSM+VEAFA RDTLKVP+ S G G K AS +F AV RTR+
Sbjct: 233 GQTYTLSFVVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRI 292
Query: 293 MFYSTFYAMRSDDFSSLCGPVIDDV 219
F+S FY + D SLCGPVID++
Sbjct: 293 TFFSGFYHTKKTDTVSLCGPVIDEI 317
>TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
DURING: 4 anthesis, C globular stage, petal differentiation and
expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:17436671-17438005 REVERSE
Length = 371
Score = 312 bits (799), Expect = 3e-085
Identities = 151/263 (57%), Positives = 188/263 (71%)
Frame = -2
Query: 989 GMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVI 810
G+ YSLTF A RTCAQDE + +SV + +P+QTV+SS G D YAWAF+A S+V ++
Sbjct: 108 GLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSSDGGDTYAWAFKATSDVVKVTF 167
Query: 809 HNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIE 630
HNPG +ED CGPL+D VA++ + P R T N++KNGGFE GP V +TG+LIP I+
Sbjct: 168 HNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGFEIGPHVFANFSTGILIPARIQ 227
Query: 629 DDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSF 450
D +PLPGW+VESLK VKY+D HF VP G+ A+ELVAG+ESAIAQ+ RTI GK Y+LSF
Sbjct: 228 DFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAGRESAIAQIIRTIAGKAYMLSF 287
Query: 449 AVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYA 270
AVGDA N C GSM+VEAFAGR+ K+ + S G G FK RFVA S RTR+ FYS FY
Sbjct: 288 AVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTGHFRFVADSDRTRLTFYSAFYH 347
Query: 269 MRSDDFSSLCGPVIDDVKLLSVR 201
+ DF LCGPV+D V + R
Sbjct: 348 TKLHDFGHLCGPVLDSVVVTLAR 370
>TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF642 (InterPro:IPR006946),
Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
NCBI BLink). | chr2:17439414-17441296 REVERSE
Length = 371
Score = 304 bits (777), Expect = 1e-082
Identities = 146/263 (55%), Positives = 187/263 (71%), Gaps = 1/263 (0%)
Frame = -2
Query: 998 KVK-GMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVA 822
KVK G+ YSLTF RTCAQDE + ISV + +P+QT++S++G D YAWAF+A S++
Sbjct: 104 KVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTNGGDTYAWAFKATSDLV 163
Query: 821 EIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIP 642
++ +NPG +EDP CGP++D VA++ + P R T N++KNGGFE GP V +TG+LIP
Sbjct: 164 KVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFETGPHVFSNFSTGILIP 223
Query: 641 PFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTY 462
I+D +PLPGW+VESLK VKY+D HF VP G AIELVAG+ESAIAQ+ RT+ GK Y
Sbjct: 224 AKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGRESAIAQIIRTVSGKNY 283
Query: 461 VLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYS 282
+LSF VGDA+N C GSM+VEAFAG KV +ES G FK F A S RTR+ FYS
Sbjct: 284 ILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGRFAFRADSNRTRITFYS 343
Query: 281 TFYAMRSDDFSSLCGPVIDDVKL 213
FY + DF LCGPV+D+V +
Sbjct: 344 GFYHTKLHDFGHLCGPVLDNVSV 366
>TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:14544114-14546732 REVERSE
Length = 402
Score = 265 bits (675), Expect = 8e-071
Identities = 128/265 (48%), Positives = 178/265 (67%), Gaps = 5/265 (1%)
Frame = -2
Query: 992 KGMYYSLTFSAARTCAQDERLNISVAPD-----SGVIPVQTVYSSSGWDLYAWAFQAESE 828
KG YS+TFSAARTCAQ E LN+SVA S I +QTVYS GWD YAWAF+A +
Sbjct: 114 KGSIYSVTFSAARTCAQLESLNVSVASSDEPIASQTIDLQTVYSVQGWDPYAWAFEAVVD 173
Query: 827 VAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVL 648
+V NPG E+DP CGP+ID +A++ L+ P N + NG FEEGP + +T GVL
Sbjct: 174 RVRLVFKNPGMEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVL 233
Query: 647 IPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGK 468
+P ++++ + LPGW VES +AV+++D++HFSVP+G+RA+EL++GKE I+Q+ T
Sbjct: 234 LPTNLDEEISSLPGWTVESNRAVRFIDSDHFSVPEGKRALELLSGKEGIISQMVETKANI 293
Query: 467 TYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMF 288
Y +SF++G A + CK + V AFAG Y ++ F+R+ + F A + RTR+ F
Sbjct: 294 PYKMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFERSELNFTAKAERTRIAF 353
Query: 287 YSTFYAMRSDDFSSLCGPVIDDVKL 213
YS +Y R+DD +SLCGPVIDDVK+
Sbjct: 354 YSIYYNTRTDDMTSLCGPVIDDVKV 378
Score = 83 bits (203), Expect = 4e-016
Identities = 47/112 (41%), Positives = 64/112 (57%), Gaps = 2/112 (1%)
Frame = -1
Query: 1317 LSSQCSFTMWGVTVVSLVLLLIATANSA--VVPFRDGILPNGDFELGPKSSDMKGTEIIN 1144
L S S+ + ++ L L ++A A+SA P DG++ NGDFE P + I +
Sbjct: 3 LYSNNSWRSNSILILLLGLSIVAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIED 62
Query: 1143 KMAIPSWEVTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKSER 988
IPSW G VE I SG KQG M+L+VP G+ AVRLGN+A I Q + E+
Sbjct: 63 TSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEK 114
>TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10504617 REVERSE
Length = 372
Score = 239 bits (608), Expect = 4e-063
Identities = 107/229 (46%), Positives = 158/229 (68%)
Frame = -2
Query: 899 IPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTN 720
+ +QT+YS GWD YAWAF+AE + +V NPG E+DP CGP+ID +A++ L+ P
Sbjct: 118 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 177
Query: 719 KNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQG 540
N + NG FE+GP + ++ GVL+P ++++ + LPGW VES +AV++VD++HFSVP+G
Sbjct: 178 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 237
Query: 539 RRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYES 360
+RA+EL++GKE I+Q+ T K Y+LSF++G A + CK + + AFAG Y +
Sbjct: 238 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 297
Query: 359 RGTGGFKRASIRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 213
+ F++A + F A + RTRV FYS +Y R+DD SSLCGPVIDDV++
Sbjct: 298 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 346
>TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10505994 REVERSE
Length = 408
Score = 239 bits (608), Expect = 4e-063
Identities = 107/229 (46%), Positives = 158/229 (68%)
Frame = -2
Query: 899 IPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTN 720
+ +QT+YS GWD YAWAF+AE + +V NPG E+DP CGP+ID +A++ L+ P
Sbjct: 154 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 213
Query: 719 KNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQG 540
N + NG FE+GP + ++ GVL+P ++++ + LPGW VES +AV++VD++HFSVP+G
Sbjct: 214 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 273
Query: 539 RRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYES 360
+RA+EL++GKE I+Q+ T K Y+LSF++G A + CK + + AFAG Y +
Sbjct: 274 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 333
Query: 359 RGTGGFKRASIRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 213
+ F++A + F A + RTRV FYS +Y R+DD SSLCGPVIDDV++
Sbjct: 334 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 382
>TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:4565246-4566653
REVERSE
Length = 384
Score = 133 bits (334), Expect = 3e-031
Identities = 94/286 (32%), Positives = 141/286 (49%), Gaps = 16/286 (5%)
Frame = -2
Query: 1040 QSGSETKRRSNKESKVKGMYYSLTFS---AARTCAQDERLNISVAPDSGVIPVQTVYSSS 870
Q G + K +K + Y LTF+ A + C L++S + V + YS
Sbjct: 74 QLGEDGKINQTFIAKGDELNYILTFALIHAGQNCTSSAGLSVSGPDSNAVFSYRQNYSKV 133
Query: 869 GWDLYAWAFQA--ESEVAEIVIHNPGEEED----PACGPLIDGVAMRALYPPRPTNK-NI 711
W Y+ + E +V+ + + D C P+ID + ++ + + N+
Sbjct: 134 SWQSYSHNLGSWGNGEPINLVLESQAIDSDSDTNSTCWPIIDTLLIKTVGVTLVQDSGNL 193
Query: 710 LKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRA 531
L NGGFE GP LP ST GVLI +PL W V + V+Y+D+EHF VP+G+ A
Sbjct: 194 LINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQWSV--IGTVRYIDSEHFHVPEGKAA 251
Query: 530 IELVAGKESAIAQVAR--TIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESR 357
IE+++ + Q A T G Y L+F +GDAN+AC+G VV A AG T ES
Sbjct: 252 IEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDANDACRGHFVVGAQAGSVTQNFTLESN 311
Query: 356 GTGGFKRASIRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDV 219
GTG ++ + F A ++ F T Y++ + +CGPVID+V
Sbjct: 312 GTGSGEKFGLVFEADKDAAQISF--TSYSVTMTKENVVCGPVIDEV 355
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,660,019,295
Number of Sequences: 33410
Number of Extensions: 9660019295
Number of Successful Extensions: 322770282
Number of sequences better than 0.0: 0
|