BLASTX 7.6.2
Query= UN15621 /QuerySize=1394
(1393 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:... 683 1e-196
TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular... 648 4e-186
TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular... 589 2e-168
TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular... 589 2e-168
TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein | chr1:... 472 3e-133
TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:... 394 7e-110
TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular... 389 2e-108
TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular... 377 8e-105
TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:... 371 6e-103
TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular... 340 1e-093
TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological... 240 2e-063
TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological... 240 2e-063
TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:... 149 6e-036
>TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:8863430-8865394
FORWARD
Length = 370
Score = 683 bits (1760), Expect = 1e-196
Identities = 337/368 (91%), Positives = 355/368 (96%), Gaps = 2/368 (0%)
Frame = +2
Query: 62 MWGVTVVSLVLLLIATANSA--VVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWE 235
M GVTVVS LL IATA +A V FRDG+LPNGDFELGPKPSD+KGTEI+NK+AIPNWE
Sbjct: 1 MEGVTVVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWE 60
Query: 236 VTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQ 415
VTGFVEYI SGHKQGDMLLVVPAGKFAVRLGNEASIKQR+KVVKGMYYSLTFSAARTCAQ
Sbjct: 61 VTGFVEYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLKVVKGMYYSLTFSAARTCAQ 120
Query: 416 DERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLID 595
DERLNISVAPDSGVIP+QTVYSSSGWDLYAWAFQAES+VAE+VIHNPG EEDPACGPLID
Sbjct: 121 DERLNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEVVIHNPGVEEDPACGPLID 180
Query: 596 GVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKA 775
GVAMR+LYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDH+PLPGWMVESLKA
Sbjct: 181 GVAMRSLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHSPLPGWMVESLKA 240
Query: 776 VKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVE 955
VKYVD EHFSVPQGRRAIELVAGKESAIAQV RT+IGKTYVLSFAVGDANNACKGSMVVE
Sbjct: 241 VKYVDVEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVLSFAVGDANNACKGSMVVE 300
Query: 956 AFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDD 1135
AFAG+DTLKVPYES+GTGGFKRASIRFVAVSTR+R+MFYSTFY+MRSDDFSSLCGPVIDD
Sbjct: 301 AFAGKDTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTFYAMRSDDFSSLCGPVIDD 360
Query: 1136 VKLISVRQ 1159
VKLISVR+
Sbjct: 361 VKLISVRK 368
>TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr5:3644655-3646991 FORWARD
Length = 367
Score = 648 bits (1669), Expect = 4e-186
Identities = 315/364 (86%), Positives = 343/364 (94%), Gaps = 1/364 (0%)
Frame = +2
Query: 68 GVTVVSLVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGF 247
G ++ L +LLIAT S V+ F DG+LPNGDFELGPKPSD+KGT++INK AIP+WE++GF
Sbjct: 3 GGSLSFLFVLLIATITS-VICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSGF 61
Query: 248 VEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERL 427
VEYI SG KQGDMLLVVPAGKFA+RLGNEASIKQR+ V KGMYYSLTFSAARTCAQDERL
Sbjct: 62 VEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERL 121
Query: 428 NISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAM 607
NISVAPDSGVIP+QTVYSSSGWDLYAWAFQAES VAEIVIHNPGEEEDPACGPLIDGVA+
Sbjct: 122 NISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAI 181
Query: 608 RALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYV 787
+ALYPPRPTNKNILKNGGFEEGP VLP +TTGVL+PPFIEDDH+PLP WMVESLKA+KYV
Sbjct: 182 KALYPPRPTNKNILKNGGFEEGPYVLPNATTGVLVPPFIEDDHSPLPAWMVESLKAIKYV 241
Query: 788 DTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAG 967
D EHFSVPQGRRA+ELVAGKESAIAQVART++GKTYVLSFAVGDANNAC+GSMVVEAFAG
Sbjct: 242 DVEHFSVPQGRRAVELVAGKESAIAQVARTVVGKTYVLSFAVGDANNACQGSMVVEAFAG 301
Query: 968 RDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLI 1147
+DTLKVPYESRG GGFKRAS+RFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL+
Sbjct: 302 KDTLKVPYESRGKGGFKRASLRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLL 361
Query: 1148 SVRQ 1159
S R+
Sbjct: 362 SARK 365
>TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 589 bits (1516), Expect = 2e-168
Identities = 281/358 (78%), Positives = 324/358 (90%)
Frame = +2
Query: 86 LVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISS 265
+VLLL+ + F DG+LPNGDFELGP+ SD+KGT++IN AIPNWE++GFVEYI S
Sbjct: 7 IVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPS 66
Query: 266 GHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVAP 445
GHKQGDM+LVVP G FAVRLGNEASIKQ+I V KG YYS+TFSAARTCAQDERLN+SVAP
Sbjct: 67 GHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAP 126
Query: 446 DSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPP 625
V+P+QTVYSSSGWDLY+WAF+A+S+ A+IVIHNPG EEDPACGPLIDGVAMRAL+PP
Sbjct: 127 HHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPP 186
Query: 626 RPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFS 805
RPTNKNILKNGGFEEGP VLP ++GVLIPP DDH+PLPGWMVESLKAVKY+D++HFS
Sbjct: 187 RPTNKNILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFS 246
Query: 806 VPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKV 985
VPQGRRA+ELVAGKESA+AQV RTI GKTYVLSF+VGDA+NAC GSM+VEAFAG+DT+KV
Sbjct: 247 VPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKV 306
Query: 986 PYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLISVRQ 1159
PYES+G GGFKR+S+RFVAVS+RTRVMFYSTFY+MR+DDFSSLCGPVIDDVKL+S R+
Sbjct: 307 PYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364
>TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 589 bits (1516), Expect = 2e-168
Identities = 281/358 (78%), Positives = 324/358 (90%)
Frame = +2
Query: 86 LVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISS 265
+VLLL+ + F DG+LPNGDFELGP+ SD+KGT++IN AIPNWE++GFVEYI S
Sbjct: 7 IVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPS 66
Query: 266 GHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVAP 445
GHKQGDM+LVVP G FAVRLGNEASIKQ+I V KG YYS+TFSAARTCAQDERLN+SVAP
Sbjct: 67 GHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAP 126
Query: 446 DSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPP 625
V+P+QTVYSSSGWDLY+WAF+A+S+ A+IVIHNPG EEDPACGPLIDGVAMRAL+PP
Sbjct: 127 HHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPP 186
Query: 626 RPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFS 805
RPTNKNILKNGGFEEGP VLP ++GVLIPP DDH+PLPGWMVESLKAVKY+D++HFS
Sbjct: 187 RPTNKNILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFS 246
Query: 806 VPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKV 985
VPQGRRA+ELVAGKESA+AQV RTI GKTYVLSF+VGDA+NAC GSM+VEAFAG+DT+KV
Sbjct: 247 VPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKV 306
Query: 986 PYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLISVRQ 1159
PYES+G GGFKR+S+RFVAVS+RTRVMFYSTFY+MR+DDFSSLCGPVIDDVKL+S R+
Sbjct: 307 PYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364
>TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein |
chr1:30171520-30172799 REVERSE
Length = 371
Score = 472 bits (1214), Expect = 3e-133
Identities = 228/364 (62%), Positives = 278/364 (76%)
Frame = +2
Query: 62 MWGVTVVSLVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVT 241
M+ + L LL I++ P RDG+LPNG+FELGPKPS +KG+ + + A+PNW +
Sbjct: 2 MYQEAALLLALLFISSNVVLSAPVRDGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNII 61
Query: 242 GFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDE 421
GFVE+I SG KQ DM+LVVP G AVRLGNEASI Q+I V+ G YS+TFSAARTCAQDE
Sbjct: 62 GFVEFIKSGQKQDDMVLVVPQGSSAVRLGNEASISQKISVLPGRLYSITFSAARTCAQDE 121
Query: 422 RLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGV 601
RLNISV +SGVIP+QT+Y S GWD Y+WAF+A EI HNPG EE PACGPLID V
Sbjct: 122 RLNISVTHESGVIPIQTMYGSDGWDSYSWAFKAGGPEIEIRFHNPGVEEHPACGPLIDAV 181
Query: 602 AMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVK 781
A++AL+PPR + N++KNG FEEGP V P + GVLIPPFIEDD++PLPGWM+ESLKAVK
Sbjct: 182 AIKALFPPRFSGYNLIKNGNFEEGPYVFPTAKWGVLIPPFIEDDNSPLPGWMIESLKAVK 241
Query: 782 YVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAF 961
YVD HF+VP+G RAIELV GKESAI+Q+ RT + K Y L+F VGDA + C+G M+VEAF
Sbjct: 242 YVDKAHFAVPEGHRAIELVGGKESAISQIVRTSLNKFYALTFNVGDARDGCEGPMIVEAF 301
Query: 962 AGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVK 1141
AG+ + V Y S+G GGF+R + F AVS RTRV F STFY M+SD SLCGPVIDDV+
Sbjct: 302 AGQGKVMVDYASKGKGGFRRGRLVFKAVSARTRVTFLSTFYHMKSDHSGSLCGPVIDDVR 361
Query: 1142 LISV 1153
L++V
Sbjct: 362 LVAV 365
>TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:2564191-2565819
FORWARD
Length = 366
Score = 394 bits (1012), Expect = 7e-110
Identities = 198/353 (56%), Positives = 251/353 (71%)
Frame = +2
Query: 80 VSLVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYI 259
+ L +LL+ + P +G L NG+FE PK +D+K T ++ K A+P WE TGFVEYI
Sbjct: 7 IILPILLLICGAALGAPASEGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETTGFVEYI 66
Query: 260 SSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISV 439
+ G + G M V G AVRLGNEA+I Q+++V G Y+LTF A+RTCAQDE L +SV
Sbjct: 67 AGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYALTFGASRTCAQDEVLRVSV 126
Query: 440 APDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALY 619
SG +P+QT+Y+S G D+YAWAF A++ + HNPG +EDPACGPL+D VA++ L
Sbjct: 127 PSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTFHNPGVQEDPACGPLLDAVAIKELV 186
Query: 620 PPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEH 799
P T N++KNGGFEEGP L ST GVL+PP ED +PLPGW++ESLKAVK++D+++
Sbjct: 187 HPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDLTSPLPGWIIESLKAVKFIDSKY 246
Query: 800 FSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTL 979
F+VP G AIELVAGKESAIAQV RT G+TY LSF VGDA N C GSM+VEAFA RDTL
Sbjct: 247 FNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSFVVGDAKNDCHGSMMVEAFAARDTL 306
Query: 980 KVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDV 1138
KVP+ S G G K AS +F AV RTR+ F+S FY + D SLCGPVID++
Sbjct: 307 KVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYHTKKTDTVSLCGPVIDEI 359
>TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
DURING: 4 anthesis, C globular stage, petal differentiation and
expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:17436671-17438005 REVERSE
Length = 371
Score = 389 bits (999), Expect = 2e-108
Identities = 189/344 (54%), Positives = 240/344 (69%)
Frame = +2
Query: 125 VPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISSGHKQGDMLLVVPA 304
VP DGILPNG+FE+ P S++KG +II ++P+WE+ G VE +S G + G VP
Sbjct: 27 VPHLDGILPNGNFEITPLKSNMKGRQIIGANSLPHWEIAGHVELVSGGPQPGGFYFPVPR 86
Query: 305 GKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSS 484
G AVRLGN +I Q ++V G+ YSLTF A RTCAQDE + +SV + +P+QTV+SS
Sbjct: 87 GVHAVRLGNLGTISQNVRVKSGLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSS 146
Query: 485 SGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGF 664
G D YAWAF+A S+V ++ HNPG +ED CGPL+D VA++ + P R T N++KNGGF
Sbjct: 147 DGGDTYAWAFKATSDVVKVTFHNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGF 206
Query: 665 EEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAG 844
E GP V +TG+LIP I+D +PLPGW+VESLK VKY+D HF VP G+ A+ELVAG
Sbjct: 207 EIGPHVFANFSTGILIPARIQDFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAG 266
Query: 845 KESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRA 1024
+ESAIAQ+ RTI GK Y+LSFAVGDA N C GSM+VEAFAGR+ K+ + S G G FK
Sbjct: 267 RESAIAQIIRTIAGKAYMLSFAVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTG 326
Query: 1025 SIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLISVR 1156
RFVA S RTR+ FYS FY + DF LCGPV+D V + R
Sbjct: 327 HFRFVADSDRTRLTFYSAFYHTKLHDFGHLCGPVLDSVVVTLAR 370
>TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF642 (InterPro:IPR006946),
Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
NCBI BLink). | chr2:17439414-17441296 REVERSE
Length = 371
Score = 377 bits (968), Expect = 8e-105
Identities = 180/339 (53%), Positives = 235/339 (69%)
Frame = +2
Query: 128 PFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISSGHKQGDMLLVVPAG 307
P DG+LPNG+FE P S+++ +II K ++P+WE++G VE +S G + G VP G
Sbjct: 28 PHLDGLLPNGNFEQIPNKSNMRKRQIIGKYSLPHWEISGHVELVSGGPQPGGFYFAVPRG 87
Query: 308 KFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSS 487
A RLGN ASI Q +KV G+ YSLTF RTCAQDE + ISV + +P+QT++S++
Sbjct: 88 VHAARLGNLASISQYVKVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTN 147
Query: 488 GWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFE 667
G D YAWAF+A S++ ++ +NPG +EDP CGP++D VA++ + P R T N++KNGGFE
Sbjct: 148 GGDTYAWAFKATSDLVKVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFE 207
Query: 668 EGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGK 847
GP V +TG+LIP I+D +PLPGW+VESLK VKY+D HF VP G AIELVAG+
Sbjct: 208 TGPHVFSNFSTGILIPAKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGR 267
Query: 848 ESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRAS 1027
ESAIAQ+ RT+ GK Y+LSF VGDA+N C GSM+VEAFAG KV +ES G FK
Sbjct: 268 ESAIAQIIRTVSGKNYILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGR 327
Query: 1028 IRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 1144
F A S RTR+ FYS FY + DF LCGPV+D+V +
Sbjct: 328 FAFRADSNRTRITFYSGFYHTKLHDFGHLCGPVLDNVSV 366
>TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:2564517-2565819
FORWARD
Length = 324
Score = 371 bits (952), Expect = 6e-103
Identities = 185/317 (58%), Positives = 231/317 (72%)
Frame = +2
Query: 188 LKGTEIINKMAIPNWEVTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVK 367
+K T ++ K A+P WE TGFVEYI+ G + G M V G AVRLGNEA+I Q+++V
Sbjct: 1 MKKTVLLGKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKP 60
Query: 368 GMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVI 547
G Y+LTF A+RTCAQDE L +SV SG +P+QT+Y+S G D+YAWAF A++ +
Sbjct: 61 GSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTF 120
Query: 548 HNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIE 727
HNPG +EDPACGPL+D VA++ L P T N++KNGGFEEGP L ST GVL+PP E
Sbjct: 121 HNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQE 180
Query: 728 DDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSF 907
D +PLPGW++ESLKAVK++D+++F+VP G AIELVAGKESAIAQV RT G+TY LSF
Sbjct: 181 DLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSF 240
Query: 908 AVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYS 1087
VGDA N C GSM+VEAFA RDTLKVP+ S G G K AS +F AV RTR+ F+S FY
Sbjct: 241 VVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYH 300
Query: 1088 MRSDDFSSLCGPVIDDV 1138
+ D SLCGPVID++
Sbjct: 301 TKKTDTVSLCGPVIDEI 317
>TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:14544114-14546732 REVERSE
Length = 402
Score = 340 bits (871), Expect = 1e-093
Identities = 174/376 (46%), Positives = 241/376 (64%), Gaps = 7/376 (1%)
Frame = +2
Query: 38 LSSQCSFTMWGVTVVSLVLLLIATANSA--VVPFRDGILPNGDFELGPKPSDLKGTEIIN 211
L S S+ + ++ L L ++A A+SA P DG++ NGDFE P I +
Sbjct: 3 LYSNNSWRSNSILILLLGLSIVAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIED 62
Query: 212 KMAIPNWEVTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTF 391
IP+W G VE I SG KQG M+L+VP G+ AVRLGN+A I Q + V KG YS+TF
Sbjct: 63 TSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEKGSIYSVTF 122
Query: 392 SAARTCAQDERLNISVAPD-----SGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNP 556
SAARTCAQ E LN+SVA S I +QTVYS GWD YAWAF+A + +V NP
Sbjct: 123 SAARTCAQLESLNVSVASSDEPIASQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNP 182
Query: 557 GEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDH 736
G E+DP CGP+ID +A++ L+ P N + NG FEEGP + +T GVL+P ++++
Sbjct: 183 GMEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVLLPTNLDEEI 242
Query: 737 TPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVG 916
+ LPGW VES +AV+++D++HFSVP+G+RA+EL++GKE I+Q+ T Y +SF++G
Sbjct: 243 SSLPGWTVESNRAVRFIDSDHFSVPEGKRALELLSGKEGIISQMVETKANIPYKMSFSLG 302
Query: 917 DANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRS 1096
A + CK + V AFAG Y ++ F+R+ + F A + RTR+ FYS +Y+ R+
Sbjct: 303 HAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFERSELNFTAKAERTRIAFYSIYYNTRT 362
Query: 1097 DDFSSLCGPVIDDVKL 1144
DD +SLCGPVIDDVK+
Sbjct: 363 DDMTSLCGPVIDDVKV 378
>TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10504617 REVERSE
Length = 372
Score = 240 bits (611), Expect = 2e-063
Identities = 107/229 (46%), Positives = 159/229 (69%)
Frame = +2
Query: 458 IPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTN 637
+ +QT+YS GWD YAWAF+AE + +V NPG E+DP CGP+ID +A++ L+ P
Sbjct: 118 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 177
Query: 638 KNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQG 817
N + NG FE+GP + ++ GVL+P ++++ + LPGW VES +AV++VD++HFSVP+G
Sbjct: 178 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 237
Query: 818 RRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYES 997
+RA+EL++GKE I+Q+ T K Y+LSF++G A + CK + + AFAG Y +
Sbjct: 238 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 297
Query: 998 RGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 1144
+ F++A + F A + RTRV FYS +Y+ R+DD SSLCGPVIDDV++
Sbjct: 298 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 346
Score = 103 bits (256), Expect = 3e-022
Identities = 53/101 (52%), Positives = 66/101 (65%)
Frame = +2
Query: 140 GILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISSGHKQGDMLLVVPAGKFAV 319
G++ NGDFE P IP+W+ G VE I+SG KQG M+L+VP G+ AV
Sbjct: 3 GLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHAV 62
Query: 320 RLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVA 442
RLGN+A I Q + V KG YS+TFSAARTCAQ E +N+SVA
Sbjct: 63 RLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVA 103
>TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10505994 REVERSE
Length = 408
Score = 240 bits (611), Expect = 2e-063
Identities = 107/229 (46%), Positives = 159/229 (69%)
Frame = +2
Query: 458 IPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTN 637
+ +QT+YS GWD YAWAF+AE + +V NPG E+DP CGP+ID +A++ L+ P
Sbjct: 154 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 213
Query: 638 KNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQG 817
N + NG FE+GP + ++ GVL+P ++++ + LPGW VES +AV++VD++HFSVP+G
Sbjct: 214 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 273
Query: 818 RRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYES 997
+RA+EL++GKE I+Q+ T K Y+LSF++G A + CK + + AFAG Y +
Sbjct: 274 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 333
Query: 998 RGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 1144
+ F++A + F A + RTRV FYS +Y+ R+DD SSLCGPVIDDV++
Sbjct: 334 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 382
Score = 111 bits (276), Expect = 1e-024
Identities = 60/135 (44%), Positives = 85/135 (62%), Gaps = 1/135 (0%)
Frame = +2
Query: 41 SSQCSFTMWGVTVVSL-VLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKM 217
+++C ++ + ++S+ V +L+A A+ DG++ NGDFE P
Sbjct: 5 NNRCKWSSIFLFLLSVSVAVLVAVADDKSPAVEDGLVINGDFETSPSSGFPDDGVTDGPS 64
Query: 218 AIPNWEVTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSA 397
IP+W+ G VE I+SG KQG M+L+VP G+ AVRLGN+A I Q + V KG YS+TFSA
Sbjct: 65 DIPSWKSNGTVELINSGQKQGGMILIVPQGRHAVRLGNDAEISQDLTVEKGFVYSVTFSA 124
Query: 398 ARTCAQDERLNISVA 442
ARTCAQ E +N+SVA
Sbjct: 125 ARTCAQLESINVSVA 139
>TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:4565246-4566653
REVERSE
Length = 384
Score = 149 bits (374), Expect = 6e-036
Identities = 117/371 (31%), Positives = 173/371 (46%), Gaps = 41/371 (11%)
Frame = +2
Query: 80 VSLVLLLIATANSAVVPFRDGILPNGDFELG----PKPSDLKGTEIINKMAIPNWEVTGF 247
+ L+LL+ A+S L N DFE P S+ + +P W G
Sbjct: 8 IFLLLLVSCCASS-------DFLENPDFESPPLNLPTNSNASSVSLDQNSTLPGWTFQGT 60
Query: 248 VEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKG--MYYSLTFS---AARTCA 412
V Y+ +P AV+LG + I Q + KG + Y LTF+ A + C
Sbjct: 61 VLYVE-----------LPDTGHAVQLGEDGKINQTF-IAKGDELNYILTFALIHAGQNCT 108
Query: 413 QDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQA--ESEVAEIVIHNPGEEED----P 574
L++S + V + YS W Y+ + E +V+ + + D
Sbjct: 109 SSAGLSVSGPDSNAVFSYRQNYSKVSWQSYSHNLGSWGNGEPINLVLESQAIDSDSDTNS 168
Query: 575 ACGPLIDGVAMRALYPPRPTNK-NILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPG 751
C P+ID + ++ + + N+L NGGFE GP LP ST GVLI +PL
Sbjct: 169 TCWPIIDTLLIKTVGVTLVQDSGNLLINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQ 228
Query: 752 WMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVAR--TIIGKTYVLSFAVGDAN 925
W V + V+Y+D+EHF VP+G+ AIE+++ + Q A T G Y L+F +GDAN
Sbjct: 229 WSV--IGTVRYIDSEHFHVPEGKAAIEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDAN 286
Query: 926 NACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDF 1105
+AC+G VV A AG T ES GTG ++ + F A ++ F T YS+
Sbjct: 287 DACRGHFVVGAQAGSVTQNFTLESNGTGSGEKFGLVFEADKDAAQISF--TSYSVTMTKE 344
Query: 1106 SSLCGPVIDDV 1138
+ +CGPVID+V
Sbjct: 345 NVVCGPVIDEV 355
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,188,306,669
Number of Sequences: 33410
Number of Extensions: 8188306669
Number of Successful Extensions: 285327727
Number of sequences better than 0.0: 0
|