BLASTX 7.6.2
Query= UN17103 /QuerySize=1392
(1391 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular... 703 1e-202
TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:... 652 2e-187
TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular... 594 6e-170
TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular... 594 6e-170
TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein | chr1:... 478 5e-135
TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:... 399 2e-111
TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular... 392 4e-109
TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:... 381 8e-106
TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular... 376 2e-104
TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular... 340 1e-093
TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological... 339 3e-093
TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological... 242 4e-064
TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:... 153 4e-037
>TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr5:3644655-3646991 FORWARD
Length = 367
Score = 703 bits (1812), Expect = 1e-202
Identities = 342/365 (93%), Positives = 360/365 (98%)
Frame = -3
Query: 1344 MKGGTGSFLCVLLISTVTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTG 1165
MKGG+ SFL VLLI+T+TSV+CF DGMLPNGDFELGPKPSDMKGTQV+NK AIPSWEL+G
Sbjct: 1 MKGGSLSFLFVLLIATITSVICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSG 60
Query: 1164 FVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDER 985
FVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDER
Sbjct: 61 FVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDER 120
Query: 984 LNISVAPDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVA 805
LNISVAPDSG+IPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVA
Sbjct: 121 LNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVA 180
Query: 804 IKALYPPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKY 625
IKALYPPRPTNKNILKNGGFEEGPY+LPN+TTGVL+PPFIEDDHSPLPAWMVESLKA+KY
Sbjct: 181 IKALYPPRPTNKNILKNGGFEEGPYVLPNATTGVLVPPFIEDDHSPLPAWMVESLKAIKY 240
Query: 624 VDVEHFSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFA 445
VDVEHFSVPQGRRAVELVAGKESAIAQVART++GKTYVLSFAVGDANNAC+GSM+VEAFA
Sbjct: 241 VDVEHFSVPQGRRAVELVAGKESAIAQVARTVVGKTYVLSFAVGDANNACQGSMVVEAFA 300
Query: 444 GRDTLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 265
G+DTLKVPYES+GKGGFKRASLRFVAVSTRTRVMFYSTFY+MRSDDFSSLCGPVIDDVKL
Sbjct: 301 GKDTLKVPYESRGKGGFKRASLRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 360
Query: 264 LSVRK 250
LS RK
Sbjct: 361 LSARK 365
>TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:8863430-8865394
FORWARD
Length = 370
Score = 652 bits (1680), Expect = 2e-187
Identities = 320/368 (86%), Positives = 344/368 (93%), Gaps = 3/368 (0%)
Frame = -3
Query: 1344 MKGGTGSFLCVLLIST---VTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWE 1174
M+G T +L I+T S V FRDGMLPNGDFELGPKPSDMKGT++LNK AIP+WE
Sbjct: 1 MEGVTVVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWE 60
Query: 1173 LTGFVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQ 994
+TGFVEYIKSG KQGDMLLVVPAGKFA+RLGNEASIKQRL V KGMYYSLTFSAARTCAQ
Sbjct: 61 VTGFVEYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLKVVKGMYYSLTFSAARTCAQ 120
Query: 993 DERLNISVAPDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLID 814
DERLNISVAPDSG+IPIQTVYSSSGWDLYAWAFQAES+VAE+VIHNPG EEDPACGPLID
Sbjct: 121 DERLNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEVVIHNPGVEEDPACGPLID 180
Query: 813 GVAIKALYPPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKA 634
GVA+++LYPPRPTNKNILKNGGFEEGP +LP STTGVLIPPFIEDDHSPLP WMVESLKA
Sbjct: 181 GVAMRSLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHSPLPGWMVESLKA 240
Query: 633 VKYVDVEHFSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVE 454
VKYVDVEHFSVPQGRRA+ELVAGKESAIAQV RT+IGKTYVLSFAVGDANNAC+GSM+VE
Sbjct: 241 VKYVDVEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVLSFAVGDANNACKGSMVVE 300
Query: 453 AFAGRDTLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDD 274
AFAG+DTLKVPYESKG GGFKRAS+RFVAVSTR+R+MFYSTFYAMRSDDFSSLCGPVIDD
Sbjct: 301 AFAGKDTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTFYAMRSDDFSSLCGPVIDD 360
Query: 273 VKLLSVRK 250
VKL+SVRK
Sbjct: 361 VKLISVRK 368
>TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 594 bits (1530), Expect = 6e-170
Identities = 286/360 (79%), Positives = 325/360 (90%)
Frame = -3
Query: 1329 GSFLCVLLISTVTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYI 1150
G + +LL S CF DG+LPNGDFELGP+ SDMKGTQV+N AIP+WEL+GFVEYI
Sbjct: 5 GVIVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYI 64
Query: 1149 KSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISV 970
SG KQGDM+LVVP G FA+RLGNEASIKQ+++V KG YYS+TFSAARTCAQDERLN+SV
Sbjct: 65 PSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSV 124
Query: 969 APDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALY 790
AP ++PIQTVYSSSGWDLY+WAF+A+S+ A+IVIHNPG EEDPACGPLIDGVA++AL+
Sbjct: 125 APHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALF 184
Query: 789 PPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEH 610
PPRPTNKNILKNGGFEEGP++LPN ++GVLIPP DDHSPLP WMVESLKAVKY+D +H
Sbjct: 185 PPRPTNKNILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDH 244
Query: 609 FSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTL 430
FSVPQGRRAVELVAGKESA+AQV RTI GKTYVLSF+VGDA+NAC GSMIVEAFAG+DT+
Sbjct: 245 FSVPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTI 304
Query: 429 KVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 250
KVPYESKGKGGFKR+SLRFVAVS+RTRVMFYSTFYAMR+DDFSSLCGPVIDDVKLLS R+
Sbjct: 305 KVPYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364
>TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 594 bits (1530), Expect = 6e-170
Identities = 286/360 (79%), Positives = 325/360 (90%)
Frame = -3
Query: 1329 GSFLCVLLISTVTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYI 1150
G + +LL S CF DG+LPNGDFELGP+ SDMKGTQV+N AIP+WEL+GFVEYI
Sbjct: 5 GVIVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYI 64
Query: 1149 KSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISV 970
SG KQGDM+LVVP G FA+RLGNEASIKQ+++V KG YYS+TFSAARTCAQDERLN+SV
Sbjct: 65 PSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSV 124
Query: 969 APDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALY 790
AP ++PIQTVYSSSGWDLY+WAF+A+S+ A+IVIHNPG EEDPACGPLIDGVA++AL+
Sbjct: 125 APHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALF 184
Query: 789 PPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEH 610
PPRPTNKNILKNGGFEEGP++LPN ++GVLIPP DDHSPLP WMVESLKAVKY+D +H
Sbjct: 185 PPRPTNKNILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDH 244
Query: 609 FSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTL 430
FSVPQGRRAVELVAGKESA+AQV RTI GKTYVLSF+VGDA+NAC GSMIVEAFAG+DT+
Sbjct: 245 FSVPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTI 304
Query: 429 KVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 250
KVPYESKGKGGFKR+SLRFVAVS+RTRVMFYSTFYAMR+DDFSSLCGPVIDDVKLLS R+
Sbjct: 305 KVPYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364
>TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein |
chr1:30171520-30172799 REVERSE
Length = 371
Score = 478 bits (1229), Expect = 5e-135
Identities = 235/358 (65%), Positives = 279/358 (77%), Gaps = 1/358 (0%)
Frame = -3
Query: 1320 LCVLLIST-VTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKS 1144
L +L IS+ V RDG+LPNG+FELGPKPS MKG+ V + A+P+W + GFVE+IKS
Sbjct: 10 LALLFISSNVVLSAPVRDGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNIIGFVEFIKS 69
Query: 1143 GQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAP 964
GQKQ DM+LVVP G A+RLGNEASI Q+++V G YS+TFSAARTCAQDERLNISV
Sbjct: 70 GQKQDDMVLVVPQGSSAVRLGNEASISQKISVLPGRLYSITFSAARTCAQDERLNISVTH 129
Query: 963 DSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPP 784
+SG+IPIQT+Y S GWD Y+WAF+A EI HNPG EE PACGPLID VAIKAL+PP
Sbjct: 130 ESGVIPIQTMYGSDGWDSYSWAFKAGGPEIEIRFHNPGVEEHPACGPLIDAVAIKALFPP 189
Query: 783 RPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFS 604
R + N++KNG FEEGPY+ P + GVLIPPFIEDD+SPLP WM+ESLKAVKYVD HF+
Sbjct: 190 RFSGYNLIKNGNFEEGPYVFPTAKWGVLIPPFIEDDNSPLPGWMIESLKAVKYVDKAHFA 249
Query: 603 VPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKV 424
VP+G RA+ELV GKESAI+Q+ RT + K Y L+F VGDA + CEG MIVEAFAG+ + V
Sbjct: 250 VPEGHRAIELVGGKESAISQIVRTSLNKFYALTFNVGDARDGCEGPMIVEAFAGQGKVMV 309
Query: 423 PYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 250
Y SKGKGGF+R L F AVS RTRV F STFY M+SD SLCGPVIDDV+L++V K
Sbjct: 310 DYASKGKGGFRRGRLVFKAVSARTRVTFLSTFYHMKSDHSGSLCGPVIDDVRLVAVGK 367
>TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:2564191-2565819
FORWARD
Length = 366
Score = 399 bits (1025), Expect = 2e-111
Identities = 199/334 (59%), Positives = 243/334 (72%)
Frame = -3
Query: 1272 DGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFA 1093
+G L NG+FE PK +DMK T +L KNA+P WE TGFVEYI G + G M V G A
Sbjct: 26 EGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHA 85
Query: 1092 IRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPDSGIIPIQTVYSSSGWD 913
+RLGNEA+I Q+L V G Y+LTF A+RTCAQDE L +SV SG +P+QT+Y+S G D
Sbjct: 86 VRLGNEATISQKLEVKPGSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGD 145
Query: 912 LYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGP 733
+YAWAF A+++ + HNPG +EDPACGPL+D VAIK L P T N++KNGGFEEGP
Sbjct: 146 VYAWAFVAKTSQVTVTFHNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGP 205
Query: 732 YLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVAGKESA 553
+ L NST GVL+PP ED SPLP W++ESLKAVK++D ++F+VP G A+ELVAGKESA
Sbjct: 206 HRLVNSTQGVLLPPKQEDLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESA 265
Query: 552 IAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKRASLRF 373
IAQV RT G+TY LSF VGDA N C GSM+VEAFA RDTLKVP+ S G G K AS +F
Sbjct: 266 IAQVIRTSPGQTYTLSFVVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKF 325
Query: 372 VAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDV 271
AV RTR+ F+S FY + D SLCGPVID++
Sbjct: 326 KAVEARTRITFFSGFYHTKKTDTVSLCGPVIDEI 359
>TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
DURING: 4 anthesis, C globular stage, petal differentiation and
expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:17436671-17438005 REVERSE
Length = 371
Score = 392 bits (1005), Expect = 4e-109
Identities = 189/340 (55%), Positives = 239/340 (70%)
Frame = -3
Query: 1272 DGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFA 1093
DG+LPNG+FE+ P S+MKG Q++ N++P WE+ G VE + G + G VP G A
Sbjct: 31 DGILPNGNFEITPLKSNMKGRQIIGANSLPHWEIAGHVELVSGGPQPGGFYFPVPRGVHA 90
Query: 1092 IRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPDSGIIPIQTVYSSSGWD 913
+RLGN +I Q + V G+ YSLTF A RTCAQDE + +SV + +P+QTV+SS G D
Sbjct: 91 VRLGNLGTISQNVRVKSGLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSSDGGD 150
Query: 912 LYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGP 733
YAWAF+A S+V ++ HNPG +ED CGPL+D VAIK + P R T N++KNGGFE GP
Sbjct: 151 TYAWAFKATSDVVKVTFHNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGFEIGP 210
Query: 732 YLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVAGKESA 553
++ N +TG+LIP I+D SPLP W+VESLK VKY+D HF VP G+ AVELVAG+ESA
Sbjct: 211 HVFANFSTGILIPARIQDFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAGRESA 270
Query: 552 IAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKRASLRF 373
IAQ+ RTI GK Y+LSFAVGDA N C GSM+VEAFAGR+ K+ + S+GKG FK RF
Sbjct: 271 IAQIIRTIAGKAYMLSFAVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTGHFRF 330
Query: 372 VAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLLSVR 253
VA S RTR+ FYS FY + DF LCGPV+D V + R
Sbjct: 331 VADSDRTRLTFYSAFYHTKLHDFGHLCGPVLDSVVVTLAR 370
>TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:2564517-2565819
FORWARD
Length = 324
Score = 381 bits (977), Expect = 8e-106
Identities = 190/317 (59%), Positives = 231/317 (72%)
Frame = -3
Query: 1221 MKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTK 1042
MK T +L KNA+P WE TGFVEYI G + G M V G A+RLGNEA+I Q+L V
Sbjct: 1 MKKTVLLGKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKP 60
Query: 1041 GMYYSLTFSAARTCAQDERLNISVAPDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVI 862
G Y+LTF A+RTCAQDE L +SV SG +P+QT+Y+S G D+YAWAF A+++ +
Sbjct: 61 GSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTF 120
Query: 861 HNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIE 682
HNPG +EDPACGPL+D VAIK L P T N++KNGGFEEGP+ L NST GVL+PP E
Sbjct: 121 HNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQE 180
Query: 681 DDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSF 502
D SPLP W++ESLKAVK++D ++F+VP G A+ELVAGKESAIAQV RT G+TY LSF
Sbjct: 181 DLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSF 240
Query: 501 AVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYA 322
VGDA N C GSM+VEAFA RDTLKVP+ S G G K AS +F AV RTR+ F+S FY
Sbjct: 241 VVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYH 300
Query: 321 MRSDDFSSLCGPVIDDV 271
+ D SLCGPVID++
Sbjct: 301 TKKTDTVSLCGPVIDEI 317
>TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF642 (InterPro:IPR006946),
Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
NCBI BLink). | chr2:17439414-17441296 REVERSE
Length = 371
Score = 376 bits (965), Expect = 2e-104
Identities = 180/336 (53%), Positives = 233/336 (69%)
Frame = -3
Query: 1272 DGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFA 1093
DG+LPNG+FE P S+M+ Q++ K ++P WE++G VE + G + G VP G A
Sbjct: 31 DGLLPNGNFEQIPNKSNMRKRQIIGKYSLPHWEISGHVELVSGGPQPGGFYFAVPRGVHA 90
Query: 1092 IRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPDSGIIPIQTVYSSSGWD 913
RLGN ASI Q + V G+ YSLTF RTCAQDE + ISV + +PIQT++S++G D
Sbjct: 91 ARLGNLASISQYVKVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTNGGD 150
Query: 912 LYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGP 733
YAWAF+A S++ ++ +NPG +EDP CGP++D VAIK + P R T N++KNGGFE GP
Sbjct: 151 TYAWAFKATSDLVKVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFETGP 210
Query: 732 YLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVAGKESA 553
++ N +TG+LIP I+D SPLP W+VESLK VKY+D HF VP G A+ELVAG+ESA
Sbjct: 211 HVFSNFSTGILIPAKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGRESA 270
Query: 552 IAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKRASLRF 373
IAQ+ RT+ GK Y+LSF VGDA+N C GSM+VEAFAG KV +ES KG FK F
Sbjct: 271 IAQIIRTVSGKNYILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGRFAF 330
Query: 372 VAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 265
A S RTR+ FYS FY + DF LCGPV+D+V +
Sbjct: 331 RADSNRTRITFYSGFYHTKLHDFGHLCGPVLDNVSV 366
>TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:14544114-14546732 REVERSE
Length = 402
Score = 340 bits (871), Expect = 1e-093
Identities = 174/358 (48%), Positives = 233/358 (65%), Gaps = 8/358 (2%)
Frame = -3
Query: 1272 DGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFA 1093
DG++ NGDFE P + + + IPSW G VE IKSGQKQG M+L+VP G+ A
Sbjct: 38 DGLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHA 97
Query: 1092 IRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPD-----SGIIPIQTVYS 928
+RLGN+A I Q L V KG YS+TFSAARTCAQ E LN+SVA S I +QTVYS
Sbjct: 98 VRLGNDAEISQELTVEKGSIYSVTFSAARTCAQLESLNVSVASSDEPIASQTIDLQTVYS 157
Query: 927 SSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGG 748
GWD YAWAF+A + +V NPG E+DP CGP+ID +A+K L+ P N + NG
Sbjct: 158 VQGWDPYAWAFEAVVDRVRLVFKNPGMEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGD 217
Query: 747 FEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVA 568
FEEGP++ N+T GVL+P ++++ S LP W VES +AV+++D +HFSVP+G+RA+EL++
Sbjct: 218 FEEGPWMFRNTTLGVLLPTNLDEEISSLPGWTVESNRAVRFIDSDHFSVPEGKRALELLS 277
Query: 567 GKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKR 388
GKE I+Q+ T Y +SF++G A + C+ + V AFAG Y ++ F+R
Sbjct: 278 GKEGIISQMVETKANIPYKMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFER 337
Query: 387 ASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLL---SVRKA*NGPCFIL 223
+ L F A + RTR+ FYS +Y R+DD +SLCGPVIDDVK+ S R + P FIL
Sbjct: 338 SELNFTAKAERTRIAFYSIYYNTRTDDMTSLCGPVIDDVKVWFSGSSRIGFSFPLFIL 395
>TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10505994 REVERSE
Length = 408
Score = 339 bits (869), Expect = 3e-093
Identities = 173/369 (46%), Positives = 234/369 (63%), Gaps = 16/369 (4%)
Frame = -3
Query: 1323 FLCVLLISTVTSVV-------CFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTG 1165
FL +L +S V DG++ NGDFE P + IPSW+ G
Sbjct: 14 FLFLLSVSVAVLVAVADDKSPAVEDGLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNG 73
Query: 1164 FVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDER 985
VE I SGQKQG M+L+VP G+ A+RLGN+A I Q L V KG YS+TFSAARTCAQ E
Sbjct: 74 TVELINSGQKQGGMILIVPQGRHAVRLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLES 133
Query: 984 LNISVAP---------DSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPA 832
+N+SVA S + +QT+YS GWD YAWAF+AE + +V NPG E+DP
Sbjct: 134 INVSVASVNADADDMLASRNVDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPT 193
Query: 831 CGPLIDGVAIKALYPPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWM 652
CGP+ID +AIK L+ P N + NG FE+GP++ N++ GVL+P ++++ S LP W
Sbjct: 194 CGPIIDDIAIKKLFTPDKPKDNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWT 253
Query: 651 VESLKAVKYVDVEHFSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACE 472
VES +AV++VD +HFSVP+G+RAVEL++GKE I+Q+ T K Y+LSF++G A + C+
Sbjct: 254 VESNRAVRFVDSDHFSVPKGKRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCK 313
Query: 471 GSMIVEAFAGRDTLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLC 292
+ + AFAG Y ++ F++A L F A + RTRV FYS +Y R+DD SSLC
Sbjct: 314 EPLAIMAFAGDQAQNFHYMAQANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLC 373
Query: 291 GPVIDDVKL 265
GPVIDDV++
Sbjct: 374 GPVIDDVRV 382
>TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10504617 REVERSE
Length = 372
Score = 242 bits (617), Expect = 4e-064
Identities = 111/229 (48%), Positives = 158/229 (68%)
Frame = -3
Query: 951 IPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTN 772
+ +QT+YS GWD YAWAF+AE + +V NPG E+DP CGP+ID +AIK L+ P
Sbjct: 118 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 177
Query: 771 KNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQG 592
N + NG FE+GP++ N++ GVL+P ++++ S LP W VES +AV++VD +HFSVP+G
Sbjct: 178 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 237
Query: 591 RRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYES 412
+RAVEL++GKE I+Q+ T K Y+LSF++G A + C+ + + AFAG Y +
Sbjct: 238 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 297
Query: 411 KGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 265
+ F++A L F A + RTRV FYS +Y R+DD SSLCGPVIDDV++
Sbjct: 298 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 346
Score = 108 bits (269), Expect = 1e-023
Identities = 55/101 (54%), Positives = 67/101 (66%)
Frame = -3
Query: 1269 GMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFAI 1090
G++ NGDFE P + IPSW+ G VE I SGQKQG M+L+VP G+ A+
Sbjct: 3 GLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHAV 62
Query: 1089 RLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVA 967
RLGN+A I Q L V KG YS+TFSAARTCAQ E +N+SVA
Sbjct: 63 RLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVA 103
>TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:4565246-4566653
REVERSE
Length = 384
Score = 153 bits (384), Expect = 4e-037
Identities = 115/355 (32%), Positives = 170/355 (47%), Gaps = 34/355 (9%)
Frame = -3
Query: 1281 CFRDGMLPNGDFELGP--KPSDMKGTQV-LNKNA-IPSWELTGFVEYIKSGQKQGDMLLV 1114
C L N DFE P P++ + V L++N+ +P W G V Y++
Sbjct: 17 CASSDFLENPDFESPPLNLPTNSNASSVSLDQNSTLPGWTFQGTVLYVE----------- 65
Query: 1113 VPAGKFAIRLGNEASIKQRLNVTKG--MYYSLTFS---AARTCAQDERLNISVAPDSGII 949
+P A++LG + I Q + KG + Y LTF+ A + C L++S + +
Sbjct: 66 LPDTGHAVQLGEDGKINQTF-IAKGDELNYILTFALIHAGQNCTSSAGLSVSGPDSNAVF 124
Query: 948 PIQTVYSSSGWDLYAWAFQAESN------VAEIVIHNPGEEEDPACGPLIDGVAIKALYP 787
+ YS W Y+ + N V E + + + C P+ID + IK +
Sbjct: 125 SYRQNYSKVSWQSYSHNLGSWGNGEPINLVLESQAIDSDSDTNSTCWPIIDTLLIKTVGV 184
Query: 786 PRPTNK-NILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEH 610
+ N+L NGGFE GP LPNST GVLI SPL W V + V+Y+D EH
Sbjct: 185 TLVQDSGNLLINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQWSV--IGTVRYIDSEH 242
Query: 609 FSVPQGRRAVELVAGKESAIAQVAR--TIIGKTYVLSFAVGDANNACEGSMIVEAFAGRD 436
F VP+G+ A+E+++ + Q A T G Y L+F +GDAN+AC G +V A AG
Sbjct: 243 FHVPEGKAAIEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDANDACRGHFVVGAQAGSV 302
Query: 435 TLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDV 271
T ES G G ++ L F A ++ F T Y++ + +CGPVID+V
Sbjct: 303 TQNFTLESNGTGSGEKFGLVFEADKDAAQISF--TSYSVTMTKENVVCGPVIDEV 355
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,043,379,755
Number of Sequences: 33410
Number of Extensions: 9043379755
Number of Successful Extensions: 311883418
Number of sequences better than 0.0: 0
|