BLASTX 7.6.2
Query= UN26813 /QuerySize=1358
(1357 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:... 687 6e-198
TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:... 615 3e-176
TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular... 436 3e-122
TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular... 431 8e-121
TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:... 400 2e-111
TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular... 395 4e-110
TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular... 395 5e-110
TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular... 395 5e-110
TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein | chr1:... 377 1e-104
TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular... 292 3e-079
TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological... 207 2e-053
TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological... 207 2e-053
TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:... 136 5e-032
>TAIR9_protein||AT3G08030.1 | Symbols: | unknown protein | chr3:2564191-2565819
FORWARD
Length = 366
Score = 687 bits (1771), Expect = 6e-198
Identities = 335/366 (91%), Positives = 354/366 (96%)
Frame = +1
Query: 85 MAVPKAIVLPLFLVLCGAALGASAYEGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETT 264
MAVPKAI+LP+ L++CGAALGA A EGYLRNGNFEESPKKTDMKKTVL+GK ALPEWETT
Sbjct: 1 MAVPKAIILPILLLICGAALGAPASEGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETT 60
Query: 265 GFVEYIAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEE 444
GFVEYIAGGPQPGGM+FPVAHGVHAVRLGNEATISQKLEVKPGSLY+LTFGASRTCAQ+E
Sbjct: 61 GFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYALTFGASRTCAQDE 120
Query: 445 VLRVSVPPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAV 624
VLRVSVP Q+GDLPLQTLYNSFGGDVYAWAFV KTS+VTV FHNPGVQEDPACGPLLDAV
Sbjct: 121 VLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTFHNPGVQEDPACGPLLDAV 180
Query: 625 AIKELVHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVK 804
AIKELVHP YT+GNLVKNGGFEEGPHRLVNSTQGVLLPPKQED+TSPLPGWIIESLKAVK
Sbjct: 181 AIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDLTSPLPGWIIESLKAVK 240
Query: 805 FIDSKHFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAF 984
FIDSK+FNVPFG+AAIELVAGKESAIAQVIRTSPGQTY+LSFVVGDAKNDCHGSMMVEAF
Sbjct: 241 FIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSFVVGDAKNDCHGSMMVEAF 300
Query: 985 AARDTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIV 1164
AARDTLKV HTSVGGGHVKTASF+FKA+EARTRITFFSGFYHTKK+D SLCGPVID+IV
Sbjct: 301 AARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYHTKKTDTVSLCGPVIDEIV 360
Query: 1165 VSHVA* 1182
VSHVA*
Sbjct: 361 VSHVA* 366
>TAIR9_protein||AT3G08030.2 | Symbols: | unknown protein | chr3:2564517-2565819
FORWARD
Length = 324
Score = 615 bits (1584), Expect = 3e-176
Identities = 300/324 (92%), Positives = 315/324 (97%)
Frame = +1
Query: 211 MKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKP 390
MKKTVL+GK ALPEWETTGFVEYIAGGPQPGGM+FPVAHGVHAVRLGNEATISQKLEVKP
Sbjct: 1 MKKTVLLGKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKP 60
Query: 391 GSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIF 570
GSLY+LTFGASRTCAQ+EVLRVSVP Q+GDLPLQTLYNSFGGDVYAWAFV KTS+VTV F
Sbjct: 61 GSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTF 120
Query: 571 HNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQE 750
HNPGVQEDPACGPLLDAVAIKELVHP YT+GNLVKNGGFEEGPHRLVNSTQGVLLPPKQE
Sbjct: 121 HNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQE 180
Query: 751 DVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSF 930
D+TSPLPGWIIESLKAVKFIDSK+FNVPFG+AAIELVAGKESAIAQVIRTSPGQTY+LSF
Sbjct: 181 DLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSF 240
Query: 931 VVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYH 1110
VVGDAKNDCHGSMMVEAFAARDTLKV HTSVGGGHVKTASF+FKA+EARTRITFFSGFYH
Sbjct: 241 VVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYH 300
Query: 1111 TKKSDIGSLCGPVIDQIVVSHVA* 1182
TKK+D SLCGPVID+IVVSHVA*
Sbjct: 301 TKKTDTVSLCGPVIDEIVVSHVA* 324
>TAIR9_protein||AT2G41800.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
DURING: 4 anthesis, C globular stage, petal differentiation and
expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:17436671-17438005 REVERSE
Length = 371
Score = 436 bits (1119), Expect = 3e-122
Identities = 209/337 (62%), Positives = 253/337 (75%)
Frame = +1
Query: 160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
+G L NGNFE +P K++MK +IG +LP WE G VE ++GGPQPGG +FPV GVHA
Sbjct: 31 DGILPNGNFEITPLKSNMKGRQIIGANSLPHWEIAGHVELVSGGPQPGGFYFPVPRGVHA 90
Query: 340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGD 519
VRLGN TISQ + VK G +YSLTFGA+RTCAQ+E ++VSVP Q +LPLQT+++S GGD
Sbjct: 91 VRLGNLGTISQNVRVKSGLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSSDGGD 150
Query: 520 VYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGP 699
YAWAF + V V FHNPGVQED CGPLLD VAIKE++ YT+GNLVKNGGFE GP
Sbjct: 151 TYAWAFKATSDVVKVTFHNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGFEIGP 210
Query: 700 HRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESA 879
H N + G+L+P + +D SPLPGWI+ESLK VK+ID +HF VP+G A+ELVAG+ESA
Sbjct: 211 HVFANFSTGILIPARIQDFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAGRESA 270
Query: 880 IAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRF 1059
IAQ+IRT G+ Y LSF VGDA+N CHGSMMVEAFA R+ K+S S G G KT FRF
Sbjct: 271 IAQIIRTIAGKAYMLSFAVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTGHFRF 330
Query: 1060 KALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIVVS 1170
A RTR+TF+S FYHTK D G LCGPV+D +VV+
Sbjct: 331 VADSDRTRLTFYSAFYHTKLHDFGHLCGPVLDSVVVT 367
>TAIR9_protein||AT2G41810.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF642 (InterPro:IPR006946),
Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
NCBI BLink). | chr2:17439414-17441296 REVERSE
Length = 371
Score = 431 bits (1106), Expect = 8e-121
Identities = 208/336 (61%), Positives = 255/336 (75%)
Frame = +1
Query: 160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
+G L NGNFE+ P K++M+K +IGK +LP WE +G VE ++GGPQPGG +F V GVHA
Sbjct: 31 DGLLPNGNFEQIPNKSNMRKRQIIGKYSLPHWEISGHVELVSGGPQPGGFYFAVPRGVHA 90
Query: 340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGD 519
RLGN A+ISQ ++VK G +YSLTFG +RTCAQ+E +R+SVP QT +LP+QTL+++ GGD
Sbjct: 91 ARLGNLASISQYVKVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTNGGD 150
Query: 520 VYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGP 699
YAWAF + V V F+NPGVQEDP CGP++DAVAIKE++ YTKGNLVKNGGFE GP
Sbjct: 151 TYAWAFKATSDLVKVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFETGP 210
Query: 700 HRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESA 879
H N + G+L+P K +D+ SPLPGWI+ESLK VK+ID++HF VP G AAIELVAG+ESA
Sbjct: 211 HVFSNFSTGILIPAKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGRESA 270
Query: 880 IAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRF 1059
IAQ+IRT G+ Y LSFVVGDA N CHGSMMVEAFA KV+ S G K F F
Sbjct: 271 IAQIIRTVSGKNYILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGRFAF 330
Query: 1060 KALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIVV 1167
+A RTRITF+SGFYHTK D G LCGPV+D + V
Sbjct: 331 RADSNRTRITFYSGFYHTKLHDFGHLCGPVLDNVSV 366
>TAIR9_protein||AT5G25460.1 | Symbols: | unknown protein | chr5:8863430-8865394
FORWARD
Length = 370
Score = 400 bits (1026), Expect = 2e-111
Identities = 203/356 (57%), Positives = 251/356 (70%), Gaps = 4/356 (1%)
Frame = +1
Query: 106 VLPLFLVLCGAALGA----SAYEGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFV 273
V+ FL+ A+ A S +G L NG+FE PK +DMK T ++ K A+P WE TGFV
Sbjct: 6 VVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWEVTGFV 65
Query: 274 EYIAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLR 453
EYI G + G M V G AVRLGNEA+I Q+L+V G YSLTF A+RTCAQ+E L
Sbjct: 66 EYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLKVVKGMYYSLTFSAARTCAQDERLN 125
Query: 454 VSVPPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIK 633
+SV P +G +P+QT+Y+S G D+YAWAF ++ V+ HNPGV+EDPACGPL+D VA++
Sbjct: 126 ISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEVVIHNPGVEEDPACGPLIDGVAMR 185
Query: 634 ELVHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFID 813
L P T N++KNGGFEEGP L ST GVL+PP ED SPLPGW++ESLKAVK++D
Sbjct: 186 SLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHSPLPGWMVESLKAVKYVD 245
Query: 814 SKHFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAAR 993
+HF+VP G AIELVAGKESAIAQV+RT G+TY LSF VGDA N C GSM+VEAFA +
Sbjct: 246 VEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVLSFAVGDANNACKGSMVVEAFAGK 305
Query: 994 DTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
DTLKV + S G G K AS RF A+ R+RI F+S FY + D SLCGPVID +
Sbjct: 306 DTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTFYAMRSDDFSSLCGPVIDDV 361
>TAIR9_protein||AT5G11420.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
- 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr5:3644655-3646991 FORWARD
Length = 367
Score = 395 bits (1014), Expect = 4e-110
Identities = 196/350 (56%), Positives = 247/350 (70%), Gaps = 1/350 (0%)
Frame = +1
Query: 115 LFLVLCGAALGASAY-EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGG 291
LF++L + +G L NG+FE PK +DMK T +I K A+P WE +GFVEYI G
Sbjct: 9 LFVLLIATITSVICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSGFVEYIKSG 68
Query: 292 PQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQ 471
+ G M V G A+RLGNEA+I Q+L V G YSLTF A+RTCAQ+E L +SV P
Sbjct: 69 QKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPD 128
Query: 472 TGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPE 651
+G +P+QT+Y+S G D+YAWAF +++ ++ HNPG +EDPACGPL+D VAIK L P
Sbjct: 129 SGVIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPR 188
Query: 652 YTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNV 831
T N++KNGGFEEGP+ L N+T GVL+PP ED SPLP W++ESLKA+K++D +HF+V
Sbjct: 189 PTNKNILKNGGFEEGPYVLPNATTGVLVPPFIEDDHSPLPAWMVESLKAIKYVDVEHFSV 248
Query: 832 PFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVS 1011
P G A+ELVAGKESAIAQV RT G+TY LSF VGDA N C GSM+VEAFA +DTLKV
Sbjct: 249 PQGRRAVELVAGKESAIAQVARTVVGKTYVLSFAVGDANNACQGSMVVEAFAGKDTLKVP 308
Query: 1012 HTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
+ S G G K AS RF A+ RTR+ F+S FY + D SLCGPVID +
Sbjct: 309 YESRGKGGFKRASLRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDV 358
>TAIR9_protein||AT4G32460.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 395 bits (1013), Expect = 5e-110
Identities = 192/334 (57%), Positives = 242/334 (72%)
Frame = +1
Query: 160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
+G L NG+FE P+ +DMK T +I TA+P WE +GFVEYI G + G M V G A
Sbjct: 24 DGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFA 83
Query: 340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGD 519
VRLGNEA+I QK+ VK GS YS+TF A+RTCAQ+E L VSV P +P+QT+Y+S G D
Sbjct: 84 VRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWD 143
Query: 520 VYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGP 699
+Y+WAF ++ ++ HNPGV+EDPACGPL+D VA++ L P T N++KNGGFEEGP
Sbjct: 144 LYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGP 203
Query: 700 HRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESA 879
L N + GVL+PP D SPLPGW++ESLKAVK+IDS HF+VP G A+ELVAGKESA
Sbjct: 204 WVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESA 263
Query: 880 IAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRF 1059
+AQV+RT PG+TY LSF VGDA N C GSM+VEAFA +DT+KV + S G G K +S RF
Sbjct: 264 VAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRF 323
Query: 1060 KALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
A+ +RTR+ F+S FY + D SLCGPVID +
Sbjct: 324 VAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDV 357
>TAIR9_protein||AT4G32460.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
- 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:15663036-15664859 REVERSE
Length = 366
Score = 395 bits (1013), Expect = 5e-110
Identities = 192/334 (57%), Positives = 242/334 (72%)
Frame = +1
Query: 160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
+G L NG+FE P+ +DMK T +I TA+P WE +GFVEYI G + G M V G A
Sbjct: 24 DGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFA 83
Query: 340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGD 519
VRLGNEA+I QK+ VK GS YS+TF A+RTCAQ+E L VSV P +P+QT+Y+S G D
Sbjct: 84 VRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWD 143
Query: 520 VYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGP 699
+Y+WAF ++ ++ HNPGV+EDPACGPL+D VA++ L P T N++KNGGFEEGP
Sbjct: 144 LYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGP 203
Query: 700 HRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESA 879
L N + GVL+PP D SPLPGW++ESLKAVK+IDS HF+VP G A+ELVAGKESA
Sbjct: 204 WVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESA 263
Query: 880 IAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRF 1059
+AQV+RT PG+TY LSF VGDA N C GSM+VEAFA +DT+KV + S G G K +S RF
Sbjct: 264 VAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRF 323
Query: 1060 KALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
A+ +RTR+ F+S FY + D SLCGPVID +
Sbjct: 324 VAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDV 357
>TAIR9_protein||AT1G80240.1 | Symbols: | unknown protein |
chr1:30171520-30172799 REVERSE
Length = 371
Score = 377 bits (966), Expect = 1e-104
Identities = 188/354 (53%), Positives = 241/354 (68%)
Frame = +1
Query: 100 AIVLPLFLVLCGAALGASAYEGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEY 279
A++L L + L A +G L NGNFE PK + MK +V+ +TA+P W GFVE+
Sbjct: 7 ALLLALLFISSNVVLSAPVRDGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNIIGFVEF 66
Query: 280 IAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVS 459
I G + M V G AVRLGNEA+ISQK+ V PG LYS+TF A+RTCAQ+E L +S
Sbjct: 67 IKSGQKQDDMVLVVPQGSSAVRLGNEASISQKISVLPGRLYSITFSAARTCAQDERLNIS 126
Query: 460 VPPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKEL 639
V ++G +P+QT+Y S G D Y+WAF E+ + FHNPGV+E PACGPL+DAVAIK L
Sbjct: 127 VTHESGVIPIQTMYGSDGWDSYSWAFKAGGPEIEIRFHNPGVEEHPACGPLIDAVAIKAL 186
Query: 640 VHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSK 819
P ++ NL+KNG FEEGP+ + GVL+PP ED SPLPGW+IESLKAVK++D
Sbjct: 187 FPPRFSGYNLIKNGNFEEGPYVFPTAKWGVLIPPFIEDDNSPLPGWMIESLKAVKYVDKA 246
Query: 820 HFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDT 999
HF VP G+ AIELV GKESAI+Q++RTS + Y+L+F VGDA++ C G M+VEAFA +
Sbjct: 247 HFAVPEGHRAIELVGGKESAISQIVRTSLNKFYALTFNVGDARDGCEGPMIVEAFAGQGK 306
Query: 1000 LKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
+ V + S G G + FKA+ ARTR+TF S FYH K GSLCGPVID +
Sbjct: 307 VMVDYASKGKGGFRRGRLVFKAVSARTRVTFLSTFYHMKSDHSGSLCGPVIDDV 360
>TAIR9_protein||AT2G34510.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
DUF642 (InterPro:IPR006946), Galactose-binding like
(InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr2:14544114-14546732 REVERSE
Length = 402
Score = 292 bits (747), Expect = 3e-079
Identities = 154/362 (42%), Positives = 217/362 (59%), Gaps = 7/362 (1%)
Frame = +1
Query: 103 IVLPLFLVLCGAALGASA--YEGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVE 276
++L L +V + G ++ +G + NG+FE P ++ + +P W + G VE
Sbjct: 17 LLLGLSIVAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVE 76
Query: 277 YIAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRV 456
I G + GGM V G HAVRLGN+A ISQ+L V+ GS+YS+TF A+RTCAQ E L V
Sbjct: 77 LIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEKGSIYSVTFSAARTCAQLESLNV 136
Query: 457 SV-----PPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDA 621
SV P + + LQT+Y+ G D YAWAF V ++F NPG+++DP CGP++D
Sbjct: 137 SVASSDEPIASQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNPGMEDDPTCGPIIDD 196
Query: 622 VAIKELVHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAV 801
+A+K+L P+ KGN V NG FEEGP N+T GVLLP ++ S LPGW +ES +AV
Sbjct: 197 IAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVLLPTNLDEEISSLPGWTVESNRAV 256
Query: 802 KFIDSKHFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEA 981
+FIDS HF+VP G A+EL++GKE I+Q++ T Y +SF +G A + C + V A
Sbjct: 257 RFIDSDHFSVPEGKRALELLSGKEGIISQMVETKANIPYKMSFSLGHAGDKCKEPLAVMA 316
Query: 982 FAARDTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
FA + + + + F A RTRI F+S +Y+T+ D+ SLCGPVID +
Sbjct: 317 FAGDQAQNFHYMAQANSSFERSELNFTAKAERTRIAFYSIYYNTRTDDMTSLCGPVIDDV 376
Query: 1162 VV 1167
V
Sbjct: 377 KV 378
>TAIR9_protein||AT1G29980.2 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10504617 REVERSE
Length = 372
Score = 207 bits (525), Expect = 2e-053
Identities = 99/230 (43%), Positives = 144/230 (62%)
Frame = +1
Query: 478 DLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYT 657
++ LQTLY+ G D YAWAF + V ++F NPG+++DP CGP++D +AIK+L P+
Sbjct: 117 NVDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKP 176
Query: 658 KGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPF 837
K N V NG FE+GP N++ GVLLP ++ S LPGW +ES +AV+F+DS HF+VP
Sbjct: 177 KDNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPK 236
Query: 838 GNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHT 1017
G A+EL++GKE I+Q++ T + Y LSF +G A + C + + AFA +
Sbjct: 237 GKRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYM 296
Query: 1018 SVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIVV 1167
+ + A F A RTR+ F+S +Y+T+ D+ SLCGPVID + V
Sbjct: 297 AQANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 346
Score = 93 bits (229), Expect = 4e-019
Identities = 50/106 (47%), Positives = 63/106 (59%)
Frame = +1
Query: 163 GYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHAV 342
G + NG+FE SP V G + +P W++ G VE I G + GGM V G HAV
Sbjct: 3 GLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHAV 62
Query: 343 RLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGD 480
RLGN+A ISQ L V+ G +YS+TF A+RTCAQ E + VSV D
Sbjct: 63 RLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVASVNAD 108
>TAIR9_protein||AT1G29980.1 | Symbols: | INVOLVED IN: biological_process
unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF642
(InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:10503411-10505994 REVERSE
Length = 408
Score = 207 bits (525), Expect = 2e-053
Identities = 99/230 (43%), Positives = 144/230 (62%)
Frame = +1
Query: 478 DLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYT 657
++ LQTLY+ G D YAWAF + V ++F NPG+++DP CGP++D +AIK+L P+
Sbjct: 153 NVDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKP 212
Query: 658 KGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPF 837
K N V NG FE+GP N++ GVLLP ++ S LPGW +ES +AV+F+DS HF+VP
Sbjct: 213 KDNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPK 272
Query: 838 GNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHT 1017
G A+EL++GKE I+Q++ T + Y LSF +G A + C + + AFA +
Sbjct: 273 GKRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYM 332
Query: 1018 SVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIVV 1167
+ + A F A RTR+ F+S +Y+T+ D+ SLCGPVID + V
Sbjct: 333 AQANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 382
Score = 94 bits (231), Expect = 2e-019
Identities = 50/107 (46%), Positives = 64/107 (59%)
Frame = +1
Query: 160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
+G + NG+FE SP V G + +P W++ G VE I G + GGM V G HA
Sbjct: 38 DGLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHA 97
Query: 340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGD 480
VRLGN+A ISQ L V+ G +YS+TF A+RTCAQ E + VSV D
Sbjct: 98 VRLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVASVNAD 144
>TAIR9_protein||AT5G14150.1 | Symbols: | unknown protein | chr5:4565246-4566653
REVERSE
Length = 384
Score = 136 bits (340), Expect = 5e-032
Identities = 113/371 (30%), Positives = 170/371 (45%), Gaps = 41/371 (11%)
Frame = +1
Query: 115 LFLVLCGAALGASAYEGYLRNGNFEES----PKKTDMKKTVLIGKTALPEWETTGFVEYI 282
+FL+L + A +L N +FE P ++ L + LP W G V Y+
Sbjct: 8 IFLLL---LVSCCASSDFLENPDFESPPLNLPTNSNASSVSLDQNSTLPGWTFQGTVLYV 64
Query: 283 AGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSL-YSLTFG---ASRTCAQEEVL 450
P G HAV+LG + I+Q K L Y LTF A + C L
Sbjct: 65 E-LPDTG----------HAVQLGEDGKINQTFIAKGDELNYILTFALIHAGQNCTSSAGL 113
Query: 451 RVSVPPQTGDLPLQTLYN-----SFGGDVYAWAFVPKTSEVTVIFHNPGVQED----PAC 603
VS P + Y+ S+ ++ +W + ++ + + D C
Sbjct: 114 SVSGPDSNAVFSYRQNYSKVSWQSYSHNLGSWG---NGEPINLVLESQAIDSDSDTNSTC 170
Query: 604 GPLLDAVAIKEL-VHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWI 780
P++D + IK + V GNL+ NGGFE GP L NST GVL+ + SPL W
Sbjct: 171 WPIIDTLLIKTVGVTLVQDSGNLLINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQWS 230
Query: 781 IESLKAVKFIDSKHFNVPFGNAAIELVAGKESAIAQVIR--TSPGQTYSLSFVVGDAKND 954
+ + V++IDS+HF+VP G AAIE+++ + Q TS G Y+L+F +GDA +
Sbjct: 231 V--IGTVRYIDSEHFHVPEGKAAIEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDANDA 288
Query: 955 CHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGS 1134
C G +V A A T + S G G + F+A + +I+F S Y +
Sbjct: 289 CRGHFVVGAQAGSVTQNFTLESNGTGSGEKFGLVFEADKDAAQISFTS--YSVTMTKENV 346
Query: 1135 LCGPVIDQIVV 1167
+CGPVID+++V
Sbjct: 347 VCGPVIDEVMV 357
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,834,117,621
Number of Sequences: 33410
Number of Extensions: 13834117621
Number of Successful Extensions: 450061051
Number of sequences better than 0.0: 0
|