Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN26813


BLASTX 7.6.2

Query= UN26813 /QuerySize=1358
        (1357 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT3G08030.1 | Symbols:  | unknown protein | chr3:...    687   6e-198
TAIR9_protein||AT3G08030.2 | Symbols:  | unknown protein | chr3:...    615   3e-176
TAIR9_protein||AT2G41800.1 | Symbols:  | FUNCTIONS IN: molecular...    436   3e-122
TAIR9_protein||AT2G41810.1 | Symbols:  | FUNCTIONS IN: molecular...    431   8e-121
TAIR9_protein||AT5G25460.1 | Symbols:  | unknown protein | chr5:...    400   2e-111
TAIR9_protein||AT5G11420.1 | Symbols:  | FUNCTIONS IN: molecular...    395   4e-110
TAIR9_protein||AT4G32460.1 | Symbols:  | FUNCTIONS IN: molecular...    395   5e-110
TAIR9_protein||AT4G32460.2 | Symbols:  | FUNCTIONS IN: molecular...    395   5e-110
TAIR9_protein||AT1G80240.1 | Symbols:  | unknown protein | chr1:...    377   1e-104
TAIR9_protein||AT2G34510.1 | Symbols:  | FUNCTIONS IN: molecular...    292   3e-079
TAIR9_protein||AT1G29980.2 | Symbols:  | INVOLVED IN: biological...    207   2e-053
TAIR9_protein||AT1G29980.1 | Symbols:  | INVOLVED IN: biological...    207   2e-053
TAIR9_protein||AT5G14150.1 | Symbols:  | unknown protein | chr5:...    136   5e-032

>TAIR9_protein||AT3G08030.1 | Symbols:  | unknown protein | chr3:2564191-2565819
        FORWARD

          Length = 366

 Score =  687 bits (1771), Expect = 6e-198
 Identities = 335/366 (91%), Positives = 354/366 (96%)
 Frame = +1

Query:   85 MAVPKAIVLPLFLVLCGAALGASAYEGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETT 264
            MAVPKAI+LP+ L++CGAALGA A EGYLRNGNFEESPKKTDMKKTVL+GK ALPEWETT
Sbjct:    1 MAVPKAIILPILLLICGAALGAPASEGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETT 60

Query:  265 GFVEYIAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEE 444
            GFVEYIAGGPQPGGM+FPVAHGVHAVRLGNEATISQKLEVKPGSLY+LTFGASRTCAQ+E
Sbjct:   61 GFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYALTFGASRTCAQDE 120

Query:  445 VLRVSVPPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAV 624
            VLRVSVP Q+GDLPLQTLYNSFGGDVYAWAFV KTS+VTV FHNPGVQEDPACGPLLDAV
Sbjct:  121 VLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTFHNPGVQEDPACGPLLDAV 180

Query:  625 AIKELVHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVK 804
            AIKELVHP YT+GNLVKNGGFEEGPHRLVNSTQGVLLPPKQED+TSPLPGWIIESLKAVK
Sbjct:  181 AIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDLTSPLPGWIIESLKAVK 240

Query:  805 FIDSKHFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAF 984
            FIDSK+FNVPFG+AAIELVAGKESAIAQVIRTSPGQTY+LSFVVGDAKNDCHGSMMVEAF
Sbjct:  241 FIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSFVVGDAKNDCHGSMMVEAF 300

Query:  985 AARDTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIV 1164
            AARDTLKV HTSVGGGHVKTASF+FKA+EARTRITFFSGFYHTKK+D  SLCGPVID+IV
Sbjct:  301 AARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYHTKKTDTVSLCGPVIDEIV 360

Query: 1165 VSHVA* 1182
            VSHVA*
Sbjct:  361 VSHVA* 366

>TAIR9_protein||AT3G08030.2 | Symbols:  | unknown protein | chr3:2564517-2565819
        FORWARD

          Length = 324

 Score =  615 bits (1584), Expect = 3e-176
 Identities = 300/324 (92%), Positives = 315/324 (97%)
 Frame = +1

Query:  211 MKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKP 390
            MKKTVL+GK ALPEWETTGFVEYIAGGPQPGGM+FPVAHGVHAVRLGNEATISQKLEVKP
Sbjct:    1 MKKTVLLGKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKP 60

Query:  391 GSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIF 570
            GSLY+LTFGASRTCAQ+EVLRVSVP Q+GDLPLQTLYNSFGGDVYAWAFV KTS+VTV F
Sbjct:   61 GSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTF 120

Query:  571 HNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQE 750
            HNPGVQEDPACGPLLDAVAIKELVHP YT+GNLVKNGGFEEGPHRLVNSTQGVLLPPKQE
Sbjct:  121 HNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQE 180

Query:  751 DVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSF 930
            D+TSPLPGWIIESLKAVKFIDSK+FNVPFG+AAIELVAGKESAIAQVIRTSPGQTY+LSF
Sbjct:  181 DLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSF 240

Query:  931 VVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYH 1110
            VVGDAKNDCHGSMMVEAFAARDTLKV HTSVGGGHVKTASF+FKA+EARTRITFFSGFYH
Sbjct:  241 VVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYH 300

Query: 1111 TKKSDIGSLCGPVIDQIVVSHVA* 1182
            TKK+D  SLCGPVID+IVVSHVA*
Sbjct:  301 TKKTDTVSLCGPVIDEIVVSHVA* 324

>TAIR9_protein||AT2G41800.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
        wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
        DURING: 4 anthesis, C globular stage, petal differentiation and
        expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
        function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
        in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
        - 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr2:17436671-17438005 REVERSE

          Length = 371

 Score =  436 bits (1119), Expect = 3e-122
 Identities = 209/337 (62%), Positives = 253/337 (75%)
 Frame = +1

Query:  160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
            +G L NGNFE +P K++MK   +IG  +LP WE  G VE ++GGPQPGG +FPV  GVHA
Sbjct:   31 DGILPNGNFEITPLKSNMKGRQIIGANSLPHWEIAGHVELVSGGPQPGGFYFPVPRGVHA 90

Query:  340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGD 519
            VRLGN  TISQ + VK G +YSLTFGA+RTCAQ+E ++VSVP Q  +LPLQT+++S GGD
Sbjct:   91 VRLGNLGTISQNVRVKSGLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSSDGGD 150

Query:  520 VYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGP 699
             YAWAF   +  V V FHNPGVQED  CGPLLD VAIKE++   YT+GNLVKNGGFE GP
Sbjct:  151 TYAWAFKATSDVVKVTFHNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGFEIGP 210

Query:  700 HRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESA 879
            H   N + G+L+P + +D  SPLPGWI+ESLK VK+ID +HF VP+G  A+ELVAG+ESA
Sbjct:  211 HVFANFSTGILIPARIQDFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAGRESA 270

Query:  880 IAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRF 1059
            IAQ+IRT  G+ Y LSF VGDA+N CHGSMMVEAFA R+  K+S  S G G  KT  FRF
Sbjct:  271 IAQIIRTIAGKAYMLSFAVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTGHFRF 330

Query: 1060 KALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIVVS 1170
             A   RTR+TF+S FYHTK  D G LCGPV+D +VV+
Sbjct:  331 VADSDRTRLTFYSAFYHTKLHDFGHLCGPVLDSVVVT 367

>TAIR9_protein||AT2G41810.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
        Protein of unknown function DUF642 (InterPro:IPR006946),
        Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
        protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
        hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
        0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
        NCBI BLink). | chr2:17439414-17441296 REVERSE

          Length = 371

 Score =  431 bits (1106), Expect = 8e-121
 Identities = 208/336 (61%), Positives = 255/336 (75%)
 Frame = +1

Query:  160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
            +G L NGNFE+ P K++M+K  +IGK +LP WE +G VE ++GGPQPGG +F V  GVHA
Sbjct:   31 DGLLPNGNFEQIPNKSNMRKRQIIGKYSLPHWEISGHVELVSGGPQPGGFYFAVPRGVHA 90

Query:  340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGD 519
             RLGN A+ISQ ++VK G +YSLTFG +RTCAQ+E +R+SVP QT +LP+QTL+++ GGD
Sbjct:   91 ARLGNLASISQYVKVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTNGGD 150

Query:  520 VYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGP 699
             YAWAF   +  V V F+NPGVQEDP CGP++DAVAIKE++   YTKGNLVKNGGFE GP
Sbjct:  151 TYAWAFKATSDLVKVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFETGP 210

Query:  700 HRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESA 879
            H   N + G+L+P K +D+ SPLPGWI+ESLK VK+ID++HF VP G AAIELVAG+ESA
Sbjct:  211 HVFSNFSTGILIPAKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGRESA 270

Query:  880 IAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRF 1059
            IAQ+IRT  G+ Y LSFVVGDA N CHGSMMVEAFA     KV+  S   G  K   F F
Sbjct:  271 IAQIIRTVSGKNYILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGRFAF 330

Query: 1060 KALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIVV 1167
            +A   RTRITF+SGFYHTK  D G LCGPV+D + V
Sbjct:  331 RADSNRTRITFYSGFYHTKLHDFGHLCGPVLDNVSV 366

>TAIR9_protein||AT5G25460.1 | Symbols:  | unknown protein | chr5:8863430-8865394
        FORWARD

          Length = 370

 Score =  400 bits (1026), Expect = 2e-111
 Identities = 203/356 (57%), Positives = 251/356 (70%), Gaps = 4/356 (1%)
 Frame = +1

Query:  106 VLPLFLVLCGAALGA----SAYEGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFV 273
            V+  FL+    A+ A    S  +G L NG+FE  PK +DMK T ++ K A+P WE TGFV
Sbjct:    6 VVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWEVTGFV 65

Query:  274 EYIAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLR 453
            EYI  G + G M   V  G  AVRLGNEA+I Q+L+V  G  YSLTF A+RTCAQ+E L 
Sbjct:   66 EYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLKVVKGMYYSLTFSAARTCAQDERLN 125

Query:  454 VSVPPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIK 633
            +SV P +G +P+QT+Y+S G D+YAWAF  ++    V+ HNPGV+EDPACGPL+D VA++
Sbjct:  126 ISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEVVIHNPGVEEDPACGPLIDGVAMR 185

Query:  634 ELVHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFID 813
             L  P  T  N++KNGGFEEGP  L  ST GVL+PP  ED  SPLPGW++ESLKAVK++D
Sbjct:  186 SLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHSPLPGWMVESLKAVKYVD 245

Query:  814 SKHFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAAR 993
             +HF+VP G  AIELVAGKESAIAQV+RT  G+TY LSF VGDA N C GSM+VEAFA +
Sbjct:  246 VEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVLSFAVGDANNACKGSMVVEAFAGK 305

Query:  994 DTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
            DTLKV + S G G  K AS RF A+  R+RI F+S FY  +  D  SLCGPVID +
Sbjct:  306 DTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTFYAMRSDDFSSLCGPVIDDV 361

>TAIR9_protein||AT5G11420.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
        wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
        EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
        of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
        in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
        - 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr5:3644655-3646991 FORWARD

          Length = 367

 Score =  395 bits (1014), Expect = 4e-110
 Identities = 196/350 (56%), Positives = 247/350 (70%), Gaps = 1/350 (0%)
 Frame = +1

Query:  115 LFLVLCGAALGASAY-EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGG 291
            LF++L         + +G L NG+FE  PK +DMK T +I K A+P WE +GFVEYI  G
Sbjct:    9 LFVLLIATITSVICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSGFVEYIKSG 68

Query:  292 PQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQ 471
             + G M   V  G  A+RLGNEA+I Q+L V  G  YSLTF A+RTCAQ+E L +SV P 
Sbjct:   69 QKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPD 128

Query:  472 TGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPE 651
            +G +P+QT+Y+S G D+YAWAF  +++   ++ HNPG +EDPACGPL+D VAIK L  P 
Sbjct:  129 SGVIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPR 188

Query:  652 YTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNV 831
             T  N++KNGGFEEGP+ L N+T GVL+PP  ED  SPLP W++ESLKA+K++D +HF+V
Sbjct:  189 PTNKNILKNGGFEEGPYVLPNATTGVLVPPFIEDDHSPLPAWMVESLKAIKYVDVEHFSV 248

Query:  832 PFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVS 1011
            P G  A+ELVAGKESAIAQV RT  G+TY LSF VGDA N C GSM+VEAFA +DTLKV 
Sbjct:  249 PQGRRAVELVAGKESAIAQVARTVVGKTYVLSFAVGDANNACQGSMVVEAFAGKDTLKVP 308

Query: 1012 HTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
            + S G G  K AS RF A+  RTR+ F+S FY  +  D  SLCGPVID +
Sbjct:  309 YESRGKGGFKRASLRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDV 358

>TAIR9_protein||AT4G32460.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
        DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
        unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
        in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
        - 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr4:15663036-15664859 REVERSE

          Length = 366

 Score =  395 bits (1013), Expect = 5e-110
 Identities = 192/334 (57%), Positives = 242/334 (72%)
 Frame = +1

Query:  160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
            +G L NG+FE  P+ +DMK T +I  TA+P WE +GFVEYI  G + G M   V  G  A
Sbjct:   24 DGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFA 83

Query:  340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGD 519
            VRLGNEA+I QK+ VK GS YS+TF A+RTCAQ+E L VSV P    +P+QT+Y+S G D
Sbjct:   84 VRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWD 143

Query:  520 VYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGP 699
            +Y+WAF  ++    ++ HNPGV+EDPACGPL+D VA++ L  P  T  N++KNGGFEEGP
Sbjct:  144 LYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGP 203

Query:  700 HRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESA 879
              L N + GVL+PP   D  SPLPGW++ESLKAVK+IDS HF+VP G  A+ELVAGKESA
Sbjct:  204 WVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESA 263

Query:  880 IAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRF 1059
            +AQV+RT PG+TY LSF VGDA N C GSM+VEAFA +DT+KV + S G G  K +S RF
Sbjct:  264 VAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRF 323

Query: 1060 KALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
             A+ +RTR+ F+S FY  +  D  SLCGPVID +
Sbjct:  324 VAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDV 357

>TAIR9_protein||AT4G32460.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
        DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
        unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
        in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
        - 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr4:15663036-15664859 REVERSE

          Length = 366

 Score =  395 bits (1013), Expect = 5e-110
 Identities = 192/334 (57%), Positives = 242/334 (72%)
 Frame = +1

Query:  160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
            +G L NG+FE  P+ +DMK T +I  TA+P WE +GFVEYI  G + G M   V  G  A
Sbjct:   24 DGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPSGHKQGDMILVVPKGAFA 83

Query:  340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGDLPLQTLYNSFGGD 519
            VRLGNEA+I QK+ VK GS YS+TF A+RTCAQ+E L VSV P    +P+QT+Y+S G D
Sbjct:   84 VRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWD 143

Query:  520 VYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYTKGNLVKNGGFEEGP 699
            +Y+WAF  ++    ++ HNPGV+EDPACGPL+D VA++ L  P  T  N++KNGGFEEGP
Sbjct:  144 LYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGP 203

Query:  700 HRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPFGNAAIELVAGKESA 879
              L N + GVL+PP   D  SPLPGW++ESLKAVK+IDS HF+VP G  A+ELVAGKESA
Sbjct:  204 WVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESA 263

Query:  880 IAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRF 1059
            +AQV+RT PG+TY LSF VGDA N C GSM+VEAFA +DT+KV + S G G  K +S RF
Sbjct:  264 VAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRF 323

Query: 1060 KALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
             A+ +RTR+ F+S FY  +  D  SLCGPVID +
Sbjct:  324 VAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDV 357

>TAIR9_protein||AT1G80240.1 | Symbols:  | unknown protein |
        chr1:30171520-30172799 REVERSE

          Length = 371

 Score =  377 bits (966), Expect = 1e-104
 Identities = 188/354 (53%), Positives = 241/354 (68%)
 Frame = +1

Query:  100 AIVLPLFLVLCGAALGASAYEGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEY 279
            A++L L  +     L A   +G L NGNFE  PK + MK +V+  +TA+P W   GFVE+
Sbjct:    7 ALLLALLFISSNVVLSAPVRDGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNIIGFVEF 66

Query:  280 IAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVS 459
            I  G +   M   V  G  AVRLGNEA+ISQK+ V PG LYS+TF A+RTCAQ+E L +S
Sbjct:   67 IKSGQKQDDMVLVVPQGSSAVRLGNEASISQKISVLPGRLYSITFSAARTCAQDERLNIS 126

Query:  460 VPPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKEL 639
            V  ++G +P+QT+Y S G D Y+WAF     E+ + FHNPGV+E PACGPL+DAVAIK L
Sbjct:  127 VTHESGVIPIQTMYGSDGWDSYSWAFKAGGPEIEIRFHNPGVEEHPACGPLIDAVAIKAL 186

Query:  640 VHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSK 819
              P ++  NL+KNG FEEGP+    +  GVL+PP  ED  SPLPGW+IESLKAVK++D  
Sbjct:  187 FPPRFSGYNLIKNGNFEEGPYVFPTAKWGVLIPPFIEDDNSPLPGWMIESLKAVKYVDKA 246

Query:  820 HFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDT 999
            HF VP G+ AIELV GKESAI+Q++RTS  + Y+L+F VGDA++ C G M+VEAFA +  
Sbjct:  247 HFAVPEGHRAIELVGGKESAISQIVRTSLNKFYALTFNVGDARDGCEGPMIVEAFAGQGK 306

Query: 1000 LKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
            + V + S G G  +     FKA+ ARTR+TF S FYH K    GSLCGPVID +
Sbjct:  307 VMVDYASKGKGGFRRGRLVFKAVSARTRVTFLSTFYHMKSDHSGSLCGPVIDDV 360

>TAIR9_protein||AT2G34510.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
        to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
        growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
        DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
        in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
        Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr2:14544114-14546732 REVERSE

          Length = 402

 Score =  292 bits (747), Expect = 3e-079
 Identities = 154/362 (42%), Positives = 217/362 (59%), Gaps = 7/362 (1%)
 Frame = +1

Query:  103 IVLPLFLVLCGAALGASA--YEGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVE 276
            ++L L +V    + G ++   +G + NG+FE  P        ++   + +P W + G VE
Sbjct:   17 LLLGLSIVAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVE 76

Query:  277 YIAGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRV 456
             I  G + GGM   V  G HAVRLGN+A ISQ+L V+ GS+YS+TF A+RTCAQ E L V
Sbjct:   77 LIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEKGSIYSVTFSAARTCAQLESLNV 136

Query:  457 SV-----PPQTGDLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDA 621
            SV     P  +  + LQT+Y+  G D YAWAF      V ++F NPG+++DP CGP++D 
Sbjct:  137 SVASSDEPIASQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNPGMEDDPTCGPIIDD 196

Query:  622 VAIKELVHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAV 801
            +A+K+L  P+  KGN V NG FEEGP    N+T GVLLP   ++  S LPGW +ES +AV
Sbjct:  197 IAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVLLPTNLDEEISSLPGWTVESNRAV 256

Query:  802 KFIDSKHFNVPFGNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEA 981
            +FIDS HF+VP G  A+EL++GKE  I+Q++ T     Y +SF +G A + C   + V A
Sbjct:  257 RFIDSDHFSVPEGKRALELLSGKEGIISQMVETKANIPYKMSFSLGHAGDKCKEPLAVMA 316

Query:  982 FAARDTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQI 1161
            FA        + +      + +   F A   RTRI F+S +Y+T+  D+ SLCGPVID +
Sbjct:  317 FAGDQAQNFHYMAQANSSFERSELNFTAKAERTRIAFYSIYYNTRTDDMTSLCGPVIDDV 376

Query: 1162 VV 1167
             V
Sbjct:  377 KV 378

>TAIR9_protein||AT1G29980.2 | Symbols:  | INVOLVED IN: biological_process
        unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
        IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF642
        (InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
        Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
        Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
        Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr1:10503411-10504617 REVERSE

          Length = 372

 Score =  207 bits (525), Expect = 2e-053
 Identities = 99/230 (43%), Positives = 144/230 (62%)
 Frame = +1

Query:  478 DLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYT 657
            ++ LQTLY+  G D YAWAF  +   V ++F NPG+++DP CGP++D +AIK+L  P+  
Sbjct:  117 NVDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKP 176

Query:  658 KGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPF 837
            K N V NG FE+GP    N++ GVLLP   ++  S LPGW +ES +AV+F+DS HF+VP 
Sbjct:  177 KDNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPK 236

Query:  838 GNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHT 1017
            G  A+EL++GKE  I+Q++ T   + Y LSF +G A + C   + + AFA        + 
Sbjct:  237 GKRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYM 296

Query: 1018 SVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIVV 1167
            +      + A   F A   RTR+ F+S +Y+T+  D+ SLCGPVID + V
Sbjct:  297 AQANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 346


 Score =  93 bits (229), Expect = 4e-019
 Identities = 50/106 (47%), Positives = 63/106 (59%)
 Frame = +1

Query: 163 GYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHAV 342
           G + NG+FE SP        V  G + +P W++ G VE I  G + GGM   V  G HAV
Sbjct:   3 GLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHAV 62

Query: 343 RLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGD 480
           RLGN+A ISQ L V+ G +YS+TF A+RTCAQ E + VSV     D
Sbjct:  63 RLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVASVNAD 108

>TAIR9_protein||AT1G29980.1 | Symbols:  | INVOLVED IN: biological_process
        unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
        IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF642
        (InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
        Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
        Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
        Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr1:10503411-10505994 REVERSE

          Length = 408

 Score =  207 bits (525), Expect = 2e-053
 Identities = 99/230 (43%), Positives = 144/230 (62%)
 Frame = +1

Query:  478 DLPLQTLYNSFGGDVYAWAFVPKTSEVTVIFHNPGVQEDPACGPLLDAVAIKELVHPEYT 657
            ++ LQTLY+  G D YAWAF  +   V ++F NPG+++DP CGP++D +AIK+L  P+  
Sbjct:  153 NVDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKP 212

Query:  658 KGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWIIESLKAVKFIDSKHFNVPF 837
            K N V NG FE+GP    N++ GVLLP   ++  S LPGW +ES +AV+F+DS HF+VP 
Sbjct:  213 KDNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPK 272

Query:  838 GNAAIELVAGKESAIAQVIRTSPGQTYSLSFVVGDAKNDCHGSMMVEAFAARDTLKVSHT 1017
            G  A+EL++GKE  I+Q++ T   + Y LSF +G A + C   + + AFA        + 
Sbjct:  273 GKRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYM 332

Query: 1018 SVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGSLCGPVIDQIVV 1167
            +      + A   F A   RTR+ F+S +Y+T+  D+ SLCGPVID + V
Sbjct:  333 AQANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 382


 Score =  94 bits (231), Expect = 2e-019
 Identities = 50/107 (46%), Positives = 64/107 (59%)
 Frame = +1

Query: 160 EGYLRNGNFEESPKKTDMKKTVLIGKTALPEWETTGFVEYIAGGPQPGGMFFPVAHGVHA 339
           +G + NG+FE SP        V  G + +P W++ G VE I  G + GGM   V  G HA
Sbjct:  38 DGLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHA 97

Query: 340 VRLGNEATISQKLEVKPGSLYSLTFGASRTCAQEEVLRVSVPPQTGD 480
           VRLGN+A ISQ L V+ G +YS+TF A+RTCAQ E + VSV     D
Sbjct:  98 VRLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVASVNAD 144

>TAIR9_protein||AT5G14150.1 | Symbols:  | unknown protein | chr5:4565246-4566653
        REVERSE

          Length = 384

 Score =  136 bits (340), Expect = 5e-032
 Identities = 113/371 (30%), Positives = 170/371 (45%), Gaps = 41/371 (11%)
 Frame = +1

Query:  115 LFLVLCGAALGASAYEGYLRNGNFEES----PKKTDMKKTVLIGKTALPEWETTGFVEYI 282
            +FL+L    +   A   +L N +FE      P  ++     L   + LP W   G V Y+
Sbjct:    8 IFLLL---LVSCCASSDFLENPDFESPPLNLPTNSNASSVSLDQNSTLPGWTFQGTVLYV 64

Query:  283 AGGPQPGGMFFPVAHGVHAVRLGNEATISQKLEVKPGSL-YSLTFG---ASRTCAQEEVL 450
               P  G          HAV+LG +  I+Q    K   L Y LTF    A + C     L
Sbjct:   65 E-LPDTG----------HAVQLGEDGKINQTFIAKGDELNYILTFALIHAGQNCTSSAGL 113

Query:  451 RVSVPPQTGDLPLQTLYN-----SFGGDVYAWAFVPKTSEVTVIFHNPGVQED----PAC 603
             VS P        +  Y+     S+  ++ +W        + ++  +  +  D      C
Sbjct:  114 SVSGPDSNAVFSYRQNYSKVSWQSYSHNLGSWG---NGEPINLVLESQAIDSDSDTNSTC 170

Query:  604 GPLLDAVAIKEL-VHPEYTKGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDVTSPLPGWI 780
             P++D + IK + V      GNL+ NGGFE GP  L NST GVL+      + SPL  W 
Sbjct:  171 WPIIDTLLIKTVGVTLVQDSGNLLINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQWS 230

Query:  781 IESLKAVKFIDSKHFNVPFGNAAIELVAGKESAIAQVIR--TSPGQTYSLSFVVGDAKND 954
            +  +  V++IDS+HF+VP G AAIE+++    +  Q     TS G  Y+L+F +GDA + 
Sbjct:  231 V--IGTVRYIDSEHFHVPEGKAAIEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDANDA 288

Query:  955 CHGSMMVEAFAARDTLKVSHTSVGGGHVKTASFRFKALEARTRITFFSGFYHTKKSDIGS 1134
            C G  +V A A   T   +  S G G  +     F+A +   +I+F S  Y    +    
Sbjct:  289 CRGHFVVGAQAGSVTQNFTLESNGTGSGEKFGLVFEADKDAAQISFTS--YSVTMTKENV 346

Query: 1135 LCGPVIDQIVV 1167
            +CGPVID+++V
Sbjct:  347 VCGPVIDEVMV 357

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,834,117,621
Number of Sequences: 33410
Number of Extensions: 13834117621
Number of Successful Extensions: 450061051
Number of sequences better than 0.0: 0