Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN18527


BLASTX 7.6.2

Query= UN18527 /QuerySize=1351
        (1350 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT5G25460.1 | Symbols:  | unknown protein | chr5:...    519   2e-147
TAIR9_protein||AT5G11420.1 | Symbols:  | FUNCTIONS IN: molecular...    507   9e-144
TAIR9_protein||AT4G32460.1 | Symbols:  | FUNCTIONS IN: molecular...    463   1e-130
TAIR9_protein||AT4G32460.2 | Symbols:  | FUNCTIONS IN: molecular...    463   1e-130
TAIR9_protein||AT1G80240.1 | Symbols:  | unknown protein | chr1:...    363   2e-100
TAIR9_protein||AT3G08030.1 | Symbols:  | unknown protein | chr3:...    313   2e-085
TAIR9_protein||AT3G08030.2 | Symbols:  | unknown protein | chr3:...    313   2e-085
TAIR9_protein||AT2G41800.1 | Symbols:  | FUNCTIONS IN: molecular...    312   3e-085
TAIR9_protein||AT2G41810.1 | Symbols:  | FUNCTIONS IN: molecular...    304   1e-082
TAIR9_protein||AT2G34510.1 | Symbols:  | FUNCTIONS IN: molecular...    265   8e-071
TAIR9_protein||AT1G29980.2 | Symbols:  | INVOLVED IN: biological...    239   4e-063
TAIR9_protein||AT1G29980.1 | Symbols:  | INVOLVED IN: biological...    239   4e-063
TAIR9_protein||AT5G14150.1 | Symbols:  | unknown protein | chr5:...    133   3e-031

>TAIR9_protein||AT5G25460.1 | Symbols:  | unknown protein | chr5:8863430-8865394
        FORWARD

          Length = 370

 Score =  519 bits (1335), Expect = 2e-147
 Identities = 252/266 (94%), Positives = 263/266 (98%)
 Frame = -2

Query: 995 VKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEI 816
           VKGMYYSLTFSAARTCAQDERLNISVAPDSGVIP+QTVYSSSGWDLYAWAFQAES+VAE+
Sbjct: 103 VKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEV 162

Query: 815 VIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPF 636
           VIHNPG EEDPACGPLIDGVAMR+LYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPF
Sbjct: 163 VIHNPGVEEDPACGPLIDGVAMRSLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPF 222

Query: 635 IEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVL 456
           IEDDH+PLPGWMVESLKAVKYVD EHFSVPQGRRAIELVAGKESAIAQV RT+IGKTYVL
Sbjct: 223 IEDDHSPLPGWMVESLKAVKYVDVEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVL 282

Query: 455 SFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTF 276
           SFAVGDANNACKGSMVVEAFAG+DTLKVPYES+GTGGFKRASIRFVAVSTR+R+MFYSTF
Sbjct: 283 SFAVGDANNACKGSMVVEAFAGKDTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTF 342

Query: 275 YAMRSDDFSSLCGPVIDDVKLLSVRK 198
           YAMRSDDFSSLCGPVIDDVKL+SVRK
Sbjct: 343 YAMRSDDFSSLCGPVIDDVKLISVRK 368


 Score =  165 bits (416), Expect = 8e-041
 Identities = 84/101 (83%), Positives = 90/101 (89%), Gaps = 2/101 (1%)
 Frame = -1

Query: 1293 MWGVTVVSLVLLLIATANSA--VVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWE 1120
            M GVTVVS  LL IATA +A   V FRDG+LPNGDFELGPK SDMKGTEI+NK+AIP+WE
Sbjct:    1 MEGVTVVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWE 60

Query: 1119 VTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIK 997
            VTGFVEYI SGHKQGDMLLVVPAGKFAVRLGNEASIKQR+K
Sbjct:   61 VTGFVEYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLK 101

>TAIR9_protein||AT5G11420.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
        wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
        EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
        of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
        in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
        - 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr5:3644655-3646991 FORWARD

          Length = 367

 Score =  507 bits (1304), Expect = 9e-144
 Identities = 244/265 (92%), Positives = 258/265 (97%)
 Frame = -2

Query: 992 KGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIV 813
           KGMYYSLTFSAARTCAQDERLNISVAPDSGVIP+QTVYSSSGWDLYAWAFQAES VAEIV
Sbjct: 101 KGMYYSLTFSAARTCAQDERLNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESNVAEIV 160

Query: 812 IHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFI 633
           IHNPGEEEDPACGPLIDGVA++ALYPPRPTNKNILKNGGFEEGP VLP +TTGVL+PPFI
Sbjct: 161 IHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGPYVLPNATTGVLVPPFI 220

Query: 632 EDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLS 453
           EDDH+PLP WMVESLKA+KYVD EHFSVPQGRRA+ELVAGKESAIAQVART++GKTYVLS
Sbjct: 221 EDDHSPLPAWMVESLKAIKYVDVEHFSVPQGRRAVELVAGKESAIAQVARTVVGKTYVLS 280

Query: 452 FAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFY 273
           FAVGDANNAC+GSMVVEAFAG+DTLKVPYESRG GGFKRAS+RFVAVSTRTRVMFYSTFY
Sbjct: 281 FAVGDANNACQGSMVVEAFAGKDTLKVPYESRGKGGFKRASLRFVAVSTRTRVMFYSTFY 340

Query: 272 AMRSDDFSSLCGPVIDDVKLLSVRK 198
           +MRSDDFSSLCGPVIDDVKLLS RK
Sbjct: 341 SMRSDDFSSLCGPVIDDVKLLSARK 365


 Score =  144 bits (362), Expect = 2e-034
 Identities = 72/96 (75%), Positives = 83/96 (86%), Gaps = 1/96 (1%)
 Frame = -1

Query: 1287 GVTVVSLVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVTGF 1108
            G ++  L +LLIAT  S V+ F DG+LPNGDFELGPK SDMKGT++INK AIPSWE++GF
Sbjct:    3 GGSLSFLFVLLIATITS-VICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSGF 61

Query: 1107 VEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRI 1000
            VEYI SG KQGDMLLVVPAGKFA+RLGNEASIKQR+
Sbjct:   62 VEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRL 97

>TAIR9_protein||AT4G32460.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
        DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
        unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
        in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
        - 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr4:15663036-15664859 REVERSE

          Length = 366

 Score =  463 bits (1190), Expect = 1e-130
 Identities = 221/270 (81%), Positives = 250/270 (92%)
 Frame = -2

Query: 1007 KESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESE 828
            K S  KG YYS+TFSAARTCAQDERLN+SVAP   V+P+QTVYSSSGWDLY+WAF+A+S+
Sbjct:   95 KISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSD 154

Query:  827 VAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVL 648
             A+IVIHNPG EEDPACGPLIDGVAMRAL+PPRPTNKNILKNGGFEEGP VLP  ++GVL
Sbjct:  155 YADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGPWVLPNISSGVL 214

Query:  647 IPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGK 468
            IPP   DDH+PLPGWMVESLKAVKY+D++HFSVPQGRRA+ELVAGKESA+AQV RTI GK
Sbjct:  215 IPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESAVAQVVRTIPGK 274

Query:  467 TYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMF 288
            TYVLSF+VGDA+NAC GSM+VEAFAG+DT+KVPYES+G GGFKR+S+RFVAVS+RTRVMF
Sbjct:  275 TYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRFVAVSSRTRVMF 334

Query:  287 YSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 198
            YSTFYAMR+DDFSSLCGPVIDDVKLLS R+
Sbjct:  335 YSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364


 Score =  132 bits (332), Expect = 5e-031
 Identities = 63/94 (67%), Positives = 77/94 (81%)
 Frame = -1

Query: 1269 LVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVTGFVEYISS 1090
            +VLLL+ +       F DG+LPNGDFELGP+ SDMKGT++IN  AIP+WE++GFVEYI S
Sbjct:    7 IVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPS 66

Query: 1089 GHKQGDMLLVVPAGKFAVRLGNEASIKQRIKSER 988
            GHKQGDM+LVVP G FAVRLGNEASIKQ+I  ++
Sbjct:   67 GHKQGDMILVVPKGAFAVRLGNEASIKQKISVKK 100

>TAIR9_protein||AT4G32460.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
        DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
        unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
        in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
        - 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr4:15663036-15664859 REVERSE

          Length = 366

 Score =  463 bits (1190), Expect = 1e-130
 Identities = 221/270 (81%), Positives = 250/270 (92%)
 Frame = -2

Query: 1007 KESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESE 828
            K S  KG YYS+TFSAARTCAQDERLN+SVAP   V+P+QTVYSSSGWDLY+WAF+A+S+
Sbjct:   95 KISVKKGSYYSITFSAARTCAQDERLNVSVAPHHAVMPIQTVYSSSGWDLYSWAFKAQSD 154

Query:  827 VAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVL 648
             A+IVIHNPG EEDPACGPLIDGVAMRAL+PPRPTNKNILKNGGFEEGP VLP  ++GVL
Sbjct:  155 YADIVIHNPGVEEDPACGPLIDGVAMRALFPPRPTNKNILKNGGFEEGPWVLPNISSGVL 214

Query:  647 IPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGK 468
            IPP   DDH+PLPGWMVESLKAVKY+D++HFSVPQGRRA+ELVAGKESA+AQV RTI GK
Sbjct:  215 IPPNSIDDHSPLPGWMVESLKAVKYIDSDHFSVPQGRRAVELVAGKESAVAQVVRTIPGK 274

Query:  467 TYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMF 288
            TYVLSF+VGDA+NAC GSM+VEAFAG+DT+KVPYES+G GGFKR+S+RFVAVS+RTRVMF
Sbjct:  275 TYVLSFSVGDASNACAGSMIVEAFAGKDTIKVPYESKGKGGFKRSSLRFVAVSSRTRVMF 334

Query:  287 YSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 198
            YSTFYAMR+DDFSSLCGPVIDDVKLLS R+
Sbjct:  335 YSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364


 Score =  132 bits (332), Expect = 5e-031
 Identities = 63/94 (67%), Positives = 77/94 (81%)
 Frame = -1

Query: 1269 LVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVTGFVEYISS 1090
            +VLLL+ +       F DG+LPNGDFELGP+ SDMKGT++IN  AIP+WE++GFVEYI S
Sbjct:    7 IVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPS 66

Query: 1089 GHKQGDMLLVVPAGKFAVRLGNEASIKQRIKSER 988
            GHKQGDM+LVVP G FAVRLGNEASIKQ+I  ++
Sbjct:   67 GHKQGDMILVVPKGAFAVRLGNEASIKQKISVKK 100

>TAIR9_protein||AT1G80240.1 | Symbols:  | unknown protein |
        chr1:30171520-30172799 REVERSE

          Length = 371

 Score =  363 bits (931), Expect = 2e-100
 Identities = 176/272 (64%), Positives = 211/272 (77%)
 Frame = -2

Query: 1013 SNKESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAE 834
            S K S + G  YS+TFSAARTCAQDERLNISV  +SGVIP+QT+Y S GWD Y+WAF+A 
Sbjct:   96 SQKISVLPGRLYSITFSAARTCAQDERLNISVTHESGVIPIQTMYGSDGWDSYSWAFKAG 155

Query:  833 SEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTG 654
                EI  HNPG EE PACGPLID VA++AL+PPR +  N++KNG FEEGP V P +  G
Sbjct:  156 GPEIEIRFHNPGVEEHPACGPLIDAVAIKALFPPRFSGYNLIKNGNFEEGPYVFPTAKWG 215

Query:  653 VLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTII 474
            VLIPPFIEDD++PLPGWM+ESLKAVKYVD  HF+VP+G RAIELV GKESAI+Q+ RT +
Sbjct:  216 VLIPPFIEDDNSPLPGWMIESLKAVKYVDKAHFAVPEGHRAIELVGGKESAISQIVRTSL 275

Query:  473 GKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRV 294
             K Y L+F VGDA + C+G M+VEAFAG+  + V Y S+G GGF+R  + F AVS RTRV
Sbjct:  276 NKFYALTFNVGDARDGCEGPMIVEAFAGQGKVMVDYASKGKGGFRRGRLVFKAVSARTRV 335

Query:  293 MFYSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 198
             F STFY M+SD   SLCGPVIDDV+L++V K
Sbjct:  336 TFLSTFYHMKSDHSGSLCGPVIDDVRLVAVGK 367


 Score =  111 bits (277), Expect = 1e-024
 Identities = 54/98 (55%), Positives = 69/98 (70%)
 Frame = -1

Query: 1293 MWGVTVVSLVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVT 1114
            M+    + L LL I++      P RDG+LPNG+FELGPK S MKG+ +  + A+P+W + 
Sbjct:    2 MYQEAALLLALLFISSNVVLSAPVRDGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNII 61

Query: 1113 GFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRI 1000
            GFVE+I SG KQ DM+LVVP G  AVRLGNEASI Q+I
Sbjct:   62 GFVEFIKSGQKQDDMVLVVPQGSSAVRLGNEASISQKI 99

>TAIR9_protein||AT3G08030.1 | Symbols:  | unknown protein | chr3:2564191-2565819
        FORWARD

          Length = 366

 Score =  313 bits (801), Expect = 2e-085
 Identities = 157/265 (59%), Positives = 193/265 (72%)
 Frame = -2

Query: 1013 SNKESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAE 834
            S K     G  Y+LTF A+RTCAQDE L +SV   SG +P+QT+Y+S G D+YAWAF A+
Sbjct:   95 SQKLEVKPGSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAK 154

Query:  833 SEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTG 654
            +    +  HNPG +EDPACGPL+D VA++ L  P  T  N++KNGGFEEGP  L  ST G
Sbjct:  155 TSQVTVTFHNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQG 214

Query:  653 VLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTII 474
            VL+PP  ED  +PLPGW++ESLKAVK++D+++F+VP G  AIELVAGKESAIAQV RT  
Sbjct:  215 VLLPPKQEDLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSP 274

Query:  473 GKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRV 294
            G+TY LSF VGDA N C GSM+VEAFA RDTLKVP+ S G G  K AS +F AV  RTR+
Sbjct:  275 GQTYTLSFVVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRI 334

Query:  293 MFYSTFYAMRSDDFSSLCGPVIDDV 219
             F+S FY  +  D  SLCGPVID++
Sbjct:  335 TFFSGFYHTKKTDTVSLCGPVIDEI 359


 Score =  88 bits (216), Expect = 1e-017
 Identities = 45/101 (44%), Positives = 62/101 (61%)
 Frame = -1

Query: 1275 VSLVLLLIATANSAVVPFRDGILPNGDFELGPKSSDMKGTEIINKMAIPSWEVTGFVEYI 1096
            + L +LL+    +   P  +G L NG+FE  PK +DMK T ++ K A+P WE TGFVEYI
Sbjct:    7 IILPILLLICGAALGAPASEGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETTGFVEYI 66

Query: 1095 SSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKSERNVLLA 973
            + G + G M   V  G  AVRLGNEA+I Q+++ +   L A
Sbjct:   67 AGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYA 107

>TAIR9_protein||AT3G08030.2 | Symbols:  | unknown protein | chr3:2564517-2565819
        FORWARD

          Length = 324

 Score =  313 bits (801), Expect = 2e-085
 Identities = 157/265 (59%), Positives = 193/265 (72%)
 Frame = -2

Query: 1013 SNKESKVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAE 834
            S K     G  Y+LTF A+RTCAQDE L +SV   SG +P+QT+Y+S G D+YAWAF A+
Sbjct:   53 SQKLEVKPGSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAK 112

Query:  833 SEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTG 654
            +    +  HNPG +EDPACGPL+D VA++ L  P  T  N++KNGGFEEGP  L  ST G
Sbjct:  113 TSQVTVTFHNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQG 172

Query:  653 VLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTII 474
            VL+PP  ED  +PLPGW++ESLKAVK++D+++F+VP G  AIELVAGKESAIAQV RT  
Sbjct:  173 VLLPPKQEDLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSP 232

Query:  473 GKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRV 294
            G+TY LSF VGDA N C GSM+VEAFA RDTLKVP+ S G G  K AS +F AV  RTR+
Sbjct:  233 GQTYTLSFVVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRI 292

Query:  293 MFYSTFYAMRSDDFSSLCGPVIDDV 219
             F+S FY  +  D  SLCGPVID++
Sbjct:  293 TFFSGFYHTKKTDTVSLCGPVIDEI 317

>TAIR9_protein||AT2G41800.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
        wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
        DURING: 4 anthesis, C globular stage, petal differentiation and
        expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
        function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
        in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
        - 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr2:17436671-17438005 REVERSE

          Length = 371

 Score =  312 bits (799), Expect = 3e-085
 Identities = 151/263 (57%), Positives = 188/263 (71%)
 Frame = -2

Query: 989 GMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVI 810
           G+ YSLTF A RTCAQDE + +SV   +  +P+QTV+SS G D YAWAF+A S+V ++  
Sbjct: 108 GLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSSDGGDTYAWAFKATSDVVKVTF 167

Query: 809 HNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIE 630
           HNPG +ED  CGPL+D VA++ + P R T  N++KNGGFE GP V    +TG+LIP  I+
Sbjct: 168 HNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGFEIGPHVFANFSTGILIPARIQ 227

Query: 629 DDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSF 450
           D  +PLPGW+VESLK VKY+D  HF VP G+ A+ELVAG+ESAIAQ+ RTI GK Y+LSF
Sbjct: 228 DFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAGRESAIAQIIRTIAGKAYMLSF 287

Query: 449 AVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYA 270
           AVGDA N C GSM+VEAFAGR+  K+ + S G G FK    RFVA S RTR+ FYS FY 
Sbjct: 288 AVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTGHFRFVADSDRTRLTFYSAFYH 347

Query: 269 MRSDDFSSLCGPVIDDVKLLSVR 201
            +  DF  LCGPV+D V +   R
Sbjct: 348 TKLHDFGHLCGPVLDSVVVTLAR 370

>TAIR9_protein||AT2G41810.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
        Protein of unknown function DUF642 (InterPro:IPR006946),
        Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
        protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
        hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
        0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
        NCBI BLink). | chr2:17439414-17441296 REVERSE

          Length = 371

 Score =  304 bits (777), Expect = 1e-082
 Identities = 146/263 (55%), Positives = 187/263 (71%), Gaps = 1/263 (0%)
 Frame = -2

Query: 998 KVK-GMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVA 822
           KVK G+ YSLTF   RTCAQDE + ISV   +  +P+QT++S++G D YAWAF+A S++ 
Sbjct: 104 KVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTNGGDTYAWAFKATSDLV 163

Query: 821 EIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIP 642
           ++  +NPG +EDP CGP++D VA++ + P R T  N++KNGGFE GP V    +TG+LIP
Sbjct: 164 KVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFETGPHVFSNFSTGILIP 223

Query: 641 PFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTY 462
             I+D  +PLPGW+VESLK VKY+D  HF VP G  AIELVAG+ESAIAQ+ RT+ GK Y
Sbjct: 224 AKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGRESAIAQIIRTVSGKNY 283

Query: 461 VLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYS 282
           +LSF VGDA+N C GSM+VEAFAG    KV +ES   G FK     F A S RTR+ FYS
Sbjct: 284 ILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGRFAFRADSNRTRITFYS 343

Query: 281 TFYAMRSDDFSSLCGPVIDDVKL 213
            FY  +  DF  LCGPV+D+V +
Sbjct: 344 GFYHTKLHDFGHLCGPVLDNVSV 366

>TAIR9_protein||AT2G34510.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
        to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
        growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
        DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
        in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
        Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr2:14544114-14546732 REVERSE

          Length = 402

 Score =  265 bits (675), Expect = 8e-071
 Identities = 128/265 (48%), Positives = 178/265 (67%), Gaps = 5/265 (1%)
 Frame = -2

Query: 992 KGMYYSLTFSAARTCAQDERLNISVAPD-----SGVIPVQTVYSSSGWDLYAWAFQAESE 828
           KG  YS+TFSAARTCAQ E LN+SVA       S  I +QTVYS  GWD YAWAF+A  +
Sbjct: 114 KGSIYSVTFSAARTCAQLESLNVSVASSDEPIASQTIDLQTVYSVQGWDPYAWAFEAVVD 173

Query: 827 VAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVL 648
              +V  NPG E+DP CGP+ID +A++ L+ P     N + NG FEEGP +   +T GVL
Sbjct: 174 RVRLVFKNPGMEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVL 233

Query: 647 IPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGK 468
           +P  ++++ + LPGW VES +AV+++D++HFSVP+G+RA+EL++GKE  I+Q+  T    
Sbjct: 234 LPTNLDEEISSLPGWTVESNRAVRFIDSDHFSVPEGKRALELLSGKEGIISQMVETKANI 293

Query: 467 TYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMF 288
            Y +SF++G A + CK  + V AFAG       Y ++    F+R+ + F A + RTR+ F
Sbjct: 294 PYKMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFERSELNFTAKAERTRIAF 353

Query: 287 YSTFYAMRSDDFSSLCGPVIDDVKL 213
           YS +Y  R+DD +SLCGPVIDDVK+
Sbjct: 354 YSIYYNTRTDDMTSLCGPVIDDVKV 378


 Score =  83 bits (203), Expect = 4e-016
 Identities = 47/112 (41%), Positives = 64/112 (57%), Gaps = 2/112 (1%)
 Frame = -1

Query: 1317 LSSQCSFTMWGVTVVSLVLLLIATANSA--VVPFRDGILPNGDFELGPKSSDMKGTEIIN 1144
            L S  S+    + ++ L L ++A A+SA    P  DG++ NGDFE  P +       I +
Sbjct:    3 LYSNNSWRSNSILILLLGLSIVAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIED 62

Query: 1143 KMAIPSWEVTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKSER 988
               IPSW   G VE I SG KQG M+L+VP G+ AVRLGN+A I Q +  E+
Sbjct:   63 TSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEK 114

>TAIR9_protein||AT1G29980.2 | Symbols:  | INVOLVED IN: biological_process
        unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
        IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF642
        (InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
        Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
        Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
        Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr1:10503411-10504617 REVERSE

          Length = 372

 Score =  239 bits (608), Expect = 4e-063
 Identities = 107/229 (46%), Positives = 158/229 (68%)
 Frame = -2

Query: 899 IPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTN 720
           + +QT+YS  GWD YAWAF+AE +   +V  NPG E+DP CGP+ID +A++ L+ P    
Sbjct: 118 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 177

Query: 719 KNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQG 540
            N + NG FE+GP +   ++ GVL+P  ++++ + LPGW VES +AV++VD++HFSVP+G
Sbjct: 178 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 237

Query: 539 RRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYES 360
           +RA+EL++GKE  I+Q+  T   K Y+LSF++G A + CK  + + AFAG       Y +
Sbjct: 238 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 297

Query: 359 RGTGGFKRASIRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 213
           +    F++A + F A + RTRV FYS +Y  R+DD SSLCGPVIDDV++
Sbjct: 298 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 346

>TAIR9_protein||AT1G29980.1 | Symbols:  | INVOLVED IN: biological_process
        unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
        IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF642
        (InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
        Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
        Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
        Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr1:10503411-10505994 REVERSE

          Length = 408

 Score =  239 bits (608), Expect = 4e-063
 Identities = 107/229 (46%), Positives = 158/229 (68%)
 Frame = -2

Query: 899 IPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTN 720
           + +QT+YS  GWD YAWAF+AE +   +V  NPG E+DP CGP+ID +A++ L+ P    
Sbjct: 154 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 213

Query: 719 KNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQG 540
            N + NG FE+GP +   ++ GVL+P  ++++ + LPGW VES +AV++VD++HFSVP+G
Sbjct: 214 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 273

Query: 539 RRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYES 360
           +RA+EL++GKE  I+Q+  T   K Y+LSF++G A + CK  + + AFAG       Y +
Sbjct: 274 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 333

Query: 359 RGTGGFKRASIRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 213
           +    F++A + F A + RTRV FYS +Y  R+DD SSLCGPVIDDV++
Sbjct: 334 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 382

>TAIR9_protein||AT5G14150.1 | Symbols:  | unknown protein | chr5:4565246-4566653
        REVERSE

          Length = 384

 Score =  133 bits (334), Expect = 3e-031
 Identities = 94/286 (32%), Positives = 141/286 (49%), Gaps = 16/286 (5%)
 Frame = -2

Query: 1040 QSGSETKRRSNKESKVKGMYYSLTFS---AARTCAQDERLNISVAPDSGVIPVQTVYSSS 870
            Q G + K      +K   + Y LTF+   A + C     L++S    + V   +  YS  
Sbjct:   74 QLGEDGKINQTFIAKGDELNYILTFALIHAGQNCTSSAGLSVSGPDSNAVFSYRQNYSKV 133

Query:  869 GWDLYAWAFQA--ESEVAEIVIHNPGEEED----PACGPLIDGVAMRALYPPRPTNK-NI 711
             W  Y+    +    E   +V+ +   + D      C P+ID + ++ +      +  N+
Sbjct:  134 SWQSYSHNLGSWGNGEPINLVLESQAIDSDSDTNSTCWPIIDTLLIKTVGVTLVQDSGNL 193

Query:  710 LKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRA 531
            L NGGFE GP  LP ST GVLI        +PL  W V  +  V+Y+D+EHF VP+G+ A
Sbjct:  194 LINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQWSV--IGTVRYIDSEHFHVPEGKAA 251

Query:  530 IELVAGKESAIAQVAR--TIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESR 357
            IE+++    +  Q A   T  G  Y L+F +GDAN+AC+G  VV A AG  T     ES 
Sbjct:  252 IEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDANDACRGHFVVGAQAGSVTQNFTLESN 311

Query:  356 GTGGFKRASIRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDV 219
            GTG  ++  + F A     ++ F  T Y++     + +CGPVID+V
Sbjct:  312 GTGSGEKFGLVFEADKDAAQISF--TSYSVTMTKENVVCGPVIDEV 355

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,660,019,295
Number of Sequences: 33410
Number of Extensions: 9660019295
Number of Successful Extensions: 322770282
Number of sequences better than 0.0: 0