Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN15621


BLASTX 7.6.2

Query= UN15621 /QuerySize=1394
        (1393 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT5G25460.1 | Symbols:  | unknown protein | chr5:...    683   1e-196
TAIR9_protein||AT5G11420.1 | Symbols:  | FUNCTIONS IN: molecular...    648   4e-186
TAIR9_protein||AT4G32460.1 | Symbols:  | FUNCTIONS IN: molecular...    589   2e-168
TAIR9_protein||AT4G32460.2 | Symbols:  | FUNCTIONS IN: molecular...    589   2e-168
TAIR9_protein||AT1G80240.1 | Symbols:  | unknown protein | chr1:...    472   3e-133
TAIR9_protein||AT3G08030.1 | Symbols:  | unknown protein | chr3:...    394   7e-110
TAIR9_protein||AT2G41800.1 | Symbols:  | FUNCTIONS IN: molecular...    389   2e-108
TAIR9_protein||AT2G41810.1 | Symbols:  | FUNCTIONS IN: molecular...    377   8e-105
TAIR9_protein||AT3G08030.2 | Symbols:  | unknown protein | chr3:...    371   6e-103
TAIR9_protein||AT2G34510.1 | Symbols:  | FUNCTIONS IN: molecular...    340   1e-093
TAIR9_protein||AT1G29980.2 | Symbols:  | INVOLVED IN: biological...    240   2e-063
TAIR9_protein||AT1G29980.1 | Symbols:  | INVOLVED IN: biological...    240   2e-063
TAIR9_protein||AT5G14150.1 | Symbols:  | unknown protein | chr5:...    149   6e-036

>TAIR9_protein||AT5G25460.1 | Symbols:  | unknown protein | chr5:8863430-8865394
        FORWARD

          Length = 370

 Score =  683 bits (1760), Expect = 1e-196
 Identities = 337/368 (91%), Positives = 355/368 (96%), Gaps = 2/368 (0%)
 Frame = +2

Query:   62 MWGVTVVSLVLLLIATANSA--VVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWE 235
            M GVTVVS  LL IATA +A   V FRDG+LPNGDFELGPKPSD+KGTEI+NK+AIPNWE
Sbjct:    1 MEGVTVVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWE 60

Query:  236 VTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQ 415
            VTGFVEYI SGHKQGDMLLVVPAGKFAVRLGNEASIKQR+KVVKGMYYSLTFSAARTCAQ
Sbjct:   61 VTGFVEYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLKVVKGMYYSLTFSAARTCAQ 120

Query:  416 DERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLID 595
            DERLNISVAPDSGVIP+QTVYSSSGWDLYAWAFQAES+VAE+VIHNPG EEDPACGPLID
Sbjct:  121 DERLNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEVVIHNPGVEEDPACGPLID 180

Query:  596 GVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKA 775
            GVAMR+LYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDH+PLPGWMVESLKA
Sbjct:  181 GVAMRSLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHSPLPGWMVESLKA 240

Query:  776 VKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVE 955
            VKYVD EHFSVPQGRRAIELVAGKESAIAQV RT+IGKTYVLSFAVGDANNACKGSMVVE
Sbjct:  241 VKYVDVEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVLSFAVGDANNACKGSMVVE 300

Query:  956 AFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDD 1135
            AFAG+DTLKVPYES+GTGGFKRASIRFVAVSTR+R+MFYSTFY+MRSDDFSSLCGPVIDD
Sbjct:  301 AFAGKDTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTFYAMRSDDFSSLCGPVIDD 360

Query: 1136 VKLISVRQ 1159
            VKLISVR+
Sbjct:  361 VKLISVRK 368

>TAIR9_protein||AT5G11420.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
        wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
        EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
        of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
        in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
        - 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr5:3644655-3646991 FORWARD

          Length = 367

 Score =  648 bits (1669), Expect = 4e-186
 Identities = 315/364 (86%), Positives = 343/364 (94%), Gaps = 1/364 (0%)
 Frame = +2

Query:   68 GVTVVSLVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGF 247
            G ++  L +LLIAT  S V+ F DG+LPNGDFELGPKPSD+KGT++INK AIP+WE++GF
Sbjct:    3 GGSLSFLFVLLIATITS-VICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSGF 61

Query:  248 VEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERL 427
            VEYI SG KQGDMLLVVPAGKFA+RLGNEASIKQR+ V KGMYYSLTFSAARTCAQDERL
Sbjct:   62 VEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERL 121

Query:  428 NISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAM 607
            NISVAPDSGVIP+QTVYSSSGWDLYAWAFQAES VAEIVIHNPGEEEDPACGPLIDGVA+
Sbjct:  122 NISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAI 181

Query:  608 RALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYV 787
            +ALYPPRPTNKNILKNGGFEEGP VLP +TTGVL+PPFIEDDH+PLP WMVESLKA+KYV
Sbjct:  182 KALYPPRPTNKNILKNGGFEEGPYVLPNATTGVLVPPFIEDDHSPLPAWMVESLKAIKYV 241

Query:  788 DTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAG 967
            D EHFSVPQGRRA+ELVAGKESAIAQVART++GKTYVLSFAVGDANNAC+GSMVVEAFAG
Sbjct:  242 DVEHFSVPQGRRAVELVAGKESAIAQVARTVVGKTYVLSFAVGDANNACQGSMVVEAFAG 301

Query:  968 RDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLI 1147
            +DTLKVPYESRG GGFKRAS+RFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL+
Sbjct:  302 KDTLKVPYESRGKGGFKRASLRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLL 361

Query: 1148 SVRQ 1159
            S R+
Sbjct:  362 SARK 365

>TAIR9_protein||AT4G32460.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
        DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
        unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
        in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
        - 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr4:15663036-15664859 REVERSE

          Length = 366

 Score =  589 bits (1516), Expect = 2e-168
 Identities = 281/358 (78%), Positives = 324/358 (90%)
 Frame = +2

Query:   86 LVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISS 265
            +VLLL+ +       F DG+LPNGDFELGP+ SD+KGT++IN  AIPNWE++GFVEYI S
Sbjct:    7 IVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPS 66

Query:  266 GHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVAP 445
            GHKQGDM+LVVP G FAVRLGNEASIKQ+I V KG YYS+TFSAARTCAQDERLN+SVAP
Sbjct:   67 GHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAP 126

Query:  446 DSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPP 625
               V+P+QTVYSSSGWDLY+WAF+A+S+ A+IVIHNPG EEDPACGPLIDGVAMRAL+PP
Sbjct:  127 HHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPP 186

Query:  626 RPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFS 805
            RPTNKNILKNGGFEEGP VLP  ++GVLIPP   DDH+PLPGWMVESLKAVKY+D++HFS
Sbjct:  187 RPTNKNILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFS 246

Query:  806 VPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKV 985
            VPQGRRA+ELVAGKESA+AQV RTI GKTYVLSF+VGDA+NAC GSM+VEAFAG+DT+KV
Sbjct:  247 VPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKV 306

Query:  986 PYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLISVRQ 1159
            PYES+G GGFKR+S+RFVAVS+RTRVMFYSTFY+MR+DDFSSLCGPVIDDVKL+S R+
Sbjct:  307 PYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364

>TAIR9_protein||AT4G32460.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
        DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
        unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
        in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
        - 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr4:15663036-15664859 REVERSE

          Length = 366

 Score =  589 bits (1516), Expect = 2e-168
 Identities = 281/358 (78%), Positives = 324/358 (90%)
 Frame = +2

Query:   86 LVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISS 265
            +VLLL+ +       F DG+LPNGDFELGP+ SD+KGT++IN  AIPNWE++GFVEYI S
Sbjct:    7 IVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYIPS 66

Query:  266 GHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVAP 445
            GHKQGDM+LVVP G FAVRLGNEASIKQ+I V KG YYS+TFSAARTCAQDERLN+SVAP
Sbjct:   67 GHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSVAP 126

Query:  446 DSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPP 625
               V+P+QTVYSSSGWDLY+WAF+A+S+ A+IVIHNPG EEDPACGPLIDGVAMRAL+PP
Sbjct:  127 HHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALFPP 186

Query:  626 RPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFS 805
            RPTNKNILKNGGFEEGP VLP  ++GVLIPP   DDH+PLPGWMVESLKAVKY+D++HFS
Sbjct:  187 RPTNKNILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDHFS 246

Query:  806 VPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKV 985
            VPQGRRA+ELVAGKESA+AQV RTI GKTYVLSF+VGDA+NAC GSM+VEAFAG+DT+KV
Sbjct:  247 VPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTIKV 306

Query:  986 PYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLISVRQ 1159
            PYES+G GGFKR+S+RFVAVS+RTRVMFYSTFY+MR+DDFSSLCGPVIDDVKL+S R+
Sbjct:  307 PYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364

>TAIR9_protein||AT1G80240.1 | Symbols:  | unknown protein |
        chr1:30171520-30172799 REVERSE

          Length = 371

 Score =  472 bits (1214), Expect = 3e-133
 Identities = 228/364 (62%), Positives = 278/364 (76%)
 Frame = +2

Query:   62 MWGVTVVSLVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVT 241
            M+    + L LL I++      P RDG+LPNG+FELGPKPS +KG+ +  + A+PNW + 
Sbjct:    2 MYQEAALLLALLFISSNVVLSAPVRDGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNII 61

Query:  242 GFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDE 421
            GFVE+I SG KQ DM+LVVP G  AVRLGNEASI Q+I V+ G  YS+TFSAARTCAQDE
Sbjct:   62 GFVEFIKSGQKQDDMVLVVPQGSSAVRLGNEASISQKISVLPGRLYSITFSAARTCAQDE 121

Query:  422 RLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGV 601
            RLNISV  +SGVIP+QT+Y S GWD Y+WAF+A     EI  HNPG EE PACGPLID V
Sbjct:  122 RLNISVTHESGVIPIQTMYGSDGWDSYSWAFKAGGPEIEIRFHNPGVEEHPACGPLIDAV 181

Query:  602 AMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVK 781
            A++AL+PPR +  N++KNG FEEGP V P +  GVLIPPFIEDD++PLPGWM+ESLKAVK
Sbjct:  182 AIKALFPPRFSGYNLIKNGNFEEGPYVFPTAKWGVLIPPFIEDDNSPLPGWMIESLKAVK 241

Query:  782 YVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAF 961
            YVD  HF+VP+G RAIELV GKESAI+Q+ RT + K Y L+F VGDA + C+G M+VEAF
Sbjct:  242 YVDKAHFAVPEGHRAIELVGGKESAISQIVRTSLNKFYALTFNVGDARDGCEGPMIVEAF 301

Query:  962 AGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVK 1141
            AG+  + V Y S+G GGF+R  + F AVS RTRV F STFY M+SD   SLCGPVIDDV+
Sbjct:  302 AGQGKVMVDYASKGKGGFRRGRLVFKAVSARTRVTFLSTFYHMKSDHSGSLCGPVIDDVR 361

Query: 1142 LISV 1153
            L++V
Sbjct:  362 LVAV 365

>TAIR9_protein||AT3G08030.1 | Symbols:  | unknown protein | chr3:2564191-2565819
        FORWARD

          Length = 366

 Score =  394 bits (1012), Expect = 7e-110
 Identities = 198/353 (56%), Positives = 251/353 (71%)
 Frame = +2

Query:   80 VSLVLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYI 259
            + L +LL+    +   P  +G L NG+FE  PK +D+K T ++ K A+P WE TGFVEYI
Sbjct:    7 IILPILLLICGAALGAPASEGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETTGFVEYI 66

Query:  260 SSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISV 439
            + G + G M   V  G  AVRLGNEA+I Q+++V  G  Y+LTF A+RTCAQDE L +SV
Sbjct:   67 AGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKPGSLYALTFGASRTCAQDEVLRVSV 126

Query:  440 APDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALY 619
               SG +P+QT+Y+S G D+YAWAF A++    +  HNPG +EDPACGPL+D VA++ L 
Sbjct:  127 PSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTFHNPGVQEDPACGPLLDAVAIKELV 186

Query:  620 PPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEH 799
             P  T  N++KNGGFEEGP  L  ST GVL+PP  ED  +PLPGW++ESLKAVK++D+++
Sbjct:  187 HPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQEDLTSPLPGWIIESLKAVKFIDSKY 246

Query:  800 FSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTL 979
            F+VP G  AIELVAGKESAIAQV RT  G+TY LSF VGDA N C GSM+VEAFA RDTL
Sbjct:  247 FNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSFVVGDAKNDCHGSMMVEAFAARDTL 306

Query:  980 KVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDV 1138
            KVP+ S G G  K AS +F AV  RTR+ F+S FY  +  D  SLCGPVID++
Sbjct:  307 KVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYHTKKTDTVSLCGPVIDEI 359

>TAIR9_protein||AT2G41800.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
        wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
        DURING: 4 anthesis, C globular stage, petal differentiation and
        expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
        function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
        in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
        - 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr2:17436671-17438005 REVERSE

          Length = 371

 Score =  389 bits (999), Expect = 2e-108
 Identities = 189/344 (54%), Positives = 240/344 (69%)
 Frame = +2

Query:  125 VPFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISSGHKQGDMLLVVPA 304
            VP  DGILPNG+FE+ P  S++KG +II   ++P+WE+ G VE +S G + G     VP 
Sbjct:   27 VPHLDGILPNGNFEITPLKSNMKGRQIIGANSLPHWEIAGHVELVSGGPQPGGFYFPVPR 86

Query:  305 GKFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSS 484
            G  AVRLGN  +I Q ++V  G+ YSLTF A RTCAQDE + +SV   +  +P+QTV+SS
Sbjct:   87 GVHAVRLGNLGTISQNVRVKSGLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSS 146

Query:  485 SGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGF 664
             G D YAWAF+A S+V ++  HNPG +ED  CGPL+D VA++ + P R T  N++KNGGF
Sbjct:  147 DGGDTYAWAFKATSDVVKVTFHNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGF 206

Query:  665 EEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAG 844
            E GP V    +TG+LIP  I+D  +PLPGW+VESLK VKY+D  HF VP G+ A+ELVAG
Sbjct:  207 EIGPHVFANFSTGILIPARIQDFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAG 266

Query:  845 KESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRA 1024
            +ESAIAQ+ RTI GK Y+LSFAVGDA N C GSM+VEAFAGR+  K+ + S G G FK  
Sbjct:  267 RESAIAQIIRTIAGKAYMLSFAVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTG 326

Query: 1025 SIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKLISVR 1156
              RFVA S RTR+ FYS FY  +  DF  LCGPV+D V +   R
Sbjct:  327 HFRFVADSDRTRLTFYSAFYHTKLHDFGHLCGPVLDSVVVTLAR 370

>TAIR9_protein||AT2G41810.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
        Protein of unknown function DUF642 (InterPro:IPR006946),
        Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
        protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
        hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
        0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
        NCBI BLink). | chr2:17439414-17441296 REVERSE

          Length = 371

 Score =  377 bits (968), Expect = 8e-105
 Identities = 180/339 (53%), Positives = 235/339 (69%)
 Frame = +2

Query:  128 PFRDGILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISSGHKQGDMLLVVPAG 307
            P  DG+LPNG+FE  P  S+++  +II K ++P+WE++G VE +S G + G     VP G
Sbjct:   28 PHLDGLLPNGNFEQIPNKSNMRKRQIIGKYSLPHWEISGHVELVSGGPQPGGFYFAVPRG 87

Query:  308 KFAVRLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSS 487
              A RLGN ASI Q +KV  G+ YSLTF   RTCAQDE + ISV   +  +P+QT++S++
Sbjct:   88 VHAARLGNLASISQYVKVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTN 147

Query:  488 GWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFE 667
            G D YAWAF+A S++ ++  +NPG +EDP CGP++D VA++ + P R T  N++KNGGFE
Sbjct:  148 GGDTYAWAFKATSDLVKVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFE 207

Query:  668 EGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGK 847
             GP V    +TG+LIP  I+D  +PLPGW+VESLK VKY+D  HF VP G  AIELVAG+
Sbjct:  208 TGPHVFSNFSTGILIPAKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGR 267

Query:  848 ESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRAS 1027
            ESAIAQ+ RT+ GK Y+LSF VGDA+N C GSM+VEAFAG    KV +ES   G FK   
Sbjct:  268 ESAIAQIIRTVSGKNYILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGR 327

Query: 1028 IRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 1144
              F A S RTR+ FYS FY  +  DF  LCGPV+D+V +
Sbjct:  328 FAFRADSNRTRITFYSGFYHTKLHDFGHLCGPVLDNVSV 366

>TAIR9_protein||AT3G08030.2 | Symbols:  | unknown protein | chr3:2564517-2565819
        FORWARD

          Length = 324

 Score =  371 bits (952), Expect = 6e-103
 Identities = 185/317 (58%), Positives = 231/317 (72%)
 Frame = +2

Query:  188 LKGTEIINKMAIPNWEVTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVK 367
            +K T ++ K A+P WE TGFVEYI+ G + G M   V  G  AVRLGNEA+I Q+++V  
Sbjct:    1 MKKTVLLGKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKP 60

Query:  368 GMYYSLTFSAARTCAQDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVI 547
            G  Y+LTF A+RTCAQDE L +SV   SG +P+QT+Y+S G D+YAWAF A++    +  
Sbjct:   61 GSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTF 120

Query:  548 HNPGEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIE 727
            HNPG +EDPACGPL+D VA++ L  P  T  N++KNGGFEEGP  L  ST GVL+PP  E
Sbjct:  121 HNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQE 180

Query:  728 DDHTPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSF 907
            D  +PLPGW++ESLKAVK++D+++F+VP G  AIELVAGKESAIAQV RT  G+TY LSF
Sbjct:  181 DLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSF 240

Query:  908 AVGDANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYS 1087
             VGDA N C GSM+VEAFA RDTLKVP+ S G G  K AS +F AV  RTR+ F+S FY 
Sbjct:  241 VVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYH 300

Query: 1088 MRSDDFSSLCGPVIDDV 1138
             +  D  SLCGPVID++
Sbjct:  301 TKKTDTVSLCGPVIDEI 317

>TAIR9_protein||AT2G34510.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
        to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
        growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
        DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
        in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
        Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr2:14544114-14546732 REVERSE

          Length = 402

 Score =  340 bits (871), Expect = 1e-093
 Identities = 174/376 (46%), Positives = 241/376 (64%), Gaps = 7/376 (1%)
 Frame = +2

Query:   38 LSSQCSFTMWGVTVVSLVLLLIATANSA--VVPFRDGILPNGDFELGPKPSDLKGTEIIN 211
            L S  S+    + ++ L L ++A A+SA    P  DG++ NGDFE  P         I +
Sbjct:    3 LYSNNSWRSNSILILLLGLSIVAAADSAGKTSPVEDGLVVNGDFETPPSNGFPDDAIIED 62

Query:  212 KMAIPNWEVTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTF 391
               IP+W   G VE I SG KQG M+L+VP G+ AVRLGN+A I Q + V KG  YS+TF
Sbjct:   63 TSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHAVRLGNDAEISQELTVEKGSIYSVTF 122

Query:  392 SAARTCAQDERLNISVAPD-----SGVIPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNP 556
            SAARTCAQ E LN+SVA       S  I +QTVYS  GWD YAWAF+A  +   +V  NP
Sbjct:  123 SAARTCAQLESLNVSVASSDEPIASQTIDLQTVYSVQGWDPYAWAFEAVVDRVRLVFKNP 182

Query:  557 GEEEDPACGPLIDGVAMRALYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDH 736
            G E+DP CGP+ID +A++ L+ P     N + NG FEEGP +   +T GVL+P  ++++ 
Sbjct:  183 GMEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGDFEEGPWMFRNTTLGVLLPTNLDEEI 242

Query:  737 TPLPGWMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVARTIIGKTYVLSFAVG 916
            + LPGW VES +AV+++D++HFSVP+G+RA+EL++GKE  I+Q+  T     Y +SF++G
Sbjct:  243 SSLPGWTVESNRAVRFIDSDHFSVPEGKRALELLSGKEGIISQMVETKANIPYKMSFSLG 302

Query:  917 DANNACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRS 1096
             A + CK  + V AFAG       Y ++    F+R+ + F A + RTR+ FYS +Y+ R+
Sbjct:  303 HAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFERSELNFTAKAERTRIAFYSIYYNTRT 362

Query: 1097 DDFSSLCGPVIDDVKL 1144
            DD +SLCGPVIDDVK+
Sbjct:  363 DDMTSLCGPVIDDVKV 378

>TAIR9_protein||AT1G29980.2 | Symbols:  | INVOLVED IN: biological_process
        unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
        IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF642
        (InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
        Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
        Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
        Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr1:10503411-10504617 REVERSE

          Length = 372

 Score =  240 bits (611), Expect = 2e-063
 Identities = 107/229 (46%), Positives = 159/229 (69%)
 Frame = +2

Query:  458 IPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTN 637
            + +QT+YS  GWD YAWAF+AE +   +V  NPG E+DP CGP+ID +A++ L+ P    
Sbjct:  118 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 177

Query:  638 KNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQG 817
             N + NG FE+GP +   ++ GVL+P  ++++ + LPGW VES +AV++VD++HFSVP+G
Sbjct:  178 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 237

Query:  818 RRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYES 997
            +RA+EL++GKE  I+Q+  T   K Y+LSF++G A + CK  + + AFAG       Y +
Sbjct:  238 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 297

Query:  998 RGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 1144
            +    F++A + F A + RTRV FYS +Y+ R+DD SSLCGPVIDDV++
Sbjct:  298 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 346


 Score =  103 bits (256), Expect = 3e-022
 Identities = 53/101 (52%), Positives = 66/101 (65%)
 Frame = +2

Query: 140 GILPNGDFELGPKPSDLKGTEIINKMAIPNWEVTGFVEYISSGHKQGDMLLVVPAGKFAV 319
           G++ NGDFE  P               IP+W+  G VE I+SG KQG M+L+VP G+ AV
Sbjct:   3 GLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHAV 62

Query: 320 RLGNEASIKQRIKVVKGMYYSLTFSAARTCAQDERLNISVA 442
           RLGN+A I Q + V KG  YS+TFSAARTCAQ E +N+SVA
Sbjct:  63 RLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVA 103

>TAIR9_protein||AT1G29980.1 | Symbols:  | INVOLVED IN: biological_process
        unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
        IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF642
        (InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
        Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
        Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
        Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr1:10503411-10505994 REVERSE

          Length = 408

 Score =  240 bits (611), Expect = 2e-063
 Identities = 107/229 (46%), Positives = 159/229 (69%)
 Frame = +2

Query:  458 IPVQTVYSSSGWDLYAWAFQAESEVAEIVIHNPGEEEDPACGPLIDGVAMRALYPPRPTN 637
            + +QT+YS  GWD YAWAF+AE +   +V  NPG E+DP CGP+ID +A++ L+ P    
Sbjct:  154 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 213

Query:  638 KNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPGWMVESLKAVKYVDTEHFSVPQG 817
             N + NG FE+GP +   ++ GVL+P  ++++ + LPGW VES +AV++VD++HFSVP+G
Sbjct:  214 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 273

Query:  818 RRAIELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACKGSMVVEAFAGRDTLKVPYES 997
            +RA+EL++GKE  I+Q+  T   K Y+LSF++G A + CK  + + AFAG       Y +
Sbjct:  274 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 333

Query:  998 RGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 1144
            +    F++A + F A + RTRV FYS +Y+ R+DD SSLCGPVIDDV++
Sbjct:  334 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 382


 Score =  111 bits (276), Expect = 1e-024
 Identities = 60/135 (44%), Positives = 85/135 (62%), Gaps = 1/135 (0%)
 Frame = +2

Query:  41 SSQCSFTMWGVTVVSL-VLLLIATANSAVVPFRDGILPNGDFELGPKPSDLKGTEIINKM 217
           +++C ++   + ++S+ V +L+A A+       DG++ NGDFE  P              
Sbjct:   5 NNRCKWSSIFLFLLSVSVAVLVAVADDKSPAVEDGLVINGDFETSPSSGFPDDGVTDGPS 64

Query: 218 AIPNWEVTGFVEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKGMYYSLTFSA 397
            IP+W+  G VE I+SG KQG M+L+VP G+ AVRLGN+A I Q + V KG  YS+TFSA
Sbjct:  65 DIPSWKSNGTVELINSGQKQGGMILIVPQGRHAVRLGNDAEISQDLTVEKGFVYSVTFSA 124

Query: 398 ARTCAQDERLNISVA 442
           ARTCAQ E +N+SVA
Sbjct: 125 ARTCAQLESINVSVA 139

>TAIR9_protein||AT5G14150.1 | Symbols:  | unknown protein | chr5:4565246-4566653
        REVERSE

          Length = 384

 Score =  149 bits (374), Expect = 6e-036
 Identities = 117/371 (31%), Positives = 173/371 (46%), Gaps = 41/371 (11%)
 Frame = +2

Query:   80 VSLVLLLIATANSAVVPFRDGILPNGDFELG----PKPSDLKGTEIINKMAIPNWEVTGF 247
            + L+LL+   A+S         L N DFE      P  S+     +     +P W   G 
Sbjct:    8 IFLLLLVSCCASS-------DFLENPDFESPPLNLPTNSNASSVSLDQNSTLPGWTFQGT 60

Query:  248 VEYISSGHKQGDMLLVVPAGKFAVRLGNEASIKQRIKVVKG--MYYSLTFS---AARTCA 412
            V Y+            +P    AV+LG +  I Q   + KG  + Y LTF+   A + C 
Sbjct:   61 VLYVE-----------LPDTGHAVQLGEDGKINQTF-IAKGDELNYILTFALIHAGQNCT 108

Query:  413 QDERLNISVAPDSGVIPVQTVYSSSGWDLYAWAFQA--ESEVAEIVIHNPGEEED----P 574
                L++S    + V   +  YS   W  Y+    +    E   +V+ +   + D     
Sbjct:  109 SSAGLSVSGPDSNAVFSYRQNYSKVSWQSYSHNLGSWGNGEPINLVLESQAIDSDSDTNS 168

Query:  575 ACGPLIDGVAMRALYPPRPTNK-NILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHTPLPG 751
             C P+ID + ++ +      +  N+L NGGFE GP  LP ST GVLI        +PL  
Sbjct:  169 TCWPIIDTLLIKTVGVTLVQDSGNLLINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQ 228

Query:  752 WMVESLKAVKYVDTEHFSVPQGRRAIELVAGKESAIAQVAR--TIIGKTYVLSFAVGDAN 925
            W V  +  V+Y+D+EHF VP+G+ AIE+++    +  Q A   T  G  Y L+F +GDAN
Sbjct:  229 WSV--IGTVRYIDSEHFHVPEGKAAIEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDAN 286

Query:  926 NACKGSMVVEAFAGRDTLKVPYESRGTGGFKRASIRFVAVSTRTRVMFYSTFYSMRSDDF 1105
            +AC+G  VV A AG  T     ES GTG  ++  + F A     ++ F  T YS+     
Sbjct:  287 DACRGHFVVGAQAGSVTQNFTLESNGTGSGEKFGLVFEADKDAAQISF--TSYSVTMTKE 344

Query: 1106 SSLCGPVIDDV 1138
            + +CGPVID+V
Sbjct:  345 NVVCGPVIDEV 355

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,188,306,669
Number of Sequences: 33410
Number of Extensions: 8188306669
Number of Successful Extensions: 285327727
Number of sequences better than 0.0: 0