Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN17103


BLASTX 7.6.2

Query= UN17103 /QuerySize=1392
        (1391 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT5G11420.1 | Symbols:  | FUNCTIONS IN: molecular...    703   1e-202
TAIR9_protein||AT5G25460.1 | Symbols:  | unknown protein | chr5:...    652   2e-187
TAIR9_protein||AT4G32460.1 | Symbols:  | FUNCTIONS IN: molecular...    594   6e-170
TAIR9_protein||AT4G32460.2 | Symbols:  | FUNCTIONS IN: molecular...    594   6e-170
TAIR9_protein||AT1G80240.1 | Symbols:  | unknown protein | chr1:...    478   5e-135
TAIR9_protein||AT3G08030.1 | Symbols:  | unknown protein | chr3:...    399   2e-111
TAIR9_protein||AT2G41800.1 | Symbols:  | FUNCTIONS IN: molecular...    392   4e-109
TAIR9_protein||AT3G08030.2 | Symbols:  | unknown protein | chr3:...    381   8e-106
TAIR9_protein||AT2G41810.1 | Symbols:  | FUNCTIONS IN: molecular...    376   2e-104
TAIR9_protein||AT2G34510.1 | Symbols:  | FUNCTIONS IN: molecular...    340   1e-093
TAIR9_protein||AT1G29980.1 | Symbols:  | INVOLVED IN: biological...    339   3e-093
TAIR9_protein||AT1G29980.2 | Symbols:  | INVOLVED IN: biological...    242   4e-064
TAIR9_protein||AT5G14150.1 | Symbols:  | unknown protein | chr5:...    153   4e-037

>TAIR9_protein||AT5G11420.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
        wall, plant-type cell wall; EXPRESSED IN: 23 plant structures;
        EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
        of unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G25460.1); Has 185 Blast hits to 157 proteins
        in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
        - 185; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr5:3644655-3646991 FORWARD

          Length = 367

 Score =  703 bits (1812), Expect = 1e-202
 Identities = 342/365 (93%), Positives = 360/365 (98%)
 Frame = -3

Query: 1344 MKGGTGSFLCVLLISTVTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTG 1165
            MKGG+ SFL VLLI+T+TSV+CF DGMLPNGDFELGPKPSDMKGTQV+NK AIPSWEL+G
Sbjct:    1 MKGGSLSFLFVLLIATITSVICFSDGMLPNGDFELGPKPSDMKGTQVINKKAIPSWELSG 60

Query: 1164 FVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDER 985
            FVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDER
Sbjct:   61 FVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDER 120

Query:  984 LNISVAPDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVA 805
            LNISVAPDSG+IPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVA
Sbjct:  121 LNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVA 180

Query:  804 IKALYPPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKY 625
            IKALYPPRPTNKNILKNGGFEEGPY+LPN+TTGVL+PPFIEDDHSPLPAWMVESLKA+KY
Sbjct:  181 IKALYPPRPTNKNILKNGGFEEGPYVLPNATTGVLVPPFIEDDHSPLPAWMVESLKAIKY 240

Query:  624 VDVEHFSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFA 445
            VDVEHFSVPQGRRAVELVAGKESAIAQVART++GKTYVLSFAVGDANNAC+GSM+VEAFA
Sbjct:  241 VDVEHFSVPQGRRAVELVAGKESAIAQVARTVVGKTYVLSFAVGDANNACQGSMVVEAFA 300

Query:  444 GRDTLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 265
            G+DTLKVPYES+GKGGFKRASLRFVAVSTRTRVMFYSTFY+MRSDDFSSLCGPVIDDVKL
Sbjct:  301 GKDTLKVPYESRGKGGFKRASLRFVAVSTRTRVMFYSTFYSMRSDDFSSLCGPVIDDVKL 360

Query:  264 LSVRK 250
            LS RK
Sbjct:  361 LSARK 365

>TAIR9_protein||AT5G25460.1 | Symbols:  | unknown protein | chr5:8863430-8865394
        FORWARD

          Length = 370

 Score =  652 bits (1680), Expect = 2e-187
 Identities = 320/368 (86%), Positives = 344/368 (93%), Gaps = 3/368 (0%)
 Frame = -3

Query: 1344 MKGGTGSFLCVLLIST---VTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWE 1174
            M+G T     +L I+T     S V FRDGMLPNGDFELGPKPSDMKGT++LNK AIP+WE
Sbjct:    1 MEGVTVVSFFLLFIATAMAAKSTVSFRDGMLPNGDFELGPKPSDMKGTEILNKLAIPNWE 60

Query: 1173 LTGFVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQ 994
            +TGFVEYIKSG KQGDMLLVVPAGKFA+RLGNEASIKQRL V KGMYYSLTFSAARTCAQ
Sbjct:   61 VTGFVEYIKSGHKQGDMLLVVPAGKFAVRLGNEASIKQRLKVVKGMYYSLTFSAARTCAQ 120

Query:  993 DERLNISVAPDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLID 814
            DERLNISVAPDSG+IPIQTVYSSSGWDLYAWAFQAES+VAE+VIHNPG EEDPACGPLID
Sbjct:  121 DERLNISVAPDSGVIPIQTVYSSSGWDLYAWAFQAESDVAEVVIHNPGVEEDPACGPLID 180

Query:  813 GVAIKALYPPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKA 634
            GVA+++LYPPRPTNKNILKNGGFEEGP +LP STTGVLIPPFIEDDHSPLP WMVESLKA
Sbjct:  181 GVAMRSLYPPRPTNKNILKNGGFEEGPLVLPGSTTGVLIPPFIEDDHSPLPGWMVESLKA 240

Query:  633 VKYVDVEHFSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVE 454
            VKYVDVEHFSVPQGRRA+ELVAGKESAIAQV RT+IGKTYVLSFAVGDANNAC+GSM+VE
Sbjct:  241 VKYVDVEHFSVPQGRRAIELVAGKESAIAQVVRTVIGKTYVLSFAVGDANNACKGSMVVE 300

Query:  453 AFAGRDTLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDD 274
            AFAG+DTLKVPYESKG GGFKRAS+RFVAVSTR+R+MFYSTFYAMRSDDFSSLCGPVIDD
Sbjct:  301 AFAGKDTLKVPYESKGTGGFKRASIRFVAVSTRSRIMFYSTFYAMRSDDFSSLCGPVIDD 360

Query:  273 VKLLSVRK 250
            VKL+SVRK
Sbjct:  361 VKLISVRK 368

>TAIR9_protein||AT4G32460.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
        DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
        unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
        in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
        - 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr4:15663036-15664859 REVERSE

          Length = 366

 Score =  594 bits (1530), Expect = 6e-170
 Identities = 286/360 (79%), Positives = 325/360 (90%)
 Frame = -3

Query: 1329 GSFLCVLLISTVTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYI 1150
            G  + +LL S      CF DG+LPNGDFELGP+ SDMKGTQV+N  AIP+WEL+GFVEYI
Sbjct:    5 GVIVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYI 64

Query: 1149 KSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISV 970
             SG KQGDM+LVVP G FA+RLGNEASIKQ+++V KG YYS+TFSAARTCAQDERLN+SV
Sbjct:   65 PSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSV 124

Query:  969 APDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALY 790
            AP   ++PIQTVYSSSGWDLY+WAF+A+S+ A+IVIHNPG EEDPACGPLIDGVA++AL+
Sbjct:  125 APHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALF 184

Query:  789 PPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEH 610
            PPRPTNKNILKNGGFEEGP++LPN ++GVLIPP   DDHSPLP WMVESLKAVKY+D +H
Sbjct:  185 PPRPTNKNILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDH 244

Query:  609 FSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTL 430
            FSVPQGRRAVELVAGKESA+AQV RTI GKTYVLSF+VGDA+NAC GSMIVEAFAG+DT+
Sbjct:  245 FSVPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTI 304

Query:  429 KVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 250
            KVPYESKGKGGFKR+SLRFVAVS+RTRVMFYSTFYAMR+DDFSSLCGPVIDDVKLLS R+
Sbjct:  305 KVPYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364

>TAIR9_protein||AT4G32460.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
        DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
        unknown function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT5G11420.1); Has 182 Blast hits to 158 proteins
        in 12 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants
        - 180; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr4:15663036-15664859 REVERSE

          Length = 366

 Score =  594 bits (1530), Expect = 6e-170
 Identities = 286/360 (79%), Positives = 325/360 (90%)
 Frame = -3

Query: 1329 GSFLCVLLISTVTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYI 1150
            G  + +LL S      CF DG+LPNGDFELGP+ SDMKGTQV+N  AIP+WEL+GFVEYI
Sbjct:    5 GVIVLLLLHSFFYVAFCFNDGLLPNGDFELGPRHSDMKGTQVINITAIPNWELSGFVEYI 64

Query: 1149 KSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISV 970
             SG KQGDM+LVVP G FA+RLGNEASIKQ+++V KG YYS+TFSAARTCAQDERLN+SV
Sbjct:   65 PSGHKQGDMILVVPKGAFAVRLGNEASIKQKISVKKGSYYSITFSAARTCAQDERLNVSV 124

Query:  969 APDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALY 790
            AP   ++PIQTVYSSSGWDLY+WAF+A+S+ A+IVIHNPG EEDPACGPLIDGVA++AL+
Sbjct:  125 APHHAVMPIQTVYSSSGWDLYSWAFKAQSDYADIVIHNPGVEEDPACGPLIDGVAMRALF 184

Query:  789 PPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEH 610
            PPRPTNKNILKNGGFEEGP++LPN ++GVLIPP   DDHSPLP WMVESLKAVKY+D +H
Sbjct:  185 PPRPTNKNILKNGGFEEGPWVLPNISSGVLIPPNSIDDHSPLPGWMVESLKAVKYIDSDH 244

Query:  609 FSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTL 430
            FSVPQGRRAVELVAGKESA+AQV RTI GKTYVLSF+VGDA+NAC GSMIVEAFAG+DT+
Sbjct:  245 FSVPQGRRAVELVAGKESAVAQVVRTIPGKTYVLSFSVGDASNACAGSMIVEAFAGKDTI 304

Query:  429 KVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 250
            KVPYESKGKGGFKR+SLRFVAVS+RTRVMFYSTFYAMR+DDFSSLCGPVIDDVKLLS R+
Sbjct:  305 KVPYESKGKGGFKRSSLRFVAVSSRTRVMFYSTFYAMRNDDFSSLCGPVIDDVKLLSARR 364

>TAIR9_protein||AT1G80240.1 | Symbols:  | unknown protein |
        chr1:30171520-30172799 REVERSE

          Length = 371

 Score =  478 bits (1229), Expect = 5e-135
 Identities = 235/358 (65%), Positives = 279/358 (77%), Gaps = 1/358 (0%)
 Frame = -3

Query: 1320 LCVLLIST-VTSVVCFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKS 1144
            L +L IS+ V      RDG+LPNG+FELGPKPS MKG+ V  + A+P+W + GFVE+IKS
Sbjct:   10 LALLFISSNVVLSAPVRDGLLPNGNFELGPKPSQMKGSVVKERTAVPNWNIIGFVEFIKS 69

Query: 1143 GQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAP 964
            GQKQ DM+LVVP G  A+RLGNEASI Q+++V  G  YS+TFSAARTCAQDERLNISV  
Sbjct:   70 GQKQDDMVLVVPQGSSAVRLGNEASISQKISVLPGRLYSITFSAARTCAQDERLNISVTH 129

Query:  963 DSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPP 784
            +SG+IPIQT+Y S GWD Y+WAF+A     EI  HNPG EE PACGPLID VAIKAL+PP
Sbjct:  130 ESGVIPIQTMYGSDGWDSYSWAFKAGGPEIEIRFHNPGVEEHPACGPLIDAVAIKALFPP 189

Query:  783 RPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFS 604
            R +  N++KNG FEEGPY+ P +  GVLIPPFIEDD+SPLP WM+ESLKAVKYVD  HF+
Sbjct:  190 RFSGYNLIKNGNFEEGPYVFPTAKWGVLIPPFIEDDNSPLPGWMIESLKAVKYVDKAHFA 249

Query:  603 VPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKV 424
            VP+G RA+ELV GKESAI+Q+ RT + K Y L+F VGDA + CEG MIVEAFAG+  + V
Sbjct:  250 VPEGHRAIELVGGKESAISQIVRTSLNKFYALTFNVGDARDGCEGPMIVEAFAGQGKVMV 309

Query:  423 PYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLLSVRK 250
             Y SKGKGGF+R  L F AVS RTRV F STFY M+SD   SLCGPVIDDV+L++V K
Sbjct:  310 DYASKGKGGFRRGRLVFKAVSARTRVTFLSTFYHMKSDHSGSLCGPVIDDVRLVAVGK 367

>TAIR9_protein||AT3G08030.1 | Symbols:  | unknown protein | chr3:2564191-2565819
        FORWARD

          Length = 366

 Score =  399 bits (1025), Expect = 2e-111
 Identities = 199/334 (59%), Positives = 243/334 (72%)
 Frame = -3

Query: 1272 DGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFA 1093
            +G L NG+FE  PK +DMK T +L KNA+P WE TGFVEYI  G + G M   V  G  A
Sbjct:   26 EGYLRNGNFEESPKKTDMKKTVLLGKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHA 85

Query: 1092 IRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPDSGIIPIQTVYSSSGWD 913
            +RLGNEA+I Q+L V  G  Y+LTF A+RTCAQDE L +SV   SG +P+QT+Y+S G D
Sbjct:   86 VRLGNEATISQKLEVKPGSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGD 145

Query:  912 LYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGP 733
            +YAWAF A+++   +  HNPG +EDPACGPL+D VAIK L  P  T  N++KNGGFEEGP
Sbjct:  146 VYAWAFVAKTSQVTVTFHNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGP 205

Query:  732 YLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVAGKESA 553
            + L NST GVL+PP  ED  SPLP W++ESLKAVK++D ++F+VP G  A+ELVAGKESA
Sbjct:  206 HRLVNSTQGVLLPPKQEDLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESA 265

Query:  552 IAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKRASLRF 373
            IAQV RT  G+TY LSF VGDA N C GSM+VEAFA RDTLKVP+ S G G  K AS +F
Sbjct:  266 IAQVIRTSPGQTYTLSFVVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKF 325

Query:  372 VAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDV 271
             AV  RTR+ F+S FY  +  D  SLCGPVID++
Sbjct:  326 KAVEARTRITFFSGFYHTKKTDTVSLCGPVIDEI 359

>TAIR9_protein||AT2G41800.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cell
        wall, plant-type cell wall; EXPRESSED IN: 7 plant structures; EXPRESSED
        DURING: 4 anthesis, C globular stage, petal differentiation and
        expansion stage; CONTAINS InterPro DOMAIN/s: Protein of unknown
        function DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT2G41810.1); Has 156 Blast hits to 155 proteins
        in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants
        - 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr2:17436671-17438005 REVERSE

          Length = 371

 Score =  392 bits (1005), Expect = 4e-109
 Identities = 189/340 (55%), Positives = 239/340 (70%)
 Frame = -3

Query: 1272 DGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFA 1093
            DG+LPNG+FE+ P  S+MKG Q++  N++P WE+ G VE +  G + G     VP G  A
Sbjct:   31 DGILPNGNFEITPLKSNMKGRQIIGANSLPHWEIAGHVELVSGGPQPGGFYFPVPRGVHA 90

Query: 1092 IRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPDSGIIPIQTVYSSSGWD 913
            +RLGN  +I Q + V  G+ YSLTF A RTCAQDE + +SV   +  +P+QTV+SS G D
Sbjct:   91 VRLGNLGTISQNVRVKSGLVYSLTFGATRTCAQDENIKVSVPGQANELPLQTVFSSDGGD 150

Query:  912 LYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGP 733
             YAWAF+A S+V ++  HNPG +ED  CGPL+D VAIK + P R T  N++KNGGFE GP
Sbjct:  151 TYAWAFKATSDVVKVTFHNPGVQEDRTCGPLLDVVAIKEILPLRYTRGNLVKNGGFEIGP 210

Query:  732 YLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVAGKESA 553
            ++  N +TG+LIP  I+D  SPLP W+VESLK VKY+D  HF VP G+ AVELVAG+ESA
Sbjct:  211 HVFANFSTGILIPARIQDFISPLPGWIVESLKPVKYIDRRHFKVPYGQGAVELVAGRESA 270

Query:  552 IAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKRASLRF 373
            IAQ+ RTI GK Y+LSFAVGDA N C GSM+VEAFAGR+  K+ + S+GKG FK    RF
Sbjct:  271 IAQIIRTIAGKAYMLSFAVGDAQNGCHGSMMVEAFAGREPFKLSFMSEGKGAFKTGHFRF 330

Query:  372 VAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLLSVR 253
            VA S RTR+ FYS FY  +  DF  LCGPV+D V +   R
Sbjct:  331 VADSDRTRLTFYSAFYHTKLHDFGHLCGPVLDSVVVTLAR 370

>TAIR9_protein||AT3G08030.2 | Symbols:  | unknown protein | chr3:2564517-2565819
        FORWARD

          Length = 324

 Score =  381 bits (977), Expect = 8e-106
 Identities = 190/317 (59%), Positives = 231/317 (72%)
 Frame = -3

Query: 1221 MKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTK 1042
            MK T +L KNA+P WE TGFVEYI  G + G M   V  G  A+RLGNEA+I Q+L V  
Sbjct:    1 MKKTVLLGKNALPEWETTGFVEYIAGGPQPGGMYFPVAHGVHAVRLGNEATISQKLEVKP 60

Query: 1041 GMYYSLTFSAARTCAQDERLNISVAPDSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVI 862
            G  Y+LTF A+RTCAQDE L +SV   SG +P+QT+Y+S G D+YAWAF A+++   +  
Sbjct:   61 GSLYALTFGASRTCAQDEVLRVSVPSQSGDLPLQTLYNSFGGDVYAWAFVAKTSQVTVTF 120

Query:  861 HNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIE 682
            HNPG +EDPACGPL+D VAIK L  P  T  N++KNGGFEEGP+ L NST GVL+PP  E
Sbjct:  121 HNPGVQEDPACGPLLDAVAIKELVHPIYTRGNLVKNGGFEEGPHRLVNSTQGVLLPPKQE 180

Query:  681 DDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSF 502
            D  SPLP W++ESLKAVK++D ++F+VP G  A+ELVAGKESAIAQV RT  G+TY LSF
Sbjct:  181 DLTSPLPGWIIESLKAVKFIDSKYFNVPFGHAAIELVAGKESAIAQVIRTSPGQTYTLSF 240

Query:  501 AVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYA 322
             VGDA N C GSM+VEAFA RDTLKVP+ S G G  K AS +F AV  RTR+ F+S FY 
Sbjct:  241 VVGDAKNDCHGSMMVEAFAARDTLKVPHTSVGGGHVKTASFKFKAVEARTRITFFSGFYH 300

Query:  321 MRSDDFSSLCGPVIDDV 271
             +  D  SLCGPVID++
Sbjct:  301 TKKTDTVSLCGPVIDEI 317

>TAIR9_protein||AT2G41810.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        endomembrane system; EXPRESSED IN: root; CONTAINS InterPro DOMAIN/s:
        Protein of unknown function DUF642 (InterPro:IPR006946),
        Galactose-binding like (InterPro:IPR008979); BEST Arabidopsis thaliana
        protein match is: unknown protein (TAIR:AT2G41800.1); Has 161 Blast
        hits to 157 proteins in 12 species: Archae - 0; Bacteria - 2; Metazoa -
        0; Fungi - 0; Plants - 159; Viruses - 0; Other Eukaryotes - 0 (source:
        NCBI BLink). | chr2:17439414-17441296 REVERSE

          Length = 371

 Score =  376 bits (965), Expect = 2e-104
 Identities = 180/336 (53%), Positives = 233/336 (69%)
 Frame = -3

Query: 1272 DGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFA 1093
            DG+LPNG+FE  P  S+M+  Q++ K ++P WE++G VE +  G + G     VP G  A
Sbjct:   31 DGLLPNGNFEQIPNKSNMRKRQIIGKYSLPHWEISGHVELVSGGPQPGGFYFAVPRGVHA 90

Query: 1092 IRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPDSGIIPIQTVYSSSGWD 913
             RLGN ASI Q + V  G+ YSLTF   RTCAQDE + ISV   +  +PIQT++S++G D
Sbjct:   91 ARLGNLASISQYVKVKSGLVYSLTFGVTRTCAQDENIRISVPGQTNELPIQTLFSTNGGD 150

Query:  912 LYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGGFEEGP 733
             YAWAF+A S++ ++  +NPG +EDP CGP++D VAIK + P R T  N++KNGGFE GP
Sbjct:  151 TYAWAFKATSDLVKVTFYNPGVQEDPTCGPIVDAVAIKEILPLRYTKGNLVKNGGFETGP 210

Query:  732 YLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVAGKESA 553
            ++  N +TG+LIP  I+D  SPLP W+VESLK VKY+D  HF VP G  A+ELVAG+ESA
Sbjct:  211 HVFSNFSTGILIPAKIQDLISPLPGWIVESLKPVKYIDNRHFKVPSGLAAIELVAGRESA 270

Query:  552 IAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKRASLRF 373
            IAQ+ RT+ GK Y+LSF VGDA+N C GSM+VEAFAG    KV +ES  KG FK     F
Sbjct:  271 IAQIIRTVSGKNYILSFVVGDAHNGCHGSMMVEAFAGISAFKVTFESNDKGAFKVGRFAF 330

Query:  372 VAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 265
             A S RTR+ FYS FY  +  DF  LCGPV+D+V +
Sbjct:  331 RADSNRTRITFYSGFYHTKLHDFGHLCGPVLDNVSV 366

>TAIR9_protein||AT2G34510.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: anchored
        to membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 14
        growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
        DUF642 (InterPro:IPR006946), Galactose-binding like
        (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT1G29980.1); Has 177 Blast hits to 163 proteins
        in 15 species: Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0;
        Plants - 167; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr2:14544114-14546732 REVERSE

          Length = 402

 Score =  340 bits (871), Expect = 1e-093
 Identities = 174/358 (48%), Positives = 233/358 (65%), Gaps = 8/358 (2%)
 Frame = -3

Query: 1272 DGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFA 1093
            DG++ NGDFE  P         + + + IPSW   G VE IKSGQKQG M+L+VP G+ A
Sbjct:   38 DGLVVNGDFETPPSNGFPDDAIIEDTSEIPSWRSDGTVELIKSGQKQGGMILIVPEGRHA 97

Query: 1092 IRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVAPD-----SGIIPIQTVYS 928
            +RLGN+A I Q L V KG  YS+TFSAARTCAQ E LN+SVA       S  I +QTVYS
Sbjct:   98 VRLGNDAEISQELTVEKGSIYSVTFSAARTCAQLESLNVSVASSDEPIASQTIDLQTVYS 157

Query:  927 SSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTNKNILKNGG 748
              GWD YAWAF+A  +   +V  NPG E+DP CGP+ID +A+K L+ P     N + NG 
Sbjct:  158 VQGWDPYAWAFEAVVDRVRLVFKNPGMEDDPTCGPIIDDIAVKKLFTPDKPKGNAVINGD 217

Query:  747 FEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQGRRAVELVA 568
            FEEGP++  N+T GVL+P  ++++ S LP W VES +AV+++D +HFSVP+G+RA+EL++
Sbjct:  218 FEEGPWMFRNTTLGVLLPTNLDEEISSLPGWTVESNRAVRFIDSDHFSVPEGKRALELLS 277

Query:  567 GKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYESKGKGGFKR 388
            GKE  I+Q+  T     Y +SF++G A + C+  + V AFAG       Y ++    F+R
Sbjct:  278 GKEGIISQMVETKANIPYKMSFSLGHAGDKCKEPLAVMAFAGDQAQNFHYMAQANSSFER 337

Query:  387 ASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKLL---SVRKA*NGPCFIL 223
            + L F A + RTR+ FYS +Y  R+DD +SLCGPVIDDVK+    S R   + P FIL
Sbjct:  338 SELNFTAKAERTRIAFYSIYYNTRTDDMTSLCGPVIDDVKVWFSGSSRIGFSFPLFIL 395

>TAIR9_protein||AT1G29980.1 | Symbols:  | INVOLVED IN: biological_process
        unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
        IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF642
        (InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
        Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT2G34510.1); Has 180 Blast hits to 163 proteins in 15 species:
        Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 170;
        Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr1:10503411-10505994 REVERSE

          Length = 408

 Score =  339 bits (869), Expect = 3e-093
 Identities = 173/369 (46%), Positives = 234/369 (63%), Gaps = 16/369 (4%)
 Frame = -3

Query: 1323 FLCVLLISTVTSVV-------CFRDGMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTG 1165
            FL +L +S    V           DG++ NGDFE  P             + IPSW+  G
Sbjct:   14 FLFLLSVSVAVLVAVADDKSPAVEDGLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNG 73

Query: 1164 FVEYIKSGQKQGDMLLVVPAGKFAIRLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDER 985
             VE I SGQKQG M+L+VP G+ A+RLGN+A I Q L V KG  YS+TFSAARTCAQ E 
Sbjct:   74 TVELINSGQKQGGMILIVPQGRHAVRLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLES 133

Query:  984 LNISVAP---------DSGIIPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPA 832
            +N+SVA           S  + +QT+YS  GWD YAWAF+AE +   +V  NPG E+DP 
Sbjct:  134 INVSVASVNADADDMLASRNVDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPT 193

Query:  831 CGPLIDGVAIKALYPPRPTNKNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWM 652
            CGP+ID +AIK L+ P     N + NG FE+GP++  N++ GVL+P  ++++ S LP W 
Sbjct:  194 CGPIIDDIAIKKLFTPDKPKDNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWT 253

Query:  651 VESLKAVKYVDVEHFSVPQGRRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACE 472
            VES +AV++VD +HFSVP+G+RAVEL++GKE  I+Q+  T   K Y+LSF++G A + C+
Sbjct:  254 VESNRAVRFVDSDHFSVPKGKRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCK 313

Query:  471 GSMIVEAFAGRDTLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLC 292
              + + AFAG       Y ++    F++A L F A + RTRV FYS +Y  R+DD SSLC
Sbjct:  314 EPLAIMAFAGDQAQNFHYMAQANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLC 373

Query:  291 GPVIDDVKL 265
            GPVIDDV++
Sbjct:  374 GPVIDDVRV 382

>TAIR9_protein||AT1G29980.2 | Symbols:  | INVOLVED IN: biological_process
        unknown; LOCATED IN: plasma membrane, anchored to membrane; EXPRESSED
        IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF642
        (InterPro:IPR006946), Galactose-binding like (InterPro:IPR008979); BEST
        Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT2G34510.1); Has 174 Blast hits to 163 proteins in 15 species:
        Archae - 0; Bacteria - 10; Metazoa - 0; Fungi - 0; Plants - 164;
        Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
        chr1:10503411-10504617 REVERSE

          Length = 372

 Score =  242 bits (617), Expect = 4e-064
 Identities = 111/229 (48%), Positives = 158/229 (68%)
 Frame = -3

Query: 951 IPIQTVYSSSGWDLYAWAFQAESNVAEIVIHNPGEEEDPACGPLIDGVAIKALYPPRPTN 772
           + +QT+YS  GWD YAWAF+AE +   +V  NPG E+DP CGP+ID +AIK L+ P    
Sbjct: 118 VDLQTLYSVQGWDPYAWAFEAEDDHVRLVFKNPGMEDDPTCGPIIDDIAIKKLFTPDKPK 177

Query: 771 KNILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEHFSVPQG 592
            N + NG FE+GP++  N++ GVL+P  ++++ S LP W VES +AV++VD +HFSVP+G
Sbjct: 178 DNAVINGDFEDGPWMFRNTSLGVLLPTNLDEEISSLPGWTVESNRAVRFVDSDHFSVPKG 237

Query: 591 RRAVELVAGKESAIAQVARTIIGKTYVLSFAVGDANNACEGSMIVEAFAGRDTLKVPYES 412
           +RAVEL++GKE  I+Q+  T   K Y+LSF++G A + C+  + + AFAG       Y +
Sbjct: 238 KRAVELLSGKEGIISQMVETKADKPYILSFSLGHAGDKCKEPLAIMAFAGDQAQNFHYMA 297

Query: 411 KGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDVKL 265
           +    F++A L F A + RTRV FYS +Y  R+DD SSLCGPVIDDV++
Sbjct: 298 QANSSFEKAGLNFTAKADRTRVAFYSVYYNTRTDDMSSLCGPVIDDVRV 346


 Score =  108 bits (269), Expect = 1e-023
 Identities = 55/101 (54%), Positives = 67/101 (66%)
 Frame = -3

Query: 1269 GMLPNGDFELGPKPSDMKGTQVLNKNAIPSWELTGFVEYIKSGQKQGDMLLVVPAGKFAI 1090
            G++ NGDFE  P             + IPSW+  G VE I SGQKQG M+L+VP G+ A+
Sbjct:    3 GLVINGDFETSPSSGFPDDGVTDGPSDIPSWKSNGTVELINSGQKQGGMILIVPQGRHAV 62

Query: 1089 RLGNEASIKQRLNVTKGMYYSLTFSAARTCAQDERLNISVA 967
            RLGN+A I Q L V KG  YS+TFSAARTCAQ E +N+SVA
Sbjct:   63 RLGNDAEISQDLTVEKGFVYSVTFSAARTCAQLESINVSVA 103

>TAIR9_protein||AT5G14150.1 | Symbols:  | unknown protein | chr5:4565246-4566653
        REVERSE

          Length = 384

 Score =  153 bits (384), Expect = 4e-037
 Identities = 115/355 (32%), Positives = 170/355 (47%), Gaps = 34/355 (9%)
 Frame = -3

Query: 1281 CFRDGMLPNGDFELGP--KPSDMKGTQV-LNKNA-IPSWELTGFVEYIKSGQKQGDMLLV 1114
            C     L N DFE  P   P++   + V L++N+ +P W   G V Y++           
Sbjct:   17 CASSDFLENPDFESPPLNLPTNSNASSVSLDQNSTLPGWTFQGTVLYVE----------- 65

Query: 1113 VPAGKFAIRLGNEASIKQRLNVTKG--MYYSLTFS---AARTCAQDERLNISVAPDSGII 949
            +P    A++LG +  I Q   + KG  + Y LTF+   A + C     L++S    + + 
Sbjct:   66 LPDTGHAVQLGEDGKINQTF-IAKGDELNYILTFALIHAGQNCTSSAGLSVSGPDSNAVF 124

Query:  948 PIQTVYSSSGWDLYAWAFQAESN------VAEIVIHNPGEEEDPACGPLIDGVAIKALYP 787
              +  YS   W  Y+    +  N      V E    +   + +  C P+ID + IK +  
Sbjct:  125 SYRQNYSKVSWQSYSHNLGSWGNGEPINLVLESQAIDSDSDTNSTCWPIIDTLLIKTVGV 184

Query:  786 PRPTNK-NILKNGGFEEGPYLLPNSTTGVLIPPFIEDDHSPLPAWMVESLKAVKYVDVEH 610
                +  N+L NGGFE GP  LPNST GVLI        SPL  W V  +  V+Y+D EH
Sbjct:  185 TLVQDSGNLLINGGFESGPGFLPNSTDGVLIDAVPSLIQSPLRQWSV--IGTVRYIDSEH 242

Query:  609 FSVPQGRRAVELVAGKESAIAQVAR--TIIGKTYVLSFAVGDANNACEGSMIVEAFAGRD 436
            F VP+G+ A+E+++    +  Q A   T  G  Y L+F +GDAN+AC G  +V A AG  
Sbjct:  243 FHVPEGKAAIEILSNTAPSGIQTATKGTSEGSRYNLTFTLGDANDACRGHFVVGAQAGSV 302

Query:  435 TLKVPYESKGKGGFKRASLRFVAVSTRTRVMFYSTFYAMRSDDFSSLCGPVIDDV 271
            T     ES G G  ++  L F A     ++ F  T Y++     + +CGPVID+V
Sbjct:  303 TQNFTLESNGTGSGEKFGLVFEADKDAAQISF--TSYSVTMTKENVVCGPVIDEV 355

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,043,379,755
Number of Sequences: 33410
Number of Extensions: 9043379755
Number of Successful Extensions: 311883418
Number of sequences better than 0.0: 0