Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN19332


BLASTX 7.6.2

Query= UN19332 /QuerySize=1703
        (1702 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT3G49310.1 | Symbols:  | LOCATED IN: endomembran...    884   5e-257
TAIR9_protein||AT4G27720.1 | Symbols:  | LOCATED IN: plasma memb...    794   4e-230
TAIR9_protein||AT1G64650.1 | Symbols:  | LOCATED IN: plasma memb...    767   8e-222
TAIR9_protein||AT1G64650.2 | Symbols:  | LOCATED IN: plasma memb...    688   5e-198
TAIR9_protein||AT2G23093.1 | Symbols:  | FUNCTIONS IN: molecular...    100   4e-021

>TAIR9_protein||AT3G49310.1 | Symbols:  | LOCATED IN: endomembrane system;
        EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages;
        CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF791
        (InterPro:IPR008509), Major facilitator superfamily, general substrate
        transporter (InterPro:IPR016196); BEST Arabidopsis thaliana protein
        match is: unknown protein (TAIR:AT4G27720.1); Has 550 Blast hits to 545
        proteins in 201 species: Archae - 5; Bacteria - 285; Metazoa - 85;
        Fungi - 33; Plants - 99; Viruses - 0; Other Eukaryotes - 43 (source:
        NCBI BLink). | chr3:18285084-18287991 REVERSE

          Length = 461

 Score =  884 bits (2282), Expect = 5e-257
 Identities = 446/461 (96%), Positives = 455/461 (98%)
 Frame = -3

Query: 1637 MEVFYYLVFGVMAAVVAALELSKTNKDRINTSSSFNSFKNNYLVVFSIMMARDWLQGPYV 1458
            MEVFYYLVFGVMAAVVAALELSKTNKDRINTSSSFNSFKNNYL+VFSIMMA DWLQGPYV
Sbjct:    1 MEVFYYLVFGVMAAVVAALELSKTNKDRINTSSSFNSFKNNYLLVFSIMMAGDWLQGPYV 60

Query: 1457 YYLYSTYGFGKGDIGQLFIAGFGSSMLFGTIVGSLADKQGRKRACVTYCIVYILSCITKH 1278
            YYLYSTYGFGKGDIGQLFIAGFGSSMLFGTIVGSLADKQGRKRACVTYCIVYILSCITKH
Sbjct:   61 YYLYSTYGFGKGDIGQLFIAGFGSSMLFGTIVGSLADKQGRKRACVTYCIVYILSCITKH 120

Query: 1277 SPQYRVLMVGRILGGIATSLLFSAFESWLIAEHNKRNFDQQWLSLTFSKAVFLGNGLVAI 1098
            SPQY+VLMVGRILGGIATSLLFSAFESWLIAEHNKRNF+QQWLSLTFSKAVFLGNGLVAI
Sbjct:  121 SPQYKVLMVGRILGGIATSLLFSAFESWLIAEHNKRNFEQQWLSLTFSKAVFLGNGLVAI 180

Query: 1097 LSGLFGNLLVDTFSFGPVAPFDAAACILAIGMAIILSTWSENYGDPSDSKDLLTQFKVAA 918
            LSGLFGNLLVDTFSFGPVAPFDAAAC LAIGMAIIL TWSEN+GDPSDSKDLLTQFKVAA
Sbjct:  181 LSGLFGNLLVDTFSFGPVAPFDAAACFLAIGMAIILGTWSENFGDPSDSKDLLTQFKVAA 240

Query:  917 IAIASDEKIALLGAIQSLFEASMYTFVFLWTPALSPNDEEIPHGFVFATFMLASMLGSSL 738
            IAIASDEKIALLGAIQSLFEASMYTFVFLWTPALSPNDEEIPHGFVFATFMLASMLGSSL
Sbjct:  241 IAIASDEKIALLGAIQSLFEASMYTFVFLWTPALSPNDEEIPHGFVFATFMLASMLGSSL 300

Query:  737 AARLMARSSLRVENYMQIVFLVSAASLLLPITTSVLVTPSKVKEEGLSLTCSIQLLGFCV 558
            AARLM+RSSLRVENYMQIVFLVSAASLLLPITTSVLVTPSKVK+EGLSLT SIQLLGFCV
Sbjct:  301 AARLMSRSSLRVENYMQIVFLVSAASLLLPITTSVLVTPSKVKDEGLSLTSSIQLLGFCV 360

Query:  557 FEACVGIFWPSIMKMRSQYIPEEARSTIMNFFRVPLNIFVCIVLYNVDAFPITIMFGMCS 378
            FE+CVGIFWPSIMKMRSQYIPEEARSTIMNFFRVPLNIFVCIVLYNVDAFPITIMFGMCS
Sbjct:  361 FESCVGIFWPSIMKMRSQYIPEEARSTIMNFFRVPLNIFVCIVLYNVDAFPITIMFGMCS 420

Query:  377 IFLFVASILQRRLMVISEKPKGEEWSPMKDRNSEVDPLTL* 255
            IFLFVASILQRRLMVISEKPK E+WSPMK+RNSEVDPLTL*
Sbjct:  421 IFLFVASILQRRLMVISEKPKAEDWSPMKERNSEVDPLTL* 461

>TAIR9_protein||AT4G27720.1 | Symbols:  | LOCATED IN: plasma membrane; EXPRESSED
        IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF791
        (InterPro:IPR008509), Major facilitator superfamily, general substrate
        transporter (InterPro:IPR016196); BEST Arabidopsis thaliana protein
        match is: unknown protein (TAIR:AT1G64650.1); Has 496 Blast hits to 491
        proteins in 183 species: Archae - 5; Bacteria - 234; Metazoa - 75;
        Fungi - 33; Plants - 96; Viruses - 0; Other Eukaryotes - 53 (source:
        NCBI BLink). | chr4:13831203-13833521 FORWARD

          Length = 461

 Score =  794 bits (2050), Expect = 4e-230
 Identities = 381/461 (82%), Positives = 431/461 (93%)
 Frame = -3

Query: 1637 MEVFYYLVFGVMAAVVAALELSKTNKDRINTSSSFNSFKNNYLVVFSIMMARDWLQGPYV 1458
            ME+FYYLVFGV+  VVAALELSK NKDRINTSS+FNSFKNNYL+V+S+MMA DWLQGPYV
Sbjct:    1 MEIFYYLVFGVLGLVVAALELSKNNKDRINTSSAFNSFKNNYLLVYSLMMAGDWLQGPYV 60

Query: 1457 YYLYSTYGFGKGDIGQLFIAGFGSSMLFGTIVGSLADKQGRKRACVTYCIVYILSCITKH 1278
            YYLYSTYGFGKGDIGQLFIAGFGSSMLFGTIVGSLADKQGRKRACVTYCI YILSCITKH
Sbjct:   61 YYLYSTYGFGKGDIGQLFIAGFGSSMLFGTIVGSLADKQGRKRACVTYCITYILSCITKH 120

Query: 1277 SPQYRVLMVGRILGGIATSLLFSAFESWLIAEHNKRNFDQQWLSLTFSKAVFLGNGLVAI 1098
            SPQY+VLMVGR+LGGIATSLLFS+FESWL+AEHNKR F+QQWLS+TFSKAVF GNGLVAI
Sbjct:  121 SPQYKVLMVGRVLGGIATSLLFSSFESWLVAEHNKRGFEQQWLSVTFSKAVFFGNGLVAI 180

Query: 1097 LSGLFGNLLVDTFSFGPVAPFDAAACILAIGMAIILSTWSENYGDPSDSKDLLTQFKVAA 918
            ++GLFGNLLVDTFS GPVAPFDAAAC L IGMA+ILS+W+ENYGDPS++KDLLTQF+ AA
Sbjct:  181 IAGLFGNLLVDTFSLGPVAPFDAAACFLTIGMAVILSSWTENYGDPSENKDLLTQFRGAA 240

Query:  917 IAIASDEKIALLGAIQSLFEASMYTFVFLWTPALSPNDEEIPHGFVFATFMLASMLGSSL 738
            +AIASDEKIALLGAIQSLFE SMYTFVFLWTPALSPNDEEIPHGF+FATFMLASMLGSSL
Sbjct:  241 VAIASDEKIALLGAIQSLFEGSMYTFVFLWTPALSPNDEEIPHGFIFATFMLASMLGSSL 300

Query:  737 AARLMARSSLRVENYMQIVFLVSAASLLLPITTSVLVTPSKVKEEGLSLTCSIQLLGFCV 558
            A+RL++RS+ +VE+YMQIVFLVS A+LLLPI  ++ + PSKVK  G+S +   QLLGFC+
Sbjct:  301 ASRLLSRSTPKVESYMQIVFLVSGAALLLPILMTLFIAPSKVKGGGISFSGCFQLLGFCI 360

Query:  557 FEACVGIFWPSIMKMRSQYIPEEARSTIMNFFRVPLNIFVCIVLYNVDAFPITIMFGMCS 378
            FEACVG+FWPSIMKMRSQYIPEEARSTIMNFFR+PLNIFVC+VLYNV+AFPIT+MFGMCS
Sbjct:  361 FEACVGLFWPSIMKMRSQYIPEEARSTIMNFFRIPLNIFVCVVLYNVNAFPITVMFGMCS 420

Query:  377 IFLFVASILQRRLMVISEKPKGEEWSPMKDRNSEVDPLTL* 255
            IFLFVAS+LQRRLM+I +KPK  +W+P+++RN+E DPL +*
Sbjct:  421 IFLFVASLLQRRLMMIVDKPKTNDWTPLEERNTEEDPLNI* 461

>TAIR9_protein||AT1G64650.1 | Symbols:  | LOCATED IN: plasma membrane; EXPRESSED
        IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF791
        (InterPro:IPR008509), Major facilitator superfamily, general substrate
        transporter (InterPro:IPR016196); BEST Arabidopsis thaliana protein
        match is: unknown protein (TAIR:AT4G27720.1); Has 565 Blast hits to 562
        proteins in 197 species: Archae - 7; Bacteria - 312; Metazoa - 76;
        Fungi - 39; Plants - 85; Viruses - 0; Other Eukaryotes - 46 (source:
        NCBI BLink). | chr1:24023805-24026336 REVERSE

          Length = 463

 Score =  767 bits (1978), Expect = 8e-222
 Identities = 371/460 (80%), Positives = 424/460 (92%), Gaps = 2/460 (0%)
 Frame = -3

Query: 1637 MEVFYYLVFGVMAAVVAALELSKTNKDRINTSSSFNSFKNNYLVVFSIMMARDWLQGPYV 1458
            ME+FY++VFG +AA+VA LELSK+NKDRINTSS+FNSFKNNYL+V+S+MMA DWLQGPYV
Sbjct:    1 MEIFYFVVFGGLAAIVAGLELSKSNKDRINTSSAFNSFKNNYLLVYSLMMAGDWLQGPYV 60

Query: 1457 YYLYSTYGFGKGDIGQLFIAGFGSSMLFGTIVGSLADKQGRKRACVTYCIVYILSCITKH 1278
            YYLYSTYGFGKG+IGQLFIAGFGSSMLFGTIVGSLADKQGRKRA +TYCI YILSCITKH
Sbjct:   61 YYLYSTYGFGKGEIGQLFIAGFGSSMLFGTIVGSLADKQGRKRASITYCITYILSCITKH 120

Query: 1277 SPQYRVLMVGRILGGIATSLLFSAFESWLIAEHNKRNFDQQWLSLTFSKAVFLGNGLVAI 1098
            SPQY+VLMVGR+LGGIATSLLFSAFESWL+AEHNKR F+QQWLS+TFSKA+FLGNGLVAI
Sbjct:  121 SPQYKVLMVGRVLGGIATSLLFSAFESWLVAEHNKRGFEQQWLSVTFSKAIFLGNGLVAI 180

Query: 1097 LSGLFGNLLVDTFSFGPVAPFDAAACILAIGMAIILSTWSENYGDPSDSKDLLTQFKVAA 918
            ++GLFGN LVD+ S GPVAPFDAAAC LAIGMA+I+S+WSENYGDPS++KDLLTQFK AA
Sbjct:  181 IAGLFGNYLVDSLSLGPVAPFDAAACFLAIGMAVIISSWSENYGDPSENKDLLTQFKNAA 240

Query:  917 IAIASDEKIALLGAIQSLFEASMYTFVFLWTPALSPNDEEIPHGFVFATFMLASMLGSSL 738
             AIASDEKIALLGAIQSLFE SMYTFVFLWTPALSPN+E+IPHGF+FATFMLASMLGSS+
Sbjct:  241 SAIASDEKIALLGAIQSLFEGSMYTFVFLWTPALSPNEEDIPHGFIFATFMLASMLGSSI 300

Query:  737 AARLMARSSLRVENYMQIVFLVSAASLLLPITTSVLVTPSKVKEEGLSLTCSIQLLGFCV 558
            A+RL+A SS +VE+YMQIVF++S+A+L+LP+ TS LV PS VK   +S +  IQL+GFC 
Sbjct:  301 ASRLLAHSSPKVESYMQIVFVISSAALMLPVVTSFLVAPSGVKGGSISFSGCIQLMGFCT 360

Query:  557 FEACVGIFWPSIMKMRSQYIPEEARSTIMNFFRVPLNIFVCIVLYNVDAFPITIMFGMCS 378
            FEACVGIFWPSIMKMRSQYIPEEARSTIMNFFR+PLNIFVC+VLYNVDAFP+T+MFGMCS
Sbjct:  361 FEACVGIFWPSIMKMRSQYIPEEARSTIMNFFRIPLNIFVCLVLYNVDAFPMTVMFGMCS 420

Query:  377 IFLFVASILQRRLMVISE--KPKGEEWSPMKDRNSEVDPL 264
            +FLFVASILQRRLM ++E  K + +EWS  K+  SE DPL
Sbjct:  421 VFLFVASILQRRLMNVAEIHKSRSQEWSAEKEMTSEADPL 460

>TAIR9_protein||AT1G64650.2 | Symbols:  | LOCATED IN: plasma membrane; EXPRESSED
        IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS
        InterPro DOMAIN/s: Protein of unknown function DUF791
        (InterPro:IPR008509), Major facilitator superfamily, general substrate
        transporter (InterPro:IPR016196); BEST Arabidopsis thaliana protein
        match is: unknown protein (TAIR:AT4G27720.1); Has 548 Blast hits to 545
        proteins in 182 species: Archae - 5; Bacteria - 298; Metazoa - 76;
        Fungi - 38; Plants - 85; Viruses - 0; Other Eukaryotes - 46 (source:
        NCBI BLink). | chr1:24023805-24026048 REVERSE

          Length = 422

 Score =  688 bits (1773), Expect = 5e-198
 Identities = 333/411 (81%), Positives = 377/411 (91%), Gaps = 2/411 (0%)
 Frame = -3

Query: 1490 MARDWLQGPYVYYLYSTYGFGKGDIGQLFIAGFGSSMLFGTIVGSLADKQGRKRACVTYC 1311
            +A DWLQGPYVYYLYSTYGFGKG+IGQLFIAGFGSSMLFGTIVGSLADKQGRKRA +TYC
Sbjct:    9 LAGDWLQGPYVYYLYSTYGFGKGEIGQLFIAGFGSSMLFGTIVGSLADKQGRKRASITYC 68

Query: 1310 IVYILSCITKHSPQYRVLMVGRILGGIATSLLFSAFESWLIAEHNKRNFDQQWLSLTFSK 1131
            I YILSCITKHSPQY+VLMVGR+LGGIATSLLFSAFESWL+AEHNKR F+QQWLS+TFSK
Sbjct:   69 ITYILSCITKHSPQYKVLMVGRVLGGIATSLLFSAFESWLVAEHNKRGFEQQWLSVTFSK 128

Query: 1130 AVFLGNGLVAILSGLFGNLLVDTFSFGPVAPFDAAACILAIGMAIILSTWSENYGDPSDS 951
            A+FLGNGLVAI++GLFGN LVD+ S GPVAPFDAAAC LAIGMA+I+S+WSENYGDPS++
Sbjct:  129 AIFLGNGLVAIIAGLFGNYLVDSLSLGPVAPFDAAACFLAIGMAVIISSWSENYGDPSEN 188

Query:  950 KDLLTQFKVAAIAIASDEKIALLGAIQSLFEASMYTFVFLWTPALSPNDEEIPHGFVFAT 771
            KDLLTQFK AA AIASDEKIALLGAIQSLFE SMYTFVFLWTPALSPN+E+IPHGF+FAT
Sbjct:  189 KDLLTQFKNAASAIASDEKIALLGAIQSLFEGSMYTFVFLWTPALSPNEEDIPHGFIFAT 248

Query:  770 FMLASMLGSSLAARLMARSSLRVENYMQIVFLVSAASLLLPITTSVLVTPSKVKEEGLSL 591
            FMLASMLGSS+A+RL+A SS +VE+YMQIVF++S+A+L+LP+ TS LV PS VK   +S 
Sbjct:  249 FMLASMLGSSIASRLLAHSSPKVESYMQIVFVISSAALMLPVVTSFLVAPSGVKGGSISF 308

Query:  590 TCSIQLLGFCVFEACVGIFWPSIMKMRSQYIPEEARSTIMNFFRVPLNIFVCIVLYNVDA 411
            +  IQL+GFC FEACVGIFWPSIMKMRSQYIPEEARSTIMNFFR+PLNIFVC+VLYNVDA
Sbjct:  309 SGCIQLMGFCTFEACVGIFWPSIMKMRSQYIPEEARSTIMNFFRIPLNIFVCLVLYNVDA 368

Query:  410 FPITIMFGMCSIFLFVASILQRRLMVISE--KPKGEEWSPMKDRNSEVDPL 264
            FP+T+MFGMCS+FLFVASILQRRLM ++E  K + +EWS  K+  SE DPL
Sbjct:  369 FPMTVMFGMCSVFLFVASILQRRLMNVAEIHKSRSQEWSAEKEMTSEADPL 419

>TAIR9_protein||AT2G23093.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        endomembrane system; CONTAINS InterPro DOMAIN/s: Protein of unknown
        function DUF791 (InterPro:IPR008509), Major facilitator superfamily,
        general substrate transporter (InterPro:IPR016196); BEST Arabidopsis
        thaliana protein match is: unknown protein (TAIR:AT3G49310.1); Has 251
        Blast hits to 248 proteins in 89 species: Archae - 0; Bacteria - 66;
        Metazoa - 68; Fungi - 18; Plants - 79; Viruses - 0; Other Eukaryotes -
        20 (source: NCBI BLink). | chr2:9832193-9834416 FORWARD

          Length = 450

 Score =  100 bits (247), Expect = 4e-021
 Identities = 69/291 (23%), Positives = 139/291 (47%), Gaps = 7/291 (2%)
 Frame = -3

Query: 1544 SSSFNSFKNNYLVVFSIMMARDWLQGPYVYYLYSTYGFGKGDIGQLFIAGFGSSMLFGTI 1365
            SSSF  F+  +L ++++    + L   Y     +TYG  K  +      G+ ++++ G +
Sbjct:   48 SSSFARFQRWFLAIYTLSSVMEGLLSVYGELELTTYGLSKESMVFYLCVGYSTALVLGPV 107

Query: 1364 VGSLADKQGRKRACVTYCIVYILSCITKHSPQYRVLMVGRILGGIATSLLFSAFESWLIA 1185
            +G ++D  G+K+ C+ YC+++++  + K            +   +A  +    FE+WL+ 
Sbjct:  108 LGVVSDLIGQKKICLLYCVLHLIVGVWKRITMSPSAWFANVFLSLAGLVYSFGFETWLVV 167

Query: 1184 EHNKRNFDQQWLSLTFSKAVFLGNGLVAILSG-LFGNLLVDTFSFGPVAPFDAAACILAI 1008
            EH K++     L+ TF    FL +   +++ G +  N LV       +A    A+ +L++
Sbjct:  168 EHEKQSQRNDSLNETFWLMTFLES--ASLIGGQVLANWLVGENVQDGIALSATASLLLSV 225

Query: 1007 GMAIILSTWSENYGDPSDSKDLLTQFKVAAIAIASDEKIALLGAIQSLFEASMYTFVFLW 828
               I +   ++        +D  T F      +  D++I  LG  Q+  + S   F  LW
Sbjct:  226 VTIICIVQTAKEPLKTLPFRDYSTAFYA---YVLGDKRIWFLGTSQACLQFSTAVFWILW 282

Query:  827 TPALSPNDEEIPHGFVFATFMLASMLGSSLAARLMA-RSSLRVENYMQIVF 678
             P +  +  E+  G ++  F+ + MLGS++   LM+ +S LR+E+ +  ++
Sbjct:  283 APTIVADGREVNLGLIYPCFLGSRMLGSTVFPWLMSGQSLLRLEDCLVYIY 333

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,059,480,512
Number of Sequences: 33410
Number of Extensions: 10059480512
Number of Successful Extensions: 333405162
Number of sequences better than 0.0: 0