Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN18872


BLASTX 7.6.2

Query= UN18872 /QuerySize=1699
        (1698 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT2G05590.1 | Symbols:  | FUNCTIONS IN: molecular...    231   2e-060
TAIR9_protein||AT2G05590.2 | Symbols:  | FUNCTIONS IN: molecular...    231   2e-060
TAIR9_protein||AT2G05620.1 | Symbols: PGR5 | PGR5 (proton gradie...    209   5e-054
TAIR9_protein||AT4G39870.1 | Symbols:  | FUNCTIONS IN: molecular...    138   1e-032

>TAIR9_protein||AT2G05590.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
        growth stages; CONTAINS InterPro DOMAIN/s: TLDc (InterPro:IPR006571);
        BEST Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT4G39870.1); Has 522 Blast hits to 522 proteins in 113 species:
        Archae - 0; Bacteria - 0; Metazoa - 307; Fungi - 48; Plants - 62;
        Viruses - 0; Other Eukaryotes - 105 (source: NCBI BLink). |
        chr2:2067196-2068650 FORWARD

          Length = 264

 Score =  231 bits (587), Expect = 2e-060
 Identities = 109/126 (86%), Positives = 117/126 (92%)
 Frame = -3

Query: 1315 GGEEMKELTESSAFISPNLCEFLHACLPNIVRGCKWVLVYSTLKHGISLRTLLRKSAELP 1136
            G ++M+ELTESS FI+ NL EFLHA LPNIVRGCKW+L+YSTLKHGISLRTLLR+S ELP
Sbjct:  121 GVKKMRELTESSVFITANLFEFLHASLPNIVRGCKWILLYSTLKHGISLRTLLRRSGELP 180

Query: 1135 GPCLLVAGDKQGAVFGALLECPLTTTPKRKYQGTSQTFLFTTIYGQPRIFRPTGANRYYY 956
            GPCLLVAGDKQGAVFGALLECPL  TPKRKYQGTSQTFLFTTIYG+PRIFRPTGANRYY 
Sbjct:  181 GPCLLVAGDKQGAVFGALLECPLQPTPKRKYQGTSQTFLFTTIYGEPRIFRPTGANRYYL 240

Query:  955 MCMNEF 938
            MCMNEF
Sbjct:  241 MCMNEF 246


 Score =  104 bits (257), Expect = 3e-022
 Identities = 67/118 (56%), Positives = 79/118 (66%), Gaps = 16/118 (13%)
 Frame = -3

Query: 1612 MHALKDKVSQKLSNLFADSPSQSASPRSALTDSPK----GSGGKSFTSYFSF---GAGNE 1454
            MHALKDKVSQKLSNLFADSPSQSASPR + +DSPK     S GKS +SYFSF    +GNE
Sbjct:    1 MHALKDKVSQKLSNLFADSPSQSASPRYSNSDSPKARLNSSVGKSLSSYFSFVVPQSGNE 60

Query: 1453 NEDSESSSPPPPPPIRTDSYESVENCKQEEDCNKNQQQASSTMMGGGGEEMKELTESS 1280
             EDSE     PP PIRT+SYE +ENCK     + N Q  + T +  G ++  EL  S+
Sbjct:   61 -EDSELC---PPLPIRTESYECIENCK-----SANGQAKAGTFISIGEDKDCELRVSA 109

>TAIR9_protein||AT2G05590.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
        growth stages; CONTAINS InterPro DOMAIN/s: TLDc (InterPro:IPR006571);
        BEST Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT4G39870.1); Has 697 Blast hits to 689 proteins in 142 species:
        Archae - 0; Bacteria - 0; Metazoa - 403; Fungi - 84; Plants - 66;
        Viruses - 0; Other Eukaryotes - 144 (source: NCBI BLink). |
        chr2:2067196-2068951 FORWARD

          Length = 304

 Score =  231 bits (587), Expect = 2e-060
 Identities = 109/126 (86%), Positives = 117/126 (92%)
 Frame = -3

Query: 1315 GGEEMKELTESSAFISPNLCEFLHACLPNIVRGCKWVLVYSTLKHGISLRTLLRKSAELP 1136
            G ++M+ELTESS FI+ NL EFLHA LPNIVRGCKW+L+YSTLKHGISLRTLLR+S ELP
Sbjct:  121 GVKKMRELTESSVFITANLFEFLHASLPNIVRGCKWILLYSTLKHGISLRTLLRRSGELP 180

Query: 1135 GPCLLVAGDKQGAVFGALLECPLTTTPKRKYQGTSQTFLFTTIYGQPRIFRPTGANRYYY 956
            GPCLLVAGDKQGAVFGALLECPL  TPKRKYQGTSQTFLFTTIYG+PRIFRPTGANRYY 
Sbjct:  181 GPCLLVAGDKQGAVFGALLECPLQPTPKRKYQGTSQTFLFTTIYGEPRIFRPTGANRYYL 240

Query:  955 MCMNEF 938
            MCMNEF
Sbjct:  241 MCMNEF 246


 Score =  117 bits (291), Expect = 3e-026
 Identities = 55/58 (94%), Positives = 56/58 (96%)
 Frame = -2

Query: 938 LAFGGGGSFALCLDEDSLKATSGPSETFGNECLASSTEFELKNVELWGFAHASQYLS* 765
           LAFGGGG+FALCLDED LKATSGPSETFGNECLASSTEFELKNVELWGFAHASQYLS*
Sbjct: 247 LAFGGGGNFALCLDEDLLKATSGPSETFGNECLASSTEFELKNVELWGFAHASQYLS* 304


 Score =  104 bits (257), Expect = 3e-022
 Identities = 67/118 (56%), Positives = 79/118 (66%), Gaps = 16/118 (13%)
 Frame = -3

Query: 1612 MHALKDKVSQKLSNLFADSPSQSASPRSALTDSPK----GSGGKSFTSYFSF---GAGNE 1454
            MHALKDKVSQKLSNLFADSPSQSASPR + +DSPK     S GKS +SYFSF    +GNE
Sbjct:    1 MHALKDKVSQKLSNLFADSPSQSASPRYSNSDSPKARLNSSVGKSLSSYFSFVVPQSGNE 60

Query: 1453 NEDSESSSPPPPPPIRTDSYESVENCKQEEDCNKNQQQASSTMMGGGGEEMKELTESS 1280
             EDSE     PP PIRT+SYE +ENCK     + N Q  + T +  G ++  EL  S+
Sbjct:   61 -EDSELC---PPLPIRTESYECIENCK-----SANGQAKAGTFISIGEDKDCELRVSA 109

>TAIR9_protein||AT2G05620.1 | Symbols: PGR5 | PGR5 (proton gradient regulation
        5); electron carrier | chr2:2081204-2081687 REVERSE

          Length = 134

 Score =  209 bits (531), Expect = 5e-054
 Identities = 110/135 (81%), Positives = 118/135 (87%), Gaps = 7/135 (5%)
 Frame = +2

Query:  29 MAAASLSA------NLGTSFYGGWGSSISGEDYHTMLAKTTAPRQHFAKLSRKPIRVQPM 190
           MAAAS+SA       +GTSFYGGWGSSISGEDY TML+KT AP Q  A++SRK IR  PM
Sbjct:   1 MAAASISAIGCNQTLIGTSFYGGWGSSISGEDYQTMLSKTVAPPQQ-ARVSRKAIRAVPM 59

Query: 191 MKNVNEGKGLFAPLVVVTRDIVGKKRFNQLRGKAIALHPQVITEFCKSIGADAKQRQGLI 370
           MKNVNEGKGLFAPLVVVTR++VGKKRFNQLRGKAIALH QVITEFCKSIGADAKQRQGLI
Sbjct:  60 MKNVNEGKGLFAPLVVVTRNLVGKKRFNQLRGKAIALHSQVITEFCKSIGADAKQRQGLI 119

Query: 371 RLAKKNGEKLGFLA* 415
           RLAKKNGE+LGFLA*
Sbjct: 120 RLAKKNGERLGFLA* 134

>TAIR9_protein||AT4G39870.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 23 plant structures;
        EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: TLDc
        (InterPro:IPR006571); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT2G05590.2); Has 2018 Blast hits to 1865
        proteins in 220 species: Archae - 0; Bacteria - 39; Metazoa - 1002;
        Fungi - 279; Plants - 129; Viruses - 34; Other Eukaryotes - 535
        (source: NCBI BLink). | chr4:18502234-18504275 FORWARD

          Length = 395

 Score =  138 bits (347), Expect = 1e-032
 Identities = 82/221 (37%), Positives = 123/221 (55%), Gaps = 16/221 (7%)
 Frame = -3

Query: 1564 ADSPSQSASPRSALTDSPKGSGGKS---------FTSYFSFGAGNENEDSESSSPPPPPP 1412
            +D+ S SA+P   + ++  G   K          F +++        ++ + +S   P  
Sbjct:  111 SDTSSSSANPTRTMKETTSGGAAKKSFLSKYKQHFRNFYQAVKFPGVKERKGNSDVIPDD 170

Query: 1411 IRTDSYESVENCKQEEDCNKNQQQASSTMMGGGGEEMKELTESSAFISPNLCEFLHACLP 1232
              T+ Y+ +E    +   N N ++  + ++      + E++E S  +S      L+  LP
Sbjct:  171 EETEYYDGLEMKPMQ---NNNVKEEVTVVVQA---IIPEISEPSLLLSEQSRRSLYTSLP 224

Query: 1231 NIVRGCKWVLVYSTLKHGISLRTLLRKSAELPGPCLLVAGDKQGAVFGALLECPLTTTPK 1052
             +V+G KW+L+YST +HGISL TL RKS   PG  LLV GD++G+VFG L+E PL  T K
Sbjct:  225 ALVQGRKWILLYSTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDK 284

Query: 1051 RKYQGTSQTFLFTTIYGQPRIFRPTGANRYYYMCMNEFWRL 929
             KYQGT+ TF+FT   GQP I+RPTGANR+Y +C  EF  L
Sbjct:  285 -KYQGTNSTFVFTNKSGQPTIYRPTGANRFYTLCSKEFLAL 324


 Score =  72 bits (174), Expect = 1e-012
 Identities = 32/55 (58%), Positives = 40/55 (72%)
 Frame = -2

Query: 938 LAFGGGGSFALCLDEDSLKATSGPSETFGNECLASSTEFELKNVELWGFAHASQY 774
           LA GGGG FAL LD + L  +S  SET+GN CLA S +F++K VELWGF + S+Y
Sbjct: 322 LALGGGGRFALYLDSELLSGSSAYSETYGNSCLADSQDFDVKEVELWGFVYGSKY 376

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,794,778,008
Number of Sequences: 33410
Number of Extensions: 9794778008
Number of Successful Extensions: 327960547
Number of sequences better than 0.0: 0