Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN00207


BLASTX 7.6.2

Query= UN00207 /QuerySize=1837
        (1836 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT4G39870.1 | Symbols:  | FUNCTIONS IN: molecular...    366   4e-101
TAIR9_protein||AT2G05590.2 | Symbols:  | FUNCTIONS IN: molecular...    119   6e-027
TAIR9_protein||AT2G05590.1 | Symbols:  | FUNCTIONS IN: molecular...     91   3e-018

>TAIR9_protein||AT4G39870.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 23 plant structures;
        EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: TLDc
        (InterPro:IPR006571); BEST Arabidopsis thaliana protein match is:
        unknown protein (TAIR:AT2G05590.2); Has 2018 Blast hits to 1865
        proteins in 220 species: Archae - 0; Bacteria - 39; Metazoa - 1002;
        Fungi - 279; Plants - 129; Viruses - 34; Other Eukaryotes - 535
        (source: NCBI BLink). | chr4:18502234-18504275 FORWARD

          Length = 395

 Score =  366 bits (938), Expect = 4e-101
 Identities = 208/295 (70%), Positives = 232/295 (78%), Gaps = 29/295 (9%)
 Frame = +1

Query:  331 KKKSLRSKAVHFVSDLTTGLLNPISDKPSSSPPP---DDEGDESKRDQLEESITE--KDL 495
            K KS RSKAVHFV+DLT GLLNPISDKPSS+ PP    DE DESKR+QLE +  E  KDL
Sbjct:    3 KHKSFRSKAVHFVTDLTAGLLNPISDKPSSAHPPPPLPDEEDESKRNQLESTTAEQPKDL 62

Query:  496 VDEPDTSSFSAFLGSLLSSDPKNKQTNQ-----------EEEEEEAESSDTSSSSSSSSG 642
            VDEPDTSSFSAFLGSLLSSDPK+K+ +Q           EEE+ EAE+SDTSSSS++ + 
Sbjct:   63 VDEPDTSSFSAFLGSLLSSDPKDKRKDQDPEDEEDEEEDEEEDSEAETSDTSSSSANPTR 122

Query:  643 TMKGTTTTVSGGAKKSLLSKYKQHFKNFYHAVKFSS-KDRKANFPEVVNKTDDDKEETKG 819
            TMK TT+   G AKKS LSKYKQHF+NFY AVKF   K+RK N   +      D EET+ 
Sbjct:  123 TMKETTS--GGAAKKSFLSKYKQHFRNFYQAVKFPGVKERKGNSDVI-----PDDEETE- 174

Query:  820 YDDGLEMKQLQKN--KEEAAKIGQQAIVIPEISEPSLLLTDQSRRSLYSSLPALVQGRKW 993
            Y DGLEMK +Q N  KEE   + Q   +IPEISEPSLLL++QSRRSLY+SLPALVQGRKW
Sbjct:  175 YYDGLEMKPMQNNNVKEEVTVVVQ--AIIPEISEPSLLLSEQSRRSLYTSLPALVQGRKW 232

Query:  994 ILLYNTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDKKYQ 1158
            ILLY+TWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDKKYQ
Sbjct:  233 ILLYSTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDKKYQ 287


 Score =  211 bits (536), Expect = 1e-054
 Identities = 99/107 (92%), Positives = 105/107 (98%)
 Frame = +1

Query: 1285 QGTNSTFVFTDKSGQPTIYRPTGANRFYTLCSKDFLALGGGGRFALYLDSELLSGSSAYS 1464
            QGTNSTFVFT+KSGQPTIYRPTGANRFYTLCSK+FLALGGGGRFALYLDSELLSGSSAYS
Sbjct:  287 QGTNSTFVFTNKSGQPTIYRPTGANRFYTLCSKEFLALGGGGRFALYLDSELLSGSSAYS 346

Query: 1465 ETYGNACLANTQDFDVKEVELWGFVYGSKYDEILALSKTTEPGVCRW 1605
            ETYGN+CLA++QDFDVKEVELWGFVYGSKYDEILA SKT EPG+CRW
Sbjct:  347 ETYGNSCLADSQDFDVKEVELWGFVYGSKYDEILAHSKTMEPGLCRW 393

>TAIR9_protein||AT2G05590.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
        growth stages; CONTAINS InterPro DOMAIN/s: TLDc (InterPro:IPR006571);
        BEST Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT4G39870.1); Has 697 Blast hits to 689 proteins in 142 species:
        Archae - 0; Bacteria - 0; Metazoa - 403; Fungi - 84; Plants - 66;
        Viruses - 0; Other Eukaryotes - 144 (source: NCBI BLink). |
        chr2:2067196-2068951 FORWARD

          Length = 304

 Score =  119 bits (298), Expect = 6e-027
 Identities = 52/90 (57%), Positives = 69/90 (76%)
 Frame = +1

Query: 1285 QGTNSTFVFTDKSGQPTIYRPTGANRFYTLCSKDFLALGGGGRFALYLDSELLSGSSAYS 1464
            QGT+ TF+FT   G+P I+RPTGANR+Y +C  +FLA GGGG FAL LD +LL  +S  S
Sbjct:  212 QGTSQTFLFTTIYGEPRIFRPTGANRYYLMCMNEFLAFGGGGNFALCLDEDLLKATSGPS 271

Query: 1465 ETYGNACLANTQDFDVKEVELWGFVYGSKY 1554
            ET+GN CLA++ +F++K VELWGF + S+Y
Sbjct:  272 ETFGNECLASSTEFELKNVELWGFAHASQY 301


 Score =  91 bits (223), Expect = 3e-018
 Identities = 52/115 (45%), Positives = 71/115 (61%), Gaps = 1/115 (0%)
 Frame = +1

Query:  817 GYDDGLEMKQLQKNKEEAAKIGQQAIVIPEISEPSLLLTDQSRRSLYSSLPALVQGRKWI 996
            G D   E++   K +E           + E++E S+ +T      L++SLP +V+G KWI
Sbjct:   98 GEDKDCELRVSAKVEESGNDYFDGVKKMRELTESSVFITANLFEFLHASLPNIVRGCKWI 157

Query:  997 LLYNTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDK-KYQ 1158
            LLY+T +HGISL TL R+S   PG  LLV GD++G+VFG L+E PL PT K KYQ
Sbjct:  158 LLYSTLKHGISLRTLLRRSGELPGPCLLVAGDKQGAVFGALLECPLQPTPKRKYQ 212

>TAIR9_protein||AT2G05590.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
        growth stages; CONTAINS InterPro DOMAIN/s: TLDc (InterPro:IPR006571);
        BEST Arabidopsis thaliana protein match is: unknown protein
        (TAIR:AT4G39870.1); Has 522 Blast hits to 522 proteins in 113 species:
        Archae - 0; Bacteria - 0; Metazoa - 307; Fungi - 48; Plants - 62;
        Viruses - 0; Other Eukaryotes - 105 (source: NCBI BLink). |
        chr2:2067196-2068650 FORWARD

          Length = 264

 Score =  91 bits (223), Expect = 3e-018
 Identities = 52/115 (45%), Positives = 71/115 (61%), Gaps = 1/115 (0%)
 Frame = +1

Query:  817 GYDDGLEMKQLQKNKEEAAKIGQQAIVIPEISEPSLLLTDQSRRSLYSSLPALVQGRKWI 996
            G D   E++   K +E           + E++E S+ +T      L++SLP +V+G KWI
Sbjct:   98 GEDKDCELRVSAKVEESGNDYFDGVKKMRELTESSVFITANLFEFLHASLPNIVRGCKWI 157

Query:  997 LLYNTWRHGISLSTLYRKSLLWPGLSLLVVGDRKGSVFGGLVEAPLIPTDK-KYQ 1158
            LLY+T +HGISL TL R+S   PG  LLV GD++G+VFG L+E PL PT K KYQ
Sbjct:  158 LLYSTLKHGISLRTLLRRSGELPGPCLLVAGDKQGAVFGALLECPLQPTPKRKYQ 212


 Score =  74 bits (180), Expect = 3e-013
 Identities = 32/52 (61%), Positives = 40/52 (76%)
 Frame = +1

Query: 1285 QGTNSTFVFTDKSGQPTIYRPTGANRFYTLCSKDFLALGGGGRFALYLDSEL 1440
            QGT+ TF+FT   G+P I+RPTGANR+Y +C  +FLA GGGG FAL LD +L
Sbjct:  212 QGTSQTFLFTTIYGEPRIFRPTGANRYYLMCMNEFLAFGGGGNFALCLDEDL 263

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 550,567,218
Number of Sequences: 33410
Number of Extensions: 550567218
Number of Successful Extensions: 31396183
Number of sequences better than 0.0: 0