Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN68822


BLASTX 7.6.2

Query= UN68822 /QuerySize=723
        (722 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT4G35190.1 | Symbols:  | unknown protein | chr4:...    237   5e-063
TAIR9_protein||AT5G06300.1 | Symbols:  | carboxy-lyase | chr5:19...    208   4e-054
TAIR9_protein||AT2G37210.1 | Symbols:  | Encodes a protein of un...    208   5e-054
TAIR9_protein||AT3G53450.1 | Symbols:  | unknown protein | chr3:...    205   3e-053
TAIR9_protein||AT2G28305.1 | Symbols:  | unknown protein | chr2:...    204   5e-053
TAIR9_protein||AT2G35990.1 | Symbols:  | FUNCTIONS IN: molecular...    193   2e-049
TAIR9_protein||AT5G11950.1 | Symbols:  | protein homodimerizatio...    179   2e-045
TAIR9_protein||AT5G11950.2 | Symbols:  | protein homodimerizatio...    179   2e-045
TAIR9_protein||AT2G35990.2 | Symbols:  | FUNCTIONS IN: molecular...    115   4e-026
TAIR9_protein||AT2G35990.3 | Symbols:  | FUNCTIONS IN: molecular...    115   4e-026
TAIR9_protein||AT5G03270.1 | Symbols:  | unknown protein | chr5:...    102   3e-022
TAIR9_protein||AT5G26140.1 | Symbols:  | lysine decarboxylase fa...     64   1e-010

>TAIR9_protein||AT4G35190.1 | Symbols:  | unknown protein |
        chr4:16746724-16748090 FORWARD

          Length = 229

 Score =  237 bits (604), Expect = 5e-063
 Identities = 117/139 (84%), Positives = 124/139 (89%)
 Frame = +1

Query: 133 MEIVKSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAV 312
           MEIVKSRFKRVCVFCGSSSG ++CY DAA DLAQELV R+LNLVYGGGSIGLMGLVSQAV
Sbjct:   1 MEIVKSRFKRVCVFCGSSSGKRECYSDAATDLAQELVTRRLNLVYGGGSIGLMGLVSQAV 60

Query: 313 HEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEE 492
           HEAGGHVLG+IPRTLMDKEITGET+GEV AVADMH+RKAEMA HSDCFIALPGGYG LEE
Sbjct:  61 HEAGGHVLGIIPRTLMDKEITGETYGEVIAVADMHERKAEMARHSDCFIALPGGYGTLEE 120

Query: 493 LLEVITWAHLCLCPPPKSL 549
           LLEVI WA L +   P  L
Sbjct: 121 LLEVIAWAQLGIHDKPVGL 139


 Score =  84 bits (207), Expect = 6e-017
 Identities = 45/70 (64%), Positives = 51/70 (72%), Gaps = 3/70 (4%)
 Frame = +3

Query: 489 GVIGSNNMGTSLSLPPPKELVQKLEAYEPVSDGVIAKSKWEVEKNV---QQQQQAVLCSN 659
           G I  +     +S P  KELVQKLEAY+PV+DGVIAKS+WEVEK V   QQQQQ V CSN
Sbjct: 160 GFIKPSQRHIFVSAPNAKELVQKLEAYKPVNDGVIAKSRWEVEKKVQQPQQQQQVVFCSN 219

Query: 660 TNMHTEIAL* 689
           T+M TEIAL*
Sbjct: 220 TSMQTEIAL* 229

>TAIR9_protein||AT5G06300.1 | Symbols:  | carboxy-lyase | chr5:1922042-1925278
        REVERSE

          Length = 218

 Score =  208 bits (528), Expect = 4e-054
 Identities = 103/139 (74%), Positives = 115/139 (82%)
 Frame = +1

Query: 133 MEIVKSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAV 312
           ME  KSRFKR+CVFCGSSSG K  Y++AAI L  ELVER+++LVYGGGS+GLMGLVSQAV
Sbjct:   1 MEETKSRFKRICVFCGSSSGKKPSYQEAAIQLGNELVERRIDLVYGGGSVGLMGLVSQAV 60

Query: 313 HEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEE 492
           H  G HVLGVIP+TLM +EITGET GEV+AVADMHQRKAEMA  +D FIALPGGYG LEE
Sbjct:  61 HHGGRHVLGVIPKTLMPREITGETIGEVKAVADMHQRKAEMARQADAFIALPGGYGTLEE 120

Query: 493 LLEVITWAHLCLCPPPKSL 549
           LLEVITWA L +   P  L
Sbjct: 121 LLEVITWAQLGIHRKPVGL 139

>TAIR9_protein||AT2G37210.1 | Symbols:  | Encodes a protein of unknown function.
        It has been crystallized and shown to be structurally almost identical
        to the protein encoded by At5g11950. | chr2:15624253-15626834 REVERSE

          Length = 216

 Score =  208 bits (527), Expect = 5e-054
 Identities = 99/135 (73%), Positives = 113/135 (83%)
 Frame = +1

Query: 145 KSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAG 324
           KS+F+R+CVFCGSS G K  Y+DAA+DL  ELV R ++LVYGGGSIGLMGLVSQAVH+ G
Sbjct:  10 KSKFRRICVFCGSSQGKKSSYQDAAVDLGNELVSRNIDLVYGGGSIGLMGLVSQAVHDGG 69

Query: 325 GHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEV 504
            HV+G+IP+TLM +E+TGET GEVRAVADMHQRKAEMA HSD FIALPGGYG LEELLEV
Sbjct:  70 RHVIGIIPKTLMPRELTGETVGEVRAVADMHQRKAEMAKHSDAFIALPGGYGTLEELLEV 129

Query: 505 ITWAHLCLCPPPKSL 549
           ITWA L +   P  L
Sbjct: 130 ITWAQLGIHDKPVGL 144

>TAIR9_protein||AT3G53450.1 | Symbols:  | unknown protein |
        chr3:19812977-19815430 REVERSE

          Length = 216

 Score =  205 bits (520), Expect = 3e-053
 Identities = 100/135 (74%), Positives = 112/135 (82%)
 Frame = +1

Query: 145 KSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAG 324
           KS+F R+CVFCGSS G K  Y+DAA+DL  ELV R ++LVYGGGSIGLMGLVSQAVH+ G
Sbjct:  10 KSKFGRICVFCGSSQGKKSSYQDAAVDLGNELVLRNIDLVYGGGSIGLMGLVSQAVHDGG 69

Query: 325 GHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEV 504
            HV+GVIP+TLM +E+TGET GEVRAVADMHQRKAEMA HSD FIALPGGYG LEELLEV
Sbjct:  70 RHVIGVIPKTLMPRELTGETVGEVRAVADMHQRKAEMARHSDAFIALPGGYGTLEELLEV 129

Query: 505 ITWAHLCLCPPPKSL 549
           ITWA L +   P  L
Sbjct: 130 ITWAQLGIHDKPVGL 144

>TAIR9_protein||AT2G28305.1 | Symbols:  | unknown protein |
        chr2:12081186-12084307 FORWARD

          Length = 214

 Score =  204 bits (518), Expect = 5e-053
 Identities = 99/136 (72%), Positives = 114/136 (83%)
 Frame = +1

Query: 142 VKSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEA 321
           ++S+FKR+CVFCGSS+GNK  Y+DAAI+L  ELV R ++LVYGGGSIGLMGL+SQAV   
Sbjct:   3 IESKFKRICVFCGSSAGNKVSYKDAAIELGTELVSRNIDLVYGGGSIGLMGLISQAVFNG 62

Query: 322 GGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLE 501
           G HV+GVIP+TLM +EITGET GEV+AVADMHQRKAEMA HSD FIALPGGYG LEELLE
Sbjct:  63 GRHVIGVIPKTLMPREITGETVGEVKAVADMHQRKAEMAKHSDAFIALPGGYGTLEELLE 122

Query: 502 VITWAHLCLCPPPKSL 549
           VITWA L +   P  L
Sbjct: 123 VITWAQLGIHDKPVGL 138

>TAIR9_protein||AT2G35990.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 15 plant structures;
        EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s:
        Conserved hypothetical protein CHP00730 (InterPro:IPR005269); BEST
        Arabidopsis thaliana protein match is: carboxy-lyase
        (TAIR:AT5G06300.1); Has 3002 Blast hits to 3001 proteins in 744
        species: Archae - 8; Bacteria - 1757; Metazoa - 10; Fungi - 79; Plants
        - 185; Viruses - 0; Other Eukaryotes - 963 (source: NCBI BLink). |
        chr2:15114070-15116647 FORWARD

          Length = 214

 Score =  193 bits (488), Expect = 2e-049
 Identities = 92/139 (66%), Positives = 110/139 (79%)
 Frame = +1

Query: 133 MEIVKSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAV 312
           ME  KSRF+R+CVFCGSSSGNK  Y DAA+ LA +LVER ++LVYGGGS+GLMGL+SQAV
Sbjct:   1 MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV 60

Query: 313 HEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEE 492
           H+ G HVLG+IP++L  +EITGE+ GEV  V+ MHQRKAEM   +D FIALPGGYG  EE
Sbjct:  61 HDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALPGGYGTFEE 120

Query: 493 LLEVITWAHLCLCPPPKSL 549
           LLEVITW+ L +   P  L
Sbjct: 121 LLEVITWSQLGIHTKPVGL 139

>TAIR9_protein||AT5G11950.1 | Symbols:  | protein homodimerization |
        chr5:3855072-3856815 FORWARD

          Length = 217

 Score =  179 bits (452), Expect = 2e-045
 Identities = 81/126 (64%), Positives = 107/126 (84%)
 Frame = +1

Query: 145 KSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAG 324
           +SRF+++CVFCGS SG+++ + DAAI+L  ELV+RK++LVYGGGS+GLMGL+S+ V+E G
Sbjct:   6 RSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRRVYEGG 65

Query: 325 GHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEV 504
            HVLG+IP+ LM  EI+GET G+VR VADMH+RKA MA  ++ FIALPGGYG +EELLE+
Sbjct:  66 LHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTMEELLEM 125

Query: 505 ITWAHL 522
           ITW+ L
Sbjct: 126 ITWSQL 131

>TAIR9_protein||AT5G11950.2 | Symbols:  | protein homodimerization |
        chr5:3855072-3856815 FORWARD

          Length = 217

 Score =  179 bits (452), Expect = 2e-045
 Identities = 81/126 (64%), Positives = 107/126 (84%)
 Frame = +1

Query: 145 KSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAG 324
           +SRF+++CVFCGS SG+++ + DAAI+L  ELV+RK++LVYGGGS+GLMGL+S+ V+E G
Sbjct:   6 RSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRRVYEGG 65

Query: 325 GHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEV 504
            HVLG+IP+ LM  EI+GET G+VR VADMH+RKA MA  ++ FIALPGGYG +EELLE+
Sbjct:  66 LHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTMEELLEM 125

Query: 505 ITWAHL 522
           ITW+ L
Sbjct: 126 ITWSQL 131

>TAIR9_protein||AT2G35990.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 15 plant structures;
        EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s:
        Conserved hypothetical protein CHP00730 (InterPro:IPR005269); BEST
        Arabidopsis thaliana protein match is: carboxy-lyase
        (TAIR:AT5G06300.1); Has 2269 Blast hits to 2268 proteins in 637
        species: Archae - 6; Bacteria - 1228; Metazoa - 8; Fungi - 85; Plants -
        183; Viruses - 0; Other Eukaryotes - 759 (source: NCBI BLink). |
        chr2:15114733-15116647 FORWARD

          Length = 162

 Score =  115 bits (286), Expect = 4e-026
 Identities = 55/87 (63%), Positives = 66/87 (75%)
 Frame = +1

Query: 289 MGLVSQAVHEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALP 468
           MGL+SQAVH+ G HVLG+IP++L  +EITGE+ GEV  V+ MHQRKAEM   +D FIALP
Sbjct:   1 MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 60

Query: 469 GGYGILEELLEVITWAHLCLCPPPKSL 549
           GGYG  EELLEVITW+ L +   P  L
Sbjct:  61 GGYGTFEELLEVITWSQLGIHTKPVGL 87

>TAIR9_protein||AT2G35990.3 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 15 plant structures;
        EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s:
        Conserved hypothetical protein CHP00730 (InterPro:IPR005269); BEST
        Arabidopsis thaliana protein match is: carboxy-lyase
        (TAIR:AT5G06300.1). | chr2:15114733-15116647 FORWARD

          Length = 162

 Score =  115 bits (286), Expect = 4e-026
 Identities = 55/87 (63%), Positives = 66/87 (75%)
 Frame = +1

Query: 289 MGLVSQAVHEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALP 468
           MGL+SQAVH+ G HVLG+IP++L  +EITGE+ GEV  V+ MHQRKAEM   +D FIALP
Sbjct:   1 MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 60

Query: 469 GGYGILEELLEVITWAHLCLCPPPKSL 549
           GGYG  EELLEVITW+ L +   P  L
Sbjct:  61 GGYGTFEELLEVITWSQLGIHTKPVGL 87

>TAIR9_protein||AT5G03270.1 | Symbols:  | unknown protein | chr5:781870-783997
        FORWARD

          Length = 230

 Score =  102 bits (253), Expect = 3e-022
 Identities = 48/61 (78%), Positives = 55/61 (90%)
 Frame = +1

Query: 148 SRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAGG 327
           SRFK +CVFCGSS+GNK  Y+DAAIDLA+ELV RK++LVYGGGSIGLMGLVSQAVH+ G 
Sbjct:  16 SRFKSICVFCGSSNGNKASYQDAAIDLAKELVMRKIDLVYGGGSIGLMGLVSQAVHDGGR 75

Query: 328 H 330
           H
Sbjct:  76 H 76


 Score =  89 bits (218), Expect = 3e-018
 Identities = 43/63 (68%), Positives = 48/63 (76%)
 Frame = +1

Query: 361 DKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEVITWAHLCLCPPP 540
           + ++TGET GEV+ VADMHQRKA MA HSD FI LPGGYG LEELLEVITWA L +   P
Sbjct:  98 NSKLTGETVGEVKEVADMHQRKAVMAKHSDAFITLPGGYGTLEELLEVITWAQLGIHDKP 157

Query: 541 KSL 549
             L
Sbjct: 158 VGL 160

>TAIR9_protein||AT5G26140.1 | Symbols:  | lysine decarboxylase family protein |
        chr5:9130796-9131636 FORWARD

          Length = 144

 Score =  64 bits (153), Expect = 1e-010
 Identities = 33/54 (61%), Positives = 40/54 (74%), Gaps = 1/54 (1%)
 Frame = +1

Query: 364 KEITGETFGEVRAVADMHQRKAEMASHSDCFIALPG-GYGILEELLEVITWAHL 522
           + I+GET GEVR V+DMH+RKA MA  +  FIAL G  Y  +EELLE+ITWA L
Sbjct:   4 EHISGETVGEVRIVSDMHERKATMAQEAGAFIALLGERYETMEELLEMITWAQL 57

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 30,470,012,363
Number of Sequences: 33410
Number of Extensions: 30470012363
Number of Successful Extensions: 1027624851
Number of sequences better than 0.0: 0