BLASTX 7.6.2
Query= UN68822 /QuerySize=723
(722 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT4G35190.1 | Symbols: | unknown protein | chr4:... 237 5e-063
TAIR9_protein||AT5G06300.1 | Symbols: | carboxy-lyase | chr5:19... 208 4e-054
TAIR9_protein||AT2G37210.1 | Symbols: | Encodes a protein of un... 208 5e-054
TAIR9_protein||AT3G53450.1 | Symbols: | unknown protein | chr3:... 205 3e-053
TAIR9_protein||AT2G28305.1 | Symbols: | unknown protein | chr2:... 204 5e-053
TAIR9_protein||AT2G35990.1 | Symbols: | FUNCTIONS IN: molecular... 193 2e-049
TAIR9_protein||AT5G11950.1 | Symbols: | protein homodimerizatio... 179 2e-045
TAIR9_protein||AT5G11950.2 | Symbols: | protein homodimerizatio... 179 2e-045
TAIR9_protein||AT2G35990.2 | Symbols: | FUNCTIONS IN: molecular... 115 4e-026
TAIR9_protein||AT2G35990.3 | Symbols: | FUNCTIONS IN: molecular... 115 4e-026
TAIR9_protein||AT5G03270.1 | Symbols: | unknown protein | chr5:... 102 3e-022
TAIR9_protein||AT5G26140.1 | Symbols: | lysine decarboxylase fa... 64 1e-010
>TAIR9_protein||AT4G35190.1 | Symbols: | unknown protein |
chr4:16746724-16748090 FORWARD
Length = 229
Score = 237 bits (604), Expect = 5e-063
Identities = 117/139 (84%), Positives = 124/139 (89%)
Frame = +1
Query: 133 MEIVKSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAV 312
MEIVKSRFKRVCVFCGSSSG ++CY DAA DLAQELV R+LNLVYGGGSIGLMGLVSQAV
Sbjct: 1 MEIVKSRFKRVCVFCGSSSGKRECYSDAATDLAQELVTRRLNLVYGGGSIGLMGLVSQAV 60
Query: 313 HEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEE 492
HEAGGHVLG+IPRTLMDKEITGET+GEV AVADMH+RKAEMA HSDCFIALPGGYG LEE
Sbjct: 61 HEAGGHVLGIIPRTLMDKEITGETYGEVIAVADMHERKAEMARHSDCFIALPGGYGTLEE 120
Query: 493 LLEVITWAHLCLCPPPKSL 549
LLEVI WA L + P L
Sbjct: 121 LLEVIAWAQLGIHDKPVGL 139
Score = 84 bits (207), Expect = 6e-017
Identities = 45/70 (64%), Positives = 51/70 (72%), Gaps = 3/70 (4%)
Frame = +3
Query: 489 GVIGSNNMGTSLSLPPPKELVQKLEAYEPVSDGVIAKSKWEVEKNV---QQQQQAVLCSN 659
G I + +S P KELVQKLEAY+PV+DGVIAKS+WEVEK V QQQQQ V CSN
Sbjct: 160 GFIKPSQRHIFVSAPNAKELVQKLEAYKPVNDGVIAKSRWEVEKKVQQPQQQQQVVFCSN 219
Query: 660 TNMHTEIAL* 689
T+M TEIAL*
Sbjct: 220 TSMQTEIAL* 229
>TAIR9_protein||AT5G06300.1 | Symbols: | carboxy-lyase | chr5:1922042-1925278
REVERSE
Length = 218
Score = 208 bits (528), Expect = 4e-054
Identities = 103/139 (74%), Positives = 115/139 (82%)
Frame = +1
Query: 133 MEIVKSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAV 312
ME KSRFKR+CVFCGSSSG K Y++AAI L ELVER+++LVYGGGS+GLMGLVSQAV
Sbjct: 1 MEETKSRFKRICVFCGSSSGKKPSYQEAAIQLGNELVERRIDLVYGGGSVGLMGLVSQAV 60
Query: 313 HEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEE 492
H G HVLGVIP+TLM +EITGET GEV+AVADMHQRKAEMA +D FIALPGGYG LEE
Sbjct: 61 HHGGRHVLGVIPKTLMPREITGETIGEVKAVADMHQRKAEMARQADAFIALPGGYGTLEE 120
Query: 493 LLEVITWAHLCLCPPPKSL 549
LLEVITWA L + P L
Sbjct: 121 LLEVITWAQLGIHRKPVGL 139
>TAIR9_protein||AT2G37210.1 | Symbols: | Encodes a protein of unknown function.
It has been crystallized and shown to be structurally almost identical
to the protein encoded by At5g11950. | chr2:15624253-15626834 REVERSE
Length = 216
Score = 208 bits (527), Expect = 5e-054
Identities = 99/135 (73%), Positives = 113/135 (83%)
Frame = +1
Query: 145 KSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAG 324
KS+F+R+CVFCGSS G K Y+DAA+DL ELV R ++LVYGGGSIGLMGLVSQAVH+ G
Sbjct: 10 KSKFRRICVFCGSSQGKKSSYQDAAVDLGNELVSRNIDLVYGGGSIGLMGLVSQAVHDGG 69
Query: 325 GHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEV 504
HV+G+IP+TLM +E+TGET GEVRAVADMHQRKAEMA HSD FIALPGGYG LEELLEV
Sbjct: 70 RHVIGIIPKTLMPRELTGETVGEVRAVADMHQRKAEMAKHSDAFIALPGGYGTLEELLEV 129
Query: 505 ITWAHLCLCPPPKSL 549
ITWA L + P L
Sbjct: 130 ITWAQLGIHDKPVGL 144
>TAIR9_protein||AT3G53450.1 | Symbols: | unknown protein |
chr3:19812977-19815430 REVERSE
Length = 216
Score = 205 bits (520), Expect = 3e-053
Identities = 100/135 (74%), Positives = 112/135 (82%)
Frame = +1
Query: 145 KSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAG 324
KS+F R+CVFCGSS G K Y+DAA+DL ELV R ++LVYGGGSIGLMGLVSQAVH+ G
Sbjct: 10 KSKFGRICVFCGSSQGKKSSYQDAAVDLGNELVLRNIDLVYGGGSIGLMGLVSQAVHDGG 69
Query: 325 GHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEV 504
HV+GVIP+TLM +E+TGET GEVRAVADMHQRKAEMA HSD FIALPGGYG LEELLEV
Sbjct: 70 RHVIGVIPKTLMPRELTGETVGEVRAVADMHQRKAEMARHSDAFIALPGGYGTLEELLEV 129
Query: 505 ITWAHLCLCPPPKSL 549
ITWA L + P L
Sbjct: 130 ITWAQLGIHDKPVGL 144
>TAIR9_protein||AT2G28305.1 | Symbols: | unknown protein |
chr2:12081186-12084307 FORWARD
Length = 214
Score = 204 bits (518), Expect = 5e-053
Identities = 99/136 (72%), Positives = 114/136 (83%)
Frame = +1
Query: 142 VKSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEA 321
++S+FKR+CVFCGSS+GNK Y+DAAI+L ELV R ++LVYGGGSIGLMGL+SQAV
Sbjct: 3 IESKFKRICVFCGSSAGNKVSYKDAAIELGTELVSRNIDLVYGGGSIGLMGLISQAVFNG 62
Query: 322 GGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLE 501
G HV+GVIP+TLM +EITGET GEV+AVADMHQRKAEMA HSD FIALPGGYG LEELLE
Sbjct: 63 GRHVIGVIPKTLMPREITGETVGEVKAVADMHQRKAEMAKHSDAFIALPGGYGTLEELLE 122
Query: 502 VITWAHLCLCPPPKSL 549
VITWA L + P L
Sbjct: 123 VITWAQLGIHDKPVGL 138
>TAIR9_protein||AT2G35990.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 15 plant structures;
EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s:
Conserved hypothetical protein CHP00730 (InterPro:IPR005269); BEST
Arabidopsis thaliana protein match is: carboxy-lyase
(TAIR:AT5G06300.1); Has 3002 Blast hits to 3001 proteins in 744
species: Archae - 8; Bacteria - 1757; Metazoa - 10; Fungi - 79; Plants
- 185; Viruses - 0; Other Eukaryotes - 963 (source: NCBI BLink). |
chr2:15114070-15116647 FORWARD
Length = 214
Score = 193 bits (488), Expect = 2e-049
Identities = 92/139 (66%), Positives = 110/139 (79%)
Frame = +1
Query: 133 MEIVKSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAV 312
ME KSRF+R+CVFCGSSSGNK Y DAA+ LA +LVER ++LVYGGGS+GLMGL+SQAV
Sbjct: 1 MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV 60
Query: 313 HEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEE 492
H+ G HVLG+IP++L +EITGE+ GEV V+ MHQRKAEM +D FIALPGGYG EE
Sbjct: 61 HDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALPGGYGTFEE 120
Query: 493 LLEVITWAHLCLCPPPKSL 549
LLEVITW+ L + P L
Sbjct: 121 LLEVITWSQLGIHTKPVGL 139
>TAIR9_protein||AT5G11950.1 | Symbols: | protein homodimerization |
chr5:3855072-3856815 FORWARD
Length = 217
Score = 179 bits (452), Expect = 2e-045
Identities = 81/126 (64%), Positives = 107/126 (84%)
Frame = +1
Query: 145 KSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAG 324
+SRF+++CVFCGS SG+++ + DAAI+L ELV+RK++LVYGGGS+GLMGL+S+ V+E G
Sbjct: 6 RSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRRVYEGG 65
Query: 325 GHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEV 504
HVLG+IP+ LM EI+GET G+VR VADMH+RKA MA ++ FIALPGGYG +EELLE+
Sbjct: 66 LHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTMEELLEM 125
Query: 505 ITWAHL 522
ITW+ L
Sbjct: 126 ITWSQL 131
>TAIR9_protein||AT5G11950.2 | Symbols: | protein homodimerization |
chr5:3855072-3856815 FORWARD
Length = 217
Score = 179 bits (452), Expect = 2e-045
Identities = 81/126 (64%), Positives = 107/126 (84%)
Frame = +1
Query: 145 KSRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAG 324
+SRF+++CVFCGS SG+++ + DAAI+L ELV+RK++LVYGGGS+GLMGL+S+ V+E G
Sbjct: 6 RSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRRVYEGG 65
Query: 325 GHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEV 504
HVLG+IP+ LM EI+GET G+VR VADMH+RKA MA ++ FIALPGGYG +EELLE+
Sbjct: 66 LHVLGIIPKALMPIEISGETVGDVRVVADMHERKAAMAQEAEAFIALPGGYGTMEELLEM 125
Query: 505 ITWAHL 522
ITW+ L
Sbjct: 126 ITWSQL 131
>TAIR9_protein||AT2G35990.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 15 plant structures;
EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s:
Conserved hypothetical protein CHP00730 (InterPro:IPR005269); BEST
Arabidopsis thaliana protein match is: carboxy-lyase
(TAIR:AT5G06300.1); Has 2269 Blast hits to 2268 proteins in 637
species: Archae - 6; Bacteria - 1228; Metazoa - 8; Fungi - 85; Plants -
183; Viruses - 0; Other Eukaryotes - 759 (source: NCBI BLink). |
chr2:15114733-15116647 FORWARD
Length = 162
Score = 115 bits (286), Expect = 4e-026
Identities = 55/87 (63%), Positives = 66/87 (75%)
Frame = +1
Query: 289 MGLVSQAVHEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALP 468
MGL+SQAVH+ G HVLG+IP++L +EITGE+ GEV V+ MHQRKAEM +D FIALP
Sbjct: 1 MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 60
Query: 469 GGYGILEELLEVITWAHLCLCPPPKSL 549
GGYG EELLEVITW+ L + P L
Sbjct: 61 GGYGTFEELLEVITWSQLGIHTKPVGL 87
>TAIR9_protein||AT2G35990.3 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 15 plant structures;
EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s:
Conserved hypothetical protein CHP00730 (InterPro:IPR005269); BEST
Arabidopsis thaliana protein match is: carboxy-lyase
(TAIR:AT5G06300.1). | chr2:15114733-15116647 FORWARD
Length = 162
Score = 115 bits (286), Expect = 4e-026
Identities = 55/87 (63%), Positives = 66/87 (75%)
Frame = +1
Query: 289 MGLVSQAVHEAGGHVLGVIPRTLMDKEITGETFGEVRAVADMHQRKAEMASHSDCFIALP 468
MGL+SQAVH+ G HVLG+IP++L +EITGE+ GEV V+ MHQRKAEM +D FIALP
Sbjct: 1 MGLISQAVHDGGRHVLGIIPKSLAPREITGESIGEVITVSTMHQRKAEMGRQADAFIALP 60
Query: 469 GGYGILEELLEVITWAHLCLCPPPKSL 549
GGYG EELLEVITW+ L + P L
Sbjct: 61 GGYGTFEELLEVITWSQLGIHTKPVGL 87
>TAIR9_protein||AT5G03270.1 | Symbols: | unknown protein | chr5:781870-783997
FORWARD
Length = 230
Score = 102 bits (253), Expect = 3e-022
Identities = 48/61 (78%), Positives = 55/61 (90%)
Frame = +1
Query: 148 SRFKRVCVFCGSSSGNKDCYRDAAIDLAQELVERKLNLVYGGGSIGLMGLVSQAVHEAGG 327
SRFK +CVFCGSS+GNK Y+DAAIDLA+ELV RK++LVYGGGSIGLMGLVSQAVH+ G
Sbjct: 16 SRFKSICVFCGSSNGNKASYQDAAIDLAKELVMRKIDLVYGGGSIGLMGLVSQAVHDGGR 75
Query: 328 H 330
H
Sbjct: 76 H 76
Score = 89 bits (218), Expect = 3e-018
Identities = 43/63 (68%), Positives = 48/63 (76%)
Frame = +1
Query: 361 DKEITGETFGEVRAVADMHQRKAEMASHSDCFIALPGGYGILEELLEVITWAHLCLCPPP 540
+ ++TGET GEV+ VADMHQRKA MA HSD FI LPGGYG LEELLEVITWA L + P
Sbjct: 98 NSKLTGETVGEVKEVADMHQRKAVMAKHSDAFITLPGGYGTLEELLEVITWAQLGIHDKP 157
Query: 541 KSL 549
L
Sbjct: 158 VGL 160
>TAIR9_protein||AT5G26140.1 | Symbols: | lysine decarboxylase family protein |
chr5:9130796-9131636 FORWARD
Length = 144
Score = 64 bits (153), Expect = 1e-010
Identities = 33/54 (61%), Positives = 40/54 (74%), Gaps = 1/54 (1%)
Frame = +1
Query: 364 KEITGETFGEVRAVADMHQRKAEMASHSDCFIALPG-GYGILEELLEVITWAHL 522
+ I+GET GEVR V+DMH+RKA MA + FIAL G Y +EELLE+ITWA L
Sbjct: 4 EHISGETVGEVRIVSDMHERKATMAQEAGAFIALLGERYETMEELLEMITWAQL 57
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 30,470,012,363
Number of Sequences: 33410
Number of Extensions: 30470012363
Number of Successful Extensions: 1027624851
Number of sequences better than 0.0: 0
|