BLASTX 7.6.2
Query= UN15077 /QuerySize=912
(911 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT4G33700.1 | Symbols: | CBS domain-containing p... 325 2e-089
TAIR9_protein||AT2G14520.1 | Symbols: | CBS domain-containing p... 301 4e-082
TAIR9_protein||AT5G52790.1 | Symbols: | CBS domain-containing p... 230 1e-060
TAIR9_protein||AT1G47330.1 | Symbols: | FUNCTIONS IN: molecular... 224 7e-059
TAIR9_protein||AT1G03270.1 | Symbols: | FUNCTIONS IN: molecular... 199 3e-051
TAIR9_protein||AT4G14240.1 | Symbols: | FUNCTIONS IN: molecular... 198 5e-051
TAIR9_protein||AT4G14230.1 | Symbols: | CBS domain-containing p... 187 9e-048
TAIR9_protein||AT4G14240.2 | Symbols: | FUNCTIONS IN: molecular... 182 4e-046
>TAIR9_protein||AT4G33700.1 | Symbols: | CBS domain-containing protein |
chr4:16176547-16179188 REVERSE
Length = 425
Score = 325 bits (833), Expect = 2e-089
Identities = 165/178 (92%), Positives = 169/178 (94%)
Frame = +2
Query: 302 MAAELECCEANFFIHIAVIAFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPQHRQYAE 481
MA E CC NFFIHIAVI FLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTP+HR+YA
Sbjct: 1 MAVEYVCCSPNFFIHIAVIVFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPEHRKYAA 60
Query: 482 KILPVVKNQHLLLVTLLVCNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQSIC 661
KILPVVKNQHLLLVTLL+CNAAAMETLPIFLD LVTAWGAILISVTLILLFGEIIPQSIC
Sbjct: 61 KILPVVKNQHLLLVTLLICNAAAMETLPIFLDGLVTAWGAILISVTLILLFGEIIPQSIC 120
Query: 662 SRYGLAIGATVASFVRVLVFVCLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHG 835
SRYGLAIGATVA FVRVLVF+CLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHG
Sbjct: 121 SRYGLAIGATVAPFVRVLVFICLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHG 178
>TAIR9_protein||AT2G14520.1 | Symbols: | CBS domain-containing protein |
chr2:6182362-6184648 REVERSE
Length = 424
Score = 301 bits (770), Expect = 4e-082
Identities = 153/178 (85%), Positives = 163/178 (91%)
Frame = +2
Query: 302 MAAELECCEANFFIHIAVIAFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPQHRQYAE 481
MA E ECC +FFIHIAVI LVLFAGLMSGLTLGLMS+SLVDLEVLAKSGTP+ R +A
Sbjct: 1 MAVEYECCGTSFFIHIAVIVLLVLFAGLMSGLTLGLMSMSLVDLEVLAKSGTPRDRIHAA 60
Query: 482 KILPVVKNQHLLLVTLLVCNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQSIC 661
KILPVVKNQHLLL TLL+CNAAAME LPIFLD+LVTAWGAILISVTLILLFGEIIPQS+C
Sbjct: 61 KILPVVKNQHLLLCTLLICNAAAMEALPIFLDALVTAWGAILISVTLILLFGEIIPQSVC 120
Query: 662 SRYGLAIGATVASFVRVLVFVCLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHG 835
SR+GLAIGATVA FVRVLV++CLPVAWPISKLLDFLLGH R ALFRRAELKTLVD HG
Sbjct: 121 SRHGLAIGATVAPFVRVLVWICLPVAWPISKLLDFLLGHGRVALFRRAELKTLVDLHG 178
>TAIR9_protein||AT5G52790.1 | Symbols: | CBS domain-containing protein-related
| chr5:21391740-21394327 REVERSE
Length = 501
Score = 230 bits (585), Expect = 1e-060
Identities = 115/179 (64%), Positives = 145/179 (81%)
Frame = +2
Query: 299 LMAAELECCEANFFIHIAVIAFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPQHRQYA 478
+ A ++ CCE F++++ V LV+FAGLMSGLTLGLMSLS+V+LEV+ K+G P R+ A
Sbjct: 1 MAANDVPCCETMFWVYLLVCVALVVFAGLMSGLTLGLMSLSIVELEVMIKAGEPHDRKNA 60
Query: 479 EKILPVVKNQHLLLVTLLVCNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQSI 658
EKILP+VKNQHLLL TLL+ NA AME LPIF+DSL+ AWGAILISVTLIL FGEIIPQ++
Sbjct: 61 EKILPLVKNQHLLLCTLLIGNALAMEALPIFVDSLLPAWGAILISVTLILAFGEIIPQAV 120
Query: 659 CSRYGLAIGATVASFVRVLVFVCLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHG 835
CSRYGL+IGA ++ VR+++ V P+++PISKLLD LLG R + L RAELK+LV HG
Sbjct: 121 CSRYGLSIGAKLSFLVRLIIIVFFPLSYPISKLLDLLLGKRHSTLLGRAELKSLVYMHG 179
>TAIR9_protein||AT1G47330.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma
membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
DUF21 (InterPro:IPR002550), Cystathionine beta-synthase, core
(InterPro:IPR000644); BEST Arabidopsis thaliana protein match is: CBS
domain-containing protein (TAIR:AT2G14520.1); Has 6970 Blast hits to
6717 proteins in 1361 species: Archae - 64; Bacteria - 4382; Metazoa -
390; Fungi - 183; Plants - 125; Viruses - 0; Other Eukaryotes - 1826
(source: NCBI BLink). | chr1:17351149-17353739 FORWARD
Length = 528
Score = 224 bits (570), Expect = 7e-059
Identities = 111/178 (62%), Positives = 140/178 (78%)
Frame = +2
Query: 302 MAAELECCEANFFIHIAVIAFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPQHRQYAE 481
M++++ CC F +++ +I LV FAGLM+GLTLGLMSL LVDLEVL KSG PQ R A
Sbjct: 1 MSSDIPCCGTTFSLYVVIIIALVAFAGLMAGLTLGLMSLGLVDLEVLIKSGRPQDRINAG 60
Query: 482 KILPVVKNQHLLLVTLLVCNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQSIC 661
KI PVVKNQHLLL TLL+ N+ AME LPIFLD +V W AIL+SVTLIL+FGEI+PQ++C
Sbjct: 61 KIFPVVKNQHLLLCTLLIGNSMAMEALPIFLDKIVPPWLAILLSVTLILVFGEIMPQAVC 120
Query: 662 SRYGLAIGATVASFVRVLVFVCLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHG 835
+RYGL +GA +A FVRVL+ + P+++PISK+LD++LG L RRAELKT V+FHG
Sbjct: 121 TRYGLKVGAIMAPFVRVLLVLFFPISYPISKVLDWMLGKGHGVLLRRAELKTFVNFHG 178
>TAIR9_protein||AT1G03270.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: 21 plant structures; EXPRESSED
DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF21 (InterPro:IPR002550), Cystathionine
beta-synthase, core (InterPro:IPR000644); BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G14240.1); Has 6515 Blast
hits to 6357 proteins in 1347 species: Archae - 64; Bacteria - 4189;
Metazoa - 291; Fungi - 188; Plants - 123; Viruses - 0; Other Eukaryotes
- 1660 (source: NCBI BLink). | chr1:799191-802436 FORWARD
Length = 500
Score = 199 bits (504), Expect = 3e-051
Identities = 100/166 (60%), Positives = 127/166 (76%), Gaps = 1/166 (0%)
Frame = +2
Query: 335 FFIHIAVIAFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPQHRQYAEKILPVVKNQHL 514
+F+ + V FLVLFAG+MSGLTLGLMSL LV+LE+L +SG+ ++ A ILPVVK QH
Sbjct: 33 WFVVVGVACFLVLFAGIMSGLTLGLMSLGLVELEILQQSGSSAEKKQAAAILPVVKKQHQ 92
Query: 515 LLVTLLVCNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQSICSRYGLAIGATV 694
LLVTLL+CNAAAME LPI LD + + A+L+SVT +L FGEIIPQ+ICSRYGLA+GA
Sbjct: 93 LLVTLLLCNAAAMEALPICLDKIFHPFVAVLLSVTFVLAFGEIIPQAICSRYGLAVGANF 152
Query: 695 ASFVRVLVFVCLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFH 832
VR+L+ +C P+A+PI K+LD ++GH LFRRA+LK LV H
Sbjct: 153 LWLVRILMIICYPIAYPIGKVLDAVIGH-NDTLFRRAQLKALVSIH 197
>TAIR9_protein||AT4G14240.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
of unknown function DUF21 (InterPro:IPR002550), Cystathionine
beta-synthase, core (InterPro:IPR000644); BEST Arabidopsis thaliana
protein match is: CBS domain-containing protein-related
(TAIR:AT4G14230.1); Has 6770 Blast hits to 6657 proteins in 1347
species: Archae - 62; Bacteria - 4461; Metazoa - 254; Fungi - 179;
Plants - 121; Viruses - 0; Other Eukaryotes - 1693 (source: NCBI
BLink). | chr4:8204712-8207273 REVERSE
Length = 495
Score = 198 bits (502), Expect = 5e-051
Identities = 98/163 (60%), Positives = 125/163 (76%), Gaps = 1/163 (0%)
Frame = +2
Query: 344 HIAVIAFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPQHRQYAEKILPVVKNQHLLLV 523
+ + FLVLFAG+MSGLTLGLMSL LV+LE+L +SGTP ++ A I PVV+ QH LLV
Sbjct: 38 YAGISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPNEKKQAAAIFPVVQKQHQLLV 97
Query: 524 TLLVCNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQSICSRYGLAIGATVASF 703
TLL+CNA AME LPI+LD L + AI++SVT +L FGE+IPQ+IC+RYGLA+GA
Sbjct: 98 TLLLCNAMAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWL 157
Query: 704 VRVLVFVCLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFH 832
VR+L+ +C P+A+PI K+LD +LGH ALFRRA+LK LV H
Sbjct: 158 VRILMTLCYPIAFPIGKILDLVLGH-NDALFRRAQLKALVSIH 199
>TAIR9_protein||AT4G14230.1 | Symbols: | CBS domain-containing protein-related
| chr4:8200850-8203130 REVERSE
Length = 496
Score = 187 bits (474), Expect = 9e-048
Identities = 92/164 (56%), Positives = 126/164 (76%), Gaps = 1/164 (0%)
Frame = +2
Query: 344 HIAVIAFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPQHRQYAEKILPVVKNQHLLLV 523
+ + FLVLFAG+MSGLTLGLMSL LV+LE+L +SGTP+ ++ + I PVV+ QH LLV
Sbjct: 37 YAGISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSGTPKEKKQSAAIFPVVQKQHQLLV 96
Query: 524 TLLVCNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQSICSRYGLAIGATVASF 703
TLL+ NA AME LPI+LD + + AI++SVT +L GE+IPQ+IC+RYGLA+GA +
Sbjct: 97 TLLLFNALAMEGLPIYLDKIFNEYVAIILSVTFVLFVGEVIPQAICTRYGLAVGANLVWL 156
Query: 704 VRVLVFVCLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFHG 835
VR+L+ + P+++PI+K+LD++LGH LFRRA+LK LV HG
Sbjct: 157 VRILMVLSYPISFPIAKMLDWVLGH-NDPLFRRAQLKALVSIHG 199
>TAIR9_protein||AT4G14240.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein
of unknown function DUF21 (InterPro:IPR002550), Cystathionine
beta-synthase, core (InterPro:IPR000644); BEST Arabidopsis thaliana
protein match is: CBS domain-containing protein-related
(TAIR:AT4G14230.1); Has 6735 Blast hits to 6622 proteins in 1349
species: Archae - 62; Bacteria - 4446; Metazoa - 254; Fungi - 179;
Plants - 121; Viruses - 0; Other Eukaryotes - 1673 (source: NCBI
BLink). | chr4:8204712-8207273 REVERSE
Length = 486
Score = 182 bits (460), Expect = 4e-046
Identities = 94/163 (57%), Positives = 119/163 (73%), Gaps = 10/163 (6%)
Frame = +2
Query: 344 HIAVIAFLVLFAGLMSGLTLGLMSLSLVDLEVLAKSGTPQHRQYAEKILPVVKNQHLLLV 523
+ + FLVLFAG+MSGLTLGLMSL LV+LE+L +S I PVV+ QH LLV
Sbjct: 38 YAGISCFLVLFAGIMSGLTLGLMSLGLVELEILQRSAA---------IFPVVQKQHQLLV 88
Query: 524 TLLVCNAAAMETLPIFLDSLVTAWGAILISVTLILLFGEIIPQSICSRYGLAIGATVASF 703
TLL+CNA AME LPI+LD L + AI++SVT +L FGE+IPQ+IC+RYGLA+GA
Sbjct: 89 TLLLCNAMAMEGLPIYLDKLFNEYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVWL 148
Query: 704 VRVLVFVCLPVAWPISKLLDFLLGHRRAALFRRAELKTLVDFH 832
VR+L+ +C P+A+PI K+LD +LGH ALFRRA+LK LV H
Sbjct: 149 VRILMTLCYPIAFPIGKILDLVLGH-NDALFRRAQLKALVSIH 190
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,828,865,732
Number of Sequences: 33410
Number of Extensions: 7828865732
Number of Successful Extensions: 264530517
Number of sequences better than 0.0: 0
|