BLASTX 7.6.2
Query= UN21982 /QuerySize=1328
(1327 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT1G27000.1 | Symbols: | bZIP family transcripti... 476 2e-134
TAIR9_protein||AT2G02730.1 | Symbols: | FUNCTIONS IN: molecular... 269 3e-072
TAIR9_protein||AT2G02730.2 | Symbols: | FUNCTIONS IN: molecular... 269 3e-072
TAIR9_protein||AT1G04960.1 | Symbols: | FUNCTIONS IN: molecular... 200 3e-051
TAIR9_protein||AT1G04960.2 | Symbols: | FUNCTIONS IN: molecular... 181 1e-045
TAIR9_protein||AT1G24265.1 | Symbols: | unknown protein | chr1:... 95 1e-019
TAIR9_protein||AT1G24265.2 | Symbols: | unknown protein | chr1:... 95 1e-019
TAIR9_protein||AT1G24267.1 | Symbols: | unknown protein | chr1:... 94 1e-019
TAIR9_protein||AT1G24267.2 | Symbols: | unknown protein | chr1:... 90 3e-018
TAIR9_protein||AT1G44674.1 | Symbols: | FUNCTIONS IN: molecular... 75 1e-013
>TAIR9_protein||AT1G27000.1 | Symbols: | bZIP family transcription factor |
chr1:9374068-9376422 FORWARD
Length = 305
Score = 476 bits (1224), Expect = 2e-134
Identities = 249/306 (81%), Positives = 275/306 (89%), Gaps = 3/306 (0%)
Frame = +1
Query: 142 MAMQTGIGLSRIFLLAGAGYTGTIMMKNGKLSDILGELQSLVKGMERSGD--EGDSDVSD 315
MAMQ G+GLSRIFLLAGAGYTGTIMMKNGKLSD+LGELQSLVKGME+SG+ EGDSDVSD
Sbjct: 1 MAMQAGVGLSRIFLLAGAGYTGTIMMKNGKLSDLLGELQSLVKGMEKSGEGSEGDSDVSD 60
Query: 316 AIAAQVRRLAMEVRQLASARQITVMNGVSGANLQALAVPAAALGALGYGYMWWKGLSFTD 495
AIAAQVRRLAME+RQLAS + ITVMNGVSGANLQALAVPAAALGALGYGYMWWKGLSFTD
Sbjct: 61 AIAAQVRRLAMEIRQLASQQHITVMNGVSGANLQALAVPAAALGALGYGYMWWKGLSFTD 120
Query: 496 LMYVTKANMATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVT 675
LMYVTKANMA AVANLTKNLEQVS TLAAAKRHLTQ+IQN+DDKVE QIDLSKEI +QV
Sbjct: 121 LMYVTKANMAAAVANLTKNLEQVSETLAAAKRHLTQRIQNLDDKVEKQIDLSKEINSQVI 180
Query: 676 LARGDINSLESELQSLNDLISGLDGKLDTLEYKQDVTNVCMLHLYNYFGGKSTKLPDMEQ 855
AR +I+SLE +L+SL++LI+GLDGKLDTLEYKQDVTNV ML+LYNYFGGKSTKLP+MEQ
Sbjct: 181 SARENISSLEMDLESLHNLITGLDGKLDTLEYKQDVTNVFMLNLYNYFGGKSTKLPEMEQ 240
Query: 856 LQLPVNQKARNLLGDAGTKGLKNFAEQLLISNDTEGGATTVRRIGIAKANDKSRPVLSRV 1035
LQLPVNQ+ARNLL D TKGLKN AE+L SN T+ TTV++I ++K N KSRP+LSR
Sbjct: 241 LQLPVNQRARNLLADVETKGLKNLAEELFKSNGTQ-VTTTVKQISLSKVNVKSRPLLSRA 299
Query: 1036 TSAGC* 1053
SA C*
Sbjct: 300 ASAKC* 305
>TAIR9_protein||AT2G02730.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: 19 plant structures; EXPRESSED
DURING: 9 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown
function DUF1664 (InterPro:IPR012458); BEST Arabidopsis thaliana
protein match is: bZIP family transcription factor (TAIR:AT1G27000.1);
Has 86 Blast hits to 86 proteins in 10 species: Archae - 0; Bacteria -
0; Metazoa - 0; Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr2:765280-767336 REVERSE
Length = 277
Score = 269 bits (687), Expect = 3e-072
Identities = 134/244 (54%), Positives = 190/244 (77%), Gaps = 4/244 (1%)
Frame = +1
Query: 142 MAMQTGIGLSRIFLLAGAGYTGTIMMKNGKLSDILGELQSLVKGMERSGDEGDSDVSDAI 321
MAMQ+GIGLS+I +LAGAGYT TI++KNGK++DILGELQ+LVK E+SGD D D SDA+
Sbjct: 1 MAMQSGIGLSKILILAGAGYTSTILVKNGKMADILGELQALVKRFEKSGDHVDDD-SDAM 59
Query: 322 AAQVRRLAMEVRQLASARQITVMNGVSGANLQALAVPAAALGALGYGYMWWKGLSFTDLM 501
Q++RLAMEVRQLAS+RQITVMNG GA+ VPAA LGALGYGYMW+KG+SF+D+M
Sbjct: 60 TTQMQRLAMEVRQLASSRQITVMNGAQGADFTPFIVPAATLGALGYGYMWFKGISFSDIM 119
Query: 502 YVTKANMATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVTLA 681
VTK NM AV+NLTK+L+ VS + AK+HL+Q++Q +DDK++ Q DL K +++ V LA
Sbjct: 120 CVTKRNMENAVSNLTKHLDTVSEAILNAKKHLSQRLQKVDDKLDLQKDLLKGVQDNVGLA 179
Query: 682 RGDINSLESELQSLNDLISGLDGKLDTLEYKQDVTNVCMLHLYNYFGGKSTKLPDM---E 852
D+ ++ + +++ + G+ GKLD++EYKQ++ N+ +++L + GG++ K+PD+ E
Sbjct: 180 LEDLANIGDDFDAMHSIFGGMGGKLDSIEYKQNIANMGLIYLCDSLGGENHKMPDILMQE 239
Query: 853 QLQL 864
+L+L
Sbjct: 240 KLRL 243
>TAIR9_protein||AT2G02730.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: 19 plant structures; EXPRESSED
DURING: 9 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown
function DUF1664 (InterPro:IPR012458); BEST Arabidopsis thaliana
protein match is: bZIP family transcription factor (TAIR:AT1G27000.1);
Has 86 Blast hits to 86 proteins in 10 species: Archae - 0; Bacteria -
0; Metazoa - 0; Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr2:765280-767336 REVERSE
Length = 277
Score = 269 bits (687), Expect = 3e-072
Identities = 134/244 (54%), Positives = 190/244 (77%), Gaps = 4/244 (1%)
Frame = +1
Query: 142 MAMQTGIGLSRIFLLAGAGYTGTIMMKNGKLSDILGELQSLVKGMERSGDEGDSDVSDAI 321
MAMQ+GIGLS+I +LAGAGYT TI++KNGK++DILGELQ+LVK E+SGD D D SDA+
Sbjct: 1 MAMQSGIGLSKILILAGAGYTSTILVKNGKMADILGELQALVKRFEKSGDHVDDD-SDAM 59
Query: 322 AAQVRRLAMEVRQLASARQITVMNGVSGANLQALAVPAAALGALGYGYMWWKGLSFTDLM 501
Q++RLAMEVRQLAS+RQITVMNG GA+ VPAA LGALGYGYMW+KG+SF+D+M
Sbjct: 60 TTQMQRLAMEVRQLASSRQITVMNGAQGADFTPFIVPAATLGALGYGYMWFKGISFSDIM 119
Query: 502 YVTKANMATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVTLA 681
VTK NM AV+NLTK+L+ VS + AK+HL+Q++Q +DDK++ Q DL K +++ V LA
Sbjct: 120 CVTKRNMENAVSNLTKHLDTVSEAILNAKKHLSQRLQKVDDKLDLQKDLLKGVQDNVGLA 179
Query: 682 RGDINSLESELQSLNDLISGLDGKLDTLEYKQDVTNVCMLHLYNYFGGKSTKLPDM---E 852
D+ ++ + +++ + G+ GKLD++EYKQ++ N+ +++L + GG++ K+PD+ E
Sbjct: 180 LEDLANIGDDFDAMHSIFGGMGGKLDSIEYKQNIANMGLIYLCDSLGGENHKMPDILMQE 239
Query: 853 QLQL 864
+L+L
Sbjct: 240 KLRL 243
>TAIR9_protein||AT1G04960.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF1664 (InterPro:IPR012458); BEST Arabidopsis
thaliana protein match is: bZIP family transcription factor
(TAIR:AT1G27000.1); Has 95 Blast hits to 94 proteins in 14 species:
Archae - 0; Bacteria - 4; Metazoa - 0; Fungi - 0; Plants - 89; Viruses
- 0; Other Eukaryotes - 2 (source: NCBI BLink). | chr1:1408021-1410673
REVERSE
Length = 318
Score = 200 bits (506), Expect = 3e-051
Identities = 106/250 (42%), Positives = 166/250 (66%), Gaps = 4/250 (1%)
Frame = +1
Query: 142 MAMQTGIGLSRIFLLAGAGYTGTIMMKNGKLSDILGELQSLVKGMERSGDEGDSDVSDAI 321
MAMQ G+ S++ +L GAG +G+I++++G+LSD++ +LQ L+ G + +
Sbjct: 1 MAMQAGVQTSKVLILLGAGVSGSIVLRHGRLSDLIAQLQDLLNGAQGVESTPFKYDGALL 60
Query: 322 AAQVRRLAMEVRQLASARQITVMNGVSGAN-LQALAVPAAALGALGYGYMWWKGLSFTDL 498
AAQ+R+LA E+++L +T+ NG S ++ + VPAAA+GA+GY YMWWKG SF+D
Sbjct: 61 AAQIRQLANEIKELTMTNPVTIFNGDSNSSGYASYLVPAAAVGAMGYCYMWWKGWSFSDA 120
Query: 499 MYVTKANMATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVTL 678
M+VTK NMA AVA+++K L+ +S TLA+ ++HL+QK+ +D KVE Q + SK I + VT
Sbjct: 121 MFVTKKNMADAVASVSKQLDDLSDTLASTRKHLSQKLATLDWKVEEQNETSKMILSDVTE 180
Query: 679 ARGDINSLESELQSLNDLISGLDGKLDTLEYKQDVTNVCMLHLYNYFGGK---STKLPDM 849
R I+ + + + LN++ISG++GK+++LE KQDVT + HL G K STK+
Sbjct: 181 MRSSISQIGFDFKQLNEMISGIEGKIESLESKQDVTLSGLWHLCQVAGVKDSTSTKVFQD 240
Query: 850 EQLQLPVNQK 879
+LP++ K
Sbjct: 241 VGERLPIDGK 250
>TAIR9_protein||AT1G04960.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF1664 (InterPro:IPR012458); BEST Arabidopsis
thaliana protein match is: bZIP family transcription factor
(TAIR:AT1G27000.1); Has 95 Blast hits to 94 proteins in 14 species:
Archae - 0; Bacteria - 4; Metazoa - 0; Fungi - 0; Plants - 89; Viruses
- 0; Other Eukaryotes - 2 (source: NCBI BLink). | chr1:1408021-1410424
REVERSE
Length = 335
Score = 181 bits (457), Expect = 1e-045
Identities = 97/232 (41%), Positives = 153/232 (65%), Gaps = 4/232 (1%)
Frame = +1
Query: 196 GYTGTIMMKNGKLSDILGELQSLVKGMERSGDEGDSDVSDAIAAQVRRLAMEVRQLASAR 375
G +G+I++++G+LSD++ +LQ L+ G + +AAQ+R+LA E+++L
Sbjct: 36 GVSGSIVLRHGRLSDLIAQLQDLLNGAQGVESTPFKYDGALLAAQIRQLANEIKELTMTN 95
Query: 376 QITVMNGVSGAN-LQALAVPAAALGALGYGYMWWKGLSFTDLMYVTKANMATAVANLTKN 552
+T+ NG S ++ + VPAAA+GA+GY YMWWKG SF+D M+VTK NMA AVA+++K
Sbjct: 96 PVTIFNGDSNSSGYASYLVPAAAVGAMGYCYMWWKGWSFSDAMFVTKKNMADAVASVSKQ 155
Query: 553 LEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVTLARGDINSLESELQSLNDL 732
L+ +S TLA+ ++HL+QK+ +D KVE Q + SK I + VT R I+ + + + LN++
Sbjct: 156 LDDLSDTLASTRKHLSQKLATLDWKVEEQNETSKMILSDVTEMRSSISQIGFDFKQLNEM 215
Query: 733 ISGLDGKLDTLEYKQDVTNVCMLHLYNYFGGK---STKLPDMEQLQLPVNQK 879
ISG++GK+++LE KQDVT + HL G K STK+ +LP++ K
Sbjct: 216 ISGIEGKIESLESKQDVTLSGLWHLCQVAGVKDSTSTKVFQDVGERLPIDGK 267
>TAIR9_protein||AT1G24265.1 | Symbols: | unknown protein | chr1:8600613-8603630
FORWARD
Length = 349
Score = 95 bits (234), Expect = 1e-019
Identities = 58/215 (26%), Positives = 107/215 (49%), Gaps = 12/215 (5%)
Frame = +1
Query: 166 LSRIFLLAGAGYTGTIMMKNGKLSDILGELQSLVKGMER--SGDEGDSDVS----DAIAA 327
L ++ +L GAG G+ K G L D+ + K + R DE S D + A
Sbjct: 5 LGKLTILIGAGLIGSAFSKEGGLPDVSNLVSGAFKMVFRQLKQDEPSKSASKPHDDVLVA 64
Query: 328 QVRRLAMEVRQLASARQITVM--NGVSGANLQALAVPAAALGALGYGYMWWKGLSFTDLM 501
QV L E++ L S R IT++ +G G N + + +G +GYGY+WWKG D M
Sbjct: 65 QVNSLRHEIQLLGSNRPITIVSPSGSGGRNYGLIII----VGVIGYGYVWWKGWKLPDFM 120
Query: 502 YVTKANMATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVTLA 681
+ T+ +++ A N+ ++ ++L KR L+ +I M +++ ++ +E +V
Sbjct: 121 FATRRSLSDACNNVGNQIDGFYSSLKGTKRELSSEIDMMGRRLDANTEVIQETIQEVAKL 180
Query: 682 RGDINSLESELQSLNDLISGLDGKLDTLEYKQDVT 786
+ + ++ +++++ D L K+ +E QD+T
Sbjct: 181 QDGTSFIKDDVKAVFDAFENLASKVCRIEGNQDIT 215
>TAIR9_protein||AT1G24265.2 | Symbols: | unknown protein | chr1:8600613-8603630
FORWARD
Length = 349
Score = 95 bits (234), Expect = 1e-019
Identities = 58/215 (26%), Positives = 107/215 (49%), Gaps = 12/215 (5%)
Frame = +1
Query: 166 LSRIFLLAGAGYTGTIMMKNGKLSDILGELQSLVKGMER--SGDEGDSDVS----DAIAA 327
L ++ +L GAG G+ K G L D+ + K + R DE S D + A
Sbjct: 5 LGKLTILIGAGLIGSAFSKEGGLPDVSNLVSGAFKMVFRQLKQDEPSKSASKPHDDVLVA 64
Query: 328 QVRRLAMEVRQLASARQITVM--NGVSGANLQALAVPAAALGALGYGYMWWKGLSFTDLM 501
QV L E++ L S R IT++ +G G N + + +G +GYGY+WWKG D M
Sbjct: 65 QVNSLRHEIQLLGSNRPITIVSPSGSGGRNYGLIII----VGVIGYGYVWWKGWKLPDFM 120
Query: 502 YVTKANMATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVTLA 681
+ T+ +++ A N+ ++ ++L KR L+ +I M +++ ++ +E +V
Sbjct: 121 FATRRSLSDACNNVGNQIDGFYSSLKGTKRELSSEIDMMGRRLDANTEVIQETIQEVAKL 180
Query: 682 RGDINSLESELQSLNDLISGLDGKLDTLEYKQDVT 786
+ + ++ +++++ D L K+ +E QD+T
Sbjct: 181 QDGTSFIKDDVKAVFDAFENLASKVCRIEGNQDIT 215
>TAIR9_protein||AT1G24267.1 | Symbols: | unknown protein | chr1:8604451-8607241
REVERSE
Length = 344
Score = 94 bits (233), Expect = 1e-019
Identities = 54/217 (24%), Positives = 106/217 (48%), Gaps = 12/217 (5%)
Frame = +1
Query: 160 IGLSRIFLLAGAGYTGTIMMKNGKLSDILGELQSLVK------GMERSGDEGDSDVSDAI 321
I L ++ +L GAG G+++ K G L D+ + +K E +D +
Sbjct: 3 IPLGKLTILIGAGLVGSVLAKEGSLPDVSSFVSGALKMVFRQLKQEEPAKSASKPRNDTL 62
Query: 322 AAQVRRLAMEVRQLASARQITVMN--GVSGANLQALAVPAAALGALGYGYMWWKGLSFTD 495
AQV L E+ L+S R IT++ G G + + +G +GYGY+WWKG D
Sbjct: 63 MAQVNSLRHELSLLSSNRPITIVTTAGSGGKKYGYIII----IGVIGYGYVWWKGWKLPD 118
Query: 496 LMYVTKANMATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVT 675
LM+ T+ +++ A ++ ++ +L+ K+ L+ KI M ++ ++ ++ +V
Sbjct: 119 LMFATRRSLSDACNSVGSQIDGFYTSLSGTKKELSSKIDGMGRSLDANTEIIQDTGREVM 178
Query: 676 LARGDINSLESELQSLNDLISGLDGKLDTLEYKQDVT 786
+ +++ +++ + D + L K+ +E QD+T
Sbjct: 179 ELQRGTENIKDDVKFVFDAVENLASKVYRIEGNQDIT 215
>TAIR9_protein||AT1G24267.2 | Symbols: | unknown protein | chr1:8604451-8607241
REVERSE
Length = 345
Score = 90 bits (221), Expect = 3e-018
Identities = 55/218 (25%), Positives = 106/218 (48%), Gaps = 13/218 (5%)
Frame = +1
Query: 160 IGLSRIFLLAGAGYTGTIMMKNGKLSDILGELQSLVK------GMERSGDEGDSDVSDAI 321
I L ++ +L GAG G+++ K G L D+ + +K E +D +
Sbjct: 3 IPLGKLTILIGAGLVGSVLAKEGSLPDVSSFVSGALKMVFRQLKQEEPAKSASKPRNDTL 62
Query: 322 AAQVRRLAMEVRQLASARQITVMN--GVSGANLQALAVPAAALGALGYGYMWWKGLSFTD 495
AQV L E+ L+S R IT++ G G + + +G +GYGY+WWKG D
Sbjct: 63 MAQVNSLRHELSLLSSNRPITIVTTAGSGGKKYGYIII----IGVIGYGYVWWKGWKLPD 118
Query: 496 LMYVTKANMATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQVT 675
LM+ T+ +++ A ++ ++ +L+ K+ L+ KI M ++ ++ ++ +V
Sbjct: 119 LMFATRRSLSDACNSVGSQIDGFYTSLSGTKKELSSKIDGMGRSLDANTEIIQDTGREVM 178
Query: 676 LARGDINSLESELQSLNDLISGLDGKL-DTLEYKQDVT 786
+ +++ +++ + D + L KL +E QD+T
Sbjct: 179 ELQRGTENIKDDVKFVFDAVENLVRKLIYRIEGNQDIT 216
>TAIR9_protein||AT1G44674.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF1664 (InterPro:IPR012458);
BEST Arabidopsis thaliana protein match is: bZIP family transcription
factor (TAIR:AT1G27000.1); Has 33 Blast hits to 33 proteins in 6
species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 33;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr1:16881891-16882133 FORWARD
Length = 52
Score = 75 bits (182), Expect = 1e-013
Identities = 38/51 (74%), Positives = 44/51 (86%)
Frame = +1
Query: 520 MATAVANLTKNLEQVSATLAAAKRHLTQKIQNMDDKVEHQIDLSKEIKNQV 672
MAT VANLTKNLEQVS LAAA RHL QKIQ++DDKV+ QIDL++EI +QV
Sbjct: 1 MATVVANLTKNLEQVSKILAAANRHLMQKIQSVDDKVKKQIDLNEEINSQV 51
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,391,343,013
Number of Sequences: 33410
Number of Extensions: 11391343013
Number of Successful Extensions: 370310059
Number of sequences better than 0.0: 0
|