BLASTX 7.6.2
Query= UN03256 /QuerySize=1411
(1410 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT2G22720.2 | Symbols: | FUNCTIONS IN: molecular... 355 6e-098
TAIR9_protein||AT2G22720.3 | Symbols: | FUNCTIONS IN: molecular... 355 6e-098
TAIR9_protein||AT2G22720.1 | Symbols: | FUNCTIONS IN: molecular... 310 2e-084
TAIR9_protein||AT4G37860.1 | Symbols: | FUNCTIONS IN: molecular... 70 3e-012
>TAIR9_protein||AT2G22720.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24
plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro
DOMAIN/s: Chromatin SPT2 (InterPro:IPR013256); BEST Arabidopsis
thaliana protein match is: unknown protein (TAIR:AT4G37860.1); Has
34422 Blast hits to 21553 proteins in 958 species: Archae - 9; Bacteria
- 2953; Metazoa - 16492; Fungi - 6405; Plants - 1061; Viruses - 320;
Other Eukaryotes - 7182 (source: NCBI BLink). | chr2:9657344-9660532
FORWARD
Length = 673
Score = 355 bits (909), Expect = 6e-098
Identities = 224/357 (62%), Positives = 247/357 (69%), Gaps = 67/357 (18%)
Frame = -1
Query: 1344 EARSAQSSARPKQSPAINGRVAQGPPREDKRP--ANGHSRPASSGSQMNHSRPTAS---- 1183
EARSAQ S+RPKQS INGR A P RE+KRP ANGHSRP+SSGSQMNHSRP++S
Sbjct: 286 EARSAQLSSRPKQSSGINGRTAHSPHREEKRPVSANGHSRPSSSGSQMNHSRPSSSGSKM 345
Query: 1182 ------SGSSQIQNSRPTMTTSSGSQMQSRSI--SGRPASSGS----SQMQKSRPASSGS 1039
+ SQ+ NSRP SSGSQMQSR++ SGRPASSGS S+ Q SRPAS+GS
Sbjct: 346 NHSRPATSGSQMPNSRP---ASSGSQMQSRAVSGSGRPASSGSQMQNSRPQNSRPASAGS 402
Query: 1038 SQMQHSRPVSSGNQM-----QQRASSSGSQRPGSSTNRQAPMRPP--GSTMNSHSANRNG 880
Q RP SSG+Q QR +SSGSQRPGSSTNRQAPMRPP GSTMN SANRNG
Sbjct: 403 QMQQ--RPASSGSQRPASSGSQRPASSGSQRPGSSTNRQAPMRPPGSGSTMNGQSANRNG 460
Query: 879 Q-----PSSRSAPAKVPVDHRRQMSSSNGVGPGRSGSTATRPLPSKPSLERKPSMSAGKS 715
Q S RSAPAKVPVDHR+QMSSSNGVGPGRS + A RPLPSK SLERKPS+SAGKS
Sbjct: 461 QLNSRSDSRRSAPAKVPVDHRKQMSSSNGVGPGRSATNA-RPLPSKSSLERKPSISAGKS 519
Query: 714 SLQSAQRPPSSSRPMSSDPRQRLGEQRKV--NPTTSRMIPKQPVPTSRHQV*NDFLTSLS 541
SLQS QR PSSSRPMSSDPRQR+ EQRKV + T RMIPKQ PTS+H
Sbjct: 520 SLQSPQR-PSSSRPMSSDPRQRVVEQRKVSRDMATPRMIPKQSAPTSKH----------- 567
Query: 540 LNY*RRH*FVK*TLPQMMTKPAAKRPPQRDIHDDRRPLKKKKKPAIMSEDAKALSMI 370
QMM+KPA KRPP RDI +RR L KKKKPA SED +A M+
Sbjct: 568 ---------------QMMSKPALKRPPSRDIDHERR-LLKKKKPA-RSEDQEAFDML 607
Score = 284 bits (725), Expect = 1e-076
Identities = 179/278 (64%), Positives = 199/278 (71%), Gaps = 34/278 (12%)
Frame = -1
Query: 1338 RSAQSSARPKQSPAINGRVAQGPPREDKRPAN-GHSRPASSGSQMNHSRPTASSGSSQIQ 1162
R+A S R ++ P A G R + HSRP+SSGS+MNHSRP S SQ+
Sbjct: 305 RTAHSPHREEKRPV----SANGHSRPSSSGSQMNHSRPSSSGSKMNHSRPATS--GSQMP 358
Query: 1161 NSRPTMTTSSGSQMQSRSI--SGRPASSGS----SQMQKSRPASSGSSQMQHSRPVSSGN 1000
NSRP SSGSQMQSR++ SGRPASSGS S+ Q SRPAS+GS Q RP SSG+
Sbjct: 359 NSRP---ASSGSQMQSRAVSGSGRPASSGSQMQNSRPQNSRPASAGSQMQQ--RPASSGS 413
Query: 999 QM-----QQRASSSGSQRPGSSTNRQAPMRPP--GSTMNSHSANRNGQ-----PSSRSAP 856
Q QR +SSGSQRPGSSTNRQAPMRPP GSTMN SANRNGQ S RSAP
Sbjct: 414 QRPASSGSQRPASSGSQRPGSSTNRQAPMRPPGSGSTMNGQSANRNGQLNSRSDSRRSAP 473
Query: 855 AKVPVDHRRQMSSSNGVGPGRSGSTATRPLPSKPSLERKPSMSAGKSSLQSAQRPPSSSR 676
AKVPVDHR+QMSSSNGVGPGRS + A RPLPSK SLERKPS+SAGKSSLQS QR PSSSR
Sbjct: 474 AKVPVDHRKQMSSSNGVGPGRSATNA-RPLPSKSSLERKPSISAGKSSLQSPQR-PSSSR 531
Query: 675 PMSSDPRQRLGEQRKV--NPTTSRMIPKQPVPTSRHQV 568
PMSSDPRQR+ EQRKV + T RMIPKQ PTS+HQ+
Sbjct: 532 PMSSDPRQRVVEQRKVSRDMATPRMIPKQSAPTSKHQM 569
Score = 87 bits (213), Expect = 3e-017
Identities = 81/242 (33%), Positives = 113/242 (46%), Gaps = 36/242 (14%)
Frame = -1
Query: 1269 PREDKRPANGHSRPASSGS---------QMNHSRPTASSGSSQIQNSRPTMTTSSGSQMQ 1117
P + R A SRP S RP +++G +SRP +SSGSQM
Sbjct: 283 PNSEARSAQLSSRPKQSSGINGRTAHSPHREEKRPVSANG-----HSRP---SSSGSQMN 334
Query: 1116 SRSISGRPASSGSSQMQKSRPASSGSSQMQHSRPVSSGNQMQQRASSSGSQRPGSSTNRQ 937
RP+SSG S+M SRPA+SG SQM +SRP SSG+QMQ RA SGS RP SS ++
Sbjct: 335 ----HSRPSSSG-SKMNHSRPATSG-SQMPNSRPASSGSQMQSRA-VSGSGRPASSGSQM 387
Query: 936 APMRPPGSTMNSHSANRNGQPSSRSAPAKVPVDHRRQMSSSNGVGPGRSGSTATRPLPSK 757
RP S S + +P+S + +R SS G R GS+ R P +
Sbjct: 388 QNSRPQNSRPASAGSQMQQRPASSGSQRPASSGSQRPASS----GSQRPGSSTNRQAPMR 443
Query: 756 P--SLERKPSMSAGKSSLQSAQRPPSSSRP--MSSDPRQRLGEQRKVNP----TTSRMIP 601
P S SA ++ +++ S P + D R+++ V P T +R +P
Sbjct: 444 PPGSGSTMNGQSANRNGQLNSRSDSRRSAPAKVPVDHRKQMSSSNGVGPGRSATNARPLP 503
Query: 600 KQ 595
+
Sbjct: 504 SK 505
Score = 81 bits (199), Expect = 1e-015
Identities = 40/62 (64%), Positives = 48/62 (77%)
Frame = -3
Query: 370 RQMFNTNRYAGRDDDDRNMEANFDDIMKEERRSARIAREEDEKEAQLIAAEEERERLRKI 191
RQ+ R++ DDDD NMEA F+DI KEERRSARIAREEDE+E +L+ EE RERL+K
Sbjct: 608 RQLLPPKRFSRYDDDDINMEAGFEDIQKEERRSARIAREEDERELKLLEEEERRERLKKN 667
Query: 190 RK 185
RK
Sbjct: 668 RK 669
>TAIR9_protein||AT2G22720.3 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24
plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro
DOMAIN/s: Chromatin SPT2 (InterPro:IPR013256); BEST Arabidopsis
thaliana protein match is: unknown protein (TAIR:AT4G37860.1); Has
34349 Blast hits to 21327 proteins in 952 species: Archae - 9; Bacteria
- 3053; Metazoa - 16365; Fungi - 6337; Plants - 1046; Viruses - 325;
Other Eukaryotes - 7214 (source: NCBI BLink). | chr2:9658182-9660532
FORWARD
Length = 570
Score = 355 bits (909), Expect = 6e-098
Identities = 224/357 (62%), Positives = 247/357 (69%), Gaps = 67/357 (18%)
Frame = -1
Query: 1344 EARSAQSSARPKQSPAINGRVAQGPPREDKRP--ANGHSRPASSGSQMNHSRPTAS---- 1183
EARSAQ S+RPKQS INGR A P RE+KRP ANGHSRP+SSGSQMNHSRP++S
Sbjct: 183 EARSAQLSSRPKQSSGINGRTAHSPHREEKRPVSANGHSRPSSSGSQMNHSRPSSSGSKM 242
Query: 1182 ------SGSSQIQNSRPTMTTSSGSQMQSRSI--SGRPASSGS----SQMQKSRPASSGS 1039
+ SQ+ NSRP SSGSQMQSR++ SGRPASSGS S+ Q SRPAS+GS
Sbjct: 243 NHSRPATSGSQMPNSRP---ASSGSQMQSRAVSGSGRPASSGSQMQNSRPQNSRPASAGS 299
Query: 1038 SQMQHSRPVSSGNQM-----QQRASSSGSQRPGSSTNRQAPMRPP--GSTMNSHSANRNG 880
Q RP SSG+Q QR +SSGSQRPGSSTNRQAPMRPP GSTMN SANRNG
Sbjct: 300 QMQQ--RPASSGSQRPASSGSQRPASSGSQRPGSSTNRQAPMRPPGSGSTMNGQSANRNG 357
Query: 879 Q-----PSSRSAPAKVPVDHRRQMSSSNGVGPGRSGSTATRPLPSKPSLERKPSMSAGKS 715
Q S RSAPAKVPVDHR+QMSSSNGVGPGRS + A RPLPSK SLERKPS+SAGKS
Sbjct: 358 QLNSRSDSRRSAPAKVPVDHRKQMSSSNGVGPGRSATNA-RPLPSKSSLERKPSISAGKS 416
Query: 714 SLQSAQRPPSSSRPMSSDPRQRLGEQRKV--NPTTSRMIPKQPVPTSRHQV*NDFLTSLS 541
SLQS QR PSSSRPMSSDPRQR+ EQRKV + T RMIPKQ PTS+H
Sbjct: 417 SLQSPQR-PSSSRPMSSDPRQRVVEQRKVSRDMATPRMIPKQSAPTSKH----------- 464
Query: 540 LNY*RRH*FVK*TLPQMMTKPAAKRPPQRDIHDDRRPLKKKKKPAIMSEDAKALSMI 370
QMM+KPA KRPP RDI +RR L KKKKPA SED +A M+
Sbjct: 465 ---------------QMMSKPALKRPPSRDIDHERR-LLKKKKPA-RSEDQEAFDML 504
Score = 284 bits (725), Expect = 1e-076
Identities = 179/278 (64%), Positives = 199/278 (71%), Gaps = 34/278 (12%)
Frame = -1
Query: 1338 RSAQSSARPKQSPAINGRVAQGPPREDKRPAN-GHSRPASSGSQMNHSRPTASSGSSQIQ 1162
R+A S R ++ P A G R + HSRP+SSGS+MNHSRP S SQ+
Sbjct: 202 RTAHSPHREEKRPV----SANGHSRPSSSGSQMNHSRPSSSGSKMNHSRPATS--GSQMP 255
Query: 1161 NSRPTMTTSSGSQMQSRSI--SGRPASSGS----SQMQKSRPASSGSSQMQHSRPVSSGN 1000
NSRP SSGSQMQSR++ SGRPASSGS S+ Q SRPAS+GS Q RP SSG+
Sbjct: 256 NSRP---ASSGSQMQSRAVSGSGRPASSGSQMQNSRPQNSRPASAGSQMQQ--RPASSGS 310
Query: 999 QM-----QQRASSSGSQRPGSSTNRQAPMRPP--GSTMNSHSANRNGQ-----PSSRSAP 856
Q QR +SSGSQRPGSSTNRQAPMRPP GSTMN SANRNGQ S RSAP
Sbjct: 311 QRPASSGSQRPASSGSQRPGSSTNRQAPMRPPGSGSTMNGQSANRNGQLNSRSDSRRSAP 370
Query: 855 AKVPVDHRRQMSSSNGVGPGRSGSTATRPLPSKPSLERKPSMSAGKSSLQSAQRPPSSSR 676
AKVPVDHR+QMSSSNGVGPGRS + A RPLPSK SLERKPS+SAGKSSLQS QR PSSSR
Sbjct: 371 AKVPVDHRKQMSSSNGVGPGRSATNA-RPLPSKSSLERKPSISAGKSSLQSPQR-PSSSR 428
Query: 675 PMSSDPRQRLGEQRKV--NPTTSRMIPKQPVPTSRHQV 568
PMSSDPRQR+ EQRKV + T RMIPKQ PTS+HQ+
Sbjct: 429 PMSSDPRQRVVEQRKVSRDMATPRMIPKQSAPTSKHQM 466
Score = 87 bits (213), Expect = 3e-017
Identities = 81/242 (33%), Positives = 113/242 (46%), Gaps = 36/242 (14%)
Frame = -1
Query: 1269 PREDKRPANGHSRPASSGS---------QMNHSRPTASSGSSQIQNSRPTMTTSSGSQMQ 1117
P + R A SRP S RP +++G +SRP +SSGSQM
Sbjct: 180 PNSEARSAQLSSRPKQSSGINGRTAHSPHREEKRPVSANG-----HSRP---SSSGSQMN 231
Query: 1116 SRSISGRPASSGSSQMQKSRPASSGSSQMQHSRPVSSGNQMQQRASSSGSQRPGSSTNRQ 937
RP+SSG S+M SRPA+SG SQM +SRP SSG+QMQ RA SGS RP SS ++
Sbjct: 232 ----HSRPSSSG-SKMNHSRPATSG-SQMPNSRPASSGSQMQSRA-VSGSGRPASSGSQM 284
Query: 936 APMRPPGSTMNSHSANRNGQPSSRSAPAKVPVDHRRQMSSSNGVGPGRSGSTATRPLPSK 757
RP S S + +P+S + +R SS G R GS+ R P +
Sbjct: 285 QNSRPQNSRPASAGSQMQQRPASSGSQRPASSGSQRPASS----GSQRPGSSTNRQAPMR 340
Query: 756 P--SLERKPSMSAGKSSLQSAQRPPSSSRP--MSSDPRQRLGEQRKVNP----TTSRMIP 601
P S SA ++ +++ S P + D R+++ V P T +R +P
Sbjct: 341 PPGSGSTMNGQSANRNGQLNSRSDSRRSAPAKVPVDHRKQMSSSNGVGPGRSATNARPLP 400
Query: 600 KQ 595
+
Sbjct: 401 SK 402
Score = 81 bits (199), Expect = 1e-015
Identities = 40/62 (64%), Positives = 48/62 (77%)
Frame = -3
Query: 370 RQMFNTNRYAGRDDDDRNMEANFDDIMKEERRSARIAREEDEKEAQLIAAEEERERLRKI 191
RQ+ R++ DDDD NMEA F+DI KEERRSARIAREEDE+E +L+ EE RERL+K
Sbjct: 505 RQLLPPKRFSRYDDDDINMEAGFEDIQKEERRSARIAREEDERELKLLEEEERRERLKKN 564
Query: 190 RK 185
RK
Sbjct: 565 RK 566
>TAIR9_protein||AT2G22720.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
growth stages; CONTAINS InterPro DOMAIN/s: Chromatin SPT2
(InterPro:IPR013256); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT4G37860.1); Has 18810 Blast hits to 9739
proteins in 609 species: Archae - 9; Bacteria - 1424; Metazoa - 8282;
Fungi - 2696; Plants - 486; Viruses - 222; Other Eukaryotes - 5691
(source: NCBI BLink). | chr2:9659235-9660532 FORWARD
Length = 341
Score = 310 bits (793), Expect = 2e-084
Identities = 197/310 (63%), Positives = 216/310 (69%), Gaps = 57/310 (18%)
Frame = -1
Query: 1239 HSRPASSGSQMNHSRPTASSGSSQIQNSRPTMTTSSGSQMQSRSI--SGRPASSGS---- 1078
HSRP+SSGS+MNHSRP S SQ+ NSRP SSGSQMQSR++ SGRPASSGS
Sbjct: 3 HSRPSSSGSKMNHSRPATS--GSQMPNSRP---ASSGSQMQSRAVSGSGRPASSGSQMQN 57
Query: 1077 SQMQKSRPASSGSSQMQHSRPVSSGNQM-----QQRASSSGSQRPGSSTNRQAPMRPP-- 919
S+ Q SRPAS+GS Q RP SSG+Q QR +SSGSQRPGSSTNRQAPMRPP
Sbjct: 58 SRPQNSRPASAGSQMQQ--RPASSGSQRPASSGSQRPASSGSQRPGSSTNRQAPMRPPGS 115
Query: 918 GSTMNSHSANRNGQ-----PSSRSAPAKVPVDHRRQMSSSNGVGPGRSGSTATRPLPSKP 754
GSTMN SANRNGQ S RSAPAKVPVDHR+QMSSSNGVGPGRS + A RPLPSK
Sbjct: 116 GSTMNGQSANRNGQLNSRSDSRRSAPAKVPVDHRKQMSSSNGVGPGRSATNA-RPLPSKS 174
Query: 753 SLERKPSMSAGKSSLQSAQRPPSSSRPMSSDPRQRLGEQRKV--NPTTSRMIPKQPVPTS 580
SLERKPS+SAGKSSLQS QR PSSSRPMSSDPRQR+ EQRKV + T RMIPKQ PTS
Sbjct: 175 SLERKPSISAGKSSLQSPQR-PSSSRPMSSDPRQRVVEQRKVSRDMATPRMIPKQSAPTS 233
Query: 579 RHQV*NDFLTSLSLNY*RRH*FVK*TLPQMMTKPAAKRPPQRDIHDDRRPLKKKKKPAIM 400
+H QMM+KPA KRPP RDI +RR L KKKKPA
Sbjct: 234 KH--------------------------QMMSKPALKRPPSRDIDHERR-LLKKKKPA-R 265
Query: 399 SEDAKALSMI 370
SED +A M+
Sbjct: 266 SEDQEAFDML 275
Score = 261 bits (665), Expect = 1e-069
Identities = 165/237 (69%), Positives = 180/237 (75%), Gaps = 27/237 (11%)
Frame = -1
Query: 1239 HSRPASSGSQMNHSRPTASSGSSQIQN---SRPTMTTSSGSQMQ-SRSISGRPASSGSSQ 1072
HSRPA+SGSQM +SRP ASSG SQ+Q+ S SSGSQMQ SR + RPAS+GS
Sbjct: 15 HSRPATSGSQMPNSRP-ASSG-SQMQSRAVSGSGRPASSGSQMQNSRPQNSRPASAGSQM 72
Query: 1071 MQKSRPASSGSSQMQHSRPVSSGNQMQQRASSSGSQRPGSSTNRQAPMRPP--GSTMNSH 898
Q RPASSGS RP SSG+ QR +SSGSQRPGSSTNRQAPMRPP GSTMN
Sbjct: 73 QQ--RPASSGS-----QRPASSGS---QRPASSGSQRPGSSTNRQAPMRPPGSGSTMNGQ 122
Query: 897 SANRNGQ-----PSSRSAPAKVPVDHRRQMSSSNGVGPGRSGSTATRPLPSKPSLERKPS 733
SANRNGQ S RSAPAKVPVDHR+QMSSSNGVGPGRS + A RPLPSK SLERKPS
Sbjct: 123 SANRNGQLNSRSDSRRSAPAKVPVDHRKQMSSSNGVGPGRSATNA-RPLPSKSSLERKPS 181
Query: 732 MSAGKSSLQSAQRPPSSSRPMSSDPRQRLGEQRKV--NPTTSRMIPKQPVPTSRHQV 568
+SAGKSSLQS QR PSSSRPMSSDPRQR+ EQRKV + T RMIPKQ PTS+HQ+
Sbjct: 182 ISAGKSSLQSPQR-PSSSRPMSSDPRQRVVEQRKVSRDMATPRMIPKQSAPTSKHQM 237
Score = 81 bits (199), Expect = 1e-015
Identities = 40/62 (64%), Positives = 48/62 (77%)
Frame = -3
Query: 370 RQMFNTNRYAGRDDDDRNMEANFDDIMKEERRSARIAREEDEKEAQLIAAEEERERLRKI 191
RQ+ R++ DDDD NMEA F+DI KEERRSARIAREEDE+E +L+ EE RERL+K
Sbjct: 276 RQLLPPKRFSRYDDDDINMEAGFEDIQKEERRSARIAREEDERELKLLEEEERRERLKKN 335
Query: 190 RK 185
RK
Sbjct: 336 RK 337
>TAIR9_protein||AT4G37860.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; CONTAINS InterPro DOMAIN/s: Chromatin SPT2
(InterPro:IPR013256); BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT2G22720.3); Has 543 Blast hits to 488 proteins
in 97 species: Archae - 0; Bacteria - 20; Metazoa - 186; Fungi - 63;
Plants - 52; Viruses - 1; Other Eukaryotes - 221 (source: NCBI BLink).
| chr4:17800589-17801907 REVERSE
Length = 355
Score = 70 bits (170), Expect = 3e-012
Identities = 35/66 (53%), Positives = 52/66 (78%), Gaps = 5/66 (7%)
Frame = -3
Query: 370 RQMFNTNRYAGRDD--DDRNMEANFDDIMKEERRSARIAREEDEKEAQLIAAEEERERLR 197
R+M T+R+AGRD+ DDR MEANFDDIM+EE+RS R+A++ED ++ +L+ EE ER+R
Sbjct: 267 RKMCKTDRFAGRDEDYDDRCMEANFDDIMREEKRSERLAKKEDAEQLRLV---EEEERVR 323
Query: 196 KIRKNR 179
+ +K +
Sbjct: 324 RQKKQK 329
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,340,731,947
Number of Sequences: 33410
Number of Extensions: 2340731947
Number of Successful Extensions: 84968550
Number of sequences better than 0.0: 0
|