BLASTX 7.6.2
Query= UN03159 /QuerySize=1460
(1459 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT4G37110.1 | Symbols: | protein binding / zinc ... 453 2e-127
TAIR9_protein||AT2G23530.1 | Symbols: | FUNCTIONS IN: molecular... 371 8e-103
TAIR9_protein||AT1G67780.1 | Symbols: | FUNCTIONS IN: molecular... 106 5e-023
TAIR9_protein||AT1G67270.1 | Symbols: | FUNCTIONS IN: molecular... 101 2e-021
TAIR9_protein||AT5G38690.1 | Symbols: | FUNCTIONS IN: molecular... 90 3e-018
>TAIR9_protein||AT4G37110.1 | Symbols: | protein binding / zinc ion binding |
chr4:17484343-17486197 REVERSE
Length = 418
Score = 453 bits (1164), Expect = 2e-127
Identities = 256/431 (59%), Positives = 302/431 (70%), Gaps = 72/431 (16%)
Frame = -3
Query: 1442 MPAMSTRANSTVPVVTNPPP---DPTLNVSIYEKCRQERIKENLQRMQNLGIVDLSLKLK 1272
MP + TRA ++P VTNP P NVSIYEKCR++RIKENLQRM+NLGI+DLSLKLK
Sbjct: 1 MPVVRTRAKCSIP-VTNPNPTVGGGNSNVSIYEKCREDRIKENLQRMKNLGIMDLSLKLK 59
Query: 1271 SGIRPAKRRYTKAADSNPDLRSSARNTVPPPPLQLSVS-------------TRRSSRLKN 1131
S IRPAKRRY +++NP +S P+QLSVS +RRSSRLKN
Sbjct: 60 SEIRPAKRRYGN-SNANPGRETS--------PIQLSVSSRRSSRLKQEPPVSRRSSRLKN 110
Query: 1130 VSPVTYFEEAEKKKGKASK-EVVLWIGEGGERPEIYTDEHEKLLGNTERTWELFVDGYAP 954
+PV+Y EE E KKGK SK E+VLW+GE G RPEIYT+EHEKLLGNTERTWELFVDG
Sbjct: 111 ATPVSYAEEPELKKGKVSKEEIVLWVGE-GVRPEIYTEEHEKLLGNTERTWELFVDGCDK 169
Query: 953 DGKRIYDPVRGKTCHQCRQKTLGYRTQCSKCHPSVTGQFCGDCLYMRYGEHVLETLENPE 774
+GKRIYDPVRGK CHQCRQKTLGY TQCS+C+ SV GQFCGDCLYMRYGEHVLE LENP+
Sbjct: 170 NGKRIYDPVRGKCCHQCRQKTLGYHTQCSQCNHSVRGQFCGDCLYMRYGEHVLEALENPD 229
Query: 773 WVCPGCRGICNCSLCRKRKGWLPTGTAYKKVCKLGYKSVAHYLIQNNKQSETDEDDDETD 594
W+CP CR ICNCS CR +KGWLPTG AY+K+ KLGYKSVAHYLIQ N+QSET EDD+
Sbjct: 230 WICPVCRDICNCSFCRTKKGWLPTGAAYRKIHKLGYKSVAHYLIQTNQQSETSEDDETDA 289
Query: 593 DTPSQASAKRSLSFKEAKVSSEDDDDDLNLLQITDGLAVQETHID---DDNEGTDKNQDS 423
SQA SE D +LL ITDG +Q+ ID DD++G++KN DS
Sbjct: 290 PADSQA--------------SEGD----HLLLITDG--IQDNQIDENLDDDDGSNKNPDS 329
Query: 422 ARKSLSFLSAEDNQTS----------VNEMDPATPLLMIDVKPY---------SGDKENG 300
ARKSL FLS+ +NQTS V E+DPATP++ ID++ S +KEN
Sbjct: 330 ARKSLCFLSSGNNQTSVTDGDLKPLDVKEVDPATPVI-IDLEAQCSETERMANSANKENR 388
Query: 299 KQRA-REMSVE 270
+ R+ R+MS E
Sbjct: 389 ETRSKRKMSAE 399
Score = 57 bits (135), Expect = 3e-008
Identities = 37/78 (47%), Positives = 52/78 (66%), Gaps = 7/78 (8%)
Frame = -3
Query: 548 EAKVSSEDDDDDLNLLQITDGLAVQETHID---DDNEGTDKNQDSARKSLSFLSAEDNQT 378
+A S+ + D +LL ITDG +Q+ ID DD++G++KN DSARKSL FLS+ +NQT
Sbjct: 288 DAPADSQASEGD-HLLLITDG--IQDNQIDENLDDDDGSNKNPDSARKSLCFLSSGNNQT 344
Query: 377 SVNEMDPATPLLMIDVKP 324
SV + D PL + +V P
Sbjct: 345 SVTDGD-LKPLDVKEVDP 361
>TAIR9_protein||AT2G23530.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 10 plant structures;
EXPRESSED DURING: 6 growth stages; CONTAINS InterPro DOMAIN/s: Cell
division cycle-associated protein (InterPro:IPR018866); BEST
Arabidopsis thaliana protein match is: protein binding / zinc ion
binding (TAIR:AT4G37110.1); Has 305 Blast hits to 301 proteins in 74
species: Archae - 0; Bacteria - 2; Metazoa - 117; Fungi - 39; Plants -
120; Viruses - 0; Other Eukaryotes - 27 (source: NCBI BLink). |
chr2:10020652-10022804 REVERSE
Length = 553
Score = 371 bits (951), Expect = 8e-103
Identities = 211/386 (54%), Positives = 259/386 (67%), Gaps = 56/386 (14%)
Frame = -3
Query: 1442 MPAMSTRANSTVPVVTNPPPD---PTLNVSIYEKCRQERIKENLQRMQNLGIVDLSLKLK 1272
M M T A +VP +NP P+ T VS+YE+CR+ERIKENLQRM NLG+++LS KLK
Sbjct: 1 MLTMRTEAQDSVP-KSNPNPELIKETPKVSLYEQCREERIKENLQRMNNLGLLNLSRKLK 59
Query: 1271 SGIRPAKRRYTKAADSNPDLRSSARNTVPPPPLQLSVSTRRSSRLKNVSPVTYFEEAEKK 1092
RP KR Y R+S +N P PPLQ S RRSSRL+N +PV Y + +K
Sbjct: 60 PKTRPVKRSYGN--------RNSVQN--PTPPLQPS---RRSSRLENTTPVIYTDGINEK 106
Query: 1091 KGKASKEVVLWIGEGGERPEIYTDEHEKLLGNTERTWELFVDGYAPDGKRIYDPVRGKTC 912
KASK + IGE G R EIYT+EHEKLLGNTER+W FVDGY +GKRIYDP GKTC
Sbjct: 107 GKKASKRESVVIGE-GIRAEIYTEEHEKLLGNTERSWTCFVDGYDKNGKRIYDPFNGKTC 165
Query: 911 HQCRQKTLGYRTQCSKCHPSVTGQFCGDCLYMRYGEHVLETLENPEWVCPGCRGICNCSL 732
HQCRQKT+G+RTQCS+C+ V GQFCGDCL+MRYGEHVLE LENP+W+CP CRGICNCSL
Sbjct: 166 HQCRQKTMGHRTQCSECN-LVQGQFCGDCLFMRYGEHVLEALENPDWICPACRGICNCSL 224
Query: 731 CRKRKGWLPTGTAYKKVCKLGYKSVAHYLIQNNKQSETDEDDDETDDTPSQASAKRSLSF 552
CR KGW+PTG Y+++ LGYKSVAHYLIQ K++ TD D TPSQASAKRSLSF
Sbjct: 225 CRNNKGWVPTGPIYRRIAALGYKSVAHYLIQ-TKRAPTD------DTTPSQASAKRSLSF 277
Query: 551 KEAKVSSEDDDDDLNLLQITDGLAVQE-THIDDDNEG----------------------- 444
+E K++ D+D+ +L+ D L +E + ++D G
Sbjct: 278 QE-KIAG---DEDVPMLENDDSLQKEEGENTNEDQNGDLPEEVQKVQNMECQSGGSLKKE 333
Query: 443 TDKNQDSARKSLSFL--SAEDNQTSV 372
D+ +SAR+SLSFL S ED+QTS+
Sbjct: 334 EDETPNSARRSLSFLLPSVEDDQTSL 359
>TAIR9_protein||AT1G67780.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: shoot apex, flower, root,
seed; EXPRESSED DURING: petal differentiation and expansion stage, E
expanded cotyledon stage; CONTAINS InterPro DOMAIN/s: DDT superfamily
(InterPro:IPR018501), DDT subgroup (InterPro:IPR018500), Cell division
cycle-associated protein (InterPro:IPR018866); BEST Arabidopsis
thaliana protein match is: unknown protein (TAIR:AT1G67270.1); Has 280
Blast hits to 277 proteins in 72 species: Archae - 0; Bacteria - 0;
Metazoa - 114; Fungi - 47; Plants - 92; Viruses - 0; Other Eukaryotes -
27 (source: NCBI BLink). | chr1:25412816-25415530 FORWARD
Length = 516
Score = 106 bits (263), Expect = 5e-023
Identities = 51/104 (49%), Positives = 60/104 (57%), Gaps = 2/104 (1%)
Frame = -3
Query: 950 GKRIYDPVRGKTCHQCRQKTLGYRTQCS--KCHPSVTGQFCGDCLYMRYGEHVLETLENP 777
G RIYD GKTCHQCRQKT+ + C K T FC CL RYGE+ E +
Sbjct: 23 GGRIYDSSNGKTCHQCRQKTMDFVASCKAMKKDKQCTINFCHKCLINRYGENAEEVAKLD 82
Query: 776 EWVCPGCRGICNCSLCRKRKGWLPTGTAYKKVCKLGYKSVAHYL 645
+W+CP CRGICNCS CRK++G PTG K G SV+ L
Sbjct: 83 DWICPQCRGICNCSFCRKKRGLNPTGILAHKAKASGLASVSMLL 126
>TAIR9_protein||AT1G67270.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; CONTAINS InterPro DOMAIN/s: DDT superfamily
(InterPro:IPR018501), DDT subgroup (InterPro:IPR018500), Cell division
cycle-associated protein (InterPro:IPR018866); BEST Arabidopsis
thaliana protein match is: unknown protein (TAIR:AT1G67780.1); Has 297
Blast hits to 296 proteins in 63 species: Archae - 0; Bacteria - 2;
Metazoa - 128; Fungi - 30; Plants - 98; Viruses - 0; Other Eukaryotes -
39 (source: NCBI BLink). | chr1:25183177-25185793 REVERSE
Length = 542
Score = 101 bits (250), Expect = 2e-021
Identities = 48/100 (48%), Positives = 59/100 (59%), Gaps = 2/100 (2%)
Frame = -3
Query: 950 GKRIYDPVRGKTCHQCRQKTLGYRTQCSKCHPS--VTGQFCGDCLYMRYGEHVLETLENP 777
GKRIYD GK+CHQCRQKTL + C +FC CL +RYGE+ E +
Sbjct: 15 GKRIYDSQNGKSCHQCRQKTLDFAAPCKAMRRKKLCPIKFCYKCLSIRYGENAEEVAKLD 74
Query: 776 EWVCPGCRGICNCSLCRKRKGWLPTGTAYKKVCKLGYKSV 657
+W+CP CRGIC CS+CRK +G PTG + GY SV
Sbjct: 75 DWICPLCRGICICSVCRKAQGLEPTGILAHEAKARGYSSV 114
>TAIR9_protein||AT5G38690.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 21 plant structures;
EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: DDT
superfamily (InterPro:IPR018501), DDT subgroup (InterPro:IPR018500),
Cell division cycle-associated protein (InterPro:IPR018866); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G67780.1); Has 492 Blast hits to 468 proteins in 97 species:
Archae - 9; Bacteria - 41; Metazoa - 179; Fungi - 19; Plants - 99;
Viruses - 2; Other Eukaryotes - 143 (source: NCBI BLink). |
chr5:15479424-15482992 REVERSE
Length = 573
Score = 90 bits (222), Expect = 3e-018
Identities = 53/180 (29%), Positives = 77/180 (42%), Gaps = 6/180 (3%)
Frame = -3
Query: 944 RIYDPVRGKTCHQCRQKTLGYRTQC--SKCHPSVTGQFCGDCLYMRYGEHVLETLENPEW 771
RI D GKTCHQCRQK C K + + C C+ RYGE+ E +W
Sbjct: 22 RIEDSTNGKTCHQCRQKRTDLVGSCVTKKKDKTCPIKLCTKCILNRYGENAQEVALKKDW 81
Query: 770 VCPGCRGICNCSLCRKRKGWLPTGTAYKKVCKLGYKSVAHYLIQNNKQ----SETDEDDD 603
+CP CRG CNCS C K++G PTG K G+ SV+ L + ++ + +
Sbjct: 82 ICPKCRGNCNCSYCMKKRGQKPTGILVHTAKKTGFSSVSELLKTSGSDKYFYTKKVKPEG 141
Query: 602 ETDDTPSQASAKRSLSFKEAKVSSEDDDDDLNLLQITDGLAVQETHIDDDNEGTDKNQDS 423
+P + + S+ K + L I +G + + + N K DS
Sbjct: 142 VVVVSPLKLDQENSIEQKHVSIKKSRKTKREELKDINNGCSNENVVVKKSNPKKIKLSDS 201
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,246,016,146
Number of Sequences: 33410
Number of Extensions: 2246016146
Number of Successful Extensions: 79715282
Number of sequences better than 0.0: 0
|