BLASTX 7.6.2
Query= UN18992 /QuerySize=1218
(1217 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT5G55920.1 | Symbols: OLI2 | nucleolar protein, ... 557 5e-159
TAIR9_protein||AT4G26600.1 | Symbols: | nucleolar protein, puta... 507 8e-144
TAIR9_protein||AT3G13180.1 | Symbols: | NOL1/NOP2/sun family pr... 106 4e-023
TAIR9_protein||AT5G26180.1 | Symbols: | NOL1/NOP2/sun family pr... 76 4e-014
TAIR9_protein||AT5G26180.2 | Symbols: | NOL1/NOP2/sun family pr... 76 4e-014
TAIR9_protein||AT4G17590.1 | Symbols: | FUNCTIONS IN: molecular... 72 6e-013
TAIR9_protein||AT4G17590.2 | Symbols: | FUNCTIONS IN: molecular... 72 6e-013
TAIR9_protein||AT1G06560.1 | Symbols: | NOL1/NOP2/sun family pr... 58 1e-008
TAIR9_protein||AT3G28770.1 | Symbols: | unknown protein | chr3:... 50 2e-006
TAIR9_protein||AT5G60530.1 | Symbols: | late embryogenesis abun... 49 8e-006
>TAIR9_protein||AT5G55920.1 | Symbols: OLI2 | nucleolar protein, putative |
chr5:22645742-22649383 REVERSE
Length = 683
Score = 557 bits (1435), Expect = 5e-159
Identities = 287/351 (81%), Positives = 321/351 (91%), Gaps = 10/351 (2%)
Frame = +1
Query: 4 PEYLAGYYMLQGASSFLPVMAPAPRENERIVDVAAAPGGKTTYIAALMKNTGLIFANEMK 183
PEYLAGYYMLQGASSFLPVMA APRENERIVDVAAAPGGKTTYIAALMKNTGLI+ANEMK
Sbjct: 334 PEYLAGYYMLQGASSFLPVMALAPRENERIVDVAAAPGGKTTYIAALMKNTGLIYANEMK 393
Query: 184 VPRLKSLTANLHRMGVTNTVVCNYDGRELPKVLGEKSVDRVLLDAPCSGTGVISKDESVK 363
VPRLKSLTANLHRMGVTNT+VCNYDGRELPKVLG+ +VDRVLLDAPCSGTG+ISKDESVK
Sbjct: 394 VPRLKSLTANLHRMGVTNTIVCNYDGRELPKVLGQNTVDRVLLDAPCSGTGIISKDESVK 453
Query: 364 TSKSLEDIKRFAHLQKQLLLAAIDMVDATSKTGGYIVYSTCSLMVAENEAVIDYALKKRN 543
+K++++IK+FAHLQKQLLLAAIDMVDA SKTGGYIVYSTCS+MV ENEAVIDYALKKR+
Sbjct: 454 ITKTMDEIKKFAHLQKQLLLAAIDMVDANSKTGGYIVYSTCSIMVTENEAVIDYALKKRD 513
Query: 544 VQLVKTGLDFGQDGYSKFREHRFHPSLKQTKRFYPHVHNMDGFFVAKLKKMSNMKQTSED 723
V+LV GLDFG+ G+++FREHRF PSL +T+RFYPHVHNMDGFFVAKLKKMSN+KQ+SE+
Sbjct: 514 VKLVTCGLDFGRKGFTRFREHRFQPSLDKTRRFYPHVHNMDGFFVAKLKKMSNVKQSSEE 573
Query: 724 -DDEAVETVEQADVSSDDDDDEAEAMEEMEKVSVPSKQPKETKE--NKERLAKSKE-KKG 891
DD+AVETVEQA+VSS DDDDEAEA+EE EK SVP +QPKE KE NKE+LAKSKE K+G
Sbjct: 574 GDDDAVETVEQAEVSS-DDDDEAEAIEETEKPSVPVRQPKERKEKKNKEKLAKSKEDKRG 632
Query: 892 KKDAKSKSKNVE-----RKPKKKRSDWKKEIAQAREEKRRAMREKSKEKQ* 1029
KKD KSKS+NVE RK KKKR +WK EIAQAREEKR AMREK+KE++*
Sbjct: 633 KKDKKSKSENVEEPSKPRKQKKKRREWKNEIAQAREEKRIAMREKAKEEK* 683
>TAIR9_protein||AT4G26600.1 | Symbols: | nucleolar protein, putative |
chr4:13419629-13423418 FORWARD
Length = 672
Score = 507 bits (1304), Expect = 8e-144
Identities = 267/350 (76%), Positives = 301/350 (86%), Gaps = 12/350 (3%)
Frame = +1
Query: 4 PEYLAGYYMLQGASSFLPVMAPAPRENERIVDVAAAPGGKTTYIAALMKNTGLIFANEMK 183
PEYLAG+YMLQ ASSFLPVMA APRE ER+VD+AAAPGGKTTY+AALMKNTG+I+ANEMK
Sbjct: 317 PEYLAGFYMLQSASSFLPVMALAPREKERVVDMAAAPGGKTTYVAALMKNTGIIYANEMK 376
Query: 184 VPRLKSLTANLHRMGVTNTVVCNYDGRELPKVLGEKSVDRVLLDAPCSGTGVISKDESVK 363
VPRLKSL+ANLHRMGVTNT+VCNYDGREL KVLG+ SVDRVLLDAPCSGTGVISKDESVK
Sbjct: 377 VPRLKSLSANLHRMGVTNTIVCNYDGRELTKVLGQSSVDRVLLDAPCSGTGVISKDESVK 436
Query: 364 TSKSLEDIKRFAHLQKQLLLAAIDMVDATSKTGGYIVYSTCSLMVAENEAVIDYALKKRN 543
TSKS +DIK+FAHLQKQL+L AID+VDA SKTGGYIVYSTCS+M+ ENEAVIDYALK R+
Sbjct: 437 TSKSADDIKKFAHLQKQLILGAIDLVDANSKTGGYIVYSTCSVMIPENEAVIDYALKNRD 496
Query: 544 VQLVKTGLDFGQDGYSKFREHRFHPSLKQTKRFYPHVHNMDGFFVAKLKKMSNMKQTSED 723
V+LV GLDFG+ G+S FREHRFHPSL++T+RFYPHVHNMDGFFVAKLKKMSN Q S +
Sbjct: 497 VKLVPCGLDFGRPGFSSFREHRFHPSLEKTRRFYPHVHNMDGFFVAKLKKMSNAMQPSGN 556
Query: 724 DDEAVETVEQADVSSDDDDDE-AEAMEEMEKVSVPSKQPK---ETKE--NKERLAKSKE- 882
D+ AV T+EQA VSS DDDDE AEA+EE+EK V S QPK TKE NK + +SKE
Sbjct: 557 DEPAV-TMEQAQVSSSDDDDEKAEAIEELEKPPVASGQPKRESNTKEDTNKRKNPRSKEI 615
Query: 883 KKGK--KDAKSKSKNVE--RKPKKKRSDWKKEIAQAREEKRRAMREKSKE 1020
KGK K+ K++S NVE RK KKKRS WK EIAQAREEKR+ MRE +KE
Sbjct: 616 HKGKRNKNTKTESGNVEEPRKQKKKRSQWKNEIAQAREEKRKTMRENAKE 665
>TAIR9_protein||AT3G13180.1 | Symbols: | NOL1/NOP2/sun family protein /
antitermination NusB domain-containing protein | chr3:4236326-4239966
REVERSE
Length = 524
Score = 106 bits (263), Expect = 4e-023
Identities = 68/171 (39%), Positives = 101/171 (59%), Gaps = 10/171 (5%)
Frame = +1
Query: 19 GYYMLQGASSFLPVMAPAPRENERIVDVAAAPGGKTTYIAALMKNTGLIFANEMKVPRLK 198
G +Q S+ L V P+ ERI+D AAPGGKT ++A+ +K G+I+A ++ RL+
Sbjct: 310 GICSVQDESAGLIVSVVKPQPGERIMDACAAPGGKTLFMASCLKGQGMIYAMDVNEGRLR 369
Query: 199 SL--TANLHRM-GVTNTVVCNYDGRELPKVLGEKSVDRVLLDAPCSGTGVISKDESVKTS 369
L TA H++ G+ T+ + D R + E D+VLLDAPCSG GV+SK ++ +
Sbjct: 370 ILGETAKSHQVDGLITTI--HSDLRVFAET-NEVQYDKVLLDAPCSGLGVLSKRADLRWN 426
Query: 370 KSLEDIKRFAHLQKQLLLAAIDMVDATSKTGGYIVYSTCSLMVAENEAVID 522
+ LED+ LQ +LL +A +V K GG +VYSTCS+ ENE ++
Sbjct: 427 RKLEDMLELTKLQDELLDSASKLV----KHGGVLVYSTCSIDPEENEGRVE 473
>TAIR9_protein||AT5G26180.1 | Symbols: | NOL1/NOP2/sun family protein |
chr5:9149253-9152595 FORWARD
Length = 568
Score = 76 bits (185), Expect = 4e-014
Identities = 56/177 (31%), Positives = 83/177 (46%), Gaps = 7/177 (3%)
Frame = +1
Query: 19 GYYMLQGASSFLPVMAPAPRENERIVDVAAAPGGKTTYIAALMKNTGLIFANEMKVPRLK 198
G LQG +S + A P+ ++D +APG KT ++AALM+ G I A E+ R+K
Sbjct: 284 GRIFLQGKASSMVAAALQPQAGWEVLDACSAPGNKTIHLAALMEGQGKIIACELNEERVK 343
Query: 199 SLTANLHRMGVTNTVVCNYDGREL-PKVLGEKSVDRVLLDAPCSGTGVISKDESVKTSKS 375
L + G +N VC+ D L PK + +LLD CSG+G I+ D S
Sbjct: 344 RLEHTIKLSGASNIEVCHGDFLGLNPKDPSFAKIRAILLDPSCSGSGTIT-DRLDHLLPS 402
Query: 376 LEDIKRFAHLQKQLLLAAIDMVDATSKTGGY-----IVYSTCSLMVAENEAVIDYAL 531
+ + +L A+ A + + +VYSTCS+ ENE V+ L
Sbjct: 403 HSEDNNMNYDSMRLHKLAVFQKKALAHALSFPKVERVVYSTCSIYQIENEDVVSSVL 459
>TAIR9_protein||AT5G26180.2 | Symbols: | NOL1/NOP2/sun family protein |
chr5:9149253-9152595 FORWARD
Length = 568
Score = 76 bits (185), Expect = 4e-014
Identities = 56/177 (31%), Positives = 83/177 (46%), Gaps = 7/177 (3%)
Frame = +1
Query: 19 GYYMLQGASSFLPVMAPAPRENERIVDVAAAPGGKTTYIAALMKNTGLIFANEMKVPRLK 198
G LQG +S + A P+ ++D +APG KT ++AALM+ G I A E+ R+K
Sbjct: 284 GRIFLQGKASSMVAAALQPQAGWEVLDACSAPGNKTIHLAALMEGQGKIIACELNEERVK 343
Query: 199 SLTANLHRMGVTNTVVCNYDGREL-PKVLGEKSVDRVLLDAPCSGTGVISKDESVKTSKS 375
L + G +N VC+ D L PK + +LLD CSG+G I+ D S
Sbjct: 344 RLEHTIKLSGASNIEVCHGDFLGLNPKDPSFAKIRAILLDPSCSGSGTIT-DRLDHLLPS 402
Query: 376 LEDIKRFAHLQKQLLLAAIDMVDATSKTGGY-----IVYSTCSLMVAENEAVIDYAL 531
+ + +L A+ A + + +VYSTCS+ ENE V+ L
Sbjct: 403 HSEDNNMNYDSMRLHKLAVFQKKALAHALSFPKVERVVYSTCSIYQIENEDVVSSVL 459
>TAIR9_protein||AT4G17590.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana protein match is:
nucleolar protein, putative (TAIR:AT4G26600.1); Has 62 Blast hits to 62
proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 32; Fungi -
2; Plants - 24; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink).
| chr4:9800843-9802591 REVERSE
Length = 202
Score = 72 bits (175), Expect = 6e-013
Identities = 45/87 (51%), Positives = 58/87 (66%), Gaps = 2/87 (2%)
Frame = +1
Query: 157 GLIFANEMKVPRLKSLTANLHRMGVTNTVVCNYD-GRELPKVLGEKSVDRVLLDAPCSGT 333
G+IFAN L SL ANLHRMG+TNTVV NY+ +L +V S D VL++AP + T
Sbjct: 55 GIIFANASTEHLLGSLYANLHRMGITNTVVSNYNINTKLSRVFHINSKDMVLVNAPSTRT 114
Query: 334 GVISKDESVKTSKSLE-DIKRFAHLQK 411
G+IS+ S+K S + E DI+RF LQK
Sbjct: 115 GLISEFGSIKMSINEEADIQRFGVLQK 141
>TAIR9_protein||AT4G17590.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana protein match is:
nucleolar protein, putative (TAIR:AT4G26600.1). | chr4:9800843-9802591
REVERSE
Length = 188
Score = 72 bits (175), Expect = 6e-013
Identities = 45/87 (51%), Positives = 58/87 (66%), Gaps = 2/87 (2%)
Frame = +1
Query: 157 GLIFANEMKVPRLKSLTANLHRMGVTNTVVCNYD-GRELPKVLGEKSVDRVLLDAPCSGT 333
G+IFAN L SL ANLHRMG+TNTVV NY+ +L +V S D VL++AP + T
Sbjct: 55 GIIFANASTEHLLGSLYANLHRMGITNTVVSNYNINTKLSRVFHINSKDMVLVNAPSTRT 114
Query: 334 GVISKDESVKTSKSLE-DIKRFAHLQK 411
G+IS+ S+K S + E DI+RF LQK
Sbjct: 115 GLISEFGSIKMSINEEADIQRFGVLQK 141
>TAIR9_protein||AT1G06560.1 | Symbols: | NOL1/NOP2/sun family protein |
chr1:2007660-2011824 FORWARD
Length = 600
Score = 58 bits (138), Expect = 1e-008
Identities = 32/82 (39%), Positives = 50/82 (60%), Gaps = 6/82 (7%)
Frame = +1
Query: 292 SVDRVLLDAPCSGTGVISKDESVKTSKSLEDIKRFAHLQKQLLLAAIDMVDATSKTGGYI 471
S DRVLLDAPCS G+ + +++ ++ Q+++L A+ +V + GG +
Sbjct: 458 SFDRVLLDAPCSALGL--RPRLFAGLETVVSLRNHGWYQRKMLDQAVQLV----RVGGIL 511
Query: 472 VYSTCSLMVAENEAVIDYALKK 537
VYSTC++ +ENEAV+ YAL K
Sbjct: 512 VYSTCTINPSENEAVVRYALDK 533
Score = 55 bits (130), Expect = 1e-007
Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 9/101 (8%)
Frame = +1
Query: 13 LAGYYMLQGASSFLPVMAPAPRENERIVDVAAAPGGKTTYIAALMKNTGLIFA---NEMK 183
L G LQ S + A P++ ERI+D+ AAPGGKTT IA LM + G I A + K
Sbjct: 274 LEGEIFLQNLPSIIVAHALDPQKGERILDMCAAPGGKTTAIAILMNDEGEIVAADRSHNK 333
Query: 184 VPRLKSLTANLHRMGVTNTVVCNYDGRE---LPKVLGEKSV 297
V +++L+A MG T C D + LP L E ++
Sbjct: 334 VLVVQNLSA---EMGFTCITTCKLDALKSVCLPTTLNESTI 371
>TAIR9_protein||AT3G28770.1 | Symbols: | unknown protein |
chr3:10796716-10803237 FORWARD
Length = 2082
Score = 50 bits (119), Expect = 2e-006
Identities = 45/176 (25%), Positives = 86/176 (48%), Gaps = 12/176 (6%)
Frame = +1
Query: 517 IDYALKKRNVQLVKTGLDFGQDGYSKFREHRFHPSLKQ------TKRFYPHVHNMDGFFV 678
+D ++K + + VK D ++G + + + S KQ K+ NM
Sbjct: 902 MDIDVQKGSGESVKYKKDEKKEGNKEENKDTINTSSKQKGKDKKKKKKESKNSNMKKKEE 961
Query: 679 AKLKKMSNMKQTSEDDDEAVETVEQADVSSDDDDDEAEAMEEMEKVSVPSKQPKETKENK 858
K + ++N + ED+ + E + + ++ D++ E+ E SK +E KE +
Sbjct: 962 DKKEYVNNELKKQEDNKKETTKSENSKLKEENKDNK----EKKESEDSASKN-REKKEYE 1016
Query: 859 ERLAKSKEKKGKKDAKSKSKNVERKPKKKRSDWKKEIAQAREEKRRAMREKSKEKQ 1026
E+ +K+KE+ K+ KS+ K E K ++R KKE ++R+ K + E++KEK+
Sbjct: 1017 EKKSKTKEEAKKEKKKSQDKKREEKDSEERKS-KKEKEESRDLKAKKKEEETKEKK 1071
>TAIR9_protein||AT5G60530.1 | Symbols: | late embryogenesis abundant
protein-related / LEA protein-related | chr5:24334197-24335685 REVERSE
Length = 440
Score = 49 bits (114), Expect = 8e-006
Identities = 24/83 (28%), Positives = 47/83 (56%)
Frame = +1
Query: 775 DDDEAEAMEEMEKVSVPSKQPKETKENKERLAKSKEKKGKKDAKSKSKNVERKPKKKRSD 954
D+ ++ +K + K K+ KE+ K KE+K KKD + K K + K +K++ D
Sbjct: 49 DNGKSNGNGPKDKEQEKKDKEKAAKDKKEKEKKDKEEKEKKDKERKEKEKKDKLEKEKKD 108
Query: 955 WKKEIAQAREEKRRAMREKSKEK 1023
+++ + +E++R+A +K KE+
Sbjct: 109 KERKEKERKEKERKAKEKKDKEE 131
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,885,483,150
Number of Sequences: 33410
Number of Extensions: 9885483150
Number of Successful Extensions: 333267254
Number of sequences better than 0.0: 0
|