Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN16453


BLASTX 7.6.2

Query= UN16453 /QuerySize=1177
        (1176 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297820414|ref|XP_002878090.1| hypothetical protein ARALYDRAFT...    441   2e-121
gi|30694465|ref|NP_191210.2| RNA recognition motif-containing pr...    435   7e-120
gi|7594529|emb|CAB88054.1| putative protein [Arabidopsis thaliana]     431   1e-118
gi|297734519|emb|CBI15766.3| unnamed protein product [Vitis vini...    349   7e-094
gi|225456507|ref|XP_002284683.1| PREDICTED: hypothetical protein...    347   3e-093
gi|147860851|emb|CAN83161.1| hypothetical protein VITISV_022556 ...    338   1e-090
gi|255570173|ref|XP_002526047.1| Activator of basal transcriptio...    297   2e-078
gi|224055607|ref|XP_002298563.1| predicted protein [Populus tric...    272   1e-070
gi|255647374|gb|ACU24153.1| unknown [Glycine max]                      262   1e-067
gi|255642287|gb|ACU21408.1| unknown [Glycine max]                      256   6e-066
gi|168056141|ref|XP_001780080.1| predicted protein [Physcomitrel...    236   8e-060
gi|307102552|gb|EFN50823.1| hypothetical protein CHLNCDRAFT_1189...    204   4e-050
gi|302828430|ref|XP_002945782.1| hypothetical protein VOLCADRAFT...    200   7e-049
gi|145351056|ref|XP_001419903.1| predicted protein [Ostreococcus...    191   2e-046
gi|159476922|ref|XP_001696560.1| hypothetical protein CHLREDRAFT...    189   9e-046
gi|255083408|ref|XP_002504690.1| predicted protein [Micromonas s...    184   3e-044
gi|291238741|ref|XP_002739285.1| PREDICTED: activator of basal t...    183   8e-044
gi|320168928|gb|EFW45827.1| pre-rRNA-processing protein esf-2 [C...    176   8e-042
gi|301118671|ref|XP_002907063.1| pre-rRNA-processing protein ESF...    174   4e-041
gi|213409696|ref|XP_002175618.1| U3 snoRNP-associated protein Es...    173   5e-041

>gi|297820414|ref|XP_002878090.1| hypothetical protein ARALYDRAFT_486092
        [Arabidopsis lyrata subsp. lyrata]

          Length = 256

 Score =  441 bits (1132), Expect = 2e-121
 Identities = 222/255 (87%), Positives = 239/255 (93%), Gaps = 3/255 (1%)
 Frame = +3

Query: 234 EESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRGVCYLSRIPPHM 413
           EES++L NG+S  EEE+KET KKSQKADRKKKKLKEKLLKEA+KADNRGVCYLSRIPPHM
Sbjct:   4 EESHDLTNGIS--EEESKET-KKSQKADRKKKKLKEKLLKEASKADNRGVCYLSRIPPHM 60

Query: 414 DHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVAD 593
           DHVRLR IL+QFGE+ RIYLAPED EAQVHRK+AGGFRGQ FSEGWVEFAKK+VAKRVAD
Sbjct:  61 DHVRLRHILAQFGELGRIYLAPEDSEAQVHRKRAGGFRGQRFSEGWVEFAKKSVAKRVAD 120

Query: 594 MLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKREKD 773
           MLNGEQIGGKKKS++YYDIWNIKYLTKFKWDDLT+EIAYKSAIREQKLNMV+SAAKREKD
Sbjct: 121 MLNGEQIGGKKKSSVYYDIWNIKYLTKFKWDDLTEEIAYKSAIREQKLNMVLSAAKREKD 180

Query: 774 FYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQVFPPRVVRQFRQKIAIKNEVFQS 953
           FYLSK+EKSRAMTEID RMKKKRKIQEESGSNAE A VFPPRV+R FRQK +I+NE  QS
Sbjct: 181 FYLSKIEKSRAMTEIDARMKKKRKIQEESGSNAEAAPVFPPRVIRHFRQKKSIENETSQS 240

Query: 954 KPGLSTDVLASVFGG 998
           KPGLSTD LASVFGG
Sbjct: 241 KPGLSTDFLASVFGG 255

>gi|30694465|ref|NP_191210.2| RNA recognition motif-containing protein
        [Arabidopsis thaliana]

          Length = 257

 Score =  435 bits (1118), Expect = 7e-120
 Identities = 219/256 (85%), Positives = 240/256 (93%), Gaps = 2/256 (0%)
 Frame = +3

Query: 231 SEESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRGVCYLSRIPPH 410
           SEES+EL +G+S EE+E+KET+ KSQKADRKKKKLKEKLLKEA+KADNRGVCYLSRIPPH
Sbjct:   3 SEESHELTDGIS-EEKESKETM-KSQKADRKKKKLKEKLLKEASKADNRGVCYLSRIPPH 60

Query: 411 MDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVA 590
           MDHVRLR IL+Q+GE+ RIYLAPED EAQVHRK+AGGFRGQ FSEGWVEFAKK+VAKRVA
Sbjct:  61 MDHVRLRHILAQYGELGRIYLAPEDSEAQVHRKRAGGFRGQRFSEGWVEFAKKSVAKRVA 120

Query: 591 DMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKREK 770
           DMLNGEQIGGKKKS++YYDIWNIKYLTKFKWDDLT+EIAYKSAIREQKLNMV+SAAKREK
Sbjct: 121 DMLNGEQIGGKKKSSVYYDIWNIKYLTKFKWDDLTEEIAYKSAIREQKLNMVLSAAKREK 180

Query: 771 DFYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQVFPPRVVRQFRQKIAIKNEVFQ 950
           DFYLSK+EKSRAMTEID RM+KKRKIQEESGSNAE A VFPPR + QFRQK +I+NE  Q
Sbjct: 181 DFYLSKIEKSRAMTEIDARMEKKRKIQEESGSNAEAAPVFPPRAIFQFRQKKSIENETSQ 240

Query: 951 SKPGLSTDVLASVFGG 998
           SKPGLSTD LASVFGG
Sbjct: 241 SKPGLSTDFLASVFGG 256

>gi|7594529|emb|CAB88054.1| putative protein [Arabidopsis thaliana]

          Length = 266

 Score =  431 bits (1108), Expect = 1e-118
 Identities = 221/267 (82%), Positives = 242/267 (90%), Gaps = 11/267 (4%)
 Frame = +3

Query: 225 MQSEESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRGVCYLSRIP 404
           MQSEES+EL +G+S EE+E+KET+ KSQKADRKKKKLKEKLLKEA+KADNRGVCYLSRIP
Sbjct:   1 MQSEESHELTDGIS-EEKESKETM-KSQKADRKKKKLKEKLLKEASKADNRGVCYLSRIP 58

Query: 405 PHMDHVRLRQILSQFGEIDRIYLAPE---------DPEAQVHRKKAGGFRGQLFSEGWVE 557
           PHMDHVRLR IL+Q+GE+ RIYLAPE         D EAQVHRK+AGGFRGQ FSEGWVE
Sbjct:  59 PHMDHVRLRHILAQYGELGRIYLAPEADTFVYCYSDSEAQVHRKRAGGFRGQRFSEGWVE 118

Query: 558 FAKKTVAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKL 737
           FAKK+VAKRVADMLNGEQIGGKKKS++YYDIWNIKYLTKFKWDDLT+EIAYKSAIREQKL
Sbjct: 119 FAKKSVAKRVADMLNGEQIGGKKKSSVYYDIWNIKYLTKFKWDDLTEEIAYKSAIREQKL 178

Query: 738 NMVMSAAKREKDFYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQVFPPRVVRQFR 917
           NMV+SAAKREKDFYLSK+EKSRAMTEID RM+KKRKIQEESGSNAE A VFPPR + QFR
Sbjct: 179 NMVLSAAKREKDFYLSKIEKSRAMTEIDARMEKKRKIQEESGSNAEAAPVFPPRAIFQFR 238

Query: 918 QKIAIKNEVFQSKPGLSTDVLASVFGG 998
           QK +I+NE  QSKPGLSTD LASVFGG
Sbjct: 239 QKKSIENETSQSKPGLSTDFLASVFGG 265

>gi|297734519|emb|CBI15766.3| unnamed protein product [Vitis vinifera]

          Length = 344

 Score =  349 bits (894), Expect = 7e-094
 Identities = 171/257 (66%), Positives = 212/257 (82%), Gaps = 4/257 (1%)
 Frame = +3

Query: 231 SEESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRGVCYLSRIPPH 410
           +EE  E+      EE E++      +K   +K KLK++LLKEA+KAD RGVCYLSRIPPH
Sbjct:  89 AEEEQEIKRTSHSEEGESE---GNDEKGRIRKNKLKKRLLKEASKADKRGVCYLSRIPPH 145

Query: 411 MDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVA 590
           MDHV+LR ILSQ+GEI RIYLAPEDP  QVHRK+AGGFRGQ+FSEGWVEF KKTVAKRVA
Sbjct: 146 MDHVKLRHILSQYGEIQRIYLAPEDPATQVHRKRAGGFRGQVFSEGWVEFTKKTVAKRVA 205

Query: 591 DMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKREK 770
            MLNGEQIGG+K+S+ YYD+WNIKYL+KFKWDDLT+EIAYK+AIREQKL + +SAAKRE+
Sbjct: 206 KMLNGEQIGGRKRSSFYYDLWNIKYLSKFKWDDLTEEIAYKNAIREQKLALELSAAKRER 265

Query: 771 DFYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAE-PAQVFPPRVVRQFRQKIAIKNEVF 947
           DFYLSKV+KSRA++ I+ER+KKK+K+Q+++G+N E PA    P+V+RQF QK  + ++  
Sbjct: 266 DFYLSKVDKSRALSSIEERLKKKQKVQQDAGTNTEAPANDQGPKVIRQFPQKPPLADKAA 325

Query: 948 QSKPGLSTDVLASVFGG 998
           +SKP LS D+LA VFGG
Sbjct: 326 ESKPRLSKDILAGVFGG 342

>gi|225456507|ref|XP_002284683.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 257

 Score =  347 bits (888), Expect = 3e-093
 Identities = 170/255 (66%), Positives = 210/255 (82%), Gaps = 4/255 (1%)
 Frame = +3

Query: 237 ESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRGVCYLSRIPPHMD 416
           E  E+      EE E++      +K   +K KLK++LLKEA+KAD RGVCYLSRIPPHMD
Sbjct:   4 EEQEIKRTSHSEEGESE---GNDEKGRIRKNKLKKRLLKEASKADKRGVCYLSRIPPHMD 60

Query: 417 HVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVADM 596
           HV+LR ILSQ+GEI RIYLAPEDP  QVHRK+AGGFRGQ+FSEGWVEF KKTVAKRVA M
Sbjct:  61 HVKLRHILSQYGEIQRIYLAPEDPATQVHRKRAGGFRGQVFSEGWVEFTKKTVAKRVAKM 120

Query: 597 LNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKREKDF 776
           LNGEQIGG+K+S+ YYD+WNIKYL+KFKWDDLT+EIAYK+AIREQKL + +SAAKRE+DF
Sbjct: 121 LNGEQIGGRKRSSFYYDLWNIKYLSKFKWDDLTEEIAYKNAIREQKLALELSAAKRERDF 180

Query: 777 YLSKVEKSRAMTEIDERMKKKRKIQEESGSNAE-PAQVFPPRVVRQFRQKIAIKNEVFQS 953
           YLSKV+KSRA++ I+ER+KKK+K+Q+++G+N E PA    P+V+RQF QK  + ++  +S
Sbjct: 181 YLSKVDKSRALSSIEERLKKKQKVQQDAGTNTEAPANDQGPKVIRQFPQKPPLADKAAES 240

Query: 954 KPGLSTDVLASVFGG 998
           KP LS D+LA VFGG
Sbjct: 241 KPRLSKDILAGVFGG 255

>gi|147860851|emb|CAN83161.1| hypothetical protein VITISV_022556 [Vitis
        vinifera]

          Length = 486

 Score =  338 bits (866), Expect = 1e-090
 Identities = 166/250 (66%), Positives = 206/250 (82%), Gaps = 4/250 (1%)
 Frame = +3

Query: 237 ESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRGVCYLSRIPPHMD 416
           E  E+      EE E++      +K   +K KLK++LLKEA+KAD RGVCYLSRIPPHMD
Sbjct:   4 EEQEIKRTSHSEEGESE---GNDEKGRIRKNKLKKRLLKEASKADKRGVCYLSRIPPHMD 60

Query: 417 HVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVADM 596
           HV+LR ILSQ+GEI RIYLAPEDP  QVHRK+AGGFRGQ+FSEGWVEF KKTVAKRVA M
Sbjct:  61 HVKLRHILSQYGEIQRIYLAPEDPATQVHRKRAGGFRGQVFSEGWVEFTKKTVAKRVAKM 120

Query: 597 LNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKREKDF 776
           LNGEQIGG+K+S+ YYD+WNIKYL+KFKWDDLT+EIAYK+AIREQKL + +SAAKRE+DF
Sbjct: 121 LNGEQIGGRKRSSFYYDLWNIKYLSKFKWDDLTEEIAYKNAIREQKLALELSAAKRERDF 180

Query: 777 YLSKVEKSRAMTEIDERMKKKRKIQEESGSNAE-PAQVFPPRVVRQFRQKIAIKNEVFQS 953
           YLSKV+KSRA++ I+ER+KKK+K+Q+++G+N E PA    P+V+RQF QK  + ++  +S
Sbjct: 181 YLSKVDKSRALSSIEERLKKKQKVQQDAGTNTEAPANDQGPKVIRQFPQKPPLADKAAES 240

Query: 954 KPGLSTDVLA 983
           KP LS D+LA
Sbjct: 241 KPRLSKDILA 250

>gi|255570173|ref|XP_002526047.1| Activator of basal transcription, putative
        [Ricinus communis]

          Length = 406

 Score =  297 bits (760), Expect = 2e-078
 Identities = 153/256 (59%), Positives = 194/256 (75%), Gaps = 9/256 (3%)
 Frame = +3

Query: 237 ESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRGVCYLSRIPPHMD 416
           E  + A+G++  EEE +     ++  DR KKK K++LLKEA +AD RGVCYLSRIPPHMD
Sbjct: 153 EEKQEASGINLVEEENQTLT--NEMVDRLKKKKKKRLLKEAAQADRRGVCYLSRIPPHMD 210

Query: 417 HVRLRQILSQFGEIDRIYLAPE----DPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKR 584
           HV+LR IL ++GEI RIYLAPE      + +V R+KA G     FSEGWVEF  K++AKR
Sbjct: 211 HVKLRHILCRYGEIQRIYLAPEVNKHRVQYRVQRRKADGLEDLGFSEGWVEFTNKSIAKR 270

Query: 585 VADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKR 764
           VA+MLNGEQ+GG+K+S  YYD+WNIKYL+KFKWDDLT+EIAYKSAIREQKL + +SAAKR
Sbjct: 271 VANMLNGEQMGGRKRSQFYYDLWNIKYLSKFKWDDLTEEIAYKSAIREQKLALELSAAKR 330

Query: 765 EKDFYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQVFPPRVVRQFRQKIAIKNEV 944
           E+DFYLSKVEKSRA++ I+ER+KKK+K+Q E+G       V  P+V+RQF Q   I +  
Sbjct: 331 ERDFYLSKVEKSRALSSIEERLKKKQKVQLETGGE---FSVSIPKVIRQFAQTKPIADRA 387

Query: 945 FQSKPGLSTDVLASVF 992
            +++P LS DVLA VF
Sbjct: 388 EENRPRLSKDVLAGVF 403

>gi|224055607|ref|XP_002298563.1| predicted protein [Populus trichocarpa]

          Length = 282

 Score =  272 bits (694), Expect = 1e-070
 Identities = 153/260 (58%), Positives = 186/260 (71%), Gaps = 16/260 (6%)
 Frame = +3

Query: 231 SEESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRGVCYLSRIPPH 410
           +EE  E A  + +  EE KE  +     + KKKKLK    KEA KA  RGVCY+SR+PP 
Sbjct:  11 TEEKGERAR-LERMSEEHKEQEQVLFLEEGKKKKLK----KEAEKAQKRGVCYISRVPPG 65

Query: 411 MDHVRLRQILSQFGEIDRIYLAPEDPEA-------QVHRKKAGGFRGQLFSEGWVEFAKK 569
           MDHV+LRQ+LSQ+GEI RIYLAP++  +          RK+ GG + Q +SEGWVEFA K
Sbjct:  66 MDHVKLRQLLSQYGEIQRIYLAPQNSSSIDKVNDNNKSRKRGGGAKAQAYSEGWVEFASK 125

Query: 570 TVAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVM 749
           + AKRVA++LNGEQIGGKK+S  YYD WNIKYL+KFKWD+LTDEIAYK AIREQKL + +
Sbjct: 126 SNAKRVANLLNGEQIGGKKRSQFYYDHWNIKYLSKFKWDNLTDEIAYKKAIREQKLALEI 185

Query: 750 SAAKREKDFYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQVFPPRVVRQFRQKIA 929
           SAAKRE+DFYL KV++SRA++ I+ERMKK  K+Q+ESG     A   PP V RQF QK  
Sbjct: 186 SAAKRERDFYLKKVDQSRALSSIEERMKK--KVQQESGGELSVAPQKPP-VCRQFPQKKP 242

Query: 930 IKNEVFQSKPGLSTDVLASV 989
           I  E  +SKP LS DVLA V
Sbjct: 243 IA-ERERSKPQLSKDVLAGV 261

>gi|255647374|gb|ACU24153.1| unknown [Glycine max]

          Length = 247

 Score =  262 bits (667), Expect = 1e-067
 Identities = 129/217 (59%), Positives = 168/217 (77%), Gaps = 3/217 (1%)
 Frame = +3

Query: 252 ANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKAD---NRGVCYLSRIPPHMDHV 422
           A+   KE+ E + T     + + K KK ++K  K+A+  D    RGVCY+SRIPPHMDHV
Sbjct:   3 ASDSEKEDAEIEATETAPDEENAKLKKKEKKKKKKASSKDAEKKRGVCYMSRIPPHMDHV 62

Query: 423 RLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVADMLN 602
           +LR ILSQFG+I RI+LAP+D   QV  K++ G R Q +SEGWVEF  K+VAKRVA+MLN
Sbjct:  63 KLRHILSQFGDIQRIFLAPQDSSVQVSAKRSRGSRDQAYSEGWVEFGNKSVAKRVANMLN 122

Query: 603 GEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKREKDFYL 782
           GEQIGGKK+S+ YYD+WNIKYL+KFKWDDLT+E+A K AIREQKL + +SAAKRE+DFYL
Sbjct: 123 GEQIGGKKRSSFYYDLWNIKYLSKFKWDDLTEELALKKAIREQKLAVELSAAKRERDFYL 182

Query: 783 SKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQVFP 893
           SKV++SRA++ I+ R+KKK+KIQ++SG   +  + FP
Sbjct: 183 SKVDQSRALSAIEGRLKKKQKIQQDSGLVEKVIRHFP 219

>gi|255642287|gb|ACU21408.1| unknown [Glycine max]

          Length = 251

 Score =  256 bits (653), Expect = 6e-066
 Identities = 130/220 (59%), Positives = 172/220 (78%), Gaps = 9/220 (4%)
 Frame = +3

Query: 255 NGVSKEEEETKETV------KKSQKADRKKKKLKEKL-LKEATKADNRGVCYLSRIPPHM 413
           N    E+E+T+  V      +++ K  +K+KK K+K+ LK+A K    GVCY+SRIPPHM
Sbjct:   2 NASDSEKEDTEIEVIAITPNEENSKLMKKEKKKKKKVSLKDAEK--KGGVCYMSRIPPHM 59

Query: 414 DHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVAD 593
           DHV+LR ILSQFG+I RI+LAP+D   QV  K++ G R Q +SEGWVEF  K+VAKRVA+
Sbjct:  60 DHVKLRHILSQFGDIQRIFLAPQDSSVQVPAKRSRGSRDQAYSEGWVEFGNKSVAKRVAN 119

Query: 594 MLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKREKD 773
           MLNGEQI GKK+S+ YYD+WNIKYL+KFKWDDLT+E+A K AIRE+KL + +SAAKRE+D
Sbjct: 120 MLNGEQIRGKKRSSFYYDLWNIKYLSKFKWDDLTEELALKKAIRERKLAVELSAAKRERD 179

Query: 774 FYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQVFP 893
           FYLSKV++SRA++ I+ER+KKK+KIQ++SG   +  + FP
Sbjct: 180 FYLSKVDQSRALSAIEERLKKKQKIQKDSGPVEKVIRHFP 219

>gi|168056141|ref|XP_001780080.1| predicted protein [Physcomitrella patens
        subsp. patens]

          Length = 288

 Score =  236 bits (600), Expect = 8e-060
 Identities = 127/269 (47%), Positives = 177/269 (65%), Gaps = 10/269 (3%)
 Frame = +3

Query: 222 KMQSEESNELANGVSKEEEETKETVKKSQKA-DRKKKKLKEKLLKEATKA--DNRGVCYL 392
           K Q+    +  N     +E+ K+       A    +KK K K L+ A  +    RGV YL
Sbjct:  16 KKQARSRKQRENRAEALQEDGKKNEGLGDDAVAAGEKKRKRKALQNARSSVPGKRGVIYL 75

Query: 393 SRIPPHMDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKT 572
           SRIPPHM  ++LR +L  +GE+ RIYLAPEDP A++ RK+AGG  G+ F+EGWVEFA K+
Sbjct:  76 SRIPPHMKPLKLRHLLEPYGEVLRIYLAPEDPAARLRRKRAGGNSGKNFTEGWVEFAHKS 135

Query: 573 VAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMS 752
            AKR+A MLNGE +   K+SA +YD+WN+KYL KFKWD+LT+EIAYK+A+REQ+L   +S
Sbjct: 136 DAKRIAVMLNGEPMEASKRSAYHYDLWNMKYLKKFKWDNLTEEIAYKNAVREQRLAAEIS 195

Query: 753 AAKREKDFYLSKVEKSRAMTEIDERMKKKRKIQE-------ESGSNAEPAQVFPPRVVRQ 911
           AAK+E+DFYLSKV++SRA+  ++ER K K+ +++       E  +  + +    PR+VR 
Sbjct: 196 AAKKERDFYLSKVDQSRAIASMEERKKNKKDLEQSQKPDGVEKPAIEKKSASEKPRLVRN 255

Query: 912 FRQKIAIKNEVFQSKPGLSTDVLASVFGG 998
           F QK  + + +  +   LS DVLA VFGG
Sbjct: 256 FNQKKPVADGMTSNVATLSRDVLAGVFGG 284

>gi|307102552|gb|EFN50823.1| hypothetical protein CHLNCDRAFT_11893 [Chlorella
        variabilis]

          Length = 167

 Score =  204 bits (517), Expect = 4e-050
 Identities = 98/167 (58%), Positives = 128/167 (76%)
 Frame = +3

Query: 336 KEKLLKEATKADNRGVCYLSRIPPHMDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKA 515
           K+KL K   K D RG+ Y+SRIPPH+   +LRQ+L Q GEI R+YLAPEDP  +  RK+ 
Sbjct:   1 KKKLQKLKEKYDRRGIVYISRIPPHLKPQKLRQMLEQHGEIGRLYLAPEDPGLRRKRKQQ 60

Query: 516 GGFRGQLFSEGWVEFAKKTVAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLT 695
           GG  G+ F+EGWVEF  K  AK VA MLNG+Q+GGK++SA YYD+W +KYL KFKWD LT
Sbjct:  61 GGNSGKNFTEGWVEFEDKQKAKEVAAMLNGQQMGGKRRSAYYYDLWCMKYLPKFKWDHLT 120

Query: 696 DEIAYKSAIREQKLNMVMSAAKREKDFYLSKVEKSRAMTEIDERMKK 836
           +EI Y+ A+REQ+L   +SAAKRE+DFYLS+V+K++A+T + ER +K
Sbjct: 121 EEINYQKAVREQRLAAEVSAAKRERDFYLSRVDKAKAVTAMMERKRK 167

>gi|302828430|ref|XP_002945782.1| hypothetical protein VOLCADRAFT_115705 [Volvox
        carteri f. nagariensis]

          Length = 429

 Score =  200 bits (506), Expect = 7e-049
 Identities = 107/221 (48%), Positives = 143/221 (64%), Gaps = 9/221 (4%)
 Frame = +3

Query: 222 KMQSEESNE-------LANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNRG 380
           K Q E+ +E        A G   ++ + K       K  +   K K + +KEA  +D  G
Sbjct: 134 KFQDEDEDEDHTHNSGKAGGSESDDGDPKPDGDPKPKQRKVLSKRKLEQVKEA--SDCHG 191

Query: 381 VCYLSRIPPHMDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEF 560
           + Y+SRIPPHM   +LRQ+L  FG+I R+Y APEDP  +  RKK GG  G+ F+EGWVEF
Sbjct: 192 IVYISRIPPHMKPHKLRQLLEPFGDIGRVYCAPEDPAMRRMRKKKGGNSGKNFTEGWVEF 251

Query: 561 AKKTVAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLN 740
             K  AKR A  LNG+ IGG K+SA +YD+W IKYL KFKWD LT+EI Y+ A+REQKL 
Sbjct: 252 EDKRRAKRAALALNGQPIGGSKRSAYHYDLWTIKYLPKFKWDMLTEEINYQRAVREQKLV 311

Query: 741 MVMSAAKREKDFYLSKVEKSRAMTEIDERMKKKRKIQEESG 863
             +SAAKRE+DFYLS+V+K++A+  I+ER  + ++ Q  SG
Sbjct: 312 AEISAAKRERDFYLSRVDKAKAIDAIEERRLRGKQTQPGSG 352

>gi|145351056|ref|XP_001419903.1| predicted protein [Ostreococcus lucimarinus
        CCE9901]

          Length = 175

 Score =  191 bits (485), Expect = 2e-046
 Identities = 93/176 (52%), Positives = 128/176 (72%), Gaps = 3/176 (1%)
 Frame = +3

Query: 321 KKKKLKEKLLKEATKADN--RGVCYLSRIPPHMDHVRLRQILSQFGEIDRIYLAPEDPEA 494
           KK  L  K L E  +AD+  RGV +L  IPP M   +LRQ+LS +GE DR+YLA EDP  
Sbjct:   1 KKPTLDAKAL-ERVRADHARRGVVFLGTIPPFMKPTKLRQLLSVYGETDRMYLAAEDPAT 59

Query: 495 QVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTK 674
           +  RKK GG  G+ + EGWVEF  K  AKR A+ML+G ++GGK++SA YYD+WNIKYL K
Sbjct:  60 RAKRKKFGGNTGKKYVEGWVEFRNKKDAKRAAEMLHGREVGGKRRSAHYYDLWNIKYLPK 119

Query: 675 FKWDDLTDEIAYKSAIREQKLNMVMSAAKREKDFYLSKVEKSRAMTEIDERMKKKR 842
           FKWD+LT+E+ Y+ A+RE+KL + +S AK+E+DFYL+K+++ +A+  + ER  K+R
Sbjct: 120 FKWDNLTEEMEYQKALREKKLQLELSVAKKERDFYLAKLDQGKALEAMKERRAKRR 175

>gi|159476922|ref|XP_001696560.1| hypothetical protein CHLREDRAFT_112316
        [Chlamydomonas reinhardtii]

          Length = 155

 Score =  189 bits (479), Expect = 9e-046
 Identities = 91/155 (58%), Positives = 116/155 (74%)
 Frame = +3

Query: 375 RGVCYLSRIPPHMDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWV 554
           RG+ Y+SRIPPHM   +LRQ+L   G + R+Y APEDP A+  RK+ GG  G+ F+EGWV
Sbjct:   1 RGIVYISRIPPHMKPHKLRQLLQPHGALGRVYCAPEDPAARRLRKQKGGNSGKNFTEGWV 60

Query: 555 EFAKKTVAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQK 734
           EF  K  AKR A  LNG+ +GG K+SA YYD+WNIKYL KFKWD LT+EI Y+ A+REQ+
Sbjct:  61 EFEDKRRAKRAALALNGQAMGGSKRSAYYYDLWNIKYLPKFKWDTLTEEINYQRAVREQR 120

Query: 735 LNMVMSAAKREKDFYLSKVEKSRAMTEIDERMKKK 839
           L   +SAAKRE+DFYLS+V+K++A+  I ER  KK
Sbjct: 121 LVAEISAAKRERDFYLSRVDKAKAIDAIKERRAKK 155

>gi|255083408|ref|XP_002504690.1| predicted protein [Micromonas sp. RCC299]

          Length = 169

 Score =  184 bits (466), Expect = 3e-044
 Identities = 90/170 (52%), Positives = 124/170 (72%), Gaps = 3/170 (1%)
 Frame = +3

Query: 324 KKKLKEKLLKEATKAD--NRGVCYLSRIPPHMDHVRLRQILSQFGEIDRIYLAPEDPEAQ 497
           KK L EK LK+A +AD   RGV YL  IPP M   +LRQ+L+ +G++DR+YL PEDPE +
Sbjct:   1 KKVLSEKALKKA-RADQAKRGVVYLGSIPPFMKPQKLRQLLTPYGDLDRMYLMPEDPEIR 59

Query: 498 VHRKKAGGFRGQLFSEGWVEFAKKTVAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKF 677
             RKK  G  G+ F EGWVEF  K  AK  A MLNG Q+GG+++ A + D+WN+KYL KF
Sbjct:  60 ARRKKFKGNTGKNFVEGWVEFRDKKKAKACAAMLNGTQVGGRRRGAHFSDLWNMKYLPKF 119

Query: 678 KWDDLTDEIAYKSAIREQKLNMVMSAAKREKDFYLSKVEKSRAMTEIDER 827
           KWD+LT+EI Y+ A+REQ++ + +S AK+E+DFYL KVE+++ + ++ ER
Sbjct: 120 KWDNLTEEIEYQKALREQRMQLELSVAKKERDFYLQKVEQAKQIEKMRER 169

>gi|291238741|ref|XP_002739285.1| PREDICTED: activator of basal transcription
        1-like [Saccoglossus kowalevskii]

          Length = 280

 Score =  183 bits (462), Expect = 8e-044
 Identities = 98/198 (49%), Positives = 132/198 (66%), Gaps = 7/198 (3%)
 Frame = +3

Query: 270 EEEETKETVKKSQKADRKKKKLKEKLLKEATKA-DNR--GVCYLSRIPPHMDHVRLRQIL 440
           E  E KE        D KK++LK    K+  KA DNR  G+ YLSRIPP+M   RL+ +L
Sbjct:  47 ESAEEKEEDDDRDGDDLKKERLKR--TKDVKKAKDNRIAGIVYLSRIPPYMKPKRLKLML 104

Query: 441 SQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAKRVADMLNGEQIGG 620
           SQFGEI R+YL PE  + +  RKK GG   + F EGWVEF  K++AK+VA  LN  QIGG
Sbjct: 105 SQFGEIGRVYLQPESKQKRKERKKRGGNSSKSFEEGWVEFKDKSIAKQVAWSLNNTQIGG 164

Query: 621 KKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAKREKDFYLSKVEKS 800
           KK++  +YDIWNIKYL +FKW  L ++IAY+ A+ +Q++   +S AKRE +FYL  V+K 
Sbjct: 165 KKRNRYHYDIWNIKYLHRFKWTQLNEKIAYERAVHDQRMRTEISQAKRETNFYLKNVQKK 224

Query: 801 RAMTEIDERMKKKRKIQE 854
           + + E+++  KKKRK +E
Sbjct: 225 KMLDEMEK--KKKRKGEE 240

>gi|320168928|gb|EFW45827.1| pre-rRNA-processing protein esf-2 [Capsaspora
        owczarzaki ATCC 30864]

          Length = 403

 Score =  176 bits (445), Expect = 8e-042
 Identities = 92/222 (41%), Positives = 138/222 (62%), Gaps = 3/222 (1%)
 Frame = +3

Query: 228 QSEESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEA--TKADNRGVCYLSRI 401
           Q ++ N+       + +  +E  + S K   KK+ + E    E+   K    GV YLSR+
Sbjct: 119 QEDDDNDHDQDGDGDGDGEEELAEVSGKKHDKKRPVIEATSVESFNEKVAKSGVLYLSRV 178

Query: 402 PPHMDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTVAK 581
           PPHM   ++R +L  +G I R++L PEDPEA+  R K GG + + FSEGW+EFA K VAK
Sbjct: 179 PPHMKPDKIRHLLQIYGAIGRVFLQPEDPEARKRRIKKGGNKKKNFSEGWIEFADKGVAK 238

Query: 582 RVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSAAK 761
            VA  LNG  IGGKK S  + D+W +KYL KFKW  LT++IAY++A+R+Q+L   ++ +K
Sbjct: 239 SVAASLNGTMIGGKKSSFYHDDLWLLKYLPKFKWHHLTEQIAYENAVRDQRLRTELAQSK 298

Query: 762 REKDFYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQV 887
           RE +F++  V+KS+ + E++ R K  +K Q + G +  P+ V
Sbjct: 299 RENNFFMENVDKSKKIREVESR-KFAKKRQADDGEDDAPSAV 339

>gi|301118671|ref|XP_002907063.1| pre-rRNA-processing protein ESF2 [Phytophthora
        infestans T30-4]

          Length = 243

 Score =  174 bits (439), Expect = 4e-041
 Identities = 87/196 (44%), Positives = 130/196 (66%), Gaps = 7/196 (3%)
 Frame = +3

Query: 336 KEKLLKEATKADNRGVCYLSRIPPHMDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKA 515
           +EK+ +   KA+ RGV Y++R+PP M   +LR +L ++GE++RIYL PED      R  +
Sbjct:  23 REKMEQFKEKAERRGVVYIARVPPFMKPEKLRHLLGKYGELNRIYLVPEDKALHKKRVSS 82

Query: 516 GGFRGQLFSEGWVEFAKKTVAKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLT 695
           GG R Q ++EGW+EF  K VAKRVA MLN  QIGG+K+   + D+WN+KYL  FKWD LT
Sbjct:  83 GGNRRQKYTEGWIEFENKKVAKRVAKMLNTTQIGGRKRDYYHDDMWNLKYLKGFKWDHLT 142

Query: 696 DEIAYKSAIREQKLNMVMSAAKREKDFYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAE 875
           ++IAY++ IR+QKL M ++ AK+E + YL +VE+S+      E+M+ ++  +++ G  A+
Sbjct: 143 EKIAYENRIRDQKLRMEIAQAKKENEAYLERVEQSKQF----EKMEARKADKKQDGHAAK 198

Query: 876 PAQVFPPRVVRQFRQK 923
            A      + R F QK
Sbjct: 199 DAM---QHIRRTFHQK 211

>gi|213409696|ref|XP_002175618.1| U3 snoRNP-associated protein Esf2
        [Schizosaccharomyces japonicus yFS275]

          Length = 328

 Score =  173 bits (438), Expect = 5e-041
 Identities = 97/258 (37%), Positives = 152/258 (58%), Gaps = 5/258 (1%)
 Frame = +3

Query: 219 GKMQSEESNELANGVSKEEEETKETVKKSQKADRKKKKLKEKLLKEATKADNR-GVCYLS 395
           GK + E ++EL +    ++E  +   +KS K  +K  K+  + +++A KA  R GV YLS
Sbjct:  70 GKREGESNSELGS----DDEGQEVGSRKSSKKSKKIAKISPEEVEKARKAIKRSGVVYLS 125

Query: 396 RIPPHMDHVRLRQILSQFGEIDRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKTV 575
           RIPP+M   +LR +LS +G+I RIYLAPE  +    R K GG +  L+ EGWVEF  K V
Sbjct: 126 RIPPYMSPQKLRHLLSAYGKIGRIYLAPESAKKHAARVKNGGNKRTLYEEGWVEFESKRV 185

Query: 576 AKRVADMLNGEQIGGKKKSAIYYDIWNIKYLTKFKWDDLTDEIAYKSAIREQKLNMVMSA 755
           AK VA MLN + IGGKK +  + DIWNIKYL KFKW  LT++IA ++A R  +L + +  
Sbjct: 186 AKTVAAMLNTQTIGGKKSNWYHDDIWNIKYLPKFKWHHLTEQIAAENAARATRLKLEIQQ 245

Query: 756 AKREKDFYLSKVEKSRAMTEIDERMKKKRKIQEESGSNAEPAQVFPPRVVRQFRQKIAIK 935
             +E   Y+  VE+++ +  + ++ K++ + + ES    EPA     + +R+F  + +  
Sbjct: 246 GNKELKEYVRNVERAKMIENMQKKRKERSETEGESAPPVEPANNDNSKQLRRFFHQKSAS 305

Query: 936 NEVFQSKPGLSTDVLASV 989
           +     K G  ++ + +V
Sbjct: 306 SHRILPKSGQDSEKVQNV 323

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,904,678,803,954
Number of Sequences: 15229318
Number of Extensions: 1904678803954
Number of Successful Extensions: 522425188
Number of sequences better than 0.0: 0