Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN55122


BLASTX 7.6.2

Query= UN55122 /QuerySize=683
        (682 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|30693135|ref|NP_190477.2| small subunit ribosomal protein S9 ...    161   1e-037
gi|51971519|dbj|BAD44424.1| hypothetical protein [Arabidopsis th...    158   6e-037
gi|297819524|ref|XP_002877645.1| ribosomal protein S9 family pro...    158   8e-037
gi|6522557|emb|CAB62001.1| hypothetical protein [Arabidopsis tha...    156   4e-036
gi|255545214|ref|XP_002513668.1| ribosomal protein S9, putative ...     85   9e-015
gi|225464944|ref|XP_002274146.1| PREDICTED: hypothetical protein...     74   1e-011
gi|242051146|ref|XP_002463317.1| hypothetical protein SORBIDRAFT...     64   2e-008
gi|218200180|gb|EEC82607.1| hypothetical protein OsI_27181 [Oryz...     62   5e-008
gi|33354201|dbj|BAC81159.1| putative 30S ribosomal protein S9 [O...     59   5e-007
gi|33354200|dbj|BAC81158.1| putative 30S ribosomal protein S9 [O...     59   5e-007
gi|125601374|gb|EAZ40950.1| hypothetical protein OsJ_25432 [Oryz...     59   5e-007

>gi|30693135|ref|NP_190477.2| small subunit ribosomal protein S9 [Arabidopsis
        thaliana]

          Length = 430

 Score =  161 bits (406), Expect = 1e-037
 Identities = 104/187 (55%), Positives = 122/187 (65%), Gaps = 13/187 (6%)
 Frame = +2

Query: 143 STNSSGGGGGGGGGGNGSGNDTPWSLSGVNDGKSDPFSSNDSWGCG-----DGKWAAKEE 307
           S  SSG  G G  GG G   + P   +   + +   F +    G G     + +    EE
Sbjct:  75 SFGSSGVAGSGLPGGEGKWPEEPKRWNIKEEEEKVVFDTGGEVGQGIETSRERRGNEWEE 134

Query: 308 PKRWNMKEEGGDEKGVFGGSEGGEVANCGF-GDVKSSGWDVSSKPWDLKDEDK-VVFDTS 481
            KRW+MKE  G+EK VFGG  G EV   G  G+VKS+ WDV SKPW+LK+E++ VVFDT 
Sbjct: 135 TKRWDMKE--GEEKVVFGGG-GDEVDGFGIRGEVKSNEWDV-SKPWNLKEEEEGVVFDTG 190

Query: 482 GEMMPVGFDNSSLVNEEEERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITDEMLDS 661
           GE +P  F+N SL   EEER KKE+ EKEEKEL EVIKGPDRAFGDLIAKSGITDEMLDS
Sbjct: 191 GE-VPFSFEN-SLEMAEEERVKKELIEKEEKELLEVIKGPDRAFGDLIAKSGITDEMLDS 248

Query: 662 LIAFKDF 682
           LIA KDF
Sbjct: 249 LIALKDF 255


 Score =  156 bits (392), Expect = 4e-036
 Identities = 103/195 (52%), Positives = 121/195 (62%), Gaps = 28/195 (14%)
 Frame = +2

Query:  41 MLSRLIQRSSNLRLATLVSSKSNSQIFSSFIRPLSTNSSGGGGGGGGGGNGSGNDTPWSL 220
           MLSRL  R SNLR  TLVSSKSNSQIFSSFIRPLSTNSSGGGG G G G  + ND PWS 
Sbjct:   1 MLSRLFLRHSNLRFVTLVSSKSNSQIFSSFIRPLSTNSSGGGGNGDGNGR-NRNDVPWSF 59

Query: 221 SGVNDGKSDPFSSNDSWGC----------GDGKWAAKEEPKRWNMKEEGGDEKGVFGGSE 370
           +GVND KS PFSS+DS+G           G+GKW   EEPKRWN+KEE  +EK VF    
Sbjct:  60 TGVNDDKSGPFSSDDSFGSSGVAGSGLPGGEGKW--PEEPKRWNIKEE--EEKVVF--DT 113

Query: 371 GGEVANCGFG-----DVKSSGWDVSSKPWDLKD-EDKVVFDTSGEMMPVGFDNSSLVNEE 532
           GGEV   G G     + + + W+  +K WD+K+ E+KVVF   G+ +  GF     V   
Sbjct: 114 GGEV---GQGIETSRERRGNEWE-ETKRWDMKEGEEKVVFGGGGDEVD-GFGIRGEVKSN 168

Query: 533 EERAKKEVFEKEEKE 577
           E    K    KEE+E
Sbjct: 169 EWDVSKPWNLKEEEE 183

>gi|51971519|dbj|BAD44424.1| hypothetical protein [Arabidopsis thaliana]

          Length = 430

 Score =  158 bits (399), Expect = 6e-037
 Identities = 103/187 (55%), Positives = 121/187 (64%), Gaps = 13/187 (6%)
 Frame = +2

Query: 143 STNSSGGGGGGGGGGNGSGNDTPWSLSGVNDGKSDPFSSNDSWGCG-----DGKWAAKEE 307
           S  SSG  G G  GG G   + P   +   + +   F +    G G     + +    EE
Sbjct:  75 SFGSSGVAGSGLPGGEGKWPEEPKRWNIKEEEEKVVFDTGGEVGQGIETSRERRGNEWEE 134

Query: 308 PKRWNMKEEGGDEKGVFGGSEGGEVANCGF-GDVKSSGWDVSSKPWDLKDEDK-VVFDTS 481
            KRW+MKE  G+EK VFGG  G EV   G  G+ KS+ WDV SKPW+LK+E++ VVFDT 
Sbjct: 135 TKRWDMKE--GEEKVVFGGG-GDEVDGFGIRGEGKSNEWDV-SKPWNLKEEEEGVVFDTG 190

Query: 482 GEMMPVGFDNSSLVNEEEERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITDEMLDS 661
           GE +P  F+N SL   EEER KKE+ EKEEKEL EVIKGPDRAFGDLIAKSGITDEMLDS
Sbjct: 191 GE-VPFSFEN-SLEMAEEERVKKELIEKEEKELLEVIKGPDRAFGDLIAKSGITDEMLDS 248

Query: 662 LIAFKDF 682
           LIA KDF
Sbjct: 249 LIALKDF 255


 Score =  152 bits (382), Expect = 6e-035
 Identities = 94/165 (56%), Positives = 110/165 (66%), Gaps = 27/165 (16%)
 Frame = +2

Query:  41 MLSRLIQRSSNLRLATLVSSKSNSQIFSSFIRPLSTNSSGGGGGGGGGGNGSGNDTPWSL 220
           MLSRL  R SNLR  TLVSSKSNSQIFSSFIRPLSTNSSGGGG G G G  + ND PWS 
Sbjct:   1 MLSRLFLRHSNLRFVTLVSSKSNSQIFSSFIRPLSTNSSGGGGNGDGNGR-NRNDVPWSF 59

Query: 221 SGVNDGKSDPFSSNDSWGC----------GDGKWAAKEEPKRWNMKEEGGDEKGVFGGSE 370
           +GVND KS PFSS+DS+G           G+GKW   EEPKRWN+KEE  +EK VF    
Sbjct:  60 TGVNDDKSGPFSSDDSFGSSGVAGSGLPGGEGKW--PEEPKRWNIKEE--EEKVVF--DT 113

Query: 371 GGEVANCGFG-----DVKSSGWDVSSKPWDLKD-EDKVVFDTSGE 487
           GGEV   G G     + + + W+  +K WD+K+ E+KVVF   G+
Sbjct: 114 GGEV---GQGIETSRERRGNEWE-ETKRWDMKEGEEKVVFGGGGD 154

>gi|297819524|ref|XP_002877645.1| ribosomal protein S9 family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 430

 Score =  158 bits (398), Expect = 8e-037
 Identities = 98/190 (51%), Positives = 117/190 (61%), Gaps = 18/190 (9%)
 Frame = +2

Query:  41 MLSRLIQRSSNLRLATLVSSKSNSQIFSSFIRPLSTNSSGGGGGGGGGGNGSGNDTPWSL 220
           MLSRL  R SNLR  TLV+SKSNSQIFSSFIRPLSTNS+GGGG G G G  + ND PWS 
Sbjct:   1 MLSRLFLRPSNLRYVTLVTSKSNSQIFSSFIRPLSTNSTGGGGNGDGNGR-NRNDVPWSF 59

Query: 221 SGVNDGKSDPFSSNDSWGC----------GDGKWAAKEEPKRWNMKEEGGDEKGVFG-GS 367
           +GVND KS PFSS+DSWG           GDGKW   EEPKRWNMKEE  ++K VF  G 
Sbjct:  60 TGVNDDKSGPFSSDDSWGSSGVEGSGSVGGDGKW--PEEPKRWNMKEE--EDKVVFDTGG 115

Query: 368 EGGEVANCGFGDVKSSGWDVSSKPWDLKDEDKVVFDTSGEMMPVGFDNSSLVNEEEERAK 547
           E G+    G  + + + W+  +K WD+K  ++ V   +GE +  GF     V   E    
Sbjct: 116 EVGQGIETG-RERRGNEWE-ETKRWDMKKGEEEVVFGAGEDVVDGFGIRGEVKSNEWDVS 173

Query: 548 KEVFEKEEKE 577
           K    KEE+E
Sbjct: 174 KPWNLKEEEE 183


 Score =  154 bits (389), Expect = 9e-036
 Identities = 103/192 (53%), Positives = 121/192 (63%), Gaps = 23/192 (11%)
 Frame = +2

Query: 143 STNSSGGGGGGGGGGNGSGNDTP--WSLSGVNDGKSDPFSSNDSWGCGDGK------WAA 298
           S  SSG  G G  GG+G   + P  W++    D            G   G+      W  
Sbjct:  75 SWGSSGVEGSGSVGGDGKWPEEPKRWNMKEEEDKVVFDTGGEVGQGIETGRERRGNEW-- 132

Query: 299 KEEPKRWNMKEEGGDEKGVFGGSEGGEVANCGF---GDVKSSGWDVSSKPWDLKDEDK-V 466
            EE KRW+MK+  G+E+ VFG    GE    GF   G+VKS+ WDV SKPW+LK+E++ V
Sbjct: 133 -EETKRWDMKK--GEEEVVFG---AGEDVVDGFGIRGEVKSNEWDV-SKPWNLKEEEEGV 185

Query: 467 VFDTSGEMMPVGFDNSSLVNEEEERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITD 646
           VFDT GE +P  F+N SL   EEER KKE+ EKEEKEL EVIKGPDRAFGDLIAKSGITD
Sbjct: 186 VFDTGGE-VPFSFEN-SLEMTEEERVKKELIEKEEKELLEVIKGPDRAFGDLIAKSGITD 243

Query: 647 EMLDSLIAFKDF 682
           EMLDSLIA KDF
Sbjct: 244 EMLDSLIALKDF 255

>gi|6522557|emb|CAB62001.1| hypothetical protein [Arabidopsis thaliana]

          Length = 237

 Score =  156 bits (392), Expect = 4e-036
 Identities = 103/195 (52%), Positives = 121/195 (62%), Gaps = 28/195 (14%)
 Frame = +2

Query:  41 MLSRLIQRSSNLRLATLVSSKSNSQIFSSFIRPLSTNSSGGGGGGGGGGNGSGNDTPWSL 220
           MLSRL  R SNLR  TLVSSKSNSQIFSSFIRPLSTNSSGGGG G G G  + ND PWS 
Sbjct:   1 MLSRLFLRHSNLRFVTLVSSKSNSQIFSSFIRPLSTNSSGGGGNGDGNGR-NRNDVPWSF 59

Query: 221 SGVNDGKSDPFSSNDSWGC----------GDGKWAAKEEPKRWNMKEEGGDEKGVFGGSE 370
           +GVND KS PFSS+DS+G           G+GKW   EEPKRWN+KEE  +EK VF    
Sbjct:  60 TGVNDDKSGPFSSDDSFGSSGVAGSGLPGGEGKW--PEEPKRWNIKEE--EEKVVF--DT 113

Query: 371 GGEVANCGFG-----DVKSSGWDVSSKPWDLKD-EDKVVFDTSGEMMPVGFDNSSLVNEE 532
           GGEV   G G     + + + W+  +K WD+K+ E+KVVF   G+ +  GF     V   
Sbjct: 114 GGEV---GQGIETSRERRGNEWE-ETKRWDMKEGEEKVVFGGGGDEVD-GFGIRGEVKSN 168

Query: 533 EERAKKEVFEKEEKE 577
           E    K    KEE+E
Sbjct: 169 EWDVSKPWNLKEEEE 183

>gi|255545214|ref|XP_002513668.1| ribosomal protein S9, putative [Ricinus
        communis]

          Length = 398

 Score =  85 bits (208), Expect = 9e-015
 Identities = 58/132 (43%), Positives = 75/132 (56%), Gaps = 15/132 (11%)
 Frame = +2

Query: 317 WNMKEEGG--DEKGVFGG----SEGGEVANCGFGDVKSSGW---DVSSKPWDLKDEDKVV 469
           W  +EEGG  D K +F G    S GGE  N      +   W   D   +   + D +++ 
Sbjct:  97 WLEEEEGGSNDSKDIFQGIEKESAGGEDNNEWLQSEEYKMWNLDDAEDQKDHVFDIEEIN 156

Query: 470 FDTSGEMMPVGFDNSSLVNEEEERAKKE-VFEKEEKELTEVIKGPDRAFGDLIAKSGITD 646
            DTS E     F   S  N E+E+ +++ + E EEKELT V+KG +RAFGDLIA SGITD
Sbjct: 157 ADTSSE-----FTTESSANVEKEKTEEQKMLEAEEKELTAVLKGSNRAFGDLIAASGITD 211

Query: 647 EMLDSLIAFKDF 682
            MLDSLIA +DF
Sbjct: 212 AMLDSLIALRDF 223

>gi|225464944|ref|XP_002274146.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 357

 Score =  74 bits (181), Expect = 1e-011
 Identities = 48/131 (36%), Positives = 64/131 (48%), Gaps = 16/131 (12%)
 Frame = +2

Query: 317 WNMKEEGGDEKGVFGGSEGGEVANC----GFGDVKSSGWDVSSKPWDLKDEDKVVFDTSG 484
           W + +E         G E G +A      G    +   W +   PW  ++E +      G
Sbjct:  57 WKISQEYDGNGDSIVGQEAGTLAGLTDEGGAAPAEDESW-LKDSPWTFEEERE-----KG 110

Query: 485 EMMPVGFDNSSLVNEE------EERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITD 646
            ++  G++    V         E    KE+ EKEE  L+ V+KGP+RAFGDLIA SGITD
Sbjct: 111 NVLDFGWELQEEVRAVGGETTIEGDGSKELLEKEESALSAVLKGPNRAFGDLIAASGITD 170

Query: 647 EMLDSLIAFKD 679
            MLDSLIA KD
Sbjct: 171 AMLDSLIALKD 181

>gi|242051146|ref|XP_002463317.1| hypothetical protein SORBIDRAFT_02g041720
        [Sorghum bicolor]

          Length = 412

 Score =  64 bits (154), Expect = 2e-008
 Identities = 30/60 (50%), Positives = 42/60 (70%)
 Frame = +2

Query: 500 GFDNSSLVNEEEERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITDEMLDSLIAFKD 679
           G D+     + ++  K++  +  EKEL E++KGP+RAFGDLIA SGIT+ M+DSLI  KD
Sbjct: 177 GLDDLDAGEDPDDELKRQQNKAREKELMEILKGPNRAFGDLIAASGITEGMIDSLILLKD 236

>gi|218200180|gb|EEC82607.1| hypothetical protein OsI_27181 [Oryza sativa Indica
        Group]

          Length = 427

 Score =  62 bits (150), Expect = 5e-008
 Identities = 30/53 (56%), Positives = 39/53 (73%)
 Frame = +2

Query: 521 VNEEEERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITDEMLDSLIAFKD 679
           V+EEEE  K++     E+EL E +KGP+RAFGDLI  SGIT++M+ SLI  KD
Sbjct: 184 VDEEEEERKRQERRAREQELMETLKGPNRAFGDLIEASGITEDMIASLILLKD 236

>gi|33354201|dbj|BAC81159.1| putative 30S ribosomal protein S9 [Oryza sativa
        Japonica Group]

          Length = 415

 Score =  59 bits (141), Expect = 5e-007
 Identities = 29/52 (55%), Positives = 37/52 (71%)
 Frame = +2

Query: 524 NEEEERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITDEMLDSLIAFKD 679
           +EEEE  K+      E+EL E +KGP+RAFGDLI  SGIT++M+ SLI  KD
Sbjct: 188 DEEEEERKRLERRAREQELMETLKGPNRAFGDLIEASGITEDMIASLILLKD 239

>gi|33354200|dbj|BAC81158.1| putative 30S ribosomal protein S9 [Oryza sativa
        Japonica Group]

          Length = 341

 Score =  59 bits (141), Expect = 5e-007
 Identities = 29/52 (55%), Positives = 37/52 (71%)
 Frame = +2

Query: 524 NEEEERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITDEMLDSLIAFKD 679
           +EEEE  K+      E+EL E +KGP+RAFGDLI  SGIT++M+ SLI  KD
Sbjct: 114 DEEEEERKRLERRAREQELMETLKGPNRAFGDLIEASGITEDMIASLILLKD 165

>gi|125601374|gb|EAZ40950.1| hypothetical protein OsJ_25432 [Oryza sativa
        Japonica Group]

          Length = 433

 Score =  59 bits (141), Expect = 5e-007
 Identities = 29/52 (55%), Positives = 37/52 (71%)
 Frame = +2

Query: 524 NEEEERAKKEVFEKEEKELTEVIKGPDRAFGDLIAKSGITDEMLDSLIAFKD 679
           +EEEE  K+      E+EL E +KGP+RAFGDLI  SGIT++M+ SLI  KD
Sbjct: 188 DEEEEERKRLERRAREQELMETLKGPNRAFGDLIEASGITEDMIASLILLKD 239

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,751,882,897,479
Number of Sequences: 15229318
Number of Extensions: 5751882897479
Number of Successful Extensions: 1355112730
Number of sequences better than 0.0: 0