Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN82987


BLASTX 7.6.2

Query= UN82987 /QuerySize=882
        (881 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297826591|ref|XP_002881178.1| hypothetical protein ARALYDRAFT...    270   3e-070
gi|4589953|gb|AAD26471.1| hypothetical protein [Arabidopsis thal...    259   4e-067
gi|42569516|ref|NP_180706.2| uncharacterized protein [Arabidopsi...    259   4e-067
gi|297843386|ref|XP_002889574.1| hypothetical protein ARALYDRAFT...    170   4e-040
gi|79316945|ref|NP_001030977.1| uncharacterized protein [Arabido...     74   3e-011

>gi|297826591|ref|XP_002881178.1| hypothetical protein ARALYDRAFT_482077
        [Arabidopsis lyrata subsp. lyrata]

          Length = 472

 Score =  270 bits (689), Expect = 3e-070
 Identities = 168/308 (54%), Positives = 205/308 (66%), Gaps = 42/308 (13%)
 Frame = +2

Query:  35 SFAGGRDGSKGLMKRITSTFSIKKNQNTTTNDHPKPVFPRSRSTGASYESMRLRQGKKSL 214
           S  GG+DGS+G +KR+TSTFSI+K +NTT++  PK + PRS+STGA+YESMRL QGKK+L
Sbjct:   3 STGGGKDGSRGFVKRVTSTFSIRKKKNTTSD--PKLLLPRSKSTGANYESMRLPQGKKAL 60

Query: 215 PDVTVK-KKGTKSACVSPQRRREKIDESRKQ------NEDMDSIWLTSD--SSSSLFGER 367
           PD T K  K TKSA VSPQ RREKIDES KQ       +D DSIWL+SD  S +SL  ER
Sbjct:  61 PDATTKDTKRTKSAGVSPQPRREKIDESGKQFMKMRCFDDNDSIWLSSDCASPTSLLEER 120

Query: 368 KVSVSFHFSLDESIVSWLSNAAK-----NQEDTKENHHHHHHHQKSSKSAKSSSENIQKD 532
           ++SVSFHFS+DE IVSWLS+ A      NQE T+      +H Q SSK+AK S ENI+KD
Sbjct: 121 RLSVSFHFSVDEKIVSWLSSVANSSLSLNQESTRST--KENHQQTSSKNAKCSLENIRKD 178

Query: 533 GKSAKTS-----------SSRLPENNNKTCEETSSFNRCVSPELSSQSHHEEKKVTFSL- 676
           GK   ++           SSRLPE+NNK C +        S  L S    EEKKV+FS+ 
Sbjct: 179 GKFCNSAGKARGTGSAKPSSRLPESNNKPCPQKPCEKSSSSNRLVSP---EEKKVSFSVE 235

Query: 677 ESDVSPSPVISTPGPSTPPITILASALEKAATEIGGSKRRNVVEPLFWPLEQKFDWTTDD 856
           E++ SPSPV ST            S+L+K+A EI  SK +  VEPLFWP EQKFDWT +D
Sbjct: 236 ETEKSPSPVNST--------ATATSSLKKSA-EIKDSKSKIAVEPLFWPFEQKFDWTPED 286

Query: 857 IMKHFSMS 880
           I+KHFSMS
Sbjct: 287 ILKHFSMS 294

>gi|4589953|gb|AAD26471.1| hypothetical protein [Arabidopsis thaliana]

          Length = 561

 Score =  259 bits (661), Expect = 4e-067
 Identities = 154/297 (51%), Positives = 193/297 (64%), Gaps = 20/297 (6%)
 Frame = +2

Query:  35 SFAGGRDGSKGLMKRITSTFSIKKNQNTTTNDHPKPVFPRSRSTGASYESMRLRQGKKSL 214
           S  GG+DGSKG +KR+TSTFSI+K +NTT++  PK + PRS+STGA+YESMRL QGKK+L
Sbjct:   3 STGGGKDGSKGFVKRVTSTFSIRKKKNTTSD--PKLLLPRSKSTGANYESMRLPQGKKAL 60

Query: 215 PDVTVKK--KGTKSACVSPQRRREKIDESRKQ------NEDMDSIWLTSD--SSSSLFGE 364
           PDV   K  K TKSA VSPQ RREKIDES KQ       +D DSIWL+SD  S +SL  E
Sbjct:  61 PDVVTTKDTKRTKSAGVSPQPRREKIDESGKQFMKVRCFDDSDSIWLSSDCASPTSLLEE 120

Query: 365 RKVSVSFHFSLDESIVSWLSNAAK-----NQEDTKENHHHHHHHQKSSKSAKSSSENIQK 529
           R++SVSFHFS+DE IVSWLS+ A      NQE T  N    +HHQKSSK+ K+S EN++K
Sbjct: 121 RRLSVSFHFSVDEKIVSWLSSVANSSLSLNQESTSSN--KENHHQKSSKNTKTSLENVRK 178

Query: 530 DGKSAKTSSSRLPENNNKTCEETSSFNRCVSPELSSQSHHEEKKVTFSLESDVSPSPVIS 709
           DGK   +S+ +     +       S N+    +   +S    + VT   E  VS S   +
Sbjct: 179 DGKVCNSSAGKARGTGSAKPSLPESNNKTCPQKQCEESSISNRFVTLE-EKKVSFSVAKT 237

Query: 710 TPGPSTPPITILASALEKAATEIGGSKRRNVVEPLFWPLEQKFDWTTDDIMKHFSMS 880
              PS    T  A++  K + EIG +K + VVEPLFWP EQKFDWT +DI+KHFSMS
Sbjct: 238 EKSPSPDNSTATATSSLKKSAEIGVTKSKIVVEPLFWPFEQKFDWTPEDILKHFSMS 294

>gi|42569516|ref|NP_180706.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 472

 Score =  259 bits (661), Expect = 4e-067
 Identities = 154/297 (51%), Positives = 193/297 (64%), Gaps = 20/297 (6%)
 Frame = +2

Query:  35 SFAGGRDGSKGLMKRITSTFSIKKNQNTTTNDHPKPVFPRSRSTGASYESMRLRQGKKSL 214
           S  GG+DGSKG +KR+TSTFSI+K +NTT++  PK + PRS+STGA+YESMRL QGKK+L
Sbjct:   3 STGGGKDGSKGFVKRVTSTFSIRKKKNTTSD--PKLLLPRSKSTGANYESMRLPQGKKAL 60

Query: 215 PDVTVKK--KGTKSACVSPQRRREKIDESRKQ------NEDMDSIWLTSD--SSSSLFGE 364
           PDV   K  K TKSA VSPQ RREKIDES KQ       +D DSIWL+SD  S +SL  E
Sbjct:  61 PDVVTTKDTKRTKSAGVSPQPRREKIDESGKQFMKVRCFDDSDSIWLSSDCASPTSLLEE 120

Query: 365 RKVSVSFHFSLDESIVSWLSNAAK-----NQEDTKENHHHHHHHQKSSKSAKSSSENIQK 529
           R++SVSFHFS+DE IVSWLS+ A      NQE T  N    +HHQKSSK+ K+S EN++K
Sbjct: 121 RRLSVSFHFSVDEKIVSWLSSVANSSLSLNQESTSSN--KENHHQKSSKNTKTSLENVRK 178

Query: 530 DGKSAKTSSSRLPENNNKTCEETSSFNRCVSPELSSQSHHEEKKVTFSLESDVSPSPVIS 709
           DGK   +S+ +     +       S N+    +   +S    + VT   E  VS S   +
Sbjct: 179 DGKVCNSSAGKARGTGSAKPSLPESNNKTCPQKQCEESSISNRFVTLE-EKKVSFSVAKT 237

Query: 710 TPGPSTPPITILASALEKAATEIGGSKRRNVVEPLFWPLEQKFDWTTDDIMKHFSMS 880
              PS    T  A++  K + EIG +K + VVEPLFWP EQKFDWT +DI+KHFSMS
Sbjct: 238 EKSPSPDNSTATATSSLKKSAEIGVTKSKIVVEPLFWPFEQKFDWTPEDILKHFSMS 294

>gi|297843386|ref|XP_002889574.1| hypothetical protein ARALYDRAFT_470603
        [Arabidopsis lyrata subsp. lyrata]

          Length = 460

 Score =  170 bits (429), Expect = 4e-040
 Identities = 124/306 (40%), Positives = 175/306 (57%), Gaps = 43/306 (14%)
 Frame = +2

Query:  32 MSFAGGRD-GSKGLMKRITSTFSIKKNQNTTTNDHPKPVFPRSRSTGA-SYESMRLRQGK 205
           M+F   +D GSKG +KR+ S+FS++K +N T+   PK + PRS+STG+ ++ESMRL   K
Sbjct:   1 MAFTNEKDAGSKGFVKRVASSFSMRKKKNATS--EPK-LLPRSKSTGSTNFESMRLPATK 57

Query: 206 KSLPDVTVKKKGTKSACVS-PQRRREKIDE----------SRKQNEDMDSIWLTSD--SS 346
           K + DVT K +   S  V+ PQ RREKID+            +  +D DSIWL+SD  S 
Sbjct:  58 K-ISDVTNKTRIKPSGGVTPPQLRREKIDDRGGGINNKFVKWRSFDDSDSIWLSSDCASP 116

Query: 347 SSLFGERKVSVSFHFSLDESIVSWLSNAAKNQEDTKENHHHHHHHQKSSKSAKSSSENIQ 526
           +SL  ER++SVSF FS+DES+VSWLSN AK       NH      +   +  +++ ENIQ
Sbjct: 117 TSLLEERRLSVSFRFSVDESVVSWLSNLAK--ASLSLNHQEVSSIKDRPRIPRNTKENIQ 174

Query: 527 -KDGKSAKTSSSRLPENNNKTCEETSSFNRCVSPELSSQSHHEEKKVTFSLESDVSPSPV 703
            KD  S+  + + +  +   +  +  +F++    EL S +H              SPS +
Sbjct: 175 KKDSFSSAPNLTVIDSSTQSSQGKKVNFSQSSGIELESGNH--------------SPSLI 220

Query: 704 ISTPGPSTPPITILAS-----ALEKAATEIGGSK--RRNVVEPLFWPLEQKFDWTTDDIM 862
           IS+  PS P      S     +L++ + EI  SK    NV EPLFWP EQ+FDWT +DI+
Sbjct: 221 ISSDVPSDPNNHTATSLVRKISLDEKSAEIVDSKSSSSNVDEPLFWPYEQRFDWTPEDIL 280

Query: 863 KHFSMS 880
           KHFSMS
Sbjct: 281 KHFSMS 286

>gi|79316945|ref|NP_001030977.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 457

 Score =  74 bits (179), Expect = 3e-011
 Identities = 46/134 (34%), Positives = 76/134 (56%), Gaps = 8/134 (5%)
 Frame = +2

Query: 308 EDMDSIWLTSD--SSSSLFGERKVSVSFHFSLDESIVSWLSNAAKNQEDTKENHHH---H 472
           +D DSIWL+SD  S +SL  ER++SVSF FS+DES+VSWLSN AK       NH      
Sbjct:  99 DDSDSIWLSSDCASPTSLLEERRLSVSFRFSVDESVVSWLSNLAKT--SLSLNHQEVSSI 156

Query: 473 HHHQKSSKSAKSSSENIQKDGKSAKTSSSRLPENNNKTCE-ETSSFNRCVSPELSSQSHH 649
               +  ++ K ++ENIQK   S    +  + +++ ++ + +  SF++    +L S +H 
Sbjct: 157 KDRPRIPRNTKENAENIQKKDSSRSVPNLTVVDSSTQSSQGKKVSFSKSSGTQLESGNHA 216

Query: 650 EEKKVTFSLESDVS 691
               ++  + SD++
Sbjct: 217 SSLIISSDVPSDLN 230


 Score =  57 bits (137), Expect = 3e-006
 Identities = 27/44 (61%), Positives = 33/44 (75%), Gaps = 2/44 (4%)
 Frame = +2

Query: 755 LEKAATEIGGSKR--RNVVEPLFWPLEQKFDWTTDDIMKHFSMS 880
           L++ + EI  SK    NV EPLFWP EQ+FDWT +DI+KHFSMS
Sbjct: 243 LDEKSAEIVDSKSSGSNVDEPLFWPYEQRFDWTPEDILKHFSMS 286

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,352,524,086,339
Number of Sequences: 15229318
Number of Extensions: 1352524086339
Number of Successful Extensions: 374284986
Number of sequences better than 0.0: 0