Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN78955


BLASTX 7.6.2

Query= UN78955 /QuerySize=775
        (774 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297798844|ref|XP_002867306.1| predicted protein [Arabidopsis ...    211   1e-052
gi|2827522|emb|CAA16530.1| hypothetical protein [Arabidopsis tha...    195   1e-047
gi|30688985|ref|NP_194855.2| sequence-specific DNA binding trans...    189   4e-046
gi|308447248|ref|XP_003087374.1| hypothetical protein CRE_16592 ...     62   6e-008
gi|154419696|ref|XP_001582864.1| hypothetical protein [Trichomon...     56   6e-006

>gi|297798844|ref|XP_002867306.1| predicted protein [Arabidopsis lyrata subsp.
        lyrata]

          Length = 297

 Score =  211 bits (535), Expect = 1e-052
 Identities = 135/233 (57%), Positives = 166/233 (71%), Gaps = 26/233 (11%)
 Frame = -3

Query: 772 NEIAAIESDCPNALSSFQKWTMISDNCNAL----DVDNELFQAIDAVVMIQENRDGSEPD 605
           N+I   ES       S+  W++ SD    L    ++D ELF+AI AVVMIQ+ + G+E  
Sbjct:  79 NQIKQWESQYRGTGRSY--WSLSSDKRKLLNLPGNIDIELFEAISAVVMIQDEKAGTE-- 134

Query: 604 SDSDPDAREDFDVVDVTAEL---GSKRSRERTMVVMKKENPPQKRKTEE---ETQRKNNQ 443
           SDSDP+A+   DVVD+TAEL   GSKRSR+RT+V+  KENPPQK K EE      + N +
Sbjct: 135 SDSDPEAQ---DVVDITAELAFVGSKRSRQRTIVM--KENPPQKTKKEEPQISRVQVNTR 189

Query: 442 EQ--RAKATHQKKTLEEKKKKKPVVEISTDEEEEEENTMSIEEEVKALEAKLGEKADMIH 269
           E+   AKATHQKKT+EE   K+P+ EISTDEEEEEE TM+IEEEV+ +EAKL  K D+IH
Sbjct: 190 EKPITAKATHQKKTMEE---KRPMEEISTDEEEEEE-TMNIEEEVEVMEAKLSYKIDLIH 245

Query: 268 AIVGRNLAKGSETGDDDVGIEDKLKFVRQQGDELIACLSEIANTLDRFREVAQ 110
           AIVGRNLAK +ET  D +  +DKLKFVRQQGDELI CLSEI +TL+R REV Q
Sbjct: 246 AIVGRNLAKDNET-RDGINTDDKLKFVRQQGDELIGCLSEIVSTLNRLREVPQ 297


 Score =  63 bits (152), Expect = 4e-008
 Identities = 29/38 (76%), Positives = 32/38 (84%)
 Frame = -3

Query: 772 NEIAAIESDCPNALSSFQKWTMISDNCNALDVDNELFQ 659
           NEIAA+E+DC NALSSFQKWTMI +NCNALDV   L Q
Sbjct:  29 NEIAAVEADCSNALSSFQKWTMILENCNALDVRRNLNQ 66

>gi|2827522|emb|CAA16530.1| hypothetical protein [Arabidopsis thaliana]

          Length = 291

 Score =  195 bits (493), Expect = 1e-047
 Identities = 121/226 (53%), Positives = 155/226 (68%), Gaps = 19/226 (8%)
 Frame = -3

Query: 772 NEIAAIESDCPNALSSFQKWTMISDNCNAL----DVDNELFQAIDAVVMIQENRDGSEPD 605
           N+I   ES       S+  W++ SD    L    D+D ELF+AI+AVVMIQ+ + G+E  
Sbjct:  79 NQIKKWESQYRGTGRSY--WSLSSDKRKLLNLPGDIDIELFEAINAVVMIQDEKAGTE-- 134

Query: 604 SDSDPDAREDFDVVDVTAELGSKRSRERTMVVMKKENPPQKRKTEEETQRKNNQEQRAKA 425
           SDSDP+A+   DVVD++AELGSKRSR+RTMV+  KE   ++ +T         +    KA
Sbjct: 135 SDSDPEAQ---DVVDLSAELGSKRSRQRTMVM--KETKKEEPRTSRVQVNTREKPITTKA 189

Query: 424 THQKKTLEEKKKKKPVVEISTDEEEEEENTMSIEEEVKALEAKLGEKADMIHAIVGRNLA 245
           THQ KT+ E   KKPV ++STDEEE+E  TM+IEE+V+ +EAKL  K D+IHAIVGRNLA
Sbjct: 190 THQNKTMGE---KKPVEDMSTDEEEDE--TMNIEEDVEVMEAKLSYKIDLIHAIVGRNLA 244

Query: 244 KGSETGDDDVGIEDKLKFVRQQGDELIACLSEIANTLDRFREVAQE 107
           K +ET  D V ++DKLK VRQQGDELI CLSEI +TL+R  EV QE
Sbjct: 245 KDNET-KDGVSMDDKLKSVRQQGDELIGCLSEIVSTLNRLHEVPQE 289


 Score =  65 bits (157), Expect = 1e-008
 Identities = 29/38 (76%), Positives = 33/38 (86%)
 Frame = -3

Query: 772 NEIAAIESDCPNALSSFQKWTMISDNCNALDVDNELFQ 659
           NEIAA+E+DC NALSSFQKWTMI++NCNALDV   L Q
Sbjct:  29 NEIAAVEADCSNALSSFQKWTMITENCNALDVSRNLNQ 66

>gi|30688985|ref|NP_194855.2| sequence-specific DNA binding transcription factor
        [Arabidopsis thaliana]

          Length = 294

 Score =  189 bits (479), Expect = 4e-046
 Identities = 121/229 (52%), Positives = 155/229 (67%), Gaps = 22/229 (9%)
 Frame = -3

Query: 772 NEIAAIESDCPNALSSFQKWTMISDNCNAL----DVDNELFQAIDAVVMIQENRDGSEPD 605
           N+I   ES       S+  W++ SD    L    D+D ELF+AI+AVVMIQ+ + G+E  
Sbjct:  79 NQIKKWESQYRGTGRSY--WSLSSDKRKLLNLPGDIDIELFEAINAVVMIQDEKAGTE-- 134

Query: 604 SDSDPDAREDFDVVDVTAEL---GSKRSRERTMVVMKKENPPQKRKTEEETQRKNNQEQR 434
           SDSDP+A+   DVVD++AEL   GSKRSR+RTMV+  KE   ++ +T         +   
Sbjct: 135 SDSDPEAQ---DVVDLSAELAFVGSKRSRQRTMVM--KETKKEEPRTSRVQVNTREKPIT 189

Query: 433 AKATHQKKTLEEKKKKKPVVEISTDEEEEEENTMSIEEEVKALEAKLGEKADMIHAIVGR 254
            KATHQ KT+ E   KKPV ++STDEEE+E  TM+IEE+V+ +EAKL  K D+IHAIVGR
Sbjct: 190 TKATHQNKTMGE---KKPVEDMSTDEEEDE--TMNIEEDVEVMEAKLSYKIDLIHAIVGR 244

Query: 253 NLAKGSETGDDDVGIEDKLKFVRQQGDELIACLSEIANTLDRFREVAQE 107
           NLAK +ET  D V ++DKLK VRQQGDELI CLSEI +TL+R  EV QE
Sbjct: 245 NLAKDNET-KDGVSMDDKLKSVRQQGDELIGCLSEIVSTLNRLHEVPQE 292


 Score =  65 bits (157), Expect = 1e-008
 Identities = 29/38 (76%), Positives = 33/38 (86%)
 Frame = -3

Query: 772 NEIAAIESDCPNALSSFQKWTMISDNCNALDVDNELFQ 659
           NEIAA+E+DC NALSSFQKWTMI++NCNALDV   L Q
Sbjct:  29 NEIAAVEADCSNALSSFQKWTMITENCNALDVSRNLNQ 66

>gi|308447248|ref|XP_003087374.1| hypothetical protein CRE_16592 [Caenorhabditis
        remanei]

          Length = 267

 Score =  62 bits (150), Expect = 6e-008
 Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 3/115 (2%)
 Frame = -3

Query: 631 ENRD---GSEPDSDSDPDAREDFDVVDVTAELGSKRSRERTMVVMKKENPPQKRKTEEET 461
           ENR    G  P  D+   A+E+ +      E   K+  E      KK+   +K+K EEE 
Sbjct:  39 ENRSSVAGPPPPDDAAAKAKEEEEKKKKEEEEAKKKKEEEEAEEKKKKEEEEKKKKEEEE 98

Query: 460 QRKNNQEQRAKATHQKKTLEEKKKKKPVVEISTDEEEEEENTMSIEEEVKALEAK 296
           ++K  +E++ +   +KK  E++KK+K   E    +EEEEE     EEE K  E++
Sbjct:  99 EKKKEEEKKKEDEEKKKKEEDEKKEKEEAEKKKKQEEEEEEKKKKEEETKKNESQ 153

>gi|154419696|ref|XP_001582864.1| hypothetical protein [Trichomonas vaginalis
        G3]

          Length = 1433

 Score =  56 bits (133), Expect = 6e-006
 Identities = 39/140 (27%), Positives = 69/140 (49%), Gaps = 6/140 (4%)
 Frame = -3

Query:  703 SDNCNALDVDNELFQAI----DAVVMIQENRDGSEPDSDSDPDAREDFDVVDVTAELGSK 536
            S N  A  VD E+ ++I    + +   +E  +  + +     + +   +      EL  K
Sbjct: 1191 SSNTFANLVDEEMQESIKQQQEEMRKAKELEEKQKREQQEQEEMKRKAEEEKRRQELEEK 1250

Query:  535 RSRERTMVVMKKENPPQKRKTEEETQRKNNQEQRAKATHQKKTLEEKKKKKPVVEISTDE 356
            + +E  +   +KE   +K+K EEE ++K  +E++ K   +KK  EE++KKK  +E    E
Sbjct: 1251 KKKE--LEQKQKEEEEKKKKEEEEKKKKEEEEKKKKEEEEKKKKEEEEKKKKELEQKKKE 1308

Query:  355 EEEEENTMSIEEEVKALEAK 296
            EEE +    IE++ K  E K
Sbjct: 1309 EEENKKKQEIEQKKKQDEDK 1328

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,093,974,624,356
Number of Sequences: 15229318
Number of Extensions: 1093974624356
Number of Successful Extensions: 299540451
Number of sequences better than 0.0: 0