Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN72668


BLASTX 7.6.2

Query= UN72668 /QuerySize=604
        (603 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|15238880|ref|NP_200204.1| uncharacterized protein [Arabidopsi...     85   5e-015
gi|297796243|ref|XP_002866006.1| hypothetical protein ARALYDRAFT...     80   2e-013
gi|71896709|ref|NP_001026146.1| hypothetical protein LOC420557 [...     61   1e-007
gi|326921753|ref|XP_003207120.1| PREDICTED: protein FAM133-like ...     60   2e-007
gi|156547343|ref|XP_001602537.1| PREDICTED: hypothetical protein...     59   3e-007
gi|281203540|gb|EFA77740.1| RNA polymerase I subunit [Polysphond...     59   4e-007
gi|154416118|ref|XP_001581082.1| hypothetical protein [Trichomon...     56   3e-006
gi|193589560|ref|XP_001945968.1| PREDICTED: hypothetical protein...     55   4e-006
gi|300120494|emb|CBK20048.2| unnamed protein product [Blastocyst...     55   4e-006
gi|118387474|ref|XP_001026844.1| hypothetical protein TTHERM_010...     55   6e-006
gi|339468824|gb|EGP83924.1| hypothetical protein MYCGRDRAFT_8787...     55   8e-006
gi|224044901|ref|XP_002194637.1| PREDICTED: hypothetical protein...     54   1e-005

>gi|15238880|ref|NP_200204.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 529

 Score =  85 bits (209), Expect = 5e-015
 Identities = 49/92 (53%), Positives = 69/92 (75%), Gaps = 10/92 (10%)
 Frame = +1

Query: 145 AIGEEEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKKRK 324
           ++ +++  +++ K IRRIKD S+ S SDSS +    SSSE+DYRR+K+R SKLSKK   +
Sbjct:  20 SVKKKKSKRNKSKKIRRIKDESESSGSDSSLY----SSSEDDYRRKKKRRSKLSKK---R 72

Query: 325 SRKRYSSSDSEGEDDDIRV---KKRSKRKDKH 411
           SRKRYSSS+S+ + DD R+   KKRSKRKD++
Sbjct:  73 SRKRYSSSESDDDSDDDRLLKKKKRSKRKDEN 104

>gi|297796243|ref|XP_002866006.1| hypothetical protein ARALYDRAFT_918499
        [Arabidopsis lyrata subsp. lyrata]

          Length = 528

 Score =  80 bits (195), Expect = 2e-013
 Identities = 48/91 (52%), Positives = 68/91 (74%), Gaps = 13/91 (14%)
 Frame = +1

Query: 154 EEEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKKRKSRK 333
           +++  +++ K IRRIK+ S+ S SDSS +    SSSE+DYRR+K+R SKLSKK   +SRK
Sbjct:  27 KKKSKRNKSKKIRRIKE-SESSGSDSSLY----SSSEDDYRRKKKRRSKLSKK---RSRK 78

Query: 334 RYSSSDSEGEDDD-----IRVKKRSKRKDKH 411
           RYSSS+S+ +DDD     ++ KKRSKRKD++
Sbjct:  79 RYSSSESDDDDDDDDSRLLKKKKRSKRKDEY 109

>gi|71896709|ref|NP_001026146.1| hypothetical protein LOC420557 [Gallus gallus]

          Length = 250

 Score =  61 bits (146), Expect = 1e-007
 Identities = 40/88 (45%), Positives = 47/88 (53%), Gaps = 1/88 (1%)
 Frame = +1

Query: 151 GEEEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKK-RKS 327
           G E   K +EK  +  K SS  S S SS  SS SSSS  D   E ++  K  KKKK R S
Sbjct:  83 GNESSSKKKEKKKKEKKKSSRLSSSSSSSSSSDSSSSASDSEDEDKKQGKKKKKKKHRSS 142

Query: 328 RKRYSSSDSEGEDDDIRVKKRSKRKDKH 411
           RK  SSS SE E D     K+ K K++H
Sbjct: 143 RKSSSSSASESESDSKDSTKKKKSKEEH 170

>gi|326921753|ref|XP_003207120.1| PREDICTED: protein FAM133-like [Meleagris
        gallopavo]

          Length = 240

 Score =  60 bits (144), Expect = 2e-007
 Identities = 40/88 (45%), Positives = 46/88 (52%), Gaps = 1/88 (1%)
 Frame = +1

Query: 151 GEEEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKK-RKS 327
           G E   K +EK  +  K SS  S S SS  SS SSSS  D   E ++  K  KKKK R S
Sbjct:  73 GNESSSKKKEKKKKEKKKSSRLSSSSSSSSSSDSSSSASDSEDEDKKQGKKKKKKKHRSS 132

Query: 328 RKRYSSSDSEGEDDDIRVKKRSKRKDKH 411
           RK  SSS SE E D     K+ K K+ H
Sbjct: 133 RKSSSSSASESESDSKDSTKKKKSKEDH 160

>gi|156547343|ref|XP_001602537.1| PREDICTED: hypothetical protein LOC100118605
        [Nasonia vitripennis]

          Length = 169

 Score =  59 bits (142), Expect = 3e-007
 Identities = 31/85 (36%), Positives = 53/85 (62%), Gaps = 2/85 (2%)
 Frame = +1

Query: 154 EEEEDKDEEKVIRRIKDSSD--ESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKKRKS 327
           +++E K + K   + K+SSD  +SESD+   S S S S+ED++++ ++  K SK KKRK 
Sbjct:  84 KKKEKKKKRKKKHKKKESSDSSDSESDNDSSSDSESDSDEDHKKKHKKKKKKSKSKKRKK 143

Query: 328 RKRYSSSDSEGEDDDIRVKKRSKRK 402
            +++SSSDS+  + D       K++
Sbjct: 144 HRKHSSSDSDDSNSDSDNHSHKKKR 168

>gi|281203540|gb|EFA77740.1| RNA polymerase I subunit [Polysphondylium pallidum
        PN500]

          Length = 359

 Score =  59 bits (141), Expect = 4e-007
 Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 14/131 (10%)
 Frame = +1

Query: 124 DELYYAVAIGEEE------EDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREK 285
           DE     A+  EE      E KD +K  +   DS  ESES+SS  S S S S+E  +++K
Sbjct: 214 DEEEETTAVSVEEPSSTKAEKKDSKKEKKVKSDSESESESESSESSESESESDEKSKKQK 273

Query: 286 QRWSKLSKKKKRKSRKRYS----SSDSEGEDDDIRVKKRSKRKDKHS*PRRRDAGETRVQ 453
           +  +   ++ K+KS+K  S    SS+SE   DD + KK+ +   K    R+RD    R++
Sbjct: 274 KNSNNKKEESKKKSKKEESSESESSESESSSDD-KKKKKKQTTPKKKTKRKRD---QRMK 329

Query: 454 VPLVVMRIVRV 486
           V + V  +V++
Sbjct: 330 VAMKVAAVVKM 340

>gi|154416118|ref|XP_001581082.1| hypothetical protein [Trichomonas vaginalis
        G3]

          Length = 795

 Score =  56 bits (133), Expect = 3e-006
 Identities = 37/90 (41%), Positives = 48/90 (53%), Gaps = 6/90 (6%)
 Frame = +1

Query: 154 EEEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKKRKSRK 333
           +E  D + EK  +   DSSD+ +SDSS +SSSS SSE+D   EK+  S   +KK   S  
Sbjct: 628 DESNDDESEKESKEDSDSSDKEKSDSSSNSSSSDSSEDD---EKKSDSSSQEKKSNSSDN 684

Query: 334 RYSSSDSEGE---DDDIRVKKRSKRKDKHS 414
              SSDS  E   + D   KK    +DK S
Sbjct: 685 EEKSSDSSDEKKSESDDEDKKSDSSEDKKS 714

>gi|193589560|ref|XP_001945968.1| PREDICTED: hypothetical protein LOC100162495
        [Acyrthosiphon pisum]

          Length = 1607

 Score =  55 bits (132), Expect = 4e-006
 Identities = 33/90 (36%), Positives = 49/90 (54%), Gaps = 5/90 (5%)
 Frame = +1

Query:  154 EEEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKKRKSRK 333
            +E  D ++E+ I+++    +  +S SS  SSSSSS     R +K++ +K  KKK  K +K
Sbjct: 1497 DERFDNNDEEEIKKLPKDKESDDSVSSSSSSSSSSDSSPVRSKKRKKTKKRKKKHVKKKK 1556

Query:  334 RYSSSDSE-----GEDDDIRVKKRSKRKDK 408
              SSSDS+      E DD    K  K+K K
Sbjct: 1557 LASSSDSDDSDSSSESDDANSSKHKKKKKK 1586

>gi|300120494|emb|CBK20048.2| unnamed protein product [Blastocystis hominis]

          Length = 545

 Score =  55 bits (132), Expect = 4e-006
 Identities = 31/87 (35%), Positives = 44/87 (50%)
 Frame = +1

Query: 154 EEEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKKRKSRK 333
           EEEE+K EE+     K+SS +S   S    SS S SE +   EK+  S         S  
Sbjct: 259 EEEEEKKEEEKKEEEKESSSDSSDSSDSSDSSDSDSESEEEEEKKEESSSDSSDSSDSSD 318

Query: 334 RYSSSDSEGEDDDIRVKKRSKRKDKHS 414
              SSDSE E+++   KK  K++++ S
Sbjct: 319 SSDSSDSESEEEEEEEKKEEKKEEEES 345

>gi|118387474|ref|XP_001026844.1| hypothetical protein TTHERM_01071450
        [Tetrahymena thermophila]

          Length = 1368

 Score =  55 bits (131), Expect = 6e-006
 Identities = 30/84 (35%), Positives = 53/84 (63%), Gaps = 5/84 (5%)
 Frame = +1

Query:  157 EEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKKRKSRKR 336
            ++E+K ++K     K SS  S S S   SSS  SS+++  ++K++  K SKKKK+ ++KR
Sbjct: 1095 QDENKKKQK-----KQSSSSSSSRSQSSSSSGDSSDDEKDKKKKKSKKDSKKKKKNTKKR 1149

Query:  337 YSSSDSEGEDDDIRVKKRSKRKDK 408
               SDSE +D+ ++++K +  + K
Sbjct: 1150 RHDSDSEPDDEIMKLEKEALHEKK 1173

>gi|339468824|gb|EGP83924.1| hypothetical protein MYCGRDRAFT_87870
        [Mycosphaerella graminicola IPO323]

          Length = 912

 Score =  55 bits (130), Expect = 8e-006
 Identities = 31/80 (38%), Positives = 46/80 (57%)
 Frame = +1

Query: 202 DSSDESESDSSFHSSSSSSSEEDYRREKQRWSKLSKKKKRKSRKRYSSSDSEGEDDDIRV 381
           DSSD S S+SS   S  SSS E+  ++K+  SK +KK K K RK+    DS   +DD   
Sbjct:  81 DSSDSSSSESSEDESDESSSSEEEVKKKKSKSKTAKKSKDKKRKKSKKPDSSESEDDSGD 140

Query: 382 KKRSKRKDKHS*PRRRDAGE 441
           +  S  +D+ S  +++ AG+
Sbjct: 141 ESDSSSEDEASKKKQKKAGK 160

>gi|224044901|ref|XP_002194637.1| PREDICTED: hypothetical protein LOC100190354
        [Taeniopygia guttata]

          Length = 239

 Score =  54 bits (129), Expect = 1e-005
 Identities = 39/85 (45%), Positives = 46/85 (54%), Gaps = 3/85 (3%)
 Frame = +1

Query: 151 GEEEEDKDEEKVIRRIKDSSDESESDSSFHSSSSSSSEEDYRRE-KQRWSKLSKKKKRKS 327
           G E   K +EK  +  K S+  S S SS  SS SSSS  D   E K++  K  KKK R S
Sbjct:  73 GNESSSKKKEKKKKEKKKSNRLSSSSSSSSSSDSSSSSSDSEDEDKKQGKKRRKKKYRSS 132

Query: 328 RKRYSSSDSEGEDD--DIRVKKRSK 396
           RK  +S+ SE E D  D   KKRSK
Sbjct: 133 RKSSASTTSESESDSKDSTKKKRSK 157

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 670,546,551,498
Number of Sequences: 15229318
Number of Extensions: 670546551498
Number of Successful Extensions: 203192682
Number of sequences better than 0.0: 0