Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN52801


BLASTX 7.6.2

Query= UN52801 /QuerySize=629
        (628 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|9663025|emb|CAC01084.1| DIP2 protein [Arabidopsis thaliana]         119   3e-025
gi|30693141|ref|NP_198588.2| THO complex subunit 4 [Arabidopsis ...    119   3e-025
gi|145334661|ref|NP_001078676.1| THO complex subunit 4 [Arabidop...    119   3e-025
gi|297805348|ref|XP_002870558.1| hypothetical protein ARALYDRAFT...    114   9e-024
gi|255582255|ref|XP_002531919.1| RNA and export factor binding p...     87   1e-015
gi|9757981|dbj|BAB08317.1| unnamed protein product [Arabidopsis ...     76   3e-012
gi|115476498|ref|NP_001061845.1| Os08g0427900 [Oryza sativa Japo...     74   1e-011
gi|226531320|ref|NP_001146561.1| hypothetical protein LOC1002801...     70   2e-010
gi|12323574|gb|AAG51767.1|AC066691_7 RNA and export factor bindi...     69   3e-010
gi|18408471|ref|NP_564871.1| RNA recognition motif-containing pr...     69   3e-010
gi|242081501|ref|XP_002445519.1| hypothetical protein SORBIDRAFT...     68   7e-010
gi|242095380|ref|XP_002438180.1| hypothetical protein SORBIDRAFT...     67   2e-009
gi|115467430|ref|NP_001057314.1| Os06g0256200 [Oryza sativa Japo...     65   5e-009
gi|297841223|ref|XP_002888493.1| hypothetical protein ARALYDRAFT...     64   2e-008
gi|9663023|emb|CAC01083.1| DIP1 protein [Arabidopsis thaliana]          63   2e-008

>gi|9663025|emb|CAC01084.1| DIP2 protein [Arabidopsis thaliana]

          Length = 288

 Score =  119 bits (298), Expect = 3e-025
 Identities = 63/90 (70%), Positives = 69/90 (76%), Gaps = 4/90 (4%)
 Frame = +3

Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRGNGPARRGPLAVNSRTSSFTINKPVRRTRSVPWQ 389
           RG + R GG     G  G   GGGGRG GPARRGPLAVN+R SSFTINKPVRR RS+PWQ
Sbjct:  15 RGKTARSGGRGISRG-RGRGRGGGGRGAGPARRGPLAVNARPSSFTINKPVRRVRSLPWQ 73

Query: 390 TGLFEDGLRAA---GVDVETRLHITNLDSG 470
           +GLFEDGLRAA   GV+V TRLH+TNLD G
Sbjct:  74 SGLFEDGLRAAGASGVEVGTRLHVTNLDQG 103


 Score =  69 bits (166), Expect = 6e-010
 Identities = 33/40 (82%)
 Frame = +3

Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
           G   RG GGGRGNG KPVEKSAADL KD ESYHADAMNTS
Sbjct: 249 GNGDRGRGGGRGNGNKPVEKSAADLAKDLESYHADAMNTS 288

>gi|30693141|ref|NP_198588.2| THO complex subunit 4 [Arabidopsis thaliana]

          Length = 288

 Score =  119 bits (298), Expect = 3e-025
 Identities = 63/90 (70%), Positives = 69/90 (76%), Gaps = 4/90 (4%)
 Frame = +3

Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRGNGPARRGPLAVNSRTSSFTINKPVRRTRSVPWQ 389
           RG + R GG     G  G   GGGGRG GPARRGPLAVN+R SSFTINKPVRR RS+PWQ
Sbjct:  15 RGKTARSGGRGISRG-RGRGRGGGGRGAGPARRGPLAVNARPSSFTINKPVRRVRSLPWQ 73

Query: 390 TGLFEDGLRAA---GVDVETRLHITNLDSG 470
           +GLFEDGLRAA   GV+V TRLH+TNLD G
Sbjct:  74 SGLFEDGLRAAGASGVEVGTRLHVTNLDQG 103


 Score =  76 bits (186), Expect = 3e-012
 Identities = 36/40 (90%)
 Frame = +3

Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
           G  GRG GGGRGNGKKPVEKSAADLDKD ESYHADAMNTS
Sbjct: 249 GNGGRGRGGGRGNGKKPVEKSAADLDKDLESYHADAMNTS 288

>gi|145334661|ref|NP_001078676.1| THO complex subunit 4 [Arabidopsis thaliana]

          Length = 280

 Score =  119 bits (298), Expect = 3e-025
 Identities = 63/90 (70%), Positives = 69/90 (76%), Gaps = 4/90 (4%)
 Frame = +3

Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRGNGPARRGPLAVNSRTSSFTINKPVRRTRSVPWQ 389
           RG + R GG     G  G   GGGGRG GPARRGPLAVN+R SSFTINKPVRR RS+PWQ
Sbjct:  15 RGKTARSGGRGISRG-RGRGRGGGGRGAGPARRGPLAVNARPSSFTINKPVRRVRSLPWQ 73

Query: 390 TGLFEDGLRAA---GVDVETRLHITNLDSG 470
           +GLFEDGLRAA   GV+V TRLH+TNLD G
Sbjct:  74 SGLFEDGLRAAGASGVEVGTRLHVTNLDQG 103


 Score =  76 bits (186), Expect = 3e-012
 Identities = 36/40 (90%)
 Frame = +3

Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
           G  GRG GGGRGNGKKPVEKSAADLDKD ESYHADAMNTS
Sbjct: 241 GNGGRGRGGGRGNGKKPVEKSAADLDKDLESYHADAMNTS 280

>gi|297805348|ref|XP_002870558.1| hypothetical protein ARALYDRAFT_493750
        [Arabidopsis lyrata subsp. lyrata]

          Length = 279

 Score =  114 bits (285), Expect = 9e-024
 Identities = 59/90 (65%), Positives = 66/90 (73%), Gaps = 3/90 (3%)
 Frame = +3

Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRGNGPARRGPLAVNSRTSSFTINKPVRRTRSVPWQ 389
           RG + R GG        G   GGG RG GPARRGPLAVN+R SS +INKPVRR RS+PWQ
Sbjct:  15 RGKTARSGGRGNSRPGRGRGRGGGRRGAGPARRGPLAVNARPSSLSINKPVRRVRSLPWQ 74

Query: 390 TGLFEDGLRAA---GVDVETRLHITNLDSG 470
           +GLFEDGLRAA   GV+V TRLH+TNLD G
Sbjct:  75 SGLFEDGLRAAGVSGVEVGTRLHVTNLDQG 104


 Score =  79 bits (192), Expect = 5e-013
 Identities = 37/40 (92%)
 Frame = +3

Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
           GG GRG GGGRGNGKKPVEKSAADLDKD ESYHADAMNTS
Sbjct: 240 GGGGRGRGGGRGNGKKPVEKSAADLDKDLESYHADAMNTS 279

>gi|255582255|ref|XP_002531919.1| RNA and export factor binding protein,
        putative [Ricinus communis]

          Length = 268

 Score =  87 bits (215), Expect = 1e-015
 Identities = 50/107 (46%), Positives = 67/107 (62%), Gaps = 15/107 (14%)
 Frame = +3

Query: 165 LDESIKRAKAAKSGGRGTSRRGGGGRRDHGPNGFLGG--GGGRGNGPARRGPLAVNSRTS 338
           LD+ IK+ +    GGRG +RRG G           GG   GGR  G  R+GPL+VN+R S
Sbjct:   9 LDDIIKKNRERGRGGRGRARRGRG----------RGGSFSGGRMTGAGRKGPLSVNARPS 58

Query: 339 SFTINKPVRRTRSVPWQTGLFEDGLRAA---GVDVETRLHITNLDSG 470
            F+I KP RR R++PWQ  L ED +RAA   GV+V T+L+++NL+ G
Sbjct:  59 QFSIAKPNRRIRNLPWQHDLLEDSIRAAGITGVEVGTKLYVSNLEYG 105

>gi|9757981|dbj|BAB08317.1| unnamed protein product [Arabidopsis thaliana]

          Length = 330

 Score =  76 bits (186), Expect = 3e-012
 Identities = 36/40 (90%)
 Frame = +3

Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
           G  GRG GGGRGNGKKPVEKSAADLDKD ESYHADAMNTS
Sbjct: 291 GNGGRGRGGGRGNGKKPVEKSAADLDKDLESYHADAMNTS 330


 Score =  65 bits (158), Expect = 5e-009
 Identities = 33/48 (68%), Positives = 37/48 (77%), Gaps = 3/48 (6%)
 Frame = +3

Query: 336 SSFTINKPVRRTRSVPWQTGLFEDGLRAA---GVDVETRLHITNLDSG 470
           S   I  PVRR RS+PWQ+GLFEDGLRAA   GV+V TRLH+TNLD G
Sbjct:  98 SFIIIALPVRRVRSLPWQSGLFEDGLRAAGASGVEVGTRLHVTNLDQG 145

>gi|115476498|ref|NP_001061845.1| Os08g0427900 [Oryza sativa Japonica Group]

          Length = 286

 Score =  74 bits (180), Expect = 1e-011
 Identities = 44/96 (45%), Positives = 56/96 (58%), Gaps = 11/96 (11%)
 Frame = +3

Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRG-------NGPARRGP--LAVNSRTSSFTINKPV 362
           RG  R  GGGR   G  G  GGGGGRG       +G   RGP  L VNSR S+ TI K  
Sbjct:   7 RGGDRVSGGGRVQGGGGG--GGGGGRGGYVLRGRSGMPPRGPLGLGVNSRPSARTIAKSF 64

Query: 363 RRTRSVPWQTGLFEDGLRAAGVDVETRLHITNLDSG 470
            RT+ + W+  LF D + A+G++  T+L+I+NLD G
Sbjct:  65 SRTKDMTWRPDLFSDSMAASGIETGTKLYISNLDYG 100

>gi|226531320|ref|NP_001146561.1| hypothetical protein LOC100280157 [Zea mays]

          Length = 269

 Score =  70 bits (170), Expect = 2e-010
 Identities = 41/90 (45%), Positives = 55/90 (61%), Gaps = 12/90 (13%)
 Frame = +3

Query: 222 RRGGGGRRDHGPNGFLGGGGGRG-------NGPARRGPLAVNSRTSSFTINKPVRRTRSV 380
           RRGG    D G +G + G GGRG       +G A RGPL VNSR S+ TI K   RT+ +
Sbjct:   6 RRGG----DRG-SGRIQGSGGRGGHVLRGRSGLAPRGPLGVNSRPSARTIAKSFSRTKDM 60

Query: 381 PWQTGLFEDGLRAAGVDVETRLHITNLDSG 470
            W+  LF D + A+G++  T+L+I+NLD G
Sbjct:  61 TWRPDLFSDSMAASGIETGTKLYISNLDYG 90

>gi|12323574|gb|AAG51767.1|AC066691_7 RNA and export factor binding protein,
        putative; 38196-36208 [Arabidopsis thaliana]

          Length = 282

 Score =  69 bits (168), Expect = 3e-010
 Identities = 35/45 (77%), Positives = 36/45 (80%), Gaps = 6/45 (13%)
 Frame = +3

Query: 471 GRGRGGGGGRGN------GKKPVEKSAADLDKDRESYHADAMNTS 587
           GRGRG GGGRGN      GKKPVEKSAADLDKD ESYHA+AMN S
Sbjct: 238 GRGRGNGGGRGNKSGGRGGKKPVEKSAADLDKDLESYHAEAMNIS 282

>gi|18408471|ref|NP_564871.1| RNA recognition motif-containing protein
        [Arabidopsis thaliana]

          Length = 295

 Score =  69 bits (168), Expect = 3e-010
 Identities = 35/45 (77%), Positives = 36/45 (80%), Gaps = 6/45 (13%)
 Frame = +3

Query: 471 GRGRGGGGGRGN------GKKPVEKSAADLDKDRESYHADAMNTS 587
           GRGRG GGGRGN      GKKPVEKSAADLDKD ESYHA+AMN S
Sbjct: 251 GRGRGNGGGRGNKSGGRGGKKPVEKSAADLDKDLESYHAEAMNIS 295

>gi|242081501|ref|XP_002445519.1| hypothetical protein SORBIDRAFT_07g020860
        [Sorghum bicolor]

          Length = 272

 Score =  68 bits (165), Expect = 7e-010
 Identities = 40/90 (44%), Positives = 54/90 (60%), Gaps = 12/90 (13%)
 Frame = +3

Query: 222 RRGGGGRRDHGPNGFLGGGGGRG-------NGPARRGPLAVNSRTSSFTINKPVRRTRSV 380
           RRGG    D G +G + G GGRG       +G   RGPL VNSR S+ TI K   RT+ +
Sbjct:   6 RRGG----DRG-SGRIQGSGGRGGHVLRGRSGLPPRGPLGVNSRPSARTIAKSFSRTKDM 60

Query: 381 PWQTGLFEDGLRAAGVDVETRLHITNLDSG 470
            W+  LF D + A+G++  T+L+I+NLD G
Sbjct:  61 TWRPDLFSDSMAASGIETGTKLYISNLDYG 90

>gi|242095380|ref|XP_002438180.1| hypothetical protein SORBIDRAFT_10g009240
        [Sorghum bicolor]

          Length = 307

 Score =  67 bits (161), Expect = 2e-009
 Identities = 45/118 (38%), Positives = 62/118 (52%), Gaps = 8/118 (6%)
 Frame = +3

Query: 141 MSGGLDMTLDESIK-RAKAAKSGGRGTSRRGGGGRRDHGPNGFLGGG----GGRGNGPAR 305
           M+  LD+ LD+ IK R    +  GRG   RG G  +      + G G     GRG G   
Sbjct:   1 MATSLDVPLDDLIKSRNGRGRGRGRGQGGRGRGDGQRLARGSWRGRGTGTFRGRGLGVPS 60

Query: 306 RGPLAVNSRTSSFTINKPVRRTRSVPWQTGLFEDGLRAA---GVDVETRLHITNLDSG 470
           R PL VN+R+SSF I K   + +   W+  LFED + AA   G++  T+L+I+NL  G
Sbjct:  61 RRPLGVNTRSSSFAIAKSFNKAKDFVWRHDLFEDSMVAAGLSGIESGTKLYISNLHYG 118

>gi|115467430|ref|NP_001057314.1| Os06g0256200 [Oryza sativa Japonica Group]

          Length = 294

 Score =  65 bits (158), Expect = 5e-009
 Identities = 40/97 (41%), Positives = 56/97 (57%), Gaps = 8/97 (8%)
 Frame = +3

Query: 195 AKSGGRGTSRRGGGGRRDHGPNGFLGGG--GGRGNGPARRGPLAVNSRTSSFTINKPVRR 368
           ++ GGRG   RG G R  +G     G G   GRG G   R PL V++R+SS+ I K   +
Sbjct:  27 SQGGGRG---RGDGQRFSYGSGRGRGAGTFRGRGVGVPSRRPLGVSTRSSSYAIAKSFNK 83

Query: 369 TRSVPWQTGLFEDGLRAAGVDV---ETRLHITNLDSG 470
           T+ + W+  LFED + AAG+ V    T+L+I+NL  G
Sbjct:  84 TKDIVWRQDLFEDSMVAAGLSVTESSTKLYISNLHYG 120

>gi|297841223|ref|XP_002888493.1| hypothetical protein ARALYDRAFT_475731
        [Arabidopsis lyrata subsp. lyrata]

          Length = 293

 Score =  64 bits (153), Expect = 2e-008
 Identities = 33/44 (75%), Positives = 34/44 (77%), Gaps = 5/44 (11%)
 Frame = +3

Query: 471 GRGRGGGGGRGN-----GKKPVEKSAADLDKDRESYHADAMNTS 587
           GRGRG  GGRGN     GKK VEKSAADLDKD ESYHA+AMN S
Sbjct: 250 GRGRGNTGGRGNKSGRGGKKAVEKSAADLDKDLESYHAEAMNIS 293

>gi|9663023|emb|CAC01083.1| DIP1 protein [Arabidopsis thaliana]

          Length = 295

 Score =  63 bits (152), Expect = 2e-008
 Identities = 34/44 (77%), Positives = 35/44 (79%), Gaps = 6/44 (13%)
 Frame = +3

Query: 471 GRGRGGG-----GGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
           GRG GGG     GGRG GKKPVEKSAADLDKD ESYHA+AMN S
Sbjct: 253 GRGNGGGIGNKSGGRG-GKKPVEKSAADLDKDLESYHAEAMNIS 295

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,680,125,884,373
Number of Sequences: 15229318
Number of Extensions: 5680125884373
Number of Successful Extensions: 1311504089
Number of sequences better than 0.0: 0