BLASTX 7.6.2
Query= UN52801 /QuerySize=629
(628 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|9663025|emb|CAC01084.1| DIP2 protein [Arabidopsis thaliana] 119 3e-025
gi|30693141|ref|NP_198588.2| THO complex subunit 4 [Arabidopsis ... 119 3e-025
gi|145334661|ref|NP_001078676.1| THO complex subunit 4 [Arabidop... 119 3e-025
gi|297805348|ref|XP_002870558.1| hypothetical protein ARALYDRAFT... 114 9e-024
gi|255582255|ref|XP_002531919.1| RNA and export factor binding p... 87 1e-015
gi|9757981|dbj|BAB08317.1| unnamed protein product [Arabidopsis ... 76 3e-012
gi|115476498|ref|NP_001061845.1| Os08g0427900 [Oryza sativa Japo... 74 1e-011
gi|226531320|ref|NP_001146561.1| hypothetical protein LOC1002801... 70 2e-010
gi|12323574|gb|AAG51767.1|AC066691_7 RNA and export factor bindi... 69 3e-010
gi|18408471|ref|NP_564871.1| RNA recognition motif-containing pr... 69 3e-010
gi|242081501|ref|XP_002445519.1| hypothetical protein SORBIDRAFT... 68 7e-010
gi|242095380|ref|XP_002438180.1| hypothetical protein SORBIDRAFT... 67 2e-009
gi|115467430|ref|NP_001057314.1| Os06g0256200 [Oryza sativa Japo... 65 5e-009
gi|297841223|ref|XP_002888493.1| hypothetical protein ARALYDRAFT... 64 2e-008
gi|9663023|emb|CAC01083.1| DIP1 protein [Arabidopsis thaliana] 63 2e-008
>gi|9663025|emb|CAC01084.1| DIP2 protein [Arabidopsis thaliana]
Length = 288
Score = 119 bits (298), Expect = 3e-025
Identities = 63/90 (70%), Positives = 69/90 (76%), Gaps = 4/90 (4%)
Frame = +3
Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRGNGPARRGPLAVNSRTSSFTINKPVRRTRSVPWQ 389
RG + R GG G G GGGGRG GPARRGPLAVN+R SSFTINKPVRR RS+PWQ
Sbjct: 15 RGKTARSGGRGISRG-RGRGRGGGGRGAGPARRGPLAVNARPSSFTINKPVRRVRSLPWQ 73
Query: 390 TGLFEDGLRAA---GVDVETRLHITNLDSG 470
+GLFEDGLRAA GV+V TRLH+TNLD G
Sbjct: 74 SGLFEDGLRAAGASGVEVGTRLHVTNLDQG 103
Score = 69 bits (166), Expect = 6e-010
Identities = 33/40 (82%)
Frame = +3
Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
G RG GGGRGNG KPVEKSAADL KD ESYHADAMNTS
Sbjct: 249 GNGDRGRGGGRGNGNKPVEKSAADLAKDLESYHADAMNTS 288
>gi|30693141|ref|NP_198588.2| THO complex subunit 4 [Arabidopsis thaliana]
Length = 288
Score = 119 bits (298), Expect = 3e-025
Identities = 63/90 (70%), Positives = 69/90 (76%), Gaps = 4/90 (4%)
Frame = +3
Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRGNGPARRGPLAVNSRTSSFTINKPVRRTRSVPWQ 389
RG + R GG G G GGGGRG GPARRGPLAVN+R SSFTINKPVRR RS+PWQ
Sbjct: 15 RGKTARSGGRGISRG-RGRGRGGGGRGAGPARRGPLAVNARPSSFTINKPVRRVRSLPWQ 73
Query: 390 TGLFEDGLRAA---GVDVETRLHITNLDSG 470
+GLFEDGLRAA GV+V TRLH+TNLD G
Sbjct: 74 SGLFEDGLRAAGASGVEVGTRLHVTNLDQG 103
Score = 76 bits (186), Expect = 3e-012
Identities = 36/40 (90%)
Frame = +3
Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
G GRG GGGRGNGKKPVEKSAADLDKD ESYHADAMNTS
Sbjct: 249 GNGGRGRGGGRGNGKKPVEKSAADLDKDLESYHADAMNTS 288
>gi|145334661|ref|NP_001078676.1| THO complex subunit 4 [Arabidopsis thaliana]
Length = 280
Score = 119 bits (298), Expect = 3e-025
Identities = 63/90 (70%), Positives = 69/90 (76%), Gaps = 4/90 (4%)
Frame = +3
Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRGNGPARRGPLAVNSRTSSFTINKPVRRTRSVPWQ 389
RG + R GG G G GGGGRG GPARRGPLAVN+R SSFTINKPVRR RS+PWQ
Sbjct: 15 RGKTARSGGRGISRG-RGRGRGGGGRGAGPARRGPLAVNARPSSFTINKPVRRVRSLPWQ 73
Query: 390 TGLFEDGLRAA---GVDVETRLHITNLDSG 470
+GLFEDGLRAA GV+V TRLH+TNLD G
Sbjct: 74 SGLFEDGLRAAGASGVEVGTRLHVTNLDQG 103
Score = 76 bits (186), Expect = 3e-012
Identities = 36/40 (90%)
Frame = +3
Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
G GRG GGGRGNGKKPVEKSAADLDKD ESYHADAMNTS
Sbjct: 241 GNGGRGRGGGRGNGKKPVEKSAADLDKDLESYHADAMNTS 280
>gi|297805348|ref|XP_002870558.1| hypothetical protein ARALYDRAFT_493750
[Arabidopsis lyrata subsp. lyrata]
Length = 279
Score = 114 bits (285), Expect = 9e-024
Identities = 59/90 (65%), Positives = 66/90 (73%), Gaps = 3/90 (3%)
Frame = +3
Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRGNGPARRGPLAVNSRTSSFTINKPVRRTRSVPWQ 389
RG + R GG G GGG RG GPARRGPLAVN+R SS +INKPVRR RS+PWQ
Sbjct: 15 RGKTARSGGRGNSRPGRGRGRGGGRRGAGPARRGPLAVNARPSSLSINKPVRRVRSLPWQ 74
Query: 390 TGLFEDGLRAA---GVDVETRLHITNLDSG 470
+GLFEDGLRAA GV+V TRLH+TNLD G
Sbjct: 75 SGLFEDGLRAAGVSGVEVGTRLHVTNLDQG 104
Score = 79 bits (192), Expect = 5e-013
Identities = 37/40 (92%)
Frame = +3
Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
GG GRG GGGRGNGKKPVEKSAADLDKD ESYHADAMNTS
Sbjct: 240 GGGGRGRGGGRGNGKKPVEKSAADLDKDLESYHADAMNTS 279
>gi|255582255|ref|XP_002531919.1| RNA and export factor binding protein,
putative [Ricinus communis]
Length = 268
Score = 87 bits (215), Expect = 1e-015
Identities = 50/107 (46%), Positives = 67/107 (62%), Gaps = 15/107 (14%)
Frame = +3
Query: 165 LDESIKRAKAAKSGGRGTSRRGGGGRRDHGPNGFLGG--GGGRGNGPARRGPLAVNSRTS 338
LD+ IK+ + GGRG +RRG G GG GGR G R+GPL+VN+R S
Sbjct: 9 LDDIIKKNRERGRGGRGRARRGRG----------RGGSFSGGRMTGAGRKGPLSVNARPS 58
Query: 339 SFTINKPVRRTRSVPWQTGLFEDGLRAA---GVDVETRLHITNLDSG 470
F+I KP RR R++PWQ L ED +RAA GV+V T+L+++NL+ G
Sbjct: 59 QFSIAKPNRRIRNLPWQHDLLEDSIRAAGITGVEVGTKLYVSNLEYG 105
>gi|9757981|dbj|BAB08317.1| unnamed protein product [Arabidopsis thaliana]
Length = 330
Score = 76 bits (186), Expect = 3e-012
Identities = 36/40 (90%)
Frame = +3
Query: 468 GGRGRGGGGGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
G GRG GGGRGNGKKPVEKSAADLDKD ESYHADAMNTS
Sbjct: 291 GNGGRGRGGGRGNGKKPVEKSAADLDKDLESYHADAMNTS 330
Score = 65 bits (158), Expect = 5e-009
Identities = 33/48 (68%), Positives = 37/48 (77%), Gaps = 3/48 (6%)
Frame = +3
Query: 336 SSFTINKPVRRTRSVPWQTGLFEDGLRAA---GVDVETRLHITNLDSG 470
S I PVRR RS+PWQ+GLFEDGLRAA GV+V TRLH+TNLD G
Sbjct: 98 SFIIIALPVRRVRSLPWQSGLFEDGLRAAGASGVEVGTRLHVTNLDQG 145
>gi|115476498|ref|NP_001061845.1| Os08g0427900 [Oryza sativa Japonica Group]
Length = 286
Score = 74 bits (180), Expect = 1e-011
Identities = 44/96 (45%), Positives = 56/96 (58%), Gaps = 11/96 (11%)
Frame = +3
Query: 210 RGTSRRGGGGRRDHGPNGFLGGGGGRG-------NGPARRGP--LAVNSRTSSFTINKPV 362
RG R GGGR G G GGGGGRG +G RGP L VNSR S+ TI K
Sbjct: 7 RGGDRVSGGGRVQGGGGG--GGGGGRGGYVLRGRSGMPPRGPLGLGVNSRPSARTIAKSF 64
Query: 363 RRTRSVPWQTGLFEDGLRAAGVDVETRLHITNLDSG 470
RT+ + W+ LF D + A+G++ T+L+I+NLD G
Sbjct: 65 SRTKDMTWRPDLFSDSMAASGIETGTKLYISNLDYG 100
>gi|226531320|ref|NP_001146561.1| hypothetical protein LOC100280157 [Zea mays]
Length = 269
Score = 70 bits (170), Expect = 2e-010
Identities = 41/90 (45%), Positives = 55/90 (61%), Gaps = 12/90 (13%)
Frame = +3
Query: 222 RRGGGGRRDHGPNGFLGGGGGRG-------NGPARRGPLAVNSRTSSFTINKPVRRTRSV 380
RRGG D G +G + G GGRG +G A RGPL VNSR S+ TI K RT+ +
Sbjct: 6 RRGG----DRG-SGRIQGSGGRGGHVLRGRSGLAPRGPLGVNSRPSARTIAKSFSRTKDM 60
Query: 381 PWQTGLFEDGLRAAGVDVETRLHITNLDSG 470
W+ LF D + A+G++ T+L+I+NLD G
Sbjct: 61 TWRPDLFSDSMAASGIETGTKLYISNLDYG 90
>gi|12323574|gb|AAG51767.1|AC066691_7 RNA and export factor binding protein,
putative; 38196-36208 [Arabidopsis thaliana]
Length = 282
Score = 69 bits (168), Expect = 3e-010
Identities = 35/45 (77%), Positives = 36/45 (80%), Gaps = 6/45 (13%)
Frame = +3
Query: 471 GRGRGGGGGRGN------GKKPVEKSAADLDKDRESYHADAMNTS 587
GRGRG GGGRGN GKKPVEKSAADLDKD ESYHA+AMN S
Sbjct: 238 GRGRGNGGGRGNKSGGRGGKKPVEKSAADLDKDLESYHAEAMNIS 282
>gi|18408471|ref|NP_564871.1| RNA recognition motif-containing protein
[Arabidopsis thaliana]
Length = 295
Score = 69 bits (168), Expect = 3e-010
Identities = 35/45 (77%), Positives = 36/45 (80%), Gaps = 6/45 (13%)
Frame = +3
Query: 471 GRGRGGGGGRGN------GKKPVEKSAADLDKDRESYHADAMNTS 587
GRGRG GGGRGN GKKPVEKSAADLDKD ESYHA+AMN S
Sbjct: 251 GRGRGNGGGRGNKSGGRGGKKPVEKSAADLDKDLESYHAEAMNIS 295
>gi|242081501|ref|XP_002445519.1| hypothetical protein SORBIDRAFT_07g020860
[Sorghum bicolor]
Length = 272
Score = 68 bits (165), Expect = 7e-010
Identities = 40/90 (44%), Positives = 54/90 (60%), Gaps = 12/90 (13%)
Frame = +3
Query: 222 RRGGGGRRDHGPNGFLGGGGGRG-------NGPARRGPLAVNSRTSSFTINKPVRRTRSV 380
RRGG D G +G + G GGRG +G RGPL VNSR S+ TI K RT+ +
Sbjct: 6 RRGG----DRG-SGRIQGSGGRGGHVLRGRSGLPPRGPLGVNSRPSARTIAKSFSRTKDM 60
Query: 381 PWQTGLFEDGLRAAGVDVETRLHITNLDSG 470
W+ LF D + A+G++ T+L+I+NLD G
Sbjct: 61 TWRPDLFSDSMAASGIETGTKLYISNLDYG 90
>gi|242095380|ref|XP_002438180.1| hypothetical protein SORBIDRAFT_10g009240
[Sorghum bicolor]
Length = 307
Score = 67 bits (161), Expect = 2e-009
Identities = 45/118 (38%), Positives = 62/118 (52%), Gaps = 8/118 (6%)
Frame = +3
Query: 141 MSGGLDMTLDESIK-RAKAAKSGGRGTSRRGGGGRRDHGPNGFLGGG----GGRGNGPAR 305
M+ LD+ LD+ IK R + GRG RG G + + G G GRG G
Sbjct: 1 MATSLDVPLDDLIKSRNGRGRGRGRGQGGRGRGDGQRLARGSWRGRGTGTFRGRGLGVPS 60
Query: 306 RGPLAVNSRTSSFTINKPVRRTRSVPWQTGLFEDGLRAA---GVDVETRLHITNLDSG 470
R PL VN+R+SSF I K + + W+ LFED + AA G++ T+L+I+NL G
Sbjct: 61 RRPLGVNTRSSSFAIAKSFNKAKDFVWRHDLFEDSMVAAGLSGIESGTKLYISNLHYG 118
>gi|115467430|ref|NP_001057314.1| Os06g0256200 [Oryza sativa Japonica Group]
Length = 294
Score = 65 bits (158), Expect = 5e-009
Identities = 40/97 (41%), Positives = 56/97 (57%), Gaps = 8/97 (8%)
Frame = +3
Query: 195 AKSGGRGTSRRGGGGRRDHGPNGFLGGG--GGRGNGPARRGPLAVNSRTSSFTINKPVRR 368
++ GGRG RG G R +G G G GRG G R PL V++R+SS+ I K +
Sbjct: 27 SQGGGRG---RGDGQRFSYGSGRGRGAGTFRGRGVGVPSRRPLGVSTRSSSYAIAKSFNK 83
Query: 369 TRSVPWQTGLFEDGLRAAGVDV---ETRLHITNLDSG 470
T+ + W+ LFED + AAG+ V T+L+I+NL G
Sbjct: 84 TKDIVWRQDLFEDSMVAAGLSVTESSTKLYISNLHYG 120
>gi|297841223|ref|XP_002888493.1| hypothetical protein ARALYDRAFT_475731
[Arabidopsis lyrata subsp. lyrata]
Length = 293
Score = 64 bits (153), Expect = 2e-008
Identities = 33/44 (75%), Positives = 34/44 (77%), Gaps = 5/44 (11%)
Frame = +3
Query: 471 GRGRGGGGGRGN-----GKKPVEKSAADLDKDRESYHADAMNTS 587
GRGRG GGRGN GKK VEKSAADLDKD ESYHA+AMN S
Sbjct: 250 GRGRGNTGGRGNKSGRGGKKAVEKSAADLDKDLESYHAEAMNIS 293
>gi|9663023|emb|CAC01083.1| DIP1 protein [Arabidopsis thaliana]
Length = 295
Score = 63 bits (152), Expect = 2e-008
Identities = 34/44 (77%), Positives = 35/44 (79%), Gaps = 6/44 (13%)
Frame = +3
Query: 471 GRGRGGG-----GGRGNGKKPVEKSAADLDKDRESYHADAMNTS 587
GRG GGG GGRG GKKPVEKSAADLDKD ESYHA+AMN S
Sbjct: 253 GRGNGGGIGNKSGGRG-GKKPVEKSAADLDKDLESYHAEAMNIS 295
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,680,125,884,373
Number of Sequences: 15229318
Number of Extensions: 5680125884373
Number of Successful Extensions: 1311504089
Number of sequences better than 0.0: 0
|