BLASTX 7.6.2
Query= UN02650 /QuerySize=1241
(1240 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297835172|ref|XP_002885468.1| SET domain-containing protein [... 206 6e-051
gi|297835174|ref|XP_002885469.1| hypothetical protein ARALYDRAFT... 206 6e-051
gi|11994649|dbj|BAB02844.1| unnamed protein product [Arabidopsis... 204 3e-050
gi|42565094|ref|NP_188819.2| histone-lysine N-methyltransferase ... 204 3e-050
gi|224132628|ref|XP_002327842.1| SET domain protein [Populus tri... 150 7e-034
gi|242077278|ref|XP_002448575.1| hypothetical protein SORBIDRAFT... 136 1e-029
gi|255549357|ref|XP_002515732.1| protein with unknown function [... 101 3e-019
gi|225447338|ref|XP_002274324.1| PREDICTED: hypothetical protein... 99 1e-018
gi|297739311|emb|CBI28962.3| unnamed protein product [Vitis vini... 97 4e-018
gi|26451714|dbj|BAC42952.1| unknown protein [Arabidopsis thaliana] 87 4e-015
gi|15231425|ref|NP_187378.1| SMAD/FHA domain-containing protein ... 87 4e-015
gi|297825551|ref|XP_002880658.1| predicted protein [Arabidopsis ... 87 4e-015
gi|297829272|ref|XP_002882518.1| hypothetical protein ARALYDRAFT... 87 4e-015
gi|115477441|ref|NP_001062316.1| Os08g0528900 [Oryza sativa Japo... 87 7e-015
gi|15231433|ref|NP_187382.1| SMAD/FHA domain-containing protein ... 86 9e-015
gi|42407966|dbj|BAD09104.1| putative transcriptional activator [... 85 2e-014
gi|326493928|dbj|BAJ85426.1| predicted protein [Hordeum vulgare ... 85 3e-014
gi|224065403|ref|XP_002301800.1| predicted protein [Populus tric... 83 1e-013
gi|326493358|dbj|BAJ85140.1| predicted protein [Hordeum vulgare ... 83 1e-013
gi|15705932|gb|AAL05884.1|AF411856_1 transcriptional activator F... 82 2e-013
>gi|297835172|ref|XP_002885468.1| SET domain-containing protein [Arabidopsis
lyrata subsp. lyrata]
Length = 471
Score = 206 bits (524), Expect = 6e-051
Identities = 106/169 (62%), Positives = 128/169 (75%), Gaps = 17/169 (10%)
Frame = -2
Query: 645 KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGLSGCCGGGGVSSESEGDECVKYS----EC 478
KVDCLVCSFCFRF+GSIEKQIGRKLYFKNLG+SGCC G SSES DECVKY+ +C
Sbjct: 78 KVDCLVCSFCFRFVGSIEKQIGRKLYFKNLGVSGCCDGD--SSESGEDECVKYNGNEEQC 135
Query: 477 DGAASSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTPLSCPGGCQEAFYCSGSCAEAD 298
G SSS+ +++P+G+VSSLMNGEMALP+TD FPLP+PLSCPGGCQEAFYCS SCAEAD
Sbjct: 136 GG---SSSSHNTLPEGVVSSLMNGEMALPYTDMFPLPSPLSCPGGCQEAFYCSESCAEAD 192
Query: 297 WESSHSLLSALVRGRNQYPERLFESL--------LYILMRQMISFSWLQ 175
WESSHSLL + + E L E + +++L + I+F+ L+
Sbjct: 193 WESSHSLLCTGEKSESNSREALGEFIKHANDTNDIFLLAAKAIAFTILR 241
>gi|297835174|ref|XP_002885469.1| hypothetical protein ARALYDRAFT_898635
[Arabidopsis lyrata subsp. lyrata]
Length = 340
Score = 206 bits (524), Expect = 6e-051
Identities = 106/169 (62%), Positives = 128/169 (75%), Gaps = 17/169 (10%)
Frame = -2
Query: 645 KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGLSGCCGGGGVSSESEGDECVKYS----EC 478
KVDCLVCSFCFRF+GSIEKQIGRKLYFKNLG+SGCC G SSES DECVKY+ +C
Sbjct: 78 KVDCLVCSFCFRFVGSIEKQIGRKLYFKNLGVSGCCDGD--SSESGEDECVKYNGNEEQC 135
Query: 477 DGAASSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTPLSCPGGCQEAFYCSGSCAEAD 298
G SSS+ +++P+G+VSSLMNGEMALP+TD FPLP+PLSCPGGCQEAFYCS SCAEAD
Sbjct: 136 GG---SSSSHNTLPEGVVSSLMNGEMALPYTDMFPLPSPLSCPGGCQEAFYCSESCAEAD 192
Query: 297 WESSHSLLSALVRGRNQYPERLFESL--------LYILMRQMISFSWLQ 175
WESSHSLL + + E L E + +++L + I+F+ L+
Sbjct: 193 WESSHSLLCTGEKSESNSREALGEFIKHANDTNDIFLLAAKAIAFTILR 241
>gi|11994649|dbj|BAB02844.1| unnamed protein product [Arabidopsis thaliana]
Length = 565
Score = 204 bits (518), Expect = 3e-050
Identities = 106/169 (62%), Positives = 125/169 (73%), Gaps = 19/169 (11%)
Frame = -2
Query: 645 KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGLSGCCGGGGVSSESEGDECVKYS----EC 478
KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLG+SGCC SE DECVKY+ +C
Sbjct: 170 KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGVSGCCD----DDSSEEDECVKYNGNEEQC 225
Query: 477 DGAASSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTPLSCPGGCQEAFYCSGSCAEAD 298
G SSS+ +++P+G+VSSLMNGEMALPHTDKFPLP+PLSCPGGCQEAFYCS SCA AD
Sbjct: 226 GG---SSSSHNTLPEGVVSSLMNGEMALPHTDKFPLPSPLSCPGGCQEAFYCSESCAAAD 282
Query: 297 WESSHSLLSALVRGRNQYPERLFESL--------LYILMRQMISFSWLQ 175
WESSHSLL R + E L E + +++L + I+F+ L+
Sbjct: 283 WESSHSLLCTGERSESISREALGEFIKHANDTNDIFLLAAKAIAFTILR 331
>gi|42565094|ref|NP_188819.2| histone-lysine N-methyltransferase ATXR2
[Arabidopsis thaliana]
Length = 473
Score = 204 bits (518), Expect = 3e-050
Identities = 106/169 (62%), Positives = 125/169 (73%), Gaps = 19/169 (11%)
Frame = -2
Query: 645 KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGLSGCCGGGGVSSESEGDECVKYS----EC 478
KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLG+SGCC SE DECVKY+ +C
Sbjct: 82 KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGVSGCCD----DDSSEEDECVKYNGNEEQC 137
Query: 477 DGAASSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTPLSCPGGCQEAFYCSGSCAEAD 298
G SSS+ +++P+G+VSSLMNGEMALPHTDKFPLP+PLSCPGGCQEAFYCS SCA AD
Sbjct: 138 GG---SSSSHNTLPEGVVSSLMNGEMALPHTDKFPLPSPLSCPGGCQEAFYCSESCAAAD 194
Query: 297 WESSHSLLSALVRGRNQYPERLFESL--------LYILMRQMISFSWLQ 175
WESSHSLL R + E L E + +++L + I+F+ L+
Sbjct: 195 WESSHSLLCTGERSESISREALGEFIKHANDTNDIFLLAAKAIAFTILR 243
>gi|224132628|ref|XP_002327842.1| SET domain protein [Populus trichocarpa]
Length = 398
Score = 150 bits (377), Expect = 7e-034
Identities = 81/184 (44%), Positives = 114/184 (61%), Gaps = 24/184 (13%)
Frame = -2
Query: 645 KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGLSGCCGGGGVSSESEGDECVKYSECDGAA 466
K+DCLVC +CF+FI S+E QIGRKLY ++LG+ C G + EC ++
Sbjct: 24 KLDCLVCGYCFQFIESVEYQIGRKLYLQSLGVPSCNG-------CDEGEC--------SS 68
Query: 465 SSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTPLSCPGGCQEAFYCSGSCAEADWESS 286
SSS N +P+G++ +LMNGE+ LP++DKFPLP+ + CPGGCQEA+YCS SCA+ DWESS
Sbjct: 69 SSSYNKACLPEGVIEALMNGELVLPYSDKFPLPSTVPCPGGCQEAYYCSKSCAQTDWESS 128
Query: 285 HSLLSALVRGRNQYPERLFESL--------LYILMRQMISFSWLQRYIYIISVVMSASRL 130
HSLL R + E L + + +++L + ISF+ L RY + + S L
Sbjct: 129 HSLLCTGERSESLSIEALSKFIQHATETNDIFLLAAKTISFTIL-RYRKLKAANADRSEL 187
Query: 129 SWIL 118
S +L
Sbjct: 188 SLLL 191
>gi|242077278|ref|XP_002448575.1| hypothetical protein SORBIDRAFT_06g029440
[Sorghum bicolor]
Length = 472
Score = 136 bits (340), Expect = 1e-029
Identities = 65/126 (51%), Positives = 89/126 (70%), Gaps = 5/126 (3%)
Frame = -2
Query: 645 KVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGLSGCCGGGGVSSESEGDECVKYSECDGAA 466
K+DC+VCS+CFRFIGSIE QIGR+LY ++LG S G G + + C GA+
Sbjct: 86 KIDCVVCSYCFRFIGSIEFQIGRRLYLQSLGGS---VDGSTERHCHGSDAGPSTGCSGAS 142
Query: 465 SSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTPLSCPGGCQEAFYCSGSCAEADWESS 286
S +SN ++PQ ++ SLM+G M+LP TD+F LP+ ++CPGGC+ YCS SCA++DW+S
Sbjct: 143 SGNSN--AVPQEVLMSLMDGNMSLPLTDQFCLPSVVACPGGCEGELYCSQSCADSDWDSY 200
Query: 285 HSLLSA 268
HSLL A
Sbjct: 201 HSLLCA 206
>gi|255549357|ref|XP_002515732.1| protein with unknown function [Ricinus
communis]
Length = 394
Score = 101 bits (251), Expect = 3e-019
Identities = 49/87 (56%), Positives = 63/87 (72%), Gaps = 6/87 (6%)
Frame = -2
Query: 522 SSESEGDECVK----YSECDGAASSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTPLS 355
SS E D +K +C +SSS+ ++P+G+V SLM+GE+ LPH+ KFPLP+P+S
Sbjct: 60 SSNEENDYYMKDGNDLKKC--TSSSSTEKVTLPKGVVESLMSGELPLPHSKKFPLPSPIS 117
Query: 354 CPGGCQEAFYCSGSCAEADWESSHSLL 274
C GGC EA+YCS CAEADWESSHSLL
Sbjct: 118 CSGGCGEAYYCSKLCAEADWESSHSLL 144
>gi|225447338|ref|XP_002274324.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 495
Score = 99 bits (245), Expect = 1e-018
Identities = 48/89 (53%), Positives = 62/89 (69%), Gaps = 7/89 (7%)
Frame = -2
Query: 525 VSSESEGDECV-----KYSECDGAASSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTP 361
V S + D C + EC A+SSS + +P+G+V SLMNGE+ALP+ +FPLP+
Sbjct: 134 VDSSEDEDNCYMEDHDELGEC--ASSSSKDKVPLPKGVVESLMNGELALPYPKEFPLPSA 191
Query: 360 LSCPGGCQEAFYCSGSCAEADWESSHSLL 274
++C GGC EA+YCS CAEADWESSHSLL
Sbjct: 192 IACSGGCGEAYYCSKLCAEADWESSHSLL 220
>gi|297739311|emb|CBI28962.3| unnamed protein product [Vitis vinifera]
Length = 464
Score = 97 bits (241), Expect = 4e-018
Identities = 48/87 (55%), Positives = 63/87 (72%), Gaps = 7/87 (8%)
Frame = -2
Query: 534 GGGVSSESEGDECVKYSECDGAASSSSNPHSIPQGIVSSLMNGEMALPHTDKFPLPTPLS 355
G GVS+ + + EC A+SSS + +P+G+V SLMNGE+ALP+ +FPLP+ ++
Sbjct: 110 GLGVSTNHD-----ELGEC--ASSSSKDKVPLPKGVVESLMNGELALPYPKEFPLPSAIA 162
Query: 354 CPGGCQEAFYCSGSCAEADWESSHSLL 274
C GGC EA+YCS CAEADWESSHSLL
Sbjct: 163 CSGGCGEAYYCSKLCAEADWESSHSLL 189
>gi|26451714|dbj|BAC42952.1| unknown protein [Arabidopsis thaliana]
Length = 320
Score = 87 bits (215), Expect = 4e-015
Identities = 44/51 (86%), Positives = 48/51 (94%), Gaps = 1/51 (1%)
Frame = +2
Query: 1052 ASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
A+A+GGG SDV GFAKLQGEDFEYYMQSYSI+LGRNSKK+TVDVDLSSLG
Sbjct: 2 ATAVGGG-SDVEVGFAKLQGEDFEYYMQSYSIILGRNSKKATVDVDLSSLG 51
>gi|15231425|ref|NP_187378.1| SMAD/FHA domain-containing protein [Arabidopsis
thaliana]
Length = 320
Score = 87 bits (215), Expect = 4e-015
Identities = 44/51 (86%), Positives = 48/51 (94%), Gaps = 1/51 (1%)
Frame = +2
Query: 1052 ASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
A+A+GGG SDV GFAKLQGEDFEYYMQSYSI+LGRNSKK+TVDVDLSSLG
Sbjct: 2 ATAVGGG-SDVEVGFAKLQGEDFEYYMQSYSIILGRNSKKATVDVDLSSLG 51
>gi|297825551|ref|XP_002880658.1| predicted protein [Arabidopsis lyrata subsp.
lyrata]
Length = 221
Score = 87 bits (215), Expect = 4e-015
Identities = 44/51 (86%), Positives = 48/51 (94%), Gaps = 1/51 (1%)
Frame = +2
Query: 1052 ASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
A+A+GGG SDV GFAKLQGEDFEYYMQSYSI+LGRNSKK+TVDVDLSSLG
Sbjct: 2 ATAVGGG-SDVEVGFAKLQGEDFEYYMQSYSIILGRNSKKATVDVDLSSLG 51
>gi|297829272|ref|XP_002882518.1| hypothetical protein ARALYDRAFT_478044
[Arabidopsis lyrata subsp. lyrata]
Length = 318
Score = 87 bits (215), Expect = 4e-015
Identities = 44/51 (86%), Positives = 48/51 (94%), Gaps = 1/51 (1%)
Frame = +2
Query: 1052 ASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
A+A+GGG SDV GFAKLQGEDFEYYMQSYSI+LGRNSKK+TVDVDLSSLG
Sbjct: 2 ATAVGGG-SDVEVGFAKLQGEDFEYYMQSYSIILGRNSKKATVDVDLSSLG 51
>gi|115477441|ref|NP_001062316.1| Os08g0528900 [Oryza sativa Japonica Group]
Length = 343
Score = 87 bits (213), Expect = 7e-015
Identities = 43/53 (81%), Positives = 46/53 (86%)
Frame = +2
Query: 1046 AMASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
AM + GG +V AGFAKLQGEDFEYYMQ+YSIMLGRNSKKSTVDVDLSSLG
Sbjct: 8 AMPAGSSGGDGEVEAGFAKLQGEDFEYYMQTYSIMLGRNSKKSTVDVDLSSLG 60
>gi|15231433|ref|NP_187382.1| SMAD/FHA domain-containing protein [Arabidopsis
thaliana]
Length = 251
Score = 86 bits (212), Expect = 9e-015
Identities = 44/51 (86%), Positives = 47/51 (92%), Gaps = 1/51 (1%)
Frame = +2
Query: 1052 ASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
A+A+G G SDV GFAKLQGEDFEYYMQSYSI+LGRNSKKSTVDVDLSSLG
Sbjct: 2 ATAVGSG-SDVEVGFAKLQGEDFEYYMQSYSIILGRNSKKSTVDVDLSSLG 51
>gi|42407966|dbj|BAD09104.1| putative transcriptional activator [Oryza sativa
Japonica Group]
Length = 335
Score = 85 bits (209), Expect = 2e-014
Identities = 42/52 (80%), Positives = 45/52 (86%)
Frame = +2
Query: 1049 MASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
M + GG +V AGFAKLQGEDFEYYMQ+YSIMLGRNSKKSTVDVDLSSLG
Sbjct: 1 MPAGSSGGDGEVEAGFAKLQGEDFEYYMQTYSIMLGRNSKKSTVDVDLSSLG 52
>gi|326493928|dbj|BAJ85426.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 344
Score = 85 bits (208), Expect = 3e-014
Identities = 42/53 (79%), Positives = 46/53 (86%)
Frame = +2
Query: 1046 AMASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
A+ + GG +V AGFAKLQGEDFEYYMQ+YSIMLGRNSKKSTVDVDLSSLG
Sbjct: 12 AIPAGPSGGDGEVEAGFAKLQGEDFEYYMQTYSIMLGRNSKKSTVDVDLSSLG 64
>gi|224065403|ref|XP_002301800.1| predicted protein [Populus trichocarpa]
Length = 317
Score = 83 bits (203), Expect = 1e-013
Identities = 41/48 (85%), Positives = 44/48 (91%)
Frame = +2
Query: 1061 IGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
+G SDV AGFAKLQGEDFEYYMQ+YSI+LGRNSKKSTVDVDLSSLG
Sbjct: 1 MGATGSDVEAGFAKLQGEDFEYYMQTYSIILGRNSKKSTVDVDLSSLG 48
>gi|326493358|dbj|BAJ85140.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 239
Score = 83 bits (203), Expect = 1e-013
Identities = 41/53 (77%), Positives = 45/53 (84%)
Frame = +2
Query: 1046 AMASAIGGGCSDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
A+ + GG +V AGFAKLQGEDFEYYMQ+YSIMLGRNSKKS VDVDLSSLG
Sbjct: 12 AIPAGPSGGDGEVEAGFAKLQGEDFEYYMQTYSIMLGRNSKKSAVDVDLSSLG 64
>gi|15705932|gb|AAL05884.1|AF411856_1 transcriptional activator FHA1 [Nicotiana
tabacum]
Length = 209
Score = 82 bits (201), Expect = 2e-013
Identities = 40/43 (93%), Positives = 42/43 (97%)
Frame = +2
Query: 1076 SDVVAGFAKLQGEDFEYYMQSYSIMLGRNSKKSTVDVDLSSLG 1204
SDV AGFAKLQGEDFEYYMQ+YSI+LGRNSKKSTVDVDLSSLG
Sbjct: 7 SDVEAGFAKLQGEDFEYYMQTYSIILGRNSKKSTVDVDLSSLG 49
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 359,848,489,672
Number of Sequences: 15229318
Number of Extensions: 359848489672
Number of Successful Extensions: 91916522
Number of sequences better than 0.0: 0
|