BLASTX 7.6.2
Query= UN00272 /QuerySize=938
(937 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297851836|ref|XP_002893799.1| hypothetical protein ARALYDRAFT... 263 3e-068
gi|30692973|ref|NP_174663.2| uncharacterized protein [Arabidopsi... 255 1e-065
gi|10086475|gb|AAG12535.1|AC015446_16 Unknown protein [Arabidops... 230 3e-058
gi|10092447|gb|AAG12850.1|AC079286_7 unknown protein; 15226-1472... 215 8e-054
gi|225460592|ref|XP_002263072.1| PREDICTED: hypothetical protein... 168 2e-039
gi|255632340|gb|ACU16528.1| unknown [Glycine max] 152 9e-035
gi|222632758|gb|EEE64890.1| hypothetical protein OsJ_19749 [Oryz... 84 2e-014
gi|255566690|ref|XP_002524329.1| conserved hypothetical protein ... 78 2e-012
gi|18395162|ref|NP_564181.1| uncharacterized protein [Arabidopsi... 67 3e-009
gi|297845306|ref|XP_002890534.1| hypothetical protein ARALYDRAFT... 67 5e-009
gi|326518054|dbj|BAK07279.1| predicted protein [Hordeum vulgare ... 64 2e-008
gi|212722818|ref|NP_001132730.1| hypothetical protein LOC1001942... 61 2e-007
gi|242053439|ref|XP_002455865.1| hypothetical protein SORBIDRAFT... 61 2e-007
gi|218188561|gb|EEC70988.1| hypothetical protein OsI_02642 [Oryz... 60 6e-007
gi|224085173|ref|XP_002307512.1| predicted protein [Populus tric... 59 1e-006
gi|212723456|ref|NP_001131259.1| hypothetical protein LOC1001925... 58 2e-006
gi|215768969|dbj|BAH01198.1| unnamed protein product [Oryza sati... 58 2e-006
>gi|297851836|ref|XP_002893799.1| hypothetical protein ARALYDRAFT_313917
[Arabidopsis lyrata subsp. lyrata]
Length = 187
Score = 263 bits (672), Expect = 3e-068
Identities = 144/199 (72%), Positives = 160/199 (80%), Gaps = 17/199 (8%)
Frame = -2
Query: 768 MLFAAEGGGFFSSSASGYTNGLALLLLGHKDEPPKKPVRV--SPWRHYHLVVEETDTKLQ 595
MLFAAEGGGFFSSSASGY+NGLALLLLG K+E +KP++V S W HYHLVVEE+DT +
Sbjct: 1 MLFAAEGGGFFSSSASGYSNGLALLLLGQKNE--QKPIKVSSSQWNHYHLVVEESDTGFR 58
Query: 594 LDSSKKWLSRACNSLTCFGRKSDRPEENPSQ---PQDEAPPPESVEYECEVVSNRFALKS 424
LDSSK WLS AC SL CFGRKS++ E+PS +DEA P SVEY CE V+NRFALKS
Sbjct: 59 LDSSKNWLSCACTSLICFGRKSEK-LESPSDIRGKKDEAVAP-SVEYNCE-VTNRFALKS 115
Query: 423 SLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREFEPSEVD 244
SLKKRSFSD V+ DDDV RD GV++H DRRKVQWPDTCGIEIAEVREFEPSEVD
Sbjct: 116 SLKKRSFSDVVIG------DDDVSRD-GVVDHTDRRKVQWPDTCGIEIAEVREFEPSEVD 168
Query: 243 ELDDELHHGNRKSCMCTIM 187
E DDE HHG+ KSCMCTIM
Sbjct: 169 ESDDEFHHGSGKSCMCTIM 187
>gi|30692973|ref|NP_174663.2| uncharacterized protein [Arabidopsis thaliana]
Length = 182
Score = 255 bits (649), Expect = 1e-065
Identities = 137/195 (70%), Positives = 153/195 (78%), Gaps = 14/195 (7%)
Frame = -2
Query: 768 MLFAAEGGGFFSSSASGYTNGLALLLLGHKDEPPKKPVRV-SPWRHYHLVVEETDTKLQL 592
MLFAAEGGGFFSSSASGY+NGLALLLLG K E +KP++V S W HYHLV+E++DT +L
Sbjct: 1 MLFAAEGGGFFSSSASGYSNGLALLLLGQKTE--QKPIKVSSQWNHYHLVLEDSDTGFRL 58
Query: 591 DSSKKWLSRACNSLTCFGRKSDRPEENPSQPQDEAPPPESVEYECEVVSNRFALKSSLKK 412
DSSK WLS AC SL CFGRKS+R E +DEAP E CE V+NRFALKSSLKK
Sbjct: 59 DSSKNWLSSACTSLICFGRKSERLESEGK--KDEAPSVEDYN-NCE-VTNRFALKSSLKK 114
Query: 411 RSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREFEPSEVDELDD 232
RSFSD V+ DDDV RD GV++H DRRKVQWPDTCGIEIAEVREFEPSEVDE +D
Sbjct: 115 RSFSDVVIG------DDDVSRD-GVVDHIDRRKVQWPDTCGIEIAEVREFEPSEVDESED 167
Query: 231 ELHHGNRKSCMCTIM 187
E HHG+ KSCMCTIM
Sbjct: 168 EFHHGSGKSCMCTIM 182
>gi|10086475|gb|AAG12535.1|AC015446_16 Unknown protein [Arabidopsis thaliana]
Length = 231
Score = 230 bits (585), Expect = 3e-058
Identities = 137/227 (60%), Positives = 158/227 (69%), Gaps = 17/227 (7%)
Frame = -2
Query: 927 NKKENRLSLSLTLAGLFSG--HLFSVAQIRQVNRIGGGCTQKNTGLGCVNEHYPQMLFAA 754
N + SLSL+L+ L S FS ++ + I CT K L C NE +MLFAA
Sbjct: 12 NPTKAHQSLSLSLSPLLSALPPAFSPVRLSDPSGIWSLCT-KRLLLECANETILEMLFAA 70
Query: 753 EGGGFFSSSASGYTNGLALLLLGHKDEPPKKPVRV-SPWRHYHLVVEETDTKLQLDSSKK 577
EGGGFFSSSASGY+NGLALLLLG K E +KP++V S W HYHLV+E++DT +LDSSK
Sbjct: 71 EGGGFFSSSASGYSNGLALLLLGQKTE--QKPIKVSSQWNHYHLVLEDSDTGFRLDSSKN 128
Query: 576 WLSRACNSLTCFGRKSDRPEENPSQPQDEAPPPESVEYECEVVSNRFALKSSLKKRSFSD 397
WLS AC SL CFGRKS+R E +DEAP E CE V+NRFALKSSLKKRSFSD
Sbjct: 129 WLSSACTSLICFGRKSERLESEGK--KDEAPSVEDYN-NCE-VTNRFALKSSLKKRSFSD 184
Query: 396 AVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREFEP 256
V+ DDDV RD GV++H DRRKVQWPDTCGIEIAEVREFEP
Sbjct: 185 VVIG------DDDVSRD-GVVDHIDRRKVQWPDTCGIEIAEVREFEP 224
>gi|10092447|gb|AAG12850.1|AC079286_7 unknown protein; 15226-14726 [Arabidopsis
thaliana]
Length = 166
Score = 215 bits (547), Expect = 8e-054
Identities = 119/172 (69%), Positives = 133/172 (77%), Gaps = 14/172 (8%)
Frame = -2
Query: 768 MLFAAEGGGFFSSSASGYTNGLALLLLGHKDEPPKKPVRV-SPWRHYHLVVEETDTKLQL 592
MLFAAEGGGFFSSSASGY+NGLALLLLG K E +KP++V S W HYHLV+E++DT +L
Sbjct: 1 MLFAAEGGGFFSSSASGYSNGLALLLLGQKTE--QKPIKVSSQWNHYHLVLEDSDTGFRL 58
Query: 591 DSSKKWLSRACNSLTCFGRKSDRPEENPSQPQDEAPPPESVEYECEVVSNRFALKSSLKK 412
DSSK WLS AC SL CFGRKS+R E +DEAP E CE V+NRFALKSSLKK
Sbjct: 59 DSSKNWLSSACTSLICFGRKSERLESEGK--KDEAPSVEDYN-NCE-VTNRFALKSSLKK 114
Query: 411 RSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREFEP 256
RSFSD V+ DDDV RD GV++H DRRKVQWPDTCGIEIAEVREFEP
Sbjct: 115 RSFSDVVIG------DDDVSRD-GVVDHIDRRKVQWPDTCGIEIAEVREFEP 159
>gi|225460592|ref|XP_002263072.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 205
Score = 168 bits (423), Expect = 2e-039
Identities = 99/209 (47%), Positives = 123/209 (58%), Gaps = 19/209 (9%)
Frame = -2
Query: 768 MLFAAEGGGFFSSSASGYTNGLALLLLGHKDEPPKKPVRVSPWRHYHLVVEETDTKLQLD 589
ML A EGGGFFSSSASGYT GL LLLLG K+E +KP+RVSPW Y LV +E+DT LQL
Sbjct: 1 MLLAVEGGGFFSSSASGYTKGLTLLLLGQKNE--EKPMRVSPWNQYQLVDQESDTDLQLA 58
Query: 588 SSKKWLSRACNSLTCFGRKSDRPE------ENPSQPQDEAPPP--------ESVEYECEV 451
S K LSR C S CFGR S E P Q QD P P + ++ +
Sbjct: 59 SGKNRLSRGCASFVCFGRASAGLEVPSPLKVGPVQQQDGLPGPPISDKGKDHTTDHGDDN 118
Query: 450 VSNRFALKSSLKKRSFSDAVLADEDDNDDDDVGRD-NGVLNHADRRKVQWPDTCGIEIAE 274
LKSSLKK +++ DN+ + +G + + +RRKVQW D CG E+ E
Sbjct: 119 NERDVPLKSSLKKP--FNSIPVSGGDNECEPLGETCSDIPGCTERRKVQWTDACGRELVE 176
Query: 273 VREFEPSEVDELDDELHHGNRKSCMCTIM 187
++EFEPSEV E DDE +G+ +SC C IM
Sbjct: 177 IKEFEPSEVGESDDEFDNGSERSCSCAIM 205
>gi|255632340|gb|ACU16528.1| unknown [Glycine max]
Length = 209
Score = 152 bits (383), Expect = 9e-035
Identities = 88/205 (42%), Positives = 117/205 (57%), Gaps = 21/205 (10%)
Frame = -2
Query: 759 AAEGGGFFSSSASGYTNGLALLLLGHKDEPPKKPVRVSPWRHYHLVVEETDTKLQLDSSK 580
A EGGG FS+SASGYT GL+LLLLG ++E KP+RV+PW Y LV +E+D +LQL S+K
Sbjct: 4 AVEGGGLFSASASGYTKGLSLLLLGQRNE--DKPMRVAPWNQYQLVDQESDPELQLASTK 61
Query: 579 KWLSRACNSLTCFGRKS------DRPEENPSQPQDEAP--------PPESVEYECEVVSN 442
LSR C S CFGR S P+ P+Q D +P S + E +
Sbjct: 62 NRLSRGCASFVCFGRTSAGLDTPSPPKVGPAQQHDVSPGTLVSNKGKDPSAHVDDESDNR 121
Query: 441 RFALKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVL---NHADRRKVQWPDTCGIEIAEV 271
+ LKSS+KK S + + + + G+ G+ +R+KVQW D CG E+ E+
Sbjct: 122 KVTLKSSIKKPQISKPIPVEAANEHEASGGQ--GICTPGGQPERKKVQWTDNCGSELVEI 179
Query: 270 REFEPSEVDELDDELHHGNRKSCMC 196
REFEPSEVD DDE GN + +C
Sbjct: 180 REFEPSEVDGSDDEFDSGNDRLVLC 204
>gi|222632758|gb|EEE64890.1| hypothetical protein OsJ_19749 [Oryza sativa
Japonica Group]
Length = 174
Score = 84 bits (207), Expect = 2e-014
Identities = 49/98 (50%), Positives = 59/98 (60%), Gaps = 12/98 (12%)
Frame = -2
Query: 819 CTQKNTGLGCVNEHYPQMLFAAEGGGFFSSSASGYTNGLALLLLGHKDEPPKKPVRVSPW 640
C Q+++ ++EH ML A E GGFFSSSASGY NGLALLLLGHK E +KPV+V+PW
Sbjct: 57 CLQESSCSRRLDEHLRWMLLAVEVGGFFSSSASGYRNGLALLLLGHKGE--EKPVKVTPW 114
Query: 639 RHYHLV----VEETDTKLQLDSSKKWLSRACNSLTCFG 538
HY LV E + + S K C S CFG
Sbjct: 115 NHYRLVGGGEAEPASEENNVPSGK------CASFICFG 146
>gi|255566690|ref|XP_002524329.1| conserved hypothetical protein [Ricinus
communis]
Length = 206
Score = 78 bits (190), Expect = 2e-012
Identities = 44/97 (45%), Positives = 58/97 (59%), Gaps = 2/97 (2%)
Frame = -2
Query: 474 SVEYECEVVSNRFALKSSLKKRSFSDAVLADEDDNDDDDVG-RDNGVLNHADRRKVQWPD 298
+ E E + R LKSSLKK + S V E+ N + +G + + + HA+RRKVQW D
Sbjct: 111 TTELEGDNNVRRVMLKSSLKKPTNSIPVPV-ENANQHNTLGEKGSNIPGHAERRKVQWTD 169
Query: 297 TCGIEIAEVREFEPSEVDELDDELHHGNRKSCMCTIM 187
CG E+AE+REFEPSE DDE + +SC C IM
Sbjct: 170 VCGSELAEIREFEPSETAGSDDEFDNAAERSCSCVIM 206
>gi|18395162|ref|NP_564181.1| uncharacterized protein [Arabidopsis thaliana]
Length = 216
Score = 67 bits (163), Expect = 3e-009
Identities = 36/86 (41%), Positives = 54/86 (62%), Gaps = 2/86 (2%)
Frame = -2
Query: 441 RFALKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVL-NHADRRKVQWPDTCGIEIAEVRE 265
+ +L+SSLK+ S +++ + ED + + + D L RRKVQWPD CG E+ +VRE
Sbjct: 132 KLSLRSSLKRPSVAES-RSLEDIKEYETLSVDGSDLTGDMARRKVQWPDACGSELTQVRE 190
Query: 264 FEPSEVDELDDELHHGNRKSCMCTIM 187
FEPSE+ D+E G +++C C IM
Sbjct: 191 FEPSEMGLSDEEWEVGRQRTCSCVIM 216
>gi|297845306|ref|XP_002890534.1| hypothetical protein ARALYDRAFT_889791
[Arabidopsis lyrata subsp. lyrata]
Length = 214
Score = 67 bits (161), Expect = 5e-009
Identities = 32/85 (37%), Positives = 51/85 (60%)
Frame = -2
Query: 441 RFALKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREF 262
+ +L+SSLK+ S +++ ++ + + + RRKVQWPD CG E+ +VREF
Sbjct: 130 KLSLRSSLKRPSVAESRSLEDIKEYETLTVDGSDLTGDMARRKVQWPDACGSELTQVREF 189
Query: 261 EPSEVDELDDELHHGNRKSCMCTIM 187
EPSE+ D+E G +++C C IM
Sbjct: 190 EPSEMGLSDEEWEVGRQRTCSCVIM 214
>gi|326518054|dbj|BAK07279.1| predicted protein [Hordeum vulgare subsp.
vulgare]
Length = 208
Score = 64 bits (155), Expect = 2e-008
Identities = 32/81 (39%), Positives = 47/81 (58%)
Frame = -2
Query: 432 LKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREFEPS 253
LKS+ K+ S ++LA E + + + + + +RRKVQW DTCG E+ E+REFE S
Sbjct: 127 LKSNSKRDSSEHSILASEGEEPRESLEEVQTLKSGMERRKVQWTDTCGKELFEIREFEAS 186
Query: 252 EVDELDDELHHGNRKSCMCTI 190
+ DDEL + + C C I
Sbjct: 187 DGSLSDDELENEGFRKCECVI 207
>gi|212722818|ref|NP_001132730.1| hypothetical protein LOC100194216 [Zea mays]
Length = 208
Score = 61 bits (147), Expect = 2e-007
Identities = 30/84 (35%), Positives = 45/84 (53%)
Frame = -2
Query: 441 RFALKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREF 262
R LKS+ K+ S ++ E + + V + + +RRKVQW DTCG E+ E+REF
Sbjct: 124 RGCLKSNSKRDSLEHRIVVSEGEEPRESVEEVQTLRSSIERRKVQWTDTCGKELFEIREF 183
Query: 261 EPSEVDELDDELHHGNRKSCMCTI 190
E S+ DD+ + + C C I
Sbjct: 184 ETSDEGLSDDDAENDGFRKCECVI 207
>gi|242053439|ref|XP_002455865.1| hypothetical protein SORBIDRAFT_03g026500
[Sorghum bicolor]
Length = 211
Score = 61 bits (147), Expect = 2e-007
Identities = 30/84 (35%), Positives = 45/84 (53%)
Frame = -2
Query: 441 RFALKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREF 262
R LKS+ K+ S ++ E + + V + + +RRKVQW DTCG E+ E+REF
Sbjct: 127 RGCLKSNSKRDSLEHRIVVSEGEEPRESVEEVQTLRSSVERRKVQWTDTCGKELFEIREF 186
Query: 261 EPSEVDELDDELHHGNRKSCMCTI 190
E S+ DD+ + + C C I
Sbjct: 187 ETSDEGLSDDDAENEGFRKCECVI 210
>gi|218188561|gb|EEC70988.1| hypothetical protein OsI_02642 [Oryza sativa Indica
Group]
Length = 204
Score = 60 bits (143), Expect = 6e-007
Identities = 27/81 (33%), Positives = 45/81 (55%)
Frame = -2
Query: 432 LKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREFEPS 253
LKS+ ++ S ++ E + + + + + +RRKVQW DTCG E+ E+REFE S
Sbjct: 123 LKSNSRRDSLEHCIVVSEGEEPRESLEEVQTLKSGMERRKVQWTDTCGTELFEIREFEAS 182
Query: 252 EVDELDDELHHGNRKSCMCTI 190
+ DD++ + + C C I
Sbjct: 183 DEGLSDDDMENEGFRKCECVI 203
>gi|224085173|ref|XP_002307512.1| predicted protein [Populus trichocarpa]
Length = 201
Score = 59 bits (140), Expect = 1e-006
Identities = 29/79 (36%), Positives = 47/79 (59%)
Frame = -2
Query: 474 SVEYECEVVSNRFALKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDT 295
+ E E + + + L+SSLKK S S V ++ + + + + + H +RRKVQW D
Sbjct: 111 TTELEGDNNAIKVTLRSSLKKTSKSIPVPVEDANQSEPLNDKGSDIPGHTERRKVQWTDV 170
Query: 294 CGIEIAEVREFEPSEVDEL 238
CG E+AE+REFEP + ++
Sbjct: 171 CGSELAEIREFEPRSLKKI 189
>gi|212723456|ref|NP_001131259.1| hypothetical protein LOC100192572 [Zea mays]
Length = 208
Score = 58 bits (139), Expect = 2e-006
Identities = 28/84 (33%), Positives = 45/84 (53%)
Frame = -2
Query: 441 RFALKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREF 262
R LKS+ K+ S ++ E + + + + + +RRKVQW DTCG ++ E+REF
Sbjct: 124 RGCLKSNSKRDSLEHRIVVSEGEEPRESLEEVQTLRSSMERRKVQWTDTCGKDLFEIREF 183
Query: 261 EPSEVDELDDELHHGNRKSCMCTI 190
E S+ DD+ + + C C I
Sbjct: 184 ETSDESLSDDDPENEGFRKCECVI 207
>gi|215768969|dbj|BAH01198.1| unnamed protein product [Oryza sativa Japonica
Group]
Length = 204
Score = 58 bits (138), Expect = 2e-006
Identities = 27/81 (33%), Positives = 44/81 (54%)
Frame = -2
Query: 432 LKSSLKKRSFSDAVLADEDDNDDDDVGRDNGVLNHADRRKVQWPDTCGIEIAEVREFEPS 253
LKS+ ++ S ++ E + + + + + +RRKVQW DTCG E+ E+REFE S
Sbjct: 123 LKSNSRRDSLEHCIVVSEGEEPRESLEEVQTLKSGMERRKVQWTDTCGKELFEIREFEAS 182
Query: 252 EVDELDDELHHGNRKSCMCTI 190
+ DD+ + + C C I
Sbjct: 183 DEGLSDDDTENEGFRKCECVI 203
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 43,161,226,205
Number of Sequences: 15229318
Number of Extensions: 43161226205
Number of Successful Extensions: 25760657
Number of sequences better than 0.0: 0
|