BLASTX 7.6.2
Query= UN40348 /QuerySize=1254
(1253 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297839681|ref|XP_002887722.1| hypothetical protein ARALYDRAFT... 502 5e-140
gi|15218227|ref|NP_177935.1| uncharacterized protein [Arabidopsi... 500 3e-139
gi|62321237|dbj|BAD94415.1| hypothetical protein [Arabidopsis th... 356 6e-096
gi|297845240|ref|XP_002890501.1| hypothetical protein ARALYDRAFT... 322 1e-085
gi|15219847|ref|NP_173642.1| uncharacterized protein [Arabidopsi... 315 2e-083
gi|9454527|gb|AAF87850.1|AC073942_4 Contains a weak similarity t... 297 4e-078
gi|224063213|ref|XP_002301044.1| predicted protein [Populus tric... 183 7e-044
gi|224084680|ref|XP_002307386.1| predicted protein [Populus tric... 181 4e-043
gi|225459336|ref|XP_002284185.1| PREDICTED: hypothetical protein... 174 4e-041
gi|224127292|ref|XP_002320038.1| predicted protein [Populus tric... 122 2e-025
>gi|297839681|ref|XP_002887722.1| hypothetical protein ARALYDRAFT_476978
[Arabidopsis lyrata subsp. lyrata]
Length = 344
Score = 502 bits (1292), Expect = 5e-140
Identities = 268/350 (76%), Positives = 281/350 (80%), Gaps = 39/350 (11%)
Frame = +1
Query: 64 MIKGRDGNRGSSSSGYSADLLVCFPSRTHLAFAPKPICSPSRPS-ISTNRRPHHRRQLSK 240
MIKG +GNRGSSSSGYSADLLVCFPSR HLA PKPICSPSRPS STNRRP HRRQLSK
Sbjct: 1 MIKGNNGNRGSSSSGYSADLLVCFPSRAHLALTPKPICSPSRPSDSSTNRRPQHRRQLSK 60
Query: 241 LSNGGGGHGSPALWAKQASSKNMGNDETAEPTSPKVTCAGQIKVRPRKCGGKGKNWRSVM 420
LS GGGGHGSP LWAKQASSKNMG DE AEPTSPKVTCAGQIKVRP KCGG+GKNW+SVM
Sbjct: 61 LSGGGGGHGSPVLWAKQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVM 120
Query: 421 EEIERIHSSNKSQSKFLGLKKDVMGFLACLRNIKFDIRCFGDFRHATDVTSDDEEEEEDD 600
EEIERIH N+SQSKF GLKKDVMGFL CLRNIKFD RCFGDFRHA DVTSDD+EEE+DD
Sbjct: 121 EEIERIH-DNRSQSKFFGLKKDVMGFLTCLRNIKFDFRCFGDFRHA-DVTSDDDEEEDDD 178
Query: 601 ------------EEEQKTVFSKWFMVLQEEEQSTNDDNKNNKKC--------------VV 702
EE KTVFSKWFMVLQ EEQS DD+KNN KC V
Sbjct: 179 DDDEEEEGEREEEENSKTVFSKWFMVLQ-EEQSNQDDDKNNNKCDEKRDLEDTETEPAVP 237
Query: 703 DENALLLMRCRSAPSKSWLEERMQVKTEHGNRE------EEEETEDQEMSVNKKKNKKDL 864
NALLLMRCRSAP+KSWLEERM+VKTE NRE EE+ETEDQE S+ K KKDL
Sbjct: 238 PPNALLLMRCRSAPAKSWLEERMKVKTEQENREEQKEEKEEKETEDQETSM--KTKKKDL 295
Query: 865 GSLMEEENMELVLMRYDTDYYRLSSDIAKETWVVGGINQDPLSRSRSWKS 1014
SLMEEE MELVLMRYDT++YRLSSDIAKETWVVGGI QDPLSRSRSWKS
Sbjct: 296 RSLMEEEKMELVLMRYDTEFYRLSSDIAKETWVVGGI-QDPLSRSRSWKS 344
>gi|15218227|ref|NP_177935.1| uncharacterized protein [Arabidopsis thaliana]
Length = 342
Score = 500 bits (1285), Expect = 3e-139
Identities = 266/348 (76%), Positives = 284/348 (81%), Gaps = 37/348 (10%)
Frame = +1
Query: 64 MIKGRDGNRGSSSSGYSADLLVCFPSRTHLAFAPKPICSPSRPS-ISTNRRPHHRRQLSK 240
MIKG +GNRGSSSSGYSADLLVCFPSRTHLA PKPICSPSRPS STNRRPHHRRQLSK
Sbjct: 1 MIKGNNGNRGSSSSGYSADLLVCFPSRTHLALTPKPICSPSRPSDSSTNRRPHHRRQLSK 60
Query: 241 LS-NGGGGHGSPALWAKQASSKNMGNDETAEPTSPKVTCAGQIKVRPRKCGGKGKNWRSV 417
LS GGGGHGSP LWAKQASSKNMG DE AEPTSPKVTCAGQIKVRP KCGG+GKNW+SV
Sbjct: 61 LSGGGGGGHGSPVLWAKQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSV 120
Query: 418 MEEIERIHSSNKSQSKFLGLKKDVMGFLACLRNIKFDIRCFGDFRHATDVTSDDEEEEED 597
MEEIERIH N+SQSKF GLKKDVMGFL CLRNIKFD RCFGDFRHA DVTSDD+EEE+D
Sbjct: 121 MEEIERIH-DNRSQSKFFGLKKDVMGFLTCLRNIKFDFRCFGDFRHA-DVTSDDDEEEDD 178
Query: 598 DEEEQ------------KTVFSKWFMVLQEEEQSTNDDNKNNKKC--------------V 699
D++E+ KTVFSKWFMVLQ EEQ+ DD+KNN KC V
Sbjct: 179 DDDEEEEVVEGEEEENSKTVFSKWFMVLQ-EEQNNKDDDKNNNKCDEKRDLEDTETEPAV 237
Query: 700 VDENALLLMRCRSAPSKSWLEERMQVKTEHGNRE---EEEETEDQEMSVNKKKNKKDLGS 870
NALLLMRCRSAP+KSWLEERM+VKTE RE EE+ETEDQE S+ K KKDL S
Sbjct: 238 PPPNALLLMRCRSAPAKSWLEERMKVKTEQEKREEQKEEKETEDQETSM--KTKKKDLRS 295
Query: 871 LMEEENMELVLMRYDTDYYRLSSDIAKETWVVGGINQDPLSRSRSWKS 1014
LMEEE MELVLMRYDT++YRLSSDIAKETWVVGGI QDPLSRSRSWK+
Sbjct: 296 LMEEEKMELVLMRYDTEFYRLSSDIAKETWVVGGI-QDPLSRSRSWKN 342
>gi|62321237|dbj|BAD94415.1| hypothetical protein [Arabidopsis thaliana]
Length = 259
Score = 356 bits (912), Expect = 6e-096
Identities = 192/265 (72%), Positives = 209/265 (78%), Gaps = 35/265 (13%)
Frame = +1
Query: 307 MGNDETAEPTSPKVTCAGQIKVRPRKCGGKGKNWRSVMEEIERIHSSNKSQSKFLGLKKD 486
MG DE AEPTSPKVTCAGQIKVRP KCGG+GKNW+SVMEEIERIH N+SQSKF GLKKD
Sbjct: 1 MGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIH-DNRSQSKFFGLKKD 59
Query: 487 VMGFLACLRNIKFDIRCFGDFRHATDVTSDDEEEEEDDEEEQ------------KTVFSK 630
VMGFL CLRNIKFD RCFGDFRHA DVTSDD+EEE+DD++E+ KTVFSK
Sbjct: 60 VMGFLTCLRNIKFDFRCFGDFRHA-DVTSDDDEEEDDDDDEEEEVVEGEEEENSKTVFSK 118
Query: 631 WFMVLQEEEQSTNDDNKNNKKC--------------VVDENALLLMRCRSAPSKSWLEER 768
WFMVLQ EEQ+ DD+KNN KC V NALLLMRCRSAP+KSWLEER
Sbjct: 119 WFMVLQ-EEQNNKDDDKNNNKCDEKRDLEDTETEPAVPPPNALLLMRCRSAPAKSWLEER 177
Query: 769 MQVKTEHGNRE---EEEETEDQEMSVNKKKNKKDLGSLMEEENMELVLMRYDTDYYRLSS 939
M+VKTE RE EE+ETEDQE S+ K KKDL SLMEEE MELVLMRYDT++YRLSS
Sbjct: 178 MKVKTEQEKREEQKEEKETEDQETSM--KTKKKDLRSLMEEEKMELVLMRYDTEFYRLSS 235
Query: 940 DIAKETWVVGGINQDPLSRSRSWKS 1014
DIAKETWVVGGI QDPLSRSRSWK+
Sbjct: 236 DIAKETWVVGGI-QDPLSRSRSWKN 259
>gi|297845240|ref|XP_002890501.1| hypothetical protein ARALYDRAFT_472459
[Arabidopsis lyrata subsp. lyrata]
Length = 318
Score = 322 bits (823), Expect = 1e-085
Identities = 182/327 (55%), Positives = 217/327 (66%), Gaps = 35/327 (10%)
Frame = +1
Query: 91 GSSSSGYSADLLVCFPSRTHLAFAPKPICSPSRPSISTNRRPHHRRQLSKLSNGGGGHGS 270
G SSGYSADL+VCFPSRTHL+ K I SPS PHHRR +SKLS GGG
Sbjct: 6 GGKSSGYSADLMVCFPSRTHLSLPSKSISSPSHSFNRRQNAPHHRRSISKLSGSGGG--- 62
Query: 271 PALWAKQASSKNMGNDETAEPTSPKVTCAGQIKVRPRKCGGKGKNWRSVMEEIERIHSSN 450
S+ G + EPTSPKVTCAGQIKVR K G KNW+S+M EIE+IH S
Sbjct: 63 ------VRQSRGGGREVVEEPTSPKVTCAGQIKVRSSKRDGGSKNWQSLMAEIEKIHRS- 115
Query: 451 KSQSKFLGLKKDVMGFLACLRNIKFDIRCFGDFRHATDVTSDDEEEEEDDEEEQK----- 615
KS+SKF G+K+DVMGFL CLR+ FD RCFG F ++ DDEE+EE++EEE++
Sbjct: 116 KSESKFFGIKRDVMGFLTCLRD--FDFRCFGAFPPVDIISDDDEEDEEEEEEEEEDEDES 173
Query: 616 --TVFSKWFMVLQEEEQSTNDDNKNNKK--------CVVDENALLLMRCRSAPSKSWLEE 765
TVFSKW MVL E++ N++ N K+ V NALLLMRCRSAP K+WLEE
Sbjct: 174 SGTVFSKWLMVLHEKQ--NNEECVNEKENAFSDVETAVPPPNALLLMRCRSAPVKNWLEE 231
Query: 766 RMQVKTEHGNR----EEEEETEDQEMSVNKKKNKKDLGSLMEEE-NMELVLMRYDTDYYR 930
+ + E NR EEEE E++E + +NKKDL SLMEEE M LV+M YDT+YY+
Sbjct: 232 KKEETEEGENRVKQSGEEEEEEEEEEEKERVRNKKDLRSLMEEEKKMNLVVMNYDTNYYK 291
Query: 931 LSSDIAKETWVVGGINQDPLSRSRSWK 1011
LS+DIAKETWVVGGI QDPL RSRSWK
Sbjct: 292 LSTDIAKETWVVGGI-QDPLFRSRSWK 317
>gi|15219847|ref|NP_173642.1| uncharacterized protein [Arabidopsis thaliana]
Length = 314
Score = 315 bits (805), Expect = 2e-083
Identities = 178/323 (55%), Positives = 216/323 (66%), Gaps = 31/323 (9%)
Frame = +1
Query: 91 GSSSSGYSADLLVCFPSRTHLAFAPKPICSPSRPSISTNRRPHHRRQLSKLSNGGGGHGS 270
G SSGYSADL+VCFPSR HL+ K I SPS PHHRR +SKLS+ GGG
Sbjct: 6 GGKSSGYSADLMVCFPSRAHLSLPSKSISSPSSSFNRRQNAPHHRRSISKLSSSGGG--- 62
Query: 271 PALWAKQASSKNMGNDETAEPTSPKVTCAGQIKVRPRKCGGKGKNWRSVMEEIERIHSSN 450
++ G + EPTSPKVTCAGQIKVR K G GKNW+S+M EIE+IH S
Sbjct: 63 ------VRQNRGGGREVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIHRS- 115
Query: 451 KSQSKFLGLKKDVMGFLACLRNIKFDIRCFGDFRHATDVTSDDEEEEEDDEEEQK----- 615
KS+SKF G+K+DVMGFL CLR+ FD RCFG F D+ SDDEEE+E++EEE +
Sbjct: 116 KSESKFFGIKRDVMGFLTCLRD--FDFRCFGAF-PPVDIISDDEEEDEEEEEEDEEEDED 172
Query: 616 ----TVFSKWFMVLQEEEQSTN-DDNKNN-----KKCVVDENALLLMRCRSAPSKSWLEE 765
TVFSKW MVL E++ + D K N + V NALLLMRCRSAP K+W EE
Sbjct: 173 ESSGTVFSKWLMVLHEKQNNEECVDGKENVFSDVETAVPPPNALLLMRCRSAPVKNWSEE 232
Query: 766 RMQVKTEHGNREEEEETEDQEMSVNKKKNKKDLGSLMEEE-NMELVLMRYDTDYYRLSSD 942
+ + +TE G+ ++ E++E ++ NKKDL SLMEEE M LV+M YDT+YY+LS+D
Sbjct: 233 KKE-ETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEEKKMNLVVMNYDTNYYKLSND 291
Query: 943 IAKETWVVGGINQDPLSRSRSWK 1011
IAKETWVVGGI QDPL RSRSWK
Sbjct: 292 IAKETWVVGGI-QDPLFRSRSWK 313
>gi|9454527|gb|AAF87850.1|AC073942_4 Contains a weak similarity to ELG protein
from Homo sapiens gi|7799418 [Arabidopsis thaliana]
Length = 330
Score = 297 bits (758), Expect = 4e-078
Identities = 168/316 (53%), Positives = 206/316 (65%), Gaps = 30/316 (9%)
Frame = +1
Query: 91 GSSSSGYSADLLVCFPSRTHLAFAPKPICSPSRPSISTNRRPHHRRQLSKLSNGGGGHGS 270
G SSGYSADL+VCFPSR HL+ K I SPS PHHRR +SKLS+ GGG
Sbjct: 6 GGKSSGYSADLMVCFPSRAHLSLPSKSISSPSSSFNRRQNAPHHRRSISKLSSSGGG--- 62
Query: 271 PALWAKQASSKNMGNDETAEPTSPKVTCAGQIKVRPRKCGGKGKNWRSVMEEIERIHSSN 450
++ G + EPTSPKVTCAGQIKVR K G GKNW+S+M EIE+IH S
Sbjct: 63 ------VRQNRGGGREVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIHRS- 115
Query: 451 KSQSKFLGLKKDVMGFLACLRNIKFDIRCFGDFRHATDVTSDDEEEEEDDEEEQK----- 615
KS+SKF G+K+DVMGFL CLR+ FD RCFG F D+ SDDEEE+E++EEE +
Sbjct: 116 KSESKFFGIKRDVMGFLTCLRD--FDFRCFGAF-PPVDIISDDEEEDEEEEEEDEEEDED 172
Query: 616 ----TVFSKWFMVLQEEEQSTN-DDNKNN-----KKCVVDENALLLMRCRSAPSKSWLEE 765
TVFSKW MVL E++ + D K N + V NALLLMRCRSAP K+W EE
Sbjct: 173 ESSGTVFSKWLMVLHEKQNNEECVDGKENVFSDVETAVPPPNALLLMRCRSAPVKNWSEE 232
Query: 766 RMQVKTEHGNREEEEETEDQEMSVNKKKNKKDLGSLMEEE-NMELVLMRYDTDYYRLSSD 942
+ + +TE G+ ++ E++E ++ NKKDL SLMEEE M LV+M YDT+YY+LS+D
Sbjct: 233 KKE-ETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEEKKMNLVVMNYDTNYYKLSND 291
Query: 943 IAKETWVVGGINQDPL 990
IAKETWVVG D L
Sbjct: 292 IAKETWVVGESTDDAL 307
>gi|224063213|ref|XP_002301044.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 183 bits (463), Expect = 7e-044
Identities = 107/196 (54%), Positives = 122/196 (62%), Gaps = 24/196 (12%)
Frame = +1
Query: 67 IKGRDGNRGSSSSGYSADLLVCFPSRTHLAFAPKPICSPSR---PSISTNRRPHHRRQ-- 231
+KGR+ R SADLLVCFPSR HL PKPICSP+R PS R HHR+Q
Sbjct: 1 MKGRESRRAP-----SADLLVCFPSRAHLTLMPKPICSPARPLEPSKPHQNRHHHRQQRP 55
Query: 232 --LSKLS-NGGGGHGSPALWAKQASSKNMGNDETAEPTSPKVTCAGQIKVRPRKCGGKGK 402
L K S GGG SP LW K + E +EPTSPKVTCAGQIKVR + K
Sbjct: 56 HHLKKSSPRGGGSRASPLLWTKTRQM----DSELSEPTSPKVTCAGQIKVRHK--ASSCK 109
Query: 403 NWRSVMEEIERIHSSNKSQSK-----FLGLKKDVMGFLACLRNIKFDIRCFGDFRHATDV 567
NW+SVMEEIERIH S KS K LG KKD+M FL CLRNI+FD RCFG F +D+
Sbjct: 110 NWQSVMEEIERIHISKKSTKKSTWLDSLGFKKDIMQFLTCLRNIRFDFRCFGSFPAQSDI 169
Query: 568 TSDDEEEEEDDEEEQK 615
TS+DEEE E+ E Q+
Sbjct: 170 TSNDEEEYEEYGEYQE 185
>gi|224084680|ref|XP_002307386.1| predicted protein [Populus trichocarpa]
Length = 341
Score = 181 bits (457), Expect = 4e-043
Identities = 106/196 (54%), Positives = 129/196 (65%), Gaps = 24/196 (12%)
Frame = +1
Query: 67 IKGRDGNRGSSSSGYSADLLVCFPSRTHLAFAPKPICSPSRPSIST---NRRPHHRRQ-- 231
+KGR+ R SADLLVCFPSR HL PKPICSP+RPS ++ R HH++Q
Sbjct: 1 MKGREIRRAP-----SADLLVCFPSRAHLTLMPKPICSPARPSETSKPRQNRHHHQQQRH 55
Query: 232 --LSKLSNGGGG-HGSPALWAKQASSKNMGNDETAEPTSPKVTCAGQIKVRPRKCGGKGK 402
L K S G G SP LWAK +K MG+ E +EPTSPKVTCAGQIKVR ++ K
Sbjct: 56 HHLKKSSTRGVGIRASPLLWAK---AKQMGS-EVSEPTSPKVTCAGQIKVRHKE--SSCK 109
Query: 403 NWRSVMEEIERIHSSNKSQSK-----FLGLKKDVMGFLACLRNIKFDIRCFGDFRHATDV 567
NW+SVMEEIE+IH+S K K LG KKD+M FL CLRNI+FD RCFG F +D+
Sbjct: 110 NWQSVMEEIEKIHNSRKHTKKSTWIDSLGFKKDIMHFLTCLRNIRFDFRCFGSFPAHSDI 169
Query: 568 TSDDEEEEEDDEEEQK 615
TSDD+E +E+ E Q+
Sbjct: 170 TSDDDEVDEEYEGYQE 185
>gi|225459336|ref|XP_002284185.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 332
Score = 174 bits (439), Expect = 4e-041
Identities = 97/181 (53%), Positives = 116/181 (64%), Gaps = 20/181 (11%)
Frame = +1
Query: 112 SADLLVCFPSRTHLAFAPKPICSPSRPSISTNRRPH--------HRRQLSKLSNGGGGHG 267
SADLLVCFPSR HL PKPICSP+RPS + R H H L K S GG
Sbjct: 11 SADLLVCFPSRAHLTLMPKPICSPARPSEPSKRHQHQHHQHHHNHPHHLKKSSTRNGGQA 70
Query: 268 SPALWAKQASSKNMGNDETAEPTSPKVTCAGQIKVRPRKCGGKGKNWRSVMEEIERIHSS 447
SP LWAK +K MG E +EPTSPKVTCAGQIKVR + KNW+SVMEEIERIH++
Sbjct: 71 SPLLWAK---TKPMGT-EISEPTSPKVTCAGQIKVRHKT--SSCKNWQSVMEEIERIHNN 124
Query: 448 NKSQSK-----FLGLKKDVMGFLACLRNIKFDIRCFGDFRHATDVTSDDEEEEEDDEEEQ 612
K + + LG KK++M FL CLR+I+FD RCFG F + + TSDDEEE+ D +
Sbjct: 125 KKHKKRPTWVEALGFKKEIMQFLTCLRSIRFDFRCFGSFPESGN-TSDDEEEDGDQYQHN 183
Query: 613 K 615
+
Sbjct: 184 Q 184
>gi|224127292|ref|XP_002320038.1| predicted protein [Populus trichocarpa]
Length = 259
Score = 122 bits (304), Expect = 2e-025
Identities = 64/106 (60%), Positives = 75/106 (70%), Gaps = 7/106 (6%)
Frame = +1
Query: 313 NDETAEPTSPKVTCAGQIKVRPRKCGGKGKNWRSVMEEIERIHSSNKSQSK-----FLGL 477
+ E +EPTSPKVTCAGQIKVR + KNW+SVMEEIERIH S KS K LG
Sbjct: 2 DSELSEPTSPKVTCAGQIKVRHK--ASSCKNWQSVMEEIERIHISRKSTKKSTWLDSLGF 59
Query: 478 KKDVMGFLACLRNIKFDIRCFGDFRHATDVTSDDEEEEEDDEEEQK 615
KKD+M FL CLRNI+FD RCFG F +D+TS+DEEE E+ E Q+
Sbjct: 60 KKDIMQFLTCLRNIRFDFRCFGSFPAQSDITSNDEEEYEEYGEYQE 105
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,486,047,059,872
Number of Sequences: 15229318
Number of Extensions: 4486047059872
Number of Successful Extensions: 1052332685
Number of sequences better than 0.0: 0
|