BLASTX 7.6.2
Query= UN33625 /QuerySize=1287
(1286 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297789040|ref|XP_002862532.1| hypothetical protein ARALYDRAFT... 233 6e-059
gi|15225907|ref|NP_182114.1| Phosphatidylinositol N-acetyglucosa... 223 6e-056
gi|297828315|ref|XP_002882040.1| hypothetical protein ARALYDRAFT... 176 9e-042
gi|15233092|ref|NP_191697.1| Phosphatidylinositol N-acetyglucosa... 150 7e-034
gi|110738951|dbj|BAF01396.1| hypothetical protein [Arabidopsis t... 150 7e-034
gi|297852330|ref|XP_002894046.1| hypothetical protein ARALYDRAFT... 134 3e-029
gi|297820990|ref|XP_002878378.1| hypothetical protein ARALYDRAFT... 126 1e-026
gi|297814285|ref|XP_002875026.1| hypothetical protein ARALYDRAFT... 74 6e-011
gi|255541600|ref|XP_002511864.1| conserved hypothetical protein ... 59 2e-006
gi|224067898|ref|XP_002302588.1| predicted protein [Populus tric... 58 4e-006
>gi|297789040|ref|XP_002862532.1| hypothetical protein ARALYDRAFT_497410
[Arabidopsis lyrata subsp. lyrata]
Length = 613
Score = 233 bits (593), Expect = 6e-059
Identities = 144/252 (57%), Positives = 169/252 (67%), Gaps = 37/252 (14%)
Frame = +1
Query: 253 KSVKKLIEEEIDEKTHQRCDTTECRGKTKARNEKRRSRTCSKAS---------DDEDHAE 405
+SVKKLIE+EIDEKT Q+C+ ARN KRRSRTC K S DD+DHAE
Sbjct: 67 QSVKKLIEDEIDEKTKQKCE---------ARNRKRRSRTCIKTSEDINVLIVGDDDDHAE 117
Query: 406 ---NQCPRTSQIDADSINDDSEEKFSELIRRLIDQKESEVESCKNLVDDDD---DSKEES 567
+QCPR SQ + D +NDDSEEKFSELI+RLI QKESEV SCK ++D DSKEES
Sbjct: 118 KAGDQCPRISQNEVDLVNDDSEEKFSELIKRLIAQKESEVGSCKKNLEDAFQVLDSKEES 177
Query: 568 FLNIGTSPNKETSSPSE--SRAQTIVVLKPEPNCLNIGSSPGR----NKAKNERFGSRFI 729
FLNIGT ++++ +E QTIV+LKPEPN L++GSSPG NK KN RF SRFI
Sbjct: 178 FLNIGTPISRDSQRINELTQCRQTIVILKPEPNSLDVGSSPGTPSTDNKTKNGRFSSRFI 237
Query: 730 LSRIRRRLTPA--KNPCNAQHESDQDPDALSSTMSQNSCLEDGEILPDNSSKSEANEE-- 897
LSRIRRRL A KNPCNAQ +SD DPDALSS MSQN CL GE + N K ++ E
Sbjct: 238 LSRIRRRLKFAVGKNPCNAQQDSDPDPDALSSNMSQNCCL--GEAIETNPGKGVSDGETL 295
Query: 898 -DTNQGREDSKK 930
D RE +K+
Sbjct: 296 PDIASKREANKE 307
Score = 102 bits (252), Expect = 2e-019
Identities = 53/75 (70%), Positives = 60/75 (80%), Gaps = 1/75 (1%)
Frame = +1
Query: 841 LEDGEILPDNSSKSEANEEDTNQGREDSKKSMCGIYIAAKKHLSEMLAEGDMNVDLPDKE 1020
+ DGE LPD +SK EAN+EDT EDSKK+MCGIYIAAKKHLSEMLAE D + D PDKE
Sbjct: 289 VSDGETLPDIASKREANKEDTIHESEDSKKNMCGIYIAAKKHLSEMLAERD-DADSPDKE 347
Query: 1021 VPRLLGKILALPQFS 1065
VPR+LGKIL+ P S
Sbjct: 348 VPRILGKILSPPDSS 362
>gi|15225907|ref|NP_182114.1| Phosphatidylinositol
N-acetyglucosaminlytransferase subunit P-like protein [Arabidopsis
thaliana]
Length = 720
Score = 223 bits (567), Expect = 6e-056
Identities = 143/249 (57%), Positives = 168/249 (67%), Gaps = 37/249 (14%)
Frame = +1
Query: 253 KSVKKLIEEEIDEKTHQRCDTTECRGKTKARNEKRRSRTCSKAS----------DDEDHA 402
+SVKKLIE EIDEKT Q+C+ ARN KRRSRTCSK S DD+DHA
Sbjct: 78 QSVKKLIEAEIDEKTTQKCE---------ARNRKRRSRTCSKISEDINVLIAGDDDDDHA 128
Query: 403 E---NQCPRTSQIDADSINDDSEEKFSELIRRLIDQKESEVESC-KNLVDDDD--DSKEE 564
E N+CP S + D +NDDSEEKFSELI+RLI QKESEVESC KNLVD DSKEE
Sbjct: 129 EKSDNECPIVSHNEVDMVNDDSEEKFSELIKRLIAQKESEVESCKKNLVDAFQVLDSKEE 188
Query: 565 SFLNIGTSPNKETSSPSESRAQTIVVLKPEPNCLNIGSSPGR----NKAKNERFGSRFIL 732
S LNIGT + ++ E+ QTIV+LKPEPN L++GSSPG NKAKNE+F SRF L
Sbjct: 189 S-LNIGTPTSGDSQRIKET--QTIVILKPEPNTLDVGSSPGTPSTDNKAKNEKFSSRFSL 245
Query: 733 SRIRRRLTPA--KNPCNAQHESDQDP--DALSSTMSQNSCLEDGEILPDNSSKSEANEED 900
SRIRRRL A KNPCNAQH+SD DP DALSS+MSQN CL + EI + S E +
Sbjct: 246 SRIRRRLKFAVGKNPCNAQHDSDPDPDADALSSSMSQNCCLGE-EIETNPGSDGEILPDI 304
Query: 901 TNQGREDSK 927
++G + +
Sbjct: 305 ASKGEANKE 313
Score = 174 bits (441), Expect = 3e-041
Identities = 91/122 (74%), Positives = 104/122 (85%), Gaps = 5/122 (4%)
Frame = +1
Query: 847 DGEILPDNSSKSEANEEDT-NQGREDSKKSMCGIYIAAKKHLSEMLAEGDMNVDLPDKEV 1023
DGEILPD +SK EAN+EDT ++ +DSKKSMCGIYIAAKKHLSEMLAEGD++ DLPDKEV
Sbjct: 297 DGEILPDIASKGEANKEDTFHESEKDSKKSMCGIYIAAKKHLSEMLAEGDIDADLPDKEV 356
Query: 1024 PRLLGKILALPQFSTPDYTPRMTLAHDFVGHQITEKPNIQQCSSED-YSETLGLDSNKHE 1200
PR+LGKILALP+F TP+ +PR+TLA D HQI EKPNIQQCSS+D Y E L LDSN HE
Sbjct: 357 PRILGKILALPEFFTPENSPRVTLALD---HQIIEKPNIQQCSSKDYYYEPLRLDSNNHE 413
Query: 1201 ET 1206
ET
Sbjct: 414 ET 415
>gi|297828315|ref|XP_002882040.1| hypothetical protein ARALYDRAFT_483730
[Arabidopsis lyrata subsp. lyrata]
Length = 702
Score = 176 bits (445), Expect = 9e-042
Identities = 109/238 (45%), Positives = 144/238 (60%), Gaps = 20/238 (8%)
Frame = +1
Query: 253 KSVKKLIEEEIDEKTHQRCDTTECRGKTKARNEKRRSRTCSKASDDEDHAE---NQCPRT 423
+SVKKLIE+EIDEKT Q+C+ + +++ ++ DD+DHAE +QCPR
Sbjct: 78 QSVKKLIEDEIDEKTKQKCEARNRKRRSRTCSKISEDINVLIVGDDDDHAEKADDQCPRI 137
Query: 424 SQIDADSINDDSEEKFSELIRRLIDQKESEVESCKNLVDDDDDSKEESFLNIGTSPNKET 603
SQ + D + DDSEEKFSELI+R + ++ ++D ++S + S + +
Sbjct: 138 SQNEVDLVPDDSEEKFSELIKRC----KKNLDDAFQVLDSKEESFLN--IGTPISRDSQR 191
Query: 604 SSPSESRAQTIVVLKPEPNCLNIGSSPGR----NKAKNERFGSRFILSRIRRRLTPA--K 765
+ TIV+LKPEPN L++GSSPG NK KN RF SRFILSRIRRRL A K
Sbjct: 192 INELTQCRHTIVILKPEPNSLDVGSSPGTPSTDNKTKNGRFSSRFILSRIRRRLKFAVGK 251
Query: 766 NPCNAQHESDQDPDALSSTMSQNSCLEDGEILPDNSSKSEANEE---DTNQGREDSKK 930
NPCNAQH+SD DPDALSS MSQN CL GE + N K ++ E D RE +K+
Sbjct: 252 NPCNAQHDSDPDPDALSSNMSQNCCL--GEAIETNPGKGVSDGETLPDIASKREANKE 307
Score = 146 bits (367), Expect = 1e-032
Identities = 75/108 (69%), Positives = 87/108 (80%), Gaps = 4/108 (3%)
Frame = +1
Query: 841 LEDGEILPDNSSKSEANEEDTNQGREDSKKSMCGIYIAAKKHLSEMLAEGDMNVDLPDKE 1020
+ DGE LPD +SK EAN+EDT EDSKK+MCGIYIAAKKHLSEMLAEGD + D PDKE
Sbjct: 289 VSDGETLPDIASKREANKEDTIHESEDSKKNMCGIYIAAKKHLSEMLAEGD-DADSPDKE 347
Query: 1021 VPRLLGKILALPQFSTPDYTPRMTLAHDFVGHQITEKPNIQQCSSEDY 1164
VPR+LGKIL+ P+FSTPD +PR+ LA D HQ+ +KP IQQCSSE Y
Sbjct: 348 VPRILGKILSPPEFSTPDNSPRVNLALD---HQLIDKPKIQQCSSEGY 392
>gi|15233092|ref|NP_191697.1| Phosphatidylinositol
N-acetyglucosaminlytransferase subunit P-like protein [Arabidopsis
thaliana]
Length = 718
Score = 150 bits (377), Expect = 7e-034
Identities = 93/215 (43%), Positives = 132/215 (61%), Gaps = 14/215 (6%)
Frame = +1
Query: 625 AQTIVVLKPEPNCLNIGSSPG----RNKAKNERFGSRFILSRIRRRLTPA--KNPCNA-- 780
A+TIVVLKP PN L++ SS G NK+K R SRF++ ++RRL A K C+
Sbjct: 190 AKTIVVLKPGPNTLDVDSSTGLHSTANKSKTGRTFSRFLIGLVKRRLQSAVGKKSCDVSV 249
Query: 781 --QHESDQDPDALSSTMSQNSCLEDGEILPDNSSKSEANEEDTNQGREDSKKSMCGIYIA 954
+ ++ + + S + + D E + +E +E+T EDSKK M G+YIA
Sbjct: 250 DKRSQNCSTQEEIQSKSEEKHDVSDKEEPFCDERTTEDGKEETIYSSEDSKKIMSGLYIA 309
Query: 955 AKKHLSEMLAEGDMNVDLPDKEVPRLLGKILALPQFSTPDYTPRMTLAHDFVG--HQITE 1128
AKKHLSEMLA GD++V+LPDKEVPR+LGKIL+LP+F +P +PR+ AHD V Q TE
Sbjct: 310 AKKHLSEMLANGDIDVNLPDKEVPRILGKILSLPEFCSPADSPRLIPAHDLVSTLSQTTE 369
Query: 1129 KPNIQQC--SSEDYSETLGLDSNKHEETTSTSDMS 1227
+P I Q +S ++ + DS+K ++T T D+S
Sbjct: 370 QPEILQTPETSSATNDLIDEDSDKDDDTLFTIDVS 404
>gi|110738951|dbj|BAF01396.1| hypothetical protein [Arabidopsis thaliana]
Length = 715
Score = 150 bits (377), Expect = 7e-034
Identities = 93/215 (43%), Positives = 132/215 (61%), Gaps = 14/215 (6%)
Frame = +1
Query: 625 AQTIVVLKPEPNCLNIGSSPG----RNKAKNERFGSRFILSRIRRRLTPA--KNPCNA-- 780
A+TIVVLKP PN L++ SS G NK+K R SRF++ ++RRL A K C+
Sbjct: 187 AKTIVVLKPGPNTLDVDSSTGLHSTANKSKTGRTFSRFLIGLVKRRLQSAVGKKSCDVSV 246
Query: 781 --QHESDQDPDALSSTMSQNSCLEDGEILPDNSSKSEANEEDTNQGREDSKKSMCGIYIA 954
+ ++ + + S + + D E + +E +E+T EDSKK M G+YIA
Sbjct: 247 DKRSQNCSTQEEIQSKSEEKHDVSDKEEPFCDERTTEDGKEETIYSSEDSKKIMSGLYIA 306
Query: 955 AKKHLSEMLAEGDMNVDLPDKEVPRLLGKILALPQFSTPDYTPRMTLAHDFVG--HQITE 1128
AKKHLSEMLA GD++V+LPDKEVPR+LGKIL+LP+F +P +PR+ AHD V Q TE
Sbjct: 307 AKKHLSEMLANGDIDVNLPDKEVPRILGKILSLPEFCSPADSPRLIPAHDLVSTLSQTTE 366
Query: 1129 KPNIQQC--SSEDYSETLGLDSNKHEETTSTSDMS 1227
+P I Q +S ++ + DS+K ++T T D+S
Sbjct: 367 QPEILQTPETSSATNDLIDEDSDKDDDTLFTIDVS 401
>gi|297852330|ref|XP_002894046.1| hypothetical protein ARALYDRAFT_891519
[Arabidopsis lyrata subsp. lyrata]
Length = 416
Score = 134 bits (337), Expect = 3e-029
Identities = 90/211 (42%), Positives = 123/211 (58%), Gaps = 23/211 (10%)
Frame = +1
Query: 625 AQTIVVLKPEPNCLNIGSSPGRNKAKNERFGSRFILSRIRRRLTPA--KNPC----NAQH 786
A+TIVVLKP N L++ SS G + S I+RRL A K C N +
Sbjct: 156 AKTIVVLKPGSNTLDVDSSSG-------------VHSTIKRRLQSAVGKKSCDVSVNKRS 202
Query: 787 ESDQDPDALSSTMSQNSCLEDGEILPDNSSKSEANEEDTNQGREDSKKSMCGIYIAAKKH 966
++ + + S + + D E N SE +E+T EDSKK M G+YIAAKKH
Sbjct: 203 QNCYMQEEIQSKSEEKHDVSDKEEPFCNERTSEDGKEETIYSSEDSKKIMSGLYIAAKKH 262
Query: 967 LSEMLAEGDMNVDLPDKEVPRLLGKILALPQFSTPDYTPRMTLAHDFVG--HQITEKPNI 1140
LSEMLA+GD++V+LPDKEVPR+LGKIL L +F +P +PR+ AH+ V Q TEKP I
Sbjct: 263 LSEMLAKGDIDVNLPDKEVPRILGKILYLHEFCSPADSPRLIPAHNLVSTLSQTTEKPEI 322
Query: 1141 QQC--SSEDYSETLGLDSNKHEETTSTSDMS 1227
Q +S ++ + DS+K ++T ST D+S
Sbjct: 323 LQTPETSSATNDLIDEDSDKEDDTLSTIDVS 353
>gi|297820990|ref|XP_002878378.1| hypothetical protein ARALYDRAFT_486612
[Arabidopsis lyrata subsp. lyrata]
Length = 714
Score = 126 bits (314), Expect = 1e-026
Identities = 80/188 (42%), Positives = 111/188 (59%), Gaps = 10/188 (5%)
Frame = +1
Query: 691 NKAKNERFGSRFILSRIRRRLTPA--KNPCNAQHESDQDPDALSSTMSQNS--CLEDGEI 858
NK+K + SRF++ I+ RL A K C+ + + + S + D E
Sbjct: 215 NKSKTGKKISRFLIGLIKGRLQSAVRKKSCDVPADKMSQNCCVQEEIQSKSEKHVSDKEE 274
Query: 859 LPDNSSKSEANEEDTNQGREDSKKSMCGIYIAAKKHLSEMLAEGDMNVDLPDKEVPRLLG 1038
N ++ ++E+T EDSKK M G+YIAAKKHLSEMLA GD++V+LPDKEVPR+LG
Sbjct: 275 PVCNERTTQDDKEETIYSSEDSKKIMSGLYIAAKKHLSEMLANGDIDVNLPDKEVPRILG 334
Query: 1039 KILALPQFSTPDYTPRMTLAHDFVGH-----QITEKPNIQQCSSEDYSETLGLDSNKHEE 1203
KIL+LP+F +P +P + AHD V + Q TEKP I QCSS ++ DS+K +
Sbjct: 335 KILSLPEFCSPADSPILIPAHDLVCNLSPLSQPTEKPEILQCSSAT-NDLTDEDSDKDDN 393
Query: 1204 TTSTSDMS 1227
T T D+S
Sbjct: 394 TLFTIDVS 401
>gi|297814285|ref|XP_002875026.1| hypothetical protein ARALYDRAFT_490525
[Arabidopsis lyrata subsp. lyrata]
Length = 814
Score = 74 bits (179), Expect = 6e-011
Identities = 65/224 (29%), Positives = 103/224 (45%), Gaps = 33/224 (14%)
Frame = +1
Query: 460 EEKFSELIR--RLIDQKESEVESCKNLVDDDD---DSKEESFLNIGTSPNKETSSPSESR 624
EE F +L++ ++ +E ES + D K SF +P +E +
Sbjct: 234 EELFLKLLQDPEILVPREKGAESLSLFESEQSSLADKKWSSFFRRKDAPQEECEA----- 288
Query: 625 AQTIVVLKPEP---NCLNIGSSPGR--------NKAKNERFGSRFILSRIRRRLTPAKNP 771
+ I +LKP + +IG+S G NK +NER S F LS I+R+L K+
Sbjct: 289 SDRIFILKPRSASFSSPDIGNSRGSSPDSHLMGNKLQNERNSSHFFLSEIKRKL---KHA 345
Query: 772 CNAQHESDQDPDALSSTMSQNSCLEDGEILPDNSSKSEANEEDTNQGREDSKKSMCGIYI 951
+ + + T ++ + P S K NE +D K+ + IYI
Sbjct: 346 IKKEQPAGGFGEGFPKT--KDHFFLERMARPSTSQKKSHNE-------DDKKQRVSNIYI 396
Query: 952 AAKKHLSEMLAEGDMNVDLPDKEVPRLLGKILALPQFSTPDYTP 1083
AKKHLSEML GD++ ++V R LG+IL+ P++ +P +P
Sbjct: 397 EAKKHLSEMLNNGDLDSKSTSRQVQRSLGRILSFPEYLSPLNSP 440
>gi|255541600|ref|XP_002511864.1| conserved hypothetical protein [Ricinus
communis]
Length = 999
Score = 59 bits (140), Expect = 2e-006
Identities = 41/128 (32%), Positives = 63/128 (49%), Gaps = 8/128 (6%)
Frame = +1
Query: 862 PDNSSKSEANEE----DTNQGRE---DSKKSMCGIYIAAKKHLSEMLAEGDMNVDLPDKE 1020
P K E ++ +T RE +SK+ + IY+ AKKHLSEM+ G+ D ++
Sbjct: 497 PSGVKKGEKTDKSKVCETGVERETGNNSKQRLTNIYVEAKKHLSEMVTSGNGEGDFSSRQ 556
Query: 1021 VPRLLGKILALPQFS-TPDYTPRMTLAHDFVGHQITEKPNIQQCSSEDYSETLGLDSNKH 1197
VPR LG+IL+LP+++ +P +P FV Q+ N + E+ LG +
Sbjct: 557 VPRTLGRILSLPEYNCSPFGSPGRDWGQSFVTAQMRFSANDKFQKQENNVSHLGRMTLNS 616
Query: 1198 EETTSTSD 1221
E SD
Sbjct: 617 ESELCASD 624
>gi|224067898|ref|XP_002302588.1| predicted protein [Populus trichocarpa]
Length = 778
Score = 58 bits (138), Expect = 4e-006
Identities = 38/104 (36%), Positives = 55/104 (52%), Gaps = 1/104 (0%)
Frame = +1
Query: 925 KKSMCGIYIAAKKHLSEMLAEGDMNVDLPDKEVPRLLGKILALPQFS-TPDYTPRMTLAH 1101
K IYI AKKHLSEML+ G +VD ++VP+ LG+IL+LP++S +P +P
Sbjct: 302 KHRASNIYIEAKKHLSEMLSTGQGDVDFSSEQVPKTLGRILSLPEYSLSPTGSPGKDWEQ 361
Query: 1102 DFVGHQITEKPNIQQCSSEDYSETLGLDSNKHEETTSTSDMSGD 1233
F+ Q+ N + E LG + E +S S+ S D
Sbjct: 362 GFLTAQMRFSANDKFQKHEANVSHLGRIALNSEPQSSVSNDSTD 405
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,824,494,299,012
Number of Sequences: 15229318
Number of Extensions: 3824494299012
Number of Successful Extensions: 893933318
Number of sequences better than 0.0: 0
|