BLASTX 7.6.2
Query= UN79013 /QuerySize=700
(699 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297793949|ref|XP_002864859.1| VHS domain-containing protein [... 202 5e-050
gi|312283255|dbj|BAJ34493.1| unnamed protein product [Thellungie... 198 6e-049
gi|15242856|ref|NP_201169.1| ENTH/VHS/GAT family protein [Arabid... 195 9e-048
gi|224122768|ref|XP_002330472.1| predicted protein [Populus tric... 147 2e-033
gi|255544385|ref|XP_002513254.1| Hepatocyte growth factor-regula... 130 3e-028
gi|147787190|emb|CAN66834.1| hypothetical protein VITISV_030892 ... 117 2e-024
gi|296082660|emb|CBI21665.3| unnamed protein product [Vitis vini... 108 8e-022
gi|115474421|ref|NP_001060807.1| Os08g0109000 [Oryza sativa Japo... 86 4e-015
gi|125559891|gb|EAZ05339.1| hypothetical protein OsI_27544 [Oryz... 86 4e-015
gi|224145705|ref|XP_002325737.1| predicted protein [Populus tric... 82 5e-014
gi|226508122|ref|NP_001149290.1| LOC100282912 [Zea mays] 75 6e-012
>gi|297793949|ref|XP_002864859.1| VHS domain-containing protein [Arabidopsis
lyrata subsp. lyrata]
Length = 447
Score = 202 bits (512), Expect = 5e-050
Identities = 113/147 (76%), Positives = 119/147 (80%), Gaps = 17/147 (11%)
Frame = -1
Query: 693 SSSIS------IEDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNE 532
SSSIS +E++DEEEEPEQLFRRLRK KARA PEDEEE P PPQ L GSAIHNE
Sbjct: 310 SSSISNRAHLKLEEEDEEEEPEQLFRRLRKGKARARPEDEEE---PSPPQGLPGSAIHNE 366
Query: 531 RLNRPLIRPLPSEESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASG 352
RLNRPLIRPLPSEE+ R D HSQ S PVVIPPPPAKHVER+KFFKEKKVD GASG
Sbjct: 367 RLNRPLIRPLPSEEASRGGDSHSQ-----SPPVVIPPPPAKHVEREKFFKEKKVD-GASG 420
Query: 351 LPGHMRGLSSHS--GSSSRSGSVDFSE 277
LPGHMRGLS HS GSSSRSGSVDFS+
Sbjct: 421 LPGHMRGLSLHSRDGSSSRSGSVDFSD 447
Score = 197 bits (499), Expect = 2e-048
Identities = 106/135 (78%), Positives = 112/135 (82%), Gaps = 11/135 (8%)
Frame = -1
Query: 684 ISIEDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLNRPLIRP 505
+ +E++DEEEEPEQLFRRLRK KARA PEDEEE P PPQ L GSAIHNERLNRPLIRP
Sbjct: 319 LKLEEEDEEEEPEQLFRRLRKGKARARPEDEEE---PSPPQGLPGSAIHNERLNRPLIRP 375
Query: 504 LPSEESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLS 325
LPSEE+ R D HSQ S PVVIPPPPAKHVER+KFFKEKKVD GASGLPGHMRGLS
Sbjct: 376 LPSEEASRGGDSHSQ-----SPPVVIPPPPAKHVEREKFFKEKKVD-GASGLPGHMRGLS 429
Query: 324 SHS--GSSSRSGSVD 286
HS GSSSRSGSVD
Sbjct: 430 LHSRDGSSSRSGSVD 444
>gi|312283255|dbj|BAJ34493.1| unnamed protein product [Thellungiella halophila]
Length = 448
Score = 198 bits (503), Expect = 6e-049
Identities = 112/149 (75%), Positives = 118/149 (79%), Gaps = 17/149 (11%)
Frame = -1
Query: 699 KPSSSIS------IEDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIH 538
K SSSIS +E++DEEEEPEQLFRRLRK KARA PEDEEE+S PPQ L GS IH
Sbjct: 309 KSSSSISNSTHLKLEEEDEEEEPEQLFRRLRKGKARARPEDEEESS---PPQGLPGSLIH 365
Query: 537 NERLNRPLIRPLPSEESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGA 358
NERLNRPLIRPLPSEE R D HSQ S PVVIPPPPAKHVER+KFFKEK VD GA
Sbjct: 366 NERLNRPLIRPLPSEERSRGGDSHSQ-----SPPVVIPPPPAKHVEREKFFKEKNVD-GA 419
Query: 357 SGLPGHMRGLSSHS--GSSSRSGSVDFSE 277
SGLPGHMRGLS HS GSSSRSGSVDFS+
Sbjct: 420 SGLPGHMRGLSLHSRDGSSSRSGSVDFSD 448
Score = 193 bits (490), Expect = 2e-047
Identities = 105/138 (76%), Positives = 112/138 (81%), Gaps = 11/138 (7%)
Frame = -1
Query: 693 SSSISIEDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLNRPL 514
S+ + +E++DEEEEPEQLFRRLRK KARA PEDEEE+S PPQ L GS IHNERLNRPL
Sbjct: 317 STHLKLEEEDEEEEPEQLFRRLRKGKARARPEDEEESS---PPQGLPGSLIHNERLNRPL 373
Query: 513 IRPLPSEESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMR 334
IRPLPSEE R D HSQ S PVVIPPPPAKHVER+KFFKEK VD GASGLPGHMR
Sbjct: 374 IRPLPSEERSRGGDSHSQ-----SPPVVIPPPPAKHVEREKFFKEKNVD-GASGLPGHMR 427
Query: 333 GLSSHS--GSSSRSGSVD 286
GLS HS GSSSRSGSVD
Sbjct: 428 GLSLHSRDGSSSRSGSVD 445
>gi|15242856|ref|NP_201169.1| ENTH/VHS/GAT family protein [Arabidopsis
thaliana]
Length = 447
Score = 195 bits (493), Expect = 9e-048
Identities = 105/138 (76%), Positives = 112/138 (81%), Gaps = 11/138 (7%)
Frame = -1
Query: 684 ISIEDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLNRPLIRP 505
+ +E++DEEEEPEQLFRRLRK KARA PEDEEE P PPQ L GSAIHNERLNRPLIRP
Sbjct: 319 LKLEEEDEEEEPEQLFRRLRKGKARARPEDEEE---PSPPQGLPGSAIHNERLNRPLIRP 375
Query: 504 LPSEESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLS 325
LPSEE+ R D HSQ S PVVIPPPPAKHVER+KFFKE K D GA GLPGHMRGLS
Sbjct: 376 LPSEEASRGGDSHSQ-----SPPVVIPPPPAKHVEREKFFKENKGD-GALGLPGHMRGLS 429
Query: 324 SHS--GSSSRSGSVDFSE 277
HS GSSSRSGSVDFS+
Sbjct: 430 LHSRDGSSSRSGSVDFSD 447
>gi|224122768|ref|XP_002330472.1| predicted protein [Populus trichocarpa]
Length = 418
Score = 147 bits (370), Expect = 2e-033
Identities = 82/135 (60%), Positives = 96/135 (71%), Gaps = 13/135 (9%)
Frame = -1
Query: 675 EDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLNRPLIRPLPS 496
E+ +EEEEPEQLFRRLRK KA A PEDE S PP ++GS I ERLNRPLIRPLPS
Sbjct: 295 EESEEEEEPEQLFRRLRKGKACARPEDEGN-SEERPPLGIIGSTIPGERLNRPLIRPLPS 353
Query: 495 EESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLS--S 322
E+ Q S+ +PVVIPPPPAKH+ER+KFF+E K DG S + HMRGLS
Sbjct: 354 EQ--------PQEPSAHPAPVVIPPPPAKHIEREKFFQETKADG--SDVDSHMRGLSLHC 403
Query: 321 HSGSSSRSGSVDFSE 277
H+ SSSR+GS+DFSE
Sbjct: 404 HNASSSRAGSIDFSE 418
>gi|255544385|ref|XP_002513254.1| Hepatocyte growth factor-regulated tyrosine
kinase substrate, putative [Ricinus communis]
Length = 415
Score = 130 bits (325), Expect = 3e-028
Identities = 74/125 (59%), Positives = 89/125 (71%), Gaps = 13/125 (10%)
Frame = -1
Query: 672 DDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLNRPLIRPLPSE 493
++DEEEE EQLFRRLRK KA A PEDE P +LGS+I +RLNRPLIRPL S+
Sbjct: 293 EEDEEEEAEQLFRRLRKGKACAKPEDEGNLE-ERIPMGMLGSSIFGDRLNRPLIRPLHSD 351
Query: 492 ESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLSSHS- 316
+ S S+++SPV IPPPPAKH+ER++FF+EKKVDG S + GHMRGLS HS
Sbjct: 352 Q--------SHEPSANTSPVAIPPPPAKHIERERFFQEKKVDG--SAVSGHMRGLSLHSR 401
Query: 315 -GSSS 304
GSSS
Sbjct: 402 NGSSS 406
Score = 87 bits (214), Expect = 2e-015
Identities = 46/72 (63%), Positives = 56/72 (77%), Gaps = 5/72 (6%)
Frame = -1
Query: 486 PRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLSSHS--G 313
P SD S S+++SPV IPPPPAKH+ER++FF+EKKVDG S + GHMRGLS HS G
Sbjct: 347 PLHSD-QSHEPSANTSPVAIPPPPAKHIERERFFQEKKVDG--SAVSGHMRGLSLHSRNG 403
Query: 312 SSSRSGSVDFSE 277
SSS SGS+DFS+
Sbjct: 404 SSSHSGSLDFSD 415
>gi|147787190|emb|CAN66834.1| hypothetical protein VITISV_030892 [Vitis
vinifera]
Length = 298
Score = 117 bits (291), Expect = 2e-024
Identities = 71/134 (52%), Positives = 83/134 (61%), Gaps = 12/134 (8%)
Frame = -1
Query: 672 DDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLNRPLIRPLPSE 493
+ +EEEE EQLFRRLRK KA A+PEDEE PP +L I E LNRPLIRP+ E
Sbjct: 175 EGEEEEEAEQLFRRLRKGKACALPEDEERPVDRPPFGLL--GTIPGEMLNRPLIRPVSLE 232
Query: 492 ESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLSSHS- 316
S H S IPPPP+KHVER+K+F+E K DG S + GHMR LS HS
Sbjct: 233 PS------HESRPLPSPLASAIPPPPSKHVEREKYFQENKGDG--SAVAGHMRSLSLHSR 284
Query: 315 -GSSSRSGSVDFSE 277
SSS SGS+D S+
Sbjct: 285 NASSSHSGSIDSSD 298
>gi|296082660|emb|CBI21665.3| unnamed protein product [Vitis vinifera]
Length = 400
Score = 108 bits (269), Expect = 8e-022
Identities = 64/123 (52%), Positives = 76/123 (61%), Gaps = 10/123 (8%)
Frame = -1
Query: 672 DDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLNRPLIRPLPSE 493
+ +EEEE EQLFRRLRK KA A+PEDEE PP +L I E LNRPLIRP+ E
Sbjct: 277 EGEEEEEAEQLFRRLRKGKACALPEDEERPVDRPPFGLL--GTIPGEMLNRPLIRPVSLE 334
Query: 492 ESPRSSDGHSQSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLSSHSG 313
S H S IPPPP+KHVER+K+F+E K DG S + GHMR LS HS
Sbjct: 335 PS------HESRPLPSPLASAIPPPPSKHVEREKYFQENKGDG--SAVAGHMRSLSLHSR 386
Query: 312 SSS 304
++S
Sbjct: 387 NAS 389
>gi|115474421|ref|NP_001060807.1| Os08g0109000 [Oryza sativa Japonica Group]
Length = 401
Score = 86 bits (211), Expect = 4e-015
Identities = 59/145 (40%), Positives = 87/145 (60%), Gaps = 16/145 (11%)
Frame = -1
Query: 699 KPSSSI-SIEDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLN 523
+P++++ S ++EEE+ E L+RRLRK K A+ ED + S P +I +++
Sbjct: 269 QPTTTVASTLKEEEEEDAESLYRRLRKGK--ALSEDYTDDSIPS------FRSIPEDKMR 320
Query: 522 RPLIRPLPSEESPRSSDGHSQSSSSSSSP-VVIPPPPAKHVERQKFFKEKKVDGGASGLP 346
RPL PS + + +S + P V+IPPPPAKH ER++FF+EK +D + L
Sbjct: 321 RPLTIE-PSNTDKKLGALNIRSPYPEARPDVLIPPPPAKHAERERFFREKSMD---ANLL 376
Query: 345 GHMRGLSSHS--GSSSRSGSVDFSE 277
GH+RGLS HS GSSS SGS D+ +
Sbjct: 377 GHLRGLSLHSRDGSSSCSGSTDYGD 401
>gi|125559891|gb|EAZ05339.1| hypothetical protein OsI_27544 [Oryza sativa Indica
Group]
Length = 401
Score = 86 bits (211), Expect = 4e-015
Identities = 59/145 (40%), Positives = 87/145 (60%), Gaps = 16/145 (11%)
Frame = -1
Query: 699 KPSSSI-SIEDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLN 523
+P++++ S ++EEE+ E L+RRLRK K A+ ED + S P +I +++
Sbjct: 269 QPTTTVASTLKEEEEEDAESLYRRLRKGK--ALSEDYTDDSIPS------FRSIPEDKMR 320
Query: 522 RPLIRPLPSEESPRSSDGHSQSSSSSSSP-VVIPPPPAKHVERQKFFKEKKVDGGASGLP 346
RPL PS + + +S + P V+IPPPPAKH ER++FF+EK +D + L
Sbjct: 321 RPLTIE-PSNTDKKLGALNIRSPYPEARPDVLIPPPPAKHAERERFFREKSMD---ANLL 376
Query: 345 GHMRGLSSHS--GSSSRSGSVDFSE 277
GH+RGLS HS GSSS SGS D+ +
Sbjct: 377 GHLRGLSLHSRDGSSSCSGSTDYGD 401
>gi|224145705|ref|XP_002325737.1| predicted protein [Populus trichocarpa]
Length = 415
Score = 82 bits (202), Expect = 5e-014
Identities = 42/64 (65%), Positives = 51/64 (79%), Gaps = 4/64 (6%)
Frame = -1
Query: 462 QSSSSSSSPVVIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLSSHS--GSSSRSGSV 289
Q +++ +PVVIPPPPAKH+ERQKFF+EKK DG S + GHMRGLS HS SSS SGS+
Sbjct: 354 QDPNANCAPVVIPPPPAKHMERQKFFQEKKADG--SAVSGHMRGLSLHSRNASSSCSGSI 411
Query: 288 DFSE 277
DFS+
Sbjct: 412 DFSD 415
Score = 76 bits (186), Expect = 3e-012
Identities = 42/62 (67%), Positives = 46/62 (74%), Gaps = 1/62 (1%)
Frame = -1
Query: 675 EDDDEEEEPEQLFRRLRKWKARAMPEDEEEASPPPPPQVLLGSAIHNERLNRPLIRPLPS 496
E+ +EEEEPEQLFRRLRK KA A PEDE S P LLGS I +RLNRPLIRPLPS
Sbjct: 292 EESEEEEEPEQLFRRLRKGKACARPEDEGN-SEERLPLGLLGSTIPGDRLNRPLIRPLPS 350
Query: 495 EE 490
E+
Sbjct: 351 EQ 352
>gi|226508122|ref|NP_001149290.1| LOC100282912 [Zea mays]
Length = 405
Score = 75 bits (184), Expect = 6e-012
Identities = 37/54 (68%), Positives = 44/54 (81%), Gaps = 3/54 (5%)
Frame = -1
Query: 432 VIPPPPAKHVERQKFFKEKKVDGGASGLPGHMRGLSSHS--GSSSRSGSVDFSE 277
+IPPPPAKH ER++FF+EK +DG AS LPGHMRGLS HS GSSS SGS D+ +
Sbjct: 353 LIPPPPAKHAERERFFREKSIDGVAS-LPGHMRGLSQHSRDGSSSCSGSTDYGD 405
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,093,974,624,356
Number of Sequences: 15229318
Number of Extensions: 1093974624356
Number of Successful Extensions: 299540451
Number of sequences better than 0.0: 0
|