BLASTX 7.6.2
Query= UN19991 /QuerySize=655
(654 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|3193330|gb|AAC19312.1| contains similarity to Medicago sativa... 197 1e-048
gi|22328170|ref|NP_680546.1| bifunctional inhibitor/lipid-transf... 197 1e-048
gi|297810165|ref|XP_002872966.1| hypothetical protein ARALYDRAFT... 188 7e-046
gi|118486948|gb|ABK95307.1| unknown [Populus trichocarpa] 158 8e-037
gi|255541204|ref|XP_002511666.1| 14 kDa proline-rich protein DC2... 154 8e-036
gi|255628581|gb|ACU14635.1| unknown [Glycine max] 150 2e-034
gi|296089175|emb|CBI38878.3| unnamed protein product [Vitis vini... 149 5e-034
gi|225453963|ref|XP_002280193.1| PREDICTED: hypothetical protein... 146 2e-033
gi|225453961|ref|XP_002280170.1| PREDICTED: hypothetical protein... 138 8e-031
gi|118488240|gb|ABK95939.1| unknown [Populus trichocarpa] 122 4e-026
gi|255630522|gb|ACU15619.1| unknown [Glycine max] 119 3e-025
gi|281398220|gb|ADA67933.1| putative 14 kDa proline-rich protein... 119 5e-025
gi|226491280|ref|NP_001152585.1| LOC100286225 [Zea mays] 118 9e-025
gi|508304|gb|AAA32650.1| bimodular protein [Medicago sativa] 117 2e-024
gi|146141284|gb|ABQ01426.1| bimodular protein [Medicago sativa s... 117 2e-024
gi|162319716|gb|ABX84384.1| protease inhibitor-like protein [Tri... 113 3e-023
gi|156454136|gb|ABU63756.1| root specific protein [Triticum aest... 108 7e-022
gi|255631195|gb|ACU15963.1| unknown [Glycine max] 107 2e-021
gi|193848565|gb|ACF22750.1| proline-rich protein [Brachypodium d... 106 3e-021
gi|162319714|gb|ABX84383.1| protease inhibitor-like protein [Tri... 104 2e-020
>gi|3193330|gb|AAC19312.1| contains similarity to Medicago sativa corC
(GB:L22305) [Arabidopsis thaliana]
Length = 399
Score = 197 bits (499), Expect = 1e-048
Identities = 99/129 (76%), Positives = 109/129 (84%), Gaps = 7/129 (5%)
Frame = +2
Query: 158 MAISKAFPLLLVLLLVLNLTFSFCH-----AVKQC-PPPTKQSSMKCPRDTVKFGVCGSW 319
M ISKA LL+LLL LN+TF F H VK C PPP KQ++ KCPRDT+KFGVCGSW
Sbjct: 272 MGISKALRSLLILLL-LNITFFFGHVTPGATVKPCPPPPAKQATTKCPRDTLKFGVCGSW 330
Query: 320 LGLVHEVIGTPPSQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCG 499
LGLV EVIGTPPSQECCSL+KGLAD EAA+CLCTALKTS+LGVAPVK+PVALTLLLNSCG
Sbjct: 331 LGLVSEVIGTPPSQECCSLIKGLADFEAAVCLCTALKTSILGVAPVKIPVALTLLLNSCG 390
Query: 500 KTLPQGFVC 526
K +PQGFVC
Sbjct: 391 KNVPQGFVC 399
>gi|22328170|ref|NP_680546.1| bifunctional inhibitor/lipid-transfer protein/seed
storage 2S albumin-like protein [Arabidopsis thaliana]
Length = 128
Score = 197 bits (499), Expect = 1e-048
Identities = 99/129 (76%), Positives = 109/129 (84%), Gaps = 7/129 (5%)
Frame = +2
Query: 158 MAISKAFPLLLVLLLVLNLTFSFCH-----AVKQC-PPPTKQSSMKCPRDTVKFGVCGSW 319
M ISKA LL+LLL LN+TF F H VK C PPP KQ++ KCPRDT+KFGVCGSW
Sbjct: 1 MGISKALRSLLILLL-LNITFFFGHVTPGATVKPCPPPPAKQATTKCPRDTLKFGVCGSW 59
Query: 320 LGLVHEVIGTPPSQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCG 499
LGLV EVIGTPPSQECCSL+KGLAD EAA+CLCTALKTS+LGVAPVK+PVALTLLLNSCG
Sbjct: 60 LGLVSEVIGTPPSQECCSLIKGLADFEAAVCLCTALKTSILGVAPVKIPVALTLLLNSCG 119
Query: 500 KTLPQGFVC 526
K +PQGFVC
Sbjct: 120 KNVPQGFVC 128
>gi|297810165|ref|XP_002872966.1| hypothetical protein ARALYDRAFT_327754
[Arabidopsis lyrata subsp. lyrata]
Length = 380
Score = 188 bits (476), Expect = 7e-046
Identities = 95/128 (74%), Positives = 105/128 (82%), Gaps = 10/128 (7%)
Frame = +2
Query: 158 MAISKAFPLLLVLLLVLNLTFSFCH-----AVKQCPPPTKQSSMKCPRDTVKFGVCGSWL 322
M ISKA LL+LLL LN+TF F H VK CPP S KCPRDT+KFGVCGSWL
Sbjct: 258 MGISKALRSLLILLL-LNMTFIFGHVIPGATVKPCPP----SPTKCPRDTLKFGVCGSWL 312
Query: 323 GLVHEVIGTPPSQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGK 502
GLV EVIGTPPSQECCSL+KGLAD EAA+CLCTALKTS+LG+APVK+PVAL+LLLNSCGK
Sbjct: 313 GLVREVIGTPPSQECCSLIKGLADFEAAVCLCTALKTSILGIAPVKIPVALSLLLNSCGK 372
Query: 503 TLPQGFVC 526
+PQGFVC
Sbjct: 373 NVPQGFVC 380
>gi|118486948|gb|ABK95307.1| unknown [Populus trichocarpa]
Length = 131
Score = 158 bits (398), Expect = 8e-037
Identities = 71/117 (60%), Positives = 93/117 (79%), Gaps = 3/117 (2%)
Frame = +2
Query: 185 LLVLLLVLNLTFSFCHAVKQCP---PPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPP 355
+ VLL V+ T H V CP PP+ + KCP+DT+KFGVCG+WLGLVHE +GTPP
Sbjct: 14 IFVLLNVIFFTCVSSHNVPACPPKAPPSPKKPAKCPKDTLKFGVCGNWLGLVHEALGTPP 73
Query: 356 SQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
S+ECC+L+KGLADLEAALCLCTA+K ++LGV +K+PVA++LLL++CGK +P+GF C
Sbjct: 74 SEECCTLIKGLADLEAALCLCTAIKANVLGVVKLKVPVAVSLLLSACGKKVPEGFKC 130
>gi|255541204|ref|XP_002511666.1| 14 kDa proline-rich protein DC2.15 precursor,
putative [Ricinus communis]
Length = 128
Score = 154 bits (389), Expect = 8e-036
Identities = 74/123 (60%), Positives = 96/123 (78%), Gaps = 4/123 (3%)
Frame = +2
Query: 167 SKAFPLLLVLLLVLNLTFSFCHAVKQCP---PPTKQSSMKCPRDTVKFGVCGSWLGLVHE 337
SKA LLL+L L+L T H V CP P + +CP+DT+ FGVCGSWLGLVHE
Sbjct: 6 SKAATLLLLLSLIL-FTCVSSHKVPVCPPKVPSVPEKPARCPKDTLTFGVCGSWLGLVHE 64
Query: 338 VIGTPPSQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTLPQG 517
VIGT PS+ECC+L+KG+ADLEAALCLCTA+K+++LGV V++PVA++LLL++CG+ +PQG
Sbjct: 65 VIGTKPSKECCTLIKGVADLEAALCLCTAIKSNVLGVVKVEVPVAISLLLSACGREVPQG 124
Query: 518 FVC 526
F C
Sbjct: 125 FKC 127
>gi|255628581|gb|ACU14635.1| unknown [Glycine max]
Length = 128
Score = 150 bits (378), Expect = 2e-034
Identities = 72/120 (60%), Positives = 88/120 (73%), Gaps = 6/120 (5%)
Frame = +2
Query: 185 LLVLLLVLNLTFSFCHAVKQ--CPP----PTKQSSMKCPRDTVKFGVCGSWLGLVHEVIG 346
L+ ++LN F C A CPP P S KCP+DT+KFGVCGSWLGLV EVIG
Sbjct: 8 LICSFILLNFLFFSCFAADNLPCPPKSTIPPSSSPQKCPKDTLKFGVCGSWLGLVKEVIG 67
Query: 347 TPPSQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
T PS+ECC L+KGLADLEAALCLCTA+K ++LG VK+ VA++LL+N+CGK +P GFVC
Sbjct: 68 TKPSEECCILLKGLADLEAALCLCTAIKANVLGAVKVKVHVAVSLLVNACGKKVPSGFVC 127
>gi|296089175|emb|CBI38878.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 149 bits (374), Expect = 5e-034
Identities = 68/116 (58%), Positives = 87/116 (75%), Gaps = 5/116 (4%)
Frame = +2
Query: 194 LLLVLNLTFSFCHAVKQCP-----PPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPS 358
LL++LN+ F C + P PP S KCP+DT+KFG C +WLGLV EV+GTPPS
Sbjct: 11 LLILLNIFFFSCVSCNGVPCPPSTPPAPTKSAKCPKDTLKFGACANWLGLVGEVVGTPPS 70
Query: 359 QECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
+CC+LV GLADLEAALC CTA+K ++LG V++PVALTLL+N+CGK +P+GFVC
Sbjct: 71 SKCCALVAGLADLEAALCFCTAIKANVLGAIKVEVPVALTLLVNACGKKVPEGFVC 126
>gi|225453963|ref|XP_002280193.1| PREDICTED: hypothetical protein isoform 2
[Vitis vinifera]
Length = 123
Score = 146 bits (368), Expect = 2e-033
Identities = 66/112 (58%), Positives = 87/112 (77%), Gaps = 1/112 (0%)
Frame = +2
Query: 194 LLLVLNLTFSFCHAVKQCP-PPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPSQECC 370
LL++LN+ F C + P PP+ + CP+DT+KFG C +WLGLV EV+GTPPS +CC
Sbjct: 11 LLILLNIFFFSCVSCNGVPCPPSTPPAPTCPKDTLKFGACANWLGLVGEVVGTPPSSKCC 70
Query: 371 SLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
+LV GLADLEAALC CTA+K ++LG V++PVALTLL+N+CGK +P+GFVC
Sbjct: 71 ALVAGLADLEAALCFCTAIKANVLGAIKVEVPVALTLLVNACGKKVPEGFVC 122
>gi|225453961|ref|XP_002280170.1| PREDICTED: hypothetical protein isoform 1
[Vitis vinifera]
Length = 135
Score = 138 bits (346), Expect = 8e-031
Identities = 59/91 (64%), Positives = 76/91 (83%)
Frame = +2
Query: 254 PTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPSQECCSLVKGLADLEAALCLCTALKT 433
P K+ + CP+DT+KFG C +WLGLV EV+GTPPS +CC+LV GLADLEAALC CTA+K
Sbjct: 44 PPKKPAPTCPKDTLKFGACANWLGLVGEVVGTPPSSKCCALVAGLADLEAALCFCTAIKA 103
Query: 434 SLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
++LG V++PVALTLL+N+CGK +P+GFVC
Sbjct: 104 NVLGAIKVEVPVALTLLVNACGKKVPEGFVC 134
>gi|118488240|gb|ABK95939.1| unknown [Populus trichocarpa]
Length = 121
Score = 122 bits (306), Expect = 4e-026
Identities = 58/116 (50%), Positives = 82/116 (70%), Gaps = 3/116 (2%)
Frame = +2
Query: 185 LLVLLLVLNLTF--SFCHAVKQCPPPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPS 358
L +L+L+L F +F A C P T + CPRDT+K G C LGLV+ V+G+PP
Sbjct: 6 LTATILILSLLFFSTFSSACGPCQPKTPPTEPTCPRDTLKLGACADILGLVNVVVGSPPY 65
Query: 359 QECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
+CC L++GLADLE ALCLCTA+K S+LG+ + +PVAL++L+++CGK++P GF C
Sbjct: 66 SKCCPLLEGLADLEVALCLCTAIKASVLGI-NLNVPVALSVLVSACGKSIPPGFKC 120
>gi|255630522|gb|ACU15619.1| unknown [Glycine max]
Length = 131
Score = 119 bits (298), Expect = 3e-025
Identities = 54/93 (58%), Positives = 70/93 (75%), Gaps = 1/93 (1%)
Frame = +2
Query: 248 PPPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPSQECCSLVKGLADLEAALCLCTAL 427
P P + CP+DTVKFGVC LGL++ +G PP CCSL++GLADLEAA+CLCTAL
Sbjct: 39 PKPPSPKQVSCPKDTVKFGVCADVLGLINVQLGKPPKTPCCSLIQGLADLEAAVCLCTAL 98
Query: 428 KTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
K ++LG+ + +PV L+LLLN CGK +P+GFVC
Sbjct: 99 KANVLGI-NLNVPVNLSLLLNYCGKGVPKGFVC 130
>gi|281398220|gb|ADA67933.1| putative 14 kDa proline-rich protein DC2.15
[Wolffia arrhiza]
Length = 135
Score = 119 bits (296), Expect = 5e-025
Identities = 53/94 (56%), Positives = 72/94 (76%), Gaps = 1/94 (1%)
Frame = +2
Query: 248 PPPTKQSSMKCPRDTVKFGVCGSWL-GLVHEVIGTPPSQECCSLVKGLADLEAALCLCTA 424
P PT + + KCP DT+K GVC + L GL++ +GTPP CC+L+KGLADLEAALCLCT
Sbjct: 41 PKPTPKPTGKCPVDTLKLGVCANLLNGLINIQLGTPPKTPCCNLIKGLADLEAALCLCTV 100
Query: 425 LKTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
LK ++LG+ + LP+ L+LL+N CGK++P GF+C
Sbjct: 101 LKANVLGLISLNLPINLSLLVNYCGKSVPTGFIC 134
>gi|226491280|ref|NP_001152585.1| LOC100286225 [Zea mays]
Length = 125
Score = 118 bits (294), Expect = 9e-025
Identities = 60/122 (49%), Positives = 79/122 (64%), Gaps = 5/122 (4%)
Frame = +2
Query: 167 SKAFPLL----LVLLLVLNLTFSFCHAVKQCPPPTKQSSMKCPRDTVKFGVCGSWLGLVH 334
SKAF L LV+L V N C P PT S +CPRD +K GVC + LGL+
Sbjct: 3 SKAFALFLAVNLVVLGVANACTPNCSGPSTTPTPTPSSFGRCPRDALKLGVCANVLGLIK 62
Query: 335 EVIGTPPSQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTLPQ 514
+G PP++ CC L++GL DLEAA CLCTA+K ++LG+ + LPV L+L+LN CG+T+P
Sbjct: 63 AKVGAPPAEPCCPLLEGLVDLEAAACLCTAIKGNILGI-NLNLPVDLSLILNYCGRTVPX 121
Query: 515 GF 520
GF
Sbjct: 122 GF 123
>gi|508304|gb|AAA32650.1| bimodular protein [Medicago sativa]
Length = 166
Score = 117 bits (291), Expect = 2e-024
Identities = 49/92 (53%), Positives = 73/92 (79%), Gaps = 1/92 (1%)
Frame = +2
Query: 251 PPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPSQECCSLVKGLADLEAALCLCTALK 430
PPT +S KCP DT+K GVC LGLV+ ++G+P S +CC+L++GLADL+AA+CLCTA+K
Sbjct: 75 PPTPSTSQKCPTDTLKLGVCADVLGLVNVIVGSPASSKCCTLIQGLADLDAAVCLCTAIK 134
Query: 431 TSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
++LG+ + +P+ L+LLL++C K++P GF C
Sbjct: 135 ANILGI-NLNVPITLSLLLSACEKSIPNGFQC 165
>gi|146141284|gb|ABQ01426.1| bimodular protein [Medicago sativa subsp. falcata]
Length = 166
Score = 117 bits (291), Expect = 2e-024
Identities = 49/92 (53%), Positives = 73/92 (79%), Gaps = 1/92 (1%)
Frame = +2
Query: 251 PPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPSQECCSLVKGLADLEAALCLCTALK 430
PPT +S KCP DT+K GVC LGLV+ ++G+P S +CC+L++GLADL+AA+CLCTA+K
Sbjct: 75 PPTPSTSQKCPTDTLKLGVCADVLGLVNVIVGSPASSKCCTLIQGLADLDAAVCLCTAIK 134
Query: 431 TSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
++LG+ + +P+ L+LLL++C K++P GF C
Sbjct: 135 ANILGI-NLNVPITLSLLLSACEKSIPNGFQC 165
>gi|162319716|gb|ABX84384.1| protease inhibitor-like protein [Triticum
aestivum]
Length = 126
Score = 113 bits (281), Expect = 3e-023
Identities = 57/126 (45%), Positives = 75/126 (59%), Gaps = 3/126 (2%)
Frame = +2
Query: 158 MAISKAFPLLLVLLLVLNLTFSFCHA---VKQCPPPTKQSSMKCPRDTVKFGVCGSWLGL 328
MA + L L + LV+ S C P PT + +CPRD VK G+C + L L
Sbjct: 1 MAGKASIALFLAVNLVVFSVASACGGNCPTPSTPTPTPAAFGRCPRDAVKIGLCVNALNL 60
Query: 329 VHEVIGTPPSQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTL 508
V +G PP+ CC LVKGL DLEAALCLCT LK ++L + + LP+ L+++LN CGK +
Sbjct: 61 VKAELGAPPTLPCCPLVKGLVDLEAALCLCTVLKANVLNIVKLNLPIDLSVILNDCGKKV 120
Query: 509 PQGFVC 526
P GF C
Sbjct: 121 PTGFQC 126
>gi|156454136|gb|ABU63756.1| root specific protein [Triticum aestivum]
Length = 126
Score = 108 bits (269), Expect = 7e-022
Identities = 48/93 (51%), Positives = 63/93 (67%)
Frame = +2
Query: 248 PPPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPSQECCSLVKGLADLEAALCLCTAL 427
P PT + +CPRD VK G+C + L LV +G PP+ CC LVKGL DLEAALCLCT L
Sbjct: 34 PTPTPAAFGRCPRDAVKIGLCVNALNLVKAELGAPPTLPCCPLVKGLVDLEAALCLCTVL 93
Query: 428 KTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
K ++L + + LP+ L+++ N CGK +P GF C
Sbjct: 94 KANVLNIVKLNLPIDLSVIPNDCGKKVPTGFQC 126
>gi|255631195|gb|ACU15963.1| unknown [Glycine max]
Length = 184
Score = 107 bits (266), Expect = 2e-021
Identities = 47/88 (53%), Positives = 68/88 (77%), Gaps = 5/88 (5%)
Frame = +2
Query: 245 CPPPTKQS----SMKCPRDTVKFGVCGSWLGLVHEVIGTPPSQECCSLVKGLADLEAALC 412
CPPP+ S CP+DT+K G C LGLV+ ++GTPPS +CC+L+KGLADLEAALC
Sbjct: 79 CPPPSPPSPPSNKASCPKDTLKLGACADLLGLVNIIVGTPPSSQCCALIKGLADLEAALC 138
Query: 413 LCTALKTSLLGVAPVKLPVALTLLLNSC 496
LCTA+K+++LG+ + +PV L+++L++C
Sbjct: 139 LCTAIKSNVLGI-NLNVPVTLSVILSAC 165
>gi|193848565|gb|ACF22750.1| proline-rich protein [Brachypodium distachyon]
Length = 142
Score = 106 bits (264), Expect = 3e-021
Identities = 53/124 (42%), Positives = 76/124 (61%), Gaps = 4/124 (3%)
Frame = +2
Query: 167 SKAFPLLLVLLLVLNLTFSFCH----AVKQCPPPTKQSSMKCPRDTVKFGVCGSWLGLVH 334
S+ L L+ L+ L+L + A CP P ++ CP +T+K GVC + L L+
Sbjct: 17 SRKLALFLLALMNLSLLLGAVNAGGCAGPHCPTPATSTTGVCPINTLKLGVCANVLNLLK 76
Query: 335 EVIGTPPSQECCSLVKGLADLEAALCLCTALKTSLLGVAPVKLPVALTLLLNSCGKTLPQ 514
IG P S++CC L+ GLADL+AA+C+C+A++ +LGV + +PV L LLLN C KT P
Sbjct: 77 LKIGVPASEQCCPLLTGLADLDAAVCVCSAIRAKVLGVVNLNVPVDLVLLLNYCRKTCPP 136
Query: 515 GFVC 526
GF C
Sbjct: 137 GFTC 140
>gi|162319714|gb|ABX84383.1| protease inhibitor-like protein [Triticum
aestivum]
Length = 131
Score = 104 bits (257), Expect = 2e-020
Identities = 45/93 (48%), Positives = 61/93 (65%)
Frame = +2
Query: 248 PPPTKQSSMKCPRDTVKFGVCGSWLGLVHEVIGTPPSQECCSLVKGLADLEAALCLCTAL 427
P PT S +CPRD +K G C + L LV +G P + CC L+ GL DLEAALCLCT +
Sbjct: 39 PTPTPASLRRCPRDALKVGACVNALNLVKAQVGRPTALPCCPLLDGLVDLEAALCLCTVI 98
Query: 428 KTSLLGVAPVKLPVALTLLLNSCGKTLPQGFVC 526
K ++L + + LP+ L+++LN CGK P GF+C
Sbjct: 99 KANVLNIVQLNLPINLSVILNHCGKKAPTGFMC 131
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,354,205,064,082
Number of Sequences: 15229318
Number of Extensions: 2354205064082
Number of Successful Extensions: 584002404
Number of sequences better than 0.0: 0
|