BLASTX 7.6.2
Query= UN30156 /QuerySize=1294
(1293 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|296090276|emb|CBI40095.3| unnamed protein product [Vitis vini... 143 8e-032
gi|297793237|ref|XP_002864503.1| hypothetical protein ARALYDRAFT... 129 1e-027
gi|30687076|ref|NP_849445.1| CCT motif family protein [Arabidops... 122 1e-025
gi|4538928|emb|CAB39664.1| putative protein [Arabidopsis thaliana] 122 1e-025
gi|18416659|ref|NP_567737.1| CCT motif family protein [Arabidops... 122 1e-025
gi|30696840|ref|NP_568852.2| chloroplast import apparatus 2 prot... 119 2e-024
gi|30696838|ref|NP_851201.1| chloroplast import apparatus 2 prot... 119 2e-024
gi|147784441|emb|CAN72725.1| hypothetical protein VITISV_015092 ... 119 2e-024
gi|225470656|ref|XP_002268213.1| PREDICTED: hypothetical protein... 119 2e-024
gi|297799402|ref|XP_002867585.1| hypothetical protein ARALYDRAFT... 112 1e-022
gi|255554380|ref|XP_002518229.1| CIL, putative [Ricinus communis] 89 1e-015
gi|224142697|ref|XP_002324691.1| predicted protein [Populus tric... 89 2e-015
gi|224087284|ref|XP_002308112.1| predicted protein [Populus tric... 84 6e-014
gi|79331135|ref|NP_001032087.1| chloroplast import apparatus 2 p... 67 8e-009
>gi|296090276|emb|CBI40095.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 143 bits (359), Expect = 8e-032
Identities = 123/329 (37%), Positives = 155/329 (47%), Gaps = 45/329 (13%)
Frame = +1
Query: 373 PLPRHQQSPNLQTQSPPGS-------QERSGNDRTRAT-TQAAALLSTAYPNIFFSPSST 528
P H SP+ S + R+ R T +AAALLSTAYPNIF S+
Sbjct: 52 PRTSHSSSPSSTISESSNSPIAISTRKPRTPRKRPNQTYNEAAALLSTAYPNIF---STK 108
Query: 529 NHLYANKKTHHHFYGFDDDVDDDAELLLPSEPPD----FLFNPILQAYSDQKEVSSGVSV 696
N K T H D ++D +ELL P D L P+ + E+ G
Sbjct: 109 NLKNPCKFTKSH----DSFLEDSSELLFPFRAFDASGFLLHQPVQEKPRKSPELCDG--- 161
Query: 697 NESEVSQFEFSDEFESILVEEVEEGGVDSIMGKLEPGSGHRRGV------NHRFDHQIVK 858
FE + ESIL EE+ EGG+DSIMG L + N + + I
Sbjct: 162 -------FEEDFDAESILDEEI-EGGIDSIMGNLSVDNEMSDEATNPVCFNSYYGNGIPM 213
Query: 859 GFKFAENIPLGLGLR---TALRDHNNDDDARFQTVDLDRIQTTVTVVDEVKKSKKKKKKV 1029
G F G G+R ALR + D RF TVD+ I V ++KKKKKV
Sbjct: 214 GLGFGGKFEFGFGMRRGVRALRHVDEGDWWRFPTVDILEISPKFNKV----SAEKKKKKV 269
Query: 1030 AAAAAVVPLATVSSREETEERSGRPMLKLDYDGVLEAWSDKATPFGEESEVTGADVNARL 1209
A + + + S +LKL+YD VL AWSD+ +PF E+E G D ARL
Sbjct: 270 EKAQELRSWESPKGNSIPKSNSSL-LLKLNYDDVLSAWSDRGSPFSRETEFPGNDTAARL 328
Query: 1210 AQIDLLGD-SGVREASVLRYKEKRQTRLF 1293
AQIDL + GVREASVLRYKEKR+TRLF
Sbjct: 329 AQIDLFSECGGVREASVLRYKEKRRTRLF 357
Score = 85 bits (209), Expect = 2e-014
Identities = 46/71 (64%), Positives = 56/71 (78%), Gaps = 4/71 (5%)
Frame = +3
Query: 264 SGQEMSACLSSGASAAYSFELEKIKSPPSSSTTTTRATSPSSTITESSNS---ISTRKPR 434
S ++MS+CL SGA Y FELE +KSP S+S T+ ++SPSSTI+ESSNS ISTRKPR
Sbjct: 22 SHKKMSSCL-SGAGRTYGFELEIVKSPSSTSPRTSHSSSPSSTISESSNSPIAISTRKPR 80
Query: 435 TQRKRPNQSYN 467
T RKRPNQ+YN
Sbjct: 81 TPRKRPNQTYN 91
>gi|297793237|ref|XP_002864503.1| hypothetical protein ARALYDRAFT_495815
[Arabidopsis lyrata subsp. lyrata]
Length = 426
Score = 129 bits (323), Expect = 1e-027
Identities = 71/126 (56%), Positives = 86/126 (68%), Gaps = 6/126 (4%)
Frame = +1
Query: 934 DARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGR---P 1104
+ TV ++ + E KKKKKK+ AAA + + S E+TEE S + P
Sbjct: 265 ETAISTVHEEKSDGKKVISGEKSNKKKKKKKMTAAATPLLITESKSSEDTEETSPKRTGP 324
Query: 1105 MLKLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEK 1275
+LKLDYDGVLEAWSDK +PF +E SE TG DVNARLA+IDL GDSG+REASVLRYKEK
Sbjct: 325 LLKLDYDGVLEAWSDKTSPFPDEILGSEATGIDVNARLAEIDLFGDSGMREASVLRYKEK 384
Query: 1276 RQTRLF 1293
R+TRLF
Sbjct: 385 RRTRLF 390
Score = 102 bits (254), Expect = 1e-019
Identities = 57/69 (82%), Positives = 62/69 (89%), Gaps = 6/69 (8%)
Frame = +3
Query: 276 MSACLSS--GASAAYSFELEKIKSPPSSS-TTTTRATSPSSTITESSNS---ISTRKPRT 437
MSAC+SS G +AAYSFELEK+KSPPSSS TTTTRATSPSSTI+ESSNS ISTRKPRT
Sbjct: 1 MSACISSGGGGAAAYSFELEKVKSPPSSSTTTTTRATSPSSTISESSNSPLAISTRKPRT 60
Query: 438 QRKRPNQSY 464
QRKRPNQ+Y
Sbjct: 61 QRKRPNQTY 69
>gi|30687076|ref|NP_849445.1| CCT motif family protein [Arabidopsis thaliana]
Length = 409
Score = 122 bits (306), Expect = 1e-025
Identities = 68/125 (54%), Positives = 84/125 (67%), Gaps = 4/125 (3%)
Frame = +1
Query: 931 DDARFQTVDLDRIQTTVTVV-DEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPM 1107
DD + VD +I+T VT D+ KK KKKKKKVA AAA + V+ E+ P+
Sbjct: 233 DDGQSNVVDSSKIKTIVTAEGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL 292
Query: 1108 LKLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEKR 1278
LKLDYDGVLEAWS K +PF +E S+ G D + RL +IDL G+SG+REASVLRYKEKR
Sbjct: 293 LKLDYDGVLEAWSGKESPFSDEILGSDADGVDFHVRLGEIDLFGESGMREASVLRYKEKR 352
Query: 1279 QTRLF 1293
+ RLF
Sbjct: 353 RNRLF 357
>gi|4538928|emb|CAB39664.1| putative protein [Arabidopsis thaliana]
Length = 378
Score = 122 bits (306), Expect = 1e-025
Identities = 68/125 (54%), Positives = 84/125 (67%), Gaps = 4/125 (3%)
Frame = +1
Query: 931 DDARFQTVDLDRIQTTVTVV-DEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPM 1107
DD + VD +I+T VT D+ KK KKKKKKVA AAA + V+ E+ P+
Sbjct: 233 DDGQSNVVDSSKIKTIVTAEGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL 292
Query: 1108 LKLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEKR 1278
LKLDYDGVLEAWS K +PF +E S+ G D + RL +IDL G+SG+REASVLRYKEKR
Sbjct: 293 LKLDYDGVLEAWSGKESPFSDEILGSDADGVDFHVRLGEIDLFGESGMREASVLRYKEKR 352
Query: 1279 QTRLF 1293
+ RLF
Sbjct: 353 RNRLF 357
>gi|18416659|ref|NP_567737.1| CCT motif family protein [Arabidopsis thaliana]
Length = 394
Score = 122 bits (306), Expect = 1e-025
Identities = 68/125 (54%), Positives = 84/125 (67%), Gaps = 4/125 (3%)
Frame = +1
Query: 931 DDARFQTVDLDRIQTTVTVV-DEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPM 1107
DD + VD +I+T VT D+ KK KKKKKKVA AAA + V+ E+ P+
Sbjct: 233 DDGQSNVVDSSKIKTIVTAEGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL 292
Query: 1108 LKLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEKR 1278
LKLDYDGVLEAWS K +PF +E S+ G D + RL +IDL G+SG+REASVLRYKEKR
Sbjct: 293 LKLDYDGVLEAWSGKESPFSDEILGSDADGVDFHVRLGEIDLFGESGMREASVLRYKEKR 352
Query: 1279 QTRLF 1293
+ RLF
Sbjct: 353 RNRLF 357
>gi|30696840|ref|NP_568852.2| chloroplast import apparatus 2 protein
[Arabidopsis thaliana]
Length = 435
Score = 119 bits (296), Expect = 2e-024
Identities = 68/124 (54%), Positives = 84/124 (67%), Gaps = 5/124 (4%)
Frame = +1
Query: 934 DARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGR---P 1104
+ TVD ++ V+ K +KKKKKK + + S E+TEE S + P
Sbjct: 277 ETAISTVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTL-ITESKSLEDTEETSLKRTGP 335
Query: 1105 MLKLDYDGVLEAWSDKATPFGEESEVTGA-DVNARLAQIDLLGDSGVREASVLRYKEKRQ 1281
+LKLDYDGVLEAWSDK +PF +E + + A DVNARLAQIDL GDSG+REASVLRYKEKR+
Sbjct: 336 LLKLDYDGVLEAWSDKTSPFPDEIQGSEAVDVNARLAQIDLFGDSGMREASVLRYKEKRR 395
Query: 1282 TRLF 1293
TRLF
Sbjct: 396 TRLF 399
>gi|30696838|ref|NP_851201.1| chloroplast import apparatus 2 protein
[Arabidopsis thaliana]
Length = 424
Score = 119 bits (296), Expect = 2e-024
Identities = 68/124 (54%), Positives = 84/124 (67%), Gaps = 5/124 (4%)
Frame = +1
Query: 934 DARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGR---P 1104
+ TVD ++ V+ K +KKKKKK + + S E+TEE S + P
Sbjct: 277 ETAISTVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTL-ITESKSLEDTEETSLKRTGP 335
Query: 1105 MLKLDYDGVLEAWSDKATPFGEESEVTGA-DVNARLAQIDLLGDSGVREASVLRYKEKRQ 1281
+LKLDYDGVLEAWSDK +PF +E + + A DVNARLAQIDL GDSG+REASVLRYKEKR+
Sbjct: 336 LLKLDYDGVLEAWSDKTSPFPDEIQGSEAVDVNARLAQIDLFGDSGMREASVLRYKEKRR 395
Query: 1282 TRLF 1293
TRLF
Sbjct: 396 TRLF 399
>gi|147784441|emb|CAN72725.1| hypothetical protein VITISV_015092 [Vitis
vinifera]
Length = 392
Score = 119 bits (296), Expect = 2e-024
Identities = 86/202 (42%), Positives = 107/202 (52%), Gaps = 16/202 (7%)
Frame = +1
Query: 718 FEFSDEFESILVEEVEEGGVDSIMGKLEPGSGHRRGV------NHRFDHQIVKGFKFAEN 879
FE + ESIL EE+ EGG+DSIMG L + N + + I G F
Sbjct: 162 FEEDFDAESILDEEI-EGGIDSIMGNLSVDNEMSDEATNPVCFNSYYGNGIPMGLGFGGK 220
Query: 880 IPLGLGLR---TALRDHNNDDDARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVV 1050
G G+R ALR + D RF TVD+ I V ++KKKKKV A +
Sbjct: 221 FEFGFGMRRGVRALRHVDEGDWWRFPTVDILEISPKFNKV----SAEKKKKKVEKAQELR 276
Query: 1051 PLATVSSREETEERSGRPMLKLDYDGVLEAWSDKATPFGEESEVTGADVNARLAQIDLLG 1230
+ + S +LKL+YD VL AWSD+ +PF E+E G D ARLAQIDL
Sbjct: 277 SWESPKGNSIPKSNSSL-LLKLNYDDVLSAWSDRGSPFSRETEFPGNDTAARLAQIDLFS 335
Query: 1231 D-SGVREASVLRYKEKRQTRLF 1293
+ GVREASVLRYKEKR+TRLF
Sbjct: 336 ECGGVREASVLRYKEKRRTRLF 357
Score = 77 bits (187), Expect = 7e-012
Identities = 42/66 (63%), Positives = 50/66 (75%), Gaps = 4/66 (6%)
Frame = +3
Query: 279 SACLSSGASAAYSFELEKIKSPPSSSTTTTRATSPSSTITESSNS---ISTRKPRTQRKR 449
S+CL SGA Y FELE +K P S+S T+ ++SPSSTI+ESSNS ISTRK RT RKR
Sbjct: 2 SSCL-SGAGRTYGFELEIVKXPSSTSPRTSHSSSPSSTISESSNSPIAISTRKXRTPRKR 60
Query: 450 PNQSYN 467
PNQ+YN
Sbjct: 61 PNQTYN 66
>gi|225470656|ref|XP_002268213.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 392
Score = 119 bits (296), Expect = 2e-024
Identities = 86/202 (42%), Positives = 107/202 (52%), Gaps = 16/202 (7%)
Frame = +1
Query: 718 FEFSDEFESILVEEVEEGGVDSIMGKLEPGSGHRRGV------NHRFDHQIVKGFKFAEN 879
FE + ESIL EE+ EGG+DSIMG L + N + + I G F
Sbjct: 162 FEEDFDAESILDEEI-EGGIDSIMGNLSVDNEMSDEATNPVCFNSYYGNGIPMGLGFGGK 220
Query: 880 IPLGLGLR---TALRDHNNDDDARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVV 1050
G G+R ALR + D RF TVD+ I V ++KKKKKV A +
Sbjct: 221 FEFGFGMRRGVRALRHVDEGDWWRFPTVDILEISPKFNKV----SAEKKKKKVEKAQELR 276
Query: 1051 PLATVSSREETEERSGRPMLKLDYDGVLEAWSDKATPFGEESEVTGADVNARLAQIDLLG 1230
+ + S +LKL+YD VL AWSD+ +PF E+E G D ARLAQIDL
Sbjct: 277 SWESPKGNSIPKSNSSL-LLKLNYDDVLSAWSDRGSPFSRETEFPGNDTAARLAQIDLFS 335
Query: 1231 D-SGVREASVLRYKEKRQTRLF 1293
+ GVREASVLRYKEKR+TRLF
Sbjct: 336 ECGGVREASVLRYKEKRRTRLF 357
Score = 82 bits (200), Expect = 2e-013
Identities = 44/66 (66%), Positives = 52/66 (78%), Gaps = 4/66 (6%)
Frame = +3
Query: 279 SACLSSGASAAYSFELEKIKSPPSSSTTTTRATSPSSTITESSNS---ISTRKPRTQRKR 449
S+CL SGA Y FELE +KSP S+S T+ ++SPSSTI+ESSNS ISTRKPRT RKR
Sbjct: 2 SSCL-SGAGRTYGFELEIVKSPSSTSPRTSHSSSPSSTISESSNSPIAISTRKPRTPRKR 60
Query: 450 PNQSYN 467
PNQ+YN
Sbjct: 61 PNQTYN 66
>gi|297799402|ref|XP_002867585.1| hypothetical protein ARALYDRAFT_354188
[Arabidopsis lyrata subsp. lyrata]
Length = 392
Score = 112 bits (280), Expect = 1e-022
Identities = 63/124 (50%), Positives = 80/124 (64%), Gaps = 5/124 (4%)
Frame = +1
Query: 931 DDARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPML 1110
DD + +D + +T VT + K+ KKKKKK A L S + E+R P+L
Sbjct: 234 DDGQSNVLDSNNNKTIVT-AEGGKRKKKKKKKTKVAPPTAELEVPDSSPKMEQRVS-PLL 291
Query: 1111 KLDYDGVLEAWSDKATPFGEE---SEVTGADVNARLAQIDLLGDSGVREASVLRYKEKRQ 1281
KLDYDGVLEAWS K +PF +E S+ G D +ARL +IDL G+SG+REASVLRYKEKR+
Sbjct: 292 KLDYDGVLEAWSGKESPFSDEILGSDAAGVDFHARLGEIDLFGESGMREASVLRYKEKRR 351
Query: 1282 TRLF 1293
RLF
Sbjct: 352 NRLF 355
>gi|255554380|ref|XP_002518229.1| CIL, putative [Ricinus communis]
Length = 397
Score = 89 bits (219), Expect = 1e-015
Identities = 53/99 (53%), Positives = 63/99 (63%), Gaps = 6/99 (6%)
Frame = +1
Query: 1000 KKSKKKKKKVAAAAAVVPLATVSSREETEERSGRPMLKLDYDGVLEAWSDKATPFGEE-- 1173
K +KKKKKV + L + S +LKL+YDGVL AWSDK +PF EE
Sbjct: 276 KSDEKKKKKV---VELKNLEMAKKESSVPQSSSGLLLKLNYDGVLNAWSDKGSPFSEEIS 332
Query: 1174 SEVTGADVNARLAQIDLLGDS-GVREASVLRYKEKRQTR 1287
G DV+ARLAQIDL ++ GVREASVLRYKEKR+TR
Sbjct: 333 GSEGGNDVSARLAQIDLFSENGGVREASVLRYKEKRRTR 371
>gi|224142697|ref|XP_002324691.1| predicted protein [Populus trichocarpa]
Length = 141
Score = 89 bits (218), Expect = 2e-015
Identities = 44/67 (65%), Positives = 54/67 (80%), Gaps = 2/67 (2%)
Frame = +1
Query: 1099 RPMLKLDYDGVLEAWSDKATPFGEES--EVTGADVNARLAQIDLLGDSGVREASVLRYKE 1272
R +LKL+YD VL AWSD+ +PF EE+ G DV+ARLAQIDL ++G+REASVLRYKE
Sbjct: 29 RLILKLNYDNVLNAWSDRGSPFSEETMGSAEGTDVSARLAQIDLFSENGMREASVLRYKE 88
Query: 1273 KRQTRLF 1293
KR+TRLF
Sbjct: 89 KRRTRLF 95
>gi|224087284|ref|XP_002308112.1| predicted protein [Populus trichocarpa]
Length = 448
Score = 84 bits (205), Expect = 6e-014
Identities = 42/65 (64%), Positives = 52/65 (80%), Gaps = 2/65 (3%)
Frame = +1
Query: 1105 MLKLDYDGVLEAWSDKATPFGEESE--VTGADVNARLAQIDLLGDSGVREASVLRYKEKR 1278
+LKL+YD VL WSD+ +PF +ES G DV+ARLAQIDL ++G+REASVLRYKEKR
Sbjct: 344 ILKLNYDHVLSEWSDRGSPFSDESMGCAEGNDVSARLAQIDLFSENGMREASVLRYKEKR 403
Query: 1279 QTRLF 1293
+TRLF
Sbjct: 404 RTRLF 408
Score = 74 bits (180), Expect = 5e-011
Identities = 40/67 (59%), Positives = 48/67 (71%), Gaps = 3/67 (4%)
Frame = +3
Query: 276 MSACLSSGASAAYSFELEKIKSPPSSSTTTTRATSPSSTITESSNS---ISTRKPRTQRK 446
MS+ SG Y F+LE +KS +SST T+ +SPSSTI+ESSNS ISTRK RT RK
Sbjct: 1 MSSPCISGGGRTYGFDLEIVKSSSTSSTRTSHTSSPSSTISESSNSPLAISTRKSRTPRK 60
Query: 447 RPNQSYN 467
RPNQ+YN
Sbjct: 61 RPNQTYN 67
>gi|79331135|ref|NP_001032087.1| chloroplast import apparatus 2 protein
[Arabidopsis thaliana]
Length = 376
Score = 67 bits (161), Expect = 8e-009
Identities = 42/99 (42%), Positives = 56/99 (56%), Gaps = 5/99 (5%)
Frame = +1
Query: 934 DARFQTVDLDRIQTTVTVVDEVKKSKKKKKKVAAAAAVVPLATVSSREETEERSGR---P 1104
+ TVD ++ V+ K +KKKKKK + + S E+TEE S + P
Sbjct: 277 ETAISTVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTL-ITESKSLEDTEETSLKRTGP 335
Query: 1105 MLKLDYDGVLEAWSDKATPFGEESEVTGA-DVNARLAQI 1218
+LKLDYDGVLEAWSDK +PF +E + + A DVN L I
Sbjct: 336 LLKLDYDGVLEAWSDKTSPFPDEIQGSEAVDVNVCLTLI 374
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,531,826,676,690
Number of Sequences: 15229318
Number of Extensions: 3531826676690
Number of Successful Extensions: 836928167
Number of sequences better than 0.0: 0
|