BLASTX 7.6.2
Query= UN70603 /QuerySize=740
(739 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297848992|ref|XP_002892377.1| predicted protein [Arabidopsis ... 289 4e-076
gi|18390719|ref|NP_563778.1| CCT motif family protein [Arabidops... 288 8e-076
gi|8954041|gb|AAF82215.1|AC067971_23 Contains similarity to a pu... 198 9e-049
gi|224055401|ref|XP_002298500.1| predicted protein [Populus tric... 137 2e-030
gi|217071490|gb|ACJ84105.1| unknown [Medicago truncatula] 131 2e-028
gi|297745837|emb|CBI15893.3| unnamed protein product [Vitis vini... 116 4e-024
gi|225434470|ref|XP_002274679.1| PREDICTED: hypothetical protein... 89 7e-016
gi|224087284|ref|XP_002308112.1| predicted protein [Populus tric... 84 1e-014
gi|255558136|ref|XP_002520096.1| hypothetical protein RCOM_17098... 84 2e-014
gi|224142697|ref|XP_002324691.1| predicted protein [Populus tric... 82 9e-014
gi|30687076|ref|NP_849445.1| CCT motif family protein [Arabidops... 80 2e-013
gi|4538928|emb|CAB39664.1| putative protein [Arabidopsis thaliana] 80 2e-013
gi|147784441|emb|CAN72725.1| hypothetical protein VITISV_015092 ... 80 2e-013
gi|18416659|ref|NP_567737.1| CCT motif family protein [Arabidops... 79 6e-013
gi|30696838|ref|NP_851201.1| chloroplast import apparatus 2 prot... 76 4e-012
gi|30696840|ref|NP_568852.2| chloroplast import apparatus 2 prot... 73 3e-011
gi|297793237|ref|XP_002864503.1| hypothetical protein ARALYDRAFT... 73 4e-011
gi|255572020|ref|XP_002526951.1| hypothetical protein RCOM_05312... 70 3e-010
gi|7573462|emb|CAB87776.1| putative protein [Arabidopsis thaliana] 70 4e-010
gi|225452242|ref|XP_002268929.1| PREDICTED: hypothetical protein... 69 6e-010
>gi|297848992|ref|XP_002892377.1| predicted protein [Arabidopsis lyrata subsp.
lyrata]
Length = 194
Score = 289 bits (738), Expect = 4e-076
Identities = 144/191 (75%), Positives = 163/191 (85%), Gaps = 8/191 (4%)
Frame = +2
Query: 50 QRLPKREEQEEEVADHQRFSIFESINAFSKEQHNHSMDDFDRIFDISIDSLGCSHELNWD 229
QRLPK+EE+E ADH S FESIN S++QHNHS+DDFD IFDI+ID+L CSHEL WD
Sbjct: 4 QRLPKKEEEE---ADHLLSSTFESINGLSRDQHNHSIDDFDSIFDITIDNLSCSHELTWD 60
Query: 230 FWKEEDDEEDVKEEEKSLSTDQEGSSSGFWDNMSTDYEDRELALKLNLNHQEVIDAWSDR 409
FW EED++EDV EEEK LSTDQEGSS GFW+N TDYED++L LKLNLNHQEVIDAWSD
Sbjct: 61 FW-EEDEDEDVGEEEKRLSTDQEGSSFGFWENKPTDYEDKDLGLKLNLNHQEVIDAWSDH 119
Query: 410 RQQPLWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQVR 589
R +PLWTD + + AN++Y GEVPV++EERN RREASVLRYKEKRQSRLFSKKIRYQVR
Sbjct: 120 R-KPLWTDNTTV---ANSLYKGEVPVIEEERNMRREASVLRYKEKRQSRLFSKKIRYQVR 175
Query: 590 KLNADKRPRFK 622
KLNADKRPRFK
Sbjct: 176 KLNADKRPRFK 186
>gi|18390719|ref|NP_563778.1| CCT motif family protein [Arabidopsis thaliana]
Length = 195
Score = 288 bits (735), Expect = 8e-076
Identities = 145/192 (75%), Positives = 164/192 (85%), Gaps = 7/192 (3%)
Frame = +2
Query: 47 TQRLPKREEQEEEVADHQRFSIFESINAFSKEQHNHSMDDFDRIFDISIDSLGCSHELNW 226
TQRLPK+EE+EEE DH S FESIN S++QHNHS+DDF+ IFDI+ID+L CS+EL W
Sbjct: 3 TQRLPKKEEEEEE--DHLLSSTFESINGHSRDQHNHSIDDFESIFDITIDNLSCSNELTW 60
Query: 227 DFWKEEDDEEDVKEEEKSLSTDQEGSSSGFWDNMSTDYEDRELALKLNLNHQEVIDAWSD 406
DFW EED++EDV EEEK STDQEGSS GFW+N TDYED++L LKLNLNHQEVIDAWSD
Sbjct: 61 DFW-EEDEDEDVGEEEKRSSTDQEGSSFGFWENKPTDYEDKDLGLKLNLNHQEVIDAWSD 119
Query: 407 RRQQPLWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQV 586
Q+PLWTDTS L N+VY GEVPV++E+RN RREASVLRYKEKRQSRLFSKKIRYQV
Sbjct: 120 -HQKPLWTDTSTL---DNSVYRGEVPVIEEKRNMRREASVLRYKEKRQSRLFSKKIRYQV 175
Query: 587 RKLNADKRPRFK 622
RKLNADKRPRFK
Sbjct: 176 RKLNADKRPRFK 187
>gi|8954041|gb|AAF82215.1|AC067971_23 Contains similarity to a putative protein
F18O22.160 gi|7573462 from Arabidopsis thaliana BAC F18O22 gb|AL163817
Length = 175
Score = 198 bits (502), Expect = 9e-049
Identities = 99/141 (70%), Positives = 115/141 (81%), Gaps = 7/141 (4%)
Frame = +2
Query: 47 TQRLPKREEQEEEVADHQRFSIFESINAFSKEQHNHSMDDFDRIFDISIDSLGCSHELNW 226
TQRLPK+EE+EEE DH S FESIN S++QHNHS+DDF+ IFDI+ID+L CS+EL W
Sbjct: 3 TQRLPKKEEEEEE--DHLLSSTFESINGHSRDQHNHSIDDFESIFDITIDNLSCSNELTW 60
Query: 227 DFWKEEDDEEDVKEEEKSLSTDQEGSSSGFWDNMSTDYEDRELALKLNLNHQEVIDAWSD 406
DFW EED++EDV EEEK STDQEGSS GFW+N TDYED++L LKLNLNHQEVIDAWSD
Sbjct: 61 DFW-EEDEDEDVGEEEKRSSTDQEGSSFGFWENKPTDYEDKDLGLKLNLNHQEVIDAWSD 119
Query: 407 RRQQPLWTDTSMLMAPANAVY 469
Q+PLWTDTS L N+VY
Sbjct: 120 -HQKPLWTDTSTL---DNSVY 136
>gi|224055401|ref|XP_002298500.1| predicted protein [Populus trichocarpa]
Length = 207
Score = 137 bits (343), Expect = 2e-030
Identities = 90/195 (46%), Positives = 116/195 (59%), Gaps = 12/195 (6%)
Frame = +2
Query: 59 PKREEQEEEVADHQRFSIFESINAFSKEQHNHSMDDFDRIFDISIDSLGCSH----ELNW 226
PK+EEQE + S + + + E +D+ + IF I D + S +L W
Sbjct: 10 PKKEEQEVPDTVVPQLSNIKEESYGNGE--IDVVDELEGIFGIEHDEMLSSDNMYGQLGW 67
Query: 227 DFWKEEDDEEDVKEEEKSLSTDQEGSSSGF---WDNMSTDYEDRELALKLNLNHQEVIDA 397
DF E+ EE + + SS F + S D +++ ++L LNLN+QEV++A
Sbjct: 68 DFMDLEEYPAAEDEEGSEMFKFGDSRSSFFEFEESHYSIDGDEKGVSLNLNLNYQEVLEA 127
Query: 398 WSDRRQQPLWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIR 577
WSDR PL D L +N Y GEVPVM+E+R TRREASVLRYKEKRQ+RLFSKKIR
Sbjct: 128 WSDR--GPLLADDHSLSTASNGHYMGEVPVMEEDR-TRREASVLRYKEKRQTRLFSKKIR 184
Query: 578 YQVRKLNADKRPRFK 622
YQVRKLNADKRPR K
Sbjct: 185 YQVRKLNADKRPRLK 199
>gi|217071490|gb|ACJ84105.1| unknown [Medicago truncatula]
Length = 217
Score = 131 bits (327), Expect = 2e-028
Identities = 79/149 (53%), Positives = 103/149 (69%), Gaps = 19/149 (12%)
Frame = +2
Query: 209 SHELNWDF--WK----EEDDEEDVKE---EEKSLSTDQEGSSSGFWDNMSTDYEDRELAL 361
S+EL+WD W+ +E +E++ + EEK + +E + GFW+ D +++ LAL
Sbjct: 70 SNELHWDIMEWEGFSFDEGKDENISKCNYEEKKI-IKRENYNDGFWE---VD-DEKSLAL 124
Query: 362 KLNLNHQEVIDAWSDRRQQPLWTDTSML--MAPANAVYSGEVPVMDEERNTRREASVLRY 535
LNLN+QEV+DAWS+R LW + L + N Y GEVP+++EER RREASVLRY
Sbjct: 125 NLNLNYQEVLDAWSNRGS--LWANDCSLSFSSSNNGSYMGEVPILEEER-ARREASVLRY 181
Query: 536 KEKRQSRLFSKKIRYQVRKLNADKRPRFK 622
KEKRQ+RLFSKKIRYQVRKLNADKRPR K
Sbjct: 182 KEKRQNRLFSKKIRYQVRKLNADKRPRIK 210
>gi|297745837|emb|CBI15893.3| unnamed protein product [Vitis vinifera]
Length = 101
Score = 116 bits (289), Expect = 4e-024
Identities = 63/86 (73%), Positives = 70/86 (81%), Gaps = 4/86 (4%)
Frame = +2
Query: 365 LNLNHQEVIDAWSDRRQQPLWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLRYKEK 544
LNLN+QEV++AWSDR LW D L + N Y GEVPVM+EE+ TRREASVLRYKEK
Sbjct: 6 LNLNYQEVLEAWSDRGS--LWADDCSL-SFKNTSYMGEVPVMEEEK-TRREASVLRYKEK 61
Query: 545 RQSRLFSKKIRYQVRKLNADKRPRFK 622
RQ+RLFSKKIRYQVRKLNADKRPR K
Sbjct: 62 RQTRLFSKKIRYQVRKLNADKRPRLK 87
>gi|225434470|ref|XP_002274679.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 64
Score = 89 bits (218), Expect = 7e-016
Identities = 45/50 (90%), Positives = 48/50 (96%), Gaps = 1/50 (2%)
Frame = +2
Query: 473 GEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQVRKLNADKRPRFK 622
GEVPVM+EE+ TRREASVLRYKEKRQ+RLFSKKIRYQVRKLNADKRPR K
Sbjct: 2 GEVPVMEEEK-TRREASVLRYKEKRQTRLFSKKIRYQVRKLNADKRPRLK 50
>gi|224087284|ref|XP_002308112.1| predicted protein [Populus trichocarpa]
Length = 448
Score = 84 bits (207), Expect = 1e-014
Identities = 48/88 (54%), Positives = 58/88 (65%), Gaps = 2/88 (2%)
Frame = +2
Query: 359 LKLNLNHQEVIDAWSDRRQQPLWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLRYK 538
L L LN+ V+ WSDR ++D SM A N V + + N REASVLRYK
Sbjct: 343 LILKLNYDHVLSEWSDRGSP--FSDESMGCAEGNDVSARLAQIDLFSENGMREASVLRYK 400
Query: 539 EKRQSRLFSKKIRYQVRKLNADKRPRFK 622
EKR++RLFSKKIRYQVRK+NAD+RPR K
Sbjct: 401 EKRRTRLFSKKIRYQVRKVNADQRPRMK 428
>gi|255558136|ref|XP_002520096.1| hypothetical protein RCOM_1709850 [Ricinus
communis]
Length = 58
Score = 84 bits (205), Expect = 2e-014
Identities = 44/50 (88%), Positives = 46/50 (92%), Gaps = 1/50 (2%)
Frame = +2
Query: 473 GEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQVRKLNADKRPRFK 622
GEV VM+EER RREASVLRYKEKRQ+RLFSKKIRYQVRKLNADKRPR K
Sbjct: 2 GEVLVMEEER-IRREASVLRYKEKRQTRLFSKKIRYQVRKLNADKRPRLK 50
>gi|224142697|ref|XP_002324691.1| predicted protein [Populus trichocarpa]
Length = 141
Score = 82 bits (200), Expect = 9e-014
Identities = 46/88 (52%), Positives = 59/88 (67%), Gaps = 2/88 (2%)
Frame = +2
Query: 359 LKLNLNHQEVIDAWSDRRQQPLWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLRYK 538
L L LN+ V++AWSDR +++ +M A V + + N REASVLRYK
Sbjct: 30 LILKLNYDNVLNAWSDRGSP--FSEETMGSAEGTDVSARLAQIDLFSENGMREASVLRYK 87
Query: 539 EKRQSRLFSKKIRYQVRKLNADKRPRFK 622
EKR++RLFSKKIRYQVRK+NAD+RPR K
Sbjct: 88 EKRRTRLFSKKIRYQVRKVNADQRPRMK 115
>gi|30687076|ref|NP_849445.1| CCT motif family protein [Arabidopsis thaliana]
Length = 409
Score = 80 bits (197), Expect = 2e-013
Identities = 56/133 (42%), Positives = 80/133 (60%), Gaps = 10/133 (7%)
Frame = +2
Query: 239 EEDDEEDVKEEEKSLSTDQEGSSSGFWDNMSTDYEDRELALKLNLNHQEVIDAWSDRRQQ 418
E D ++ K+++K ++ S S + + E R L L L++ V++AWS ++
Sbjct: 252 EGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL-LKLDYDGVLEAWSG-KES 309
Query: 419 PLWTDTSMLMAPANA----VYSGEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQV 586
P +L + A+ V GE+ + E + REASVLRYKEKR++RLFSKKIRYQV
Sbjct: 310 PF--SDEILGSDADGVDFHVRLGEIDLFGE--SGMREASVLRYKEKRRNRLFSKKIRYQV 365
Query: 587 RKLNADKRPRFKV 625
RKLNAD+RPR KV
Sbjct: 366 RKLNADQRPRMKV 378
>gi|4538928|emb|CAB39664.1| putative protein [Arabidopsis thaliana]
Length = 378
Score = 80 bits (197), Expect = 2e-013
Identities = 56/133 (42%), Positives = 80/133 (60%), Gaps = 10/133 (7%)
Frame = +2
Query: 239 EEDDEEDVKEEEKSLSTDQEGSSSGFWDNMSTDYEDRELALKLNLNHQEVIDAWSDRRQQ 418
E D ++ K+++K ++ S S + + E R L L L++ V++AWS ++
Sbjct: 252 EGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL-LKLDYDGVLEAWSG-KES 309
Query: 419 PLWTDTSMLMAPANA----VYSGEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQV 586
P +L + A+ V GE+ + E + REASVLRYKEKR++RLFSKKIRYQV
Sbjct: 310 PF--SDEILGSDADGVDFHVRLGEIDLFGE--SGMREASVLRYKEKRRNRLFSKKIRYQV 365
Query: 587 RKLNADKRPRFKV 625
RKLNAD+RPR KV
Sbjct: 366 RKLNADQRPRMKV 378
>gi|147784441|emb|CAN72725.1| hypothetical protein VITISV_015092 [Vitis
vinifera]
Length = 392
Score = 80 bits (197), Expect = 2e-013
Identities = 50/100 (50%), Positives = 63/100 (63%), Gaps = 7/100 (7%)
Frame = +2
Query: 356 ALKLNLNHQEVIDAWSDRRQQPLWTDTSMLMAPAN--AVYSGEVPVMDEERNTRREASVL 529
+L L LN+ +V+ AWSD R P +T P N A ++ + E REASVL
Sbjct: 292 SLLLKLNYDDVLSAWSD-RGSPFSRETEF---PGNDTAARLAQIDLFSECGGV-REASVL 346
Query: 530 RYKEKRQSRLFSKKIRYQVRKLNADKRPRFKVSLSLFPSS 649
RYKEKR++RLFSKKIRYQVRK+NAD+RPR K P+S
Sbjct: 347 RYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPNS 386
>gi|18416659|ref|NP_567737.1| CCT motif family protein [Arabidopsis thaliana]
Length = 394
Score = 79 bits (193), Expect = 6e-013
Identities = 55/132 (41%), Positives = 79/132 (59%), Gaps = 10/132 (7%)
Frame = +2
Query: 239 EEDDEEDVKEEEKSLSTDQEGSSSGFWDNMSTDYEDRELALKLNLNHQEVIDAWSDRRQQ 418
E D ++ K+++K ++ S S + + E R L L L++ V++AWS ++
Sbjct: 252 EGDKKKKKKKKKKKVAPAAAESKSSEVTDSNPKLEQRVSPL-LKLDYDGVLEAWSG-KES 309
Query: 419 PLWTDTSMLMAPANA----VYSGEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQV 586
P +L + A+ V GE+ + E + REASVLRYKEKR++RLFSKKIRYQV
Sbjct: 310 PF--SDEILGSDADGVDFHVRLGEIDLFGE--SGMREASVLRYKEKRRNRLFSKKIRYQV 365
Query: 587 RKLNADKRPRFK 622
RKLNAD+RPR K
Sbjct: 366 RKLNADQRPRMK 377
>gi|30696838|ref|NP_851201.1| chloroplast import apparatus 2 protein
[Arabidopsis thaliana]
Length = 424
Score = 76 bits (186), Expect = 4e-012
Identities = 51/129 (39%), Positives = 73/129 (56%), Gaps = 5/129 (3%)
Frame = +2
Query: 242 EDDEEDVKEEEKSLSTDQEGSSSGFWDNMSTDYEDRELALKLNLNHQEVIDAWSDRRQQP 421
E + K+++ +++T S D T + LK L++ V++AWSD + P
Sbjct: 298 EKSNKKKKKKKMTVTTTLITESKSLEDTEETSLKRTGPLLK--LDYDGVLEAWSD-KTSP 354
Query: 422 LWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQVRKLNA 601
+ A ++ + + + REASVLRYKEKR++RLFSKKIRYQVRKLNA
Sbjct: 355 FPDEIQGSEAVDVNARLAQIDLFGD--SGMREASVLRYKEKRRTRLFSKKIRYQVRKLNA 412
Query: 602 DKRPRFKVS 628
D+RPR KVS
Sbjct: 413 DQRPRMKVS 421
>gi|30696840|ref|NP_568852.2| chloroplast import apparatus 2 protein
[Arabidopsis thaliana]
Length = 435
Score = 73 bits (178), Expect = 3e-011
Identities = 49/127 (38%), Positives = 71/127 (55%), Gaps = 5/127 (3%)
Frame = +2
Query: 242 EDDEEDVKEEEKSLSTDQEGSSSGFWDNMSTDYEDRELALKLNLNHQEVIDAWSDRRQQP 421
E + K+++ +++T S D T + LK L++ V++AWSD + P
Sbjct: 298 EKSNKKKKKKKMTVTTTLITESKSLEDTEETSLKRTGPLLK--LDYDGVLEAWSD-KTSP 354
Query: 422 LWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQVRKLNA 601
+ A ++ + + + REASVLRYKEKR++RLFSKKIRYQVRKLNA
Sbjct: 355 FPDEIQGSEAVDVNARLAQIDLFGD--SGMREASVLRYKEKRRTRLFSKKIRYQVRKLNA 412
Query: 602 DKRPRFK 622
D+RPR K
Sbjct: 413 DQRPRMK 419
>gi|297793237|ref|XP_002864503.1| hypothetical protein ARALYDRAFT_495815
[Arabidopsis lyrata subsp. lyrata]
Length = 426
Score = 73 bits (177), Expect = 4e-011
Identities = 43/90 (47%), Positives = 60/90 (66%), Gaps = 9/90 (10%)
Frame = +2
Query: 365 LNLNHQEVIDAWSDRR----QQPLWTDTSMLMAPANAVYSGEVPVMDEERNTRREASVLR 532
L L++ V++AWSD+ + L ++ + + A E+ + + + REASVLR
Sbjct: 326 LKLDYDGVLEAWSDKTSPFPDEILGSEATGIDVNARL---AEIDLFGD--SGMREASVLR 380
Query: 533 YKEKRQSRLFSKKIRYQVRKLNADKRPRFK 622
YKEKR++RLFSKKIRYQVRKLNAD+RPR K
Sbjct: 381 YKEKRRTRLFSKKIRYQVRKLNADQRPRMK 410
>gi|255572020|ref|XP_002526951.1| hypothetical protein RCOM_0531220 [Ricinus
communis]
Length = 276
Score = 70 bits (170), Expect = 3e-010
Identities = 34/44 (77%), Positives = 40/44 (90%)
Frame = +2
Query: 491 DEERNTRREASVLRYKEKRQSRLFSKKIRYQVRKLNADKRPRFK 622
+E + +REASVLRYKEKRQSRLFSK+IRY+VRKLNA+KRPR K
Sbjct: 228 EEWKLGQREASVLRYKEKRQSRLFSKRIRYEVRKLNAEKRPRLK 271
>gi|7573462|emb|CAB87776.1| putative protein [Arabidopsis thaliana]
Length = 347
Score = 70 bits (169), Expect = 4e-010
Identities = 33/41 (80%), Positives = 39/41 (95%)
Frame = +2
Query: 512 REASVLRYKEKRQSRLFSKKIRYQVRKLNADKRPRFKVSLS 634
REAS+LRYKEKRQ+RLFSK+IRYQVRKLNA+KRPR K+ L+
Sbjct: 260 REASLLRYKEKRQNRLFSKRIRYQVRKLNAEKRPRVKLRLN 300
>gi|225452242|ref|XP_002268929.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 310
Score = 69 bits (167), Expect = 6e-010
Identities = 34/49 (69%), Positives = 41/49 (83%)
Frame = +2
Query: 476 EVPVMDEERNTRREASVLRYKEKRQSRLFSKKIRYQVRKLNADKRPRFK 622
EV + + +R+ASVLRYKEKRQSRLFSK+IRY+VRKLNA+KRPR K
Sbjct: 255 EVGGGEGSKKGQRQASVLRYKEKRQSRLFSKRIRYEVRKLNAEKRPRMK 303
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 555,700,182,581
Number of Sequences: 15229318
Number of Extensions: 555700182581
Number of Successful Extensions: 174766259
Number of sequences better than 0.0: 0
|