BLASTX 7.6.2
Query= UN66920 /QuerySize=734
(733 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297821008|ref|XP_002878387.1| pentatricopeptide repeat-contai... 307 1e-081
gi|21537126|gb|AAM61467.1| unknown [Arabidopsis thaliana] 298 6e-079
gi|15233137|ref|NP_191711.1| pentatricopeptide repeat-containing... 296 2e-078
gi|22327132|ref|NP_680234.1| pentatricopeptide repeat-containing... 295 7e-078
gi|15241779|ref|NP_198189.1| pentatricopeptide repeat-containing... 295 7e-078
gi|225454300|ref|XP_002275491.1| PREDICTED: hypothetical protein... 178 9e-043
gi|224130398|ref|XP_002320827.1| predicted protein [Populus tric... 159 4e-037
gi|255541716|ref|XP_002511922.1| pentatricopeptide repeat-contai... 152 7e-035
>gi|297821008|ref|XP_002878387.1| pentatricopeptide repeat-containing protein
[Arabidopsis lyrata subsp. lyrata]
Length = 766
Score = 307 bits (785), Expect = 1e-081
Identities = 171/247 (69%), Positives = 195/247 (78%), Gaps = 9/247 (3%)
Frame = -1
Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYIQSLN---NHREESLSLVFQSVV 563
E+L SLL S S L S+ITRRLGSYSLAISF Y+ S + REESLSL QSV+
Sbjct: 64 ESLSSLLVSSSSHSPLVFSQITRRLGSYSLAISFFEYLDSKSQSLKRREESLSLALQSVI 123
Query: 562 EFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSSV 383
EFA GSEPDS+DKLL LY AKEKNIPLT VA KLLIRWF RMGM NQSV VYE LDS++
Sbjct: 124 EFA-GSEPDSRDKLLRLYEIAKEKNIPLTVVATKLLIRWFGRMGMANQSVLVYERLDSNM 182
Query: 382 KNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRLL 203
KN +QVRNV+IDVL RN VDDAFKVLDEML E VF PNRITA+IV HEVW+GRLL
Sbjct: 183 KN-SQVRNVVIDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKGRLL 238
Query: 202 KEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPSF 23
E++++GLISRF SHGV+PN VWLTRFI+SLC++A RTN AW++LS LMKN A L+AP F
Sbjct: 239 TEEKIIGLISRFSSHGVSPNSVWLTRFISSLCKNA-RTNAAWDILSDLMKNKAPLEAPPF 297
Query: 22 NALLTAL 2
NALL+ L
Sbjct: 298 NALLSCL 304
>gi|21537126|gb|AAM61467.1| unknown [Arabidopsis thaliana]
Length = 766
Score = 298 bits (762), Expect = 6e-079
Identities = 167/248 (67%), Positives = 193/248 (77%), Gaps = 11/248 (4%)
Frame = -1
Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYI----QSLNNHREESLSLVFQSV 566
E+L +L+ S S L S+ITRRLGSYSLAISF Y+ QSL REESLSL QSV
Sbjct: 64 ESLSALVVSSSSASPLVFSQITRRLGSYSLAISFFEYLDAKSQSL-KRREESLSLALQSV 122
Query: 565 VEFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSS 386
+EFA GSEPD +DKLL LY AKEKNIPLT VA KLLIRWF RMGMVNQSV VYE LDS+
Sbjct: 123 IEFA-GSEPDPRDKLLRLYEIAKEKNIPLTVVATKLLIRWFGRMGMVNQSVLVYERLDSN 181
Query: 385 VKNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRL 206
+KN +QVRNV++DVL RN VDDAFKVLDEML E VF PNRITA+IV HEVW+GRL
Sbjct: 182 MKN-SQVRNVVVDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKGRL 237
Query: 205 LKEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPS 26
L E++++ LISRF SHGV+PN VWLTRFI+SLC++A R N AW++LS LMKN L+AP
Sbjct: 238 LTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA-RANAAWDILSDLMKNKTPLEAPP 296
Query: 25 FNALLTAL 2
FNALL+ L
Sbjct: 297 FNALLSCL 304
>gi|15233137|ref|NP_191711.1| pentatricopeptide repeat-containing protein
[Arabidopsis thaliana]
Length = 766
Score = 296 bits (757), Expect = 2e-078
Identities = 166/248 (66%), Positives = 192/248 (77%), Gaps = 11/248 (4%)
Frame = -1
Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYI----QSLNNHREESLSLVFQSV 566
E+L +L+ S S L S+ITRRLGSYSLAISF Y+ QSL REESLSL QSV
Sbjct: 64 ESLSALVVSSSSASPLVFSQITRRLGSYSLAISFFEYLDAKSQSL-KRREESLSLALQSV 122
Query: 565 VEFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSS 386
+EFA GSEPD +DKLL LY AKEKNIPLT VA LLIRWF RMGMVNQSV VYE LDS+
Sbjct: 123 IEFA-GSEPDPRDKLLRLYEIAKEKNIPLTVVATNLLIRWFGRMGMVNQSVLVYERLDSN 181
Query: 385 VKNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRL 206
+KN +QVRNV++DVL RN VDDAFKVLDEML E VF PNRITA+IV HEVW+GRL
Sbjct: 182 MKN-SQVRNVVVDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKGRL 237
Query: 205 LKEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPS 26
L E++++ LISRF SHGV+PN VWLTRFI+SLC++A R N AW++LS LMKN L+AP
Sbjct: 238 LTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA-RANAAWDILSDLMKNKTPLEAPP 296
Query: 25 FNALLTAL 2
FNALL+ L
Sbjct: 297 FNALLSCL 304
>gi|22327132|ref|NP_680234.1| pentatricopeptide repeat-containing protein
[Arabidopsis thaliana]
Length = 766
Score = 295 bits (753), Expect = 7e-078
Identities = 166/248 (66%), Positives = 192/248 (77%), Gaps = 11/248 (4%)
Frame = -1
Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYI----QSLNNHREESLSLVFQSV 566
E+L +L+ S S L S+ITRRLGSYSLAISF Y+ QSL REESLSL QSV
Sbjct: 64 ESLSALVVSSSSASPLVFSQITRRLGSYSLAISFFEYLDAKSQSL-KRREESLSLALQSV 122
Query: 565 VEFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSS 386
+EFA GSEPD +DKLL LY AKEKNIPLT VA KLLIRWF RMGMVNQSV VYE LDS+
Sbjct: 123 IEFA-GSEPDPRDKLLRLYEIAKEKNIPLTIVATKLLIRWFGRMGMVNQSVLVYERLDSN 181
Query: 385 VKNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRL 206
+KN +QVRNV++DVL RN VDDAFKVLDEML E VF PNRITA+IV HEVW+ RL
Sbjct: 182 MKN-SQVRNVVVDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKERL 237
Query: 205 LKEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPS 26
L E++++ LISRF SHGV+PN VWLTRFI+SLC++A R N AW++LS LMKN L+AP
Sbjct: 238 LTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA-RANTAWDILSDLMKNKTPLEAPP 296
Query: 25 FNALLTAL 2
FNALL+ L
Sbjct: 297 FNALLSCL 304
>gi|15241779|ref|NP_198189.1| pentatricopeptide repeat-containing protein
[Arabidopsis thaliana]
Length = 727
Score = 295 bits (753), Expect = 7e-078
Identities = 166/248 (66%), Positives = 192/248 (77%), Gaps = 11/248 (4%)
Frame = -1
Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYI----QSLNNHREESLSLVFQSV 566
E+L +L+ S S L S+ITRRLGSYSLAISF Y+ QSL REESLSL QSV
Sbjct: 64 ESLSALVVSSSSASPLVFSQITRRLGSYSLAISFFEYLDAKSQSL-KRREESLSLALQSV 122
Query: 565 VEFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSS 386
+EFA GSEPD +DKLL LY AKEKNIPLT VA KLLIRWF RMGMVNQSV VYE LDS+
Sbjct: 123 IEFA-GSEPDPRDKLLRLYEIAKEKNIPLTIVATKLLIRWFGRMGMVNQSVLVYERLDSN 181
Query: 385 VKNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRL 206
+KN +QVRNV++DVL RN VDDAFKVLDEML E VF PNRITA+IV HEVW+ RL
Sbjct: 182 MKN-SQVRNVVVDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKERL 237
Query: 205 LKEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPS 26
L E++++ LISRF SHGV+PN VWLTRFI+SLC++A R N AW++LS LMKN L+AP
Sbjct: 238 LTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA-RANTAWDILSDLMKNKTPLEAPP 296
Query: 25 FNALLTAL 2
FNALL+ L
Sbjct: 297 FNALLSCL 304
>gi|225454300|ref|XP_002275491.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 765
Score = 178 bits (450), Expect = 9e-043
Identities = 111/233 (47%), Positives = 150/233 (64%), Gaps = 13/233 (5%)
Frame = -1
Query: 682 LSKITRRLGSYSLAISFLHYIQSLNNHREES--LSLVFQSVVEFAGGSEPDSKDKLLSLY 509
L +ITR LGS + A+ F +++Q+ N+ ++S LS ++V E A EP+S +KLL L+
Sbjct: 90 LLQITRLLGSTAKALKFFNWVQA-NSPCQDSPLLSFTLEAVFEHA-SREPNSHNKLLDLF 147
Query: 508 ATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSSVKNTTQVRNVLIDVLFRNE 329
T+K IPL+ AA LLIR F R MV++S VY L S + T +RN+LIDVLFR
Sbjct: 148 KTSKSHKIPLSVNAATLLIRCFGRAQMVDESFLVYNELCPS-RRLTHIRNILIDVLFRKG 206
Query: 328 RVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWR----GRLLKEDEVVGLISRFGS 161
RVDDA +LDEML + F PN T IVF + + GR + E+E+VGL+S+F
Sbjct: 207 RVDDALHLLDEML---QPKAEFPPNSNTGHIVFSALSKRDKVGRAVDEEEIVGLVSKFAE 263
Query: 160 HGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPSFNALLTAL 2
H V PN +WLT+ I+ LCR RT+ AW+VL GLMK G +++A S NALLTAL
Sbjct: 264 HEVFPNSIWLTQLISRLCRSG-RTDRAWDVLHGLMKLGGVMEAASCNALLTAL 315
>gi|224130398|ref|XP_002320827.1| predicted protein [Populus trichocarpa]
Length = 775
Score = 159 bits (401), Expect = 4e-037
Identities = 99/230 (43%), Positives = 143/230 (62%), Gaps = 11/230 (4%)
Frame = -1
Query: 676 KITRRLGSYSLAISFLHYIQSLNNHREES---LSLVFQSVVEFAGGSEPDSKDKLLSLYA 506
+ITRRL S S A+ FL+Y+Q+ + ++ LS FQ++ E A EPDS L LY
Sbjct: 88 QITRRLPSSSQALKFLNYLQNNSPSSPDTQSLLSYTFQAIFELA-FCEPDSNANLSRLYK 146
Query: 505 TAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSSVKNTTQVRNVLIDVLFRNER 326
T+KE NIPLT AA L+R R +V +S+ ++ LD SVKN T +RNV + +L R+ R
Sbjct: 147 TSKELNIPLTVNAASFLLRASGRSELVEESLILFNDLDPSVKN-TYLRNVWLSILLRSGR 205
Query: 325 VDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWR----GRLLKEDEVVGLISRFGSH 158
V DA KV+DEM + S RPN T +I+F + + LL EDE+V L+ +FG H
Sbjct: 206 VKDALKVIDEMFESNDDSNC-RPNDATGDILFSFLLKRERNEELLSEDEIVNLVLKFGEH 264
Query: 157 GVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPSFNALLT 8
GV + W+ R IT LCR+ R+TN W++ + ++K GA+L++ + N+LLT
Sbjct: 265 GVLISSFWMGRLITRLCRN-RKTNRGWDLFTEMIKLGAVLESAACNSLLT 313
>gi|255541716|ref|XP_002511922.1| pentatricopeptide repeat-containing protein,
putative [Ricinus communis]
Length = 346
Score = 152 bits (382), Expect = 7e-035
Identities = 99/232 (42%), Positives = 143/232 (61%), Gaps = 12/232 (5%)
Frame = -1
Query: 682 LSKITRRLGSYSLAISFLHYIQ-SLNNHREESLSLVFQSVVEFAGGSEPDSKDKLLSLYA 506
L +I RRL S S A+ FL Y+Q + + LS FQ++ E A E DS+ L LY
Sbjct: 95 LFQIARRLPSSSQALKFLKYLQNNFPTSNTQHLSSTFQAIFELA-SRENDSRTNLYELYK 153
Query: 505 TAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSSVKNTTQVRNVLIDVLFRNER 326
+KE NIPLT +A LL+R+F R+G+V +S ++ L+ +KN T VRN+++D+L R+ R
Sbjct: 154 VSKEWNIPLTINSATLLLRFFGRIGLVEKSFILFNELEHCIKN-THVRNLMVDLLLRDGR 212
Query: 325 VDDAFKVLDEMLCCEEGSVV-FRPNRITAEIVFHEVW---RGRLLKEDEVVGLISRFGSH 158
VDDAFKVLDEML + GS RP+ +T +I+F W R +L+ +E+V ++ + G
Sbjct: 213 VDDAFKVLDEML--QPGSEFDLRPDDVTGDIIF--TWLMKREKLVSPEEIVEVVLKLGKF 268
Query: 157 GVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPSFNALLTAL 2
V PN + LT+ I LCR T+ A+++L LM GA ++A NALLT L
Sbjct: 269 DVFPNSIRLTQTIGQLCRTG-NTSKAYDLLIELMTLGAAIKAAPCNALLTGL 319
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 283,601,666,890
Number of Sequences: 15229318
Number of Extensions: 283601666890
Number of Successful Extensions: 91682752
Number of sequences better than 0.0: 0
|