Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN66920


BLASTX 7.6.2

Query= UN66920 /QuerySize=734
        (733 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297821008|ref|XP_002878387.1| pentatricopeptide repeat-contai...    307   1e-081
gi|21537126|gb|AAM61467.1| unknown [Arabidopsis thaliana]              298   6e-079
gi|15233137|ref|NP_191711.1| pentatricopeptide repeat-containing...    296   2e-078
gi|22327132|ref|NP_680234.1| pentatricopeptide repeat-containing...    295   7e-078
gi|15241779|ref|NP_198189.1| pentatricopeptide repeat-containing...    295   7e-078
gi|225454300|ref|XP_002275491.1| PREDICTED: hypothetical protein...    178   9e-043
gi|224130398|ref|XP_002320827.1| predicted protein [Populus tric...    159   4e-037
gi|255541716|ref|XP_002511922.1| pentatricopeptide repeat-contai...    152   7e-035

>gi|297821008|ref|XP_002878387.1| pentatricopeptide repeat-containing protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 766

 Score =  307 bits (785), Expect = 1e-081
 Identities = 171/247 (69%), Positives = 195/247 (78%), Gaps = 9/247 (3%)
 Frame = -1

Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYIQSLN---NHREESLSLVFQSVV 563
           E+L SLL S S    L  S+ITRRLGSYSLAISF  Y+ S +     REESLSL  QSV+
Sbjct:  64 ESLSSLLVSSSSHSPLVFSQITRRLGSYSLAISFFEYLDSKSQSLKRREESLSLALQSVI 123

Query: 562 EFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSSV 383
           EFA GSEPDS+DKLL LY  AKEKNIPLT VA KLLIRWF RMGM NQSV VYE LDS++
Sbjct: 124 EFA-GSEPDSRDKLLRLYEIAKEKNIPLTVVATKLLIRWFGRMGMANQSVLVYERLDSNM 182

Query: 382 KNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRLL 203
           KN +QVRNV+IDVL RN  VDDAFKVLDEML  E    VF PNRITA+IV HEVW+GRLL
Sbjct: 183 KN-SQVRNVVIDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKGRLL 238

Query: 202 KEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPSF 23
            E++++GLISRF SHGV+PN VWLTRFI+SLC++A RTN AW++LS LMKN A L+AP F
Sbjct: 239 TEEKIIGLISRFSSHGVSPNSVWLTRFISSLCKNA-RTNAAWDILSDLMKNKAPLEAPPF 297

Query:  22 NALLTAL 2
           NALL+ L
Sbjct: 298 NALLSCL 304

>gi|21537126|gb|AAM61467.1| unknown [Arabidopsis thaliana]

          Length = 766

 Score =  298 bits (762), Expect = 6e-079
 Identities = 167/248 (67%), Positives = 193/248 (77%), Gaps = 11/248 (4%)
 Frame = -1

Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYI----QSLNNHREESLSLVFQSV 566
           E+L +L+ S S    L  S+ITRRLGSYSLAISF  Y+    QSL   REESLSL  QSV
Sbjct:  64 ESLSALVVSSSSASPLVFSQITRRLGSYSLAISFFEYLDAKSQSL-KRREESLSLALQSV 122

Query: 565 VEFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSS 386
           +EFA GSEPD +DKLL LY  AKEKNIPLT VA KLLIRWF RMGMVNQSV VYE LDS+
Sbjct: 123 IEFA-GSEPDPRDKLLRLYEIAKEKNIPLTVVATKLLIRWFGRMGMVNQSVLVYERLDSN 181

Query: 385 VKNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRL 206
           +KN +QVRNV++DVL RN  VDDAFKVLDEML  E    VF PNRITA+IV HEVW+GRL
Sbjct: 182 MKN-SQVRNVVVDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKGRL 237

Query: 205 LKEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPS 26
           L E++++ LISRF SHGV+PN VWLTRFI+SLC++A R N AW++LS LMKN   L+AP 
Sbjct: 238 LTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA-RANAAWDILSDLMKNKTPLEAPP 296

Query:  25 FNALLTAL 2
           FNALL+ L
Sbjct: 297 FNALLSCL 304

>gi|15233137|ref|NP_191711.1| pentatricopeptide repeat-containing protein
        [Arabidopsis thaliana]

          Length = 766

 Score =  296 bits (757), Expect = 2e-078
 Identities = 166/248 (66%), Positives = 192/248 (77%), Gaps = 11/248 (4%)
 Frame = -1

Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYI----QSLNNHREESLSLVFQSV 566
           E+L +L+ S S    L  S+ITRRLGSYSLAISF  Y+    QSL   REESLSL  QSV
Sbjct:  64 ESLSALVVSSSSASPLVFSQITRRLGSYSLAISFFEYLDAKSQSL-KRREESLSLALQSV 122

Query: 565 VEFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSS 386
           +EFA GSEPD +DKLL LY  AKEKNIPLT VA  LLIRWF RMGMVNQSV VYE LDS+
Sbjct: 123 IEFA-GSEPDPRDKLLRLYEIAKEKNIPLTVVATNLLIRWFGRMGMVNQSVLVYERLDSN 181

Query: 385 VKNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRL 206
           +KN +QVRNV++DVL RN  VDDAFKVLDEML  E    VF PNRITA+IV HEVW+GRL
Sbjct: 182 MKN-SQVRNVVVDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKGRL 237

Query: 205 LKEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPS 26
           L E++++ LISRF SHGV+PN VWLTRFI+SLC++A R N AW++LS LMKN   L+AP 
Sbjct: 238 LTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA-RANAAWDILSDLMKNKTPLEAPP 296

Query:  25 FNALLTAL 2
           FNALL+ L
Sbjct: 297 FNALLSCL 304

>gi|22327132|ref|NP_680234.1| pentatricopeptide repeat-containing protein
        [Arabidopsis thaliana]

          Length = 766

 Score =  295 bits (753), Expect = 7e-078
 Identities = 166/248 (66%), Positives = 192/248 (77%), Gaps = 11/248 (4%)
 Frame = -1

Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYI----QSLNNHREESLSLVFQSV 566
           E+L +L+ S S    L  S+ITRRLGSYSLAISF  Y+    QSL   REESLSL  QSV
Sbjct:  64 ESLSALVVSSSSASPLVFSQITRRLGSYSLAISFFEYLDAKSQSL-KRREESLSLALQSV 122

Query: 565 VEFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSS 386
           +EFA GSEPD +DKLL LY  AKEKNIPLT VA KLLIRWF RMGMVNQSV VYE LDS+
Sbjct: 123 IEFA-GSEPDPRDKLLRLYEIAKEKNIPLTIVATKLLIRWFGRMGMVNQSVLVYERLDSN 181

Query: 385 VKNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRL 206
           +KN +QVRNV++DVL RN  VDDAFKVLDEML  E    VF PNRITA+IV HEVW+ RL
Sbjct: 182 MKN-SQVRNVVVDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKERL 237

Query: 205 LKEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPS 26
           L E++++ LISRF SHGV+PN VWLTRFI+SLC++A R N AW++LS LMKN   L+AP 
Sbjct: 238 LTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA-RANTAWDILSDLMKNKTPLEAPP 296

Query:  25 FNALLTAL 2
           FNALL+ L
Sbjct: 297 FNALLSCL 304

>gi|15241779|ref|NP_198189.1| pentatricopeptide repeat-containing protein
        [Arabidopsis thaliana]

          Length = 727

 Score =  295 bits (753), Expect = 7e-078
 Identities = 166/248 (66%), Positives = 192/248 (77%), Gaps = 11/248 (4%)
 Frame = -1

Query: 733 ETLHSLLSSLSLCISLALSKITRRLGSYSLAISFLHYI----QSLNNHREESLSLVFQSV 566
           E+L +L+ S S    L  S+ITRRLGSYSLAISF  Y+    QSL   REESLSL  QSV
Sbjct:  64 ESLSALVVSSSSASPLVFSQITRRLGSYSLAISFFEYLDAKSQSL-KRREESLSLALQSV 122

Query: 565 VEFAGGSEPDSKDKLLSLYATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSS 386
           +EFA GSEPD +DKLL LY  AKEKNIPLT VA KLLIRWF RMGMVNQSV VYE LDS+
Sbjct: 123 IEFA-GSEPDPRDKLLRLYEIAKEKNIPLTIVATKLLIRWFGRMGMVNQSVLVYERLDSN 181

Query: 385 VKNTTQVRNVLIDVLFRNERVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWRGRL 206
           +KN +QVRNV++DVL RN  VDDAFKVLDEML  E    VF PNRITA+IV HEVW+ RL
Sbjct: 182 MKN-SQVRNVVVDVLLRNGLVDDAFKVLDEMLQKES---VFPPNRITADIVLHEVWKERL 237

Query: 205 LKEDEVVGLISRFGSHGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPS 26
           L E++++ LISRF SHGV+PN VWLTRFI+SLC++A R N AW++LS LMKN   L+AP 
Sbjct: 238 LTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA-RANTAWDILSDLMKNKTPLEAPP 296

Query:  25 FNALLTAL 2
           FNALL+ L
Sbjct: 297 FNALLSCL 304

>gi|225454300|ref|XP_002275491.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 765

 Score =  178 bits (450), Expect = 9e-043
 Identities = 111/233 (47%), Positives = 150/233 (64%), Gaps = 13/233 (5%)
 Frame = -1

Query: 682 LSKITRRLGSYSLAISFLHYIQSLNNHREES--LSLVFQSVVEFAGGSEPDSKDKLLSLY 509
           L +ITR LGS + A+ F +++Q+ N+  ++S  LS   ++V E A   EP+S +KLL L+
Sbjct:  90 LLQITRLLGSTAKALKFFNWVQA-NSPCQDSPLLSFTLEAVFEHA-SREPNSHNKLLDLF 147

Query: 508 ATAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSSVKNTTQVRNVLIDVLFRNE 329
            T+K   IPL+  AA LLIR F R  MV++S  VY  L  S +  T +RN+LIDVLFR  
Sbjct: 148 KTSKSHKIPLSVNAATLLIRCFGRAQMVDESFLVYNELCPS-RRLTHIRNILIDVLFRKG 206

Query: 328 RVDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWR----GRLLKEDEVVGLISRFGS 161
           RVDDA  +LDEML   +    F PN  T  IVF  + +    GR + E+E+VGL+S+F  
Sbjct: 207 RVDDALHLLDEML---QPKAEFPPNSNTGHIVFSALSKRDKVGRAVDEEEIVGLVSKFAE 263

Query: 160 HGVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPSFNALLTAL 2
           H V PN +WLT+ I+ LCR   RT+ AW+VL GLMK G +++A S NALLTAL
Sbjct: 264 HEVFPNSIWLTQLISRLCRSG-RTDRAWDVLHGLMKLGGVMEAASCNALLTAL 315

>gi|224130398|ref|XP_002320827.1| predicted protein [Populus trichocarpa]

          Length = 775

 Score =  159 bits (401), Expect = 4e-037
 Identities = 99/230 (43%), Positives = 143/230 (62%), Gaps = 11/230 (4%)
 Frame = -1

Query: 676 KITRRLGSYSLAISFLHYIQSLNNHREES---LSLVFQSVVEFAGGSEPDSKDKLLSLYA 506
           +ITRRL S S A+ FL+Y+Q+ +    ++   LS  FQ++ E A   EPDS   L  LY 
Sbjct:  88 QITRRLPSSSQALKFLNYLQNNSPSSPDTQSLLSYTFQAIFELA-FCEPDSNANLSRLYK 146

Query: 505 TAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSSVKNTTQVRNVLIDVLFRNER 326
           T+KE NIPLT  AA  L+R   R  +V +S+ ++  LD SVKN T +RNV + +L R+ R
Sbjct: 147 TSKELNIPLTVNAASFLLRASGRSELVEESLILFNDLDPSVKN-TYLRNVWLSILLRSGR 205

Query: 325 VDDAFKVLDEMLCCEEGSVVFRPNRITAEIVFHEVWR----GRLLKEDEVVGLISRFGSH 158
           V DA KV+DEM    + S   RPN  T +I+F  + +      LL EDE+V L+ +FG H
Sbjct: 206 VKDALKVIDEMFESNDDSNC-RPNDATGDILFSFLLKRERNEELLSEDEIVNLVLKFGEH 264

Query: 157 GVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPSFNALLT 8
           GV  +  W+ R IT LCR+ R+TN  W++ + ++K GA+L++ + N+LLT
Sbjct: 265 GVLISSFWMGRLITRLCRN-RKTNRGWDLFTEMIKLGAVLESAACNSLLT 313

>gi|255541716|ref|XP_002511922.1| pentatricopeptide repeat-containing protein,
        putative [Ricinus communis]

          Length = 346

 Score =  152 bits (382), Expect = 7e-035
 Identities = 99/232 (42%), Positives = 143/232 (61%), Gaps = 12/232 (5%)
 Frame = -1

Query: 682 LSKITRRLGSYSLAISFLHYIQ-SLNNHREESLSLVFQSVVEFAGGSEPDSKDKLLSLYA 506
           L +I RRL S S A+ FL Y+Q +      + LS  FQ++ E A   E DS+  L  LY 
Sbjct:  95 LFQIARRLPSSSQALKFLKYLQNNFPTSNTQHLSSTFQAIFELA-SRENDSRTNLYELYK 153

Query: 505 TAKEKNIPLTPVAAKLLIRWFSRMGMVNQSVHVYEGLDSSVKNTTQVRNVLIDVLFRNER 326
            +KE NIPLT  +A LL+R+F R+G+V +S  ++  L+  +KN T VRN+++D+L R+ R
Sbjct: 154 VSKEWNIPLTINSATLLLRFFGRIGLVEKSFILFNELEHCIKN-THVRNLMVDLLLRDGR 212

Query: 325 VDDAFKVLDEMLCCEEGSVV-FRPNRITAEIVFHEVW---RGRLLKEDEVVGLISRFGSH 158
           VDDAFKVLDEML  + GS    RP+ +T +I+F   W   R +L+  +E+V ++ + G  
Sbjct: 213 VDDAFKVLDEML--QPGSEFDLRPDDVTGDIIF--TWLMKREKLVSPEEIVEVVLKLGKF 268

Query: 157 GVAPNCVWLTRFITSLCRDARRTNIAWEVLSGLMKNGALLQAPSFNALLTAL 2
            V PN + LT+ I  LCR    T+ A+++L  LM  GA ++A   NALLT L
Sbjct: 269 DVFPNSIRLTQTIGQLCRTG-NTSKAYDLLIELMTLGAAIKAAPCNALLTGL 319

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 283,601,666,890
Number of Sequences: 15229318
Number of Extensions: 283601666890
Number of Successful Extensions: 91682752
Number of sequences better than 0.0: 0