Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN50845


BLASTX 7.6.2

Query= UN50845 /QuerySize=971
        (970 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT1G63830.2 | Symbols:  | proline-rich family pro...    449   2e-126
TAIR9_protein||AT1G63830.1 | Symbols:  | proline-rich family pro...    449   2e-126
TAIR9_protein||AT5G41390.1 | Symbols:  | FUNCTIONS IN: molecular...    415   2e-116
TAIR9_protein||AT4G23470.1 | Symbols:  | hydroxyproline-rich gly...    394   4e-110
TAIR9_protein||AT4G23470.3 | Symbols:  | hydroxyproline-rich gly...    385   3e-107
TAIR9_protein||AT4G23470.2 | Symbols:  | hydroxyproline-rich gly...    322   3e-088
TAIR9_protein||AT5G41390.2 | Symbols:  | FUNCTIONS IN: molecular...    317   1e-086
TAIR9_protein||AT5G17650.1 | Symbols:  | glycine/proline-rich pr...     72   5e-013
TAIR9_protein||AT3G49845.1 | Symbols:  | FUNCTIONS IN: molecular...     63   2e-010
TAIR9_protein||AT5G67600.1 | Symbols:  | unknown protein | chr5:...     58   7e-009
TAIR9_protein||AT4G19200.1 | Symbols:  | proline-rich family pro...     57   2e-008
TAIR9_protein||AT5G45350.1 | Symbols:  | proline-rich family pro...     56   3e-008
TAIR9_protein||AT5G45350.2 | Symbols:  | proline-rich family pro...     56   3e-008

>TAIR9_protein||AT1G63830.2 | Symbols:  | proline-rich family protein |
        chr1:23685408-23687098 FORWARD

          Length = 233

 Score =  449 bits (1153), Expect = 2e-126
 Identities = 207/226 (91%), Positives = 216/226 (95%), Gaps = 2/226 (0%)
 Frame = +3

Query:  78 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 257
           DK +KMKLRQDYRNLWHSDLMGTVTADTPYCC+SC+CGPCVSY+LRRRALYNDMSRYTCC
Sbjct:   6 DKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCC 65

Query: 258 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 437
           AGYMPCSGRCGESKCP+LCLATEVFLCFGNSVASTRFLLQDEFNIQTT+CDNCIIGFMFC
Sbjct:  66 AGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFC 125

Query: 438 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 617
           LSQVACIFSIVAC+VGS+ELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFG Q
Sbjct: 126 LSQVACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQ 185

Query: 618 PMGVPPAQQMSRFDQPA-PPVGYPPASYPPAQGYPPAPYPPAQGYP 752
           PMGVPPAQQMSRFDQP  PPVGYP +  PPAQGYPPA YPP  GYP
Sbjct: 186 PMGVPPAQQMSRFDQPVPPPVGYPQSYPPPAQGYPPASYPP-PGYP 230


 Score =  60 bits (145), Expect = 1e-009
 Identities = 24/31 (77%), Positives = 25/31 (80%)
 Frame = +3

Query: 699 PPAQGYPPAPYPPAQGYPPASYPPPGYPQH* 791
           PP  GYP +  PPAQGYPPASYPPPGYPQH*
Sbjct: 203 PPPVGYPQSYPPPAQGYPPASYPPPGYPQH* 233

>TAIR9_protein||AT1G63830.1 | Symbols:  | proline-rich family protein |
        chr1:23685408-23687098 FORWARD

          Length = 233

 Score =  449 bits (1153), Expect = 2e-126
 Identities = 207/226 (91%), Positives = 216/226 (95%), Gaps = 2/226 (0%)
 Frame = +3

Query:  78 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 257
           DK +KMKLRQDYRNLWHSDLMGTVTADTPYCC+SC+CGPCVSY+LRRRALYNDMSRYTCC
Sbjct:   6 DKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCC 65

Query: 258 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 437
           AGYMPCSGRCGESKCP+LCLATEVFLCFGNSVASTRFLLQDEFNIQTT+CDNCIIGFMFC
Sbjct:  66 AGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFC 125

Query: 438 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 617
           LSQVACIFSIVAC+VGS+ELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFG Q
Sbjct: 126 LSQVACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQ 185

Query: 618 PMGVPPAQQMSRFDQPA-PPVGYPPASYPPAQGYPPAPYPPAQGYP 752
           PMGVPPAQQMSRFDQP  PPVGYP +  PPAQGYPPA YPP  GYP
Sbjct: 186 PMGVPPAQQMSRFDQPVPPPVGYPQSYPPPAQGYPPASYPP-PGYP 230


 Score =  60 bits (145), Expect = 1e-009
 Identities = 24/31 (77%), Positives = 25/31 (80%)
 Frame = +3

Query: 699 PPAQGYPPAPYPPAQGYPPASYPPPGYPQH* 791
           PP  GYP +  PPAQGYPPASYPPPGYPQH*
Sbjct: 203 PPPVGYPQSYPPPAQGYPPASYPPPGYPQH* 233

>TAIR9_protein||AT5G41390.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
        DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
        BEST Arabidopsis thaliana protein match is: proline-rich family protein
        (TAIR:AT1G63830.2); Has 12019 Blast hits to 5816 proteins in 448
        species: Archae - 6; Bacteria - 1151; Metazoa - 4980; Fungi - 1500;
        Plants - 2605; Viruses - 338; Other Eukaryotes - 1439 (source: NCBI
        BLink). | chr5:16565576-16567253 FORWARD

          Length = 265

 Score =  415 bits (1066), Expect = 2e-116
 Identities = 190/236 (80%), Positives = 207/236 (87%), Gaps = 6/236 (2%)
 Frame = +3

Query:  78 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 257
           DK +KM+LRQ YRNLWHSDLMGTV+ADTPYC  SC+CGPCVSYLLR+RALYNDMSRYTCC
Sbjct:   6 DKLDKMQLRQSYRNLWHSDLMGTVSADTPYCFFSCLCGPCVSYLLRKRALYNDMSRYTCC 65

Query: 258 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 437
            GYMPCSG+CGESKCP+ CLATEV LCFGNSVASTRF+LQDEFNI TTKCDNCIIGFMFC
Sbjct:  66 GGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNCIIGFMFC 125

Query: 438 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 617
           L+Q+ACIFS+VAC+VGS+ELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKRDG+  PQ
Sbjct: 126 LNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKRDGLISPQ 185

Query: 618 PMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYP-PAQGYPPA-SYPPPGY 779
           PM VPPAQQMSR DQP PP     A YPPA GYP   YP P  GYPPA  YPPPG+
Sbjct: 186 PMSVPPAQQMSRIDQPVPPY----AGYPPATGYPQHYYPQPGHGYPPAPGYPPPGH 237

>TAIR9_protein||AT4G23470.1 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr4:12249289-12251079 FORWARD

          Length = 256

 Score =  394 bits (1012), Expect = 4e-110
 Identities = 178/234 (76%), Positives = 199/234 (85%), Gaps = 7/234 (2%)
 Frame = +3

Query:  87 EKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCCAGY 266
           EKM+LR+++RN+WH+DL  ++  DTPYCC +  C PC SYLLR+RALY+DMSRY CCAGY
Sbjct:   7 EKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGY 66

Query: 267 MPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFCLSQ 446
           MPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNCIIGFM CLSQ
Sbjct:  67 MPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQ 126

Query: 447 VACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQPMG 626
           VACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKRDG FGPQPM 
Sbjct: 127 VACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFGPQPMA 186

Query: 627 VPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPGYPQH 788
           VPPAQQMSRFDQ  PP       YPP QGYPP+ YP    +PP  YPP GYPQ+
Sbjct: 187 VPPAQQMSRFDQATPPA----VGYPPQQGYPPSGYPQ---HPPQGYPPSGYPQN 233

>TAIR9_protein||AT4G23470.3 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr4:12249289-12251079 FORWARD

          Length = 234

 Score =  385 bits (987), Expect = 3e-107
 Identities = 174/228 (76%), Positives = 194/228 (85%), Gaps = 7/228 (3%)
 Frame = +3

Query:  87 EKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCCAGY 266
           EKM+LR+++RN+WH+DL  ++  DTPYCC +  C PC SYLLR+RALY+DMSRY CCAGY
Sbjct:   7 EKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGY 66

Query: 267 MPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFCLSQ 446
           MPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNCIIGFM CLSQ
Sbjct:  67 MPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQ 126

Query: 447 VACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQPMG 626
           VACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKRDG FGPQPM 
Sbjct: 127 VACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFGPQPMA 186

Query: 627 VPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPP 770
           VPPAQQMSRFDQ  PP       YPP QGYPP+ Y     YPP +YPP
Sbjct: 187 VPPAQQMSRFDQATPPA----VGYPPQQGYPPSAY---SQYPPGAYPP 227

>TAIR9_protein||AT4G23470.2 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr4:12249867-12251079 FORWARD

          Length = 200

 Score =  322 bits (824), Expect = 3e-088
 Identities = 148/184 (80%), Positives = 158/184 (85%), Gaps = 7/184 (3%)
 Frame = +3

Query: 237 MSRYTCCAGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNC 416
           MSRY CCAGYMPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNC
Sbjct:   1 MSRYVCCAGYMPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNC 60

Query: 417 IIGFMFCLSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKR 596
           IIGFM CLSQVACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKR
Sbjct:  61 IIGFMVCLSQVACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKR 120

Query: 597 DGVFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPG 776
           DG FGPQPM VPPAQQMSRFDQ  PP       YPP QGYPP+ YP    +PP  YPP G
Sbjct: 121 DGKFGPQPMAVPPAQQMSRFDQATPPA----VGYPPQQGYPPSGYPQ---HPPQGYPPSG 173

Query: 777 YPQH 788
           YPQ+
Sbjct: 174 YPQN 177

>TAIR9_protein||AT5G41390.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
        DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
        BEST Arabidopsis thaliana protein match is: proline-rich family protein
        (TAIR:AT1G63830.2); Has 11998 Blast hits to 5796 proteins in 448
        species: Archae - 6; Bacteria - 1151; Metazoa - 4966; Fungi - 1500;
        Plants - 2602; Viruses - 338; Other Eukaryotes - 1435 (source: NCBI
        BLink). | chr5:16566179-16567253 FORWARD

          Length = 207

 Score =  317 bits (810), Expect = 1e-086
 Identities = 146/183 (79%), Positives = 158/183 (86%), Gaps = 6/183 (3%)
 Frame = +3

Query: 237 MSRYTCCAGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNC 416
           MSRYTCC GYMPCSG+CGESKCP+ CLATEV LCFGNSVASTRF+LQDEFNI TTKCDNC
Sbjct:   1 MSRYTCCGGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNC 60

Query: 417 IIGFMFCLSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKR 596
           IIGFMFCL+Q+ACIFS+VAC+VGS+ELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKR
Sbjct:  61 IIGFMFCLNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKR 120

Query: 597 DGVFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYP-PAQGYPPA-SYPP 770
           DG+  PQPM VPPAQQMSR DQP PP     A YPPA GYP   YP P  GYPPA  YPP
Sbjct: 121 DGLISPQPMSVPPAQQMSRIDQPVPPY----AGYPPATGYPQHYYPQPGHGYPPAPGYPP 176

Query: 771 PGY 779
           PG+
Sbjct: 177 PGH 179

>TAIR9_protein||AT5G17650.1 | Symbols:  | glycine/proline-rich protein |
        chr5:5817000-5817763 REVERSE

          Length = 174

 Score =  72 bits (175), Expect = 5e-013
 Identities = 30/45 (66%), Positives = 32/45 (71%), Gaps = 1/45 (2%)
 Frame = +3

Query: 654 FDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPGYPQH 788
           +  P PP GYPP +YPP  GYPPA YPPA GYPPA YP  GYP H
Sbjct:  45 YPPPPPPHGYPPVAYPPHGGYPPAGYPPA-GYPPAGYPAHGYPSH 88

>TAIR9_protein||AT3G49845.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: root; CONTAINS InterPro
        DOMAIN/s: XYPPX repeat (InterPro:IPR006031); Has 14038 Blast hits to
        7746 proteins in 541 species: Archae - 8; Bacteria - 1083; Metazoa -
        5225; Fungi - 1793; Plants - 3748; Viruses - 424; Other Eukaryotes -
        1757 (source: NCBI BLink). | chr3:18487339-18487965 FORWARD

          Length = 125

 Score =  63 bits (152), Expect = 2e-010
 Identities = 32/60 (53%), Positives = 33/60 (55%), Gaps = 15/60 (25%)
 Frame = +3

Query: 612 PQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPP--GYPQ 785
           P P G PP +            GYPPA YPP  GYPP  YP A GYPPA YPPP  GY Q
Sbjct:  13 PPPQGYPPKE------------GYPPAGYPPPAGYPPPQYPQA-GYPPAGYPPPQQGYGQ 59


 Score =  61 bits (147), Expect = 8e-010
 Identities = 26/46 (56%), Positives = 27/46 (58%)
 Frame = +3

Query: 645 MSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPGYP 782
           MS  D   P    PP  YPP +GYPPA YPP  GYPP  YP  GYP
Sbjct:   1 MSYQDPQHPVSAPPPQGYPPKEGYPPAGYPPPAGYPPPQYPQAGYP 46

>TAIR9_protein||AT5G67600.1 | Symbols:  | unknown protein |
        chr5:26959754-26960226 REVERSE

          Length = 83

 Score =  58 bits (139), Expect = 7e-009
 Identities = 29/43 (67%), Gaps = 5/43 (11%)
 Frame = +3

Query: 669 PPVGYPPA-SYPPAQGYPPAPYPP---AQGYPPASYPPPGYPQ 785
           PP GYPP   YPPA GYPPA YPP   AQGYP   YPPP Y Q
Sbjct:  13 PPQGYPPKDGYPPA-GYPPAGYPPPGYAQGYPAQGYPPPQYSQ 54

>TAIR9_protein||AT4G19200.1 | Symbols:  | proline-rich family protein |
        chr4:10499277-10500390 FORWARD

          Length = 180

 Score =  57 bits (136), Expect = 2e-008
 Identities = 28/43 (65%), Positives = 29/43 (67%), Gaps = 4/43 (9%)
 Frame = +3

Query: 663 PAPPVGYPPASYPPAQGYPPA-PYPPAQGYPPASYP--PPGYP 782
           P    GYPP  YPP QGYPPA  YPPA GYPP +YP  P GYP
Sbjct:  26 PPAQGGYPPQGYPPQQGYPPAGGYPPA-GYPPGAYPAAPGGYP 67

>TAIR9_protein||AT5G45350.1 | Symbols:  | proline-rich family protein |
        chr5:18382100-18382854 REVERSE

          Length = 178

 Score =  56 bits (133), Expect = 3e-008
 Identities = 30/47 (63%), Positives = 32/47 (68%), Gaps = 10/47 (21%)
 Frame = +3

Query: 669 PPVGY-PPASYPPA----QGYPPAP--YPPAQGYPPASYPPP--GYP 782
           PP GY PP +YPPA    QGYPP P  YPPA GYPP +YPP   GYP
Sbjct:  17 PPAGYPPPGAYPPAGYPQQGYPPPPGAYPPA-GYPPGAYPPAPGGYP 62

>TAIR9_protein||AT5G45350.2 | Symbols:  | proline-rich family protein |
        chr5:18382100-18382854 REVERSE

          Length = 178

 Score =  56 bits (133), Expect = 3e-008
 Identities = 30/47 (63%), Positives = 32/47 (68%), Gaps = 10/47 (21%)
 Frame = +3

Query: 669 PPVGY-PPASYPPA----QGYPPAP--YPPAQGYPPASYPPP--GYP 782
           PP GY PP +YPPA    QGYPP P  YPPA GYPP +YPP   GYP
Sbjct:  17 PPAGYPPPGAYPPAGYPQQGYPPPPGAYPPA-GYPPGAYPPAPGGYP 62

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,087,338,472
Number of Sequences: 33410
Number of Extensions: 24087338472
Number of Successful Extensions: 761288629
Number of sequences better than 0.0: 0