Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN09652


BLASTX 7.6.2

Query= UN09652 /QuerySize=1032
        (1031 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT1G63830.2 | Symbols:  | proline-rich family pro...    462   2e-130
TAIR9_protein||AT1G63830.1 | Symbols:  | proline-rich family pro...    462   2e-130
TAIR9_protein||AT5G41390.1 | Symbols:  | FUNCTIONS IN: molecular...    401   4e-112
TAIR9_protein||AT4G23470.1 | Symbols:  | hydroxyproline-rich gly...    386   2e-107
TAIR9_protein||AT4G23470.3 | Symbols:  | hydroxyproline-rich gly...    382   2e-106
TAIR9_protein||AT4G23470.2 | Symbols:  | hydroxyproline-rich gly...    314   1e-085
TAIR9_protein||AT5G41390.2 | Symbols:  | FUNCTIONS IN: molecular...    303   2e-082
TAIR9_protein||AT5G17650.1 | Symbols:  | glycine/proline-rich pr...     55   5e-008
TAIR9_protein||AT3G49845.1 | Symbols:  | FUNCTIONS IN: molecular...     53   3e-007
TAIR9_protein||AT5G67600.1 | Symbols:  | unknown protein | chr5:...     48   8e-006

>TAIR9_protein||AT1G63830.2 | Symbols:  | proline-rich family protein |
        chr1:23685408-23687098 FORWARD

          Length = 233

 Score =  462 bits (1187), Expect = 2e-130
 Identities = 210/228 (92%), Positives = 219/228 (96%), Gaps = 1/228 (0%)
 Frame = +3

Query: 117 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 296
           DK +KMKLRQDYRNLWHSDLMGTVTADTPYCC+SC+CGPCVSY+LRRRALYNDMSRYTCC
Sbjct:   6 DKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCC 65

Query: 297 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 476
           AGYMPCSGRCGESKCP+LCLATEVFLCFGNSVASTRFLLQDEFNIQTT+CDNCIIGFMFC
Sbjct:  66 AGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFC 125

Query: 477 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 656
           LSQVACIFSIVAC+VGS+ELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFG Q
Sbjct: 126 LSQVACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQ 185

Query: 657 PMGVPPAQQMSRFDQPA-PPVGYPPASYPPAQGYPPAPYPPPGYPQH* 797
           PMGVPPAQQMSRFDQP  PPVGYP +  PPAQGYPPA YPPPGYPQH*
Sbjct: 186 PMGVPPAQQMSRFDQPVPPPVGYPQSYPPPAQGYPPASYPPPGYPQH* 233

>TAIR9_protein||AT1G63830.1 | Symbols:  | proline-rich family protein |
        chr1:23685408-23687098 FORWARD

          Length = 233

 Score =  462 bits (1187), Expect = 2e-130
 Identities = 210/228 (92%), Positives = 219/228 (96%), Gaps = 1/228 (0%)
 Frame = +3

Query: 117 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 296
           DK +KMKLRQDYRNLWHSDLMGTVTADTPYCC+SC+CGPCVSY+LRRRALYNDMSRYTCC
Sbjct:   6 DKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCC 65

Query: 297 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 476
           AGYMPCSGRCGESKCP+LCLATEVFLCFGNSVASTRFLLQDEFNIQTT+CDNCIIGFMFC
Sbjct:  66 AGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFC 125

Query: 477 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 656
           LSQVACIFSIVAC+VGS+ELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFG Q
Sbjct: 126 LSQVACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQ 185

Query: 657 PMGVPPAQQMSRFDQPA-PPVGYPPASYPPAQGYPPAPYPPPGYPQH* 797
           PMGVPPAQQMSRFDQP  PPVGYP +  PPAQGYPPA YPPPGYPQH*
Sbjct: 186 PMGVPPAQQMSRFDQPVPPPVGYPQSYPPPAQGYPPASYPPPGYPQH* 233

>TAIR9_protein||AT5G41390.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
        DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
        BEST Arabidopsis thaliana protein match is: proline-rich family protein
        (TAIR:AT1G63830.2); Has 12019 Blast hits to 5816 proteins in 448
        species: Archae - 6; Bacteria - 1151; Metazoa - 4980; Fungi - 1500;
        Plants - 2605; Viruses - 338; Other Eukaryotes - 1439 (source: NCBI
        BLink). | chr5:16565576-16567253 FORWARD

          Length = 265

 Score =  401 bits (1030), Expect = 4e-112
 Identities = 181/223 (81%), Positives = 198/223 (88%), Gaps = 4/223 (1%)
 Frame = +3

Query: 117 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 296
           DK +KM+LRQ YRNLWHSDLMGTV+ADTPYC  SC+CGPCVSYLLR+RALYNDMSRYTCC
Sbjct:   6 DKLDKMQLRQSYRNLWHSDLMGTVSADTPYCFFSCLCGPCVSYLLRKRALYNDMSRYTCC 65

Query: 297 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 476
            GYMPCSG+CGESKCP+ CLATEV LCFGNSVASTRF+LQDEFNI TTKCDNCIIGFMFC
Sbjct:  66 GGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNCIIGFMFC 125

Query: 477 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 656
           L+Q+ACIFS+VAC+VGS+ELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKRDG+  PQ
Sbjct: 126 LNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKRDGLISPQ 185

Query: 657 PMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPPGY 785
           PM VPPAQQMSR DQP PP     A YPPA GYP   YP PG+
Sbjct: 186 PMSVPPAQQMSRIDQPVPPY----AGYPPATGYPQHYYPQPGH 224

>TAIR9_protein||AT4G23470.1 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr4:12249289-12251079 FORWARD

          Length = 256

 Score =  386 bits (990), Expect = 2e-107
 Identities = 176/227 (77%), Positives = 195/227 (85%), Gaps = 4/227 (1%)
 Frame = +3

Query: 126 EKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCCAGY 305
           EKM+LR+++RN+WH+DL  ++  DTPYCC +  C PC SYLLR+RALY+DMSRY CCAGY
Sbjct:   7 EKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGY 66

Query: 306 MPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFCLSQ 485
           MPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNCIIGFM CLSQ
Sbjct:  67 MPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQ 126

Query: 486 VACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQPMG 665
           VACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKRDG FGPQPM 
Sbjct: 127 VACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFGPQPMA 186

Query: 666 VPPAQQMSRFDQPAPP-VGYPPASYPPAQGY---PPAPYPPPGYPQH 794
           VPPAQQMSRFDQ  PP VGYPP    P  GY   PP  YPP GYPQ+
Sbjct: 187 VPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQHPPQGYPPSGYPQN 233

>TAIR9_protein||AT4G23470.3 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr4:12249289-12251079 FORWARD

          Length = 234

 Score =  382 bits (980), Expect = 2e-106
 Identities = 176/227 (77%), Positives = 196/227 (86%), Gaps = 5/227 (2%)
 Frame = +3

Query: 126 EKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCCAGY 305
           EKM+LR+++RN+WH+DL  ++  DTPYCC +  C PC SYLLR+RALY+DMSRY CCAGY
Sbjct:   7 EKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGY 66

Query: 306 MPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFCLSQ 485
           MPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNCIIGFM CLSQ
Sbjct:  67 MPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQ 126

Query: 486 VACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQPMG 665
           VACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKRDG FGPQPM 
Sbjct: 127 VACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFGPQPMA 186

Query: 666 VPPAQQMSRFDQPAPP-VGYPP-ASYPPA--QGYPPAPY-PPPGYPQ 791
           VPPAQQMSRFDQ  PP VGYPP   YPP+    YPP  Y PPP YP+
Sbjct: 187 VPPAQQMSRFDQATPPAVGYPPQQGYPPSAYSQYPPGAYPPPPAYPK 233

>TAIR9_protein||AT4G23470.2 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr4:12249867-12251079 FORWARD

          Length = 200

 Score =  314 bits (802), Expect = 1e-085
 Identities = 146/177 (82%), Positives = 154/177 (87%), Gaps = 4/177 (2%)
 Frame = +3

Query: 276 MSRYTCCAGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNC 455
           MSRY CCAGYMPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNC
Sbjct:   1 MSRYVCCAGYMPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNC 60

Query: 456 IIGFMFCLSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKR 635
           IIGFM CLSQVACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKR
Sbjct:  61 IIGFMVCLSQVACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKR 120

Query: 636 DGVFGPQPMGVPPAQQMSRFDQPAPP-VGYPPASYPPAQGY---PPAPYPPPGYPQH 794
           DG FGPQPM VPPAQQMSRFDQ  PP VGYPP    P  GY   PP  YPP GYPQ+
Sbjct: 121 DGKFGPQPMAVPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQHPPQGYPPSGYPQN 177

>TAIR9_protein||AT5G41390.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
        DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
        BEST Arabidopsis thaliana protein match is: proline-rich family protein
        (TAIR:AT1G63830.2); Has 11998 Blast hits to 5796 proteins in 448
        species: Archae - 6; Bacteria - 1151; Metazoa - 4966; Fungi - 1500;
        Plants - 2602; Viruses - 338; Other Eukaryotes - 1435 (source: NCBI
        BLink). | chr5:16566179-16567253 FORWARD

          Length = 207

 Score =  303 bits (774), Expect = 2e-082
 Identities = 137/170 (80%), Positives = 149/170 (87%), Gaps = 4/170 (2%)
 Frame = +3

Query: 276 MSRYTCCAGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNC 455
           MSRYTCC GYMPCSG+CGESKCP+ CLATEV LCFGNSVASTRF+LQDEFNI TTKCDNC
Sbjct:   1 MSRYTCCGGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNC 60

Query: 456 IIGFMFCLSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKR 635
           IIGFMFCL+Q+ACIFS+VAC+VGS+ELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKR
Sbjct:  61 IIGFMFCLNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKR 120

Query: 636 DGVFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPPGY 785
           DG+  PQPM VPPAQQMSR DQP PP     A YPPA GYP   YP PG+
Sbjct: 121 DGLISPQPMSVPPAQQMSRIDQPVPPY----AGYPPATGYPQHYYPQPGH 166

>TAIR9_protein||AT5G17650.1 | Symbols:  | glycine/proline-rich protein |
        chr5:5817000-5817763 REVERSE

          Length = 174

 Score =  55 bits (132), Expect = 5e-008
 Identities = 21/32 (65%), Positives = 23/32 (71%)
 Frame = +3

Query: 693 FDQPAPPVGYPPASYPPAQGYPPAPYPPPGYP 788
           +  P PP GYPP +YPP  GYPPA YPP GYP
Sbjct:  45 YPPPPPPHGYPPVAYPPHGGYPPAGYPPAGYP 76

>TAIR9_protein||AT3G49845.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: root; CONTAINS InterPro
        DOMAIN/s: XYPPX repeat (InterPro:IPR006031); Has 14038 Blast hits to
        7746 proteins in 541 species: Archae - 8; Bacteria - 1083; Metazoa -
        5225; Fungi - 1793; Plants - 3748; Viruses - 424; Other Eukaryotes -
        1757 (source: NCBI BLink). | chr3:18487339-18487965 FORWARD

          Length = 125

 Score =  53 bits (125), Expect = 3e-007
 Identities = 28/52 (53%), Positives = 29/52 (55%), Gaps = 4/52 (7%)
 Frame = +3

Query: 642 VFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPP--GYPQ 791
           V  P P G PP +       P PP GYPP  YP A GYPPA YPPP  GY Q
Sbjct:  10 VSAPPPQGYPPKEGYPPAGYP-PPAGYPPPQYPQA-GYPPAGYPPPQQGYGQ 59


 Score =  52 bits (123), Expect = 5e-007
 Identities = 26/52 (50%), Gaps = 13/52 (25%)
 Frame = +3

Query: 651 PQPMGVPPAQQMSRFDQPAPPVGYPPASYPP-----AQGYPPAPYPPPGYPQ 791
           P P G PP Q         P  GYPPA YPP      QGYP   YPPP YPQ
Sbjct:  30 PPPAGYPPPQY--------PQAGYPPAGYPPPQQGYGQGYPAQGYPPPQYPQ 73

>TAIR9_protein||AT5G67600.1 | Symbols:  | unknown protein |
        chr5:26959754-26960226 REVERSE

          Length = 83

 Score =  48 bits (113), Expect = 8e-006
 Identities = 22/29 (75%), Gaps = 2/29 (6%)
 Frame = +3

Query: 708 PPVGYPPA-SYPPAQGYPPAPYPPPGYPQ 791
           PP GYPP   YPPA GYPPA YPPPGY Q
Sbjct:  13 PPQGYPPKDGYPPA-GYPPAGYPPPGYAQ 40

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,346,187,961
Number of Sequences: 33410
Number of Extensions: 5346187961
Number of Successful Extensions: 195116757
Number of sequences better than 0.0: 0