BLASTX 7.6.2
Query= UN09652 /QuerySize=1032
(1031 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT1G63830.2 | Symbols: | proline-rich family pro... 462 2e-130
TAIR9_protein||AT1G63830.1 | Symbols: | proline-rich family pro... 462 2e-130
TAIR9_protein||AT5G41390.1 | Symbols: | FUNCTIONS IN: molecular... 401 4e-112
TAIR9_protein||AT4G23470.1 | Symbols: | hydroxyproline-rich gly... 386 2e-107
TAIR9_protein||AT4G23470.3 | Symbols: | hydroxyproline-rich gly... 382 2e-106
TAIR9_protein||AT4G23470.2 | Symbols: | hydroxyproline-rich gly... 314 1e-085
TAIR9_protein||AT5G41390.2 | Symbols: | FUNCTIONS IN: molecular... 303 2e-082
TAIR9_protein||AT5G17650.1 | Symbols: | glycine/proline-rich pr... 55 5e-008
TAIR9_protein||AT3G49845.1 | Symbols: | FUNCTIONS IN: molecular... 53 3e-007
TAIR9_protein||AT5G67600.1 | Symbols: | unknown protein | chr5:... 48 8e-006
>TAIR9_protein||AT1G63830.2 | Symbols: | proline-rich family protein |
chr1:23685408-23687098 FORWARD
Length = 233
Score = 462 bits (1187), Expect = 2e-130
Identities = 210/228 (92%), Positives = 219/228 (96%), Gaps = 1/228 (0%)
Frame = +3
Query: 117 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 296
DK +KMKLRQDYRNLWHSDLMGTVTADTPYCC+SC+CGPCVSY+LRRRALYNDMSRYTCC
Sbjct: 6 DKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCC 65
Query: 297 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 476
AGYMPCSGRCGESKCP+LCLATEVFLCFGNSVASTRFLLQDEFNIQTT+CDNCIIGFMFC
Sbjct: 66 AGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFC 125
Query: 477 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 656
LSQVACIFSIVAC+VGS+ELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFG Q
Sbjct: 126 LSQVACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQ 185
Query: 657 PMGVPPAQQMSRFDQPA-PPVGYPPASYPPAQGYPPAPYPPPGYPQH* 797
PMGVPPAQQMSRFDQP PPVGYP + PPAQGYPPA YPPPGYPQH*
Sbjct: 186 PMGVPPAQQMSRFDQPVPPPVGYPQSYPPPAQGYPPASYPPPGYPQH* 233
>TAIR9_protein||AT1G63830.1 | Symbols: | proline-rich family protein |
chr1:23685408-23687098 FORWARD
Length = 233
Score = 462 bits (1187), Expect = 2e-130
Identities = 210/228 (92%), Positives = 219/228 (96%), Gaps = 1/228 (0%)
Frame = +3
Query: 117 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 296
DK +KMKLRQDYRNLWHSDLMGTVTADTPYCC+SC+CGPCVSY+LRRRALYNDMSRYTCC
Sbjct: 6 DKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCC 65
Query: 297 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 476
AGYMPCSGRCGESKCP+LCLATEVFLCFGNSVASTRFLLQDEFNIQTT+CDNCIIGFMFC
Sbjct: 66 AGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFC 125
Query: 477 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 656
LSQVACIFSIVAC+VGS+ELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFG Q
Sbjct: 126 LSQVACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQ 185
Query: 657 PMGVPPAQQMSRFDQPA-PPVGYPPASYPPAQGYPPAPYPPPGYPQH* 797
PMGVPPAQQMSRFDQP PPVGYP + PPAQGYPPA YPPPGYPQH*
Sbjct: 186 PMGVPPAQQMSRFDQPVPPPVGYPQSYPPPAQGYPPASYPPPGYPQH* 233
>TAIR9_protein||AT5G41390.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
BEST Arabidopsis thaliana protein match is: proline-rich family protein
(TAIR:AT1G63830.2); Has 12019 Blast hits to 5816 proteins in 448
species: Archae - 6; Bacteria - 1151; Metazoa - 4980; Fungi - 1500;
Plants - 2605; Viruses - 338; Other Eukaryotes - 1439 (source: NCBI
BLink). | chr5:16565576-16567253 FORWARD
Length = 265
Score = 401 bits (1030), Expect = 4e-112
Identities = 181/223 (81%), Positives = 198/223 (88%), Gaps = 4/223 (1%)
Frame = +3
Query: 117 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 296
DK +KM+LRQ YRNLWHSDLMGTV+ADTPYC SC+CGPCVSYLLR+RALYNDMSRYTCC
Sbjct: 6 DKLDKMQLRQSYRNLWHSDLMGTVSADTPYCFFSCLCGPCVSYLLRKRALYNDMSRYTCC 65
Query: 297 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 476
GYMPCSG+CGESKCP+ CLATEV LCFGNSVASTRF+LQDEFNI TTKCDNCIIGFMFC
Sbjct: 66 GGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNCIIGFMFC 125
Query: 477 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 656
L+Q+ACIFS+VAC+VGS+ELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKRDG+ PQ
Sbjct: 126 LNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKRDGLISPQ 185
Query: 657 PMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPPGY 785
PM VPPAQQMSR DQP PP A YPPA GYP YP PG+
Sbjct: 186 PMSVPPAQQMSRIDQPVPPY----AGYPPATGYPQHYYPQPGH 224
>TAIR9_protein||AT4G23470.1 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249289-12251079 FORWARD
Length = 256
Score = 386 bits (990), Expect = 2e-107
Identities = 176/227 (77%), Positives = 195/227 (85%), Gaps = 4/227 (1%)
Frame = +3
Query: 126 EKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCCAGY 305
EKM+LR+++RN+WH+DL ++ DTPYCC + C PC SYLLR+RALY+DMSRY CCAGY
Sbjct: 7 EKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGY 66
Query: 306 MPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFCLSQ 485
MPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNCIIGFM CLSQ
Sbjct: 67 MPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQ 126
Query: 486 VACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQPMG 665
VACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKRDG FGPQPM
Sbjct: 127 VACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFGPQPMA 186
Query: 666 VPPAQQMSRFDQPAPP-VGYPPASYPPAQGY---PPAPYPPPGYPQH 794
VPPAQQMSRFDQ PP VGYPP P GY PP YPP GYPQ+
Sbjct: 187 VPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQHPPQGYPPSGYPQN 233
>TAIR9_protein||AT4G23470.3 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249289-12251079 FORWARD
Length = 234
Score = 382 bits (980), Expect = 2e-106
Identities = 176/227 (77%), Positives = 196/227 (86%), Gaps = 5/227 (2%)
Frame = +3
Query: 126 EKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCCAGY 305
EKM+LR+++RN+WH+DL ++ DTPYCC + C PC SYLLR+RALY+DMSRY CCAGY
Sbjct: 7 EKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGY 66
Query: 306 MPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFCLSQ 485
MPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNCIIGFM CLSQ
Sbjct: 67 MPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQ 126
Query: 486 VACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQPMG 665
VACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKRDG FGPQPM
Sbjct: 127 VACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFGPQPMA 186
Query: 666 VPPAQQMSRFDQPAPP-VGYPP-ASYPPA--QGYPPAPY-PPPGYPQ 791
VPPAQQMSRFDQ PP VGYPP YPP+ YPP Y PPP YP+
Sbjct: 187 VPPAQQMSRFDQATPPAVGYPPQQGYPPSAYSQYPPGAYPPPPAYPK 233
>TAIR9_protein||AT4G23470.2 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249867-12251079 FORWARD
Length = 200
Score = 314 bits (802), Expect = 1e-085
Identities = 146/177 (82%), Positives = 154/177 (87%), Gaps = 4/177 (2%)
Frame = +3
Query: 276 MSRYTCCAGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNC 455
MSRY CCAGYMPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNC
Sbjct: 1 MSRYVCCAGYMPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNC 60
Query: 456 IIGFMFCLSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKR 635
IIGFM CLSQVACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKR
Sbjct: 61 IIGFMVCLSQVACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKR 120
Query: 636 DGVFGPQPMGVPPAQQMSRFDQPAPP-VGYPPASYPPAQGY---PPAPYPPPGYPQH 794
DG FGPQPM VPPAQQMSRFDQ PP VGYPP P GY PP YPP GYPQ+
Sbjct: 121 DGKFGPQPMAVPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQHPPQGYPPSGYPQN 177
>TAIR9_protein||AT5G41390.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
BEST Arabidopsis thaliana protein match is: proline-rich family protein
(TAIR:AT1G63830.2); Has 11998 Blast hits to 5796 proteins in 448
species: Archae - 6; Bacteria - 1151; Metazoa - 4966; Fungi - 1500;
Plants - 2602; Viruses - 338; Other Eukaryotes - 1435 (source: NCBI
BLink). | chr5:16566179-16567253 FORWARD
Length = 207
Score = 303 bits (774), Expect = 2e-082
Identities = 137/170 (80%), Positives = 149/170 (87%), Gaps = 4/170 (2%)
Frame = +3
Query: 276 MSRYTCCAGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNC 455
MSRYTCC GYMPCSG+CGESKCP+ CLATEV LCFGNSVASTRF+LQDEFNI TTKCDNC
Sbjct: 1 MSRYTCCGGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNC 60
Query: 456 IIGFMFCLSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKR 635
IIGFMFCL+Q+ACIFS+VAC+VGS+ELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKR
Sbjct: 61 IIGFMFCLNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKR 120
Query: 636 DGVFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPPGY 785
DG+ PQPM VPPAQQMSR DQP PP A YPPA GYP YP PG+
Sbjct: 121 DGLISPQPMSVPPAQQMSRIDQPVPPY----AGYPPATGYPQHYYPQPGH 166
>TAIR9_protein||AT5G17650.1 | Symbols: | glycine/proline-rich protein |
chr5:5817000-5817763 REVERSE
Length = 174
Score = 55 bits (132), Expect = 5e-008
Identities = 21/32 (65%), Positives = 23/32 (71%)
Frame = +3
Query: 693 FDQPAPPVGYPPASYPPAQGYPPAPYPPPGYP 788
+ P PP GYPP +YPP GYPPA YPP GYP
Sbjct: 45 YPPPPPPHGYPPVAYPPHGGYPPAGYPPAGYP 76
>TAIR9_protein||AT3G49845.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: root; CONTAINS InterPro
DOMAIN/s: XYPPX repeat (InterPro:IPR006031); Has 14038 Blast hits to
7746 proteins in 541 species: Archae - 8; Bacteria - 1083; Metazoa -
5225; Fungi - 1793; Plants - 3748; Viruses - 424; Other Eukaryotes -
1757 (source: NCBI BLink). | chr3:18487339-18487965 FORWARD
Length = 125
Score = 53 bits (125), Expect = 3e-007
Identities = 28/52 (53%), Positives = 29/52 (55%), Gaps = 4/52 (7%)
Frame = +3
Query: 642 VFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPP--GYPQ 791
V P P G PP + P PP GYPP YP A GYPPA YPPP GY Q
Sbjct: 10 VSAPPPQGYPPKEGYPPAGYP-PPAGYPPPQYPQA-GYPPAGYPPPQQGYGQ 59
Score = 52 bits (123), Expect = 5e-007
Identities = 26/52 (50%), Gaps = 13/52 (25%)
Frame = +3
Query: 651 PQPMGVPPAQQMSRFDQPAPPVGYPPASYPP-----AQGYPPAPYPPPGYPQ 791
P P G PP Q P GYPPA YPP QGYP YPPP YPQ
Sbjct: 30 PPPAGYPPPQY--------PQAGYPPAGYPPPQQGYGQGYPAQGYPPPQYPQ 73
>TAIR9_protein||AT5G67600.1 | Symbols: | unknown protein |
chr5:26959754-26960226 REVERSE
Length = 83
Score = 48 bits (113), Expect = 8e-006
Identities = 22/29 (75%), Gaps = 2/29 (6%)
Frame = +3
Query: 708 PPVGYPPA-SYPPAQGYPPAPYPPPGYPQ 791
PP GYPP YPPA GYPPA YPPPGY Q
Sbjct: 13 PPQGYPPKDGYPPA-GYPPAGYPPPGYAQ 40
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,346,187,961
Number of Sequences: 33410
Number of Extensions: 5346187961
Number of Successful Extensions: 195116757
Number of sequences better than 0.0: 0
|