BLASTX 7.6.2
Query= UN50845 /QuerySize=971
(970 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT1G63830.2 | Symbols: | proline-rich family pro... 449 2e-126
TAIR9_protein||AT1G63830.1 | Symbols: | proline-rich family pro... 449 2e-126
TAIR9_protein||AT5G41390.1 | Symbols: | FUNCTIONS IN: molecular... 415 2e-116
TAIR9_protein||AT4G23470.1 | Symbols: | hydroxyproline-rich gly... 394 4e-110
TAIR9_protein||AT4G23470.3 | Symbols: | hydroxyproline-rich gly... 385 3e-107
TAIR9_protein||AT4G23470.2 | Symbols: | hydroxyproline-rich gly... 322 3e-088
TAIR9_protein||AT5G41390.2 | Symbols: | FUNCTIONS IN: molecular... 317 1e-086
TAIR9_protein||AT5G17650.1 | Symbols: | glycine/proline-rich pr... 72 5e-013
TAIR9_protein||AT3G49845.1 | Symbols: | FUNCTIONS IN: molecular... 63 2e-010
TAIR9_protein||AT5G67600.1 | Symbols: | unknown protein | chr5:... 58 7e-009
TAIR9_protein||AT4G19200.1 | Symbols: | proline-rich family pro... 57 2e-008
TAIR9_protein||AT5G45350.1 | Symbols: | proline-rich family pro... 56 3e-008
TAIR9_protein||AT5G45350.2 | Symbols: | proline-rich family pro... 56 3e-008
>TAIR9_protein||AT1G63830.2 | Symbols: | proline-rich family protein |
chr1:23685408-23687098 FORWARD
Length = 233
Score = 449 bits (1153), Expect = 2e-126
Identities = 207/226 (91%), Positives = 216/226 (95%), Gaps = 2/226 (0%)
Frame = +3
Query: 78 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 257
DK +KMKLRQDYRNLWHSDLMGTVTADTPYCC+SC+CGPCVSY+LRRRALYNDMSRYTCC
Sbjct: 6 DKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCC 65
Query: 258 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 437
AGYMPCSGRCGESKCP+LCLATEVFLCFGNSVASTRFLLQDEFNIQTT+CDNCIIGFMFC
Sbjct: 66 AGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFC 125
Query: 438 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 617
LSQVACIFSIVAC+VGS+ELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFG Q
Sbjct: 126 LSQVACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQ 185
Query: 618 PMGVPPAQQMSRFDQPA-PPVGYPPASYPPAQGYPPAPYPPAQGYP 752
PMGVPPAQQMSRFDQP PPVGYP + PPAQGYPPA YPP GYP
Sbjct: 186 PMGVPPAQQMSRFDQPVPPPVGYPQSYPPPAQGYPPASYPP-PGYP 230
Score = 60 bits (145), Expect = 1e-009
Identities = 24/31 (77%), Positives = 25/31 (80%)
Frame = +3
Query: 699 PPAQGYPPAPYPPAQGYPPASYPPPGYPQH* 791
PP GYP + PPAQGYPPASYPPPGYPQH*
Sbjct: 203 PPPVGYPQSYPPPAQGYPPASYPPPGYPQH* 233
>TAIR9_protein||AT1G63830.1 | Symbols: | proline-rich family protein |
chr1:23685408-23687098 FORWARD
Length = 233
Score = 449 bits (1153), Expect = 2e-126
Identities = 207/226 (91%), Positives = 216/226 (95%), Gaps = 2/226 (0%)
Frame = +3
Query: 78 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 257
DK +KMKLRQDYRNLWHSDLMGTVTADTPYCC+SC+CGPCVSY+LRRRALYNDMSRYTCC
Sbjct: 6 DKLDKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCC 65
Query: 258 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 437
AGYMPCSGRCGESKCP+LCLATEVFLCFGNSVASTRFLLQDEFNIQTT+CDNCIIGFMFC
Sbjct: 66 AGYMPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFC 125
Query: 438 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 617
LSQVACIFSIVAC+VGS+ELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFG Q
Sbjct: 126 LSQVACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQ 185
Query: 618 PMGVPPAQQMSRFDQPA-PPVGYPPASYPPAQGYPPAPYPPAQGYP 752
PMGVPPAQQMSRFDQP PPVGYP + PPAQGYPPA YPP GYP
Sbjct: 186 PMGVPPAQQMSRFDQPVPPPVGYPQSYPPPAQGYPPASYPP-PGYP 230
Score = 60 bits (145), Expect = 1e-009
Identities = 24/31 (77%), Positives = 25/31 (80%)
Frame = +3
Query: 699 PPAQGYPPAPYPPAQGYPPASYPPPGYPQH* 791
PP GYP + PPAQGYPPASYPPPGYPQH*
Sbjct: 203 PPPVGYPQSYPPPAQGYPPASYPPPGYPQH* 233
>TAIR9_protein||AT5G41390.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
BEST Arabidopsis thaliana protein match is: proline-rich family protein
(TAIR:AT1G63830.2); Has 12019 Blast hits to 5816 proteins in 448
species: Archae - 6; Bacteria - 1151; Metazoa - 4980; Fungi - 1500;
Plants - 2605; Viruses - 338; Other Eukaryotes - 1439 (source: NCBI
BLink). | chr5:16565576-16567253 FORWARD
Length = 265
Score = 415 bits (1066), Expect = 2e-116
Identities = 190/236 (80%), Positives = 207/236 (87%), Gaps = 6/236 (2%)
Frame = +3
Query: 78 DKAEKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCC 257
DK +KM+LRQ YRNLWHSDLMGTV+ADTPYC SC+CGPCVSYLLR+RALYNDMSRYTCC
Sbjct: 6 DKLDKMQLRQSYRNLWHSDLMGTVSADTPYCFFSCLCGPCVSYLLRKRALYNDMSRYTCC 65
Query: 258 AGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFC 437
GYMPCSG+CGESKCP+ CLATEV LCFGNSVASTRF+LQDEFNI TTKCDNCIIGFMFC
Sbjct: 66 GGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNCIIGFMFC 125
Query: 438 LSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQ 617
L+Q+ACIFS+VAC+VGS+ELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKRDG+ PQ
Sbjct: 126 LNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKRDGLISPQ 185
Query: 618 PMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYP-PAQGYPPA-SYPPPGY 779
PM VPPAQQMSR DQP PP A YPPA GYP YP P GYPPA YPPPG+
Sbjct: 186 PMSVPPAQQMSRIDQPVPPY----AGYPPATGYPQHYYPQPGHGYPPAPGYPPPGH 237
>TAIR9_protein||AT4G23470.1 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249289-12251079 FORWARD
Length = 256
Score = 394 bits (1012), Expect = 4e-110
Identities = 178/234 (76%), Positives = 199/234 (85%), Gaps = 7/234 (2%)
Frame = +3
Query: 87 EKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCCAGY 266
EKM+LR+++RN+WH+DL ++ DTPYCC + C PC SYLLR+RALY+DMSRY CCAGY
Sbjct: 7 EKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGY 66
Query: 267 MPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFCLSQ 446
MPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNCIIGFM CLSQ
Sbjct: 67 MPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQ 126
Query: 447 VACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQPMG 626
VACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKRDG FGPQPM
Sbjct: 127 VACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFGPQPMA 186
Query: 627 VPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPGYPQH 788
VPPAQQMSRFDQ PP YPP QGYPP+ YP +PP YPP GYPQ+
Sbjct: 187 VPPAQQMSRFDQATPPA----VGYPPQQGYPPSGYPQ---HPPQGYPPSGYPQN 233
>TAIR9_protein||AT4G23470.3 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249289-12251079 FORWARD
Length = 234
Score = 385 bits (987), Expect = 3e-107
Identities = 174/228 (76%), Positives = 194/228 (85%), Gaps = 7/228 (3%)
Frame = +3
Query: 87 EKMKLRQDYRNLWHSDLMGTVTADTPYCCLSCVCGPCVSYLLRRRALYNDMSRYTCCAGY 266
EKM+LR+++RN+WH+DL ++ DTPYCC + C PC SYLLR+RALY+DMSRY CCAGY
Sbjct: 7 EKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYVCCAGY 66
Query: 267 MPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNCIIGFMFCLSQ 446
MPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNCIIGFM CLSQ
Sbjct: 67 MPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFMVCLSQ 126
Query: 447 VACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGPQPMG 626
VACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKRDG FGPQPM
Sbjct: 127 VACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFGPQPMA 186
Query: 627 VPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPP 770
VPPAQQMSRFDQ PP YPP QGYPP+ Y YPP +YPP
Sbjct: 187 VPPAQQMSRFDQATPPA----VGYPPQQGYPPSAY---SQYPPGAYPP 227
>TAIR9_protein||AT4G23470.2 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249867-12251079 FORWARD
Length = 200
Score = 322 bits (824), Expect = 3e-088
Identities = 148/184 (80%), Positives = 158/184 (85%), Gaps = 7/184 (3%)
Frame = +3
Query: 237 MSRYTCCAGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNC 416
MSRY CCAGYMPCSGRCGE+KCP+LCLATEVF CF NSVASTRFLLQDEF IQTTKCDNC
Sbjct: 1 MSRYVCCAGYMPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNC 60
Query: 417 IIGFMFCLSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKR 596
IIGFM CLSQVACIFSIVAC+VG +ELSEASQIL+CC+DMVYCTVCACMQTQHK+EMDKR
Sbjct: 61 IIGFMVCLSQVACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKR 120
Query: 597 DGVFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPG 776
DG FGPQPM VPPAQQMSRFDQ PP YPP QGYPP+ YP +PP YPP G
Sbjct: 121 DGKFGPQPMAVPPAQQMSRFDQATPPA----VGYPPQQGYPPSGYPQ---HPPQGYPPSG 173
Query: 777 YPQH 788
YPQ+
Sbjct: 174 YPQN 177
>TAIR9_protein||AT5G41390.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
BEST Arabidopsis thaliana protein match is: proline-rich family protein
(TAIR:AT1G63830.2); Has 11998 Blast hits to 5796 proteins in 448
species: Archae - 6; Bacteria - 1151; Metazoa - 4966; Fungi - 1500;
Plants - 2602; Viruses - 338; Other Eukaryotes - 1435 (source: NCBI
BLink). | chr5:16566179-16567253 FORWARD
Length = 207
Score = 317 bits (810), Expect = 1e-086
Identities = 146/183 (79%), Positives = 158/183 (86%), Gaps = 6/183 (3%)
Frame = +3
Query: 237 MSRYTCCAGYMPCSGRCGESKCPELCLATEVFLCFGNSVASTRFLLQDEFNIQTTKCDNC 416
MSRYTCC GYMPCSG+CGESKCP+ CLATEV LCFGNSVASTRF+LQDEFNI TTKCDNC
Sbjct: 1 MSRYTCCGGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNC 60
Query: 417 IIGFMFCLSQVACIFSIVACLVGSEELSEASQILSCCADMVYCTVCACMQTQHKLEMDKR 596
IIGFMFCL+Q+ACIFS+VAC+VGS+ELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKR
Sbjct: 61 IIGFMFCLNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKR 120
Query: 597 DGVFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYP-PAQGYPPA-SYPP 770
DG+ PQPM VPPAQQMSR DQP PP A YPPA GYP YP P GYPPA YPP
Sbjct: 121 DGLISPQPMSVPPAQQMSRIDQPVPPY----AGYPPATGYPQHYYPQPGHGYPPAPGYPP 176
Query: 771 PGY 779
PG+
Sbjct: 177 PGH 179
>TAIR9_protein||AT5G17650.1 | Symbols: | glycine/proline-rich protein |
chr5:5817000-5817763 REVERSE
Length = 174
Score = 72 bits (175), Expect = 5e-013
Identities = 30/45 (66%), Positives = 32/45 (71%), Gaps = 1/45 (2%)
Frame = +3
Query: 654 FDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPGYPQH 788
+ P PP GYPP +YPP GYPPA YPPA GYPPA YP GYP H
Sbjct: 45 YPPPPPPHGYPPVAYPPHGGYPPAGYPPA-GYPPAGYPAHGYPSH 88
>TAIR9_protein||AT3G49845.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: root; CONTAINS InterPro
DOMAIN/s: XYPPX repeat (InterPro:IPR006031); Has 14038 Blast hits to
7746 proteins in 541 species: Archae - 8; Bacteria - 1083; Metazoa -
5225; Fungi - 1793; Plants - 3748; Viruses - 424; Other Eukaryotes -
1757 (source: NCBI BLink). | chr3:18487339-18487965 FORWARD
Length = 125
Score = 63 bits (152), Expect = 2e-010
Identities = 32/60 (53%), Positives = 33/60 (55%), Gaps = 15/60 (25%)
Frame = +3
Query: 612 PQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPP--GYPQ 785
P P G PP + GYPPA YPP GYPP YP A GYPPA YPPP GY Q
Sbjct: 13 PPPQGYPPKE------------GYPPAGYPPPAGYPPPQYPQA-GYPPAGYPPPQQGYGQ 59
Score = 61 bits (147), Expect = 8e-010
Identities = 26/46 (56%), Positives = 27/46 (58%)
Frame = +3
Query: 645 MSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPGYP 782
MS D P PP YPP +GYPPA YPP GYPP YP GYP
Sbjct: 1 MSYQDPQHPVSAPPPQGYPPKEGYPPAGYPPPAGYPPPQYPQAGYP 46
>TAIR9_protein||AT5G67600.1 | Symbols: | unknown protein |
chr5:26959754-26960226 REVERSE
Length = 83
Score = 58 bits (139), Expect = 7e-009
Identities = 29/43 (67%), Gaps = 5/43 (11%)
Frame = +3
Query: 669 PPVGYPPA-SYPPAQGYPPAPYPP---AQGYPPASYPPPGYPQ 785
PP GYPP YPPA GYPPA YPP AQGYP YPPP Y Q
Sbjct: 13 PPQGYPPKDGYPPA-GYPPAGYPPPGYAQGYPAQGYPPPQYSQ 54
>TAIR9_protein||AT4G19200.1 | Symbols: | proline-rich family protein |
chr4:10499277-10500390 FORWARD
Length = 180
Score = 57 bits (136), Expect = 2e-008
Identities = 28/43 (65%), Positives = 29/43 (67%), Gaps = 4/43 (9%)
Frame = +3
Query: 663 PAPPVGYPPASYPPAQGYPPA-PYPPAQGYPPASYP--PPGYP 782
P GYPP YPP QGYPPA YPPA GYPP +YP P GYP
Sbjct: 26 PPAQGGYPPQGYPPQQGYPPAGGYPPA-GYPPGAYPAAPGGYP 67
>TAIR9_protein||AT5G45350.1 | Symbols: | proline-rich family protein |
chr5:18382100-18382854 REVERSE
Length = 178
Score = 56 bits (133), Expect = 3e-008
Identities = 30/47 (63%), Positives = 32/47 (68%), Gaps = 10/47 (21%)
Frame = +3
Query: 669 PPVGY-PPASYPPA----QGYPPAP--YPPAQGYPPASYPPP--GYP 782
PP GY PP +YPPA QGYPP P YPPA GYPP +YPP GYP
Sbjct: 17 PPAGYPPPGAYPPAGYPQQGYPPPPGAYPPA-GYPPGAYPPAPGGYP 62
>TAIR9_protein||AT5G45350.2 | Symbols: | proline-rich family protein |
chr5:18382100-18382854 REVERSE
Length = 178
Score = 56 bits (133), Expect = 3e-008
Identities = 30/47 (63%), Positives = 32/47 (68%), Gaps = 10/47 (21%)
Frame = +3
Query: 669 PPVGY-PPASYPPA----QGYPPAP--YPPAQGYPPASYPPP--GYP 782
PP GY PP +YPPA QGYPP P YPPA GYPP +YPP GYP
Sbjct: 17 PPAGYPPPGAYPPAGYPQQGYPPPPGAYPPA-GYPPGAYPPAPGGYP 62
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,087,338,472
Number of Sequences: 33410
Number of Extensions: 24087338472
Number of Successful Extensions: 761288629
Number of sequences better than 0.0: 0
|