BLASTX 7.6.2
Query= UN09598 /QuerySize=1290
(1289 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT4G23470.1 | Symbols: | hydroxyproline-rich gly... 481 6e-136
TAIR9_protein||AT4G23470.3 | Symbols: | hydroxyproline-rich gly... 437 1e-122
TAIR9_protein||AT5G41390.1 | Symbols: | FUNCTIONS IN: molecular... 395 4e-110
TAIR9_protein||AT1G63830.2 | Symbols: | proline-rich family pro... 381 9e-106
TAIR9_protein||AT1G63830.1 | Symbols: | proline-rich family pro... 381 9e-106
TAIR9_protein||AT4G23470.2 | Symbols: | hydroxyproline-rich gly... 374 1e-103
TAIR9_protein||AT5G41390.2 | Symbols: | FUNCTIONS IN: molecular... 321 8e-088
TAIR9_protein||AT3G49845.1 | Symbols: | FUNCTIONS IN: molecular... 80 3e-015
TAIR9_protein||AT5G45350.1 | Symbols: | proline-rich family pro... 76 5e-014
TAIR9_protein||AT5G45350.2 | Symbols: | proline-rich family pro... 76 5e-014
TAIR9_protein||AT4G34150.1 | Symbols: | C2 domain-containing pr... 67 2e-011
TAIR9_protein||AT5G59170.1 | Symbols: | proline-rich family pro... 56 4e-008
>TAIR9_protein||AT4G23470.1 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249289-12251079 FORWARD
Length = 256
Score = 481 bits (1236), Expect = 6e-136
Identities = 220/255 (86%), Positives = 232/255 (90%), Gaps = 6/255 (2%)
Frame = +2
Query: 218 PKFDTEKMQERQNFRNVWHTDLTHSIQGDTPYCCFALWCAPCASYLLRKRALYNDMSRYT 397
PK D EKM+ R+NFRNVWHTDLTHSIQ DTPYCCFALWCAPCASYLLRKRALY+DMSRY
Sbjct: 2 PKQDMEKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYV 61
Query: 398 CCAGYMPCSGRCGETKCPQLCLATEVFCCFGTSVASTRFLLQDEFQIQTTQCDNCIIGFM 577
CCAGYMPCSGRCGE KCPQLCLATEVFCCF SVASTRFLLQDEFQIQTT+CDNCIIGFM
Sbjct: 62 CCAGYMPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFM 121
Query: 578 VCLSQVACIFSIVACIVGIDELSEASQLLSCLADMVYCTVCACMQTQHKVEMDKRDGKFG 757
VCLSQVACIFSIVACIVG+DELSEASQ+L+C +DMVYCTVCACMQTQHK+EMDKRDGKFG
Sbjct: 122 VCLSQVACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFG 181
Query: 758 PQPMAVPPPQVMSRIDQATPPAIGYPP-QGYPPSGYPQHPPQGYPPSGYPQHPPQG---- 922
PQPMAVPP Q MSR DQATPPA+GYPP QGYPPSGYPQHPPQGYPPSGYPQ+PP
Sbjct: 182 PQPMAVPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQHPPQGYPPSGYPQNPPPSAYSQ 241
Query: 923 YPPSGYPQNPPAYPQ 967
YPP YP PPAYP+
Sbjct: 242 YPPGAYPP-PPAYPK 255
Score = 99 bits (246), Expect = 4e-021
Identities = 50/73 (68%), Positives = 52/73 (71%), Gaps = 7/73 (9%)
Frame = +2
Query: 809 ATPPA--IGYPPQGYPPS-GYPQHPPQGYPPSGYPQHPPQGYPPSGYPQNPP--AYPQYP 973
A PPA + Q PP+ GYP P QGYPPSGYPQHPPQGYPPSGYPQNPP AY QYP
Sbjct: 186 AVPPAQQMSRFDQATPPAVGYP--PQQGYPPSGYPQHPPQGYPPSGYPQNPPPSAYSQYP 243
Query: 974 PGPAYPPQAYPK* 1012
PG PP AYPK*
Sbjct: 244 PGAYPPPPAYPK* 256
>TAIR9_protein||AT4G23470.3 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249289-12251079 FORWARD
Length = 234
Score = 437 bits (1122), Expect = 1e-122
Identities = 200/232 (86%), Positives = 212/232 (91%), Gaps = 2/232 (0%)
Frame = +2
Query: 218 PKFDTEKMQERQNFRNVWHTDLTHSIQGDTPYCCFALWCAPCASYLLRKRALYNDMSRYT 397
PK D EKM+ R+NFRNVWHTDLTHSIQ DTPYCCFALWCAPCASYLLRKRALY+DMSRY
Sbjct: 2 PKQDMEKMELRKNFRNVWHTDLTHSIQNDTPYCCFALWCAPCASYLLRKRALYDDMSRYV 61
Query: 398 CCAGYMPCSGRCGETKCPQLCLATEVFCCFGTSVASTRFLLQDEFQIQTTQCDNCIIGFM 577
CCAGYMPCSGRCGE KCPQLCLATEVFCCF SVASTRFLLQDEFQIQTT+CDNCIIGFM
Sbjct: 62 CCAGYMPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNCIIGFM 121
Query: 578 VCLSQVACIFSIVACIVGIDELSEASQLLSCLADMVYCTVCACMQTQHKVEMDKRDGKFG 757
VCLSQVACIFSIVACIVG+DELSEASQ+L+C +DMVYCTVCACMQTQHK+EMDKRDGKFG
Sbjct: 122 VCLSQVACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKRDGKFG 181
Query: 758 PQPMAVPPPQVMSRIDQATPPAIGYPP-QGYPPSGYPQHPPQGY-PPSGYPQ 907
PQPMAVPP Q MSR DQATPPA+GYPP QGYPPS Y Q+PP Y PP YP+
Sbjct: 182 PQPMAVPPAQQMSRFDQATPPAVGYPPQQGYPPSAYSQYPPGAYPPPPAYPK 233
>TAIR9_protein||AT5G41390.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
BEST Arabidopsis thaliana protein match is: proline-rich family protein
(TAIR:AT1G63830.2); Has 12019 Blast hits to 5816 proteins in 448
species: Archae - 6; Bacteria - 1151; Metazoa - 4980; Fungi - 1500;
Plants - 2605; Viruses - 338; Other Eukaryotes - 1439 (source: NCBI
BLink). | chr5:16565576-16567253 FORWARD
Length = 265
Score = 395 bits (1014), Expect = 4e-110
Identities = 188/257 (73%), Positives = 207/257 (80%), Gaps = 7/257 (2%)
Frame = +2
Query: 233 EKMQERQNFRNVWHTDLTHSIQGDTPYCCFALWCAPCASYLLRKRALYNDMSRYTCCAGY 412
+KMQ RQ++RN+WH+DL ++ DTPYC F+ C PC SYLLRKRALYNDMSRYTCC GY
Sbjct: 9 DKMQLRQSYRNLWHSDLMGTVSADTPYCFFSCLCGPCVSYLLRKRALYNDMSRYTCCGGY 68
Query: 413 MPCSGRCGETKCPQLCLATEVFCCFGTSVASTRFLLQDEFQIQTTQCDNCIIGFMVCLSQ 592
MPCSG+CGE+KCPQ CLATEV CFG SVASTRF+LQDEF I TT+CDNCIIGFM CL+Q
Sbjct: 69 MPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNCIIGFMFCLNQ 128
Query: 593 VACIFSIVACIVGIDELSEASQLLSCLADMVYCTVCACMQTQHKVEMDKRDGKFGPQPMA 772
+ACIFS+VACIVG DELSEASQLLSCLADMVYCTVCACMQTQHK+EMDKRDG PQPM+
Sbjct: 129 IACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKRDGLISPQPMS 188
Query: 773 VPPPQVMSRIDQATPPAIGYPP-QGYPPSGYPQHPPQGYPPS-GYPQHPPQGYPPS-GYP 943
VPP Q MSRIDQ PP GYPP GYP YPQ P GYPP+ GYP P GYPP+ GYP
Sbjct: 189 VPPAQQMSRIDQPVPPYAGYPPATGYPQHYYPQ-PGHGYPPAPGYPP-PGHGYPPAPGYP 246
Query: 944 QNPPAYPQ-YPPGPAYP 991
P YP YPP P YP
Sbjct: 247 P-APGYPSGYPPAPGYP 262
>TAIR9_protein||AT1G63830.2 | Symbols: | proline-rich family protein |
chr1:23685408-23687098 FORWARD
Length = 233
Score = 381 bits (976), Expect = 9e-106
Identities = 174/225 (77%), Positives = 190/225 (84%), Gaps = 2/225 (0%)
Frame = +2
Query: 233 EKMQERQNFRNVWHTDLTHSIQGDTPYCCFALWCAPCASYLLRKRALYNDMSRYTCCAGY 412
+KM+ RQ++RN+WH+DL ++ DTPYCC + C PC SY+LR+RALYNDMSRYTCCAGY
Sbjct: 9 DKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCCAGY 68
Query: 413 MPCSGRCGETKCPQLCLATEVFCCFGTSVASTRFLLQDEFQIQTTQCDNCIIGFMVCLSQ 592
MPCSGRCGE+KCPQLCLATEVF CFG SVASTRFLLQDEF IQTTQCDNCIIGFM CLSQ
Sbjct: 69 MPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFCLSQ 128
Query: 593 VACIFSIVACIVGIDELSEASQLLSCLADMVYCTVCACMQTQHKVEMDKRDGKFGPQPMA 772
VACIFSIVACIVG DELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKRDG FG QPM
Sbjct: 129 VACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQPMG 188
Query: 773 VPPPQVMSRIDQATPPAIGYPPQGYPPSGYPQHPPQGYPPSGYPQ 907
VPP Q MSR DQ PP +GY PQ YPP +PP YPP GYPQ
Sbjct: 189 VPPAQQMSRFDQPVPPPVGY-PQSYPPPA-QGYPPASYPPPGYPQ 231
>TAIR9_protein||AT1G63830.1 | Symbols: | proline-rich family protein |
chr1:23685408-23687098 FORWARD
Length = 233
Score = 381 bits (976), Expect = 9e-106
Identities = 174/225 (77%), Positives = 190/225 (84%), Gaps = 2/225 (0%)
Frame = +2
Query: 233 EKMQERQNFRNVWHTDLTHSIQGDTPYCCFALWCAPCASYLLRKRALYNDMSRYTCCAGY 412
+KM+ RQ++RN+WH+DL ++ DTPYCC + C PC SY+LR+RALYNDMSRYTCCAGY
Sbjct: 9 DKMKLRQDYRNLWHSDLMGTVTADTPYCCISCLCGPCVSYMLRRRALYNDMSRYTCCAGY 68
Query: 413 MPCSGRCGETKCPQLCLATEVFCCFGTSVASTRFLLQDEFQIQTTQCDNCIIGFMVCLSQ 592
MPCSGRCGE+KCPQLCLATEVF CFG SVASTRFLLQDEF IQTTQCDNCIIGFM CLSQ
Sbjct: 69 MPCSGRCGESKCPQLCLATEVFLCFGNSVASTRFLLQDEFNIQTTQCDNCIIGFMFCLSQ 128
Query: 593 VACIFSIVACIVGIDELSEASQLLSCLADMVYCTVCACMQTQHKVEMDKRDGKFGPQPMA 772
VACIFSIVACIVG DELSEASQ+LSC ADMVYCTVCACMQTQHK+EMDKRDG FG QPM
Sbjct: 129 VACIFSIVACIVGSDELSEASQILSCCADMVYCTVCACMQTQHKLEMDKRDGVFGSQPMG 188
Query: 773 VPPPQVMSRIDQATPPAIGYPPQGYPPSGYPQHPPQGYPPSGYPQ 907
VPP Q MSR DQ PP +GY PQ YPP +PP YPP GYPQ
Sbjct: 189 VPPAQQMSRFDQPVPPPVGY-PQSYPPPA-QGYPPASYPPPGYPQ 231
>TAIR9_protein||AT4G23470.2 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:12249867-12251079 FORWARD
Length = 200
Score = 374 bits (958), Expect = 1e-103
Identities = 172/200 (86%), Positives = 181/200 (90%), Gaps = 6/200 (3%)
Frame = +2
Query: 383 MSRYTCCAGYMPCSGRCGETKCPQLCLATEVFCCFGTSVASTRFLLQDEFQIQTTQCDNC 562
MSRY CCAGYMPCSGRCGE KCPQLCLATEVFCCF SVASTRFLLQDEFQIQTT+CDNC
Sbjct: 1 MSRYVCCAGYMPCSGRCGEAKCPQLCLATEVFCCFANSVASTRFLLQDEFQIQTTKCDNC 60
Query: 563 IIGFMVCLSQVACIFSIVACIVGIDELSEASQLLSCLADMVYCTVCACMQTQHKVEMDKR 742
IIGFMVCLSQVACIFSIVACIVG+DELSEASQ+L+C +DMVYCTVCACMQTQHK+EMDKR
Sbjct: 61 IIGFMVCLSQVACIFSIVACIVGMDELSEASQILTCCSDMVYCTVCACMQTQHKMEMDKR 120
Query: 743 DGKFGPQPMAVPPPQVMSRIDQATPPAIGYPP-QGYPPSGYPQHPPQGYPPSGYPQHPPQ 919
DGKFGPQPMAVPP Q MSR DQATPPA+GYPP QGYPPSGYPQHPPQGYPPSGYPQ+PP
Sbjct: 121 DGKFGPQPMAVPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQHPPQGYPPSGYPQNPPP 180
Query: 920 G----YPPSGYPQNPPAYPQ 967
YPP YP PPAYP+
Sbjct: 181 SAYSQYPPGAYPP-PPAYPK 199
Score = 99 bits (246), Expect = 4e-021
Identities = 50/73 (68%), Positives = 52/73 (71%), Gaps = 7/73 (9%)
Frame = +2
Query: 809 ATPPA--IGYPPQGYPPS-GYPQHPPQGYPPSGYPQHPPQGYPPSGYPQNPP--AYPQYP 973
A PPA + Q PP+ GYP P QGYPPSGYPQHPPQGYPPSGYPQNPP AY QYP
Sbjct: 130 AVPPAQQMSRFDQATPPAVGYP--PQQGYPPSGYPQHPPQGYPPSGYPQNPPPSAYSQYP 187
Query: 974 PGPAYPPQAYPK* 1012
PG PP AYPK*
Sbjct: 188 PGAYPPPPAYPK* 200
>TAIR9_protein||AT5G41390.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro
DOMAIN/s: Protein of unknown function Cys-rich (InterPro:IPR006461);
BEST Arabidopsis thaliana protein match is: proline-rich family protein
(TAIR:AT1G63830.2); Has 11998 Blast hits to 5796 proteins in 448
species: Archae - 6; Bacteria - 1151; Metazoa - 4966; Fungi - 1500;
Plants - 2602; Viruses - 338; Other Eukaryotes - 1435 (source: NCBI
BLink). | chr5:16566179-16567253 FORWARD
Length = 207
Score = 321 bits (821), Expect = 8e-088
Identities = 156/207 (75%), Positives = 167/207 (80%), Gaps = 7/207 (3%)
Frame = +2
Query: 383 MSRYTCCAGYMPCSGRCGETKCPQLCLATEVFCCFGTSVASTRFLLQDEFQIQTTQCDNC 562
MSRYTCC GYMPCSG+CGE+KCPQ CLATEV CFG SVASTRF+LQDEF I TT+CDNC
Sbjct: 1 MSRYTCCGGYMPCSGKCGESKCPQFCLATEVCLCFGNSVASTRFMLQDEFNIHTTKCDNC 60
Query: 563 IIGFMVCLSQVACIFSIVACIVGIDELSEASQLLSCLADMVYCTVCACMQTQHKVEMDKR 742
IIGFM CL+Q+ACIFS+VACIVG DELSEASQLLSCLADMVYCTVCACMQTQHK+EMDKR
Sbjct: 61 IIGFMFCLNQIACIFSLVACIVGSDELSEASQLLSCLADMVYCTVCACMQTQHKIEMDKR 120
Query: 743 DGKFGPQPMAVPPPQVMSRIDQATPPAIGYPP-QGYPPSGYPQHPPQGYPPS-GYPQHPP 916
DG PQPM+VPP Q MSRIDQ PP GYPP GYP YPQ P GYPP+ GYP P
Sbjct: 121 DGLISPQPMSVPPAQQMSRIDQPVPPYAGYPPATGYPQHYYPQ-PGHGYPPAPGYPP-PG 178
Query: 917 QGYPPS-GYPQNPPAYPQ-YPPGPAYP 991
GYPP+ GYP P YP YPP P YP
Sbjct: 179 HGYPPAPGYPP-APGYPSGYPPAPGYP 204
>TAIR9_protein||AT3G49845.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: root; CONTAINS InterPro
DOMAIN/s: XYPPX repeat (InterPro:IPR006031); Has 14038 Blast hits to
7746 proteins in 541 species: Archae - 8; Bacteria - 1083; Metazoa -
5225; Fungi - 1793; Plants - 3748; Viruses - 424; Other Eukaryotes -
1757 (source: NCBI BLink). | chr3:18487339-18487965 FORWARD
Length = 125
Score = 80 bits (195), Expect = 3e-015
Identities = 37/68 (54%), Positives = 43/68 (63%), Gaps = 6/68 (8%)
Frame = +2
Query: 809 ATPPAIGYPP-QGYPPSGYPQHPPQGYPPSGYPQHPPQGYPPSGYPQNPPAYPQYPPGPA 985
+ PP GYPP +GYPP+GYP PP GYPP PQ+P GYPP+GYP Y Q P
Sbjct: 11 SAPPPQGYPPKEGYPPAGYP--PPAGYPP---PQYPQAGYPPAGYPPPQQGYGQGYPAQG 65
Query: 986 YPPQAYPK 1009
YPP YP+
Sbjct: 66 YPPPQYPQ 73
>TAIR9_protein||AT5G45350.1 | Symbols: | proline-rich family protein |
chr5:18382100-18382854 REVERSE
Length = 178
Score = 76 bits (185), Expect = 5e-014
Identities = 48/97 (49%), Positives = 50/97 (51%), Gaps = 16/97 (16%)
Frame = +2
Query: 734 DKRDGKFGPQPMAVPPPQVMSRIDQATPPAIGYPPQGYPPSGYPQHPPQGYPPSGYPQHP 913
DK G G P PPP A PPA GYP QGYPP +PP GYPP YP
Sbjct: 8 DKDKGFHGYPPAGYPPP-------GAYPPA-GYPQQGYPPPP-GAYPPAGYPPGAYPP-A 57
Query: 914 PQGYPPS-GYPQNPPA--YPQYPPGP---AYPPQAYP 1006
P GYPP+ GY PPA Y YPP P YPP YP
Sbjct: 58 PGGYPPAPGYGGYPPAPGYGGYPPAPGHGGYPPAGYP 94
>TAIR9_protein||AT5G45350.2 | Symbols: | proline-rich family protein |
chr5:18382100-18382854 REVERSE
Length = 178
Score = 76 bits (185), Expect = 5e-014
Identities = 48/97 (49%), Positives = 50/97 (51%), Gaps = 16/97 (16%)
Frame = +2
Query: 734 DKRDGKFGPQPMAVPPPQVMSRIDQATPPAIGYPPQGYPPSGYPQHPPQGYPPSGYPQHP 913
DK G G P PPP A PPA GYP QGYPP +PP GYPP YP
Sbjct: 8 DKDKGFHGYPPAGYPPP-------GAYPPA-GYPQQGYPPPP-GAYPPAGYPPGAYPP-A 57
Query: 914 PQGYPPS-GYPQNPPA--YPQYPPGP---AYPPQAYP 1006
P GYPP+ GY PPA Y YPP P YPP YP
Sbjct: 58 PGGYPPAPGYGGYPPAPGYGGYPPAPGHGGYPPAGYP 94
>TAIR9_protein||AT4G34150.1 | Symbols: | C2 domain-containing protein |
chr4:16355035-16356955 FORWARD
Length = 248
Score = 67 bits (162), Expect = 2e-011
Identities = 46/95 (48%), Positives = 50/95 (52%), Gaps = 13/95 (13%)
Frame = +2
Query: 752 FGPQPMAVPPPQV--MSRIDQATPPAIGYPPQGYPPSGYP---QHP-PQGYPP-SGYPQH 910
+G P A P V S A+P + P G PS YP Q+P P GYPP SGYP
Sbjct: 136 YGSAPSAPYAPHVPQYSAPPSASPYSTAPPYSG--PSLYPQVQQYPQPSGYPPASGYPPQ 193
Query: 911 PPQGYPP---SGYPQNPPAYPQYPPGPAYPPQAYP 1006
P YPP SGYP P AYP PP AYPPQ YP
Sbjct: 194 -PSAYPPPSTSGYPPIPSAYPPPPPSSAYPPQPYP 227
>TAIR9_protein||AT5G59170.1 | Symbols: | proline-rich family protein |
chr5:23882045-23882911 FORWARD
Length = 289
Score = 56 bits (134), Expect = 4e-008
Identities = 33/89 (37%), Positives = 41/89 (46%), Gaps = 4/89 (4%)
Frame = +2
Query: 749 KFGPQPMAVPPPQVMSRIDQATPPAIGYPPQ-GYPPSGYPQHPPQGYPPSGYPQHPPQGY 925
K+ P PP + + PP YPPQ YPP PP+ YPP PP+ Y
Sbjct: 139 KYPPPEQYPPPIKKYPPPEHYPPPIKKYPPQEQYPPPIKKYPPPEKYPPPIKKYPPPEQY 198
Query: 926 PPSGYPQNPPAYPQYPPGPAYPP--QAYP 1006
PP + PP +YPP YPP + YP
Sbjct: 199 PPP-IKKYPPPIKKYPPPEEYPPPIKTYP 226
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,346,187,961
Number of Sequences: 33410
Number of Extensions: 5346187961
Number of Successful Extensions: 195116757
Number of sequences better than 0.0: 0
|