BLASTX 7.6.2
Query= UN15847 /QuerySize=1476
(1475 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT2G33490.1 | Symbols: | hydroxyproline-rich gly... 562 3e-160
TAIR9_protein||AT3G26910.1 | Symbols: | hydroxyproline-rich gly... 114 2e-025
TAIR9_protein||AT3G26910.2 | Symbols: | hydroxyproline-rich gly... 114 2e-025
TAIR9_protein||AT5G41100.1 | Symbols: | FUNCTIONS IN: molecular... 84 2e-016
TAIR9_protein||AT5G41100.2 | Symbols: | FUNCTIONS IN: molecular... 84 2e-016
TAIR9_protein||AT1G44191.1 | Symbols: | Encodes a ECA1 gametoge... 52 9e-007
TAIR9_protein||AT2G28440.1 | Symbols: | proline-rich family pro... 49 6e-006
>TAIR9_protein||AT2G33490.1 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr2:14183552-14187666 FORWARD
Length = 624
Score = 562 bits (1446), Expect = 3e-160
Identities = 289/376 (76%), Positives = 319/376 (84%), Gaps = 6/376 (1%)
Frame = +3
Query: 111 KNNDEDDSEVNDDGELSFEYRVNDKDQNADSSPSASSQLGQSDITFPLVAGVKTGQENEE 290
+NN+ D SEV+DDGELSFEYRVNDKDQ+ADSS SS+LG SDITFP + G T QENEE
Sbjct: 253 ENNENDGSEVHDDGELSFEYRVNDKDQDADSSAGGSSELGNSDITFPQIGGPYTAQENEE 312
Query: 291 ANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPTPVESAKS 470
NY +S S+RRDVR SQSAPLF ENRTTPPSEKLLRMRS++TRKF+TYALPTPVE+ +S
Sbjct: 313 GNYRKSHSFRRDVRAVSQSAPLFPENRTTPPSEKLLRMRSTLTRKFNTYALPTPVETTRS 372
Query: 471 PSSVTSLGNNKSMASSNPAKPIAKNIWYSSPLETRGPAKVSSRPMTCLKEQVLRESNKNT 650
PSS TS G +K++ SSNP K I K IWYSSPLETRGPAKVSSR M LKEQVLRESNKNT
Sbjct: 373 PSSTTSPG-HKNVGSSNPTKAITKQIWYSSPLETRGPAKVSSRSMVALKEQVLRESNKNT 431
Query: 651 SSRLPRPSTDGLLYSRIGSLKRRSFSGPITSKPLPNKPLSSTPRLYSGPIPRSPVSKLPK 830
SRLP P DGLL+SR+G+LKRRSFSGP+TSKPLPNKPLS+T LYSGPIPR+PVSKLPK
Sbjct: 432 -SRLPPPLADGLLFSRLGTLKRRSFSGPLTSKPLPNKPLSTTSHLYSGPIPRNPVSKLPK 490
Query: 831 VSTSSPTASPTFVSTPKISELHELPRPPPPPPSSSAKSSRAFGYSAPLVSKSQLLSKPII 1010
VS SSPTASPTFVSTPKISELHELPR PPP SS KSSR GYSAPLVS+SQLLSKP+I
Sbjct: 491 VS-SSPTASPTFVSTPKISELHELPR---PPPRSSTKSSRELGYSAPLVSRSQLLSKPLI 546
Query: 1011 STPPSPLPIPPAITRSFSIPTGNLRGTELDMSKMSLGANKLSTASPPLTPMSLVHPPPSA 1190
+ SPLPIPPAITRSFSIPT NLR ++LDMSK SLG KL T SPPLTPMSL+HPPP A
Sbjct: 547 TNSASPLPIPPAITRSFSIPTSNLRASDLDMSKTSLGTKKLGTPSPPLTPMSLIHPPPQA 606
Query: 1191 ITEHAEHLAMSKQERR 1238
+ E A+HL MSKQERR
Sbjct: 607 LPERADHLMMSKQERR 622
Score = 69 bits (167), Expect = 7e-012
Identities = 31/35 (88%), Positives = 34/35 (97%)
Frame = +2
Query: 2 FKKALNSLEEVEPHVKTVTESQHIDYHFSGLEDDD 106
FKKAL+SLEEV+PHV+ VTESQHIDYHFSGLEDDD
Sbjct: 213 FKKALSSLEEVDPHVQMVTESQHIDYHFSGLEDDD 247
>TAIR9_protein||AT3G26910.1 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr3:9915338-9918511 REVERSE
Length = 609
Score = 114 bits (284), Expect = 2e-025
Identities = 81/194 (41%), Positives = 108/194 (55%), Gaps = 23/194 (11%)
Frame = +3
Query: 657 RLPRPSTDGLLYSRIGSLKRRSFSGPITSKPLPNKPLSSTPRLYSG---PIPRSPVSKLP 827
RLPRPST + + + R +FSGP+ +P KP++ YSG P+P PV +
Sbjct: 410 RLPRPSTTDTHHHQQQAAGRHAFSGPL--RPSSTKPITMADS-YSGAFCPLPTPPVLQSH 466
Query: 828 KVSTS----SPTASPTFVSTPKISELHELPRPPP--PPPSSSAKSSRAFGYSAPLVSKSQ 989
S+S SPTASP S+P+++ELHELPRPP PP AKS G+SAPL + +Q
Sbjct: 467 PHSSSSPRVSPTASPPPASSPRLNELHELPRPPGHFAPPPRRAKSPGLVGHSAPLTAWNQ 526
Query: 990 LLSKPIISTP------PSPLPIPP-AITRSFSIPTGNLRGTELDMSKMSLGANKLSTASP 1148
S ++ P SPLP+PP + RS+SIP+ N R +S+ + ASP
Sbjct: 527 ERSTVTVAVPSATNIVASPLPVPPLVVPRSYSIPSRNQR----VVSQRLVERRDDIVASP 582
Query: 1149 PLTPMSLVHPPPSA 1190
PLTPMSL P P A
Sbjct: 583 PLTPMSLSRPLPQA 596
Score = 59 bits (142), Expect = 5e-009
Identities = 47/154 (30%), Positives = 77/154 (50%), Gaps = 13/154 (8%)
Frame = +3
Query: 96 KMTMKKNNDEDDSEVNDDGELSFEYRVNDKDQNADS-SPSASSQLGQSDITFPLVAGVKT 272
+M +++D+D +N +GELSF+YR N++ A S S ++++ +D++FP + +
Sbjct: 246 EMEASEDDDDDGRYMNREGELSFDYRTNEQKVEASSLSTPWATKMDDTDLSFPRPSTTRP 305
Query: 273 GQENEEANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPTP 452
N + S RD + S SAPLF E + SE+L + S F+ Y LPTP
Sbjct: 306 AAVNADHREEYPVS-TRDKYLSSHSAPLFPEKK-PDVSERLRQANPS----FNAYVLPTP 359
Query: 453 VESAKSPSSVTSLGNNKSMAS------SNPAKPI 536
+S S +L + S S+P +PI
Sbjct: 360 NDSRYSKPVSQALNPRPTNHSAGNIWHSSPLEPI 393
>TAIR9_protein||AT3G26910.2 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr3:9915304-9918511 REVERSE
Length = 615
Score = 114 bits (284), Expect = 2e-025
Identities = 81/194 (41%), Positives = 108/194 (55%), Gaps = 23/194 (11%)
Frame = +3
Query: 657 RLPRPSTDGLLYSRIGSLKRRSFSGPITSKPLPNKPLSSTPRLYSG---PIPRSPVSKLP 827
RLPRPST + + + R +FSGP+ +P KP++ YSG P+P PV +
Sbjct: 410 RLPRPSTTDTHHHQQQAAGRHAFSGPL--RPSSTKPITMADS-YSGAFCPLPTPPVLQSH 466
Query: 828 KVSTS----SPTASPTFVSTPKISELHELPRPPP--PPPSSSAKSSRAFGYSAPLVSKSQ 989
S+S SPTASP S+P+++ELHELPRPP PP AKS G+SAPL + +Q
Sbjct: 467 PHSSSSPRVSPTASPPPASSPRLNELHELPRPPGHFAPPPRRAKSPGLVGHSAPLTAWNQ 526
Query: 990 LLSKPIISTP------PSPLPIPP-AITRSFSIPTGNLRGTELDMSKMSLGANKLSTASP 1148
S ++ P SPLP+PP + RS+SIP+ N R +S+ + ASP
Sbjct: 527 ERSTVTVAVPSATNIVASPLPVPPLVVPRSYSIPSRNQR----VVSQRLVERRDDIVASP 582
Query: 1149 PLTPMSLVHPPPSA 1190
PLTPMSL P P A
Sbjct: 583 PLTPMSLSRPLPQA 596
Score = 59 bits (142), Expect = 5e-009
Identities = 47/154 (30%), Positives = 77/154 (50%), Gaps = 13/154 (8%)
Frame = +3
Query: 96 KMTMKKNNDEDDSEVNDDGELSFEYRVNDKDQNADS-SPSASSQLGQSDITFPLVAGVKT 272
+M +++D+D +N +GELSF+YR N++ A S S ++++ +D++FP + +
Sbjct: 246 EMEASEDDDDDGRYMNREGELSFDYRTNEQKVEASSLSTPWATKMDDTDLSFPRPSTTRP 305
Query: 273 GQENEEANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPTP 452
N + S RD + S SAPLF E + SE+L + S F+ Y LPTP
Sbjct: 306 AAVNADHREEYPVS-TRDKYLSSHSAPLFPEKK-PDVSERLRQANPS----FNAYVLPTP 359
Query: 453 VESAKSPSSVTSLGNNKSMAS------SNPAKPI 536
+S S +L + S S+P +PI
Sbjct: 360 NDSRYSKPVSQALNPRPTNHSAGNIWHSSPLEPI 393
>TAIR9_protein||AT5G41100.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma
membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match is:
hydroxyproline-rich glycoprotein family protein (TAIR:AT3G26910.2); Has
1264 Blast hits to 964 proteins in 165 species: Archae - 2; Bacteria -
75; Metazoa - 445; Fungi - 228; Plants - 134; Viruses - 35; Other
Eukaryotes - 345 (source: NCBI BLink). | chr5:16447429-16450610
FORWARD
Length = 587
Score = 84 bits (206), Expect = 2e-016
Identities = 60/146 (41%), Positives = 82/146 (56%), Gaps = 15/146 (10%)
Frame = +3
Query: 762 PLSSTPRLYSGPIPRSPVSKLPKVSTSSPTASPTFVSTPKISELHELPRPPPP-PPSSSA 938
PL + P+ S P++ SPTASP S+P+I+ELHELPRPP P +
Sbjct: 418 PLKPSSTRLPVPVAVQAQSSSPRI---SPTASPPLASSPRINELHELPRPPGQFAPPRRS 474
Query: 939 KSSRAFGYSAPLVSKSQLLSKPIIST--PPSPLPIPP-AITRSFSIPTGNLRGTELDMSK 1109
KS G+SAPL + +Q S ++ST SPLP+PP + RS+SIP+ N R M++
Sbjct: 475 KSPGLVGHSAPLTAWNQERSNVVVSTNIVASPLPVPPLVVPRSYSIPSRNQRA----MAQ 530
Query: 1110 MSL-GANKLSTASP---PLTPMSLVH 1175
L N+ ASP PLTP SL++
Sbjct: 531 QPLPERNQNRVASPPPLPLTPASLMN 556
Score = 76 bits (186), Expect = 4e-014
Identities = 50/152 (32%), Positives = 78/152 (51%), Gaps = 12/152 (7%)
Frame = +3
Query: 96 KMTMKKNNDEDDSEVNDDGELSFEYRVNDKDQNADSSPSASSQLGQSDITF--PLVAGVK 269
+M ++ND+DD VN DGELSF+Y +++ S+P S ++ +D++F P AG
Sbjct: 248 EMDCSEDNDDDDRLVNRDGELSFDYITSEQRVEVISTPHGSMKMDDTDLSFQRPSPAGSA 307
Query: 270 TGQENEEANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPT 449
T + + S RD R S SAPLF + + + +M S + Y LPT
Sbjct: 308 TVNADPREEHSVS---NRDRRTSSHSAPLFPDKKADLADRSMRQMTPSA----NAYILPT 360
Query: 450 PVESAKSP---SSVTSLGNNKSMASSNPAKPI 536
PV+S SP VT ++ ++ S+P +PI
Sbjct: 361 PVDSKSSPIFTKPVTQTNHSANLWHSSPLEPI 392
>TAIR9_protein||AT5G41100.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma
membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match is:
hydroxyproline-rich glycoprotein family protein (TAIR:AT3G26910.2); Has
1257 Blast hits to 957 proteins in 161 species: Archae - 2; Bacteria -
67; Metazoa - 448; Fungi - 227; Plants - 135; Viruses - 33; Other
Eukaryotes - 345 (source: NCBI BLink). | chr5:16447429-16450686
FORWARD
Length = 583
Score = 84 bits (206), Expect = 2e-016
Identities = 60/146 (41%), Positives = 82/146 (56%), Gaps = 15/146 (10%)
Frame = +3
Query: 762 PLSSTPRLYSGPIPRSPVSKLPKVSTSSPTASPTFVSTPKISELHELPRPPPP-PPSSSA 938
PL + P+ S P++ SPTASP S+P+I+ELHELPRPP P +
Sbjct: 418 PLKPSSTRLPVPVAVQAQSSSPRI---SPTASPPLASSPRINELHELPRPPGQFAPPRRS 474
Query: 939 KSSRAFGYSAPLVSKSQLLSKPIIST--PPSPLPIPP-AITRSFSIPTGNLRGTELDMSK 1109
KS G+SAPL + +Q S ++ST SPLP+PP + RS+SIP+ N R M++
Sbjct: 475 KSPGLVGHSAPLTAWNQERSNVVVSTNIVASPLPVPPLVVPRSYSIPSRNQRA----MAQ 530
Query: 1110 MSL-GANKLSTASP---PLTPMSLVH 1175
L N+ ASP PLTP SL++
Sbjct: 531 QPLPERNQNRVASPPPLPLTPASLMN 556
Score = 76 bits (186), Expect = 4e-014
Identities = 50/152 (32%), Positives = 78/152 (51%), Gaps = 12/152 (7%)
Frame = +3
Query: 96 KMTMKKNNDEDDSEVNDDGELSFEYRVNDKDQNADSSPSASSQLGQSDITF--PLVAGVK 269
+M ++ND+DD VN DGELSF+Y +++ S+P S ++ +D++F P AG
Sbjct: 248 EMDCSEDNDDDDRLVNRDGELSFDYITSEQRVEVISTPHGSMKMDDTDLSFQRPSPAGSA 307
Query: 270 TGQENEEANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPT 449
T + + S RD R S SAPLF + + + +M S + Y LPT
Sbjct: 308 TVNADPREEHSVS---NRDRRTSSHSAPLFPDKKADLADRSMRQMTPSA----NAYILPT 360
Query: 450 PVESAKSP---SSVTSLGNNKSMASSNPAKPI 536
PV+S SP VT ++ ++ S+P +PI
Sbjct: 361 PVDSKSSPIFTKPVTQTNHSANLWHSSPLEPI 392
>TAIR9_protein||AT1G44191.1 | Symbols: | Encodes a ECA1 gametogenesis related
family protein | chr1:16813654-16814733 REVERSE
Length = 360
Score = 52 bits (123), Expect = 9e-007
Identities = 51/174 (29%), Positives = 61/174 (35%), Gaps = 7/174 (4%)
Frame = +3
Query: 663 PRPSTDGLLYSRIGSLKRRSFSGPITSK-PLPNKPLSSTPRLYSGPIPRSPVSKLPKVST 839
P+PS+ + + + S P K P P KP S P P P P PK ST
Sbjct: 120 PKPSSPPPIPKKSPPPPKPSSPPPTPKKSPPPPKPSSPPPSPKKSPPPPKPSPSPPKPST 179
Query: 840 SSPTASPTFVSTPKISELHELPRPPPPPPSSSAKSSRAFGYSAPLVSKSQLLSKPIISTP 1019
PT + S PK S P+ PPPP S + P KS KP +
Sbjct: 180 PPPTPKKSPPSPPKPSSPPPSPKKSPPPPKPSPSPPKP-STPPPTPKKSPPPPKP---SQ 235
Query: 1020 PSPLPIPPAITRSFSIPTGNLRGTELDMSKMSLGANKLSTASPPLTPMSLVHPP 1181
P P P PP R S PT T + PP TP P
Sbjct: 236 PPPKPSPP--RRKPSPPTPKPSTTPPSPKPSPPRPTPKKSPPPPTTPSPSYQDP 287
>TAIR9_protein||AT2G28440.1 | Symbols: | proline-rich family protein |
chr2:12161226-12162032 FORWARD
Length = 269
Score = 49 bits (116), Expect = 6e-006
Identities = 43/130 (33%), Positives = 63/130 (48%), Gaps = 10/130 (7%)
Frame = +3
Query: 738 TSKPLPNKPL--SSTPRLYS-GPIPRSPVSKLPKVSTSSPTAS--PTFVSTPKISELHEL 902
+S P + PL SS+P + S P SP + P +SSP A+ + S+PK L +
Sbjct: 84 SSSPEVDSPLAPSSSPEVDSPQPPSSSPEADSPLPPSSSPEANSPQSPASSPKPESLADS 143
Query: 903 PRPPPPPPSSSAKSSRAFGYSAPLVSKSQLLS----KPIISTPPSPLPIPPAITRSFSIP 1070
P PPPPPP + SS ++ AP+ + S S +P PSP P P + + I
Sbjct: 144 PSPPPPPPQPESPSSPSYPEPAPVPAPSDDDSDDDPEPETEYFPSPAP-SPELGMAQDIK 202
Query: 1071 TGNLRGTELD 1100
+ G EL+
Sbjct: 203 ASDAAGEELN 212
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,278,069,989
Number of Sequences: 33410
Number of Extensions: 8278069989
Number of Successful Extensions: 290448565
Number of sequences better than 0.0: 0
|