BLASTX 7.6.2
Query= UN25116 /QuerySize=1035
(1034 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT4G37900.1 | Symbols: | glycine-rich protein | ... 394 8e-110
TAIR9_protein||AT2G22660.2 | Symbols: | FUNCTIONS IN: molecular... 238 4e-063
TAIR9_protein||AT2G22660.1 | Symbols: | FUNCTIONS IN: molecular... 80 1e-015
TAIR9_protein||AT1G54215.1 | Symbols: | proline-rich family pro... 65 5e-011
TAIR9_protein||AT5G23150.1 | Symbols: HUA2 | HUA2 (ENHANCER OF A... 65 6e-011
TAIR9_protein||AT3G51290.1 | Symbols: | proline-rich family pro... 55 6e-008
TAIR9_protein||AT2G36120.1 | Symbols: DOT1 | DOT1 (DEFECTIVELY O... 55 8e-008
TAIR9_protein||AT4G13340.1 | Symbols: | leucine-rich repeat fam... 55 8e-008
TAIR9_protein||AT3G20470.1 | Symbols: GRP-5, ATGRP-5, GRP5, ATGR... 54 1e-007
TAIR9_protein||AT4G36230.1 | Symbols: | unknown protein | chr4:... 54 2e-007
TAIR9_protein||AT5G46730.1 | Symbols: | glycine-rich protein | ... 52 5e-007
TAIR9_protein||AT5G46730.2 | Symbols: | glycine-rich protein | ... 51 9e-007
TAIR9_protein||AT4G01985.1 | Symbols: | unknown protein | chr4:... 50 3e-006
TAIR9_protein||AT4G38770.1 | Symbols: PRP4, ATPRP4 | PRP4 (PROLI... 49 6e-006
>TAIR9_protein||AT4G37900.1 | Symbols: | glycine-rich protein |
chr4:17821737-17824445 REVERSE
Length = 788
Score = 394 bits (1010), Expect = 8e-110
Identities = 211/302 (69%), Positives = 228/302 (75%), Gaps = 40/302 (13%)
Frame = +2
Query: 20 FTRVVGETETELISLHMRNHNNA------RQVIGVKESGETLVLATYDGCVWSLLEAKWS 181
FTRVV ETETE+I+L MRN N+A RQVIGVKE GET VLA YDG WSLL++KWS
Sbjct: 486 FTRVVDETETEVINLQMRNSNDAAPKGDRRQVIGVKECGETYVLAEYDGTFWSLLDSKWS 545
Query: 182 LKQT--SSLDGPVFEIFGVRMVKVYYGRKLEYETKRCAKLRSEQDFMTAVEFSKQYPYGK 355
LKQT + DGP+FE+ G RMVKVY GRKLEYE K C+KLRSEQDFMTAVEFSKQ+PYGK
Sbjct: 546 LKQTCNPATDGPLFELSGTRMVKVYSGRKLEYEPKHCSKLRSEQDFMTAVEFSKQHPYGK 605
Query: 356 AVGLLDLKFGSFEANERWLVLPGLVSAFILNDLLKKEGFCGAAKDIVKGNGIT------- 514
AVGLLDLKFGS EANE+WLVLPG+VS+FIL+DLLKKEGF AAKD VK NGIT
Sbjct: 606 AVGLLDLKFGSIEANEKWLVLPGMVSSFILSDLLKKEGFSAAAKDTVKANGITEESTEID 665
Query: 515 ----VKLEEETMMHVN-----------INGGARCLSKELNSGNMVEEEGGHCGGCGGCGG 649
KLEEETMM V+ INGGARC SKEL SGNM+EEEGGHCGGCGGCGG
Sbjct: 666 VLSQEKLEEETMMDVDTTTPVAVAAEKINGGARCFSKEL-SGNMIEEEGGHCGGCGGCGG 724
Query: 650 CGGGSGCGGGGGRCGNMMTNENVTTGGGSCTGGSTGCGGCGGAGGGCGNMMTNE---NVP 820
CGGG GC GGGGRCG M E GGGSCTGGSTGCG C GGGCGNMM N N P
Sbjct: 725 CGGGGGC-GGGGRCGGMTKIEG--CGGGSCTGGSTGCGNC---GGGCGNMMKNNANGNAP 778
Query: 821 SL 826
S+
Sbjct: 779 SV 780
>TAIR9_protein||AT2G22660.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma
membrane; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13
growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
DUF1399 (InterPro:IPR009836); BEST Arabidopsis thaliana protein match
is: glycine-rich protein (TAIR:AT4G37900.1); Has 14843 Blast hits to
4550 proteins in 469 species: Archae - 40; Bacteria - 5555; Metazoa -
4422; Fungi - 897; Plants - 1977; Viruses - 191; Other Eukaryotes -
1761 (source: NCBI BLink). | chr2:9627737-9630840 FORWARD
Length = 820
Score = 238 bits (607), Expect = 4e-063
Identities = 147/308 (47%), Positives = 185/308 (60%), Gaps = 49/308 (15%)
Frame = +2
Query: 23 TRVVGETETELISLHMRN-------HNNARQVIGVKESGETLVLATYDGCVWSLLEAKWS 181
T +V ET+TE+I+L +RN ++ RQV+GV +SGET VLA Y G WSLL++KWS
Sbjct: 498 THIVDETQTEVITLQIRNSADGGILKDDQRQVMGVTDSGETRVLAVYTGSFWSLLDSKWS 557
Query: 182 LKQ--TSSLDGPVFEIFGVRMVKVYYGRKLEYETKRCAKLRSEQDFMTAVEFSKQYPYGK 355
LKQ S+ D P+FEI G R+VK++ GRKL+YE K CA LRS+ DFMT VEFSKQ+PYGK
Sbjct: 558 LKQINASTADNPLFEILGPRVVKIFSGRKLDYEPKHCANLRSDLDFMTLVEFSKQHPYGK 617
Query: 356 AVGLLDLKFGSFEANERWLVLPGLVSAFILNDLLKK---EGFCGAAKDIVKGNGITVKLE 526
VGL+D++FGS EA E WL+LPG+VSAFIL+ +LKK EGF KDI K KL
Sbjct: 618 TVGLVDMRFGSIEAKENWLLLPGIVSAFILHTVLKKGGSEGFNVTTKDI-KEESKQTKLV 676
Query: 527 EETMMHVNINGGARCLSKELNSGNMVEEEGGHC-GGCGG-CG---------GCG------ 655
T +VN N + E + ++G C GGC G CG GCG
Sbjct: 677 AATENNVNANS----TNVETQTAITAPKKGSGCGGGCSGECGNMVKAANASGCGSSCSGE 732
Query: 656 ---------GGSGCGGG-GGRCGNMMTNENVTTG--GGSCTGG-STGC-GGC-GGAGGGC 790
SGCG G G CGNM+ N + G G C ++GC GGC GG GGGC
Sbjct: 733 CGDMVKSAANASGCGSGCSGECGNMVKAANASGGGYGARCKAAKASGCGGGCGGGCGGGC 792
Query: 791 GNMMTNEN 814
G+M+ + N
Sbjct: 793 GDMVKSVN 800
>TAIR9_protein||AT2G22660.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma
membrane; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13
growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function
DUF1399 (InterPro:IPR009836); BEST Arabidopsis thaliana protein match
is: glycine-rich protein (TAIR:AT4G37900.1); Has 257 Blast hits to 239
proteins in 58 species: Archae - 0; Bacteria - 21; Metazoa - 12; Fungi
- 149; Plants - 61; Viruses - 0; Other Eukaryotes - 14 (source: NCBI
BLink). | chr2:9627737-9629957 FORWARD
Length = 587
Score = 80 bits (197), Expect = 1e-015
Identities = 43/82 (52%), Positives = 57/82 (69%), Gaps = 9/82 (10%)
Frame = +2
Query: 23 TRVVGETETELISLHMRN-------HNNARQVIGVKESGETLVLATYDGCVWSLLEAKWS 181
T +V ET+TE+I+L +RN ++ RQV+GV +SGET VLA Y G WSLL++KWS
Sbjct: 498 THIVDETQTEVITLQIRNSADGGILKDDQRQVMGVTDSGETRVLAVYTGSFWSLLDSKWS 557
Query: 182 LKQ--TSSLDGPVFEIFGVRMV 241
LKQ S+ D P+FEI G R+V
Sbjct: 558 LKQINASTADNPLFEILGPRVV 579
>TAIR9_protein||AT1G54215.1 | Symbols: | proline-rich family protein |
chr1:20243118-20243627 FORWARD
Length = 170
Score = 65 bits (158), Expect = 5e-011
Identities = 31/67 (46%), Positives = 34/67 (50%), Gaps = 10/67 (14%)
Frame = -3
Query: 792 PHPPPAPPHPPHPVEPPVQEPPPVVTFSLVIILPHLPPP----------PPHPLPPPHPP 643
P PPP PP PP P PP PPP V S+ +P PPP PP P PPP
Sbjct: 42 PPPPPPPPPPPPPPPPPPPPPPPAVNMSVETGIPPPPPPVTDMIKPLSSPPPPQPPPRSQ 101
Query: 642 HPPQPPQ 622
PP+PPQ
Sbjct: 102 PPPKPPQ 108
>TAIR9_protein||AT5G23150.1 | Symbols: HUA2 | HUA2 (ENHANCER OF AG-4 2);
transcription factor | chr5:7786173-7792080 FORWARD
Length = 1393
Score = 65 bits (157), Expect = 6e-011
Identities = 35/79 (44%), Positives = 39/79 (49%), Gaps = 5/79 (6%)
Frame = -3
Query: 795 LPHPPPAPPHPPHPVEPPVQEPPPVVTFSLVIILPHLPPPPPHPLPPPHPPHPPQPPQ*P 616
LP PP+PP PP P PP PPP + P LPPPP P PPP P P PP P
Sbjct: 1072 LPPLPPSPP-PPSPPLPPSSLPPP----PPAALFPPLPPPPSQPPPPPLSPPPSPPPPPP 1126
Query: 615 PSSSTMFPLLSSLDKHLAP 559
P S ++ LS H P
Sbjct: 1127 PPSQSLTTQLSIASHHQIP 1145
>TAIR9_protein||AT3G51290.1 | Symbols: | proline-rich family protein |
chr3:19039980-19042437 FORWARD
Length = 603
Score = 55 bits (131), Expect = 6e-008
Identities = 30/69 (43%), Positives = 33/69 (47%), Gaps = 8/69 (11%)
Frame = -3
Query: 792 PHPPPAPPHPPHPVEPPVQEPPPVVTFSLVIILPHLPPPPPHPLPPPHPP------HPPQ 631
P P P PP PP P PP+ T++ LPPPPP P PPP P P
Sbjct: 69 PSPSPPPPPPPRPPPPPLSPGSETTTWTTTTTSSVLPPPPPPPPPPPPPSSTWDFWDPFI 128
Query: 630 PPQ*PPSSS 604
PP PPSSS
Sbjct: 129 PP--PPSSS 135
>TAIR9_protein||AT2G36120.1 | Symbols: DOT1 | DOT1 (DEFECTIVELY ORGANIZED
TRIBUTARIES 1) | chr2:15165468-15166235 FORWARD
Length = 256
Score = 55 bits (130), Expect = 8e-008
Identities = 30/61 (49%), Positives = 32/61 (52%), Gaps = 1/61 (1%)
Frame = +2
Query: 614 GGHCGGCGGCGGCG-GGSGCGGGGGRCGNMMTNENVTTGGGSCTGGSTGCGGCGGAGGGC 790
GG GG GG GG G GG G GGGG G + GGG GG+ G GG GG G G
Sbjct: 118 GGEAGGHGGGGGGGAGGGGGGGGGAHGGGYGGGQGAGAGGGYGGGGAGGHGGGGGGGNGG 177
Query: 791 G 793
G
Sbjct: 178 G 178
Score = 50 bits (118), Expect = 2e-006
Identities = 30/58 (51%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Frame = +2
Query: 614 GGHCGGCGGCGGCGGGSGCGGGGGRCGNMMTNENVTTGGGSCTGGSTGCGGCGGAGGG 787
GG GG GG GG GGG G G GGG G + E GGG GG G G GGAG G
Sbjct: 156 GGGYGG-GGAGGHGGGGGGGNGGGGGGG--SGEGGAHGGGYGAGGGAGEGYGGGAGAG 210
>TAIR9_protein||AT4G13340.1 | Symbols: | leucine-rich repeat family protein /
extensin family protein | chr4:7758610-7760892 FORWARD
Length = 761
Score = 55 bits (130), Expect = 8e-008
Identities = 28/62 (45%), Positives = 30/62 (48%), Gaps = 6/62 (9%)
Frame = -3
Query: 792 PHPPPAPPHPPHPVEPPVQEPPPVVTFSLVIILPHLPPPPPHPL--PPPHPP----HPPQ 631
PH PP P PP P+ P + PPP S P PPPP P PPP PP PP
Sbjct: 572 PHSPPPPHSPPPPIYPYLSPPPPPTPVSSPPPTPVYSPPPPPPCIEPPPPPPCIEYSPPP 631
Query: 630 PP 625
PP
Sbjct: 632 PP 633
>TAIR9_protein||AT3G20470.1 | Symbols: GRP-5, ATGRP-5, GRP5, ATGRP5 | GRP5
(GLYCINE-RICH PROTEIN 5); structural constituent of cell wall |
chr3:7140625-7141149 REVERSE
Length = 175
Score = 54 bits (129), Expect = 1e-007
Identities = 30/62 (48%), Positives = 34/62 (54%), Gaps = 2/62 (3%)
Frame = +2
Query: 614 GGHCGGCGGCGGCGGGSGCGGGGGRCGNMMTNENVTTGGGS--CTGGSTGCGGCGGAGGG 787
GG GG GG GG GGG+G G GGG G + + GGG+ GG G G GG GGG
Sbjct: 69 GGLGGGAGGGGGLGGGAGGGAGGGFGGGAGSGGGLGGGGGAGGGFGGGAGGGSGGGFGGG 128
Query: 788 CG 793
G
Sbjct: 129 AG 130
Score = 53 bits (125), Expect = 3e-007
Identities = 28/56 (50%)
Frame = +2
Query: 626 GGCGGCGGCGGGSGCGGGGGRCGNMMTNENVTTGGGSCTGGSTGCGGCGGAGGGCG 793
GG GG GG GGGSG GGGGG G GGG G G GG G G G G
Sbjct: 45 GGLGGGGGIGGGSGLGGGGGFGGGGGLGGGAGGGGGLGGGAGGGAGGGFGGGAGSG 100
>TAIR9_protein||AT4G36230.1 | Symbols: | unknown protein |
chr4:17145586-17146251 FORWARD
Length = 222
Score = 54 bits (127), Expect = 2e-007
Identities = 31/61 (50%), Positives = 35/61 (57%), Gaps = 3/61 (4%)
Frame = +2
Query: 614 GGHCGGCGGCGGCGGGSGCGGGGGRCGNMMTNENVTTGGGSCTGGSTGCGGCGGAGGGCG 793
GG GG GG GG GGG G GGGGG GN ++ + +GG G GG GG GGG G
Sbjct: 89 GGGGGGGGGGGGGGGGQGSGGGGGE-GNGGNGKDNSHKRNKSSGG--GGGGGGGGGGGSG 145
Query: 794 N 796
N
Sbjct: 146 N 146
>TAIR9_protein||AT5G46730.1 | Symbols: | glycine-rich protein |
chr5:18964030-18964902 FORWARD
Length = 291
Score = 52 bits (123), Expect = 5e-007
Identities = 30/60 (50%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Frame = +2
Query: 614 GGHCGGCGGCGGCGGGSGCGGGGGRCGNMMTNENVTTGGGSCTGGSTGCGGCGGAGGGCG 793
GG GG GG GG GGG G GGGG G +GGG GG G G GG GGG G
Sbjct: 200 GGSHGGAGGYGG-GGGGGSGGGGAYGGGGAHGGGYGSGGGE--GGGYGGGAAGGYGGGGG 256
Score = 51 bits (120), Expect = 1e-006
Identities = 31/69 (44%), Gaps = 2/69 (2%)
Frame = +2
Query: 593 GNMVEEEG--GHCGGCGGCGGCGGGSGCGGGGGRCGNMMTNENVTTGGGSCTGGSTGCGG 766
GN E G G G GG G GGG G GGGGG G G G GG GG
Sbjct: 146 GNGAGEGGGAGASGYGGGAYGGGGGHGGGGGGGSAGGAHGGSGYGGGEGGGAGGGGSHGG 205
Query: 767 CGGAGGGCG 793
GG GGG G
Sbjct: 206 AGGYGGGGG 214
>TAIR9_protein||AT5G46730.2 | Symbols: | glycine-rich protein |
chr5:18964030-18964902 FORWARD
Length = 247
Score = 51 bits (121), Expect = 9e-007
Identities = 32/69 (46%), Positives = 33/69 (47%), Gaps = 4/69 (5%)
Frame = +2
Query: 593 GNMVEEEG--GHCGGCGGCGGCGGGSGCGGGGGRCGNMMTNENVTTGGGSCTGGSTGCGG 766
GN E G G G GG G GGG G GGGGG G +GGG GG G G
Sbjct: 146 GNGAGEGGGAGASGYGGGAYGGGGGHGGGGGGGSAGGAHGGSGYGSGGGE--GGGYGGGA 203
Query: 767 CGGAGGGCG 793
GG GGG G
Sbjct: 204 AGGYGGGGG 212
>TAIR9_protein||AT4G01985.1 | Symbols: | unknown protein | chr4:866387-868126
REVERSE
Length = 580
Score = 50 bits (117), Expect = 3e-006
Identities = 30/65 (46%), Positives = 32/65 (49%), Gaps = 5/65 (7%)
Frame = +2
Query: 614 GGHCGGC--GGCGGCGGGSGCGGG---GGRCGNMMTNENVTTGGGSCTGGSTGCGGCGGA 778
GG GG G GG GGGS GGG GG G + GG+ G S G GG GGA
Sbjct: 359 GGAVGGAVGGAVGGGGGGSVGGGGRGSGGASGGASGGASGGASGGASGGASGGVGGAGGA 418
Query: 779 GGGCG 793
GG G
Sbjct: 419 GGSVG 423
>TAIR9_protein||AT4G38770.1 | Symbols: PRP4, ATPRP4 | PRP4 (PROLINE-RICH PROTEIN
4) | chr4:18097009-18098448 REVERSE
Length = 449
Score = 49 bits (114), Expect = 6e-006
Identities = 23/61 (37%), Positives = 31/61 (50%)
Frame = -3
Query: 804 VIILPHPPPAPPHPPHPVEPPVQEPPPVVTFSLVIILPHLPPPPPHPLPPPHPPHPPQPP 625
++ P PPP P + P V P PPPV + +++ P PP P PP P PP PP
Sbjct: 371 IVKKPCPPPVPIYKPPVVIPKKPCPPPVPVYKPPVVVIPKKPCPPLPQLPPLPKFPPLPP 430
Query: 624 Q 622
+
Sbjct: 431 K 431
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,904,828,436
Number of Sequences: 33410
Number of Extensions: 12904828436
Number of Successful Extensions: 408053267
Number of sequences better than 0.0: 0
|