Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN15847


BLASTX 7.6.2

Query= UN15847 /QuerySize=1476
        (1475 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT2G33490.1 | Symbols:  | hydroxyproline-rich gly...    562   3e-160
TAIR9_protein||AT3G26910.1 | Symbols:  | hydroxyproline-rich gly...    114   2e-025
TAIR9_protein||AT3G26910.2 | Symbols:  | hydroxyproline-rich gly...    114   2e-025
TAIR9_protein||AT5G41100.1 | Symbols:  | FUNCTIONS IN: molecular...     84   2e-016
TAIR9_protein||AT5G41100.2 | Symbols:  | FUNCTIONS IN: molecular...     84   2e-016
TAIR9_protein||AT1G44191.1 | Symbols:  | Encodes a ECA1 gametoge...     52   9e-007
TAIR9_protein||AT2G28440.1 | Symbols:  | proline-rich family pro...     49   6e-006

>TAIR9_protein||AT2G33490.1 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr2:14183552-14187666 FORWARD

          Length = 624

 Score =  562 bits (1446), Expect = 3e-160
 Identities = 289/376 (76%), Positives = 319/376 (84%), Gaps = 6/376 (1%)
 Frame = +3

Query:  111 KNNDEDDSEVNDDGELSFEYRVNDKDQNADSSPSASSQLGQSDITFPLVAGVKTGQENEE 290
            +NN+ D SEV+DDGELSFEYRVNDKDQ+ADSS   SS+LG SDITFP + G  T QENEE
Sbjct:  253 ENNENDGSEVHDDGELSFEYRVNDKDQDADSSAGGSSELGNSDITFPQIGGPYTAQENEE 312

Query:  291 ANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPTPVESAKS 470
             NY +S S+RRDVR  SQSAPLF ENRTTPPSEKLLRMRS++TRKF+TYALPTPVE+ +S
Sbjct:  313 GNYRKSHSFRRDVRAVSQSAPLFPENRTTPPSEKLLRMRSTLTRKFNTYALPTPVETTRS 372

Query:  471 PSSVTSLGNNKSMASSNPAKPIAKNIWYSSPLETRGPAKVSSRPMTCLKEQVLRESNKNT 650
            PSS TS G +K++ SSNP K I K IWYSSPLETRGPAKVSSR M  LKEQVLRESNKNT
Sbjct:  373 PSSTTSPG-HKNVGSSNPTKAITKQIWYSSPLETRGPAKVSSRSMVALKEQVLRESNKNT 431

Query:  651 SSRLPRPSTDGLLYSRIGSLKRRSFSGPITSKPLPNKPLSSTPRLYSGPIPRSPVSKLPK 830
             SRLP P  DGLL+SR+G+LKRRSFSGP+TSKPLPNKPLS+T  LYSGPIPR+PVSKLPK
Sbjct:  432 -SRLPPPLADGLLFSRLGTLKRRSFSGPLTSKPLPNKPLSTTSHLYSGPIPRNPVSKLPK 490

Query:  831 VSTSSPTASPTFVSTPKISELHELPRPPPPPPSSSAKSSRAFGYSAPLVSKSQLLSKPII 1010
            VS SSPTASPTFVSTPKISELHELPR   PPP SS KSSR  GYSAPLVS+SQLLSKP+I
Sbjct:  491 VS-SSPTASPTFVSTPKISELHELPR---PPPRSSTKSSRELGYSAPLVSRSQLLSKPLI 546

Query: 1011 STPPSPLPIPPAITRSFSIPTGNLRGTELDMSKMSLGANKLSTASPPLTPMSLVHPPPSA 1190
            +   SPLPIPPAITRSFSIPT NLR ++LDMSK SLG  KL T SPPLTPMSL+HPPP A
Sbjct:  547 TNSASPLPIPPAITRSFSIPTSNLRASDLDMSKTSLGTKKLGTPSPPLTPMSLIHPPPQA 606

Query: 1191 ITEHAEHLAMSKQERR 1238
            + E A+HL MSKQERR
Sbjct:  607 LPERADHLMMSKQERR 622


 Score =  69 bits (167), Expect = 7e-012
 Identities = 31/35 (88%), Positives = 34/35 (97%)
 Frame = +2

Query:   2 FKKALNSLEEVEPHVKTVTESQHIDYHFSGLEDDD 106
           FKKAL+SLEEV+PHV+ VTESQHIDYHFSGLEDDD
Sbjct: 213 FKKALSSLEEVDPHVQMVTESQHIDYHFSGLEDDD 247

>TAIR9_protein||AT3G26910.1 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr3:9915338-9918511 REVERSE

          Length = 609

 Score =  114 bits (284), Expect = 2e-025
 Identities = 81/194 (41%), Positives = 108/194 (55%), Gaps = 23/194 (11%)
 Frame = +3

Query:  657 RLPRPSTDGLLYSRIGSLKRRSFSGPITSKPLPNKPLSSTPRLYSG---PIPRSPVSKLP 827
            RLPRPST    + +  +  R +FSGP+  +P   KP++     YSG   P+P  PV +  
Sbjct:  410 RLPRPSTTDTHHHQQQAAGRHAFSGPL--RPSSTKPITMADS-YSGAFCPLPTPPVLQSH 466

Query:  828 KVSTS----SPTASPTFVSTPKISELHELPRPPP--PPPSSSAKSSRAFGYSAPLVSKSQ 989
              S+S    SPTASP   S+P+++ELHELPRPP    PP   AKS    G+SAPL + +Q
Sbjct:  467 PHSSSSPRVSPTASPPPASSPRLNELHELPRPPGHFAPPPRRAKSPGLVGHSAPLTAWNQ 526

Query:  990 LLSKPIISTP------PSPLPIPP-AITRSFSIPTGNLRGTELDMSKMSLGANKLSTASP 1148
              S   ++ P       SPLP+PP  + RS+SIP+ N R     +S+  +       ASP
Sbjct:  527 ERSTVTVAVPSATNIVASPLPVPPLVVPRSYSIPSRNQR----VVSQRLVERRDDIVASP 582

Query: 1149 PLTPMSLVHPPPSA 1190
            PLTPMSL  P P A
Sbjct:  583 PLTPMSLSRPLPQA 596


 Score =  59 bits (142), Expect = 5e-009
 Identities = 47/154 (30%), Positives = 77/154 (50%), Gaps = 13/154 (8%)
 Frame = +3

Query:  96 KMTMKKNNDEDDSEVNDDGELSFEYRVNDKDQNADS-SPSASSQLGQSDITFPLVAGVKT 272
           +M   +++D+D   +N +GELSF+YR N++   A S S   ++++  +D++FP  +  + 
Sbjct: 246 EMEASEDDDDDGRYMNREGELSFDYRTNEQKVEASSLSTPWATKMDDTDLSFPRPSTTRP 305

Query: 273 GQENEEANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPTP 452
              N +       S  RD  + S SAPLF E +    SE+L +   S    F+ Y LPTP
Sbjct: 306 AAVNADHREEYPVS-TRDKYLSSHSAPLFPEKK-PDVSERLRQANPS----FNAYVLPTP 359

Query: 453 VESAKSPSSVTSLGNNKSMAS------SNPAKPI 536
            +S  S     +L    +  S      S+P +PI
Sbjct: 360 NDSRYSKPVSQALNPRPTNHSAGNIWHSSPLEPI 393

>TAIR9_protein||AT3G26910.2 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr3:9915304-9918511 REVERSE

          Length = 615

 Score =  114 bits (284), Expect = 2e-025
 Identities = 81/194 (41%), Positives = 108/194 (55%), Gaps = 23/194 (11%)
 Frame = +3

Query:  657 RLPRPSTDGLLYSRIGSLKRRSFSGPITSKPLPNKPLSSTPRLYSG---PIPRSPVSKLP 827
            RLPRPST    + +  +  R +FSGP+  +P   KP++     YSG   P+P  PV +  
Sbjct:  410 RLPRPSTTDTHHHQQQAAGRHAFSGPL--RPSSTKPITMADS-YSGAFCPLPTPPVLQSH 466

Query:  828 KVSTS----SPTASPTFVSTPKISELHELPRPPP--PPPSSSAKSSRAFGYSAPLVSKSQ 989
              S+S    SPTASP   S+P+++ELHELPRPP    PP   AKS    G+SAPL + +Q
Sbjct:  467 PHSSSSPRVSPTASPPPASSPRLNELHELPRPPGHFAPPPRRAKSPGLVGHSAPLTAWNQ 526

Query:  990 LLSKPIISTP------PSPLPIPP-AITRSFSIPTGNLRGTELDMSKMSLGANKLSTASP 1148
              S   ++ P       SPLP+PP  + RS+SIP+ N R     +S+  +       ASP
Sbjct:  527 ERSTVTVAVPSATNIVASPLPVPPLVVPRSYSIPSRNQR----VVSQRLVERRDDIVASP 582

Query: 1149 PLTPMSLVHPPPSA 1190
            PLTPMSL  P P A
Sbjct:  583 PLTPMSLSRPLPQA 596


 Score =  59 bits (142), Expect = 5e-009
 Identities = 47/154 (30%), Positives = 77/154 (50%), Gaps = 13/154 (8%)
 Frame = +3

Query:  96 KMTMKKNNDEDDSEVNDDGELSFEYRVNDKDQNADS-SPSASSQLGQSDITFPLVAGVKT 272
           +M   +++D+D   +N +GELSF+YR N++   A S S   ++++  +D++FP  +  + 
Sbjct: 246 EMEASEDDDDDGRYMNREGELSFDYRTNEQKVEASSLSTPWATKMDDTDLSFPRPSTTRP 305

Query: 273 GQENEEANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPTP 452
              N +       S  RD  + S SAPLF E +    SE+L +   S    F+ Y LPTP
Sbjct: 306 AAVNADHREEYPVS-TRDKYLSSHSAPLFPEKK-PDVSERLRQANPS----FNAYVLPTP 359

Query: 453 VESAKSPSSVTSLGNNKSMAS------SNPAKPI 536
            +S  S     +L    +  S      S+P +PI
Sbjct: 360 NDSRYSKPVSQALNPRPTNHSAGNIWHSSPLEPI 393

>TAIR9_protein||AT5G41100.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma
        membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
        growth stages; BEST Arabidopsis thaliana protein match is:
        hydroxyproline-rich glycoprotein family protein (TAIR:AT3G26910.2); Has
        1264 Blast hits to 964 proteins in 165 species: Archae - 2; Bacteria -
        75; Metazoa - 445; Fungi - 228; Plants - 134; Viruses - 35; Other
        Eukaryotes - 345 (source: NCBI BLink). | chr5:16447429-16450610
        FORWARD

          Length = 587

 Score =  84 bits (206), Expect = 2e-016
 Identities = 60/146 (41%), Positives = 82/146 (56%), Gaps = 15/146 (10%)
 Frame = +3

Query:  762 PLSSTPRLYSGPIPRSPVSKLPKVSTSSPTASPTFVSTPKISELHELPRPPPP-PPSSSA 938
            PL  +      P+     S  P++   SPTASP   S+P+I+ELHELPRPP    P   +
Sbjct:  418 PLKPSSTRLPVPVAVQAQSSSPRI---SPTASPPLASSPRINELHELPRPPGQFAPPRRS 474

Query:  939 KSSRAFGYSAPLVSKSQLLSKPIIST--PPSPLPIPP-AITRSFSIPTGNLRGTELDMSK 1109
            KS    G+SAPL + +Q  S  ++ST    SPLP+PP  + RS+SIP+ N R     M++
Sbjct:  475 KSPGLVGHSAPLTAWNQERSNVVVSTNIVASPLPVPPLVVPRSYSIPSRNQRA----MAQ 530

Query: 1110 MSL-GANKLSTASP---PLTPMSLVH 1175
              L   N+   ASP   PLTP SL++
Sbjct:  531 QPLPERNQNRVASPPPLPLTPASLMN 556


 Score =  76 bits (186), Expect = 4e-014
 Identities = 50/152 (32%), Positives = 78/152 (51%), Gaps = 12/152 (7%)
 Frame = +3

Query:  96 KMTMKKNNDEDDSEVNDDGELSFEYRVNDKDQNADSSPSASSQLGQSDITF--PLVAGVK 269
           +M   ++ND+DD  VN DGELSF+Y  +++     S+P  S ++  +D++F  P  AG  
Sbjct: 248 EMDCSEDNDDDDRLVNRDGELSFDYITSEQRVEVISTPHGSMKMDDTDLSFQRPSPAGSA 307

Query: 270 TGQENEEANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPT 449
           T   +    +  S    RD R  S SAPLF + +       + +M  S     + Y LPT
Sbjct: 308 TVNADPREEHSVS---NRDRRTSSHSAPLFPDKKADLADRSMRQMTPSA----NAYILPT 360

Query: 450 PVESAKSP---SSVTSLGNNKSMASSNPAKPI 536
           PV+S  SP     VT   ++ ++  S+P +PI
Sbjct: 361 PVDSKSSPIFTKPVTQTNHSANLWHSSPLEPI 392

>TAIR9_protein||AT5G41100.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma
        membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
        growth stages; BEST Arabidopsis thaliana protein match is:
        hydroxyproline-rich glycoprotein family protein (TAIR:AT3G26910.2); Has
        1257 Blast hits to 957 proteins in 161 species: Archae - 2; Bacteria -
        67; Metazoa - 448; Fungi - 227; Plants - 135; Viruses - 33; Other
        Eukaryotes - 345 (source: NCBI BLink). | chr5:16447429-16450686
        FORWARD

          Length = 583

 Score =  84 bits (206), Expect = 2e-016
 Identities = 60/146 (41%), Positives = 82/146 (56%), Gaps = 15/146 (10%)
 Frame = +3

Query:  762 PLSSTPRLYSGPIPRSPVSKLPKVSTSSPTASPTFVSTPKISELHELPRPPPP-PPSSSA 938
            PL  +      P+     S  P++   SPTASP   S+P+I+ELHELPRPP    P   +
Sbjct:  418 PLKPSSTRLPVPVAVQAQSSSPRI---SPTASPPLASSPRINELHELPRPPGQFAPPRRS 474

Query:  939 KSSRAFGYSAPLVSKSQLLSKPIIST--PPSPLPIPP-AITRSFSIPTGNLRGTELDMSK 1109
            KS    G+SAPL + +Q  S  ++ST    SPLP+PP  + RS+SIP+ N R     M++
Sbjct:  475 KSPGLVGHSAPLTAWNQERSNVVVSTNIVASPLPVPPLVVPRSYSIPSRNQRA----MAQ 530

Query: 1110 MSL-GANKLSTASP---PLTPMSLVH 1175
              L   N+   ASP   PLTP SL++
Sbjct:  531 QPLPERNQNRVASPPPLPLTPASLMN 556


 Score =  76 bits (186), Expect = 4e-014
 Identities = 50/152 (32%), Positives = 78/152 (51%), Gaps = 12/152 (7%)
 Frame = +3

Query:  96 KMTMKKNNDEDDSEVNDDGELSFEYRVNDKDQNADSSPSASSQLGQSDITF--PLVAGVK 269
           +M   ++ND+DD  VN DGELSF+Y  +++     S+P  S ++  +D++F  P  AG  
Sbjct: 248 EMDCSEDNDDDDRLVNRDGELSFDYITSEQRVEVISTPHGSMKMDDTDLSFQRPSPAGSA 307

Query: 270 TGQENEEANYGRSRSYRRDVRIESQSAPLFAENRTTPPSEKLLRMRSSVTRKFSTYALPT 449
           T   +    +  S    RD R  S SAPLF + +       + +M  S     + Y LPT
Sbjct: 308 TVNADPREEHSVS---NRDRRTSSHSAPLFPDKKADLADRSMRQMTPSA----NAYILPT 360

Query: 450 PVESAKSP---SSVTSLGNNKSMASSNPAKPI 536
           PV+S  SP     VT   ++ ++  S+P +PI
Sbjct: 361 PVDSKSSPIFTKPVTQTNHSANLWHSSPLEPI 392

>TAIR9_protein||AT1G44191.1 | Symbols:  | Encodes a ECA1 gametogenesis related
        family protein | chr1:16813654-16814733 REVERSE

          Length = 360

 Score =  52 bits (123), Expect = 9e-007
 Identities = 51/174 (29%), Positives = 61/174 (35%), Gaps = 7/174 (4%)
 Frame = +3

Query:  663 PRPSTDGLLYSRIGSLKRRSFSGPITSK-PLPNKPLSSTPRLYSGPIPRSPVSKLPKVST 839
            P+PS+   +  +     + S   P   K P P KP S  P     P P  P    PK ST
Sbjct:  120 PKPSSPPPIPKKSPPPPKPSSPPPTPKKSPPPPKPSSPPPSPKKSPPPPKPSPSPPKPST 179

Query:  840 SSPTASPTFVSTPKISELHELPRPPPPPPSSSAKSSRAFGYSAPLVSKSQLLSKPIISTP 1019
              PT   +  S PK S     P+  PPPP  S    +      P   KS    KP   + 
Sbjct:  180 PPPTPKKSPPSPPKPSSPPPSPKKSPPPPKPSPSPPKP-STPPPTPKKSPPPPKP---SQ 235

Query: 1020 PSPLPIPPAITRSFSIPTGNLRGTELDMSKMSLGANKLSTASPPLTPMSLVHPP 1181
            P P P PP   R  S PT     T               +  PP TP      P
Sbjct:  236 PPPKPSPP--RRKPSPPTPKPSTTPPSPKPSPPRPTPKKSPPPPTTPSPSYQDP 287

>TAIR9_protein||AT2G28440.1 | Symbols:  | proline-rich family protein |
        chr2:12161226-12162032 FORWARD

          Length = 269

 Score =  49 bits (116), Expect = 6e-006
 Identities = 43/130 (33%), Positives = 63/130 (48%), Gaps = 10/130 (7%)
 Frame = +3

Query:  738 TSKPLPNKPL--SSTPRLYS-GPIPRSPVSKLPKVSTSSPTAS--PTFVSTPKISELHEL 902
            +S P  + PL  SS+P + S  P   SP +  P   +SSP A+   +  S+PK   L + 
Sbjct:   84 SSSPEVDSPLAPSSSPEVDSPQPPSSSPEADSPLPPSSSPEANSPQSPASSPKPESLADS 143

Query:  903 PRPPPPPPSSSAKSSRAFGYSAPLVSKSQLLS----KPIISTPPSPLPIPPAITRSFSIP 1070
            P PPPPPP   + SS ++   AP+ + S   S    +P     PSP P  P +  +  I 
Sbjct:  144 PSPPPPPPQPESPSSPSYPEPAPVPAPSDDDSDDDPEPETEYFPSPAP-SPELGMAQDIK 202

Query: 1071 TGNLRGTELD 1100
              +  G EL+
Sbjct:  203 ASDAAGEELN 212

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,278,069,989
Number of Sequences: 33410
Number of Extensions: 8278069989
Number of Successful Extensions: 290448565
Number of sequences better than 0.0: 0