Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN08231


BLASTX 7.6.2

Query= UN08231 /QuerySize=1456
        (1455 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT1G63720.1 | Symbols:  | EXPRESSED IN: 21 plant ...    485   5e-137
TAIR9_protein||AT5G52430.1 | Symbols:  | hydroxyproline-rich gly...    205   1e-052
TAIR9_protein||AT4G25620.1 | Symbols:  | hydroxyproline-rich gly...    115   1e-025
TAIR9_protein||AT1G76660.1 | Symbols:  | FUNCTIONS IN: molecular...     99   8e-021

>TAIR9_protein||AT1G63720.1 | Symbols:  | EXPRESSED IN: 21 plant structures;
        EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein
        match is: hydroxyproline-rich glycoprotein family protein
        (TAIR:AT5G52430.1); Has 406 Blast hits to 313 proteins in 78 species:
        Archae - 0; Bacteria - 2; Metazoa - 123; Fungi - 81; Plants - 111;
        Viruses - 14; Other Eukaryotes - 75 (source: NCBI BLink). |
        chr1:23636122-23637348 REVERSE

          Length = 359

 Score =  485 bits (1246), Expect = 5e-137
 Identities = 272/381 (71%), Positives = 296/381 (77%), Gaps = 39/381 (10%)
 Frame = -1

Query: 1335 MRGGASGNNVLETINAAATAFASSDDRVHHQPSPIHKKRRWWNRFI---CFRPSTQRKKR 1165
            MR GA+GNNV +TINAAA+A ASSDDR+ HQ SPIHKKR+WWNR+    CF  S QR KR
Sbjct:    1 MRSGANGNNVFDTINAAASAIASSDDRL-HQSSPIHKKRKWWNRWSLLKCFGSSRQR-KR 58

Query: 1164 IGKAALAPEPVPTDS----TSNSGYRSVMTALPFIAPPSSPASFFQSEPPSATQSPVGIL 997
            IG + L PEPV   S    TSNSGYRSV+T LPFIAPPSSPASFFQSEPPSATQSPVGIL
Sbjct:   59 IGNSVLVPEPVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGIL 118

Query:  996 SFSPLPSNNHNNNNNSEERPSIFAIGPYAHEPQLVSPPVFSTYTTEPSSAPVTPPLDE-S 820
            SFSPLP NN         RPSIFAIGPYAHE QLVSPPVFSTYTTEPSSAP+TPPLD+ S
Sbjct:  119 SFSPLPCNN---------RPSIFAIGPYAHETQLVSPPVFSTYTTEPSSAPITPPLDDSS 169

Query:  819 FYLTTTTPSSPEVPFAQLFNS---SSNYGVRSPV-SNYEFQFYQLPPGSPLAQLISPSSV 652
             YLTTTTPSSPEVPFAQLFNS   + +YG + P+ S+YEFQFYQLPPGSPL QLISPS  
Sbjct:  170 IYLTTTTPSSPEVPFAQLFNSNHQTGSYGYKFPMSSSYEFQFYQLPPGSPLGQLISPS-- 227

Query:  651 MSGSGTTSPFPDG----VAHFQVSDPPKLLSPGKLRCSKTVTAPKEHMKIARPNKPVSFD 484
              GSG TSPFPDG      HFQVSDPPKLLSP     +  VT P +  KI RP+KPVSFD
Sbjct:  228 -PGSGPTSPFPDGETSLFPHFQVSDPPKLLSPK----TAGVTTPCKEQKIVRPHKPVSFD 282

Query:  483 LDADHFIRCVDQKLRTTFPEASTSSLQEAAQHSSLGGPSKEFDFGTTDGKHLTVDDEHRD 304
            LDADH IRCVDQKLRTTFPEAS+   QE+  HSSLG  +KEF+FG TD KHLTVD+    
Sbjct:  283 LDADHVIRCVDQKLRTTFPEASSD--QESMNHSSLGS-NKEFNFG-TDEKHLTVDEHRSA 338

Query:  303 STKNSSDWS-FPVMQSGTLS* 244
            S KNS+DWS FPVMQSGTLS*
Sbjct:  339 SPKNSNDWSFFPVMQSGTLS* 359

>TAIR9_protein||AT5G52430.1 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr5:21283093-21285045 REVERSE

          Length = 439

 Score =  205 bits (519), Expect = 1e-052
 Identities = 126/265 (47%), Positives = 163/265 (61%), Gaps = 31/265 (11%)
 Frame = -1

Query: 1314 NNVLETINAAATAFASSDDRVHHQPSPIHKKR--RWWNRFICFRPSTQRKKRIGKAALAP 1141
            NN +ET+NAAATA  +++ RV  QPS   K R  + W+ + CF  + +  KRIG A L P
Sbjct:    6 NNSVETVNAAATAIVTAESRV--QPSSSQKGRWGKCWSLYSCF-GTQKNNKRIGNAVLVP 62

Query: 1140 EP----VPTDSTSNSGYRSVMTALPFIAPPSSPASFFQSEPPSATQSPVGILSFSPLPSN 973
            EP    VP  +  NS   S    LPFIAPPSSPASF QS+P S + SPVG LS +     
Sbjct:   63 EPVTSGVPVVTVQNSA-TSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLT----- 116

Query:  972 NHNNNNNSEERPSIFAIGPYAHEPQLVSPPVFSTYTTEPSSAPVTPPLDESFYLTTTTPS 793
              +N  + +E  S+F +GPYA+E Q V+PPVFS + TEPS+AP TPP + S ++  TTPS
Sbjct:  117 --SNTFSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPESSVHI--TTPS 172

Query:  792 SPEVPFAQLFNSS---------SNYGVRSPVSNYEFQFYQLPPGSP-LAQLISPSSVMSG 643
            SPEVPFAQL  SS         S    +   S+YEF+  Q+ PGSP    LISP SV+S 
Sbjct:  173 SPEVPFAQLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISN 232

Query:  642 SGTTSPFP--DGVAHFQVSDPPKLL 574
            SGT+SP+P    +  F++ +PPK L
Sbjct:  233 SGTSSPYPGKSPMVEFRIGEPPKFL 257


 Score =  94 bits (231), Expect = 3e-019
 Identities = 53/112 (47%), Positives = 69/112 (61%), Gaps = 3/112 (2%)
 Frame = -1

Query: 1038 SEPPSATQSPVGILSFSPL-PSNNHNNNNNSEERPSIFAIGPYAHEPQLVSPPVFSTYTT 862
            S P S  QS    +S SP+ P +  +N  + +E  S+F +GPYA+E Q V+PPVFS + T
Sbjct:   92 SSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQSVFTVGPYANETQPVTPPVFSAFIT 151

Query:  861 EPSSAPVTPPLDESFYLTTTTPSSPEVPFAQLFNSSSNYGVRSPVSNYEFQF 706
            EPS+AP TPP + S ++  TTPSSPEVPFAQL  SS     R   S    +F
Sbjct:  152 EPSTAPYTPPPESSVHI--TTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKF 201

>TAIR9_protein||AT4G25620.1 | Symbols:  | hydroxyproline-rich glycoprotein
        family protein | chr4:13067447-13069296 REVERSE

          Length = 450

 Score =  115 bits (286), Expect = 1e-025
 Identities = 65/135 (48%), Positives = 81/135 (60%), Gaps = 24/135 (17%)
 Frame = -1

Query: 945 ERPSIFAIGPYAHEPQLVSPPVFSTYTTEPSSAPVTPPLDESFYLTTTTPSSPEVPFAQL 766
           E PS F IGPYAHE Q V+PPVFS +TTEPS+AP TPP +        +PSSPEVPFAQL
Sbjct: 121 EPPSAFTIGPYAHETQPVTPPVFSAFTTEPSTAPFTPPPE--------SPSSPEVPFAQL 172

Query: 765 F---------NSSSNYGVRSPVSNYEFQFYQLPPGSPLAQLISPSSVMSGSGTTSPFPD- 616
                     NS      +   ++YEF+  Q+ PGSP   LISP     GSGT+SP+P  
Sbjct: 173 LTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISP-----GSGTSSPYPGK 227

Query: 615 -GVAHFQVSDPPKLL 574
             +  F++ +PPK L
Sbjct: 228 CSIIEFRIGEPPKFL 242


 Score =  90 bits (221), Expect = 4e-018
 Identities = 50/107 (46%), Positives = 66/107 (61%), Gaps = 8/107 (7%)
 Frame = -1

Query: 1314 NNVLETINAAATAFASSDDRVHHQPSPIHKKR-RWWNRFICFRPSTQRKKRIGKAALAPE 1138
            N+ ++T+NAAA+A  S++ R   QPS + KKR  WW+ + CF  S +  KRIG A L PE
Sbjct:    6 NSSVDTVNAAASAIVSAESRT--QPSSVQKKRGSWWSLYWCF-GSKKNNKRIGHAVLVPE 62

Query: 1137 PVPTDS----TSNSGYRSVMTALPFIAPPSSPASFFQSEPPSATQSP 1009
            P  + +      NS   S    +PFIAPPSSPASF  S PPSA+ +P
Sbjct:   63 PAASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTP 109

>TAIR9_protein||AT1G76660.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma
        membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
        growth stages; BEST Arabidopsis thaliana protein match is:
        hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1); Has
        280 Blast hits to 160 proteins in 42 species: Archae - 0; Bacteria - 4;
        Metazoa - 55; Fungi - 17; Plants - 64; Viruses - 4; Other Eukaryotes -
        136 (source: NCBI BLink). | chr1:28769157-28771036 REVERSE

          Length = 432

 Score =  99 bits (244), Expect = 8e-021
 Identities = 62/139 (44%), Positives = 77/139 (55%), Gaps = 4/139 (2%)
 Frame = -1

Query: 1038 SEPPSATQSPVGILSFSPLPSNNHNNNNNSEERPSIFAIGPYAHEPQLVSPPVFSTYTTE 859
            S P S T S +   + SP    +   N+      S++A GPYAHE QLVSPPVFST+TTE
Sbjct:   73 SSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPYAHETQLVSPPVFSTFTTE 132

Query:  858 PSSAPVTPPLDESFYLTTTTPSSPEVPFAQLFNSSSNYGVRSPVSNYEFQ-FYQLPPGSP 682
            PS+AP TPP +       T PSSP+VP+A+   SS +          + Q  Y L PGSP
Sbjct:  133 PSTAPFTPPPE---LARLTAPSSPDVPYARFLTSSMDLKNSGKGHYNDLQATYSLYPGSP 189

Query:  681 LAQLISPSSVMSGSGTTSP 625
             + L SP S  SG G  SP
Sbjct:  190 ASALRSPISRASGDGLLSP 208

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,760,199,660
Number of Sequences: 33410
Number of Extensions: 4760199660
Number of Successful Extensions: 173999632
Number of sequences better than 0.0: 0