Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN20045


BLASTX 7.6.2

Query= UN20045 /QuerySize=1559
        (1558 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT2G40060.1 | Symbols:  | protein binding / struc...    350   2e-096
TAIR9_protein||AT2G40070.1 | Symbols:  | FUNCTIONS IN: molecular...    253   3e-067
TAIR9_protein||AT2G40070.2 | Symbols:  | FUNCTIONS IN: molecular...    253   3e-067
TAIR9_protein||AT2G20760.1 | Symbols:  | protein binding / struc...    219   6e-057
TAIR9_protein||AT3G51890.1 | Symbols:  | protein binding / struc...    217   2e-056
TAIR9_protein||AT3G09000.1 | Symbols:  | proline-rich family pro...    131   2e-030
TAIR9_protein||AT5G01280.1 | Symbols:  | FUNCTIONS IN: molecular...     90   4e-018

>TAIR9_protein||AT2G40060.1 | Symbols:  | protein binding / structural molecule
        | chr2:16726564-16728001 FORWARD

          Length = 259

 Score =  350 bits (896), Expect = 2e-096
 Identities = 179/225 (79%), Positives = 196/225 (87%), Gaps = 13/225 (5%)
 Frame = +1

Query: 163 SAQVDDSINDDVFAAPSSDYGGYSNGD---------DGPILPPPSEMESDEGAALREWRR 315
           S QV+DS+ DDVFAAPSSDYG YSNGD         DGPILPPPSEMESDEG ALREWRR
Sbjct:  38 SLQVEDSV-DDVFAAPSSDYGAYSNGDGIFGSNGDHDGPILPPPSEMESDEGFALREWRR 96

Query: 316 QNAIQLEEKEKREKELRNQIIEEADQFKEEFHKKRELACENNKAANREKQKLYVETQEKF 495
           QNAIQLEEKEKREKEL  QIIEEADQ+KEEFHKK E+ CENNKAANREK+KLY+E QEKF
Sbjct:  97 QNAIQLEEKEKREKELLKQIIEEADQYKEEFHKKIEVTCENNKAANREKEKLYLENQEKF 156

Query: 496 YAEASKNYWKAIAELVPKEVPIIEKTRRGKKKEEDPKKPTTISVIQGPKPGKPTDLSRMR 675
           YAE+SKNYWKAIAELVPKEVP IEK RRGKK+++DPKKP T+SVIQGPKPGKPTDL+RMR
Sbjct: 157 YAESSKNYWKAIAELVPKEVPTIEK-RRGKKEQQDPKKP-TVSVIQGPKPGKPTDLTRMR 214

Query: 676 QILLKLKQNPPAHLKLTPQPPPSE-AAPPENVPQTKPAEAVAAS* 807
           QIL+KLK NPP+HLKLT QPP  E AAPP+NVP+TKP EAV A+*
Sbjct: 215 QILVKLKHNPPSHLKLTSQPPSEEAAAPPKNVPETKPTEAVTAA* 259

>TAIR9_protein||AT2G40070.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 17 plant structures;
        EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein
        match is: proline-rich family protein (TAIR:AT3G09000.1); Has 94255
        Blast hits to 49644 proteins in 1573 species: Archae - 225; Bacteria -
        11215; Metazoa - 37735; Fungi - 21320; Plants - 3339; Viruses - 2662;
        Other Eukaryotes - 17759 (source: NCBI BLink). | chr2:16728378-16731160
        REVERSE

          Length = 608

 Score =  253 bits (645), Expect = 3e-067
 Identities = 134/144 (93%), Positives = 136/144 (94%), Gaps = 3/144 (2%)
 Frame = -2

Query: 1557 MVERVINMRKLAPPRSDEKGSPHGNLSAKSSSPDSAGFGRNLSKKSPDMAIRHMDIRRTI 1378
            MVERVINMRKLAPPRSD+KGSPHGNLSAKSSSPDSAGFGR LSKKS DMAIRHMDIRRTI
Sbjct:  467 MVERVINMRKLAPPRSDDKGSPHGNLSAKSSSPDSAGFGRTLSKKSLDMAIRHMDIRRTI 526

Query: 1377 PGNLRPLMTNIPASSMYSVRSGHTRGRPMNVSD-SPLATSSNASSEISVYNNNGMCLEAA 1201
            PGNLRPLMTNIPASSMYSVRSGHTRGRPMNVSD SPLATSSNASSEISV NNNG+CLE A
Sbjct:  527 PGNLRPLMTNIPASSMYSVRSGHTRGRPMNVSDSSPLATSSNASSEISVCNNNGICLE-A 585

Query: 1200 SEKEDDAGSGRGCRSPASSLQGR* 1129
            SEKEDDAGS RGCRSPA SLQGR*
Sbjct:  586 SEKEDDAGSERGCRSPA-SLQGR* 608

>TAIR9_protein||AT2G40070.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 17 plant structures;
        EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein
        match is: proline-rich family protein (TAIR:AT3G09000.1); Has 92805
        Blast hits to 48882 proteins in 1559 species: Archae - 225; Bacteria -
        11081; Metazoa - 37135; Fungi - 20962; Plants - 3300; Viruses - 2664;
        Other Eukaryotes - 17438 (source: NCBI BLink). | chr2:16728378-16731040
        REVERSE

          Length = 568

 Score =  253 bits (645), Expect = 3e-067
 Identities = 134/144 (93%), Positives = 136/144 (94%), Gaps = 3/144 (2%)
 Frame = -2

Query: 1557 MVERVINMRKLAPPRSDEKGSPHGNLSAKSSSPDSAGFGRNLSKKSPDMAIRHMDIRRTI 1378
            MVERVINMRKLAPPRSD+KGSPHGNLSAKSSSPDSAGFGR LSKKS DMAIRHMDIRRTI
Sbjct:  427 MVERVINMRKLAPPRSDDKGSPHGNLSAKSSSPDSAGFGRTLSKKSLDMAIRHMDIRRTI 486

Query: 1377 PGNLRPLMTNIPASSMYSVRSGHTRGRPMNVSD-SPLATSSNASSEISVYNNNGMCLEAA 1201
            PGNLRPLMTNIPASSMYSVRSGHTRGRPMNVSD SPLATSSNASSEISV NNNG+CLE A
Sbjct:  487 PGNLRPLMTNIPASSMYSVRSGHTRGRPMNVSDSSPLATSSNASSEISVCNNNGICLE-A 545

Query: 1200 SEKEDDAGSGRGCRSPASSLQGR* 1129
            SEKEDDAGS RGCRSPA SLQGR*
Sbjct:  546 SEKEDDAGSERGCRSPA-SLQGR* 568

>TAIR9_protein||AT2G20760.1 | Symbols:  | protein binding / structural molecule
        | chr2:8943279-8945108 REVERSE

          Length = 339

 Score =  219 bits (556), Expect = 6e-057
 Identities = 121/225 (53%), Positives = 150/225 (66%), Gaps = 17/225 (7%)
 Frame = +1

Query: 130 GDDASESVPQVSAQVDDSINDDVFAAPSSDYGGYSNGD-----DGPILPPPSEMESDEGA 294
           G  AS      S+  + S+ND      ++  GG S GD     DGPILP P+EM  +EG 
Sbjct:  57 GFGASSPNHDFSSPFESSVND------ANGNGGGSGGDAIFASDGPILPDPNEMR-EEGF 109

Query: 295 ALREWRRQNAIQLEEKEKREKELRNQIIEEADQFKEEFHKKRELACENNKAANREKQKLY 474
             REWRR N I LEEKEK+EKE+RNQII EA+ FK+ F++KR+   E NK  NREK+KLY
Sbjct: 110 QRREWRRLNTIHLEEKEKKEKEMRNQIITEAEDFKKAFYEKRDKTIETNKTDNREKEKLY 169

Query: 475 VETQEKFYAEASKNYWKAIAELVPKEVPIIEKTRRGKKKEEDPKKPTTISVIQGPKPGKP 654
              QEKF+ E  K+YWKAIAEL+P+EVP IEK +RGKK   DP K  +++VIQGPKPGKP
Sbjct: 170 WANQEKFHKEVDKHYWKAIAELIPREVPNIEK-KRGKK---DPDKKPSVNVIQGPKPGKP 225

Query: 655 TDLSRMRQILLKLKQNPPAHLKLTPQPPPSEAAPPENVPQTKPAE 789
           TDL RMRQI LKLK NPP H+ + P PP  +A   ++    K A+
Sbjct: 226 TDLGRMRQIFLKLKTNPPPHM-MPPPPPAKDAKDGKDAKDGKDAK 269

>TAIR9_protein||AT3G51890.1 | Symbols:  | protein binding / structural molecule
        | chr3:19249686-19250890 REVERSE

          Length = 259

 Score =  217 bits (552), Expect = 2e-056
 Identities = 111/174 (63%), Positives = 131/174 (75%), Gaps = 8/174 (4%)
 Frame = +1

Query: 253 ILPPPSEMESDEGAALREWRRQNAIQLEEKEKREKELRNQIIEEADQFKEEFHKKRELAC 432
           ILPPPS ME +EG ALREWRR NA++LEEKEK EKE+  QI+E A+Q+K EF+ KR +  
Sbjct:  85 ILPPPSAMEKEEGFALREWRRLNALRLEEKEKEEKEMVQQILEAAEQYKAEFYSKRNVTI 144

Query: 433 ENNKAANREKQKLYVETQEKFYAEASKNYWKAIAELVPKEVPIIEKTRRGKKKEEDPKKP 612
           ENNK  NREK+K ++E QEKFYAEA KN WKAIAEL+P+EVP+IE   RG K     KK 
Sbjct: 145 ENNKKLNREKEKFFLENQEKFYAEADKNNWKAIAELIPREVPVIE--NRGNK-----KKT 197

Query: 613 TTISVIQGPKPGKPTDLSRMRQILLKLKQNPPAHLKLTPQPPPSEAAPPENVPQ 774
            TI+VIQGPKPGKPTDLSRMRQ+L KLK NPP H+K    P PS A P  +V +
Sbjct: 198 ATITVIQGPKPGKPTDLSRMRQVLTKLKHNPPTHMK-PKLPSPSGADPNVSVSE 250

>TAIR9_protein||AT3G09000.1 | Symbols:  | proline-rich family protein |
        chr3:2746014-2748326 FORWARD

          Length = 542

 Score =  131 bits (328), Expect = 2e-030
 Identities = 80/132 (60%), Positives = 93/132 (70%), Gaps = 10/132 (7%)
 Frame = -2

Query: 1557 MVERVINMRKLAPPRSDEKGSPHGNLSAKSSSP-DSAGFGRNLSKKSPDMAIRHMDIRRT 1381
            MVERV+NMRKL PPR  E G   G  S KSSS  +S G+GRNLSK S DMAIRHMDIRR 
Sbjct:  403 MVERVVNMRKLGPPRLTENG---GRGSGKSSSAFNSLGYGRNLSKSSIDMAIRHMDIRRG 459

Query: 1380 IPGNLRPLMTNIPASSMYSVRSGHTRGRPMNVSDSPLATSSN-ASSEISVYNNNGMCLEA 1204
            + GNLRPL+T +PASSMYSVRS     RP +VS SP+ATSS  +SS+ SV N N +CL+ 
Sbjct:  460 MTGNLRPLVTKVPASSMYSVRS-----RPGSVSSSPVATSSTVSSSDPSVDNINILCLDG 514

Query: 1203 ASEKEDDAGSGR 1168
               + DD  S R
Sbjct:  515 NEAENDDLLSER 526

>TAIR9_protein||AT5G01280.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: sperm cell, male gametophyte;
        EXPRESSED DURING: L mature pollen stage, M germinated pollen stage;
        BEST Arabidopsis thaliana protein match is: proline-rich family protein
        (TAIR:AT3G09000.1); Has 40332 Blast hits to 19265 proteins in 905
        species: Archae - 97; Bacteria - 3595; Metazoa - 17370; Fungi - 8276;
        Plants - 837; Viruses - 1209; Other Eukaryotes - 8948 (source: NCBI
        BLink). | chr5:114185-116237 REVERSE

          Length = 461

 Score =  90 bits (221), Expect = 4e-018
 Identities = 62/128 (48%), Positives = 79/128 (61%), Gaps = 22/128 (17%)
 Frame = -2

Query: 1554 VERVINMRKLAPPRSDEKGSPH-----GNLSAKSSSPDSA--GFGRNLSKKSPDMAIRHM 1396
            VE+V+NMRKLA PR  E GS       G+ SA  SS  S   GFGRNLSK S DMA+RHM
Sbjct:  339 VEKVVNMRKLATPRLTESGSRRLGGGGGDSSAGKSSSGSGGFGFGRNLSKSSIDMALRHM 398

Query: 1395 DIRR-TIPGNLRPLMTNIPASSMYSVRSGHTRGRPMNVSDSPLATSSNASSEISVYNNNG 1219
            D+R+ ++ GN R  +T  PA+S+YSVRS   R RP+         SS+ SSE SV   N 
Sbjct:  399 DVRKGSMAGNFRHSVTKAPATSVYSVRS--CRNRPV---------SSSRSSESSV---NI 444

Query: 1218 MCLEAASE 1195
            +CL+ + +
Sbjct:  445 LCLDGSDD 452

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,459,653,557
Number of Sequences: 33410
Number of Extensions: 10459653557
Number of Successful Extensions: 353860551
Number of sequences better than 0.0: 0