Library    |     Search    |     Batch query    |     SNP    |     SSR  

TAIR blast output of UN83098


BLASTX 7.6.2

Query= UN83098 /QuerySize=863
        (862 letters)

Database: TAIR9 protein;
          33,410 sequences; 13,468,323 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

TAIR9_protein||AT1G76240.1 | Symbols:  | unknown protein | chr1:...    448   3e-126
TAIR9_protein||AT2G17080.1 | Symbols:  | unknown protein | chr2:...     69   3e-012
TAIR9_protein||AT2G40070.1 | Symbols:  | FUNCTIONS IN: molecular...     61   9e-010
TAIR9_protein||AT2G40070.2 | Symbols:  | FUNCTIONS IN: molecular...     61   9e-010
TAIR9_protein||AT1G68725.1 | Symbols: AGP19, ATAGP19 | AGP19 (AR...     54   1e-007
TAIR9_protein||AT4G35800.1 | Symbols: NRPB1, RPB1, RNA_POL_II_LS...     48   6e-006
TAIR9_protein||AT2G45000.1 | Symbols: EMB2766 | EMB2766 (EMBRYO ...     48   8e-006

>TAIR9_protein||AT1G76240.1 | Symbols:  | unknown protein |
        chr1:28602949-28603875 REVERSE

          Length = 309

 Score =  448 bits (1150), Expect = 3e-126
 Identities = 228/281 (81%), Positives = 253/281 (90%), Gaps = 4/281 (1%)
 Frame = +3

Query:  30 MVGVFRRSLSFPNKPTVRPPPPSKPRVSHHTRSISLPCRSHPLISHINHEISQIKSWSSL 209
           MVGVFRRSLSFPNKP  R  P SKPRVSHHTRSISLPCRSHPLISH+NHEISQ+KSW S 
Sbjct:   1 MVGVFRRSLSFPNKPCGRSSPSSKPRVSHHTRSISLPCRSHPLISHVNHEISQLKSWFSF 60

Query: 210 ----DRRTTAWITDGLSLLRDVQETLSDILHLPQSQESLRNRPVFFENLLEDLLRFVDAY 377
                 RTT+WITDGLSLL+DVQETL+DIL LPQSQESLRNRPVFFENLLEDLLRFVDAY
Sbjct:  61 AGETHSRTTSWITDGLSLLKDVQETLADILQLPQSQESLRNRPVFFENLLEDLLRFVDAY 120

Query: 378 GIFRTSLLSLREHQSAAQVALRRKDDVKISSYVNSRRALARDVAKLTSAVREPKTKYNRC 557
           GIFRTS+L LREHQSAAQVALR+KDD KI+SY+ SRR+LARD+AKLTS++REPKTK+  C
Sbjct: 121 GIFRTSILCLREHQSAAQVALRKKDDEKIASYLKSRRSLARDIAKLTSSIREPKTKHQHC 180

Query: 558 HVDVLNGSYVEAELASVIGDVIEVTVLVSVALFNGVYLSLRSSKTTAFVGFLKRSEKRDK 737
           HVD +NG+Y +AELASVIGDVIEVTVLVSVALFNGVYLSLR++KTT F+GFLKRSEK++K
Sbjct: 181 HVDNVNGTYGDAELASVIGDVIEVTVLVSVALFNGVYLSLRATKTTPFIGFLKRSEKKEK 240

Query: 738 NGEGIEELKQVEEKSLVGLSKKKNEEVKILTQKMMEFENSI 860
             EGI ELKQVEEKSL+GLSKKKNEEVK L ++MME ENSI
Sbjct: 241 LDEGIVELKQVEEKSLIGLSKKKNEEVKSLMKRMMELENSI 281

>TAIR9_protein||AT2G17080.1 | Symbols:  | unknown protein | chr2:7433326-7434117
        REVERSE

          Length = 264

 Score =  69 bits (167), Expect = 3e-012
 Identities = 41/153 (26%), Positives = 81/153 (52%), Gaps = 5/153 (3%)
 Frame = +3

Query: 108 VSHHTRSISLPCRSHPLISHINHEISQIKSWSSLDRRTTAWITDGLSLLRDVQETLSDIL 287
           VS H RS S P RSHP  +H++ ++++++S       +++ I   L  L+++ E+L  ++
Sbjct:   3 VSFHVRSNSFPSRSHPQAAHVDEQLARLRSSEQASSSSSSSICQRLDNLQELHESLDKLI 62

Query: 288 HLPQSQESL--RNRPVFFENLLEDLLRFVDAYGIFRTSLLSLREHQSAAQVALRRKD--- 452
             P +Q++L   +     E LL+  LR +D   I + +L  ++E     Q  LRRK    
Sbjct:  63 SRPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEGLMEIQSILRRKRGDL 122

Query: 453 DVKISSYVNSRRALARDVAKLTSAVREPKTKYN 551
             ++  Y+ SR++L +   K+  +++  + + N
Sbjct: 123 SEEVKKYLTSRKSLKKSFQKVQKSLKVTQAEDN 155

>TAIR9_protein||AT2G40070.1 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 17 plant structures;
        EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein
        match is: proline-rich family protein (TAIR:AT3G09000.1); Has 94255
        Blast hits to 49644 proteins in 1573 species: Archae - 225; Bacteria -
        11215; Metazoa - 37735; Fungi - 21320; Plants - 3339; Viruses - 2662;
        Other Eukaryotes - 17759 (source: NCBI BLink). | chr2:16728378-16731160
        REVERSE

          Length = 608

 Score =  61 bits (146), Expect = 9e-010
 Identities = 66/206 (32%), Positives = 96/206 (46%), Gaps = 13/206 (6%)
 Frame = +1

Query:  49 DHFLSRTNPPSVHHLHQSHAS---LTTQDPSASHAGHTL*SPTSTTRSPRSNPGPP---- 207
           +H  SR    S      S AS    ++  P +  A  T  S T T  S  S P  P    
Sbjct: 153 NHLTSRQQTSSPGLSSSSGASRRPSSSGGPGSRPATPTGRSSTLTANSKSSRPSTPTSRA 212

Query: 208 SIAAPPRGSPTVSASS-ETSKKPSP-TYSTSLSRRSLSATA--PSSSRTSSKTSSASSTP 375
           ++++  R S T S S+   + KP+P + STSLS   L+ TA  P++S   S  S   STP
Sbjct: 213 TVSSATRPSLTNSRSTVSATTKPTPMSRSTSLSSSRLTPTASKPTTSTARSAGSVTRSTP 272

Query: 376 TASSAPRSSPSASTSPPLRSLSGEKTTSRSPPT*TPAALSRETSRS*RRPYASRRRSTTA 555
            +++   + PS ST+P  RS +   +T  S PT  P+     +S   RRP AS   +TT 
Sbjct: 273 -STTTKSAGPSRSTTPLSRS-TARSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATTT 330

Query: 556 ATWTF*TGRTSRRSWRRSSATSSRSP 633
           A  T    + S  +  +   T S++P
Sbjct: 331 ANPTISQIKPSSPAPAKPMPTPSKNP 356

>TAIR9_protein||AT2G40070.2 | Symbols:  | FUNCTIONS IN: molecular_function
        unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
        cellular_component unknown; EXPRESSED IN: 17 plant structures;
        EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein
        match is: proline-rich family protein (TAIR:AT3G09000.1); Has 92805
        Blast hits to 48882 proteins in 1559 species: Archae - 225; Bacteria -
        11081; Metazoa - 37135; Fungi - 20962; Plants - 3300; Viruses - 2664;
        Other Eukaryotes - 17438 (source: NCBI BLink). | chr2:16728378-16731040
        REVERSE

          Length = 568

 Score =  61 bits (146), Expect = 9e-010
 Identities = 66/206 (32%), Positives = 96/206 (46%), Gaps = 13/206 (6%)
 Frame = +1

Query:  49 DHFLSRTNPPSVHHLHQSHAS---LTTQDPSASHAGHTL*SPTSTTRSPRSNPGPP---- 207
           +H  SR    S      S AS    ++  P +  A  T  S T T  S  S P  P    
Sbjct: 113 NHLTSRQQTSSPGLSSSSGASRRPSSSGGPGSRPATPTGRSSTLTANSKSSRPSTPTSRA 172

Query: 208 SIAAPPRGSPTVSASS-ETSKKPSP-TYSTSLSRRSLSATA--PSSSRTSSKTSSASSTP 375
           ++++  R S T S S+   + KP+P + STSLS   L+ TA  P++S   S  S   STP
Sbjct: 173 TVSSATRPSLTNSRSTVSATTKPTPMSRSTSLSSSRLTPTASKPTTSTARSAGSVTRSTP 232

Query: 376 TASSAPRSSPSASTSPPLRSLSGEKTTSRSPPT*TPAALSRETSRS*RRPYASRRRSTTA 555
            +++   + PS ST+P  RS +   +T  S PT  P+     +S   RRP AS   +TT 
Sbjct: 233 -STTTKSAGPSRSTTPLSRS-TARSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATTT 290

Query: 556 ATWTF*TGRTSRRSWRRSSATSSRSP 633
           A  T    + S  +  +   T S++P
Sbjct: 291 ANPTISQIKPSSPAPAKPMPTPSKNP 316

>TAIR9_protein||AT1G68725.1 | Symbols: AGP19, ATAGP19 | AGP19
        (ARABINOGALACTAN-PROTEIN 19) | chr1:25809298-25810130 FORWARD

          Length = 249

 Score =  54 bits (128), Expect = 1e-007
 Identities = 35/124 (28%), Positives = 52/124 (41%)
 Frame = +1

Query: 100 SHASLTTQDPSASHAGHTL*SPTSTTRSPRSNPGPPSIAAPPRGSPTVSASSETSKKPSP 279
           S  S+  Q P+AS    T  +P  TT +P +   PP     P  S     +S  +  P+ 
Sbjct:  18 SSFSVNAQGPAASPVTSTTTAPPPTTAAPPTTAAPPPTTTTPPVSAAQPPASPVTPPPAV 77

Query: 280 TYSTSLSRRSLSATAPSSSRTSSKTSSASSTPTASSAPRSSPSASTSPPLRSLSGEKTTS 459
           T ++  + +     +P++       S  +S PT S  P S P A TSPP    S     +
Sbjct:  78 TPTSPPAPKVAPVISPATPPPQPPQSPPASAPTVSPPPVSPPPAPTSPPPTPASPPPAPA 137

Query: 460 RSPP 471
             PP
Sbjct: 138 SPPP 141

>TAIR9_protein||AT4G35800.1 | Symbols: NRPB1, RPB1, RNA_POL_II_LSRNA_POL_II_LS,
        RNA_POL_II_LS | NRPB1 (RNA POLYMERASE II LARGE SUBUNIT); DNA binding /
        DNA-directed RNA polymerase | chr4:16961115-16967892 REVERSE

          Length = 1841

 Score =  48 bits (113), Expect = 6e-006
 Identities = 44/124 (35%), Positives = 61/124 (49%), Gaps = 4/124 (3%)
 Frame = +1

Query:  109 SLTTQDPSASHAGHTL*SPTSTTRSPRSNPGPPSIA-APPRGSPTVSASSETSKKPSPTY 285
            S T+   S S  G++  SP  +  SP  +P  PS +   P  SPT  + S TS   SPT 
Sbjct: 1574 SPTSPTYSPSSPGYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS 1633

Query:  286 -STSLSRRSLSATAPSSSRTSSKTS--SASSTPTASSAPRSSPSASTSPPLRSLSGEKTT 456
             S S +  + S T+P+ S TS   S  S S +PT+ S   +SPS S + P  S +    +
Sbjct: 1634 PSYSPTSPAYSPTSPAYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYS 1693

Query:  457 SRSP 468
              SP
Sbjct: 1694 PTSP 1697


 Score =  48 bits (112), Expect = 8e-006
 Identities = 46/125 (36%), Positives = 63/125 (50%), Gaps = 6/125 (4%)
 Frame = +1

Query:  109 SLTTQDPSASHAGHTL*SPTSTTRSPRSNPGPPSIA-APPRGSPTVSASSETSKKPSPTY 285
            S T+   S +  G++  SPT +  SP  +P  P+ +   P  SPT  + S TS   SPT 
Sbjct: 1560 SPTSPGYSPTSPGYSPTSPTYSPSSPGYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTS 1619

Query:  286 -STSLSRRSLSATAPSSSRTS---SKTSSASSTPTASSAPRSSPSASTSPPLRSLSGEKT 453
             S S +  S S T+PS S TS   S TS A S PT+ +   +SPS S + P  S +    
Sbjct: 1620 PSYSPTSPSYSPTSPSYSPTSPAYSPTSPAYS-PTSPAYSPTSPSYSPTSPSYSPTSPSY 1678

Query:  454 TSRSP 468
            +  SP
Sbjct: 1679 SPTSP 1683

>TAIR9_protein||AT2G45000.1 | Symbols: EMB2766 | EMB2766 (EMBRYO DEFECTIVE
        2766); structural constituent of nuclear pore | chr2:18564156-18567632
        FORWARD

          Length = 740

 Score =  48 bits (112), Expect = 8e-006
 Identities = 41/134 (30%), Positives = 61/134 (45%), Gaps = 1/134 (0%)
 Frame = +1

Query:  16 SPPKTWLEFSGDHFLSRTNPPSVHHLHQSHASLTTQDPSASHAGHTL*SPTSTTRSPRSN 195
           S P  +   S     S  +P  V   + S  S T+   ++  +  T  S   +T S  ++
Sbjct: 288 STPSLFASSSSGATTSSPSPFGVSTFNSSSTSNTSNASASPFSASTGFSFLKSTASSTTS 347

Query: 196 PGPPSIAAPPRGSPTVSASSETSKKPSPTYSTSLSRRSLSATAPSSSRTSSKTSSASSTP 375
              PS A P   S + S S  TS       ST  S    S+T+ +    ++ T+++SSTP
Sbjct: 348 STTPS-APPQTASSSSSFSFGTSANSGFNLSTGSSAAPASSTSGAVFSIATTTTTSSSTP 406

Query: 376 TASSAPRSSPSAST 417
            A+SAP SS  AST
Sbjct: 407 AATSAPASSAPAST 420

  Database: TAIR9 protein
    Posted date:  Wed Jul 08 15:16:08 2009
  Number of letters in database: 13,468,323
  Number of sequences in database:  33,410

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,252,145,367
Number of Sequences: 33410
Number of Extensions: 33252145367
Number of Successful Extensions: 1129160757
Number of sequences better than 0.0: 0