Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN63473


BLASTX 7.6.2

Query= UN63473 /QuerySize=687
        (686 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT...    273   1e-071
gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabid...    273   2e-071
gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]              149   5e-034
gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsi...    148   9e-034
gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabid...    140   2e-031
gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis...    127   2e-027
gi|255551795|ref|XP_002516943.1| conserved hypothetical protein ...     98   8e-019
gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT...     91   1e-016
gi|226501094|ref|NP_001146008.1| hypothetical protein LOC1002795...     80   3e-013
gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein...     69   5e-010
gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vini...     69   5e-010

>gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT_478007
        [Arabidopsis lyrata subsp. lyrata]

          Length = 372

 Score =  273 bits (698), Expect = 1e-071
 Identities = 150/219 (68%), Positives = 177/219 (80%), Gaps = 13/219 (5%)
 Frame = +1

Query:  46 SASSSSSSSASLVYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKASNV 225
           S+SSSSSSSA+L+YKRSKS AA+YGESFSQRKRSGFWSFLHLYSSK HQ+++TTKK  N 
Sbjct: 106 SSSSSSSSSANLIYKRSKSTAAAYGESFSQRKRSGFWSFLHLYSSK-HQISNTTKKVDNF 164

Query: 226 SHSSRPGNKIDNGKHQTTETTNQVV--GRGIDVIVKEEDKRSSGNKVVAETPTN-SVGSG 396
           SHS R     +     TTET+++ V  G GIDVIV+EED+ S  NKVV+ETPTN  +G G
Sbjct: 165 SHSRR-----NQRTESTTETSSKRVGGGGGIDVIVEEEDE-SPPNKVVSETPTNGGIGGG 218

Query: 397 GGSSFGRKVLRSRSVGYGNRSFSS---ERISNGIGDCALRRIESQREYSTKVNCNGGDEA 567
           GGSSFGRKVLRSRSVG G+RSFS    ERISNG GDCALRRIESQRE +  ++  GG EA
Sbjct: 219 GGSSFGRKVLRSRSVGCGSRSFSGDFFERISNGFGDCALRRIESQREATKVISNGGGGEA 278

Query: 568 GDAMSETVKCGGIFGGFTIMTPSASTSTSASSTVDHHHH 684
            +AM+E VKCGGIFGGF IMTPS+++S++ SSTVDHHH+
Sbjct: 279 ANAMNEMVKCGGIFGGFMIMTPSSTSSSTTSSTVDHHHN 317

>gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabidopsis
        thaliana]

          Length = 369

 Score =  273 bits (697), Expect = 2e-071
 Identities = 149/218 (68%), Positives = 172/218 (78%), Gaps = 13/218 (5%)
 Frame = +1

Query:  46 SASSSSSSSASLVYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKASNV 225
           S+SSSSSSSA+L+YKRSKS AA+YGESFSQRKRSGFWSF HLYSSK HQ+++TTKK  N 
Sbjct: 106 SSSSSSSSSANLIYKRSKSTAAAYGESFSQRKRSGFWSFFHLYSSK-HQISNTTKKVDNF 164

Query: 226 SHSSRPGNKIDNGKHQTTETTNQVV--GRGIDVIVKEEDKRSSGNKVVAETPTNSVGSGG 399
           SH  R     +      TET++  V  G GIDVIV+EED+  S NKVV+ETPTN +G GG
Sbjct: 165 SHLRR-----NQRTESKTETSSMRVGGGGGIDVIVEEEDE--SPNKVVSETPTNGIGGGG 217

Query: 400 GSSFGRKVLRSRSVGYGNRSFSS---ERISNGIGDCALRRIESQREYSTKVNCNGGDEAG 570
           GSSFGRKVLRSRSVG G+RSFS    ERISNG GDCALRRIESQRE +  ++  GG EA 
Sbjct: 218 GSSFGRKVLRSRSVGCGSRSFSGDFFERISNGFGDCALRRIESQREATKVISNGGGGEAA 277

Query: 571 DAMSETVKCGGIFGGFTIMTPSASTSTSASSTVDHHHH 684
           DAMSE VKCGGIFGGF IMT S++TS++ SSTVDHHH+
Sbjct: 278 DAMSEMVKCGGIFGGFMIMTSSSTTSSTTSSTVDHHHN 315

>gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]

          Length = 388

 Score =  149 bits (374), Expect = 5e-034
 Identities = 98/230 (42%), Positives = 136/230 (59%), Gaps = 43/230 (18%)
 Frame = +1

Query:  46 SASSSSSSSASLVYKRSKS---MAASYGES-FSQRKRSGFWSFLHLYSSKQ--------- 186
           +ASSSSS++A++VYKRS+S      +YG+S  S RKR+GFWSF HLYSSKQ         
Sbjct: 101 TASSSSSTTANIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQHGSSKKVGN 160

Query: 187 -HQVASTTKKASNVSHSSRPGNKIDNGKHQTTETTNQVVG--------RGIDVIVKEEDK 339
            HQ  S T+  + ++ ++  G+   +    ++  + +VVG         GIDVIV+E+  
Sbjct: 161 FHQPISQTETKTELAETTTVGS--SSSSSASSSMSKRVVGGGGSSSNRNGIDVIVEED-- 216

Query: 340 RSSGNKVVAETPTNSVGSGGGSSFGRKVLRSRSVGYGNRSFSS---ERISNGIGDCALRR 510
              G+  +  TP+            RKV RSRSVG G+RSFS    ERI+NG GDC LRR
Sbjct: 217 ---GSPNIEVTPSE-----------RKVSRSRSVGCGSRSFSGDFFERITNGFGDCTLRR 262

Query: 511 IESQREYSTKVNCNGGDEAGDAMSETVKCGGIFGGFTIMTPSASTSTSAS 660
           +ESQRE +          + + + E V+CGGIFGGF IMT S+S+S+S+S
Sbjct: 263 VESQREGNNNKGNKVSSNSSNGVREMVRCGGIFGGFMIMTSSSSSSSSSS 312

>gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 396

 Score =  148 bits (372), Expect = 9e-034
 Identities = 98/230 (42%), Positives = 135/230 (58%), Gaps = 43/230 (18%)
 Frame = +1

Query:  46 SASSSSSSSASLVYKRSKS---MAASYGES-FSQRKRSGFWSFLHLYSSKQ--------- 186
           +ASSSSS++A++VYKRS+S      +YG+S  S RKR+GFWSF HLYSSKQ         
Sbjct: 109 TASSSSSTTANIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFFHLYSSKQHGSSKKVGN 168

Query: 187 -HQVASTTKKASNVSHSSRPGNKIDNGKHQTTETTNQVVG--------RGIDVIVKEEDK 339
            HQ  S T+  + ++ ++  G+   +    ++  + +VVG         GIDVIV+E+  
Sbjct: 169 FHQPISQTETKTELAETTTVGS--SSSSSASSSMSKRVVGGGGSSSNRNGIDVIVEED-- 224

Query: 340 RSSGNKVVAETPTNSVGSGGGSSFGRKVLRSRSVGYGNRSFSS---ERISNGIGDCALRR 510
              G+  +  TP+            RKV RSRSVG G+RSFS    ERI+NG GDC LRR
Sbjct: 225 ---GSPNIEVTPSE-----------RKVSRSRSVGCGSRSFSGDFFERITNGFGDCTLRR 270

Query: 511 IESQREYSTKVNCNGGDEAGDAMSETVKCGGIFGGFTIMTPSASTSTSAS 660
           +ESQRE +            + + E V+CGGIFGGF IMT S+S+S+S+S
Sbjct: 271 VESQREGNNNKGNKVSSNPSNGVREMVRCGGIFGGFMIMTSSSSSSSSSS 320

>gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabidopsis
        thaliana]

          Length = 214

 Score =  140 bits (351), Expect = 2e-031
 Identities = 78/112 (69%), Positives = 85/112 (75%), Gaps = 5/112 (4%)
 Frame = -1

Query: 626 IIVNPPKIPPHFTVSLIASPASSPPLQLTFVEYSLCDSILLKAQSPIPLEILSL---EKL 456
           +I+NPPK+PPH T+SLIAS AS PP  L  +  SLCDSILLKAQSP PLEILS    EKL
Sbjct:   1 MIINPPKMPPHLTISLIASAASPPPPLLMTLVASLCDSILLKAQSPNPLEILSKKSPEKL 60

Query: 455 LFPYPTDLDLNTFLPKEDPPPLPTLFVGVSATTLLPELLLSSSLTITSMPLP 300
           L P PTDLDLNTFLP ++PPP P  FVGVS TTLL   L SSS TITSMP P
Sbjct:  61 LLPQPTDLDLNTFLPNDEPPPPPIPFVGVSETTLLG--LSSSSSTITSMPPP 110

>gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis thaliana]

          Length = 207

 Score =  127 bits (318), Expect = 2e-027
 Identities = 73/105 (69%), Positives = 78/105 (74%), Gaps = 5/105 (4%)
 Frame = -1

Query: 605 IPPHFTVSLIASPASSPPLQLTFVEYSLCDSILLKAQSPIPLEILSL---EKLLFPYPTD 435
           +PPH T+SLIAS AS PP  L  +  SLCDSILLKAQSP PLEILS    EKLL P PTD
Sbjct:   1 MPPHLTISLIASAASPPPPLLMTLVASLCDSILLKAQSPNPLEILSKKSPEKLLLPQPTD 60

Query: 434 LDLNTFLPKEDPPPLPTLFVGVSATTLLPELLLSSSLTITSMPLP 300
           LDLNTFLP ++PPP P  FVGVS TTLL   L SSS TITSMP P
Sbjct:  61 LDLNTFLPNDEPPPPPIPFVGVSETTLLG--LSSSSSTITSMPPP 103

>gi|255551795|ref|XP_002516943.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 450

 Score =  98 bits (243), Expect = 8e-019
 Identities = 61/120 (50%), Positives = 73/120 (60%), Gaps = 11/120 (9%)
 Frame = +1

Query: 337 KRSSGNKVVAETPTNSVGSGGGSSFGRKVLRSRSVGYGNRSFSS---ERISNGIGDCALR 507
           K+S    V  +   NS  +   SSF RKV RSRSVG G+RSFS    ERIS G GDC LR
Sbjct: 261 KKSDIVAVEDDDSPNSQATASASSFERKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLR 320

Query: 508 RIESQREYSTKVNCNGGDEAGDAMSETVKCGGIFGGFTIMTPSASTSTSA---SSTVDHH 678
           R+ESQRE   K     G  A   M E VKCGGIFGGF I + S+S+S+S+   SS+ + H
Sbjct: 321 RVESQREGKPK-----GPGAASHMKERVKCGGIFGGFMITSSSSSSSSSSYWVSSSAEEH 375

>gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876
        [Arabidopsis lyrata subsp. lyrata]

          Length = 384

 Score =  91 bits (224), Expect = 1e-016
 Identities = 48/85 (56%), Positives = 59/85 (69%), Gaps = 3/85 (3%)
 Frame = +1

Query: 415 RKVLRSRSVGYGNRSFSS---ERISNGIGDCALRRIESQREYSTKVNCNGGDEAGDAMSE 585
           RKV RSRSVG G+RSFS    ERI+NG GDC LRR+ESQRE +            + + E
Sbjct: 227 RKVSRSRSVGCGSRSFSGDFFERITNGFGDCTLRRVESQREGNNNKGNKVSSNPSNGVRE 286

Query: 586 TVKCGGIFGGFTIMTPSASTSTSAS 660
            V+CGGIFGGF IMT S+S+S+S+S
Sbjct: 287 MVRCGGIFGGFMIMTSSSSSSSSSS 311


 Score =  60 bits (145), Expect = 2e-007
 Identities = 33/61 (54%), Positives = 45/61 (73%), Gaps = 7/61 (11%)
 Frame = +1

Query:  52 SSSSSSSASLVYKRSKS---MAASYGES-FSQRKRSGFWSFLHLYSSKQHQVASTTKKAS 219
           +SSS+++A++VYKRS+S      +YG+S  S RKR+GFWSFLHLYSSK H    ++KK  
Sbjct: 104 ASSSATTANIVYKRSQSTRTTKTTYGDSDLSPRKRNGFWSFLHLYSSKHH---GSSKKVG 160

Query: 220 N 222
           N
Sbjct: 161 N 161

>gi|226501094|ref|NP_001146008.1| hypothetical protein LOC100279539 [Zea mays]

          Length = 366

 Score =  80 bits (195), Expect = 3e-013
 Identities = 54/135 (40%), Positives = 68/135 (50%), Gaps = 16/135 (11%)
 Frame = +1

Query: 259 NGKHQTTETTNQVVGRGI----DVIVKEEDKRSSGNKVVA----ETPTNSVGSGGGSSFG 414
           +G H+   ++    G G      V V      S G ++ A    E+P         SSFG
Sbjct: 164 SGSHKGGASSASAAGGGAARRNSVSVASASSASLGGRLEAIVEPESPGRRSEGSSSSSFG 223

Query: 415 RKVLRSRSVGYGNRSFSS---ERISNGIGDCALRRIESQREYSTKV---NCNGGDEAGDA 576
           RKV RSRSVG G+RSFS    ER+S G GDCALRR+ES RE   K    +  GG+E    
Sbjct: 224 RKVARSRSVGCGSRSFSGDFLERLSTGFGDCALRRVESHREPKPKAALGHLGGGEEHEQD 283

Query: 577 MSE--TVKCGGIFGG 615
             +   +KC G FGG
Sbjct: 284 QDQHHRIKCAGFFGG 298

>gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 420

 Score =  69 bits (167), Expect = 5e-010
 Identities = 39/62 (62%), Positives = 44/62 (70%), Gaps = 4/62 (6%)
 Frame = +1

Query: 364 AETPTNSVGSGGGSSFGRKVLRSRSVGYGNRSFSS---ERISNGIGDCALRRIESQREYS 534
           +E+P NS  +   SSFGRKV RSRSVG G+RSFS    ERIS G GDC LRR+ESQRE  
Sbjct: 237 SESP-NSHATASSSSFGRKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGK 295

Query: 535 TK 540
            K
Sbjct: 296 PK 297

>gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vinifera]

          Length = 387

 Score =  69 bits (167), Expect = 5e-010
 Identities = 39/62 (62%), Positives = 44/62 (70%), Gaps = 4/62 (6%)
 Frame = +1

Query: 364 AETPTNSVGSGGGSSFGRKVLRSRSVGYGNRSFSS---ERISNGIGDCALRRIESQREYS 534
           +E+P NS  +   SSFGRKV RSRSVG G+RSFS    ERIS G GDC LRR+ESQRE  
Sbjct: 253 SESP-NSHATASSSSFGRKVSRSRSVGCGSRSFSGDFFERISTGFGDCTLRRVESQREGK 311

Query: 535 TK 540
            K
Sbjct: 312 PK 313

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 112,428,570,657
Number of Sequences: 15229318
Number of Extensions: 112428570657
Number of Successful Extensions: 28740549
Number of sequences better than 0.0: 0