Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN82997


BLASTX 7.6.2

Query= UN82997 /QuerySize=885
        (884 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT...    283   3e-074
gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabid...    283   4e-074
gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabid...    142   8e-032
gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]              141   2e-031
gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsi...    140   4e-031
gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis...    129   7e-028
gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT...    104   2e-020
gi|255551795|ref|XP_002516943.1| conserved hypothetical protein ...     82   1e-013
gi|224107110|ref|XP_002314378.1| predicted protein [Populus tric...     80   4e-013
gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein...     77   2e-012
gi|255645721|gb|ACU23354.1| unknown [Glycine max]                       71   2e-010
gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vini...     67   3e-009

>gi|297829232|ref|XP_002882498.1| hypothetical protein ARALYDRAFT_478007
        [Arabidopsis lyrata subsp. lyrata]

          Length = 372

 Score =  283 bits (723), Expect = 3e-074
 Identities = 155/218 (71%), Positives = 178/218 (81%), Gaps = 13/218 (5%)
 Frame = +2

Query: 248 SSSSSSSSSANLVYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKANNV 427
           SSSSSSSSSANL+YKRSKS AA+YGESFSQRKRSGFWSFLHLYSSK HQ+++TTKK +N 
Sbjct: 106 SSSSSSSSSANLIYKRSKSTAAAYGESFSQRKRSGFWSFLHLYSSK-HQISNTTKKVDNF 164

Query: 428 SHSSRPRNKIDNGKHQTTETSNQVV--GGGIDVIVKEEDERSSSDKVVAATPTN-SVGSG 598
           SHS R     +     TTETS++ V  GGGIDVIV+EEDE S  +KVV+ TPTN  +G G
Sbjct: 165 SHSRR-----NQRTESTTETSSKRVGGGGGIDVIVEEEDE-SPPNKVVSETPTNGGIGGG 218

Query: 599 GGSSFGRKVLRSRSVGCGNRSFSSD---RISNGFGDCALRRIESQREYSTKVSCNSGDEA 769
           GGSSFGRKVLRSRSVGCG+RSFS D   RISNGFGDCALRRIESQRE +  +S   G EA
Sbjct: 219 GGSSFGRKVLRSRSVGCGSRSFSGDFFERISNGFGDCALRRIESQREATKVISNGGGGEA 278

Query: 770 GDAMSETVKCGGIFGGFMIMTPSASTSTSASSTVVHHH 883
            +AM+E VKCGGIFGGFMIMTPS+++S++ SSTV HHH
Sbjct: 279 ANAMNEMVKCGGIFGGFMIMTPSSTSSSTTSSTVDHHH 316


 Score =  113 bits (281), Expect = 5e-023
 Identities = 59/123 (47%), Positives = 80/123 (65%), Gaps = 5/123 (4%)
 Frame = +2

Query:  89 IMNLKDQDMGEGMQCITHPYTKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSS 268
           ++ LKDQDMGEGMQCI HPYTKNPGGICALCLQ+KLGKLVTSSFP+ KPNH+ SSS  S 
Sbjct:   1 MVELKDQDMGEGMQCIRHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKSF 60

Query: 269 SSANLVYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKANNVSHSSRPR 448
           + +      + S+A S   + + R  +   +   L + K+  + + +  +++ S SS   
Sbjct:  61 TPST-----TSSLALSLSSASNGRDSTSNNNLPFLLAKKKKNMLAASSSSSSSSSSSSSA 115

Query: 449 NKI 457
           N I
Sbjct: 116 NLI 118

>gi|186509861|ref|NP_001118595.1| uncharacterized protein [Arabidopsis
        thaliana]

          Length = 369

 Score =  283 bits (722), Expect = 4e-074
 Identities = 154/217 (70%), Positives = 173/217 (79%), Gaps = 13/217 (5%)
 Frame = +2

Query: 248 SSSSSSSSSANLVYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKANNV 427
           SSSSSSSSSANL+YKRSKS AA+YGESFSQRKRSGFWSF HLYSSK HQ+++TTKK +N 
Sbjct: 106 SSSSSSSSSANLIYKRSKSTAAAYGESFSQRKRSGFWSFFHLYSSK-HQISNTTKKVDNF 164

Query: 428 SHSSRPRNKIDNGKHQTTETSNQVV--GGGIDVIVKEEDERSSSDKVVAATPTNSVGSGG 601
           SH  R     +      TETS+  V  GGGIDVIV+EEDE  S +KVV+ TPTN +G GG
Sbjct: 165 SHLRR-----NQRTESKTETSSMRVGGGGGIDVIVEEEDE--SPNKVVSETPTNGIGGGG 217

Query: 602 GSSFGRKVLRSRSVGCGNRSFSSD---RISNGFGDCALRRIESQREYSTKVSCNSGDEAG 772
           GSSFGRKVLRSRSVGCG+RSFS D   RISNGFGDCALRRIESQRE +  +S   G EA 
Sbjct: 218 GSSFGRKVLRSRSVGCGSRSFSGDFFERISNGFGDCALRRIESQREATKVISNGGGGEAA 277

Query: 773 DAMSETVKCGGIFGGFMIMTPSASTSTSASSTVVHHH 883
           DAMSE VKCGGIFGGFMIMT S++TS++ SSTV HHH
Sbjct: 278 DAMSEMVKCGGIFGGFMIMTSSSTTSSTTSSTVDHHH 314


 Score =  110 bits (273), Expect = 4e-022
 Identities = 60/124 (48%), Positives = 81/124 (65%), Gaps = 7/124 (5%)
 Frame = +2

Query:  89 IMNLKD-QDMGEGMQCITHPYTKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSS 265
           ++ LKD QDMGEGMQCITHPYTKNPGGICALCLQ+KLGKLVTSSFP+ KPNH+ SSS  S
Sbjct:   1 MVELKDQQDMGEGMQCITHPYTKNPGGICALCLQEKLGKLVTSSFPVPKPNHLSSSSPKS 60

Query: 266 SSSANLVYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKANNVSHSSRP 445
            + +      + S+A S   + + R  +   +   L + K+  + + +  +++ S SS  
Sbjct:  61 FTPS------TTSLALSLSSASNGRDSTNNNNLPFLLAKKKKNMLAASSSSSSSSSSSSS 114

Query: 446 RNKI 457
            N I
Sbjct: 115 ANLI 118

>gi|30680042|ref|NP_187343.2| proline-rich family protein [Arabidopsis
        thaliana]

          Length = 214

 Score =  142 bits (357), Expect = 8e-032
 Identities = 79/112 (70%), Positives = 86/112 (76%), Gaps = 5/112 (4%)
 Frame = -3

Query: 828 IIINPPKIPPHFTVSLIASPASSPLLQLTFVEYSLCDSILLKAQSPNPLEILSL---EKL 658
           +IINPPK+PPH T+SLIAS AS P   L  +  SLCDSILLKAQSPNPLEILS    EKL
Sbjct:   1 MIINPPKMPPHLTISLIASAASPPPPLLMTLVASLCDSILLKAQSPNPLEILSKKSPEKL 60

Query: 657 LFPHPTDLDLNTFLPKEDPPPLPTLFVGVAATTLSLELLSSSSLTITSMPPP 502
           L P PTDLDLNTFLP ++PPP P  FVGV+ TTL    LSSSS TITSMPPP
Sbjct:  61 LLPQPTDLDLNTFLPNDEPPPPPIPFVGVSETTLL--GLSSSSSTITSMPPP 110


 Score =  64 bits (155), Expect = 2e-008
 Identities = 29/36 (80%)
 Frame = -3

Query: 255 EEEGTWLGLERGKEEVTSLPSLSWRHNAHIPPGFFV 148
           EEE  WLGL  GKEEVTSLPS SWRH A IPPGFFV
Sbjct: 179 EEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 214

>gi|21536969|gb|AAM61310.1| unknown [Arabidopsis thaliana]

          Length = 388

 Score =  141 bits (353), Expect = 2e-031
 Identities = 96/252 (38%), Positives = 135/252 (53%), Gaps = 34/252 (13%)
 Frame = +2

Query: 149 TKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSSSSANLVYKRSKSMAASYGES 328
           T N        L  K  K++T+S          SSSS++++      + +++   +YG+S
Sbjct:  81 TNNNNSKLPFLLAKKKKKMLTAS----------SSSSTTANIVYKRSQSTRTTKTTYGDS 130

Query: 329 -FSQRKRSGFWSFLHLYSSKQHQVASTTKKANNVSHSSRPRNKID----------NGKHQ 475
             S RKR+GFWSF HLYSSKQH   S+ K  N     S+   K +          +    
Sbjct: 131 DLSPRKRNGFWSFFHLYSSKQH--GSSKKVGNFHQPISQTETKTELAETTTVGSSSSSSA 188

Query: 476 TTETSNQVVGGGIDVIVKEEDERSSSDKVVAATPTNSVGSGGGSSFGRKVLRSRSVGCGN 655
           ++  S +VVGGG          R+  D +V    + ++     +   RKV RSRSVGCG+
Sbjct: 189 SSSMSKRVVGGG-----GSSSNRNGIDVIVEEDGSPNIEV---TPSERKVSRSRSVGCGS 240

Query: 656 RSFSSD---RISNGFGDCALRRIESQREYSTKVSCNSGDEAGDAMSETVKCGGIFGGFMI 826
           RSFS D   RI+NGFGDC LRR+ESQRE +          + + + E V+CGGIFGGFMI
Sbjct: 241 RSFSGDFFERITNGFGDCTLRRVESQREGNNNKGNKVSSNSSNGVREMVRCGGIFGGFMI 300

Query: 827 MTPSASTSTSAS 862
           MT S+S+S+S+S
Sbjct: 301 MTSSSSSSSSSS 312


 Score =  92 bits (227), Expect = 9e-017
 Identities = 60/133 (45%), Positives = 76/133 (57%), Gaps = 7/133 (5%)
 Frame = +2

Query: 113 MGEGMQCITHPYTKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSSSS--ANLV 286
           MG+GMQCI HP+TKNPGGICA CLQ+KLGKLVTSSFPL  P H+ SSS+SSS S  ++ V
Sbjct:   1 MGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPL--PKHLTSSSTSSSPSFRSDSV 58

Query: 287 YKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKANNVSHSSRPRNKIDNG 466
              + + AA+   S S    S   +  +  S     +A   KK    S SS     I   
Sbjct:  59 GSTTTASAANLSASLS---LSVSGATNNNNSKLPFLLAKKKKKMLTASSSSSTTANIVYK 115

Query: 467 KHQTTETSNQVVG 505
           + Q+T T+    G
Sbjct: 116 RSQSTRTTKTTYG 128

>gi|18422990|ref|NP_568706.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 396

 Score =  140 bits (351), Expect = 4e-031
 Identities = 96/252 (38%), Positives = 134/252 (53%), Gaps = 34/252 (13%)
 Frame = +2

Query: 149 TKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSSSSANLVYKRSKSMAASYGES 328
           T N        L  K  K++T+S          SSSS++++      + +++   +YG+S
Sbjct:  89 TNNNNSKLPFLLAKKKKKMLTAS----------SSSSTTANIVYKRSQSTRTTKTTYGDS 138

Query: 329 -FSQRKRSGFWSFLHLYSSKQHQVASTTKKANNVSHSSRPRNKID----------NGKHQ 475
             S RKR+GFWSF HLYSSKQH   S+ K  N     S+   K +          +    
Sbjct: 139 DLSPRKRNGFWSFFHLYSSKQH--GSSKKVGNFHQPISQTETKTELAETTTVGSSSSSSA 196

Query: 476 TTETSNQVVGGGIDVIVKEEDERSSSDKVVAATPTNSVGSGGGSSFGRKVLRSRSVGCGN 655
           ++  S +VVGGG          R+  D +V    + ++     +   RKV RSRSVGCG+
Sbjct: 197 SSSMSKRVVGGG-----GSSSNRNGIDVIVEEDGSPNIEV---TPSERKVSRSRSVGCGS 248

Query: 656 RSFSSD---RISNGFGDCALRRIESQREYSTKVSCNSGDEAGDAMSETVKCGGIFGGFMI 826
           RSFS D   RI+NGFGDC LRR+ESQRE +            + + E V+CGGIFGGFMI
Sbjct: 249 RSFSGDFFERITNGFGDCTLRRVESQREGNNNKGNKVSSNPSNGVREMVRCGGIFGGFMI 308

Query: 827 MTPSASTSTSAS 862
           MT S+S+S+S+S
Sbjct: 309 MTSSSSSSSSSS 320


 Score =  101 bits (250), Expect = 2e-019
 Identities = 64/141 (45%), Positives = 82/141 (58%), Gaps = 7/141 (4%)
 Frame = +2

Query:  89 IMNLKDQDMGEGMQCITHPYTKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSS 268
           ++  KDQDMG+GMQCI HP+TKNPGGICA CLQ+KLGKLVTSSFPL  P H+ SSS+SSS
Sbjct:   1 MVEAKDQDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPL--PKHLTSSSTSSS 58

Query: 269 SS--ANLVYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKANNVSHSSR 442
            S  ++ V   + + AA+   S S    S   +  +  S     +A   KK    S SS 
Sbjct:  59 PSFRSDSVGSTTTASAANLSASLS---LSVSGATNNNNSKLPFLLAKKKKKMLTASSSSS 115

Query: 443 PRNKIDNGKHQTTETSNQVVG 505
               I   + Q+T T+    G
Sbjct: 116 TTANIVYKRSQSTRTTKTTYG 136

>gi|6728994|gb|AAF26991.1|AC016827_2 unknown protein [Arabidopsis thaliana]

          Length = 207

 Score =  129 bits (323), Expect = 7e-028
 Identities = 73/105 (69%), Positives = 79/105 (75%), Gaps = 5/105 (4%)
 Frame = -3

Query: 807 IPPHFTVSLIASPASSPLLQLTFVEYSLCDSILLKAQSPNPLEILSL---EKLLFPHPTD 637
           +PPH T+SLIAS AS P   L  +  SLCDSILLKAQSPNPLEILS    EKLL P PTD
Sbjct:   1 MPPHLTISLIASAASPPPPLLMTLVASLCDSILLKAQSPNPLEILSKKSPEKLLLPQPTD 60

Query: 636 LDLNTFLPKEDPPPLPTLFVGVAATTLSLELLSSSSLTITSMPPP 502
           LDLNTFLP ++PPP P  FVGV+ TTL    LSSSS TITSMPPP
Sbjct:  61 LDLNTFLPNDEPPPPPIPFVGVSETTLL--GLSSSSSTITSMPPP 103


 Score =  64 bits (155), Expect = 2e-008
 Identities = 29/36 (80%)
 Frame = -3

Query: 255 EEEGTWLGLERGKEEVTSLPSLSWRHNAHIPPGFFV 148
           EEE  WLGL  GKEEVTSLPS SWRH A IPPGFFV
Sbjct: 172 EEEERWLGLGTGKEEVTSLPSFSWRHKAQIPPGFFV 207

>gi|297795649|ref|XP_002865709.1| hypothetical protein ARALYDRAFT_917876
        [Arabidopsis lyrata subsp. lyrata]

          Length = 384

 Score =  104 bits (259), Expect = 2e-020
 Identities = 58/115 (50%), Positives = 75/115 (65%), Gaps = 4/115 (3%)
 Frame = +2

Query:  89 IMNLKDQDMGEGMQCITHPYTKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSS 268
           ++ +KDQDMG+GMQCI HP+TKNPGGICA CLQ+KLGKLVTSSFPL  P H+ SSS+SSS
Sbjct:   1 MVEVKDQDMGDGMQCINHPFTKNPGGICAFCLQEKLGKLVTSSFPL--PKHLSSSSTSSS 58

Query: 269 SS--ANLVYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKANNV 427
            S  ++ V   + + AAS   S S    +    FL     K+   AS++    N+
Sbjct:  59 PSFRSDSVGSTTTASAASLSLSVSGATNNNKLPFLLAKKKKKMLTASSSATTANI 113


 Score =  101 bits (249), Expect = 3e-019
 Identities = 51/85 (60%), Positives = 62/85 (72%), Gaps = 3/85 (3%)
 Frame = +2

Query: 617 RKVLRSRSVGCGNRSFSSD---RISNGFGDCALRRIESQREYSTKVSCNSGDEAGDAMSE 787
           RKV RSRSVGCG+RSFS D   RI+NGFGDC LRR+ESQRE +            + + E
Sbjct: 227 RKVSRSRSVGCGSRSFSGDFFERITNGFGDCTLRRVESQREGNNNKGNKVSSNPSNGVRE 286

Query: 788 TVKCGGIFGGFMIMTPSASTSTSAS 862
            V+CGGIFGGFMIMT S+S+S+S+S
Sbjct: 287 MVRCGGIFGGFMIMTSSSSSSSSSS 311

>gi|255551795|ref|XP_002516943.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 450

 Score =  82 bits (200), Expect = 1e-013
 Identities = 39/60 (65%), Positives = 46/60 (76%), Gaps = 2/60 (3%)
 Frame = +2

Query:  92 MNLKDQDMGEGMQCITHPYTKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSSS 271
           + + ++DMG+GMQC  HPY  NPGGICA CLQ+KLGKLV+SSFPL  P    SSSSSS S
Sbjct:  23 VGIGEEDMGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPL--PIRASSSSSSSPS 80

>gi|224107110|ref|XP_002314378.1| predicted protein [Populus trichocarpa]

          Length = 389

 Score =  80 bits (196), Expect = 4e-013
 Identities = 50/118 (42%), Positives = 62/118 (52%), Gaps = 3/118 (2%)
 Frame = +2

Query: 104 DQDMGEGMQCITHPYTKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSSSSANL 283
           ++D+G+GMQC  HPY  NPGGICA CLQ+KLGKLV+SSFPL       SSSSSSS S   
Sbjct:   1 EEDLGDGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPLPIRG---SSSSSSSPSFRS 57

Query: 284 VYKRSKSMAASYGESFSQRKRSGFWSFLHLYSSKQHQVASTTKKANNVSHSSRPRNKI 457
           V     S     G S S   R       +   S  H     T++A      ++ + KI
Sbjct:  58 VIGVGGSSNVGAGTSLSLAARPTTTKCRNDGGSNSHYQEYYTRRARIPFLLAKKKKKI 115

>gi|225431743|ref|XP_002270026.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 420

 Score =  77 bits (189), Expect = 2e-012
 Identities = 38/56 (67%), Positives = 43/56 (76%), Gaps = 3/56 (5%)
 Frame = +2

Query: 104 DQDMGEGMQCITHPYTKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSSS 271
           + D+GEGMQC  HPY  NPGGICA CLQ+KLGKLV+SSFP +     PSSSSSS S
Sbjct:   9 EDDVGEGMQCSDHPYRNNPGGICAFCLQEKLGKLVSSSFPNA---IFPSSSSSSPS 61

>gi|255645721|gb|ACU23354.1| unknown [Glycine max]

          Length = 324

 Score =  71 bits (172), Expect = 2e-010
 Identities = 39/87 (44%), Positives = 52/87 (59%), Gaps = 10/87 (11%)
 Frame = +2

Query: 101 KDQDMGEGMQCITHPY----TKNPGGICALCLQDKLGKLVTSSFPLSKPNHVPSSSSSSS 268
           +  ++ +GMQC+ HP+      NPGGICALCLQDKL  L++SSFP S P   P SSSSSS
Sbjct:   7 RHNEISDGMQCMNHPHRNNNNNNPGGICALCLQDKLRNLLSSSFPTSSP---PFSSSSSS 63

Query: 269 SSANLVYKRSKSMAASYGESFSQRKRS 349
           S +   +  S S+   +   +    RS
Sbjct:  64 SPS---FTSSSSVKTDHDHDYDHYTRS 87

>gi|296083358|emb|CBI22994.3| unnamed protein product [Vitis vinifera]

          Length = 387

 Score =  67 bits (163), Expect = 3e-009
 Identities = 43/118 (36%), Positives = 62/118 (52%), Gaps = 7/118 (5%)
 Frame = +2

Query:  23 ERERAQRFWFCLFLLPCVNLFVIMNLK---DQDMGEGMQCITHPYTKNPGGICALCLQDK 193
           +R   + F+ CL    C   +V+  ++   + D+GEGMQC  HPY  NPGGICA CLQ+K
Sbjct:  18 QRSFFELFFVCLRGEACEVGWVMEGVRGGGEDDVGEGMQCSDHPYRNNPGGICAFCLQEK 77

Query: 194 LGKLVTSSFPLSKPNHVPSSSSSSSSSANLVYKRSKSMAAS----YGESFSQRKRSGF 355
           LGKL+     +          +SS+S +      S S +AS    Y  ++S+R R  F
Sbjct:  78 LGKLIGGGAGVGVGVGGGGGGASSTSLSVRPTSSSSSYSASKDCHYHGNYSRRARIPF 135

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,352,524,086,339
Number of Sequences: 15229318
Number of Extensions: 1352524086339
Number of Successful Extensions: 374284986
Number of sequences better than 0.0: 0