Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN74902


BLASTX 7.6.2

Query= UN74902 /QuerySize=456
        (455 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|13449316|ref|NP_085498.1| hypothetical protein ArthMp028 [Ara...    123   6e-027
gi|4539406|emb|CAB40039.1| putative retrotransposon [Arabidopsis...    107   5e-022
gi|4895171|gb|AAD32759.1| putative retroelement pol polyprotein ...    103   7e-021
gi|4388818|gb|AAD19773.1| putative retroelement pol polyprotein ...    103   7e-021
gi|4567266|gb|AAD23679.1| putative retroelement pol polyprotein ...    101   3e-020
gi|6996255|emb|CAB75481.1| copia-like polyprotein [Arabidopsis t...    101   4e-020
gi|4982475|gb|AAD36943.1|AF069441_3 putative polyprotein [Arabid...    100   7e-020
gi|4567277|gb|AAD23690.1| putative retroelement pol polyprotein ...     94   5e-018
gi|6623973|gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 poly...     89   1e-016
gi|9294244|dbj|BAB02146.1| copia retroelement pol polyprotein-li...     87   6e-016
gi|1345510|emb|CAA37918.1| unnamed protein product [Arabidopsis ...     85   2e-015
gi|99719|pir||S23319 hypothetical protein 2 - Arabidopsis thalia...     83   7e-015
gi|16534|emb|CAA31653.1| polyprotein [Arabidopsis thaliana]             82   1e-014
gi|3402755|emb|CAA20201.1| putative transposable element [Arabid...     80   5e-014
gi|147836446|emb|CAN62092.1| hypothetical protein VITISV_022473 ...     80   5e-014
gi|13699782|gb|AAK38381.1|AC079028_5 polyprotein, putative [Arab...     80   6e-014
gi|109391001|emb|CAJ09951.2| putative gag-pol polyprotein [Citru...     80   6e-014
gi|20198271|gb|AAM15487.1| hypothetical protein [Arabidopsis tha...     79   1e-013
gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza sativa Japon...     76   1e-012
gi|116317760|emb|CAH65740.1| OSIGBa0127D24.3 [Oryza sativa Indic...     76   1e-012

>gi|13449316|ref|NP_085498.1| hypothetical protein ArthMp028 [Arabidopsis
        thaliana]

          Length = 145

 Score =  123 bits (308), Expect = 6e-027
 Identities = 57/92 (61%), Positives = 70/92 (76%)
 Frame = -2

Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
           R +L+G ++DSLY LQG V   E+ + E  KDET LWHSRL HMSQ+G++LLVKKG++D 
Sbjct:  36 RTILKGNRHDSLYILQGSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVKKGFLDS 95

Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTTKLP 113
            KVSSL FCED IYGK HRV+F   +HTTK P
Sbjct:  96 SKVSSLKFCEDCIYGKTHRVNFSTGQHTTKNP 127

>gi|4539406|emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana]

          Length = 1230

 Score =  107 bits (266), Expect = 5e-022
 Identities = 52/101 (51%), Positives = 68/101 (67%), Gaps = 2/101 (1%)
 Frame = -2

Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
           R +L G +++ LY LQGK   + ++  E + D+T LWH RLGH+SQK + +LVKKGY+DG
Sbjct: 389 RTLLIGSRHEKLYLLQGKPEVSHSMTVERRNDDTVLWHRRLGHISQKNMDILVKKGYLDG 448

Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
           KKVS L  CED IYGKA R+SF V  H T  KL ++    W
Sbjct: 449 KKVSKLELCEDCIYGKARRLSFVVATHNTEDKLNYVHSDLW 489

>gi|4895171|gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis
        thaliana]

          Length = 1356

 Score =  103 bits (256), Expect = 7e-021
 Identities = 51/101 (50%), Positives = 69/101 (68%), Gaps = 2/101 (1%)
 Frame = -2

Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
           +V+L G++YD+LY L  K V +E+L    + D+T LWH RL HMSQK +++LV+KG++D 
Sbjct: 405 QVLLTGRRYDTLYLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDK 464

Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
           KKVSSL  CED IYGKA R SF +  H T  KL ++    W
Sbjct: 465 KKVSSLDVCEDCIYGKAKRKSFSLAHHDTKEKLEYIHSDLW 505

>gi|4388818|gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis
        thaliana]

          Length = 1335

 Score =  103 bits (256), Expect = 7e-021
 Identities = 49/99 (49%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
 Frame = -2

Query: 382 VLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKK 203
           +L+G++ D+LY L G   E E+  + + KDET+LWHSRLGHMSQKG+++LVKKG +  + 
Sbjct: 383 ILKGQKRDTLYILDGVTEEGESHSSAEVKDETALWHSRLGHMSQKGMEILVKKGCLRREV 442

Query: 202 VSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
           +  L FCED +YGK HRVSF   +H T  KL ++    W
Sbjct: 443 IKELEFCEDCVYGKQHRVSFAPAQHVTKEKLAYVHSDLW 481

>gi|4567266|gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis
        thaliana]

          Length = 838

 Score =  101 bits (250), Expect = 3e-020
 Identities = 47/99 (47%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
 Frame = -2

Query: 382 VLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKK 203
           +L+GK+  +LY LQG VV   A      KDE+ +WHSRL HMSQ+ + +L+KKG +  +K
Sbjct: 403 LLKGKKVGTLYLLQGVVVTGNANAVTSSKDESKIWHSRLCHMSQRNIDVLIKKGCLQAEK 462

Query: 202 VSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
           ++ L FCED +YGK HRV FG  KH T  KL ++    W
Sbjct: 463 INGLEFCEDCVYGKTHRVGFGSAKHVTREKLEYIHSDLW 501

>gi|6996255|emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]

          Length = 1363

 Score =  101 bits (249), Expect = 4e-020
 Identities = 48/101 (47%), Positives = 66/101 (65%), Gaps = 2/101 (1%)
 Frame = -2

Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
           +V+L G++YD+LY L GK    E+L      D+T LWH RL HMSQK + LL+KKG++D 
Sbjct: 412 QVLLEGRRYDTLYILHGKPATDESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDK 471

Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
           KKVS L  CED IYG+A ++ F + +H T  KL ++    W
Sbjct: 472 KKVSMLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLW 512

>gi|4982475|gb|AAD36943.1|AF069441_3 putative polyprotein [Arabidopsis
        thaliana]

          Length = 778

 Score =  100 bits (247), Expect = 7e-020
 Identities = 51/105 (48%), Positives = 66/105 (62%), Gaps = 5/105 (4%)
 Frame = -2

Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
           +V+L G++YD+LY L GK    E+L      D+  LWH RL HMSQK + LLVKKG++D 
Sbjct: 295 QVLLEGRRYDTLYILHGKPATDESLAVAKANDDIVLWHRRLCHMSQKNMSLLVKKGFLDK 354

Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTTKLPFLQEKKWHQHWSL 74
           KKVS L  CED IYGKA ++ F   +H TK     EK  + H+ L
Sbjct: 355 KKVSMLDTCEDCIYGKAKKIGFNFAQHDTK-----EKLEYVHYDL 394

>gi|4567277|gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis
        thaliana]

          Length = 1333

 Score =  94 bits (231), Expect = 5e-018
 Identities = 48/100 (48%), Positives = 65/100 (65%), Gaps = 2/100 (2%)
 Frame = -2

Query: 385 VVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGK 206
           V+L  ++  +LY LQ + V  E+L    ++D+T LWH RLGHMSQK + LL+KKG +D K
Sbjct: 384 VLLTVRRCYTLYLLQWRPVTEESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKK 443

Query: 205 KVSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
           KVS L  CED IYGKA R+ F + +H T  KL ++    W
Sbjct: 444 KVSKLETCEDCIYGKAKRIGFNLAQHDTREKLEYVHSDLW 483

>gi|6623973|gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein
        [Arabidopsis thaliana]

          Length = 1356

 Score =  89 bits (219), Expect = 1e-016
 Identities = 46/110 (41%), Positives = 67/110 (60%), Gaps = 3/110 (2%)
 Frame = -2

Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
           G+ ++++  +  LRG   + LY L G  V +E    E  K +T+LWHSRLGHMS   LK+
Sbjct: 395 GKVRYFKNNKTALRGSLSNGLYVLDGSTVMSELCNAETDKVKTALWHSRLGHMSMNNLKV 454

Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
           L  KG ID K+++ L FCE  + GK+ +VSF V KHT++  L ++    W
Sbjct: 455 LAGKGLIDRKEINELEFCEHCVMGKSKKVSFNVGKHTSEDALSYVHADLW 504

>gi|9294244|dbj|BAB02146.1| copia retroelement pol polyprotein-like [Arabidopsis
        thaliana]

          Length = 526

 Score =  87 bits (213), Expect = 6e-016
 Identities = 41/88 (46%), Positives = 55/88 (62%)
 Frame = -2

Query: 382 VLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKK 203
           ++ GK  D  Y+L+G VV  E  +     D T LWHSRLGHMS K + +LVK+GY+ GK+
Sbjct: 374 IISGKYQDGRYYLEGNVVNGEFAVARPDVDMTRLWHSRLGHMSLKNMNVLVKEGYLLGKE 433

Query: 202 VSSLYFCEDSIYGKAHRVSFGV*KHTTK 119
           V+ L  CE  +  K+ + SF   KHTTK
Sbjct: 434 VTKLELCESCVLRKSDKQSFPTAKHTTK 461

>gi|1345510|emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana]

          Length = 560

 Score =  85 bits (208), Expect = 2e-015
 Identities = 41/110 (37%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
 Frame = -2

Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
           G+ ++++ ++  LRG+  + LY L G  V +E  + E  K +T LWHSRLGHM    +K+
Sbjct: 393 GKVRYFKNQKTALRGEIVNGLYILDGNTVLSETCVAEGSKGKTELWHSRLGHMGLNNMKV 452

Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
           L  KG +  +++  L FCE+ + GKA +VSF V KH ++  L ++    W
Sbjct: 453 LAGKGLVSKEEIRELDFCENCVMGKAKKVSFNVGKHNSEYVLSYVHADLW 502

>gi|99719|pir||S23319 hypothetical protein 2 - Arabidopsis thaliana
        retrotransposon Ta1-2 (strain Landsberg) (fragment)

          Length = 1084

 Score =  83 bits (204), Expect = 7e-015
 Identities = 39/110 (35%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
 Frame = -2

Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
           G+ ++++ ++  LRG+  + LY L G  + +E  + E  K +T LWHSRLGHM    +K+
Sbjct: 305 GKVRYFKNQKTALRGEIVNGLYILDGNTILSETCVAEGSKGKTELWHSRLGHMGLNNMKV 364

Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
           L  KG +  +++  L FCE+ + GKA +VSF + KH ++  L ++    W
Sbjct: 365 LAGKGLVSKEEIRELDFCENCVMGKAKKVSFNMGKHNSEYVLSYVHADLW 414

>gi|16534|emb|CAA31653.1| polyprotein [Arabidopsis thaliana]

          Length = 1291

 Score =  82 bits (202), Expect = 1e-014
 Identities = 38/99 (38%), Positives = 62/99 (62%), Gaps = 1/99 (1%)
 Frame = -2

Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
           G+ ++++ ++  LRG+  + LY L G  V +E  + E  K +T LWHSRLGH+    +K+
Sbjct: 405 GKVRYFKNQKTALRGELVNGLYILDGNTVLSETCVAEGSKGKTELWHSRLGHIGLNNMKV 464

Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK 119
           L  KG +  +++  L FCE+ + GKA +VSF V KH ++
Sbjct: 465 LAGKGLVSKEEIRVLDFCENCVMGKAKKVSFNVGKHNSE 503

>gi|3402755|emb|CAA20201.1| putative transposable element [Arabidopsis
        thaliana]

          Length = 1308

 Score =  80 bits (197), Expect = 5e-014
 Identities = 41/116 (35%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
 Frame = -2

Query: 412 GRKQWY-QRRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
           G+ ++Y + +  L G   + LY L G  V  E    E   ++T LWH RLGHMS   +K+
Sbjct: 396 GKVRFYKENKTALCGNLVNGLYVLDGHTVVNENCNVEGSNEKTELWHCRLGHMSLNNMKI 455

Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKWHQHWSL 74
           L +KG ++ K +  L FCE+ + GK+ ++SF V KH T   L ++    W + + L
Sbjct: 456 LAEKGLLEKKDIKELSFCENCVMGKSKKLSFNVGKHITDEVLGYIHADLWGKQYFL 511

>gi|147836446|emb|CAN62092.1| hypothetical protein VITISV_022473 [Vitis
        vinifera]

          Length = 318

 Score =  80 bits (197), Expect = 5e-014
 Identities = 41/91 (45%), Positives = 54/91 (59%), Gaps = 2/91 (2%)
 Frame = -2

Query: 385 VVLRGKQYDSLYFLQGKVV--ETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYID 212
           VV++G + + LY LQG  +       I+E   + T LWH RLGHMS KGL +L K+G +D
Sbjct: 124 VVMKGNKINGLYTLQGGTIIGAVVVSISESIIETTRLWHMRLGHMSDKGLTILSKRGLLD 183

Query: 211 GKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK 119
           G+K   L FCE  ++GK  RV F    H TK
Sbjct: 184 GQKTGELDFCEHCVFGKQCRVKFSAGVHRTK 214

>gi|13699782|gb|AAK38381.1|AC079028_5 polyprotein, putative [Arabidopsis
        thaliana]

          Length = 855

 Score =  80 bits (196), Expect = 6e-014
 Identities = 43/110 (39%), Positives = 62/110 (56%), Gaps = 3/110 (2%)
 Frame = -2

Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
           G+ ++Y+  +  LRG     LY L G  V  E+ I E  K+ T+LWHSRLGHM    +K+
Sbjct:  78 GQVRYYKNNKTALRGSLSGGLYVLDGNTVIAESCIAERSKELTTLWHSRLGHMGGNNMKI 137

Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
           L  KG I   + +SL F E  + GKA +VSF + KH ++  L ++    W
Sbjct: 138 LAGKGLIKPSEATSLEFYEHCVMGKAKKVSFNIGKHNSEEILSYVHADLW 187

>gi|109391001|emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis]

          Length = 1334

 Score =  80 bits (196), Expect = 6e-014
 Identities = 39/90 (43%), Positives = 56/90 (62%), Gaps = 1/90 (1%)
 Frame = -2

Query: 385 VVLRGKQYDSLYFLQGKVVET-EALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
           +V++G   + LY LQG  V   E +    ++D T LWH RLGHMS KGL+ L K+G + G
Sbjct: 390 IVMKGVNENGLYVLQGSSVPVQEGVSAVSEEDRTKLWHLRLGHMSIKGLQELSKQGLLGG 449

Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTTK 119
            ++  L FCE+ I+GK+HR  F   +H +K
Sbjct: 450 DRIQQLEFCENCIFGKSHRSKFNKGEHMSK 479

>gi|20198271|gb|AAM15487.1| hypothetical protein [Arabidopsis thaliana]

          Length = 122

 Score =  79 bits (194), Expect = 1e-013
 Identities = 39/75 (52%), Positives = 50/75 (66%), Gaps = 3/75 (4%)
 Frame = -2

Query: 307 EDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KH 128
           E + D T LWH RLGHMSQK + LLVK+G++D KKVS+    ED IYG+A RVSF + +H
Sbjct:  39 EKRVDNTVLWHQRLGHMSQKNMDLLVKRGFLDRKKVSTFSIFEDCIYGRAKRVSFDLAQH 98

Query: 127 TT---KLPFLQEKKW 92
            T   KL ++    W
Sbjct:  99 DTKEEKLDYVHSDLW 113

>gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza sativa Japonica Group]

          Length = 1181

 Score =  76 bits (185), Expect = 1e-012
 Identities = 40/94 (42%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
 Frame = -2

Query: 358 SLYFLQGKVVETEALITED---KKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKKVSSLY 188
           +LY LQG  +        D     D T+LWH RLGHMS+ GL  L K+G +DG+ +S L 
Sbjct: 407 NLYHLQGTTILGNVATVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSISKLK 466

Query: 187 FCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
           FCE  I+GK  RV F    HTT+  L ++    W
Sbjct: 467 FCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLW 500

>gi|116317760|emb|CAH65740.1| OSIGBa0127D24.3 [Oryza sativa Indica Group]

          Length = 1009

 Score =  76 bits (185), Expect = 1e-012
 Identities = 39/94 (41%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
 Frame = -2

Query: 358 SLYFLQGKVVETEALITED---KKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKKVSSLY 188
           +LY L+G  +        D     D T+LWH RLGHMS+ GL  L K+G +DG+ +  L 
Sbjct: 228 NLYHLRGTPILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRGMLDGQSIGKLI 287

Query: 187 FCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
           FCE  I+GK HRV F    HTT+  L ++    W
Sbjct: 288 FCEHCIFGKHHRVKFNTSTHTTEGILDYVHSDLW 321

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 798,366,029,682
Number of Sequences: 15229318
Number of Extensions: 798366029682
Number of Successful Extensions: 240520672
Number of sequences better than 0.0: 0