BLASTX 7.6.2
Query= UN74902 /QuerySize=456
(455 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|13449316|ref|NP_085498.1| hypothetical protein ArthMp028 [Ara... 123 6e-027
gi|4539406|emb|CAB40039.1| putative retrotransposon [Arabidopsis... 107 5e-022
gi|4895171|gb|AAD32759.1| putative retroelement pol polyprotein ... 103 7e-021
gi|4388818|gb|AAD19773.1| putative retroelement pol polyprotein ... 103 7e-021
gi|4567266|gb|AAD23679.1| putative retroelement pol polyprotein ... 101 3e-020
gi|6996255|emb|CAB75481.1| copia-like polyprotein [Arabidopsis t... 101 4e-020
gi|4982475|gb|AAD36943.1|AF069441_3 putative polyprotein [Arabid... 100 7e-020
gi|4567277|gb|AAD23690.1| putative retroelement pol polyprotein ... 94 5e-018
gi|6623973|gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 poly... 89 1e-016
gi|9294244|dbj|BAB02146.1| copia retroelement pol polyprotein-li... 87 6e-016
gi|1345510|emb|CAA37918.1| unnamed protein product [Arabidopsis ... 85 2e-015
gi|99719|pir||S23319 hypothetical protein 2 - Arabidopsis thalia... 83 7e-015
gi|16534|emb|CAA31653.1| polyprotein [Arabidopsis thaliana] 82 1e-014
gi|3402755|emb|CAA20201.1| putative transposable element [Arabid... 80 5e-014
gi|147836446|emb|CAN62092.1| hypothetical protein VITISV_022473 ... 80 5e-014
gi|13699782|gb|AAK38381.1|AC079028_5 polyprotein, putative [Arab... 80 6e-014
gi|109391001|emb|CAJ09951.2| putative gag-pol polyprotein [Citru... 80 6e-014
gi|20198271|gb|AAM15487.1| hypothetical protein [Arabidopsis tha... 79 1e-013
gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza sativa Japon... 76 1e-012
gi|116317760|emb|CAH65740.1| OSIGBa0127D24.3 [Oryza sativa Indic... 76 1e-012
>gi|13449316|ref|NP_085498.1| hypothetical protein ArthMp028 [Arabidopsis
thaliana]
Length = 145
Score = 123 bits (308), Expect = 6e-027
Identities = 57/92 (61%), Positives = 70/92 (76%)
Frame = -2
Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
R +L+G ++DSLY LQG V E+ + E KDET LWHSRL HMSQ+G++LLVKKG++D
Sbjct: 36 RTILKGNRHDSLYILQGSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVKKGFLDS 95
Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTTKLP 113
KVSSL FCED IYGK HRV+F +HTTK P
Sbjct: 96 SKVSSLKFCEDCIYGKTHRVNFSTGQHTTKNP 127
>gi|4539406|emb|CAB40039.1| putative retrotransposon [Arabidopsis thaliana]
Length = 1230
Score = 107 bits (266), Expect = 5e-022
Identities = 52/101 (51%), Positives = 68/101 (67%), Gaps = 2/101 (1%)
Frame = -2
Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
R +L G +++ LY LQGK + ++ E + D+T LWH RLGH+SQK + +LVKKGY+DG
Sbjct: 389 RTLLIGSRHEKLYLLQGKPEVSHSMTVERRNDDTVLWHRRLGHISQKNMDILVKKGYLDG 448
Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
KKVS L CED IYGKA R+SF V H T KL ++ W
Sbjct: 449 KKVSKLELCEDCIYGKARRLSFVVATHNTEDKLNYVHSDLW 489
>gi|4895171|gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis
thaliana]
Length = 1356
Score = 103 bits (256), Expect = 7e-021
Identities = 51/101 (50%), Positives = 69/101 (68%), Gaps = 2/101 (1%)
Frame = -2
Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
+V+L G++YD+LY L K V +E+L + D+T LWH RL HMSQK +++LV+KG++D
Sbjct: 405 QVLLTGRRYDTLYLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDK 464
Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
KKVSSL CED IYGKA R SF + H T KL ++ W
Sbjct: 465 KKVSSLDVCEDCIYGKAKRKSFSLAHHDTKEKLEYIHSDLW 505
>gi|4388818|gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis
thaliana]
Length = 1335
Score = 103 bits (256), Expect = 7e-021
Identities = 49/99 (49%), Positives = 68/99 (68%), Gaps = 2/99 (2%)
Frame = -2
Query: 382 VLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKK 203
+L+G++ D+LY L G E E+ + + KDET+LWHSRLGHMSQKG+++LVKKG + +
Sbjct: 383 ILKGQKRDTLYILDGVTEEGESHSSAEVKDETALWHSRLGHMSQKGMEILVKKGCLRREV 442
Query: 202 VSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
+ L FCED +YGK HRVSF +H T KL ++ W
Sbjct: 443 IKELEFCEDCVYGKQHRVSFAPAQHVTKEKLAYVHSDLW 481
>gi|4567266|gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis
thaliana]
Length = 838
Score = 101 bits (250), Expect = 3e-020
Identities = 47/99 (47%), Positives = 64/99 (64%), Gaps = 2/99 (2%)
Frame = -2
Query: 382 VLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKK 203
+L+GK+ +LY LQG VV A KDE+ +WHSRL HMSQ+ + +L+KKG + +K
Sbjct: 403 LLKGKKVGTLYLLQGVVVTGNANAVTSSKDESKIWHSRLCHMSQRNIDVLIKKGCLQAEK 462
Query: 202 VSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
++ L FCED +YGK HRV FG KH T KL ++ W
Sbjct: 463 INGLEFCEDCVYGKTHRVGFGSAKHVTREKLEYIHSDLW 501
>gi|6996255|emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1363
Score = 101 bits (249), Expect = 4e-020
Identities = 48/101 (47%), Positives = 66/101 (65%), Gaps = 2/101 (1%)
Frame = -2
Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
+V+L G++YD+LY L GK E+L D+T LWH RL HMSQK + LL+KKG++D
Sbjct: 412 QVLLEGRRYDTLYILHGKPATDESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDK 471
Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
KKVS L CED IYG+A ++ F + +H T KL ++ W
Sbjct: 472 KKVSMLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLW 512
>gi|4982475|gb|AAD36943.1|AF069441_3 putative polyprotein [Arabidopsis
thaliana]
Length = 778
Score = 100 bits (247), Expect = 7e-020
Identities = 51/105 (48%), Positives = 66/105 (62%), Gaps = 5/105 (4%)
Frame = -2
Query: 388 RVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
+V+L G++YD+LY L GK E+L D+ LWH RL HMSQK + LLVKKG++D
Sbjct: 295 QVLLEGRRYDTLYILHGKPATDESLAVAKANDDIVLWHRRLCHMSQKNMSLLVKKGFLDK 354
Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTTKLPFLQEKKWHQHWSL 74
KKVS L CED IYGKA ++ F +H TK EK + H+ L
Sbjct: 355 KKVSMLDTCEDCIYGKAKKIGFNFAQHDTK-----EKLEYVHYDL 394
>gi|4567277|gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis
thaliana]
Length = 1333
Score = 94 bits (231), Expect = 5e-018
Identities = 48/100 (48%), Positives = 65/100 (65%), Gaps = 2/100 (2%)
Frame = -2
Query: 385 VVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGK 206
V+L ++ +LY LQ + V E+L ++D+T LWH RLGHMSQK + LL+KKG +D K
Sbjct: 384 VLLTVRRCYTLYLLQWRPVTEESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKK 443
Query: 205 KVSSLYFCEDSIYGKAHRVSFGV*KHTT--KLPFLQEKKW 92
KVS L CED IYGKA R+ F + +H T KL ++ W
Sbjct: 444 KVSKLETCEDCIYGKAKRIGFNLAQHDTREKLEYVHSDLW 483
>gi|6623973|gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein
[Arabidopsis thaliana]
Length = 1356
Score = 89 bits (219), Expect = 1e-016
Identities = 46/110 (41%), Positives = 67/110 (60%), Gaps = 3/110 (2%)
Frame = -2
Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
G+ ++++ + LRG + LY L G V +E E K +T+LWHSRLGHMS LK+
Sbjct: 395 GKVRYFKNNKTALRGSLSNGLYVLDGSTVMSELCNAETDKVKTALWHSRLGHMSMNNLKV 454
Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
L KG ID K+++ L FCE + GK+ +VSF V KHT++ L ++ W
Sbjct: 455 LAGKGLIDRKEINELEFCEHCVMGKSKKVSFNVGKHTSEDALSYVHADLW 504
>gi|9294244|dbj|BAB02146.1| copia retroelement pol polyprotein-like [Arabidopsis
thaliana]
Length = 526
Score = 87 bits (213), Expect = 6e-016
Identities = 41/88 (46%), Positives = 55/88 (62%)
Frame = -2
Query: 382 VLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKK 203
++ GK D Y+L+G VV E + D T LWHSRLGHMS K + +LVK+GY+ GK+
Sbjct: 374 IISGKYQDGRYYLEGNVVNGEFAVARPDVDMTRLWHSRLGHMSLKNMNVLVKEGYLLGKE 433
Query: 202 VSSLYFCEDSIYGKAHRVSFGV*KHTTK 119
V+ L CE + K+ + SF KHTTK
Sbjct: 434 VTKLELCESCVLRKSDKQSFPTAKHTTK 461
>gi|1345510|emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana]
Length = 560
Score = 85 bits (208), Expect = 2e-015
Identities = 41/110 (37%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
Frame = -2
Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
G+ ++++ ++ LRG+ + LY L G V +E + E K +T LWHSRLGHM +K+
Sbjct: 393 GKVRYFKNQKTALRGEIVNGLYILDGNTVLSETCVAEGSKGKTELWHSRLGHMGLNNMKV 452
Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
L KG + +++ L FCE+ + GKA +VSF V KH ++ L ++ W
Sbjct: 453 LAGKGLVSKEEIRELDFCENCVMGKAKKVSFNVGKHNSEYVLSYVHADLW 502
>gi|99719|pir||S23319 hypothetical protein 2 - Arabidopsis thaliana
retrotransposon Ta1-2 (strain Landsberg) (fragment)
Length = 1084
Score = 83 bits (204), Expect = 7e-015
Identities = 39/110 (35%), Positives = 66/110 (60%), Gaps = 3/110 (2%)
Frame = -2
Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
G+ ++++ ++ LRG+ + LY L G + +E + E K +T LWHSRLGHM +K+
Sbjct: 305 GKVRYFKNQKTALRGEIVNGLYILDGNTILSETCVAEGSKGKTELWHSRLGHMGLNNMKV 364
Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
L KG + +++ L FCE+ + GKA +VSF + KH ++ L ++ W
Sbjct: 365 LAGKGLVSKEEIRELDFCENCVMGKAKKVSFNMGKHNSEYVLSYVHADLW 414
>gi|16534|emb|CAA31653.1| polyprotein [Arabidopsis thaliana]
Length = 1291
Score = 82 bits (202), Expect = 1e-014
Identities = 38/99 (38%), Positives = 62/99 (62%), Gaps = 1/99 (1%)
Frame = -2
Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
G+ ++++ ++ LRG+ + LY L G V +E + E K +T LWHSRLGH+ +K+
Sbjct: 405 GKVRYFKNQKTALRGELVNGLYILDGNTVLSETCVAEGSKGKTELWHSRLGHIGLNNMKV 464
Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK 119
L KG + +++ L FCE+ + GKA +VSF V KH ++
Sbjct: 465 LAGKGLVSKEEIRVLDFCENCVMGKAKKVSFNVGKHNSE 503
>gi|3402755|emb|CAA20201.1| putative transposable element [Arabidopsis
thaliana]
Length = 1308
Score = 80 bits (197), Expect = 5e-014
Identities = 41/116 (35%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Frame = -2
Query: 412 GRKQWY-QRRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
G+ ++Y + + L G + LY L G V E E ++T LWH RLGHMS +K+
Sbjct: 396 GKVRFYKENKTALCGNLVNGLYVLDGHTVVNENCNVEGSNEKTELWHCRLGHMSLNNMKI 455
Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKWHQHWSL 74
L +KG ++ K + L FCE+ + GK+ ++SF V KH T L ++ W + + L
Sbjct: 456 LAEKGLLEKKDIKELSFCENCVMGKSKKLSFNVGKHITDEVLGYIHADLWGKQYFL 511
>gi|147836446|emb|CAN62092.1| hypothetical protein VITISV_022473 [Vitis
vinifera]
Length = 318
Score = 80 bits (197), Expect = 5e-014
Identities = 41/91 (45%), Positives = 54/91 (59%), Gaps = 2/91 (2%)
Frame = -2
Query: 385 VVLRGKQYDSLYFLQGKVV--ETEALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYID 212
VV++G + + LY LQG + I+E + T LWH RLGHMS KGL +L K+G +D
Sbjct: 124 VVMKGNKINGLYTLQGGTIIGAVVVSISESIIETTRLWHMRLGHMSDKGLTILSKRGLLD 183
Query: 211 GKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK 119
G+K L FCE ++GK RV F H TK
Sbjct: 184 GQKTGELDFCEHCVFGKQCRVKFSAGVHRTK 214
>gi|13699782|gb|AAK38381.1|AC079028_5 polyprotein, putative [Arabidopsis
thaliana]
Length = 855
Score = 80 bits (196), Expect = 6e-014
Identities = 43/110 (39%), Positives = 62/110 (56%), Gaps = 3/110 (2%)
Frame = -2
Query: 412 GRKQWYQ-RRVVLRGKQYDSLYFLQGKVVETEALITEDKKDETSLWHSRLGHMSQKGLKL 236
G+ ++Y+ + LRG LY L G V E+ I E K+ T+LWHSRLGHM +K+
Sbjct: 78 GQVRYYKNNKTALRGSLSGGLYVLDGNTVIAESCIAERSKELTTLWHSRLGHMGGNNMKI 137
Query: 235 LVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
L KG I + +SL F E + GKA +VSF + KH ++ L ++ W
Sbjct: 138 LAGKGLIKPSEATSLEFYEHCVMGKAKKVSFNIGKHNSEEILSYVHADLW 187
>gi|109391001|emb|CAJ09951.2| putative gag-pol polyprotein [Citrus sinensis]
Length = 1334
Score = 80 bits (196), Expect = 6e-014
Identities = 39/90 (43%), Positives = 56/90 (62%), Gaps = 1/90 (1%)
Frame = -2
Query: 385 VVLRGKQYDSLYFLQGKVVET-EALITEDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDG 209
+V++G + LY LQG V E + ++D T LWH RLGHMS KGL+ L K+G + G
Sbjct: 390 IVMKGVNENGLYVLQGSSVPVQEGVSAVSEEDRTKLWHLRLGHMSIKGLQELSKQGLLGG 449
Query: 208 KKVSSLYFCEDSIYGKAHRVSFGV*KHTTK 119
++ L FCE+ I+GK+HR F +H +K
Sbjct: 450 DRIQQLEFCENCIFGKSHRSKFNKGEHMSK 479
>gi|20198271|gb|AAM15487.1| hypothetical protein [Arabidopsis thaliana]
Length = 122
Score = 79 bits (194), Expect = 1e-013
Identities = 39/75 (52%), Positives = 50/75 (66%), Gaps = 3/75 (4%)
Frame = -2
Query: 307 EDKKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKKVSSLYFCEDSIYGKAHRVSFGV*KH 128
E + D T LWH RLGHMSQK + LLVK+G++D KKVS+ ED IYG+A RVSF + +H
Sbjct: 39 EKRVDNTVLWHQRLGHMSQKNMDLLVKRGFLDRKKVSTFSIFEDCIYGRAKRVSFDLAQH 98
Query: 127 TT---KLPFLQEKKW 92
T KL ++ W
Sbjct: 99 DTKEEKLDYVHSDLW 113
>gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza sativa Japonica Group]
Length = 1181
Score = 76 bits (185), Expect = 1e-012
Identities = 40/94 (42%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
Frame = -2
Query: 358 SLYFLQGKVVETEALITED---KKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKKVSSLY 188
+LY LQG + D D T+LWH RLGHMS+ GL L K+G +DG+ +S L
Sbjct: 407 NLYHLQGTTILGNVATVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSISKLK 466
Query: 187 FCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
FCE I+GK RV F HTT+ L ++ W
Sbjct: 467 FCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLW 500
>gi|116317760|emb|CAH65740.1| OSIGBa0127D24.3 [Oryza sativa Indica Group]
Length = 1009
Score = 76 bits (185), Expect = 1e-012
Identities = 39/94 (41%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
Frame = -2
Query: 358 SLYFLQGKVVETEALITED---KKDETSLWHSRLGHMSQKGLKLLVKKGYIDGKKVSSLY 188
+LY L+G + D D T+LWH RLGHMS+ GL L K+G +DG+ + L
Sbjct: 228 NLYHLRGTPILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRGMLDGQSIGKLI 287
Query: 187 FCEDSIYGKAHRVSFGV*KHTTK--LPFLQEKKW 92
FCE I+GK HRV F HTT+ L ++ W
Sbjct: 288 FCEHCIFGKHHRVKFNTSTHTTEGILDYVHSDLW 321
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 798,366,029,682
Number of Sequences: 15229318
Number of Extensions: 798366029682
Number of Successful Extensions: 240520672
Number of sequences better than 0.0: 0
|