BLASTX 7.6.2
Query= UN08948 /QuerySize=1324
(1323 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297850016|ref|XP_002892889.1| methyladenine glycosylase famil... 389 9e-106
gi|15218379|ref|NP_173049.1| DNA glycosylase-like protein [Arabi... 385 1e-104
gi|297839853|ref|XP_002887808.1| hypothetical protein ARALYDRAFT... 374 3e-101
gi|15220860|ref|NP_178200.1| methyladenine glycosylase-like prot... 373 4e-101
gi|224142383|ref|XP_002324538.1| predicted protein [Populus tric... 307 5e-081
gi|224091765|ref|XP_002309346.1| predicted protein [Populus tric... 305 1e-080
gi|255576987|ref|XP_002529378.1| DNA-3-methyladenine glycosylase... 301 3e-079
gi|147823377|emb|CAN66338.1| hypothetical protein VITISV_026086 ... 287 5e-075
gi|225443176|ref|XP_002264482.1| PREDICTED: hypothetical protein... 285 1e-074
gi|125556288|gb|EAZ01894.1| hypothetical protein OsI_23919 [Oryz... 280 4e-073
gi|225445871|ref|XP_002276173.1| PREDICTED: hypothetical protein... 279 8e-073
gi|51534979|dbj|BAD38103.1| methyladenine glycosylase protein-li... 278 2e-072
gi|222635995|gb|EEE66127.1| hypothetical protein OsJ_22173 [Oryz... 278 2e-072
gi|242073624|ref|XP_002446748.1| hypothetical protein SORBIDRAFT... 277 3e-072
gi|242096536|ref|XP_002438758.1| hypothetical protein SORBIDRAFT... 274 3e-071
gi|226529264|ref|NP_001141717.1| hypothetical protein LOC1002738... 273 7e-071
gi|15242914|ref|NP_200605.1| DNA-3-methyladenine glycosylase I [... 270 4e-070
gi|215697314|dbj|BAG91308.1| unnamed protein product [Oryza sati... 267 3e-069
gi|297793315|ref|XP_002864542.1| methyladenine glycosylase famil... 267 4e-069
gi|90265145|emb|CAC09513.2| H0711G06.19 [Oryza sativa Indica Group] 266 7e-069
>gi|297850016|ref|XP_002892889.1| methyladenine glycosylase family protein
[Arabidopsis lyrata subsp. lyrata]
Length = 354
Score = 389 bits (997), Expect = 9e-106
Identities = 209/289 (72%), Positives = 230/289 (79%), Gaps = 18/289 (6%)
Frame = -1
Query: 1011 SSMTASYSSDASSSCESSHLSVASSSSRKKAV-VRRSGSVSSVGTTSAVVVARRK-QVDE 838
SS+ +S S+S +S+ S ASSS + V S S V S V + RK + +
Sbjct: 72 SSLCSSILRKNSASMTASYSSDASSSCESSPLSVASSSSCKKVMRRSGSVSSTRKLSIGK 131
Query: 837 KEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLC 658
+EDK A GDCFADGRRRCAWITPK+DPCYVAFHDEEWGVPV +DKKLFELLC
Sbjct: 132 EEDKVA---------GDCFADGRRRCAWITPKADPCYVAFHDEEWGVPVDDDKKLFELLC 182
Query: 657 LSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSIL 490
LSGAL+ELSWTDILSRRQLLREVFMDFDPVAVS++N+K + T+AISLLSEVK+RSIL
Sbjct: 183 LSGALAELSWTDILSRRQLLREVFMDFDPVAVSEMNDKKLTAPGTAAISLLSEVKIRSIL 242
Query: 489 DNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRS 310
DN+R VRK IAE GSFKKYMWNFV+NKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRS
Sbjct: 243 DNSRHVRKIIAECGSFKKYMWNFVNNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRS 302
Query: 309 VSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETTTTKA---KKTERE 172
VSPTVIYSFMQAAGLTNDHLIGCFR QDCCV AETTTT KK ERE
Sbjct: 303 VSPTVIYSFMQAAGLTNDHLIGCFRFQDCCVDAETTTTTTKAKKKNERE 351
>gi|15218379|ref|NP_173049.1| DNA glycosylase-like protein [Arabidopsis
thaliana]
Length = 352
Score = 385 bits (987), Expect = 1e-104
Identities = 214/355 (60%), Positives = 250/355 (70%), Gaps = 27/355 (7%)
Frame = -1
Query: 1182 PPGMKLENSKKMTTTNTIESKDEKTKKKP-----DPPASPTTTL-----KQCSSLCSSLL 1033
PP + NS + + + K ++KP + P T + K +
Sbjct: 4 PPRFRSVNSDEREFRSVLGPTGNKLQRKPPGMKLEKPMMEKTIIDSKDEKAKKPTTPASP 63
Query: 1032 RRRNSGASSMTASYSSDASSSCESSHLSVASSS--SRKKAVVRRSGSVSSVGTTSAVVVA 859
R SS+ +S S+S +S+ S ASSS S +V S V + +V
Sbjct: 64 RTTLKQCSSLCSSILRKNSASMTASYSSDASSSCESSPLSVASSSSCKKVVRRSGSVSST 123
Query: 858 RRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDK 679
R+ V ++E+K +GDCFADGR+RCAWITPK+DPCYVAFHDEEWGVPVH+DK
Sbjct: 124 RKLSVGKEEEKV---------SGDCFADGRKRCAWITPKADPCYVAFHDEEWGVPVHDDK 174
Query: 678 KLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV----TSAISLLSE 511
KLFELLCLSGAL+ELSWTDILSRR +LREVFMDFDPVAV++LN+K + T+AISLLSE
Sbjct: 175 KLFELLCLSGALAELSWTDILSRRHILREVFMDFDPVAVAELNDKKLTAPGTAAISLLSE 234
Query: 510 VKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDL 331
VK+RSILDN+R VRK IAE GS KKYMWNFV+NKPTQSQFRYQRQVPVKTSKAEFISKDL
Sbjct: 235 VKIRSILDNSRHVRKIIAECGSLKKYMWNFVNNKPTQSQFRYQRQVPVKTSKAEFISKDL 294
Query: 330 VRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETTTT--KAKKTERE 172
VRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFR+QDCCV AETTTT KK ERE
Sbjct: 295 VRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRYQDCCVDAETTTTTKAKKKNERE 349
>gi|297839853|ref|XP_002887808.1| hypothetical protein ARALYDRAFT_477160
[Arabidopsis lyrata subsp. lyrata]
Length = 323
Score = 374 bits (958), Expect = 3e-101
Identities = 198/295 (67%), Positives = 227/295 (76%), Gaps = 24/295 (8%)
Frame = -1
Query: 1077 TTTLKQCSSLCSSLLRRRNSGASSMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSGS 898
T + QCS L S+LRR SMTASYSSDASSSCESS LSVAS+SS K+A +RRSGS
Sbjct: 47 TKNMPQCSPLSPSILRR---NGISMTASYSSDASSSCESSPLSVASTSSGKRA-LRRSGS 102
Query: 897 VSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAF 718
+SS + RR +E+++KA+ DCF+DGR+RCAWITPKS CY+AF
Sbjct: 103 LSSSSS------LRRNLTEERDEKAS----------DCFSDGRKRCAWITPKSGQCYIAF 146
Query: 717 HDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV 538
HD EWGVPVH+DK+LFELL LSGAL+ELSW DILS+RQL REVFMDFDP+A+S+L K +
Sbjct: 147 HDTEWGVPVHDDKRLFELLSLSGALAELSWKDILSKRQLFREVFMDFDPIAISELTNKKI 206
Query: 537 TSA----ISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVP 370
TS+ +LLSE KLRSIL+NA QV K I E+GSF KY+WNFV+ KPTQSQFRY RQVP
Sbjct: 207 TSSEIATTTLLSEQKLRSILENANQVCKLIVEFGSFDKYIWNFVNQKPTQSQFRYPRQVP 266
Query: 369 VKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAET 205
VKTSKAE ISKDLVRRGFRSVSPTVIYSFMQ AGLTNDHL CFRH DC ET
Sbjct: 267 VKTSKAELISKDLVRRGFRSVSPTVIYSFMQTAGLTNDHLTCCFRHHDCMTKDET 321
>gi|15220860|ref|NP_178200.1| methyladenine glycosylase-like protein
[Arabidopsis thaliana]
Length = 327
Score = 373 bits (957), Expect = 4e-101
Identities = 198/295 (67%), Positives = 225/295 (76%), Gaps = 24/295 (8%)
Frame = -1
Query: 1077 TTTLKQCSSLCSSLLRRRNSGASSMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSGS 898
T + QCS L +LRR SMTASYSSDASSSCESS LS+ S+SS K+ V+RRSGS
Sbjct: 51 TEKMPQCSPLSPPILRR---NGISMTASYSSDASSSCESSPLSMTSTSSGKR-VLRRSGS 106
Query: 897 VSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAF 718
VSS + RR +E+++KA+ DCF DGR+RCAWITPKSD CY+AF
Sbjct: 107 VSSSSS------LRRNLTEERDEKAS----------DCFCDGRKRCAWITPKSDQCYIAF 150
Query: 717 HDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV 538
HDEEWGVPVH+DK+LFELL LSGAL+ELSW DILS+RQL REVFMDFDP+A+S+L K +
Sbjct: 151 HDEEWGVPVHDDKRLFELLSLSGALAELSWKDILSKRQLFREVFMDFDPIAISELTNKKI 210
Query: 537 TS----AISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVP 370
TS A +LLSE KLRSIL+NA QV K I +GSF KY+WNFV+ KPTQSQFRY RQVP
Sbjct: 211 TSPEIAATTLLSEQKLRSILENANQVCKIIGAFGSFDKYIWNFVNQKPTQSQFRYPRQVP 270
Query: 369 VKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAET 205
VKTSKAE ISKDLVRRGFRSVSPTVIYSFMQ AGLTNDHL CFRH DC ET
Sbjct: 271 VKTSKAELISKDLVRRGFRSVSPTVIYSFMQTAGLTNDHLTCCFRHHDCMTKDET 325
>gi|224142383|ref|XP_002324538.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 307 bits (784), Expect = 5e-081
Identities = 178/365 (48%), Positives = 224/365 (61%), Gaps = 27/365 (7%)
Frame = -1
Query: 1245 VNSDERDFRSVLGPTGNKQRKPPGMKLENSK--KMTTTNTIESKDEKTKKKPDPPASPTT 1072
+N + + RSVLGPTGN + P + SK + + E K + KK PA T
Sbjct: 10 MNVADSEARSVLGPTGNNKAGPLSARKPVSKQSRKVEKSPEEVKLGEEKKTLTVPAVGTL 69
Query: 1071 TLKQCSSLCSSLLRRRN---SGASSMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSG 901
+ K S SS+LRR S+ AS SSDAS + S +A R
Sbjct: 70 SPKSHSLNISSVLRRHELLLHSNLSLNASCSSDAS-----------TDSFHSRASTGRLT 118
Query: 900 SVSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVA 721
+S GT RRKQ + + G + ++ CAW+TP +DPCY
Sbjct: 119 RSNSAGT-------RRKQYVLRPRSFVSEGGLESPPSPDDSQSKKSCAWVTPNTDPCYAT 171
Query: 720 FHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKT 541
FHDEEWGVP+H+D+KLFELL LSGAL+EL+W ILS+R + REVF DFDP+AVSK NEK
Sbjct: 172 FHDEEWGVPIHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPIAVSKFNEKK 231
Query: 540 V----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQV 373
+ ++A SLLSE+KLR+I++NARQ+ K I E+GSF KY+W+FV+ KP S+FRY RQV
Sbjct: 232 ILAPGSTATSLLSELKLRAIVENARQISKVIDEFGSFDKYIWSFVNYKPIVSRFRYPRQV 291
Query: 372 PVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETTTTK 193
PVKT KA+ ISKDLVRRGFRSV PTVIYSFMQ AG+TNDHLI CFR Q+C AE
Sbjct: 292 PVKTPKADAISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQECLDAAEGKVEN 351
Query: 192 AKKTE 178
K+E
Sbjct: 352 GIKSE 356
>gi|224091765|ref|XP_002309346.1| predicted protein [Populus trichocarpa]
Length = 381
Score = 305 bits (781), Expect = 1e-080
Identities = 179/371 (48%), Positives = 225/371 (60%), Gaps = 36/371 (9%)
Frame = -1
Query: 1248 SVNSDERDFRSVLGPTGNKQ-------RKPPGMKLENSKKMTTTNTIESKDEKTKKKPDP 1090
S+N + + R VLGPTGN + RKP +L K + E+K + KK
Sbjct: 9 SMNVADSEARPVLGPTGNTKAGPLTSARKPASKQLRKDGK----SPEEAKLGEEKKVLTV 64
Query: 1089 PASPTTTLKQCSSLCSSLLRRRNS---GASSMTASYSSDASSSCESSHLSVASSSSRKKA 919
P + K S SS+LRR S+ AS SSDAS + S +A
Sbjct: 65 PTVGNLSPKSLSGNFSSVLRRHEQLLHSNLSLNASCSSDAS-----------TDSFHSRA 113
Query: 918 VVRRSGSVSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKS 739
R ++VGT RRKQ K + G + ++ CAW+TP +
Sbjct: 114 STGRLIRSNNVGT-------RRKQYVSKPRSVVSDGGLESLPSSDGSQSKKSCAWVTPNT 166
Query: 738 DPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVS 559
DPCY AFHDEEWG+PVH+D+KLFELL LSGAL+EL+W ILS+R + REVF DFDP+AVS
Sbjct: 167 DPCYTAFHDEEWGLPVHDDRKLFELLVLSGALAELTWPAILSKRHMFREVFADFDPIAVS 226
Query: 558 KLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQF 391
K NEK + ++A SLLSE+KLR+I++NARQ+ K I E+GSF KY+W+FV+ KP S+F
Sbjct: 227 KFNEKKIIAPGSTAASLLSELKLRAIIENARQISKVIDEFGSFDKYIWSFVNYKPIVSRF 286
Query: 390 RYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVA 211
RY RQVP KT KA+ ISKDLVRRGFRSV PTVIYSFMQ AG+TNDHLI CFR Q+C A
Sbjct: 287 RYPRQVPAKTPKADAISKDLVRRGFRSVGPTVIYSFMQVAGVTNDHLISCFRFQECIDAA 346
Query: 210 ETTTTKAKKTE 178
E K+E
Sbjct: 347 EGKEENGIKSE 357
>gi|255576987|ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative
[Ricinus communis]
Length = 380
Score = 301 bits (769), Expect = 3e-079
Identities = 175/365 (47%), Positives = 218/365 (59%), Gaps = 27/365 (7%)
Frame = -1
Query: 1245 VNSDERDFRSVLGPTGNKQRKPPGMKLENSKKMTTTNTIES--KDEKTKKKPDPPASPTT 1072
+N + + R VLGPTGN + K SK++ T K + KK P +
Sbjct: 10 MNVADSETRPVLGPTGNNKAGSLSAKKPASKQLRKVETSPEAVKLGQEKKLVTVPTASAL 69
Query: 1071 TLKQCSSLCSSLLRRRNS---GASSMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSG 901
+ K S S+LRR S+ AS SSDAS + S +A R
Sbjct: 70 SPKSHSVSVPSVLRRHEQLLHSNLSLNASCSSDAS-----------TDSFHSRASTGRLT 118
Query: 900 SVSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVA 721
+S+GT RRKQ K + G + ++ CAW+TP +DPCY A
Sbjct: 119 RSNSLGT-------RRKQYALKPRSVVSDGGLESPPPSDGSQAKKSCAWVTPNADPCYTA 171
Query: 720 FHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKT 541
FHDEEWG+PVH+DKKLFELL LSGAL+EL+W ILS+R + REVF +FDPV VSK NEK
Sbjct: 172 FHDEEWGIPVHDDKKLFELLVLSGALAELTWPAILSKRHIFREVFANFDPVVVSKFNEKK 231
Query: 540 V----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQV 373
+ ++A SLLSE+KLR+I++NARQ+ K E GSF KY+W+FV+ KP S+FRY RQV
Sbjct: 232 IIAPGSTASSLLSEIKLRAIIENARQISKVTDELGSFDKYIWSFVNYKPIVSRFRYPRQV 291
Query: 372 PVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETTTTK 193
PVKT KA+ ISKDLVRRGFRSV PTV+YSFMQ AGLTNDHLI CFR Q+C AE
Sbjct: 292 PVKTPKADVISKDLVRRGFRSVGPTVVYSFMQVAGLTNDHLISCFRFQECINAAEGKEEN 351
Query: 192 AKKTE 178
K E
Sbjct: 352 GVKVE 356
>gi|147823377|emb|CAN66338.1| hypothetical protein VITISV_026086 [Vitis
vinifera]
Length = 431
Score = 287 bits (732), Expect = 5e-075
Identities = 141/255 (55%), Positives = 180/255 (70%), Gaps = 22/255 (8%)
Frame = -1
Query: 972 SCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKEDKAAAAAG-GGVS 796
SC SS S + +SSR+ + RRK K DK G V+
Sbjct: 161 SCSSSESSSSRASSRRSS-----------------TPIRRKHFSPKADKXEKTGGRPSVA 203
Query: 795 NGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDIL 616
+ +C +RRCAW+TP +DPCY AFHDEEWGVPVH+DK+ FELL LSGAL+EL+W IL
Sbjct: 204 SDNCALQAKRRCAWVTPNTDPCYAAFHDEEWGVPVHDDKRHFELLVLSGALAELTWPAIL 263
Query: 615 SRRQLLREVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYG 448
+R + REVF++FDP+AVSKLNEK + + A SL+S++KLRS+++NARQ+ K I E+G
Sbjct: 264 QKRHIFREVFLEFDPIAVSKLNEKKIVTPGSPATSLVSDLKLRSVIENARQICKIIGEFG 323
Query: 447 SFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAG 268
SF +Y+W FV++KP +FRY RQVPVKT+KA+ ISKDLVRRGFRSV PTVIY+FMQ AG
Sbjct: 324 SFDQYIWGFVNHKPMVGRFRYPRQVPVKTAKADVISKDLVRRGFRSVGPTVIYAFMQVAG 383
Query: 267 LTNDHLIGCFRHQDC 223
+TNDHL CFR Q+C
Sbjct: 384 ITNDHLTSCFRFQEC 398
>gi|225443176|ref|XP_002264482.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 360
Score = 285 bits (729), Expect = 1e-074
Identities = 141/255 (55%), Positives = 179/255 (70%), Gaps = 22/255 (8%)
Frame = -1
Query: 972 SCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKEDKAAAAAG-GGVS 796
SC SS S + +SSR+ + RRK K DK G V+
Sbjct: 90 SCSSSESSSSRASSRRSS-----------------TPIRRKHFSPKADKVEKTGGRPSVA 132
Query: 795 NGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDIL 616
+ +C +RRCAW+TP +DPCY AFHDEEWGVPVH+DK+ FELL LSGAL+EL+W IL
Sbjct: 133 SDNCALQAKRRCAWVTPNTDPCYAAFHDEEWGVPVHDDKRHFELLVLSGALAELTWPAIL 192
Query: 615 SRRQLLREVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYG 448
+R + REVF++FDP+AVSKLNEK + + A SL+S++KLRS+++NARQ+ K I E+G
Sbjct: 193 QKRHIFREVFLEFDPIAVSKLNEKKIVTPGSPATSLVSDLKLRSVIENARQICKIIGEFG 252
Query: 447 SFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAG 268
SF +Y+W FV++KP +FRY RQVPVKT+KA+ ISKDLVRRGFRSV PTVIY FMQ AG
Sbjct: 253 SFDQYIWGFVNHKPMVGRFRYPRQVPVKTAKADVISKDLVRRGFRSVGPTVIYVFMQVAG 312
Query: 267 LTNDHLIGCFRHQDC 223
+TNDHL CFR Q+C
Sbjct: 313 ITNDHLTSCFRFQEC 327
>gi|125556288|gb|EAZ01894.1| hypothetical protein OsI_23919 [Oryza sativa Indica
Group]
Length = 426
Score = 280 bits (716), Expect = 4e-073
Identities = 147/257 (57%), Positives = 184/257 (71%), Gaps = 10/257 (3%)
Frame = -1
Query: 978 SSSCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKEDKA-AAAAGGG 802
++SC SS SV S R + R S S V A V RR + K A AA
Sbjct: 120 NASC-SSDASVESLRGRDSSGGRLERSWSRV----APAVPRRGKTPVKAAAAEKVAADAE 174
Query: 801 VSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTD 622
V G+RRCAW+TP SDPCYV FHDEEWGVPVH+D++LFELL LSGAL+EL+W +
Sbjct: 175 VVAPATPEAGKRRCAWVTPTSDPCYVIFHDEEWGVPVHDDRRLFELLVLSGALAELTWPE 234
Query: 621 ILSRRQLLREVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAE 454
IL RRQL RE+F+DFDPVA+SK+NEK + + A SLLSE KLR++++NARQ+ K + E
Sbjct: 235 ILKRRQLFREIFVDFDPVAISKINEKKLVAPGSVANSLLSEQKLRAVVENARQILKIVDE 294
Query: 453 YGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQA 274
+GSF +Y W F+++KP S+FRY RQVPVK+ KA+ ISKD+VRRGFR V PT+IYSFMQA
Sbjct: 295 FGSFDRYCWGFLNHKPIVSKFRYPRQVPVKSPKADMISKDMVRRGFRGVGPTIIYSFMQA 354
Query: 273 AGLTNDHLIGCFRHQDC 223
AGLTNDHL+ CFR ++C
Sbjct: 355 AGLTNDHLVSCFRFKEC 371
>gi|225445871|ref|XP_002276173.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 375
Score = 279 bits (713), Expect = 8e-073
Identities = 146/255 (57%), Positives = 181/255 (70%), Gaps = 6/255 (2%)
Frame = -1
Query: 960 SHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCF 781
S+LS+ +S S + S+ T + ARR+ K K + G S D
Sbjct: 88 SNLSLNASCSSDASTDSFHSRASTGRITRSSSTARRRSYASK-PKVIVSDGVSESPPDGL 146
Query: 780 ADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQL 601
+RRCAW+TP +D Y+AFHDEEWGVPVH+DKKLFELL LSGAL+EL+W ILS+R +
Sbjct: 147 -KAKRRCAWVTPNTDLSYIAFHDEEWGVPVHDDKKLFELLVLSGALAELTWPTILSKRHI 205
Query: 600 LREVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYGSFKKY 433
REVF DFDP+AV+KLNEK + + A SL+SE+KLR I++NARQ+ K I E+GSF +Y
Sbjct: 206 FREVFADFDPIAVAKLNEKKLMAPGSIASSLISELKLRGIIENARQMSKVIDEFGSFDEY 265
Query: 432 MWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDH 253
+W+FV++KP S+FRY R VPVKT KA+ ISKDLVRRGFRSV PTVIYSFMQ AG+TNDH
Sbjct: 266 IWSFVNHKPIVSRFRYPRHVPVKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDH 325
Query: 252 LIGCFRHQDCCVVAE 208
LI CFR QDC AE
Sbjct: 326 LISCFRFQDCVTAAE 340
>gi|51534979|dbj|BAD38103.1| methyladenine glycosylase protein-like [Oryza
sativa Japonica Group]
Length = 433
Score = 278 bits (710), Expect = 2e-072
Identities = 126/188 (67%), Positives = 159/188 (84%), Gaps = 4/188 (2%)
Frame = -1
Query: 774 GRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLR 595
G+RRCAW+TP SDPCYV FHDEEWGVPVH+D++LFELL LSGAL+EL+W +IL RRQL R
Sbjct: 191 GKRRCAWVTPTSDPCYVIFHDEEWGVPVHDDRRLFELLVLSGALAELTWPEILKRRQLFR 250
Query: 594 EVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMW 427
E+F+DFDPVA+SK+NEK + + A SLLSE KLR++++NARQ+ K + E+GSF +Y W
Sbjct: 251 EIFVDFDPVAISKINEKKLVAPGSVANSLLSEQKLRAVVENARQILKIVDEFGSFDRYCW 310
Query: 426 NFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLI 247
F+++KP S+FRY RQVPVK+ KA+ ISKD+VRRGFR V PT+IYSFMQAAGLTNDHL+
Sbjct: 311 GFLNHKPIVSKFRYPRQVPVKSPKADMISKDMVRRGFRGVGPTIIYSFMQAAGLTNDHLV 370
Query: 246 GCFRHQDC 223
CFR ++C
Sbjct: 371 SCFRFKEC 378
>gi|222635995|gb|EEE66127.1| hypothetical protein OsJ_22173 [Oryza sativa
Japonica Group]
Length = 410
Score = 278 bits (710), Expect = 2e-072
Identities = 126/188 (67%), Positives = 159/188 (84%), Gaps = 4/188 (2%)
Frame = -1
Query: 774 GRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLR 595
G+RRCAW+TP SDPCYV FHDEEWGVPVH+D++LFELL LSGAL+EL+W +IL RRQL R
Sbjct: 168 GKRRCAWVTPTSDPCYVIFHDEEWGVPVHDDRRLFELLVLSGALAELTWPEILKRRQLFR 227
Query: 594 EVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMW 427
E+F+DFDPVA+SK+NEK + + A SLLSE KLR++++NARQ+ K + E+GSF +Y W
Sbjct: 228 EIFVDFDPVAISKINEKKLVAPGSVANSLLSEQKLRAVVENARQILKIVDEFGSFDRYCW 287
Query: 426 NFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLI 247
F+++KP S+FRY RQVPVK+ KA+ ISKD+VRRGFR V PT+IYSFMQAAGLTNDHL+
Sbjct: 288 GFLNHKPIVSKFRYPRQVPVKSPKADMISKDMVRRGFRGVGPTIIYSFMQAAGLTNDHLV 347
Query: 246 GCFRHQDC 223
CFR ++C
Sbjct: 348 SCFRFKEC 355
>gi|242073624|ref|XP_002446748.1| hypothetical protein SORBIDRAFT_06g021680
[Sorghum bicolor]
Length = 389
Score = 277 bits (708), Expect = 3e-072
Identities = 148/273 (54%), Positives = 184/273 (67%), Gaps = 20/273 (7%)
Frame = -1
Query: 1008 SMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKED 829
S+ AS SSDAS+ S S R R+ ++S Q D K
Sbjct: 88 SLDASCSSDASTDSFCSRAS-TGRIGRPAFGARKKKTLS--------------QTDYKPV 132
Query: 828 KAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSG 649
GG S D A +RRCAW+T +DPCY AFHDEEWGVPVH+DKKLFELL LSG
Sbjct: 133 SMLEREGGLASQIDA-AGVKRRCAWVTANTDPCYAAFHDEEWGVPVHDDKKLFELLVLSG 191
Query: 648 ALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNA 481
AL+EL+W IL++R + REVFMDFDPV VSKL+EK + + + SLLSE KLR +++NA
Sbjct: 192 ALAELTWPAILNKRDIFREVFMDFDPVLVSKLSEKKIIAPGSPSSSLLSEQKLRGVIENA 251
Query: 480 RQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSP 301
RQ+ K I E+GSF KY W+FV++KP S+FRY RQVPVKTSKA+ ISKDLVRRGFRSV P
Sbjct: 252 RQILKIIEEFGSFDKYCWSFVNHKPILSRFRYSRQVPVKTSKADAISKDLVRRGFRSVGP 311
Query: 300 TVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETT 202
TV+Y+FMQ +G+TNDHLI C+R +C ++ T
Sbjct: 312 TVVYTFMQVSGMTNDHLISCYRFAECAASSDGT 344
>gi|242096536|ref|XP_002438758.1| hypothetical protein SORBIDRAFT_10g025650
[Sorghum bicolor]
Length = 412
Score = 274 bits (700), Expect = 3e-071
Identities = 126/198 (63%), Positives = 159/198 (80%), Gaps = 4/198 (2%)
Frame = -1
Query: 777 DGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLL 598
+G+RRCAW TP +DPCYV FHDEEWGVPVH D++LFELL LSGAL+EL+W +IL RRQL
Sbjct: 174 EGKRRCAWATPTTDPCYVTFHDEEWGVPVHNDRRLFELLVLSGALAELTWPEILKRRQLF 233
Query: 597 REVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYM 430
RE+FM+FDP A+SK+NEK + ++A SLLSE KLR +L+NARQ+ K + E+GSF +Y
Sbjct: 234 REIFMEFDPAAISKINEKKLVAPGSTAHSLLSEQKLRVVLENARQILKIVDEFGSFDRYC 293
Query: 429 WNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHL 250
W F+++KP S+FRY RQVPVK+ KA+ ISKD++RRGFR V PTVIYSFMQAAGLTNDHL
Sbjct: 294 WGFLNHKPIVSKFRYPRQVPVKSPKADIISKDMMRRGFRGVGPTVIYSFMQAAGLTNDHL 353
Query: 249 IGCFRHQDCCVVAETTTT 196
+ CFR + C + T+
Sbjct: 354 VSCFRFEQCNAIPTLCTS 371
>gi|226529264|ref|NP_001141717.1| hypothetical protein LOC100273848 [Zea mays]
Length = 391
Score = 273 bits (696), Expect = 7e-071
Identities = 145/281 (51%), Positives = 186/281 (66%), Gaps = 20/281 (7%)
Frame = -1
Query: 1029 RRNSGASS-----MTASYSSDASSSCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVV 865
RRN S ++AS SSDAS+ + + V + SV +
Sbjct: 102 RRNDAPLSHPGLPLSASCSSDASAESVRARRAFTGK-------VEKGRSVPTAQPKQGKA 154
Query: 864 VARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHE 685
V K V+ K + A V+ G+RRCAW+TP +DPCYV FHDEEWGVPVH
Sbjct: 155 VG--KAVESKPIRVEVVAPMTVTPE--AVQGKRRCAWVTPTTDPCYVTFHDEEWGVPVHN 210
Query: 684 DKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTVTS----AISLL 517
D++LFELL LSGAL+EL+W +IL +RQL RE+FM+FDP AVS++NEK + + A SLL
Sbjct: 211 DRRLFELLVLSGALAELTWPEILKKRQLFREIFMEFDPAAVSEINEKKLVAPGCVAHSLL 270
Query: 516 SEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISK 337
SE KLR++L+NARQ+ K E+GSF +Y W F+++KP S+FRY RQVPVK+ KA+ ISK
Sbjct: 271 SEQKLRAVLENARQILKIADEFGSFDRYCWGFLNHKPIVSKFRYPRQVPVKSPKADIISK 330
Query: 336 DLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVV 214
D++RRGFR V PTVIYSFMQAAGLTNDHL+ CFR + C V
Sbjct: 331 DMMRRGFRGVGPTVIYSFMQAAGLTNDHLVSCFRFEHCSAV 371
>gi|15242914|ref|NP_200605.1| DNA-3-methyladenine glycosylase I [Arabidopsis
thaliana]
Length = 347
Score = 270 bits (690), Expect = 4e-070
Identities = 125/190 (65%), Positives = 153/190 (80%), Gaps = 4/190 (2%)
Frame = -1
Query: 780 ADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQL 601
++ ++RC W+TP SDPCY+ FHDEEWGVPVH+DK+LFELL LSGAL+E +W ILS+RQ
Sbjct: 150 SETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILSKRQA 209
Query: 600 LREVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKY 433
REVF DFDP A+ K+NEK + + A +LLS++KLR++++NARQ+ K I EYGSF KY
Sbjct: 210 FREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGSFDKY 269
Query: 432 MWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDH 253
+W+FV NK S+FRYQRQVP KT KAE ISKDLVRRGFRSV PTV+YSFMQAAG+TNDH
Sbjct: 270 IWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDH 329
Query: 252 LIGCFRHQDC 223
L CFR C
Sbjct: 330 LTSCFRFHHC 339
>gi|215697314|dbj|BAG91308.1| unnamed protein product [Oryza sativa Japonica
Group]
Length = 383
Score = 267 bits (682), Expect = 3e-069
Identities = 123/196 (62%), Positives = 157/196 (80%), Gaps = 4/196 (2%)
Frame = -1
Query: 771 RRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLRE 592
+RRC+W+T ++PCY AFHDEEWGVPVH+DK LFELL LSGAL+EL+W IL++R + RE
Sbjct: 149 KRRCSWVTANTEPCYAAFHDEEWGVPVHDDKVLFELLVLSGALAELTWPTILNKRPIFRE 208
Query: 591 VFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWN 424
VFMDFDPV VSKL+EK + + + +LLSE KLR +++NARQ+ K + E+G+F KY W+
Sbjct: 209 VFMDFDPVLVSKLSEKKIIAPGSPSSTLLSEQKLRGVIENARQILKIVEEFGTFDKYCWS 268
Query: 423 FVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIG 244
FV+NKP S+FRY RQVPVKTSKA+ ISKDLVRRGFRSV PTV+Y+FMQ +G+TNDHLI
Sbjct: 269 FVNNKPILSRFRYPRQVPVKTSKADAISKDLVRRGFRSVGPTVVYTFMQVSGMTNDHLIS 328
Query: 243 CFRHQDCCVVAETTTT 196
C+R +C A + T
Sbjct: 329 CYRFAECAAAATGSNT 344
>gi|297793315|ref|XP_002864542.1| methyladenine glycosylase family protein
[Arabidopsis lyrata subsp. lyrata]
Length = 349
Score = 267 bits (681), Expect = 4e-069
Identities = 126/199 (63%), Positives = 155/199 (77%), Gaps = 4/199 (2%)
Frame = -1
Query: 807 GGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSW 628
G + + ++ ++RCAW+T SDPCY+ FHDEEWGVPVH+DK+LFELL LSGAL+E +W
Sbjct: 143 GALDSPPSGSETKKRCAWVTSNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTW 202
Query: 627 TDILSRRQLLREVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTI 460
ILS+RQ REVF DFDP A+ K+NEK + + A +LLS++KLR +++NARQ+ K I
Sbjct: 203 PMILSKRQTFREVFADFDPNAIVKINEKKLIGPGSPASTLLSDLKLRGVIENARQILKVI 262
Query: 459 AEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFM 280
EYGSF KY+W+FV NK S+FRYQRQVP KT KAE ISKDLVRRGFRSV PTV+YSFM
Sbjct: 263 EEYGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFM 322
Query: 279 QAAGLTNDHLIGCFRHQDC 223
QAAG+TNDHL CFR C
Sbjct: 323 QAAGVTNDHLTSCFRFHHC 341
>gi|90265145|emb|CAC09513.2| H0711G06.19 [Oryza sativa Indica Group]
Length = 383
Score = 266 bits (679), Expect = 7e-069
Identities = 122/196 (62%), Positives = 157/196 (80%), Gaps = 4/196 (2%)
Frame = -1
Query: 771 RRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLRE 592
+RRC+W+T ++PCY AFHDEEWGVPVH+DK LFELL LSGAL+EL+W IL++R + RE
Sbjct: 149 KRRCSWVTANTEPCYAAFHDEEWGVPVHDDKVLFELLVLSGALAELTWPTILNKRPIFRE 208
Query: 591 VFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWN 424
VFMDFDP+ VSKL+EK + + + +LLSE KLR +++NARQ+ K + E+G+F KY W+
Sbjct: 209 VFMDFDPLLVSKLSEKKIIAPGSPSSTLLSEQKLRGVIENARQILKIVEEFGTFDKYCWS 268
Query: 423 FVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIG 244
FV+NKP S+FRY RQVPVKTSKA+ ISKDLVRRGFRSV PTV+Y+FMQ +G+TNDHLI
Sbjct: 269 FVNNKPILSRFRYPRQVPVKTSKADAISKDLVRRGFRSVGPTVVYTFMQVSGMTNDHLIS 328
Query: 243 CFRHQDCCVVAETTTT 196
C+R +C A + T
Sbjct: 329 CYRFAECAAAATGSNT 344
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,053,028,657,214
Number of Sequences: 15229318
Number of Extensions: 1053028657214
Number of Successful Extensions: 300909508
Number of sequences better than 0.0: 0
|