Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN08948


BLASTX 7.6.2

Query= UN08948 /QuerySize=1324
        (1323 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297850016|ref|XP_002892889.1| methyladenine glycosylase famil...    389   9e-106
gi|15218379|ref|NP_173049.1| DNA glycosylase-like protein [Arabi...    385   1e-104
gi|297839853|ref|XP_002887808.1| hypothetical protein ARALYDRAFT...    374   3e-101
gi|15220860|ref|NP_178200.1| methyladenine glycosylase-like prot...    373   4e-101
gi|224142383|ref|XP_002324538.1| predicted protein [Populus tric...    307   5e-081
gi|224091765|ref|XP_002309346.1| predicted protein [Populus tric...    305   1e-080
gi|255576987|ref|XP_002529378.1| DNA-3-methyladenine glycosylase...    301   3e-079
gi|147823377|emb|CAN66338.1| hypothetical protein VITISV_026086 ...    287   5e-075
gi|225443176|ref|XP_002264482.1| PREDICTED: hypothetical protein...    285   1e-074
gi|125556288|gb|EAZ01894.1| hypothetical protein OsI_23919 [Oryz...    280   4e-073
gi|225445871|ref|XP_002276173.1| PREDICTED: hypothetical protein...    279   8e-073
gi|51534979|dbj|BAD38103.1| methyladenine glycosylase protein-li...    278   2e-072
gi|222635995|gb|EEE66127.1| hypothetical protein OsJ_22173 [Oryz...    278   2e-072
gi|242073624|ref|XP_002446748.1| hypothetical protein SORBIDRAFT...    277   3e-072
gi|242096536|ref|XP_002438758.1| hypothetical protein SORBIDRAFT...    274   3e-071
gi|226529264|ref|NP_001141717.1| hypothetical protein LOC1002738...    273   7e-071
gi|15242914|ref|NP_200605.1| DNA-3-methyladenine glycosylase I [...    270   4e-070
gi|215697314|dbj|BAG91308.1| unnamed protein product [Oryza sati...    267   3e-069
gi|297793315|ref|XP_002864542.1| methyladenine glycosylase famil...    267   4e-069
gi|90265145|emb|CAC09513.2| H0711G06.19 [Oryza sativa Indica Group]    266   7e-069

>gi|297850016|ref|XP_002892889.1| methyladenine glycosylase family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 354

 Score =  389 bits (997), Expect = 9e-106
 Identities = 209/289 (72%), Positives = 230/289 (79%), Gaps = 18/289 (6%)
 Frame = -1

Query: 1011 SSMTASYSSDASSSCESSHLSVASSSSRKKAV-VRRSGSVSSVGTTSAVVVARRK-QVDE 838
            SS+ +S     S+S  +S+ S ASSS     + V  S S   V   S  V + RK  + +
Sbjct:   72 SSLCSSILRKNSASMTASYSSDASSSCESSPLSVASSSSCKKVMRRSGSVSSTRKLSIGK 131

Query:  837 KEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLC 658
            +EDK A         GDCFADGRRRCAWITPK+DPCYVAFHDEEWGVPV +DKKLFELLC
Sbjct:  132 EEDKVA---------GDCFADGRRRCAWITPKADPCYVAFHDEEWGVPVDDDKKLFELLC 182

Query:  657 LSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSIL 490
            LSGAL+ELSWTDILSRRQLLREVFMDFDPVAVS++N+K +    T+AISLLSEVK+RSIL
Sbjct:  183 LSGALAELSWTDILSRRQLLREVFMDFDPVAVSEMNDKKLTAPGTAAISLLSEVKIRSIL 242

Query:  489 DNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRS 310
            DN+R VRK IAE GSFKKYMWNFV+NKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRS
Sbjct:  243 DNSRHVRKIIAECGSFKKYMWNFVNNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRS 302

Query:  309 VSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETTTTKA---KKTERE 172
            VSPTVIYSFMQAAGLTNDHLIGCFR QDCCV AETTTT     KK ERE
Sbjct:  303 VSPTVIYSFMQAAGLTNDHLIGCFRFQDCCVDAETTTTTTKAKKKNERE 351

>gi|15218379|ref|NP_173049.1| DNA glycosylase-like protein [Arabidopsis
        thaliana]

          Length = 352

 Score =  385 bits (987), Expect = 1e-104
 Identities = 214/355 (60%), Positives = 250/355 (70%), Gaps = 27/355 (7%)
 Frame = -1

Query: 1182 PPGMKLENSKKMTTTNTIESKDEKTKKKP-----DPPASPTTTL-----KQCSSLCSSLL 1033
            PP  +  NS +    + +     K ++KP     + P    T +     K       +  
Sbjct:    4 PPRFRSVNSDEREFRSVLGPTGNKLQRKPPGMKLEKPMMEKTIIDSKDEKAKKPTTPASP 63

Query: 1032 RRRNSGASSMTASYSSDASSSCESSHLSVASSS--SRKKAVVRRSGSVSSVGTTSAVVVA 859
            R      SS+ +S     S+S  +S+ S ASSS  S   +V   S     V  + +V   
Sbjct:   64 RTTLKQCSSLCSSILRKNSASMTASYSSDASSSCESSPLSVASSSSCKKVVRRSGSVSST 123

Query:  858 RRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDK 679
            R+  V ++E+K          +GDCFADGR+RCAWITPK+DPCYVAFHDEEWGVPVH+DK
Sbjct:  124 RKLSVGKEEEKV---------SGDCFADGRKRCAWITPKADPCYVAFHDEEWGVPVHDDK 174

Query:  678 KLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV----TSAISLLSE 511
            KLFELLCLSGAL+ELSWTDILSRR +LREVFMDFDPVAV++LN+K +    T+AISLLSE
Sbjct:  175 KLFELLCLSGALAELSWTDILSRRHILREVFMDFDPVAVAELNDKKLTAPGTAAISLLSE 234

Query:  510 VKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDL 331
            VK+RSILDN+R VRK IAE GS KKYMWNFV+NKPTQSQFRYQRQVPVKTSKAEFISKDL
Sbjct:  235 VKIRSILDNSRHVRKIIAECGSLKKYMWNFVNNKPTQSQFRYQRQVPVKTSKAEFISKDL 294

Query:  330 VRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETTTT--KAKKTERE 172
            VRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFR+QDCCV AETTTT    KK ERE
Sbjct:  295 VRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRYQDCCVDAETTTTTKAKKKNERE 349

>gi|297839853|ref|XP_002887808.1| hypothetical protein ARALYDRAFT_477160
        [Arabidopsis lyrata subsp. lyrata]

          Length = 323

 Score =  374 bits (958), Expect = 3e-101
 Identities = 198/295 (67%), Positives = 227/295 (76%), Gaps = 24/295 (8%)
 Frame = -1

Query: 1077 TTTLKQCSSLCSSLLRRRNSGASSMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSGS 898
            T  + QCS L  S+LRR      SMTASYSSDASSSCESS LSVAS+SS K+A +RRSGS
Sbjct:   47 TKNMPQCSPLSPSILRR---NGISMTASYSSDASSSCESSPLSVASTSSGKRA-LRRSGS 102

Query:  897 VSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAF 718
            +SS  +       RR   +E+++KA+          DCF+DGR+RCAWITPKS  CY+AF
Sbjct:  103 LSSSSS------LRRNLTEERDEKAS----------DCFSDGRKRCAWITPKSGQCYIAF 146

Query:  717 HDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV 538
            HD EWGVPVH+DK+LFELL LSGAL+ELSW DILS+RQL REVFMDFDP+A+S+L  K +
Sbjct:  147 HDTEWGVPVHDDKRLFELLSLSGALAELSWKDILSKRQLFREVFMDFDPIAISELTNKKI 206

Query:  537 TSA----ISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVP 370
            TS+     +LLSE KLRSIL+NA QV K I E+GSF KY+WNFV+ KPTQSQFRY RQVP
Sbjct:  207 TSSEIATTTLLSEQKLRSILENANQVCKLIVEFGSFDKYIWNFVNQKPTQSQFRYPRQVP 266

Query:  369 VKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAET 205
            VKTSKAE ISKDLVRRGFRSVSPTVIYSFMQ AGLTNDHL  CFRH DC    ET
Sbjct:  267 VKTSKAELISKDLVRRGFRSVSPTVIYSFMQTAGLTNDHLTCCFRHHDCMTKDET 321

>gi|15220860|ref|NP_178200.1| methyladenine glycosylase-like protein
        [Arabidopsis thaliana]

          Length = 327

 Score =  373 bits (957), Expect = 4e-101
 Identities = 198/295 (67%), Positives = 225/295 (76%), Gaps = 24/295 (8%)
 Frame = -1

Query: 1077 TTTLKQCSSLCSSLLRRRNSGASSMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSGS 898
            T  + QCS L   +LRR      SMTASYSSDASSSCESS LS+ S+SS K+ V+RRSGS
Sbjct:   51 TEKMPQCSPLSPPILRR---NGISMTASYSSDASSSCESSPLSMTSTSSGKR-VLRRSGS 106

Query:  897 VSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAF 718
            VSS  +       RR   +E+++KA+          DCF DGR+RCAWITPKSD CY+AF
Sbjct:  107 VSSSSS------LRRNLTEERDEKAS----------DCFCDGRKRCAWITPKSDQCYIAF 150

Query:  717 HDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV 538
            HDEEWGVPVH+DK+LFELL LSGAL+ELSW DILS+RQL REVFMDFDP+A+S+L  K +
Sbjct:  151 HDEEWGVPVHDDKRLFELLSLSGALAELSWKDILSKRQLFREVFMDFDPIAISELTNKKI 210

Query:  537 TS----AISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVP 370
            TS    A +LLSE KLRSIL+NA QV K I  +GSF KY+WNFV+ KPTQSQFRY RQVP
Sbjct:  211 TSPEIAATTLLSEQKLRSILENANQVCKIIGAFGSFDKYIWNFVNQKPTQSQFRYPRQVP 270

Query:  369 VKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAET 205
            VKTSKAE ISKDLVRRGFRSVSPTVIYSFMQ AGLTNDHL  CFRH DC    ET
Sbjct:  271 VKTSKAELISKDLVRRGFRSVSPTVIYSFMQTAGLTNDHLTCCFRHHDCMTKDET 325

>gi|224142383|ref|XP_002324538.1| predicted protein [Populus trichocarpa]

          Length = 380

 Score =  307 bits (784), Expect = 5e-081
 Identities = 178/365 (48%), Positives = 224/365 (61%), Gaps = 27/365 (7%)
 Frame = -1

Query: 1245 VNSDERDFRSVLGPTGNKQRKPPGMKLENSK--KMTTTNTIESKDEKTKKKPDPPASPTT 1072
            +N  + + RSVLGPTGN +  P   +   SK  +    +  E K  + KK    PA  T 
Sbjct:   10 MNVADSEARSVLGPTGNNKAGPLSARKPVSKQSRKVEKSPEEVKLGEEKKTLTVPAVGTL 69

Query: 1071 TLKQCSSLCSSLLRRRN---SGASSMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSG 901
            + K  S   SS+LRR         S+ AS SSDAS           + S   +A   R  
Sbjct:   70 SPKSHSLNISSVLRRHELLLHSNLSLNASCSSDAS-----------TDSFHSRASTGRLT 118

Query:  900 SVSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVA 721
              +S GT       RRKQ   +     +  G         +  ++ CAW+TP +DPCY  
Sbjct:  119 RSNSAGT-------RRKQYVLRPRSFVSEGGLESPPSPDDSQSKKSCAWVTPNTDPCYAT 171

Query:  720 FHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKT 541
            FHDEEWGVP+H+D+KLFELL LSGAL+EL+W  ILS+R + REVF DFDP+AVSK NEK 
Sbjct:  172 FHDEEWGVPIHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPIAVSKFNEKK 231

Query:  540 V----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQV 373
            +    ++A SLLSE+KLR+I++NARQ+ K I E+GSF KY+W+FV+ KP  S+FRY RQV
Sbjct:  232 ILAPGSTATSLLSELKLRAIVENARQISKVIDEFGSFDKYIWSFVNYKPIVSRFRYPRQV 291

Query:  372 PVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETTTTK 193
            PVKT KA+ ISKDLVRRGFRSV PTVIYSFMQ AG+TNDHLI CFR Q+C   AE     
Sbjct:  292 PVKTPKADAISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQECLDAAEGKVEN 351

Query:  192 AKKTE 178
              K+E
Sbjct:  352 GIKSE 356

>gi|224091765|ref|XP_002309346.1| predicted protein [Populus trichocarpa]

          Length = 381

 Score =  305 bits (781), Expect = 1e-080
 Identities = 179/371 (48%), Positives = 225/371 (60%), Gaps = 36/371 (9%)
 Frame = -1

Query: 1248 SVNSDERDFRSVLGPTGNKQ-------RKPPGMKLENSKKMTTTNTIESKDEKTKKKPDP 1090
            S+N  + + R VLGPTGN +       RKP   +L    K    +  E+K  + KK    
Sbjct:    9 SMNVADSEARPVLGPTGNTKAGPLTSARKPASKQLRKDGK----SPEEAKLGEEKKVLTV 64

Query: 1089 PASPTTTLKQCSSLCSSLLRRRNS---GASSMTASYSSDASSSCESSHLSVASSSSRKKA 919
            P     + K  S   SS+LRR         S+ AS SSDAS           + S   +A
Sbjct:   65 PTVGNLSPKSLSGNFSSVLRRHEQLLHSNLSLNASCSSDAS-----------TDSFHSRA 113

Query:  918 VVRRSGSVSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKS 739
               R    ++VGT       RRKQ   K     +  G         +  ++ CAW+TP +
Sbjct:  114 STGRLIRSNNVGT-------RRKQYVSKPRSVVSDGGLESLPSSDGSQSKKSCAWVTPNT 166

Query:  738 DPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVS 559
            DPCY AFHDEEWG+PVH+D+KLFELL LSGAL+EL+W  ILS+R + REVF DFDP+AVS
Sbjct:  167 DPCYTAFHDEEWGLPVHDDRKLFELLVLSGALAELTWPAILSKRHMFREVFADFDPIAVS 226

Query:  558 KLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQF 391
            K NEK +    ++A SLLSE+KLR+I++NARQ+ K I E+GSF KY+W+FV+ KP  S+F
Sbjct:  227 KFNEKKIIAPGSTAASLLSELKLRAIIENARQISKVIDEFGSFDKYIWSFVNYKPIVSRF 286

Query:  390 RYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVA 211
            RY RQVP KT KA+ ISKDLVRRGFRSV PTVIYSFMQ AG+TNDHLI CFR Q+C   A
Sbjct:  287 RYPRQVPAKTPKADAISKDLVRRGFRSVGPTVIYSFMQVAGVTNDHLISCFRFQECIDAA 346

Query:  210 ETTTTKAKKTE 178
            E       K+E
Sbjct:  347 EGKEENGIKSE 357

>gi|255576987|ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative
        [Ricinus communis]

          Length = 380

 Score =  301 bits (769), Expect = 3e-079
 Identities = 175/365 (47%), Positives = 218/365 (59%), Gaps = 27/365 (7%)
 Frame = -1

Query: 1245 VNSDERDFRSVLGPTGNKQRKPPGMKLENSKKMTTTNTIES--KDEKTKKKPDPPASPTT 1072
            +N  + + R VLGPTGN +      K   SK++    T     K  + KK    P +   
Sbjct:   10 MNVADSETRPVLGPTGNNKAGSLSAKKPASKQLRKVETSPEAVKLGQEKKLVTVPTASAL 69

Query: 1071 TLKQCSSLCSSLLRRRNS---GASSMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSG 901
            + K  S    S+LRR         S+ AS SSDAS           + S   +A   R  
Sbjct:   70 SPKSHSVSVPSVLRRHEQLLHSNLSLNASCSSDAS-----------TDSFHSRASTGRLT 118

Query:  900 SVSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVA 721
              +S+GT       RRKQ   K     +  G         +  ++ CAW+TP +DPCY A
Sbjct:  119 RSNSLGT-------RRKQYALKPRSVVSDGGLESPPPSDGSQAKKSCAWVTPNADPCYTA 171

Query:  720 FHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKT 541
            FHDEEWG+PVH+DKKLFELL LSGAL+EL+W  ILS+R + REVF +FDPV VSK NEK 
Sbjct:  172 FHDEEWGIPVHDDKKLFELLVLSGALAELTWPAILSKRHIFREVFANFDPVVVSKFNEKK 231

Query:  540 V----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQV 373
            +    ++A SLLSE+KLR+I++NARQ+ K   E GSF KY+W+FV+ KP  S+FRY RQV
Sbjct:  232 IIAPGSTASSLLSEIKLRAIIENARQISKVTDELGSFDKYIWSFVNYKPIVSRFRYPRQV 291

Query:  372 PVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETTTTK 193
            PVKT KA+ ISKDLVRRGFRSV PTV+YSFMQ AGLTNDHLI CFR Q+C   AE     
Sbjct:  292 PVKTPKADVISKDLVRRGFRSVGPTVVYSFMQVAGLTNDHLISCFRFQECINAAEGKEEN 351

Query:  192 AKKTE 178
              K E
Sbjct:  352 GVKVE 356

>gi|147823377|emb|CAN66338.1| hypothetical protein VITISV_026086 [Vitis
        vinifera]

          Length = 431

 Score =  287 bits (732), Expect = 5e-075
 Identities = 141/255 (55%), Positives = 180/255 (70%), Gaps = 22/255 (8%)
 Frame = -1

Query: 972 SCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKEDKAAAAAG-GGVS 796
           SC SS  S + +SSR+ +                    RRK    K DK     G   V+
Sbjct: 161 SCSSSESSSSRASSRRSS-----------------TPIRRKHFSPKADKXEKTGGRPSVA 203

Query: 795 NGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDIL 616
           + +C    +RRCAW+TP +DPCY AFHDEEWGVPVH+DK+ FELL LSGAL+EL+W  IL
Sbjct: 204 SDNCALQAKRRCAWVTPNTDPCYAAFHDEEWGVPVHDDKRHFELLVLSGALAELTWPAIL 263

Query: 615 SRRQLLREVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYG 448
            +R + REVF++FDP+AVSKLNEK + +    A SL+S++KLRS+++NARQ+ K I E+G
Sbjct: 264 QKRHIFREVFLEFDPIAVSKLNEKKIVTPGSPATSLVSDLKLRSVIENARQICKIIGEFG 323

Query: 447 SFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAG 268
           SF +Y+W FV++KP   +FRY RQVPVKT+KA+ ISKDLVRRGFRSV PTVIY+FMQ AG
Sbjct: 324 SFDQYIWGFVNHKPMVGRFRYPRQVPVKTAKADVISKDLVRRGFRSVGPTVIYAFMQVAG 383

Query: 267 LTNDHLIGCFRHQDC 223
           +TNDHL  CFR Q+C
Sbjct: 384 ITNDHLTSCFRFQEC 398

>gi|225443176|ref|XP_002264482.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 360

 Score =  285 bits (729), Expect = 1e-074
 Identities = 141/255 (55%), Positives = 179/255 (70%), Gaps = 22/255 (8%)
 Frame = -1

Query: 972 SCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKEDKAAAAAG-GGVS 796
           SC SS  S + +SSR+ +                    RRK    K DK     G   V+
Sbjct:  90 SCSSSESSSSRASSRRSS-----------------TPIRRKHFSPKADKVEKTGGRPSVA 132

Query: 795 NGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDIL 616
           + +C    +RRCAW+TP +DPCY AFHDEEWGVPVH+DK+ FELL LSGAL+EL+W  IL
Sbjct: 133 SDNCALQAKRRCAWVTPNTDPCYAAFHDEEWGVPVHDDKRHFELLVLSGALAELTWPAIL 192

Query: 615 SRRQLLREVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYG 448
            +R + REVF++FDP+AVSKLNEK + +    A SL+S++KLRS+++NARQ+ K I E+G
Sbjct: 193 QKRHIFREVFLEFDPIAVSKLNEKKIVTPGSPATSLVSDLKLRSVIENARQICKIIGEFG 252

Query: 447 SFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAG 268
           SF +Y+W FV++KP   +FRY RQVPVKT+KA+ ISKDLVRRGFRSV PTVIY FMQ AG
Sbjct: 253 SFDQYIWGFVNHKPMVGRFRYPRQVPVKTAKADVISKDLVRRGFRSVGPTVIYVFMQVAG 312

Query: 267 LTNDHLIGCFRHQDC 223
           +TNDHL  CFR Q+C
Sbjct: 313 ITNDHLTSCFRFQEC 327

>gi|125556288|gb|EAZ01894.1| hypothetical protein OsI_23919 [Oryza sativa Indica
        Group]

          Length = 426

 Score =  280 bits (716), Expect = 4e-073
 Identities = 147/257 (57%), Positives = 184/257 (71%), Gaps = 10/257 (3%)
 Frame = -1

Query: 978 SSSCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKEDKA-AAAAGGG 802
           ++SC SS  SV S   R  +  R   S S V    A  V RR +   K   A   AA   
Sbjct: 120 NASC-SSDASVESLRGRDSSGGRLERSWSRV----APAVPRRGKTPVKAAAAEKVAADAE 174

Query: 801 VSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTD 622
           V        G+RRCAW+TP SDPCYV FHDEEWGVPVH+D++LFELL LSGAL+EL+W +
Sbjct: 175 VVAPATPEAGKRRCAWVTPTSDPCYVIFHDEEWGVPVHDDRRLFELLVLSGALAELTWPE 234

Query: 621 ILSRRQLLREVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAE 454
           IL RRQL RE+F+DFDPVA+SK+NEK + +    A SLLSE KLR++++NARQ+ K + E
Sbjct: 235 ILKRRQLFREIFVDFDPVAISKINEKKLVAPGSVANSLLSEQKLRAVVENARQILKIVDE 294

Query: 453 YGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQA 274
           +GSF +Y W F+++KP  S+FRY RQVPVK+ KA+ ISKD+VRRGFR V PT+IYSFMQA
Sbjct: 295 FGSFDRYCWGFLNHKPIVSKFRYPRQVPVKSPKADMISKDMVRRGFRGVGPTIIYSFMQA 354

Query: 273 AGLTNDHLIGCFRHQDC 223
           AGLTNDHL+ CFR ++C
Sbjct: 355 AGLTNDHLVSCFRFKEC 371

>gi|225445871|ref|XP_002276173.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 375

 Score =  279 bits (713), Expect = 8e-073
 Identities = 146/255 (57%), Positives = 181/255 (70%), Gaps = 6/255 (2%)
 Frame = -1

Query: 960 SHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKEDKAAAAAGGGVSNGDCF 781
           S+LS+ +S S   +        S+   T +   ARR+    K  K   + G   S  D  
Sbjct:  88 SNLSLNASCSSDASTDSFHSRASTGRITRSSSTARRRSYASK-PKVIVSDGVSESPPDGL 146

Query: 780 ADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQL 601
              +RRCAW+TP +D  Y+AFHDEEWGVPVH+DKKLFELL LSGAL+EL+W  ILS+R +
Sbjct: 147 -KAKRRCAWVTPNTDLSYIAFHDEEWGVPVHDDKKLFELLVLSGALAELTWPTILSKRHI 205

Query: 600 LREVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYGSFKKY 433
            REVF DFDP+AV+KLNEK + +    A SL+SE+KLR I++NARQ+ K I E+GSF +Y
Sbjct: 206 FREVFADFDPIAVAKLNEKKLMAPGSIASSLISELKLRGIIENARQMSKVIDEFGSFDEY 265

Query: 432 MWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDH 253
           +W+FV++KP  S+FRY R VPVKT KA+ ISKDLVRRGFRSV PTVIYSFMQ AG+TNDH
Sbjct: 266 IWSFVNHKPIVSRFRYPRHVPVKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDH 325

Query: 252 LIGCFRHQDCCVVAE 208
           LI CFR QDC   AE
Sbjct: 326 LISCFRFQDCVTAAE 340

>gi|51534979|dbj|BAD38103.1| methyladenine glycosylase protein-like [Oryza
        sativa Japonica Group]

          Length = 433

 Score =  278 bits (710), Expect = 2e-072
 Identities = 126/188 (67%), Positives = 159/188 (84%), Gaps = 4/188 (2%)
 Frame = -1

Query: 774 GRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLR 595
           G+RRCAW+TP SDPCYV FHDEEWGVPVH+D++LFELL LSGAL+EL+W +IL RRQL R
Sbjct: 191 GKRRCAWVTPTSDPCYVIFHDEEWGVPVHDDRRLFELLVLSGALAELTWPEILKRRQLFR 250

Query: 594 EVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMW 427
           E+F+DFDPVA+SK+NEK + +    A SLLSE KLR++++NARQ+ K + E+GSF +Y W
Sbjct: 251 EIFVDFDPVAISKINEKKLVAPGSVANSLLSEQKLRAVVENARQILKIVDEFGSFDRYCW 310

Query: 426 NFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLI 247
            F+++KP  S+FRY RQVPVK+ KA+ ISKD+VRRGFR V PT+IYSFMQAAGLTNDHL+
Sbjct: 311 GFLNHKPIVSKFRYPRQVPVKSPKADMISKDMVRRGFRGVGPTIIYSFMQAAGLTNDHLV 370

Query: 246 GCFRHQDC 223
            CFR ++C
Sbjct: 371 SCFRFKEC 378

>gi|222635995|gb|EEE66127.1| hypothetical protein OsJ_22173 [Oryza sativa
        Japonica Group]

          Length = 410

 Score =  278 bits (710), Expect = 2e-072
 Identities = 126/188 (67%), Positives = 159/188 (84%), Gaps = 4/188 (2%)
 Frame = -1

Query: 774 GRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLR 595
           G+RRCAW+TP SDPCYV FHDEEWGVPVH+D++LFELL LSGAL+EL+W +IL RRQL R
Sbjct: 168 GKRRCAWVTPTSDPCYVIFHDEEWGVPVHDDRRLFELLVLSGALAELTWPEILKRRQLFR 227

Query: 594 EVFMDFDPVAVSKLNEKTVTS----AISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMW 427
           E+F+DFDPVA+SK+NEK + +    A SLLSE KLR++++NARQ+ K + E+GSF +Y W
Sbjct: 228 EIFVDFDPVAISKINEKKLVAPGSVANSLLSEQKLRAVVENARQILKIVDEFGSFDRYCW 287

Query: 426 NFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLI 247
            F+++KP  S+FRY RQVPVK+ KA+ ISKD+VRRGFR V PT+IYSFMQAAGLTNDHL+
Sbjct: 288 GFLNHKPIVSKFRYPRQVPVKSPKADMISKDMVRRGFRGVGPTIIYSFMQAAGLTNDHLV 347

Query: 246 GCFRHQDC 223
            CFR ++C
Sbjct: 348 SCFRFKEC 355

>gi|242073624|ref|XP_002446748.1| hypothetical protein SORBIDRAFT_06g021680
        [Sorghum bicolor]

          Length = 389

 Score =  277 bits (708), Expect = 3e-072
 Identities = 148/273 (54%), Positives = 184/273 (67%), Gaps = 20/273 (7%)
 Frame = -1

Query: 1008 SMTASYSSDASSSCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVVVARRKQVDEKED 829
            S+ AS SSDAS+    S  S      R     R+  ++S              Q D K  
Sbjct:   88 SLDASCSSDASTDSFCSRAS-TGRIGRPAFGARKKKTLS--------------QTDYKPV 132

Query:  828 KAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSG 649
                  GG  S  D  A  +RRCAW+T  +DPCY AFHDEEWGVPVH+DKKLFELL LSG
Sbjct:  133 SMLEREGGLASQIDA-AGVKRRCAWVTANTDPCYAAFHDEEWGVPVHDDKKLFELLVLSG 191

Query:  648 ALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNA 481
            AL+EL+W  IL++R + REVFMDFDPV VSKL+EK +    + + SLLSE KLR +++NA
Sbjct:  192 ALAELTWPAILNKRDIFREVFMDFDPVLVSKLSEKKIIAPGSPSSSLLSEQKLRGVIENA 251

Query:  480 RQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSP 301
            RQ+ K I E+GSF KY W+FV++KP  S+FRY RQVPVKTSKA+ ISKDLVRRGFRSV P
Sbjct:  252 RQILKIIEEFGSFDKYCWSFVNHKPILSRFRYSRQVPVKTSKADAISKDLVRRGFRSVGP 311

Query:  300 TVIYSFMQAAGLTNDHLIGCFRHQDCCVVAETT 202
            TV+Y+FMQ +G+TNDHLI C+R  +C   ++ T
Sbjct:  312 TVVYTFMQVSGMTNDHLISCYRFAECAASSDGT 344

>gi|242096536|ref|XP_002438758.1| hypothetical protein SORBIDRAFT_10g025650
        [Sorghum bicolor]

          Length = 412

 Score =  274 bits (700), Expect = 3e-071
 Identities = 126/198 (63%), Positives = 159/198 (80%), Gaps = 4/198 (2%)
 Frame = -1

Query: 777 DGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLL 598
           +G+RRCAW TP +DPCYV FHDEEWGVPVH D++LFELL LSGAL+EL+W +IL RRQL 
Sbjct: 174 EGKRRCAWATPTTDPCYVTFHDEEWGVPVHNDRRLFELLVLSGALAELTWPEILKRRQLF 233

Query: 597 REVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYM 430
           RE+FM+FDP A+SK+NEK +    ++A SLLSE KLR +L+NARQ+ K + E+GSF +Y 
Sbjct: 234 REIFMEFDPAAISKINEKKLVAPGSTAHSLLSEQKLRVVLENARQILKIVDEFGSFDRYC 293

Query: 429 WNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHL 250
           W F+++KP  S+FRY RQVPVK+ KA+ ISKD++RRGFR V PTVIYSFMQAAGLTNDHL
Sbjct: 294 WGFLNHKPIVSKFRYPRQVPVKSPKADIISKDMMRRGFRGVGPTVIYSFMQAAGLTNDHL 353

Query: 249 IGCFRHQDCCVVAETTTT 196
           + CFR + C  +    T+
Sbjct: 354 VSCFRFEQCNAIPTLCTS 371

>gi|226529264|ref|NP_001141717.1| hypothetical protein LOC100273848 [Zea mays]

          Length = 391

 Score =  273 bits (696), Expect = 7e-071
 Identities = 145/281 (51%), Positives = 186/281 (66%), Gaps = 20/281 (7%)
 Frame = -1

Query: 1029 RRNSGASS-----MTASYSSDASSSCESSHLSVASSSSRKKAVVRRSGSVSSVGTTSAVV 865
            RRN    S     ++AS SSDAS+    +  +           V +  SV +        
Sbjct:  102 RRNDAPLSHPGLPLSASCSSDASAESVRARRAFTGK-------VEKGRSVPTAQPKQGKA 154

Query:  864 VARRKQVDEKEDKAAAAAGGGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHE 685
            V   K V+ K  +    A   V+       G+RRCAW+TP +DPCYV FHDEEWGVPVH 
Sbjct:  155 VG--KAVESKPIRVEVVAPMTVTPE--AVQGKRRCAWVTPTTDPCYVTFHDEEWGVPVHN 210

Query:  684 DKKLFELLCLSGALSELSWTDILSRRQLLREVFMDFDPVAVSKLNEKTVTS----AISLL 517
            D++LFELL LSGAL+EL+W +IL +RQL RE+FM+FDP AVS++NEK + +    A SLL
Sbjct:  211 DRRLFELLVLSGALAELTWPEILKKRQLFREIFMEFDPAAVSEINEKKLVAPGCVAHSLL 270

Query:  516 SEVKLRSILDNARQVRKTIAEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISK 337
            SE KLR++L+NARQ+ K   E+GSF +Y W F+++KP  S+FRY RQVPVK+ KA+ ISK
Sbjct:  271 SEQKLRAVLENARQILKIADEFGSFDRYCWGFLNHKPIVSKFRYPRQVPVKSPKADIISK 330

Query:  336 DLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRHQDCCVV 214
            D++RRGFR V PTVIYSFMQAAGLTNDHL+ CFR + C  V
Sbjct:  331 DMMRRGFRGVGPTVIYSFMQAAGLTNDHLVSCFRFEHCSAV 371

>gi|15242914|ref|NP_200605.1| DNA-3-methyladenine glycosylase I [Arabidopsis
        thaliana]

          Length = 347

 Score =  270 bits (690), Expect = 4e-070
 Identities = 125/190 (65%), Positives = 153/190 (80%), Gaps = 4/190 (2%)
 Frame = -1

Query: 780 ADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQL 601
           ++ ++RC W+TP SDPCY+ FHDEEWGVPVH+DK+LFELL LSGAL+E +W  ILS+RQ 
Sbjct: 150 SETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILSKRQA 209

Query: 600 LREVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKY 433
            REVF DFDP A+ K+NEK +    + A +LLS++KLR++++NARQ+ K I EYGSF KY
Sbjct: 210 FREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGSFDKY 269

Query: 432 MWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDH 253
           +W+FV NK   S+FRYQRQVP KT KAE ISKDLVRRGFRSV PTV+YSFMQAAG+TNDH
Sbjct: 270 IWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDH 329

Query: 252 LIGCFRHQDC 223
           L  CFR   C
Sbjct: 330 LTSCFRFHHC 339

>gi|215697314|dbj|BAG91308.1| unnamed protein product [Oryza sativa Japonica
        Group]

          Length = 383

 Score =  267 bits (682), Expect = 3e-069
 Identities = 123/196 (62%), Positives = 157/196 (80%), Gaps = 4/196 (2%)
 Frame = -1

Query: 771 RRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLRE 592
           +RRC+W+T  ++PCY AFHDEEWGVPVH+DK LFELL LSGAL+EL+W  IL++R + RE
Sbjct: 149 KRRCSWVTANTEPCYAAFHDEEWGVPVHDDKVLFELLVLSGALAELTWPTILNKRPIFRE 208

Query: 591 VFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWN 424
           VFMDFDPV VSKL+EK +    + + +LLSE KLR +++NARQ+ K + E+G+F KY W+
Sbjct: 209 VFMDFDPVLVSKLSEKKIIAPGSPSSTLLSEQKLRGVIENARQILKIVEEFGTFDKYCWS 268

Query: 423 FVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIG 244
           FV+NKP  S+FRY RQVPVKTSKA+ ISKDLVRRGFRSV PTV+Y+FMQ +G+TNDHLI 
Sbjct: 269 FVNNKPILSRFRYPRQVPVKTSKADAISKDLVRRGFRSVGPTVVYTFMQVSGMTNDHLIS 328

Query: 243 CFRHQDCCVVAETTTT 196
           C+R  +C   A  + T
Sbjct: 329 CYRFAECAAAATGSNT 344

>gi|297793315|ref|XP_002864542.1| methyladenine glycosylase family protein
        [Arabidopsis lyrata subsp. lyrata]

          Length = 349

 Score =  267 bits (681), Expect = 4e-069
 Identities = 126/199 (63%), Positives = 155/199 (77%), Gaps = 4/199 (2%)
 Frame = -1

Query: 807 GGVSNGDCFADGRRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSW 628
           G + +    ++ ++RCAW+T  SDPCY+ FHDEEWGVPVH+DK+LFELL LSGAL+E +W
Sbjct: 143 GALDSPPSGSETKKRCAWVTSNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTW 202

Query: 627 TDILSRRQLLREVFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTI 460
             ILS+RQ  REVF DFDP A+ K+NEK +    + A +LLS++KLR +++NARQ+ K I
Sbjct: 203 PMILSKRQTFREVFADFDPNAIVKINEKKLIGPGSPASTLLSDLKLRGVIENARQILKVI 262

Query: 459 AEYGSFKKYMWNFVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFM 280
            EYGSF KY+W+FV NK   S+FRYQRQVP KT KAE ISKDLVRRGFRSV PTV+YSFM
Sbjct: 263 EEYGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFM 322

Query: 279 QAAGLTNDHLIGCFRHQDC 223
           QAAG+TNDHL  CFR   C
Sbjct: 323 QAAGVTNDHLTSCFRFHHC 341

>gi|90265145|emb|CAC09513.2| H0711G06.19 [Oryza sativa Indica Group]

          Length = 383

 Score =  266 bits (679), Expect = 7e-069
 Identities = 122/196 (62%), Positives = 157/196 (80%), Gaps = 4/196 (2%)
 Frame = -1

Query: 771 RRRCAWITPKSDPCYVAFHDEEWGVPVHEDKKLFELLCLSGALSELSWTDILSRRQLLRE 592
           +RRC+W+T  ++PCY AFHDEEWGVPVH+DK LFELL LSGAL+EL+W  IL++R + RE
Sbjct: 149 KRRCSWVTANTEPCYAAFHDEEWGVPVHDDKVLFELLVLSGALAELTWPTILNKRPIFRE 208

Query: 591 VFMDFDPVAVSKLNEKTV----TSAISLLSEVKLRSILDNARQVRKTIAEYGSFKKYMWN 424
           VFMDFDP+ VSKL+EK +    + + +LLSE KLR +++NARQ+ K + E+G+F KY W+
Sbjct: 209 VFMDFDPLLVSKLSEKKIIAPGSPSSTLLSEQKLRGVIENARQILKIVEEFGTFDKYCWS 268

Query: 423 FVSNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIG 244
           FV+NKP  S+FRY RQVPVKTSKA+ ISKDLVRRGFRSV PTV+Y+FMQ +G+TNDHLI 
Sbjct: 269 FVNNKPILSRFRYPRQVPVKTSKADAISKDLVRRGFRSVGPTVVYTFMQVSGMTNDHLIS 328

Query: 243 CFRHQDCCVVAETTTT 196
           C+R  +C   A  + T
Sbjct: 329 CYRFAECAAAATGSNT 344

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,053,028,657,214
Number of Sequences: 15229318
Number of Extensions: 1053028657214
Number of Successful Extensions: 300909508
Number of sequences better than 0.0: 0