BLASTX 7.6.2
Query= UN41159 /QuerySize=1994
(1993 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|166221587|sp|Q9LPD2.2|GENL1_ARATH RecName: Full=Flap endonucl... 919 3e-265
gi|334182222|ref|NP_171691.2| 5'-3' exonuclease-like protein [Ar... 919 3e-265
gi|334182224|ref|NP_001184887.1| 5'-3' exonuclease-like protein ... 913 2e-263
gi|255541446|ref|XP_002511787.1| DNA binding protein, putative [... 655 9e-186
gi|8570440|gb|AAF76467.1|AC020622_1 Contains similarity to excis... 632 1e-178
gi|124360235|gb|ABN08248.1| Helix-hairpin-helix motif, class 2 [... 597 4e-168
gi|297842934|ref|XP_002889348.1| hypothetical protein ARALYDRAFT... 519 8e-145
gi|110430659|gb|ABG73449.1| DNA repair protein [Oryza brachyantha] 512 1e-142
gi|75288736|sp|Q64MA3.1|GENL1_ORYSJ RecName: Full=Flap endonucle... 502 1e-139
gi|125564417|gb|EAZ09797.1| hypothetical protein OsI_32084 [Oryz... 499 8e-139
gi|124360865|gb|ABN08837.1| Helix-hairpin-helix motif, class 2 [... 492 2e-136
gi|293334819|ref|NP_001169528.1| hypothetical protein LOC1003834... 382 2e-103
gi|225453885|ref|XP_002273159.1| PREDICTED: hypothetical protein... 374 5e-101
gi|63098616|gb|AAY32559.1| single strand DNA repair-like protein... 354 3e-095
gi|242045348|ref|XP_002460545.1| hypothetical protein SORBIDRAFT... 292 2e-076
>gi|166221587|sp|Q9LPD2.2|GENL1_ARATH RecName: Full=Flap endonuclease GEN-like
1
Length = 599
Score = 919 bits (2375), Expect = 3e-265
Identities = 470/614 (76%), Positives = 516/614 (84%), Gaps = 30/614 (4%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKGFALKPHLRLTFFRTI 265
MGVGG FWDL+RPY + +G D+LR KRVAVDLSFWI+QHETAVKGF LKPHLRLTFFRTI
Sbjct: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 60
Query: 266 NLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSLQ--EGVVSVERNKQFCEWV 439
NLFSKFGAYPVFVVDGTPSPLKS RISRF+RSSGIDT +L + VSVERNK F EWV
Sbjct: 61 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV 120
Query: 440 TECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPNS 619
ECVELL+LLGIPVLKANGEAEALCAQLNS GFVDACITPDSDAFLFGA VIK IKPNS
Sbjct: 121 RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS 180
Query: 620 TEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDIL 799
EPFECYHMS IE+GLGLKR+HLIAISLLVGND+DSGGV GIG+DKALRIVR FSED +L
Sbjct: 181 REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL 240
Query: 800 QRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSSC 979
+RL+DIG G +PAV GG KS DDG E S+MK+R PHCSRCGH GSKR+HFKSSC
Sbjct: 241 ERLQDIGNGLQPAVPGGIKS-----GDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSC 295
Query: 980 EHCSSDSGCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGPDFPNRKIIQL 1159
EHC DSGCIKKPL F CECSFC+KDR+L+EQKKT +WWIKVCD+IAL P+FPNRKII+L
Sbjct: 296 EHCGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIEL 355
Query: 1160 YLSDS-FTEDGSSMSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLSTIYLREKARSSN 1336
YLSD T DGSSMSWGTPDT MLVD +VF LHWDP YVRKMLLPMLSTIYLREKAR +N
Sbjct: 356 YLSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREKAR-NN 414
Query: 1337 TGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPRSTSGLT-----PEKPIVVWEDEEEV 1501
TG LLCDQYEFHSIKC+KTRYGH+SFVIRWRKP+STSG + PE+ IVV E+EEE
Sbjct: 415 TGYALLCDQYEFHSIKCIKTRYGHQSFVIRWRKPKSTSGYSHSHNEPEESIVVLEEEEES 474
Query: 1502 VEEEECVGLLDGLNEPQVQDDNGECFLLTDECVGLVQSAFPEETEHFLKEKKLRESKKKN 1681
V+ LDGLNEPQVQ+DNG+CFLLTDEC+GLVQSAFP+ETEHFL EKKLRESKKKN
Sbjct: 475 VDP------LDGLNEPQVQNDNGDCFLLTDECIGLVQSAFPDETEHFLHEKKLRESKKKN 528
Query: 1682 VCEGAA----SSSVGAQRSITDFYRSTKAAATPAQSVDTGGSSIAFASVEKKRQATSGSF 1849
V E ++++G QRSITDFYRS K AA QS++TGGSS AS EKKRQATS S
Sbjct: 529 VSEEETATPRATTMGVQRSITDFYRSAKKAAA-GQSIETGGSS--KASAEKKRQATSTSS 585
Query: 1850 S---KSVRRRLLFG 1882
S KSVRRRLLFG
Sbjct: 586 SNLTKSVRRRLLFG 599
>gi|334182222|ref|NP_171691.2| 5'-3' exonuclease-like protein [Arabidopsis
thaliana]
Length = 599
Score = 919 bits (2375), Expect = 3e-265
Identities = 470/614 (76%), Positives = 516/614 (84%), Gaps = 30/614 (4%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKGFALKPHLRLTFFRTI 265
MGVGG FWDL+RPY + +G D+LR KRVAVDLSFWI+QHETAVKGF LKPHLRLTFFRTI
Sbjct: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 60
Query: 266 NLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSLQ--EGVVSVERNKQFCEWV 439
NLFSKFGAYPVFVVDGTPSPLKS RISRF+RSSGIDT +L + VSVERNK F EWV
Sbjct: 61 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV 120
Query: 440 TECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPNS 619
ECVELL+LLGIPVLKANGEAEALCAQLNS GFVDACITPDSDAFLFGA VIK IKPNS
Sbjct: 121 RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS 180
Query: 620 TEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDIL 799
EPFECYHMS IE+GLGLKR+HLIAISLLVGND+DSGGV GIG+DKALRIVR FSED +L
Sbjct: 181 REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL 240
Query: 800 QRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSSC 979
+RL+DIG G +PAV GG KS DDG E S+MK+R PHCSRCGH GSKR+HFKSSC
Sbjct: 241 ERLQDIGNGLQPAVPGGIKS-----GDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSC 295
Query: 980 EHCSSDSGCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGPDFPNRKIIQL 1159
EHC DSGCIKKPL F CECSFC+KDR+L+EQKKT +WWIKVCD+IAL P+FPNRKII+L
Sbjct: 296 EHCGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIEL 355
Query: 1160 YLSDS-FTEDGSSMSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLSTIYLREKARSSN 1336
YLSD T DGSSMSWGTPDT MLVD +VF LHWDP YVRKMLLPMLSTIYLREKAR +N
Sbjct: 356 YLSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREKAR-NN 414
Query: 1337 TGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPRSTSGLT-----PEKPIVVWEDEEEV 1501
TG LLCDQYEFHSIKC+KTRYGH+SFVIRWRKP+STSG + PE+ IVV E+EEE
Sbjct: 415 TGYALLCDQYEFHSIKCIKTRYGHQSFVIRWRKPKSTSGYSHSHSEPEESIVVLEEEEES 474
Query: 1502 VEEEECVGLLDGLNEPQVQDDNGECFLLTDECVGLVQSAFPEETEHFLKEKKLRESKKKN 1681
V+ LDGLNEPQVQ+DNG+CFLLTDEC+GLVQSAFP+ETEHFL EKKLRESKKKN
Sbjct: 475 VDP------LDGLNEPQVQNDNGDCFLLTDECIGLVQSAFPDETEHFLHEKKLRESKKKN 528
Query: 1682 VCEGAA----SSSVGAQRSITDFYRSTKAAATPAQSVDTGGSSIAFASVEKKRQATSGSF 1849
V E ++++G QRSITDFYRS K AA QS++TGGSS AS EKKRQATS S
Sbjct: 529 VSEEETATPRATTMGVQRSITDFYRSAKKAAA-GQSIETGGSS--KASAEKKRQATSTSS 585
Query: 1850 S---KSVRRRLLFG 1882
S KSVRRRLLFG
Sbjct: 586 SNLTKSVRRRLLFG 599
>gi|334182224|ref|NP_001184887.1| 5'-3' exonuclease-like protein [Arabidopsis
thaliana]
Length = 598
Score = 913 bits (2359), Expect = 2e-263
Identities = 469/614 (76%), Positives = 515/614 (83%), Gaps = 31/614 (5%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKGFALKPHLRLTFFRTI 265
MGVGG FWDL+RPY + +G D+LR KRVAVDLSFWI+QHETAVKGF LKPHLRLTFFRTI
Sbjct: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 60
Query: 266 NLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSLQ--EGVVSVERNKQFCEWV 439
NLFSKFGAYPVFVVDGTPSPLKS RISRF+RSSGIDT +L + VSVERNK F EWV
Sbjct: 61 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV 120
Query: 440 TECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPNS 619
EC ELL+LLGIPVLKANGEAEALCAQLNS GFVDACITPDSDAFLFGA VIK IKPNS
Sbjct: 121 REC-ELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS 179
Query: 620 TEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDIL 799
EPFECYHMS IE+GLGLKR+HLIAISLLVGND+DSGGV GIG+DKALRIVR FSED +L
Sbjct: 180 REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL 239
Query: 800 QRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSSC 979
+RL+DIG G +PAV GG KS DDG E S+MK+R PHCSRCGH GSKR+HFKSSC
Sbjct: 240 ERLQDIGNGLQPAVPGGIKS-----GDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSC 294
Query: 980 EHCSSDSGCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGPDFPNRKIIQL 1159
EHC DSGCIKKPL F CECSFC+KDR+L+EQKKT +WWIKVCD+IAL P+FPNRKII+L
Sbjct: 295 EHCGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIEL 354
Query: 1160 YLSDS-FTEDGSSMSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLSTIYLREKARSSN 1336
YLSD T DGSSMSWGTPDT MLVD +VF LHWDP YVRKMLLPMLSTIYLREKAR +N
Sbjct: 355 YLSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREKAR-NN 413
Query: 1337 TGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPRSTSGLT-----PEKPIVVWEDEEEV 1501
TG LLCDQYEFHSIKC+KTRYGH+SFVIRWRKP+STSG + PE+ IVV E+EEE
Sbjct: 414 TGYALLCDQYEFHSIKCIKTRYGHQSFVIRWRKPKSTSGYSHSHSEPEESIVVLEEEEES 473
Query: 1502 VEEEECVGLLDGLNEPQVQDDNGECFLLTDECVGLVQSAFPEETEHFLKEKKLRESKKKN 1681
V+ LDGLNEPQVQ+DNG+CFLLTDEC+GLVQSAFP+ETEHFL EKKLRESKKKN
Sbjct: 474 VDP------LDGLNEPQVQNDNGDCFLLTDECIGLVQSAFPDETEHFLHEKKLRESKKKN 527
Query: 1682 VCEGAA----SSSVGAQRSITDFYRSTKAAATPAQSVDTGGSSIAFASVEKKRQATSGSF 1849
V E ++++G QRSITDFYRS K AA QS++TGGSS AS EKKRQATS S
Sbjct: 528 VSEEETATPRATTMGVQRSITDFYRSAKKAAA-GQSIETGGSS--KASAEKKRQATSTSS 584
Query: 1850 S---KSVRRRLLFG 1882
S KSVRRRLLFG
Sbjct: 585 SNLTKSVRRRLLFG 598
>gi|255541446|ref|XP_002511787.1| DNA binding protein, putative [Ricinus
communis]
Length = 609
Score = 655 bits (1689), Expect = 9e-186
Identities = 356/615 (57%), Positives = 445/615 (72%), Gaps = 24/615 (3%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKGFALKPHLRLTFFRTI 265
MGVGGKFWD+++PY R+EG D+LREKRVA+DLS+WI+QHETA+K +A KPHLRLTFFRTI
Sbjct: 1 MGVGGKFWDILKPYTRHEGPDFLREKRVAIDLSYWIVQHETAIKSYARKPHLRLTFFRTI 60
Query: 266 NLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSL---QEGVVSVERNKQFCEW 436
NLFSKFGA+PVFVVDGTPSPLKS RISRF+RSSGID+S L +EG VSVERN F +
Sbjct: 61 NLFSKFGAFPVFVVDGTPSPLKSRARISRFFRSSGIDSSVLPTPEEG-VSVERNGAFLKC 119
Query: 437 VTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPN 616
V ECVELL+L G+PVLKANGEAEALCAQLNS G VDACIT DSDAFLFGA VIKSIKPN
Sbjct: 120 VKECVELLELFGMPVLKANGEAEALCAQLNSQGLVDACITADSDAFLFGAKCVIKSIKPN 179
Query: 617 STEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDI 796
S EPFECY MSDIE+GL LKR+HLIAI+LLVGND D GV GIG+D ALR V+ F ED+I
Sbjct: 180 SKEPFECYQMSDIESGLALKRKHLIAIALLVGNDHDLNGVQGIGVDTALRFVQTFHEDEI 239
Query: 797 LQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSS 976
L L +IGKG G ++ V+ D D E+S +K ++ HCS CGHPGSKR+HFKSS
Sbjct: 240 LNCLREIGKGNTNIFLGVSRVVE-DLMIDPHENS--LKSKISHCSFCGHPGSKRAHFKSS 296
Query: 977 CEHC--SSDSGCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGPDFPNRKI 1150
CE+C S+ GC KK F C C C KDR+ KE++K ENW IKVCD++ + P+FPN I
Sbjct: 297 CEYCGNSNGEGCTKKSGAFRCNCGSCNKDRKAKEEQKRENWQIKVCDKMFMEPNFPNDDI 356
Query: 1151 IQLYLSDS---FTEDGSS-MSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLSTIYLRE 1318
I++YL ++ FTED + +SWG+P+T+MLVD L F+ W P Y+R+ +LP+LSTIYLR+
Sbjct: 357 IEMYLCNNHAEFTEDDDTCLSWGSPNTDMLVDFLAFHKLWHPSYIRQRILPVLSTIYLRD 416
Query: 1319 KARSSNTGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPRSTSGLTPEKPIVVWED--E 1492
A + LL QYEF SI+ +K RYGH+S+VI+W+K +T IV D +
Sbjct: 417 MA--AKPEKALLYGQYEFDSIQRIKVRYGHESYVIKWKKAANTISSNICINIVEELDKHQ 474
Query: 1493 EEVVEEEECVGLLDGLNEPQVQDDNGECFLLTDECVGLVQSAFPEETEHFLKEKKLRESK 1672
E++V+ +E + L+ N P+ D+G FLLTDE + LVQ+AFP+ + FLKEK+ +ESK
Sbjct: 475 EDIVKTDESIDQLEEYNVPKSYVDDGCWFLLTDENMDLVQNAFPDAVDKFLKEKEQKESK 534
Query: 1673 KKNVCEGAASSSV---GAQRSITDFYRSTK---AAATPAQSVDTGGSSIAFASVEKKRQA 1834
++ S SV G Q +IT+FYRSTK AA+ + D + S E KR+
Sbjct: 535 RRLSSSTEKSESVKSKGVQLNITEFYRSTKVQFAASGGEEQADCSENQDDVISTE-KRKI 593
Query: 1835 TSGSFSKSVRRRLLF 1879
+S + KSVRRRLLF
Sbjct: 594 SSSNLPKSVRRRLLF 608
>gi|8570440|gb|AAF76467.1|AC020622_1 Contains similarity to excision repair
protein ERCC5 from Homo sapiens gi|1082359 and contains XPG N-terminal
PF|00752 and XPG I-region PF|00867 domains [Arabidopsis thaliana]
Length = 497
Score = 632 bits (1628), Expect = 1e-178
Identities = 311/399 (77%), Positives = 342/399 (85%), Gaps = 9/399 (2%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKGFALKPHLRLTFFRTI 265
MGVGG FWDL+RPY + +G D+LR KRVAVDLSFWI+QHETAVKGF LKPHLRLTFFRTI
Sbjct: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 60
Query: 266 NLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSLQ--EGVVSVERNKQFCEWV 439
NLFSKFGAYPVFVVDGTPSPLKS RISRF+RSSGIDT +L + VSVERNK F EWV
Sbjct: 61 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV 120
Query: 440 TECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPNS 619
ECVELL+LLGIPVLKANGEAEALCAQLNS GFVDACITPDSDAFLFGA VIK IKPNS
Sbjct: 121 RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS 180
Query: 620 TEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDIL 799
EPFECYHMS IE+GLGLKR+HLIAISLLVGND+DSGGV GIG+DKALRIVR FSED +L
Sbjct: 181 REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL 240
Query: 800 QRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSSC 979
+RL+DIG G +PAV GG KS DDG E S+MK+R PHCSRCGH GSKR+HFKSSC
Sbjct: 241 ERLQDIGNGLQPAVPGGIKS-----GDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSC 295
Query: 980 EHCSSDSGCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGPDFPNRKIIQL 1159
EHC DSGCIKKPL F CECSFC+KDR+L+EQKKT +WWIKVCD+IAL P+FPNRKII+L
Sbjct: 296 EHCGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIEL 355
Query: 1160 YLSDS-FTEDGSSMSWGTPDTEMLVDCLVFNLHWDPCYV 1273
YLSD T DGSSMSWGTPDT MLVD +V N + D C++
Sbjct: 356 YLSDGLMTGDGSSMSWGTPDTGMLVDLMVQNDNGD-CFL 393
>gi|124360235|gb|ABN08248.1| Helix-hairpin-helix motif, class 2 [Medicago
truncatula]
Length = 612
Score = 597 bits (1537), Expect = 4e-168
Identities = 308/542 (56%), Positives = 390/542 (71%), Gaps = 21/542 (3%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKGFALKPHLRLTFFRTI 265
MGVGG FW+L++PY RNEG D+LR KRVA+DLSFWI+QH A+K KPHLRLTFFRTI
Sbjct: 1 MGVGGNFWELLKPYSRNEGFDFLRNKRVAIDLSFWIVQHNNAIKTHVKKPHLRLTFFRTI 60
Query: 266 NLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSL---QEGVVSVERNKQFCEW 436
NLFSKFGA+PVFVVDGTPSPLKS RI+RF+RSSGI+++SL +EG VS RN F
Sbjct: 61 NLFSKFGAFPVFVVDGTPSPLKSQARIARFFRSSGIESTSLPVAEEG-VSAGRNSTFSRC 119
Query: 437 VTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPN 616
V ECVEL KLLGIPVLKA GEAEALCAQLNS G VDACITPDSDAFLFGA +IKS PN
Sbjct: 120 VQECVELAKLLGIPVLKAKGEAEALCAQLNSEGHVDACITPDSDAFLFGAKCIIKSFSPN 179
Query: 617 STEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDI 796
S EPFECY+MSDIEAGLGLKR+HLIAISLLVGND D GV GIG+D ALR V+AF EDDI
Sbjct: 180 SKEPFECYNMSDIEAGLGLKRKHLIAISLLVGNDHDLSGVQGIGIDSALRFVQAFGEDDI 239
Query: 797 LQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSS 976
L RL +IGKG V K+ + D D ++ Q HCS CGHPG+KR H K S
Sbjct: 240 LNRLHEIGKGNAFQVPIDIKAEENMDIDGNSPNTKQ-----THCSFCGHPGNKRDHMKFS 294
Query: 977 CEHCSSD--SGCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGPDFPNRKI 1150
CE C +D GC+KKP F C+C+ C +R+ KEQKK ENW K+CD+IA P+FP +I
Sbjct: 295 CEFCVADDNEGCLKKPEGFKCDCNSCCMNRKHKEQKKMENWHTKICDKIAKEPNFPKDEI 354
Query: 1151 IQLYLSDS----FTEDGSSMSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLSTIYLRE 1318
I +YL + DG +SW P+ ++LVD L F+ +WDP Y+R+++ PM+STI+LRE
Sbjct: 355 IDMYLCNDNGYFSANDGPQISWERPNMDLLVDFLNFHQNWDPSYIRRIMFPMMSTIFLRE 414
Query: 1319 KARSSNTGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPRSTSGLTPEKPIVVWEDEEE 1498
A + + LL Q+EF S+K +KTRYG++ +V++W+ R+ + + P +E+
Sbjct: 415 MATTPT--DSLLFGQFEFASLKRVKTRYGYQFYVVKWK--RAMGNIASKTPANKSGMQED 470
Query: 1499 VVE--EEECVGLLDGLNEPQVQDDNGECFLLTDECVGLVQSAFPEETEHFLKEKKLRESK 1672
V+E +E V LLD + PQ+ +++G FLLTDE + LV +A+PEE + F +E++L++ K
Sbjct: 471 VIELDVDETVDLLDDCDFPQICEEDGCSFLLTDENMDLVGAAYPEEVKRFRQEQELKDVK 530
Query: 1673 KK 1678
+K
Sbjct: 531 RK 532
>gi|297842934|ref|XP_002889348.1| hypothetical protein ARALYDRAFT_311256
[Arabidopsis lyrata subsp. lyrata]
Length = 590
Score = 519 bits (1336), Expect = 8e-145
Identities = 256/316 (81%), Positives = 274/316 (86%), Gaps = 7/316 (2%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKGFALKPHLRLTFFRTI 265
MGVGG FWDL+RPY + G DYLR KRVAVDLSFWI+QHETAVKGF LKPHLRLTFFRTI
Sbjct: 1 MGVGGNFWDLLRPYAQQRGFDYLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 60
Query: 266 NLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSLQ--EGVVSVERNKQFCEWV 439
NLFSKFGAYPVFVVDGTPSPLKS RISRF+RSSGIDT +L + VSVERNK FCEWV
Sbjct: 61 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFCEWV 120
Query: 440 TECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPNS 619
ECVELL+LL IPVLKANGEAEALCAQLNS G+VDACITPDSDAFLFGA VIK IKPNS
Sbjct: 121 KECVELLELLSIPVLKANGEAEALCAQLNSEGYVDACITPDSDAFLFGAKCVIKDIKPNS 180
Query: 620 TEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDIL 799
EPFECYHMSDIE+GLGLKR+HLIAISLLVGND+DSGGV GIG+DKALRIVR FSED+IL
Sbjct: 181 REPFECYHMSDIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDEIL 240
Query: 800 QRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSSC 979
+RL+DIGKG KP V GG KSV DDG E S+MK+R PHCSRCGH GSKR+HFKSSC
Sbjct: 241 ERLQDIGKGLKPTVPGGIKSV-----DDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSC 295
Query: 980 EHCSSDSGCIKKPLEF 1027
EHC DSGCIKKPL F
Sbjct: 296 EHCGCDSGCIKKPLGF 311
>gi|110430659|gb|ABG73449.1| DNA repair protein [Oryza brachyantha]
Length = 629
Score = 512 bits (1317), Expect = 1e-142
Identities = 253/465 (54%), Positives = 331/465 (71%), Gaps = 21/465 (4%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKG---FALKPHLRLTFF 256
MGVGG FWDL++PY R+EG+ YLR++RVAVDLSFW++ H TA++ A PHLR FF
Sbjct: 1 MGVGGSFWDLLKPYARHEGAGYLRDRRVAVDLSFWVVSHSTAIRARSPHARVPHLRTLFF 60
Query: 257 RTINLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSL--QEGVVSVE-----R 415
RT++LFSK GAYPVFVVDG PSPLKS R +RF+R SG+D ++L EG + + R
Sbjct: 61 RTLSLFSKMGAYPVFVVDGEPSPLKSQARAARFFRGSGMDLATLPSTEGEANADSPVQPR 120
Query: 416 NKQFCEWVTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSV 595
N +F +V ECVELL+ LG+PVL+A GE EALCAQLN+ G VDACIT DSDAFLFGA +V
Sbjct: 121 NAKFTRYVKECVELLEYLGMPVLRAKGEGEALCAQLNNEGHVDACITSDSDAFLFGAKTV 180
Query: 596 IKSIKPNSTEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVR 775
IK ++ N EPFECY+M+DIE+GLGLKR+ ++A++LLVG+D D GV G G + ALR V+
Sbjct: 181 IKVLRSNCKEPFECYNMTDIESGLGLKRKQMVAMALLVGSDHDLHGVPGFGPETALRFVQ 240
Query: 776 AFSEDDILQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSK 955
F ED +L +L +IGKG P + G V + DD S++ R+PHCS CGHPG+K
Sbjct: 241 LFDEDTVLDKLYEIGKGVYPFIEG----VTAPNIDDLPSPSTKSLPRVPHCSHCGHPGNK 296
Query: 956 RSHFKSSCEHCSSDS--GCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGP 1129
++H KS C C DS C++KP F CEC C K R+LKE+++ ENW IKVC RIA
Sbjct: 297 KNHIKSGCNFCLVDSLENCVEKPTGFICECPSCDKARDLKERRRNENWQIKVCKRIAAET 356
Query: 1130 DFPNRKIIQLYLSDSFTEDGS---SMSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLS 1300
+FPN +II+LYLS + +D + S+ W PD E+LVD L F +W+P Y+R+ +LPMLS
Sbjct: 357 NFPNEEIIKLYLSGNNLDDENGVLSLKWNKPDVEVLVDFLSFKQNWEPAYIRQRMLPMLS 416
Query: 1301 TIYLREKARSSNTGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRK 1435
TIYLRE A S+ L DQYEFHSI+ +K RYGH ++++W++
Sbjct: 417 TIYLREMA--SSPSKSFLYDQYEFHSIQRIKIRYGHPYYLVKWKR 459
>gi|75288736|sp|Q64MA3.1|GENL1_ORYSJ RecName: Full=Flap endonuclease GEN-like 1;
Short=OsGEN-L; Short=Protein OsGEN-like; AltName: Full=OsRAD
Length = 629
Score = 502 bits (1292), Expect = 1e-139
Identities = 256/503 (50%), Positives = 343/503 (68%), Gaps = 30/503 (5%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKG---FALKPHLRLTFF 256
MGVGG FWDL++PY R+EG+ YLR +RVAVDLSFW++ H A++ A PHLR FF
Sbjct: 1 MGVGGSFWDLLKPYARHEGAGYLRGRRVAVDLSFWVVSHSAAIRARSPHARLPHLRTLFF 60
Query: 257 RTINLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSL--QEGVVSVE-----R 415
RT++LFSK GA+PVFVVDG PSPLKS R +RF+R SG+D ++L E S + R
Sbjct: 61 RTLSLFSKMGAFPVFVVDGQPSPLKSQVRAARFFRGSGMDLAALPSTEAEASADALVQPR 120
Query: 416 NKQFCEWVTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSV 595
N +F +V +CVELL+ LG+PVL+A GE EALCAQLN+ G VDACIT DSDAFLFGA +V
Sbjct: 121 NAKFTRYVEDCVELLEYLGMPVLRAKGEGEALCAQLNNQGHVDACITSDSDAFLFGAKTV 180
Query: 596 IKSIKPNSTEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVR 775
IK ++ N EPFECY+M+DIE+GLGLKR+ ++A++LLVG+D D GV G G + ALR V+
Sbjct: 181 IKVLRSNCKEPFECYNMADIESGLGLKRKQMVAMALLVGSDHDLHGVPGFGPETALRFVQ 240
Query: 776 AFSEDDILQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSK 955
F ED++L +L +IGKG P + ++ DD + S + R PHCS CGHPG+K
Sbjct: 241 LFDEDNVLAKLYEIGKGVYPFIGVSAPNI---DDLPSPSTKSLPRARSPHCSHCGHPGNK 297
Query: 956 RSHFKSSCEHCSSDS--GCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGP 1129
++H K C C DS C++KP F CEC C K R+LK Q++ ENW IKVC RIA
Sbjct: 298 KNHIKDGCNFCLVDSLENCVEKPAGFICECPSCDKARDLKVQRRNENWQIKVCKRIAAET 357
Query: 1130 DFPNRKIIQLYLSDSFTEDGSS---MSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLS 1300
+FPN +II LYL+D ++ + ++W PD E+LVD L F +W+P Y+R+ +LPMLS
Sbjct: 358 NFPNEEIINLYLNDDNLDNENGVPLLTWNKPDMEILVDFLSFKQNWEPAYIRQRMLPMLS 417
Query: 1301 TIYLREKARSSNTGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPRST--SGLTPEK-- 1468
TIYLRE A SS + + LL DQY+FHSI+ +K RYGH ++++W++ + S P K
Sbjct: 418 TIYLREMA-SSQSKSFLLYDQYKFHSIQRIKIRYGHPYYLVKWKRVTRSMISNDPPSKQT 476
Query: 1469 -------PIVVWEDEEEVVEEEE 1516
+ V + ++EVV+EEE
Sbjct: 477 ELEGKNDKVEVLDGDDEVVDEEE 499
>gi|125564417|gb|EAZ09797.1| hypothetical protein OsI_32084 [Oryza sativa Indica
Group]
Length = 630
Score = 499 bits (1284), Expect = 8e-139
Identities = 249/465 (53%), Positives = 327/465 (70%), Gaps = 18/465 (3%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKG---FALKPHLRLTFF 256
MGVGG FWDL++PY R+EG+ YLR +RVAVDLSFW+I H A++ A PHLR FF
Sbjct: 1 MGVGGSFWDLLKPYARHEGAGYLRGRRVAVDLSFWVISHSAAIRARSPHARLPHLRTLFF 60
Query: 257 RTINLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSL--QEGVVSVE-----R 415
RT++LFSK GA+PVFVVDG PSPLKS R +RF+R SG+D ++L E S + R
Sbjct: 61 RTLSLFSKMGAFPVFVVDGQPSPLKSQVRAARFFRGSGMDLAALPSTEAEASADAPVQPR 120
Query: 416 NKQFCEWVTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSV 595
N +F +V +CVELL+ LG+PVL+A GE EALCAQLN+ G VDACIT DSDAFLFGA +V
Sbjct: 121 NAKFTRYVEDCVELLEYLGMPVLRAKGEGEALCAQLNNQGHVDACITSDSDAFLFGAKTV 180
Query: 596 IKSIKPNSTEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVR 775
IK ++ N EPFECY+M+DIE+GLGLKR+ ++A++LLVG+D D GV G G + ALR V+
Sbjct: 181 IKVLRSNCKEPFECYNMADIESGLGLKRKQMVAMALLVGSDHDLHGVPGFGPETALRFVQ 240
Query: 776 AFSEDDILQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSK 955
F ED++L +L +IGKG P + G S DD + S + R PHCS CGHPG+K
Sbjct: 241 LFDEDNVLAKLYEIGKGVYPFIEG--VSAPNIDDLPSPSTKSLPRARSPHCSHCGHPGNK 298
Query: 956 RSHFKSSCEHCSSDS--GCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGP 1129
++H K C C DS C++KP F CEC C K R++K Q++ ENW IKVC RIA
Sbjct: 299 KNHIKDGCNFCLVDSLENCVEKPAGFICECPSCDKARDMKVQRRNENWQIKVCKRIAAET 358
Query: 1130 DFPNRKIIQLYLSDSFTEDGSS---MSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLS 1300
+FPN +II LYLSD ++ + ++W PD E+LVD L F +W+P Y+R+ +LPMLS
Sbjct: 359 NFPNEEIINLYLSDDNLDNENGVPLLTWNKPDMEILVDFLSFKQNWEPAYIRQRMLPMLS 418
Query: 1301 TIYLREKARSSNTGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRK 1435
TIYLRE A SS + + LL DQY+FHSI+ +K RYGH ++++W++
Sbjct: 419 TIYLREMA-SSQSKSFLLYDQYKFHSIQRIKIRYGHPYYLVKWKR 462
>gi|124360865|gb|ABN08837.1| Helix-hairpin-helix motif, class 2 [Medicago
truncatula]
Length = 547
Score = 492 bits (1264), Expect = 2e-136
Identities = 260/477 (54%), Positives = 335/477 (70%), Gaps = 21/477 (4%)
Frame = +2
Query: 281 FGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSL---QEGVVSVERNKQFCEWVTECV 451
FGA+PVFVVDGTPSPLKS RI+RF+RSSGI+++SL +EG VS RN F V ECV
Sbjct: 1 FGAFPVFVVDGTPSPLKSQARIARFFRSSGIESTSLPVAEEG-VSAGRNSTFSRCVQECV 59
Query: 452 ELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPNSTEPF 631
EL KLLGIPVLKA GEAEALCAQLNS G VDACITPDSDAFLFGA +IKS PNS EPF
Sbjct: 60 ELAKLLGIPVLKAKGEAEALCAQLNSEGHVDACITPDSDAFLFGAKCIIKSFSPNSKEPF 119
Query: 632 ECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDILQRLE 811
ECY+MSDIEAGLGLKR+HLIAISLLVGND D GV GIG+D ALR V+AF EDDIL RL
Sbjct: 120 ECYNMSDIEAGLGLKRKHLIAISLLVGNDHDLSGVQGIGIDSALRFVQAFGEDDILNRLH 179
Query: 812 DIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSSCEHCS 991
+IGKG V K+ + D D ++ Q HCS CGHPG+KR H K SCE C
Sbjct: 180 EIGKGNAFQVPIDIKAEENMDIDGNSPNTKQ-----THCSFCGHPGNKRDHMKFSCEFCV 234
Query: 992 SD--SGCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALGPDFPNRKIIQLYL 1165
+D GC+KKP F C+C+ C +R+ KEQKK ENW K+CD+IA P+FP +II +YL
Sbjct: 235 ADDNEGCLKKPEGFKCDCNSCCMNRKHKEQKKMENWHTKICDKIAKEPNFPKDEIIDMYL 294
Query: 1166 SDS----FTEDGSSMSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPMLSTIYLREKARSS 1333
+ DG +SW P+ ++LVD L F+ +WDP Y+R+++ PM+STI+LRE A +
Sbjct: 295 CNDNGYFSANDGPQISWERPNMDLLVDFLNFHQNWDPSYIRRIMFPMMSTIFLREMATTP 354
Query: 1334 NTGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPRSTSGLTPEKPIVVWEDEEEVVE-- 1507
+ LL Q+EF S+K +KTRYG++ +V++W+ R+ + + P +E+V+E
Sbjct: 355 T--DSLLFGQFEFASLKRVKTRYGYQFYVVKWK--RAMGNIASKTPANKSGMQEDVIELD 410
Query: 1508 EEECVGLLDGLNEPQVQDDNGECFLLTDECVGLVQSAFPEETEHFLKEKKLRESKKK 1678
+E V LLD + PQ+ +++G FLLTDE + LV +A+PEE + F +E++L++ K+K
Sbjct: 411 VDETVDLLDDCDFPQICEEDGCSFLLTDENMDLVGAAYPEEVKRFRQEQELKDVKRK 467
>gi|293334819|ref|NP_001169528.1| hypothetical protein LOC100383402 [Zea mays]
Length = 638
Score = 382 bits (979), Expect = 2e-103
Identities = 194/371 (52%), Positives = 252/371 (67%), Gaps = 12/371 (3%)
Frame = +2
Query: 413 RNKQFCEWVTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASS 592
RN F V ECVELL+ LG+PVL+A GEAEALCAQLN+ G V ACIT DSDAFLFGA +
Sbjct: 121 RNAAFTRCVEECVELLEYLGMPVLRAKGEAEALCAQLNNEGHVGACITADSDAFLFGAKT 180
Query: 593 VIKSIKPNSTEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIV 772
V+K ++ N EPFECYH++DIE+GLGLKR+ L+A++LL+G+D D GV G GL+ ALR V
Sbjct: 181 VVKVLRSNCKEPFECYHIADIESGLGLKRKQLVAMALLIGSDHDLHGVPGFGLETALRFV 240
Query: 773 RAFSEDDILQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGS 952
+ F ED+IL +L +IGKG P + G D DD SS + + PHCS CGHPGS
Sbjct: 241 QLFDEDEILDKLHEIGKGVYPFLKG----FDNPHIDDLPSSSKKSPIKSPHCSHCGHPGS 296
Query: 953 KRSHFKSSCEHCSSDS--GCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALG 1126
K++H K C +C DS C+++P F CEC C + R+L EQ++ ENW IKVC RIA
Sbjct: 297 KKNHIKDGCNYCLVDSLENCVERPAGFKCECPSCDEARDLNEQRRHENWQIKVCKRIAAE 356
Query: 1127 PDFPNRKIIQLYLSDS--FTEDG-SSMSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPML 1297
+FPN +II+LYLSD+ E G +SW PD E LVD L + +W+P Y+R+ +LPML
Sbjct: 357 TNFPNEEIIKLYLSDNNLVEEKGVPLLSWSKPDVEALVDLLSYKQNWEPSYIRQRMLPML 416
Query: 1298 STIYLREKARSSNTGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPR--STSGLTPEKP 1471
STIYLRE A SS+T P LCDQYEF SI+ K R+GH ++++W++ S + +KP
Sbjct: 417 STIYLREVASSSSTPLP-LCDQYEFDSIERTKIRHGHPYYLVKWKRATRGMNSNMPSKKP 475
Query: 1472 IVVWEDEEEVV 1504
+ E EVV
Sbjct: 476 VTEGETSSEVV 486
>gi|225453885|ref|XP_002273159.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 667
Score = 374 bits (958), Expect = 5e-101
Identities = 196/308 (63%), Positives = 225/308 (73%), Gaps = 8/308 (2%)
Frame = +2
Query: 86 MGVGGKFWDLVRPYGRNEGSDYLREKRVAVDLSFWIIQHETAVKGFALKPHLRLTFFRTI 265
MGVGG FW+L++PY R EG DY+R KRVAVDLSFWI+Q ETA K PHLRLTFFRTI
Sbjct: 1 MGVGGSFWELLKPYARPEGFDYIRNKRVAVDLSFWIVQQETATKANVRNPHLRLTFFRTI 60
Query: 266 NLFSKFGAYPVFVVDGTPSPLKSHTRISRFYRSSGIDTSSL---QEGVVSVERNKQFCEW 436
NLFSKFGA+PVFVVDGTPSPLKS RI+RF+R SGID S L +EG VSVERN +F
Sbjct: 61 NLFSKFGAFPVFVVDGTPSPLKSQARIARFFRGSGIDLSGLPVVEEG-VSVERNAEFSRR 119
Query: 437 VTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASSVIKSIKPN 616
V ECVELL+LLGIPVLKA EAEALCAQLNS G VDACIT DSDAFLFGA VIK ++PN
Sbjct: 120 VQECVELLELLGIPVLKAREEAEALCAQLNSEGHVDACITADSDAFLFGAKCVIKCLRPN 179
Query: 617 STEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIVRAFSEDDI 796
EP ECYHMSDIE+GLGLKR+HLIAISLLVGND+D GV GIGLD A+R V+ FSED+I
Sbjct: 180 CKEPLECYHMSDIESGLGLKRKHLIAISLLVGNDYDLNGVQGIGLDTAVRFVQGFSEDEI 239
Query: 797 LQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGSKRSHFKSS 976
L RL++ G G G KS+ DD + ++PHCS G +
Sbjct: 240 LNRLQEKGNG-ATVFDGAVKSM---DDSIPCLDEKSPRPKVPHCSTFPRKGCLEKPEGFA 295
Query: 977 CEHCSSDS 1000
C+ +SD+
Sbjct: 296 CDCSTSDA 303
>gi|63098616|gb|AAY32559.1| single strand DNA repair-like protein [Triticum
monococcum]
Length = 646
Score = 354 bits (908), Expect = 3e-095
Identities = 182/354 (51%), Positives = 241/354 (68%), Gaps = 11/354 (3%)
Frame = +2
Query: 413 RNKQFCEWVTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACITPDSDAFLFGASS 592
RN F V +CVELLK LG+PVL A GEAEALCAQLN+ G VDACIT DSDAFLFGA +
Sbjct: 122 RNAIFTRCVKDCVELLKNLGMPVLWAKGEAEALCAQLNNEGEVDACITSDSDAFLFGAKT 181
Query: 593 VIKSIKPNSTEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGGVSGIGLDKALRIV 772
VIK ++ N EPFECY++ DIE+G+GLKR+ ++A++LL+G+D D GV G G++ ALR V
Sbjct: 182 VIKVMRSNCKEPFECYNIVDIESGIGLKRKQMVAMALLIGSDHDLHGVPGFGVETALRFV 241
Query: 773 RAFSEDDILQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKRRLPHCSRCGHPGS 952
R F ED IL +L +IGKG P + G K+ D + S R PHCS CGHPGS
Sbjct: 242 RLFDEDQILDKLHEIGKGIYPFLEGFDKA--HVGDLPSPSTKSPPVARSPHCSHCGHPGS 299
Query: 953 KRSHFKSSCEHCSSDS--GCIKKPLEFTCECSFCTKDRELKEQKKTENWWIKVCDRIALG 1126
K++H K+ C +C DS C++KP F CEC C K R+LK Q++ ENW IKVC R+A
Sbjct: 300 KKNHSKTGCNYCLVDSLEFCMEKPAGFICECPSCEKARDLKAQRRHENWQIKVCKRLAAE 359
Query: 1127 PDFPNRKIIQLYLSDSFTEDGSS----MSWGTPDTEMLVDCLVFNLHWDPCYVRKMLLPM 1294
+FPN +II+LYL D ++ S + W P + LVD L + +W+P YVR+ +LPM
Sbjct: 360 TNFPNEEIIRLYLCDDNLDNKESGDRKLEWTEPKVDDLVDLLTYMQNWEPSYVRQHMLPM 419
Query: 1295 LSTIYLREKARSSNTGNPLLCDQYEFHSIKCMKTRYGHKSFVIRWRKPRSTSGL 1456
LSTIYLR A SS + LLCDQYEFHSI+ +K ++G+ ++++W+ R+T G+
Sbjct: 420 LSTIYLRRMA-SSPCKSLLLCDQYEFHSIQRIKIKHGYPYYLVKWK--RATGGI 470
>gi|242045348|ref|XP_002460545.1| hypothetical protein SORBIDRAFT_02g030290
[Sorghum bicolor]
Length = 590
Score = 292 bits (747), Expect = 2e-076
Identities = 146/282 (51%), Positives = 189/282 (67%), Gaps = 5/282 (1%)
Frame = +2
Query: 374 DTSSLQEGVVSVERNKQFCEWVTECVELLKLLGIPVLKANGEAEALCAQLNSHGFVDACI 553
+T S RN F V ECVELL+ LG+PVL+A GEAEALCAQLN+ G VDACI
Sbjct: 108 ETESSAAAAPVKRRNAAFTRCVEECVELLEYLGMPVLRAKGEAEALCAQLNNEGHVDACI 167
Query: 554 TPDSDAFLFGASSVIKSIKPNSTEPFECYHMSDIEAGLGLKRRHLIAISLLVGNDFDSGG 733
T DSDAFLFGA +V+K + N EPFECYH++DIE+GLGLKR+ ++A++LL+G+D D G
Sbjct: 168 TADSDAFLFGAKTVVKVFRSNCKEPFECYHIADIESGLGLKRKQMVAMALLIGSDHDLHG 227
Query: 734 VSGIGLDKALRIVRAFSEDDILQRLEDIGKGFKPAVSGGTKSVDVDDDDDGVESSSQMKR 913
V G GL+ ALR V+ F ED+IL +L +IG+G P + G + DD S+
Sbjct: 228 VPGFGLETALRFVQLFDEDEILDKLHEIGRGVYPFLEGFD---NAHIDDLPSSSTKSPVA 284
Query: 914 RLPHCSRCGHPGSKRSHFKSSCEHCSSDS--GCIKKPLEFTCECSFCTKDRELKEQKKTE 1087
+ PHCS CGHPGSK++H K C +C DS C++KP F CEC C + R+LKEQ++ E
Sbjct: 285 KSPHCSHCGHPGSKKNHSKDGCNYCLVDSLENCVEKPAGFKCECPSCDEARDLKEQRRHE 344
Query: 1088 NWWIKVCDRIALGPDFPNRKIIQLYLSDSFTEDGSSMSWGTP 1213
NW IKVC RIA +FPN +II+LYLSD+ + + S TP
Sbjct: 345 NWQIKVCKRIAAETNFPNEEIIKLYLSDNNLVEEVASSPSTP 386
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,587,916,767,856
Number of Sequences: 15229318
Number of Extensions: 4587916767856
Number of Successful Extensions: 1071853064
Number of sequences better than 0.0: 0
|