BLASTX 7.6.2
Query= UN34482 /QuerySize=1331
(1330 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|62321784|dbj|BAD95408.1| hypothetical protein [Arabidopsis th... 325 1e-086
gi|9755374|gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 309 7e-082
gi|3461840|gb|AAC33226.1| putative non-LTR retroelement reverse ... 308 2e-081
gi|12321684|gb|AAG50886.1|AC025294_24 hypothetical protein [Arab... 302 1e-079
gi|3738337|gb|AAC63678.1| putative non-LTR retroelement reverse ... 301 2e-079
gi|4539462|emb|CAB39942.1| putative protein [Arabidopsis thaliana] 295 1e-077
gi|12321503|gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsi... 218 2e-054
gi|21952510|gb|AAM82604.1|AF525305_2 putative AP endonuclease/re... 216 1e-053
gi|8843742|dbj|BAA97290.1| non-LTR retroelement reverse transcri... 210 4e-052
gi|110739557|dbj|BAF01687.1| hypothetical protein [Arabidopsis t... 209 1e-051
gi|110740597|dbj|BAE98403.1| putative non-LTR reverse transcript... 190 5e-046
gi|297795303|ref|XP_002865536.1| hypothetical protein ARALYDRAFT... 188 2e-045
gi|9279655|dbj|BAB01155.1| unnamed protein product [Arabidopsis ... 185 2e-044
gi|9757833|dbj|BAB08270.1| non-LTR retroelement reverse transcri... 159 1e-036
gi|5281029|emb|CAB45965.1| putative reverse transcriptase [Arabi... 159 1e-036
gi|158828216|gb|ABW81094.1| RT6non-ltr [Cleome spinosa] 159 1e-036
gi|9758853|dbj|BAB09379.1| non-LTR retroelement reverse transcri... 152 2e-034
gi|3047086|gb|AAC13599.1| similar to reverse transcriptase (Pfam... 142 2e-031
gi|8778669|gb|AAF79677.1|AC022314_18 F9C16.26 [Arabidopsis thali... 137 6e-030
gi|297819234|ref|XP_002877500.1| predicted protein [Arabidopsis ... 132 1e-028
>gi|62321784|dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]
Length = 478
Score = 325 bits (832), Expect = 1e-086
Identities = 164/381 (43%), Positives = 230/381 (60%), Gaps = 16/381 (4%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LWSG T KAK AW D+C PKDEGGLGIR L +++KV L L WR+ SS+ SLWV W+
Sbjct: 103 LWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANKVSLLKLIWRMLSST-SLWVQWL 161
Query: 1056 QQYLLRQNSFWDVREDWK-GSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGK 880
+ YLLR+ SFW + + GSW+W+K+LK R++A F++ ++++G FW D+W ++G+
Sbjct: 162 RLYLLRKGSFWSISGNTTLGSWMWKKILKHRALASGFVKHDIHNGSNTSFWFDNWSKIGR 221
Query: 879 LLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHR-NRHFHDLYARIRN--ERVPHD-- 715
L+D+TG G +G+ +A V++AV HR RH HD RI + V H
Sbjct: 222 LIDVTGHRGCIDMGITLHASVAEAVV-------NHRPRRHRHDTLLRIEDVIAEVRHQGL 274
Query: 714 EYGSDLVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVK 535
G D V WK + D +KP F++ T+ R + +V W K VWFS P+YS + W+A+K
Sbjct: 275 TSGEDTVRWKGNGDIFKPCFNTKETWAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIK 334
Query: 534 NRLSTGDRMRAW--GIQQSCVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDW 361
NRL+TGDRM +W G SCV+C ETRDH+FF CPY+ VW TL +L W
Sbjct: 335 NRLTTGDRMLSWNAGADSSCVLCHHLVETRDHLFFTCPYSAEVWSTLTRKLLSQHFTNRW 394
Query: 360 DTTLQFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQN 181
+ L+ +TN L L + FQ ++ +WKERN RRH + + Q VR +DK ++N
Sbjct: 395 EAILKLLTNKSLGHEVPFLTRYTFQLTLHSLWKERNGRRHGEVPQAAAQMVRFLDKQVRN 454
Query: 180 RISSLQYKADHKLAGLMRRWF 118
RISS+Q + D + G M WF
Sbjct: 455 RISSIQSQEDRRYNGCMTCWF 475
>gi|9755374|gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
Length = 872
Score = 309 bits (791), Expect = 7e-082
Identities = 151/377 (40%), Positives = 216/377 (57%), Gaps = 8/377 (2%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LWSGS +HKAK +W+ +C PK EGGLG+R L +++ V L L WRI S+S SLW W+
Sbjct: 494 LWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWV 553
Query: 1056 QQYLLRQNSFWDVREDWK-GSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGK 880
+YL+R+ S W +++ GSWIWRK+LK+R VA F R EV +G A FW D W G+
Sbjct: 554 AEYLIRKKSIWSLKQSTSMGSWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGR 613
Query: 879 LLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARIRNERVPHDEYGSD 700
L+D G GT LG+PR A V+DA T+ S R HR +++ + +R+ H + D
Sbjct: 614 LIDTVGDKGTIDLGIPREASVADAWTRR--SRRRHRTSLLNEIEEMMAYQRIHHSD-AED 670
Query: 699 LVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKNRLST 520
VLW+ D +KPHFS+ T+ I+ S V W K VWF P+Y+ WLA+ NRL T
Sbjct: 671 TVLWRGKNDVFKPHFSTRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPT 730
Query: 519 GDRMRAW----GIQQSCVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDWDTT 352
GDRM W + +CV+C +T +H+FF+C Y TVW LA + R W
Sbjct: 731 GDRMLKWNSSGSVSGNCVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHL 790
Query: 351 LQFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNRIS 172
L I+ + + ++ L + +FQ IY++W+ERN RRH T + IDK +N+I+
Sbjct: 791 LTHISTHFQDRVEGFLTRYIFQATIYHVWRERNGRRHDAAPNTPATVIGWIDKQTRNQIT 850
Query: 171 SLQYKADHKLAGLMRRW 121
++ D + + W
Sbjct: 851 IIRQSGDRRYDKAFQAW 867
>gi|3461840|gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase
[Arabidopsis thaliana]
Length = 1529
Score = 308 bits (787), Expect = 2e-081
Identities = 147/378 (38%), Positives = 220/378 (58%), Gaps = 5/378 (1%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LWSG KAK AW +C PK EGGLGI+ L +++KV L L WR+ S+ SLWV+WI
Sbjct: 1144 LWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWI 1203
Query: 1056 QQYLLRQNSFWDVRE-DWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGK 880
+++R+ +FW E GSW+W+KLLK R +A + EV +G FW D W +G+
Sbjct: 1204 WTFIIRKGTFWSANERSSLGSWMWKKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGR 1263
Query: 879 LLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARIRNERVPHDEYGSD 700
LLDITG LG+P + + V + H R HR ++ + A I+ + E G D
Sbjct: 1264 LLDITGTRRVIDLGIPLETNL-ETVLRTH-QHRQHRAAIYNRINAEIQRLQQQEREAGPD 1321
Query: 699 LVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKNRLST 520
+ LW+ ++++ F + T++ +R + + W K VWF P+YSF++WL V+NRLST
Sbjct: 1322 ISLWRSLKNDFNKRFITKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLST 1381
Query: 519 GDRMRAWGIQQ--SCVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDWDTTLQ 346
GDR++AW Q +C +C +ETRDH+FF+C YT VW+ L RL DW+
Sbjct: 1382 GDRIKAWNSGQLVTCTLCNNAEETRDHLFFSCQYTSYVWEALTQRLLSTNYSRDWNRLFT 1441
Query: 345 FITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNRISSL 166
+ ++L L + VFQ IY++W+ERN RRH + ++ +++IDK ++NRISS+
Sbjct: 1442 LLCTSNLPRDHLFLFRYVFQASIYHIWRERNARRHGEISSPTNRLIKLIDKTVRNRISSI 1501
Query: 165 QYKADHKLAGLMRRWFEV 112
+ DH M+ WF +
Sbjct: 1502 RDTGDHNYNDCMQLWFSM 1519
>gi|12321684|gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis
thaliana]
Length = 629
Score = 302 bits (771), Expect = 1e-079
Identities = 147/379 (38%), Positives = 213/379 (56%), Gaps = 11/379 (2%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LWSG KAK +W+D+C PK EGGLG+R L +++ V L L WR+ S+ SLWV W
Sbjct: 252 LWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWS 311
Query: 1056 QQYLLRQNSFWDVREDWK-GSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGK 880
+ LL+Q SFW + + GSW+W+K+LK R A F R EVN+G FW D+W +G
Sbjct: 312 KMNLLKQESFWSLTPNSSLGSWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSGMGH 371
Query: 879 LLDITGAVGTYYLGVPRNARVSDAVTQAHWS---IRGHRNRHFHDLYARIRNERVPHDEY 709
L+D+TG G LG+ RN V++A WS R HR +D+ A + + +
Sbjct: 372 LMDVTGQRGQIDLGISRNKTVAEA-----WSNRRRRKHRTEQLNDIEAALNQKYQTRNLL 426
Query: 708 GSDLVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKNR 529
D LW+ D +K FS+ T++Q+R + + V W K VWFS P+Y F WLA++NR
Sbjct: 427 REDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNR 486
Query: 528 LSTGDRMRAW--GIQQSCVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDWDT 355
LSTG RM+ W G C C ETRDH+FF+C Y +W +A + R DW T
Sbjct: 487 LSTGYRMQLWNNGSDVKCTFCSTSIETRDHLFFSCSYASAIWTAIAKNVLQHRFSTDWQT 546
Query: 354 TLQFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNRI 175
+ +I+ + I L + +FQ ++ +WKERN+RRH + RT + +DK I+N++
Sbjct: 547 IVNYISETQTDRIRSFLSRYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQIRNQL 606
Query: 174 SSLQYKADHKLAGLMRRWF 118
S + D + ++ WF
Sbjct: 607 SIIISTGDRRYENGLQVWF 625
>gi|3738337|gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase
[Arabidopsis thaliana]
Length = 1216
Score = 301 bits (770), Expect = 2e-079
Identities = 146/372 (39%), Positives = 220/372 (59%), Gaps = 13/372 (3%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LWSG KAK +W+++C PK EGGLG++ L +++KV +L L WR+ S SLWV W
Sbjct: 568 LWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANKVSSLKLIWRLLSCQDSLWVKWT 627
Query: 1056 QQYLLRQNSFWDV-REDWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGK 880
+ LL++ SFW + GSWIWR+LLK R VA F + EVN+G FW D+W + G
Sbjct: 628 RMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFCKIEVNNGVNTSFWFDNWSEKGP 687
Query: 879 LLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDL---YARIRNERVPHDEY 709
L+++TGA G +G+ R+ +++A WS R R RH ++ + I ++ H
Sbjct: 688 LINLTGARGAIDMGISRHMTLAEA-----WS-RRRRKRHRVEILNEFEEILLQKYQHRNI 741
Query: 708 G-SDLVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKN 532
D +LW+ ED +K FS+ T++ IR ++ W K VWF+ P++SF WLA++N
Sbjct: 742 ELEDAILWRGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRN 801
Query: 531 RLSTGDRMRAW--GIQQSCVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDWD 358
RLSTGDRM W G +CV C P ETRDH+FF C Y+ +W ++A + R W
Sbjct: 802 RLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSIAKNVYKDRFSTKWS 861
Query: 357 TTLQFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNR 178
+ +I+++ + I L + FQ I+ +W+ERN RRH + R+ +R IDK I+N+
Sbjct: 862 AVVNYISDSQPDRIQSFLSRYTFQVSIHSIWRERNSRRHGEKSRSASNLIRQIDKTIRNQ 921
Query: 177 ISSLQYKADHKL 142
+S+++ K D +L
Sbjct: 922 LSTIKKKGDLRL 933
>gi|4539462|emb|CAB39942.1| putative protein [Arabidopsis thaliana]
Length = 473
Score = 295 bits (755), Expect = 1e-077
Identities = 145/381 (38%), Positives = 218/381 (57%), Gaps = 12/381 (3%)
Frame = -2
Query: 1239 HLWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSW 1060
+LWSG T KAK W +C PK+EGGLG+R L +++ V L L WRI S + SLWV W
Sbjct: 95 YLWSGGELNTSKAKITWAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKW 154
Query: 1059 IQQYLLRQNSFWDVREDWK-GSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVG 883
IQ LL++ SFW VRE+ GSW+WRK+LK R +A + E+N+G FW DDW +G
Sbjct: 155 IQSSLLKKVSFWAVRENTSLGSWMWRKILKFRDIARTLCKVEINNGARTSFWYDDWSDLG 214
Query: 882 KLLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARIRNERV---PHDE 712
+L+D G G LG+ ++A V +A W R R RH + R+ +
Sbjct: 215 RLIDSAGDRGAIDLGINKHATVVEA-----WGNR-RRRRHRTNFLNRVEERLILSWNSRN 268
Query: 711 YGSDLVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKN 532
D LWK E+ ++ FS+ T++ IR ++V W K VWF+Q +P+++F +WLAV N
Sbjct: 269 QAEDRALWKGKENRFRSIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHN 328
Query: 531 RLSTGDRMRAW--GIQQSCVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDWD 358
RLSTGDRM W G+ +C++C + E+RDH+FF+CP+ +W+ LA + DW
Sbjct: 329 RLSTGDRMTLWNMGVDATCILCNKALESRDHLFFSCPFATEIWEPLAKTIYNTCFYTDWQ 388
Query: 357 TTLQFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNR 178
T + ++ N + I L + + Q IY +W+ERNER+H + + + IDK I+N
Sbjct: 389 TIINNVSRNWPDRIAGFLARCILQVTIYTLWRERNERKHGASPNSSSRLISWIDKHIRNH 448
Query: 177 ISSLQYKADHKLAGLMRRWFE 115
+ +++ D + + W +
Sbjct: 449 LMAIKQSGDRRFDRGFQVWLQ 469
>gi|12321503|gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
Length = 1213
Score = 218 bits (555), Expect = 2e-054
Identities = 118/360 (32%), Positives = 194/360 (53%), Gaps = 6/360 (1%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LWSG+ Q K +W LC PK EGGLG+R+L + +K ++ L WR+F + SLW W
Sbjct: 840 LWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQ 899
Query: 1056 QQYLLRQNSFWDVREDWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGKL 877
+ L + SFW V SW W++LL LR +A+QF+ +V +G A +W D+W +G L
Sbjct: 900 HLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPL 959
Query: 876 LDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARIRNERVPHDEYGSDL 697
I G +G L VP A+V+ A ++ W + R+ ++ + VP D+
Sbjct: 960 FRIIGDIGPSSLRVPLLAKVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQ-EDV 1018
Query: 696 VLWKYSEDNYK-PHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKNRLST 520
+++S + + FS+++T++ IR + + W+ S+WF VP+Y+F +W++ NRL T
Sbjct: 1019 DRYEWSVNGFLCQGFSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLT 1078
Query: 519 GDRMRAWGIQQS--CVMCGEPDETRDHIFFACPYTFTVWDTLAGRL-SGGRSDPDWDTTL 349
R+ +WG QS CV+C E+RDH+ C ++ VW + R+ R W L
Sbjct: 1079 RQRLASWGHIQSDACVLCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELL 1138
Query: 348 QFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNRISS 169
++ + E+ +L K+V Q +Y +W++RN H +++D+ I+N ISS
Sbjct: 1139 SWVRQSSPEA-PPLLRKIVSQVVVYNLWRQRNNLLHNSLRLAPAVIFKLVDREIRNIISS 1197
>gi|21952510|gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse
transcriptase [Brassica napus]
Length = 1214
Score = 216 bits (548), Expect = 1e-053
Identities = 116/377 (30%), Positives = 188/377 (49%), Gaps = 8/377 (2%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LW T+ K +W++ C PK EGGLG+R +K L L W +F+ SLWV+W
Sbjct: 839 LWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWN 898
Query: 1056 QQYLLRQNSFWDVREDWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGKL 877
LR +FW+ SWIW+ +L LR +A +F+R V +G + +W D W +G L
Sbjct: 899 HANRLRHVNFWNAEAASHHSWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPL 958
Query: 876 LDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRH--FHDLYARIRNERVPHDEYGS 703
++ GA G G+ +A V++A + W + R R+ +L + + N P + G
Sbjct: 959 IEAIGASGPQLTGIHESAVVTEASSSTGWILPSARTRNASLANLRSTLLNSPAPSGDRGE 1018
Query: 702 DLVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKNRLS 523
D W Y E + FSS T++ +R R + W+ +VW+ +P+Y+F W+A NRL
Sbjct: 1019 DTYTW-YIEGSSSTSFSSKLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLP 1077
Query: 522 TGDRMRAWGIQQS--CVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDWDTTL 349
R W + C +C ETRDH+F C +W + R + +W +
Sbjct: 1078 VRARTTHWSTNRPSLCCVCQRETETRDHLFIHCTLGSLIWQQVLARFGRSQMFREWKDII 1137
Query: 348 QFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNRISS 169
+++ +N S L K+ QT I+++WKERN R H + + ID++I++ I +
Sbjct: 1138 EWMLSNQ-GSFSGTLKKLAVQTAIFHIWKERNSRLHSAMSASHTAIFKQIDRSIRDSILA 1196
Query: 168 LQYKADHKLAGLMRRWF 118
+ + K L+ +WF
Sbjct: 1197 RITRRNFK--DLLSQWF 1211
>gi|8843742|dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like
[Arabidopsis thaliana]
Length = 1072
Score = 210 bits (534), Expect = 4e-052
Identities = 118/363 (32%), Positives = 184/363 (50%), Gaps = 5/363 (1%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LW+GS +K +W D C PK EGGLG R + +K L L W +F SLW W
Sbjct: 700 LWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQ 759
Query: 1056 QQYLLRQNSFWDVREDWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGKL 877
+ + L SFW V W W+ LL LR +A +FI+ +V +G FW D W +G L
Sbjct: 760 RHHRLGHASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPL 819
Query: 876 LDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARIRNERVPHDEYGSDL 697
+ G VG+ L +P +A+V+DA+ + W + R+ + + + + P SD
Sbjct: 820 IKYLGDVGSRPLRIPFSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDS 879
Query: 696 VLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKNRLSTG 517
W + + + FS+++T++ +R RR W+KSVWF VP+++F W A NRL T
Sbjct: 880 YSWCVDDVDCQ-GFSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTR 938
Query: 516 DRMRAWGIQQS--CVMCGEPDETRDHIFFACPYTFTVWDTLAGRL-SGGRSDPDWDTTLQ 346
R+ +WG+ S C +C ETRDH+ C ++ VW + RL R W L
Sbjct: 939 QRLVSWGLVSSAECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLS 998
Query: 345 FITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNRISSL 166
+ T + +L K+V Q +Y +W++RN H + R++D+ ++N I S
Sbjct: 999 W-TRQSTAAAPSLLRKVVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILSR 1057
Query: 165 QYK 157
++K
Sbjct: 1058 RHK 1060
>gi|110739557|dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
Length = 1072
Score = 209 bits (531), Expect = 1e-051
Identities = 117/363 (32%), Positives = 184/363 (50%), Gaps = 5/363 (1%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LW+GS +K +W D C PK EGGLG R + +K L L W +F SLW W
Sbjct: 700 LWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQ 759
Query: 1056 QQYLLRQNSFWDVREDWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGKL 877
+ + L SFW V W W+ LL LR +A +FI+ +V +G FW D W +G L
Sbjct: 760 RHHRLGHASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPL 819
Query: 876 LDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARIRNERVPHDEYGSDL 697
+ G VG+ L +P +A+V+DA+ + W + R+ + + + + P SD
Sbjct: 820 IKYLGDVGSRPLRIPFSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDS 879
Query: 696 VLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKNRLSTG 517
W + + + FS+++T++ +R RR W++SVWF VP+++F W A NRL T
Sbjct: 880 YSWCVDDVDCQ-GFSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTR 938
Query: 516 DRMRAWGIQQS--CVMCGEPDETRDHIFFACPYTFTVWDTLAGRL-SGGRSDPDWDTTLQ 346
R+ +WG+ S C +C ETRDH+ C ++ VW + RL R W L
Sbjct: 939 QRLVSWGLVSSAECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLS 998
Query: 345 FITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNRISSL 166
+ T + +L K+V Q +Y +W++RN H + R++D+ ++N I S
Sbjct: 999 W-TRQSTAAAPSLLRKVVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILSR 1057
Query: 165 QYK 157
++K
Sbjct: 1058 RHK 1060
>gi|110740597|dbj|BAE98403.1| putative non-LTR reverse transcriptase
[Arabidopsis thaliana]
Length = 278
Score = 190 bits (482), Expect = 5e-046
Identities = 104/280 (37%), Positives = 148/280 (52%), Gaps = 11/280 (3%)
Frame = -2
Query: 939 EVNDGHIAFFWVDDWLQVGKLLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHF 760
EV G FW D W +G+L G+ GT LG+ A V++ V H R RH
Sbjct: 2 EVRSGTTTSFWHDHWSPLGRLHQHLGSRGTIDLGIATQATVAE-VLDTH-----RRKRHR 55
Query: 759 HDLYARIRN--ERVPH-DEYGSDLVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSV 589
D +I N E V SD LWK ED++K FSS +T+ QIR + W + V
Sbjct: 56 ADFLNQIENHIELVRQARSQESDRSLWKQKEDSFKGSFSSPKTWQQIRTISNECEWYRGV 115
Query: 588 WFSQEVPRYSFIVWLAVKNRLSTGDRMRAWG--IQQSCVMCGEPDETRDHIFFACPYTFT 415
WF P+YSF+ WLA NRL+TGDR+ W + +CV C E ETRDH+FF+CPY+
Sbjct: 116 WFPSSTPKYSFVTWLAFHNRLATGDRLYKWNSEARATCVFCDEELETRDHLFFSCPYSSQ 175
Query: 414 VWDTLAGRLSGGRSDPDWDTTLQFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQ 235
+W LA L GR+ W + ++ + ++ FQ I+ +W+ERN RRH +
Sbjct: 176 IWIALAKGLLNGRNVSSWSLITPHLLDSSQPYLHVFTLRYTFQALIHSLWRERNGRRHGE 235
Query: 234 GFRTVDQAVRVIDKAIQNRISSLQYKADHKLAGLMRRWFE 115
+ ++IDK I+NR S+LQ + +L G ++ WF+
Sbjct: 236 PAIPASKLTKLIDKNIRNRFSTLQKMGNKRLQGGLQYWFQ 275
>gi|297795303|ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542
[Arabidopsis lyrata subsp. lyrata]
Length = 227
Score = 188 bits (477), Expect = 2e-045
Identities = 84/190 (44%), Positives = 124/190 (65%)
Frame = -2
Query: 831 RNARVSDAVTQAHWSIRGHRNRHFHDLYARIRNERVPHDEYGSDLVLWKYSEDNYKPHFS 652
R++ V++AV W +R R++ ++ +++ P G D LW+Y+ D+Y F+
Sbjct: 5 RHSTVANAVQGTQWRVRRCRSQTLRNVVTKLQEIAPPQRAKGPDKPLWRYTLDDYDSSFT 64
Query: 651 SSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFIVWLAVKNRLSTGDRMRAWGIQQSCVMC 472
S T++ +R + +V W SVWF Q VPRYSFIVWLAVK++LSTG RMRAWG++Q CV C
Sbjct: 65 SRHTWNLLRKAKHKVLWHNSVWFPQRVPRYSFIVWLAVKDQLSTGTRMRAWGVEQPCVFC 124
Query: 471 GEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDWDTTLQFITNNDLESIDKILIKMV 292
E DE+RDH+FFACP+T+++W L RL + +PDW TL + + L D+ L+ M+
Sbjct: 125 RERDESRDHLFFACPFTYSIWSELTSRLLRRKLNPDWSRTLLSLRSPQLTKRDQNLLCMI 184
Query: 291 FQTCIYYMWK 262
FQT +Y +WK
Sbjct: 185 FQTTLYMVWK 194
>gi|9279655|dbj|BAB01155.1| unnamed protein product [Arabidopsis thaliana]
Length = 310
Score = 185 bits (468), Expect = 2e-044
Identities = 93/252 (36%), Positives = 141/252 (55%), Gaps = 4/252 (1%)
Frame = -2
Query: 912 FWVDDWLQVGKLLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARIRN 733
FW D W +G +++ G G LG+P+ A V + + A R HR + + I
Sbjct: 39 FWFDSWSSLGCIIEKLGERGYIDLGIPKTATVGEVM--AMQRRRHHRTGLLNQIEEEITK 96
Query: 732 ERVPHDEYGSDLVLWKYSEDNYKPHFSSSRTYDQIRLRRSRVGWSKSVWFSQEVPRYSFI 553
+R+ + D+ LWK +D+Y+ F +S T+ QIR + + K VWFS P+YSFI
Sbjct: 97 QRMCNIAGERDIALWKGEKDSYRNKFVTSETWRQIRNAKPEMEGYKGVWFSHSTPKYSFI 156
Query: 552 VWLAVKNRLSTGDRMRAWG--IQQSCVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGG 379
WL KNR++TGDRM W + SC +C EP ETRDH+FF C Y+ VW+ +A +
Sbjct: 157 TWLVSKNRMATGDRMVLWNQHVNTSCSLCDEPMETRDHLFFVCTYSRKVWEDIAKPILQH 216
Query: 378 RSDPDWDTTLQFITNNDLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVI 199
R DW L ++ D + +IK VFQ I+ +W ERN RRH + V + V++I
Sbjct: 217 RFSLDWKDILNYVCERDSDKTRNFIIKHVFQNTIHSVWGERNARRHGEQPSPVGKLVKMI 276
Query: 198 DKAIQNRISSLQ 163
DK ++N++S+++
Sbjct: 277 DKNMRNKLSTIR 288
>gi|9757833|dbj|BAB08270.1| non-LTR retroelement reverse transcriptase-like
protein [Arabidopsis thaliana]
Length = 489
Score = 159 bits (401), Expect = 1e-036
Identities = 78/160 (48%), Positives = 102/160 (63%), Gaps = 7/160 (4%)
Frame = -2
Query: 1239 HLWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSW 1060
+LWSG T KAK AW D+C PKDEGGLG+R L +++ V L L WRI S + SLWV W
Sbjct: 228 YLWSGGDLNTSKAKIAWTDVCKPKDEGGLGLRSLKEANDVSCLKLIWRIISHADSLWVKW 287
Query: 1059 IQQYLLRQNSFWDVREDWK-GSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVG 883
I LL+Q SFW VRE+ GSW+W+K+LK R A Q + EVN+G FFW D+W +G
Sbjct: 288 IHATLLKQVSFWAVRENTSLGSWMWKKVLKFRDAAIQLCKAEVNNGAHTFFWYDNWSDMG 347
Query: 882 KLLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRH 763
+L+DI G G +G+ ++A ++DA W R R RH
Sbjct: 348 RLIDIAGDRGVIDMGIKKHATMADA-----WGNR-RRRRH 381
>gi|5281029|emb|CAB45965.1| putative reverse transcriptase [Arabidopsis
thaliana]
Length = 662
Score = 159 bits (401), Expect = 1e-036
Identities = 77/199 (38%), Positives = 116/199 (58%), Gaps = 9/199 (4%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LWSG T+KAK AWE +C PK EGGLG++ + +++ V L L WRI S SLWV WI
Sbjct: 365 LWSGPELSTNKAKIAWETVCRPKREGGLGLQSIKEANDVCCLKLIWRIVSQGDSLWVQWI 424
Query: 1056 QQYLLRQNSFWDVREDWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGKL 877
+ YLL++N+FW R +GSW+W+KLLK R A F + ++ +G A FW DDW G+L
Sbjct: 425 RTYLLKRNTFWSFRSASQGSWMWKKLLKYRDTAKAFSKVDIRNGETASFWYDDWSSKGRL 484
Query: 876 LDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARI---RNERVPHDEYG 706
+D+ G G + +G+ + +++A + R HR + + + + RV +
Sbjct: 485 IDVLGERGQFDMGISKFKTLAEAWDRRR--SRYHRAETLNTIEQELLLAKQNRVAVE--- 539
Query: 705 SDLVLWKYSEDNYKPHFSS 649
D+ LWK D ++P FS+
Sbjct: 540 -DVFLWKGKNDTFRPQFSA 557
>gi|158828216|gb|ABW81094.1| RT6non-ltr [Cleome spinosa]
Length = 459
Score = 159 bits (401), Expect = 1e-036
Identities = 73/186 (39%), Positives = 114/186 (61%)
Frame = -2
Query: 1230 SGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWIQQ 1051
S S T T A+ AW +C P+ EGGL +KL + +KVF L + WR+F ++SLWV+W+++
Sbjct: 188 SWSQTSTGTARVAWNIICRPRKEGGLNSKKLEELNKVFRLKMVWRVFKYASSLWVAWLKR 247
Query: 1050 YLLRQNSFWDVREDWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGKLLD 871
+L++ SFWD + SW RKLL ++ +A +FIR + +G++A FW D+W +G LLD
Sbjct: 248 NVLKRGSFWDTVPTARHSWNVRKLLNMKDLAGKFIRCRIGNGNMASFWFDNWTDLGPLLD 307
Query: 870 ITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLYARIRNERVPHDEYGSDLVL 691
G+ G L +P + +VSDAV+ W + G R++ DL+ R+ P G D+
Sbjct: 308 YIGSDGPRLLRIPLSGKVSDAVSGQSWLLPGARSQRIQDLHIRLLTLPTPSQAAGEDIYE 367
Query: 690 WKYSED 673
WK +E+
Sbjct: 368 WKSAEN 373
>gi|9758853|dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like
protein [Arabidopsis thaliana]
Length = 1223
Score = 152 bits (382), Expect = 2e-034
Identities = 76/164 (46%), Positives = 104/164 (63%), Gaps = 7/164 (4%)
Frame = -2
Query: 1236 LWSGSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWI 1057
LWSG+ ++KAK +W +C PKDEGGLG+R L +++ V L L W+I S S SLWV W+
Sbjct: 847 LWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWV 906
Query: 1056 QQYLLRQNSFWDVRED-WKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGK 880
Q+LLR SFW+V++ +GSWIW+KLLK R VA + EV +G FW D+W +G+
Sbjct: 907 DQHLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQ 966
Query: 879 LLDITGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRHFHDLY 748
LL+ TG G LG+ R V +A W+ R R RH +D+Y
Sbjct: 967 LLERTGDRGLIDLGISRRMTVEEA-----WTNRRQR-RHRNDVY 1004
>gi|3047086|gb|AAC13599.1| similar to reverse transcriptase (Pfam:
transcript_fact.hmm, score: 72.31) [Arabidopsis thaliana]
Length = 928
Score = 142 bits (356), Expect = 2e-031
Identities = 64/164 (39%), Positives = 96/164 (58%), Gaps = 2/164 (1%)
Frame = -2
Query: 603 WSKSVWFSQEVPRYSFIVWLAVKNRLSTGDRMRAWGIQQS--CVMCGEPDETRDHIFFAC 430
W K VWF+ E P++SF VWLA+ N+LSTG RM+ W +Q S CV+C ETRDH+FF+C
Sbjct: 761 WHKGVWFAHETPKHSFCVWLAIWNKLSTGQRMQHWNLQSSVGCVLCNNNLETRDHLFFSC 820
Query: 429 PYTFTVWDTLAGRLSGGRSDPDWDTTLQFITNNDLESIDKILIKMVFQTCIYYMWKERNE 250
YT +W+ LA L DW T + +++ + + L + V Q +Y +W+ERN
Sbjct: 821 AYTSGIWEALAKNLLQRSYTTDWQTIISYVSGQCHDRVSCFLARSVLQASVYTIWRERNG 880
Query: 249 RRHQQGFRTVDQAVRVIDKAIQNRISSLQYKADHKLAGLMRRWF 118
RRH + + ++ IDK I+N +S + K D + ++ WF
Sbjct: 881 RRHGETPNPAARLIQWIDKHIRNMLSVIHQKGDKRYDKGLQMWF 924
>gi|8778669|gb|AAF79677.1|AC022314_18 F9C16.26 [Arabidopsis thaliana]
Length = 1902
Score = 137 bits (343), Expect = 6e-030
Identities = 66/155 (42%), Positives = 95/155 (61%)
Frame = -2
Query: 1227 GSPTQTHKAKAAWEDLCCPKDEGGLGIRKLHDSSKVFALSLTWRIFSSSASLWVSWIQQY 1048
G+P AK +WE +C K GGLG+R L +KV AL L W +F++++SLWVSW++
Sbjct: 1535 GAPNSARGAKLSWEIVCSSKVCGGLGLRDLVAWNKVLALKLIWMLFTAASSLWVSWVRVN 1594
Query: 1047 LLRQNSFWDVREDWKGSWIWRKLLKLRSVAYQFIRFEVNDGHIAFFWVDDWLQVGKLLDI 868
L+R +FW + + GSWI R+L KLR++A FI EV G A FW+D+W G L+D+
Sbjct: 1595 LIRNKNFWYLNPSFSGSWILRRLCKLRTLARPFIVCEVGSGVTANFWLDNWTSHGPLIDL 1654
Query: 867 TGAVGTYYLGVPRNARVSDAVTQAHWSIRGHRNRH 763
T G G+PR++ V DA+ W I R+R+
Sbjct: 1655 TVPTGPQITGLPRDSTVRDALRGNDWWISASRSRN 1689
>gi|297819234|ref|XP_002877500.1| predicted protein [Arabidopsis lyrata subsp.
lyrata]
Length = 136
Score = 132 bits (332), Expect = 1e-028
Identities = 61/133 (45%), Positives = 81/133 (60%)
Frame = -2
Query: 510 MRAWGIQQSCVMCGEPDETRDHIFFACPYTFTVWDTLAGRLSGGRSDPDWDTTLQFITNN 331
MRAWG+ Q CV CGEP E+RDH+FFACPYTFT+W L L + PDW TL + +
Sbjct: 1 MRAWGVTQPCVFCGEPTESRDHLFFACPYTFTIWFELTSPLLRHKLTPDWSQTLLSLRST 60
Query: 330 DLESIDKILIKMVFQTCIYYMWKERNERRHQQGFRTVDQAVRVIDKAIQNRISSLQYKAD 151
L+ DK L ++ FQ +Y +W+ERN R H Q + ++ IDKAI+ RISSL+ +
Sbjct: 61 QLKLHDKTLARLAFQASVYLLWRERNGRIHNQCSNSSTTMLKTIDKAIKERISSLKSRKG 120
Query: 150 HKLAGLMRRWFEV 112
L RW E+
Sbjct: 121 SNFEELQVRWTEL 133
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,934,268,127,257
Number of Sequences: 15229318
Number of Extensions: 3934268127257
Number of Successful Extensions: 921802745
Number of sequences better than 0.0: 0
|