BLASTX 7.6.2
Query= UN16378 /QuerySize=2105
(2104 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT3G24590.1 | Symbols: PLSP1 | PLSP1 (plastidic t... 465 7e-131
TAIR9_protein||AT3G24600.1 | Symbols: | INVOLVED IN: antigen pr... 459 5e-129
TAIR9_protein||AT1G06870.1 | Symbols: | signal peptidase, putat... 240 4e-063
TAIR9_protein||AT2G30440.1 | Symbols: | chloroplast thylakoidal... 239 6e-063
TAIR9_protein||AT1G45688.1 | Symbols: | unknown protein | chr1:... 132 1e-030
TAIR9_protein||AT4G35170.1 | Symbols: | FUNCTIONS IN: molecular... 123 5e-028
TAIR9_protein||AT5G42860.1 | Symbols: | unknown protein | chr5:... 117 3e-026
TAIR9_protein||AT2G41990.1 | Symbols: | unknown protein | chr2:... 116 7e-026
TAIR9_protein||AT3G08980.1 | Symbols: | signal peptidase I fami... 50 4e-006
>TAIR9_protein||AT3G24590.1 | Symbols: PLSP1 | PLSP1 (plastidic type I signal
peptidase 1); peptidase | chr3:8970694-8972020 FORWARD
Length = 292
Score = 465 bits (1195), Expect = 7e-131
Identities = 243/299 (81%), Positives = 255/299 (85%), Gaps = 13/299 (4%)
Frame = -1
Query: 2032 ISLHFPTPSLSLLTSHSNSNSRFFKNSNNNPIPRLNFTNQSQSVPLPPLTFK-ATHSNRR 1856
ISLHF TP L+ L S+SNSRF KN N N I FT +SQ + L F T+ NRR
Sbjct: 5 ISLHFSTPPLAFL--KSDSNSRFLKNPNPNFI---QFTPKSQLLFPQRLNFNTGTNLNRR 59
Query: 1855 NLGCHGLKDSSETAKSAPPLDSGGGNGEGGDGGDNGDEPEGSEVEVEKNRLFPEWLDFTS 1676
L C+G+KDSSET KSAP LDSG G GGD GD+ +G EVE EKNRLFPEWLDFTS
Sbjct: 60 TLSCYGIKDSSETTKSAPSLDSGDGG-----GGDGGDDDKG-EVE-EKNRLFPEWLDFTS 112
Query: 1675 DDAKTVFLAITVSLAFRYFIAEPRYIPSLSMYPTFDVGDRLVAEKVSYYFRKPCANDIVI 1496
DDA+TVF+AI VSLAFRYFIAEPRYIPSLSMYPTFDVGDRLVAEKVSYYFRKPCANDIVI
Sbjct: 113 DDAQTVFVAIAVSLAFRYFIAEPRYIPSLSMYPTFDVGDRLVAEKVSYYFRKPCANDIVI 172
Query: 1495 FKSPPVLQEVGYTDADVFIKRIVAKEGDLVEVHNGKLMVNGVPRNESFILEPPGYEMTPV 1316
FKSPPVLQEVGYTDADVFIKRIVAKEGDLVEVHNGKLMVNGV RNE FILEPPGYEMTP+
Sbjct: 173 FKSPPVLQEVGYTDADVFIKRIVAKEGDLVEVHNGKLMVNGVARNEKFILEPPGYEMTPI 232
Query: 1315 RVPENSVFVMGDNRNNSYDSHVWGPLPLKNIIGRSVFRYWPPNRVSGTVLEGGCAVDIQ 1139
RVPENSVFVMGDNRNNSYDSHVWGPLPLKNIIGRSVFRYWPPNRVSGTVLEGGCAVD Q
Sbjct: 233 RVPENSVFVMGDNRNNSYDSHVWGPLPLKNIIGRSVFRYWPPNRVSGTVLEGGCAVDKQ 291
>TAIR9_protein||AT3G24600.1 | Symbols: | INVOLVED IN: antigen processing and
presentation; LOCATED IN: MHC class I protein complex, membrane;
CONTAINS InterPro DOMAIN/s: MHC class I, alpha chain, C-terminal
(InterPro:IPR010579), Harpin-induced 1 (InterPro:IPR010847); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G35170.1); Has 181 Blast hits to 96 proteins in 11 species:
Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 181; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). | chr3:8972195-8974867
REVERSE
Length = 507
Score = 459 bits (1179), Expect = 5e-129
Identities = 226/261 (86%), Positives = 245/261 (93%), Gaps = 1/261 (0%)
Frame = +3
Query: 165 IQVQTPNYTILTESRLSSSSRTSNGTSGVGFRWKGSSRRRDMYWLERHYTIDEEEVYEDN 344
+ V TPNYTIL+ESRLSSSSRTSNGTSG+GFRWKGSSRR +MYW E+ YTI+E+EVY+DN
Sbjct: 248 VPVHTPNYTILSESRLSSSSRTSNGTSGMGFRWKGSSRRSNMYWPEKPYTINEDEVYDDN 307
Query: 345 RGLSVGQCRAVMVILGIVVVFSVFCSVLWGASHPFSPIVSVKSFSLHSFYYGEGIDRTGV 524
RGLSVGQCRAV+VILG VVVFSVFCSVLWGASHPFSPIVSVKS +HSFYYGEGIDRTGV
Sbjct: 308 RGLSVGQCRAVLVILGTVVVFSVFCSVLWGASHPFSPIVSVKSVDIHSFYYGEGIDRTGV 367
Query: 525 ATKILSFNSSVKVTIDSPAPYFGIHVSSSSTFNLTFSTLTLATGQLKSYYQPRKSKHTAI 704
ATKILSFNSSVKVTIDSPAPYFGIHV SSSTF LTFS LTLATGQLKSYYQPRKSKH +I
Sbjct: 368 ATKILSFNSSVKVTIDSPAPYFGIHV-SSSTFKLTFSALTLATGQLKSYYQPRKSKHISI 426
Query: 705 VKLIGSEVPLYGAGPNLAASDKKGRVPVKLEFEIRSQGNLLGKLVKSKHLNHISCSFYIS 884
VKL G+EVPLYGAGP+LAASDKKG+VPVKLEFEIRS+GNLLGKLVKSKH NH+SCSF+IS
Sbjct: 427 VKLTGAEVPLYGAGPHLAASDKKGKVPVKLEFEIRSRGNLLGKLVKSKHENHVSCSFFIS 486
Query: 885 SSKTSKPLEFTHKTCKHITK* 947
SSKTSKP+EFTHKTCK +TK*
Sbjct: 487 SSKTSKPIEFTHKTCKLVTK* 507
Score = 204 bits (518), Expect = 2e-052
Identities = 121/259 (46%), Positives = 159/259 (61%), Gaps = 24/259 (9%)
Frame = +3
Query: 39 SESERTSLDLSISSPKQ-AYYVESPSSVSQVYDGDKSSSAASLIQVQTPNYTILTESRLS 215
S+S+ TSLDL SSPK+ YYV+SPS D DKSSS A TP + S S
Sbjct: 7 SDSDVTSLDL--SSPKRPTYYVQSPSR-----DSDKSSSVALTTHQTTPTE---SPSHPS 56
Query: 216 SSSRTSNGTSGVGFRWKGSSRRRDMYWLERHYTIDEEE--------VYEDNRGLSVGQCR 371
+SR SNG G GFRWKG + W + D+EE +YEDNRG+S+ CR
Sbjct: 57 IASRVSNGGGG-GFRWKGRRKYHGGIW----WPADKEEGGDGRYEDLYEDNRGVSIVTCR 111
Query: 372 AVMVILGIVVVFSVFCSVLWGASHPFSPIVSVKSFSLHSFYYGEGIDRTGVATKILSFNS 551
++ ++ + +F + CSVL+GAS PIV +K ++ SFYYGEG D TGV TKI++
Sbjct: 112 LILGVVATLSIFFLLCSVLFGASQSSPPIVYIKGVNVRSFYYGEGSDNTGVPTKIMNVKC 171
Query: 552 SVKVTIDSPAPYFGIHVSSSSTFNLTFSTLTLATGQLKSYYQPRKSKHTAIVKLIGSEVP 731
SV +T +P+ FGIHVSS++ + TLA +LKSY+QP++S HT+ + LIGS+VP
Sbjct: 172 SVVITTHNPSTLFGIHVSSTAVSLIYSRQFTLANARLKSYHQPKQSNHTSRINLIGSKVP 231
Query: 732 LYGAGPNLAASDKKGRVPV 788
LYGAG L ASD G VPV
Sbjct: 232 LYGAGAELVASDNSGGVPV 250
>TAIR9_protein||AT1G06870.1 | Symbols: | signal peptidase, putative |
chr1:2108832-2110642 FORWARD
Length = 368
Score = 240 bits (610), Expect = 4e-063
Identities = 116/174 (66%), Positives = 139/174 (79%)
Frame = -1
Query: 1690 LDFTSDDAKTVFLAITVSLAFRYFIAEPRYIPSLSMYPTFDVGDRLVAEKVSYYFRKPCA 1511
L+ S+DAK F A+TVSL FR +AEP+ IPS SM PT DVGDR++AEKVSY+FRKP
Sbjct: 180 LNICSEDAKAAFTAVTVSLLFRSALAEPKSIPSTSMLPTLDVGDRVIAEKVSYFFRKPEV 239
Query: 1510 NDIVIFKSPPVLQEVGYTDADVFIKRIVAKEGDLVEVHNGKLMVNGVPRNESFILEPPGY 1331
+DIVIFK+PP+L E GY+ ADVFIKRIVA EGD VEV +GKL+VN + E F+LEP Y
Sbjct: 240 SDIVIFKAPPILVEHGYSCADVFIKRIVASEGDWVEVCDGKLLVNDTVQAEDFVLEPIDY 299
Query: 1330 EMTPVRVPENSVFVMGDNRNNSYDSHVWGPLPLKNIIGRSVFRYWPPNRVSGTV 1169
EM P+ VPE VFV+GDNRN S+DSH WGPLP+KNIIGRSVFRYWPP++VS +
Sbjct: 300 EMEPMFVPEGYVFVLGDNRNKSFDSHNWGPLPIKNIIGRSVFRYWPPSKVSDII 353
>TAIR9_protein||AT2G30440.1 | Symbols: | chloroplast thylakoidal processing
peptidase | chr2:12973244-12975027 FORWARD
Length = 341
Score = 239 bits (609), Expect = 6e-063
Identities = 122/222 (54%), Positives = 153/222 (68%), Gaps = 9/222 (4%)
Frame = -1
Query: 1825 SETAKSAPPLDSGGGNGEGGDGGDNGDEPEGSEVEVEKNRLFPEWLDFTSDDAKTVFLAI 1646
S+ K+ P +D G D D+ + G V K L S+DAK F A+
Sbjct: 111 SKWIKNPPVIDDVDKGGTVCDDDDDKESRNGGSGWVNK------LLSVCSEDAKAAFTAV 164
Query: 1645 TVSLAFRYFIAEPRYIPSLSMYPTFDVGDRLVAEKVSYYFRKPCANDIVIFKSPPVL--- 1475
TVS+ FR +AEP+ IPS SMYPT D GDR++AEKVSY+FRKP +DIVIFK+PP+L
Sbjct: 165 TVSILFRSALAEPKSIPSTSMYPTLDKGDRVMAEKVSYFFRKPEVSDIVIFKAPPILLEY 224
Query: 1474 QEVGYTDADVFIKRIVAKEGDLVEVHNGKLMVNGVPRNESFILEPPGYEMTPVRVPENSV 1295
E GY+ DVFIKRIVA EGD VEV +GKL VN + + E F+LEP YEM P+ VP+ V
Sbjct: 225 PEYGYSSNDVFIKRIVASEGDWVEVRDGKLFVNDIVQEEDFVLEPMSYEMEPMFVPKGYV 284
Query: 1294 FVMGDNRNNSYDSHVWGPLPLKNIIGRSVFRYWPPNRVSGTV 1169
FV+GDNRN S+DSH WGPLP++NI+GRSVFRYWPP++VS T+
Sbjct: 285 FVLGDNRNKSFDSHNWGPLPIENIVGRSVFRYWPPSKVSDTI 326
>TAIR9_protein||AT1G45688.1 | Symbols: | unknown protein |
chr1:17191502-17192870 FORWARD
Length = 343
Score = 132 bits (331), Expect = 1e-030
Identities = 106/337 (31%), Positives = 170/337 (50%), Gaps = 48/337 (14%)
Frame = +3
Query: 45 SERTSLDLS--ISSPKQ-AYYVESPSSVSQVYDGDKSSSAASLIQVQTP-------NYTI 194
SE TSL S SP++ YYV+SPS S +DG+K++++ V +P + ++
Sbjct: 7 SEVTSLAASSPARSPRRPVYYVQSPSRDS--HDGEKTATSFHSTPVLSPMGSPPHSHSSM 64
Query: 195 LTESRLSSSSRTSNGTSGVGFRW----KGSSRR---RDMYWLERHYTIDEEEVYE--DNR 347
SR SSSSR S G+ G R GS R+ + W E I+EE + + D
Sbjct: 65 GRHSRESSSSRFS-GSLKPGSRKVNPNDGSKRKGHGGEKQWKE-CAVIEEEGLLDDGDRD 122
Query: 348 GLSVGQCRAVMVILGIVVVFSVFCSVLWGASHPFSPIVSVKSFSLHSFYYGEGIDRTGVA 527
G +C + I+G ++F F +L+GA+ P P ++VKS + + G D GV
Sbjct: 123 GGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDAGGVG 182
Query: 528 TKILSFNSSVKVTIDSPAPYFGIHVSSSSTFNLTFSTLTLATGQLKSYYQPRKSKHTAIV 707
T +++ N+++++ + +FG+HV +S+ +L+FS + + +G +K +YQ RKS+ T +V
Sbjct: 183 TDMITMNATLRMLYRNTGTFFGVHV-TSTPIDLSFSQIKIGSGSVKKFYQGRKSERTVLV 241
Query: 708 KLIGSEVPLYGAGPNL----------AASDKKGR-------------VPVKLEFEIRSQG 818
+IG ++PLYG+G L KKG VP+ L F +RS+
Sbjct: 242 HVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVRSRA 301
Query: 819 NLLGKLVKSKHLNHISCSFYISSSKTSKPLEFTHKTC 929
+LGKLV+ K I C +K + T K C
Sbjct: 302 YVLGKLVQPKFYKKIECDINFEHKNLNKHIVIT-KNC 337
>TAIR9_protein||AT4G35170.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: stem,
carpel; EXPRESSED DURING: petal differentiation and expansion stage;
CONTAINS InterPro DOMAIN/s: Harpin-induced 1 (InterPro:IPR010847); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G41990.1); Has 82 Blast hits to 81 proteins in 9 species:
Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 82; Viruses
- 0; Other Eukaryotes - 0 (source: NCBI BLink). |
chr4:16736839-16738186 FORWARD
Length = 300
Score = 123 bits (308), Expect = 5e-028
Identities = 69/212 (32%), Positives = 118/212 (55%), Gaps = 8/212 (3%)
Frame = +3
Query: 321 EEEVYEDNRGLSVGQCRAV----MVILGIVVVFSVFCSVLWGASHPFSPIVSVKSFSLHS 488
E+E Y++ G + R ++ +V+ F++FC +LWG S F+PI ++K L +
Sbjct: 89 EDEDYDEMDGPDEKRRRITRFYSCLLFTLVLAFTLFCLILWGVSKSFAPIATLKEMVLEN 148
Query: 489 FYYGEGIDRTGVATKILSFNSSVKVTIDSPAPYFGIHVSSSSTFNLTFSTLTLATGQLKS 668
G D++GV T +L+ NS+V++ +PA +F +HV +S+ L++S L LA+GQ+
Sbjct: 149 LNVQSGNDQSGVLTDMLTLNSTVRILYRNPATFFTVHV-TSAPLQLSYSQLILASGQMGE 207
Query: 669 YYQPRKSKHTAIVKLIGSEVPLYGAGPNL---AASDKKGRVPVKLEFEIRSQGNLLGKLV 839
+ Q RKS+ K+ G ++PLYG P L A + +P+ L F +R++ +LG+LV
Sbjct: 208 FSQRRKSERIIETKVFGDQIPLYGGVPALFGQRAEPDQVVLPLNLTFTLRARAYVLGRLV 267
Query: 840 KSKHLNHISCSFYISSSKTSKPLEFTHKTCKH 935
K+ ++I CS K K L+ + H
Sbjct: 268 KTTFHSNIKCSITFYGDKLGKTLDLSKSCSDH 299
>TAIR9_protein||AT5G42860.1 | Symbols: | unknown protein |
chr5:17183339-17184857 REVERSE
Length = 321
Score = 117 bits (293), Expect = 3e-026
Identities = 76/252 (30%), Positives = 135/252 (53%), Gaps = 17/252 (6%)
Frame = +3
Query: 45 SERTSLDLSISSP-----KQAYYVESPSSVSQVYDGDKSSSAASLIQVQTPNYTILTESR 209
SE TS LS SSP + AY+V+SPS S +DG+K++++ TP T S
Sbjct: 7 SEVTS--LSASSPTRSPRRPAYFVQSPSRDS--HDGEKTATSFH----STPVLTSPMGSP 58
Query: 210 LSSSSRTSNGTSGVGFRWKGSSRRRDMYWLERHYTIDEEEVYEDNRGLSVGQCRAVMVIL 389
S S +S + G + KG + + +E +D+ + ++ +C + I+
Sbjct: 59 PHSHSSSSRFSKINGSKRKGHAGEKQFAMIEEEGLLDDGDREQE---ALPRRCYVLAFIV 115
Query: 390 GIVVVFSVFCSVLWGASHPFSPIVSVKSFSLHSFYYGEGIDRTGVATKILSFNSSVKVTI 569
G ++F+ F +L+ A+ P P +SVKS + G D G+ T +++ N+++++
Sbjct: 116 GFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMITMNATLRMLY 175
Query: 570 DSPAPYFGIHVSSSSTFNLTFSTLTLATGQLKSYYQPRKSKHTAIVKLIGSEVPLYGAGP 749
+ +FG+HV +SS +L+FS +T+ +G +K +YQ RKS+ T +V ++G ++PLYG+G
Sbjct: 176 RNTGTFFGVHV-TSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGDKIPLYGSGS 234
Query: 750 NLAASDKKGRVP 785
L +P
Sbjct: 235 TLVPPPPPAPIP 246
>TAIR9_protein||AT2G41990.1 | Symbols: | unknown protein |
chr2:17527396-17528527 FORWARD
Length = 298
Score = 116 bits (289), Expect = 7e-026
Identities = 88/300 (29%), Positives = 152/300 (50%), Gaps = 21/300 (7%)
Frame = +3
Query: 33 ATSESERTSLDLSISSPKQA-----YYVESPSSVSQVYDGDKSS--SAASLIQVQT-PNY 188
A ++SE TS+D + SP ++ YYV+SPS+ +D +K S S SL+ T P+Y
Sbjct: 3 AKTDSEATSIDAAALSPPRSAIRPLYYVQSPSN----HDVEKMSFGSGCSLMGSPTHPHY 58
Query: 189 TILTESRLSSSSRTSNGTSGVGFRWKGSSRRRDMYWLERHYTIDEEEVYEDNRGLSVGQC 368
+ S S TS + +K S R R Y + D + + R + +
Sbjct: 59 YHCSPIHHSRESSTSRFSDRALLSYK-SIRERRRYINDGDDKTDGGDDDDPFRNVRL--- 114
Query: 369 RAVMVILGIVVVFSVFCSVLWGASHPFSPIVSVKSFSLHSFYYGEGIDRTGVATKILSFN 548
V ++L ++ +F+VF +LWGAS + P V+VK + G D +GV T +LS N
Sbjct: 115 -YVWLLLSVIFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDMLSLN 173
Query: 549 SSVKVTIDSPAPYFGIHVSSSSTFNLTFSTLTLATGQLKSYYQPRKSKHTAIVKLIGSEV 728
S+V++ +P+ +F +HV++S L +S L L++G++ + R + + + G ++
Sbjct: 174 STVRIYYRNPSTFFAVHVTASPLL-LHYSNLLLSSGEMNKFTVGRNGETNVVTVVQGHQI 232
Query: 729 PLYGAGPNLAASDKKGRVPVKLEFEIRSQGNLLGKLVKSKHLNHISCSFYISSSKTSKPL 908
PLYG ++ +P+ L + S+ +LG+LV SK I CSF + ++ K +
Sbjct: 233 PLYG---GVSFHLDTLSLPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDANHLPKSI 289
>TAIR9_protein||AT3G08980.1 | Symbols: | signal peptidase I family protein |
chr3:2741279-2742375 FORWARD
Length = 155
Score = 50 bits (119), Expect = 4e-006
Identities = 23/47 (48%), Positives = 30/47 (63%)
Frame = -1
Query: 1318 VRVPENSVFVMGDNRNNSYDSHVWGPLPLKNIIGRSVFRYWPPNRVS 1178
+RVPE +V GDN+ +S DS +GP+PL I GR WPP R+S
Sbjct: 104 IRVPEGHCWVEGDNKTSSLDSRSFGPIPLGLIQGRVTRVMWPPQRIS 150
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,599,552,957
Number of Sequences: 33410
Number of Extensions: 8599552957
Number of Successful Extensions: 306342874
Number of sequences better than 0.0: 0
|