BLASTX 7.6.2
Query= UN83098 /QuerySize=863
(862 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT1G76240.1 | Symbols: | unknown protein | chr1:... 448 3e-126
TAIR9_protein||AT2G17080.1 | Symbols: | unknown protein | chr2:... 69 3e-012
TAIR9_protein||AT2G40070.1 | Symbols: | FUNCTIONS IN: molecular... 61 9e-010
TAIR9_protein||AT2G40070.2 | Symbols: | FUNCTIONS IN: molecular... 61 9e-010
TAIR9_protein||AT1G68725.1 | Symbols: AGP19, ATAGP19 | AGP19 (AR... 54 1e-007
TAIR9_protein||AT4G35800.1 | Symbols: NRPB1, RPB1, RNA_POL_II_LS... 48 6e-006
TAIR9_protein||AT2G45000.1 | Symbols: EMB2766 | EMB2766 (EMBRYO ... 48 8e-006
>TAIR9_protein||AT1G76240.1 | Symbols: | unknown protein |
chr1:28602949-28603875 REVERSE
Length = 309
Score = 448 bits (1150), Expect = 3e-126
Identities = 228/281 (81%), Positives = 253/281 (90%), Gaps = 4/281 (1%)
Frame = +3
Query: 30 MVGVFRRSLSFPNKPTVRPPPPSKPRVSHHTRSISLPCRSHPLISHINHEISQIKSWSSL 209
MVGVFRRSLSFPNKP R P SKPRVSHHTRSISLPCRSHPLISH+NHEISQ+KSW S
Sbjct: 1 MVGVFRRSLSFPNKPCGRSSPSSKPRVSHHTRSISLPCRSHPLISHVNHEISQLKSWFSF 60
Query: 210 ----DRRTTAWITDGLSLLRDVQETLSDILHLPQSQESLRNRPVFFENLLEDLLRFVDAY 377
RTT+WITDGLSLL+DVQETL+DIL LPQSQESLRNRPVFFENLLEDLLRFVDAY
Sbjct: 61 AGETHSRTTSWITDGLSLLKDVQETLADILQLPQSQESLRNRPVFFENLLEDLLRFVDAY 120
Query: 378 GIFRTSLLSLREHQSAAQVALRRKDDVKISSYVNSRRALARDVAKLTSAVREPKTKYNRC 557
GIFRTS+L LREHQSAAQVALR+KDD KI+SY+ SRR+LARD+AKLTS++REPKTK+ C
Sbjct: 121 GIFRTSILCLREHQSAAQVALRKKDDEKIASYLKSRRSLARDIAKLTSSIREPKTKHQHC 180
Query: 558 HVDVLNGSYVEAELASVIGDVIEVTVLVSVALFNGVYLSLRSSKTTAFVGFLKRSEKRDK 737
HVD +NG+Y +AELASVIGDVIEVTVLVSVALFNGVYLSLR++KTT F+GFLKRSEK++K
Sbjct: 181 HVDNVNGTYGDAELASVIGDVIEVTVLVSVALFNGVYLSLRATKTTPFIGFLKRSEKKEK 240
Query: 738 NGEGIEELKQVEEKSLVGLSKKKNEEVKILTQKMMEFENSI 860
EGI ELKQVEEKSL+GLSKKKNEEVK L ++MME ENSI
Sbjct: 241 LDEGIVELKQVEEKSLIGLSKKKNEEVKSLMKRMMELENSI 281
>TAIR9_protein||AT2G17080.1 | Symbols: | unknown protein | chr2:7433326-7434117
REVERSE
Length = 264
Score = 69 bits (167), Expect = 3e-012
Identities = 41/153 (26%), Positives = 81/153 (52%), Gaps = 5/153 (3%)
Frame = +3
Query: 108 VSHHTRSISLPCRSHPLISHINHEISQIKSWSSLDRRTTAWITDGLSLLRDVQETLSDIL 287
VS H RS S P RSHP +H++ ++++++S +++ I L L+++ E+L ++
Sbjct: 3 VSFHVRSNSFPSRSHPQAAHVDEQLARLRSSEQASSSSSSSICQRLDNLQELHESLDKLI 62
Query: 288 HLPQSQESL--RNRPVFFENLLEDLLRFVDAYGIFRTSLLSLREHQSAAQVALRRKD--- 452
P +Q++L + E LL+ LR +D I + +L ++E Q LRRK
Sbjct: 63 SRPVTQQALSQEHNKKAVEQLLDGSLRILDLCNISKDALSEMKEGLMEIQSILRRKRGDL 122
Query: 453 DVKISSYVNSRRALARDVAKLTSAVREPKTKYN 551
++ Y+ SR++L + K+ +++ + + N
Sbjct: 123 SEEVKKYLTSRKSLKKSFQKVQKSLKVTQAEDN 155
>TAIR9_protein||AT2G40070.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 17 plant structures;
EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein
match is: proline-rich family protein (TAIR:AT3G09000.1); Has 94255
Blast hits to 49644 proteins in 1573 species: Archae - 225; Bacteria -
11215; Metazoa - 37735; Fungi - 21320; Plants - 3339; Viruses - 2662;
Other Eukaryotes - 17759 (source: NCBI BLink). | chr2:16728378-16731160
REVERSE
Length = 608
Score = 61 bits (146), Expect = 9e-010
Identities = 66/206 (32%), Positives = 96/206 (46%), Gaps = 13/206 (6%)
Frame = +1
Query: 49 DHFLSRTNPPSVHHLHQSHAS---LTTQDPSASHAGHTL*SPTSTTRSPRSNPGPP---- 207
+H SR S S AS ++ P + A T S T T S S P P
Sbjct: 153 NHLTSRQQTSSPGLSSSSGASRRPSSSGGPGSRPATPTGRSSTLTANSKSSRPSTPTSRA 212
Query: 208 SIAAPPRGSPTVSASS-ETSKKPSP-TYSTSLSRRSLSATA--PSSSRTSSKTSSASSTP 375
++++ R S T S S+ + KP+P + STSLS L+ TA P++S S S STP
Sbjct: 213 TVSSATRPSLTNSRSTVSATTKPTPMSRSTSLSSSRLTPTASKPTTSTARSAGSVTRSTP 272
Query: 376 TASSAPRSSPSASTSPPLRSLSGEKTTSRSPPT*TPAALSRETSRS*RRPYASRRRSTTA 555
+++ + PS ST+P RS + +T S PT P+ +S RRP AS +TT
Sbjct: 273 -STTTKSAGPSRSTTPLSRS-TARSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATTT 330
Query: 556 ATWTF*TGRTSRRSWRRSSATSSRSP 633
A T + S + + T S++P
Sbjct: 331 ANPTISQIKPSSPAPAKPMPTPSKNP 356
>TAIR9_protein||AT2G40070.2 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 17 plant structures;
EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein
match is: proline-rich family protein (TAIR:AT3G09000.1); Has 92805
Blast hits to 48882 proteins in 1559 species: Archae - 225; Bacteria -
11081; Metazoa - 37135; Fungi - 20962; Plants - 3300; Viruses - 2664;
Other Eukaryotes - 17438 (source: NCBI BLink). | chr2:16728378-16731040
REVERSE
Length = 568
Score = 61 bits (146), Expect = 9e-010
Identities = 66/206 (32%), Positives = 96/206 (46%), Gaps = 13/206 (6%)
Frame = +1
Query: 49 DHFLSRTNPPSVHHLHQSHAS---LTTQDPSASHAGHTL*SPTSTTRSPRSNPGPP---- 207
+H SR S S AS ++ P + A T S T T S S P P
Sbjct: 113 NHLTSRQQTSSPGLSSSSGASRRPSSSGGPGSRPATPTGRSSTLTANSKSSRPSTPTSRA 172
Query: 208 SIAAPPRGSPTVSASS-ETSKKPSP-TYSTSLSRRSLSATA--PSSSRTSSKTSSASSTP 375
++++ R S T S S+ + KP+P + STSLS L+ TA P++S S S STP
Sbjct: 173 TVSSATRPSLTNSRSTVSATTKPTPMSRSTSLSSSRLTPTASKPTTSTARSAGSVTRSTP 232
Query: 376 TASSAPRSSPSASTSPPLRSLSGEKTTSRSPPT*TPAALSRETSRS*RRPYASRRRSTTA 555
+++ + PS ST+P RS + +T S PT P+ +S RRP AS +TT
Sbjct: 233 -STTTKSAGPSRSTTPLSRS-TARSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATTT 290
Query: 556 ATWTF*TGRTSRRSWRRSSATSSRSP 633
A T + S + + T S++P
Sbjct: 291 ANPTISQIKPSSPAPAKPMPTPSKNP 316
>TAIR9_protein||AT1G68725.1 | Symbols: AGP19, ATAGP19 | AGP19
(ARABINOGALACTAN-PROTEIN 19) | chr1:25809298-25810130 FORWARD
Length = 249
Score = 54 bits (128), Expect = 1e-007
Identities = 35/124 (28%), Positives = 52/124 (41%)
Frame = +1
Query: 100 SHASLTTQDPSASHAGHTL*SPTSTTRSPRSNPGPPSIAAPPRGSPTVSASSETSKKPSP 279
S S+ Q P+AS T +P TT +P + PP P S +S + P+
Sbjct: 18 SSFSVNAQGPAASPVTSTTTAPPPTTAAPPTTAAPPPTTTTPPVSAAQPPASPVTPPPAV 77
Query: 280 TYSTSLSRRSLSATAPSSSRTSSKTSSASSTPTASSAPRSSPSASTSPPLRSLSGEKTTS 459
T ++ + + +P++ S +S PT S P S P A TSPP S +
Sbjct: 78 TPTSPPAPKVAPVISPATPPPQPPQSPPASAPTVSPPPVSPPPAPTSPPPTPASPPPAPA 137
Query: 460 RSPP 471
PP
Sbjct: 138 SPPP 141
>TAIR9_protein||AT4G35800.1 | Symbols: NRPB1, RPB1, RNA_POL_II_LSRNA_POL_II_LS,
RNA_POL_II_LS | NRPB1 (RNA POLYMERASE II LARGE SUBUNIT); DNA binding /
DNA-directed RNA polymerase | chr4:16961115-16967892 REVERSE
Length = 1841
Score = 48 bits (113), Expect = 6e-006
Identities = 44/124 (35%), Positives = 61/124 (49%), Gaps = 4/124 (3%)
Frame = +1
Query: 109 SLTTQDPSASHAGHTL*SPTSTTRSPRSNPGPPSIA-APPRGSPTVSASSETSKKPSPTY 285
S T+ S S G++ SP + SP +P PS + P SPT + S TS SPT
Sbjct: 1574 SPTSPTYSPSSPGYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS 1633
Query: 286 -STSLSRRSLSATAPSSSRTSSKTS--SASSTPTASSAPRSSPSASTSPPLRSLSGEKTT 456
S S + + S T+P+ S TS S S S +PT+ S +SPS S + P S + +
Sbjct: 1634 PSYSPTSPAYSPTSPAYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYS 1693
Query: 457 SRSP 468
SP
Sbjct: 1694 PTSP 1697
Score = 48 bits (112), Expect = 8e-006
Identities = 46/125 (36%), Positives = 63/125 (50%), Gaps = 6/125 (4%)
Frame = +1
Query: 109 SLTTQDPSASHAGHTL*SPTSTTRSPRSNPGPPSIA-APPRGSPTVSASSETSKKPSPTY 285
S T+ S + G++ SPT + SP +P P+ + P SPT + S TS SPT
Sbjct: 1560 SPTSPGYSPTSPGYSPTSPTYSPSSPGYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTS 1619
Query: 286 -STSLSRRSLSATAPSSSRTS---SKTSSASSTPTASSAPRSSPSASTSPPLRSLSGEKT 453
S S + S S T+PS S TS S TS A S PT+ + +SPS S + P S +
Sbjct: 1620 PSYSPTSPSYSPTSPSYSPTSPAYSPTSPAYS-PTSPAYSPTSPSYSPTSPSYSPTSPSY 1678
Query: 454 TSRSP 468
+ SP
Sbjct: 1679 SPTSP 1683
>TAIR9_protein||AT2G45000.1 | Symbols: EMB2766 | EMB2766 (EMBRYO DEFECTIVE
2766); structural constituent of nuclear pore | chr2:18564156-18567632
FORWARD
Length = 740
Score = 48 bits (112), Expect = 8e-006
Identities = 41/134 (30%), Positives = 61/134 (45%), Gaps = 1/134 (0%)
Frame = +1
Query: 16 SPPKTWLEFSGDHFLSRTNPPSVHHLHQSHASLTTQDPSASHAGHTL*SPTSTTRSPRSN 195
S P + S S +P V + S S T+ ++ + T S +T S ++
Sbjct: 288 STPSLFASSSSGATTSSPSPFGVSTFNSSSTSNTSNASASPFSASTGFSFLKSTASSTTS 347
Query: 196 PGPPSIAAPPRGSPTVSASSETSKKPSPTYSTSLSRRSLSATAPSSSRTSSKTSSASSTP 375
PS A P S + S S TS ST S S+T+ + ++ T+++SSTP
Sbjct: 348 STTPS-APPQTASSSSSFSFGTSANSGFNLSTGSSAAPASSTSGAVFSIATTTTTSSSTP 406
Query: 376 TASSAPRSSPSAST 417
A+SAP SS AST
Sbjct: 407 AATSAPASSAPAST 420
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,252,145,367
Number of Sequences: 33410
Number of Extensions: 33252145367
Number of Successful Extensions: 1129160757
Number of sequences better than 0.0: 0
|