BLASTX 7.6.2
Query= UN20626 /QuerySize=1944
(1943 letters)
Database: TAIR9 protein;
33,410 sequences; 13,468,323 total letters
Score E
Sequences producing significant alignments: (bits) Value
TAIR9_protein||AT3G60380.1 | Symbols: | FUNCTIONS IN: molecular... 266 4e-071
TAIR9_protein||AT4G16790.1 | Symbols: | hydroxyproline-rich gly... 74 3e-013
>TAIR9_protein||AT3G60380.1 | Symbols: | FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein
match is: hydroxyproline-rich glycoprotein family protein
(TAIR:AT4G16790.1); Has 5716 Blast hits to 3635 proteins in 329
species: Archae - 2; Bacteria - 249; Metazoa - 2628; Fungi - 569;
Plants - 209; Viruses - 24; Other Eukaryotes - 2035 (source: NCBI
BLink). | chr3:22316913-22319144 REVERSE
Length = 744
Score = 266 bits (679), Expect = 4e-071
Identities = 151/219 (68%), Positives = 167/219 (76%), Gaps = 17/219 (7%)
Frame = +1
Query: 16 PAAATPPSVSVSHTGGGFFSNKSVLLGMFLLALPLFPSQAPDFVGETVLTKLWELIHLLF 195
P PP GGG F KSVL +FLLALPLFPSQAPDFVGETVLTK WELIHLLF
Sbjct: 14 PNVVVPPQPRYKSIGGGGFFCKSVLFALFLLALPLFPSQAPDFVGETVLTKFWELIHLLF 73
Query: 196 VGIAVAYGLFSRRNVESNVELRRSGVDDESSLSYVSRILQVSSVFDEEEELENPCSFVD- 372
VGIAVAYGLFSRRNVES V+LR + V DESSLSYVSRI QVSSVFDEE + +N C FVD
Sbjct: 74 VGIAVAYGLFSRRNVESAVDLRMTRV-DESSLSYVSRIFQVSSVFDEEFD-DNSCEFVDV 131
Query: 373 -----VSARASAVVKEMNKSDSFAVD----ESLSEYGEINQVQAWNSQYFQGRSKVVVAR 525
VSARAS V KS+SF V+ E SE+GE N+V+AWNSQYFQG+SKVVVAR
Sbjct: 132 RSDESVSARASVV----GKSESFVVESGELEESSEFGETNEVRAWNSQYFQGKSKVVVAR 187
Query: 526 PAYGLDGHHVVHQSLGLPVRSLRSGLRDDAAVEDKEKAE 642
PAYGLDG HVVHQ LGLP+R LRS LRD+AA++DK A+
Sbjct: 188 PAYGLDG-HVVHQPLGLPIRRLRSSLRDNAALQDKSFAD 225
Score = 242 bits (616), Expect = 8e-064
Identities = 164/310 (52%), Positives = 191/310 (61%), Gaps = 63/310 (20%)
Frame = +1
Query: 1057 TRRSYPPD-------GSDHSTTSRRRYLQQKSDSHLFE----KGLESDDHNKMSVKKVRS 1203
+RRSYPP+ G+D STT RR+ LQQKS+ HL E KG+E+ DHN + VKK RS
Sbjct: 439 SRRSYPPESISSPVGGADDSTT-RRQDLQQKSNCHLLEENIRKGVEA-DHNNLRVKKGRS 496
Query: 1204 HESLE----------------------FQP------RRAMRSSRGGSDTLVVKDNKKNSA 1299
H+SLE FQP RRAMRSSRGG DTL KD
Sbjct: 497 HDSLELTAEDSAKDEKVSESFPALDVVFQPTNAKASRRAMRSSRGGRDTLPEKDVVTRKL 556
Query: 1300 DD---DADDSEDYDLPGEENKEVISN---SLQSWRGSSKVSSRGKSVRTIRSDRN----- 1446
+D D+D S DLPG++ + + +SWR SS VS RGKSVRTIRSDR+
Sbjct: 557 EDDTVDSDKSRKKDLPGKDKEMKLDGPRIEPRSWRASSNVSLRGKSVRTIRSDRHGKDVK 616
Query: 1447 -HGDSS---ADDRGESHGRTKPRRQWQQELAIVLHQEKQ-----HSEPEDDAEETETEAE 1599
GDSS A+ + ES GRTK RR Q+EL+IVLHQEK SEPE+ A E +
Sbjct: 617 TDGDSSEDRAEAKVESRGRTKSRRPRQEELSIVLHQEKSSETRAKSEPEEVAMEEPQAEQ 676
Query: 1600 QPQETLEEEEEVAAWESQSNVSNEHYEVDRKADEFIAKFREQIRLQKLHSGEQQRRGGGT 1779
QP+ T EEEEE AAWESQSN S++H EVDRKA EFIAKFREQIRLQKL SGEQ RGGGT
Sbjct: 677 QPEVTFEEEEE-AAWESQSNASHDHNEVDRKAGEFIAKFREQIRLQKLISGEQP-RGGGT 734
Query: 1780 GVIRNSHFR* 1809
G+ RNS FR*
Sbjct: 735 GIFRNSQFR* 744
Score = 137 bits (343), Expect = 4e-032
Identities = 89/162 (54%), Positives = 103/162 (63%), Gaps = 33/162 (20%)
Frame = +1
Query: 643 EESLAAPSS--PWHSSPEMMGMGDD------ETQFTTRRSSVSSSSSSSSSSQTSFESND 798
+E LAAP+S PW + PEMMG+GD+ S+SS S+ SSSSQTS+ S +
Sbjct: 245 DEVLAAPASPVPWQARPEMMGIGDNYPSNFQPISVDETLKSISSRSTGSSSSQTSYASQN 304
Query: 799 QKNRYSPSRSVSEESLNYNVE----EKSLQGSSRSSSPSLQPSPSLSPP----------- 933
Q NR+SPSRSVS ESLN NVE EKS Q SSRSSSPSL PSPSLSP
Sbjct: 305 Q-NRFSPSRSVSAESLNSNVEELVKEKSRQSSSRSSSPSLPPSPSLSPSPPSPELVPNDT 363
Query: 934 ---SPELVTDDSHRRVLHSRHYSDGSLL------GCEDKLDG 1032
SPELVTDD+ RR HSRHYSDGSLL G E++L+G
Sbjct: 364 RRRSPELVTDDTPRRASHSRHYSDGSLLEEDVRRGFENELEG 405
>TAIR9_protein||AT4G16790.1 | Symbols: | hydroxyproline-rich glycoprotein
family protein | chr4:9451747-9453168 REVERSE
Length = 474
Score = 74 bits (180), Expect = 3e-013
Identities = 43/102 (42%), Positives = 65/102 (63%), Gaps = 10/102 (9%)
Frame = +1
Query: 70 FSNKSVLLGMFLLALPLFPSQAPDFVGETVLTKLWELIHLLFVGIAVAYGLFSRRNVE-- 243
F K+++L + +P+F SQ P+ + T+L EL+HL+FVGIAV+YGLFSRRN +
Sbjct: 28 FIFKALILTVLCAVVPVFLSQTPELANQ---TRLLELLHLVFVGIAVSYGLFSRRNYDGG 84
Query: 244 -----SNVELRRSGVDDESSLSYVSRILQVSSVFDEEEELEN 354
SN + ++ + +S SYV +IL+VSSVF+ E E+
Sbjct: 85 GGGGTSNSDHNKADHSNNNSHSYVPKILEVSSVFNVGHESES 126
Database: TAIR9 protein
Posted date: Wed Jul 08 15:16:08 2009
Number of letters in database: 13,468,323
Number of sequences in database: 33,410
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,768,941,887
Number of Sequences: 33410
Number of Extensions: 10768941887
Number of Successful Extensions: 359282909
Number of sequences better than 0.0: 0
|