BLASTX 7.6.2
Query= UN42809 /QuerySize=674
(673 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|340520432|gb|EGR50668.1| N-terminal WSC domain-containing pro... 68 9e-010
gi|2506877|sp|Q02817.2|MUC2_HUMAN RecName: Full=Mucin-2; Short=M... 68 1e-009
gi|186396|gb|AAA59163.1| mucin [Homo sapiens] 68 1e-009
gi|285310533|emb|CBJ25057.1| C. elegans protein C30H6.1c, partia... 67 1e-009
gi|285310534|emb|CBJ25058.1| C. elegans protein C30H6.1d, partia... 67 1e-009
gi|341889279|gb|EGT45214.1| hypothetical protein CAEBREN_03876 [... 67 1e-009
gi|308503659|ref|XP_003114013.1| hypothetical protein CRE_27453 ... 67 3e-009
gi|301613160|ref|XP_002936083.1| PREDICTED: mucin-5AC-like [Xeno... 66 4e-009
gi|188864|gb|AAA59875.1| mucin [Homo sapiens] 65 6e-009
gi|116284392|ref|NP_002448.2| mucin-2 precursor [Homo sapiens] 65 6e-009
gi|326429118|gb|EGD74688.1| hypothetical protein PTSG_12387 [Sal... 65 6e-009
gi|308482738|ref|XP_003103572.1| CRE-CLEC-202 protein [Caenorhab... 65 7e-009
gi|326431726|gb|EGD77296.1| Notch2 [Salpingoeca sp. ATCC 50818] 65 7e-009
gi|312376432|gb|EFR23515.1| hypothetical protein AND_12733 [Anop... 65 1e-008
gi|326427356|gb|EGD72926.1| hypothetical protein PTSG_12198 [Sal... 64 1e-008
gi|326433338|gb|EGD78908.1| hypothetical protein PTSG_01883 [Sal... 63 3e-008
gi|195379124|ref|XP_002048331.1| GJ11408 [Drosophila virilis] 63 4e-008
gi|242333231|emb|CAZ65473.1| C. elegans protein C30H6.1a, partia... 63 4e-008
gi|313219696|emb|CBY30616.1| unnamed protein product [Oikopleura... 63 4e-008
>gi|340520432|gb|EGR50668.1| N-terminal WSC domain-containing protein
[Trichoderma reesei QM6a]
Length = 947
Score = 68 bits (165), Expect = 9e-010
Identities = 40/120 (33%), Positives = 60/120 (50%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
TT + +P T TT ST+ PT T ++ TT T TTTTTTT + T + T
Sbjct: 453 TTTTTSPTTTTTTTTSLSTTTTTSPTTTTTSTTTTTTSPTTTTTTTTTTSPTTTTTTTTT 512
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTTRGCSS 630
+ P+ P T+ +T+PTTT +TT + TT +T +T + + + TT +S
Sbjct: 513 TSPTTTTTTSPTTTTTTSPTTTTTTTTTTSPTTTTTTSPTTVTTTTTASPSTVTTTTTAS 572
>gi|2506877|sp|Q02817.2|MUC2_HUMAN RecName: Full=Mucin-2; Short=MUC-2; AltName:
Full=Intestinal mucin-2; Flags: Precursor
Length = 5179
Score = 68 bits (164), Expect = 1e-009
Identities = 44/117 (37%), Positives = 53/117 (45%), Gaps = 2/117 (1%)
Frame = +1
Query: 265 P*TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAP 444
P TTPS PP TT +T+ PP T T + T T P TTTTT L TP
Sbjct: 1402 PTTTPS--PPPTTTTTLPPTTTPSPPTTTTTTPPPTTTPSPPITTTTTPLPTTTPSPPIS 1459
Query: 445 TVSDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
T + P P T+ + PTTTP TT T+T +T S P + S TT
Sbjct: 1460 TTTTPPPTTTPSPPTTTPSPPTTTPSPPTTTTTTPPPTTTPSPPMTTPITPPASTTT 1516
>gi|186396|gb|AAA59163.1| mucin [Homo sapiens]
Length = 1270
Score = 68 bits (164), Expect = 1e-009
Identities = 44/117 (37%), Positives = 53/117 (45%), Gaps = 2/117 (1%)
Frame = +1
Query: 265 P*TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAP 444
P TTPS PP TT +T+ PP T T + T T P TTTTT L TP
Sbjct: 777 PTTTPS--PPPTTTTTLPPTTTPSPPTTTTTTPPPTTTPSPPITTTTTPLPTTTPSPPIS 834
Query: 445 TVSDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
T + P P T+ + PTTTP TT T+T +T S P + S TT
Sbjct: 835 TTTTPPPTTTPSPPTTTPSPPTTTPSPPTTTTTTPPPTTTPSPPMTTPITPPASTTT 891
>gi|285310533|emb|CBJ25057.1| C. elegans protein C30H6.1c, partially confirmed
by transcript evidence [Caenorhabditis elegans]
Length = 667
Score = 67 bits (163), Expect = 1e-009
Identities = 39/111 (35%), Positives = 53/111 (47%), Gaps = 4/111 (3%)
Frame = +1
Query: 295 TNAPTTAETSTSQLPP----PTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTVSDPS 462
T + TT T+T+ P PT T + TT T TTTTT T TP + T + P+
Sbjct: 259 TTSTTTTSTTTTTTPTTTTMPTTTTTTPTTTTTTPTTTTTTPTTTTTTPTTTTETTTTPT 318
Query: 463 AQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
+ P T+ T TTTP +TT T+TT +T T +P + TT
Sbjct: 319 TETTTTPTTTTPTTTTTTPTTTTTETTTTPTTTTTETTTTPTTTTTTPTTT 369
>gi|285310534|emb|CBJ25058.1| C. elegans protein C30H6.1d, partially confirmed
by transcript evidence [Caenorhabditis elegans]
Length = 653
Score = 67 bits (163), Expect = 1e-009
Identities = 39/111 (35%), Positives = 53/111 (47%), Gaps = 4/111 (3%)
Frame = +1
Query: 295 TNAPTTAETSTSQLPP----PTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTVSDPS 462
T + TT T+T+ P PT T + TT T TTTTT T TP + T + P+
Sbjct: 259 TTSTTTTSTTTTTTPTTTTMPTTTTTTPTTTTTTPTTTTTTPTTTTTTPTTTTETTTTPT 318
Query: 463 AQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
+ P T+ T TTTP +TT T+TT +T T +P + TT
Sbjct: 319 TETTTTPTTTTPTTTTTTPTTTTTETTTTPTTTTTETTTTPTTTTTTPTTT 369
>gi|341889279|gb|EGT45214.1| hypothetical protein CAEBREN_03876 [Caenorhabditis
brenneri]
Length = 372
Score = 67 bits (163), Expect = 1e-009
Identities = 41/120 (34%), Positives = 57/120 (47%)
Frame = +1
Query: 256 TIFP*TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGS 435
T P TT S + T T + ++T+ T T + TT T PTTTT+TT T
Sbjct: 41 TSSPITTTSTSSSTTTTTASTSTTTTESTSTTTTTEPTTTTITTPTTTTSTTSTTTTTTP 100
Query: 436 LAPTVSDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
T + P+ P T+ +T PTTT STT T+TT +T +TP + + TT
Sbjct: 101 TTTTTTTPTTTTTTTPTTTTTTTPTTTTTTSTTTTTTTPTTTTTTTPTTTTTTTPTTTTT 160
>gi|308503659|ref|XP_003114013.1| hypothetical protein CRE_27453 [Caenorhabditis
remanei]
Length = 325
Score = 67 bits (161), Expect = 3e-009
Identities = 45/124 (36%), Positives = 60/124 (48%), Gaps = 12/124 (9%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITT-------ATRLLPTTTTTTTLRAGTP 429
TTP+ T PTT+ T+T+ P T T + TT T PTTTTTTT TP
Sbjct: 70 TTPT-TTTTTTPTTSTTTTTTTPTTTTTTTTTTTPTTTTTPTTTTTPTTTTTTTTTTTTP 128
Query: 430 GSLAPTVSDPSAQNQQHPQTSISTAP--TTTPGNSTTRTSTTGISTRASTPRSPRAMASI 603
T + P+ P T+ +T P TTTP +TT T+TT + +T SP +
Sbjct: 129 --TTTTTTTPTTTTTTTPTTTTTTTPTTTTTPTTTTTPTTTTTTTPTTTTTTSPTTTTTT 186
Query: 604 SRTT 615
+ TT
Sbjct: 187 TTTT 190
>gi|301613160|ref|XP_002936083.1| PREDICTED: mucin-5AC-like [Xenopus (Silurana)
tropicalis]
Length = 3816
Score = 66 bits (159), Expect = 4e-009
Identities = 42/128 (32%), Positives = 62/128 (48%), Gaps = 10/128 (7%)
Frame = +1
Query: 253 RTIFP*TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTP- 429
+T P TT + P T TT T+T+ P T+T + TTAT + TTTTTT TP
Sbjct: 1588 QTTTPTTTETTTPTTTTETTTPTTTTTTPTTTETTTQTTTATPITTITTTTTTTETTTPT 1647
Query: 430 -------GSLAPTVSDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPR 588
+ PT ++ + T+ +T PTTT +TT T+TT +T+ +TP +
Sbjct: 1648 TTTETTTQTTTPTTTETTTPTTTTETTTQTTTPTTT--ETTTPTTTTETTTQTTTPTTTE 1705
Query: 589 AMASISRT 612
+ T
Sbjct: 1706 TTTPTTTT 1713
>gi|188864|gb|AAA59875.1| mucin [Homo sapiens]
Length = 573
Score = 65 bits (158), Expect = 6e-009
Identities = 43/117 (36%), Positives = 53/117 (45%), Gaps = 2/117 (1%)
Frame = +1
Query: 265 P*TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAP 444
P TTPS PP + TT +T+ PP T T + T T P TTTTT TP
Sbjct: 60 PTTTPS--PPPTSTTTLPPTTTPSPPTTTTTTPPPTTTPSPPITTTTTPPPTTTPSPPIS 117
Query: 445 TVSDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
T + P P T+ + PTTTP TT T+T +T S P + S TT
Sbjct: 118 TTTTPPPTTTPSPPTTTPSPPTTTPSPPTTTTTTPPPTTTPSPPTTTPITPPASTTT 174
>gi|116284392|ref|NP_002448.2| mucin-2 precursor [Homo sapiens]
Length = 5179
Score = 65 bits (158), Expect = 6e-009
Identities = 43/117 (36%), Positives = 53/117 (45%), Gaps = 2/117 (1%)
Frame = +1
Query: 265 P*TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAP 444
P TTPS PP + TT +T+ PP T T + T T P TTTTT TP
Sbjct: 1402 PTTTPS--PPPTSTTTLPPTTTPSPPTTTTTTPPPTTTPSPPITTTTTPPPTTTPSPPIS 1459
Query: 445 TVSDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
T + P P T+ + PTTTP TT T+T +T S P + S TT
Sbjct: 1460 TTTTPPPTTTPSPPTTTPSPPTTTPSPPTTTTTTPPPTTTPSPPTTTPITPPASTTT 1516
>gi|326429118|gb|EGD74688.1| hypothetical protein PTSG_12387 [Salpingoeca sp.
ATCC 50818]
Length = 860
Score = 65 bits (158), Expect = 6e-009
Identities = 39/115 (33%), Positives = 55/115 (47%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
+T S PPT + TT T+T+ T T + TT T TTTTTTT T + T
Sbjct: 159 STASTTPPTTSSTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 218
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
+ + T+ +T TTT STT TSTT +T +T + ++ + TT
Sbjct: 219 TTTTTTTTTTTTTTTTTTTTTTTTTSTTTTSTTTTTTTTTTTTTTTTTSTTTTTT 273
>gi|308482738|ref|XP_003103572.1| CRE-CLEC-202 protein [Caenorhabditis remanei]
Length = 696
Score = 65 bits (157), Expect = 7e-009
Identities = 39/115 (33%), Positives = 53/115 (46%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
TT + T PTT TST+ P T T TT T TT TTTT T + PT
Sbjct: 387 TTTTPTTTTTTPTTTTTSTTTTPTTTTTTPTTTTTTPTTTTTETTTTTTTPTTTTTTPTT 446
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
+ + T+ +T PTTT + TT T+T +T +T +P + + TT
Sbjct: 447 TTETTTATTTTPTTTTTTPTTTTTSPTTTTTTPTTTTVTTTTTTPSTTPTTTTTT 501
>gi|326431726|gb|EGD77296.1| Notch2 [Salpingoeca sp. ATCC 50818]
Length = 5122
Score = 65 bits (157), Expect = 7e-009
Identities = 42/120 (35%), Positives = 58/120 (48%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
TT S T+ TT+ T+TS T T S TT+T TTTT+TT + T S T
Sbjct: 2688 TTTSTTTTTSTTTTSTTTTSTTTTSTTTTSTTTTSTTTTSTTTTSTTTTSTTTTSTTTTS 2747
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTTRGCSS 630
+ ++ TS ST T+T STT T+TT +T ST + S + T+ +S
Sbjct: 2748 TTTTSTTLTTTTTSTSTTSTSTTSTSTTSTTTTSTTTTTSTTTTSTTTTSTTTTSTTTTS 2807
Score = 63 bits (152), Expect = 3e-008
Identities = 41/120 (34%), Positives = 57/120 (47%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
TT S + T+ T+TS T T S TT+T TTTT+TT + T S T
Sbjct: 2678 TTTSTTTTSTTTTSTTTTTSTTTTSTTTTSTTTTSTTTTSTTTTSTTTTSTTTTSTTTTS 2737
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTTRGCSS 630
+ ++ T+ +T TTT STT TSTT ST ++T S S + T+ +S
Sbjct: 2738 TTTTSTTTTSTTTTSTTLTTTTTSTSTTSTSTTSTSTTSTTTTSTTTTTSTTTTSTTTTS 2797
>gi|312376432|gb|EFR23515.1| hypothetical protein AND_12733 [Anopheles
darlingi]
Length = 484
Score = 65 bits (156), Expect = 1e-008
Identities = 41/120 (34%), Positives = 55/120 (45%), Gaps = 2/120 (1%)
Frame = +1
Query: 256 TIFP*TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGS 435
T P TT + PT TT T+T+ P T T TT T TTT TTT T +
Sbjct: 312 TTTPTTTTT--TPTTTTTTPSTTTTTTPTTTTTTPTTTTTTPTTTTTTPTTTTTTPTTTT 369
Query: 436 LAPTVSDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
PT + + T+ +T PTTT + T T+TT +T +TP + + S TT
Sbjct: 370 TTPTTTTTTPTTTTTTPTTTTTIPTTTTTTTPTTTTTTPTTTTTTTPTTTTTTPTTSTTT 429
>gi|326427356|gb|EGD72926.1| hypothetical protein PTSG_12198 [Salpingoeca sp.
ATCC 50818]
Length = 997
Score = 64 bits (155), Expect = 1e-008
Identities = 38/115 (33%), Positives = 54/115 (46%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
TT S PPT A TT T+T+ T T + +T T +TTTTTT T + T
Sbjct: 422 TTTSTVPPTTATTTTTTTTTTTTTTTTTTTTTSTTTTTTTSTTTTTTTTTTTTSTTTTTT 481
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
+ + T+ +T TTT +TT T+TT +T +T + + S TT
Sbjct: 482 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTSTTT 536
>gi|326433338|gb|EGD78908.1| hypothetical protein PTSG_01883 [Salpingoeca sp.
ATCC 50818]
Length = 585
Score = 63 bits (152), Expect = 3e-008
Identities = 38/115 (33%), Positives = 55/115 (47%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
TT + ++PTT T+T+ T T S TTATR TTTTTTT T + T
Sbjct: 260 TTTTTTTTASSPTTTTTTTASSTTTTATRSTTTTATRSTTTTTTTTTTTTTTTTTTTSTT 319
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
+ + TS +T TTT STT ++TT +T ++ + + + TT
Sbjct: 320 TSTTTTTTTTSTTSTTTTTTTTTTTSTTTSTTTSTTTTTTSTTTTTTSTTTTTTT 374
>gi|195379124|ref|XP_002048331.1| GJ11408 [Drosophila virilis]
Length = 497
Score = 63 bits (151), Expect = 4e-008
Identities = 39/120 (32%), Positives = 56/120 (46%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
TT + PT TT ++T+ T T + TT T TTTTTTT + T + T
Sbjct: 173 TTTTTTKPTTTTTTTCSTTTTTTTTTTTTTTTTTTTTTTKTTTTTTTTCSTTTTTTTTTT 232
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTTRGCSS 630
+ + T+ +T TTT +TT T+TT +T ST S + + TT CS+
Sbjct: 233 TTTTTTTTTTCSTTTTTTTTTTTTTTTTTTTTTTTTTTCSTTTSTTTTTTTTTTTTTCST 292
>gi|242333231|emb|CAZ65473.1| C. elegans protein C30H6.1a, partially confirmed
by transcript evidence [Caenorhabditis elegans]
Length = 657
Score = 63 bits (151), Expect = 4e-008
Identities = 39/115 (33%), Positives = 54/115 (46%), Gaps = 1/115 (0%)
Frame = +1
Query: 271 TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTTTTTTTLRAGTPGSLAPTV 450
TTP+ T TT T+T+ P T T TT T TTT TTT A T + PT
Sbjct: 354 TTPT-TTTTETTTTTPTTTTTTPTTTTTTPTTTTTTPTTTTTTPTTTTTAPTTTTTTPTT 412
Query: 451 SDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTPRSPRAMASISRTT 615
+ + T+ +T PTTT +TT T+TT T +TP + + + + T
Sbjct: 413 TTTTPTTTTTVPTTTTTTPTTTTTTTTTPTTTTTTPTTTTTPATTTSETTTTTPT 467
>gi|313219696|emb|CBY30616.1| unnamed protein product [Oikopleura dioica]
Length = 1016
Score = 63 bits (151), Expect = 4e-008
Identities = 43/133 (32%), Positives = 57/133 (42%)
Frame = +1
Query: 217 GSSAIN*ESSRRRTIFP*TTPS*APPTNAPTTAETSTSQLPPPTKTLSNITTATRLLPTT 396
GS+ +S T TT S T TT T+T+ T T + TT T TT
Sbjct: 615 GSTTTTSTTSTTTTTTTTTTTSTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 674
Query: 397 TTTTTLRAGTPGSLAPTVSDPSAQNQQHPQTSISTAPTTTPGNSTTRTSTTGISTRASTP 576
TTTTT T + T S S T+ ST TTT +TT T+TT +T +T
Sbjct: 675 TTTTTTTTTTSTTTTTTTSTTSTTTTTTTTTTTSTTTTTTTTTTTTTTTTTTTTTTTTTT 734
Query: 577 RSPRAMASISRTT 615
+ + + TT
Sbjct: 735 TTTTTTTTTTSTT 747
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,749,908,712,651
Number of Sequences: 15229318
Number of Extensions: 4749908712651
Number of Successful Extensions: 1126793398
Number of sequences better than 0.0: 0
|