BLASTX 7.6.2
Query= UN57169 /QuerySize=708
(707 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|15226106|ref|NP_180899.1| RNA recognition motif-containing pr... 376 2e-102
gi|297823133|ref|XP_002879449.1| hypothetical protein ARALYDRAFT... 375 4e-102
gi|222424729|dbj|BAH20318.1| AT2G33410 [Arabidopsis thaliana] 206 3e-051
gi|125574985|gb|EAZ16269.1| hypothetical protein OsJ_31729 [Oryz... 125 9e-027
gi|31432402|gb|AAP54039.1| transposon protein, putative, CACTA, ... 124 1e-026
gi|115487986|ref|NP_001066480.1| Os12g0242100 [Oryza sativa Japo... 124 1e-026
gi|242034309|ref|XP_002464549.1| hypothetical protein SORBIDRAFT... 123 3e-026
gi|125536228|gb|EAY82716.1| hypothetical protein OsI_37929 [Oryz... 122 4e-026
gi|195117472|ref|XP_002003271.1| GI17825 [Drosophila mojavensis] 121 1e-025
gi|255575788|ref|XP_002528793.1| Glycine-rich cell wall structur... 114 2e-023
gi|195029221|ref|XP_001987473.1| GH21940 [Drosophila grimshawi] 112 4e-023
gi|17821|emb|CAA78762.1| glycine-rich_protein_(aa1-291) [Brassic... 112 8e-023
gi|45935055|gb|AAS79562.1| At5g46730 [Arabidopsis thaliana] 111 2e-022
gi|294996438|ref|ZP_06802129.1| hypothetical protein Mtub2_18524... 109 4e-022
gi|255583740|ref|XP_002532623.1| Glycine-rich cell wall structur... 106 3e-021
gi|2961347|emb|CAA18105.1| glycine-rich protein [Arabidopsis tha... 106 4e-021
gi|170733890|ref|YP_001765837.1| hypothetical protein Bcenmc03_2... 104 1e-020
gi|71022041|ref|XP_761251.1| hypothetical protein UM05104.1 [Ust... 101 1e-019
gi|225427197|ref|XP_002278025.1| PREDICTED: hypothetical protein... 97 1e-018
>gi|15226106|ref|NP_180899.1| RNA recognition motif-containing protein
[Arabidopsis thaliana]
Length = 404
Score = 376 bits (964), Expect = 2e-102
Identities = 191/242 (78%), Positives = 197/242 (81%), Gaps = 29/242 (11%)
Frame = +2
Query: 2 KTFHDLNGKQVEVKRALPKDANPGVAGGGGGGRGGGGGFPGYG-------EGRVDSSRYM 160
KTFHDLNGKQVEVKRALPKDANPG+A GGG G GG GGFPGYG EGRVDS+RYM
Sbjct: 170 KTFHDLNGKQVEVKRALPKDANPGIASGGGRGSGGAGGFPGYGGSGGSGYEGRVDSNRYM 229
Query: 161 PPQNAGGSGYPP-----YGAGYGYGSNGVGYGGFGGYGNPSGAPYGNPGVTVPGAGFGSG 325
PQN GSGYPP YG GYGYGSNGVGYGGFGGYGNP+GAPYGNP +VPGAGFGSG
Sbjct: 230 QPQNT-GSGYPPYGGSGYGTGYGYGSNGVGYGGFGGYGNPAGAPYGNP--SVPGAGFGSG 286
Query: 326 PRSSWGGQAPSGYGNVGYGNAAAPSAPWGGS-GPGSAVMGQGGASAGYGGQGYGYGGNDS 502
PRSSWG QAPSGYGNVGYGNA APWGGS GPGSAVMGQ GASAGYG QGYGYGGNDS
Sbjct: 287 PRSSWGAQAPSGYGNVGYGNA----APWGGSGGPGSAVMGQAGASAGYGSQGYGYGGNDS 342
Query: 503 SYGTPSGYGAVGGRPNSL-----GGGYADGLD---GYGNHQG-NGQAGYGGGYGSGNQAL 655
SYGTPS YGAVGGR ++ GGGYAD LD GYGNHQG NGQAGYGGGYGSG QA
Sbjct: 343 SYGTPSAYGAVGGRSGNMPNNHGGGGYADALDGSGGYGNHQGNNGQAGYGGGYGSGRQAQ 402
Query: 656 QQ 661
QQ
Sbjct: 403 QQ 404
>gi|297823133|ref|XP_002879449.1| hypothetical protein ARALYDRAFT_482284
[Arabidopsis lyrata subsp. lyrata]
Length = 403
Score = 375 bits (962), Expect = 4e-102
Identities = 193/241 (80%), Positives = 195/241 (80%), Gaps = 28/241 (11%)
Frame = +2
Query: 2 KTFHDLNGKQVEVKRALPKDANPGVAGGGGGGRGGGGGFPGYG-------EGRVDSSRYM 160
KTFHDLNGKQVEVKRALPKDANPGVA GGG G GG GGFP YG EGRVDS+RYM
Sbjct: 170 KTFHDLNGKQVEVKRALPKDANPGVASGGGRGSGGAGGFPVYGGSGGSGYEGRVDSNRYM 229
Query: 161 PPQNAGGSGYPPYGA-----GYGYGSNGVGYGGFGGYGNPSGAPYGNPGVTVPGAGFGSG 325
PQN GSGYPPYGA GYGYGSNGVGYGGFGGYGNP+GAPYGNPG VPGAGFGSG
Sbjct: 230 QPQNT-GSGYPPYGASGYGTGYGYGSNGVGYGGFGGYGNPAGAPYGNPG--VPGAGFGSG 286
Query: 326 PRSSWGGQAPSGYGNVGYGNAAAPSAPWGGSGPGSAVMGQGGASAGYGGQGYGYGGNDSS 505
PRSSWG QAPSGYGNVGYGNA APWGGS PGSAVMGQ GAS GYG QGYGYGGNDSS
Sbjct: 287 PRSSWGAQAPSGYGNVGYGNA----APWGGSAPGSAVMGQAGASGGYGSQGYGYGGNDSS 342
Query: 506 YGTPSGYGAVGGR----PNSL-GGGYADGLD---GYGNHQG-NGQAGYGGGYGSGNQALQ 658
YGTPS YGAVGGR PNS GGGYAD D GYGNHQG NGQAGYGGGYGSG QA Q
Sbjct: 343 YGTPSAYGAVGGRSGNMPNSHGGGGYADASDVSGGYGNHQGNNGQAGYGGGYGSGRQAQQ 402
Query: 659 Q 661
Q
Sbjct: 403 Q 403
>gi|222424729|dbj|BAH20318.1| AT2G33410 [Arabidopsis thaliana]
Length = 148
Score = 206 bits (523), Expect = 3e-051
Identities = 105/140 (75%), Positives = 110/140 (78%), Gaps = 14/140 (10%)
Frame = +2
Query: 245 FGGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGGS-G 421
FGGYGNP+GAPYGNP +VPGAGFGSGPRSSWG QAP GYGNVGYGNA APWGGS G
Sbjct: 6 FGGYGNPAGAPYGNP--SVPGAGFGSGPRSSWGAQAPPGYGNVGYGNA----APWGGSGG 59
Query: 422 PGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSL-----GGGYADGLDG 586
PGSAVMGQ GASAGYG QGYGYGGNDSSYGTPS YGAVGGR ++ GGGYAD LDG
Sbjct: 60 PGSAVMGQAGASAGYGSQGYGYGGNDSSYGTPSAYGAVGGRSGNMPNNHGGGGYADALDG 119
Query: 587 YGNHQGNGQAGYG-GGYGSG 643
G + GN Q G GYG G
Sbjct: 120 SGGY-GNHQGNNGQAGYGGG 138
>gi|125574985|gb|EAZ16269.1| hypothetical protein OsJ_31729 [Oryza sativa
Japonica Group]
Length = 222
Score = 125 bits (312), Expect = 9e-027
Identities = 86/191 (45%), Positives = 95/191 (49%), Gaps = 18/191 (9%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYGGFG 250
G GGGGGG GGGGG+ G G G S Y AG GY G G G G G G G
Sbjct: 36 GGGGGGGGGSGGGGGYGGSGYG--SGSGYGEGGGAGAGGYGHGGGGGGGGGEGGGSGSGY 93
Query: 251 GYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGN-VGYGNAAAPSAPWGGSGPG 427
G G SG+ YG+ G G G G GG A SGYG+ GYG+ GSG G
Sbjct: 94 GSGQGSGSGYGSGAFGAGGYGSGGG---GGGGGAGSGYGSGEGYGSGY-------GSGAG 143
Query: 428 SAVMGQGG-ASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDGYGNHQG 604
A G GG G GGQG GY G+ S YG+ SGYG GG + GGGY G G G
Sbjct: 144 GASGGGGGHGGGGGGGQGGGY-GSGSGYGSGSGYGQGGG---AYGGGYGSGGGGGGGGGQ 199
Query: 605 NGQAGYGGGYG 637
G +GYG G G
Sbjct: 200 GGGSGYGSGSG 210
>gi|31432402|gb|AAP54039.1| transposon protein, putative, CACTA, En/Spm
sub-class, expressed [Oryza sativa Japonica Group]
Length = 222
Score = 124 bits (311), Expect = 1e-026
Identities = 86/191 (45%), Positives = 95/191 (49%), Gaps = 18/191 (9%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYGGFG 250
G GGGGGG GGGGG+ G G G S Y AG GY G G G G G G G
Sbjct: 36 GGGGGGGGGSGGGGGYGGSGYG--SGSGYGEGGGAGAGGYGHGGGGGGGGGEGGGSGSGY 93
Query: 251 GYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGN-VGYGNAAAPSAPWGGSGPG 427
G G SG+ YG+ G G G G GG A SGYG+ GYG+ GSG G
Sbjct: 94 GSGQGSGSGYGSGAFGAGGYGSGGG---GGGGGAGSGYGSGEGYGSGY-------GSGAG 143
Query: 428 SAVMGQGG-ASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDGYGNHQG 604
A G GG G GGQG GY G+ S YG+ GYG GG + GGGYA G G G
Sbjct: 144 GASGGGGGHGGGGGGGQGGGY-GSGSGYGSGRGYGQGGG---AYGGGYASGGGGGGGGGQ 199
Query: 605 NGQAGYGGGYG 637
G +GYG G G
Sbjct: 200 GGGSGYGSGSG 210
>gi|115487986|ref|NP_001066480.1| Os12g0242100 [Oryza sativa Japonica Group]
Length = 255
Score = 124 bits (310), Expect = 1e-026
Identities = 93/200 (46%), Positives = 99/200 (49%), Gaps = 36/200 (18%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYG--- 241
G GGGGGG GGGGG GYGEG Y A G GY G G G G G G G
Sbjct: 37 GEGGGGGGGEGGGGG-SGYGEG------YGQGGGASGGGYGQGGGGGGGGGQGGGSGSGY 89
Query: 242 --GFGGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGG 415
G+G G SG YG G G G G G S +G SGYG+ GYG G
Sbjct: 90 GSGYGQGGGASGGGYGKGGGGGGGGGQGGGAGSGYG----SGYGS-GYGQGR------GA 138
Query: 416 SGPGSAVMGQGGASAGYGGQGYGYGGNDSSYGT--PSGYGAVGGRPNSLGGGYAD-GLDG 586
SG G GQGG G GGQG GGN S YG+ SGYG GG + GGGY G G
Sbjct: 139 SGGG---YGQGGGGGGGGGQG---GGNGSGYGSGYGSGYGQGGG---ASGGGYGQGGGGG 189
Query: 587 YGNHQGNGQ-AGYGGGYGSG 643
G QG G +GYG GYGSG
Sbjct: 190 GGGGQGGGNGSGYGSGYGSG 209
>gi|242034309|ref|XP_002464549.1| hypothetical protein SORBIDRAFT_01g020430
[Sorghum bicolor]
Length = 222
Score = 123 bits (308), Expect = 3e-026
Identities = 87/201 (43%), Positives = 99/201 (49%), Gaps = 18/201 (8%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGY-GEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYGGF 247
G GGGGGG GGGG GY G G S Y GGSG GAG GYG G G GG
Sbjct: 30 GPGGGGGGGGSGGGGGGGYGGSGYGSGSGY---GEGGGSG----GAGGGYGHGGGGGGGE 82
Query: 248 G------GYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPW 409
G GYG+ G+ YG G G G G G SGYG+ G G + +
Sbjct: 83 GGGAGGSGYGSGQGSGYGAGSGGAGGYGSGGGGGGGGGQGGGSGYGH-GGGEGYGSGSGY 141
Query: 410 GGSGPGSAVMGQGGASAGYGGQGYGYG-GNDSSYGTPSGYGAVGGRPNSLGGGYADGLDG 586
GG G G G GGQG GYG G+ S YG+ G GA GG S GGG G G
Sbjct: 142 GGGAASGGGGGGGHGGGGGGGQG-GYGSGSGSGYGSGEGGGAHGGGYGSGGGGGGGGGQG 200
Query: 587 YGNHQGNGQ-AGYGGGYGSGN 646
G+ G+G +GYGGGYG+G+
Sbjct: 201 GGSGYGSGSGSGYGGGYGNGH 221
>gi|125536228|gb|EAY82716.1| hypothetical protein OsI_37929 [Oryza sativa Indica
Group]
Length = 256
Score = 122 bits (306), Expect = 4e-026
Identities = 91/198 (45%), Positives = 98/198 (49%), Gaps = 31/198 (15%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGF---PGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYG 241
G GGGGGG GGGGG GYGEG Y G GY G G G G G G G
Sbjct: 37 GEGGGGGGGDGGGGGSGYGSGYGEG------YGQGGGTSGGGYGQGGGGGGGGGQGGGSG 90
Query: 242 -GFG-GYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGG 415
G+G GYG GA G G G G G GG+ SGYG+ GYG+ G
Sbjct: 91 SGYGSGYGQGGGASRGG-----YGKGGGGGGGGGQGGRGGSGYGS-GYGSGYGQGG--GA 142
Query: 416 SGPGSAVMGQGGASAGYGGQGYGYGGNDSSYGT--PSGYGAVGGRPNSLGGGYADGLDGY 589
SG G GQGG G G QG GGN S YG+ SGYG GG + GGGY G G
Sbjct: 143 SGGG---YGQGGGGGGGGAQG---GGNGSGYGSGYGSGYGQGGG---ASGGGYGQGGGGG 193
Query: 590 GNHQGNGQAGYGGGYGSG 643
G GNG +GYG GYGSG
Sbjct: 194 GQGGGNG-SGYGSGYGSG 210
>gi|195117472|ref|XP_002003271.1| GI17825 [Drosophila mojavensis]
Length = 552
Score = 121 bits (302), Expect = 1e-025
Identities = 80/197 (40%), Positives = 99/197 (50%), Gaps = 10/197 (5%)
Frame = +2
Query: 77 AGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAG--GSGYPPYGAGYGYGSNGVGYGGFG 250
AGGG G GGGGG+ G G G SS +AG G G P +GAG G G G G+GG G
Sbjct: 88 AGGGPFGGGGGGGYGGAGAGGGASSSAGSSTSAGGHGGGGPGFGAGGGGGGLGGGHGGSG 147
Query: 251 GYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYG-----NAAAPSAPWGG 415
G+G S G + G+GFGSG GG G+G+ GYG +A + GG
Sbjct: 148 GFGGGSAGGGHGGGGGLGGSGFGSGGHGGGGGLGGGGFGSGGYGGGGFAGGSAGGSAGGG 207
Query: 416 SGPGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDGYGN 595
G + G G S G+GG G G GG G G G +GG GG + G G G+
Sbjct: 208 HAGGGGLGGSGFGSGGHGGGG-GLGGGGFGSGGHGGSGGLGGGGFGSGGQGSGGFAG-GS 265
Query: 596 HQGNGQAGYGG-GYGSG 643
G+ AG+GG G+GSG
Sbjct: 266 ASGSAGAGHGGAGFGSG 282
Score = 108 bits (269), Expect = 8e-022
Identities = 75/199 (37%), Positives = 86/199 (43%), Gaps = 15/199 (7%)
Frame = +2
Query: 62 ANPGVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYG 241
A G GGG GGGG G G G GG G +G+G GS G+G G
Sbjct: 196 AGGSAGGSAGGGHAGGGGLGGSGFGSGGHG------GGGGLGGGGFGSGGHGGSGGLGGG 249
Query: 242 GFGGYGNPS-----GAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAP 406
GFG G S G+ G+ G GAGFGSG + GG G +G G S
Sbjct: 250 GFGSGGQGSGGFAGGSASGSAGAGHGGAGFGSGGHGAGGGLG----GGLGGGLGGGVSLG 305
Query: 407 WGGSGPGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDG 586
GG GSA G GG + G GG G+G G S G +G GA GG GG G G
Sbjct: 306 GGGYAGGSAGSGIGGGAHGGGGPGFGGGIGGGSAGASAGSGASGGAIGGGNGGIGGGHGG 365
Query: 587 YGNHQGNGQAGYGGGYGSG 643
G G G+GG G G
Sbjct: 366 IGGGHGGFGGGHGGHGGGG 384
>gi|255575788|ref|XP_002528793.1| Glycine-rich cell wall structural protein 1.8
precursor, putative [Ricinus communis]
Length = 379
Score = 114 bits (284), Expect = 2e-023
Identities = 81/214 (37%), Positives = 93/214 (43%), Gaps = 28/214 (13%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFP----------GYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGY- 217
G+ GGGGGG GGGGG G G G + Y GG G G G
Sbjct: 83 GIGGGGGGGSGGGGGAAHAGGASGAGYGAGSGEGGGAGYGGAAGIGGGGGKGGGGGAASA 142
Query: 218 -GSNGVGYGGFGGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAA 394
G+ G GYG GG G GA YG G G+G G G G + G G GYG+
Sbjct: 143 GGAGGAGYGAGGGEG--GGAGYGGAGGIGGGSGGGGGKGGGGGAASAGGAGGAGYGSGGG 200
Query: 395 P--------SAPWGGSGPGSAVMGQGGASA--GYGGQGYGYGGNDSSYGTPSGYGAVGGR 544
+ GGSG G G GGA++ G GG GYG GG + G GG
Sbjct: 201 EGGGAGYGGAGIAGGSGGGGGKGGGGGAASAGGAGGAGYGSGGGEGGGAGGHTSGGGGGS 260
Query: 545 PNSLGGGYA-DGLDGYGNHQGNGQAGYGGGYGSG 643
G GYA G GYG +G AG GGGYG+G
Sbjct: 261 GGGGGAGYAGSGAGGYGGGEG---AGSGGGYGAG 291
>gi|195029221|ref|XP_001987473.1| GH21940 [Drosophila grimshawi]
Length = 697
Score = 112 bits (280), Expect = 4e-023
Identities = 82/204 (40%), Positives = 92/204 (45%), Gaps = 26/204 (12%)
Frame = +2
Query: 62 ANPGVAGGGGGGRG--GGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVG 235
A G GGG GG G GG G GYG G + R AGG+G GYG G+ G G
Sbjct: 345 AGAGGYGGGAGGAGGAGGAGAGGYGGGAGGAGR---GGGAGGAG------GYGGGAGGAG 395
Query: 236 YGGF----GGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSA 403
GG+ GG G GA G G GAG G G GG GYG G AA A
Sbjct: 396 AGGYGGGAGGAGGAGGAGAGGYGGGAGGAGRGGGA----GGAGAGGYGGGAGGAGAAGGA 451
Query: 404 PWGGSGPGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLG--GGYADG 577
GG G G+ G+GG + G G GYG G + G G+G GR G GGY G
Sbjct: 452 GAGGYGGGAGGAGRGGGAGGAGAGGYGGGAGGAGAG---GFGGGAGRGGGAGGAGGYGGG 508
Query: 578 LDGYGNHQGN--GQAGYGGGYGSG 643
G G G G G+GGG G G
Sbjct: 509 AGGPGGGGGGGAGAGGFGGGAGRG 532
>gi|17821|emb|CAA78762.1| glycine-rich_protein_(aa1-291) [Brassica napus]
Length = 291
Score = 112 bits (278), Expect = 8e-023
Identities = 77/198 (38%), Positives = 87/198 (43%), Gaps = 18/198 (9%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPG-YGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGS---NGVGY 238
GV GGGGG GGG G+ G G G + GG G P G+GYG GS G GY
Sbjct: 55 GVGVGGGGGEGGGAGYGGAEGIGGGGGGGHGGGAGGGGGGGPGGGSGYGGGSGEGGGAGY 114
Query: 239 GGFGGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGGS 418
GG G G+ G G G G G G +GG +G G GYG A GG
Sbjct: 115 GGGGAGGHGGGGGSGGGG----GGGAGGAHGGGYGGGEGAGAGG-GYGGGGAGGHGGGGG 169
Query: 419 GPGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDGYGNH 598
G G GG G G G GYGG + + G GYG GG GGG G G G
Sbjct: 170 G------GNGGGGGGGGAHGGGYGGGEGA-GAGGGYG--GGGAGGHGGGGGGGKGGGGGG 220
Query: 599 QGNGQAGYGGGYGSGNQA 652
+GGGYG+G A
Sbjct: 221 GSGAGGAHGGGYGAGGGA 238
>gi|45935055|gb|AAS79562.1| At5g46730 [Arabidopsis thaliana]
Length = 270
Score = 111 bits (275), Expect = 2e-022
Identities = 83/201 (41%), Positives = 94/201 (46%), Gaps = 19/201 (9%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQ---NAGGSGYPPYGAGY----GYGSNG 229
GV+ GG GG GGG G GEG Y + + GGSG+ G G GY S G
Sbjct: 47 GVSSGGYGGESGGGYGGGSGEGA--GGGYGGAEGYASGGGSGHGGGGGGAASSGGYAS-G 103
Query: 230 VGYGGFGGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGN-VGYGNAAAPSAP 406
G GG GGYG +G G G G+G G G GG+ SGYGN G G A S
Sbjct: 104 AGEGGGGGYGGAAGGHAGGGG---GGSGGGGGSAYGAGGEHASGYGNGAGEGGGAGASGY 160
Query: 407 WGGS--GPGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGL 580
GG+ G G G GG SAG G GYGG + G SG G G + GGGY G
Sbjct: 161 GGGAYGGGGGHGGGGGGGSAGGAHGGSGYGGGE---GGGSGGGGAYGGGGAHGGGYGSGG 217
Query: 581 DGYGNHQGNGQAGYGGGYGSG 643
G + G GYGGG G G
Sbjct: 218 GEGGGYGGGAAGGYGGGGGGG 238
>gi|294996438|ref|ZP_06802129.1| hypothetical protein Mtub2_18524 [Mycobacterium
tuberculosis 210]
Length = 311
Score = 109 bits (272), Expect = 4e-022
Identities = 75/193 (38%), Positives = 80/193 (41%), Gaps = 14/193 (7%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYGGFG 250
G GGGGGGRGGGGG G G G + GG G G G G G G G GG G
Sbjct: 57 GGGGGGGGGRGGGGGGGGGGGGGGGGGGKREEEGGGGGGEGGRGGGGGGGRGGGGEGGGG 116
Query: 251 GYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGGSGPGS 430
G+G G G G G G G G GG G G G G GG G G
Sbjct: 117 GWGGGGG---GGGGGGRGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 173
Query: 431 AVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAV------GGRPNSLGGGYADGLDGYG 592
G GG G GG G+G GG G G+G V GG GGG+ G G G
Sbjct: 174 GGGGGGGGGGGGGGGGWGGGGGGGGGGGGWGWGVVLCWFLGGG-----GGGWGGGGGGGG 228
Query: 593 NHQGNGQAGYGGG 631
G G G GGG
Sbjct: 229 GGGGGGGGGGGGG 241
>gi|255583740|ref|XP_002532623.1| Glycine-rich cell wall structural protein 1
precursor, putative [Ricinus communis]
Length = 221
Score = 106 bits (264), Expect = 3e-021
Identities = 75/191 (39%), Positives = 88/191 (46%), Gaps = 17/191 (8%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYGGFG 250
G GGGGG GGGGG G G S Y G G YG GYG G+G GG G
Sbjct: 36 GGGGGGGGQGGGGGGGSALGSGSGYGSGY------GSGGGEGYGGAGGYG--GLGGGGGG 87
Query: 251 GYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGGSGPGS 430
G G+ G G+ + G+G+GSG S +G + G G G G GG G G+
Sbjct: 88 GGGSGGGGGGGSASGSGSGSGYGSGSGSGYGSGSGGGKGGGGGGGGGKGGGGGGGGGVGN 147
Query: 431 AVMGQG-GASAGYG-GQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDGYGNHQG 604
G G G +GYG G G GYG G G G GG GGG G GYG+ G
Sbjct: 148 ---GNGSGYGSGYGSGSGSGYGSGGGKGGGGGGGGGGGGGGGGGGGGSGSG-SGYGSGYG 203
Query: 605 NGQ---AGYGG 628
+G +GYGG
Sbjct: 204 SGSGYGSGYGG 214
>gi|2961347|emb|CAA18105.1| glycine-rich protein [Arabidopsis thaliana]
Length = 396
Score = 106 bits (263), Expect = 4e-021
Identities = 77/199 (38%), Positives = 92/199 (46%), Gaps = 11/199 (5%)
Frame = +2
Query: 62 ANPGVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNA----GGSGYPPYGAGYGYGSNG 229
A GV G GG GGGGG G G G S + A GG+ G G G G G
Sbjct: 166 AGAGVGGSSGGAGGGGGGGGGEGGGANGGSGHGSGAGAGAGVGGAAGGVGGGGGGGGGEG 225
Query: 230 VGYGGFGGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPW 409
G G G+G+ SGA G G G G G G S G + GY G+G+ +
Sbjct: 226 GGANGGSGHGSGSGAGGGVSG-AAGGGGGGGGGGGSGGSKVGGGY---GHGSGFGGGVGF 281
Query: 410 GGSGPGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDGY 589
G SG G G GG G GG G GY G+ S YG+ G G+ G GGG + G +G
Sbjct: 282 GNSGGGGG-GGGGGGGGGGGGNGSGY-GSGSGYGSGMGKGSGSGGGGGGGGGGSGGGNGS 339
Query: 590 GNHQGNGQAGYGGGYGSGN 646
G+ G G G GGG G+GN
Sbjct: 340 GSGSGEGY-GMGGGAGTGN 357
>gi|170733890|ref|YP_001765837.1| hypothetical protein Bcenmc03_2554
[Burkholderia cenocepacia MC0-3]
Length = 505
Score = 104 bits (259), Expect = 1e-020
Identities = 75/196 (38%), Positives = 84/196 (42%), Gaps = 11/196 (5%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGSNGVGYGGFG 250
G GG G G GG G G+G G + P N GG G G G G G G G GG G
Sbjct: 295 GSGGGHGDGGGGHGNGGGHGNGGGGNGNGGGPGNGGGGGGGGGGGGNGNGGGGGGGGGGG 354
Query: 251 GYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGGSGPGS 430
G G GA GN G G G G G GG G G G GN GG G G
Sbjct: 355 GGGGGGGAG-GNGG---NGGGGGGGGGGGGGGGGGGGGGGGGGGNGGNGGGGGGGHGNGG 410
Query: 431 AVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDGYGNHQG-- 604
G GG + G G G GN + G +G G GG + GG +G G GN G
Sbjct: 411 GGHGNGGGNG--NGNGSGGAGNGGANGVGNGRGN-GGNSGNAGGSNGNGGGGAGNGGGSG 467
Query: 605 --NGQAGYGGGYGSGN 646
NG G+G G G+GN
Sbjct: 468 GANGTGGHGNGGGNGN 483
>gi|71022041|ref|XP_761251.1| hypothetical protein UM05104.1 [Ustilago maydis
521]
Length = 838
Score = 101 bits (250), Expect = 1e-019
Identities = 77/195 (39%), Positives = 85/195 (43%), Gaps = 8/195 (4%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGS---GYPPYGAGYGYGSNGVGYG 241
G G G G G G G G G G S P + GS G G +G GSNG G G
Sbjct: 91 GSGSGSGSGFGSGSG-SGSGSGSGSGSGTKSPGSGSGSHDGGSNGGGGSHGGGSNGGGNG 149
Query: 242 GFGGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGGSG 421
G G GN G GN G G G G+G + G +G GN G GN GG
Sbjct: 150 GGNGGGNGGGNGGGNGGGNGGGNGGGNGGGNGGGNGGGNGGGN-GGGNGGGNGGGNGGGN 208
Query: 422 PGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGGRPNSLGGGYADGLDGYGNHQ 601
G G GG + G G G G GGN G +G G GG GGG G +G GN
Sbjct: 209 GGGNGGGNGGGNGGGNGGGNG-GGNGGGNGGGNGGGNGGGNGGGNGGGNGGG-NGGGNGG 266
Query: 602 GNGQAGYGGGYGSGN 646
GNG G GGG G GN
Sbjct: 267 GNG-GGNGGGNGGGN 280
>gi|225427197|ref|XP_002278025.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 393
Score = 97 bits (241), Expect = 1e-018
Identities = 63/160 (39%), Positives = 72/160 (45%), Gaps = 7/160 (4%)
Frame = +2
Query: 71 GVAGGGGGGRGGGGGFPGYGEGRVDSSRYMPPQNAGGSGYPPYGAGYGYGS---NGVGYG 241
G GGG GG GGGG GYG G Y + AGG G G G GYG+ +G GYG
Sbjct: 235 GEHGGGYGGGSGGGGGVGYGAGGEHGGGYGTGEGAGGGGGGGSGGGTGYGAGGEHGGGYG 294
Query: 242 GFGGYGNPSGAPYGNPGVTVPGAGFGSGPRSSWGGQAPSGYGNVGYGNAAAPSAPWGGSG 421
GG G +G G G G GSG + +GG+ GYG GN +G G
Sbjct: 295 SGGGSGGGTGYGAGGEHGGGYGGGGGSGGGTGYGGEHGGGYGG---GNGGGGGVGYGAGG 351
Query: 422 PGSAVMGQGGASAGYGGQGYGYGGNDSSYGTPSGYGAVGG 541
G+GG S G G G G GG YG G G GG
Sbjct: 352 EHGGGYGRGGGSGGGAGAGSG-GGGGGGYGGGYGGGVHGG 390
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,878,430,556,656
Number of Sequences: 15229318
Number of Extensions: 5878430556656
Number of Successful Extensions: 1392584967
Number of sequences better than 0.0: 0
|