Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN02836


BLASTX 7.6.2

Query= UN02836 /QuerySize=1435
        (1434 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|18416364|ref|NP_567704.1| hydroxyproline-rich glycoprotein fa...    411   1e-112
gi|15028121|gb|AAK76684.1| unknown protein [Arabidopsis thaliana]      409   6e-112
gi|297803660|ref|XP_002869714.1| hydroxyproline-rich glycoprotei...    406   8e-111
gi|30686552|ref|NP_849436.1| hydroxyproline-rich glycoprotein fa...    363   5e-098
gi|255545984|ref|XP_002514052.1| conserved hypothetical protein ...    129   2e-027
gi|224063391|ref|XP_002301125.1| predicted protein [Populus tric...    118   3e-024
gi|46095228|gb|AAS80151.1| ACT11D09.5 [Cucumis melo]                   110   7e-022
gi|255545986|ref|XP_002514053.1| conserved hypothetical protein ...     92   2e-016
gi|224081921|ref|XP_002306529.1| predicted protein [Populus tric...     79   2e-012

>gi|18416364|ref|NP_567704.1| hydroxyproline-rich glycoprotein family protein
        [Arabidopsis thaliana]

          Length = 319

 Score =  411 bits (1056), Expect = 1e-112
 Identities = 204/272 (75%), Positives = 223/272 (81%), Gaps = 18/272 (6%)
 Frame = -2

Query: 1127 FYTDPMAAYSSFKRNKSPKQQYISSPSHQMSPPV-PQFPPSV-PGSMGNDYQVHPNHGGF 954
            +YTDPMAAYSSFK+NK+PKQQYISSPSHQ S PV PQFPPSV PGS+ ++YQ   NHGGF
Sbjct:   61 YYTDPMAAYSSFKKNKTPKQQYISSPSHQGSSPVPPQFPPSVPPGSLCSEYQAQTNHGGF 120

Query:  953 QEAHYGGDNQHTQPRGMA---PSYRGPPAPWNNNFRPPPPVNHLGPPQWVPRPYPFIQGN 783
              AHY       +PRGMA   PS+RGPPA WNNNFR PPPVNH GPPQWVPRP+PF Q  
Sbjct:  121 HAAHY-------EPRGMAHLSPSHRGPPAGWNNNFR-PPPVNHSGPPQWVPRPFPFSQEM 172

Query:  782 HDMGNNRYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGGGRGMNTSFGR 603
             +MGNNR+GGR    G YNN PPQF +YGRQN+NW GNTYPNSGRGR   GRGMNTSFGR
Sbjct:  173 PNMGNNRFGGR----GSYNNTPPQFSNYGRQNANWGGNTYPNSGRGR-SRGRGMNTSFGR 227

Query:  602 GGGRRPMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMI 423
             GGRRPME GAERFYSNSMAEDPWKHLKPVLWK+ SDASSS+STGQ W P SIAPKK + 
Sbjct:  228 DGGRRPMEPGAERFYSNSMAEDPWKHLKPVLWKNCSDASSSSSTGQAWLPKSIAPKKSVT 287

Query:  422 SEASHKPSNNQQSLAEYLAASLDEATCDDPSN 327
            SEA+HK S+NQQSLAEYLAASLD ATCD+ SN
Sbjct:  288 SEATHKTSSNQQSLAEYLAASLDGATCDESSN 319


 Score =  82 bits (200), Expect = 3e-013
 Identities = 40/61 (65%), Positives = 50/61 (81%), Gaps = 1/61 (1%)
 Frame = -3

Query: 1306 EDSEKRKEMLKAMRMEAAAAASQNDVSTELETSMNTSHLSNPLADASTHQQESYDKPRFD 1127
            EDSEKRK+MLKAMRME AAA + +D +T  ETSM+T HLSNPLA+ S HQQ+S++  RFD
Sbjct:    2 EDSEKRKQMLKAMRME-AAAQNDDDATTGTETSMSTGHLSNPLAETSNHQQDSFETQRFD 60

Query: 1126 F 1124
            +
Sbjct:   61 Y 61

>gi|15028121|gb|AAK76684.1| unknown protein [Arabidopsis thaliana]

          Length = 319

 Score =  409 bits (1051), Expect = 6e-112
 Identities = 203/272 (74%), Positives = 223/272 (81%), Gaps = 18/272 (6%)
 Frame = -2

Query: 1127 FYTDPMAAYSSFKRNKSPKQQYISSPSHQMSPPV-PQFPPSV-PGSMGNDYQVHPNHGGF 954
            +YTDPMAAYSSFK+NK+PKQQYISSPSHQ S PV PQFPPSV PGS+ ++YQ   NHGGF
Sbjct:   61 YYTDPMAAYSSFKKNKTPKQQYISSPSHQGSSPVPPQFPPSVPPGSLCSEYQAQTNHGGF 120

Query:  953 QEAHYGGDNQHTQPRGMA---PSYRGPPAPWNNNFRPPPPVNHLGPPQWVPRPYPFIQGN 783
              AHY       +PRGMA   PS+RGPPA WNNNFR PPPVNH GPPQWVPRP+PF Q  
Sbjct:  121 HAAHY-------EPRGMAHLSPSHRGPPAGWNNNFR-PPPVNHSGPPQWVPRPFPFSQEM 172

Query:  782 HDMGNNRYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGGGRGMNTSFGR 603
             +MGNNR+GGR    G YNN PPQF +YGRQN+NW GNT+PNSGRGR   GRGMNTSFGR
Sbjct:  173 PNMGNNRFGGR----GSYNNTPPQFSNYGRQNANWGGNTHPNSGRGR-SRGRGMNTSFGR 227

Query:  602 GGGRRPMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMI 423
             GGRRPME GAERFYSNSMAEDPWKHLKPVLWK+ SDASSS+STGQ W P SIAPKK + 
Sbjct:  228 DGGRRPMEPGAERFYSNSMAEDPWKHLKPVLWKNCSDASSSSSTGQAWLPKSIAPKKSVT 287

Query:  422 SEASHKPSNNQQSLAEYLAASLDEATCDDPSN 327
            SEA+HK S+NQQSLAEYLAASLD ATCD+ SN
Sbjct:  288 SEATHKTSSNQQSLAEYLAASLDGATCDESSN 319


 Score =  82 bits (200), Expect = 3e-013
 Identities = 40/61 (65%), Positives = 50/61 (81%), Gaps = 1/61 (1%)
 Frame = -3

Query: 1306 EDSEKRKEMLKAMRMEAAAAASQNDVSTELETSMNTSHLSNPLADASTHQQESYDKPRFD 1127
            EDSEKRK+MLKAMRME AAA + +D +T  ETSM+T HLSNPLA+ S HQQ+S++  RFD
Sbjct:    2 EDSEKRKQMLKAMRME-AAAQNDDDATTGTETSMSTGHLSNPLAETSNHQQDSFETQRFD 60

Query: 1126 F 1124
            +
Sbjct:   61 Y 61

>gi|297803660|ref|XP_002869714.1| hydroxyproline-rich glycoprotein family
        protein [Arabidopsis lyrata subsp. lyrata]

          Length = 320

 Score =  406 bits (1041), Expect = 8e-111
 Identities = 199/272 (73%), Positives = 222/272 (81%), Gaps = 18/272 (6%)
 Frame = -2

Query: 1127 FYTDPMAAYSSFKRNKSPKQQYISSPSHQMSPPV-PQFPPSV-PGSMGNDYQVHPNHGGF 954
            +YTDPM+AYSSFK+ K+PKQQYISSPSHQ S PV PQFPPSV PGS+G++YQ H NHGGF
Sbjct:   62 YYTDPMSAYSSFKKIKTPKQQYISSPSHQASSPVPPQFPPSVPPGSLGSEYQAHTNHGGF 121

Query:  953 QEAHYGGDNQHTQPRGM---APSYRGPPAPWNNNFRPPPPVNHLGPPQWVPRPYPFIQGN 783
            Q AHY       +PRGM   +P YRG PA WNNNFR PPPVNH GPPQWVPRP+PF Q  
Sbjct:  122 QAAHY-------EPRGMSHLSPPYRGSPASWNNNFR-PPPVNHPGPPQWVPRPFPFSQEI 173

Query:  782 HDMGNNRYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGGGRGMNTSFGR 603
             +MGNNR+G R    G YNN  P F +YGRQN+NW GNTYPNSGRG GG GRGMNTSFGR
Sbjct:  174 PNMGNNRFGDR----GSYNNTAPHFSNYGRQNANWVGNTYPNSGRG-GGRGRGMNTSFGR 228

Query:  602 GGGRRPMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMI 423
             GGRRP E GAER+YSNSMA+DPWK+LKPV+WKS SDASSSNSTGQ W P+S APKK + 
Sbjct:  229 DGGRRPTELGAERYYSNSMADDPWKYLKPVIWKSCSDASSSNSTGQAWLPNSTAPKKSVT 288

Query:  422 SEASHKPSNNQQSLAEYLAASLDEATCDDPSN 327
            SEA+HKPSNNQQSLAEYLAASLDEATCD+ S+
Sbjct:  289 SEATHKPSNNQQSLAEYLAASLDEATCDESSS 320


 Score =  86 bits (211), Expect = 1e-014
 Identities = 41/62 (66%), Positives = 50/62 (80%)
 Frame = -3

Query: 1309 MEDSEKRKEMLKAMRMEAAAAASQNDVSTELETSMNTSHLSNPLADASTHQQESYDKPRF 1130
            MEDSEKRK+MLKAMRMEAAA    +D +T+ ETSMNT HLSNPLA+ ST  Q+S++  RF
Sbjct:    1 MEDSEKRKQMLKAMRMEAAAQNDNDDSTTDPETSMNTGHLSNPLAETSTQHQDSFETSRF 60

Query: 1129 DF 1124
            D+
Sbjct:   61 DY 62

>gi|30686552|ref|NP_849436.1| hydroxyproline-rich glycoprotein family protein
        [Arabidopsis thaliana]

          Length = 290

 Score =  363 bits (931), Expect = 5e-098
 Identities = 182/254 (71%), Positives = 200/254 (78%), Gaps = 18/254 (7%)
 Frame = -2

Query: 1073 KQQYISSPSHQMSPPV-PQFPPSV-PGSMGNDYQVHPNHGGFQEAHYGGDNQHTQPRGMA 900
            +Q    + SHQ S PV PQFPPSV PGS+ ++YQ   NHGGF  AHY       +PRGMA
Sbjct:   50 QQDSFETQSHQGSSPVPPQFPPSVPPGSLCSEYQAQTNHGGFHAAHY-------EPRGMA 102

Query:  899 ---PSYRGPPAPWNNNFRPPPPVNHLGPPQWVPRPYPFIQGNHDMGNNRYGGRGPRVGGY 729
               PS+RGPPA WNNNFR PPPVNH GPPQWVPRP+PF Q   +MGNNR+GGR    G Y
Sbjct:  103 HLSPSHRGPPAGWNNNFR-PPPVNHSGPPQWVPRPFPFSQEMPNMGNNRFGGR----GSY 157

Query:  728 NNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGGGRGMNTSFGRGGGRRPMEQGAERFYSNS 549
            NN PPQF +YGRQN+NW GNTYPNSGRGR   GRGMNTSFGR GGRRPME GAERFYSNS
Sbjct:  158 NNTPPQFSNYGRQNANWGGNTYPNSGRGR-SRGRGMNTSFGRDGGRRPMEPGAERFYSNS 216

Query:  548 MAEDPWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMISEASHKPSNNQQSLAEYL 369
            MAEDPWKHLKPVLWK+ SDASSS+STGQ W P SIAPKK + SEA+HK S+NQQSLAEYL
Sbjct:  217 MAEDPWKHLKPVLWKNCSDASSSSSTGQAWLPKSIAPKKSVTSEATHKTSSNQQSLAEYL 276

Query:  368 AASLDEATCDDPSN 327
            AASLD ATCD+ SN
Sbjct:  277 AASLDGATCDESSN 290


 Score =  75 bits (182), Expect = 3e-011
 Identities = 37/55 (67%), Positives = 46/55 (83%), Gaps = 1/55 (1%)
 Frame = -3

Query: 1306 EDSEKRKEMLKAMRMEAAAAASQNDVSTELETSMNTSHLSNPLADASTHQQESYD 1142
            EDSEKRK+MLKAMRME AAA + +D +T  ETSM+T HLSNPLA+ S HQQ+S++
Sbjct:    2 EDSEKRKQMLKAMRME-AAAQNDDDATTGTETSMSTGHLSNPLAETSNHQQDSFE 55

>gi|255545984|ref|XP_002514052.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 412

 Score =  129 bits (322), Expect = 2e-027
 Identities = 114/293 (38%), Positives = 144/293 (49%), Gaps = 53/293 (18%)
 Frame = -2

Query: 1127 FYTDPMAAYSSFKRNKS---PKQQYISSPSHQMSPPVPQFPPSVPGSMGND-------YQ 978
            FYT+PMAA+S+ KR  S   P  +Y   PS+  + P+P F   VPG  GN        YQ
Sbjct:  139 FYTNPMAAFSADKRIASINQPAPRYFIPPSN--NGPMPWFSSPVPGP-GNPGMTPSPVYQ 195

Query:  977 VH----PNHGGFQEAHYGGDNQHTQPR-GMAPSYRGPPAPWNNNFRPPPPVNHLGPPQWV 813
            +     PN    Q+  Y     +  PR G  P ++G P  WN     P  +    P +  
Sbjct:  196 MQSNYLPNQRTHQQGPYNSAVPYRSPRAGPFPMHQGTPDAWNG----PGGIAAAAPYRGR 251

Query:  812 PRPYPFIQGNHDM-----GNNRYG-GRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSG 651
              PYP  + N         +  YG GR P  G  N+  P+  H G        +TY  SG
Sbjct:  252 MCPYPIHESNPGFQPAGSPSFNYGQGRPPWSG--NSPSPRSVHGG-------SSTY--SG 300

Query:  650 RGRG---GGGRG-MNTSFGRGG----GRRPMEQ-GAERFYSNSMAEDPWKHLKPVLWKSF 498
            RG+G   G  RG ++   GR G    G  P E  G E FY  SM EDPWK L+PV+WK  
Sbjct:  301 RGQGQWHGSSRGQISGQSGRRGFHSRGPAPGEAFGPESFYEKSMVEDPWKQLEPVVWKML 360

Query:  497 SDASSSNSTGQTWRPSSIAPKKPMISEASHKPSNNQQSLAEYLAASLDEATCD 339
                SSNS    W P SI+ KKP  SE+S+  SN++QSLAEYLAAS +EA  D
Sbjct:  361 GVPGSSNS----WLPKSISRKKPRPSESSNN-SNSKQSLAEYLAASFNEAVKD 408

>gi|224063391|ref|XP_002301125.1| predicted protein [Populus trichocarpa]

          Length = 347

 Score =  118 bits (294), Expect = 3e-024
 Identities = 96/269 (35%), Positives = 124/269 (46%), Gaps = 32/269 (11%)
 Frame = -2

Query: 1085 NKSPKQQYISSPSHQMSPPVPQFPPSVPGSMGNDY----QVHPNHGGFQEAHYGGDNQHT 918
            N S   Q+ S    Q +P V    PS    M N+Y    Q+  N+   Q  + G    H 
Sbjct:   93 NISSMPQFSSPHPGQRNPEV---TPSSAYQMQNNYSPANQMQSNYSPNQRMYPGQGPYHN 149

Query:  917 QPRGMAPS--------YRGPPAPWNNNFRPPPPVNHLGPP-QWVPRPYPFIQGNHDMGNN 765
                  PS         +G P  WN      P  NH   P + + RPYP  QGN   G  
Sbjct:  150 AAFYRTPSNFARPFTMNQGTPEMWNG--PGGPASNHSSTPYRGISRPYPIHQGNPGFG-- 205

Query:  764 RYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNSGRGRGGG-GRGMNTSFGRGGGRR 588
              G     V GY  +P      GR      G    +SG G+ GG GRG    F   G   
Sbjct:  206 PVGSSPSPVSGYGGSPAS---SGRGQGRGQGYWDSSSGLGQSGGRGRG----FRSRGFAL 258

Query:  587 PMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSNS---TGQTWRPSSIAPKKPMISE 417
               Q  E F+ NSM EDPW+HLKPVLW+   D  ++ +   +  +W P SI+ KKP ISE
Sbjct:  259 NETQEPECFHDNSMVEDPWQHLKPVLWRGLDDPGNNLNGPVSSNSWLPKSISVKKPRISE 318

Query:  416 ASHKPSNNQQSLAEYLAASLDEATCDDPS 330
            +S+K S + Q+LAEYL+A+  EAT D P+
Sbjct:  319 SSNK-STSGQTLAEYLSAAFTEATNDAPN 346

>gi|46095228|gb|AAS80151.1| ACT11D09.5 [Cucumis melo]

          Length = 568

 Score =  110 bits (274), Expect = 7e-022
 Identities = 90/290 (31%), Positives = 131/290 (45%), Gaps = 39/290 (13%)
 Frame = -2

Query: 1127 FYTDPMAAYSSFKRNKSPKQQYISS---PSHQMSPPVPQFPPSVPG------SMGNDYQV 975
            +YT+PMAA+S+ K+    + Q +S    P H  +      PP+ PG      S  + +Q 
Sbjct:  293 YYTNPMAAFSTSKKKGKIENQPVSDTFVPYHHNTSSTTYLPPTFPGLRNPEMSPSSTHQF 352

Query:  974 H---PNHGGFQ---EAHYGGDNQHTQPRGMAPS------YRGPPAPWNNNFRPPPPVNHL 831
            H   P+   F    ++  GG      PR  A +      +RGP  P+ N F P  P   +
Sbjct:  353 HQYSPDQRTFYARGDSEAGGHGSPGMPRPYAVNQGDPHMWRGPRRPFVNQF-PTHPPREM 411

Query:  830 GPPQWVPRPYPFIQGNHDMGNNRYGGRGPRVGGYNNNPPQFPHYGRQNSNWAGNTYPNS- 654
                 V  P      N      +Y    P  G + +  P     GR +    GN  P+  
Sbjct:  412 NSSSHVSGPRGNSYTNPTQDRAKYRSSSPNPGFHGSLSP-----GRGSHGHHGNMTPSPR 466

Query:  653 -GRGRGGGGRGMNTSFGRGGGRRPMEQGAERFYSNSMAEDPWKHLKPVLWKSFSDASSSN 477
             G GRG G  G ++   +         G E+FY+ SM EDPWK L+P +W +   +S+S 
Sbjct:  467 FGYGRGTGFHGRHSLLDK--------SGPEQFYNVSMLEDPWKVLQPCIWTTIDSSSNSA 518

Query:  476 STGQTWRPSSIAPKKPMISEASHKPSNNQQ-SLAEYLAASLDEATCDDPS 330
               ++W  S    KK  +S++S   S++QQ SLAEYLAAS  EA  D P+
Sbjct:  519 KPSESW-ISKFGTKKARVSDSSSGRSSSQQPSLAEYLAASFKEAIEDAPN 567

>gi|255545986|ref|XP_002514053.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 226

 Score =  92 bits (228), Expect = 2e-016
 Identities = 83/235 (35%), Positives = 103/235 (43%), Gaps = 31/235 (13%)
 Frame = -2

Query: 1034 PPVPQFPPSVPGSMGNDYQVHPNHGGFQ-EAHYGGDNQHTQPR-GMAPSYRGPPAPWNNN 861
            P  P   PS    M ++Y   PN    Q +  Y     +  PR G+ P ++G P  WN  
Sbjct:    4 PGNPGMTPSPAYQMQSNYL--PNQRTHQAQGPYNSAVPYRSPRTGLFPMHQGTPDAWNG- 60

Query:  860 FRPPPPVNHLGPPQWVPRPYPFIQGNHDMGNNR-----YG-GRGPRVGGYNNNPPQFPHY 699
               P  +    P +    PYP  + N      R     YG GR P  G  NN  P+  H 
Sbjct:   61 ---PGGIAAAAPYRGRMCPYPIYESNPGFQPARSPSFNYGQGRPPWSG--NNPCPRSVHG 115

Query:  698 G------RQNSNWAGNTYPNSGRGRGGGGRGMNTSFGRGGGRRPMEQGAERFYSNSMAED 537
            G      R    W G+         G  GRG + S G   G      G E F+  SM ED
Sbjct:  116 GSSTYSRRGQGQWHGSNRGQISGQSGRRGRGFH-SRGPASGE---AFGPESFHDKSMVED 171

Query:  536 PWKHLKPVLWKSFSDASSSNSTGQTWRPSSIAPKKPMISEASHKPSNNQQSLAEY 372
            PWK L+PV+WK      SSNS    W P SI+ KKP  SE S+  SN++QSLAEY
Sbjct:  172 PWKQLEPVVWKMLEVPRSSNS----WLPKSISRKKPRPSEPSNN-SNSKQSLAEY 221

>gi|224081921|ref|XP_002306529.1| predicted protein [Populus trichocarpa]

          Length = 331

 Score =  79 bits (193), Expect = 2e-012
 Identities = 55/136 (40%), Positives = 75/136 (55%), Gaps = 14/136 (10%)
 Frame = -2

Query: 746 PRVGGYNNNPPQFPHYGRQ---NSNWAGNTYPNSGRGRGGG-GRGMNTSFGRGGGRRPME 579
           P  G   ++P     YG     +    G+ + +SG G+ GG GRG ++      G  P E
Sbjct: 202 PGFGPVGSSPSPVSGYGGSPAISQTGQGHWHSSSGFGQSGGRGRGFHSR-----GFAPNE 256

Query: 578 -QGAERFYSNSMAEDPWKHLKPVLWKSFSD-ASSSNSTG--QTWRPSSIAPKKPMISEAS 411
            QG E FY NSM EDPW+HL+PVLW    D  ++ N  G   +  P SI+ KK  ++E+S
Sbjct: 257 AQGPECFYDNSMVEDPWQHLEPVLWSGLDDWGNNLNGPGSSNSLLPKSISMKKSSVAESS 316

Query: 410 HKPSNNQQSLAEYLAA 363
           +K S +  SLAEYLAA
Sbjct: 317 NK-STSGVSLAEYLAA 331

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 359,848,489,672
Number of Sequences: 15229318
Number of Extensions: 359848489672
Number of Successful Extensions: 91916522
Number of sequences better than 0.0: 0