Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN16887


BLASTX 7.6.2

Query= UN16887 /QuerySize=877
        (876 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|18423010|ref|NP_568708.1| hydroxyproline-rich glycoprotein fa...    303   3e-080
gi|297795679|ref|XP_002865724.1| hypothetical protein ARALYDRAFT...    297   2e-078
gi|297804592|ref|XP_002870180.1| predicted protein [Arabidopsis ...    189   6e-046
gi|15234820|ref|NP_193349.1| proline-rich family protein [Arabid...    179   4e-043
gi|224106966|ref|XP_002314326.1| predicted protein [Populus tric...    170   3e-040
gi|225431867|ref|XP_002271531.1| PREDICTED: hypothetical protein...    165   1e-038
gi|18397707|ref|NP_566291.1| hydroxyproline-rich glycoprotein fa...    157   2e-036
gi|297833450|ref|XP_002884607.1| hypothetical protein ARALYDRAFT...    157   2e-036
gi|224130440|ref|XP_002328609.1| predicted protein [Populus tric...    154   3e-035
gi|255556284|ref|XP_002519176.1| nutrient reservoir, putative [R...    153   4e-035
gi|255548545|ref|XP_002515329.1| nutrient reservoir, putative [R...    137   3e-030
gi|225678143|gb|EEH16427.1| conserved hypothetical protein [Para...     85   1e-014
gi|115386684|ref|XP_001209883.1| conserved hypothetical protein ...     80   5e-013
gi|332025507|gb|EGI65670.1| hypothetical protein G5I_05770 [Acro...     78   1e-012
gi|261190885|ref|XP_002621851.1| cytokinesis protein sepA [Ajell...     77   2e-012

>gi|18423010|ref|NP_568708.1| hydroxyproline-rich glycoprotein family protein
        [Arabidopsis thaliana]

          Length = 162

 Score =  303 bits (774), Expect = 3e-080
 Identities = 139/165 (84%), Positives = 149/165 (90%), Gaps = 4/165 (2%)
 Frame = -2

Query: 755 MEANQLYGLSTLVLILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPPPSRPPP 576
           ME N LY LSTLV++LL+S+T TVTSKDEVVSCTMCSSCDNPC+PV S PPPP P  PPP
Sbjct:   1 METNHLYTLSTLVVMLLVSVTPTVTSKDEVVSCTMCSSCDNPCSPVQSSPPPPSP--PPP 58

Query: 575 SPSTTTACPPPPSPPSSGGGSSYYYPPPSQS-GGDKYPPPYGDGGQGYYYPPPYSGNYPT 399
           S + TTACPPPPSPPSSGGGSSYYYPPPSQS GG KYPPPYG GGQGYYYPPPYSGNYPT
Sbjct:  59 S-TPTTACPPPPSPPSSGGGSSYYYPPPSQSGGGSKYPPPYGGGGQGYYYPPPYSGNYPT 117

Query: 398 PPPPNPIVPYFPFYYHTPPPGSGSDRFMGSGSVMFALFAMFLCLI 264
           PPPPNPIVPYFPFYYHTPPPGSGSDRFM S S++FALFA+FLCL+
Sbjct: 118 PPPPNPIVPYFPFYYHTPPPGSGSDRFMSSYSIIFALFAVFLCLV 162

>gi|297795679|ref|XP_002865724.1| hypothetical protein ARALYDRAFT_494989
        [Arabidopsis lyrata subsp. lyrata]

          Length = 161

 Score =  297 bits (759), Expect = 2e-078
 Identities = 137/164 (83%), Positives = 144/164 (87%), Gaps = 4/164 (2%)
 Frame = -2

Query: 755 MEANQLYGLSTLVLILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPPPSRPPP 576
           ME N LY  STLV+ILL+S+T TVTSKDEVVSCTMCSSCDNPC+PV S PPPP P  PPP
Sbjct:   1 METNHLYTFSTLVVILLMSVTPTVTSKDEVVSCTMCSSCDNPCSPVQSSPPPPSP--PPP 58

Query: 575 SPSTTTACPPPPSPPSSGGGSSYYYPPPSQS-GGDKYPPPYGDGGQGYYYPPPYSGNYPT 399
           S + TTACPPPPSPP SGGGSSYYYPPPSQS GG KYPPPYG GGQGYYYPPPYSGNYPT
Sbjct:  59 S-TPTTACPPPPSPPRSGGGSSYYYPPPSQSGGGSKYPPPYGGGGQGYYYPPPYSGNYPT 117

Query: 398 PPPPNPIVPYFPFYYHTPPPGSGSDRFMGSGSVMFALFAMFLCL 267
           PPPPNPIVPYFPFYYHTPPPGSGSDRFM S SV+F  FA+FLCL
Sbjct: 118 PPPPNPIVPYFPFYYHTPPPGSGSDRFMSSCSVIFTFFAVFLCL 161

>gi|297804592|ref|XP_002870180.1| predicted protein [Arabidopsis lyrata subsp.
        lyrata]

          Length = 167

 Score =  189 bits (479), Expect = 6e-046
 Identities = 98/168 (58%), Positives = 110/168 (65%), Gaps = 11/168 (6%)
 Frame = -2

Query: 755 MEANQLYGLSTLVL-ILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPPPSRPP 579
           ME  + + L  L+L     + T+T+TS  ++  CTMC+SCDNPC P PS PPP  PS PP
Sbjct:   1 METLRTFHLFLLLLFFFFFTFTTTLTSPSQIADCTMCTSCDNPCQPNPSPPPPSNPSPPP 60

Query: 578 PSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDGGQGYYYPPPYS-GNYP 402
           P+P TTTACPPPPS  S GGG  YYYPPPSQSG   Y PP    G GYYYPPP S GNYP
Sbjct:  61 PAP-TTTACPPPPS--SGGGGPYYYYPPPSQSG--SYRPPPSSSGGGYYYPPPKSGGNYP 115

Query: 401 TPPPPNPIVPYFPFYYHTPPPG---SGSD-RFMGSGSVMFALFAMFLC 270
             PPPNPIVPYFPFYY+ PPP    SGSD +   S  V F L    LC
Sbjct: 116 FTPPPNPIVPYFPFYYYNPPPQSVMSGSDAKIRFSYGVSFLLLVFSLC 163

>gi|15234820|ref|NP_193349.1| proline-rich family protein [Arabidopsis
        thaliana]

          Length = 164

 Score =  179 bits (454), Expect = 4e-043
 Identities = 88/137 (64%), Positives = 96/137 (70%), Gaps = 10/137 (7%)
 Frame = -2

Query: 719 VLILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPP-PSRPPPSPSTTTACPPP 543
           +     + T+T+TS  ++  CTMC+SCDNPC P PS PPPP  PS PPPSP TTTACPPP
Sbjct:  11 LFFFFFTFTTTLTSPSQIADCTMCTSCDNPCQPNPSPPPPPSNPSPPPPSP-TTTACPPP 69

Query: 542 PSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDGGQGYYYPPPYS-GNYPTPPPPNPIVPYF 366
           PS  SSGGG  YYYPP SQSG   Y PP      GYYYPPP S GNYP  PPPNPIVPYF
Sbjct:  70 PS--SSGGGPYYYYPPASQSG--SYRPPPSSSSGGYYYPPPKSGGNYPYTPPPNPIVPYF 125

Query: 365 PFYYHTPPPG---SGSD 324
           PFYY+ PPP    SGSD
Sbjct: 126 PFYYYNPPPQSVMSGSD 142

>gi|224106966|ref|XP_002314326.1| predicted protein [Populus trichocarpa]

          Length = 174

 Score =  170 bits (430), Expect = 3e-040
 Identities = 92/174 (52%), Positives = 104/174 (59%), Gaps = 12/174 (6%)
 Frame = -2

Query: 755 MEANQLYGLSTLVLILLLSLTSTVTSKDE----VVSCTMCSSCDNPCNPVPSYPP--PPP 594
           ME      LS L L +LL  T + T         ++CTMCS+C     PVPS PP  PPP
Sbjct:   1 METRHKLKLSLLALFMLLPSTKSSTMPKSRMLYQIACTMCSTCCG-STPVPSPPPPSPPP 59

Query: 593 PSRPPPSPSTTTACPPPPSPPSSGGGSSYYY-PPPSQSGGDKYPPPYGDGGQGYYYPPPY 417
           P+  PP P+TT  CPPPPSPP SGGGS YY  PPPS       PPP G    G YYPPP 
Sbjct:  60 PAASPPPPATTAICPPPPSPPPSGGGSYYYSPPPPSTYTYSSPPPPQGGVVGGTYYPPPN 119

Query: 416 SGNYPTPPPPNPIVPYFPFYYHTPPPGSGSDRF----MGSGSVMFALFAMFLCL 267
             NYPTPPPPNPIVPYFPFYY++PPP S S  F      S SV+  + A+ LCL
Sbjct: 120 YKNYPTPPPPNPIVPYFPFYYYSPPPPSMSASFKLMASYSTSVLVGVVALVLCL 173

>gi|225431867|ref|XP_002271531.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 196

 Score =  165 bits (416), Expect = 1e-038
 Identities = 86/158 (54%), Positives = 105/158 (66%), Gaps = 19/158 (12%)
 Frame = -2

Query: 695 TSTVTSKDEV-VSCTMCSSCDNPCNPVPSYPPPPPPSRPPPS---PSTTTACPPPPSPPS 528
           T+ V +K +  + CTMCS+CDNPCN VPS PPPP PS PPPS   PS+++ CPPPPSPPS
Sbjct:  44 TADVAAKSKYQIECTMCSACDNPCNQVPS-PPPPNPSPPPPSPPPPSSSSNCPPPPSPPS 102

Query: 527 SGGGSSYYY--PPPSQSG--GDKYPPPY----GDGGQGYYYPPPYSGNYPTPPPPNPIVP 372
           +    +YYY  PPPSQ        PPP     G GG G +YPPPY  NYP PPPPNPIVP
Sbjct: 103 N---PTYYYSPPPPSQPTYVYSSPPPPNGYNGGGGGGGAFYPPPYQ-NYPAPPPPNPIVP 158

Query: 371 YFPFYYHTPPPGSGSDRFMGSGSVMF--ALFAMFLCLI 264
           YFPF+YHTPPP S ++    S S ++  AL ++  C +
Sbjct: 159 YFPFWYHTPPPTSAANSVYFSDSTVYTIALISLLFCFL 196

>gi|18397707|ref|NP_566291.1| hydroxyproline-rich glycoprotein family protein
        [Arabidopsis thaliana]

          Length = 147

 Score =  157 bits (396), Expect = 2e-036
 Identities = 74/107 (69%), Positives = 80/107 (74%), Gaps = 5/107 (4%)
 Frame = -2

Query: 581 PPSPSTTTACPPPPSPPSSGGGSSYYY--PPPSQSGGDKYPPPYGDGGQGYYYPPPYSGN 408
           P +P  ++  PPPP P SSGGG SYYY  PPPS SGG KYPPPYG  G G YYPPPY GN
Sbjct:  41 PCNPVPSSYSPPPPPPSSSGGGGSYYYSPPPPSSSGGVKYPPPYGGDGYGGYYPPPYYGN 100

Query: 407 YPTPPPPNPIVPYFPFYYHTPPPG-SGSDRFMGSGSVMFALFAMFLC 270
           Y TPPPPNPIVPYFPFYYHTPP G SGS R     S++FALFA+ LC
Sbjct: 101 YGTPPPPNPIVPYFPFYYHTPPQGYSGSARL--QNSLLFALFAVLLC 145


 Score =  104 bits (259), Expect = 2e-020
 Identities = 66/125 (52%), Positives = 69/125 (55%), Gaps = 8/125 (6%)
 Frame = -2

Query: 752 EANQLYGLSTLVLILLLSLTSTVTSKDEVVSCTMCSSCDNPCNPVPSYPPPPPPSRPPPS 573
           EA +   LS L LILL+SL   VTSKDE VSCTMCSSCDNPCNPVPS   PPPP  PP S
Sbjct:   2 EAIRFSNLSGL-LILLMSLPLLVTSKDETVSCTMCSSCDNPCNPVPSSYSPPPP--PPSS 58

Query: 572 PSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDGGQGYYYPP----PYSGNY 405
                +    P PPSS GG  Y  P      G  YPPPY  G  G   PP    PY   Y
Sbjct:  59 SGGGGSYYYSPPPPSSSGGVKYPPPYGGDGYGGYYPPPY-YGNYGTPPPPNPIVPYFPFY 117

Query: 404 PTPPP 390
              PP
Sbjct: 118 YHTPP 122

>gi|297833450|ref|XP_002884607.1| hypothetical protein ARALYDRAFT_477990
        [Arabidopsis lyrata subsp. lyrata]

          Length = 149

 Score =  157 bits (396), Expect = 2e-036
 Identities = 78/116 (67%), Positives = 83/116 (71%), Gaps = 13/116 (11%)
 Frame = -2

Query: 593 PSRPPPSPSTTTACPPPPSPPSSGGGSSYYY--PPPSQSGGDKYPPPYGD---GGQGYYY 429
           P  P PS     +  PPP P SSGGG SYYY  PPPS SGG KYPPPYG    GGQGYYY
Sbjct:  41 PCNPVPS-----SYSPPPPPSSSGGGGSYYYSPPPPSSSGGAKYPPPYGGDGYGGQGYYY 95

Query: 428 PPPYSGNYPTPPPPNPIVPYFPFYYHTPPPG-SGSDRFMGSGSVMFALFAMFLCLI 264
           PPPY GNY TPPPPNPIVPYFPFYYHTPP G SGS R     S++FALFA+ LC +
Sbjct:  96 PPPYYGNYGTPPPPNPIVPYFPFYYHTPPQGYSGSAR--SHDSLLFALFAVLLCFV 149

>gi|224130440|ref|XP_002328609.1| predicted protein [Populus trichocarpa]

          Length = 172

 Score =  154 bits (387), Expect = 3e-035
 Identities = 84/155 (54%), Positives = 96/155 (61%), Gaps = 13/155 (8%)
 Frame = -2

Query: 755 MEANQLYGLSTLVLILLLSLT--STVTSKDEV---VSCTMCSSCDNPCNPVPSYPPPPPP 591
           ME    + LS L L++LLS T  STV     +   ++CTMCS+C    +PV S PPPPP 
Sbjct:   1 METRHKFKLSLLALLMLLSSTKSSTVLPNSRMLYQIACTMCSTCCG-SSPVTSPPPPPP- 58

Query: 590 SRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGD---KYPPPYGDGG-QGYYYPP 423
             PPPS +TT+ CPPPPSPP+S G   YY PPP           PPP  DG   G YYPP
Sbjct:  59 --PPPSLATTSNCPPPPSPPASPGVGFYYSPPPPPPPSTYTYSSPPPPQDGVIGGTYYPP 116

Query: 422 PYSGNYPTPPPPNPIVPYFPFYYHTPPPGSGSDRF 318
           P   NYPTPPPPNPIVPYFPFYY+ PPP S S  F
Sbjct: 117 PNYKNYPTPPPPNPIVPYFPFYYYIPPPPSTSASF 151

>gi|255556284|ref|XP_002519176.1| nutrient reservoir, putative [Ricinus
        communis]

          Length = 171

 Score =  153 bits (385), Expect = 4e-035
 Identities = 82/144 (56%), Positives = 95/144 (65%), Gaps = 17/144 (11%)
 Frame = -2

Query: 731 LSTLVLILLLSLTS-TVTSKDEV---VSCTMCSSCDNPCNPVPSYPPPPPPSRPPPSPST 564
           LS L  +++L  TS +   K  +   ++CTMCS+C   CNPVPS PPPPPPS  PP P++
Sbjct:   9 LSLLAFLVVLPFTSPSPVPKSRMLYQIACTMCSTC---CNPVPS-PPPPPPS--PPPPAS 62

Query: 563 TTACPPPPSPPSSGGGSSYYY--PPPSQSGGDKYPPPYGDGGQG--YYYPPPYS-GNYPT 399
           T  CPPPPSPPS  G  SYYY  PPP+       PPP G  G G  YYYPPP    NYP 
Sbjct:  63 TNNCPPPPSPPSPSG--SYYYSPPPPATYTYSSPPPPQGGNGGGSYYYYPPPADYKNYPA 120

Query: 398 PPPPNPIVPYFPFYYHTPPPGSGS 327
           PPPPNPIVPYFPFYY++PPP S S
Sbjct: 121 PPPPNPIVPYFPFYYYSPPPPSMS 144

>gi|255548545|ref|XP_002515329.1| nutrient reservoir, putative [Ricinus
        communis]

          Length = 154

 Score =  137 bits (344), Expect = 3e-030
 Identities = 65/116 (56%), Positives = 73/116 (62%), Gaps = 14/116 (12%)
 Frame = -2

Query: 653 MCSSCDNPCNPVPSYP------PPPPPSRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPP 492
           MCS+CDNPC P+PS P      PPPPP   PP P+  + CPPPP  PSS  G+ YY PPP
Sbjct:   1 MCSACDNPCQPLPSPPPPVSTCPPPPPPPSPPPPAIVSDCPPPPVTPSS--GAYYYSPPP 58

Query: 491 SQSGGDKY---PPPYGDGGQGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPPGS 333
                  Y   PPP  DGG   +YP P  G+Y  PPPPNPIVPYFPFYY+ PPP S
Sbjct:  59 PAEPTYVYSSPPPPANDGG---FYPSPNYGSYQGPPPPNPIVPYFPFYYYNPPPSS 111

>gi|225678143|gb|EEH16427.1| conserved hypothetical protein [Paracoccidioides
        brasiliensis Pb03]

          Length = 675

 Score =  85 bits (208), Expect = 1e-014
 Identities = 43/98 (43%), Positives = 48/98 (48%)
 Frame = -2

Query: 623 PVPSYPPPPPPSRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDGG 444
           P  S P PPPP  PPP P+ +TA PPPP PP      S   PPP  SG    PPP+    
Sbjct: 482 PPVSAPAPPPPPPPPPGPAPSTAVPPPPPPPPPPPSGSAPPPPPPPSGSAPPPPPHPAPS 541

Query: 443 QGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPPGSG 330
             +  PPP S    +PPPP P     P     PP GSG
Sbjct: 542 ASHSPPPPLSAPSSSPPPPPPPASGVPPPPPPPPAGSG 579

>gi|115386684|ref|XP_001209883.1| conserved hypothetical protein [Aspergillus
        terreus NIH2624]

          Length = 600

 Score =  80 bits (195), Expect = 5e-013
 Identities = 46/108 (42%), Positives = 53/108 (49%), Gaps = 9/108 (8%)
 Frame = -2

Query: 623 PVPSYPPPPPP--SRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGD 450
           P P  PPPP P  S PPP P  ++  PPPP PPSS G ++   PPP+  GG   PPP   
Sbjct: 421 PPPGPPPPPAPASSVPPPPPPPSSHIPPPPPPPSSTGPTAPPPPPPAPGGGAPPPPPPPP 480

Query: 449 G-GQGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPPGSGSDRFMGS 309
           G G     PPP +G  P PPPP    P        P P  G D  M +
Sbjct: 481 GAGAPPPPPPPGAGAPPPPPPPGGAAP------PLPKPAGGRDDLMAA 522

>gi|332025507|gb|EGI65670.1| hypothetical protein G5I_05770 [Acromyrmex
        echinatior]

          Length = 481

 Score =  78 bits (191), Expect = 1e-012
 Identities = 43/96 (44%), Positives = 44/96 (45%), Gaps = 3/96 (3%)
 Frame = -2

Query: 617 PSY---PPPPPPSRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGGDKYPPPYGDG 447
           PSY   PPPPPP  PPP P      PPPP  PS+    SY  PPP        PP Y   
Sbjct:  44 PSYSYPPPPPPPPPPPPPPPPPPPPPPPPGYPSTQPSYSYPPPPPPPPPPPPSPPGYPST 103

Query: 446 GQGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPP 339
              Y YPPP     P PPPP P  P  P     PPP
Sbjct: 104 QPSYSYPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP 139

>gi|261190885|ref|XP_002621851.1| cytokinesis protein sepA [Ajellomyces
        dermatitidis SLH14081]

          Length = 1704

 Score =  77 bits (189), Expect = 2e-012
 Identities = 45/102 (44%), Positives = 47/102 (46%), Gaps = 7/102 (6%)
 Frame = -2

Query:  632 PCNPVPSYPPPPPPSRPPPSPSTTTACPPPPSPPSSGGGSSYYYPPPSQSGG--DKYPPP 459
            P  P P    PPPP  PPP P    A PPPP PP   GG     PPP   GG     PPP
Sbjct:  947 PPPPPPGIGGPPPPPPPPPPPPGVGAPPPPPPPPPGAGGPPPPPPPPPGVGGPPPPPPPP 1006

Query:  458 YGDGGQGYYYPPPYSGNYPTPPPPNPIVPYFPFYYHTPPPGS 333
             G GG     PPP     P PPPP P +   P     PPPG+
Sbjct: 1007 PGMGGPPLPPPPPPGMGGPPPPPPPPGMRGPP-----PPPGA 1043

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,992,423,490,450
Number of Sequences: 15229318
Number of Extensions: 1992423490450
Number of Successful Extensions: 533126175
Number of sequences better than 0.0: 0