Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN30806


BLASTX 7.6.2

Query= UN30806 /QuerySize=1200
        (1199 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|22326944|ref|NP_680186.1| hydroxyproline-rich glycoprotein fa...    440   2e-121
gi|297812311|ref|XP_002874039.1| hypothetical protein ARALYDRAFT...    431   1e-118
gi|224088643|ref|XP_002308507.1| predicted protein [Populus tric...    206   7e-051
gi|222625089|gb|EEE59221.1| hypothetical protein OsJ_11187 [Oryz...     91   3e-016
gi|242035531|ref|XP_002465160.1| hypothetical protein SORBIDRAFT...     91   3e-016
gi|125544240|gb|EAY90379.1| hypothetical protein OsI_11957 [Oryz...     91   3e-016
gi|218193008|gb|EEC75435.1| hypothetical protein OsI_11963 [Oryz...     91   3e-016
gi|296081744|emb|CBI20749.3| unnamed protein product [Vitis vini...     91   3e-016
gi|225429742|ref|XP_002280401.1| PREDICTED: hypothetical protein...     87   6e-015
gi|255550239|ref|XP_002516170.1| hypothetical protein RCOM_07081...     85   2e-014
gi|242046288|ref|XP_002461015.1| hypothetical protein SORBIDRAFT...     68   2e-009
gi|254577823|ref|XP_002494898.1| ZYRO0A12386p [Zygosaccharomyces...     56   9e-006

>gi|22326944|ref|NP_680186.1| hydroxyproline-rich glycoprotein family protein
        [Arabidopsis thaliana]

          Length = 302

 Score =  440 bits (1131), Expect = 2e-121
 Identities = 236/310 (76%), Positives = 256/310 (82%), Gaps = 20/310 (6%)
 Frame = +2

Query:  158 MVRSHKQQPRVSTTYIRSLVKQQLASSTTMTTTTTTADSSKTPTQTQTQTQTHKKQVRRR 337
            MVRS+KQ+PRVS+TYIRSLVKQQLA STTMTTTTTT  +  +    +TQTQTHKKQVRRR
Sbjct:    1 MVRSNKQEPRVSSTYIRSLVKQQLAYSTTMTTTTTTTTNDGS-GGGKTQTQTHKKQVRRR 59

Query:  338 LHTTRPYQERLLNMAEARREIVTALKQHRASMRQAARVPPPPPPHIP-----FSAPPPPP 502
            LHT+RPYQERLLNMAEARREIVTALKQHRASMRQA R+PPP PP  P     FS PPPPP
Sbjct:   60 LHTSRPYQERLLNMAEARREIVTALKQHRASMRQATRIPPPQPPPPPQPLNLFSPPPPPP 119

Query:  503 PPDPFSWSNPHLNFLLPNQPLGLNLN---FDDFIQTSSSSSSSSSSSSSSSSSSSSSSML 673
            PPDPFSW+NP LNFLLPNQPLGLNLN   F+DFIQTSS++SSSSSSS+SSSSSS     +
Sbjct:  120 PPDPFSWTNPSLNFLLPNQPLGLNLNFQDFNDFIQTSSTTSSSSSSSTSSSSSS-----I 174

Query:  674 PATNPHIYSSPSPPLPTF-AAQSDSVPHQP-LMEAENNVATSAWWSELMMKTVEPDVIKT 847
              TNPHIYSSPSPP PTF  A SDS P  P     ENNV TSAWWSELM+KTVEP++   
Sbjct:  175 FPTNPHIYSSPSPP-PTFTTATSDSAPQLPSSSNGENNVVTSAWWSELMLKTVEPEIKPE 233

Query:  848 DEEVAVAEDDVFPKFNDVMEFPPWLNPTDEELFHHPYNLT-HY-SSPHNPPLTCMEIGEI 1021
             EEV V EDDVFPKF+DVMEFP WLN T+EELF HPYNLT HY SSPHNPPL+CMEIGEI
Sbjct:  234 TEEVIVVEDDVFPKFSDVMEFPSWLNQTEEELF-HPYNLTDHYSSSPHNPPLSCMEIGEI 292

Query: 1022 EGMDGDDWLA 1051
            EGMDGDDWLA
Sbjct:  293 EGMDGDDWLA 302

>gi|297812311|ref|XP_002874039.1| hypothetical protein ARALYDRAFT_489044
        [Arabidopsis lyrata subsp. lyrata]

          Length = 292

 Score =  431 bits (1107), Expect = 1e-118
 Identities = 231/308 (75%), Positives = 248/308 (80%), Gaps = 26/308 (8%)
 Frame = +2

Query:  158 MVRSHKQQPRVSTTYIRSLVKQQLASSTTMTTTTTTADSSKTPTQTQTQTQTHKKQVRRR 337
            MVR +KQ+PRVS TYIRSLVKQQL SSTTMTTTTT         +TQTQTQTHKKQVRRR
Sbjct:    1 MVRPNKQEPRVSATYIRSLVKQQLTSSTTMTTTTTDGSGG---GKTQTQTQTHKKQVRRR 57

Query:  338 LHTTRPYQERLLNMAEARREIVTALKQHRASMRQAARVPPPPPPHIPFSAPPPPP----P 505
            LHT+RPYQERLLNMAEARREIVTALKQHRASMRQA R+PPP PP      PPPPP    P
Sbjct:   58 LHTSRPYQERLLNMAEARREIVTALKQHRASMRQATRIPPPQPP------PPPPPQPLNP 111

Query:  506 PDPFSWSNPHLNFLLPNQPLGLNLN---FDDFIQTSSSSSSSSSSSSSSSSSSSSSSMLP 676
            PDPFSW+NP LNFLLPNQPLGLNLN   F+DFIQTSS++SSSSSSS+SSSSSS     + 
Sbjct:  112 PDPFSWTNPSLNFLLPNQPLGLNLNFQDFNDFIQTSSTTSSSSSSSTSSSSSS-----IF 166

Query:  677 ATNPHIYSSPSPPLPTF-AAQSDSVPHQP-LMEAENNVATSAWWSELMMKTVEPDVIKTD 850
             TNPHIYSSPSPP PTF  A SDS P  P     ENNV TSAWWSELMMKTVEP++    
Sbjct:  167 PTNPHIYSSPSPP-PTFTTANSDSAPQPPSSSNGENNVITSAWWSELMMKTVEPEIKPET 225

Query:  851 EEVAVAEDDVFPKFNDVMEFPPWLNPTDEELFHHPYNLT-HYSSPHNPPLTCMEIGEIEG 1027
            EEVA  EDDVFPK +DVMEFP WLN T+EELF HPYNLT +YSSPHNPPL+CMEIGEIEG
Sbjct:  226 EEVAAVEDDVFPKLSDVMEFPSWLNQTEEELF-HPYNLTDNYSSPHNPPLSCMEIGEIEG 284

Query: 1028 MDGDDWLA 1051
            MDGDDW A
Sbjct:  285 MDGDDWFA 292

>gi|224088643|ref|XP_002308507.1| predicted protein [Populus trichocarpa]

          Length = 320

 Score =  206 bits (523), Expect = 7e-051
 Identities = 149/333 (44%), Positives = 184/333 (55%), Gaps = 54/333 (16%)
 Frame = +2

Query:  161 VRSHKQQPRVSTTYIRSLVKQQLASSTTMTTTTT----TADS---SKTPTQTQTQ-TQTH 316
            +R  K++P +S  YIRSLVK QL SS T          +ADS   SK     Q Q  Q H
Sbjct:    6 LRKLKEEPHLSGAYIRSLVK-QLTSSRTKDPMNPKGHGSADSDGLSKNQKSQQPQEPQPH 64

Query:  317 KKQVRRRLHTTRPYQERLLNMAEARREIVTALKQHRASMRQAARVPPPPP---------- 466
            KKQVRRRLHT+RPYQERLLNMAEARREIVTALK++        R+ P             
Sbjct:   65 KKQVRRRLHTSRPYQERLLNMAEARREIVTALKRN-------PRIYPSNSTDFSNYLDNF 117

Query:  467 PHIPFSAPPPPPPPDPFSWSN---------PHLNFLLPNQPLGLNLNFDDF--IQTS--- 604
             + PF+ PPP PPP PFSW +          ++NF LPNQ LGLNLNF DF  I T+   
Sbjct:  118 SYKPFTPPPPCPPPYPFSWPSSSILDPTTAENINFPLPNQTLGLNLNFHDFNNIDTTLYY 177

Query:  605 SSSSSSSSSSSSSSSSSSSSSMLPATN--PHIYSSPSPPLPTFAAQSDSVPHQPLMEAEN 778
            SS +  S  SSSS SSSS  S   AT   P + ++     P    ++DS   Q  ME  +
Sbjct:  178 SSDNPPSVYSSSSPSSSSFPSPFIATEEIPSVSNTCEGMPPAAFDETDSYGEQHQMEWND 237

Query:  779 --NVATSAWWSELMMKTVEPDVIKTDEEVAVAEDDVFPKFNDVMEFPPWLNPTDEELFHH 952
              N+ TSAWW + M  T        D EV   EDD    F  VMEFP WLN  D++ F+ 
Sbjct:  238 TMNLVTSAWWFKFMKTT------GLDPEVKSTEDDGCHPFEQVMEFPAWLNANDQQHFND 291

Query:  953 PYNLTHYSSPHNPPLTCMEIGEIEGMDGDDWLA 1051
             ++  ++   H+  L CM+IGEIEG+DG +WLA
Sbjct:  292 HFSQDYF---HDAALPCMDIGEIEGIDG-EWLA 320

>gi|222625089|gb|EEE59221.1| hypothetical protein OsJ_11187 [Oryza sativa
        Japonica Group]

          Length = 471

 Score =  91 bits (225), Expect = 3e-016
 Identities = 46/54 (85%), Positives = 47/54 (87%)
 Frame = +2

Query: 281 TPTQTQTQTQTHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKQHRASMRQA 442
           TP   Q Q Q HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK HRASMRQA
Sbjct:  88 TPPPPQPQPQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKIHRASMRQA 141

>gi|242035531|ref|XP_002465160.1| hypothetical protein SORBIDRAFT_01g033040
        [Sorghum bicolor]

          Length = 447

 Score =  91 bits (225), Expect = 3e-016
 Identities = 46/57 (80%), Positives = 49/57 (85%)
 Frame = +2

Query: 272 SSKTPTQTQTQTQTHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKQHRASMRQA 442
           + +T T  Q Q Q HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK HRASMRQA
Sbjct:  80 AQQTATPQQQQQQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKIHRASMRQA 136

>gi|125544240|gb|EAY90379.1| hypothetical protein OsI_11957 [Oryza sativa Indica
        Group]

          Length = 494

 Score =  91 bits (224), Expect = 3e-016
 Identities = 59/115 (51%), Positives = 72/115 (62%), Gaps = 10/115 (8%)
 Frame = +2

Query: 116 NDTREET*PSNIS-IMVRSHKQQPRVSTTYIRSLVKQQLASSTTMTTTTTTADSSKTPTQ 292
           N  +++  P ++S   +RS  +Q   S++  RS        +TTM T+         P Q
Sbjct:  29 NKQQQQQEPPHLSGAYIRSLVKQLSSSSSTARS----NKDHTTTMGTSKPHGCCHPQPDQ 84

Query: 293 TQTQT-----QTHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKQHRASMRQA 442
            + QT     Q HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK HRASMRQA
Sbjct:  85 QEPQTTPPPPQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKIHRASMRQA 139

>gi|218193008|gb|EEC75435.1| hypothetical protein OsI_11963 [Oryza sativa Indica
        Group]

          Length = 261

 Score =  91 bits (224), Expect = 3e-016
 Identities = 59/115 (51%), Positives = 72/115 (62%), Gaps = 10/115 (8%)
 Frame = +2

Query: 116 NDTREET*PSNIS-IMVRSHKQQPRVSTTYIRSLVKQQLASSTTMTTTTTTADSSKTPTQ 292
           N  +++  P ++S   +RS  +Q   S++  RS        +TTM T+         P Q
Sbjct:  29 NKQQQQQEPPHLSGAYIRSLVKQLSSSSSTARS----NKDHTTTMGTSKPHGCCHPQPDQ 84

Query: 293 TQTQT-----QTHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKQHRASMRQA 442
            + QT     Q HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK HRASMRQA
Sbjct:  85 QEPQTTPPPPQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKIHRASMRQA 139

>gi|296081744|emb|CBI20749.3| unnamed protein product [Vitis vinifera]

          Length = 290

 Score =  91 bits (224), Expect = 3e-016
 Identities = 50/80 (62%), Positives = 57/80 (71%), Gaps = 3/80 (3%)
 Frame = +2

Query: 173 KQQPRVSTTYIRSLVKQQLASSTTMTTTTTTADSSKTPTQTQTQTQTHKKQVRRRLHTTR 352
           K++P +S  YIRSLVKQ  +S T        +D+    TQ   Q Q HKKQVRRRLHT+R
Sbjct:  10 KEEPHLSGAYIRSLVKQLTSSRTKDPMNPKDSDTQAHQTQ---QPQQHKKQVRRRLHTSR 66

Query: 353 PYQERLLNMAEARREIVTAL 412
           PYQERLLNMAEARREIVTAL
Sbjct:  67 PYQERLLNMAEARREIVTAL 86

>gi|225429742|ref|XP_002280401.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 395

 Score =  87 bits (213), Expect = 6e-015
 Identities = 49/76 (64%), Positives = 54/76 (71%), Gaps = 2/76 (2%)
 Frame = +2

Query: 290 QTQTQTQTHKKQVRRRLHTTRPYQERLLNMAEARREIVTALKQHRASMRQA-ARVPPPPP 466
           QTQ Q Q HKKQVRRRLHT+RPYQERLLNMAEARREIVTALK HRA+M+QA  +      
Sbjct:  82 QTQ-QPQQHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKQANEQQQQQQQ 140

Query: 467 PHIPFSAPPPPPPPDP 514
                 +PP   PP P
Sbjct: 141 QQQQQQSPPLQSPPQP 156

>gi|255550239|ref|XP_002516170.1| hypothetical protein RCOM_0708150 [Ricinus
        communis]

          Length = 425

 Score =  85 bits (209), Expect = 2e-014
 Identities = 46/79 (58%), Positives = 51/79 (64%)
 Frame = +2

Query: 206 RSLVKQQLASSTTMTTTTTTADSSKTPTQTQTQTQTHKKQVRRRLHTTRPYQERLLNMAE 385
           RS V     S   M         ++   Q Q   Q H+KQVRRRLHT+RPYQERLLNMAE
Sbjct:  54 RSCVDDDSFSGQNMAKFGEGVSENQQTQQPQQPQQQHRKQVRRRLHTSRPYQERLLNMAE 113

Query: 386 ARREIVTALKQHRASMRQA 442
           ARREIV ALK HRASM+QA
Sbjct: 114 ARREIVAALKFHRASMKQA 132


 Score =  70 bits (169), Expect = 8e-010
 Identities = 47/109 (43%), Positives = 60/109 (55%), Gaps = 14/109 (12%)
 Frame = +2

Query: 443 ARVPPPP---PPHIPFSAPPPPPPPDPFSWSNPHLNFLLPNQPLGLNLNFDDFIQTSSS- 610
           +  PPPP   PP  PF  P PP  P   S  N +LNF LPNQ LGLNLNF DF    +S 
Sbjct: 200 SHAPPPPSASPPPYPFCWPTPPVLP---STINENLNFPLPNQTLGLNLNFQDFNDLDTSL 256

Query: 611 --SSSSSSSSSSSSSSSSSSSMLPATNPHIYSSPSPPLPTFAAQSDSVP 751
             +S++ SS  SSSS SS SS  P+     +S  +  +P+ A   + +P
Sbjct: 257 YHNSNNPSSVYSSSSPSSFSSPSPS-----FSIATEDVPSVAKSQEGMP 300

>gi|242046288|ref|XP_002461015.1| hypothetical protein SORBIDRAFT_02g039220
        [Sorghum bicolor]

          Length = 432

 Score =  68 bits (165), Expect = 2e-009
 Identities = 33/38 (86%), Positives = 35/38 (92%)
 Frame = +2

Query: 317 KKQVRRRLHTTRPYQERLLNMAEARREIVTALKQHRAS 430
           K+Q RRR HT+RPYQERLLNMAEARREIVTALK HRAS
Sbjct:  37 KRQARRRTHTSRPYQERLLNMAEARREIVTALKIHRAS 74

>gi|254577823|ref|XP_002494898.1| ZYRO0A12386p [Zygosaccharomyces rouxii]

          Length = 743

 Score =  56 bits (134), Expect = 9e-006
 Identities = 35/98 (35%), Positives = 45/98 (45%), Gaps = 6/98 (6%)
 Frame = +2

Query: 455 PPPPPHIP------FSAPPPPPPPDPFSWSNPHLNFLLPNQPLGLNLNFDDFIQTSSSSS 616
           PPPPP  P       SAPPPPP P P + S+  L+   P  P+   L F   I    S  
Sbjct: 194 PPPPPAAPAPPAPSMSAPPPPPTPPPVAASSSSLSSPPPAPPMPGGLPFLGEINARRSDR 253

Query: 617 SSSSSSSSSSSSSSSSSMLPATNPHIYSSPSPPLPTFA 730
            ++   SS   SS+ +   P   P    S +PPLP+ A
Sbjct: 254 GATEGLSSGDGSSARAPSAPPAPPAPPPSTAPPLPSAA 291

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,630,913,215,204
Number of Sequences: 15229318
Number of Extensions: 3630913215204
Number of Successful Extensions: 856065349
Number of sequences better than 0.0: 0