Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN22100


BLASTX 7.6.2

Query= UN22100 /QuerySize=672
        (671 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|1184277|gb|AAC37510.1| arabinogalactan protein [Brassica napus]     228   8e-058
gi|1184275|gb|AAC37509.1| arabinogalactan protein [Brassica napus]     225   5e-057
gi|145307801|gb|ABP57237.1| putative arabinogalactan protein [Br...    225   5e-057
gi|297832764|ref|XP_002884264.1| hypothetical protein ARALYDRAFT...    203   2e-050
gi|18395919|ref|NP_566146.1| arabinogalactan protein 11 [Arabido...    185   5e-045
gi|10880499|gb|AAG24279.1|AF195892_1 arabinogalactan protein [Ar...    182   5e-044
gi|4775268|emb|CAB42531.1| AGP6 protein [Arabidopsis thaliana]         132   6e-029
gi|297811557|ref|XP_002873662.1| hypothetical protein ARALYDRAFT...    132   6e-029
gi|15241392|ref|NP_196942.1| arabinogalactan protein 6 [Arabidop...    130   2e-028
gi|186510502|ref|NP_001118720.1| uncharacterized protein [Arabid...     74   2e-011
gi|255567969|ref|XP_002524962.1| copper ion binding protein, put...     72   6e-011
gi|225459205|ref|XP_002285739.1| PREDICTED: hypothetical protein...     67   2e-009
gi|147835211|emb|CAN61253.1| hypothetical protein VITISV_019773 ...     66   3e-009
gi|323449787|gb|EGB05672.1| hypothetical protein AURANDRAFT_7213...     58   1e-006
gi|326431946|gb|EGD77516.1| hypothetical protein PTSG_08614 [Sal...     56   3e-006

>gi|1184277|gb|AAC37510.1| arabinogalactan protein [Brassica napus]

          Length = 136

 Score =  228 bits (579), Expect = 8e-058
 Identities = 123/138 (89%), Positives = 125/138 (90%), Gaps = 4/138 (2%)
 Frame = +1

Query:  94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPA--TKAPAAAPKSSTSASAPKASS 267
           MARQFVV  LLALVVATAFAAEAPSAAPTASPTKAP   TKAPAAAPKSS SASAPKASS
Sbjct:   1 MARQFVVLALLALVVATAFAAEAPSAAPTASPTKAPTTQTKAPAAAPKSS-SASAPKASS 59

Query: 268 PVAEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNV 447
           PVAEEPTAEDDY   SP+ES EGPTVSSPPAPTPE   DGPSSDAPTPGPEVLDGSATNV
Sbjct:  60 PVAEEPTAEDDYAATSPSESAEGPTVSSPPAPTPEV-VDGPSSDAPTPGPEVLDGSATNV 118

Query: 448 KLSIAGTVAALGFFFFSL 501
           KLSIAGTVAA+GFFFFSL
Sbjct: 119 KLSIAGTVAAVGFFFFSL 136

>gi|1184275|gb|AAC37509.1| arabinogalactan protein [Brassica napus]

          Length = 136

 Score =  225 bits (572), Expect = 5e-057
 Identities = 121/138 (87%), Positives = 124/138 (89%), Gaps = 4/138 (2%)
 Frame = +1

Query:  94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPA--TKAPAAAPKSSTSASAPKASS 267
           MARQFVV  LLALVVATAFAA+APSAAPT SPTKAP   TKAPAAAPKSS SASAPKASS
Sbjct:   1 MARQFVVLALLALVVATAFAADAPSAAPTTSPTKAPTTQTKAPAAAPKSS-SASAPKASS 59

Query: 268 PVAEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNV 447
           PVAEEPTAEDDY   SP+ES EGPTVSSPPAPTPE   DGPSSDAPTPGPEVLDGSATNV
Sbjct:  60 PVAEEPTAEDDYAATSPSESAEGPTVSSPPAPTPEV-VDGPSSDAPTPGPEVLDGSATNV 118

Query: 448 KLSIAGTVAALGFFFFSL 501
           KLSIAGTVAA+GFFFFSL
Sbjct: 119 KLSIAGTVAAVGFFFFSL 136

>gi|145307801|gb|ABP57237.1| putative arabinogalactan protein [Brassica rapa
        subsp. chinensis]

          Length = 136

 Score =  225 bits (572), Expect = 5e-057
 Identities = 122/138 (88%), Positives = 124/138 (89%), Gaps = 4/138 (2%)
 Frame = +1

Query:  94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPA--TKAPAAAPKSSTSASAPKASS 267
           MARQFVV  LLALVVATAFAAEAPSAAPTASPTKAP   TKAPAAAPKSS SASAPKASS
Sbjct:   1 MARQFVVLALLALVVATAFAAEAPSAAPTASPTKAPTTQTKAPAAAPKSS-SASAPKASS 59

Query: 268 PVAEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNV 447
           PVAEEPTAEDDY   SP+ES  GPTVSSPPAPTPE   DGPSSDAPTPGPEVLDGSATNV
Sbjct:  60 PVAEEPTAEDDYAATSPSESAGGPTVSSPPAPTPEV-VDGPSSDAPTPGPEVLDGSATNV 118

Query: 448 KLSIAGTVAALGFFFFSL 501
           KLSIAGTVAA+GFFFFSL
Sbjct: 119 KLSIAGTVAAVGFFFFSL 136

>gi|297832764|ref|XP_002884264.1| hypothetical protein ARALYDRAFT_477345
        [Arabidopsis lyrata subsp. lyrata]

          Length = 140

 Score =  203 bits (516), Expect = 2e-050
 Identities = 108/140 (77%), Positives = 116/140 (82%), Gaps = 5/140 (3%)
 Frame = +1

Query:  85 KKKMARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKAS 264
           KKKMARQFVVF LLAL VATAFAA+APSAAPTASPTKAP TKAPAAAPKS  SA+APKAS
Sbjct:   3 KKKMARQFVVFALLALAVATAFAADAPSAAPTASPTKAPTTKAPAAAPKS--SAAAPKAS 60

Query: 265 SPVAEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEA---SADGPSSDAPTPGPEVLDGS 435
           SPVAEEPT+EDDY+  +P++S E PTVSSPPAPTPEA   S DGPSSD PT       G+
Sbjct:  61 SPVAEEPTSEDDYSAATPSDSAEAPTVSSPPAPTPEADGPSTDGPSSDGPTAAESPKSGA 120

Query: 436 ATNVKLSIAGTVAALGFFFF 495
            TNVKLSIAGTVAA GFF F
Sbjct: 121 TTNVKLSIAGTVAAAGFFIF 140

>gi|18395919|ref|NP_566146.1| arabinogalactan protein 11 [Arabidopsis thaliana]

          Length = 136

 Score =  185 bits (469), Expect = 5e-045
 Identities = 102/139 (73%), Positives = 110/139 (79%), Gaps = 6/139 (4%)
 Frame = +1

Query:  94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPV 273
           MAR FVV  LLAL V T FAA+APSAAPTASPTK+P TKAPAAAPKS  SA+APKASSPV
Sbjct:   1 MARLFVVVALLALAVGTVFAADAPSAAPTASPTKSP-TKAPAAAPKS--SAAAPKASSPV 57

Query: 274 AEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEA---SADGPSSDAPTPGPEVLDGSATN 444
           AEEPT EDDY+  SP++S E PTVSSPPAPTPEA   S+DGPSSD P        G+ TN
Sbjct:  58 AEEPTPEDDYSAASPSDSAEAPTVSSPPAPTPEADGPSSDGPSSDGPAAAESPKSGATTN 117

Query: 445 VKLSIAGTVAALGFFFFSL 501
           VKLSIAGTVAA GFF FSL
Sbjct: 118 VKLSIAGTVAAAGFFIFSL 136

>gi|10880499|gb|AAG24279.1|AF195892_1 arabinogalactan protein [Arabidopsis
        thaliana]

          Length = 135

 Score =  182 bits (460), Expect = 5e-044
 Identities = 100/138 (72%), Positives = 108/138 (78%), Gaps = 6/138 (4%)
 Frame = +1

Query:  97 ARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPVA 276
           AR FVV  LLAL V T FAA+APSAAPTASPTK+P TKAPA APKS  SA+APKASSPVA
Sbjct:   1 ARLFVVVALLALAVGTVFAADAPSAAPTASPTKSP-TKAPAVAPKS--SAAAPKASSPVA 57

Query: 277 EEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEA---SADGPSSDAPTPGPEVLDGSATNV 447
           EEPT EDDY+  SP++S E PTVSSPPAPTPEA   S+DGPSSD P        G+ TNV
Sbjct:  58 EEPTPEDDYSAASPSDSAEAPTVSSPPAPTPEADGPSSDGPSSDGPAAAESPKSGATTNV 117

Query: 448 KLSIAGTVAALGFFFFSL 501
           KLSIAGTVAA GFF FSL
Sbjct: 118 KLSIAGTVAAAGFFIFSL 135

>gi|4775268|emb|CAB42531.1| AGP6 protein [Arabidopsis thaliana]

          Length = 150

 Score =  132 bits (330), Expect = 6e-029
 Identities = 73/120 (60%), Positives = 89/120 (74%), Gaps = 10/120 (8%)
 Frame = +1

Query: 160 APSAAPTASP---TKAPA--TKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYTGNSPTE 324
           +P+AAPT +P   TKAP+  TKAPAAAPKSS SAS+PKASSP AE P  EDDY+ +SP++
Sbjct:  33 SPTAAPTKAPTATTKAPSAPTKAPAAAPKSS-SASSPKASSPAAEGPVPEDDYSASSPSD 91

Query: 325 SVEGPTVSSPPAPTPE--ASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAALGFFFFS 498
           S E PTVSSPPAPTP+  ++ADGPS       P+   G+ T  K S+ GTVAA+GFFFFS
Sbjct:  92 SAEAPTVSSPPAPTPDSTSAADGPSDGPTAESPK--SGAVTTAKFSVVGTVAAVGFFFFS 149

>gi|297811557|ref|XP_002873662.1| hypothetical protein ARALYDRAFT_488278
        [Arabidopsis lyrata subsp. lyrata]

          Length = 150

 Score =  132 bits (330), Expect = 6e-029
 Identities = 73/120 (60%), Positives = 89/120 (74%), Gaps = 10/120 (8%)
 Frame = +1

Query: 160 APSAAPT---ASPTKAPA--TKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYTGNSPTE 324
           +P+A PT   A+PTKAPA  TKAPAAAPKSS SAS+PKASSP AE P  +DDY+ +SP+ 
Sbjct:  33 SPTATPTKAPAAPTKAPAAPTKAPAAAPKSS-SASSPKASSPTAEGPVPDDDYSASSPSG 91

Query: 325 SVEGPTVSSPPAPTPE--ASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAALGFFFFS 498
           S E PTVSSPPAPTP+  ++ADGPS       P+   G+ T  KLS+ GT+AA+GFFFFS
Sbjct:  92 SAEAPTVSSPPAPTPDSTSAADGPSDGPTAESPK--SGAVTTAKLSVVGTIAAVGFFFFS 149

>gi|15241392|ref|NP_196942.1| arabinogalactan protein 6 [Arabidopsis thaliana]

          Length = 150

 Score =  130 bits (326), Expect = 2e-028
 Identities = 72/120 (60%), Positives = 88/120 (73%), Gaps = 10/120 (8%)
 Frame = +1

Query: 160 APSAAPTASP---TKAPA--TKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYTGNSPTE 324
           +P+AAPT +P   TKAP+  TKAPAAAPKSS SAS+PKASSP AE P  EDDY+ +SP++
Sbjct:  33 SPTAAPTKAPTATTKAPSAPTKAPAAAPKSS-SASSPKASSPAAEGPVPEDDYSASSPSD 91

Query: 325 SVEGPTVSSPPAPTPE--ASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAALGFFFFS 498
           S E PTVSSPPAPTP+  ++ADGPS       P+   G+ T  K S+ GTVA +GFFFFS
Sbjct:  92 SAEAPTVSSPPAPTPDSTSAADGPSDGPTAESPK--SGAVTTAKFSVVGTVATVGFFFFS 149

>gi|186510502|ref|NP_001118720.1| uncharacterized protein [Arabidopsis
        thaliana]

          Length = 171

 Score =  74 bits (179), Expect = 2e-011
 Identities = 48/127 (37%), Positives = 69/127 (54%), Gaps = 8/127 (6%)
 Frame = +1

Query: 139 ATAFAAEAPSA-APTASPTKAP--ATKAPAAAPKSSTSASAPKASS--PVAEEPTAEDDY 303
           A A + ++P+A APT+ PT AP  A K     P  S S+ +PK+SS    A  P +    
Sbjct:  44 AAAASPKSPTASAPTSPPTAAPTMAKKNSTGTPSPSPSSPSPKSSSAKTPASSPDSSSGD 103

Query: 304 TGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVL--DGSATNVKLSIAGTV-A 474
           +   PT S + PT SSPPAPTPE S         + GPE       A+++ +S++G+V A
Sbjct: 104 SSEGPTSSSDAPTASSPPAPTPEMSPSSDDGTGASDGPEASAPAAGASSLVISVSGSVLA 163

Query: 475 ALGFFFF 495
           A+ + FF
Sbjct: 164 AVAWLFF 170

>gi|255567969|ref|XP_002524962.1| copper ion binding protein, putative [Ricinus
        communis]

          Length = 238

 Score =  72 bits (175), Expect = 6e-011
 Identities = 47/118 (39%), Positives = 57/118 (48%), Gaps = 3/118 (2%)
 Frame = +1

Query: 127 ALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYT 306
           A   A + A   PS AP A+ T  PAT AP  AP +    +AP A+   A  P+ +    
Sbjct: 113 AATPAKSPAKSPPSPAPVAA-TPPPATSAPETAPPAPVPVAAPTAADVPAPTPSKKKPKK 171

Query: 307 GNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAAL 480
            +  T    GP VSSPPAP  E  A GPS DA +PGP V D S      S+   V  L
Sbjct: 172 HSHATSPAPGPDVSSPPAPPME--APGPSLDASSPGPSVADDSGAETIRSLQKMVGGL 227

>gi|225459205|ref|XP_002285739.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 138

 Score =  67 bits (162), Expect = 2e-009
 Identities = 45/135 (33%), Positives = 65/135 (48%), Gaps = 13/135 (9%)
 Frame = +1

Query:  94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPV 273
           MA   VV  ++  +VA +  A++P+++PTASPTK+P    P A P S T + +   S+P 
Sbjct:   1 MAYSSVVLVMMFALVAGSAFAQSPASSPTASPTKSPTASPPVATPPSPTPSPSTTPSAPA 60

Query: 274 AEEPTAEDDYTGNSPTESVEGPTVSSPPAP-----TPEASADGPSSDAPTPGPEVLDGSA 438
               T        SP  S   PT  SPPAP     +P  +   PS    TP  E    SA
Sbjct:  61 PAPSTV-------SPPASTPSPTSGSPPAPPTSPASPPTTGASPSPSISTPPTEPPSPSA 113

Query: 439 TNVK-LSIAGTVAAL 480
             +  ++  G+ AA+
Sbjct: 114 AALNTVTFTGSAAAV 128

>gi|147835211|emb|CAN61253.1| hypothetical protein VITISV_019773 [Vitis
        vinifera]

          Length = 138

 Score =  66 bits (160), Expect = 3e-009
 Identities = 40/110 (36%), Positives = 58/110 (52%), Gaps = 9/110 (8%)
 Frame = +1

Query:  94 MARQFVVFGLLALVVATAFAAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKASSPV 273
           MA   VV  ++  +VA +  A++P+++PTASPTK+P    P A P S T + +   S+P 
Sbjct:   1 MAYSSVVLVMMFALVAGSAFAQSPASSPTASPTKSPTASPPVATPPSPTPSPSTTPSAPA 60

Query: 274 AEEPTAEDDYTGNSPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEV 423
               T        SP  S   PT  SPPAP P + A  P++ A +P P +
Sbjct:  61 PAPSTV-------SPPASTPSPTSGSPPAP-PTSPASAPTTGA-SPSPSI 101

>gi|323449787|gb|EGB05672.1| hypothetical protein AURANDRAFT_72138 [Aureococcus
        anophagefferens]

          Length = 5032

 Score =  58 bits (138), Expect = 1e-006
 Identities = 39/117 (33%), Positives = 50/117 (42%), Gaps = 8/117 (6%)
 Frame = +1

Query:  151 AAEAPSAAPTASPTKAPATKAPAAAPKSSTSASAPKAS------SPVAEEPTAEDDYTGN 312
            A  AP+AAPTA+PT  P T +P AAP + T   AP  +      +P    PTA    T  
Sbjct: 2805 ATPAPTAAPTAAPTGTPTT-SPTAAP-TGTPTQAPTVTPGEPTLAPATPAPTAAPSVTPT 2862

Query:  313 SPTESVEGPTVSSPPAPTPEASADGPSSDAPTPGPEVLDGSATNVKLSIAGTVAALG 483
                    P  S  P P P  +   P+    TP P     +A  +  S+A T A  G
Sbjct: 2863 PAPSVTPTPAPSVSPTPAPTVTPGSPTLAPATPAPTAAPTAAPTLAPSVAPTAAPTG 2919

>gi|326431946|gb|EGD77516.1| hypothetical protein PTSG_08614 [Salpingoeca sp.
        ATCC 50818]

          Length = 516

 Score =  56 bits (134), Expect = 3e-006
 Identities = 29/81 (35%), Positives = 37/81 (45%)
 Frame = +1

Query: 175 PTASPTKAPATKAPAAAPKSSTSASAPKASSPVAEEPTAEDDYTGNSPTESVEGPTVSSP 354
           PT +P  AP    PAA P S    + P  S P A  P ++   +G  P  S   PT + P
Sbjct: 186 PTTAPPAAPPAAPPAAKPPSGPPPAGPPPSKPPAGPPPSKPAPSGPPPPPSGPPPTTAPP 245

Query: 355 PAPTPEASADGPSSDAPTPGP 417
           P   P      P++ APTP P
Sbjct: 246 PPKAPPGPPPPPATSAPTPPP 266

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,672,520,567,077
Number of Sequences: 15229318
Number of Extensions: 2672520567077
Number of Successful Extensions: 649898909
Number of sequences better than 0.0: 0