Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN29844


BLASTX 7.6.2

Query= UN29844 /QuerySize=1207
        (1206 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297804114|ref|XP_002869941.1| hypothetical protein ARALYDRAFT...    362   6e-098
gi|30685055|ref|NP_849413.1| uncharacterized protein [Arabidopsi...    355   1e-095
gi|30685058|ref|NP_567597.2| uncharacterized protein [Arabidopsi...    355   1e-095
gi|21553974|gb|AAM63055.1| unknown [Arabidopsis thaliana]              139   1e-030
gi|3046695|emb|CAA18257.1| putative protein [Arabidopsis thaliana]     138   2e-030
gi|255583914|ref|XP_002532705.1| conserved hypothetical protein ...    108   2e-021
gi|224072416|ref|XP_002303724.1| predicted protein [Populus tric...     87   6e-015
gi|224057890|ref|XP_002299375.1| predicted protein [Populus tric...     87   6e-015
gi|115399896|ref|XP_001215537.1| conserved hypothetical protein ...     59   1e-006

>gi|297804114|ref|XP_002869941.1| hypothetical protein ARALYDRAFT_914630
        [Arabidopsis lyrata subsp. lyrata]

          Length = 351

 Score =  362 bits (929), Expect = 6e-098
 Identities = 209/306 (68%), Positives = 229/306 (74%), Gaps = 31/306 (10%)
 Frame = +3

Query:  342 MMRYQRVSPDVLPLTNGAKKPYLRPSPSRSTHEDTT----ITPNSIAGKGFNGGSFT--- 500
            MMRYQRVSPD LPLTNG KKPYLRPSPSR+T+EDTT    IT  SIAG+GFNGGS T   
Sbjct:    1 MMRYQRVSPDCLPLTNGGKKPYLRPSPSRATNEDTTTTTVITTTSIAGRGFNGGSCTTTT 60

Query:  501 ---SLDG--GGVRIRSS---QQTDPTPTTKRGGDVLLQWGQRKRSRVSRAEIRSTTTTAA 656
               SLDG   G R RS+   QQ DP+P ++RGGDVLLQWGQRKRSR SRAEIRSTTTT  
Sbjct:   61 NTSSLDGVPKGFRFRSTQQQQQQDPSP-SRRGGDVLLQWGQRKRSRASRAEIRSTTTT-T 118

Query:  657 AADDSSSSSGHGKIQSNKLLRRSVNPSMPPPPPPHPVSSNRSSNHRNGFV-GSKEIFLSR 833
             ADDSSSSSG GKIQS+KL RRS+NPSMPPPPP  P+ S RS+N RNGFV G +  F SR
Sbjct:  119 TADDSSSSSGQGKIQSSKLQRRSMNPSMPPPPPAPPIFSGRSTNPRNGFVIGKESFFPSR 178

Query:  834 NQEDRSANGSPSRNTNNGRTVSRSGGSKRSPPSPDQIEKISSVRDHHHQNQRQNGLDHNH 1013
            N EDRSANGSPSRN  NGR +SRSGGSKRSPPSPDQIEK SSVRDH     RQNG DH+H
Sbjct:  179 NLEDRSANGSPSRNNINGRMISRSGGSKRSPPSPDQIEKRSSVRDH-----RQNGFDHHH 233

Query: 1014 N-QQHQRVSRSESMAHAHPELETNNSGEREKATHVEVTEWPRSTLPCLGKRRRR-FLGYE 1187
            + QQHQRV+RSES A  HPE+E N  GEREKAT     EWPR  +    K +   FL  +
Sbjct:  234 HQQQHQRVNRSESTAQGHPEVEIN--GEREKATQ----EWPRIYIALSRKEKEEDFLVMK 287

Query: 1188 STKLPH 1205
             TKLPH
Sbjct:  288 GTKLPH 293

>gi|30685055|ref|NP_849413.1| uncharacterized protein [Arabidopsis thaliana]

          Length = 339

 Score =  355 bits (910), Expect = 1e-095
 Identities = 206/307 (67%), Positives = 223/307 (72%), Gaps = 32/307 (10%)
 Frame = +3

Query:  342 MMRYQRVSPDVLPLTNGAKKPYLRPSPSRSTHEDTT----ITPNSIAGKGFNGGSFT--- 500
            MMRYQRVSPD LPLTNG KKPYLRPSPSR+T+EDTT    IT  SIAG+GFNGGS T   
Sbjct:    1 MMRYQRVSPDCLPLTNGGKKPYLRPSPSRATNEDTTTTTVITTTSIAGRGFNGGSCTTTT 60

Query:  501 ----SLDG--GGVRIRSS----QQTDPTPTTKRGGDVLLQWGQRKRSRVSRAEIRSTTTT 650
                SLDG   G R RS+    QQ DP+P ++RGGDVLLQWGQRKRSR SRAEIRSTTT 
Sbjct:   61 NNTSSLDGVPKGFRFRSTQQQQQQQDPSP-SRRGGDVLLQWGQRKRSRASRAEIRSTTTI 119

Query:  651 AAAADDSSSSSGHGKIQSNKLLRRSVNPSMPPPPPPHPVSSNRSSNHRNGFV-GSKEIFL 827
               ADDSSSSSG GKIQSNK  RRS+NPSMPPPPP  P+ S RS+N RNGFV G +  F 
Sbjct:  120 TTTADDSSSSSGQGKIQSNKPQRRSMNPSMPPPPPAPPIFSGRSTNPRNGFVIGKESFFP 179

Query:  828 SRNQEDRSANGSPSRNTNNGRTVSRSGGSKRSPPSPDQIEKISSVRDHHHQNQRQNGLDH 1007
            SRN EDRSANGSPSRN  NGR +SRSGGSKRSPPSPDQIEK SSVRD     QRQNG DH
Sbjct:  180 SRNLEDRSANGSPSRNNINGRMISRSGGSKRSPPSPDQIEKRSSVRD-----QRQNGFDH 234

Query: 1008 NHNQQHQRVSRSESMAHAHPELETNNSGEREKATHVEVTEWPRSTLPCLGKRRRR-FLGY 1184
               QQHQRV+RSES A  H E+E N  GEREKAT     EWPR  +    K +   FL  
Sbjct:  235 -QQQQHQRVNRSESTAQGHQEVEIN--GEREKATQ----EWPRIYIALSRKEKEEDFLVM 287

Query: 1185 ESTKLPH 1205
            + TKLPH
Sbjct:  288 KGTKLPH 294

>gi|30685058|ref|NP_567597.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 352

 Score =  355 bits (910), Expect = 1e-095
 Identities = 206/307 (67%), Positives = 223/307 (72%), Gaps = 32/307 (10%)
 Frame = +3

Query:  342 MMRYQRVSPDVLPLTNGAKKPYLRPSPSRSTHEDTT----ITPNSIAGKGFNGGSFT--- 500
            MMRYQRVSPD LPLTNG KKPYLRPSPSR+T+EDTT    IT  SIAG+GFNGGS T   
Sbjct:    1 MMRYQRVSPDCLPLTNGGKKPYLRPSPSRATNEDTTTTTVITTTSIAGRGFNGGSCTTTT 60

Query:  501 ----SLDG--GGVRIRSS----QQTDPTPTTKRGGDVLLQWGQRKRSRVSRAEIRSTTTT 650
                SLDG   G R RS+    QQ DP+P ++RGGDVLLQWGQRKRSR SRAEIRSTTT 
Sbjct:   61 NNTSSLDGVPKGFRFRSTQQQQQQQDPSP-SRRGGDVLLQWGQRKRSRASRAEIRSTTTI 119

Query:  651 AAAADDSSSSSGHGKIQSNKLLRRSVNPSMPPPPPPHPVSSNRSSNHRNGFV-GSKEIFL 827
               ADDSSSSSG GKIQSNK  RRS+NPSMPPPPP  P+ S RS+N RNGFV G +  F 
Sbjct:  120 TTTADDSSSSSGQGKIQSNKPQRRSMNPSMPPPPPAPPIFSGRSTNPRNGFVIGKESFFP 179

Query:  828 SRNQEDRSANGSPSRNTNNGRTVSRSGGSKRSPPSPDQIEKISSVRDHHHQNQRQNGLDH 1007
            SRN EDRSANGSPSRN  NGR +SRSGGSKRSPPSPDQIEK SSVRD     QRQNG DH
Sbjct:  180 SRNLEDRSANGSPSRNNINGRMISRSGGSKRSPPSPDQIEKRSSVRD-----QRQNGFDH 234

Query: 1008 NHNQQHQRVSRSESMAHAHPELETNNSGEREKATHVEVTEWPRSTLPCLGKRRRR-FLGY 1184
               QQHQRV+RSES A  H E+E N  GEREKAT     EWPR  +    K +   FL  
Sbjct:  235 -QQQQHQRVNRSESTAQGHQEVEIN--GEREKATQ----EWPRIYIALSRKEKEEDFLVM 287

Query: 1185 ESTKLPH 1205
            + TKLPH
Sbjct:  288 KGTKLPH 294

>gi|21553974|gb|AAM63055.1| unknown [Arabidopsis thaliana]

          Length = 174

 Score =  139 bits (349), Expect = 1e-030
 Identities = 81/128 (63%), Positives = 88/128 (68%), Gaps = 13/128 (10%)
 Frame = +3

Query:  825 LSRNQEDRSANGSPSRNTNNGRTVSRSGGSKRSPPSPDQIEKISSVRDHHHQNQRQNGLD 1004
            + RN EDRSANGSPSRN  NGR +SRSGGSKRSPPSPDQIEK SSVRD     QRQNG D
Sbjct:    1 MHRNLEDRSANGSPSRNNINGRMISRSGGSKRSPPSPDQIEKRSSVRD-----QRQNGFD 55

Query: 1005 HNHNQQHQRVSRSESMAHAHPELETNNSGEREKATHVEVTEWPRSTLPCLGKRRRR-FLG 1181
            H   QQHQRV+RSES A  H E+E N  GEREKAT     EWPR  +    K +   FL 
Sbjct:   56 H-QQQQHQRVNRSESTAQGHQEVEIN--GEREKATQ----EWPRIYIALSRKEKEEDFLV 108

Query: 1182 YESTKLPH 1205
             + TKLPH
Sbjct:  109 MKGTKLPH 116

>gi|3046695|emb|CAA18257.1| putative protein [Arabidopsis thaliana]

          Length = 287

 Score =  138 bits (346), Expect = 2e-030
 Identities = 80/129 (62%), Positives = 89/129 (68%), Gaps = 13/129 (10%)
 Frame = +3

Query:  822 FLSRNQEDRSANGSPSRNTNNGRTVSRSGGSKRSPPSPDQIEKISSVRDHHHQNQRQNGL 1001
            +++ N EDRSANGSPSRN  NGR +SRSGGSKRSPPSPDQIEK SSVRD     QRQNG 
Sbjct:  113 YVNGNLEDRSANGSPSRNNINGRMISRSGGSKRSPPSPDQIEKRSSVRD-----QRQNGF 167

Query: 1002 DHNHNQQHQRVSRSESMAHAHPELETNNSGEREKATHVEVTEWPRSTLPCLGKRRRR-FL 1178
            DH   QQHQRV+RSES A  H E+E N  GEREKAT     EWPR  +    K +   FL
Sbjct:  168 DH-QQQQHQRVNRSESTAQGHQEVEIN--GEREKATQ----EWPRIYIALSRKEKEEDFL 220

Query: 1179 GYESTKLPH 1205
              + TKLPH
Sbjct:  221 VMKGTKLPH 229

>gi|255583914|ref|XP_002532705.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 367

 Score =  108 bits (269), Expect = 2e-021
 Identities = 79/193 (40%), Positives = 95/193 (49%), Gaps = 26/193 (13%)
 Frame = +3

Query: 429 STHEDTTITPNSIAGKGFNGGSFTSLDGG-------GVRIRSSQQTDPTPTTKRGGDVLL 587
           +T   TT T  S   KGF   S +S                S     P P+     D+ L
Sbjct:  48 TTSSTTTSTSTSFESKGFTFKSSSSRSQDHHHHHHQNASSPSHSDASPNPSPSPNKDLFL 107

Query: 588 QWGQRKRSRVSRAEIRSTTTTAAAADDSSSSSGHGKIQSNKLLRRS----VNPSMPPPPP 755
           QWGQ+KR+RVSR+EIR       A  D SSSS   K   NKL RR+      PSMPPPPP
Sbjct: 108 QWGQKKRARVSRSEIR-------ALADESSSSAQAKQPINKLPRRADSKFSTPSMPPPPP 160

Query: 756 PHPVSSNRSSNHRNGFVGSK-----EIFLSRNQEDRSANG--SPSRNT-NNGRTVSRSGG 911
           P P S   S+N+ N    SK          R  E RS  G  SPSRN+  + R VSRS  
Sbjct: 161 PPPPSQQHSANNNNSSSISKGRNFRSSLPHRILEKRSGAGNVSPSRNSGGSSRVVSRSTA 220

Query: 912 SKRSPPSPDQIEK 950
            KRSPP+P++I+K
Sbjct: 221 GKRSPPTPEKIDK 233

>gi|224072416|ref|XP_002303724.1| predicted protein [Populus trichocarpa]

          Length = 252

 Score =  87 bits (213), Expect = 6e-015
 Identities = 64/155 (41%), Positives = 79/155 (50%), Gaps = 24/155 (15%)
 Frame = +3

Query: 351 YQRVSPDVLPLTNGAKKPYLRPSPSRSTHEDTTITPNSIAGKGF---------NGGSFTS 503
           YQRVSPD +PL+NG K        S     ++T T     G  F         +  S TS
Sbjct:  71 YQRVSPDCVPLSNGKKPNGAENGRSIPNGFNSTSTNFDTKGLRFRSPSRNQDHHNNSTTS 130

Query: 504 LDGGGVRIRSSQQTD--PTPTTKRG--GDVLLQWGQRKRSRVSRAEIRSTTTTAAAADDS 671
                     +Q+ D  P P+  RG  GDVLLQWGQ+KR+RVSR+EIR       A  D 
Sbjct: 131 SPHSENNHNRTQRHDSSPGPSPSRGGNGDVLLQWGQKKRARVSRSEIR-------ALADE 183

Query: 672 SSSSGHGKIQSNKLLRRSVN----PSMPPPPPPHP 764
           SSSSG  +   N++ RR  N    P+MPPPPPP P
Sbjct: 184 SSSSGQARQPINRVPRRVDNKFSPPTMPPPPPPPP 218

>gi|224057890|ref|XP_002299375.1| predicted protein [Populus trichocarpa]

          Length = 195

 Score =  87 bits (213), Expect = 6e-015
 Identities = 66/181 (36%), Positives = 86/181 (47%), Gaps = 51/181 (28%)
 Frame = +3

Query: 351 YQRVSPDVLPLTNGAKKP--------------------------YLRPSPSRSTHEDTTI 452
           YQRVSPD +PL+NG KKP                          +  PS ++  H ++T 
Sbjct:  10 YQRVSPDCVPLSNG-KKPNGVENGRSIPNGFSSTSTNFETKAFRFRSPSRNQDHHNNSTT 68

Query: 453 TPNSIAGKGFNGGSFTSLDGGGVRIRSSQQTDPTPTTKRGGDVLLQWGQRKRSRVSRAEI 632
           +P        N  + T         R      P+P+    GDVLLQWGQ+KR+RVSR+EI
Sbjct:  69 SP----PHSDNSHNHTQ--------RHGTSPSPSPSRVGNGDVLLQWGQKKRARVSRSEI 116

Query: 633 RSTTTTAAAADDSSSSSGHGKIQSNKLLRR---SVNPSMPPPPPPHPVSSNR--SSNHRN 797
           R       A  D SSSSG  +   NK+ RR    ++PS  PPPPP P S  +  S+N R 
Sbjct: 117 R-------AFPDESSSSGQARQPINKIPRRVDNKLSPSSMPPPPPPPSSQQQSTSTNTRG 169

Query: 798 G 800
           G
Sbjct: 170 G 170

>gi|115399896|ref|XP_001215537.1| conserved hypothetical protein [Aspergillus
        terreus NIH2624]

          Length = 1064

 Score =  59 bits (141), Expect = 1e-006
 Identities = 44/190 (23%), Positives = 85/190 (44%), Gaps = 3/190 (1%)
 Frame = +3

Query:  636 STTTTAAAADDSSSSSGHGKIQSNKLLRRSVNPSMPPPPPPHPVSSNRSSNHRNGFVGSK 815
            ST T AA A ++SSSS   K+++   +  + NP+ P   PP PVS N+ S  +       
Sbjct:  133 STLTPAATATNTSSSSTDSKLEAEPSVENNHNPAKPDSTPPVPVSENKKSQCKKHPGKES 192

Query:  816 EIFLSRNQEDRSANGSPSRNTNNGRTVSRSGGSKRSPPSPDQIEKISSVRDHHHQNQRQN 995
            +    + +++  A+   + ++++  + S S  S +     +     S V   HH   R+ 
Sbjct:  193 DPSARKKKKNCKASSIATPSSDSSSSSSTSDSSDQDESDDEASSSASEVERKHH---RKR 249

Query:  996 GLDHNHNQQHQRVSRSESMAHAHPELETNNSGEREKATHVEVTEWPRSTLPCLGKRRRRF 1175
                    +H R  +S S   +  ELE++   + ++++  E T      +  L K ++  
Sbjct:  250 SKAKKKASKHSRKKKSSSRYQSDSELESDPDADEDESSMDEKTLKKLIQMLKLRKAKKNR 309

Query: 1176 LGYESTKLPH 1205
               +ST+ P+
Sbjct:  310 SKEDSTEDPY 319

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,531,826,676,690
Number of Sequences: 15229318
Number of Extensions: 3531826676690
Number of Successful Extensions: 836928167
Number of sequences better than 0.0: 0