Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN14606


BLASTX 7.6.2

Query= UN14606 /QuerySize=766
        (765 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297845248|ref|XP_002890505.1| hypothetical protein ARALYDRAFT...    252   4e-065
gi|30687784|ref|NP_173644.2| uncharacterized protein [Arabidopsi...    250   2e-064
gi|297839689|ref|XP_002887726.1| hypothetical protein ARALYDRAFT...    113   3e-023
gi|30699311|ref|NP_177941.2| uncharacterized protein [Arabidopsi...     94   2e-017
gi|12324248|gb|AAG52095.1|AC012680_6 unknown protein; 48924-4970...     94   2e-017
gi|225459312|ref|XP_002284166.1| PREDICTED: hypothetical protein...     89   8e-016
gi|224084674|ref|XP_002307383.1| predicted protein [Populus tric...     86   5e-015
gi|255545710|ref|XP_002513915.1| conserved hypothetical protein ...     85   9e-015
gi|224063225|ref|XP_002301050.1| predicted protein [Populus tric...     83   3e-014
gi|255628921|gb|ACU14805.1| unknown [Glycine max]                       79   6e-013
gi|4325367|gb|AAD17363.1| contains similarity to Nicotiana tabac...     64   2e-008
gi|42566349|ref|NP_192630.2| uncharacterized protein [Arabidopsi...     64   2e-008
gi|297813245|ref|XP_002874506.1| hypothetical protein ARALYDRAFT...     64   3e-008
gi|212722528|ref|NP_001131648.1| hypothetical protein LOC1001930...     62   8e-008
gi|226491005|ref|NP_001144049.1| hypothetical protein LOC1002768...     59   7e-007
gi|242036645|ref|XP_002465717.1| hypothetical protein SORBIDRAFT...     58   1e-006
gi|226499716|ref|NP_001144712.1| hypothetical protein LOC1002777...     56   4e-006

>gi|297845248|ref|XP_002890505.1| hypothetical protein ARALYDRAFT_335470
        [Arabidopsis lyrata subsp. lyrata]

          Length = 202

 Score =  252 bits (643), Expect = 4e-065
 Identities = 136/186 (73%), Positives = 153/186 (82%), Gaps = 8/186 (4%)
 Frame = -1

Query: 711 MFSLHDGTICPKKRDQMKVYGESFHGSFKRIKQEDQTQAKLEK-STTL----SKSENAKL 547
           MFSL DGT+C KKRD MKV+GESFHGSFKR KQEDQTQ KLEK STTL    SKSE+A L
Sbjct:   1 MFSLRDGTLCSKKRDHMKVFGESFHGSFKRSKQEDQTQTKLEKNSTTLFFQRSKSESAML 60

Query: 546 LKPEVQLYLDTKVHSETSEDHRTSLDLELNLCSSSS-SVVKRIMKKEESSKGKTLIM-SP 373
           +KP+VQL+L+ K  SET EDHRT LDL LNL SSSS ++ K IM+K+E SKG +LIM +P
Sbjct:  61 IKPDVQLHLEAKTLSETFEDHRTDLDLNLNLSSSSSFNMKKTIMEKDECSKGVSLIMTTP 120

Query: 372 SKKRKSGDKDVVSRSPSWLAFECENDTDEQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCK 193
           SKK +SGD   +SRSPSWLAFE ++D D QKKQEMVTTVCMKCHMLVMLCKSTLVCPNCK
Sbjct: 121 SKKVRSGDIG-LSRSPSWLAFEGDDDDDSQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCK 179

Query: 192 FTHPDD 175
           F H DD
Sbjct: 180 FMHHDD 185

>gi|30687784|ref|NP_173644.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 200

 Score =  250 bits (638), Expect = 2e-064
 Identities = 131/185 (70%), Positives = 150/185 (81%), Gaps = 8/185 (4%)
 Frame = -1

Query: 711 MFSLHDGTICPKKRDQMKVYGESFHGSFKRIKQEDQTQAKLEKSTTL-----SKSENAKL 547
           MFS HDGT+C KKRD MKV+GESFHGSFKR KQ DQ Q K EK+TT      SKS++A L
Sbjct:   1 MFSPHDGTLCSKKRDHMKVFGESFHGSFKRSKQGDQKQTKFEKNTTTLFFQRSKSDSAML 60

Query: 546 LKPEVQLYLDTKVHSETSEDHRTSLDLELNLCSSSSSVVKR-IMKKEESSKGKTLIMSPS 370
           +KP+VQL+L+ K  SET EDHRT LDL LNL SSSSS VK+ IM+K+E SKG T+I+SPS
Sbjct:  61 IKPDVQLHLEAKTQSETFEDHRTDLDLNLNLYSSSSSSVKKTIMEKDECSKGGTVIISPS 120

Query: 369 KKRKSGDKDVVSRSPSWLAFECENDTDEQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCKF 190
           KK +SGD   +SRSPSWLAFE ++D D QKKQEM+TTVCMKCHMLVMLCKSTLVCPNCKF
Sbjct: 121 KKVRSGDIG-LSRSPSWLAFEGDDD-DNQKKQEMITTVCMKCHMLVMLCKSTLVCPNCKF 178

Query: 189 THPDD 175
            H DD
Sbjct: 179 MHHDD 183

>gi|297839689|ref|XP_002887726.1| hypothetical protein ARALYDRAFT_895709
        [Arabidopsis lyrata subsp. lyrata]

          Length = 216

 Score =  113 bits (282), Expect = 3e-023
 Identities = 65/113 (57%), Positives = 71/113 (62%), Gaps = 3/113 (2%)
 Frame = -1

Query: 483 RTSLDLELNLC-SSSSSVVKRIMKKEESSKGKTLIMSPSKKRKSGDKDVVSRSPSWLAFE 307
           + SLDLELNL  S S S    I K E SS     + S  K+  +  K  +SRSPSWLAFE
Sbjct:  95 KMSLDLELNLSPSGSPSRTATIKKDEYSSNHNETVTSKGKELTNPSKKRISRSPSWLAFE 154

Query: 306 CENDTD-EQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCKFTHPDDDHRPLSLF 151
              D D + K QEMVTTVCMKCHMLVMLC ST VCPNCKF HP  DH    LF
Sbjct: 155 GGGDDDVDHKGQEMVTTVCMKCHMLVMLCTSTPVCPNCKFMHP-HDHSSTKLF 206

>gi|30699311|ref|NP_177941.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 221

 Score =  94 bits (232), Expect = 2e-017
 Identities = 55/86 (63%), Positives = 59/86 (68%), Gaps = 5/86 (5%)
 Frame = -1

Query: 405 SSKGKTLIMSPSKKRKSGDKDVVSRSPSWLAFECENDTD-EQKKQEMVTTVCMKCHMLVM 229
           SSK K L  + SKK   G    +SRSPSWLAFE  +D D + K QEMVTTVCMKCHMLVM
Sbjct: 130 SSKIKVL-TNTSKKSIIGTG--LSRSPSWLAFEGGDDNDVDHKGQEMVTTVCMKCHMLVM 186

Query: 228 LCKSTLVCPNCKFTHPDDDHRPLSLF 151
           LC ST VCPNCKF HP  DH    LF
Sbjct: 187 LCTSTPVCPNCKFMHP-HDHSSTKLF 211

>gi|12324248|gb|AAG52095.1|AC012680_6 unknown protein; 48924-49705 [Arabidopsis
        thaliana]

          Length = 198

 Score =  94 bits (232), Expect = 2e-017
 Identities = 55/86 (63%), Positives = 59/86 (68%), Gaps = 5/86 (5%)
 Frame = -1

Query: 405 SSKGKTLIMSPSKKRKSGDKDVVSRSPSWLAFECENDTD-EQKKQEMVTTVCMKCHMLVM 229
           SSK K L  + SKK   G    +SRSPSWLAFE  +D D + K QEMVTTVCMKCHMLVM
Sbjct: 107 SSKIKVL-TNTSKKSIIGTG--LSRSPSWLAFEGGDDNDVDHKGQEMVTTVCMKCHMLVM 163

Query: 228 LCKSTLVCPNCKFTHPDDDHRPLSLF 151
           LC ST VCPNCKF HP  DH    LF
Sbjct: 164 LCTSTPVCPNCKFMHP-HDHSSTKLF 188

>gi|225459312|ref|XP_002284166.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 211

 Score =  89 bits (218), Expect = 8e-016
 Identities = 58/136 (42%), Positives = 72/136 (52%), Gaps = 28/136 (20%)
 Frame = -1

Query: 525 YLDTKVHSETSEDHRT----------SLDLELNL-CSSSSSVVKRIMKKEESSKGKTLIM 379
           + +T+ H  TS D R           SL+LELNL C S    ++R   + + + GK    
Sbjct:  78 FYNTRTHKRTSRDPRASPEPPSPGPMSLELELNLPCDS----LRR--NRPDDNMGKWNSG 131

Query: 378 SPSKKRKSG----DKDVVSRSPSWLAFECENDTDEQKKQEMVTTVCMKCHMLVMLCKSTL 211
           SPS   K      +   V+ SPSWLAFE +N       QEMV  VC +CHMLVMLCKS+ 
Sbjct: 132 SPSHNSKESLQKKNPGGVACSPSWLAFEGDN-------QEMVAAVCKRCHMLVMLCKSSP 184

Query: 210 VCPNCKFTHPDDDHRP 163
            CPNCKF HP D   P
Sbjct: 185 TCPNCKFMHPPDQTPP 200

>gi|224084674|ref|XP_002307383.1| predicted protein [Populus trichocarpa]

          Length = 211

 Score =  86 bits (211), Expect = 5e-015
 Identities = 55/138 (39%), Positives = 70/138 (50%), Gaps = 25/138 (18%)
 Frame = -1

Query: 525 YLDTKVHSETSE-----------DHRTSLDLELNLCSSSSSVVKRIMK---KEESSKGKT 388
           + +T+ H  TS            DH  SLDLELNL     S  KR       +++S G  
Sbjct:  78 FYNTRTHKRTSRDPRKTPEPPSPDHHMSLDLELNL-PYDQSQRKRFANDHITKQNSGGSI 136

Query: 387 LIMSPSKKRKSGDKDV---VSRSPSWLAFECENDTDEQKKQEMVTTVCMKCHMLVMLCKS 217
                  K  S DK+    ++R PSWLA        E+ ++EMV TVC +CHMLVMLC+S
Sbjct: 137 RGFGDLFKDSSRDKESSGGLTRRPSWLA-------SERDQEEMVATVCTRCHMLVMLCRS 189

Query: 216 TLVCPNCKFTHPDDDHRP 163
           +  CPNCKF HP D   P
Sbjct: 190 SPACPNCKFMHPPDQSSP 207

>gi|255545710|ref|XP_002513915.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 216

 Score =  85 bits (209), Expect = 9e-015
 Identities = 65/205 (31%), Positives = 91/205 (44%), Gaps = 27/205 (13%)
 Frame = -1

Query: 729 NRVEGLMFSLHDGTICPKKRDQMKVYGESFHGSFKRIKQED----QTQAKLEKSTTLSKS 562
           NR E ++  + D +   +K D+ + YG +F    K   Q      + + +LE    L   
Sbjct:  12 NRRETIISEV-DNSSKKRKWDEPQTYG-TFEKRSKPPNQNTKPIFEIELQLETPLPLEWQ 69

Query: 561 ENAKLLKPEVQLYLDTKVHSETSEDHR----------TSLDLELNL--CSSSSSVVKRIM 418
           +   +   E+  Y +T+    TS D R           SLDLEL+L  C S         
Sbjct:  70 QCLDIQSGEIHFY-NTRTKKRTSRDPRRSPEPPSPGHMSLDLELHLQPCESQRKNNANDH 128

Query: 417 KKEESSKGKTLIMSPSKKRKSGDKDVVSRSPSWLAFECENDTDEQKKQEMVTTVCMKCHM 238
               S +G   +   S K     +  + R PSW+         E  ++EMV TVC +CHM
Sbjct: 129 SLASSIQGFGDLFMDSSKEDKSSEGTIKRCPSWIV--------EGDQEEMVATVCTRCHM 180

Query: 237 LVMLCKSTLVCPNCKFTHPDDDHRP 163
           LVMLCKS+  CPNCKF HP D   P
Sbjct: 181 LVMLCKSSPACPNCKFMHPPDQSPP 205

>gi|224063225|ref|XP_002301050.1| predicted protein [Populus trichocarpa]

          Length = 211

 Score =  83 bits (204), Expect = 3e-014
 Identities = 54/138 (39%), Positives = 71/138 (51%), Gaps = 25/138 (18%)
 Frame = -1

Query: 525 YLDTKVHSETSE-----------DHRTSLDLELNL---CSSSSSVVKRIMKKEE---SSK 397
           + +T+ H  TS            DH  SL+LELNL    S   S     + K+    S +
Sbjct:  77 FYNTRTHKRTSRDPRGSPEPPSPDHDMSLELELNLPYDQSQRKSYTHDHITKQNPGGSIR 136

Query: 396 GKTLIMSPSKKRKSGDKDVVSRSPSWLAFECENDTDEQKKQEMVTTVCMKCHMLVMLCKS 217
           G   +   S  R +G    ++R PSWLAF       E+ +QEM+ TVC KCHMLVMLC+S
Sbjct: 137 GFGDLFKES-SRDNGSSGGLTRRPSWLAF-------EKDQQEMLATVCTKCHMLVMLCRS 188

Query: 216 TLVCPNCKFTHPDDDHRP 163
           +  CPNCKF HP +   P
Sbjct: 189 SPTCPNCKFLHPLEQSPP 206

>gi|255628921|gb|ACU14805.1| unknown [Glycine max]

          Length = 215

 Score =  79 bits (193), Expect = 6e-013
 Identities = 48/119 (40%), Positives = 61/119 (51%), Gaps = 10/119 (8%)
 Frame = -1

Query: 513 KVHSETSEDHRTSLDLELNLCSSSSSVVKR----IMKKEESSKGKTLIMSPSKKRKSGDK 346
           K   E    H  SL+L LNL   S    +      M +++SS      +S  +     D 
Sbjct:  90 KSSEEPPSSHHMSLNLGLNLTCESPRKKEEGYGYEMNEKKSSGSSPGGLSEREDLCKKDS 149

Query: 345 DVVSRSPSWLAFECENDTDEQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCKFTHPDDDH 169
           D    SPSWL+      + E   +EMV TVCM+CHMLVMLCKS+  CPNCKF HP D +
Sbjct: 150 DGKILSPSWLS------SSEDDYKEMVATVCMRCHMLVMLCKSSPSCPNCKFMHPPDQN 202

>gi|4325367|gb|AAD17363.1| contains similarity to Nicotiana tabacum B-type
        cyclin (GB:D50737) [Arabidopsis thaliana]

          Length = 188

 Score =  64 bits (155), Expect = 2e-008
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 12/78 (15%)
 Frame = -1

Query: 369 KKRKSGDKDVV-----SRSPSWLAFECENDTDE-------QKKQEMVTTVCMKCHMLVML 226
           +K+K  +K  V      +S S +AF+ ++D  +       + ++EMV  VCMKCHMLVML
Sbjct:  96 RKKKMMEKGSVLGLETKKSMSRVAFDLDDDCCDRGDGVGGRSEEEMVARVCMKCHMLVML 155

Query: 225 CKSTLVCPNCKFTHPDDD 172
           CK++  CPNCKF H  +D
Sbjct: 156 CKASPACPNCKFMHSPED 173

>gi|42566349|ref|NP_192630.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 212

 Score =  64 bits (155), Expect = 2e-008
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 12/78 (15%)
 Frame = -1

Query: 369 KKRKSGDKDVV-----SRSPSWLAFECENDTDE-------QKKQEMVTTVCMKCHMLVML 226
           +K+K  +K  V      +S S +AF+ ++D  +       + ++EMV  VCMKCHMLVML
Sbjct: 120 RKKKMMEKGSVLGLETKKSMSRVAFDLDDDCCDRGDGVGGRSEEEMVARVCMKCHMLVML 179

Query: 225 CKSTLVCPNCKFTHPDDD 172
           CK++  CPNCKF H  +D
Sbjct: 180 CKASPACPNCKFMHSPED 197

>gi|297813245|ref|XP_002874506.1| hypothetical protein ARALYDRAFT_911063
        [Arabidopsis lyrata subsp. lyrata]

          Length = 208

 Score =  64 bits (153), Expect = 3e-008
 Identities = 29/60 (48%), Positives = 40/60 (66%), Gaps = 6/60 (10%)
 Frame = -1

Query: 333 RSPSWLAFECENDTDEQ------KKQEMVTTVCMKCHMLVMLCKSTLVCPNCKFTHPDDD 172
           +S S +AF+ + D  ++       ++EMV  VCMKCHMLVMLCK++  CPNCKF H  +D
Sbjct: 134 KSMSRVAFDLDEDCCDRGGVGGGNEEEMVARVCMKCHMLVMLCKASPACPNCKFMHSPED 193

>gi|212722528|ref|NP_001131648.1| hypothetical protein LOC100193008 [Zea mays]

          Length = 232

 Score =  62 bits (149), Expect = 8e-008
 Identities = 24/49 (48%), Positives = 32/49 (65%)
 Frame = -1

Query: 309 ECENDTDEQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCKFTHPDDDHRP 163
           E E+D      +EMV  VC++CHMLVM+C+++  CPNCKF HP     P
Sbjct: 168 EAEDDDSSSSSREMVAGVCVRCHMLVMMCRASPACPNCKFLHPPSRAAP 216

>gi|226491005|ref|NP_001144049.1| hypothetical protein LOC100276873 [Zea mays]

          Length = 237

 Score =  59 bits (141), Expect = 7e-007
 Identities = 25/52 (48%), Positives = 33/52 (63%), Gaps = 3/52 (5%)
 Frame = -1

Query: 309 ECENDTD---EQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCKFTHPDDDHRP 163
           E E+D D       +EMV  VC++CHMLVM+C+++  CPNCKF HP     P
Sbjct: 170 EAEDDDDSDSSSSSREMVAGVCVRCHMLVMMCRASPACPNCKFLHPPSRATP 221

>gi|242036645|ref|XP_002465717.1| hypothetical protein SORBIDRAFT_01g044450
        [Sorghum bicolor]

          Length = 244

 Score =  58 bits (139), Expect = 1e-006
 Identities = 23/47 (48%), Positives = 30/47 (63%)
 Frame = -1

Query: 303 ENDTDEQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCKFTHPDDDHRP 163
           E   D    +EMV  VC++CHMLVM+C+++  CPNCKF HP     P
Sbjct: 170 EEAEDSSGSREMVAGVCVRCHMLVMMCRASPACPNCKFLHPPSRAAP 216

>gi|226499716|ref|NP_001144712.1| hypothetical protein LOC100277752 [Zea mays]

          Length = 239

 Score =  56 bits (134), Expect = 4e-006
 Identities = 21/38 (55%), Positives = 28/38 (73%)
 Frame = -1

Query: 297 DTDEQKKQEMVTTVCMKCHMLVMLCKSTLVCPNCKFTH 184
           D+     +EMV  VCM+CHMLVM+C+++  CPNCKF H
Sbjct: 172 DSSSGSSREMVAGVCMRCHMLVMMCRASPACPNCKFLH 209

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,774,473,030,928
Number of Sequences: 15229318
Number of Extensions: 1774473030928
Number of Successful Extensions: 444405353
Number of sequences better than 0.0: 0