Library    |     Search    |     Batch query    |     SNP    |     SSR  

GenBank blast output of UN22578


BLASTX 7.6.2

Query= UN22578 /QuerySize=1017
        (1016 letters)

Database: GenBank nr;
          15,229,318 sequences; 5,219,829,378 total letters
                                                                  Score    E
Sequences producing significant alignments:                       (bits) Value

gi|297839689|ref|XP_002887726.1| hypothetical protein ARALYDRAFT...    312   1e-082
gi|30699311|ref|NP_177941.2| uncharacterized protein [Arabidopsi...    305   1e-080
gi|12324248|gb|AAG52095.1|AC012680_6 unknown protein; 48924-4970...    191   1e-046
gi|297845248|ref|XP_002890505.1| hypothetical protein ARALYDRAFT...    130   5e-028
gi|30687784|ref|NP_173644.2| uncharacterized protein [Arabidopsi...    122   1e-025
gi|255628921|gb|ACU14805.1| unknown [Glycine max]                       91   3e-016
gi|226491005|ref|NP_001144049.1| hypothetical protein LOC1002768...     77   3e-012
gi|4325367|gb|AAD17363.1| contains similarity to Nicotiana tabac...     76   7e-012
gi|42566349|ref|NP_192630.2| uncharacterized protein [Arabidopsi...     76   7e-012
gi|224084674|ref|XP_002307383.1| predicted protein [Populus tric...     75   2e-011
gi|297813245|ref|XP_002874506.1| hypothetical protein ARALYDRAFT...     75   2e-011
gi|255545710|ref|XP_002513915.1| conserved hypothetical protein ...     74   3e-011
gi|224063225|ref|XP_002301050.1| predicted protein [Populus tric...     71   2e-010
gi|225459312|ref|XP_002284166.1| PREDICTED: hypothetical protein...     70   4e-010
gi|212722528|ref|NP_001131648.1| hypothetical protein LOC1001930...     68   2e-009
gi|242036645|ref|XP_002465717.1| hypothetical protein SORBIDRAFT...     60   4e-007
gi|226499716|ref|NP_001144712.1| hypothetical protein LOC1002777...     59   9e-007
gi|115451263|ref|NP_001049232.1| Os03g0191200 [Oryza sativa Japo...     58   2e-006

>gi|297839689|ref|XP_002887726.1| hypothetical protein ARALYDRAFT_895709
        [Arabidopsis lyrata subsp. lyrata]

          Length = 216

 Score =  312 bits (797), Expect = 1e-082
 Identities = 167/231 (72%), Positives = 180/231 (77%), Gaps = 17/231 (7%)
 Frame = +3

Query:  96 MVSRQEETLFSRKRDFTAYGEEFHNLFKKTKQEDQSHGTLQRAMLNEGSNSESMRSVTFD 275
           MVSRQE TL+SRKRDFT YGEEFHN FKK KQEDQS  TL     NE  NSESMRS+TFD
Sbjct:   1 MVSRQEGTLYSRKRDFTVYGEEFHNSFKKIKQEDQSQSTL----FNERPNSESMRSITFD 56

Query: 276 FELNLHTPLPSGWQKSIETKSYPRTSVDHRTRHPNDPLLVERPKMSLDLELNLSPSYSPT 455
           FEL+LHTPLPS WQ    TK Y RTS DHR  +  DP++V +PKMSLDLELNLSPS SP+
Sbjct:  57 FELHLHTPLPSDWQ----TKGYSRTSDDHRA-YTKDPVIVGQPKMSLDLELNLSPSGSPS 111

Query: 456 KAAT--EIEESSNDNENVSSSKGINLTSPRMKTTTVTGLKRSLSWLAFEGGDDDDVDCKE 629
           + AT  + E SSN NE V +SKG  LT+P  K      + RS SWLAFEGG DDDVD K 
Sbjct: 112 RTATIKKDEYSSNHNETV-TSKGKELTNPSKKR-----ISRSPSWLAFEGGGDDDVDHKG 165

Query: 630 QEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFKPSNLLRLLC 782
           QEMVT VCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFKPSNLLRLLC
Sbjct: 166 QEMVTTVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFKPSNLLRLLC 216

>gi|30699311|ref|NP_177941.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 221

 Score =  305 bits (779), Expect = 1e-080
 Identities = 161/226 (71%), Positives = 176/226 (77%), Gaps = 10/226 (4%)
 Frame = +3

Query:  96 MVSRQEETLFSRKRDFTAYGEEFHNLFKKTKQEDQSHGTLQRAMLNEGSNSESMRSVTFD 275
           MVSRQE TL+SRKRDFT  GEEFHN FKK+KQEDQS  TL     NE   SESMRS+TFD
Sbjct:   1 MVSRQEGTLYSRKRDFTVCGEEFHNSFKKSKQEDQSQSTL----FNERPKSESMRSITFD 56

Query: 276 FELNLHTPLPSGWQKSIETKSYPRTSVDHRTRHPNDPLLVERPKMSLDLELNLSPSYSPT 455
           FEL+LHTPLPS WQ    TK Y RTS +HR  +P DP++  +PKMSLDLELNLSPS SP+
Sbjct:  57 FELHLHTPLPSDWQ----TKGYSRTSDEHRA-YPKDPVIFGQPKMSLDLELNLSPSGSPS 111

Query: 456 KAAT-EIEESSNDNENVSSSKGINLTSPRMKTTTVTGLKRSLSWLAFEGGDDDDVDCKEQ 632
           +  T + E SS  NE VSSSK   LT+   K+   TGL RS SWLAFEGGDD+DVD K Q
Sbjct: 112 RTTTKKYESSSIHNETVSSSKIKVLTNTSKKSIIGTGLSRSPSWLAFEGGDDNDVDHKGQ 171

Query: 633 EMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFKPSNLL 770
           EMVT VCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFKPSNLL
Sbjct: 172 EMVTTVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFKPSNLL 217


 Score =  260 bits (662), Expect = 4e-067
 Identities = 135/204 (66%), Positives = 153/204 (75%), Gaps = 6/204 (2%)
 Frame = +3

Query: 174 FKKTKQEDQSHGTLQRAMLNEGSNSESMRSVTFDFELNLHTPLPSGWQKSIETKSYPRTS 353
           F  + ++ +     Q  + NE   SESMRS+TFDFEL+LHTPLPS WQ    TK Y RTS
Sbjct:  23 FHNSFKKSKQEDQSQSTLFNERPKSESMRSITFDFELHLHTPLPSDWQ----TKGYSRTS 78

Query: 354 VDHRTRHPNDPLLVERPKMSLDLELNLSPSYSPTKAAT-EIEESSNDNENVSSSKGINLT 530
            +HR  +P DP++  +PKMSLDLELNLSPS SP++  T + E SS  NE VSSSK   LT
Sbjct:  79 DEHRA-YPKDPVIFGQPKMSLDLELNLSPSGSPSRTTTKKYESSSIHNETVSSSKIKVLT 137

Query: 531 SPRMKTTTVTGLKRSLSWLAFEGGDDDDVDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNC 710
           +   K+   TGL RS SWLAFEGGDD+DVD K QEMVT VCMKCHMLVMLCTSTPVCPNC
Sbjct: 138 NTSKKSIIGTGLSRSPSWLAFEGGDDNDVDHKGQEMVTTVCMKCHMLVMLCTSTPVCPNC 197

Query: 711 KFMHPHDHSSTKLFKPSNLLRLLC 782
           KFMHPHDHSSTKLFKPSNLLRLLC
Sbjct: 198 KFMHPHDHSSTKLFKPSNLLRLLC 221

>gi|12324248|gb|AAG52095.1|AC012680_6 unknown protein; 48924-49705 [Arabidopsis
        thaliana]

          Length = 198

 Score =  191 bits (485), Expect = 1e-046
 Identities = 101/139 (72%), Positives = 110/139 (79%), Gaps = 2/139 (1%)
 Frame = +3

Query: 372 HPNDPLLVE-RPKMSLDLELNLSPSYSPTKAAT-EIEESSNDNENVSSSKGINLTSPRMK 545
           H + PL  + + KMSLDLELNLSPS SP++  T + E SS  NE VSSSK   LT+   K
Sbjct:  60 HLHTPLPSDWQTKMSLDLELNLSPSGSPSRTTTKKYESSSIHNETVSSSKIKVLTNTSKK 119

Query: 546 TTTVTGLKRSLSWLAFEGGDDDDVDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHP 725
           +   TGL RS SWLAFEGGDD+DVD K QEMVT VCMKCHMLVMLCTSTPVCPNCKFMHP
Sbjct: 120 SIIGTGLSRSPSWLAFEGGDDNDVDHKGQEMVTTVCMKCHMLVMLCTSTPVCPNCKFMHP 179

Query: 726 HDHSSTKLFKPSNLLRLLC 782
           HDHSSTKLFKPSNLLRLLC
Sbjct: 180 HDHSSTKLFKPSNLLRLLC 198


 Score =  109 bits (272), Expect = 7e-022
 Identities = 55/74 (74%), Positives = 59/74 (79%), Gaps = 4/74 (5%)
 Frame = +3

Query:  96 MVSRQEETLFSRKRDFTAYGEEFHNLFKKTKQEDQSHGTLQRAMLNEGSNSESMRSVTFD 275
           MVSRQE TL+SRKRDFT  GEEFHN FKK+KQEDQS  TL     NE   SESMRS+TFD
Sbjct:   1 MVSRQEGTLYSRKRDFTVCGEEFHNSFKKSKQEDQSQSTL----FNERPKSESMRSITFD 56

Query: 276 FELNLHTPLPSGWQ 317
           FEL+LHTPLPS WQ
Sbjct:  57 FELHLHTPLPSDWQ 70

>gi|297845248|ref|XP_002890505.1| hypothetical protein ARALYDRAFT_335470
        [Arabidopsis lyrata subsp. lyrata]

          Length = 202

 Score =  130 bits (325), Expect = 5e-028
 Identities = 76/132 (57%), Positives = 85/132 (64%), Gaps = 11/132 (8%)
 Frame = +3

Query: 396 ERPKMSLDLELNLSPSYSPTKAATEIEESSNDNENVSSSKGINL--TSPRMKTTT-VTGL 566
           E  +  LDL LNLS S S     T +E+          SKG++L  T+P  K  +   GL
Sbjct:  79 EDHRTDLDLNLNLSSSSSFNMKKTIMEKD-------ECSKGVSLIMTTPSKKVRSGDIGL 131

Query: 567 KRSLSWLAFEGGDDDDVDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTK 746
            RS SWLAFEG DDDD   K+QEMVT VCMKCHMLVMLC ST VCPNCKFMH  DHSSTK
Sbjct: 132 SRSPSWLAFEGDDDDDSQ-KKQEMVTTVCMKCHMLVMLCKSTLVCPNCKFMHHDDHSSTK 190

Query: 747 LFKPSNLLRLLC 782
            FK  NL +LLC
Sbjct: 191 QFKTLNLFKLLC 202

>gi|30687784|ref|NP_173644.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 200

 Score =  122 bits (305), Expect = 1e-025
 Identities = 70/130 (53%), Positives = 81/130 (62%), Gaps = 9/130 (6%)
 Frame = +3

Query: 396 ERPKMSLDLELNLSPSYSPTKAATEIEESSNDNENVSSSKGINLTSPRMKTTT-VTGLKR 572
           E  +  LDL LNL  S S +   T +E+         S  G  + SP  K  +   GL R
Sbjct:  79 EDHRTDLDLNLNLYSSSSSSVKKTIMEKDE------CSKGGTVIISPSKKVRSGDIGLSR 132

Query: 573 SLSWLAFEGGDDDDVDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLF 752
           S SWLAFEG DDD+   K+QEM+T VCMKCHMLVMLC ST VCPNCKFMH  DHSSTK F
Sbjct: 133 SPSWLAFEGDDDDNQ--KKQEMITTVCMKCHMLVMLCKSTLVCPNCKFMHHDDHSSTKQF 190

Query: 753 KPSNLLRLLC 782
           +  NL +LLC
Sbjct: 191 QTLNLFKLLC 200

>gi|255628921|gb|ACU14805.1| unknown [Glycine max]

          Length = 215

 Score =  91 bits (223), Expect = 3e-016
 Identities = 73/206 (35%), Positives = 95/206 (46%), Gaps = 24/206 (11%)
 Frame = +3

Query: 159 EFHNLFKKTKQEDQSHGTLQRAMLNEGSNSESM--RSVTFDFELNLHTPLPSG-WQKSIE 329
           EF    KK K E+          L EG   + +  R   FD EL+  TP  S  W++ + 
Sbjct:  20 EFEASSKKRKWEEP---------LTEGFFKDQIEKRKSVFDIELHPETPFSSDKWRQYLT 70

Query: 330 TKSYPRTSVDHRTRHPNDPLLVERP----KMSLDLELNLSPSYSPTKAATEIEESSNDNE 497
            +S      + RT   N     E P     MSL+L LNL+   SP K         N+ +
Sbjct:  71 IQSGQIQLCNTRTTTENPKKSSEEPPSSHHMSLNLGLNLT-CESPRKKEEGYGYEMNEKK 129

Query: 498 NVSSSKGINLTSPRMKTTTVTGLKRSLSWLAFEGGDDDDVDCKEQEMVTKVCMKCHMLVM 677
           +  SS G       +      G   S SWL+     +DD     +EMV  VCM+CHMLVM
Sbjct: 130 SSGSSPGGLSEREDLCKKDSDGKILSPSWLS---SSEDDY----KEMVATVCMRCHMLVM 182

Query: 678 LCTSTPVCPNCKFMHPHDHSSTKLFK 755
           LC S+P CPNCKFMHP D + +K  K
Sbjct: 183 LCKSSPSCPNCKFMHPPDQNPSKFLK 208

>gi|226491005|ref|NP_001144049.1| hypothetical protein LOC100276873 [Zea mays]

          Length = 237

 Score =  77 bits (189), Expect = 3e-012
 Identities = 61/194 (31%), Positives = 83/194 (42%), Gaps = 25/194 (12%)
 Frame = +3

Query: 237 GSNSESMRSVTFDFELNLH-TPLPSGWQKSIETKS----YPRTSVDHRTRHPNDPLLVER 401
           G + +    V    ELN    PLP  WQ+ ++ KS    Y  T    RT    DP   + 
Sbjct:  54 GGDEDDRGEVVDGIELNFDAAPLPPEWQRCLDIKSGQIHYYNTRTQKRT--CKDPRGHDG 111

Query: 402 PKMSLDLELNLSPSYSPTKAATEIEESSNDNENVSSSKGINLT-SPRM------KTTTVT 560
                       P Y   +   E E+S+N          +NLT  PR       KTT   
Sbjct: 112 -----------QPDYRAEEGQEEEEDSANYCAPPGLDLELNLTFEPRRVLAAHGKTTKRA 160

Query: 561 GLKRSLSWLAFEGGDDDDVDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSS 740
             + +      E  DD D     +EMV  VC++CHMLVM+C ++P CPNCKF+HP   ++
Sbjct: 161 KQQPAEEEEEAEDDDDSDSSSSSREMVAGVCVRCHMLVMMCRASPACPNCKFLHPPSRAT 220

Query: 741 TKLFKPSNLLRLLC 782
                    L+LLC
Sbjct: 221 PPPPPLKLGLQLLC 234

>gi|4325367|gb|AAD17363.1| contains similarity to Nicotiana tabacum B-type
        cyclin (GB:D50737) [Arabidopsis thaliana]

          Length = 188

 Score =  76 bits (186), Expect = 7e-012
 Identities = 40/71 (56%), Positives = 50/71 (70%), Gaps = 7/71 (9%)
 Frame = +3

Query: 567 KRSLSWLAFEGGDD-----DDVDCK-EQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMH-P 725
           K+S+S +AF+  DD     D V  + E+EMV +VCMKCHMLVMLC ++P CPNCKFMH P
Sbjct: 112 KKSMSRVAFDLDDDCCDRGDGVGGRSEEEMVARVCMKCHMLVMLCKASPACPNCKFMHSP 171

Query: 726 HDHSSTKLFKP 758
            D S + LF P
Sbjct: 172 EDTSLSLLFTP 182

>gi|42566349|ref|NP_192630.2| uncharacterized protein [Arabidopsis thaliana]

          Length = 212

 Score =  76 bits (186), Expect = 7e-012
 Identities = 40/71 (56%), Positives = 50/71 (70%), Gaps = 7/71 (9%)
 Frame = +3

Query: 567 KRSLSWLAFEGGDD-----DDVDCK-EQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMH-P 725
           K+S+S +AF+  DD     D V  + E+EMV +VCMKCHMLVMLC ++P CPNCKFMH P
Sbjct: 136 KKSMSRVAFDLDDDCCDRGDGVGGRSEEEMVARVCMKCHMLVMLCKASPACPNCKFMHSP 195

Query: 726 HDHSSTKLFKP 758
            D S + LF P
Sbjct: 196 EDTSLSLLFTP 206

>gi|224084674|ref|XP_002307383.1| predicted protein [Populus trichocarpa]

          Length = 211

 Score =  75 bits (183), Expect = 2e-011
 Identities = 31/43 (72%), Positives = 35/43 (81%)
 Frame = +3

Query: 627 EQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFK 755
           ++EMV  VC +CHMLVMLC S+P CPNCKFMHP D SS KLFK
Sbjct: 169 QEEMVATVCTRCHMLVMLCRSSPACPNCKFMHPPDQSSPKLFK 211

>gi|297813245|ref|XP_002874506.1| hypothetical protein ARALYDRAFT_911063
        [Arabidopsis lyrata subsp. lyrata]

          Length = 208

 Score =  75 bits (183), Expect = 2e-011
 Identities = 43/89 (48%), Positives = 54/89 (60%), Gaps = 16/89 (17%)
 Frame = +3

Query: 540 MKTTTVTGL--KRSLSWLAFEGGDDDDVDC---------KEQEMVTKVCMKCHMLVMLCT 686
           M+   V GL  ++S+S +AF    D D DC          E+EMV +VCMKCHMLVMLC 
Sbjct: 122 MEKDLVLGLETRKSMSRVAF----DLDEDCCDRGGVGGGNEEEMVARVCMKCHMLVMLCK 177

Query: 687 STPVCPNCKFMH-PHDHSSTKLFKPSNLL 770
           ++P CPNCKFMH P D S + +F P   L
Sbjct: 178 ASPACPNCKFMHSPEDTSLSLIFTPKPTL 206

>gi|255545710|ref|XP_002513915.1| conserved hypothetical protein [Ricinus
        communis]

          Length = 216

 Score =  74 bits (180), Expect = 3e-011
 Identities = 31/47 (65%), Positives = 36/47 (76%)
 Frame = +3

Query: 615 VDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFK 755
           V+  ++EMV  VC +CHMLVMLC S+P CPNCKFMHP D S  KLFK
Sbjct: 163 VEGDQEEMVATVCTRCHMLVMLCKSSPACPNCKFMHPPDQSPPKLFK 209

>gi|224063225|ref|XP_002301050.1| predicted protein [Populus trichocarpa]

          Length = 211

 Score =  71 bits (173), Expect = 2e-010
 Identities = 29/43 (67%), Positives = 34/43 (79%)
 Frame = +3

Query: 627 EQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFK 755
           +QEM+  VC KCHMLVMLC S+P CPNCKF+HP + S  KLFK
Sbjct: 168 QQEMLATVCTKCHMLVMLCRSSPTCPNCKFLHPLEQSPPKLFK 210

>gi|225459312|ref|XP_002284166.1| PREDICTED: hypothetical protein [Vitis
        vinifera]

          Length = 211

 Score =  70 bits (171), Expect = 4e-010
 Identities = 29/42 (69%), Positives = 32/42 (76%)
 Frame = +3

Query: 630 QEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFK 755
           QEMV  VC +CHMLVMLC S+P CPNCKFMHP D +   LFK
Sbjct: 163 QEMVAAVCKRCHMLVMLCKSSPTCPNCKFMHPPDQTPPNLFK 204

>gi|212722528|ref|NP_001131648.1| hypothetical protein LOC100193008 [Zea mays]

          Length = 232

 Score =  68 bits (164), Expect = 2e-009
 Identities = 29/63 (46%), Positives = 40/63 (63%)
 Frame = +3

Query: 594 EGGDDDDVDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSSTKLFKPSNLLR 773
           E  +DDD     +EMV  VC++CHMLVM+C ++P CPNCKF+HP   ++         L+
Sbjct: 167 EEAEDDDSSSSSREMVAGVCVRCHMLVMMCRASPACPNCKFLHPPSRAAPPPPPLKLGLQ 226

Query: 774 LLC 782
           LLC
Sbjct: 227 LLC 229

>gi|242036645|ref|XP_002465717.1| hypothetical protein SORBIDRAFT_01g044450
        [Sorghum bicolor]

          Length = 244

 Score =  60 bits (145), Expect = 4e-007
 Identities = 26/53 (49%), Positives = 35/53 (66%), Gaps = 3/53 (5%)
 Frame = +3

Query: 567 KRSLSWLAFEGGDDDDVDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHP 725
           +R L   A E  +D       +EMV  VC++CHMLVM+C ++P CPNCKF+HP
Sbjct: 161 RRRLQLAAEEEAEDSS---GSREMVAGVCVRCHMLVMMCRASPACPNCKFLHP 210

>gi|226499716|ref|NP_001144712.1| hypothetical protein LOC100277752 [Zea mays]

          Length = 239

 Score =  59 bits (142), Expect = 9e-007
 Identities = 22/39 (56%), Positives = 29/39 (74%)
 Frame = +3

Query: 606 DDDVDCKEQEMVTKVCMKCHMLVMLCTSTPVCPNCKFMH 722
           +D      +EMV  VCM+CHMLVM+C ++P CPNCKF+H
Sbjct: 171 EDSSSGSSREMVAGVCMRCHMLVMMCRASPACPNCKFLH 209

>gi|115451263|ref|NP_001049232.1| Os03g0191200 [Oryza sativa Japonica Group]

          Length = 244

 Score =  58 bits (138), Expect = 2e-006
 Identities = 22/38 (57%), Positives = 28/38 (73%)
 Frame = +3

Query: 630 QEMVTKVCMKCHMLVMLCTSTPVCPNCKFMHPHDHSST 743
           +EMV  VC +CHMLVM+C   P CPNCKF+HP  + S+
Sbjct: 185 REMVAAVCARCHMLVMMCREWPACPNCKFVHPTANQSS 222

  Database: GenBank nr
    Posted date:  Thu Sep 08 23:06:31 2011
  Number of letters in database: 5,219,829,378
  Number of sequences in database:  15,229,318

Lambda     K     H
   0.267   0.041    0.140
Gapped
Lambda     K     H
   0.267   0.041    0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,672,520,567,077
Number of Sequences: 15229318
Number of Extensions: 2672520567077
Number of Successful Extensions: 649898909
Number of sequences better than 0.0: 0