BLASTX 7.6.2
Query= UN16599 /QuerySize=534
(533 letters)
Database: GenBank nr;
15,229,318 sequences; 5,219,829,378 total letters
Score E
Sequences producing significant alignments: (bits) Value
gi|297828307|ref|XP_002882036.1| DNA-binding family protein [Ara... 179 2e-043
gi|15225902|ref|NP_182109.1| AT hook motif DNA-binding family pr... 178 4e-043
gi|297820982|ref|XP_002878374.1| hypothetical protein ARALYDRAFT... 129 2e-028
gi|255636132|gb|ACU18409.1| unknown [Glycine max] 111 8e-023
gi|133907524|gb|ABO42262.1| AT-hook DNA-binding protein [Gossypi... 106 2e-021
gi|255541558|ref|XP_002511843.1| DNA binding protein, putative [... 106 2e-021
gi|255645533|gb|ACU23261.1| unknown [Glycine max] 103 2e-020
gi|224067876|ref|XP_002302577.1| predicted protein [Populus tric... 101 6e-020
gi|224130232|ref|XP_002320785.1| predicted protein [Populus tric... 101 6e-020
gi|225454180|ref|XP_002272142.1| PREDICTED: hypothetical protein... 100 1e-019
gi|255636324|gb|ACU18501.1| unknown [Glycine max] 100 1e-019
gi|6850898|emb|CAB71061.1| putative DNA-binding protein [Arabido... 99 3e-019
gi|30695388|ref|NP_191690.2| AT hook motif DNA-binding family pr... 99 3e-019
gi|225426407|ref|XP_002273061.1| PREDICTED: hypothetical protein... 84 1e-014
gi|297742528|emb|CBI34677.3| unnamed protein product [Vitis vini... 84 1e-014
gi|297809519|ref|XP_002872643.1| DNA-binding family protein [Ara... 67 1e-009
gi|255575345|ref|XP_002528575.1| DNA binding protein, putative [... 67 1e-009
gi|255644758|gb|ACU22881.1| unknown [Glycine max] 67 1e-009
gi|242095694|ref|XP_002438337.1| hypothetical protein SORBIDRAFT... 65 5e-009
gi|294461667|gb|ADE76393.1| unknown [Picea sitchensis] 65 5e-009
>gi|297828307|ref|XP_002882036.1| DNA-binding family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 340
Score = 179 bits (453), Expect = 2e-043
Identities = 89/108 (82%), Positives = 95/108 (87%), Gaps = 4/108 (3%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVI+GSFIWAAPKIK+KKREEE SE VQDT
Sbjct: 233 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIIGSFIWAAPKIKSKKREEEASEVVQDT 292
Query: 184 NDHH----HHQALDPVPQHTQGQNLIWSTGSRQMDMRHAHADIDLMRG 315
+DHH ++ + PVPQ QNLIWSTGSRQMDMRHAHADIDLMRG
Sbjct: 293 DDHHVLDNNNNTISPVPQQQPSQNLIWSTGSRQMDMRHAHADIDLMRG 340
>gi|15225902|ref|NP_182109.1| AT hook motif DNA-binding family protein
[Arabidopsis thaliana]
Length = 348
Score = 178 bits (450), Expect = 4e-043
Identities = 89/108 (82%), Positives = 95/108 (87%), Gaps = 4/108 (3%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIK+KKREEE SE VQ+T
Sbjct: 241 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKSKKREEEASEVVQET 300
Query: 184 NDHH----HHQALDPVPQHTQGQNLIWSTGSRQMDMRHAHADIDLMRG 315
+DHH ++ + PVPQ QNLIWSTGSRQMDMRHAHADIDLMRG
Sbjct: 301 DDHHVLDNNNNTISPVPQQQPNQNLIWSTGSRQMDMRHAHADIDLMRG 348
>gi|297820982|ref|XP_002878374.1| hypothetical protein ARALYDRAFT_324562
[Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 129 bits (324), Expect = 2e-028
Identities = 77/114 (67%), Positives = 82/114 (71%), Gaps = 13/114 (11%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TG+L+VSLAS DGRVIGG IGGPLIAAS VQVIVGSFIWA PK K KKREE SE+VQDT
Sbjct: 236 TGSLAVSLASSDGRVIGGGIGGPLIAASQVQVIVGSFIWAIPKGKIKKREET-SEDVQDT 294
Query: 184 ----NDHHHHQALDPVPQHTQGQNL------IWSTGSRQMDMRHAHADIDLMRG 315
N+ + PVPQ Q QNL IWSTGSR MDM H H DIDLMRG
Sbjct: 295 AALDNNDNTAATSPPVPQ--QSQNLVQTPVGIWSTGSRSMDMHHPHMDIDLMRG 346
>gi|255636132|gb|ACU18409.1| unknown [Glycine max]
Length = 341
Score = 111 bits (275), Expect = 8e-023
Identities = 63/115 (54%), Positives = 77/115 (66%), Gaps = 15/115 (13%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TG LSVSLASPDGRVIGG +GG LIA+SPVQV+VGSF+W K KNKK+E V
Sbjct: 231 TGGLSVSLASPDGRVIGGGVGGVLIASSPVQVVVGSFLWGGSKTKNKKKESSEGAEVAVE 290
Query: 184 NDH---HHHQALDPVPQHTQGQNL--------IWSTGSRQMDMRHAHADIDLMRG 315
+DH H+ +L+ + +Q QNL WST SR +DMR++H DIDLMRG
Sbjct: 291 SDHQGVHNPVSLNSI---SQNQNLPPTPPSLSPWST-SRPLDMRNSHVDIDLMRG 341
>gi|133907524|gb|ABO42262.1| AT-hook DNA-binding protein [Gossypium hirsutum]
Length = 340
Score = 106 bits (263), Expect = 2e-021
Identities = 63/112 (56%), Positives = 74/112 (66%), Gaps = 13/112 (11%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TG LSVSLASPDGR IGG +GG LIAASPVQVIVGSFIW K KNKK G E ++D+
Sbjct: 234 TGGLSVSLASPDGRAIGGGVGGMLIAASPVQVIVGSFIWGGSKAKNKK---GGQEGIKDS 290
Query: 184 NDHHHHQALDPVPQHTQGQNL-------IWSTGSRQMDMR-HAHADIDLMRG 315
+D + P P + QN+ +W GSR MDMR ++H DIDLMRG
Sbjct: 291 DDQMVDNLVAP-PGISPSQNMTPSAPAGVW-PGSRSMDMRNNSHVDIDLMRG 340
>gi|255541558|ref|XP_002511843.1| DNA binding protein, putative [Ricinus
communis]
Length = 340
Score = 106 bits (263), Expect = 2e-021
Identities = 63/113 (55%), Positives = 74/113 (65%), Gaps = 15/113 (13%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TG LSVSLASPDGRVIGG +GG LIAASPVQVIVGSF+W K KNKK EG E +D+
Sbjct: 234 TGGLSVSLASPDGRVIGGGVGGMLIAASPVQVIVGSFLWGGSKAKNKK--GEGPEGARDS 291
Query: 184 NDHHHHQALDPVPQHT--QGQNL-------IWSTGSRQMDMRHAHADIDLMRG 315
+ H +PV + QNL +W GS+ +DMR+ H DIDLMRG
Sbjct: 292 D---HQTVENPVTPSSVPPSQNLTPTSSIGLW-PGSQSLDMRNTHVDIDLMRG 340
>gi|255645533|gb|ACU23261.1| unknown [Glycine max]
Length = 340
Score = 103 bits (255), Expect = 2e-020
Identities = 59/110 (53%), Positives = 72/110 (65%), Gaps = 6/110 (5%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKRE-EEGSENVQD 180
TG LSVSLASPDGRV+GG +GG LIAASPVQVI+GSF W A K K KK+E EG+E +
Sbjct: 231 TGGLSVSLASPDGRVVGGGVGGVLIAASPVQVILGSFSWDASKTKIKKKEGSEGAEVALE 290
Query: 181 TNDHHHH-----QALDPVPQHTQGQNLIWSTGSRQMDMRHAHADIDLMRG 315
T+ H ++ P T +L SR +DMR++H DIDLMRG
Sbjct: 291 TDHQTVHNPVAVNSISPNQNLTPTSSLSPWPASRSLDMRNSHIDIDLMRG 340
>gi|224067876|ref|XP_002302577.1| predicted protein [Populus trichocarpa]
Length = 328
Score = 101 bits (250), Expect = 6e-020
Identities = 60/113 (53%), Positives = 72/113 (63%), Gaps = 15/113 (13%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TG LSVSLASPDG VIGG +GG LIAASPVQVI GSF+W K KNKK EG+E +D+
Sbjct: 222 TGGLSVSLASPDGCVIGGGVGGVLIAASPVQVIAGSFLWGGSKTKNKK--VEGAEVARDS 279
Query: 184 NDHHHHQALDPVPQHTQGQNL---------IWSTGSRQMDMRHAHADIDLMRG 315
+ H +PV + +L +W GSR +DMR+ H DIDLMRG
Sbjct: 280 D---HQTVENPVTPTSVQPSLNLTPTSSMGVW-PGSRSVDMRNTHVDIDLMRG 328
>gi|224130232|ref|XP_002320785.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 101 bits (250), Expect = 6e-020
Identities = 62/113 (54%), Positives = 73/113 (64%), Gaps = 14/113 (12%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
+G LSVSLASPDGRVIGG +GG LIAASPVQVIVGSF+W K K ++ EG E +D+
Sbjct: 229 SGGLSVSLASPDGRVIGGGVGGVLIAASPVQVIVGSFLWGGGS-KTKNKKVEGPEGARDS 287
Query: 184 NDHHHHQALDPV-PQHTQ-GQNL-------IWSTGSRQMDMRHAHADIDLMRG 315
+ H +PV P Q QNL +W GSR +DMR H DIDLMRG
Sbjct: 288 D---HQTVENPVTPTSVQPSQNLTPTSSMGVW-PGSRPVDMRSTHVDIDLMRG 336
>gi|225454180|ref|XP_002272142.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 345
Score = 100 bits (248), Expect = 1e-019
Identities = 61/114 (53%), Positives = 69/114 (60%), Gaps = 16/114 (14%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TG LSVSLASPDGRVIGG +GG L AASPVQVIVGSFIW K KNK E+V+
Sbjct: 238 TGGLSVSLASPDGRVIGGGVGGMLTAASPVQVIVGSFIWGNSKTKNKM-----GESVEGA 292
Query: 184 NDHHHHQALDPVPQHT---QGQNL-------IWSTGSRQMDMRHAHADIDLMRG 315
D P+ T QNL +W GSRQ+DMR++ DIDLMRG
Sbjct: 293 GDSERQTVDHPITTPTTVPASQNLTPASSMGVW-PGSRQLDMRNSPVDIDLMRG 345
>gi|255636324|gb|ACU18501.1| unknown [Glycine max]
Length = 191
Score = 100 bits (248), Expect = 1e-019
Identities = 58/107 (54%), Positives = 70/107 (65%), Gaps = 6/107 (5%)
Frame = +1
Query: 13 LSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKRE-EEGSENVQDTND 189
LSVSLASPDGRVIGG +GG LIAASPVQVI+GSF W A K K KK+E EG+E +T+
Sbjct: 85 LSVSLASPDGRVIGGGVGGVLIAASPVQVILGSFSWGASKTKIKKKEGSEGAEVAMETDH 144
Query: 190 HHHH-----QALDPVPQHTQGQNLIWSTGSRQMDMRHAHADIDLMRG 315
H ++ P T +L SR +DMR++H DIDLMRG
Sbjct: 145 QTVHNPVAVNSISPNQNLTPTSSLSPWPASRPLDMRNSHIDIDLMRG 191
>gi|6850898|emb|CAB71061.1| putative DNA-binding protein [Arabidopsis thaliana]
Length = 348
Score = 99 bits (244), Expect = 3e-019
Identities = 58/87 (66%), Positives = 65/87 (74%), Gaps = 8/87 (9%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TG+L+VSLASPDGRVIGG IGGPLIAAS VQVIVGSFIWA PK K KKREE SE+VQDT
Sbjct: 237 TGSLAVSLASPDGRVIGGGIGGPLIAASQVQVIVGSFIWAIPKGKIKKREET-SEDVQDT 295
Query: 184 -----NDHHHHQALDPVPQHTQGQNLI 249
N+ + PVPQ Q QN++
Sbjct: 296 DALENNNDNTAATSPPVPQ--QSQNIV 320
>gi|30695388|ref|NP_191690.2| AT hook motif DNA-binding family protein
[Arabidopsis thaliana]
Length = 354
Score = 99 bits (244), Expect = 3e-019
Identities = 58/87 (66%), Positives = 65/87 (74%), Gaps = 8/87 (9%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQDT 183
TG+L+VSLASPDGRVIGG IGGPLIAAS VQVIVGSFIWA PK K KKREE SE+VQDT
Sbjct: 243 TGSLAVSLASPDGRVIGGGIGGPLIAASQVQVIVGSFIWAIPKGKIKKREET-SEDVQDT 301
Query: 184 -----NDHHHHQALDPVPQHTQGQNLI 249
N+ + PVPQ Q QN++
Sbjct: 302 DALENNNDNTAATSPPVPQ--QSQNIV 326
>gi|225426407|ref|XP_002273061.1| PREDICTED: hypothetical protein [Vitis
vinifera]
Length = 346
Score = 84 bits (205), Expect = 1e-014
Identities = 49/106 (46%), Positives = 65/106 (61%), Gaps = 4/106 (3%)
Frame = +1
Query: 7 GNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREE-EGSENVQDT 183
G +SVSL SPDG VIGG +GG LIAASPVQV+ SF++ K KNK +E +G +N
Sbjct: 242 GGISVSLCSPDGHVIGGGVGGMLIAASPVQVVACSFVYGGSKTKNKNGDEPKGDQNSGLQ 301
Query: 184 NDHHHHQALDPVPQHTQGQNL--IWSTGSRQMDMRHAHADIDLMRG 315
+ P+ QH + +W + SRQ+D+R+ H DIDL RG
Sbjct: 302 PSESAAPSSVPLGQHFAPISAMGMWPS-SRQVDLRNPHTDIDLTRG 346
>gi|297742528|emb|CBI34677.3| unnamed protein product [Vitis vinifera]
Length = 309
Score = 84 bits (205), Expect = 1e-014
Identities = 49/106 (46%), Positives = 65/106 (61%), Gaps = 4/106 (3%)
Frame = +1
Query: 7 GNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREE-EGSENVQDT 183
G +SVSL SPDG VIGG +GG LIAASPVQV+ SF++ K KNK +E +G +N
Sbjct: 205 GGISVSLCSPDGHVIGGGVGGMLIAASPVQVVACSFVYGGSKTKNKNGDEPKGDQNSGLQ 264
Query: 184 NDHHHHQALDPVPQHTQGQNL--IWSTGSRQMDMRHAHADIDLMRG 315
+ P+ QH + +W + SRQ+D+R+ H DIDL RG
Sbjct: 265 PSESAAPSSVPLGQHFAPISAMGMWPS-SRQVDLRNPHTDIDLTRG 309
>gi|297809519|ref|XP_002872643.1| DNA-binding family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 353
Score = 67 bits (162), Expect = 1e-009
Identities = 43/105 (40%), Positives = 61/105 (58%), Gaps = 5/105 (4%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGSENVQD- 180
TG +SVSLASPDGRV+GG +GG L+AASPVQV+VGSF+ + K+++ + + +
Sbjct: 247 TGGMSVSLASPDGRVVGGGLGGLLVAASPVQVVVGSFLAGTDQQDQKQKKNKHDFMLSNP 306
Query: 181 TNDHHHHQALDPVPQHTQGQ---NLIWSTGSRQMDMRHAHADIDL 306
T A D H+ N W T S D R+ H+DI++
Sbjct: 307 TAAIPISSAADHRTIHSVSSLPVNNTWQT-SLASDPRNKHSDINV 350
>gi|255575345|ref|XP_002528575.1| DNA binding protein, putative [Ricinus
communis]
Length = 408
Score = 67 bits (161), Expect = 1e-009
Identities = 32/54 (59%), Positives = 40/54 (74%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGS 165
TG LSVSLASPDGRVIGG I G L+AASP+Q+++GSF+ K+ KK E +
Sbjct: 260 TGGLSVSLASPDGRVIGGGIAGLLLAASPIQIVMGSFMPNGYKVHKKKHHRENT 313
>gi|255644758|gb|ACU22881.1| unknown [Glycine max]
Length = 346
Score = 67 bits (161), Expect = 1e-009
Identities = 35/63 (55%), Positives = 44/63 (69%), Gaps = 3/63 (4%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAA---PKIKNKKREEEGSENV 174
TG +SVSLASPDGRV+GG + G L+AASPVQV+VGSF+ ++ KIK K + G V
Sbjct: 213 TGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLPSSQQEQKIKKSKSSDYGVATV 272
Query: 175 QDT 183
T
Sbjct: 273 TPT 275
>gi|242095694|ref|XP_002438337.1| hypothetical protein SORBIDRAFT_10g012730
[Sorghum bicolor]
Length = 361
Score = 65 bits (156), Expect = 5e-009
Identities = 29/38 (76%), Positives = 34/38 (89%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFI 117
TG LSVSLA PDGRV+GGA+ GPL AASPVQV++GSF+
Sbjct: 261 TGGLSVSLAGPDGRVLGGAVAGPLTAASPVQVVIGSFL 298
>gi|294461667|gb|ADE76393.1| unknown [Picea sitchensis]
Length = 302
Score = 65 bits (156), Expect = 5e-009
Identities = 34/54 (62%), Positives = 39/54 (72%)
Frame = +1
Query: 4 TGNLSVSLASPDGRVIGGAIGGPLIAASPVQVIVGSFIWAAPKIKNKKREEEGS 165
TG LSVSLASPDGRV+GG + G L+AASPVQV+VGSFI K K + E S
Sbjct: 165 TGGLSVSLASPDGRVVGGGVAGMLMAASPVQVVVGSFISNGQKDPPKPAKPEPS 218
Database: GenBank nr
Posted date: Thu Sep 08 23:06:31 2011
Number of letters in database: 5,219,829,378
Number of sequences in database: 15,229,318
Lambda K H
0.267 0.041 0.140
Gapped
Lambda K H
0.267 0.041 0.140
Matrix: blosum62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,992,423,490,450
Number of Sequences: 15229318
Number of Extensions: 1992423490450
Number of Successful Extensions: 533126175
Number of sequences better than 0.0: 0
|