Godoy M, Franco-Zorrilla JM, Pérez-Pérez J, Oliveros JC, Lorenzo O & Solano R.

Here we report on the development of a protein-binding microarray (PBM11) containing all possible double-stranded 11-mers for the determination of DNA-binding specificities of TFs. The large number of sequences in the PBM11 allows accurate and high-throughput quantification of TF-binding sites, outperforming previous methods. We applied this tool to determine binding site specificities of two Arabidopsis TFs, MYC2 and ERF1, rendering the G-box and the GCC-box, respectively, as their highest-affinity binding sites.
In addition, we identified variants of the G-box recognized by MYC2 with high and medium affinity, whereas ERF1 only recognized GCC variants with low affinity, indicating that ERF1 binding to DNA has stricter base requirements than MYC2. Analysis of transcriptomic data revealed that high- and medium-affinity binding sites have biological significance, probably representing relevant cis-acting elements in vivo. Comparison of promoter sequences with putative orthologs from closely related species demonstrated a high degree of conservation of all the identified DNA elements. The combination of PBM11, transcriptomic data and phylogenomic footprinting provides a straightforward method for the prediction of biologically active cis-elements, and thus for identification of in vivo DNA targets of TFs.