DUX4 induces a transcriptome more characteristic of a less-differentiated cell state and inhibits myogenesis

ABSTRACT Skeletal muscle wasting in facioscapulohumeral muscular dystrophy (FSHD) results in substantial morbidity. On a disease-permissive chromosome 4qA haplotype, genomic and/or epigenetic changes at the D4Z4 macrosatellite repeat allows transcription of the DUX4 retrogene. Analysing transgenic mice carrying a human D4Z4 genomic locus from an FSHD-affected individual showed that DUX4 was transiently induced in myoblasts during skeletal muscle regeneration. Centromeric to the D4Z4 repeats is an inverted D4Z4 unit encoding DUX4c. Expression of DUX4, DUX4c and DUX4 constructs, including constitutively active, dominant-negative and truncated versions, revealed that DUX4 activates target genes to inhibit proliferation and differentiation of satellite cells, but that it also downregulates target genes to suppress myogenic differentiation. These transcriptional changes elicited by DUX4 in mouse have significant overlap with genes regulated by DUX4 in man. Comparison of DUX4 and DUX4c transcriptional perturbations revealed that DUX4 regulates genes involved in cell proliferation, whereas DUX4c regulates genes engaged in angiogenesis and muscle development, with both DUX4 and DUX4c modifing genes involved in urogenital development. Transcriptomic analysis showed that DUX4 operates through both target gene activation and repression to orchestrate a transcriptome characteristic of a less-differentiated cell state. Summary: DUX4 underlies pathogenesis in facioscapulohumeral muscular dystrophy. DUX4 acts mainly as a transcriptional activator that inhibits myogenesis by orchestrating a gene expression profile representative of a more stem-cell-like state.

Satellite cells are responsible for maintenance and repair of skeletal muscle (Relaix and Zammit, 2012), and muscle dystrophy implies a failure of this normal homeostatic and repair function (Morgan and Zammit, 2010). Consistent with this premise, myoblasts from FSHD-affected individuals are more susceptible to oxidative stress and show deregulation of MYOD (also known as MYOD1) (Winokur et al., 2003a,b), and differentiate into myotubes with abnormal morphology (Barro et al., 2008).
D4Z4 tandem repeats and DUX4 ORF are evolutionarily conserved in placental mammals (Clapp et al., 2007;Giussani et al., 2012). Identification of DUX proteins in germline cells (Geng et al., 2012) suggests a role during development, but little is known of endogenous DUX4 function. Two important DUX4 isoforms are derived from the D4Z4 ORF -DUX4-fl (full-length) that is expressed in germline and stem cells, and the alternatively spliced DUX4-s (short) isoform expressed in some somatic cells at low levels .
Mice transgenic for a D4Z4 repeat array from an FSHD individual recapitulate epigenetic phenomena consistent with a contracted FSHD locus. DUX4 is expressed in germline cells, and the protein can be detected in myoblasts and muscle, but there is no overt skeletal muscle pathology (Krom et al., 2013). Ectopic DUX4 expression results in impaired myogenesis (Dandapat et al., 2014) and gross muscle damage through p53-dependent apoptosis in other mouse models (Wallace et al., 2010).
How incomplete repression of DUX4 in somatic cells causes muscular dystrophy is enigmatic. DUX4 inhibits muscle differentiation and induces myoblast death (Bosnakovski et al., 2008a;Kowaljow et al., 2007). DUX4 also causes myoblasts to differentiate to produce myotubes with a morphology similar to the dysmorphic myotubes from FSHD individuals (Vanderplanck et al., 2011). However, systematic comparison is lacking between DUX4, DUX4c and DUX4-s.
An incomplete and reversed D4Z4 unit is located 40 kb centromeric to the D4Z4 repeat array. This encodes DUX4c, which lacks the Nterminus and diverges from DUX4-fl in the C-terminal region but is otherwise homologous to DUX4-fl. DUX4c is detectable in FSHD muscle biopsies and FSHD-derived proliferating myoblasts, and increases in myotubes (Ansseau et al., 2009).
Here, we show that DUX4 is transiently elevated in myoblasts during muscle regeneration. To model FSHD, we used retroviralmediated delivery of DUX4, in parallel with truncated, constitutively active and dominant-negative DUX4 versions, as well as with DUX4c. DUX4 activates transcriptional targets to suppress proliferation in satellite cells but can both activate and inhibit transcriptional targets to prevent myogenic differentiation. Transcriptomic analysis showed that DUX4 acts as a strong transcriptional activator but can also inhibit transcriptional targets. DUX4c increases transcription of some genes that are induced by DUX4 but also repressed a significant proportion. In general, DUX4 orchestrates a transcriptome more characteristic of a less-differentiated cell state.

DUX4 is transiently expressed during skeletal muscle regeneration
Two transgenic mouse models for FSHD have been previously generatedcontrol D4Z4-12.5 mice contain a human genomic region encompassing 12.5 D4Z4 units, whereas FSHD1 D4Z4-2.5 mice are transgenic for a contracted human repeat with 2.5 D4Z4 units obtained from an FSHD-affected individual. D4Z4-2.5 transgenic mice reveal low and variable levels of DUX4 in skeletal muscles (Krom et al., 2013).
Human DUX4, murine Duxbl, Myod and Myog (myogenin) expression was measured using real time quantitative PCR (RT-qPCR) on RNA extracted from the other half of the regenerating gastrocnemius muscles ( Fig. 1B; Fig. S1). Myog levels increased during the early phase of muscle regeneration in both D4Z4-2.5 and D4Z4-12.5 mice as expected. As shown previously (Wu et al., 2014), substantial Duxbl levels were detectable in mouse skeletal muscle, with levels enhanced during regeneration (Fig. S1A,B). DUX4 levels were negligible but increased in gastrocnemius at days 4 and 5 post-cardiotoxin injection of D4Z4-2.5 mice, compared to those in undamaged control muscles, before returning to pre-injury levels at days 6-10 ( Fig. 1B). RT-qPCR analysis was also performed on RNA from further D4Z4-2.5 gastrocnemius muscles that had regenerated for 4 or 5 days. D4Z4-2.5 muscle at day 4 of regeneration showed a significant increase in DUX4 levels (n=3 mice) and approached significance at day 5 ( Fig. 1C), with control genes Pax7, Myod and Myog generally higher than in undamaged muscle, as expected. However, DUX4 transcripts could not generally be detected in either undamaged or regenerating muscle from control D4Z4-12.5 mice (Fig. 1D,E). Thus, in FSHD1 D4Z4-2.5 mice, DUX4 expression increases transiently during early muscle regeneration in vivo.

DUX4 is expressed in myoblasts during skeletal muscle regeneration
To determine if DUX4 is expressed in myoblasts during muscle regeneration, fluorescence-activated cell sorting (FACS) was performed in a pilot experiment at day 4 post cardiotoxin injectionthe time point with the highest levels of DUX4 transcripts (Fig. 1C). DUX4 could be detected in RNA pooled from eight regenerating muscles and the FACS-isolated CD31 − CD45 − SCA1 − α7-integrin + population, which was identified as a myoblast population because they also expressed Pax7, Myod and Myog (Fig. S1C).
The λ42/L42 construct (van Deutekom et al., 1993) used to generate the D4Z4-2.5 transgenic mice was also transfected into wildtype murine satellite cells, and rare DUX4-protein-containing satellite cells could be identified (Fig. 1L). Thus, the native human contracted D4Z4 repeat containing 5′ and 3′ regions can be regulated in murine satellite cells to produce DUX4 protein in vivo and in vitro. The mechanism of action of DUX4 DUX4 that is transcribed from the potential upstream Met-Lys-Gly (MKG) start site, or from the originally identified Met-Ala-Leu (MAL) start site, encodes a protein that inhibits myogenic differentiation and induces cell death (Snider et al., 2009). DUX4c is identical to DUX4 (MAL start) in the N-terminus and across the double homeodomain but has an alternative 32-amino-acid Cterminus. DUX4c and DUX4 proteins lacking the C-terminus inhibit differentiation but do not induce overt cell death (Ansseau et al., 2009;Bosnakovski et al., 2008a). Interestingly, the DUX4 C-terminal peptide alone inhibits muscle differentiation (Snider et al., 2009).
We used retroviral expression vectors encoding DUX4, DUX4c or a truncated DUX4 variant termed tMALDUX4 that initiates at the MAL start site and is intact across the two homeodomains but terminates at the Met-Gln-Gly (MQG) site, so lacks the C-terminal 75 amino acids of DUX4 or the 32 amino acids of DUX4c (Snider et al., 2009). We also used tMALDUX4 fused to a VP16 transactivation domain to generate the constitutively active tMALDUX4-VP16 construct, or the Engrailed repressor domain to create the dominant-negative tMALDUX4-ERD construct (Banerji et al., 2015a) (Fig. 2A).
To assess transcriptional activation of our DUX4 constructs, we used three DUX4 reporter constructs incorporating the ZSCAN4, RFPL4b or KHDC1L promoters driving a luciferase reporter gene (Ferreboeuf et al., 2014). DUX4 constructs and DUX4 reporters were co-transfected into murine C2C12 myoblasts, together with an RSV-β-galactosidase construct for normalisation of transfection efficiency. DUX4 and tMALDUX4-VP16 robustly activated all three DUX4 reporters compared to transfection with control plasmid, whereas tMALDUX4, DUX4c or tMALDUX4-ERD did not (Fig. 2B). tMALDUX4-VP16 activated the ZSCAN4 reporter more than DUX4, whereas RFPL4b and KHDC1L reporters were activated to similar extents by both constructs.

DUX4 alters cell morphology and causes apoptosis through transcriptional activation of target genes
Proteins encoded by each DUX4 construct could be identified in C2C12 myoblast nuclei using the 9A12 monoclonal antibody (Dixit et al., 2007). The viral vector has an IRES-eGFP module to mark transduced cells (Fig. 2C). C2C12 myoblasts that were transduced with DUX4 displayed a specific morphological phenotype, extending long cytoplasmic projections (Fig. 2C), as previously observed in the iC2C12-DUX4 immortalised cell line (Bosnakovski et al., 2008b). Expression of tMALDUX4-VP16 also caused long cytoplasmic projections, but tMALDUX4-ERD, tMALDUX4 or DUX4c did not perturb morphology, indicating that the projections are a result of transcriptional activation of target genes.
We next assayed apoptosis in plated satellite-cell-derived primary myoblasts by measuring caspase 3 and caspase 7 activity over the 48-h period after transduction with retroviruses encoding the DUX4 constructs. Caspase 3 and caspase 7 activity generally increased over time, as expected (Dee et al., 2002). However, further increased caspase activity was measured at 36 and 38 h post transduction in myoblasts expressing DUX4, and in those expressing tMALDUX4-VP16 at 38 h (Fig. 2D).

DUX4 maintains Pax7 expression through transcriptional activation of target genes
We first investigated the effects of the DUX4 constructs on early myogenesis. At 24 h after isolation, extensor digitorum longus (EDL) satellite cells that were associated with their myofibres were transduced with either retroviruses encoding DUX4, tMALDUX4, tMALDUX4-VP16, tMALDUX4-ERD, DUX4c or control retrovirus, and were cultured for 48 h before immunostaining (Fig. 3). For illustration, only co-immunostaining for eGFP and Pax7 ( Quiescent satellite cells express Pax7. Upon activation and differentiation of satellite cells, Pax7 expression decreases, with the few cells retaining Pax7 thought to be those that repopulate the stem cell pool (Zammit et al., 2004). A higher proportion of satellite cells expressing DUX4 and tMALDUX4-VP16 retained Pax7 (Fig. 3A,B). This suggests that DUX4 inhibits myogenic progression in satellite cells and causes retention of proteins that are normally associated with a more naïve stem cell and lessdifferentiated phenotype.

DUX4, DUX4c and tMALDUX4 inhibit entry into myogenic differentiation
Myod expression increases in proliferating satellite cells and drives the early stages of myogenic differentiation (Zammit et al., 2004). Expression of DUX4 constructs (except tMALDUX4-ERD) significantly reduced the proportion of satellite cells that contained MyoD (Fig. 3C,D).
The reduction in satellite-cell proliferation is due to DUX4 transcriptional activity To examine the effects of DUX4 during proliferation, we used expanded primary myoblast cultures, which were transduced with the DUX4 retroviral constructs and DUX4c, and pulsed with EdU. There was a reduced proportion of satellite-cell-derived myoblasts containing EdU after transduction with DUX4, tMALDUX4-VP16 or DUX4c constructs, compared to transduction with control retrovirus (Fig. 4A). The proliferation rate was unaltered in myoblasts expressing tMALDUX4 or tMALDUX4-ERD (Fig. 4A). The nuclear pattern of the signal after co-labelling with antibodies against phosphorylated histones H1 and H3 can be used to identify stages of the cell cycle (Hendzel et al., 1997;Lu et al., 1994). DUX4 expression significantly reduced the proportion of satellite cells that were in all phases of the cell cycle and increased the proportion that were in G0 (Fig. 4B).

Activation or inhibition of DUX4 target genes suppresses myotube formation
We next examined the effects of DUX4 constructs on later phases of differentiation. Satellite-cell-derived myoblasts were cultured at high density to mitigate the anti-proliferative effects of some constructs, transduced with retroviruses encoding DUX4c or DUX4 constructs and switched to low-serum conditions to promote fusion. Coimmunostaining for eGFP and myosin heavy chain (MyHC) revealed that myoblasts that had been infected with control retrovirus readily formed large multinucleated myotubes (fusion index of ≥2 nuclei/ myotube), which appear yellow-orange in merged images (Fig. 4C,D). Expression of any of the DUX4 constructs reduced myoblast fusion, resulting in numerous unfused eGFP-positive (green) myoblasts. MyHC-positive but eGFP-negative red myotubes, principally composed of non-transduced myoblasts, could also be identified (Fig. 4D). However, two categories of severity were identified: tMALDUX4 or DUX4c had a less-profound effects on fusion than DUX4, tMALDUX4-VP16 or tMALDUX4-ERD, with cells even unable to differentiate into unfused myocytes expressing MyHC in the latter category (Fig. 4C,D). Thus, both transcriptional activation and suppression of DUX4 target genes reduces and/or prevents myoblast fusion, whereas loss of the C-terminus of DUX4 in tMALDUX4 and DUX4c lessens these inhibitory effects. The effects on satellite cell function of the four DUX4 constructs and DUX4c are summarised in Fig. 4E.

DUX4 is predominately an activator of transcription
Previous transcriptional profiling of gene expression changes induced by DUX4 constructs (GEO accession number GSE77100; http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc= GSE77100) has revealed that DUX4 in murine satellite cells recapitulates a transcriptional signature that has been identified in human FSHD muscle biopsies (Banerji et al., 2015a).

Concordance between DUX4-driven gene expression changes in mouse and humans
We also performed differential expression analyses using an empirical Bayes approach employing a P<0.05 significance threshold (Smyth, 2004) in comparing gene expression in the presence of DUX4, tMALDUX4, tMALDUX4-VP16, tMALDUX4-ERD and DUX4c independently to that under control retrovirus (Fig. 6A). A gene was considered upregulated by DUX4 if it was upregulated by both DUX4 and tMALDUX4-VP16 but downregulated by tMALDUX4-ERD, compared to control (Fig. 6A). A gene was considered downregulated by DUX4 if it was downregulated by both DUX4 and tMALDUX4-VP16 but upregulated by tMALDUX4-ERD, compared to control (Fig. 6A). Together, this generated a sample-specific biomarker for DUX4 activity by comparing expression of 291 DUX4-upregulated target genes to 344 DUX4-downregulated target genes in each sample (Table S1). DUX4-upregulated target genes should be at higher levels than downregulated target genes in samples expressing DUX4; thus, the difference between upregulated and downregulated target gene distribution is a biomarker for DUX4 expression.
This DUX4 biomarker shows significant concordance in the genes changed by DUX4 in our microarray using primary mouse satellite cells and those identified as changed in C2C12 myoblasts (Bosnakovski et al., 2008b;Sharma et al., 2013) (Fig. 6B,C). Importantly, this DUX4 biomarker also shows significant concordance with changes elicited by DUX4 in human cells (Geng et al., 2012), and can be used to distinguish human myoblasts Transcriptional activity was assessed in C2C12 myoblasts by cotransfection of DUX4 constructs and DUX4c with three DUX4-responsive promoters driving luciferase reporter genes ( pZSCAN4-luc, pKHDC1L-luc or RFPL4b-luc), together with β-galactosidase for transfection normalisation. Only DUX4 and tMALDUX4-VP16 strongly activated DUX4 reporters. Boxes represent the interquartile range (central 50% of data) with the median indicated by a line, and whiskers indicate the extremes of the distribution. (C) Retroviral (RV)-mediated expression of DUX4 constructs and DUX4c in C2C12 myoblasts that had been co-immunostained for eGFP (green) to identify transduced cells, actin (red) and DUX4 (white, inset panel) with DAPI (blue). DUX4-and tMALDUX4-VP16-transduced myoblasts had altered morphology, with long projections (arrows). Scale bars: 20 µm. (D) Apoptosis was assayed in plated satellite-cell-derived primary myoblasts by measuring caspase 3 and caspase 7 activity over 48 h post transduction with retroviruses encoding DUX4 constructs and DUX4c. Data are mean±s.e.m. from three experiments (B) or four mice (D), where an asterisk denotes significant difference (P<0.05) from GFP control using a Student's t-test. expressing DUX4 from those expressing either DUX4-s or control (Fig. 6D).
Using the microarray analysis of human myoblasts that expressed DUX4 (Geng et al., 2012), we also determined the DUX4 transcription signature in humans comprising 123 upregulated and 253 downregulated genes (Table S2). This human DUX4 signature clearly separated our DUX4-and tMALDUX4-VP16-expressing mouse myoblasts from those expressing DUX4c and tMALDUX4 (Fig. 6E), and also those expressing tMALDUX4-ERD from DUX4c-and tMALDUX4-expressing myoblasts (Fig. 6E). Thus, genes controlled by DUX4 in mouse overlap with those regulated by DUX4 in humans.

DUX4 increases transcriptomic measures of stem cells
Signalling entropy is a combined single-sample measure of intracellular signalling promiscuity and intercellular heterogeneity, derived from integration of gene expression data with a protein interaction network. Signalling entropy is a powerful measure of cell differentiation potential, valid across multiple lineages and in pathology, and we have shown previously that it outperforms other popular methodologies (Banerji et al., 2013(Banerji et al., , 2015b. The assumption is that stem cells have many options with respect to fate, and so the diversity of genes expressed is high, giving stem cells a high signalling entropy. In contrast, differentiated cells have a more limited and defined gene expression profile in order to perform their functions, so have low signalling entropy. Thus signalling entropy progressively drops during progress from stem cells to differentiated cells, so signalling entropy indicates the position of a cell population on this spectrum (Banerji et al., 2013).
Computing signalling entropy for each DUX4 construct revealed that gene expression profiles induced by DUX4 and tMALDUX4-VP16 displayed significantly higher signalling entropy than those induced by control (P<0.005) and DUX4c (P<0.0006), suggesting that DUX4 results in a transcriptomic profile that is more like that of a stem cell or of a less-differentiated cell (Fig. 6F). In contrast, tMALDUX4-ERD displayed a significantly lower signalling entropy than control (P<0.04), suggesting that repression of DUX4 target genes causes a more differentiated expression regime. tMALDUX4-and DUX4c-expressing cells had similar signalling entropy to that of control cells (Fig. 6F), suggesting that they do not significantly alter global transcriptomic measures of differentiation potential, despite their effects on key markers of differentiation at the protein level (Figs 3 and 4).
DUX4 regulates genes associated with apoptosis and reduced cell proliferation DUX4 principally activates transcription of target genes, whereas DUX4c and tMALDUX4 activate some of these DUX4 target genes but repress others. We compared pathways that are regulated by DUX4 and DUX4c using sequential gene-set filtering and information from the four DUX4 construct and DUX4c microarrays Fig. 4. DUX4 reduces myogenic fusion by both transcriptional activation and suppression of target genes. Expanded satellite-cell-derived myoblasts were transduced to express DUX4, tMALDUX4, tMALDUX4-VP16, tMALDUX4-ERD, DUX4c or control retrovirus (RV). (A) At 24 h posttransduction, myoblasts were pulsed with EdU for 2 h, fixed and immunostained for eGFP with EdU detection. DUX4, tMALDUX4-VP16 or DUX4c expression reduced the proportion of eGFP+ myoblasts containing EdU. (B) The pattern of phosphorylated histones H1 and H3 immunosignal can be used to identify stages in the cell cycle (Hendzel et al., 1997;Lu et al., 1994) and revealed that DUX4 suppressed cell cycle progression. (C,D) Transduced myoblasts were switched to differentiation medium for 48 h, and coimmunostained for eGFP (green) and MyHC (red) with DAPI counterstain (blue). (C) DUX4 constructs and DUX4c significantly reduced the fusion index (≥2 nuclei). (D) DUX4 constructs reduced the number and size of myotubes, with many unfused eGFP+ and MyHC− myoblasts. Data are mean ±s.e.m. from three mice, where an asterisk denotes significant difference (P<0.05) from transduction with control RV using a paired Student's t-test. Scale bars: 50 µm. (E) Summary of effects of DUX4 constructs and DUX4c on satellite cells. (Fig. 6A). In addition to the target gene set that acts as a DUX4 biomarker, we also generated two DUX4c target gene setsone in which genes were considered to be upregulated by DUX4c if they were upregulated by both tMALDUX4 and DUX4c, and one in which genes were considered to be downregulated by DUX4c if they were downregulated by both tMALDUX4 and DUX4c (Fig. 6A). Gene set enrichment analysis was used to evaluate whether genes that were commonly and differentially regulated by DUX4 and DUX4c (Table S3) were significantly associated with particular functional classes. After correcting for multiple testing, there was no enrichment for gene sets that were downregulated by both DUX4 and DUX4c or that were upregulated by DUX4 but not DUX4c (Tables S4 and S5).
Crucially, genes downregulated by DUX4 but not DUX4c were significantly enriched for those regulating cell proliferation and apoptosis, for example those encoding TGFβ1 and Notch1 ( Fig. 7A; Table S6). Genes upregulated by both DUX4 and DUX4c were significantly enriched for urogenital development and gland development, for example Gata3, Esr1, Bcl2 and Wwtr1 ( Fig. 7B; Table S7). Genes that were upregulated by DUX4c but not DUX4 were strongly associated with angiogenesis and blood vessel morphogenesis, for example Hey1 (Fig. 7C; Table S8). Conversely, genes downregulated by DUX4c but not DUX4 were associated with developmental processes and muscle development, for example Hoxa1, Fzd2, Tnnc2, Myh7 and myoglobin (Mb) (Fig. 7D; Table S9).

DISCUSSION
DUX4 plays a key role in FSHD1 and FSHD2 pathology because of its de-repression in skeletal muscles (Tawil et al., 2014). Epigenetic regulation of the D4Z4 repeat in transgenic D4Z4-2.5 mice is generally similar to that in man, with variable low levels of DUX4 in Fig. 5. DUX4 acts by both activating and suppressing target genes. (A) Flow chart describing the filtering of probes to identify genes whose expression was modified by DUX4 constructs and DUX4c. (B) Global transcriptomic analysis of microarray assays of cells expressing DUX4 constructs (compared to control) demonstrates correlations between differential expression t-values. Positive correlations were detected between DUX4 and tMALDUX4-VP16 gene sets and between DUX4 and DUX4c gene sets. Lack of anti-correlation between DUX4 and tMALDUX4-ERD gene sets indicates that DUX4 also suppresses transcription of some target genes. Fig. 6. DUX4 induces signatures of a stem-cell-like and less-differentiated state. (A) Transcripts that were upregulated (red) by DUX4 and tMALDUX4VP16 (tDUX4VP16) but downregulated (green) by tMALDUX4ERD (tDUX4ERD) were considered as positively correlated (upregulated) with DUX4 activity. Conversely, transcripts that were downregulated (green) by DUX4 and tMALDUX4VP16 but upregulated (red) by tMALDUX4ERD were considered as negatively correlated (downregulated) with DUX4 activity. Transcripts upregulated (red) by tMALDUX4 and DUX4c were considered as positively correlated (upregulated) with DUX4c activity. Conversely, transcripts downregulated (green) by tMALDUX4 and DUX4c were considered as negatively correlated (downregulated) with DUX4c activity. (B-D) We constructed a single-sample DUX4 expression score from our study in mouse to examine overlap with DUX4 target genes identified by other studies. (B,C) Our mouse DUX4 expression score distinguishes murine C2C12 myoblasts expressing DUX4 from controls in two independent published microarray studies (Bosnakovski et al., 2008b;Sharma et al., 2013). (D) Our mouse DUX4 expression score also distinguishes DUX4-expressing human immortalised myoblasts from those expressing DUX4-s or eGFP control (Geng et al., 2012). (E) A human DUX4 signature derived from human myoblasts expressing DUX4 (Geng et al., 2012) distinguishes mouse myoblasts expressing tMALDUX4 and DUX4c both from those expressing DUX4 or tMALDUX4-VP16, and also from those expressing tMALDUX4-ERD. (F) Signalling entropy is elevated in the transcriptional profiles induced by DUX4 and tMALDUX4-VP16 but is reduced by tMALDUX4-ERD expression, supporting the hypothesis that DUX4 inhibits myogenic differentiation. Boxes represent the interquartile range (central 50% of data) with the median indicated by a line, and whiskers indicate the extremes of the distribution. P-values were calculated using Student's t-test. RV, retrovirus.  skeletal muscle, but the transgenic model has no overt muscle pathology (Krom et al., 2013). Here, we show that DUX4 expression increases during muscle regeneration, being expressed by myoblasts, although overall, DUX4 levels remained low. Our observations are consistent with those made in primary FSHD myoblasts, where both DUX4 and its transcriptional activity can be detected in proliferating and differentiating human myoblasts (Dixit et al., 2007;Jones et al., 2012;Kowaljow et al., 2007;Rickard et al., 2015;Snider et al., 2010). Mice have an impressive regeneration capacity, and so low DUX4 levels or expression restricted to a few myoblasts might explain the lack of an overt muscle phenotype. The DUX4 locus is predisposed to being expressed and is activated by, amongst other things, myogenic transcriptional regulators. Recently, two myogenic enhancers have been identified (Himeda et al., 2014), one of which, the DUX4 myogenic enhancer 1 (DME1), is included in the D4Z4-2.5 transgene.
DUX4 splice variants emanate from the D4Z4 repeat array (Snider et al., 2009), and inappropriate temporal expression or increased proportions of the transcript encoding DUX4-fl are probably pathogenic in FSHD muscle. DUX4-fl and splice variants inhibit myoblast differentiation (Bosnakovski et al., 2008b;Snider et al., 2009), and DUX4-fl is also apoptotic (Bosnakovski et al., 2008b;Mitsuhashi et al., 2013;Wallace et al., 2010). DUX4-fl contains double homeobox DNA-binding domains and an evolutionarily conserved peptide sequence at the C-terminus (Clapp et al., 2007) that acts as a strong transcriptional activation domain (Kawamura-Saito et al., 2006). To better understand the mode of action of DUX4 and DUX4c on myogenesis, we used our panel of four DUX4 constructs, including constitutively active, dominant-negative and truncated versions of DUX4.
Pax7 is expressed in activated satellite cells, but levels decrease during differentiation, with Pax7 and myogenin expression being mutually exclusive (Zammit et al., 2004). DUX4 and tMALDUX4-VP16 resulted in maintenance of Pax7 expression, as did DUX4c, whereas transcriptional repression of target genes by the tMALDUX4-ERD construct did not alter Pax7 levels. DUX4, tMALDUX4 and DUX4c also reduced Myod expression. Because tMALDUX4-VP16 but not tMALDUX4-ERD reduced Myod levels, it is likely that DUX4 activates genes involved in Myod repression rather than by directly repressing Myod transcription itself, providing insight into MYOD-dependent pathway suppression in FSHD (Celegato et al., 2006;Winokur et al., 2003b).
Interestingly, the C-terminal peptide of DUX4 inhibits myogenin expression in the absence of the DNA-binding homeodomains (Snider et al., 2009). tMALDUX4-VP16 does not contain this C-terminal peptide and did not alter myogenin gene expression, showing that DUX4 is not solely acting by transcriptionally activating target genes, consistent with observations that tMALDUX4-ERD also suppresses myogenin. Myf5 mRNA is upregulated by DUX4 in immortalised myoblasts and satellite cells (Banerji et al., 2015a;Bosnakovski et al., 2008b) and this could represent a compensatory mechanism. However, DUX4 inhibits both Myod and myogenin gene expression in mouse satellite cells to produce a differentiation defect that cannot be overcome by upregulation of Myf5.
All DUX4 constructs and DUX4c inhibited myoblast fusion into multinucleated myotubes, but DUX4c and tMALDUX4 had relatively mild effects. Myoblasts were re-plated at high-density before assessing fusion, to mitigate the effects on proliferation. However, tMALDUX4-ERD did not affect proliferation yet still blocked fusion, indicating that transcriptional activation of DUX4 target genes inhibits proliferation, but both activation and suppression of target genes can suppress differentiation.
Thus, DUX4 expression results in maintenance of a stem-cell-like and less-differentiated state, with concomitant suppression of proliferation and inhibition of differentiation. This striking differentiation defect might explain the lack of muscle phenotype in our D4Z4-2.5 mice because rare DUX4-expressing myoblasts might be inhibited from fusing into myofibres.
To better understand DUX4, we further analysed our microarray of satellite-cell-derived myoblasts expressing DUX4, tMALDUX4-VP16, tMALDUX4-ERD, tMALDUX4 or DUX4c constructs (Banerji et al., 2015a). Pairwise comparison of the transcriptional changes caused by each construct compared to control allowed us to determine the predominant mode of action of DUX4. Transcriptional changes elicited by DUX4 or tMALDUX4-VP16 were strongly positively correlated, indicating that DUX4 activates many transcriptional targets. Interestingly, although DUX4 and tMALDUX4-VP16 had very similar transcriptome signatures, they were not identical, indicating that DUX4 is not operating solely as a transcriptional activator. Indeed, although the expression profile of tMALDUX4-VP16 target genes was anti-correlated to that of tMALDUX4-ERD, DUX4 was not, indicating that DUX4 also suppresses some transcriptional target genes. The target gene sets of tMALDUX4 and DUX4c were positively correlated, but were also positively correlated with DUX4, indicating that they have many target genes in common. This again suggests additional mechanisms by which DUX4 alters transcriptional regulation that are distinct from the activity of its C-terminal transactivation domain.
Signalling entropy is a strong correlate of differentiation potential in healthy tissue (Banerji et al., 2013) and is a powerful prognostic factor in cancerous tissue, where it is associated with anaplasia (Banerji et al., 2015b). tMALDUX4 or DUX4c induced similar signalling entropies to control, whereas tMALDUX4-ERD decreased signalling entropy, indicating induction of differentiation. In contrast, signalling entropy was raised by DUX4 or tMALDUX4-VP16, implying that DUX4 activates transcriptional target genes that are expressed in stem cell populations, consistent with retention of Pax7 expression in satellite cells.
Although there are Dux-like genes in mouse, there is debate about how useful mouse studies are for identifying genes regulated by DUX4. However, there has only been limited assessment of the concordance in mouse and man between DUX4-mediated transcriptional changes. There was a 27% overlap of transcripts that are differentially expressed by DUX4 in mouse C2C12 myoblasts compared to human RD rhabdomyosarcoma cells expressing DUX4, despite effects associated with comparing mouse myoblasts with human cancer cells (Sharma et al., 2013). We have also demonstrated previously a 23% overlap in DUX4 targets between mouse and man using the transgenic D4Z4-2.5 mouse model (Krom et al., 2013). However, the significance of this overlap in DUX4-perturbed genes was not statistically assessed in these studies.
Objectively assessing mouse as a FSHD model is requisite because many mouse models have been developed for FSHD (Lek et al., 2015). Reliable transcriptomic profiling of DUX4 overexpression requires matched cell types between mouse and man, and statistical assessment of target overlap. We developed a DUX4 signature of genes using our microarray in primary mouse satellite cells. Our mouse DUX4 signature could distinguish mouse C2 myoblasts expressing DUX4 from control cells, described in two independent studies (Bosnakovski et al., 2008b;Sharma et al., 2013). Importantly, this overlap extended to a human microarray study (Geng et al., 2012), where genes identified as being perturbed by DUX4 in our murine myoblasts could also be used to distinguish human myoblasts overexpressing DUX4 from those expressing DUX4-s or eGFP controls. We also derived a human DUX4 signature, which separated our DUX4-and tMALDUX4-VP16expressing mouse myoblasts from those expressing DUX4c and tMALDUX4, and also tMALDUX4-ERD-expressing cells from those expressing DUX4c and tMALDUX4. Thus, there is a statistically significant overlap in DUX4 transcriptional dysregulation across mouse and man. Furthermore, DUX4 in mouse primary myoblasts perturbs expression of genes that are modified in multiple human FSHD muscle biopsies (Banerji et al., 2015a).
Using transcriptome data from mouse satellite cells expressing DUX4 or tMALDUX4-VP16, we isolated genes that are likely to be transcriptionally activated by DUX4. Identifying those genes that exhibited inverse expression patterns in satellite cells expressing tMALDUX4-ERD increases the confidence that they are pathways regulated by DUX4. However, DUX4c or DUX4 splice variants also perturb myoblast function (Ansseau et al., 2009;Bosnakovski et al., 2008b;Snider et al., 2009). Using the four DUX4 constructs and DUX4c, we filtered gene expression profiles to provide sets of genes that are perturbed by DUX4 and/or DUX4c. As expected, those genes regulated by DUX4 but not DUX4c were enriched for genes involved in apoptosis and proliferation, consistent with observations that DUX4, but not DUX4c, is pro-apoptotic in myoblasts. DUX4c-enriched genes were involved in vascular development, which is relevant given an association with Coat's like retinopathy and FSHD (Fitzsimons, 2011). DUX4c-perturbed genes are also involved in muscle development, supporting an active role for DUX4c in FSHD muscle pathology (Ansseau et al., 2009). Both DUX4 and DUX4c regulate genes expressed during urogenital and gland morphogenesis, supporting DUX4 expression in testes and indicating that overlapping DUX4 and DUX4c transcriptional targets could guide development of urogenital organs . Finally, genes downregulated by DUX4c but not DUX4 were associated with muscle development and axonal guidance. Both DUX4 and DUX4c inhibit myoblast fusion, whereas DUX4 overexpression in embryonic stem cells promotes differentiation towards the neuronal lineage (Dandapat et al., 2014), indicating that DUX4c is associated with neuronal and myogenic development in a manner that is independent of DUX4. These transcriptome signatures add to our understanding of how DUX4 and DUX4c induce pathology in FSHD. Examining multiple DUX4 constructs also allows for the identification of target genes that could be overlooked when examining DUX4 alone due to its effects on proliferation and apoptosis.
Overall, our study suggests that induction of a more a stem-celllike and less-differentiated state in myoblasts expressing DUX4 inhibits proliferation and myogenesis. Identification of pathways perturbed by DUX4 contributes to the challenge to identify viable therapeutic targets to alleviate the consequences of mis-expression of DUX4 in FSHD.

Muscle injury
Procedures were carried out under the Animals (Scientific Procedures) Act 1986, as approved by King's College London Ethical Review Process committee or approved by the local animal experimental committee of Leiden University Medical Center and by the Commission Biotechnology in Animals of the Dutch Ministry of Agriculture. Four-month-old hemizygous D4Z4-2.5 and D4Z4-12.5 mice were used (Krom et al., 2013). Muscle injury was induced by intra-muscular injection of 10 μM cardiotoxin in 50 μl PBS into the gastrocnemius of anaesthetised mice. Contra-lateral muscles were injected with 50 μl saline. Muscles were isolated at days 3, 4, 5, 6 and 10 postcardiotoxin, snap-frozen in 2-methylbutane (Sigma-Aldrich, Dorset, UK) cooled in liquid nitrogen, cryosectioned and stained with haematoxylin and eosin (H&E). D4Z4-2.5 (stock #027991) and D4Z4-12.5 (stock #028012) transgenic mice are available from the Jackson Laboratory.
Images were acquired on a Zeiss Axiovert 200 M microscope using a Zeiss AxioCamHRm and AxioVision version 4.4 (Zeiss) or a Zeiss Axioplan 2 with a Hamamatsu ORCA-ER camera with Openlab 3.1.7.

EdU incorporation
Myoblasts were plated at 5×10 3 in 8-well Matrigel-coated chamber slides, maintained in high-serum medium for 24 h before transduction, and 24 h later, pulsed with EdU for 2 h (Thermo Fisher Scientific) and immunostained for eGFP before EdU detection with Alexa-Fluor-594 (Thermo Fisher Scientific).

Apoptosis assay
Transduced myoblasts were plated (5×10 3 /well) into 96-well plates for fluorescence assays (Greiner Bio-One) in three technical replicates to investigate apoptosis using the Caspase-Glo 3/7 Assay (Promega) on a Glomax-Multi+ microplate reader (Promega). Luminescence activity from the Caspase-Glo assay from each well was normalised to GFP measured using the Glomax-Multi+ reader.

Statistical analysis
Myofibre and satellite-cell-derived myoblasts were obtained from at least three mice. Data from immortalised myoblast lines was from at least three experiments. Data are mean±s.e.m. with significance assessed by Student's t-test, unless otherwise stated.
Differential expression analysis was performed using an empirical Bayes approach (Smyth, 2004) to identify transcripts perturbed by each DUX4 construct, t-statistics for transcripts were correlated between constructs to ascertain similarities in expression landscapes. t-values described in reference to differential expression are the test statistics of a standard statistical assessment of differential expression using the Linear Models for Microarrays (limma) package in R (Smyth, 2004). Transcripts were filtered using all constructs to obtain two lists representing genes whose expression was modified by either DUX4 or DUX4c. P<0.05 was used to identify genes which were differentially expressed by each DUX4 construct compared to control. Expression of genes was then attributed as DUX4 upregulated if they were upregulated by both DUX4 and tMALDUX4-VP16 and downregulated by tMALDUX4-ERD. Expression of genes was attributed as DUX4 downregulated if they were downregulated by both DUX4 and tMALDUX4-VP16 and upregulated by tMALDUX4-ERD. Similarly, a gene was considered to be up-or downregulated by DUX4c if it was up-or downregulated by both DUX4c and tMALDUX4.
GSEA was performed using a Fisher's Exact test, using the DAVID functional annotation tool (Huang et al., 2009a,b). Gene sets which displayed Benjamini-Hochberg adjusted P<0.05 were considered enriched.

Signalling entropy
Signalling entropy was computed using a mass action principle approximation (Banerji et al., 2013). Each sample was integrated with a protein interaction network (PIN) to create a sample specific stochastic matrix, P=( p ij ). The PIN was constructed from previous work (Banerji et al., 2015b) through orthology relations. The i th row of P defines a probability distribution describing rates of reaction of protein i with each of its neighbours. Distributions were constructed appealing to a mass action principle, namely that rate of a reaction is proportional to the product of the active masses. Assuming log normalised gene expression is a proxy for protein concentration, we compute: where E j is log-normalised expression of gene j in the given sample and N(i) denotes the set of direct interaction partners of gene i in the PIN. From this definition, ∑ jɛN(i) P ij =1 for all ji.e. P is row stochastic and i th row corresponds to weighted interaction distribution of protein i in sample. Not all proteins in the PIN have a corresponding microarray probe, consequentially the PIN is the maximally connected component after removal of missing proteins. For each protein i, we define the local entropy of its interaction distribution, S i , quantifying promiscuity in its signalling within the sample: Signalling entropy is a global measure of signalling promiscuity and is computed from the stochastic matrix p ij as the entropy rate (SR) of the stochastic process described by p ij : where π i denotes the stationary distribution of the stochastic matrix, satisfying: π i is the non-degenerate eigenvector of P corresponding to eigenvalue 1. By Perron-Frobenius existence of π i requires that matrix P be irreducible; as the PIN considered is connected and non-bipartite, this is guaranteed. R-scripts for signalling entropy can be found at www.sourceforge.net/projects/ signalentropy.