CTCF and its protein partners: divide and rule?

CTCF is a ubiquitous transcription factor that is involved in numerous, seemingly unrelated functions. These functions include, but are not limited to, positive or negative regulation of transcription, enhancer-blocking activities at developmentally regulated gene clusters and at imprinted loci, and X-chromosome inactivation. Here, we review recent data acquired with state-of-the-art technologies that illuminate possible mechanisms behind the diversity of CTCF functions. CTCF interacts with numerous protein partners, including cohesin, nucleophosmin, PARP1, Yy1 and RNA polymerase II. We propose that CTCF interacts with one or two different partners according to the biological context, applying the Roman principle of governance, `divide and rule' (divide et impera).


Introduction
Linking intra-and interchromosomal sites As has been discussed above, CTCF appears to be able to link discrete domains on the same or different chromosomes. An important series of studies has used various modifications of the chromosome conformation capture (3C) technique (Dekker et al., 2002) to identify chromatin regions that contact each other physically in the nucleus. Ling and co-workers (Ling et al., 2006) studied the transcriptional control of imprinted genes (only one of the two alleles of such genes is expressed in a given cell, and the expression is determined by the parental origin of the allele, i.e. the mother and the father alleles are differentially expressed). They found that the so-called imprinting control region (ICR) that borders and governs the expression of the maternal allele of the H19 gene (which encodes an RNA molecule of unknown function) specifically interacts with the paternal allele of an intergenic region between two other imprinted genes, Wsb1 (WD repeat and SOCS- (C) Schematic of CTCF primary structure, showing its three domains, as well as those protein partners whose interactions with CTCF have been mapped to the individual domains. CHD8, chromodomain helicase DNA-binding protein 8; CIITA, MHC class II transactivator; CP190, centrosomal protein 190; H2A.Z, variant Z of histone H2A; LS, large subunit; HDAC, histone deacetylase; PARP1, poly(ADP-ribose) polymerase 1; SIN3A, SIN3 homolog A, transcription regulator (yeast); RFX, regulatory factor X; RNAP II, RNA polymerase II; Suz12, suppressor of zeste 12 homolog (Drosophila), Taf1/Set, SET translocation (myeloid leukemia-associated); Topo II, DNA topoisomerase II; YB1, Y-box binding protein 1; Yy1, yin and yang 1.
box-containing protein 1) and Nf1 (neurofibromin). Notably, the interacting regions are located on two different mouse chromosomes: the H19 ICR on chromosome 7, and Wsb1 and Nf1 on chromosome 11. Importantly, the interaction is dependent on the presence of CTCF and intact CTCF-binding sites. A further study identified 114 unique sequences from all chromosomes that interact with the same H19 ICR region, with some preference for interchromosomal interactions (Zhao et al., 2006). Imprinted loci are highly represented among the interacting DNA regions, and the pattern of interactions changes during differentiation ex vivo (in embryonic stem cells) and in vivo (when comparing the embryoid body with the neonatal liver). Notably, the physical proximity of sites depends on intact CTCF target sites, implicating CTCF in mediating these interactions [for further examples and discussion, see Zlatanova and Caiafa (Zlatanova and Caiafa, 2009)]. Simonis and colleagues (Simonis et al., 2006) studied the β-globin gene locus in its transcriptionally active (fetal liver) and inactive (fetal brain) state. When active, the locus preferentially interacts with other transcribed loci, whereas the inactive locus prefers to partner with transcriptionally silent regions. This study did not directly address CTCF involvement in bringing genomic loci together; however, such an involvement is to be expected because CTCF is known to have a role in enhancer blocking in the β-globin gene clusters through a mechanism that involves loop formation, i.e. bringing distant DNA regions together (Bell et al., 1999;Farrell et al., 2002;Splinter et al., 2006).

CTCF-interacting proteins -a different partner for each occasion?
The number of proteins recognized to interact with CTCF under specific circumstances is growing steadily and will, undoubtedly, continue to grow. In general, CTCF partners can be divided into several functional groups (Fig. 1B). The group of DNA-binding proteins [transcription factors (activators and/or repressors depending on the context) and cofactors] includes, but is probably not limited to, Y-box-binding protein 1 (YB1) (Chernukhin et al., 2000), Yin and yang 1 (Yy1) (Fig. 2), Kaiso (Defossez et al., 2005), and regulatory factor X (RFX) and MHC class II transactivator (CIITA) (Majumder et al., 2006;Majumder et al., 2008). The second category of partners includes chromatin proteins (both structural proteins and enzymes). Table 1 provides a summary of the most important characteristics of each specific partner, and the main findings concerning the functional significance of the partnership, for the first two groups of interactors. A third group includes important multifunctional proteins, such as poly[ADP-ribose] polymerase 1 (PARP1), nucleophosmin and Topo II. Finally, there are other identified partners that do not belong to any of these groups and will be separately considered as 'miscellaneous'. In the following subsections, we describe the interactions of CTCF with several partner proteins, and show how these give rise to distinct functions of CTCF. Please note, however, that these cases represent only a few examples of the CTCF-protein interactions that occur at specific genomic loci; in addition, the issue of whether CTCF actually recruits the partner protein in question to the site has not been addressed in most of the examples described.

Yy1 is a CTCF partner with a role in X-chromosome inactivation
Yy1 is a ubiquitous four-zinc-finger transcription factor that has been implicated in biological processes such as embryogenesis, differentiation, cell proliferation and tumorigenesis (Gordon et al., 2006). Homozygous Yy1 mouse mutants die early in development, whereas heterozygous animals are characterized by severe growth retardation and neurological defects (Gordon et al., 2006). It has been hypothesized that overexpression and/or activation of Yy1 are linked to loss of control of cell proliferation, although the molecular mechanisms remain elusive. Among the numerous potential mechanisms are effects on p53 expression and/or activity (Gordon et al., 2006) and stimulation of PARP1 activity (Griesenbeck et al., 1999). PARP1 stimulation might be of special interest, because PARP1 has been identified as a CTCF interaction partner  and poly(ADP-ribosyl)ated forms of CTCF have been implicated in the control of transcription of imprinted genes and ribosomal DNA Torrano et al., 2006;Caiafa and Zlatanova, 2009) (see below). Vertebrate Yy1 has also been implicated in polycomb group (PcG)-mediated functions because it can repress transcription in Drosophila and functionally compensates for loss of its Drosophila homologue, PHO (Atchison et al., 2003;Wilkinson et al., 2006). Yy1 recruits the PcG complex to DNA, resulting in methylation of histone H3K27 (Wilkinson et al., 2006); the introduction of methyl groups onto Lys27 in the tail of histone H3 is thought be a mechanism through which PcG proteins repress expression of genes involved in embryonic development.
Yy1 has been recently identified as a CTCF cofactor that has a role in X-chromosome inactivation. Although the mechanism still remains unclear, it is worth noting that another CTCF partner, histone variant H2A.Z, has been also implicated in the inactivation process (Donohoe et al., 2007) (Fig. 2). In mammals, gene-dosage compensation between females (XX) and males (XY) occurs through a random inactivation of one of the two female X chromosomes. The inactivation process is complex and occurs through at least three genetically separable stages: (1) 'counting' of the X-chromosometo-autosome ratio to ensure the inactivation of only one of the two X chromosomes; (2) 'choice' of the chromosome to be inactivated; and (3) the actual inactivation process, which is initiated by coating the designated inactive chromosome with the non-coding Xist RNA (Avner and Heard, 2001;Clerc and Avner, 2006;Erwin and Lee, 2008). CTCF has been implicated in the initial pairing of the two X chromosomes through their X-inactivation centers (Avner and Heard,   Xu et al., 2007), and in the inactivation process itself (Pugacheva et al., 2005). CTCF is also involved in the function of boundary (insulator) elements that separate inactivated genes from rare 'escapee' genes that remain transcriptionally active in the context of the inactive X chromosome (Filippova et al., 2005). The interactions of Yy1 and CTCF are described in more detail in Table 1. Next, we describe the role of CTCF in X-chromosome inactivation in more detail. The physical map of the region that specifies the sequences of the three non-coding RNAs involved in the inactivation process is presented in Fig. 2. On the future active X chromosome, Xite (X-inactivation intergenic transcription element) prolongs the antisense transcription of Tsix [X (inactive)-specific transcript, antisense], which in turn blocks transcription of Xist [X (inactive)specific transcript] (Fig. 2); both CTCF and Yy1 transactivate Tsix. On the future inactive X chromosome, repression of Xite downregulates Tsix transcription, which in turn induces Xist transcription to initiate the inactivation process. In mouse cells, the Xist-Tsix region is characterized by the presence of ~40 potential CTCF-binding sites, which are frequently paired with binding sites for Yy1 (Donohoe et al., 2007) (Fig. 2). CTCF directly interacts with Yy1, as shown in co-immunoprecipitation experiments; the highaffinity interaction between the two proteins involves mainly the N-terminus of CTCF (Donohoe et al., 2007). Finally, transient cotransfection experiments indicate that CTCF and Yy1 together confer higher transactivation on Tsix than either protein alone (Donohoe et al., 2007). The physical and functional interaction of CTCF with Yy1 during X-chromosome inactivation provides a clear example of how a specific function of CTCF is mediated by a specific protein partner.

Cohesin partners CTCF in gene regulation
The cohesin complex has a central role in holding the two sister chromatids in close contact from the time of DNA replication in S phase to the time of their separation at the onset of mitotic anaphase (reviewed by Hirano and Hirano, 2006;Hirano, 2006). Cohesin function is essential for genome stability and repair; several human developmental disorders, such as Cornelia de Lange syndrome and Robert's syndrome, are associated with mutations in cohesin components or the machinery that loads cohesin on chromatids.
The cohesin complex comprises four subunits; Smc1 and Smc3 are members of the structural maintenance of chromosomes (SMC) protein family, whereas Scc1 and Scc3 (subunit of the cohesin complex 1 and 3) are thought to participate in the formation of a ring structure around the two chromatids ( Fig. 3; and see below). Two other non-SMC proteins, Scc2 and Scc4, are required in mammals to load the cohesin complex onto DNA. SMC proteins are large polypeptides of very unusual three-dimensional organization, in which two long α-helices fold back on themselves in an antiparallel orientation to form a rigid coiled-coil domain that has a hinge domain at one end and an ATP-binding 'head' domain at the other (Fig. 3A). Two SMC monomers dimerize at their hinge region to produce long V-shaped molecules. These dimers can form several alternative structures -rings, filaments and rosettes -through intra-and intermolecular interactions. The cohesins are proposed to form ring structures around the two sister chromatids (Haering et al., 2002).
Recently, a cohesion-independent function of cohesins has been recognized in yeast, Drosophila and mammals: they have been detected in post-mitotic cells that lack chromatid cohesion and have been implicated in gene regulation (for reviews, see Göndör and Ohlsson, 2008;Peric-Hupkes and van Steensel, 2008;Uhlmann, 2008;Gause et al., 2008). Four recent papers have reported a strong functional connection between cohesins and CTCF (Table 1). First, cohesin proteins and CTCF colocalize both at specific loci (Stedman et al., 2008;Rubio et al., 2008), including the Myc insulator element (MINE) (Gombert et al., 2003) (Fig. 3B) and genome-wide (Parelho et al., 2008;Rubio et al., 2008;Wendt et al., 2008). Second, CTCF recruits cohesin to specific sites, including the DM1 locus, which has a CTG repeat that is expanded in individuals with myotonic dystrophy (Fig. 3C) (Rubio et al., 2008;Cho et al., 2005). Third, in transient transfection experiments, the activity of insulator elements depends on the presence of cohesin proteins (Parelho et al., 2008;Wendt et al., 2008) (such effects have yet to be demonstrated on endogenous sites).
These studies, exciting as they are, raise a plethora of important questions. For example, what are the molecular interactions that are responsible for the colocalization of CTCF and cohesin? Despite the fact that ~70% of all sites identified as CTCF-and cohesinbinding sites bind to both proteins (Parelho et al., 2008), it is clear that there are sites occupied exclusively by CTCF or cohesin (Rubio et al., 2008). Moreover, downregulation of CTCF does not interfere with mitosis (Parelho et al., 2008;Wendt et al., 2008), suggesting that the cohesion function of cohesin is independent of CTCF. A second question is whether the structure of cohesin is different at CTCF-dependent and CTCF-independent binding sites. Fluorescence recovery after photobleaching (FRAP) experiments suggest that this might be the case; they indicate the existence of two pools of cohesin at interphase (an immobile fraction that is irreversibly bound to chromatin and a dynamic fraction) (Gerlich et al., 2006). The existence of the two distinct cohesin pools is consistent with available biochemical data (Hirano and Hirano, 2006), which suggest the existence of two forms of chromatin-bound cohesin: the ring form that embraces two DNA helices tightly and steadily without interacting directly with DNA, and a less tightly bound form that interacts with DNA in a more conventional manner. The second structure might require other DNA-binding proteins, such as CTCF. We propose that the ring structure is involved in cohesion, whereas the conventional structure participates in gene regulation. Whether long-range chromosomal interactions (loops) are involved in gene regulation through CTCF and cohesin also remains to be directly addressed.
Thus, the interactions between CTCF and cohesin provide another important example of how different CTCF partners may underlie distinct CTCF functions. The cohesin complex should clearly be considered as an interaction partner that mediates the involvement of CTCF in gene regulation.

PARP1 partners CTCF in DNA methylation
Poly(ADP-ribose) polymerases (PARPs) are enzymes that catalyze the formation of poly(ADP-ribose) chains (PARs) on chromatin proteins, including themselves (D'Amours et al., 1999;Schreiber et al., 2006;Kraus, 2008). PARPs use the coenzyme NAD + as a source of ADP-ribose moieties to synthesize protein-bound polymers of variable size (ranging from 2 to more than 200 units) and structural complexity (linear or branched); these polymers introduce negative charges onto the acceptor proteins, thus affecting their interactions with DNA and/or other proteins. The intracellular levels of PARs are under tight control; this involves dynamic formation of polymers by members of the PARP family Heteromodification and automodification are the two processes through which PARPs introduce covalently bound ADP-ribose polymers onto other proteins or onto themselves, respectively. Automodification of PARPs is generally activated by nicks on DNA.
PAR polymers on PARP1, which are attached at up to 28 sites in the automodification domain, are usually very long (up to 200 ADPribose units) and heavily branched (Juarez-Salinas et al., 1982). In addition, PARs (both protein-free and covalently linked to proteins) are capable of strong non-covalent binding (Malanga and Althaus, 2005) to specific proteins, the activity of which is then modulated by the bound polymers.   (Gombert et al., 2003). CTCF is constitutively bound at MINE and at the Myc promoter, and binding is independent of the transcriptional status of the gene. The Myc gene and its insulator are embedded in a large (~160 kb) domain that is flanked by matrix-attachment regions (MARs) and is devoid of other expressed genes; together, they constitute a euchromatic region embedded within a heterochromatic environment [this might be representative of a more general pattern that was recently recognized on mammalian chromosome arms (Regha et al., 2007) of active chromatin interspersed with repressive chromatin]. The CTCF-binding sites at MINE and the Myc promoter also bind to cohesin (Rubio et al., 2008;Stedman et al., 2008). Binding of the chromatin remodeler CHD8 to this region (see bracket) suggests that the chromatin structure in the region is actively altered (Ishihara et al., 2006). (C) The DM1 locus (which contains DMPK, the gene encoding myotonic dystrophy protein kinase), showing the position of the CTG repeat in the 3Ј UTR of DMPK that is expanded in individuals with myotonic dystrophy. The repeat is flanked by two CTCF-binding sites that are occupied by CTCF. In healthy individuals, the repeat is organized in a single positioned nucleosome (a nucleosome in which the histone octamer occupies a specific sequence). This strict positioning of the single nucleosome over the CTG repeat places the CTCF sites in the DNA-linker regions upstream and downstream of the nucleosome. The chromatin structure of the positioned nucleosome is highly heterochromatic [histone H3 is dimethylated at lysine 9 (H3K9me2)], but the rest of the region is characterized by the presence of 'active' histone modifications [histone H3 is methylated at lysine 4 (H3K4me)]. CTCF restricts the length of the antisense transcript, which limits heterochromatin formation to only the positioned nucleosome. In individuals with myotonic dystrophy, expansion of the CTG repeats is associated with loss of CTCF binding and conversion of the entire region to heterochromatin. According to Rubio et al. (Rubio et al., 2008), the CTCF-binding sites on the human DM1 locus (integrated in mouse cells) are simultaneously bound by CTCF and cohesin, and binding of cohesin directly depends on the presence of CTCF. Schematic based on Filippova et al. (Filippova et al., 2001) and Cho et al. (Cho et al., 2005). HP1γ, heterochromatin protein 1γ.
A PARP has been identified among the partners of CTCF in a proteomic search carried out on purified CTCF complexes . Yu and colleagues  demonstrated that CTCF undergoes covalent poly(ADP-ribosyl)ation in the N-terminal domain. These authors found that the control of gene imprinting by CTCF is lost upon inhibition of PARP activity, and therefore suggested that PARylated CTCF is directly involved in the control of imprinting. PARylated CTCF has also been implicated in the control of ribosomal gene expression (Torrano et al., 2006;Caiafa and Zlatanova, 2009). Importantly, it has been recently shown that transient ectopic overexpression of CTCF induces PAR accumulation, PARP1 expression and PARylation of CTCF (Guastafierro et al., 2008). In vitro data from this paper have shown that CTCF can activate automodification of PARP1, even in the absence of nicked DNA; this finding is of great interest, because so far a burst of PARylation of PARP1 has generally been found only following introduction of DNA strand breaks. The persistence of high PAR levels over time affects the DNA methylation machinery: DNA-methyltransferase activity is inhibited, with the consequence that the genome becomes diffusely hypomethylated (Caiafa et al., 2008). Thus, the data of Guastafierro and co-workers (Guastafierro et al., 2008) provide, for the first time, evidence that CTCF is involved in the crosstalk between PARylation and DNA methylation, through its activation of PARP1 (which, in turn, leads to inhibition of DNA methylation) (Reale et al., 2005).

Nucleophosmin is a CTCF partner at insulator sites
Nucleophosmin is an abundant nuclear-matrix phosphoprotein, a large fraction of which is localized to the peripheral region of the nucleolus. It has been implicated in embryonic development and maintenance of genomic stability, mainly through its role in centrosome duplication (Grisendi et al., 2005). At the molecular level, nucleophosmin mediates diverse functions, including rDNA transcription, pre-ribosomal RNA processing, mRNA polyadenylation, and the stress response. It also participates in transport functions, chaperoning ribosomal subunits and/or histones from the cytoplasm to the nucleus and nucleoli. A recent study of the role of nucleophosmin in transcriptional regulation of rDNA has indicated that nucleophosmin is associated with the gene locus, maintaining an open chromatin conformation over the active copies of the rRNA genes by removing histones from the promoter (Murano et al., 2008).
Nucleophosmin was identified as a CTCF partner in a proteomic search , and was the only protein in the soluble CTCF complex that was present in stoichiometric amounts. ChIP analysis of the two known insulator sites that flank the chicken β-globin gene locus confirmed the presence of CTCF at these sites. Remarkably, nucleophosmin was also present at both sites ( Fig. 4) . In human cell lines carrying multiple integrated copies of the chicken HS4 insulator (one of the insulators upstream of the β-globin gene locus), the insulator sites were preferentially localized to the nuclear periphery. As in the case of the endogenous insulator sites at the β-globin gene locus (see above), CTCF colocalized with nucleophosmin at these integrated insulator sites; importantly, the peripheral nucleolar localization of insulator sites was dependent on the integrity of CTCF-binding sites. Thus, it was suggested that insulators are recruited to the periphery of the nucleolus through the strong interaction of CTCF with nucleophosmin . It should be noted that these data concern only the relatively small portion of CTCF that is located in the nucleolus; a large fraction of CTCF is not bound to the nucleolus, and might not be associated with nucleophosmin .
Finally, a recent study focused on chromosome translocations involving the immunoglobulin heavy chain (IgH) gene locus in certain cancer cells (Liu et al., 2008). Interestingly, CTCF and nucleophosmin colocalized at the 3Ј regulatory elements of the IgH Nucleophosmin and CTCF associate with both 5Ј and 3Ј insulator elements in the chicken β-globin gene cluster in vivo. The schematic at the bottom depicts the developmentally regulated β-globin gene cluster and its locus control region (LCR), which encompasses DNase-I-hypersensitive sites 1-3 (HS1-HS3) and the β A /ε enhancer (Gaszner and Felsenfeld, 2006). The domain is flanked by a region of highly compacted chromatin at the 5Ј end and a cluster of genes encoding olfactory receptors at the 3Ј end. Two further DNase-I-hypersensitive sites, HS4 and 3ЈHS, possess enhancer-blocking and insulator activities. 3ЈHS prevents the β A /ε enhancer from activating the olfactory-receptor genes, and HS4 acts as both an insulator, to prevent spreading of heterochromatin into the gene cluster, and an enhancer-blocker, to prevent the enhancer located 5Ј of the condensed chromatin region from activating the globin genes (enhancer-blocking insulators are effective only when situated between a promoter and an enhancer). Nucleophosmin binds to both HS4 and 3ЈHS, as shown in the schematic and demonstrated by the ChIP data presented at the top of the figure [modified from ]. gene locus only in cells carrying the chromosome translocation; moreover, the cells could be growth arrested by nucleophosmin short hairpin RNA. The exact molecular mechanism behind these observations awaits further research. The studies described here provide evidence that the insulator function of CTCF is mediated through its specific tethering to subnuclear sites through its interactions with nucleophosmin. Thus, the insulator function of CTCF -similar to its functions in X-chromosome inactivation, gene regulation and DNA methylation -might require its interaction with a partner protein specific to that function.
Is RNA polymerase II a CTCF partner in transcriptional regulation?
The function of CTCF in transcriptional regulation is not well understood. However, a recent report has identified direct interactions between CTCF and the large subunit of Pol II (Chernukhin et al., 2007); we will discuss this paper in detail, as it contains data of potential relevance to the role of CTCF in transcriptional regulation.
In vitro, CTCF interacts equally well with the hypophosphorylated and the hyperphosphorylated forms of Pol II, which are known to be involved in transcription initiation and elongation, respectively (Chernukhin et al., 2007). In vivo, however, CTCF exhibits a significant preference for interaction with the hypophosphorylated Pol II form. This interaction is mediated by the C-terminal domain of CTCF (Fig. 1C), which contains the sites for phosphorylation of CTCF (Klenova et al., 2001;El-Kady and Klenova, 2005). Preliminary data (Chernukhin et al., 2007) indicate that in-vitro-phosphorylated CTCF has a lower affinity for Pol II, suggesting that the CTCF-Pol-II interaction might be subject to regulation by CTCF phosphorylation.
In an attempt to gain insight into the functional significance of the reported CTCF-Pol-II interaction, serial ChIP analysis (using anti-CTCF antibodies, followed by anti-Pol II antibodies, as bait) was used to interrogate the in vivo presence of the CTCF-Pol-II complex on the β-globin insulator (see above) (Chernukhin et al., 2007). Interestingly, CTCF colocalizes with Pol II at the insulator only in proliferating chicken erythroblasts that do not express the globin genes. In differentiated cells that transcribe two of the four globin genes in the cluster, the association of both proteins with the insulator is lost. The mechanisms behind these events remain to be determined. Further experiments in human choriocarcinoma cells transfected with wild-type or mutated H19 ICR (see above) demonstrated that the binding of Pol II to the ICR requires functional CTCF target sites. Finally, a single CTCFbinding site fused to a promoterless luciferase reporter gene conferred transcriptional activity on the gene in stably integrated constructs. This observation suggested that CTCF is a functional equivalent of TATA-box-binding protein (TBP), and thereby allows accurate transcription initiation at some promoters. This is certainly an interesting notion that deserves to be directly addressed in further experiments.
ChIP-on-chip experiments using a previously constructed library of CTCF-binding sites from mouse fetal liver  were used to identify sites that are co-occupied by CTCF and Pol II in proliferating and resting NIH 3T3 cells (Chernukhin et al., 2007). Only about 10% of the CTCF sites represented on the microarray interacted with Pol II. Of note, 15 out of the 26 sequences that bound to both CTCF and Pol II were not present in the mouse genome database, which contains almost exclusively euchromatic sequences. Thus, CTCF-Pol-II binding probably also occurs at heterochromatic sequences. Finally, the protein complex was also identified in intergenic regions that are 1.5-15 kb from the nearest gene. Chernukhin and colleagues (Chernukhin et al., 2007) suggest that the CTCF-Pol-II complexes at these sites remain intact until a signal for the release of Pol II is received; the released Pol II then initiates transcription of the neighboring genes from cryptic promoters.
An earlier study that is relevant to Pol-II-and CTCF-mediated insulator function showed that the presence of the chicken insulator HS4 on chromatinized episomes (ectopic, unintegrated DNA constructs that acquire characteristics of chromatin organization in the host cell) in human cells leads to accumulation of Pol II at the enhancer in the β-globin gene locus control region (Zhao and Dean, 2004). This suggested that, as part of its insulator function, CTCF blocks the transfer of Pol II from the enhancer to the promoter. Whether and how these observations relate to the more recent data (Chernukhin et al., 2007) remains to be seen.
More recently, a possible link between CTCF binding and Pol II occupancy was revealed in a genome-wide study (Barski et al., 2007), in which a tantalizing high-resolution profiling of histone methylation patterns in the human genome was undertaken. In addition to mapping 20 histone lysine and arginine methylations, the authors addressed the genome-wide localization patterns of Pol II, histone H2A.Z (see Table 1) and CTCF. Out of the ~20,000 CTCF-binding sites, more than 6000 were in transcribed regions. Unfortunately, the CTCF sites that lie close to Pol II sites were excluded from further analysis to avoid complications in the interpretation of the methylation data, which was the main objective of that study.
The picture that emerges from the study by Chernukhin and colleagues (Chernukhin et al., 2007) is extremely complex; the authors suggest several possible functions of the CTCF-Pol-II complex that are context dependent. It is clear that numerous new questions (concerning the mechanism of a possible TBP-like function for CTCF, the presence and distribution of CTCF-Pol-II complexes at different genomic regions, etc.) arise from this study, and that significant experimental effort will be required to address them.
The CTCF-partner network Above, we have presented and discussed evidence that connects CTCF with individual protein partners, particularly Yy1, cohesin, PARP1, nucleophosmin and Pol II. We have pointed out that the interactions of CTCF with each protein partner occur in a specific biological context. However, it has not escaped our attention that some of the partners are known to interact with each other, thus creating a rather complex network (Fig. 5). For example, nucleophosmin is a recognized partner of PARP1 (Meder et al., 2005), and PARP1 interacts with Yy1 (Oei and Shi, 2001a;Oei and Shi, 2001b). In addition, Yy1 directly interacts with another recognized CTCF partner, YB1 (Chernukhin et al., 2000;Li et al., 1997). It is clear that more research is needed to identify the possible protein interactions in the CTCF network, and to understand the biological contexts in which they work.

Concluding remarks
The data discussed in this Commentary show that CTCF possesses extreme flexibility, not only in terms of the diversity of its binding sites but also with respect to its numerous binding partners. It seems that CTCF performs its numerous functions by using different binding partners in different biological contexts. Two points deserve special mention. First, even with one and the same partner, CTCF is obviously performing a multiplicity of (sometimes seemingly antagonistic) functions. The CTCF-Pol-II interaction might provide a good example of such functional diversity, because such complexes might perform different functions depending on whether they are located in euchromatin or heterochromatic regions. Second, the various partners seem to interact with each other directly or indirectly, which is likely to contribute to the fine-tuning of CTCF function (Fig. 5). There is no doubt that new CTCF protein partners will be identified in the future; they will probably endow CTCF with distinct functions in distinct biological contexts, as the ones that are already recognized appear to do. Will we ever be able to understand this complexity? Is 'divide and rule' the key to success in nature, as well as in society? Fig. 5. Summary of the CTCF protein interaction network from the interactions discussed in this Commentary. The arrows connecting individual partners show recognized interactions. Thus, for example, both Yy1 and CTCF upregulate PARP1 activity; PARP1 and nucleophosmin interact directly, which might contribute to the inhibitory effect of CTCF on ribosomal gene transcription.