Abstract

Epigenetic regulation has been thoroughly investigated in recent years and has emerged as an important aspect of chronic lymphocytic leukemia (CLL) biology. Characteristic aberrant features such as methylation patterns and global DNA hypomethylation were the early findings of the research during the last decades. The investigation in this field led to the identification of a large number of genes where methylation features correlated with important clinical and laboratory parameters. Gene-specific analyses investigated methylation in the gene body enhancer regions as well as promoter regions. The findings included genes and proteins involved in key pathways that play central roles in the pathophysiology of the disease. Τhe application of these findings beyond the theoretical understanding can not only lead to the creation of prognostic and predictive models and scores but also to the design of novel therapeutic agents. The following is a review focusing on the present knowledge about single gene/gene promoter methylation or mRNA expression in CLL cases as well as records of older data that have been published in past papers.

1. Introduction

Chronic lymphocytic leukemia (CLL) is characterized by the accumulation of clonal B cells in peripheral blood, bone marrow, and secondary lymphoid organs. It is the most common leukemia in adults of Western countries and accounts for approximately 2–35 percent of all leukemias in the United States [1]. CLL is more common in men with a male to female ratio ranging from 1.2 : 1 to 1.8 : 1 [1, 2] and is considered to be a disease of the elderly as the median age at diagnosis is 70 years and an incidence increasing rapidly with increasing age [3]. Despite characterized by a common morphology and immunophenotype pattern, CLL shows a variable clinical behavior spectrum. At one end, some CLL patients have a very indolent disease course and may not require treatment for many years, while at the other end, some patients present a very aggressive disease early from diagnosis and require prompt treatment with lower survival rates despite therapy [4].

In terms of genetic pathobiology, at least one of the four common chromosomal abnormalities (deletion 13q14, trisomy 12, deletion 11q22-23, and deletion 17p12) can be detected by interphase fluorescence in situ hybridization in most patients [5]. Most CLL tumors have myriads of somatic gene mutations. Of these, tumor protein 53 (TP53), ataxia telangiectasia mutated (ATM), neurogenic locus, notch homolog protein (NOTCH), and subunit 1 of the splicing factor 3b protein complex (SF3B1) genes are the most frequently mutated at diagnosis [6, 7]. Several other gene mutations seen in CLL tumors are involved in important cellular signaling pathways. NOTCH signaling, B-cell receptor (BCR) signaling, Toll-like receptor, mitogen-activated protein kinase/extracellular signal-regulated kinase (MAPK-Erk) pathway, nuclear factor kappa-light-chain enhancer of activated B cells (NF-kB) signaling, chromatin modifiers signaling, cell cycle signaling, DNA damage signaling, and RNA splicing are some of the many pathways investigated [610]. Moreover, it is well documented that the interaction between CLL tumor cells and antigens is regulated by the somatic hypermutation load of the immunoglobulin heavy chain variable region genes [11].

In the complex domain of the tumor microenvironment, CLL cells seem to influence the kind and arrangement of the cells surrounding them, and they are highly dependent on signals from these cells for their own survival and proliferation, encouraging a leukemia-supportive and immunosuppressive microenvironment [11].

Methylation of DNA is an enzyme-mediated modification of DNA structure without interfering in the specific sequence of the base pairs for the gene encoded. Although DNA methylation is important in normal biologic processes, aberrant patterns of methylation are observed in several malignancies. Specifically, two patterns have been mainly described, namely, large sites of global hypomethylation along the genome and localized areas of hypermethylation at CpG islands, within the gene promoter regions. Ιn more and more studies in neoplastic diseases in recent years, there has been an emergence of evidence that methylation of the promoter regions of hundreds of genes, including tumor suppressor genes, results in the failure to express their normal purpose. In other cases, DNA methylation may represent an early step in the pathway by which normal tissue cells undergo tumorigenesis. Some other possible mechanisms for mutational induction secondary to DNA methylation include failure to produce DNA repair proteins that normally protect from mutations and predisposition to increased oxidative DNA damage, resulting in increased mutation potential [12].

Abnormal DNA methylation has been documented in many solid neoplasms, including breast cancer, lung, prostate, colorectal cancer, and melanoma. As described above, hypermethylation of promoter regions inhibits the expression of tumor suppressor genes while hypomethylation activates an oncogene expression. Global hypomethylation is also of crucial role, leading to genomic and chromosomal instability. For instance, BRCA1/2 genes are two of the most studied genes in breast cancer. The hypermethylation of their promoter regions is reported to result in their inactivation and consequent increased risk of breast cancer [13].

As regards CLL, a global DNA methylation pattern was reported to be relatively stable during the disease course and similar both in resting and proliferative cell compartments, implying that aberrant methylation may present as an early leukemogenic event with early indications of specific gene epigenetic changes in CLL samples being described over 30 years ago [14]. Global DNA hypomethylation was reported few years later [15].

Except CLL, abnormal function of basic methylation‐related enzymes is widely reported in several hematological malignancies, such as in myelodysplastic syndromes, myeloproliferative neoplasms, acute myeloid leukemia, T-cell acute lymphoblastic leukemia, and diffuse large B‐cell lymphoma. The final DNA methylation patterns are suggested to be frequently lineage specific and accompanied by specific secondary mutations [16].

The research in this field led to the study and discovery of a large number of genes whose methylation plays an important role in their regulation, while certain correlations with clinical and laboratory parameters were also identified. The streptothricin acetyltransferase, alpha (SAT-α) gene, whose methylation levels were found to be an independent marker of poor treatment-free survival was an early example [17].

With the appearance and application of newer technics such as whole-genome bisulfite sequencing and DNA methylation arrays, global DNA hypomethylation was confirmed and it was reported that gene body and enhancer regions rather than promoter regions were primarily involved [18, 19] whereas hypermethylation was described to occur mainly in transcribed genomic regions [20] and regions affecting other regulatory mechanisms such as spliceosome [21]. After the CLL genome was discovered to be globally hypomethylated, research for aberrantly methylated oncogene targets revealed that hypomethylation of the B cell lymphoma 2 (BCL2) gene, an important antiapoptotic gene, correlated with higher protein expression in CLL [22]. In the following studies, multiple drug-resistance protein 1 (MDR1) and T-cell leukemia/lymphoma 1 (TCL1) genes were reported to be both hypomethylated and upregulated in CLL [23, 24]. Moreover, the nuclear factor of activated T-cell 1 (NFATc1) gene hypomethylation was also identified and was further shown to be associated with increased mRNA and protein expression, suggesting hypomethylation as a mechanism of constitutive activation of NFATc1 expression in CLL. Through subsequent findings, however, it was discovered that the activation of oncogenes through DNA hypomethylation was a rather infrequent lesion in CLL [25].

To date, several studies have reported gene promoter hypermethylation in CLL patients using both global and single gene approaches. The affected genes include tumor suppressors, transcription factors, genes involved in survival and proliferation, genes with prognostic impact, and microRNA genes. There are several methods that have been used to study gene methylation in chronic lymphocytic leukemia. Each approach has its own advantages and limitations, and the choice of the method depends on the specific research question being addressed. Bisulfite sequencing is considered the gold standard method for analyzing DNA methylation at single-nucleotide resolution. Bisulfite treatment of DNA converts unmethylated cytosine to uracil, while leaving methylated cytosine unchanged. After polymerisation chain reaction (PCR) amplification and sequencing, the pattern of cytosine-to-thymine conversion can be used to determine the methylation status of individual CpG sites. Methylation-specific PCR is a PCR-based method that uses primers specific for either methylated or unmethylated DNA to amplify a region of interest. By comparing the intensity of PCR products amplified using methylated-specific versus unmethylated-specific primers, the methylation status of a given region can be determined. Infinium methylation arrays are microarray-based platforms that can interrogate the DNA methylation status of thousands of CpG sites across the genome. Infinium arrays rely on bisulfite conversion of DNA, followed by hybridization with probes that distinguish between methylated and unmethylated DNA at each CpG site. Methylation-sensitive restriction enzyme digestion relies on the use of restriction enzymes that are sensitive to DNA methylation. By digesting genomic DNA with these enzymes and comparing the resulting fragment patterns with and without prior treatment with a DNA methyltransferase, the methylation status of specific regions can be inferred.

2. The CLL Methylome from a Gene-Specific Perspective

Early in 2004, using genome-wide screening for aberrant promoter methylation in CLL samples, 193 sequences were identified as novel targets for aberrant methylation in CLL. Among them, 173 were homologous to specific genes such as glutamate metabotropic receptor 7 (GRM7), cell division protein kinase 6 (CDK6), T-box transcription factor 3 (TBX3), paired box 5 (PAX5), and protein tyrosine phosphatase nonreceptor type 1 (PTPN1) genes [19]. In another study, where DNA methylation was compared within immunoglobulin heavy-chain gene variable region (IGVH) mutated/unmutated subgroups of CLL, it was reported that in IGVH-unmutated cells, tumor suppressor genes such as von Hippel–Lindau (VHL), Abelson interactor 3 (ABI3), and immunoglobulin superfamily member 4 (IGSF4) were found hypermethylated while genes associated with cell proliferation and migration such as adenosine A3 receptor (ADORA3) and perforin 1 (PRF1) were hypomethylated [26].

Regarding microRNAs, a genome-wide profiling in CLL patients led to the detection of 128 microRNAs carrying aberrantly methylated promoters. Hypermethylated loci included the promoter regions of miR-9-2, miR-124-2, miR-129-2, miR-551b, and miR-708, while miR-21, miR-29a/b-1, miR-34a, miR-155, miR-574, and miR-1204 gene promoters were found hypomethylated [27].

When DNA methylation was studied in paired diagnostic and follow-up samples from IGVH mutated and unmutated CLL patients, genes with prognostic significance including chronic lymphocytic leukemia upregulated 1 (CLLU1), lipoprotein lipase (LPL), zeta chain of T-cell receptor-associated protein kinase 70 (ZAP70) and NOTCH1, the epigenetic regulators histone deacetylases 9 and 4 (HDAC9/4), and DNA methyltransferase 3, beta (DNMT3B), were reported to be aberrantly methylated [14]. Another study used the methylation levels of five specific CpGs to track the cellular origin of CLL, where they seemed to act as enhancers. The varying methylation levels of these biomarkers did not translate into gene expression changes, suggesting that they may not have a functional impact but represented a stable molecular mark [28].

In the context of normal B-cell maturation, CLL DNA methylation was reported to be highly enriched in enhancer and promoter regions, especially in regions of transcriptional elongation and in genes involved in B cell- and lymphocyte-related processes and pathways. Regions targeted for hypomethylation during B cell maturation showed highly significant enrichment for the following six transcription factor families: activator protein 1 (AP-1), early B-cell factor (EBF), runt-related transcription factor (RUNX), octamer-binding transcription factor (OCT), interferon regulatory factor (IRF), and NF-kB [20].

In future, it will be crucial to further develop more efficient and accurate prognostic tools that incorporate clinical, cytogenetic, and molecular data. Therapeutic strategies and agent design are constantly reformed in order to pursue discoveries about the biology of the disease, the pathways involved, and the mechanisms of drug resistance.

To capture the latest discoveries, this review focuses on the present knowledge about single gene/gene promoter methylation in CLL biology while also recording older data that have been published in past papers. Through a review of 126 articles indexed in PubMed, we ended up recording data for 133 genes. Then, we classified them in the following tables according to their associated biological pathways and the related data on CLL studies. It is reasonable for several genes to be associated with more than one pathway. Α characteristic attempt to capture the correlations between most genes and pathways is presented in Figure 1.

In the following tables are presented the genes we identified in our search, with information on their function, as well as the findings of studies regarding CLL. In each table, we introduced in detail, data on the study of methylation or mRNA expression of single genes in CLL patients that showed interesting correlations with clinical and laboratory data.

2.1. Chromosome Maintenance and the Cell Cycle Process
2.1.1. SE Translocation and MYN Domain Containing 3 (SMYD3) and Human Telomerase Reverse Transcriptase (hTERT) Genes (Table 1)

SMYD3 is a chromatin modifier that is involved in the development and progression of several malignancies. Methylation levels in specific SMYD3 gene promoter CpG sites were reported to independently predict time to treatment [29].

Hypermethylation of the hTERT gene in CLL was one of the first to be described. TERT acts as a subunit of telomerase, preventing chromosomal degradation after DNA replication. It was reported that hTERT promoter hypermethylation led to decreased telomerase activity and was associated with superior overall survival [30].

2.2. DNA Repair Mechanisms and Purine Biosynthesis
2.2.1. Ribonucleotide Reductase Subunits 1 and 2 (RRM1 and RRM2) Genes (Table 2)

Ribonucleotide reductase is required for DNA replication and repair and consists of RRM1 and RRM2 proteins. In CLL patients, RRM1 mRNA expression was higher in patients without anemia, absence of lymphadenopathy, and 17p gene deletion. Moreover, abnormal lactate dehydrogenase (LDH) levels and higher Rai stage were associated with lower RRM1 mRNA levels. Higher expression of RRM2 mRNA was detected in patients without lymphadenopathy, Rai stage 0, and trisomy 12. The methylated status of RRM1 promoter significantly correlated with lymphadenopathy presence. The methylated status of RRM1 promoter correlated also with about 4 times lower levels of RRM1 mRNA and with about 10 times lower levels of RRM2 mRNA [39].

2.3. Gene Transcription
2.3.1. Cytotoxic T-Lymphocyte-Associated Protein 4 (CTLA-4) Gene (Table 3)

CTLA-4 protein is involved in gene transcription and acts as an immune checkpoint to regulate T-cell function. Mutations in its gene have been associated with insulin-dependent diabetes mellitus, Grave’s disease, Hashimoto thyroiditis, celiac disease, systemic lupus erythematosus, and other autoimmune diseases. In CLL cases, CTLA4 gene was hypomethylated in the first exon region and body region. Compared to healthy controls, CTLA4 gene had a 128-fold higher expression in CLL samples [37, 59].

2.4. RNA Polymerase I Promoter Opening
2.4.1. V-Maf Musculoaponeurotic Fibrosarcoma Oncogene Homolog B (MAFB) Gene (Table 4)

MAFB gene has been associated with survival parameters in CLL cohorts where differentially hypermethylated regions correlated with inferior post-treatment survival. The protein encoded by MAFB gene is a basic leucine zipper (bZIP) transcription factor with an important role in the regulation of hematopoiesis and nervous system development [66].

2.5. Programmed Cell Death/Cell Apoptosis and p53 Signaling
2.5.1. Integrin Subunit Alpha 4 (ITGA4) Gene (Table 5)

ITGA4 is capable of cell adhesion molecules and fibronectin binding and is involved in apoptotic and integrin pathways. It is reported to be deregulated in CLL with adverse clinical features, suggested as a negative prognostic factor related to a more aggressive course and shorter time to treatment. Protein expression is regulated at the mRNA level and in a methylation-regulated manner. Hypermethylation at specific ITGA4 CpG sites was a common phenomenon in the del13q14+ samples. Moreover, it was demonstrated that the methylation status of the ITGA4 gene at CpG site 1 may have a prognostic role [70].

2.6. Transforming Growth Factor-β (TGF-β) Pathway
2.6.1. Doublecortin-Like Kinase 2 (DCLK2) and Tumor Necrosis Factor Receptor Superfamily Member 1B (TNFRSF1B) Genes (Table 6)

Retrotransposons (also referred as class I transposable elements or transposons through RNA intermediates) are a type of genetic component that copy and paste themselves into different genomic locations (transposons) by retroconverting RNA into DNA via the reverse transcription process using RNA transposition intermediate. In solid neoplasms, universal hypomethylation of these elements has been described. In CLL, locus-specific hypomethylation was detected with differential expression of proximal genes, including DCLK2 and TNFRSF1B genes. Moreover, higher levels of DCLK2 and TNFRSF1B expression were associated with inferior survival. DCLK2 protein is characterized by transferase and protein tyrosine kinase activity, whereas TNFRSF1B protein by ubiquitin protein ligase binding and tumor necrosis factor (TNF) receptor activity. Both proteins are involved in TGF-β and TNF pathways [88].

2.7. Wingless-Related Integration Site (WNT) and Hedgehog Pathways
2.7.1. WNT Family Member 5A (WNT5A) Gene (Table 7)

WNT5A gene encodes a receptor tyrosine kinase-like orphan receptor (ROR1) ligand with DNA-binding transcription factor activity and is involved in the regulation of p21 activated kinase 2 (PAK-2), WNT signaling, and developmental pathways during embryogenesis. It is known that its expression differs in CLL patients with worse prognosis in the IGVH-mutated subgroup. Methylation levels of all CpG sites in the WNT5A gene promoter were lower in the group of the intermediate genome methylation profile. In the memory-like and intermediate genome methylation profile groups, promoter methylation and subsequent undetectable WNT5A expression correlated with longer treatment-free survival [95].

2.8. NFAT and T-Cell Receptor (TCR) Signaling
2.8.1. NFATC1 Gene (Table 8)

NFATC1 is part of the NFAT transcription complex. It is primarily involved in gene transcription during the immune response and acts as a downstream regulator of the BCR signaling pathway. As with many other proteins, alternative splicing leads to multiple transcription variants, which in turn may induce the expression of different cytokine genes. Moreover, NFATC1 protein is a central target for immunosuppressive agents. Regarding CLL, when the genome-wide DNA methylation study was performed, NFATC1 gene was reported to be hypomethylated and upregulated. Moreover, NFATC1 gene promoter DNA hypomethylation correlated inversely with RNA levels and was associated with Binet disease staging and thymidine kinase levels, suggesting a potential central role of NFATC1 in CLL pathobiology [105].

2.9. NF-kB Pathway
2.9.1. Leucine Zipper Was Downregulated in Cancer 1 (LDOC1) Gene (Table 9)

LDOC1 gene encodes a leucine zipper protein and is suggested to regulate the NF-κB pathway through the plasma membrane ATPase or TNF-mediated pathway of apoptosis. Its expression has been studied in oral squamous cell carcinoma and pancreatic cancer cells. Regarding CLL, unmethylated status was found in IGVH-unmutated cases [26, 111, 112].

2.10. MAPK/Erk Pathway
2.10.1. Angiopoietin 2 (ANGPT2) Gene (Table 10)

ANGPT2 is a glycoprotein involved in angiogenesis. Conflicting data regarding the involvement of Erkand phosphoinositide 3-kinase/protein kinase B (PI3K/Akt) pathways in the regulation of ANGPT2 suggest a possible cell-type specific control for this gene in solid tumors. As regards CLL, higher levels of ANGPT2 methylation in the IGHV-mutated population showed a possible epigenetic regulation of this gene. Furthermore, it was reported that ANGPT2 expression is highly dependent on the DNA methylation status, where a lower degree of methylation was associated with a particularly poor prognosis in CLL. Importantly, the percentages of methylation showed negative correlations with ANGPT2 mRNA expression, suggesting that methylation of the ANGPT2 promoter leads to this gene silencing. CLL cases with highly methylated gene status had a more favorable prognosis. Considering that normal B cells showed high levels of ANGPT2 methylation, the apparently aberrant low ANGPT2 methylation levels found in aggressive CLL patients indicate that these latter cases lose normal epigenetic control [14, 119121].

2.11. NOTCH Signaling
2.11.1. CREBBP (cAMP Response Element-Binding Protein Binding Protein) Gene (Table 11)

In a small Korean cohort, genome-wide methylation profiling identified the CREBBP gene with no known relevance to CLL to be differentially methylated among other genes previously known to be affected in CLL. CREBBP gene encodes chromatin-modifying enzymes and has been described in diffuse large B cell lymphoma, acute lymphoblastic leukemia, and lung cancer [130].

2.12. JAK-STAT Pathway
2.12.1. SOCS-1 (Suppressor of Cytokine Signaling 1) Gene (Table 12)

SOCS-1 is an inhibitor of cytokine signal transduction. Its gene expression can be positively affected by interleukins 2 and 3, erythropoietin, granulocyte macrophage colony-stimulating factor, and interferon-gamma. Gene polymorphisms of SOCS1 and its expression have been studied in several malignancies including diffuse large B-cell lymphoma and acute lymphoblastic leukemia, where, particularly, the expression level of SOCS1 was lower compared to the control group. In CLL, the SOCS1 gene body was hypermethylated in no case [135].

2.13. Phosphatidylinositol-3 Kinase/Protein Kinase B (PI3K/Akt) Signaling
2.13.1. T-Cell Leukemia/Lymphoma 1 Oncogene (TCL1A) Gene (Table 13)

Abnormal expression of TCL1A gene in mouse B-cells led to a leukemia phenotype similar to aggressive human CLL. It is demonstrated that TCL1A physically interacts with DNA methylthansferases 3A and 3B. TCL1A is suggested to act as a coactivator of Akt, activator protein 1 (AP1), and NF-κB pathways with a potential involvement in CLL cells resistance to apoptotic mechanisms. TCL1A expression seems to be strongly associated with the expression levels of ataxia-telangiesctasia mutated (ATM) gene in malignant and nonmalignant B cells. TCL1A gene promoter was hypomethylated in CLL cells and significantly correlated with TCL1A transcription enhancement [139].

2.14. Rho GTPases/Rhodopsin-Like Receptors
2.14.1. Endothelin-1 (ET-1) Gene (Table 14)

This gene is responsible for the creation of a peptide that belongs to the endothelin/sarafotoxin family. This peptide acts as a potent vasoconstrictor and its receptors have been well studied as therapeutic targets in the treatment of pulmonary arterial hypertension. Regarding ET-1 gene expression, alternative splicing leads to several transcript variants and abnormal expression is thought to promote tumorigenic activity, with ET-1 expression being under the constant control of the NF-kB signaling pathway. In CLL, ET-1 is reported to be involved in survival, drug resistance, and growth signaling of leukemic cells. Moreover, it is reported that basal expression levels of ET-1 are affected when high methylation in the region of ET-1 gene first intron is detected [141].

2.15. Class I Major Histocompatibility Complex (MHC)-Mediated Antigen Processing
2.15.1. Ubiquitin Conjugating Enzyme E2 R2 (UBE2R2) Gene (Table 15)

UBE2R2 is involved in Class I MHC-mediated pathways as well as in the metabolism of certain proteins. To develop a tool useful in stratifying CLL patients on a specific methylation signature basis and focusing on time to treatment, it was demonstrated that UBE2R gene methylation levels independently predicted time to treatment [29].

2.16. Estrogen Receptor (ESR)-Mediated Signaling
2.16.1. Estrogen Receptor 1 (ESR1) Gene (Table 16)

ESR1 gene encodes a ligand-dependent transcription factor. ESR1 acts through direct binding to specific estrogen response elements and is associated with other transcription factors. ESR-mediated signaling has been studied in several solid tumors and hematological malignancies. When studied in CLL samples, ESR1 gene was amplified in 15% of the samples examined with a copy number loss frequency ranging between 1 and 10%. The ESR1 gene promoter region was methylated in one out of ten CLL samples controlled, and in normal B cell, the gene promoter was completely unmethylated [51].

2.17. Angiogenesis/Erythropoiesis
2.17.1. Thrombospondin-1 (THBS1) Gene (Table 17)

THBS1 acts as an adhesion-specific glycoprotein. THBS1 is able to bind fibrinogen, fibronectin, laminin, and type V collagen and is involved in platelet aggregation, angiogenesis, and tumorigenesis. In CLL studies, THBS1 gene promoter methylation was detected in 50% of the samples, while in normal samples, the gene promoters were unmethylated [51].

2.18. Slit Glycoprotein/Roundabout Receptor (SLIT/ROBO) Signaling and Nervous System Development
2.18.1. Protein Tyrosine Phosphatase Type O (PTPRO) Gene (Table 18)

The methylation levels of the PTPRO gene have been studied in the pathobiology of several neoplasms including hepatocellular cancer and various lung tumors, with indications of potential tumor suppressor features. Regarding CLL, PTPRO gene promoter was also methylated and gene expression was suppressed compared to normal B cells, with the overall expression of PTPRO being lower in CLL lymphocytes than in normal samples [51].

2.19. Drug Uptake and Sensitivity Mechanisms
2.19.1. Solute Carrier Family 22 Member 18 (SLCO3A1) Gene (Table 19)

Methylation of the SLCO3A1 gene has been associated with survival features in CLL cohorts where specific differentially hypermethylated regions were linked to inferior post-treatment survival. SLCO3A1 gene encodes an anion transporter that is suggested to be involved in transport of inorganic cations/anions and amino acids/oligopeptides, transport of vitamins, nucleosides, and related molecules, as well as drug uptake mechanisms [66].

2.20. Various Other Genes Investigated
2.20.1. GATA-Binding Protein 5 (GATA5) Gene (Table 20)

GATA5 contains two GATA-type zinc fingers and is related to hepatocyte nuclear factor-1alpha. Promoter methylation of the GATA5 gene has been involved in gastric cancer biology. When studied in CLL, GATA 5 gene hypermethylation was detected in 35.1% of the cases. Another study reported that genomic regions which became hypomethylated prior to specific CLL treatment initiation but also after disease relapse were enriched for binding sites of several transcription factors related to CLL pathogenesis, including GATA5. Compared to normal B-cells, these hypomethylated regions were also enriched for the GATA 5 transcription factor [51, 167].

2.20.2. SH3 and Multiple Ankyrin Repeat Domain 1 (SHANK1) Genes (Table 20)

SHANK1 gene encodes a protein acting as a scaffold molecule required for the development and function of neuronal synapses. In CLL samples, a specific CpG region of the SHANK1 gene body (hg19) was reported to be hypermethylated when compared to control samples. Moreover, without yet being strongly supported, it was reported that methylation in the same CpG was detectable in blood samples collected years before CLL diagnosis [170].

2.20.3. RalA-Binding Protein-Associated Eps Domain Containing 1 (REPS1), Interleukin 1B (IL1B), and ATPase Phospholipid Transporting 9B (ATP9B) Genes (Tables 21 and 22)

Specific CpG sites at which the methylation levels independently predicted time to treatment were detected and some of them located in the gene bodies of REPS1, IL1B, and ATP9B. REPS1 protein has calcium ion and SH3 domain-binding activity with involvement in vesicle transport and endocytosis. IL1B protein has interleukin-1 receptor-binding activity and is involved in MIF-mediated glucocorticoid regulation. ATP9B protein is capable of nucleotide binding and ATPase-coupled monoatomic cation transmembrane transporting and is involved in ion channel transport and cardiac conduction mechanisms [29].

2.21. Noncoding RNAs
2.21.1. Colorectal Neoplasia Differentially Expressed (CRNDE) Gene (Table 23)

The expression of the CRNDE gene is increased in proliferating tissues and is involved in the expression of genes associated with metabolism and in neoplasms such as colorectal adenomas and adenocarcinomas. Its related transcription is inversely regulated by insulin and insulin-like growth factors. When studied in CLL cell lines, CRNDE was downregulated and the methylation level of CRNDE promoter was higher than in normal B cells. After exposure to demethylating agents, an increase of CRNDE expression levels was reported [183].

3. Conclusion

Some studies have focused on the potential clinical impact of DNA methylation in CLL phenotypes. Based on the findings abovementioned, time to treatment, an essential feature of CLL management, was reported to be predicted by the methylation status of SMYD3 gene promoter and UBE2R gene methylation levels [29]. Methylation levels in specific CpG sites located in the gene bodies of REPS1, IL1B, and ATP9B genes also correlated with the time to treatment [29].

Regarding overall survival, hTERT gene promoter hypermethylation was associated with superior overall survival [30], and specific hypermethylated MAFB gene loci correlated with inferior post-treatment survival [66]. Moreover, locus-specific hypomethylation of retrotransposons proximal to DCLK2 and TNFRSF1B was related with higher levels of DCLK2 and TNFRSF1B expression and subsequent inferior survival [88]. When CLL patients were classified upon specific IGVH mutation patterns, promoter methylation of the WNT5A gene showed longer treatment-free survival in a subgroup of the patients [95]. Lower levels of DNA methylation of the ANGPT2 gene were related to a particularly poor prognosis in CLL patients [14, 121].

Findings that could be useful in staging and correlating methylation with pre-existing prognostic factors in CLL include RRM1/2 expression, the promoter methylation status that correlated with lymphadenopathy, 17p gene deletion, Rai stage, and trisomy 12 [39], and the documentation of hypermethylation at specific ITGA4 CpG sites in del13q14+ samples [70]. NFATC1 gene promoter hypomethylation was associated with Binet disease staging [105].

Drug sensitivity and mechanisms of resistance development are another important area of study in CLL. Hypermethylation of specific SLCO3A1 gene loci, a gene which among others is involved in drug uptake mechanisms, has also been associated with inferior post-treatment survival [66]. Another gene, ET-1, is related with drug-resistance potential and its expression is affected upon high methylation levels in its first intron region in patients with CLL [141].

Α last finding that could theoretically have a clinical application in CLL cases is about the SHANK1 gene. A specific CpG locus of its body (hg19) was hypermethylated when compared to control samples. Also, it was even weakly supported that methylation in the same CpG was detectable in blood samples collected years before CLL diagnosis [170]. It is a finding that, if confirmed, is very interesting and could be used in panels for early diagnosis of the disease, certainly in combination with other markers in the context of a population screening program or the surveillance of high-risk patients. Certainly, such methods of investigation have their peculiarities and must be applied after appropriate research and discussion in order to be necessary or beneficial for the patients.

Our findings suggested that certain methylation patterns have been associated with disease progression and survival features and may be used as prognostic markers. In addition, some studies have investigated the use of hypomethylating agents as a part of a therapeutic strategy in CLL.

Regarding findings from previous studies that focused on studying large numbers of genes, data are generally limited but quite intriguing. Kanduri et al. applied high-resolution methylation microarrays (27 578 CpG sites) to CLL samples, which were classified in IGHV-mutated (favorable) and IGHV-unmutated/IGHV3-21 (poor-prognostic) subsets. Results demonstrated significant differences in methylation patterns between these subgroups. In IGHV-unmutated cases, they reported methylation of known or potent tumor suppressor genes (for example, VHL, ABI3, and IGSF4) as well as unmethylated genes involved in cell growth and tumor progression (ADORA3 and PRF1). In contrast, these latter genes were silenced by methylation in IGHV-mutated patients. Moreover, they reported the reinducing of four methylated tumor suppressor genes (including VHL and ABI3) in IGHV-unmutated samples using the methyl inhibitor 5-aza-2′-deoxycytidine [26]. Pei et al. reported a genome-wide DNA methylation analysis with 1,764 gene promoters being identified as differentially methylated in at least one sample when compared with normal B cell samples. Aberrant hypermethylation was discovered in all HOX gene clusters and a significant number of WNT signaling pathway genes. The NFATc1p2 promoter and first intron hypomethylated status correlated with the upregulation of both NFAT protein expressions [25]. Kulis et al. performed a wide analysis of the DNA methylome in normal B cells and CLL samples. They identified widespread hypomethylation targeting mainly the gene body and enhancer regions, suggesting that DNA methylation may be functionally relevant beyond promoter regions. Moreover, they reported that distinct patterns in DNA methylation were recorded between different CLL subtypes [18]. Cahill et al. used high-resolution 450 K arrays to analyze samples from IGHV-mutated or untreated and IGHV-unmutated or treated patients. They identified 2239 differentially methylated CpG regions between IGHV-mutated and -unmutated patients, where the majority of the regions were placed outside annotated CpG islands. Known CLL prognostic genes (i.e., LPL, ZAP70, and NOTCH1), epigenetic regulators (HDAC9/4 and DNMT3B), B-cell signaling, and numerous TGF-β and NF-κB/TNF pathway genes were differentially methylated between the subgroups [14]. Barrow et al. reported 490 differentially methylated regions after exposure to therapy. Among them, 31 were CLL related. Seventeen genes were classified as differentially expressed, following treatment in an independent cohort. Methylation of the HOXA4, MAFB, and SLCO3A1 differentially methylated regions associated with post-treatment patient survival and HOXA4 displayed the strongest association. Reinducing of HOXA4 expression in cell lines and primary CLL cells increased apoptosis following treatment with fludarabine, ibrutinib, and idelalisib [66]. Lastly, Zhang et al. identified 34,797 differentially methylated positions related to CLL. Most of them were hypomethylated and located in gene body sites. They combined these positions with existing DNA methylation and RNA sequencing data and identified regions associated with 1,130 genes whose expression was significantly different in CLL samples [37].

The limitations of this study include possible evidence selection bias because data from statistically significant studies are more likely to be published than those that are not statistically significant. Moreover, in some limited reports, it was not stated whether the methylation study included the gene body, the promoter region, or some other related loci. Lastly, some papers did not reported correlations with patients’ clinical data such as sampling time, staging, or prognostic and treatment features.

To summarize, there are still many mechanisms that need to be investigated in order to define the extent of global aberrant DNA methylation in different prognostic groups, the fundamental role of DNA methylation in sites other than CpG islands, and the interaction of DNA methylation with other regulatory processes in the pathogenesis of CLL. Moreover, with the prospect that there is and will continue to be increasing data on gene methylation, it should be noted that a large number of findings will concern passenger DNA methylation events that need to be identified accordingly.

In conclusion, at some points, the research raises even more questions than answers. Further research is still needed to fully understand the complex interplay between DNA methylation and other epigenetic and genetic alterations in CLL and to develop more effective targeted therapies and prognostic stratification tools for this disease.

Data Availability

The datasets used to support the findings of this study are available from the corresponding author on reasonable request.

Disclosure

Citations are included in the reference list.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

SC and N-AV had the idea for the article. SC performed the literature search and wrote the first draft of the manuscript. N-AV critically revised the work. All the authors have discussed the results, drafted further work, and approved the version to be published. All the authors have agreed to be responsible for the accuracy and integrity of any part of the manuscript.