Abstract
Metformin is the first-line treatment for type 2 diabetes (T2D) in youth but with limited sustained glycemic response. To identify common variants associated with metformin response, we used a genome-wide approach in 506 youth from the Treatment Options for Type 2 Diabetes in Adolescents and Youth (TODAY) study and examined the relationship between T2D partitioned polygenic scores (pPS), glycemic traits, and metformin response in these youth. Several variants met a suggestive threshold (), though none including published adult variants reached genome-wide significance. We pursued replication of top nine variants in three cohorts, and rs76195229 in ATRNL1 was associated with worse metformin response in the Metformin Genetics Consortium (n = 7,812), though statistically not being significant after Bonferroni correction (). A higher β-cell pPS was associated with a lower insulinogenic index () and C-peptide () at baseline and higher pPS related to two insulin resistance processes were associated with increased C-peptide at baseline (). Although pPS were not associated with changes in glycemic traits or metformin response, our results indicate a trend in the association of the β-cell pPS with reduced β-cell function over time. Our data show initial evidence for genetic variation associated with metformin response in youth with T2D.
1. Introduction
The incidence of type 2 diabetes (T2D) in youth is increasing in the United States and worldwide [1, 2]. Youth with T2D have an aggressive disease course with early onset and severe burden of complications [3]. Metformin is currently the foundation of treatment of T2D and remains one of the few FDA-approved options in addition to insulin and glucagon-like peptide receptor agonists for the management of T2D in youth. However, despite a good initial response [4, 5], over time, youth with T2D have poorer responses to metformin than those observed in adults. For example, despite initial good responses, 52% of the youth participants in the Treatment Options for Type 2 Diabetes in Adolescents and Youth (TODAY) study failed to have a sustained glycemic response to metformin therapy [6], whereas only 12% of adults with T2D in the ADOPT study failed after the same duration of metformin treatment [7]. Understanding reasons for variations in response to metformin is needed to characterize individuals into likely responders and nonresponders and to shed further light on the mechanism(s) of action underlying metformin response in youth.
Using the genome-wide complex trait analysis method, the heritability of metformin response is estimated to explain a substantial proportion (21–34%) of the variation in metformin response depending on how glycemic response is measured [8]. Indeed, genome-wide association studies (GWAS) have revealed loci associated with metformin response in adults with established T2D as well as in adults at high risk for T2D [9–12]. However, the genetic determinants of metformin response in youth remain unexplored. Our objective was to evaluate the genetic determinants of metformin failure in youth through a genome-wide approach by searching for novel variants and examining the effect of known genetic variants associated with metformin response in adults. A secondary objective was to evaluate the biological mechanisms underlying metformin response using partitioned polygenic scores (pPS) derived from genetic clustering of T2D loci.
2. Methods
2.1. Description of Participants
This study was undertaken by the Progress in Diabetes Genetics in Youth (ProDiGY) consortium [13], a collaboration of the TODAY [6], SEARCH for Diabetes in Youth [14], and T2D-GENES [15] study groups. We examined genetic determinants of metformin response in 506 youth with T2D from the TODAY study, after excluding participants with monogenic diabetes (n = 22) [16, 17]. The design and results of the TODAY study have been previously described [6], with the primary outcome being loss of glycemic control, defined as a HbA1c ≥ 8% for 6 months, or sustained metabolic decompensation requiring insulin. Of note, the American Indian Tribal Nations that partnered with the TODAY study elected not to participate in the genomics collection [18].
2.2. Genotyping, Imputation, and Quality Control
Samples were genotyped on the Infinium array by the Genomics Platform at the Broad Institute. Genotypes were called using the autocall algorithm with quality control steps run in PLINK2 and R-3.4. Imputation was performed using the TOPMed Imputation Server against the TOPMed r2 panel as the reference, with the imputation threshold (R2) set at 0.5, yielding 24,813,350 autosomal single nucleotide polymorphisms (SNPs) for analysis.
2.3. Construction of Partitioned Polygenic Scores (pPS)
The methods to construct the pPS have been previously described [19]. Briefly, a soft-clustering approach was used on 94 genetic variants associated with T2D risk and 47 diabetes-related traits to create five pPS, namely, two clusters representing reduced β-cell function, differing from each other by high versus low proinsulin levels and three other clusters that displayed features of insulin resistance, namely, (1) obesity-mediated, (2) “lipodystrophy-like” fat distribution, and (3) disrupted liver and lipid metabolism. In TODAY, individual pPS was constructed for each participant by multiplying the number of risk alleles present per SNP by the cluster weight reported for that SNP and then summing the results over the SNPs.
2.4. Replication Analyses
An evaluation of top findings () was conducted within three adult cohorts: the Metformin Genetics Consortium (MetGen) [10], the Diabetes Prevention Program (DPP) [20], and the Study to Understand the Genetics of the Acute Response to Metformin and Glipizide in Humans (SUGAR-MGH) [21]. These cohorts have independently performed GWAS for metformin response [10, 22, 23]. A total of 10 lookups were done, and each SNP was evaluated for association with metformin response based on the outcomes and covariates used in the respective GWAS (Supplementary Table 2). Binomial tests were performed to compare the effects of the top SNPs associated with metformin response in ProDiGY with data from MetGen, DPP, and SUGAR-MGH.
2.5. Statistical Analysis
Genome-wide analysis of time to metformin treatment failure was run under a Cox proportional hazards model in gwasurvivr (an R package) using an additive genetic model, adjusting for age, sex, top three principal components (PCs), and treatment arms (metformin alone, metformin + rosiglitazone, and metformin + lifestyle), similar to the prior analyses in the TODAY study [6]. For the pPS analyses, general linear models were used to test association with glycemic traits and change in traits over 6 months. The association between pPS and treatment failure as defined by TODAY was tested using a Cox proportional hazards model. The cluster analyses were adjusted for age, sex, first three PCs, and treatment arms and were run in R-4.0.
3. Results
3.1. Baseline Demographics
The demographics of the 506 TODAY participants at baseline are summarized in Table 1. The mean age was 14 ± 2 years, 65% were female, and the mean BMI Z-score was 2.23 ± 0.5. The majority of participants were youth of color with 20% identifying as non-Hispanic White, 37% as non-Hispanic Black, and 35% as Hispanic. Mean HbA1c at the end of the run-in period and prerandomization was 6.0 ± 0.7%. The quantile-quantile plot is shown in Figure 1 and λGC was 1.09, filtering for a minor allele frequency of 5%.

3.2. Genome-Wide Association Testing
Several genetic variants (n = 10) met a suggestive significance threshold of , though none reached genome-wide significance (Manhattan plot is shown in Figure 2). Top findings are shown in Supplementary Table 1.

3.3. Replication Analyses
Given the modest sample size, top findings were examined across the cohorts of MetGen (n = 7,812), DPP (n = 1,763), and SUGAR-MGH (n = 807) where metformin response has been defined in adults (results in Supplementary Table 3). rs76195229 in an intron of ATRNL1 was significantly associated with worse metformin response (β = 0.336 ± 0.125, ) in MetGen where the outcome was glycemic response, as measured by baseline minus minimum on-treatment HbA1c within 18 months after metformin initiation. However, when accounting for the nine variants that were evaluated, the findings were no longer statistically significant (). Our top variants were not significant in the DPP or SUGAR-MGH cohorts. Binomial tests to compare the top variants in ProDiGY with the replication cohorts showed that 70% (), 90% (), and 60% () of the SNPs had the same direction of effect in the MetGen, SUGAR-MGH, and DPP cohorts, respectively. We also performed lookups of published variants associated with metformin response in adults as well as variants associated with metformin transporters (Supplementary Table 4) and did not find any associations at .
3.4. Genetic Cluster Analyses
For quality control, we examined the association of the pPS for each of the five T2D genetic clusters with select metabolic traits and the results were in the expected direction and similar to findings in adults [19, 24] (Supplementary Table 5). The associations between pPS and quantitative glycemic traits at baseline are shown in Table 2. A higher β-cell cluster score was significantly associated with a lower insulinogenic index and C-peptide. For the clusters representing features of insulin resistance, higher lipodystrophy and liver/liver pPS were associated with increased C-peptide levels. The association between pPS and change in glycemic traits from baseline to 6 months were not significant, but there was a trend in the association of the β-cell cluster, worsening C-peptide index over time (Supplementary Table 6). The associations between pPS and metformin response using the Cox proportional hazards model were not significant (Supplementary Table 6).
4. Discussion
To our knowledge, this is the first large-scale evaluation of the genetics of metformin response in youth with T2D. Though we did not identify any genome-wide significant findings, we were able to identify several associations that met a suggestive threshold. As participants were subject to a run-in period and needed to maintain HbA1c of <8% on metformin monotherapy for randomization, it is possible that the run-in period excluded those with the poorest response to metformin and removed some variation within the sample, thus reducing power. We also validated pPS derived from genetic clustering of T2D loci in our youth-onset T2D population, based on associations with glycemic and metabolic traits that were consistent with associations observed in adults.
Although our study represents the largest existing genetic dataset for youth with T2D, our sample size was modest. We therefore chose to evaluate our top findings in adult cohorts with well-defined metformin response. We show a trend towards significance for association between rs76195229 and metformin response in adults from MetGen, the largest meta-analysis evaluating glycemic response to metformin in adults with T2D [10]. rs76195229 is an intronic variant in the ATRNL1 (attractin like 1) gene on chromosome 10 and is predicted to be associated with carbohydrate binding. According to the UniProt Knowledgebase, ATRNL1 may influence melanocortin signaling in pathways that regulate energy homeostasis. In MetGen, individuals who were homozygous for this variant had a 0.34% higher HbA1c on metformin compared to those with the wild-type allele. In addition to rs76195229, several of our top results have associations with glycemic and metabolic traits. As an example, rs10040292 in the AFAP1L1 intron is associated with waist-hip-ratio in the GIANT-UK Biobank GWAS meta-analyses () and with insulin sensitivity in GENESIS GWAS (). A complete list of associated traits for our top findings is listed in Supplementary Table 1. These findings merit exploration in other pediatric cohorts. We could not confirm the reported genetic associations influencing glycemic response to metformin that have been found in adults, either because our sample size was not large enough to detect these associations based on the reported effect sizes or because genetic variation may influence metformin response differently in youth compared to adults with T2D. Another factor to consider is adherence to metformin which has been shown to be worse in younger populations compared to adults [18]. Our data here are from the original TODAY clinical trial where there was frequent contact with participants and where medication adherence was greater than 70% across all treatment arms and not found to be a factor associated with metformin treatment failure [25].
In the cluster analyses, a greater cluster score for β-cell function was associated with lower baseline β-cell function and a trend towards reduced β-cell function over time. This is similar to results observed in adults at risk for diabetes in the Diabetes Prevention Program, where a high β-cell pPS was associated with an increased risk of diabetes and worsening in insulin secretion despite interventions with intensive lifestyle and metformin [26]. In the future, analyses of process-specific genetic clusters, particularly when combined with clinical phenotyping, could offer additional insight on the mechanisms of disease and drug response.
While studies in adults with T2D have shown variants associated with metformin response, there are virtually no such studies in youth to date. Youth in the TODAY study who were subsequently found to have HNF4A Maturity Onset Diabetes of the Young (MODY) were more likely to experience glycemic failure on metformin, a finding that was not surprising given their expected preferential response to sulfonylureas [17]. A study of 124 children with obesity randomized to either metformin or placebo for weight loss over a 6-month period conducted post hoc genotyping for 225 candidate SNPs previously associated with obesity or metformin pharmacogenetics. The authors did not identify any statistically significant associations of the chosen variants with weight change on metformin, but there was a trend towards significance for 28 common variants including novel variants in ADYC3 and BDNF which were associated with worse response and improved response, respectively [27].
Strengths of our study include the detailed phenotyping and longitudinal characterization of metformin response in the TODAY study. Additionally, our cohort was multiethnic and truly representative of youth-onset T2D with the majority of participants being youth of color. Lastly, in ProDiGY, we have established the largest known genetic dataset for youth-onset T2D that can be meta-analyzed with future studies, as the burden of youth-onset T2D continues to increase [2, 28]. We attempted to counter the modest sample size for genetic analyses with validation in three independent cohorts and through lookups of all variants associated with metformin response in adults. Additional limitations include the different definitions of metformin response in the replication cohorts, the white European predominance of the MetGen dataset, and the exclusion of metformin failures during run-in in TODAY.
In conclusion, we have generated a resource that may help prioritize genetic determinants of metformin response in youth with T2D from the TODAY study. As the burden of T2D in youth continues to increase, pediatric clinical studies should prioritize collection of genetic data so that future studies are sufficiently powered to detect significant associations.
Data Availability
The dataset analyzed in the current study is available at dbGap (dbGaP Study Accession: phs001511.v1.p1, https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001511.v1.p1).
Conflicts of Interest
The authors declare that they no conflicts of interest.
Authors’ Contributions
S.S. assisted in generation of the analysis plan, interpretation of data, wrote the first draft of the manuscript, and edited and approved the manuscript. L.C. conducted the analysis and edited and approved the manuscript. M.U., J.T., M.M.K., M.W.H., S.A., P.Z., R.G-K., K.N., K.K., N.W., T.P., and J.C.F. assisted in forming the analyses plan, interpretation of data, and edited and approved the manuscript. J.H.L, J.A.P., V.K., L.B., J.M.M., A.D., E.P., S.W.Y., and K.G. participated in the replication analyses and edited and approved the manuscript. S.S. is the guarantor of this work and, as such, had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the results.
Acknowledgments
S.S. is supported by NIH grant K23 DK120932. J.H.L. is supported by NIH grant K23 DK131345. J.C.F. is supported by NIH grant K24 HL157960. This work was completed with funding from NIDDK and the NIH Office of the Director (OD) through grants U01-DK61212, U01-DK61230, U01-DK61239, U01-DK61242, and U01-DK61254; from the National Center for Research Resources General Clinical Research Centers Program grant numbers M01-RR00036 (Washington University School of Medicine), M01-RR00043-45 (Children’s Hospital Los Angeles), M01-RR00069 (University of Colorado Denver), M01-RR00084 (Children’s Hospital of Pittsburgh), M01-RR01066 (Massachusetts General Hospital), M01-RR00125 (Yale University), and M01-RR14467 (University of Oklahoma Health Sciences Center); and from the NCRR Clinical and Translational Science Awards grant numbers UL1-RR024134 (Children’s Hospital of Philadelphia), UL1-RR024139 (Yale University), UL1-RR024153 (Children’s Hospital of Pittsburgh), UL1-RR024989 (Case Western Reserve University), UL1-RR024992 (Washington University in St Louis), UL1-RR025758 (Massachusetts General Hospital), UL1-RR025780 (University of Colorado Denver), and NIDDK DRC P30-DK116073 (University of Colorado Denver). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Supplementary Materials
Supplementary Table 1: nominally significant genome-wide significant findings with a value <1 × 10−6. Supplementary Table 2: primary outcome and models used in replication analyses. Supplementary Table 3: results of replication analyses. Supplementary Table 4: lookup of variants relevant to metformin response in adults and metformin transport. Supplementary Table 5: quality control tests using partitioned polygenic scores (pPS) on metabolic traits. Supplementary Table 6: results of pPS analysis. (Supplementary Materials)