A comprehensive genomic survey encompassing the genetic blueprints of over 900,000 individuals has illuminated a striking phenomenon: certain segments within our DNA exhibit a pronounced tendency toward increased instability as the passage of time progresses. These particular genomic territories are characterized by the repetitive nature of exceptionally brief nucleotide sequences, which, according to the findings of this extensive study, tend to elongate progressively throughout an individual’s lifespan. Furthermore, the research team uncovered compelling evidence that common, inherited variations in our genetic makeup play a significant role in modulating the pace at which this expansion occurs, capable of accelerating or decelerating the process by as much as a factor of four. In certain instances, the phenomenon of expanded DNA repeats has been demonstrably linked to the manifestation of severe health complications, including but not limited to renal failure and hepatic dysfunction.
The implications of these findings are profound, particularly in the context of inherited diseases, as expanded DNA repeats are recognized as the underlying cause for more than sixty distinct genetic disorders. These debilitating conditions arise when the normally consistent patterns of repeating genetic sequences extend beyond their physiological limits, thereby disrupting the intricate and essential functions of cellular machinery. Prominent examples of such disorders include Huntington’s disease, a neurodegenerative condition; myotonic dystrophy, which affects muscle function; and specific forms of amyotrophic lateral sclerosis (ALS), a progressive motor neuron disease.
While the scientific community has long acknowledged that the majority of individuals carry DNA repeats that undergo gradual expansion throughout their lives, a thorough examination of the prevalence of this instability and the specific genetic regulators governing it, particularly utilizing the vast datasets now available from large biobanks, had remained elusive. This groundbreaking research conclusively demonstrates that repeat expansion is a far more pervasive characteristic of the human genome than was previously understood. Moreover, it has pinpointed dozens of genes that actively participate in the regulation of this complex biological process, thereby opening novel avenues for the development of therapeutic interventions aimed at mitigating the progression of associated diseases.
The methodology employed by the research consortium, a collaborative effort involving esteemed institutions such as UCLA, the Broad Institute, and Harvard Medical School, involved a meticulous analysis of whole-genome sequencing data. This dataset was drawn from an impressive cohort of 490,416 participants enrolled in the UK Biobank and an additional 414,830 individuals from the All of Us Research Program. To effectively process and interpret this immense volume of genetic information, the scientists developed sophisticated new computational algorithms. These innovative tools are specifically designed to accurately measure both the length of DNA repeats and their inherent instability, even when working with standard sequencing data.
Leveraging these advanced analytical instruments, the research team systematically investigated 356,131 distinct locations within the human genome known for their variable repeat sequences. Their objective was to meticulously track the alterations in repeat lengths over time, specifically within blood cells, and to identify inherited genetic variants that exerted an influence on the rate of this expansion. Concurrently, the researchers conducted extensive searches for correlations between the observed patterns of repeat expansion and thousands of different disease outcomes, thereby aiming to uncover previously unrecognized associations with human pathologies.
A cornerstone of the study’s revelations is the consistent observation that common DNA repeats within blood cells undergo a discernible expansion as individuals age. The investigation successfully identified 29 specific regions of the genome where inherited genetic variations were found to significantly modulate the rates of repeat expansion. The impact of these variations was substantial, with observed differences in expansion rates reaching up to a fourfold disparity between individuals exhibiting the highest and lowest genetic risk profiles for accelerated repeat elongation.
Perhaps one of the more surprising discoveries from the study was the finding that the same genes involved in DNA repair did not exert a uniform effect across all repeat regions. Specifically, genetic variants that contributed to the stabilization of certain types of DNA repeats were found, counterintuitively, to increase the instability of other, distinct repeat sequences. Further to this, the researchers identified a newly recognized disorder linked to repeat expansion involving the GLS gene. Expansions within this particular gene, which are present in approximately 0.03% of the population, were associated with a remarkable 14-fold heightened risk of developing severe kidney disease and a threefold increased risk of developing liver diseases.
The implications of these findings extend significantly into the realm of future research and clinical applications. The study’s results strongly suggest that the measurement of DNA repeat expansion in blood samples could potentially serve as a valuable biomarker. This biomarker could be instrumental in evaluating the efficacy of novel therapeutic agents designed to curb the growth of repeat sequences in diseases such as Huntington’s. The computational methodologies developed as part of this research are now poised to be applied to other large biobank datasets, facilitating the identification of additional unstable DNA repeats and the associated risks of developing various illnesses.
The researchers emphasize that further detailed mechanistic studies are imperative to fully elucidate the reasons behind the observed phenomenon where identical genetic modifiers can produce opposing effects on different repeat sequences. These future investigations will likely focus on the intricate mechanisms of DNA repair processes, examining how these processes differ across various cell types and within diverse genetic contexts. The recent discovery of a link between GLS repeat expansion and kidney and liver diseases also points towards the possibility that other, as yet unrecognized, repeat expansion disorders may be silently present within existing genetic data repositories.
Dr. Margaux L. A. Hujoel, the lead author of the study and an assistant professor in the Departments of Human Genetics and Computational Medicine at the David Geffen School of Medicine at UCLA, offered a professional perspective on the findings. "We observed that the majority of human genomes harbor repeat elements that undergo expansion as we age," she stated. "The substantial genetic control evident in this expansion process, with some individuals’ repeats expanding at rates four times faster than others, presents significant opportunities for therapeutic intervention. These naturally occurring genetic modifiers provide us with crucial insights into the molecular pathways that could be targeted to effectively slow down repeat expansion in disease states."
The collaborative nature of this research is reflected in the extensive list of contributing scientists and their affiliations, including Margaux L. A. Hujoel (UCLA and Brigham and Women’s Hospital/Harvard Medical School), Robert E. Handsaker (Broad Institute and Harvard Medical School), David Tang (Brigham and Women’s Hospital/Harvard Medical School), Nolan Kamitaki (Brigham and Women’s Hospital/Harvard Medical School), Ronen E. Mukamel (Brigham and Women’s Hospital/Harvard Medical School), Simone Rubinacci (Brigham and Women’s Hospital/Harvard Medical School and Institute for Molecular Medicine Finland), Pier Francesco Palamara (University of Oxford), Steven A. McCarroll (Broad Institute and Harvard Medical School), and Po-Ru Loh (Brigham and Women’s Hospital/Harvard Medical School and Broad Institute). Financial support for this extensive endeavor was provided by several prestigious funding bodies, including US NIH fellowship F32 HL160061, US NIH grant R01 HG006855, US NIH training grants T32 HG002295 and F31 DE034283, US NIH grant K25 HL150334, a Swiss National Science Foundation Postdoc. Mobility fellowship, ERC Starting Grant no. 850869, and US NIH grants R56 HG012698, R01 HG013110, UM1 DA058230, and a Burroughs Wellcome Fund Career Award. The All of Us Research Program itself receives support from the NIH. The authors have formally declared no competing interests in relation to this research.
