Developing "Aging Clocks" Based on Novel Allelic Polymorphisms
Abstract
The aim of this study is to create an "aging clock" that accounts for novel allelic polymorphisms in full-genome sequencing results to determine biological age.
The .vcf files of 21 samples from a study investigating the role of individual genomic variants of brain morphogenesis factors in the development of cognitive impairment were used to examine the data structure. Full genome sequencing (GWAS) results in .vcf format will be used as raw data from our own study. Correlation analysis and machine learning methods will be applied to analyze the data. The following algorithm for data processing is proposed: preprocessing of raw data, comparison of found polymorphisms with known polymorphisms, establishment of correlation between found polymorphisms and known age-associated diseases, assessment of statistical significance of found new polymorphisms with age-associated parameters (using trained models), model selection, and "clock" validation.
In the process of algorithm development, the structure of .vcf files was studied in order to automate the search for new allelic polymorphisms in genes with known involvement in the development of age-associated diseases. Analysis of fullexome sequencing results from 21 samples identified several genomic variants (rs6265, rs4760, rs4758443, etc.) associated with impaired brain development and the occurrence of cognitive impairment and age-associated diseases according to the literature. Preliminary results of the analysis demonstrate that the proposed bioinformatics data processing method can potentially be used to estimate the biological age of the study sample; however, the small sample size does not allow us to establish the accuracy of the proposed analysis method. For this reason, an analysis of 5000 additional samples is planned to optimize the proposed "aging clock" algorithms and evaluate their accuracy.
Analysis of a larger sample of biological samples from patients with ageassociated diseases will make it possible to identify new allelic polymorphisms, which will further improve the accuracy of the "aging clock".
About the Authors
I. V. MironenkoRussian Federation
Moscow
A. L. Primak
Russian Federation
Moscow
V. N. Karagyaur
Russian Federation
Moscow
M. S. Arbatsky
Russian Federation
Moscow
References
1. Johnson AA, English BW, Shokhirev MN, Sinclair DA, Cuellar TL. Human age reversal: Fact or fiction? Aging Cell. 2022 Aug;21(8):e13664. doi: 10.1111/acel.13664. Epub 2022 Jul 2. PMID: 35778957; PMCID: PMC9381899.
2. Buckley, M.T., Sun, E.D., George, B.M. et al. Cell-typespecific aging clocks to quantify aging and rejuvenation in neurogenic regions of the brain. Nat Aging 3, 121–137 (2023). https://doi.org/10.1038/s43587-022-00335-4
3. Higgins-Chen AT, Thrush KL, Wang Y, Minteer CJ, Kuo PL, Wang M, Niimi P, Sturm G, Lin J, Moore AZ, Bandinelli S, Vinkers CH, Vermetten E, Rutten BPF, Geuze E, OkhuijsenPfeifer C, van der Horst MZ, Schreiter S, Gutwinski S, Luykx JJ, Picard M, Ferrucci L, Crimmins EM, Boks MP, Hägg S, Hu-Seliger TT, Levine ME. A computational solution for bolstering reliability of epigenetic clocks: Implications for clinical trials and longitudinal tracking. Nat Aging. 2022 Jul;2(7):644-661. doi: 10.1038/s43587-022-00248-2. Epub 2022 Jul 15. PMID: 36277076; PMCID: PMC9586209.
4. Reed E, Nunez S, Kulp D, Qian J, Reilly MP, Foulkes AS. A guide to genome-wide association analysis and postanalytic interrogation. Stat Med. 2015 Dec 10;34(28):3769- 92. doi: 10.1002/sim.6605. Epub 2015 Sep 6. PMID: 26343929; PMCID: PMC5019244.
5. Chang M, He L, Cai L. An Overview of Genome-Wide Association Studies. Methods Mol Biol. 2018;1754:97-108. doi: 10.1007/978-1-4939-7717-8_6. PMID: 29536439
Review
For citations:
Mironenko I.V., Primak A.L., Karagyaur V.N., Arbatsky M.S. Developing "Aging Clocks" Based on Novel Allelic Polymorphisms. Problems of Geroscience. 2023;(4):237-239. (In Russ.)