From chaos, order: On the nature and measurement of biological aging

title: From chaos, order: On the nature and measurement of biological aging

content_type: article

publication: Nintil

published: 2025-09-27T00:00:00+00:00

source_url: https://nintil.com/measurement-of-aging/

word_count: 16884

What is aging and how to measure it is an everpresent question in the field of aging research. Given the complexity of biology many give up on the task, proclaiming that "we" (either the field or humanity) don't understand aging. I don't. To me, what aging is is clear enough, and we can understand it as a fractal and emergent phenomenon within a system: there's aging of DNA, aging of cells, aging of organs, aging of organisms. To understand aging and measure it we have to be reasonably acquainted with the systems we are studying, to come up with mental models and validate them against published evidence. The understanding required does not have to be complete and I of course do not claim to understand all of biology, but after having spent a long time thinking about the topic, what follows below is what seems most coherent with that evidence. In classic Nintil fashion, I aim to reach as close as possible conclusions that summarize and harmonize all previously published work by anyone who has ever had thoughts on the matter. This doesn't mean I have read everything, at some point one has to declare it sufficient and assume that what has not been read will still cohere with the model here presented. I have in the past written some blogposts about aging that are useful to read before this one: Epigenetic clocks: A review Telomeres: everything you always wanted to know Aging is already solved in vitro. What comes next? Beyond lifespan as a metric in aging research: why reprogramming is promising. Making Cells Young What is aging? The present post is an update of my views on the topic and any discrepancy between this and my previous posts should be resolved in favor of the current one. Below I present different lines of evidence under the different sections. I tried to linearize it somewhat but ultimately the nature of knowledge is that of a web, not of a linear flow so I haven't tried to coerce too much it into what it is not. I recommend reading this to the end and taking it as a whole instead of being fixated by any one specific section. Some sections may seen like they simplify a lot, but if I had to stop to fully explain everything, it would lose focus. Before going into it it is useful for me to say roughly what this is about, and what this is not about: Here I cover extensively methylation clocks results and whether they are a good proxy for aging, but I don't do this because I think they are in any way special; rather I do it because they are thematically relevant. They are used by consumers and in research. I discuss what I mean by 'aging' as it applies to various systems, including inanimate objects like a car or a glass of boiling water I reach slightly beyond methylation into proteomics to speculate about the results that might come into the future and what those might enable. I consider the case of iPSC reprogramming and what that might say about aging and the possibility of its reversal I do not consider in any depth the idea that aging is programmed (ie that we evolved purposefully to age in a group selection way) as I consider that to have been sufficiently argued against by others elsewhere. While at the same time agreeing that while aging is a universal entropic phenomenon, its translation into phenotypes (how cells and their programs react to the damage) is encoded in the genome of each species and in this sense, it is programmed. That is, the 'aging phenotype' of an individual is a function of chronological time, the environment, and its genome. I do not go in depth into the specifics of human aging (e.g. cancer in humans or cardiovascular disease). I focus mostly on cellular aging. I do not claim to have invented much here de novo, and my thoughts on the subject owe a lot to people like Gladyshev and Hayflick (Aging theory in general), Ocampo (Heterochromatin loss), Gorbunova , Seluanov , and Sinclair (DNA damage and its connection to aging), Kenyon (Genetics of aging), Levin (morphostasis) and Alon (Systems biology), and many others. At the end I have more concrete summary that hopefully is more actionable for specific applications Aging by the numbers For recreational reasons, I did an epigenetic aging test. I sent a small sample of blood to TrueDiagnostic, a company that runs epigenetic clocks for consumers, and got back some results. I got the following results: It's interesting that just from a blood sample one can get my actual chronological age within just 1 year, impressive isn't it! What is going on in the charts above? One says my "telomere age is 17" and another that my "OMICage is 31.82", that my "rate of aging is 0.72" or that my "Gait speed epigenetic biomarker is higher than 95% of people", what does that mean? Is my biological age supposed to be 17? 31.82? Neither? All? Epigenetic clocks are simple (linear, most often) models trained on the methylation profile of a training set (very often cells collected from blood) to predict some outcome of interest. This outcome could be: Chronological age (Perhaps the most common one used, like the OG Horvath or Hannum clocks) Mortality risk (As in GrimAge or GrimAge2) Some composite endpoint of multifunctional health (Like PhenoAge) Here we take a number of parameters like grip strength or VO2max and then we make them into an index (this can all be done independent of chronological age). Then, we can train models that predict grip strength or VO2max from methylation and in turn reconstruct the composite index from a blood sample. This is a better approximation to the question "Relative to a reference population, how functional am I overall?" than using chronological age as predicted variable. This is not a new idea: Whether someone is aging fast or slow relative to some established biomarker can be easily measured and aggregtead and ways to do this have been around for a while: Klemera-Doubal ( 2006 ) as a general way to do this sort of thing and Pace of Aging ( 2015 ) are two examples; the latter is what was used to train the DunedinPACE clock. For Pace of Aging it's things like cholesterol, CRP (inflammation), Hb1ac (diabetes), the waist to hip ratio, forced expiration, BMI, etc. But one could do others: There's something called intrinsic capacity from the WHO that sounds to me like a proxy for "aging": it is a framework comprising a set of faculties (locomotion, cognition, vitality (eg muscle strength), sensory, and psychological (wellbeing)). Intrinsic capacity predicts mortality and declines with age as one'd expect. One could also build an epigenetic clock that predicts that too but no one has yet. Rate of aging (Like DunedinPoAM or DunedinPACE ). This one is particularly interesting and perhaps unique among the clocks beacuse of the way it was built, out of a multi-decade cohort and periodic blood sampling of the same individuals. Here they try to predict not how old you are now but how fast you are getting older; to construct it they also built these composite endpoints as in PhenoAge. In theory this means that if I test 10 years from now, my DunedinPACE score might be exactly the same but my biological age (by some clock) will be only 7.2 years older instead of 10 Their datasets allows this clock to work around an issue the original clocks had: that training on longitudinal datasets could introduce a composition bias, as only the healthiest individuals would be represented at the later timepoints, as the sickest ones would die. Molecular markers like telomere length, inflammatory cytokines (IL-6), or cholesterol etc can all be proxied, in theory, via methylation-based models. Per some clockmakers, this is better than using the raw values of the marker because they can be noisier in the short term (IL-6 in particular, I'm looking at you). This is similar to how HbA1c is a more estable measure of diabetes risk than just measuring blood glucose, which tends to vary a lot. These markers, to some extent, can be influenced by cell type composition . Someone with a greater proportion of naive CD8 T-cells (that have divided less) will present with longer telomeres if one measures telomeres from blood, so this may be a correlate of "lower lifetime exposure to infectious disease" as opposed to "slower aging", but it could also be a correlate of a more efficient immune system, and that might bona fide be a contributor to slower aging. GrimAge2 uses as intermediate markers things like B2M (inflammation-related) or HbA1c (diabetes) as explicit intermediates which skews this clock towards diseases 1 as opposed to cellular aging. [1]. What is cellular aging and what is disease? Good question, more on that later Here, have a map Before continuing, the diagram below will be helpful for you to understand how I'm thinking about my core model of aging. This is written mostly with cellular aging in mind but it broadly generalizes: at its core sits DNA (nuclear and mitochondrial) from which everything else flows, and the code that helps cells interpret DNA (the chromatin state) which is comprised of modifications of DNA (like methylation) and their supporting structure (histone marks). These together condition what tools a cell has available (if a gene is broken the cell can't use it) to do its job and how effectively, efficiently, and quickly they can de deployed to react to its environment. Empirically, it seems to be the case that it is chromatin disorganization that by far dominates in the aging process and hence iPSC reprogramming is able to reverse most of it which is quite convenient. Telomere shortening happens naturally when cells divide but it is also a function of our genetics (we could have been born with active telomerase). Through life, the cell constantly reacts to damage, and the response to this damage creates an epigenetic memory that eventually locks the cells in a state of constant inflammation (Patrick et al. 2024 ), which shifts resources away from the job of the cell (fibroblasts make less collagen, hepatocytes metabolize slower, etc). So indeed if cells were able to repair their DNA damage faster than it gets produced, and were able to keep their telomeres from eroding, they would not meaningfully age. They would still get somatic mutations and that may eventually, absent cell division and natural selection of fit enough cells, end up in death. Induced pluripotent stem cells, which don't seem to age epigenetically in culture, do inded drastically upregulate all DNA damage repair pathways (Liedtke et al. 2015 ). This is existence proof that within our genomes are they are, without any editing, there is enough repair capacity to stave off damage for a long time. One potential wrinkle to this is the case of cells that don't divide; these accumulate age-related misfolded proteins and aggregates that some argue are not digestable even by young cells. Not much research has been done on actually testing it as most research is done on dividing cells. The one paper I am aware of that tested reprogramming in non-dividing cells did see a lowering of one such pigment, lipofuscin (Ivanova et al. 2024 ). The jury is still out on whether epigenetic rejuvenation can comprehensively reverse this too absent cell division but I am optimistic. Causality in biology In biology, everything may potentially cause everything else. Feedback loops abound and disturbing one node enough will eventually disturb the rest. Impair autophagy and you probably get more DNA damage, induce DNA damage and you probably impair autophagy. As a result, interventions targeting one aspect of the biology of aging can potentially ameliorate impairments in the rest, which can lead one to the wrong conclusion that they are all the same and there are no privileged hallmarks. "Impaired proteostasis" is more caused by "epigenetic alterations" than it is the other way around, for example. To counter this idea and to defend a more hierarchical model with things resembling root causes, I want to point at the notion of turnover rates: most things in our body turn over. Proteins are produced and degraded, and the same goes for mRNA and most (but not all) cells. Even fibrosis turns over! That's right: fibrosis exists as a balance of generation of collagen and its degradation by macrophages, with a steady state equilibrium of "there's fibrosis" as opposed to the one where "there's no scar" (Alon, 2023 ). The epigenome turns over as well: methylation marks are constantly being removed and added back. Methylation gets a lot of airtime because it is easier to measure than what argueably is the most important part of the epigenome: histone marks. These also turn over: they are deposited and removed. This may sound puzzling at first: if the cell is able to just put back marks "where it should" (ie to where young cells have them) why isn't it doing it? And the answer is that they do, but not perfectly, hence a constant ratchet towards aging. Then there's the genome. The genome gets mutations and chromosomes get messed up to some extent with aging. But those mutations just a tiny fraction of a gigantic set of information. DNA gets copied with relatively high fidelity but it doesn't turn over much. We should think of the genome not as a code but rather as a vast workshop with miriads of redundant tools the cell uses to do what it does. The epigenome is more aptly thought of as that code: the epigenome is the instruction set that's running on the workshop, determining whether the shop will be making neuron stuff or liver stuff, or aged stuff or young stuff. There is then a hierarchy of turnover rates: DNA: ~almost no turnover informationally, though DNA gets copied (cell division happens every ~20hr) mRNA: 10 hr (Yang et al., 2003 ) to 16.4h (Wang et al. 2022 ) Proteins: From 30 min ( Ornithine decarboxylase ) to crystallins in the lenses (almost no turnover) as well as histones Histones (of particular interest as they support epigenetic modification) bound to chromatin can last months (Mathieson et al. 2018 ; Shmueli et al. 2021 ) Mitochondria: Days to months ( this and Poovathingal et al. 2021 ) though these also come with their own DNA (mtDNA) so in a way they are like DNA Cells: Neutrophils (6-8 hours) to neurons (for life) My claim: The slower the turnover rate of something is, the more it matters for aging . Turnover rate is like gravity in a way: Gravity extends infinitely in all directions, everything pulls on everything, but some objects pull more than others. Slow turnover rate of a biological entity is like its mass. Think of a simple case: If you alter the proteome of a cell a little bit, briefly (you add 1 misfolded protein, other than a prion), what happens next? The cell will go back to equilibrium, degrading the old proteins and making new fresh ones. But if you were to wave that magic wand and set the entire epigenome to an aged state, then for a few hours or days you'd have an aged epigenome and some youthful proteins, but eventually given turnover rates, the aged epigenome would stay and the proteome would then be aged. The heritability of longevity is higher than what we thought in the past (up to 50% per Shenhar et al. 2025 vs 10-30% I reported back then ), and as I discuss in my Longevity FAQ it has been possible to make other species like worms live longer by 10x through manipulating their genome. Given that we already know a lot about the genetics of aging, one could make humans really long lived and slow aging by editing a handful of genes like FOXO3 or CIRBP . Genetic engineering is the easiest way to make something live longer: longevity is encoded in the genome, but for those already born and until we solve bodywide gene editing, rejuvenation of the epigenome is the second best thing we can hope. Environmental vs intrinsic aging This fact, that given a normal environment for a species, its lifespan is determined mostly by genetics (compare say a mouse vs a bowhead whale) is phrased by some (like Cynthia Kenyon) as aging being programmed. This is a different use of the term from those that argue that "there's an aging program that purposefully drives the organism towards aging". The phrase "normal environment for a species" is doing some work there. For many years cardiovascular disease has been a key component of human aging. But suppose that statins, PCSK9 inhibitors and other such interventions manage to make CVD a rare disease. Humans now live longer. Suppose that we also manage to have perfect cancer treatements and we also cure cancer. Now humans live 10 years longer. In that case we would be leaving a central part of what aging is untouched: the thing that human and worm and mouse aging have in common. That part is cellular aging. Whereas in multicellular organisms new ways of aging arise, for any one single cell, regardless of whether it is yeast or a mammalian cell or a worm cell, they all age in a similar way. Roughly speaking, the main thing that goes on in cellular aging is: DNA damage happens (In the nucleus and mitochondria) That leads to DNA mutations (which can later on become cancer) and epigenetic/chromatin alterations Cells have to decide whether to spend energy maintaining their chromatin or fixing their DNA damage. Indeed some of the proteins that help upkeep chromatin are also involved in repair (like sirtuins). This loss of chromatin stability means that regions that should be open are closed and those that should be close are open. The cell becomes less functional and resilient over time. Eventually an insult to the cell is stronger than its declining resiliency and dies. Besides DNA damage, telomere shortening is the other variable I'd look at as a driver of aging, though we know, as will be discussed later, that one can fix telomere shortening and one still ages similarly. Later in the post I will argue that all that has ever been done to extend the lifespan of organisms, from caloric restriction to reprogramming ultimately boils down to mostly fixing DNA damage or reducing its production. Clocks and their weights: DunedinPACE I found DunedinPACE quite intriguing beacuse it predicts a rate of aging, not an absolute number, and it is popular enough that there are people out there taking the test and publishing their own results eg here . This below is from the DunedinPACE paper showing the correlation of the marker with age for their training set (The Dunedin cohort). There is some correlation but not that as strong as clocks that predict chronological age. This makes sense because DunedinPACE was built to predict how healthy someone is and that gets worse exponentially with chronological age, and so the rate of decline has to be greater later in life. Now if you take this and apply it to two totally different datasets (The Normative Aging Study and the Framingham Offspring Cohort) one can ask the question: does a higher score in DunedinPACE predict higher mortality and morbidity? And the answer is yes it does: Framingham analysis included n = 2471 members of the Offspring cohort (54% women) with average age of 66 years (SD = 9) at DNA methylation measurement during 2005–2008. Over follow-up through 2018, 23% died, 13% were newly diagnosed with cardiovascular disease (CVD), and 6% had a first stroke or transient ischemic attack (TIA). Disability follow-up was conducted through 2015 based on participant reports of limitations to activities of daily living (ADLs) on the Nagi, Katz, and RosowBrelsau scales. Participants with faster DunedinPACE at baseline were at increased risk for CVD, stroke/TIA, and mortality (CVD HR = 1.39 [1.26–1.54]; stroke/TIA HR = 1.37 [1.19–1.58]; mortality HR = 1.65 95% CI [1.51–1.79], Figure 4B). They were also more likely to develop disability (Nagi ADL IRR = 1.40 [1.19–1.65]; Katz ADL IRR = 1.33 [1.16–1.53]; Rosow-Breslau ADL IRR = 1.39 [1.24–1.56]). The average starting age in Normative aging was 77 years old and in Framingham, 66 years old. The effect sizes seem quite different and may have been affected by selection bias: In Normative Aging if you have already survived to be 77 years old you are probably healthier to start with so the clock struggles to tease out among a population of already healthy people whereas in Framingham the effect is clearer (and the sample size bigger). In the supplement, they look at predictive power for mortality or quality of life (disability) after controlling for age . Controlling for smoking is also useful because smoking is an easy way to predict mortality that has little to do with age and we wouldn't want clocks just overindexing on this. This is quite important: there is no value if a clock is just predicting chronological age because we already have access to that. Unsurprisingly, DunedinPACE (and GrimAge, a clock built to predict mortality) do much better than clocks that try to predict just age: One might object to this that while these clocks work on a normal population, they might fail to detect out of distribution interventions. That is: If we were to subject a human (or a mouse) to caloric restriction or rapamycin or an intervention that turns on their FOXO3 (the one gene most associated with human longevity), would this show up in the clocks? Or, for that matter, if we just give people statins (which should lower LDL cholesterol), will that be reflected in the signal? Mechanistically, one can investigate the methylation sites (CpGs) that the clock is picking up and see what these clocks might be picking up. This is rarely done! I pulled the CpGs from their GitHub and slapped the Illumina annotations on top, then did some ChatGPT-assisted plotting. First, ouf of 173 CpGs in the clock, only a handful are quite large, followed by a long tail (Pareto principle at work!) probe Gene Name weight Interpretation/Association cg06570125 — 0.617 ? cg02650017 PHOSPHO1 -0.594 Diabetes, CVD risk cg06500161 ABCG1 0.575 Related to statin use in part (ie being on a statin makes you older in the clock!) cg01554316 GALNT2 0.515 Lipids, metabolism cg26470501 BCL3 -0.462 NK-kB related (inflammation) cg17460386 FAIM3 0.440 B-cell immune aging cg17501210 RPS6KA2 -0.395 MAP/ERK signaling cg17018786 DISP2 0.367 cg18181703 SOCS3 -0.359 Inflammation cg15192750 — 0.354 cg05304729 MNDA 0.305 Inflammation cg13274938 RARA 0.303 Inflammation /immune cg10919522 C14orf43 -0.300 Smoking ? cg27165794 PNMA1 -0.300 Cancer ? cg14702960 — -0.295 cg16924010 — 0.267 cg17439800 — 0.267 Obesity ? cg00574958 CPT1A -0.265 Metabolic syndrome cg01055871 EHD2 -0.232 cg09349128 — -0.227 But are these causal? You can already see that one of the top driving CpGs is known to be driven by statin use. People on statins are plausibly less healthy but the statins are not what is making them sick. So were someone to go off a statin, their DunedinPACE score would show rejuvenation but their mortality risk would increase. This is why it is important to look at how these clocks are constructed and investigate whether the CpGs have reasons to be causal or not. It is also useful, I think, to present stratified mortality curves according to other biomarkers just to illustrate that mortality prediction is not anything magical that only clocks can do. LDL cholesterol [ source ] Glycated hemoglobin (Hb1Ac) [ source ] Walking speed [ source ] Physical strength [ source ; in this plot maximum quadriceps torque is depicted] Can you lift you way to immortality? The search for causal aging markers These examples are illustrative of the way the causality of disease and aging work: aging causes disease, and disease causes mortality and disability. As a result, disease is causally closer to what we observe (people dying and suffering) so if one tries to make predictors of those one will pick up stronger drivers or correlates of that than of the underlying driving factor (aging). And because in the natural population the decline of everything is correlated to an extent, one can use one (grip or leg strength) to predict another (mortality) despite the fact there is a very small direct causal link between having really strong forearms and dying; rather strenght acts there as a proxy for overall health. This is not the case for causal biomarkers like LDL cholesterol which cause the relevant disease (such that modifying it modifies the disease) and we can measure directly. Hence we can measure the downstream consequences of aging very easily and we can predict outcomes of interest reasonably well, but that can all be done without touching the measurement of cellular aging at all. We don't even need these clocks to do that! Suppose you could either know your DunedinPACE score or know your Hb1ac (diabetes risk), LDL/ApoB (CVD risk), your lean mass %, and the usual blood counts and metabolic panels? What would give you a better picture of your health and mortality risk. To me, it is uncontroversially clear that classic validated biomarkers are not only superior predictors but they are also directly actionable compared to epigenetic clocks . If your LDL is high you can take statins. If your DunedinPACE score is high what are you supposed to do? To my knowledge, no analysis has found that methylation clocks are superior to composites of traditional blood based biomarkers to predict mortality and disability. If you are aware of any such analysis, let me know! But if we have these blood-based biomarkers that work so well, what's missing? What's missing is models that estimate aging , instead of those two other downstream things. Sure, but who cares about aging you might ask? We care about its downstream consequences right? If an aging predictor cannot predict mortality better than LDL cholesterol, what would be their use? Good question! One that doesn't get asked often enough. Moreover, is aging even a thing you can measure with a single number? Right now we don't measure "health", we measure diabetes risk and CVD risk separately. One answer could be "for research purposes". If we can slow down the rate of aging permanently in a mouse, then we can know that very quickly whereas waiting for a cohort of mice to die and/or get frail can be expensive and time-consuming. Another use could be for clinical trials, if one shows age slowdown in mice and wants to take that to the clinic, one could use such a biomarker to have an early readout of whether the treatment is working or not and if its not, discontinue and save money on the trial. In the appendix to this essay I discuss proteomics clocks, which I argue are superior to methylation clocks, and superior (and deeper) to the simple baselines of LDL+HbA1c. They do something methylation clocks struggle with: sampling different tissues with specificity; methylation clocks tell us the methylation status of a given cell type (blood cells), but not say how the brain is aging. But there are plasma-based biomarkers of CNS disorders like Alzheimer's: the organs leak information about their state into the blood, and that can be read out with proteomics. So I argue that in the future, paired proteomics clocks (for intantaneous organ-specific function) with methylation clocks (aging of blood cells) will give us a reasonable picture of how a living being is aging much better than what we currently have right now. But now, back to the methylation clocks! We said the issue is that if we train them on endpoints like mortality, we bias them towards disease, not aging. What can we do about that? Doing (aging prediction) by non-doing (aging prediction) There are a couple of options to work around the limitations of the data we have with methylation clocks Correct the mortality/disease signal by broadening or standardizing the population: Try to include other species in the dataset so that it picks up what is common across them and not just in humans (diabetes, etc) Develop clocks for populations with standardized environments and genetics (like lab mice or cells in culture!) to remove the effects of specific diseases from biasing what the clock picks up Leverage EWAS to try to pick up CpGs that are associated with specific diseases With a theory of aging at hand that points to drivers of aging: Hand-pick CpG sites that are posited to be drivers of aging as opposed to letting a model optimize the weights (One man's taste is another's bias). For the first approach there we can look at the CpGs of this other clock (PanTissue, PanMammalian). But this is not great: It doesn't uniquely pick out the genuine slower rate of aging of dwarf mice (thought they did claim this in the preprint and then took it back!), which by all metrics age slower, nor does it pick out the substantial epigenetic rejuvenation of iPSCs. So perhaps adding all species is bad and adds noise. What about clocks just trained on human data, and then applied in vitro? That is more promising: these clocks show that the cells age in culture in a linear fashion (just like humans in the wild), just 60-65x faster. The Pan-tissue clock also seems to work in patients with certain progerias (accelerated aging) And it also performs well for Ercc1- (a DNA repair gene defect) mice (Where genetic and environmental variablity is controlled), at least for some tissues even though this clock was trained on humans not mice. Retrotransposon clocks Another approach to make better clocks is to handpick CpGs. A good candidate for this are CpGs that suppress retrotransposons . Retrotransposons cause DNA damage when they insert themselves in DNA and they get derepressed with aging. One can build a clock that looks at CpGs that are in Retrotransposons exclusively: Intriguingly the authors note Notably, we observed ourRetroelement-Age V2 clock overlapped with 9 CpGs (cg06672696,cg07286682, cg08822136, cg16936289, cg16810279, cg22277154, cg13261390, cg22277154, and cg24251135) used in AdaptAge,CausAge, and DamAge causality-enriched epigenetic clocks recently developed using Mendelian randomization (Ying et al., 2024). This observation suggest some of the signal from our Retroelement-Age V2 may include sites that contribute and/or protect against aging. This works for reprogramming: It also predict aging in vitro: Alas, this clock hasn't been applied to other datasets in eg slow aging mice or mice subjected to caloric restriction. A similar approach , however, has, and that shows the expected effects: For context, these Snell dwarf are a well known case of slow aging: It also seems to match with the idea that bigger dogs age faster (bigger dogs have their retrotransposons derepressed faster): I really like the retrotransposon clocks! They do seem to be neatly tied to DNA damage. With these, I think we have some grounds to believe, given what we know about retrotransposons, that if one were to see a slowdown in this clock, one would have thus slowed down aging. But we could not be maximally confident: If someone developed an epigenetic editor to methylate all these retrotransposons, that could drop the Retrotransposon age to zero, and yet that wouldn't necessarily reset aging as it is not the only source of DNA damage. Using this clock would be valid especially if one's not specifically targeting retrotransposons, but even then I would suggest using a panel of approaches and not just this one. Aging clocks with mouse-only data Other clocks , developed specifically for mice are also able to pick this up, doing it better (greater separation) than what the Retrotransposon-only clock does: And later this one as well, also from the Gladyshev lab. Aging clocks through EWAS More recently, we got this paper from the Gladyshev lab that I really liked trying to build "causal clocks". The idea here is to carefully tease out what CpGs might be associated with pro-aging vs those that are differentially altered with chronological aging as a response to aging. The fist one they call DamAge clock, the second, AdaptAge. DamAge passses the reprogramming test (age drops with reprogramming): It picks out progeroid damage, exposure to cigarette smoke, and exposure to UV light (ie skin samples exposed to the sun vs non exposed) Here the other clocks fail to pick this up, in fact some say UV exposure slow down aging! But no, what they are probably picking up is mechanisms that counteract aging in those cells. DamAge correctly assigns a score similar or greater to these exposed cells. In vitro, the effects are more stark, exposing cells to cigarette smoke shows up cleanly only in this clock: Intriguing. So what is this clock picking up? The weights are here . Doing the same as before, here's some ChatGPT interpretation. First, most of the weights of the model are negative: this means the CpGs picked up are to be interpreted as "the more methylated the gene is (the less expressed), the lower the damage" These are the top 20 CpGs CpGmarker CoefficientTraining UCSC_RefGene_Name Interpretation cg07850154 -33.64986406 RNF180 Also here , involved in proteostasis. The entire gene seems to get less methylated over time. cg02254885 -31.56205228 EFCAB3 cg05463027 -29.7405477 KIF13A cg08529529 -27.93679343 ALOX5AP Also here cg08526814 -24.67192062 DNTTIP2 cg07495704 -23.24721626 cg02867102 -22.53153968 Also in GrimAge, smoking cg04229059 -19.17402516 SLC38A7 cg22652782 -18.1396963 MYBPC2 cg01082242 -18.03942941 HIF1AN Inhibitor of HIF1a (hypoxia inducible factor). cg08593364 -17.42228852 SLC35F2 cg08081725 -16.50151564 NDE1 cg23282585 -16.24094824 HIST1H3G Histone H3.1 (replication-dependent). More cell replication? Picking up cancer? cg09754948 -15.80284482 ATXN2L cg26635214 -15.1806246 HSD3B7 3beta-hydryxosteroid dehydrogenase type 7: bile acid synthesis. cg15063695 -14.61865262 GK2 cg03950166 -14.36951799 FAM108C1 cg01557754 -14.19144026 FGF11 cg02339392 -13.60087486 ZNF187 Associated with CKD cg03520471 -13.57242019 GABRR3 Then here's pooled by gene This looks much better than DunedinPACE: the CpGs and genes picked up are not obviously tied to diabetes and metabolic syndrome! But it's still hard to interpret. Aging clocks with in vitro data: Polycomb Repressive Complex sites One last way I'll discuss of constructing more robust models to track cellular aging is training them in vitro where organismal-level disorders are not relevant. This, as it turns out also has its problems. The EpiTOC/pcgtAge clock (Yang et al. 2016 ) was constructed by handpicking as opposed to running a regression which makes it more robust to allegations that it's picking just a correlation. What they do is pick PRC2 -related promoters that get hypermethylated with age in blood samples and that start with no methylation across a range of fetal tissues, then the score for a given sample is simply the average methylation across those CpGs. This can be used to predict human chronological age, and it also correlates with the number of cell divisions estimated for stem cells in different tissues, which matches also with other clocks that pick up tissues that turn over faster as faster aging. Note that in this picture below each point is a different tissue from a person of a different age. A later model, EpiTOC2 (Teschendorff 2020 ) further expands this methodology, using fewer CpGs. The original Horvath clock has 353 CpGs that are jointly fitted to estimate the chronological age of a sample. This is not great because a) Plausibly some people might have aged faster or slower but we are telling the model that two samples have an age of exactly their chronological age b) Different tissues plausibly have aged faster and slower but we again tell the model to assume that they all have the same age EpiTOC2 instead models each CpG individually, they say that the methylation at a site equals is a function of someone's age, how often stem cells divide per tissue , and how likely is that a cell division will lead to an increase in methylation. Doing this they fit one model per CpG (3 values to fit), then pick the modal value of the stem cell division rates estimated (should be the same across CpGs because they are all replicated jointly) and fit the model again, this time with just 2 values (de novo methylation probability and the ground state methylation). Though this all is done in blood, their model is posited as universal per cell division so then this model fitted in blood can be applied to other tissues. So given say a colon sample, one can measure the actual methylation and knowing the age of the person one can calculate the number of stem cell divisions for a tissue. They find their estimate of this parameters correlates reasonably well with literature estimates of how often stem cells divide per tissue Moqri et al. ( 2024 ) did something similar in concept (picking PRC2-associated sites and averaging their methylation) but without modeling, they just find those PRC2 sites, pick 1000 with the most PRC2 binding and average the methylation of that. This, they explicitly note, is what explains ELOVL2 (discussed later): It is one of those PRC2 associated sites! Exciting, isn't it! But: Is this causal? If we were able to revert this clock while leaving everything else unchanged, would we revert aging? I don't think so! I looked into why is it that cells keep adding methylation at these sites and the reason seems to be that initially those genes are silenced during development when cells are comitted to a fate, by depositing H3K27me3 (a histone mark) on it. But during cell division, this gets diluted and to compensate, the cell methylates those sites to continue to repress them (Yang et al. 2023 ). This means that these PRC2 methylation sites are non causal! They are an adaptive response. The implication of this is that if we were to remove them without adding back H3K27me3 we would make the cells younger "by the clock" but make them less functional. Conversely, this means it is also possible to make the cells genuinely younger while seeing no change in an epigenetic clock that heavily indexes on these sites. It also means that the sometimes reported age acceleration in cancer might just litearlly be a proxy for their higher rate of cell divisions (Rozenblit et al. 2022 ). Everything is connected: epigenetic clocks are not unique, but they are convenient The state of a cell is the aggregate of everything that's happening in the cell: how many mRNAs of each type there are, concentrations of proteins, what chromatin is closed, etc. But it's all connected: One influences the rest. As a result, if one measures deep enough a cell, one can reconstruct the rest of the state from that subset . From the genome and chromatin one can predict gene expression, and from gene expression one can predict protein abundance and chromatin accessibility 3 . [3]. This requires the entire transcriptome; the correlation between mRNA of gene X and abundance of protein X is very noisy as this is influenced by post-translational regulation among others. This effects can be addressed by sampling the whole transcriptome which will contain e.g. those genes participating in post-translational regulation . If one believes this then there's the question of "how deep to sample" and "what sorts of things can be sampled". The genome is not that useful in general for aging because cells can mostly work around the damage that the genome is subject to, though one can predict aging, albeit noisily from somatic mutation accumulation (Alexandrov et al. 2015 ; Koch et al. 2025 ), one would not be able to predict say cell type just from the genome whereas this is possible from the other, more dynamic -omes. Single cell RNA seq is something people have tried to make clocks out of (Buckley et al. 2022 ) but generally those approaches rely on pseudobulking cells to reduce the noise of mRNA expression. Bulk RNA clocks work much better. But still they seem to be noisier than methylation clocks or, for that matter, bulk ATAC clocks. Chromatin is more stable than mRNAs and so a better thing to measure to get a consistent readout of state unbiased by transient effects. The full proteome, as of today, is more expensive to measure, and we can't quite fully measure all of it, but if we were able to, it would probably also work fine to build aging clocks out of. That makes methylation (or chromatin, even better) as the most reasonable way to make and use aging clocks. Epigenetic clocks and organismal aging When someone says that doing something changed their epigenetic age what that really means is the same as when someone typically makes a claim about their telomeres having lenghtened. It usually means the epigenetic age of the cells in the blood . That epigenetic age can change for various reasons: Change in cell type composition (which now some clocks try to correct for) Death of older cells "Rejuvenation by depletion (as in senolytics)" Replenishment from younger cells; this would have to come from rejuvenated hematopoietic stem cells in the case of blood. Change in state towards less inflammation. As in most populations older people have more inflammation, a reduction in inflammation would register as reduced aging. This also explains why what the clocks measure seem to increase and then decrease during disease (Poganik et al. 2023 ) Additionally the readout of a clock can be biased by the source population in the training set. If you take say all Americans and build an aging clock on that and then a person that's fairly healthy does it, it is likely that they will show up a bit younger in the clock and that might be because of less inflammation. That said, if someone has a substantial (say >10 years) reduction in their own personal score, I would take that as evidence of robust rejuvenation for cells in their blood . This has been achieved, to my knowledge, only through bone marrow transplantation, where an old person gets bone marrow from a younger donor; in that case the old person ends up with a real legit rejuvenated blood system (Soraas et al. 2019 ) 4 [4]. At Retro Biosciences, where I work, we're trying to do this with iPSC-derived HSC . These people have a younger epigenetic age and do live longer (DeZern et al. 2021 ), but they do not live nearly as long as the drastic reduction in epigenetic age would imply. That is, you can have "20 year old blood" in a "80 years body" and clocks would totally fail to register that the rest of the body is not quite 20. Do neurons age? It seems one of the most stressful things a cell can do is divide: tissues with more cell division age faster. This then leads us to the question: what about tissues where the cells don't divide much like the brain? Do those age very very slowly? If one takes DNA damage as the prime mover in the aging process, one can look at kinds of DNA damage in DNA mutations: we have more access to that kind of data than we have for DNA methylation because people have been extracting tissue samples from all sorts of tissues for cancer research. This data has been analyzed for patterns in mutations and some emerged that seemed to correlate fairly well with chronological aging (Alexandrov et al. 2015 ). The two biggest "clocklike" signatures are SBS1 and SBS5 (Spisak et al. 2024 ), which together represent half of the mutations one will accrue with aging. SBS1 seems to nicely track cell division rates as it is flat in neurons and much accelerated in colon. What is this SBS1 ? It's a signature of mutations from C>T at CpG sites caused by spontaneous deamination. That is, if we have a CpG site, it can either become demethylated, so the CG with the methyl in the C becomes TG. This seems like a good candidate for the "driven by cell division" component of aging, specifically that involving demethylation. Importantly, because it's driven by a mutation, that CpG site is gone and can't be remethylated again! SBS5 might be a proxy for a a cell-replication independent process that aggregates various kinds of damage (Spisak et al. 2025) . This rate is also tissue-dependent, so the rate of damage generation is also probably cell-type dependent. The germline as well as muscle seem to age quite slowly. The former is no surprise, the latter is and I wonder why it might be. We can see here that if we look at absolute magnitude, neurons should also be aging (~225 mutations total vs 300 for lung). Turns out a lot of these mutations are cell-replication independent! Given what we have seen thus far, if we applied a clock to a region of the brain like the cerebellum that's mostly neurons and see if it ages slower: If a good amount of what the clocks pick up is cell division (through PRC2) as a proxy for chronological age then they may underestimate their real aging. At a high level, the cerebellum becomes less functional and shrinks in size (Arleo et al. 2024 ). Epigenetically, the cerebellum does seem to age slower in the usual clocks: It is possible to construct cerebellum-only aging clocks (Wang et al. 2023 ) and if one applies those clocks to other tissues then one observes that it is the other tissues that age faster. Here's the original Horvath clock (Pan-tissue clock) on cerebellums: And Horvath 2018 (Skin&Blood clock) is even flatter, perhaps because skin and blood are tissues where there's higher turnover so this clock is picking up more of that cell-division driven aging signal. When looking at it, they indeed find that compared to other brain tissues, the cerebellum is less methylated at CpG islands, which tend to be PRC2 binding sites, the changes that they do see are smaller and there are fewer that are significant. One model we could try to explore for neurons is that by and large they don't age and that all variation in cognitive abilities can be explained through neuron loss instead of reduced function: just like with naked mole rats (Ruby et al. 2018 ), which don't experience an increase in their mortality rate over time, they still eventually die. As we largely lack neuronal stem cells, once a neuron is lost, it is not replaced, which would then explain the decline in cognitive abilities. But in healthy human beings one doesn't see much cell loss in the hippocampus and enthorrhinal cortex in healthy patients, both areas greatly affected by Alzheimer's in their sample, with rates of neron loss of up to 50% (Price et al., 2001 ). Neurons become senescent over time (Hudson et al. 2024 ) and senescent cells are overall less functional so despite the clocks saying neurons age slower (or not at all), given that cognitive faculties do decline with age, we should take this as evidence that we shouldn't trust the clocks here. Better is worse: desiderata for a gold standard aging predictor If you train a model to predict mortality in humans, it is likely the model will learn to pick up correlates of things like diabetes or heart disease. This is indeed what happens with newer generation aging clocks, which I discussed earlier. This would explain why on some of these clocks things like an infection or losing weight reverts epigenetic aging. An aging predictor ("clock") should take in some data (could be the epigenome, metabolome, etc) about an organism and return a biological aging score. For us to think that the model is measuring cellular aging, the follow would have to be true: Chronologically older individuals, on average, present as biologically older by the clock The clock ticks in vitro as well, though it may tick at a different speed iPSC reprogramming largely resets the clock Gold standard age-slowing interventions like caloric restriction or certain knockouts (growth hormone receptor mice) slown down the ticking of the clock DNA damage should accelerate the clock while the insult is being applied (eg. radiation, chemotherapy) and progerias should have the same effect The score is somewhat predictive of mortality risk and health conditions above and beyond chronological age Ideally, but not necessarily, able to do the above across species and tissues The model should be able to be trained from in vitro data to predict in vivo data and vice versa The model should be examined for causal plausibility; ie it should be known how it works. This need not be causal; eg later I discuss entropy-based clocks where each specific site does not matter, and instead what matter is the higher level noise being measured. It is acceptable if such a model ends up being a worse predictor of disease and mortality than models directly trained to do this, for reasons already argued. Somatic mutation accumulation correlates with aging Earlier I said that DNA damage is what drives a good chunk of aging. Worse DNA damage repair also leads to more somatic mutations and their rate of accumulation (single base-pair mutations per genome) correlates with lifespan . This accumulation seems to be linear in a given species. If one takes those rates for various species and plots their lifespan, one gets a nice curve: Their equation for lifespan is L(years)=3206/mutations; taking the baseline rate for a given species: Lifespan Fold Change=1/Mutations Fold Change . So for a mouse for example if we were to halve DNA mutations we would get a lifespan of ~8 years, roughly a doubling. The same would be true for humans (half mutations=double the lifespan). This is for crypts in colons which should be among the tissues that mutates the most (and hence ages faster (and gets more cancer)): This paper pairs well with Crofts et al. (2023) , a very similar one that does the same thing but with methylation where the result is essentially the same, though they apply it to different tissues (blood and skin) and use a different definition of lifespan (maximum recorded lifespan for an organism). Similar ideas are contained in Bertucci-Richter & Parrott ( 2023 ) or Horvath et al. ( 2024 ) who also ties this in particular to specific rates of methylation in bivalent (PRC2) promoters (promoter with both repressive marks like H3K27me3) and activating marks (like H3K4me3). Bowhead whales are quite based entities; as you would by now expect, their cells are remarkably resilient to radiation through better DNA repair (Firsanov et al. 2024 ). They also run colder which would help explain why it's easier to repair things (less entropy). Their cells divide slower than human's: they take 2x the time to divide and they live about twice as we do. Intriguing isn't it! Does this work in reverse? Do mouse cells take ~40x less time to replicate? Not really! So it's not just just slower cell replication. Can't we just use somatic mutations (in particular the SBS1 and SBS5 signatures) as a clock ? One could try: If you sequence someone's blood cells say, subject them to some intervention for a year and then come back and estimate those signatures, if their DNA damage repair is better, their signature will have ticked up less than controls and from there one could infer slower biological aging. But that wouldn't work for measuring age reversal, as iPSCs retain their mutations but have the rest of the aging phenotype wiped clean. And are dwarf mice or mice on caloric restriction experiencing less mutations? They indeed are. Are humans undergoing chemotherapy (which also leads to accelerated aging ) accumulating more somatic mutations and have a faster ticking clock? Yes to both. Are cells from longer lived species more resilient to DNA damage? You bet! (Hall et al. 1984 ) Somatic mutations and methylation drift are linked because one (the former) causes the latter (Koch et al. 2025 ). Accordingly, their rates of change vary similarly, and are connected to some extent to cell division, hence DNAme age and most of the somatic mutations occurs during development and then stabilize to a stable rate of ccumulation thereafter (Spisak et al. 2024 .) Axolotls! Not every living being seems to age similarly from an epigenetic perspective: axolotls don't age epigenetically after development (Haluza et al. 2024 ): Here, we conduct DNA methylation profiling of axolotl tissues at CpGs associated with ageing across mammalian and amphibian species. We develop axolotl epigenetic clocks at both panand single tissue levels and uncover that axolotls exhibit conserved epigenetic ageing traits during early life but not thereafter, deviating from the established notion of organismal ageing.

From chaos, order: On the nature and measurement of biological aging

Brief

Why it matters

Key details