Why Your Biological Age Score Might Be Wrong
A new GeroScience analysis argues the machine-learning clocks behind today's biological-age tests optimize for math, not biology — and miss inflammation in the process.
The vial arrives in a tidy box, the instructions promise a number, and the number promises a verdict: you are younger than your birthday, or older, or — most unsettling — somewhere your body has been hiding from you. Consumer biological-age tests have become a fixture of the longevity conversation, sold as a mirror more honest than the calendar. But a 2025 analysis in GeroScience argues that the mirror is warped in a specific, technical way most buyers never hear about: the algorithms behind these scores were built to look smooth on a graph, not to faithfully describe what aging is doing inside a cell.
The paper, by Mei and colleagues, is not a tabloid takedown. It is a careful methodological critique of the machine-learning logic that underpins the most widely used DNA methylation "age clocks" — the kind of model that powers many of the at-home tests now marketed to health-curious consumers. Their central claim is uncomfortable: a clock can be excellent at predicting chronological age and still be a poor instrument for understanding biological aging. The reason is that the math rewards features that move in a clean line from birth toward death, even when the underlying biology does nothing of the sort.
Aging, the authors remind us, is not linear. It has different rhythms in childhood, midlife, and the years after menopause, with abrupt inflection points rather than a tidy slope. When an algorithm is trained to compress that messy reality into a single rising line, it tends to keep the features that cooperate and quietly discard the ones that don't — even if the discarded signals are the biologically interesting ones. The output looks impressive. The interpretability suffers.
The incoherence problem
One of the paper's sharpest observations concerns what the authors call incoherence. In the DNA methylation patterns these clocks read, some sites gain methylation as we age and as disease accumulates; others lose it in the same direction. A biologically faithful model would treat those two groups as opposites. Yet the authors show that major conventional clocks assign positive weights to both kinds of sites, effectively adding signals that should be subtracting from one another. The model still produces a number. It just isn't the number you think it is.
The consequences are not abstract. The same analysis finds that popular clocks are skewed toward leukocyte fractions — heavily influenced by the mix of white blood cells in a sample rather than by aging itself. A score can shift because your immune cell composition shifted that morning, not because anything meaningful changed about your biological trajectory. For a reader weighing whether last quarter's number really reflects last quarter's habits, that is a sobering caveat.
DNA methylation clocks read chemical tags on the genome — but how those readings are combined matters as much as the readings themselves.
What the clocks miss
Perhaps the most striking finding, especially for women navigating the post-menopausal decade, concerns inflammaging — the chronic, low-grade inflammation now considered a hallmark of biological aging and a driver of cardiovascular, metabolic, and cognitive decline. The authors argue that conventional age clocks struggle to detect inflammaging, precisely because the mathematical scaffolding that makes them look accurate also flattens the biological signal that inflammation leaves behind.
When the authors rebuild the models to remove the incoherence — forcing the math to respect the direction biology is actually moving — the picture changes. The corrected models are less skewed toward neutrophils and better at detecting inflammaging. That is not a marketing flourish; it is a methodological proof of concept that the standard approach is leaving important information on the table.
A clock can be excellent at predicting chronological age and still be a poor instrument for understanding biological aging.
How to read your own number
None of this means biological-age testing is meaningless, and the GeroScience authors do not argue that it is. They are clear-eyed about the appeal of a single, legible score, and they sketch a path forward through non-linear machine-learning approaches that may better honor the shape of aging. The point is more measured: today's commercial tests rest on a methodological tradition whose limitations are now being articulated in the peer-reviewed literature, and the informed consumer should know that.
For readers who have already taken a test, the practical reframing is this. A biological-age score is best understood as one signal among many — interesting, not authoritative, and likely sensitive to factors (like the immune cell mix in a given blood draw) that have little to do with how you have been living. A score that drops after a quarter of better sleep and walking is encouraging; a score that jumps after a stressful month is not a diagnosis. Anyone weighing what a result means for their actual health decisions should talk it through with a clinician who can place it alongside the rest of the picture — blood pressure, lipids, inflammatory markers, function, history.
The longevity field is moving quickly, and the candor of this paper is itself a good sign. The same researchers building the next generation of clocks are publicly auditing the last one. That is how a young science matures — not by retreating from the promise of measuring aging, but by being honest about which measurements have earned our trust, and which are still a work in progress.
- The critique is methodological, not commercial. A 2025 GeroScience analysis argues popular DNA methylation clocks optimize for mathematical linearity at the cost of biological meaning.
- Incoherence is the core issue. Major clocks assign positive weights to methylation sites that move in opposite biological directions, muddying what the score represents.
- The numbers can drift for the wrong reasons. Conventional clocks are skewed toward leukocyte fractions, meaning immune cell composition can shift your result.
- Inflammaging is being missed. Standard models struggle to detect chronic low-grade inflammation, one of the most consequential features of aging.
- Treat a score as one signal, not a verdict. Discuss results with a clinician who can interpret them alongside conventional health markers.
- The field is correcting itself. Non-linear and coherence-corrected models are being explored as a more biologically faithful next step.
The behaviors that move biological-age scores — sleep, movement, stress, nutrition — are also the ones with the strongest independent evidence for healthy aging.
Sources
- Misalignment of age clocks. — GeroScience