Your Blood Panel Knows Your Age: Random Forests Read Biological Age From Standard Labs
A new study trained a machine-learning model on routine blood, urine and saliva tests from 11,554 people — and the labs already in your annual physical may carry more aging signal than the boutique clocks selling it back to you.
The longevity market has spent the better part of a decade convincing the appearance-obsessed that knowing your biological age requires a special kit, a saliva swab in the mail, and a methylation readout from a lab you've never heard of. A new analysis in the Journal of Clinical Laboratory Analysis quietly complicates that pitch. Working with screening data from 11,554 people aged 0 to 95, researchers trained a random forest model on 71 ordinary items — 60 blood tests, 8 urine tests and 2 saliva tests — and recovered a serviceable estimate of chronological age. The implication, for anyone treating their face and physique as a long compounding investment, is striking: the panel your physician already orders may carry more aging signal than the boutique clocks dominating the looksmaxing feed.
- Big sample, ordinary inputs. A random forest trained on 11,554 people's routine screening data estimated age with R² ≈ 0.70.
- You don't need 71 markers. Cutting the panel to 15 well-chosen items barely moved accuracy (R² ≈ 0.69).
- Floors exist. Below ~800 training samples or ~7 inputs, accuracy collapsed (R² under 0.6).
- Menopause leaves a fingerprint. Postmenopausal women tended to read as biologically older than premenopausal women on the same panel.
- It's a research tool, not a verdict. The authors frame 'blood age' as promising for studying aging — not a diagnosis you should act on alone.
What the model actually did
Random forests are the workhorse of unsexy, reliable machine learning: an ensemble of decision trees, each trained on a slice of the data, voting together. The team applied one to screening tests collected between February 2020 and August 2023, then asked it to predict chronological age from the labs alone. With 80% of the dataset used for training and all 71 items including gender in play, the model hit an R² of 0.7010 — meaning it explained roughly 70% of the variance in age across a population spanning infancy to the mid-nineties.
That's not clock-stopping precision on any single individual. It is, however, a respectable showing for inputs that weren't designed for the job. The labs in question — lipid panels, liver enzymes, kidney markers, complete blood counts, urinalysis, salivary measures — were never engineered as an aging clock. They were engineered to flag disease. The fact that they encode this much chronological information as a side effect is the quiet headline.
Routine screening — blood, urine, saliva — wasn't designed as an aging clock. It encodes more age signal than most realize.
Why this matters for the optimization crowd
If you've been pricing epigenetic tests against your skincare budget, the practical takeaway is this: a meaningful slice of what those clocks measure may already be sitting in your patient portal. The study found that pruning the input list from 71 items down to 15 — removing the variables the model leaned on least — only dropped R² from 0.7010 to 0.6937. In other words, a tight, well-chosen subset of common labs carried nearly the entire predictive load. The exotic markers weren't doing much extra work.
That isn't an argument against epigenetic clocks, which measure something genuinely different at the molecular level. It is an argument against treating standard bloodwork as too pedestrian to bother with. For readers tracking glow-up metrics over years, the cheaper, repeatable signal is probably the one you'll actually look at.
The labs were engineered to flag disease. The fact that they encode this much chronological information is the quiet headline.
Where the model breaks
The same analysis mapped the floors. When the training set dropped below roughly 800 people, or when the input list shrank below about 7 items, R² fell under 0.6 — territory where the model's estimate becomes shaky enough that individual readings shouldn't be taken seriously. This is a useful caution against the small-sample, narrow-panel 'age scores' that pop up in consumer apps. Statistical aging models need both breadth of population and breadth of input to behave.
The authors also flagged a biologically intuitive pattern: postmenopausal women tended to be estimated as older than premenopausal women on the same labs. That's consistent with the metabolic and inflammatory shifts of the menopausal transition leaving a measurable trace in routine chemistry — and a reminder that any 'biological age' number is shaped by hormonal context, not just lifestyle.
Menopausal status shifted the model's estimates — a reminder that hormonal context, not just lifestyle, shapes any biological-age readout.
How to think about your own panel
The temptation, reading a study like this, is to immediately ask which 15 markers matter most and start optimizing them. Resist that. The paper didn't publish a consumer-ready shortlist or a take-home formula, and even if it had, a population-trained random forest doesn't translate into personal targets without clinical context. The labs that move the model are statistical features, not necessarily levers you can or should push.
The honest framing for the appearance-and-longevity reader is more modest: routine screening is doing more than catching disease. It's quietly accumulating a record of how your physiology is aging, and the math to read that record exists. If you're already getting annual labs, save them. Trend them over years. Bring them to a clinician who treats them as a longitudinal story rather than a one-shot pass/fail. That's the version of biological-age tracking the evidence currently supports.
The deeper story here isn't that random forests beat epigenetic clocks. It's that the infrastructure for tracking biological aging at population scale may already be deployed — in every primary-care office, every annual physical, every patient portal that lets you download a PDF of your labs. The looksmaxing instinct to buy the newest, shiniest test is understandable. The unsexier move — keep your labs, trend them, and let better models catch up to the data you already own — is probably the one that ages well.
Sources
- Age Estimation From Blood Test Results Using a Random Forest Model. — Journal of clinical laboratory analysis