longevity researchVO2 max and mortality: the Mandsager study, deconstructed
A close read of the Mandsager 2018 JAMA study showing a 5-fold mortality gap between low and elite cardiorespiratory fitness — what it means for longevity training.
In October 2018, Kyle Mandsager and colleagues at the Cleveland Clinic published an analysis in JAMA Network Open that should have changed the longevity conversation more than it did. They followed 122,007 patients who had undergone graded exercise testing between 1991 and 2014, for a median 8.4 years. The hazard ratio for all-cause mortality between the lowest fitness category and the elite fitness category was 5.04.
To put that number in perspective: the hazard ratio between current smokers and non-smokers in the same dataset was roughly 1.41. The mortality gap between low and elite cardiorespiratory fitness was larger than the gap between smoking and not smoking, between having end-stage kidney disease and not having it, and between being diabetic and not being diabetic. That is what 122,007 graded exercise tests, run with a hard mortality endpoint, produced.
This article is a close read of that study — what it showed, what it did not show, and what it actually implies for training prescription.
What Mandsager 2018 actually measured
The study was retrospective but used a high-quality data source: every adult patient referred for a symptom-limited treadmill test at the Cleveland Clinic over 23 years. VO2 max was estimated from peak treadmill performance using the standard Bruce or modified Bruce protocol, expressed in metabolic equivalents (METs). Patients were stratified into five fitness categories by age- and sex-adjusted percentile:
- Low: bottom 25 percent
- Below average: 25-49 percent
- Above average: 50-74 percent
- High: 75-97.6 percent
- Elite: top 2.3 percent
The primary endpoint was all-cause mortality. The analysis adjusted for age, sex, body mass index, smoking, diabetes, hypertension, hyperlipidemia, end-stage renal disease, cardiovascular disease, statin use, beta-blocker use, and aspirin use. After adjustment, cardiorespiratory fitness remained the strongest single predictor of death across the cohort.
The continuous nature of the dose-response was the second important finding. Each 1 MET increase in fitness was associated with roughly 12 percent lower all-cause mortality. There was no plateau at the high end. Elite-level fitness carried the lowest mortality risk, and the curve did not turn upward — directly contradicting earlier "reverse J-curve" hypotheses suggesting extreme exercise might increase mortality. See the VO2 max longevity predictor breakdown for the broader mechanistic context.
How the numbers compare to other risk factors
The most striking comparison in the Mandsager paper is the side-by-side hazard ratios. Adjusted hazard ratios for all-cause mortality were:
- Low vs. elite fitness: 5.04
- End-stage renal disease vs. not: 2.97
- Current smoker vs. never: 1.41
- Diabetes vs. not: 1.40
- Coronary artery disease vs. not: 1.29
- Hypertension vs. not: 1.16
Read that list twice. The mortality risk associated with low cardiorespiratory fitness exceeded every other clinical variable measured, including end-stage kidney disease. This is why VO2 max has become the dominant single marker in the longevity panel discussion. The longevity biomarkers panel places it at the top of the priority list for exactly this reason.
The mortality difference between low fitness and elite fitness is larger than the difference between smokers and non-smokers. That is not a metaphor. It is what the data shows.
Causation vs. correlation
The honest read of an observational dataset, however large, requires confronting the selection question. People who can complete a symptom-limited treadmill test are healthier than the general population. People with elite fitness are different in ways the model may not fully capture. Reverse causation is plausible: subclinical disease may both reduce exercise capacity and raise mortality, producing an association without fitness being causally protective.
Several lines of evidence push back on a pure-selection interpretation:
The dose-response is continuous and steep across the fitness range, not just at the extremes. Moving from below-average to above-average fitness — a change achievable in 6-12 months of structured training in most adults — was associated with a hazard ratio improvement of roughly 1.41. That is not consistent with selection alone.
Randomized trials of structured aerobic training in middle-aged and older adults show measurable VO2 max improvements of 10-20 percent over 6-12 months and corresponding improvements in surrogate markers (insulin sensitivity, endothelial function, blood pressure, HDL-C). The biological pathway from training to fitness to risk reduction is well characterized.
Twin and Mendelian randomization studies suggest a meaningful causal component to physical activity effects on cardiovascular and metabolic outcomes, beyond what selection alone would explain.
The cautious read is that VO2 max captures a combination of intrinsic biological reserve and modifiable training adaptation. Both contribute. The training-driven component appears to be substantial and recoverable for most adults.
How much improvement is realistic
Stephen Kodama's 2009 JAMA meta-analysis of 33 prospective studies established that each 1 MET improvement in cardiorespiratory fitness was associated with 13 percent lower all-cause mortality and 15 percent lower cardiovascular mortality. The Mandsager dataset extended that finding to a larger and more diverse population with consistent results.
For most untrained or moderately trained adults, a structured 8-12 week aerobic program produces 10-15 percent VO2 max improvement, which corresponds to roughly 1-2 METs depending on starting fitness. That is enough to shift one fitness quintile, which in the Mandsager data corresponds to a hazard ratio reduction in the range of 0.6-0.8.
The most evidence-backed single protocol for VO2 max improvement is Norwegian 4x4 interval training. Jan Helgerud's 2007 randomized trial compared four training methods and found that 4 rounds of 4 minutes at 90-95 percent HRmax with 3-minute active recovery, performed 3 times weekly, produced significantly larger VO2 max gains than longer continuous training at the same total work volume. The protocol generalizes across endurance disciplines. See the Norwegian 4x4 protocol guide for the prescription detail and the VO2 max estimator tool for a sub-maximal baseline.
What the study did not show
Worth being clear about the limits.
Mandsager is observational. The hazard ratios are associations, not proven causal effects. The training trials that establish causation use surrogate endpoints, not mortality.
The "elite" group was small — 2.3 percent of the cohort — and a fraction of that group may carry genetic protection that also drives fitness, which would inflate the apparent benefit.
The cohort was a clinical referral population, not a general population sample. People referred for treadmill testing typically had a clinical indication, which may shift the underlying risk distribution.
The study does not tell us the optimal training prescription. It tells us where to aim, not how to get there.
None of these caveats are large enough to dismiss the finding. The signal is too consistent across studies and the effect size too large. But they should temper any claim that raising VO2 max by 5 ml/kg/min "will" reduce mortality by a specific percentage. The honest framing is: it almost certainly helps, the magnitude is probably large, and the cost-benefit ratio relative to any other intervention is hard to beat.
Protocol
- Measure baseline. Graded exercise test in a clinical lab if accessible. Otherwise a validated sub-maximal estimate via the Cooper test or Rockport walk. Wearable estimates carry 10-15 percent error.
- Identify your age-adjusted percentile. Norms tables are widely available. Aim to reach at least the 75th percentile for your age and sex.
- Run Norwegian 4x4 twice weekly. 4 rounds of 4 minutes at 85-95 percent HRmax, 3 minutes active recovery between, plus a 10-minute warmup and 5-minute cooldown.
- Add 2-3 hours of Zone 2 weekly. 60-70 percent HRmax, conversational pace. Zone 2 builds mitochondrial density and aerobic base without the interference cost of additional high-intensity work.
- Retest every 6-12 months. VO2 max changes slowly. Monthly retesting captures noise, not signal.
- Track concurrent fitness markers. Resting heart rate, HRV trends, and recovery between hard sessions all improve alongside VO2 max in a productive program. See the recovery stack protocol.
- Sustain it. VO2 max gains regress in 4-8 weeks of detraining. The mortality benefit is contingent on maintained fitness, not on a one-time peak.
The Mandsager study did not prove that raising VO2 max will extend any individual's life by any specific number of years. It strongly suggested that few other modifiable factors compete with cardiorespiratory fitness as a target for mortality risk reduction. Eight years later, no large study has overturned that conclusion.