Longevity risk represents a structural source of uncertainty for modern pension systems and longevity-linked financial arrangements. Under persistent mortality improvements, period life expectancy evaluated at retirement age systematically understates the effective lifetime of real cohorts, generating a wedge between period and cohort measures. The magnitude and dynamics of this Life Expectancy Gap (LEG) depend critically on how future mortality rates are projected, since cohort life expectancy aggregates mortality along the diagonal of the Lexis surface and is therefore inherently forward-looking.
This paper investigates how model uncertainty and forecast combination design affect the measurement of cohort-based longevity indicators and the implied intergenerational transfers embedded in pension rules. Rather than proposing a new structural mortality specification, we focus on the aggregation layer within the class of generalized age–period–cohort (GAPC) mortality models. We implement a horizon-specific stacked regression ensemble (SRE), where combination weights are estimated via blocked time-series cross-validation and allowed to vary across forecasting horizons. This approach explicitly targets multi-step predictive risk and accounts for the instability of model performance across horizons.
To interpret the sources of predictive gains, we introduce a cooperative game–theoretic attribution based on the Shapley value. For each forecasting horizon, Shapley values provide an axiomatically grounded decomposition of out-of-sample risk reduction relative to an equal-weight benchmark, linking ensemble weights to the marginal informational contribution of each mortality specification.
Using Italian mortality data from the Human Mortality Database for ages 60–100 over 1960–2022, we construct extended mortality surfaces to compute period and cohort life expectancy at retirement age and the associated longevity gap. Results show that while a positive cohort–period wedge is robust across aggregation schemes, its magnitude and long-run dynamics differ materially between horizon-adaptive stacking and static forecast averaging, implying non-trivial consequences for the measurement of intergenerational transfers and actuarial fairness in pension systems.
