At ISPOR International 2023, there was a strong presence of technical, statistics-based sessions, including ones relating to methods for network meta-analysis (NMA).1 NMA is a common and widely accepted statistical method used to compare the efficacy and safety of multiple interventions that may not have been directly compared in a randomised trial.2 Standard NMAs assume that trials assessing time-to-event outcomes like progression-free survival (PFS) meet the proportional hazards (PH) assumption, which states that the hazard ratio (the ratio of the hazard rate in the treated versus the control group) is constant over time.3 If trials in a network do not meet this assumption, then the relative treatment effect estimates obtained from the NMA might be biased, and these extrapolated survival estimates might not be accurate. NMA findings are often incorporated into model-based cost-effectiveness analyses (CEAs) for health technology assessment (HTA), where relative treatment effect estimates can be key drivers of quality-adjusted life year (QALY) benefit and, thus, cost-effectiveness. Therefore, minimising any potential biases that could affect these estimates is of the upmost importance from an HTA perspective. Currently, however, there is no clear consensus on the most appropriate way to address a scenario where a trial in a network does not meet the PH assumption.4
Technical Session 128 focused on alternative methodologies for performing NMA in the presence of non-PH (when the PH assumption is violated) for at least one trial in the network.4 The session gave an overview of the alternative methods (summarised in Table 1), along with a detailed, step-by-step model selection guide (Figure 1).4
Table 1. Alternative NMA methods in the presence of non-PH
NMA Method | Overview |
1-step multivariate NMA |
|
2-step multivariate NMA based on traditional survival distributions or fractional polynomials |
|
NMA with RCS for baseline hazard |
|
Restricted mean survival NMA |
|
Abbreviations: HR: hazard ratio; IPD: individual patient data; NMA: network meta-analysis; PH: proportional hazards; RCS: restricted cubic splines; RMST: restricted mean survival time.
There are strengths and limitations to all four of these methods, and the most appropriate method to use depends on the available data; one can refer to Figure 1 for the step-by-step model selection guide and Table 2 for an overview of the criteria used to assess the most appropriate model for each method.4 All methods except the 1-step multivariate NMA are able to fit parametric survival distributions for each arm of each trial as a single step, a key strength of these methods. Goodness of fit statistics, like the deviance information criterion (DIC), can be calculated to assess the fit of the NMA model to the data for only the 1-step multivariate NMA and the NMA with RCS for baseline hazard method, another strength for these approaches.4 A key limitation of the restricted mean survival NMA is that it is not designed to extrapolate relative treatments or facilitate discounting for an economic evaluation, so it is not suggested to use this method for CEA. Lastly, an important overarching limitation is that there is still substantial uncertainty associated with the model selection process, and formal evaluation of this process is still needed.4
Figure 1. Model selection process adapted for NMA
Source: Adapted from Technical Session 128: Alternative Network Meta-Analysis Methods in the Presence of Non-Proportional Hazards. ISPOR International Congress, Boston, Massachusetts, 2023.
Abbreviations: AIC: Akaike information criterion; BIC: Bayesian information criterion; DIC: deviance information criterion; HR: hazard ratio; NMA: network meta-analysis; OS: overall survival; PFS: progression-free survival; PH: proportional hazards; RCT: randomised controlled trial.
Table 2. Overview of model selection criteria for each NMA method
Model selection process | 1-step multivariate NMA | 2-step multivariate NMA | NMA with RCS for baseline hazard | Restricted mean survival NMA |
Assessment of development of the hazard over time using trial-level diagnostics | ☑ | ☑ | ☑ | ☑ |
Selection of arm-level or trial-level models using AIC and visual inspection | ☒ Not applicable since study-level estimates and indirect comparisons are performed simultaneously |
☑ | ☑ | ☑ |
Assessment of NMA model goodness of fit using DIC | ☑ | ☒ DICs across models cannot be compared as data differ |
☑ | ☒ DICs across models cannot be compared as data differ |
Green checkmark indicates selection criterion was applicable and red ’x’ indicates the selection criterion was not applicable.
Abbreviations: AIC: Akaike information criterion; DIC: deviance information criterion; NMA: network meta-analysis; RCS: restricted cubic splines; RMST: restricted mean survival time.
Overall, these alternative NMA methods offer promising solutions in the presence of non-PH. Although multiple alternative NMA methods have been developed over the past few years, these approaches have yet to be taken up widely among researchers and statisticians.4 Hopefully, over time, these methods are tested more rigorously and eventually put into practice more often, as effectively addressing non-PH in NMA will help to more accurately estimate long-term survival outcomes.4
References
If you would like any further information on the themes presented above, please do not hesitate to contact Zarena Jafry, Statistician (LinkedIn), and Naomi van Hest, UK Head of Health Economics and Statistics (LinkedIn). Zarena Jafry and Naomi van Hest are employees at Costello Medical. The views/opinions expressed are their own and do not necessarily reflect those of Costello Medical’s clients/affiliated partners.