We registered this review in PROSPERO: CRD42020210645.There were no changes to the protocol.
This systematic review follows the reporting recommendations of the updated Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement[15].
Eligibility criteria
Participants
We only included studies on older type 2 diabetes patients. We operationalized the age criterion as follows:
• ≥80% of the total study population aged ≥65 years.
• subgroup analysis reports on participants aged ≥65 years.
Intervention
The intervention group must be treated with any type of DPP4 inhibitor. Any dose or regimen were eligible. Trials on DPP4 inhibitors not approved in the European Union before 2020 were excluded.
As comparator we accepted any active control, including standard care and no treatment or placebo. In studies of additional DPP4 inhibitor treatment in combined regimens (e.g., metformin), the non-DPP4-treatment must be the same in all groups, so that the groups only differ regarding DPP4 inhibitors.
Outcomes
We prioritized all-cause mortality, overall adverse events, and hypoglycaemia as primary outcomes (critical outcomes in GRADE). Secondary outcomes were hospitalization, discontinuation due to adverse events, falls, fractures, delirium, and pancreatitis (important outcomes).
We anticipated that the effectiveness of DPP4 inhibitors regarding glycaemic control is constant across different age subgroups and consequently would not have a shifting effect on the benefit-risk ratio[13, 16, 17]. Moreover, glycaemic control is a surrogate endpoint. Although it may be important in older patients, a greater reduction in morbidity and mortality may rather result from control of other cardiovascular risk factors than from tight glycaemic control alone. Thus, a benefit for patients cannot be directly assumed when glycaemic targets are met[10].
Types of studies
Only RCTs or subgroup analyses of RCTs on the relevant age group were eligible.
Information sources
First, we screened the title/abstracts of the references of all systematic reviews included in a systematic review previously performed by the research group of one member of our review team[13].
Second, we updated the electronic literature searches of the previous systematic review. For this purpose, we searched MEDLINE, MEDLINE in Process, and Embase (all via Embase) for studies published from 1st December 2015 onwards. We last run the search on 11 December 2020.
In addition, we searched the references lists of all included RCTs and systematic reviews on the same topic.
Search strategy
The search strategy was prepared by an experienced information specialist in collaboration with the clinical experts. The full search is presented in supplement I. The searches were limited to English and German. In addition, we limited the search to articles and reviews and excluded case reports, in vitro studies, and animal experiments. The search strategy included a search filter for RCTs, a search filter for older patients and a generic search filter (modified and with additional specific terms) for adverse events[18-20]. The search strategy was reviewed by a second person using the PRESS checklist and validated by checking whether clearly eligible RCTs already known would have been identified[21].
Selection process
Two reviewers independently screened the titles and abstracts of all records identified by the literature search. Next, full-text articles of potentially relevant reports were retrieved and assessed for compliance with the eligibility criteria by two reviewers independently. Disagreements between reviewers were resolved by discussion until consensus.
Multiple reports of the same RCT were merged, so that each trial is the unit of analysis. Title/Abstract screening of the update search was performed in Rayyan[22].
Data collection process
Descriptive data were extracted by one reviewer and checked by a second reviewer for accuracy. Relevant outcome data were identified by two reviewers independently by marking the section in the relevant report. Subsequently, one reviewer extracted the data and a second reviewer checked the correctness. All disagreements were resolved in discussions until consensus.
Data items
Supplement II lists all items for which we extracted data.
We extracted data on outcomes for the last available follow-up, i.e., the longest observation period.
Study risk of bias assessment
We assessed the risk of bias with the revised Cochrane risk-of-bias tool for RCTs (RoB 2 tool)[23]. The RoB 2 tool provides a framework for assessing the risk of bias in five distinct domains on one particular outcome, i.e. for each outcome separately.
In the first domain we considered if the subgroup consideration raised a problem in the randomization process (e.g., unbalanced confounders) in addition.
One of the three levels of risk of bias was assigned to each domain:
• Low risk of bias
• Some concerns
• High risk of bias
Effect measures
All considered outcomes were dichotomous. We extracted raw data on events and number of participants for each group and calculated relative risks for all outcomes.
Synthesis methods
Statistical synthesis method
We pooled data only if RCTs were sufficiently clinically and methodologically homogenous and the p-value of the statistical test for heterogeneity was >0.05. To describe statistical heterogeneity, we calculated prediction intervals and I-square.
We pooled adverse event data separately for each comparator (placebo, no treatment, active control, standard care).
We performed an inverse variance random effects meta-analysis using the Hartung-Knapp method and the Paule-Mandel heterogeneity variance estimator[24, 25]. For outcomes for which only sparse data were available (event rate <5%, zero event studies, less than four RCTs in meta-analysis), we additionally pooled the results using beta-binomial regression models for sensitivity analysis[26, 27].
We used the R-Package Meta for the meta-analysis and SAS 9.4 for estimating the beta-binomial models[28]. In case of heterogeneity, we synthesized results across RCTs presenting range of effects of the point estimate of the relative risk ratio.
Sensitivity analyses
We performed a sensitivity analysis according to risk of bias. More precisely, we excluded RCTs at high risk of bias.
Reporting bias assessment
We planned to assess publication bias by visual inspection of funnel plots for asymmetry, if at least 10 trials for each outcome would had been available.
Bias in selection of the reported results within one trial is a domain of the RoB 2 tool (see above). For RoB 2 assessment, we compared the list of outcomes reported in the protocols or methods section with the outcomes reported in the published paper.
Certainty of evidence assessment
We rated the certainty of the body of evidence using the GRADE approach (Grading of Recommendations Assessment, Development and Evaluation). In the GRADE system, evidence from RCTs starts as “high-certainty” and the following criteria are applied for downgrading the certainty of evidence by one or two levels[29]:
• Risk of bias
• Imprecision
• Inconsistency
• Indirectness
• Publication bias
The rating of these criteria leads to four levels of the certainty of evidence for each of the prioritized outcomes[30]:
• High-certainty evidence: the review authors have a lot of confidence that the true effect is similar to the estimated effect.
• Moderate-certainty evidence: the review authors believe that the true effect is probably close to the estimated effect.
• Low-certainty evidence: the review authors believe that the true effect might be markedly different from the estimated effect.
• Very low-certainty evidence: the review authors believe that the true effect is probably markedly different from the estimated effect.
One reviewer judged the certainty of the evidence and a second reviewer verified the assessment. Disagreements were resolved by discussion until consensus.
The certainty of evidence and results are presented in the 'Summary of Findings' (SoF) tables[31]. The SoF tables were prepared using GRADEpro GDT[32]. For estimating the absolute effect, we used absolute risks of the comparator group of included RCTs.
To report the findings in consideration of the certainty of evidence, we used the standardized informative statements suggested by the GRADE working group[33].
The certainty of evidence is expressed with the following statements:
• High-certainty: reduces/increases in outcome
• Moderate-certainty: likely/probably reduces/increases in outcome
• Low-certainty: may reduce/increase in outcome
• Very low-certainty: the evidence is uncertain in outcome