Priority domains, aims, and testable hypotheses for implementation research: Protocol for a scoping review and evidence map

doi:10.21203/rs.3.rs-17280/v2

Download PDF

Protocol

Priority domains, aims, and testable hypotheses for implementation research: Protocol for a scoping review and evidence map

https://doi.org/10.21203/rs.3.rs-17280/v2

This work is licensed under a CC BY 4.0 License

Journal Publication

published 03 Dec, 2020

Read the published version in Systematic Reviews →

You are reading this older preprint version

Read the latest preprint version →

Background: The challenge of implementing evidence-based innovations within practice settings is a significant public health issue the field of implementation research (IR) is focused on addressing. Significant amounts of funding, time, and effort have been invested in IR to date, yet there remains significant room for advancement, especially regarding IR’s development of scientific theories as defined by the National Academy of Sciences (i.e., a comprehensive explanation of the relationship between variables that is supported by a vast body of evidence). Research priority setting (i.e., promoting consensus about areas where research effort will have wide benefits to society) is a key approach to helping accelerate research advancements. Thus, building upon existing IR, general principles of data reduction, and a general framework for moderated mediation, this article identifies four priority domains, three priority aims, and four testable hypotheses for IR, which we organize in the priority aims and testable hypotheses (PATH) diagram..

Methods: The objective of this scoping review is to map the extent to which IR has examined the identified PATH priorities to date. Because Implementation Science is the leading journal for publishing IR and receives over 800 submissions annually, our sample will include IR (specifically, original research articles and short reports) published in Implementation Science between its inception in 2006 and December 2019. The protocol for the current scoping review and evidence map has been developed in accordance with the approach developed by Arksey & O’Malley and advanced by Levac, Colquhoun, and O’Brien. Because scoping reviews seek to provide an overview of the identified evidence base rather than synthesize findings from across studies, we plan to use our data-charting form to provide a descriptive overview of implementation research to-date and summarize the research via one or more summary tables. We will use the PATH diagram to organize a map of the evidence to date.

Discussion: This scoping review and evidence map is intended to help accelerate IR focused on suggested priority aims and testable hypotheses, which in turn will accelerate IR’s development of National Academy of Sciences-defined scientific theories and, subsequently, improvements in public health.

Systematic review registration: Open Science Framework: https://osf.io/3vhuj/

Other Public Policy

Implementation science

implementation research

priority setting

The persistence of unacceptably low rates of translating research findings into practice has led to increasing attention to implementation research (IR) as a means to significantly accelerate improvements in public health.^1,2P Over a decade ago, Eccles and Mittman (2006) defined IR as “the scientific study of methods to promote the systematic uptake of research findings and other evidence-based practices into routine practice, and, hence, to improve the quality and effectiveness of health services and care.”³ Similarly, the National Institutes of Health has consistently defined IR as “the scientific study of the use of strategies to adopt and integrate evidence-based health interventions into clinical and community settings in order to improve patient outcomes and benefit population health.”^4,5 Considering the significant amounts of funding, time, and effort invested in IR, it would be ideal if the field of IR had developed one or more scientific theories as defined by National Academy of Sciences (i.e., a comprehensive explanation of the relationship between variables that is supported by a vast body of evidence).⁶ Although the field of IR has developed various theories, models, and frameworks to support IR, these theories, models, and frameworks have some limitations.^7,8 First, they tend to be narrow in scope, focusing on one area of implementation research (e.g., evaluation, implementation determinants, implementation processes), instead of comprehensive explanations of phenomena. Second, although there are many models and frameworks, there are few theories (comprehensive explanations of relationships between variables) and, to our knowledge, no IR theories are supported by vast bodies of evidence the way prominent theories in other fields are (e.g., Theory of Planned Behavior, which has been widely applied across fields to predict human social behavior and has a vast body of evidence, including meta-analyses, assessing the predictive validity of its theoretical propositions).^9,10 Given the limitations of IR theories, efforts to accelerate the development of theories that meet National Academy of Sciences standards are warranted.

Research priority setting (i.e., promoting consensus about areas where research effort will have wide benefits to society) is one approach to accelerating research advancements.¹¹ The Priority Aims and Testable Hypotheses for IR (PATH4IR) Project seeks to accelerate IR on several of IR’s priority domains, aims, and testable hypotheses via estimating the extent to which IR to date has examined these priority areas. Helping accelerate IR on these priorities should accelerate IR’s development of National Academy of Sciences-defined scientific theories, which in turn will help accelerate improvements in public health. Below we identify, describe, and justify four priority domains, three priority aims, and four priority testable hypotheses for IR, which are the focus of the PATH4IR Project and its scoping review.

Four priority domains for implementation research

A plethora of IR theories, models, and frameworks have identified numerous IR domains.^7,12 Table 1 lists the domains of three IR models/frameworks that have guided much IR to date. Building on these IR models/frameworks,^13-15 other IR,¹⁶ principles of data reduction,¹⁷ and a general framework for moderated mediation,¹⁸ the PATH4IR Project identified four priority domains for IR. Each priority domain is defined below and in Table 2.

Table 1. Domains included in several existing implementation research models/frameworks

Implementation Research Model/Framework	List of Domains
Proctor et al. (2009) – A Conceptual Model of Implementation Research¹⁵	Intervention Strategies; Implementation Strategies; Outcomes
Damschroder et al (2009) – The Consolidated Framework for Implementation Research¹⁴	Intervention Characteristics; Outer Setting; Inner Setting; Characteristics of the Individuals Involved; Process of Implementation
Aarons et al. (2011) – Conceptual Model of Evidence-Based Practice Implementation in Public Service Sectors¹³	Outer Context; Inner Context; Innovation Characteristics & Intervention Developers; Innovation/System Fit; Innovation/Organization Fit; Interconnections

Table 2. The priority domains for implementation research

Priority Domain (acronym)	Brief Description	Justification
Implementation Strategies (IS)	Strategies used to put into practice a program of known dimensions (e.g., EBP)	^15,19
Evidence-Based Measures of Implementation (EBMI)	A measure shown to be predictive of improvement in one or more key HHROs (e.g., client outcomes)	²¹
Health and Health-Related Outcomes (HHRO)	End-points regarding evidence-based process of care, client/patient outcomes, or population outcomes	^{15, 22}
Context-Related Moderators/Mediators (CRMM)	Measures of the outer setting/context or inner setting/context that are hypothesized to moderate and/or mediate relationships between the other domains (i.e., IS, HHRO, EBMI)	^{13, 14}

Note: IS = Implementation Strategies; EBMI = Evidence-Based Measure of Implementation; HHRO = Health and Health-Related Outcomes; CRRM = Context-Related Moderators/Mediators; EBP = Evidence-Based Practice;

Implementation strategies. Implementation strategies are defined as the strategies used to put into practice a program of known dimensions (e.g., an evidence-based practice [EBP]).^15,19 Given how IR has been defined and that implementation strategies are the quintessential independent variable in IR,^3-5 we consider the implementation strategy (IS) domain a priority for IR.

Evidence-based measures of implementation. If implementation strategies are the quintessential independent variable of IR, implementation outcomes have become the quintessential dependent variable. However, consistent with the important distinction demonstrated between a practice and an EBP,²⁰ an important distinction has been demonstrated between an implementation outcome and an evidence-based measure of implementation (EBMI).²¹ An implementation outcome is defined as “the effects of deliberate and purposive actions to implement new treatments, practices, and services,”¹⁶ whereas an EBMI is defined as “an implementation outcome measure that is predictive of improvements in key client outcomes” (i.e., health and health-related outcomes [HHROs], such as client functioning, health-related quality of life, or morbidity/mortality).²¹ This means that while all EBMIs are implementation outcomes, not all implementation outcomes are EBMIs. IR has historically prioritized implementation outcomes, but as noted by Proctor and colleagues (2009), implementation outcomes should not be treated as dependent variables until we have advanced them as consistent, valid, and efficient measures of implementation.¹⁶ Otherwise, we rely on the assumption that implementation outcomes are predictive of HHROs, without empirically demonstrating this to be true. To our knowledge, the PATH4IR Project is the first to explicitly identify EBMIs as a priority domain for IR.

Health & health-related outcomes. Health outcomes (e.g., client/patient functioning) and health-related outcomes (e.g., health-related quality of life, quality adjusted life years) are the outcomes that IR seeks to ultimately improve. Despite this, not all outcome-focused IR models/frameworks explicitly include the HHRO domain.^13,14 Instead, many focus on implementation outcomes, leaving out HHROs entirely. We identify HHROs as a priority domain for IR for two reasons. First, as noted above, until EBMIs are established, measuring only implementation outcomes relies on the assumption that implementation outcomes are predictive of HHROs. Second, as noted by Foy et al. (2015), “If studies evaluating the effects of implementation interventions are to be of relevance to policy and practice, they should have end-points related to evidence-based processes of care.”²²

Context-related moderators/mediators. Moderation occurs when the effect of an independent measure on a dependent variable depends on the level of another measure and mediation occurs when the effect of an independent variable on a dependent measure is transmitted through a third variable.²³ Given that existing IR models/frameworks have highlighted the importance of context^13,14,24 and that Edwards and Lambert’s (2007) general framework for moderated mediation¹⁸ guided identification of the priority domains for this project, context-related moderators/ mediators (CRMMs) was identified as a priority domain for IR. Conceptualizing context as potential moderators/mediators (instead of just discrete factors that “influence” implementation) moves the field of IR towards National Academy of Sciences-consistent theory as it starts to clarify relationships between constructs.

Three priority aims for implementation research

There are numerous aims (i.e., research questions) that IR could address, and there is value in establishing consensus regarding the types of aims IR should prioritize. Relative to IR’s domains, IR’s aims have received less explicit attention. The work of Curran et al. (2012)²⁵ is one exception. Specifically, for their type 3 effectiveness-implementation research categorization, Curran et al. recommended that the primary aim of this research category was to “determine utility of an implementation intervention/strategy” and the secondary aim was to “assess clinical outcomes” ). associated with implementation trial.”²⁵ Curran et al. also recommended implementation outcomes (e.g., adoption, fidelity) as dependent measures for the primary aim, with client outcomes (e.g., patient symptoms patient functioning) as dependent measures for the secondary aim.²⁵ However, priority aims have not generally been explicitly addressed by most other IR models/frameworks.^13-15 Given that developing or contributing to generalizable knowledge is central to how research is defined,²⁶ it is important that IR prioritize aims that seek to develop or contribute to generalizable knowledge for its priority relationships. Thus, building from the four priority domains described above, we identified the following three priority aims for IR: (1) the IS to HHRO relationship (i.e., IS à HHRO), (2) the IS to EBMI relationship (i.e., IS à EBMI), and (3) the EBMI to HHRO (i.e., EBMI à HHRO). Consistent with the mediational analysis literature,^27-30 we have termed IR focused on the IS à HHRO relationship as Path C IR (the red triangle of Figure 1), IR focused on the IS à EBMI relationship as Path A IR (the blue triangle of Figure 1), and IR focused on the EBMI à HHRO relationship as Path B IR (the green triangle of Figure 1). Each priority aim is defined below and in Table 3.

Table 3. The priority aims for implementation research

Priority Aim

Type

Advance generalizable knowledge regarding the

IS à HHRO relationship

Path C

implementation research

Advance generalizable knowledge regarding the

IS à EBMI relationship

Path A

implementation research

Advance generalizable knowledge regarding the

EBMI à HHRO relationship

Path B

implementation research

Note: IS = Implementation Strategies; HHRO = Health and Health-Related Outcomes;
EBMI = Evidence-Based Measures of Implementation.

Advance generalizable knowledge regarding the IS à HHRO relationship. Advancing generalizable knowledge about the relationship between an IS and a HHRO is termed Path C IR. Given IR’s emphasis on strategies to increase the uptake of EBPs to improve patient and population health^3-5 and the importance of measuring outcomes that have relevance to policy and practice,²² Path C IR was identified as a priority aim for IR. An example of Path C IR is a 29-site cluster randomized implementation experiment Garner et al. (2012) conducted between 2008 and 2012 that focused on testing the impact of a pay-for-performance implementation strategy to improve the implementation and effectiveness of the Adolescent Community Reinforcement Approach (A-CRA), which is an EBP for adolescents with substance use disorders.³¹ The dependent variable of interest was for the a primary HHRO, which was adolescent substance use recovery status at 6-month follow-up.

Advance generalizable knowledge regarding the IS à EBMI relationship. Advancing generalizable knowledge about the relationship between an IS and an EBMI is termed Path A IR. Given that an EBMI is a measure of EBP implementation found to be predictive of a key client outcomes²¹ Path A IR was identified as a priority aim for IR. Relative to IR that has tested the impact of an IS on implementation outcomes that do not have evidence of being a meaningful predictor of key client outcomes, IR testing the impact of an IS on EBMIs appears be limited. Having established an EBMI for A-CRA as part of an effectiveness study,^32,33 Garner et al. (2012)³¹ also provide an example of Path A IR. Indeed, examining the impact of pay-for-performance on an EBMI called Target A-CRA (i.e., 10+ of the core the A-CRA components delivered within no less than seven sessions), which prior research found to be significantly associated with greater reductions in adolescents’ days of abstinence at follow-up,³² Garner et al. (2012) found that relative to adolescents in the implementation-as-usual condition, adolescents in the pay-for-performance condition had a significantly higher likelihood of receiving Target A-CRA.³¹

Advance generalizable knowledge regarding the EBMI à HHRO relationship. Advancing generalizable knowledge about the relationship between an EBMI and HHRO is termed Path B IR. Research by Nosek et al. (2015),³⁴ which increased concern regarding the reproducibility of psychological science, underscores why Path B IR is a priority. That is, it is important that significant relationships (e.g., EBMI à HHRO) supported as part of effectiveness research be examined for replicability within IR. As part of their IR experiment, Garner et al. (2012)³¹ provide an example of Path B IR by replicating a significant association between Target A-CRA (i.e., the previously established evidence-based measure of implementation) and adolescent abstinence from substance use at follow-up (i.e., the HHRO).³¹

Four priority testable hypotheses for implementation research

While the possible testable hypotheses for IR are numerous, there is value in establishing consensus regarding the types of testable hypotheses IR should prioritize. Toward helping generate National Academy of Sciences-defined scientific IR, prioritizing one or more of the four testable hypotheses shown in Figure 2 is warranted. More specifically, there is a need to prioritize IR testable hypotheses regarding the extent to which an IS has demonstrated one or more of the following, relative to an appropriate active-control implementation strategy: superior effectiveness (upper left quadrant [ULQ]) and/or cost-effectiveness (upper right quadrant [URQ]), non-inferior effectiveness (lower left quadrant [LLQ]) and/or cost-effectiveness (lower right quadrant [LRQ]). Each priority testable hypothesis is described below and in Table 4.

Table 4. The priority testable hypotheses for implementation research

Priority Testable Hypothesis	Type
Cost-effectiveness hypotheses from a superiority trial	URQ hypotheses
Effectiveness hypotheses from a superiority trial	ULQ hypotheses
Effectiveness hypotheses from a non-inferiority trial	LLQ hypotheses
Cost-effectiveness hypotheses from A non-inferiority trial	LRQ hypotheses

Note: URQ = Upper Right Quadrant; ULQ = Upper Left Quadrant; LLQ = Lower Left Quadrant;
LRQ = Lower Right Quadrant.

Effectiveness hypotheses from a superiority trial. Testing the extent to which an experimental IS has superior effectiveness, relative to an active-control IS, is termed IR testing an upper left quadrant (ULQ) hypothesis. In contrast to research on clinical treatments, where an active-control condition may not exist or be appropriate, IR should include the most appropriate active-control IS possible. One of the most appropriate active-control condition IS may be the IS used as part of an EBPs effectiveness research. To date, the “large and growing evidence base relating to the effectiveness of implementation strategies” noted by Foy et al.²² has tested ULQ hypotheses and supports that this testable hypothesis is and should remain a priority for IR. Indeed, given that tests of ULQ hypotheses may continue to be the most common type of IR hypotheses, it may not be much longer before results of ULQ hypothesis tests are analyzed as part of a meta-analysis.

Cost-effectiveness hypotheses from a superiority trial. Testing the cost-effectiveness of an IS that has been shown to have superior effectiveness, relative to an active-control IS, is termed IR testing an upper right quadrant (URQ) hypothesis. It is considered a priority testable hypothesis for IR as knowing the effectiveness of an intervention/strategy is not sufficient for many potential users, especially decision makers who need to know whether the benefits from the intervention/strategy are commensurate with its costs (i.e., whether it delivers value),^35-38 Further, noting that economic evaluation of implementation strategies “has been neglected,” Foy et al. encouraged IR with an economic evaluation component.²² Building upon Garner et al. (2012),³¹ which found pay-for-performance to be an effective IS for improving the implementation and effectiveness of A-CRA in a superiority trial, Garner et al. (2018)³⁹ provide an example of IR testing an URQ hypothesis. Supporting the cost-effectiveness of a pay-for-performance IS, Garner et al (2018)³⁹ found that although the pay-for-performance strategy led to 5% higher average total costs compared to the implementation-as-usual control condition, this average cost increase of 5% resulted in a 325% increase in the average number of patients who received Target A-CRA (i.e., the EBMI).³⁹

Effectiveness hypotheses from non-inferiority trial. Testing the extent to which an experimental IS has non-inferior effectiveness, relative to an active-control IS, is termed IR testing a lower left quadrant (LLQ) hypothesis. Similar to how Schumi and Wittes (2011)⁴⁰ explain non-inferiority, testing a non-inferiority hypothesis seeks to provide evidence that the IS being tested is “not unacceptably worse” than the IS being used as a control. This is a priority for IR given strategies used to study a clinical intervention’s effectiveness may not be possible in practice settings (e.g., too intensive). We are not aware of IR that has tested LLQ hypotheses. However, a close example is a non-randomized observational IR study by Stirman et al. (2017)⁴¹ that compared two strategies for providing post-workshop consultation in an evidence-based cognitive therapy. As detailed by Stirman et al., results of their study did not support the hypothesis of the group consultation and feedback condition being non-inferior to the gold-standard individual feedback condition.⁴¹

Cost-effectiveness hypotheses from non-inferiority trial. Testing the cost-effectiveness of an IS shown to have non-inferior effectiveness, relative to an active-control IS, is termed IR testing a lower right quadrant (LLQ) hypothesis. Again, given decision makers desire to know the extent to which benefits from an IS are commensurate with its costs,³⁵ LLQ hypotheses were identified as a priority for IR. Although not from the field of IR, an example of testing cost-effectiveness hypotheses from a non-inferiority trial is provided by Bansback et al. (2018),⁴² which extended research by Oviedo-Jockes et al. (2016)⁴³ to support the non-inferiority of injectable hydromorphone hydrochloride (i.e., a narcotic pain reliever) relative to injectable diacetylmorphine hydrochloride (i.e., pharmaceutical heroin).

Objectives

The primary objective of the PATH4IR Project’s scoping review is to advance understanding regarding the extent to which IR to date has examined the four priority domains, three priority aims, and four priority testable hypotheses described above. We hypothesize that IR addressing these priorities will be limited (i.e., represent significant gaps in the extant IR literature). Thus, a secondary objective of this review is to help advance understanding regarding what domains, aims, and testable hypotheses IR has focused on to date.

The scoping review approach developed by Arksey & O’Malley (2005)⁴⁴ and advanced by Levac, Colquhoun, and O’Brien (2010)⁴⁵ guided this scoping review protocol and is therefore organized around five stages: (1) identifying the research question, (2) identifying relevant studies, (3) selecting studies, (4) charting the data, and (5) collating, summarizing, and reporting results. Each stage is described below.

Stage 1: Identifying the research questions

The primary research questions our research team will answer with this scoping review is: To what extent have the four priority domains, three primary aims, and four priority testable hypotheses described above been addressed by IR to date? Via an iterative process, our research team also identified the following secondary research questions: (1) which other domains have been studied by IR to date, (2) which other aims have been studied by IR to date, and (3) which other hypotheses have been examined by IR to date.

Stage 2: Identifying relevant studies

Implementation Science is the leading journal for publishing IR and receives over 800 submissions annually.⁴⁶ As such, this review will focus on IR published in Implementation Science since its inception in 2006. To identify relevant studies, we will search PubMed using the search strategy below and cross-reference the results with a list of publications on the journal’s website:

Search "Implementation science IS"[Journal]

Filters: Publication date to 12/31/2019

“Research articles” and “short reports” published in Implementation Science since its inception through 2019 are eligible. Articles labeled by the journal as “systematic review,” “methodology,” debate,” or “conference proceedings” are not eligible as this review aims to map original IR. “Protocols” were also excluded given that intended analyses do not always align with published results. Research articles and short reports will be excluded if the review team agrees that the paper’s primary objective is more aligned with an excluded article type.

Stage 3: Study selection

Reference information and full texts for all articles published in Implementation Science in 2019 or earlier will be imported into an EndNote database. The articles will be sorted by a reviewer by type to identify all articles labeled by the journal as research articles or short reports. In the subsequent stages, if a reviewer encounters an article deemed ineligible (i.e., labeled by the journal as a research article or short report but is not considered primary IR), the reviewer will raise it with review team so that consensus around an inclusion decision can be reached.

Stage 4: Charting the data

Table 5 provides a list of variables to be included in the project’s data-charting form, which was developed based on discussions by the review team regarding what information should be recorded for each eligible article and a pilot test of the form with five articles. First author, title, publication year, and article type are included as article identifiers. We will extract whether and which IS, EBMI, HHRO, or CRMM was studied, which relationships between these domains were studied (i.e., Path C, A, or B), and whether URQ, ULQ, LLQ, or LRQ hypotheses were tested when studying these relationships to answer our primary question of the extent to which the priority domains, aims, and testable hypotheses have been assessed in IR to date. As a secondary question, we will seek to understand what other domains, aims, and testable hypotheses have been examined by IR to date. For example, we will extract whether studies consider implementation outcomes that are not evidence-based or contextual factors not as moderators or mediators to understand which other domains have been examined and the extent to which they have been examined. We anticipate identifying IR that focused on implementation outcomes rather than EBMIs and therefore will record whether the IS à implementation outcome relationship was assessed. Our form also will include a space to capture other aims and testable hypotheses that IR has examined to date.

Table 5. Data elements.

Variable	Format	Description
Article identifiers
First author	Free text	Last name of the article’s first author
Title	Free text	Title of the article
Publication year	Numerical	Year in which the article was published
Article type	Categorical	Whether the article is labeled as a research article or short report by the journal
Primary question: To what extent have the PATH4IR Project’s priority domains, aims, and testable hypotheses been studied in IR to date?
IS	Dichotomous	Whether the study develops or assesses an IS
	Categorical	If yes, whether the implementation strategies of interest are evaluative and iterative, provide interactive assistance, adapt and tailor to context, develop stakeholder interrelationships, train and educate stakeholders, support clinicians, engage consumers, utilize financial strategies, or change infrastructure
	Free text	If yes, lists the IS of interest
HHRO	Dichotomous	Whether the study assesses an HHRO
HHRO	Free text	If yes, lists the HHRO of interest
EBMI	Dichotomous	Whether the study assesses an EBMI
EBMI	Free text	If yes, lists the EBMI of interest
CRMM	Dichotomous	Whether the study assesses a contextual factor as a moderator or mediator in some relationship
CRMM	Categorical	If yes, whether the contextual factors of interest are related to intervention characteristics (e.g., complexity), outer setting (e.g., external policies and incentives), inner setting (e.g., leadership engagement), individual characteristics (e.g., staff perceptions about the intervention), or the implementation process (e.g., extent of planning ahead of implementation)
Path C	Dichotomous	Whether the study assessed the IS à HHRO relationship
Path A	Dichotomous	Whether the study assessed the IS à EBMI relationship
Path B	Dichotomous	Whether the study assessed the EBMI à HHRO relationship
URQ hypothesis	Dichotomous	Whether the study tested a URQ hypothesis
ULQ hypothesis	Dichotomous	Whether the study tested a ULQ hypothesis
LLQ hypothesis	Dichotomous	Whether the study tested an LLQ hypothesis
LRQ hypothesis	Dichotomous	Whether the study tested an LRQ hypothesis
Secondary question: Which other domains have been studied in IR to date?
Implementation outcome	Dichotomous	Whether the study assesses an implementation outcome that is not yet evidence-based
	Categorical	If yes, whether the implementation outcomes of interest are related to acceptability, adoption, appropriateness, feasibility, fidelity, implementation cost, penetration, or sustainability
	Free text	If yes, lists the contextual factors of interest
Context generally	Dichotomous	Whether the study considers the implementation context without assessing it as a moderator or mediator in some relationship
	Categorical	If yes, whether the contextual factors of interest are related to intervention characteristics (e.g., complexity), outer setting (e.g., external policies and incentives), inner setting (e.g., leadership engagement), individual characteristics (e.g., staff perceptions about the intervention), or the implementation process (e.g., extent of planning ahead of implementation)
	Free text	If yes, lists the contextual factors of interest
Other domain	Free text	Lists domains other than IS, HHRO, EBMI, implementation outcomes, CRMM, or context generally that are studied
Secondary question: Which other aims have been studied in IR to date?
Path A-ish	Dichotomous	Whether the study assessed the IS à implementation outcome relationship
Other aim	Free text	Lists relationships other than Path C, Path A, Path A-ish, and Path B that are studied
Secondary question: Which other hypotheses have been tested in IR to date?
Other hypothesis	Free text	Lists testable hypotheses other than URQ, ULQ, LLQ, LRQ that are studied

Note: URQ = Upper Right Quadrant; ULQ = Upper Left Quadrant; LLQ = Lower Left Quadrant;
LRQ = Lower Right Quadrant.

To ensure validity of the form, data will be extracted by a primary reviewer and confirmed by a secondary reviewer for approximately one-third of the included articles. Any conflicts will be discussed until consensus is reached. Clarifications and additional revisions to the data-charting form based on the types of conflicts that arise will be considered. Once the form is finalized at this stage, data from the remaining articles will be extracted by a single reviewer.

Stage 5: Collating, summarizing, and reporting the results

A PRISMA flow diagram will be used to report results of the scoping review. Additionally, we will present a descriptive overview (including tabular and/or graphical summaries) of the eligible full texts. Because scoping reviews seek to provide an overview of the identified evidence base rather than synthesize findings from across studies, we plan to use our data-charting form to provide a descriptive overview of IR to-date and summarize the research via one or more summary tables (e.g., for each priority aim). Additionally, we will use the PATH diagram (see Figure 3), which integrates the four priority domains, three priority aims, and four priority testable hypotheses, to develop a map of the evidence.

Despite significant amounts of funding, time, and effort, the field of IR has yet to develop scientific theories as defined by the National Academy of Sciences (i.e., a comprehensive explanation of some aspect of nature that is supported by a vast body of evidence). The findings from this project are intended to help accelerate IR focused on one or more of the identified IR priority aims and testable hypotheses, which in turn will accelerate IR’s development of National Academy of Sciences-defined scientific theories and, subsequently, improvements in public health. Our review is limited to English-language articles published in the journal Implementation Science, which is a limitation given that IR can be submitted and published elsewhere and in other languages. However, limiting our review to primary research published in Implementation Science provides an efficient starting place given the research has already been screened and deemed to be relevant to IR. Results of this scoping review will be disseminated via presentations at professional conferences (e.g., Annual Conference on the Science of Dissemination and Implementation in Health, Society on Implementation Research Collaboration), publication in a peer-reviewed journal (e.g., Implementation Science, Implementation Research and Practice, Implementation Science Communications).

A-CRA: adolescent community reinforcement approach; CRMM: context-related moderators/ mediators; EBMI: evidence-based measures of implementation; EBP: evidence-based practice; HHRO: health and health-related outcomes; IR: implementation research; IS: implementation strategy; LLQ: lower left quadrant; LRQ: lower right quadrant; PATH: priority aims and testable hypotheses; ULQ: upper left quadrant; URQ: upper right quadrant;

Ethics approval. Not applicable.

Consent for publication. Not applicable.

Availability of data and material. Not applicable.

Competing interests. None.

Funding. This work was supported by the National Institute on Alcohol Abuse and Alcoholism (R01-AA017625), the National Institute on Drug Abuse (R01-DA038146; R01-DA044051), and RTI International (IRD-0271900.079).

Authors' contributions. BRG conceived of the project and its primary research questions. BRG, MAK, and SVP contributed to the development of the search strategy, eligibility criteria, and data charting form. BRG led the writing of the manuscript. SVP and MAK contributed meaningfully to the drafting and editing of the manuscript. All authors reviewed and approved the final manuscript.

Acknowledgments. The content is solely the responsibility of the author and does not necessarily represent the official views of the National Institutes of Health or RTI International.

Glasgow RE, Vinson C, Chambers D, Khoury MJ, Kaplan RM, Hunter C. National Institutes of Health approaches to dissemination and implementation science: current and future directions. Am J Public Health. 2012;102(7):1274-1281. doi: 10.2105/AJPH.2012.300755
Neta G, Glasgow RE, Carpenter CR, et al. A framework for enhancing the value of research for dissemination and implementation. Am J Public Health. 2015;105(1):49-57. doi: 10.2105/AJPH.2014.302206
Eccles MP, Mittman BS. Welcome to implementation science. Implement Sci. 2006;1. doi: Artn 1, 10.1186/1748-5908-1-1
Department of Health and Human Services. PAR-13-055. https://grants.nih.gov/grants/guide/pa-files/PAR-13-055.html. n.d.
Department of Health and Human Services. PAR-16-238. https://grants.nih.gov/grants/guide/pa-files/PAR-16-238.html. n.d.
National Academy of Sciences (U.S.), Institute of Medicine (U.S.). Science, evolution, and creationism. Washington, D.C.: National Academies Press; 2008.
Nilsen P. Making sense of implementation theories, models and frameworks. Implement Sci. 2015;10:53. doi: 10.1186/s13012-015-0242-0
Damschroder LJ. Clarity out of chaos: use of theory in implementation research. Psychiatry Res. 2020;283:112461.
Ajzen I. The theory of planned behavior. Organ Behav Hum Decis Process. 1991;50(2):179-211.
McEachan RRC, Conner M, Taylor NJ, Lawton RJ. Prospective prediction of health-related behaviours with the theory of planned behaviour: a meta-analysis. Health Psychol Rev. 2011;5(2):97-144.
Chalmers I, Bracken MB, Djulbegovic B, et al. How to increase value and reduce waste when research priorities are set. The Lancet. 2014;383(9912):156-165.
Tabak RG, Khoong EC, Chambers DA, Brownson RC. Bridging research and practice: models for dissemination and implementation research. Am J Prev Med. 2012;43(3):337-350. doi: 10.1016/j.amepre.2012.05.024
Aarons GA, Hurlburt M, Horwitz SM. Advancing a conceptual model of evidence-based practice implementation in public service sectors. Adm Policy Ment Health. 2011;38(1):4–23. doi: 10.1007/s10488-010-0327-7
Damschroder LJ, Aron DC, Keith RE, Kirsh SR, Alexander JA, Lowery JC. Fostering implementation of health services research findings into practice: a consolidated framework for advancing implementation science. Implement Sci. 2009;4:50. doi: 10.1186/1748-5908-4-50
Proctor EK, Landsverk J, Aarons G, Chambers D, Glisson C, Mittman B. Implementation research in mental health services: an emerging science with conceptual, methodological, and training challenges. Adm Policy Ment Health. 2009;36(1):24–34. doi: 10.1007/s10488-008-0197-4
Proctor E, Silmere H, Raghavan R, et al. Outcomes for implementation research: conceptual distinctions, measurement challenges, and research agenda. Adm Policy Ment Health. 2011;38(2):65-76. doi: 10.1007/s10488-010-0319-7
Ehrenberg ASC. A primer in data reduction: an introductory statistics textbook. Chichester West Sussex ; New York: Wiley; 1982.
Edwards JR, Lambert LS. Methods for integrating moderation and mediation: a general analytical framework using moderated path analysis. Psychol Methods. 2007;12(1):1-22. doi: 10.1037/1082-989X.12.1.1
Fixsen DL, Naoom SF, Blasé KA, Friedman RM, Wallace F. Implementation research: A synthesis of the literature. Tampa, FL: National Implementation Research Network; 2005.
Spring B. Evidence-based practice in clinical psychology: what it is, why it matters; what you need to know. J Clin Psychol. 2007;63(7):611-631. doi: 10.1002/jclp.20373
Garner BR, Hunter SB, Funk RR, Griffin BA, Godley SH. Toward evidence-based measures of implementation: examining the relationship between implementation outcomes and client outcomes. J Subst Abuse Treat. 2016;67:15–21. doi: 10.1016/j.jsat.2016.04.006
Foy R, Sales A, Wensing M, et al. Implementation science: a reappraisal of our journal mission and scope. Implement Sci. 2015;10:51. doi: 10.1186/s13012-015-0240-2
Fairchild AJ, MacKinnon DP. A general model for testing mediation and moderation effects. Prev Sci. 2009;10(2):87-99. doi: 10.1007/s11121-008-0109-6
Greenhalgh T, Robert G, Macfarlane F, Bate P, Kyriakidou O. Diffusion of innovations in service organizations: systematic review and recommendations. Milbank Q. 2004;82(4):581-629. doi: 10.1111/j.0887-378X.2004.00325.x
Curran GM, Bauer M, Mittman B, Pyne JM, Stetler C. Effectiveness-implementation hybrid designs: combining elements of clinical effectiveness and implementation research to enhance public health impact. Med Care. 2012;50(3):217–226. doi: 10.1097/MLR.0b013e3182408812
United States Congress. 45 CFR 46. Department of Health and Human Services; 2009.
Baron RM, Kenny DA. The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. J Pers Soc Psychol. 1986;51(6):1173-1182. doi: 10.1037/0022-3514.51.6.1173
MacKinnon DP, Lockwood CM, Hoffman JM, West SG, Sheets V. A comparison of methods to test mediation and other intervening variable effects. Psychol Methods. 2002;7(1):83-104. doi: 10.1037/1082-989x.7.1.83
MacKinnon DP, Fairchild AJ, Fritz MS. Mediation analysis. Annu Rev Psychol. 2007;58:593-614. doi: 10.1146/annurev.psych.58.110405.085542
MacKinnon DP. Introduction to statistical mediation analysis. New York: Lawrence Erlbaum Associates; 2008.
Garner BR, Godley SH, Dennis ML, Hunter B, Bair C, Godley MD. Using pay for performance to improve treatment implementation for adolescent substance use disorders: results from a cluster randomized trial. Arch Pediatr Adolesc Med. 2012;166:938–944. doi: 10.1007/archpediatrics.2012.802
Garner BR, Godley SH, Dennis ML, Godley MD, Shepard DS. The Reinforcing Therapist Performance (RTP) experiment: study protocol for a cluster randomized trial. Implement Sci. 2010;5:5. doi: 10.1186/1748-5908-5-5
Garner BR, Godley SH, Funk RR, Dennis ML, Smith JE, Godley MD. Exposure to adolescent community reinforcement approach treatment procedures as a mediator of the relationship between adolescent substance abuse treatment retention and outcome. J Subst Abuse Treat. 2009;36(3):252-264. doi: 10.1016/j.jsat.2008.06.007
Open Science Collaboration. PSYCHOLOGY. Estimating the reproducibility of psychological science. Science. 2015;349(6251):aac4716. doi: 10.1126/science.aac4716
Garber AM, Sox HC. The role of costs in comparative effectiveness research. Health Aff (Millwood). 2010;29(10):1805-1811. doi: 10.1377/hlthaff.2010.0647
Reeves P, Edmunds K, Searles A, Wiggers J. Economic evaluations of public health implementation-interventions: a systematic review and guideline for practice. Public Health. 2019;169:101-113. doi: 10.1016/j.puhe.2019.01.012
Roberts SLE, Healey A, Sevdalis N. Use of health economic evaluation in the implementation and improvement science fields—a systematic literature review. Implement Sci. 2019;14(1):72.
Eisman AB, Kilbourne AM, Dopp AR, Saldana L, Eisenberg D. Economic evaluation in implementation science: Making the business case for implementation strategies. Psychiatry Res. 2020;283:112433. doi: 10.1016/j.psychres.2019.06.008
Garner BR, Lwin AK, Strickler GK, Hunter BD, Shepard DS. Pay-for-performance as a cost-effective implementation strategy: results from a cluster randomized trial. Implement Sci. 2018;13(1):92. doi: 10.1186/s13012-018-0774-1
Schumi J, Wittes JT. Through the looking glass: understanding non-inferiority. Trials. 2011;12. doi: Artn 106, 10.1186/1745-6215-12-106
Stirman SW, Pontoski K, Creed T, et al. A non-randomized comparison of strategies for consultation in a community-academic training program to implement an evidence-based psychotherapy. Adm Policy Ment Health. 2017;44(1):55-66. doi: 10.1007/s10488-015-0700-7
Bansback N, Guh D, Oviedo-Joekes E, et al. Cost-effectiveness of hydromorphone for severe opioid use disorder: findings from the SALOME randomized clinical trial. Addiction. 2018;113(7):1264-1273. doi: 10.1111/add.14171
Oviedo-Joekes E, Guh D, Brissette S, et al. Hydromorphone compared with diacetylmorphine for long-term opioid dependence: a randomized clinical trial. JAMA Psychiatry. 2016;73(5):447-455. doi: 10.1001/jamapsychiatry.2016.0109
Arksey H, O'Malley L. Scoping studies: towards a methodological framework. Int J Soc Res Methodol. 2005;8(1):19-32. doi: 10.1080/1364557032000119616
Levac D, Colquhoun H, O'Brien KK. Scoping studies: advancing the methodology. Implement Sci. 2010;5:69. doi: 10.1186/1748-5908-5-69
Sales AE, Wilson PM, Wensing M, et al. Implementation Science and Implementation Science Communications: our aims, scope, and reporting expectations. Implement Sci. 2019;14(1):77. doi: 10.1186/s13012-019-0922-2