FROM:
PLoS One 2025 (Nov 17); 20 (11): e0334151 ~ FULL TEXT
Martha Funabashi • Lindsay M Gorrell • Katherine A Pohlman Andrea Bergna • Nicola R Heneghan • AdEMMo Group
Division of Research and Innovation,
Canadian Memorial Chiropractic College,
Toronto, Canada.
Spinal and peripheral joint manipulation (MAN) and mobilization (MOB) are widely used for managing musculoskeletal conditions. Although adverse events (AE) have been reported following these interventions, there is no universally accepted definition and classification system. This study aimed to establish an inter-professional and international standardized definition and severity classification for AE following MAN and MOB. This sequential mixed-methods study included an electronic Delphi process (e-Delphi) followed by focus groups. Inter-professional and international expert stakeholders participated in 3 e-Delphi rounds: Round 1 included open-ended questions on participants' working AE definition and severity classification; Round 2, level of agreement with statements generated from Round 1 and a previous scoping review; and Round 3, level of agreement with statements achieving consensus in Round 2. Focus groups explored e-Delphi findings.
Consensus was reached for severity categories (i.e., mild, moderate, severe and catastrophic) and on 2 domains to differentiate these categories (i.e., symptom intensity and impact on patient).
Consensus was not reached for a standardized AE definition following MAN and MOB. Focus group discussions centered on "unfavourable", "unexpected" and "undesired" terms and differences between "serious" and "catastrophic" severity classification categories.
Findings contribute to advancing patient safety and AE knowledge across professions and informing further safety research and practice.
From the FULL TEXT Article:
Introduction
Spinal and peripheral joint manipulation and mobilization are interventions commonly used by many healthcare professionals (such as physiotherapists, chiropractors, osteopaths, etc.) to manage musculoskeletal conditions, including spinal pain. [1, 2] Manipulation techniques (MAN) involve the manual application of a dynamic high-velocity, low-amplitude force; whereas mobilization techniques (MOB) involve the application of a cyclic low-velocity and variable amplitude manual force. [3]
There is increasing evidence supporting the effectiveness of these interventions to reduce pain and improve function in patients with musculoskeletal conditions with MAN and MOB both recommended in numerous clinical practice guidelines. [4–8] Despite this, research focused on the safety of MAN and MOB remains in its infancy and limited in both scope and depth, making it challenging to fully assess the risk-benefit profile of these interventions.
In all healthcare professions, patient safety is a priority, with emphasis on minimizing preventable and/or unexpected adverse events (AE) during or following any clinical intervention, including MAN and MOB. [9, 10] However, within the field of MAN and MOB, efforts to reduce AE have been hindered by a number of factors, including: i) heterogeneity in AE definitions; their nuanced wording and classification; ii) a lack of reporting systems that accurately and consistently collect safety information; iii) long periods of time between patient visits; and iv) a lack of coordination of care among multiple providers. [9, 11–14] Consequently, addressing these factors and collecting relevant AE data in a standardized way is fundamental to improve patient safety. [15]
This is challenging considering the spectrum of AE reported in the literature following MAN and MOB, ranging from frequent and expected minor AE (e.g., mild discomfort and increased muscle soreness after treatment) to rare and serious AE (e.g., cervical artery dissection and subsequent stroke, or death). [12, 16–18] Furthermore, a recent scoping review of the literature, encompassing several healthcare professions that use MAN and MOB, reported that a wide variety of terms and definitions are used to describe AE and, similarly, several systems are used to classify AEs. [19] Authors concluded there was no one standardized definition or classification system for reporting AE that was commonly used. This lack of inter-professional uniformity globally not only hinders the possibility of accurately quantifying AE incidence rates, but also impedes advancement in patient safety initiatives and strategies.
Additionally, the heterogeneity in AE definitions also precludes the accurate reporting of AEs and hinders effective communication among healthcare professionals, which can negatively impact patient outcomes. Therefore, to address this need, this study aimed to establish an inter-professional and international standardized definition and severity classification for AE following spinal and peripheral joint manipulation and mobilization, within an adult population with musculoskeletal conditions.
Methods
Figure 1
|
Design
A sequential explanatory mixed-methods design consisting of:
1) an electronic Delphi process (e-Delphi process); and
2) virtual focus groups was used.
The e-Delphi (January to July 2022) consisted of three rounds of questionnaires collected electronically using the [Research Electronic Data Capture system (REDCap®) platform. [20, 21] To help further understand and explain the results of the e-Delphi process, 3 focus groups were conducted in December 2022 (Figure 1).
A detailed rationale and justification for this design was previously described. [22] Briefly, the e-Delphi method was chosen as it addresses barriers from other consensus approaches, such as nominal group technique. Specifically, by overcoming differences in geographical location, time zones, and participants schedules, this method allowed for the participation of experts globally and without limits to specific participant groups.
All parts of this study were in accordance with the ethical standards, guidelines and regulations of the responsible committee, with the Helsinki Declaration and approved by the Canadian Memorial Chiropractic College Research Ethics Board (CMCC REB# 2103B01). All participants signed an electronic informed consent form prior to participation. The AdEMMo group (full name of members in acknowledgment section), comprised of 14 inter-professional and international members with expertise in patient safety, research methodology, and MAN/MOB, acted as a steering committee.
Given their knowledge and expertise in the area, they were divided into two sub-committees that provided their opinions by either: i) participating in each e-Delphi round as a respondent (CB, DC, ES, LC, MOB, MH, PaD, SV, SL); or ii) reviewing results at each round, providing feedback on questionnaire development, and approving questionnaires to be sent to participants (AG, PeD, RE, SM, SS). [22]
The AdEMMo group included a patient representative who was a member in the sub-committee reviewing rounds results and questionnaires ensuring patients’ perspectives were taken into consideration. The e-Delphi is reported in accordance with the “Guidance on Conducting and Reporting Delphi Studies (CREDES)” [S1 Table]. [23] Similarly, the qualitative portions of this study (round 1 of the e-Delphi process and focus groups) are reported in accordance with the “Standards for Reporting Qualitative Research (SRQR)” [S2 Table]. [24]
Procedure
e-Delphi process.
The e-Delphi process consisted of three rounds of questionnaires (Fig 1). A detailed description of each round was previously provided. [22] Briefly, round 1 questionnaire consisted of open-ended questions of participants’ current understanding of definition and severity classification of AEs following joint MAN and MOB. Round 2 included statements developed based on round 1 responses and findings from a previous scoping review of the literature mapping the definition and classification of AEs previous studies investigating joint MAN and MOB used. [19] Round 3 included feedback from round 2 using descriptive statistics to promote participant reflection and the questionnaire consisted of statements from round 2 that reached consensus for further agreement.
Definitions identified in the scoping review of the literature were only presented to participants in round 2 to avoid anchoring bias, encourage independent, discipline-specific contributions and ensure that the resulting consensus was not influenced by the literature or the researchers’ preconceptions. [23] Questionnaires were reviewed and piloted by (expert methodologist) members of the AdEMMo group and edited based on their feedback [S1 File]. All participants were invited to respond to all rounds to provide them with the opportunity to continue their involvement even if they were unable to complete previous rounds. Each round remained active for 6 weeks with email reminders sent in weeks 1, 3 and 5. [22]
Focus groups.
A topic guide was developed based on e-Delphi round 3 results with questions and prompts designed to encourage participant discussion [S3 Table]. A pilot focus group was conducted with colleagues from the investigative team that were not involved with the study to simulate the discussions as well as test and further refine the topic guide. Following the pilot focus group, modifications to the topic guide were made, including re-wording and re-ordering of questions.
Three virtual focus groups were conducted using an online platform (Zoom Video Communications, Inc., CA, USA). Conducting three focus groups would allow for thematic saturation and cross-group comparison, while accounting for practical constraints related to group size and participant availability across time zones. A summary of the e-Delphi results and discussion topics were circulated to participants a priori in pdf format [S2 File]. Each focus group was scheduled for 120 minutes and was digitally recorded. The facilitator (MF) encouraged all participants to actively contribute to the discussion to ensure that everyone had the opportunity to voice their opinions and thoughts on all topics. The moderator (NH) kept track of time to ensure all topics were discussed and monitored the chat box for written comments. In addition to notes taken by both the facilitator and moderator, each focus group was transcribed verbatim. Participants in each session had the opportunity to review their transcripts and add further comments prior to data analysis.
Participant eligibility and sample
e-Delphi process.
Table 1
|
Inclusion and exclusion criteria and participant recruitment were previously detailed. [22] Briefly, participants were adults (? 18 years old) with a high level of knowledge within the area of patient safety and AE related to MAN and MOB (Table 1). They were identified and recruited through social media, emails to professional networks, and authors of studies included in the previous scoping review. [19] Care was taken to ensure balanced representation from most continents, expert groups and professions. [22] Recruitment occurred over an 8-week period (September 8 2021 to November 3 2021), with all interested individuals completing an expression of interest form, including demographic and expertise information. [22] Individuals who met the eligibility criteria were invited to participate.
Focus groups.
All participants who responded to round 3 of the e-Delphi were invited to participate in a focus group following the conclusion of the e-Delphi process. Purposive sampling based on participant demographic characteristics, profession, geographical location, expert group and availability was used to ensure comparable exploration of findings from the e-Delphi process. This ensured diverse representation while still having enough participants in each focus group (minimum of 4 participants). All focus group participants signed a separate informed consent form and provided verbal consent for their focus group to be audio and video recorded.
Protocol
The protocol for the e-Delphi process was registered at Open Science Framework (osf.io/ex3ha) in September 2021, and published a priori. [22]
Due to the large heterogeneity of terms, definitions and classifications derived from round 1 responses, specific protocol deviations were deemed appropriate for the round 2 questionnaire:
Limited feedback from round 1 responses was provided in round 2 for time efficiency (i.e., feedback was restricted to a) number of respondents and studies (from scoping review of the literature) that contributed to the development of questions included in round 2 and b) specification that terms included in this round’s questions derived from participant responses from round 1);
Questions regarding participants’ preferences of terms were included based on terms that participants appeared to use interchangeably in round 1 responses; and
Open text boxes were provided at the end of questionnaire sections (rather than after each statement) to allow participants to elaborate only when necessary (e.g., to clarify reasoning, suggest refinements, or raise concerns), thereby reducing cognitive burden.
Additional protocol deviations included:
NH acted as the third reviewer for all qualitative analyses;
Definitions from the International Classification of Functioning, Disability and Health (ICF) [25] and Serious Adverse Events (SAE) [26, 27] were incorporated into round 2 and 3 questionnaires;
Inferential statistics were not used to calculate stability in rounds 2 and 3 due to heterogeneity of responses; and
Focus groups were added to further understand and explain the results of the e-Delphi process.
Data analysis
e-Delphi process
Qualitative data from round 1 open-ended questions were analysed using a theoretical thematic analysis, using Braun and Clarke’s six-phase framework. [28] Initial coding was structured around domains pre-identified from the scoping review of the literature [19] and served as a deductive framework. Within each domain, data were examined inductively for any new themes. [28, 29] Data were analysed independently by two researchers (MF/LG) using Microsoft Excel (Microsoft Corporation, USA) and any disagreements were resolved by discussion with a third reviewer (NH). The resulting codes were synthesized into statements for inclusion in Round 2 questionnaire. Wording used by participants was combined to generate statements that best represented similar themes across participants. [30] Statements generated from the scoping review [19] not identified from round 1 responses were included during round 2 questionnaire development. All researchers involved in round 1 open text responses are female and have a doctorate degree.
Both complete and incomplete responses were included in the analyses. Descriptive statistics, calculated using R (R Foundation for Statistical Computing, Vienna, Austria), were used to evaluate consensus in rounds 2 and 3 responses. Median, interquartile range and percentage agreement were calculated for each statement. Consensus for round 2 was pre-defined as 60% with an interquartile range (IQR) of ?1.5 and for round 3, 70% with an IQR with of ?1.0 [22, 30–32], and represent the proportion of participants who rated a given statement as either “agree” or “strongly agree” on a 5-point Likert scale. Agreement of rounds 2 and 3 responses were assessed using Kendall’s coefficient of concordance to assess inter-rater agreement. [22] Round 3 responses were further sub-grouped by profession and expert group to evaluate sub-group consensus. Statements not achieving consensus in round 2 were not included in round 3. Open text comments provided by participants in rounds 2 and 3 were analyzed qualitatively using thematic analysis and were used to inform the focus group topic guide. Statements achieving consensus after round 3 were used to inform the final AE definition and severity classification reported in this manuscript.
Focus groups
Data were analyzed once all focus groups had been completed using an inductive theoretical thematic analysis. The facilitator (MF) coded focus groups transcripts and identified key themes in consultation with a second reviewer (NH). Preliminary results were then presented to all named authors for further analysis and to inform interpretation. All researchers involved in focus groups data analysis are female, have a doctorate degree, and had previous experience with analysis of qualitative data.
Results
Participants
A total of 233 individuals expressed interest in participating in this study, with 215 meeting the eligibility criteria and being invited to participate in the e-Delphi process. Round 1 had 183 respondents (85% response rate), with 168 complete and 15 partially completed questionnaires. Subsequently, 212 participants were invited to round 2 with 163 (77%) responding (152 complete; 11 partial completion). Round 3 questionnaire was sent to 208 participants, of whom 153 (73%) responded (152 complete; 1 partial completion). Seven participants withdrew from the study (n?=?5 due to availability of time; n?=?2 due to methodological preferences/concerns).
At the end of round 3, a total of 74 participants (48%) expressed interest in focus group participation with a total of 16 participating.
Table 2
|
Demographic characteristics of participants in each e-Delphi round and the focus groups are presented in Table 2. While all e-Delphi rounds had respondent representatives from all expert groups and most continents of the world, focus groups had representatives from most expert groups and continents.
1) e-Delphi process
Round 1.
Adverse event definition:
Common descriptors participants used to define AE following MAN and MOB included “unfavourable”, “unexpected”, “undesired”, and “unintended”. Domains commonly used to define AE were temporality (e.g., “during” or “after” treatment), causality (e.g., “result of the treatment” or “not necessarily caused by the treatment”) and association (e.g., “associated with the treatment”).
Adverse event severity classification:
Common categories participants used to classify the severity of AE following MAN and MOB included “benign”, “mild”, “minor”, “moderate”, “major”, “severe”, “serious”, “catastrophic” and “fatal”. Domains commonly used to describe differences between severity classification categories included symptom intensity (e.g., “low intensity” or “NRS score of 4-6”), duration (e.g., “lasting minutes to days” or “medium to long term”), reversibility (e.g., “reversible” or “permanent damage”), level of disability (e.g., “impairs function” or “incapacity”), impact on patient (e.g., “not interfering with normal activities” or “prohibits normal everyday life, work, and quality of life”) and additional care required (e.g., “requires medical attention” or “requiring medical interventions which could include surgery and hospitalization”).
Round 2.
Adverse event definition:
Table 3
|
Ten terms reached consensus as most preferred by participants to define AE (i.e., “unfavourable”, “unanticipated”, “unexpected”, “undesired”, “unwanted”, “unintended”, “outcome”, “response”, “exacerbation”, and “following”). Similarly, 6 terms reached consensus as least preferred by participants (i.e., “deleterious”, “unusual”, “uncalled for”, “untoward”, “reaction”, and “deterioration”) and were discarded. Additionally, there was consensus to keep 5 statements and remove 3 statements (Table 3).
Adverse event classification:
Table 4
Table 5
|
There was consensus that the terms “mild” and “minor” are similar (86%), with “mild” being the preferred term (79%). There was also consensus that the terms “major”, “severe” and “serious” are similar (72%), with “severe” being the preferred term (82%) and “catastrophic” having greater severity than “major”, “severe” and “serious” (93%). Additionally, there was consensus that symptom intensity (60%) and impact on patient (70%) were the most preferred domains to differentiate between severity categories. There was also consensus that symptom duration (48%), symptom reversibility (48%) and additional care required (49%) were the least preferred domains to differentiate between severity categories (Table 4). The domains that reached consensus to differentiate between AE severity categories are presented in Table 5. The descriptors suggested by most participants for each domain are also presented in Table 5 although consensus on the descriptors was not reached.
Round 3.
Adverse event definition:
Table 6
|
None of the proposed statements related to the definition of AE following MAN and MOB reached consensus agreement (Table 6). Sub-grouped responses by profession and expert group also did not reach consensus [S4 Table].
Adverse event classification
There was consensus that the magnitude of symptom intensity and impact on patient would allow for differentiation between severity categories (Table 6).
A comprehensive table providing a summary of the severity classification description that reached consensus is provided in S5 Table.
2) Focus groups
Focus group discussions centered on and explored items that did not reach consensus during the e-Delphi process. For the definition of AE following MAN and MOB, participants shared opinions and thoughts related to the terms “unfavourable”, “unexpected” and “undesired”, and what AE meant to them. Participants discussed the possibility of expanding the AE definition beyond MAN and MOB to include physical examination of the patient and other conservative interventions.
With respect to severity categories, most participants stated that the term “catastrophic” implied greater severity than the term “serious”. Participants were presented with the SAE definition from the wider healthcare literature: “any untoward medical occurrence that results in death, is life-threatening requires inpatient hospitalization or causes prolongation of existing hospitalization results in persistent or significant disability/incapacity, may have caused a congenital anomaly/birth defect, or requires intervention to prevent permanent impairment or damage”. They were also presented with the definition from “catastrophic AE” from the e-Delphi process: “significant intensity, ranging between 8-10 on an 11-point numeric scale; life-threatening and could result in death; it totally disrupts a patient’s activities, participation, and quality of life”.
When presented with both definitions, most participants stated that the definitions should be reversed. Participants also shared concerns regarding the:
i) use of a generic numeric scale to classify AE as similar scales are commonly used by individuals to describe pain (AE are not only restricted to pain);
ii) overlapping numeric scores related to symptom severity;
iii) multiple criteria that must be met to determine severity category (i.e., symptom severity score and impact on patient); and
iv) importance of including symptom duration to differentiate between severity categories (e.g., transient, permanent, etc.).
A representation of the main themes discussed during the focus groups is presented in S1 Figure
Discussion
This study aimed to establish an inter-professional and international standardized definition and severity classification for AE following spinal and peripheral joint manipulation and mobilization for adults with musculoskeletal conditions and involved a large and diverse sample of participants representative of most expert stakeholder groups globally. Although consensus on a standardized definition for AE following MAN and MOB was not achieved, a standardized severity classification was established. Findings from focus groups emphasized the diversity of opinions contributing to the complexity of this topic.
Adverse event definition
The current study builds on Carnes and colleagues’ (2010) Delphi process [33] with a larger, more diverse and representative sample in terms of professions that use MAN and MOB, geographical location and expert stakeholder groups. Interestingly, both studies did not reach consensus on a definition for AE. Collectively, these studies along with the previous scoping review reporting a vast array of AE definitions used throughout the literature [19] highlight the historical diversity and heterogeneity in defining AE following MAN and MOB. Indeed, the complexity of standardizing AE vocabulary has been described with some AE definitions evolving over time and being revised with knowledge advancements. [34]
The challenge of establishing a standardized AE definition is not unique to MAN/MOB. Even main health-related resources have different definitions for AE. Specifically, the Common Terminology Criteria for Adverse Events (CTCAE) defines AE as an “unfavorable and unintended sign (including an abnormal laboratory finding), symptom, or disease temporally associated with the use of a medical treatment or procedure that may or may not be considered related to the medical treatment or procedure”. [35] Cochrane defines AE as an “unfavourable or harmful outcome that occurs during, or after, the use of a drug or other intervention, but is not necessarily caused by it”. [36]
The World Health Organization (WHO) has more than 10 different definitions for AE in their conceptual framework for the international classification for patient safety document, including “an undesired patient outcome that may or may not be the result of an error” and “an injury that was caused by medical management and that results in measurable disability”. [37]
It is noteworthy that most definitions include terms identified in our e-Delphi process (i.e., “unfavourable”, “unintended”, “undesired”); however, there were divergent opinions related to these terms in rounds 2 and 3, which were re-emphasized in the focus group discussions. While this could reflect individual preferences and different sources of AE knowledge and information, this lack of standardization is concerning as it can significantly influence the dialogue between healthcare professions and negatively impact patients’ safety and quality of care.
Focus group findings highlighted additional challenges related to language and heterogeneity in safety cultures depending on geographical location. This suggests that the diversity of opinions regarding the AE definition is influenced by local culture and standards. This is consistent with previous literature reporting on the influence of culture on health [38,39], as well as the WHO’s Global Patient Safety Action Plan Guiding Principles highlighting the importance of considering local context, culture, traditions, infrastructure and healthcare systems when implementing strategies to increase patient safety. [10]
Focus group discussions revealed that some participants felt challenged with defining AE separately from a severity classification. This could be due to the close interconnection between defining AE based on their severity classification observed in previous studies. [33, 40–42]
Focus group discussions inferred that consensus would not be reached even if additional e-Delphi rounds were conducted. Discussions also provided participants with the opportunity to change their personal AE definition based other participants’ perspectives. This suggests that additional opportunities for discussions related to AE might assist with developing individual understandings and conceptualization. This, in turn, has the potential to contribute to establishing a standardized inter-professional and international definition for AE in the future and highlights the need for training institutions to implement these established definitions throughout their curriculum.
Adverse event severity classification
The severity classification for AE following MAN and MOB and the domains used to differentiate categories that reached consensus in the current study are not aligned with those obtained in previous Delphi studies. [33, 43] Specifically, one previous severity classification included “minor”, “moderate” and “major” categories with duration, severity and descriptor as domains. [33] Another severity classification included “no adverse event”, “minor” and “major” severity categories with duration as the domain. [43] While the differences could be explained by the participation of different respondents, the vast heterogeneity in AE classifications described in the literature likely also contributed. Given the wider professional, expert stakeholder group and global representation included in this study, the standardized severity classification for AE following MAN and MOB obtained from this e-Delphi constitutes a robust foundation for supporting a more standardized classification of AEs. This would facilitate AE reporting and dialogue that promote inter-professional learning significantly contributing to advancing patient safety.
Round 1 responses demonstrated the need to quantify the impact of AE on patients’ functionality, quality of life and well-being as domains to differentiate severity categories. As such, this study used ICF’s definitions and modified qualifier scale. [25] While this may facilitate the use of such a scale for MAN/MOB providers, additional studies are needed to further assess its applicability in the context of ICF standards.
Round 2 responses showed that participants deemed the terms “major”, “severe” and “serious” as similar. When the established SAE definition was incorporated in the round 3 questionnaire, it was not considered similar to “catastrophic”. Focus group discussions revealed that participants thought the term “catastrophic” implied greater severity than the term “serious” and some suggestions to address this issue were voiced, including having SAE as a separate definition altogether, as well as merging “severe” and “serious” categories or “catastrophic” and SAE. Given the importance of standardizing terminology across healthcare professions, further investigation into how SAE fit into the proposed severity classification is warranted.
Despite the domains used to differentiate between severity categories reaching consensus during the e-Delphi process, focus groups revealed potential concerns including:
a) the 11-point generic numeric scale, b) the overlapping scores, and c) the importance of symptom duration (excluded in round 2).
a) The description of symptom intensity/severity using both words (e.g., “low”, “mild”, “moderate”, “high”, “significant”) and a generic 11-point numeric rating scale (NRS) have been observed not only in previous studies [33, 41, 44–47], but also during round 1 of this e-Delphi. The NRS is well-known to be a valid and reliable measurement of pain intensity and have been widely used in clinical practice and research in many healthcare areas. [48–50] Consequently, the assumption that the NRS included in this study’s severity classification was associated to pain intensity generated confusion in some participants. While the most common use of NRS is indeed for measuring pain intensity, especially in musculoskeletal conditions, it has also been validated for measuring other symptoms, such as perceived effort [51], spasticity [52], and itchiness. [53] In accordance with previous studies [44, 45, 54, 55] and based on responses from round 1, the proposed NRS included in the severity classification of the current study was intended as a measure of overall symptom intensity, rather than pain specifically.
b) Open text comments throughout the e-Delphi process included the importance of considering patients’ perspectives. The subjectivity of symptom severity and the different impact of similar symptoms on different patients’ functionality and well-being has been described. [56] The overlapping numeric scores in our proposed severity classification could provide the flexibility to account for this subjectivity (S5 Table).
c) Given that symptom duration was incorporated as a domain to differentiate between severity categories in previous studies [33, 40, 43, 44, 46, 57], it was interesting that this domain did not reach consensus in round 2 of the current study. While this could be associated with different severity classifications described in the literature that do not use symptom duration as a domain [45, 58–60], focus group participants shared how important they believed it was to consider symptom duration when differentiating between severity categories. This highlights, once again, the unique individual opinions and perspectives that exist related to this topic.
While revisions should continue to be made as the here-established severity classification is validated, this study introduces a severity classification for AE following MAN and MOB that reached consensus among over 150 participants representing most continents, professions that use MAN and MOB and expert stakeholder groups.
Strengths and limitations
Key strengths of this study included: i) adherence to an a priori published protocol [22]; ii) an inter-professional and international expert investigative team with patient representation; iii) the use of a sequential mixed-methods design that enabled further exploration of items that did not achieve consensus; iv) representation of all expert stakeholder groups and from most continents; and v) a high response rate and low attrition across all e-Delphi rounds. Limitations included: i) under representation from some expert groups (e.g., law, patients), manual therapy professions (e.g., naturopaths) and continents (e.g., Asia and Africa); ii) limited representation in the focus groups (n?=?16) due to time, availability and organizational considerations; iii) although consensus was reached on a severity classification for AE following MAN and MOB, its validity and reliability remain unknown.
Implications for practice and research
Findings contribute to advancing the knowledge related to AE following MAN and MOB internationally and across all professions that use these interventions. Even though a standardized adverse event (AE) definition was not reached, this study establishes a standardized severity classification for AEs following MAN and MOB internationally and across all professions that use these interventions. Additional opportunities for inter-professional and international discussions will contribute to advancing and evolving individuals’ perspectives and opinions, potentially towards a standardized AE definition in the future. Combined with the inter-professional and international standardized AE severity classification established in this study, such discussions will significantly contribute to enhancing precision in practice significantly enhancing patient safety and quality of care in clinical practice. Although future studies are needed to further assess the validity, reliability and applicability of the here-established severity classification and the incorporation of SAE definition, the classification developed in this study can serve as a preliminary reference to support more consistent inter-professional and international communication regarding AEs among all stakeholders.
Conclusion
A standardized severity classification including mild, moderate, severe and catastrophic categories was established with detailed descriptors for each severity category. Consensus on a standardized definition for AE following MAN and MOB was not reached. Findings contribute to advancing patient safety and AE knowledge inter-professionally and internationally and inform further safety research to enhance patient safety in clinical practice.
Supplementary Material
S1 Table
Recommendations for the Conducting and REporting of DElphi Studies (CREDES)
(DOCX)
S2 Table
Standards for Reporting Qualitative Research (SRQR)
(DOCX)
S3 Table
Focus Group Topic Guide.
(DOCX)
S4 Table
Round 3 sub-group analysis.
(DOCX)
S5 Table
Comprehensive summary of established classifications that reached consensus in this study.
(DOCX)
S1 File
E-Delphi Questionnaires.
(PDF)
S2 File
Summary of e-Delphi results and discussion topics.
(DOCX)
S1 Fig
Representation of the main themes discussed during the focus groups.
(DOCX)
Acknowledgments
The authors would like to truly thank all the participants who provided their time and expertise during the e-Delphi and focus groups.
AdEMMo Group members: Anita Gross (McMaster University), Christopher Burrell (Macquarie University), Dawn Carnes (University College of Osteopathy), Eva Skillgate (Karolinska Institutet), Lisa Carlesso (McMaster University), Maeve O’Beirne (University of Calgary), Michael Hill (University of Calgary), Paul Dougherty (VA Medical Center), Peter Doucet [patient representative], Roger Engel (Macquarie University), Silvano Mior (Canadian Memorial Chiropractic College), Stacie Salsbury (Palmer College of Chiropractic), Steven Vogel (University College of Osteopathy), Stina Lilje (Karolinska Institutet).
This study was presented at the 17th WFC biennial congress in Gold Coast, Australia 11th-14th October 2023 and was awarded the Scott Haldeman Award for Outstanding Research.
Funding:
The author(s) received no specific funding for this work.
Competing interests:
The authors have declared that no competing interests exist.
References:
Hurwitz EL.
Epidemiology: Spinal Manipulation Utilization
J Electromyogr Kinesiol. 2012 (Oct); 22 (5): 648–654
Beliveau PJH, Wong JJ, Sutton DA, Simon NB, Bussières AE, Mior SA, et al.
The Chiropractic Profession: A Scoping Review of Utilization Rates,
Reasons for Seeking Care, Patient Profiles, and Care Provided
Chiropractic & Manual Therapies 2017 (Nov 22); 25: 35
Rushton K, Beeton R, Jordaan JL, Lenerdene L, Lorrie M, Jan Pool A.
Educational Standards In Orthopaedic Manipulative Therapy
International Federation of Oorthopaedic Manipulative
Physical Therapists (IFOMPT) 2016
Coulter ID, Crawford C, Hurwitz EL, Vernon H, Khorsan R, et al.
Manipulation and Mobilization for Treating Chronic
Low Back Pain: A Systematic Review and Meta-analysis
Spine J. 2018 (May); 18 (5): 866–879
Clar C, Tsertsvadze A, Court R, Hundt GL, Clarke A, Sutcliffe P.
Clinical effectiveness of manual therapy for the management of musculoskeletal
and non-musculoskeletal conditions: systematic review
and update of UK evidence report.
Chiropr Man Therap. 2014;22(1):1–34. pmid:24679336
Paige NM, Miake-Lye IM, Booth MS, Beroes JM, Mardian AS, et al.
Association of Spinal Manipulative Therapy With Clinical
Benefit and Harm for Acute Low Back Pain:
Systematic Review and Meta-analysis
JAMA. 2017 (Apr 11); 317 (14): 1451–1460
Blanpied PR, Gross AR, Elliott JM, Devaney LL, Clewley D, et al.
Clinical practice guidelines linked to the international classification
of functioning, disability and health from the orthopaedic section
of the American physical therapy association.
J Orthop Sports Phys Ther. 2017;47(7):A1–83. pmid:28666405
Corp N, Mansell G, Stynes S, Wynne-Jones G, Morsø L, Hill JC, et al.
Evidence-based treatment recommendations for neck and low back pain
across Europe: A systematic review of guidelines.
Eur J Pain. 2021;25(2):275–95. pmid:33064878
Kohn L, Corrigan J, Donaldson M.
To Err Is Human: Building a Safer Health System
Washington DC: National Academy Press; (Nov 1999)
WHO.
Towards Eliminating Avoidable Harm in Health Care
Patient Safety Action Plan 2021-2030.
World Health Organization (3 August 2021)
Landrigan CP, Parry GJ, Bones CB, Hackbarth AD.
Temporal trends in rates of patient harm resulting from medical care.
N Engl J Med. 2010;363(22):2124–34.
pmid:21105794
Swait G, Finch R.
What Are the Risks of Manual Treatment of the Spine?
A Scoping Review for Clinicians
Chiropractic & Manual Therapies 2017 (Dec 7); 25: 37
Vohra S, Kawchuk GN, Boon H, Caulfield T, Pohlman KA.
SafetyNET: An interdisciplinary research program to support
a safety culture for spinal manipulation therapy.
Europ J Integr Med. 2014;6(4):473–7.
Nsubuga P, White ME, Thacker SB, Anderson MA, et al.
Public Health Surveillance: A Tool for Targeting and Monitoring Interventions.
In: Jamison DT, Breman JG, Measham AR, et al, editors.
Disease Control Priorities in Developing Countries.
2nd edition. Disease Control Priorities in
Developing Countries; 2006;22. Available from:
http://www.ncbi.nlm.nih.gov/books/NBK11770/pdf/ch53.pdf
Kuriakose R, Aggarwal A, Sohi RK, Goel R, Rashmi NC.
Patient safety in primary and outpatient health care.
J Family Med Prim Care. 2020;9(1):7–11. pmid:32110556
Funabashi M, Pohlman KA, Goldsworthy R, et al.
Beliefs, perceptions and practices of chiropractors and patients about
mitigation strategies for benign adverse events
after spinal manipulation therapy.
Chiropr Man Therap. 2020;28(1):46.
pmid:32895053
Carnes D, Mars TS, Mullinger B, Froud R, Underwood M.
Adverse events and manual therapy: A systematic review.
Manual Therapy. 2010;15(4):355–63.
Hebert JJ, Stomski NJ, French SD, Rubinstein SM.
Serious Adverse Events and Spinal Manipulative Therapy of
the Low Back Region: A Systematic Review of Cases
J Manipulative Physiol Ther 2015 (Nov); 38 (9): 677–691
Funabashi M, Gorrell LM, Pohlman KA, Bergna A.
Definition and classification for adverse events following spinal
and peripheral joint manipulation and mobilization:
A scoping review.
PLoS ONE. 2022;17(7):e0270671.
Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N.
Research electronic data capture (REDCap)--a metadata-driven
methodology and workflow process for providing
translational research informatics support.
J Biomed Inform. 2009;42(2):377–81.
pmid:18929686
Harris PA, Taylor R, Minor BL, Elliott V, et al.
The REDCap consortium: Building an international
community of software platform partners.
J Biomed Inform. 2019;95:103208. pmid:31078660
Funabashi M, Pohlman KA, Gorrell LM, Salsbury SA.
Expert Consensus on a Standardised Definition and Severity Classification for Adverse Events Associated with Spinal
and Peripheral Joint Manipulation and Mobilisation:
Protocol for an International E-Delphi Study
BMJ Open 2021 (Nov 11); 11 (11): e050219
Jünger S, Payne SA, Brine J, Radbruch L, Brearley SG.
Guidance on Conducting and REporting DElphi Studies (CREDES) in
palliative care: Recommendations based on a
methodological systematic review.
Palliat Med. 2017;31(8):684–706. pmid:28190381
O’Brien BC, Harris IB, Beckman TJ, Reed DA, Cook DA.
Standards for reporting qualitative research:
a synthesis of recommendations.
Acad Med. 2014;89(9):1245–51. pmid:24979285
World Health Organization.
International Classification of Functioning, Disability and Health.
ICF. 2001. Available:
https://apps.who.int/iris/bitstream/handle/
10665/42407/9241545429.pdf
ICH-GCP, (E6).
International Conference on Harmonization Good Clinical Practice.
In: Glossary art. 1.2. p. Available from:
http://ichgcp.net/1-glossary
(FDA) F and DA,
Coomissioner O of the, (CDER) C for DE and R, (CBER) C for BE and R, (CDRH)
C for D and RH, (GCPP) GCPP. Adverse Event Reporting to
IRBs — Improving Human Subject Protection Guidance
for Clinical Investigators, Sponsors, and IRBs. 2009.
Available from: https://www.fda.gov/media/72267/download
Braun V, Clarke V.
Using thematic analysis in psychology.
Qual Res Psychol. 2006;3(2):77–101.
Braun V, Clarke V.
Successful Qualitative Research: A Practical Guide for Beginners.
SAGE Publications; 2013. Available from:
https://books.google.com/books?id=EV_Q06CUsXsC&pgis=1
Hasson F, Keeney S, McKenna H.
Research guidelines for the Delphi survey technique.
J Adv Nurs. 2000;32(4):1008–15.
Diamond IR, Grant RC, Feldman BM, Pencharz PB, et al.
Defining consensus: a systematic review recommends
methodologic criteria for reporting of Delphi studies.
J Clin Epidemiol. 2014;67(4):401–9. pmid:24581294
von der Gracht HA.
Consensus measurement in Delphi studies.
Technol Forecast Soc Change. 2012;79(8):1525–36.
Carnes D, Mullinger B, Underwood M.
Defining adverse events in manual therapies:
A modified delphi consensus study.
Int J Osteopath Med. 2010;13:94–98.
https://dx.doi.org/10.1016/j.math.2009.02.003
He Y, Sarntivijai S, Lin Y, Xiang Z, Guo A, Zhang S, et al.
OAE: The Ontology of Adverse Events.
J Biomed Semantics. 2014;5:29.
pmid:25093068
Services UD of H and H, Health NI of,
Institute NC. Common Terminology Criteria for Adverse Events
(CTCAE) v5.0. 2017. p. Published on Novermber 27.
https://doi.org/10.1039/DT9930001947
Preyer G, Golder S, Junqueira D, Vohra S, Loke Y.
Chapter 19: Adverse effects.
In: Higgins J, Thmoas J, Chandler J, Cumpston M, Li T, Page M, et al.,
editors. Cochrane Handbook for Systematic Reviews of Interventions.
Cochrane; 2022. Available from:
www.training.cochrane.org/handbook
World Health Organization.
Conceptual Framework for the International Classification for Patient Safety.
Final Technical Report. 2009.
Dutta MJ.
Communicating About Culture and Health:
Theorizing Culture-Centered and Cultural Sensitivity Approaches.
Commun Theory. 2007;17(3):304–28.
Iwelunmor J, Newsome V, Airhihenbuwa CO.
Framing the impact of culture on health: a systematic review of the PEN-3
cultural model and its application in public health research and interventions.
Ethn Health. 2014;19(1):20–46. pmid:24266638
Carlesso LC, Macdermid JC, Santaguida PL, Thabane L.
Determining adverse events in patients with neck pain receiving
orthopaedic manual physiotherapy: a pilot and feasibility study.
Physiother Can. 2013;65(3):255–65. pmid:24403696
Carlesso LC, Cairney J, Dolovich L, Hoogenes J.
Defining adverse events in manual therapy:
an exploratory qualitative analysis of the patient perspective.
Man Ther. 2011;16(5):440–6.
pmid:21377403
Gross AR, Kay TM, Kennedy C, Gasner D, Hurley L, Yardley K, et al.
Clinical practice guideline on the use of manipulation or mobilization
in the treatment of adults with mechanical neck disorders.
Man Ther. 2002;7(4):193–205.
pmid:12419654
Kranenburg HA, Lakke SE, Schmitt MA, Van der Schans CP.
Adverse events following cervical manipulative therapy:
consensus on classification among Dutch medical
specialists, manual therapists, and patients.
J Man Manip Ther. 2017;25(5):279–87.
pmid:29449770
Paanalahti K, Holm LW, Nordin M, Asker M, Lyander J, Skillgate E.
Adverse events after manual therapy among patients seeking care
for neck and/or back pain: a randomized controlled trial.
BMC Musculoskelet Disord. 2014;15(1).
Walker BF, Hebert JJ, Stomski NJ, Clarke BR, et al.
Outcomes of Usual Chiropractic.
The OUCH Randomized Controlled Trial of Adverse Events
Spine (Phila Pa 1976). 2013 (Sep 15); 38 (20): 1723–1729
Dagenais S, Brady O, Haldeman S.
Shared decision making through informed consent in
chiropractic management of low back pain.
J Manipulative Physiol Ther. 2012;35(3):216–26. pmid:22405500
Leboeuf-Yde C, Hennius B, Rudberg E, Leufvenmark P, Thunman M.
Side effects of chiropractic treatment: a prospective study.
J Manipulative Physiol Ther. 1997;20(8):511–5.
pmid:9345679
Williamson A, Hoggart B.
Pain: a review of three commonly used pain rating scales.
J Clin Nurs. 2005;14(7):798–804. pmid:16000093
Thong ISK, Jensen MP, Miró J, Tan G.
The validity of pain intensity measures.
Scand J Pain. 2018;18.
Ferreira-Valente MA, Pais-Ribeiro JL, Jensen MP.
Validity of four pain intensity rating scales.
Pain. 2011;152(10):2399–404.
pmid:21856077
Lampropoulou S, Nowicky AV.
Evaluation of the numeric rating scale for perception of
effort during isometric elbow flexion exercise.
Eur J Appl Physiol. 2012;112(3):1167–75.
pmid:21769733
Farrar JT, Troxel AB, Stott C, Duncombe P, Jensen MP.
Validity, reliability, and clinical importance of change in a 0-10
numeric rating scale measure of spasticity: a post hoc analysis
of a randomized, double-blind, placebo-controlled trial.
Clin Ther. 2008;30(5):974–85. pmid:18555944
Kimel M, Zeidler C, Kwon P, Revicki D, Ständer S.
Validation of Psychometric Properties of the Itch Numeric Rating Scale
for Pruritus Associated With Prurigo Nodularis:
A Secondary Analysis of a Randomized Clinical Trial.
JAMA Dermatol. 2020;156(12):1354–8. pmid:32936233
Eriksen K, Rochester RP, Hurwitz EL.
Symptomatic Reactions, Clinical Outcomes and Patient Satisfaction
Associated with Upper Cervical Chiropractic Care:
A Prospective, Multicenter, Cohort Study
BMC Musculoskelet Disord. 2011 (Oct 5); 12: 219
Rubinstein SM, Leboeuf-Yde C, Knol DL, de Koekkoek TE.
The Benefits Outweigh the Risks for Patients Undergoing
Chiropractic Care for Neck Pain: A Prospective,
Multicenter, Cohort Study
J Manipulative Physiol Ther 2007 (Jul); 30 (6): 408–418
Cleeland CS.
Symptom burden: multiple symptoms and their impact
as patient-reported outcomes.
J Natl Cancer Inst Monogr. 2007;(37):16–21. pmid:17951226
Thoomes-de Graaf M, Thoomes EJ, Carlesso L, Kerry R, Rushton A.
Adverse effects as a consequence of being the subject of orthopaedic
manual therapy training, a worldwide retrospective survey.
Musculoskelet Sci Pract. 2017;29:20–7. pmid:28284053
Pohlman KA, O’Beirne M, Thiel H, Cassidy JDavid, Mior S, Hurwitz EL, et al.
Development and validation of providers’ and patients’ measurement
instruments to evaluate adverse events after spinal manipulation therapy
Europ J Integr Med. 2014;6(4):451–66.
Tabell V, Tarkka IM, Holm LW, Skillgate E.
Do adverse events after manual therapy for back and/or neck pain
have an impact on the chance to recover? A cohort study.
Chiropr Man Therap. 2019;27:27. pmid:31205681
Skelly AC, Chou R, Dettori JR, Turner JA, Friedly JL, et al.
Noninvasive Nonpharmacological Treatment for Chronic Pain:
A Systematic Review Update
Rockville (MD): Agency for Healthcare Research and Quality (US); 2020.
Return to ADVERSE EVENTS
Since 11-18-2025
|