The search results are summarized in Fig. 1. The literature search returned a total of 2,345 studies. Following removal based on duplicates, a review of title and abstracts (n = 1654) was performed, and 246 full texts were screened. After checking against our inclusion and exclusion criteria, we included in total 41 publications in the final review. A list of excluded studies and reasons is included in APPENDIX 3.
Except for resistance training (CCA 4%), in all exercise types with more than one review included, there was a high or very high overlap of the included studies (walking: CCA 38%).
Summary results
The narrative analyses of included SRs/MAs showed large effects when comparing the exercise interventions with minimal or no intervention and are summarized in table 2. For most exercise types, there were no differences when different exercise types were compared with each other. Mostly small or non-significant effects on pain and disability were found in favour of the various exercise types compared with other control interventions, such as usual care. We found low to moderate evidence that any exercise type is effective for reducing pain and disability compared to no or minimal intervention, but that no exercise type is more effective than another.
a) Aerobic exercise
Aerobic exercises aim to improve the efficiency and capacity of the cardiorespiratory system (70). Our search resulted in one meta-analysis on the effects of aerobic exercise which covered a literature search up to March 2016 and included six studies with 333 subjects (36). The methodological quality varied across the studies. The review reached 13 points on AMSTAR-2 and was rated as having high quality (Table 1). Aerobic exercise was compared to resistance training, or combined aerobic and resistance training versus exercise advice, maintained normal activity, or waiting list (APPENDIX 4).
The results showed that aerobic exercise reduced pain, although neither aerobic nor resistance training proved to be superior to the other (Table 3a). No significant differences were reported for disability.
The GRADE analysis showed moderate quality evidence that aerobic exercise is as effective for the reduction of pain and disability compared to resistance training (Table 4). We downgraded due to possible publication bias, since only one review was identified.
b) Aquatic exercise
Aquatic exercises are any exercises performed in water, such as running, active range of motion, or strengthening (50). The literature search identified only one SR about the effectiveness of aquatic exercises, compared to land-based or no exercises (50). The review included eight RCTs with a total of 311 participants (Table 3b). The review was published in 2018, conducted its search in November 2016, and included a meta-analysis. The included RCTs were of moderate to high quality. The review reached 9 out of 16 points on AMSTAR-2, indicating moderate quality (Table 1). The reasoning of the study design was not reported, no list of excluded studies was provided, no reporting of funding, and RoB was not considered in the MA.
The MA found a statistically significant, but clinically questionable, reduction in pain and disability in patients treated with aquatic therapy compared to patients treated with land-based therapy (Table 3b). No information about the time point of outcome reporting was provided (APPENDIX 4).
The GRADE analysis showed that there is low quality evidence that aerobic exercise is superior in reduction of pain and disability compared to land-based exercise (Table 4). The evidence was downgraded due to study limitations and possible publication bias, since only one review was identified.
c) Motor control exercises
Motor control exercises aim to restore the neuromuscular control of the muscles stabilizing the spine, and are graded from low load exercises into activation during functional exercises and activities (71). We included 11 reviews (18, 35, 40, 42, 45, 46, 48, 55-57, 65) on motor control exercises (MCE), including one review on movement control exercises (45). All but one (57) had conducted a meta-analysis. The publication year ranged from 2008 to 2020 and the last updated search was October 2018 (48). The reviews included between four to 34 low to high quality RCT’s and included between 209 to 2514 participants. There was a high overlap of the included studies (CCA 14%). AMSTAR-2 ranged from 4 to 16 points out of 16 (Table 1). Only two of the reviews were rated with an overall high quality (18, 35). Several of the reviews reported “nearly yes” on the item “presenting a protocol”, and most of the reviews reported a “No” on items “presenting a list of the excluded studies” and “presenting the source of funding and conflicts of interest”.
The reviews reported outcomes of pain and disability in the short, intermediate and long-term (Table 3c). Control interventions were general exercises (GE), spinal manual therapy (MT), multimodal treatment (MMT), or information/minimal intervention/usual care (MI) (APPENDIX 4). The narrative synthesis on pain in the included reviews showed a small but non-significant effect for motor control exercises over general exercises in some of the reviews mainly in the short and intermediate term (18, 35, 40, 42, 55, 56, 65). Compared to manual therapy, none of the nine reviews presented any results on differences to motor control exercises for pain (18, 35, 40, 42, 45, 46, 55, 56, 72). Four reviews, however, reported significant and clinically relevant results showing that motor control exercises were more effective in short, intermediate and long-term compared to minimal intervention (18, 40, 46, 55).
The narrative synthesis on disability in the included reviews showed a small but nonsignificant effect for motor control exercises over general exercises (18, 35, 40, 42, 46, 55, 65), while Niederer et al. (2020) presented results of no difference at any time points (48). Motor control exercises showed small and non-clinically relevant results compared to manual therapy in two reviews (40, 46). Compared to minimal intervention motor control exercises showed significant and clinically significant differences in the short (18, 40, 55), intermediate (18, 40, 46, 55), and long-term (18, 40).
The GRADE analyses showed that there is a moderate level of evidence on the effect of motor control exercises on pain compared to minimal intervention, and a low level of evidence that there is an effect on disability (Table 4). Downgrading was based on the inconsistency of the results, and for disability also on imprecision due to significant heterogeneity.
d) Resistance training
Resistance training includes exercises to improve the strength, power, endurance and size of skeletal muscles (70). The interventions included resistance training, back muscle training and medical training therapy. We included three SR’s on the effect of resistance training (66, 67, 69), but none conducted a meta-analysis. Publication year ranged from 2001 to 2012 and the last updated search was performed in April 2010. In one review, only two RCTs were included (67), while the other two reviews included 12 and seven RCTs, respectively, with a small overlap (CCA 4%) (66, 69). Two reviews reported moderate to high quality of the included RCTs (66, 67), while one review did not report on study quality (69). AMSTAR-2 scores were very low and ranged from 1.5 to 5, indicating critically low, and low-quality reviews (Table 1). Resistance training was compared with passive treatments, fitness training, no treatment placebo, or cognitive interventions (APPENDIX 4).
All included reviews reported on decreased pain scores in the included studies compared to passive or no intervention (66, 67), but it was unclear if the effect was clinically relevant or what time-period was used. One review found no difference compared to a cognitive behavioural intervention, and the effect disappeared at the long-term follow-up (67), while another review reported no difference in pain scores when compared to fitness training (66) (Table 3d). Resistance training was found effective for the reduction of disability in all reviews compared to passive or no intervention in one review (66), but it was unclear what comparison groups were used in the other two (67, 69) (Table 2d).
The GRADE analyses showed that there was a very low level of evidence that resistance training has positive effects on pain and disability, but not compared to fitness training and cognitive behavioural intervention (Table 4). The level of evidence was downgraded due to low study quality, inconsistency, imprecision, and an increased risk of publication bias.
e) Pilates
Pilates exercises follow the traditional Pilates principles such as centring, concentration, control, precision, flow and breathing (52).The literature search resulted in nine systematic reviews (19, 32, 39, 44, 47, 52, 61, 62, 64), of which five had performed an MA on the effect of Pilates (19, 32, 44, 47, 62). The reviews included between four to 14 RCTs, published between 2005 and 2016, and included between 134 (62) to 708 participants (39). There was a very high overlap of the included studies (CCA 32%). Publication years ranged from 2011 (44, 64) to 2018 (39); the last updated search was April 2016 (39). The studies included in the reviews were graded with different types of quality scores and were rated as being of moderate quality, with some exceptions to both low and high quality. The study quality ranged from low, 7 points (64) to 16 points (19) out of 16 points of AMSTER-2 (Table 1). Most of the reviews did not present the source of funding or conflict of interest (Item 10). All studies with MAs significant large heterogeneity. The intervention dosage varied greatly between the studies, and due to poor reporting, it was impossible to summarize a typical exercise duration, frequency, or intensity. The control interventions also varied greatly between the different studies, and contained treatments such as other exercise types, McKenzie, massage, back school programs, or information/minimal intervention/usual care (APPENDIX 4).
The narrative synthesis on the outcome pain showed significant (but mostly small) effects for Pilates over no or minimal intervention in eight reviews (19, 32, 39, 44, 47, 52, 61, 64). In all included reviews there were no differences compared to other types of exercises, except for one review that found a superiority for Pilates exercises compared to physical activity (52). Similar results were found in the narrative synthesis on disability. Six of the included reviews reported on non-significant effects for Pilates over minimal intervention (19, 39, 47, 52, 61, 64), and the majority of the reviews pointed out that Pilates exercises were as effective compared to other types of exercises, mainly with short-time effects (Table 3e). Most outcomes were of short or intermediate term and most often the effect was lower than the recommended minimal clinical important difference (MCID) for pain and disability measures.
The GRADE analyses showed a moderate level of evidence on the short-term effects of Pilates compared to minimal intervention and no effect if compared with other types of exercise concerning pain (Table 4). For disability, the level of evidence was low for this comparison. For both pain and disability, the evidence was downgraded due to the low and moderate quality of most of the reviews. Moreover, an additional down-grading for disability was added, since the results were conflicting concerning the conclusion on the effectiveness of Pilates over minimal interventions.
f) Sling exercises
Sling exercises use slings and elastic bands to offset body weight and progress the exercises without pain (73). We found two systematic reviews on the effect of sling exercises (37, 60) including one meta-analysis (37). In total, 16 studies were included, but there was a very high overlap between the two reviews (CCA 23%). The last updated search in the systematic review was August 2013. The methodological quality of the included studies according to PEDro assessments was moderate and low. The AMSTAR-2 ratings for these two studies was 8 (60) and 14 points (37), indicating low and high quality studies, respectively (Table 1). The interventions in both reviews were primarily sling exercise based; however, the sling exercises were also combined with e.g. passive modalities and with other kind of exercises, back school, contemporary treatment, and drugs. The control groups received other forms of exercise, passive modalities, manipulation, contemporary treatment, and drugs (APPENDIX 4).
The narrative analyses of the reviews showed that sling exercises are no more effective in reducing pain or improving disability compared with other types of exercise (Table 3f). However, in comparison to passive modalities or the combination of physical agents and drug therapy, sling exercises were more effective in decreasing pain and improving disability, but only sling exercise vs thermomagnetic therapy showed clinically relevant differences between the groups in favour of sling exercise. In addition, sling exercises were found to be not more effective compared to traditional Chinese medical therapies (37). Sling exercises in addition to acupuncture therapy were as effective as acupuncture therapy alone for reduction of pain and improvement of disability.
The GRADE analyses showed that there is moderate and low level of evidence for short-term and long-term effects on pain and disability, respectively, for sling exercises over thermomagnetic therapy; however, the evidence is only based on two RCTs (Table 4). We downgraded due to lack of precision and, for disability, also for inconsistency of the results.
g) Traditional Chinese Exercises (Tai chi/Qigong)
Tai chi and Qigong, two common types of traditional Chinese mind-body techniques, also referred to as Traditional Chinese Exercises (TCE), include low to moderate intensity exercises coordinated with slow body movement and focus on physical- mental connection (38). Two reviews were identified evaluating the efficacy of Tai Chi and Qigong, and both performed a meta-analysis. Included were 10-11 RCT’s published between 2008 and 2019, and the total sample size ranged between 886 and 959 participants. There was a high overlap of the studies investigating Traditional Chinese Exercises (CCA 13%). The studies included in the reviews were rated as having fair to good quality. The risk of bias was assessed to 11.5 and 13 out of 16 AMSTAR-2 points, indicating moderate to high quality (Table 1). Both reviews compared the efficacy of either Tai Chi or various types of Qigong (Wuqinxi, Baduanjin, Liuzijue) to either no treatment, active treatment (strength exercise, backwards walking, or other physiotherapy), or usual care, with or without the experimental component (APPENDIX 4).
The narrative synthesis on pain showed small to moderate effects for Traditional Chinese Exercises over no treatment, active treatment, or usual care only. Subgroup analyses revealed a larger effect when Tai Chi was compared to no treatment, than to active control interventions or to routine care (without an added Tai Chi component) (Table 3g). Only short-term effects seem to have been evaluated, but exact follow-up time was not reported. The synthesis on disability showed a variability in effect, from small to large effect for Traditional Chinese Exercises over no treatment, active treatment or usual care only (Table 3g). In both systematic reviews, the effects differed depending on outcome measure used (38, 49).
The GRADE analyses for pain showed a moderate level of evidence concerning pain and a low level of evidence for disability on the short-term effects of Traditional Chinese Exercises compared to no intervention (passive control), various active treatments or usual care in chronic LBP patients concerning pain (Table 4). The evidence was downgraded for imprecision due to heterogeneity (pain, disability) and additionally due to large confidence intervals of the effects (disability).
h) Walking
Walking interventions use outdoor walking (with or without supervision), treadmill walking, and/or Nordic walking as therapeutic programs in chronic LBP (51). The literature search identified three systematic reviews (34, 43, 51) of the effectiveness of walking interventions, two of which performed a meta-analysis (34, 51). The reviews included five to nine studies with 329 to 869 participants with a very high overlap of the studies (CAA 37.5%). The reviews were published between 2016 and 2019, with the last updated search up in October 2017 (34). The quality of the included studies was low (43) to high (34). Two reviews achieved 10.5 out of 16 AMSTAR-2 points, indicating moderate quality (43, 51), and one achieved 15 out of 16 AMSTAR-2 points, indicating high quality (34). Two reviews (43, 51) did not report excluded studies, the source of funding, and did not investigate the impact of study quality on summary estimates (Table 1). All reviews compared the effectiveness of walking interventions (overland and/or treadmill, and/or Nordic walking) to non-pharmacological interventions (e.g. other types of exercise, physical therapy, education), and two additionally compared walking and exercise to exercise alone (34, 51) (APPENDIX 4).
Both the meta-analyses (34, 51) for either pain or disability and the SR (43) for disability found no significant differences between walking and the comparison groups that received other interventions (Table 3-H). Neither did adding walking to the comparison groups induce a statistical improvement.
The GRADE analysis showed that there is a low quality of evidence that walking is as effective as other non-pharmacological interventions for pain and disability improvement in chronic LBP patients, and adding walking to exercise does not increase effectiveness (Table 4). The evidence was downgraded due to study limitations and for imprecision due to large confidence intervals of the effect, and a large overlap of the reviews.
i) Yoga
Yoga exercises follow the traditional Yoga principles with a physical component (41). We included nine systematic reviews on Yoga (20, 33, 41, 53, 54, 58, 59, 63, 68). Five out of nine conducted a meta-analysis (20, 33, 41, 53, 59). The reviews included between four to 14 RCTs and 403 to 1444 participants, with a very high overlap (CCA 28%). The publication year ranged from 2011 (63) to 2019 (53), and the last updated search was in 2018 (53). The included studies were graded with varied quality. The study quality ranged from 3 (68) to 16 points (20) on AMSTAR-2 (Table 1). Only two reviews were rated as having high quality (20, 33). Only one study presented a list of the excluded studies (20). The Yoga interventions were highly heterogenous, not only in terms of which kind of Yoga was used, but also in the length, frequency, and intensity of the sessions. Some interventions were combined with other physical therapy modalities, with book readings or usual treatments. There were no clear manuals or protocols that described the Yoga interventions. The control interventions were treatment such as physical therapy, waitlist control, stabilizing exercise and physical therapy, conventional exercise therapy, usual care, educational control group, and self-directed medical care (APPENDIX 4).
The narrative synthesis on both pain intensity and disability in the included reviews showed a short-term effect for Yoga, especially compared to no or minimal intervention, but also compared to general exercises. Three MAs showed medium, and medium-to-large effects, indicating that the effects of Yoga may be of clinical importance (33, 41, 59). However, the long-term effects did not seem to demonstrate better effects than usual care (Table 3i).
The Grade analyses showed a low level of evidence for a short-term effect in pain and disability for Yoga over general exercises; however, the long-term effects did not seem to demonstrate better effects than usual care or compared to usual care or compared to other types of exercises (Table 4). We downgraded due to large heterogeneity between the studies and inconsistent results. Although the risk of bias was high in most of the reviews, two reviews had low risk of bias (16 points); hence we decided not to downgrade due to study limitations.