Objective To conduct a systematic review and meta-analyses to assess the effect of manual therapy interventions for healthy but unsettled, distressed and excessively crying infants and to provide information to help clinicians and parents inform decisions about care.
Methods We reviewed published peer-reviewed primary research articles in the last 26 years from nine databases (Medline Ovid, Embase, Web of Science, Physiotherapy Evidence Database, Osteopathic Medicine Digital Repository , Cochrane (all databases), Index of Chiropractic Literature, Open Access Theses and Dissertations and Cumulative Index to Nursing and Allied Health Literature). Our inclusion criteria were: manual therapy (by regulated or registered professionals) of unsettled, distressed and excessively crying infants who were otherwise healthy and treated in a primary care setting. Outcomes of interest were: crying, feeding, sleep, parent–child relations, parent experience/satisfaction and parent-reported global change.
Results Nineteen studies were selected for full review: seven randomised controlled trials, seven case series, three cohort studies, one service evaluation study and one qualitative study.
We found moderate strength evidence for the effectiveness of manual therapy on: reduction in crying time (favourable: −1.27 hours per day (95% CI −2.19 to –0.36)), sleep (inconclusive), parent–child relations (inconclusive) and global improvement (no effect). The risk of reported adverse events was low: seven non-serious events per 1000 infants exposed to manual therapy (n=1308) and 110 per 1000 in those not exposed.
Conclusions Some small benefits were found, but whether these are meaningful to parents remains unclear as does the mechanisms of action. Manual therapy appears relatively safe.
PROSPERO registration number CRD42016037353.
- Manual Therapy
- Excessive Crying
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
Strengths and limitations of this study
Meaningful outcomes for parents with distressed, unsettled and excessively crying infants were investigated to help inform their decisions about seeking manual therapy care for their infants.
Compiling evidence for distressed, unsettled and excessively crying infants based on multiple ‘clinical diagnoses’ using varied definitions is difficult.
The mechanism of action of complex interventions was not explained by the pragmatic research investigations used in this review.
Low to moderate quality studies limited the certainty of conclusions, suggesting they are liable to change with further research.
Unsettled infant behaviour and colic are terms used to describe a range of behaviours in infants aged up to 12 months that include prolonged episodes of crying, difficulties with sleeping and/or feeding.1 Reports suggest a prevalence of approximately 20%,2 and the incidence is equal between sexes.3 The problems are found more commonly in first-borns and infants who have siblings who also had this condition.4–6 High levels of multiple health service use have been found in the postpartum period, including visits to emergency departments.1 4 A cost burden analysis found that the annual cost to the UK National Health Service of infant crying and sleeping problems in the first 12 weeks of life was £65 million.5 There are associations between unsettled infant behaviour and high maternal depression scores,6 and the natural crying peak at 6 weeks coincides with the peak age for severe infant injury or death as a result of child abuse.7
Many aetiological factors for unsettled infant behaviour have been explored including diet, feeding and digestive issues,8–11 musculoskeletal strains and disorders,12 13 developmental progress14–17 and parenting.18–22 Despite extensive research, causative factors and effective treatment remain elusive.
Medicalising these symptoms is controversial as they are seen as self-limiting with infants normally settling after 12 weeks. However, coping with these infants during this period can be very difficult.
Manual therapists offer a mix of health screening, education, advice, psychological support and touch therapy for these infants. Manual treatment is based on the premise that infants may have musculoskeletal strains or limitations affecting comfort, feeding and gut motility causing distress. A previous Cochrane review (2012) of manual therapy and colic meta-analysed data from six randomised controlled trials (RCTs) and found small positive (statistically significant) changes in crying time outcomes overall. However, a sensitivity analysis of data from only RCT studies where parents were blinded to treatment did not show beneficial effects.23 Other analyses showed a small beneficial effect for sleep but not for ‘recovery’. The studies included in this review were generally small and methodologically prone to bias, so definitive conclusions could not be drawn and effects were downgraded accordingly.23
There are some concerns around the safety of manual techniques in the treatment of infants, but published data of cases of serious adverse events are rare.24 No reviews to our knowledge have explored qualitative research and non-specific effects such as parental confidence and satisfaction. In this review, we aimed to update the Cochrane review23 of RCTs for crying time and investigate non-RCT studies and outcomes that are important to parents, rather than biomedical markers alone that might be of more interest to primary researchers exploring aetiology as our selected population was infants that were considered healthy.
Types of studies
We included the following types of peer-reviewed studies in our search: RCTs, prospective cohort studies, observational studies, case–control studies, case series, questionnaire surveys and qualitative studies. We excluded single-case studies and non-peer reviewed literature (editorials, letters, master’s and undergraduate theses). Systematic reviews were identified to inform our research and for citation tracking. There were no language restrictions in our search criteria.
Types of participants
Participants were aged between 0 months and 12 months (infants) when they received manual therapy treatment. They were healthy, thriving and not receiving other medical interventions. Their presenting symptoms were excessive crying, distress and unsettledness; they might also be described as having colic, constipation, breastfeeding/feeding difficulties and/or gastro-oesophageal reflux/discomfort.
‘Colic’ was determined using the Wessel ‘rule of three’25 or Rome III26 criteria. The latter considers infants to have colic if they were thriving and healthy but had paroxysms of irritability, fussing or crying lasting for a total or more than 3 hours a day and occurring on more than 3 days a week for more than 1 week.26
We excluded studies that included infants requiring treatment for conditions that needed specialist or hospital-based clinical care for conditions such as: respiratory disorders, developmental disorders (learning and motor), cystic fibrosis, cerebral palsy, otitis media, neuralgia, congenital torticolis or musculoskeletal trauma. We also excluded studies about plagiocephaly or brachycephaly.
We included studies where the manual therapy intervention was delivered in primary care by statutorily registered or regulated professional(s). This included osteopaths, chiropractors, physiotherapists and any other discipline using manual contact as the primary therapeutic component. The intervention or therapy had to involve physical and/or manual contact with the patient for therapeutic intent, administered without the use of mechanical, automated, electronic, computer or pharmacological aids/products/procedures. We excluded mixed or multidisciplinary interventions where the response to the manual therapy elements would have been unclear/undeterminable. Studies where the professional trained a non-professional to deliver the therapy or where parents delivered the therapy were excluded also.
Types of outcome measures
Outcomes of interest were unsettled behaviours, experience/satisfaction and global change scores. Unsettled behaviours included, for example, excessive crying, lack of sleep, displays of distress or discomfort (back arching and drawing up of legs) and difficulty feeding. Adverse events data were also collected.
Selection of articles
Nine electronic databases were searched from January 1990 to January 2017 in the last 26 years: Medline Ovid, Embase, Web of Science (WOS), Physiotherapy Evidence Database, Osteopathic Medicine Digital Repository, Cochrane (all databases), Index of Chiropractic Literature, Open Access Theses and Dissertations and Cumulative Index to Nursing and Allied Health Literature. We selected this timeframe because our scoping work revealed that most papers prior to January 1990 were theory-driven position papers on the manual therapy care of infants and for pragmatic reasons in terms of access to full-text original articles.
The main search string (modified for the different engines) is included in the electronic online supplementary appendices. It included the key terms: musculoskeletal, manipulation, manual and physical therapy, physiotherapy, osteopathy and chiropratic with infant baby and new borns. We updated the search to the end of January 2017 using Medline Ovid and search alerts from Embase, Cochrane and WOS. We also located articles through peer networks. Four reviewers (the authors in two teams of two) reviewed the titles and abstracts, then the full texts independently. Where there was disagreement between the reviewers, a third reviewer from the other team arbitrated the final decision to select reject. Review articles retrieved in the search were citation tracked to identify additional studies. Covidence software was used to organise and classify the articles.27 See figure 1 for a flow chart of the search process.
Supplementary file 1
Quality appraisal of included studies
Two reviewers independently rated the quality of each included study (either CM/JE or DC/AP). We used the appropriate quality appraisal tools for each type of study design.28–30 An overall quality score for each study was assigned by summing the number of quality criteria that were present. For RCTs: six risk of bias criteria were assessed28 (5–6 quality criteria evaluated as present indicated low risk of bias=high quality, 3–4=moderate quality and 1–2=low quality). For cohorts: 11 quality criteria were assessed29 (8–11 quality criteria evaluated as present=high quality, 4–7=moderate quality, 0–3=low quality). For case series: nine quality criteria were assessed30 (if 7–9 quality criteria were present=high quality, if 3–6=moderate quality and if 0–3=low quality). For qualitative studies: 10 criteria were assessed29 (if 8–10 quality criteria were present=high quality, 4–7=moderate quality and 0–3=low quality). All low quality cohort and case series studies were regarded as severely methodologically flawed and were not included in the final analyses.
Data extraction and synthesis
One reviewer extracted the data and another checked the data extractions (all authors).
We aimed to meta-analyse data for RCTs and matched or paired cohort studies. For RCTs, we planned to extract final value scores for each group and convert them to standardised mean differences and weighted mean differences for comparison using a random effects model due to the expected differences in treatment protocols and effects between studies. Where there was a majority of either change or final value scores, we planned sensitivity analysis to check ‘consistency’/meaning of the meta-analyses. We planned to extract risk ratios (RR) for comparison of adverse events between treatment and control groups. I2 was used to calculate heterogeneity. RevMan software (V.5.3) was used to conduct the meta-analyses.
For non-RCT studies, analyses proposed were descriptive and narrative, but change scores and RRs were extracted where possible. If there were a sufficient number of qualitative studies, we proposed to organise and synthesise findings from the qualitative data, by identifying emergent themes and subthemes.
Strength of evidence
We rated the strength of evidence across studies for each outcome, as either high, moderate or low, taking note of the quality and overall direction of results (inconclusive, favourable or unfavourable).31 Strength of evidence was considered as follows:
Consistent results from at least two high-quality RCTs, or other well-designed studies, conducted in representative populations where the conclusion is unlikely to be strongly affected by future studies.
Available evidence from at least one higher quality RCT or two or more lower quality RCTs but constrained by: number, size, quality, inconsistency in findings and limited generalisability to clinical practice. The conclusions are likely to be affected by future studies.
Evidence was insufficient with limitations in data provision, number, power, quality, inconsistency in results and findings not generalisable to clinical practice. All studies that were rated as low quality were treated as inconclusive regardless of author findings.
Two reviewers rated the quality and strength of evidence, and a consensus vote was used in cases of disagreement.
A total of 11 423 studies were retrieved. After duplicate removal, 8844 studies remained. There were 8638 references excluded by title and abstract predominantly because the population was not appropriate; for example, the children were too old and/or treatment settings were not primary care. We acquired full text for 206 references and 19 of these fulfilled our inclusion criteria. Reasons for exclusion are listed in figure 1.
There were 19 primary studies included: seven RCTs,32–38 seven case series,39–45 three cohort studies,46–48 one service evaluation survey49 and one qualitative study.50 One other primary study was excluded due to translation difficulties of technical terms in Chinese medicine.51 All studies were published between January 1990 and January 2017. Countries represented across the studies were the UK,32–34 41–43 46 47 49 USA,35 40 48 Canada,38 Australia,39 44 50 Norway36 and Denmark.37 45 The following conditions were represented in the studies: colic (11 studies),32–34 36 37 39 40 43 45–47 gastro-oesophageal reflux (2 studies),35 40; breastfeeding difficulties (5 studies)38 42 44 48 49 and infant signs of distress (described as headache) (1 study).41 With the exception of four studies, all used chiropractic intervention. The other four studies used massage therapy35 and osteopathic intervention.33 38 49 Eight studies used control groups.32–36 38 46 47 The controls varied across studies, from no physical treatment33 34 36 46 47 to a sham treatment35 38 or drug.37 See table 1 for characteristics of included studies.
In the few cases where there was uncertainty with selection choice, these were all resolved after discussion with a third reviewer.
The methodological quality of the studies varied (table 2). Five studies were rated as high quality: four RCTs (low risk of bias)32 34 35 38 and a qualitative study.50 Seven were of moderate quality.33 36 39 42 43 45 49 The remaining seven were rated as low quality due to severe methodological flaws (eg, small samples, the treating clinician observed and reported outcomes)37 39 41 44 46–48 (table 2). The non-RCT studies rated as low quality were excluded from further analyses.
Table 3 shows the results from studies reporting similar outcomes. Six studies reported outcomes related to improvement in feeding,38 42 44 48–50 seven reported a reduction in crying time,32–34 36 37 45 46 five reported global improvement in symptoms,32 34 36 39 40 four reported sleep outcomes32 33 38 46 and three reported outcomes about parent–child relations.33 35 46 The remaining outcomes were from one study only.
A meta-analysis was only possible for the RCTs with outcomes measuring reduction in crying time and for adverse events.
Meta-analyses for global improvement in symptoms, parent–child relations, sleeping time and feeding were not possible because: several studies did not have a ‘no-treatment’ control group,32 39 40 42 44 48–50 did not present data at their primary endpoints,34 36 did not collect enough data or the data and outcomes were too heterogeneous.
Reduction in crying time
Seven studies reported data on crying time.32–34 36 37 45 46 There were sufficient data from four studies in the form of final value scores for the outcome of reduced crying time that could be meta-analysed for comparison of treatment effects. This replicated a previous meta-analysis.23 Our replicated meta-analysis (figure 2) gave a slightly different but still significant outcome for reduced crying time of −1.27 (95% CI −2.19 to −0.36) hours per day (figure 2). The difference is due to apportioned weighting given by the different versions of RevMan. One study37 used dimethicone as a comparison; the other studies’ controls were no treatment or placebo. We classified dimethicone as a placebo control (see figure 2). Parents were blinded to their child’s treatment in only two of the studies included in the meta-analysis.34 36
We were able to extract dichotomous data for adverse events and calculate RRs for meta-analysis (figure 3). Of the eight studies that reported presence or absence of adverse events,33 34 37–39 42 43 45 three studies reported there were no adverse events,38 42 45 two reported adverse events after manual therapy39 43 and three reported adverse events (worsening symptoms) in the control group.33 34 37
Using data from all the studies reporting adverse events, there were 1308 infants exposed to manual therapy and nine non-serious adverse events recorded, giving an incidence rate of seven non-serious events per 1000 infants. Conversely, there were 11 non-serious adverse events in the infants not exposed to manual therapy (n=97), giving an incidence rate of around 110 per 1000 infants.
Figure 3 shows the meta-analysis for the RCTs, which was possible for four studies.33 34 37 38 There was an overall RR of 0.12 (95% CI 0.12 to 0.66); that is, those who had manual therapy had an 88% reduced risk of having an adverse event compared with those who did not have manual therapy.
In this systematic review, we searched for both RCT and non-RCT evidence. We found seven RCTs and 12 non-RCTs investigating the effects of manual therapy on healthy but unsettled, distressed and excessively crying infants treated in primary care.
Using Brontfort et al’s approach to overall evidence rating we found: moderate strength evidence for a small positive effective of manual therapy on reduction in crying time, inconclusive evidence for sleep and parent–child relations and no effects for global improvement (table 3).
Previous systematic reviews from 2012 and 201423 52 concluded there was favourable but inconclusive and weak evidence for manual therapy for infantile colic. Since 2014, two new RCTs have been published: one pilot study RCT (n=18)35 and one high-quality RCT (n=97)38 but neither presented new data on crying time for the meta-analysis. These two new RCTs blinded the parents to treatment, but they reported outcomes on feeding and global improvement and parent–child relations, respectively. This meant we were unable to update the meta-analyses conducted by Dobson et al.23
We considered all methodological study types narratively and looked at: direction of effect, quality of the study and results presented (table 3). However, because the low quality studies were so methodologically flawed, we did not include their results in the final analyses (this indicates a need for more scientific rigour in this field of research). We were still able to review the effects of manual therapy on multiple outcomes in 12 of our 19 selected studies. With the exception of reduced crying time, the findings were inconclusive, and the absence of effect shown for global improvements might suggest that the reduction in crying time of just over 1 hour was not sufficient enough to be meaningful for parents.
We anticipated that there would be more measurement of outcomes related to parent satisfaction and confidence or parent–child relations, but only five studies reported these outcomes.33 35 46 49 50 This paucity of information about the reciprocity of parent–infant psychosocial development indicates a gap in the literature considering the importance of the parent–infant dyad in positive bonding53 and the relationship between parent mood and psychosocial development of infants.54–57
Results in context with other research
The Cochrane review by Dobson et al 23 included two studies that we excluded because they were not peer reviewed: one a master’s thesis58 and one from conference proceedings.59 We repeated the Dobson et al sensitivity meta-analysis for peer-reviewed studies only, using their imputed SD for one study.36 The data extracted were the same, but the meta-analysis results were slightly different due the different versions of RevMan assigning different weights (we used RevMan V.5.3, while Dobson et al used RevMaN V.5.1). Both showed a significant reduction in the weighted mean difference of just over 1 hour in daily crying time (−1.01 hours (95% CI −1.78 to –0.24)23 vs −1.27 hours (95% CI −2.19 to –0.36)). As mentioned above, whether this reduction of around 1 hour of daily crying is meaningful to parents remains to be answered.
The I2 statistic in our meta-analysis and Dobson et al’s23 were 69% and 55%, respectively, indicating heterogeneity between the studies analysed. This was not unexpected due to the potential variation in treatments (and hence effects), loose diagnostic criteria and the power of the samples for the RCTs. Therefore, the results have to be considered with caution and are likely to change with further research. The meta-analysis helps illustrate and indicate that future research in this field requires well-powered studies, flexible but protocolised treatment and parental blinding.
Dobson et al 23 conducted a sensitivity meta-analysis to explore parent blinding to their infant’s treatment (Miller et al 34 and Olafsdottir et al 36), and interestingly, their results showed that there was no difference in crying time between groups with blinding.
Our searches also revealed 19 references to other systematic reviews of manual therapy paediatric care for conditions that were not the focus of our review, for example, otitis media, asthma, cerebral palsy and motor development. Our review draws similar conclusions to these other reviews; that is, more high-quality RCTs are needed, but methodological problems with research in this field might preclude researchers taking on this challenge. The gold standard to test effectiveness is the RCT, but double-blinding is not possible (one cannot blind the treating therapist) and some parents are reluctant to blinding and being separated from their child. Other issues particular to allied, complementary and alternative therapies include: loose definitions and diagnostic criteria, describing and/or protocolising interventions that are bespoke and determining the active elements of these multicomponent interventions. These problems are further compounded by the self-limiting nature of many childhood conditions.
These methodological issues may help explain the equivocal findings, small numbers recruited and low-quality assessments presented in systematic reviews.
Data about non-specific effects of treatment, such as the impact of care on parental confidence, and clinician reassurance were not found, possibly because these are difficult to assess as direct, indirect or independent of the study intervention. In one study we reviewed,36 all infants and parents received the same support, advice and non-manual therapy care. They found no difference in outcomes between the group who had manual therapy in addition, and both groups improved over time. The authors of this study suggested that the counselling, support and natural progression of the condition played a more powerful role than the manual therapy.
It remains unclear what the active components of a manual therapy consultation are, but we suggest that it would be valuable to understand why parents seek manual therapy care, despite the presence of other healthcare providers.
The safety data we extracted regarding adverse events indicated that manual therapy is a relatively low risk intervention, reflecting similar findings in other studies.24 The definitions of adverse events recorded in the studies reviewed ranged from ‘worsening symptoms’ to seeking other forms of care: a comprehensive prospective cohort study specifically focused on adverse events in children is necessary to draw better conclusions.
Strengths and limitations
This was a comprehensive and rigorously conducted review that included studies in all languages, including a growing number of articles published from China (titles and abstracts were in English for indexing). There was one Chinese paper that was selected for full paper review. We translated this article, but we were unable to fully interpret and understand the treatment given and the outcomes that related to Chinese Traditional Medicine energy points.51 In other words, the therapeutic paradigm presented was beyond our knowledge from a Western medicine perspective.
Inclusion criteria were specific to our population of interest, that is, thriving infants who were inexplicably unsettled, distressed and excessively crying who were treated in primary care. This symptom-based approach to selection permitted the inclusion of studies relating to various diagnoses, for example, breastfeeding, gastric and behavioural problems. However, this latitude could also be interpreted as a weakness, since definitions of unsettledness, distress and excessive crying and otherwise healthy were not always clear. Perhaps a more stringent, universally accepted definition of ‘colic’ is required. We may have failed to include some studies due to the authors’ descriptions of their populations.
Outcomes for parental satisfaction and confidence were under-researched, and we did not find much data about these. Collecting parent outcomes may provide more informative data about the active components of care.
A well-powered RCT with parental blinding, blinded assessment of reported outcomes, testing both non-specific and manual therapy effects of manual therapist care is needed to supplement research in this area.
We found moderate favourable evidence for the reduction in crying time in infants receiving manual therapy care (around 1 hour per day), but this may change with further research evidence. We still do not know if this result is meaningful to parents or if the reduction is due to the manual therapy component of care or other aspects of care. For other outcomes, the strength of evidence was low and inconclusive.
Contributors DC conceptualised and designed the study, contributed to the data selection, extraction and analysis, drafted the initial manuscript, reviewed and revised the manuscript and approved the final manuscript submitted. CM managed the data, contributed to the data selection and extraction and did the meta-analyses, reviewed and revised drafts of the manuscript and approved the final manuscript submitted. AP and JE contributed to the data selection and extraction, reviewed and revised drafts of the manuscript and approved the final manuscript submitted. All authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.
Funding This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests CM had financial support from the National Council for Osteopathic Research from crowd-funded donations.
Patient consent Not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement Full datasets, analyses and all full searches are available on request from the corresponding author at email@example.com. No individual patient level data was used in this study.