Article Text


Diagnosis and management of polycystic ovary syndrome in the UK (2004–2014): a retrospective cohort study
  1. Tao Ding1,
  2. Gianluca Baio1,
  3. Paul J Hardiman2,
  4. Irene Petersen3,
  5. Cormac Sammon3
  1. 1Department of Statistical Science, University College London, London, UK
  2. 2Institute for Women's Health, University College London Medical School, London, UK
  3. 3Department of Primary Care and Population Health, University College London, London, UK
  1. Correspondence to Tao Ding; tao.ding.11{at}


Objective To estimate the incidence and prevalence of polycystic ovary syndrome (PCOS) in UK primary care and investigate prescribing patterns before and after a PCOS diagnosis.

Design Retrospective cohort study.

Setting UK primary care (2004–2014).

Participants Women aged 15–45 years.

Primary and secondary outcome measures The incidence and prevalence of diagnosed PCOS and probable PCOS (ie, those without a confirmed diagnosis but with at least 2 PCOS features recorded within 3 years). Among women with diagnosed or probable PCOS, the prevalence of prescribing of drugs typically used to treat PCOS was calculated prior to and in the 24 months after the diagnosis of PCOS.

Results We identified 7233 women with PCOS diagnoses and 7057 women with records suggestive of probable PCOS, corresponding to incidence rates of 0.93 and 0.91 per 1000 person-years at risk (PYAR) and an overall rate of 1.84 per 1000 PYAR. Women aged 20–24 years and women living in deprived areas had the highest incidence of PCOS. The prevalence of PCOS in 2014 was ∼2%. The proportion of women with a prescription in the 24 months after their PCOS index date varied by drug type: 10.2% metformin, 15.2% combined oral contraceptives, 18.8% acne-related treatments, 1.93% clomiphene, 1.0% spironolactone, 0.28% cyproterone and 3.11% eflornithine. Acne-related treatments were more commonly used to treat probable (28.3%) than diagnosed (12.3%) cases, while metformin was prescribed much more commonly in diagnosed cases.

Conclusions In conclusion, compared to rates estimated in community samples, the incidence and prevalence of women presenting in primary care with PCOS diagnoses and features are low, indicating that PCOS is an under-recognised condition. Although considerable variation is observed in treatments prescribed to women with PCOS, the treatments initiated following a confirmed diagnosis generally reflect the long-term prognostic concerns raised in PCOS consensuses.

Statistics from

Strengths and limitations of this study

  • The current study is the first to investigate the incidence and prevalence of polycystic ovary syndrome (PCOS) in the primary care setting in the UK and the longitudinal nature of the database allowed us to examine trends over a long time period, which has not been captured by previous epidemiological studies.

  • Underdiagnosis was the main concern for the current study as only data considered relevant at the time of a consultation are recorded by clinicians although we attempted to include women with two or more features of PCOS as potential cases.

  • Our study investigated the prescribing patterns of PCOS in the UK primary care, which has not been well explored in previous studies. These findings reflected the current management of PCOS in the clinical practices and provide important indications for general practitioners.


Polycystic ovary syndrome (PCOS) is associated with a wide range of reproductive, cardiometabolic and dermatological abnormalities. One of the most prominent symptoms in patients with PCOS is oligomenorrhea. Consequently, women with PCOS are highly likely to be infertile and potentially develop endometrial hyperplasia due to continuing secretion of oestrogen without ovulation.1 ,2 Furthermore, emerging evidence has suggested that ∼50–70% of patients with PCOS have insulin resistance regardless of their body weight or body mass index.3 Consequently, women with PCOS are at an elevated risk of developing various common metabolic disorders compared with the general population.4 In addition, many patients with PCOS are observed to have an elevated androgen level, which leads to hirsutism, alopecia and acne.1

While the epidemiology of PCOS in the community has been well studied,5–8 the proportion of women who present in routine clinical practice with PCOS features and the extent to which these women are subsequently diagnosed are less clear. Similarly, while a range of treatments have been suggested for the management of PCOS,9 there is very little information regarding which of these drugs are actually prescribed in routine clinical practice. Such ‘real-world evidence’ can help identify priority areas for research, training and health promotion efforts. The current study sought to provide such evidence by investigating the recording of PCOS features and diagnoses in UK general practice between 2004 and 2014 and the subsequent prescribing of pharmacological treatments.


Data source

The Health Improvement Network (THIN) is one of the largest primary care data sources in the UK, including data from over 500 general practices, covering ∼6.2% of the total population in the UK. Available data include patient demographics, medical history, test results, drug prescriptions and social deprivation as measured by quintiles of the Townsend score.10 Symptoms and diagnoses are recorded using a hierarchical clinical coding system (Read codes),11 with additional information recorded as unstructured text. The information stored as unstructured text was not available in this study. Notably, as the data are collected in routine clinical practice, only information deemed clinically relevant is entered in a patient's record.

In our study, data were included from each practice that met minimum quality criteria, for example, acceptable computer usage (a time point when a practice is considered to use their computer system adequately, ie, at least one medical record, one additional health data record such as body mass index, laboratory test results and two therapy records are computerised annually for a practice) and acceptable mortality reporting (a time point which the observed death rates for a practice reach the standard predicted numbers of deaths derived from National statistics given the practice's demographics).12–14

Study population

Women aged 15–45 years, who were permanently registered for at least 1 year, were included in the study population. Women with conditions that can cause similar symptoms to PCOS were identified and excluded. These conditions include prolactinoma, Cushing's syndrome, Nelson's syndrome, adrenal-related disorders (ie, adrenal tumours, adrenal hyperplasia) and pituitary disorders.

Case definition

PCOS cases were identified using two methods. First, Read codes for ‘polycystic ovary syndrome’ (C165.00), ‘Stein-Leventhal syndrome’ (C164.12) and ‘endoscopic drilling of ovary’ (7E25300) were used to identify those women who had been clinically diagnosed as PCOS cases (diagnosed cases). Women with two or more Read codes indicative of PCOS features (menstrual/ovarian dysfunction, clinical and biochemical hyperandrogenism, polycystic ovaries) recorded in a 3-year period were then selected and we considered these as probable cases. These women were considered as those who were likely to meet at least one of the three major definitions of PCOS15–18 but who may not have been clinically diagnosed as having the condition. The index date for probable cases was considered to be the date the second PCOS feature was recorded. A full list of the codes used to define cases is provided in online supplementary table SI.

Covariates and prescription indicators

We extracted data on each woman's year of birth, ethnicity and deprivation level of the area in which the woman lived;10 data on prescriptions of interest (ie, combined oral contraceptives (COCs), progestin oral contraceptives (POCs), intrauterine devices, clomiphene, metformin, spironolactone, gonadotrophins, cyproterone, flutamide, eflornithine, weight control/loss drugs, lipid regulators and acne-related drugs) were also included and information on prescribing of these drugs before and in the 24 months after each PCOS case index data was extracted.

Statistical analysis

For incidence estimation, the rate was computed as the total number of new PCOS cases recorded between 2004 and 2014 divided by the total number of person-years of follow-up. Person-time for the denominator was estimated by summing each woman's follow-up from the latest among (1) their 15th birthday, (2) 1 year after registration, (3) the date at which their practice met minimum quality criteria and (4) the 1 January 2004, to the earliest of the date among (1) their first incident diagnosis, (2) their date of death, (3) the date they left the practice, (4) the date data were last collected from their practice and (5) the 31 December 2014. All incidence rates were reported per 1000 person-years at risk (PYAR).

Hierarchical (patients were considered to be nested in each practice) multivariate Poisson regression models were used to estimate incidence rate ratios and 95% CIs comparing the incidence of first PCOS diagnoses across 5-year age bands, Townsend score quintiles and calendar period (ie, 2004–2007, 2008–2011 and 2012–2014).

The period prevalence of the diagnosis of PCOS was evaluated for the calendar year 2014. The denominator for the prevalence calculation consisted of any women with at least 1 year of postregistration follow-up, of which at least 6 months must have occurred in 2014. The prevalence of PCOS was also estimated within 5-year age bands. Secondary analysis was carried out to assess the sensitivity of the prevalence estimate to the length of the postregistration period (ie, 1 year, 2 years) and the minimum period registered within 2014 (ie, 3, 6 and 9 months).

Among the women with a diagnosis of PCOS, we calculated the number and proportion with a prescription for one of the drugs of interest at any point prior to their PCOS index date. Among the women without prescription for each drug of interest prior to the index date, we then calculated the proportion of women with a prescription for that drug within 2 years after the PCOS index date. We used cumulative incidence plots to describe how the proportion initiating different drugs increased over the 2 years following the PCOS index date for the time period 2004–2012.

All analyses were performed using STATA V.13.0 and were carried out for all PCOS cases and stratified by case definition (diagnosed PCOS vs probable PCOS).


In total, 7233 diagnosed and 7057 probable PCOS cases were identified among 2 087 107 female individuals aged 15–45 years old between 2004 and 2014. Table 1 describes the number of PCOS features identified in each group. The incidence rate of diagnosed PCOS cases was 0.93 per 1000 PYAR (95% CI 0.91 to 0.96), whereas the rate for probable cases was 0.91 per 1000 PYAR (95% CI 0.89 to 0.93). This equated to an overall combined incidence rate of 1.84 PYAR (95% CI 1.81 to 1.87).

Table 1

Number and proportion of diagnosed and probable cases with major PCOS features

The overall incidence of PCOS increased from 1.67 (95% CI 1.58 to 1.77) per 1000 PYAR in 2004 to 2.00 (95% CI 1.89 to 2.10) per 1000 PYAR in 2010, after which the rate remained relatively constant at ∼2 per 1000 PYAR (figure 1). The incidence was the highest for those in the 20–24 year age group (3.59 per 1000 PYAR, 95% CI 3.47 to 3.70), whereas the 40–44 year age group had the lowest incidence (0.62 per 1000 PYAR, 95% CI 0.58 to 0.66). The age-specific trend of PCOS diagnoses was similar for diagnosed and probable cases. After adjusting for the effects of year and social deprivation, significant differences still remained in the incidence of PCOS (table 2). In terms of social deprivation, the incidence of PCOS for individuals who were least deprived was 1.59 (95% CI 1.53 to 1.65) per 1000 PYAR, whereas among the most deprived, a rate of 2.23 (95% CI 2.15 to 2.32) per 1000 PYAR was estimated. This difference in rates remained statistically significant after adjusting for effects of other covariates (ie, age and year) and after stratifying by case definition (table 2).

Table 2

Recorded rate of PCOS diagnoses by social and demographical characteristics

Figure 1

Time trends in PCOS diagnosis recorded (for diagnosed, probable and total cases). PCOS, polycystic ovary syndrome.

The overall prevalence of PCOS in 2014 was ∼2.27% (95% CI 2.23% to 2.31%), with a prevalence of 1.34% and 0.93% in diagnosed and probable cases, respectively. The age-specific prevalence peaked in the 30–34 year age group, and decreased for older age groups. Prevalence estimates were not sensitive to varying the postregistration period and the time period registered within 2014, remaining consistently ∼2%.

The proportion of women using one of the PCOS-related drugs before or after their index date varied widely across drugs groups (see table 3). At the time of their PCOS index date, over 40% of women had previously been prescribed COC, ∼30% had been prescribed acne-related drugs before diagnosis, >18% had been prescribed POCs and ∼18% had previously been prescribed at least one of the other drugs (table 3). Acne-related drugs, COC and metformin were the most commonly used drugs in the 24 months after a PCOS record (table 3). Plots describing the cumulative incidence of women with a prescription for each drug type over the 24 months following their index date are provided in online supplementary figure SI. The plots show that while all drugs show an initial surge in prescribing on or just after the PCOS index date, this is greater for some drugs (eg, metformin, acne-related drugs) than for others (COCs and POCs).

Table 3

Number and percentage of PCOS women prescribed relevant drugs for PCOS prior to and following the diagnosis of PCOS

Prescription results stratified according to whether PCOS cases were diagnosed or probable are provided in online supplementary table SII. These results indicate that acne-related treatments and POCs were more commonly used to treat probable than diagnosed cases, while COCs, metformin, clomiphene, cyproterone, eflornithine and weight loss drugs were prescribed more commonly in diagnosed than probable cases. Cumulative incidence plots stratified according to whether a case was diagnosed or probable illustrate the differences listed in online supplementary table SII and further to this show that these differences are typically established on or immediately after the index date (figure 2).

Figure 2

Plots describing the cumulative incidence of women with a prescription for each drug type over the 24 months following their index date stratified according to whether the case was diagnosed (dashed line) or probable (solid line). Results shown for the eight most commonly prescribed drugs. COC, combined oral contraceptive; POC, progestin oral contraceptive.



We present data on >14 000 potential PCOS cases among women aged 15–45 years in primary care across the UK between 2004 and 2014. 51.2% of these women had a PCOS diagnosis recorded, while 49.9% did not, corresponding to incidence rates of 0.93 per 1000 PYAR (95% CI 0.91 to 0.96) and 0.91 per 1000 PYAR (95% CI 0.89 to 0.93), respectively. The prevalence of PCOS in 2014 was ∼2.27%. There was a considerable variation in the type of drug prescribed on the day of, or in the 24 months after, a PCOS diagnosis and prescribing differed between diagnosed and probable cases.

Strengths and limitations

To the best of our knowledge, this study is the first to investigate the diagnosis and management of PCOS in the primary care setting in the UK. As THIN contains over 10 million patient records, our study is robust in terms of sample size. The longitudinal nature of the database also allowed us to examine trends over a 10-year study period, which has not been captured by previous epidemiological studies where individuals were often sampled at a single time point.

As our data were collected in routine clinical practice, our results reflect the true burden of PCOS on the healthcare system. However, this also means that only data considered relevant at the time of a consultation are recorded by clinicians. Consequently, it is unsurprising that only 8% of diagnosed cases had two or more PCOS features recorded as, while the initial feature prompting referral is likely to be noted by the general practitioner (GP), once a PCOS diagnosis has been made by a specialist a GP is unlikely to record anything other than the confirmed diagnosis in the coded record. The routine nature of data collection also meant that underdiagnosis was a concern in the current study and underestimation of PCOS rates was anticipated. We attempted to address under-reporting by allowing women with two or more features of PCOS (interfeature period within 3 years) to count as a PCOS case. However, the inclusion of probable cases may introduce case misclassification as some probable cases may not be true PCOS cases. For example, while we considered women with a raised testosterone level to have hyperandrogenism, there are concerns surrounding the accuracy of testosterone testing.19 On the contrary, it is also possible that many probable cases are true cases but do not have a diagnosis recorded in their medical records for some reason.

Incomplete patient history is a concern as women with prevalent PCOS diagnoses at the time of registration with a practice may not be identified, resulting in the underestimation of prevalence rates and the overestimation of incidence rates. Additionally, the lack of information on ethnicity is also an issue as the trends in incidence observed over age, deprivation and calendar year categories may be influenced by unobserved differences in ethnicity distributions across these covariates.

As we lack information on the indication for prescriptions, we cannot be certain that prescriptions issued after, or even on, the date of a PCOS record were prescribed for the treatment of PCOS. For example, ∼30% of the PCOS cases prescribed metformin had a prior diagnosis of type 2 diabetes, the approved indication for this drug. However, by excluding those ever prescribed metformin prior to their PCOS diagnosis from our calculations, we can be relatively confident that prescriptions for metformin issued on the date of a PCOS diagnosis are likely to be at least partly for the treatment of PCOS. Our confidence that the drug was prescribed for PCOS then decreases with increasing time after the PCOS diagnosis such that we expect the proportion of diagnosed cases prescribed metformin for PCOS to lie somewhere between the 8% with a prescription on the date of diagnosis and the 20% with a prescription on or in the 24 months after the date of diagnosis.

While we lacked information on prescribing outside primary care, the responsibility for prescribing treatments which have been initiated by specialists is likely to be transferred to an individual's GP a number of months after diagnosis. The use of a 24-month window to assess prescribing therefore allowed us to capture initiation of drugs prescribed in secondary care. This is reflected in the cumulative incidence curves shown in online supplementary figure S1 where the drugs that are commonly initiated outside primary care (eg, spironolactone and clomiphene) are initiated further after the index date than the drugs typically initiated in primary care (eg, COCs and acne drugs). However, prescribing rates may be underestimated if care is not transferred to the GP within 24 months (eg, for budgetary reasons).

The proportion of women with PCOS who had been prescribed COCs before their PCOS diagnosis is similar to that reported by other studies.20 ,21 The proportion of women with PCOS who initiated metformin after their diagnosis in our study (over 10%) is comparable to that of a Danish study where 11.8% of the women with PCOS were identified as having received metformin.21 These comparisons support the validity of our prescribing data.


The prevalence of recorded PCOS in UK primary care in 2014 is comparable to that obtained from studies using databases in the USA (0.56–2.22%).22–24 However, the rates are significantly lower than those from epidemiological surveys in the Europe where systematic screening was often provided to identify cases from selected populations.25–28 This gap highlights the importance for improving public and GPs' awareness of PCOS.

The incidence of PCOS increased slightly over the study period; however, no significant changes in yearly rates were observed. This might reflect the increasing awareness of the syndrome after the establishment of the Rotterdam and Androgen Excess Society criteria during the study period. However, it could also be due to the improvement of the database, that is, the completeness of medical recordings has improved over time.

It should be noted that the Townsend score represents the deprivation level of the area in which a woman lives. Women who lived in more deprived areas had a higher incidence of PCOS than those living in the less deprived areas. A possible explanation is that obesity (a factor strongly associated with PCOS) is more prevalent among women living in more deprived areas. Alternatively, these women may consult their GP more frequently than those in less deprived areas, for other morbidities (ie, type 2 diabetes), and therefore have more opportunity for PCOS to be diagnosed and recorded.

The fact that the incidence of probable PCOS cases was as high as the incidence of diagnosed cases indicates that there is a large group of women who present in primary care exhibiting two features of PCOS within a 3-year period but who do not have a subsequent PCOS diagnosis. While for some of these probable cases a PCOS diagnosis may not be relevant, it is likely that a considerable proportion of the women may meet the diagnostic criteria for PCOS and should therefore be referred for further assessment. Failure to refer such women may mean that they are not offered the lifestyle advice or medications that could reduce their risk of long-term PCOS-related complications.

Variation was observed in the treatments prescribed to diagnosed and probable PCOS cases; in particular, a greater proportion of diagnosed women received metformin prescriptions, while a greater proportion of the probable cases received treatment for the PCOS feature they presented with. This suggests that the diagnosed and probable cases are indeed receiving different care for their condition, with some probable cases not receiving potentially effective treatments such as metformin. The wide variation in prescribing patterns may also be due to the varied nature of clinical presentations of PCOS not only by individuals and also by age. For example, young women consulting their GPs are more likely to ask for drugs to regulate their menses or to treat acne, whereas more elderly women may initiate antidiabetic drugs to prevent rapid conversion to diabetes.

Metformin and oral contraceptives were the two drugs most commonly initiated in women with diagnosed PCOS, possibly reflecting the major concerns of long-term metabolic risks of this syndrome stated by the three PCOS consensuses. However, it is notable that even among the diagnosed PCOS cases, there is some variation in treatments prescribed following a diagnosis. This suggests that there may be a lack of consensus on the ideal treatment for the condition. This is supported by a recent survey of European endocrinologists which found variation in the treatments most commonly prescribed for PCOS.29 Further research into the comparative efficacy and effectiveness of the various PCOS treatment options may therefore be warranted.


In conclusion, compared to rates estimated in community samples, the incidence of women presenting in primary care with PCOS diagnoses and features is low compared with most epidemiological surveys. Among the women who present, only 50% were observed to have a PCOS diagnosis recorded. Further work is therefore needed to inform women and healthcare professionals about the condition to avoid any worsening of the disease or rapid conversion into other metabolic disorders considering the relatively low cost of diagnosis and high cost of care for the associated diseases suggested by Azziz et al.30 There is much potential for these treatments to prove cost-effective alternatives, which should be carefully considered by public healthcare providers, such as the National Health Service in the UK.

Although there is much variation in the treatments prescribed following a PCOS diagnosis, the widespread prescribing of oral contraceptives and metformin generally reflects the prognostic concerns raised in PCOS consensuses, aiming to reduce the future metabolic risks of patients with PCOS or patients who are undergoing treatment for PCOS and may already have developed metabolic disorders. Further work is needed to identify the most effective treatment for the condition.


View Abstract


  • Contributors TD, CS and IP were involved in the design of the study. TD and CS performed the analysis, and GB, IP and PJH helped with the interpretation. TD prepared the manuscript, and CS, GB, IP and PJH revised it as needed. All authors have read and approved the final manuscript.

  • Funding This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.

  • Competing interests None declared.

  • Ethics approval THIN has overall ethical approval from the South East Multicentre Research Ethics Committee (reference number: 07/H1102/103).

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data sharing statement No additional data are available.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.