Assessing the quality of primary healthcare in seven Chinese provinces with unannounced standardised patients: protocol of a cross-sectional survey

Dong Roman Xu; Mengyao Hu; Wenjun He; Jing Liao; Yiyuan Cai; Sean Sylvia; Kara Hanson; Yaolong Chen; Jay Pan; Zhongliang Zhou; Nan Zhang; Chengxiang Tang; Xiaohui Wang; Scott Rozelle; Hua He; Hong Wang; Gary Chan; Edmundo Roberto Melipillán; Wei Zhou; Wenjie Gong

doi:10.1136/bmjopen-2018-023997

Article Text

PDF

XML

Health services research

Protocol

Assessing the quality of primary healthcare in seven Chinese provinces with unannounced standardised patients: protocol of a cross-sectional survey

http://orcid.org/0000-0001-7438-632XDong Roman Xu1,
Mengyao Hu2,
Wenjun He3,
Jing Liao1,3,
Yiyuan Cai1,3,
Sean Sylvia4,
Kara Hanson5,
Yaolong Chen6,
Jay Pan7,
Zhongliang Zhou8,
Nan Zhang9,
Chengxiang Tang10,
Xiaohui Wang11,
Scott Rozelle12,
Hua He13,
Hong Wang14,
Gary Chan15,
Edmundo Roberto Melipillán2,
Wei Zhou16,
http://orcid.org/0000-0002-7943-4041Wenjie Gong17

¹ Sun Yat-sen Global Health Institute (SGHI), School of Public Health and Institute of State Governance, Sun Yat-sen University, Guangzhou, China
² Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, Michigan, USA
³ Department of Biostatistics and Epidemiology, School of Public Health, Sun Yat-sen University, Guangzhou, China
⁴ Department of Health Policy and Management, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
⁵ Department of Global Health and Development, Faculty of Public Health and Policy, London School of Hygiene and Tropical Medicine, London, UK
⁶ Evidence Based Medicine Center, School of Basic Medical Sciences, Lanzhou University, Lanzhou, China
⁷ West China School of Public Health, Sichuan University, Chengdu, Sichuan, China
⁸ School of Public Policy and Administration, Xi’an Jiaotong University, Xi’an, China
⁹ Department of Health Management, School of Health Management, Inner Mongolia Medical University, Hohhot, China
¹⁰ School of Public Administration, Guangzhou University, Guangzhou, China
¹¹ Department of Social Medicine and Health Management, School of Public Health, Lanzhou University, Lanzhou, Gansu, China
¹² Freeman Spogli Institute for International Studies, Stanford University, Stanford, California, USA
¹³ Department of Epidemiology, School of Public Health and Tropical Medicine, Tulane University, New Orleans, USA
¹⁴ Health Economics, Financing and Systems, Bill & Melinda Gates Foundation, Seattle, USA
¹⁵ Department of Biostatistics, University of Washington, Seattle, Washington, USA
¹⁶ Hospital Administration Institute, Xiangya Hospital, Central South University, Changsha, China
¹⁷ Xiangya School of Public Health, Central South University, Changsha, China

Correspondence to Professor Wenjie Gong; gongwenjie{at}csu.edu.cn

Abstract

Introduction Primary healthcare (PHC) serves as the cornerstone for the attainment of universal health coverage (UHC). Efforts to promote UHC should focus on the expansion of access and on healthcare quality. However, robust quality evidence has remained scarce in China. Common quality assessment methods such as chart abstraction, patient rating and clinical vignette use indirect information that may not represent real practice. This study will send standardised patients (SP or healthy person trained to consistently simulate the medical history, physical symptoms and emotional characteristics of a real patient) unannounced to PHC providers to collect quality information and represent real practice.

Methods and analysis 1981 SP–clinician visits will be made to a random sample of PHC providers across seven provinces in China. SP cases will be developed for 10 tracer conditions in PHC. Each case will include a standard script for the SP to use and a quality checklist that the SP will complete after the clinical visit to indicate diagnostic and treatment activities performed by the clinician. Patient-centredness will be assessed according to the Patient Perception of Patient-Centeredness Rating Scale by the SP. SP cases and the checklist will be developed through a standard protocol and assessed for content, face and criterion validity, and test–retest and inter-rater reliability before its full use. Various descriptive analyses will be performed for the survey results, such as a tabulation of quality scores across geographies and provider types.

Ethics and dissemination This study has been reviewed and approved by the Institutional Review Board of the School of Public Health of Sun Yat-sen University (#SYSU 2017-011). Results will be actively disseminated through print and social media, and SP tools will be made available for other researchers.

standardized patients
unannounced standardized patients
quality of primary health care
patient-centered care

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

https://doi.org/10.1136/bmjopen-2018-023997

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

We will assess the quality of care with a random sample of primary healthcare providers in seven provinces in China.
We will use unannounced standardised patients (USPs), the ‘gold standard’ of quality assessment.
Both technical quality and patient-centredness will be assessed.
USPs are not suitable for certain health conditions.
The seven provinces are not randomly selected, although we intend for them to represent different health development conditions (using life expectancy as the proxy) in China’s provinces.

Background

In 2015, all 191 member states of the United Nations adopted the sustainable development goals, aiming to achieve universal health coverage (UHC)—access to high-quality healthcare services without incurring financial hardship—by 2030.1 As previous literature emphasised, efforts to promote UHC should focus on the expansion of access and on healthcare quality.2 Healthcare quality is variously defined by the WHO as the ‘responsiveness’ of the healthcare system to meet desired health outcomes,3 as the instrumental goals on structure, process and outcome in the Donabedian framework,4 and as the six comprehensive aims (effectiveness, efficiency, equity, patient-centredness, safety and timeliness) put forth by the Institute of Medicine (IOM).5 In this study, we adopt the IOM definition of quality.

Primary healthcare (PHC) serves as the cornerstone for the attainment of UHC.6 China’s latest round of healthcare reform since 2009 has invested heavily in strengthening PHC. There have been some efforts to assess the quality of PHC in China: patients were interviewed with a Primary Care Assessment Tool questionnaire in Guangdong, Shanghai and Hong Kong7–9; comprehensiveness of the service provision was used as a proxy for quality through clinician interviewing10; and PHC clinicians’ adherence to clinical guidelines was assessed with a self-report questionnaire.11 However, assessment of the quality of PHC has largely remained scant in China, and the assessment tools are indirect and prone to bias.12 A number of studies have found the quality of PHC to be low in other low-income and middle-income countries (LMICs),6 13–18 where robust evidence remains scarce.19 Commonly used methods of measuring technical quality of care include chart abstraction, patient rating of care and using a clinical vignette to test clinician knowledge. Those methods use indirect information that may not represent real practice. This study instead will use unannounced standardised patients (USPs) to measure the quality of real practice. The standardised patient (SP) is a healthy person (or occasionally a real patient) trained to consistently simulate the medical history, physical symptoms and emotional characteristics of a real patient. The SP, particularly when their visit is unannounced, has several reported advantages: (1) reliability in measurement and cross-provider comparison because the same patient is presented to all providers, (2) elimination of the Hawthorne effect (ie, that the study itself may change doctors’ behaviour) due to the nature of disguised and unannounced visit by SPs,20–22 and (3) reduced recall bias.23 24

Despite these advantages, the application of SP in China has been concentrated mainly in the area of medical education.25 An ongoing systematic review identified only four papers on using the SP for quality assessment in China14 26–28 and 44 in other LMICs. Those projects, often based on a small convenience sample, tended to target a limited number of conditions (approximately 70% on family planning services, childhood infectious diseases, sexually transmitted infections and respiratory tract infections). In this study, we intend to assess the quality of PHC with a probability sample of PHC visits in seven Chinese provinces, using USPs for 10 commonly seen conditions in the PHC setting. The project has involved 20 universities across 19 provinces in China, as well as researchers from Nepal, USA and UK in a USP network (https://www.researchgate.net/project/Unannounced-Standardized-Patient-USP-and-Virtual-Patient-VP-to-Measure-Quality-of-Primary-Care). The USP resources will be pooled and shared widely within the network first and then with the general public. This study is the first of a series of studies to be based on quality data collected using USPs. The primary purpose of this study is to collect and present descriptive data on the quality of China’s PHC. We are developing separate protocols for the various hypothesis-driven studies, which will be available elsewhere and from our network website.29

Methods

Survey design

The purpose of the sample design is to create a representative sample of China’s PHC providers so that healthcare quality can be assessed based on USP visits to those providers.

Survey population/frame

We considered creating a nationally representative probability sample, but at this stage we have selected seven provinces to ‘represent’ China due to feasibility considerations. These provinces represent five levels of average life expectancies across China’s provinces (figure 1), which are similar to those of five countries with low-income to high-income levels.30 We intend to create a probability sample that represents PHC in these seven provinces. For the survey population, we intend to include (1) licensed physicians and licensed assistant physicians at community/township health centres/stations and urban health stations; (2) certified village doctors (a terminology in China that refers to village clinicians who have village-level practice privilege even without a medical licence) and village sanitarians (referring to uncertified village doctors who are supposed to work under the supervision of the village doctor) at village clinics; and (3) clinicians with a licence notation for general practice, internal medicine, obstetrics/gynaecology and paediatrics at the level I and level II hospitals and the maternal and childcare centres. We exclude level 3 hospitals, which provide more specialised care, and specialty hospitals. Clinicians meeting those criteria will constitute the ‘sampling frame’.

Figure 1

Seven selected sample provinces on the map of China with referencing countries of equivalent life expectancy in brackets. The figure is adapted from the paper by Liao et al 29. Permission to use has been obtained.

Sampling procedures

The sample will be selected using a multistage, clustered sample design covering all eligible clinicians in the seven provinces (figure 2). In the first stage, stratification will be based on the provinces. Due to the high number of visits in the seven capital cities, we will sample each capital city. Each province is thus divided into two strata consisting of the provincial capital city and other prefecture-level municipalities, leading to 14 strata in total. We will use proportionate allocation (in terms of the number of eligible clinicians) of the sample size for each stratum. For each stratum, five rural townships or urban subdistricts (the primary sampling unit [PSU]) will be selected using probability proportional to size (PPS). In the second stage, for each PSU, PHC facilities as previously defined (secondary sampling unit [SSU]) will be selected using PPS systematic sampling. Neighbouring village clinics will be grouped as an SSU. The number of SSUs for each stratum will vary depending on the size of the stratum—for example, more SSUs will be selected in strata with more PHC clinicians. In the final stage, a fixed number of USP visits will be made to each selected facility or the group of facilities in the case of village clinics. The exact number of visits will be determined once we obtain and examine our sampling frame.

Figure 2

Sampling procedure. PSU, primary sampling unit; SSU, secondary sampling unit; USP, unannounced standardised patient.

Sample size calculation

The sample size was calculated for the primary purpose of the standard descriptive survey analysis of this survey. The sample size (power) calculation for other related hypotheses of related studies will be described in separate study protocols. The primary statistic of interest in this survey is a latent variable measuring clinicians’ quality, constructed using the two-parameter logistic item response theory (IRT) model.31 32 The model was based on a list of quality checklist items measuring whether doctors asked recommended questions and whether they performed recommended exams (see the Scoring methods section below). Survey sample size was calculated based on the desired level of relative precision (coefficient of variation, CV), an estimate for the population element variance for the variable of interest ( ) from previous study and design effect ( ). In this study, our desired level of relative precision (CV) is 0.08. was estimated to be 4.54, based on Sylvia et al’s14 27 work on the USP-assessed quality of PHC in three Chinese provinces. Design effect is the variance inflation due to cluster sampling. This figure was calculated based on intraclass correlation (ICC) (describing the level of homogeneity of the units in a cluster) and cluster sample size: , where δ is the ICC and n is the average size of the cluster. The ICC of 0.0486 was also estimated from Sylvia et al’s work. Our estimated average cluster size is 27 clinician–SP encounters per PSU. Accordingly, we calculated the total required sample size to be 1981 clinician–SP encounters. The steps taken to calculate the sample size can be found in online supplementary appendix 1.

Supplemental material

[bmjopen-2018-023997-supp1.pdf]

USP case development and implementation

The development process of a USP case is based on our extensive literature review20 33 as well as our own USP experiences in Shaanxi Province, China.14 27 We are concurrently developing smartphone-based virtual standardised patients (VPs) (details described elsewhere). The two projects will share almost identical case scenarios and quality criteria.

Case selection

Our purpose is to select 10 health problems as tracer conditions for PHC in China. Ideally our selected cases should (1) be highly prevalent in PHC settings; (2) carry challenging features in different aspects of PHC (eg, some cases focus on curative care, while others on prevention, disease management, culturally sensitive care34 or misuse of low-value tests35–37); (3) not involve invasive and painful procedures; and (4) not require physical signs that cannot be simulated (eg, jaundice can be simulated with make-up, but heart murmurs cannot).23 We created a list of the top 30 conditions commonly seen in PHC in China, combining the results of two national surveys on PHC.12 A panel of physicians and public health and health system researchers then applied the principles above and selected a dozen of PHC problems for USP development (table 1). Ten final conditions will be selected from this list.

View this table:

Table 1

Selected candidate conditions

Development team

We have created an overall development team and 10 case-specific development teams. Each team includes case-specific specialists, general practitioners, and public health and health system researchers (online supplementary appendix 2). A third overall panel consisting of primary care providers at the village, township and community levels will review all cases for contextual appropriateness in primary care settings. In developing the case, we will follow several principles: (1) limiting case scenarios to those that require definitive clinician action on the first visit to minimise potential ‘first-visit bias’,38 (2) focusing on the presentation of symptoms for which evidence is well established for diagnosis and management, and (3) deriving some content of the cases from the actual case history of relevant patient files in real practice.23

Case description

The case description describes the relevant clinical roles and psychosocial biographies of the SP.39 We used a structured description of the cases as follows:

Social and demographic profile: (1) socioeconomic information: name, gender, age, ethnicity, education, occupation, family structure (eg, married and have two children but live alone), dress style (eg, dressed in jeans, work boots and a well-worn but neat sweater), health insurance or other social programme participation; (2) personality that may influence interaction with the clinician (eg, non-proactive and introverted); and (3) lifestyle relevant to health (eg, smoked one pack of cigarette since age 18, like fried pork but also eat much fruit, exercise regularly, watch television a lot during spare time, play mah-jong with friends and visit children every week).
Medical history: (1) disease information: severity of the condition (eg, mild or severe depression), duration of the condition (the first onset? previously diagnosed/existing [how long?]), comorbidity (any other physical and/or psychological problems?); (2) reason for seeking care for this specific visit (eg, was feeling down for 2 months but depression worsened last week); and (3) treatment/management already or currently received (eg, a ‘patient’ with diabetes took metoprolol for hypertension but does not monitor his glucose/watch his diet/weight).
Physical examination: symptoms the SP will (and will not) portray (eg, reduced appetite, but not showing agitation), and medical signs the SP has or does not have (eg, heart murmur).
Laboratory and imaging: laboratory and imaging that a clinician may prescribe for the SP. The laboratory and imaging results of the SP may be generated from those of real typical patients.
Diagnosis: the correct diagnosis that the clinician should make based on the information presented by the SP.
Treatment and management: the decision of the clinician on what medications, procedures, advice or referral will be given at the end of the consultation.

Script

Corresponding to the six components of the aforementioned case description, we will develop a detailed script for the SPs to use in their PHC visit with the clinician. The script ideally should cover all possible questions a clinician may ask, as well as the SP’s answers during the clinical interaction. Panels of clinicians will be consulted to collect relevant questions that will guide the development of the script. The script will continue to add new questions asked by the clinicians on the SP–clinician interaction. The script will have five sections: (1) an opening: spontaneous information given to the clinician at the start (eg, Doctor, I have had a headache for 2 days), (2) the information given only on request, (3) the information for the SP to volunteer even if not asked, (4) the language to insist on a diagnosis if not given and (5) an ending.14 20 40

Quality checklist

The checklist consists of explicit quality criteria for gathering data on patient history, physical examination, laboratory/imaging, diagnosis and treatment.14 33 Based on our comprehensive review of 14 articles on literature and evidence-based clinical guideline development methodology,41 we have established a guiding principle and standard protocol for checklist development. Our process will (1) be evidence-based and augmented by expert opinion,42 (2) follow a systematic procedure to gather, evaluate and select evidence and criteria, (3) select criteria related to clinician actions that the SP can easily evaluate,43 and (4) keep the number of checklist items under 30 to include high-priority criteria only so that the SP can reliably recall clinician behaviour.43–45 The details of our checklist development protocol will be described in a separate paper, and key messages are summarised in online supplementary appendix 2.

Selecting and training SPs

We will advertise on social media to recruit SPs. The candidate must be in stable health without confounding symptoms; should match the real patients in age, sex and physical features; are willing to allow the examinations appropriate to their condition; and have the intellectual maturity to present the behaviour of the actual patient and complete the checklist.23 46 47 We may consider recruiting real patients with stable conditions to portray the cases not subject to simulation.23 The training of the SP will aim at portraying the signs, symptoms and presentations, completing the checklist, and minimising detection by the provider.20 The week-long training will have three stages: classroom instruction, a dress rehearsal and two field tests.23 47 48 Each case will have three SPs who will be trained according to a standardised training manual that will be developed to guide the training and appraisal of the SPs.

Fielding and implementing SPs

A disguise plan will be developed for each case to minimise physician detection of the SP status (eg, convincing excuse for seeking care where they do not usually reside). In the pilot (instrument validation) phase, consent will be sought for audio recording (see below); in these cases, fieldwork will start only 3–4 weeks after consent is obtained. We will provide each SP with a calamity letter, explaining the project in case of their identity being exposed.

After the facilities are selected, and the number of visits per facility is determined, each of the planned visits will be given a unique identifier (eg, facility A-1, facility A-2, facility B-1), which will then be randomly ordered to form a random sequence numbered from 1 to 1981 consecutively. One of the ten SP cases will be randomly assigned to each number on this random sequence. The seven SPs per case will be dispatched to the seven provinces concurrently, one SP per province. If multiple clinicians are available in that facility at the time of a particular SP visit (PHC visits in China do not require appointments), the field coordinator will randomly select a clinician by drawing lots onsite. Each SP is expected to make a total of approximately 30 visits. We plan to complete those SP visits over a 3-month time span.

In a separate but related study, a week after the visit of the SP, the same clinician will perform the same consultation but with a standardised virtual patient on a smartphone.29 We will use this opportunity to administer a detection questionnaire to the clinician, asking whether they suspect they had any visit from an SP over the past week. The detected cases will be treated as missing data in the data analysis.

Variables

Outcome variables

We will collect a variety of quality of care information and other related explanatory variables. The IOM quality framework (effective, safe, patient-centred, timely, efficient and equitable) will be used for quality evaluation (table 2). Effectiveness (avoiding underuse and misuse) and safety (avoiding harm), traditional technical goals of quality of care, will be evaluated through the yes/no checklist discussed above (online supplementary appendix 2). Patient-centredness (respectful of and responsive to individual preferences) will be assessed by the Patient Perception of Patient-Centeredness (PPPC) Rating Scale.49–51 Using a 4-point Likert scale, the PPPC Rating Scale evaluates three dimensions of patient-centredness: exploring the disease and illness experiences, understanding the whole person and finding common ground.49 Prior studies have demonstrated the validity of SPs rating clinician communications.52 53 A separate study will be conducted to test the validity of the PPPC Rating Scale. Timeliness will be assessed by analysing opening hours, waiting time and consultation time.5 Efficiency (avoiding waste) will be measured by costs of care of the SP–clinician encounter. Equity of care (no variance in quality because of personal characteristics) will be assessed through a separate but related study in a randomised cross-over trial.

View this table:

Table 2

Variables

Scoring methods

Technical quality of care will be reflected by a continuous score ranging from 0 to 1. We will evaluate further whether to classify checklist items in four categories (essential, important, indicated and non-contributory) with corresponding numeric weights (3, 2, 1 and 0).54 Two scoring methods will be used: (1) the simple scoring method will use the formula of items performed divided by the total number of items on the checklist for the process scores, whereas (2) the complex method will use an algorithm based on the IRT.31 Using the IRT model approach, we can obtain a latent performance score for each doctor, which has been corrected for measurement error. An ordinal variable will be used for diagnosis and management plans (table 2), while patient-centredness will follow the scoring methods of the PPPC Rating Scale (possible range of score from 1 to 4).51

Other variables

We will collect additional information on the predictors, confounders and effect modifiers to the outcomes in the planned hypothesis testing of the related studies to this survey. The information will include qualifications of the clinician and facility information (environment, amenity, size, location, ownership type and so forth).

Analytical methods

USP validation

USP validation will be based on a convenience sample of clinicians not included in our final survey sample in the project training and pilot phase. Those SP–clinician interactions in the pilot will be audio-recorded and transcribed. Validity is the extent to which an instrument measures what it is supposed to measure. We will assess content, face and criterion validity of the cases. Content validity will be assessed by an expert panel who will use a 4-point Likert scale to evaluate the appropriateness of the written content of the cases that will include the scenario, scripts and checklists. For the checklist, they will be instructed to check the appropriateness against the published clinical guidelines. The face validity of the SP assessment depends on (1) the SP remaining undetected (detection ratio reported to be 5%–10%55), and (2) authentically and consistently portraying the clinical features of the case. We will send the participating clinician in the pilot a ‘detection form’ to report their degrees of suspicion of any SP visit.46 The authenticity of the SP presentation will be evaluated by checking the transcribed recording to discover whether a key piece of information was divulged by the SP when appropriately prompted, not divulged when prompted or volunteered when not prompted. Criterion validity will be assessed through the agreement of the SP-completed checklist against that completed by a clinician based on the transcript of the visit (ie, the clinician rating as the ‘gold standard’).56–59 Checklist items which depend on visual observation will be excluded. Reliability examines the level of consistency of the repeated measurements. The inter-rater reliability of two SPs on the same condition and context will be assessed with two SPs completing the checklist for the same recorded transcript. Test–retest reliability will be analysed by the concordance of assessment results of the same SP to score his or her own recorded encounter a month later.57 The agreement will be analysed with Lin’s concordance correlation coefficient (r_c).60 r_c indicates how closely pairs of observation fell on a 45° line (the perfect concordance line) through the origin in addition to their correlation.60–62 Bland-Altman plot will be used to visualise the concordance.63 64 Table 3 summarises our methods of validation.

View this table:

Table 3

Methods of validation for the USP cases

Survey analysis

We will focus on descriptive analysis to present the quality of PHC in the seven provinces. Hypothesis-driven analyses will be described in separate study protocols. For descriptive analysis, we will first present clinician and facility profiles in tables for all seven provinces and by each province. The clinician profile will include sociodemographic information (age, gender and ethnicity), professional qualification (general and medical education, licensure, and professional ranks) and service information (volume of visits and number of support personnel). The facility profile will include information on operation and management (years in operation, ownership types, accreditation, level of hospitals, affiliation with medical universities, revenue, health insurance contracting, payment methods), clinical services (annual number of inpatient and outpatient visits, number of clinical departments), personnel (number of physicians, nurses and attrition ratio) and equipment. Second, we will tabulate the results of overall quality and subdomains across administrative regions and provider types. Third, we will map out the locations of the facilities along with their quality scores with geospatial analytical tools. Finally, a t-test/Wilcoxon test or χ² test will be employed to compare quality differences between public versus private providers, primary care clinics/centres versus hospital outpatient services, care in rural versus urban areas, and across different conditions, clinician educational levels and payment mechanisms.

Related studies

This study protocol mainly deals with the descriptive analysis and presentation of the data to be collected by the USPs. Using the USP survey data, we have planned several related studies that will be covered by separate study protocols with details on the background, theoretical framework and analytical methods. To summarise those related studies, we will assess (1) the effect of ownership types of the PHC providers (ie, private vs public) on the quality of PHC (study protocol under revision), (2) the know-do gap between the assessment results by a smartphone-based VPs and USP (protocol already published),29 (3) the effect of using smartphone-based virtual patient in improving clinician performance, (4) the effect of types of insurance carried by a patient on quality of care, (5) the impact of gatekeeping by primary care providers on quality of tuberculosis care—a mathematical modelling study, and (6) clinician skills in handling low-value or harmful patient-requested services, particularly antibiotics and some processed traditional Chinese medicine.

Ethics and dissemination

USP studies do not necessarily require consent if they meet certain conditions.65 66 Our waiver has been granted for the following reasons: (1) our study serves important public good, while requiring informed consent may lead to considerable selection bias and greater risk for the detection of the SP; (2) this study does not intend to entrap or reveal identities of any institution or individual, and all analyses will be conducted at the broader health system level (after data cleaning, all individual identifiers will be destroyed); and (3) no audiovisuals will be recorded during the SP–clinician encounter (however, in the pilot stage, we will seek informed consent from participating clinicians as we will use a disguised recording for the validation purposes). The study results will be widely distributed in the form of scientific papers and policy briefs. The data generated from this project and the USP cases and accompanying user manuals will be made available to other researchers on request after we complete our primary analysis.

Patient and public involvement

We selected the conditions for the USP partly based on results from surveys on common conditions in the context of PHC as reported by patients. The USP cases will also be reviewed by a panel that includes patients. The results of the studies will be widely distributed in scientific reports as well as social media to benefit policymakers, clinicians and patients.

Discussion

In this study, we will develop, validate and implement methods of assessing the quality of PHC using USPs. Compared with existing studies using USPs,33 this proposed study has several distinctive features. First, we will establish a large probability random sample so that representative estimates of PHC quality can be achieved in the chosen seven provinces in China. Second, unlike previous studies,14 27 we include village clinics, township health centres and community health centres, and also county hospitals and other level I and level II hospitals, in the study. The latter were not officially designated as PHC facilities in China but provided a substantial amount of PHCs. Third, 10 SP cases will be developed through a standardised process using the same template and methodology and will represent common conditions in PHC, while past studies often used two to three conditions.33 Fourth, an evidence-based systematic method will guide checklist development. In a review, only 12 out of 29 SP articles reported the procedures of checklist development and many checklists were developed by expert consensus only.54 Fifth, in addition to using the checklist to evaluate technical quality of care as performed in most other USP studies, we will assess patient-centredness with a global rating scale. Sixth, we have planned a series of related studies to address the quality of PHC in a concerted effort. Most noteworthy, we are developing 10 identical conditions as smartphone-based virtual patients to assess the competency of PHC providers. Seventh, we used the same case for all levels of providers, from village doctors to township health centres, to county hospitals, but quality checklists for process, diagnosis and treatment will be tailored to fit the expected roles and responsibilities of the different providers. Finally, we have secured the understanding and cooperation of the provincial health authorities.

We note two particular issues. In high-income settings, logistical arrangements for the SP are complex. A significant challenge is to introduce the SP into medical practice.23 47 48 However, in China and many other LMICs, enrolment with a clinician is not required, and a walk-in visit to clinicians without an appointment is commonplace. However, village doctors usually know their patients well. For these areas, the SPs in other studies pretended to be tourists or friends visiting the families in the village. We will try other pretences, such as a temporary poverty-relief worker who has just arrived in a nearby village. Those poverty-relief workers are common in remote rural areas in China. For the second issue, assessing quality with USP was reported to incur high cost in developed countries (estimated to be US$350–400 per visit).53 67 We expect the cost in China to be considerably lower due to the lower labour cost. We will collect detailed cost information to inform the future application of the USP.

The study has several potential limitations. Most important, even though the assessment of SP is considered the gold standard for measuring clinician performance, and in this study we have further expanded the use of SPs to evaluate other elements of quality in the IOM framework such as patient-centredness, timeliness and efficiency, we recognise that those quality of care elements are still largely clinician-related, and other important quality aspects such as the quality of laboratory testing cannot be assessed by our SPs. In addition, the USP method has several technical challenges. If healthy people are used to simulate the patient, it is difficult to achieve complete alignment of patient presentation of signs and symptoms (for instance, it is difficult to fake a sore throat). There are also challenges to obtaining fake laboratory test results that may be necessary for the diagnosis. Some clinical roles that require the SP to go through invasive investigation may also pose a problem. We will experiment with a real patient in stable conditions to resolve some of those challenges. Next, our judgement of the clinical quality through the first and only visit with the SP may lead to ‘first-visit bias’.38 The quality of care provided by a clinician who spreads his or her diagnosis and management over several visits may be underestimated. We try to minimise this bias by designing cases that require a definitive decision on the first visit. Last, even though we intend to select 10 tracer conditions in the context of PHC, we still need to be cautious in generalising the findings to the overall quality of PHC.

In conclusion, this proposed study may produce a set of validated tools for the assessment of the quality of PHC using USP and apply it to obtain valuable quality of care information on PHC in China.

References

1.↵
A/RES/70/1 R. Transforming our world: the 2030 agenda for sustainable development. 2018. http://www.un.org/ga/search/view_doc.asp?symbol=A/RES/70/1&Lang=E (accessed 17 Feb 2018).
2.↵
2. Hanefeld J ,
3. Powell-Jackson T ,
4. Balabanova D
. Understanding and measuring quality of care: dealing with complexity. Bull World Health Organ 2017;95:368–74.doi:10.2471/BLT.16.179309
OpenUrl
3.↵
2. Murray CJ ,
3. Frenk J
. A WHO framework for health system performance assessment: Evidence and Information for Policy: World Health Organization, 1999.
4.↵
2. Donabedian A
. The quality of care. How can it be assessed? 1988. Arch Pathol Lab Med 1997;121:1145.
OpenUrl PubMed Web of Science
5.↵
2. Pongsupap Y ,
3. Van Lerberghe W
. Choosing between public and private or between hospital and primary care: responsiveness, patient-centredness and prescribing patterns in outpatient consultations in Bangkok. Trop Med Int Health 2006;11:81–9.doi:10.1111/j.1365-3156.2005.01532.x
OpenUrl CrossRef PubMed Web of Science
6.↵
2. Bitton A ,
3. Ratcliffe HL ,
4. Veillard JH , et al
. Primary health care as a foundation for strengthening health systems in low- and middle-income countries. J Gen Intern Med 2017;32:566–71.doi:10.1007/s11606-016-3898-5
OpenUrl
7.↵
2. Wei X ,
3. Li H ,
4. Yang N , et al
. Changes in the perceived quality of primary care in Shanghai and Shenzhen, China: a difference-in-difference analysis. Bull World Health Organ 2015;93:407–16.doi:10.2471/BLT.14.139527
OpenUrl CrossRef PubMed
8.↵
2. Zou Y ,
3. Zhang X ,
4. Hao Y , et al
. General practitioners versus other physicians in the quality of primary care: a cross-sectional study in Guangdong Province, China. BMC Fam Pract 2015;16:134.doi:10.1186/s12875-015-0349-z
OpenUrl
9.↵
2. Feng S ,
3. Shi L ,
4. Zeng J , et al
. Comparison of primary care experiences in village clinics with different ownership models in Guangdong Province, China. PLoS One 2017;12:e0169241.doi:10.1371/journal.pone.0169241
10.↵
2. Wong WCW ,
3. Jiang S ,
4. Ong JJ , et al
. Bridging the gaps between patients and primary care in china: a nationwide representative survey. Ann Fam Med 2017;15:237–45.doi:10.1370/afm.2034
OpenUrl Abstract/FREE Full Text
11.↵
2. Zeng L ,
3. Li Y ,
4. Zhang L , et al
. Guideline use behaviours and needs of primary care practitioners in China: a cross-sectional survey. BMJ Open 2017;7:e015379.doi:10.1136/bmjopen-2016-015379
12.↵
2. Li X ,
3. Lu J ,
4. Hu S , et al
. The primary health-care system in China. The Lancet 2017;390:2584–94.doi:10.1016/S0140-6736(17)33109-4
OpenUrl
13.↵
2. Das J ,
3. Hammer J
. Quality of primary care in low-income countries: facts and economics. Annu Rev Econom 2014;6:525–53.doi:10.1146/annurev-economics-080213-041350
OpenUrl CrossRef
14.↵
2. Sylvia S ,
3. Shi Y ,
4. Xue H , et al
. Survey using incognito standardized patients shows poor quality care in China’s rural clinics. Health Policy Plan 2015;30:322–33.doi:10.1093/heapol/czu014
OpenUrl CrossRef PubMed
15.↵
2. Berendes S ,
3. Heywood P ,
4. Oliver S , et al
. Quality of private and public ambulatory health care in low and middle income countries: systematic review of comparative studies. PLoS Med 2011;8:e1000433.doi:10.1371/journal.pmed.1000433
OpenUrl CrossRef PubMed
16.↵
2. Das J ,
3. Holla A ,
4. Das V , et al
. In urban and rural india, a standardized patient study showed low levels of provider training and huge quality gaps. Health Aff 2012;31:2774–84.doi:10.1377/hlthaff.2011.1356
OpenUrl Abstract/FREE Full Text
17.↵
2. Das J ,
3. Gertler PJ
. Variations in practice quality in five low-income countries: a conceptual overview. Health Aff 2007;26:w296–309.doi:10.1377/hlthaff.26.3.w296
OpenUrl Abstract/FREE Full Text
18.↵
2. Das J ,
3. Hammer J ,
4. Leonard K
. The quality of medical advice in low-income countries. J Econ Perspect 2008;22:93–114.doi:10.1257/jep.22.2.93
OpenUrl PubMed Web of Science
19.↵
2. Coarasa J ,
3. Das J ,
4. Gummerson E , et al
. A systematic tale of two differing reviews: evaluating the evidence on public and private sector quality of primary care in low and middle income countries. Global Health 2017;13:24.doi:10.1186/s12992-017-0246-4
OpenUrl
20.↵
2. Glassman PA ,
3. Luck J ,
4. O’Gara EM , et al
. Using standardized patients to measure quality: evidence from the literature and a prospective study. Jt Comm J Qual Improv 2000;26:644–53.doi:10.1016/S1070-3241(00)26055-0
OpenUrl PubMed
21.↵
2. Leonard K ,
3. Masatu MC
. Outpatient process quality evaluation and the Hawthorne Effect. Soc Sci Med 2006;63:2330–40.doi:10.1016/j.socscimed.2006.06.003
OpenUrl CrossRef PubMed Web of Science
22.↵
2. McCambridge J ,
3. Witton J ,
4. Elbourne DR
. Systematic review of the Hawthorne effect: new concepts are needed to study research participation effects. J Clin Epidemiol 2014;67:267–77.doi:10.1016/j.jclinepi.2013.08.015
OpenUrl CrossRef PubMed
23.↵
2. Woodward CA ,
3. McConvey GA ,
4. Neufeld V , et al
. Measurement of physician performance by standardized patients. Refining techniques for undetected entry in physicians’ offices. Med Care 1985;23:1019–27.
OpenUrl CrossRef PubMed Web of Science
24.↵
2. Das J ,
3. Hammer J
. Money for nothing: the dire straits of medical practice in Delhi, India. J Dev Econ 2007;83:1–36.doi:10.1016/j.jdeveco.2006.05.004
OpenUrl CrossRef Web of Science
25.↵
2. Yu-jie Z ,
3. Min W ,
4. Qin L
. Analyze the development of standardized patient teaching in China by literature review in recent 10 years. Chin J Nurs 2009;44:259–61.
OpenUrl
26.↵
2. Currie J ,
3. Lin W ,
4. Zhang W
. Patient knowledge and antibiotic abuse: Evidence from an audit study in China. J Health Econ 2011;30:933–49.doi:10.1016/j.jhealeco.2011.05.009
OpenUrl CrossRef PubMed Web of Science
27.↵
2. Sylvia S ,
3. Xue H ,
4. Zhou C , et al
. Tuberculosis detection and the challenges of integrated care in rural China: A cross-sectional standardized patient study. PLoS Med 2017;14:e1002405.doi:10.1371/journal.pmed.1002405
OpenUrl CrossRef PubMed
28.↵
2. Li L ,
3. Lin C ,
4. Guan J
. Using standardized patients to evaluate hospital-based intervention outcomes. Int J Epidemiol 2014;43:897–903.doi:10.1093/ije/dyt249
OpenUrl CrossRef PubMed
29.↵
2. Liao J ,
3. Chen Y ,
4. Cai Y , et al
. Using smartphone-based virtual patients to assess the quality of primary healthcare in rural China: protocol for a prospective multicentre study. BMJ Open 2018;8:e020943.doi:10.1136/bmjopen-2017-020943
30.↵
2. Zhou M ,
3. Wang H ,
4. Zhu J , et al
. Cause-specific mortality for 240 causes in China during 1990–2013: a systematic subnational analysis for the Global Burden of Disease Study 2013. The Lancet 2016;387:251–72.doi:10.1016/S0140-6736(15)00551-6
OpenUrl
31.↵
2. Das J ,
3. Hammer J
. Which doctor? Combining vignettes and item response to measure clinical competence. J Dev Econ 2005;78:348–83.doi:10.1016/j.jdeveco.2004.11.004
OpenUrl CrossRef Web of Science
32.↵
2. Hambleton RK ,
3. Swaminathan H ,
4. Rogers HJ
. Fundamentals of item response theory. Sage, 1991.
33.↵
2. Rethans JJ ,
3. Gorter S ,
4. Bokken L , et al
. Unannounced standardised patients in real practice: a systematic literature review. Med Educ 2007;41:537–49.doi:10.1111/j.1365-2929.2006.02689.x
OpenUrl CrossRef PubMed Web of Science
34.↵
2. Kutob RM ,
3. Bormanis J ,
4. Crago M , et al
. Assessing culturally competent diabetes care with unannounced standardized patients. Fam Med 2013;45:400–8.
OpenUrl PubMed
35.↵
2. Fenton JJ ,
3. Kravitz RL ,
4. Jerant A , et al
. Promoting patient-centered counseling to reduce use of low-value diagnostic tests: a randomized clinical trial. JAMA Intern Med 2016;176:191–7.doi:10.1001/jamainternmed.2015.6840
OpenUrl
36.↵
2. May L ,
3. Franks P ,
4. Jerant A , et al
. Watchful Waiting Strategy May Reduce Low-Value Diagnostic Testing. J Am Board Fam Med 2016;29:710–7.doi:10.3122/jabfm.2016.06.160056
OpenUrl Abstract/FREE Full Text
37.↵
2. Zabar S ,
3. Hanley K ,
4. Lee H , et al
. Ordering of labs and tests: variation and correlates of value-based care in an unannounced standardized patient visit. J Gen Intern Med 2016;32:S318.
OpenUrl
38.↵
2. Tamblyn RM ,
3. Abrahamowicz M ,
4. Berkson L , et al
. First-visit bias in the measurement of clinical competence with standardized patients. Acad Med 1992;67:S22–4.doi:10.1097/00001888-199210000-00027
OpenUrl PubMed Web of Science
39.↵
2. Shepherd HL ,
3. Barratt A ,
4. Trevena LJ , et al
. Three questions that patients can ask to improve the quality of information physicians give about treatment options: a cross-over trial. Patient Educ Couns 2011;84:379–85.doi:10.1016/j.pec.2011.07.022
OpenUrl CrossRef PubMed
40.↵
2. Peabody JW ,
3. Luck J ,
4. Jain S , et al
. Assessing the accuracy of administrative data in health information systems. Med Care 2004;42:1066–72.doi:10.1097/00005650-200411000-00005
OpenUrl CrossRef PubMed Web of Science
41.↵
Organization WH. WHO handbook for guideline development: World Health Organization, 2014.
42.↵
2. Campbell SM ,
3. Braspenning J ,
4. Hutchinson A , et al
. Research methods used in developing and applying quality indicators in primary care. Qual Saf Health Care 2002;11:358–64.doi:10.1136/qhc.11.4.358
OpenUrl Abstract/FREE Full Text
43.↵
2. De Champlain AF ,
3. Margolis MJ ,
4. King A , et al
. Standardized patients’ accuracy in recording examinees’ behaviors using checklists. Acad Med 1997;72:S85–7.doi:10.1097/00001888-199710001-00029
OpenUrl PubMed Web of Science
44.↵
2. Vu NV ,
3. Steward DE ,
4. Marcy M
. An assessment of the consistency and accuracy of standardized patients’ simulations. J Med Educ 1987;62:1000–2.
OpenUrl PubMed Web of Science
45.↵
2. Vu NV ,
3. Marcy MM ,
4. Colliver JA , et al
. Standardized (simulated) patients’ accuracy in recording clinical performance check-list items. Med Educ 1992;26:99–104.
OpenUrl PubMed Web of Science
46.↵
2. Maiburg BH ,
3. Rethans JJ ,
4. van Erk IM , et al
. Fielding incognito standardised patients as ‘known’ patients in a controlled trial in general practice. Med Educ 2004;38:1229–35.doi:10.1111/j.1365-2929.2004.02015.x
OpenUrl PubMed
47.↵
2. Gorter SL ,
3. Rethans JJ ,
4. Scherpbier AJ , et al
. How to introduce incognito standardized patients into outpatient clinics of specialists in rheumatology. Med Teach 2001;23:138–44.doi:10.1080/014215931048
OpenUrl CrossRef PubMed Web of Science
48.↵
2. Siminoff LA ,
3. Rogers HL ,
4. Waller AC , et al
. The advantages and challenges of unannounced standardized patient methodology to assess healthcare communication. Patient Educ Couns 2011;82:318–24.doi:10.1016/j.pec.2011.01.021
OpenUrl CrossRef PubMed
49.↵
2. Oates J ,
3. Weston WW ,
4. Jordan J
. The impact of patient-centered care on outcomes. Fam Pract 2000;49:796–804.
OpenUrl Web of Science
50.↵
2. Hudon C ,
3. Fortin M ,
4. Haggerty JL , et al
. Measuring patients’ perceptions of patient-centered care: a systematic review of tools for family medicine. Ann Fam Med 2011;9:155–64.doi:10.1370/afm.1226
OpenUrl Abstract/FREE Full Text
51.↵
2. Brown J ,
3. Stewart M ,
4. Tessier S
. Assessing communication between patients and doctors: a manual for scoring patient-centred communication. London: Thames Valley Family Practice Research Unit, 1995.
52.↵
2. Ozuah PO ,
3. Reznik M
. Can standardised patients reliably assess communication skills in asthma cases? Med Educ 2007;41:1104–5.doi:10.1111/j.1365-2923.2007.02885.x
OpenUrl PubMed
53.↵
2. Zabar S ,
3. Ark T ,
4. Gillespie C , et al
. Can unannounced standardized patients assess professionalism and communication skills in the emergency department? Acad Emerg Med 2009;16:915–8.doi:10.1111/j.1553-2712.2009.00510.x
OpenUrl PubMed
54.↵
2. Gorter S ,
3. Rethans JJ ,
4. Scherpbier A , et al
. Developing case-specific checklists for standardized-patient-based assessments in internal medicine: a review of the literature. Acad Med 2000;75:1130–7.doi:10.1097/00001888-200011000-00022
OpenUrl CrossRef PubMed Web of Science
55.↵
2. Franz CE ,
3. Epstein R ,
4. Miller KN , et al
. Caught in the act? Prevalence, predictors, and consequences of physician detection of unannounced standardized patients. Health Serv Res 2006;41:2290–302.doi:10.1111/j.1475-6773.2006.00560.x
OpenUrl CrossRef PubMed Web of Science
56.↵
2. Swartz MH ,
3. Colliver JA ,
4. Bardes CL , et al
. Validating the standardized-patient assessment administered to medical students in the New York City Consortium. Acad Med 1997;72:619–26.doi:10.1097/00001888-199707000-00014
OpenUrl PubMed Web of Science
57.↵
2. Rethans JJ ,
3. Drop R ,
4. Sturmans F , et al
. A method for introducing standardized (simulated) patients into general practice consultations. Br J Gen Pract 1991;41:94–6.
OpenUrl Abstract/FREE Full Text
58.↵
2. Luck J ,
3. Peabody JW
. Using standardised patients to measure physicians’ practice: validation study using audio recordings. BMJ 2002;325:679.doi:10.1136/bmj.325.7366.679
OpenUrl Abstract/FREE Full Text
59.↵
2. Shirazi M ,
3. Sadeghi M ,
4. Emami A , et al
. Training and validation of standardized patients for unannounced assessment of physicians’ management of depression. Acad Psychiatry 2011;35:382–7.doi:10.1176/appi.ap.35.6.382
OpenUrl PubMed
60.↵
2. Lin LI
. A concordance correlation coefficient to evaluate reproducibility. Biometrics 1989;45:255–68.
OpenUrl CrossRef PubMed Web of Science
61.↵
2. Steichen TJ ,
3. Cox NJ
. A note on the concordance correlation coefficient. Stata J 2002;2:183–9.doi:10.1177/1536867X0200200206
OpenUrl
62.↵
2. Lawrence I ,
3. Lin K
. Assay validation using the concordance correlation coefficient. Biometrics 1992:599–604.
63.↵
2. Kwiecien R ,
3. Kopp-Schneider A ,
4. Blettner M
. Concordance analysis: part 16 of a series on evaluation of scientific publications. Dtsch Arztebl Int 2011;108:515.doi:10.3238/arztebl.2011.0515
OpenUrl PubMed
64.↵
2. Bland JM ,
3. Altman DG
. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986;1:307–10.
OpenUrl CrossRef PubMed Web of Science
65.↵
2. Rhodes K
. Taking the mystery out of “mystery shopper” studies. N Engl J Med 2011;365:484–6.doi:10.1056/NEJMp1107779
OpenUrl CrossRef PubMed
66.↵
2. Rhodes KV ,
3. Miller FG
. Simulated patient studies: an ethical analysis. Milbank Q 2012;90:706–24.doi:10.1111/j.1468-0009.2012.00680.x
OpenUrl CrossRef PubMed Web of Science
67.↵
2. Weiner SJ ,
3. Schwartz A
. Directly observed care: can unannounced standardized patients address a gap in performance measurement? J Gen Intern Med 2014;29:1183–7.doi:10.1007/s11606-014-2860-7
OpenUrl CrossRef PubMed

Footnotes

Patient consent for publication Not required.
Contributors DRX conceived the project concept and developed the first protocol draft, along with WG. DRX, MH and WH developed the sampling design, and MH, WH and ERM wrote the section on samples and performed the sample size calculation. SS provided original data of the previous studies for the sample size estimation and calculated some summary statistics. JL and Y-LC worked on the SP case templates. Y-LC and XW developed the guideline for the development of the quality checklist. KH reviewed the content and edited the manuscript. HH and GC reviewed the statistical plan. SR, JP, HW, ZZ, CT, NZ and WZ reviewed and commented on the design and methods. All coauthors participated in the revision and approved this manuscript.
Funding This project is funded through the following competitive grants: China Medical Board (grant number: CMB GNL 16-260), National Natural Science Foundation of China (grant number: 81773446) and Young Scientists Fund of the National Natural Science Foundation of China (grant number: 81402690).
Competing interests None declared.
Ethics approval This study has received ethical approval from the institutional review board (IRB) of the Sun Yat-sen University School of Public Health with a waiver of informed consent from each participating clinician.
Provenance and peer review Not commissioned; externally peer reviewed.

[1] 1.↵
A/RES/70/1 R. Transforming our world: the 2030 agenda for sustainable development. 2018. http://www.un.org/ga/search/view_doc.asp?symbol=A/RES/70/1&Lang=E (accessed 17 Feb 2018).

[2] 2.↵

Hanefeld J ,
Powell-Jackson T ,
Balabanova D
. Understanding and measuring quality of care: dealing with complexity. Bull World Health Organ 2017;95:368–74.doi:10.2471/BLT.16.179309
OpenUrl

[4] Hanefeld J ,

[5] Powell-Jackson T ,

[6] Balabanova D

[7] 3.↵

Murray CJ ,
Frenk J
. A WHO framework for health system performance assessment: Evidence and Information for Policy: World Health Organization, 1999.

[9] Murray CJ ,

[10] Frenk J

[11] 4.↵

Donabedian A
. The quality of care. How can it be assessed? 1988. Arch Pathol Lab Med 1997;121:1145.
OpenUrl PubMed Web of Science

[13] Donabedian A

[14] 5.↵

Pongsupap Y ,
Van Lerberghe W
. Choosing between public and private or between hospital and primary care: responsiveness, patient-centredness and prescribing patterns in outpatient consultations in Bangkok. Trop Med Int Health 2006;11:81–9.doi:10.1111/j.1365-3156.2005.01532.x
OpenUrl CrossRef PubMed Web of Science

[16] Pongsupap Y ,

[17] Van Lerberghe W

[18] 6.↵

Bitton A ,
Ratcliffe HL ,
Veillard JH , et al
. Primary health care as a foundation for strengthening health systems in low- and middle-income countries. J Gen Intern Med 2017;32:566–71.doi:10.1007/s11606-016-3898-5
OpenUrl

[20] Bitton A ,

[21] Ratcliffe HL ,

[22] Veillard JH , et al

[23] 7.↵

Wei X ,
Li H ,
Yang N , et al
. Changes in the perceived quality of primary care in Shanghai and Shenzhen, China: a difference-in-difference analysis. Bull World Health Organ 2015;93:407–16.doi:10.2471/BLT.14.139527
OpenUrl CrossRef PubMed

[25] Wei X ,

[26] Li H ,

[27] Yang N , et al

[28] 8.↵

Zou Y ,
Zhang X ,
Hao Y , et al
. General practitioners versus other physicians in the quality of primary care: a cross-sectional study in Guangdong Province, China. BMC Fam Pract 2015;16:134.doi:10.1186/s12875-015-0349-z
OpenUrl

[30] Zou Y ,

[31] Zhang X ,

[32] Hao Y , et al

[33] 9.↵

Feng S ,
Shi L ,
Zeng J , et al
. Comparison of primary care experiences in village clinics with different ownership models in Guangdong Province, China. PLoS One 2017;12:e0169241.doi:10.1371/journal.pone.0169241

[35] Feng S ,

[36] Shi L ,

[37] Zeng J , et al

[38] 10.↵

Wong WCW ,
Jiang S ,
Ong JJ , et al
. Bridging the gaps between patients and primary care in china: a nationwide representative survey. Ann Fam Med 2017;15:237–45.doi:10.1370/afm.2034
OpenUrl Abstract/FREE Full Text

[40] Wong WCW ,

[41] Jiang S ,

[42] Ong JJ , et al

[43] 11.↵

Zeng L ,
Li Y ,
Zhang L , et al
. Guideline use behaviours and needs of primary care practitioners in China: a cross-sectional survey. BMJ Open 2017;7:e015379.doi:10.1136/bmjopen-2016-015379

[45] Zeng L ,

[46] Li Y ,

[47] Zhang L , et al

[48] 12.↵

Li X ,
Lu J ,
Hu S , et al
. The primary health-care system in China. The Lancet 2017;390:2584–94.doi:10.1016/S0140-6736(17)33109-4
OpenUrl

[50] Li X ,

[51] Lu J ,

[52] Hu S , et al

[53] 13.↵

Das J ,
Hammer J
. Quality of primary care in low-income countries: facts and economics. Annu Rev Econom 2014;6:525–53.doi:10.1146/annurev-economics-080213-041350
OpenUrl CrossRef

[55] Das J ,

[56] Hammer J

[57] 14.↵

Sylvia S ,
Shi Y ,
Xue H , et al
. Survey using incognito standardized patients shows poor quality care in China’s rural clinics. Health Policy Plan 2015;30:322–33.doi:10.1093/heapol/czu014
OpenUrl CrossRef PubMed

[59] Sylvia S ,

[60] Shi Y ,

[61] Xue H , et al

[62] 15.↵

Berendes S ,
Heywood P ,
Oliver S , et al
. Quality of private and public ambulatory health care in low and middle income countries: systematic review of comparative studies. PLoS Med 2011;8:e1000433.doi:10.1371/journal.pmed.1000433
OpenUrl CrossRef PubMed

[64] Berendes S ,

[65] Heywood P ,

[66] Oliver S , et al

[67] 16.↵

Das J ,
Holla A ,
Das V , et al
. In urban and rural india, a standardized patient study showed low levels of provider training and huge quality gaps. Health Aff 2012;31:2774–84.doi:10.1377/hlthaff.2011.1356
OpenUrl Abstract/FREE Full Text

[69] Das J ,

[70] Holla A ,

[71] Das V , et al

[72] 17.↵

Das J ,
Gertler PJ
. Variations in practice quality in five low-income countries: a conceptual overview. Health Aff 2007;26:w296–309.doi:10.1377/hlthaff.26.3.w296
OpenUrl Abstract/FREE Full Text

[74] Das J ,

[75] Gertler PJ

[76] 18.↵

Das J ,
Hammer J ,
Leonard K
. The quality of medical advice in low-income countries. J Econ Perspect 2008;22:93–114.doi:10.1257/jep.22.2.93
OpenUrl PubMed Web of Science

[78] Das J ,

[79] Hammer J ,

[80] Leonard K

[81] 19.↵

Coarasa J ,
Das J ,
Gummerson E , et al
. A systematic tale of two differing reviews: evaluating the evidence on public and private sector quality of primary care in low and middle income countries. Global Health 2017;13:24.doi:10.1186/s12992-017-0246-4
OpenUrl

[83] Coarasa J ,

[84] Das J ,

[85] Gummerson E , et al

[86] 20.↵

Glassman PA ,
Luck J ,
O’Gara EM , et al
. Using standardized patients to measure quality: evidence from the literature and a prospective study. Jt Comm J Qual Improv 2000;26:644–53.doi:10.1016/S1070-3241(00)26055-0
OpenUrl PubMed

[88] Glassman PA ,

[89] Luck J ,

[90] O’Gara EM , et al

[91] 21.↵

Leonard K ,
Masatu MC
. Outpatient process quality evaluation and the Hawthorne Effect. Soc Sci Med 2006;63:2330–40.doi:10.1016/j.socscimed.2006.06.003
OpenUrl CrossRef PubMed Web of Science

[93] Leonard K ,

[94] Masatu MC

[95] 22.↵

McCambridge J ,
Witton J ,
Elbourne DR
. Systematic review of the Hawthorne effect: new concepts are needed to study research participation effects. J Clin Epidemiol 2014;67:267–77.doi:10.1016/j.jclinepi.2013.08.015
OpenUrl CrossRef PubMed

[97] McCambridge J ,

[98] Witton J ,

[99] Elbourne DR

[100] 23.↵

Woodward CA ,
McConvey GA ,
Neufeld V , et al
. Measurement of physician performance by standardized patients. Refining techniques for undetected entry in physicians’ offices. Med Care 1985;23:1019–27.
OpenUrl CrossRef PubMed Web of Science

[102] Woodward CA ,

[103] McConvey GA ,

[104] Neufeld V , et al

[105] 24.↵

Das J ,
Hammer J
. Money for nothing: the dire straits of medical practice in Delhi, India. J Dev Econ 2007;83:1–36.doi:10.1016/j.jdeveco.2006.05.004
OpenUrl CrossRef Web of Science

[107] Das J ,

[108] Hammer J

[109] 25.↵

Yu-jie Z ,
Min W ,
Qin L
. Analyze the development of standardized patient teaching in China by literature review in recent 10 years. Chin J Nurs 2009;44:259–61.
OpenUrl

[111] Yu-jie Z ,

[112] Min W ,

[113] Qin L

[114] 26.↵

Currie J ,
Lin W ,
Zhang W
. Patient knowledge and antibiotic abuse: Evidence from an audit study in China. J Health Econ 2011;30:933–49.doi:10.1016/j.jhealeco.2011.05.009
OpenUrl CrossRef PubMed Web of Science

[116] Currie J ,

[117] Lin W ,

[118] Zhang W

[119] 27.↵

Sylvia S ,
Xue H ,
Zhou C , et al
. Tuberculosis detection and the challenges of integrated care in rural China: A cross-sectional standardized patient study. PLoS Med 2017;14:e1002405.doi:10.1371/journal.pmed.1002405
OpenUrl CrossRef PubMed

[121] Sylvia S ,

[122] Xue H ,

[123] Zhou C , et al

[124] 28.↵

Li L ,
Lin C ,
Guan J
. Using standardized patients to evaluate hospital-based intervention outcomes. Int J Epidemiol 2014;43:897–903.doi:10.1093/ije/dyt249
OpenUrl CrossRef PubMed

[126] Li L ,

[127] Lin C ,

[128] Guan J

[129] 29.↵

Liao J ,
Chen Y ,
Cai Y , et al
. Using smartphone-based virtual patients to assess the quality of primary healthcare in rural China: protocol for a prospective multicentre study. BMJ Open 2018;8:e020943.doi:10.1136/bmjopen-2017-020943

[131] Liao J ,

[132] Chen Y ,

[133] Cai Y , et al

[134] 30.↵

Zhou M ,
Wang H ,
Zhu J , et al
. Cause-specific mortality for 240 causes in China during 1990–2013: a systematic subnational analysis for the Global Burden of Disease Study 2013. The Lancet 2016;387:251–72.doi:10.1016/S0140-6736(15)00551-6
OpenUrl

[136] Zhou M ,

[137] Wang H ,

[138] Zhu J , et al

[139] 31.↵

Das J ,
Hammer J
. Which doctor? Combining vignettes and item response to measure clinical competence. J Dev Econ 2005;78:348–83.doi:10.1016/j.jdeveco.2004.11.004
OpenUrl CrossRef Web of Science

[141] Das J ,

[142] Hammer J

[143] 32.↵

Hambleton RK ,
Swaminathan H ,
Rogers HJ
. Fundamentals of item response theory. Sage, 1991.

[145] Hambleton RK ,

[146] Swaminathan H ,

[147] Rogers HJ

[148] 33.↵

Rethans JJ ,
Gorter S ,
Bokken L , et al
. Unannounced standardised patients in real practice: a systematic literature review. Med Educ 2007;41:537–49.doi:10.1111/j.1365-2929.2006.02689.x
OpenUrl CrossRef PubMed Web of Science

[150] Rethans JJ ,

[151] Gorter S ,

[152] Bokken L , et al

[153] 34.↵

Kutob RM ,
Bormanis J ,
Crago M , et al
. Assessing culturally competent diabetes care with unannounced standardized patients. Fam Med 2013;45:400–8.
OpenUrl PubMed

[155] Kutob RM ,

[156] Bormanis J ,

[157] Crago M , et al

[158] 35.↵

Fenton JJ ,
Kravitz RL ,
Jerant A , et al
. Promoting patient-centered counseling to reduce use of low-value diagnostic tests: a randomized clinical trial. JAMA Intern Med 2016;176:191–7.doi:10.1001/jamainternmed.2015.6840
OpenUrl

[160] Fenton JJ ,

[161] Kravitz RL ,

[162] Jerant A , et al

[163] 36.↵

May L ,
Franks P ,
Jerant A , et al
. Watchful Waiting Strategy May Reduce Low-Value Diagnostic Testing. J Am Board Fam Med 2016;29:710–7.doi:10.3122/jabfm.2016.06.160056
OpenUrl Abstract/FREE Full Text

[165] May L ,

[166] Franks P ,

[167] Jerant A , et al

[168] 37.↵

Zabar S ,
Hanley K ,
Lee H , et al
. Ordering of labs and tests: variation and correlates of value-based care in an unannounced standardized patient visit. J Gen Intern Med 2016;32:S318.
OpenUrl

[170] Zabar S ,

[171] Hanley K ,

[172] Lee H , et al

[173] 38.↵

Tamblyn RM ,
Abrahamowicz M ,
Berkson L , et al
. First-visit bias in the measurement of clinical competence with standardized patients. Acad Med 1992;67:S22–4.doi:10.1097/00001888-199210000-00027
OpenUrl PubMed Web of Science

[175] Tamblyn RM ,

[176] Abrahamowicz M ,

[177] Berkson L , et al

[178] 39.↵

Shepherd HL ,
Barratt A ,
Trevena LJ , et al
. Three questions that patients can ask to improve the quality of information physicians give about treatment options: a cross-over trial. Patient Educ Couns 2011;84:379–85.doi:10.1016/j.pec.2011.07.022
OpenUrl CrossRef PubMed

[180] Shepherd HL ,

[181] Barratt A ,

[182] Trevena LJ , et al

[183] 40.↵

Peabody JW ,
Luck J ,
Jain S , et al
. Assessing the accuracy of administrative data in health information systems. Med Care 2004;42:1066–72.doi:10.1097/00005650-200411000-00005
OpenUrl CrossRef PubMed Web of Science

[185] Peabody JW ,

[186] Luck J ,

[187] Jain S , et al

[188] 41.↵
Organization WH. WHO handbook for guideline development: World Health Organization, 2014.

[189] 42.↵

Campbell SM ,
Braspenning J ,
Hutchinson A , et al
. Research methods used in developing and applying quality indicators in primary care. Qual Saf Health Care 2002;11:358–64.doi:10.1136/qhc.11.4.358
OpenUrl Abstract/FREE Full Text

[191] Campbell SM ,

[192] Braspenning J ,

[193] Hutchinson A , et al

[194] 43.↵

De Champlain AF ,
Margolis MJ ,
King A , et al
. Standardized patients’ accuracy in recording examinees’ behaviors using checklists. Acad Med 1997;72:S85–7.doi:10.1097/00001888-199710001-00029
OpenUrl PubMed Web of Science

[196] De Champlain AF ,

[197] Margolis MJ ,

[198] King A , et al

[199] 44.↵

Vu NV ,
Steward DE ,
Marcy M
. An assessment of the consistency and accuracy of standardized patients’ simulations. J Med Educ 1987;62:1000–2.
OpenUrl PubMed Web of Science

[201] Vu NV ,

[202] Steward DE ,

[203] Marcy M

[204] 45.↵

Vu NV ,
Marcy MM ,
Colliver JA , et al
. Standardized (simulated) patients’ accuracy in recording clinical performance check-list items. Med Educ 1992;26:99–104.
OpenUrl PubMed Web of Science

[206] Vu NV ,

[207] Marcy MM ,

[208] Colliver JA , et al

[209] 46.↵

Maiburg BH ,
Rethans JJ ,
van Erk IM , et al
. Fielding incognito standardised patients as ‘known’ patients in a controlled trial in general practice. Med Educ 2004;38:1229–35.doi:10.1111/j.1365-2929.2004.02015.x
OpenUrl PubMed

[211] Maiburg BH ,

[212] Rethans JJ ,

[213] van Erk IM , et al

[214] 47.↵

Gorter SL ,
Rethans JJ ,
Scherpbier AJ , et al
. How to introduce incognito standardized patients into outpatient clinics of specialists in rheumatology. Med Teach 2001;23:138–44.doi:10.1080/014215931048
OpenUrl CrossRef PubMed Web of Science

[216] Gorter SL ,

[217] Rethans JJ ,

[218] Scherpbier AJ , et al

[219] 48.↵

Siminoff LA ,
Rogers HL ,
Waller AC , et al
. The advantages and challenges of unannounced standardized patient methodology to assess healthcare communication. Patient Educ Couns 2011;82:318–24.doi:10.1016/j.pec.2011.01.021
OpenUrl CrossRef PubMed

[221] Siminoff LA ,

[222] Rogers HL ,

[223] Waller AC , et al

[224] 49.↵

Oates J ,
Weston WW ,
Jordan J
. The impact of patient-centered care on outcomes. Fam Pract 2000;49:796–804.
OpenUrl Web of Science

[226] Oates J ,

[227] Weston WW ,

[228] Jordan J

[229] 50.↵

Hudon C ,
Fortin M ,
Haggerty JL , et al
. Measuring patients’ perceptions of patient-centered care: a systematic review of tools for family medicine. Ann Fam Med 2011;9:155–64.doi:10.1370/afm.1226
OpenUrl Abstract/FREE Full Text

[231] Hudon C ,

[232] Fortin M ,

[233] Haggerty JL , et al

[234] 51.↵

Brown J ,
Stewart M ,
Tessier S
. Assessing communication between patients and doctors: a manual for scoring patient-centred communication. London: Thames Valley Family Practice Research Unit, 1995.

[236] Brown J ,

[237] Stewart M ,

[238] Tessier S

[239] 52.↵

Ozuah PO ,
Reznik M
. Can standardised patients reliably assess communication skills in asthma cases? Med Educ 2007;41:1104–5.doi:10.1111/j.1365-2923.2007.02885.x
OpenUrl PubMed

[241] Ozuah PO ,

[242] Reznik M

[243] 53.↵

Zabar S ,
Ark T ,
Gillespie C , et al
. Can unannounced standardized patients assess professionalism and communication skills in the emergency department? Acad Emerg Med 2009;16:915–8.doi:10.1111/j.1553-2712.2009.00510.x
OpenUrl PubMed

[245] Zabar S ,

[246] Ark T ,

[247] Gillespie C , et al

[248] 54.↵

Gorter S ,
Rethans JJ ,
Scherpbier A , et al
. Developing case-specific checklists for standardized-patient-based assessments in internal medicine: a review of the literature. Acad Med 2000;75:1130–7.doi:10.1097/00001888-200011000-00022
OpenUrl CrossRef PubMed Web of Science

[250] Gorter S ,

[251] Rethans JJ ,

[252] Scherpbier A , et al

[253] 55.↵

Franz CE ,
Epstein R ,
Miller KN , et al
. Caught in the act? Prevalence, predictors, and consequences of physician detection of unannounced standardized patients. Health Serv Res 2006;41:2290–302.doi:10.1111/j.1475-6773.2006.00560.x
OpenUrl CrossRef PubMed Web of Science

[255] Franz CE ,

[256] Epstein R ,

[257] Miller KN , et al

[258] 56.↵

Swartz MH ,
Colliver JA ,
Bardes CL , et al
. Validating the standardized-patient assessment administered to medical students in the New York City Consortium. Acad Med 1997;72:619–26.doi:10.1097/00001888-199707000-00014
OpenUrl PubMed Web of Science

[260] Swartz MH ,

[261] Colliver JA ,

[262] Bardes CL , et al

[263] 57.↵

Rethans JJ ,
Drop R ,
Sturmans F , et al
. A method for introducing standardized (simulated) patients into general practice consultations. Br J Gen Pract 1991;41:94–6.
OpenUrl Abstract/FREE Full Text

[265] Rethans JJ ,

[266] Drop R ,

[267] Sturmans F , et al

[268] 58.↵

Luck J ,
Peabody JW
. Using standardised patients to measure physicians’ practice: validation study using audio recordings. BMJ 2002;325:679.doi:10.1136/bmj.325.7366.679
OpenUrl Abstract/FREE Full Text

[270] Luck J ,

[271] Peabody JW

[272] 59.↵

Shirazi M ,
Sadeghi M ,
Emami A , et al
. Training and validation of standardized patients for unannounced assessment of physicians’ management of depression. Acad Psychiatry 2011;35:382–7.doi:10.1176/appi.ap.35.6.382
OpenUrl PubMed

[274] Shirazi M ,

[275] Sadeghi M ,

[276] Emami A , et al

[277] 60.↵

Lin LI
. A concordance correlation coefficient to evaluate reproducibility. Biometrics 1989;45:255–68.
OpenUrl CrossRef PubMed Web of Science

[279] Lin LI

[280] 61.↵

Steichen TJ ,
Cox NJ
. A note on the concordance correlation coefficient. Stata J 2002;2:183–9.doi:10.1177/1536867X0200200206
OpenUrl

[282] Steichen TJ ,

[283] Cox NJ

[284] 62.↵

Lawrence I ,
Lin K
. Assay validation using the concordance correlation coefficient. Biometrics 1992:599–604.

[286] Lawrence I ,

[287] Lin K

[288] 63.↵

Kwiecien R ,
Kopp-Schneider A ,
Blettner M
. Concordance analysis: part 16 of a series on evaluation of scientific publications. Dtsch Arztebl Int 2011;108:515.doi:10.3238/arztebl.2011.0515
OpenUrl PubMed

[290] Kwiecien R ,

[291] Kopp-Schneider A ,

[292] Blettner M

[293] 64.↵

Bland JM ,
Altman DG
. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986;1:307–10.
OpenUrl CrossRef PubMed Web of Science

[295] Bland JM ,

[296] Altman DG

[297] 65.↵

Rhodes K
. Taking the mystery out of “mystery shopper” studies. N Engl J Med 2011;365:484–6.doi:10.1056/NEJMp1107779
OpenUrl CrossRef PubMed

[299] Rhodes K

[300] 66.↵

Rhodes KV ,
Miller FG
. Simulated patient studies: an ethical analysis. Milbank Q 2012;90:706–24.doi:10.1111/j.1468-0009.2012.00680.x
OpenUrl CrossRef PubMed Web of Science

[302] Rhodes KV ,

[303] Miller FG

[304] 67.↵

Weiner SJ ,
Schwartz A
. Directly observed care: can unannounced standardized patients address a gap in performance measurement? J Gen Intern Med 2014;29:1183–7.doi:10.1007/s11606-014-2860-7
OpenUrl CrossRef PubMed

[306] Weiner SJ ,

[307] Schwartz A

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Strengths and limitations of this study

Background

Methods

Survey design

Survey population/frame

Sampling procedures

Sample size calculation

Supplemental material

USP case development and implementation

Case selection

Development team

Case description

Script

Quality checklist

Selecting and training SPs

Fielding and implementing SPs

Variables

Outcome variables

Scoring methods

Other variables

Analytical methods

USP validation

Survey analysis

Related studies

Ethics and dissemination

Patient and public involvement

Discussion

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password