Article Text


Dynamics and Determinants of HPV Infection: The Michigan HPV and Oropharyngeal Cancer (M-HOC) Study
  1. Marisa C Eisenberg1,
  2. Lora P Campredon1,
  3. Andrew F Brouwer1,
  4. Heather M Walline2,
  5. Brittany M Marinelli2,
  6. Yan Kwan Lau1,
  7. Trey B Thomas2,
  8. Rachel L Delinger1,
  9. Taylor S Sullivan1,
  10. Monica L Yost1,
  11. Christine M Goudsmit2,
  12. Thomas E Carey2,
  13. Rafael Meza1
  1. 1 Department of Epidemiology, University of Michigan, Ann Arbor, Ann Arbor, Michigan, USA
  2. 2 Department of Otolaryngology, University of Michigan, Ann Arbor, Ann Arbor, Michigan, USA
  1. Correspondence to Dr Marisa C Eisenberg; marisae{at} and Dr Rafael Meza; rmeza{at}


Introduction Human papillomavirus (HPV) is the primary cause of cervical and other anogenital cancers and is also associated with head and neck cancers. Incidence of HPV-related oropharyngeal squamous cell cancers (OPSCCs) is increasing, and HPV-related OPSCCs have surpassed cervical cancer as the most common HPV-related cancer in the USA. Given the multisite nature of HPV, there is strong interest in collecting data from both genital and oral sites, as well as associated data on social and sexual behaviours. The overarching goal of this study is to evaluate patterns of oral HPV infection incidence, clearance and persistence and their relationship to sexual behaviour history.

Methods and analysis Participants are recruited from two populations: college students at a large public university and general population from the surrounding area. At the first study visit, participants complete a detailed sexual history, health and behaviour questionnaire. Follow-up visits occur every 3–4 months over 3 years, when participants complete an abbreviated questionnaire. All participants provide a saliva sample at each visit, and eligible participants may provide a cervicovaginal self-swab. Genetic material isolated from specimens is tested for 15 high-risk and 3 low-risk HPV types. Statistical analyses will examine outcome variables including HPV prevalence, incidence, persistence and clearance. Logistic regression models will be used to estimate odds ratios and 95% confidence intervals for associations between the outcomes of interest and demographic/behavioural variables collected in the questionnaires. The longitudinal HPV infection data and detailed sexual history data collected in the questionnaires will allow us to develop individual-based network models of HPV transmission and will be used to parameterise multiscale models of HPV-related OPSC carcinogenesis.

Ethics and dissemination This study has been approved by the University of Michigan Institutional Review Board. All participants are consented in person by trained study staff. Study results will be disseminated through peer-reviewed publications.

  • human papillomavirus
  • oropharyngeal cancer
  • longitudinal study
  • sexual behavior
  • oral rinse

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See:

Statistics from

Strengths and limitations of this study

  • Longitudinal design with 3–4 study visits per participant per year allows for analysis of incidence, persistence and clearance of oral human papillomavirus (HPV).

  • Collection of both oral and cervicovaginal samples allows for comparison of infections across sites.

  • Low prevalence of high-risk oral HPV may limit our ability to connect HPV infection dynamics to demographic, sexual and behavioural variables.

  • Participant-dependent saliva production and swishing effectiveness may result in variable sample quality, but this concern is mitigated by PCR sensitivity.

  • Our HPV test separately identifies 15 high-risk genotypes but only 3 low-risk genotypes and thus may not provide a complete picture of the patterns of mucosal HPV types.


Infectious disease agents account for approximately 20% of cancers worldwide.1 The human papillomavirus (HPV) is the primary cause of cervical and most anogenital cancers, as well as an increasing fraction of head and neck (HN) cancers.2 3 Indeed, incidence of HPV-related oropharyngeal squamous cell cancers (OPSCCs) is increasing,4–7 and HPV-related OPSCCs have surpassed cervical cancer as the most common HPV-related cancer in the USA.3 At the University of Michigan (UM) Cancer Center, over 80% of patients with OPSCC have high-risk HPV detected in their cancers.8 Although many HPV infections are benign, high-risk HPV infections can disrupt several key cancer regulation pathways, including those involving tumour suppressors p53 and pRb.9 10

The underlying reasons behind the significant rise in HPV-related OPSCC rates in the USA are unclear. From a prevention perspective, addressing HPV-related OPSCC begins with an understanding of the epidemiology of the virus, both in the oral cavity and more broadly. Estimates from the National Health and Nutrition Examination Survey (NHANES) suggest that oral HPV prevalence in the USA is about 11%–12% in men and 3%–4% in women.11–14 However, while oral HPV infections may clear relatively quickly (the HPV in Men (HIM) trial estimated a mean clearance time of approximately 7 months15), persistent infections may lead to cancer after several decades. (Although there are no precise estimates of time to cancer for OPSCCs, cervical lesions are estimated to progress to cancer in 10–30 years16).

While studies are increasingly treating HPV as a multisite disease, the mechanisms underlying the relationship between oral, genital and anal HPV infections are not understood, although autoinoculation is increasingly implicated.17 18 The HPV vaccine further complicates the epidemiological picture: HPV vaccination is expected to have an impact on the rates of oral HPV incidence, at least for those genotypes included in the vaccine.12 19 Countries with more aggressive vaccination campaigns, such as Australia, have already seen a reduction in HPV-related genital warts.20 21 It remains to be seen whether HPV vaccination will ultimately reduce the incidence and mortality of HPV-related HN cancers.

Many open questions remain regarding oral HPV transmission, epidemiology, infection and persistence, the mechanisms of HPV-related HN carcinogenesis, and the connection between the ongoing oral HPV epidemic and the rising OPSCC incidence. The knowledge gaps surrounding the mechanisms and epidemiology of HPV in the oral cavity hamper our ability to develop more effective measures to prevent oral HPV infections and, in turn, reduce the burden of HN cancers. Prevalence studies can only draw associations between the burden and probable risk factors. Thus, longitudinal studies with frequent testing and assessment of recent sexual behaviour are needed to understand HPV transmission, persistence and clearance and to associate these events with certain behaviours. Here, we describe the methods, study design and initial study population characteristics of a longitudinal study of oral HPV incidence, clearance and related sexual behaviours; this study and its cervicovaginal HPV substudy are the epidemiological arms of the Michigan HPV and Oropharyngeal Cancer (MHOC) Study. Our research will contribute to a better understanding of the complex factors affecting patterns of HPV infection. This study will inform predictive and mechanistic models and develop effective prevention strategies for OPSCC and other HPV-related cancers.

Methods and analysis


The overarching goal of this study is to evaluate patterns of oral HPV infection incidence and clearance and their relationship to sexual history and sexual behaviours. Participants are recruited to the study from two populations: college students at UM (a large public university) and the general population of the surrounding area. As a secondary goal, we will also compare HPV infection patterns in the oral and cervicovaginal sites as part of a substudy examining cervicovaginal HPV infection.

Subject enrolment and consent

Here, we describe the process for subject enrolment and consent (diagrammed in figure 1).

Figure 1

Flow diagram for the study detailing recruitment methods, visit structure (baseline and follow-ups) and the cervicovaginal substudy. H&N, head and neck.

Identification of study subjects

Participants are continuously recruited by one of three methods. (1) Advertisement: we place recruitment announcements in UM health clinics and area community sites (eg, libraries, coffee shops, bus stops), as well as on UM clinical research volunteer’s website ( In this method, individuals who are potentially interested in participating in the study contact the study coordinator to determine eligibility. (2) Screening clinics: we approach participants of the UM health system free throat cancer screening clinic to invite eligible participants to join the study. (3) Student residence halls: once per year, we set up information tables in or near UM residence halls and invite eligible students to join the student.

Eligibility criteria. Volunteers are considered for enrolment if they

  • Are age 18 or older.

  • Do not have a history of HN cancer.

  • Are willing to return for a follow-up every 3–4 months over 3 years.

  • Are able and willing to provide written informed consent.

To participate in the optional cervicovaginal substudy, participants must additionally

  • Have a vagina.

  • Not be pregnant.

  • Not be menstruating at the time of the study visit.


We document informed consent for all willing and eligible participants before enrolment in the study. Separate, additional consent is obtained prior to enrolment in the cervicovaginal substudy. All study subjects are given the opportunity to consent to storage of residual specimens collected in this study for future research purposes. Specimens collected from subjects who do not consent to specimen banking will be destroyed after completion of the analyses of this study.

Benefit to participants

Since we are not using a clinical test, our Institutional Review Board approval does not allow returning individual HPV test results to participants, but population-level results will be disseminated through newsletters after peer-reviewed publication. Staff are knowledgeable and available to discuss vaccination and screening with interested participants. Because of the nature of the sexual and behavioural questionnaire, we also have pamphlets for the UM Sexual Assault Prevention and Awareness Center available to participants.

Enrolment and statistical power

Phase I of this study recruited between April 2015 and December 2017, enrolling 395 participants. At this sample size and level of significance 0.05, we will detect the difference at baseline between two equally sized populations with HPV prevalence 10% and 20% with 80% power. Assuming each participant completes 10 visits, we will detect the difference between 10% and 13% ever HPV positive with more than 80% power. At the time of submission, 321 participants had completed at least one follow-up visit, and 1693 baseline and follow-up visits had been completed. Follow-up visits are ongoing. Pending funding, phase II is anticipated to recruit a similar number of participants (potentially with additional study locations as well).

Data collection

We administer surveys and collect biological specimens as described below.

Social and sexual behaviour survey

All study participants complete a comprehensive questionnaire at the baseline visit and an abbreviated questionnaire at all subsequent visits. The questionnaire consists of questions related to demographic information; behaviour, including alcohol, cigarette, cigar, tobacco, and other drug use; and sexual practices, sexual history, sexual healthcare (including HPV vaccination status), general health and history of sexually transmitted infections (STIs). The baseline questionnaire collects a complete sexual history, with subsequent follow-up visits providing more recent information and updates. The questions regarding sexual behaviours and history are designed to provide detailed information about the timing of different sexual partnerships and sexual practices in each sexual partnership and form a longitudinal, egocentric sexual network data set. Table 1 details the categories and numbers of questions given in both the baseline and follow-up surveys. The questionnaire is administered electronically via Qualtrics in a quiet, private room. Participants are asked to enter their unique study identification number at the beginning of the questionnaire. Participants are free to skip or otherwise not answer any question.

Table 1

Categories and numbers of questions asked in Michigan HPV and Oropharyngeal Cancer (MHOC) study baseline and follow-up questionnaire.

Oral specimen collection

Two millilitres of saliva and oral rinse specimen are collected at each visit (ie, baseline and every 3–4 months for 3 years) using Oragene RE-100 kits.22 Participants may use a sweetener or candy to stimulate saliva production if needed.

In the initial phase of the study, we tested several alternative saliva collection methods: Scope-brand mouthwash (administered as in NHANES23), and two commercially available kits developed for detection of DNA and RNA in saliva, the OM-505 (suitable for DNA, stable for weeks at room temperature) and RE-100 (designed to protect RNA, stable for 60 days at room temperature).22 24 We were able to detect DNA from saliva in all three methods but were only able to reliably recover RNA from the RE-100 kit sample. We selected the RE-100 kits for use in the study so that we could isolate both DNA and RNA from a single sample.

Cervicovaginal swab collection

Participants fulfilling the eligibility criteria are given the option at each study visit to provide a self-collected cervicovaginal swab sample using a HerSwab25 in addition to the oral specimen. The HerSwab is a vaginal swab designed to sample near the cervix. Participants self-administer the swab by inserting the tip of the HerSwab into the vagina; turning the crank so that the brush is fully extended; removing the swab; turning the crank the opposite direction to retract the brush back into the tip; and placing the HerSwab into a provided resealable plastic bag. The participants are provided with the printed, graphical instructions designed by the manufacturer and collect their sample in a restroom. Staff facilitating the study are available to review the procedure and answer any participant questions.

Cervicovaginal swab acceptability survey

All study participants who provide a cervicovaginal sample complete a questionnaire after each cervicovaginal self-collection that asks about comfort, ease of use and subjective assessment of the self-collection procedure.26 The questionnaire is administered electronically via Qualtrics.

Follow-up visit schedule

Participants are followed for up to 3 years after their baseline visit, for a total of up to 12 study visits. The study prioritises longitudinal follow-up over regular follow-up but strives to achieve both. Therefore, if participants miss a follow-up visit, they are not excluded and instead continue the study when next available. This protocol facilitates research in the student population, as students are often gone for extended periods of time on scheduled breaks.

Sample processing and analysis

Sample processing and DNA extraction differs for oral and cervicovaginal samples, but DNA analysis is the same for both kinds of samples.

Oral sample processing

DNA isolation from saliva samples is performed by first splitting the sample in half, half for DNA and half for RNA. The saliva is heated to 50C for 60 min, and 1/25th volume of prepIT-L2P (DNA Genotek) is added. Sample is incubated for 10 min at 4C and spun for 20 min at 4100 revolutions per minute. After that the supernatant is added to equal volumes of 95% ethanol to precipitate the DNA. The DNA is pelleted, washed with 200 µL of 70% ethanol and dried. The DNA pellet is rehydrated with Tris–EDTA buffer and quantified with picogreen (Invitrogen P11496) or the QuBit (Invitrogen Q32850). RNA is isolated from samples that have tested positive for HPV DNA. RNA isolation from saliva samples is performed using the Oragene RNA isolation method according to the manufacturer’s instruction which includes QIAgen’s RNeasy Micro Kit (74004). In short, the samples are heated to 50C for 60 min, 95C for 15 min, and 20 µL neutralisation solution is added. Samples then are incubated on ice for 10 min followed by the RNeasy Kit protocol. The RNA is then converted to cDNA for further testing. The cDNA synthesis is preformed using Invitrogen’s Superscript III protocol and reagents (18080-044).

Cervicovaginal sample processing

After collection, the HerSwab is soaked for 1 min in 20 mL of ThinPrep solution and subsequently swished vigorously for 1 min. The 20 mL solution is then split evenly into two 15 mL nuclease-free conical tubes—one for RNA processing and one for DNA—and the tube for RNA is centrifuged. The pellet for RNA use is resuspended in RNAlater for isolation and later study. DNA and RNA processing is the same as above. For RNA isolation, the cells in RNAlater are pelleted by centrifugation and RNA is isolated using the RNeasy as described above. Qiagen reagents are used for DNA isolation. The cells in the ThinPrep sample are pelleted, resuspended in cell lysis buffer, incubated with Proteinase k, treated with protein precipitation buffer, vortexed vigorously, pelleted and placed on ice for 10 min. The supernatant is transferred to isopropanol in glycogen solution mixed, and the DNA pellet is washed twice with cold ethanol, rehydrated in hydration solution and dissolved at 65C, and DNA is quantified with picogreen or QuBit.

DNA analysis

DNA and cDNA samples are assayed using a previously described, highly sensitive method.8 Briefly, multiplex competitive PCR amplification of the heterogeneous E6 region of 15 discrete high-risk (16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, 68, 73) and 3 low-risk (6, 11, 90) HPV types is performed, followed by probe-specific single-base extension. Extension products are loaded onto a matrix silicon chip array and separated by size using matrix-assisted laser desorption/ionisation time-of-flight mass spectroscopy, allowing detection of any HPV types present in the sample.

Statistical analysis

Outcomes of interest include, but are not limited to, HPV prevalence (detection of HPV, or detection of a specific HPV genotype), incidence (detection of HPV in a previously uninfected person, or detection of a specific HPV genotype in a person who previously tested negative for that genotype), persistence (detection of HPV at subsequent study visits, or detection of specific genotypes at subsequent study visits), and clearance (non-detection of HPV in a previously infected person, or non-detection of a specific HPV genotype in a person previously infected by that genotype). Other patterns of HPV detection, such as patterns of intermittent detection of the same genotype or detection of an HPV genotype at the oral site after previous detection at the genital site (or vice versa), will be considered. We will adjust for the time between visits as appropriate. Presence of RNA in addition to DNA will be used to distinguish between active and latent infections.

Covariates of interest include, but are not limited to, sex, age, student status, HIV status, sexual orientation, HPV vaccine status, circumcision status, number of vaginal, oral, and anal sexual partners, timing and nature of recent sexual activity, use of sexual protection and substance use. Demographic and sexual behaviour differences among subpopulations will be considered.

Logistic regression models will be used to estimate odds ratios and 95% confidence intervals for associations between the outcomes of interest and the demographic/behavioural variables. Variables that are significant in univariate analysis or are considered to be relevant based on a priori knowledge will be evaluated in multiple logistic regression models. Cox regression models will be used to investigate the effect of covariates on clearance. Analyses will be performed in SAS or R statistical software.

Modelling analysis

The longitudinal HPV infection data and detailed sexual partner and sexual history data collected in the questionnaires will allow us to develop individual-based network models of HPV transmission. While the data from the study are necessarily egocentric (ie, we cannot link partners reported by participants to any specific individual in or out of the study to form a full sexual network), they can nonetheless be used to directly parameterise simulation models of transmission. The models we develop can match the timing of partner switching, numbers of partners and type of sexual contact, as well as a range of behavioural and demographic covariates as well as HPV infections and clearance times. We will draw from the measured distributions and correlation patterns of these variables in our population to generate simulated populations with sexual behaviour patterns similar to those measured in our study (eg, using approaches based on configuration model methods27 to connect our simulated sexual network based on partner history data). This will allow us to generalise our statistical model findings to different populations and to examine counterfactual scenarios of alternative HPV vaccine coverage levels. Similar models have been developed for other studies, both for STIs and other infectious diseases.28–33 This study is designed to specifically inform such modelling analyses.

Patient and public involvement

Patients and the public were not involved in the development of this study. Study results will be disseminated to the public through peer-reviewed publications as described in the ethics and dissemination section. Additionally, the newsletter described will be made accessible to the public through the study website.

Ethics and dissemination

This study was deemed to not require additional safety monitoring beyond the monitoring of the institutional review board. All study staff interacting with participants and participant data complete the University of Michigan Program for Education and Evaluation in Responsible Research and Scholarship (PEERRS) prior to beginning the project.31 Study results will be disseminated through a series of peer-reviewed publications. Study participants will receive a study results newsletter written for a general audience.


The knowledge gained from this study will contribute to the field of HPV research and will be disseminated via peer-reviewed publications and presentations at national and international conferences. This study will provide improved estimates of HPV clearance rates in the oral cavity, due to the short follow-up times, and will also allow us to examine how HPV clearance is affected by a range of factors, including coinfection with multiple HPV types. More broadly, this study will improve our understanding of the relationships between behaviour patterns and HPV, helping to elucidate how sexual behaviours, numbers of partners and other behaviours (eg, alcohol and substance use) relate to HPV infection patterns. Additionally, this study will provide detailed data on HPV transmission in the college student population—a key population who are also seeing changing vaccination patterns as HPV vaccine coverage improves. This will allow us to further examine the impact of HPV vaccination on the incidence and prevalence of a range of vaccine types (both those covered by the vaccine and others). The study questionnaire and longitudinal design also are also useful in developing sexual network models of transmission, which will allow us to examine alternative vaccination strategies and generalise to other populations.

Limitations and strengths

Because oral HPV prevalence in the USA is relatively low (prevalence of high-risk oral HPV among adults 18–69 was 4.0% in 2011–2014: 6.8% in men and 1.2% in women14), there is a risk that the number of HPV-positive samples in the study will be low, limiting our abilities to draw inference between HPV infection and the demographic and behavioural variables of the participant. The quality of our saliva and oral rinse specimens may depend on the saliva production and swishing efficacy of each participant, although this is mitigated by the sensitivity of the PCR analysis. Finally, we only test for 18 genotypes, which, although we cover all high-risk types, may not give as complete a picture of patterns of mucosal HPV infection.

Instead of identifying risk factors through associations in cross-sectional prevalence analyses, we will use a longitudinal approach with frequent oral and cervicovaginal HPV testing and updated detailed sexual behaviour questionnaires. This will allow for understanding prevalence of HPV and incidence, persistence and clearance. Some studies have also adopted a longitudinal approach (eg,34–38), but many have had long times between follow-up or have focused only on sex or only on genital infection sites. In the MHOC study, we use short follow-up times, with 3–4 visits per participant per year. We focus primarily on oral infection in both men and women but also collect cervicovaginal samples to assess multisite infections.


We would like to thank Professors Carol Bradford, Paul Hoff, Norm Hogikyan, Trachette Jackson, James Koopman, Kelly Malloy, John McHugh, Erin McKean, Mark Prince, Jamie Riddell, Laura Rozek, Andrew Shuman, Matthew Spector, Jeffrey Stanley, Jeremy Taylor, Francis Worden, and Gregory Wolf, for their helpful comments, advice, and support. We also thank our collaborators, study and laboratory staff, and students: Miranda West, Thankum Nair, Lisa Petersen, Sheila Terrones, Chad Brenner, Emily Bellile, Justin Bell, Tori Melandez, and Eliyas Asfaw, Christian Alvarez, Kristin Bevilacqua, Pian Pian Cao, Alanna Clark, Jesse Contreras, Rachel Delinger, Greg Foakes, Anna Gottschlich, Manila Hada, Christina Hanson, Nadine Jawad, Bala Naveen Kakaraparthi, Alexandra Kalabat, Jung Woo Lee, Jodi McCloskey, Ivan Montoya, Anna Morris, Chinmay Pandit, Derrick Sund, Kayla Stanek, Peter Tortora, Taylor Vandenberg, and Lucy Yang. We would like to thank all of our staff and our study participants, without whom this study would not have been possible.


  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
  6. 6.
  7. 7.
  8. 8.
  9. 9.
  10. 10.
  11. 11.
  12. 12.
  13. 13.
  14. 14.
  15. 15.
  16. 16.
  17. 17.
  18. 18.
  19. 19.
  20. 20.
  21. 21.
  22. 22.
  23. 23.
  24. 24.
  25. 25.
  26. 26.
  27. 27.
  28. 28.
  29. 29.
  30. 30.
  31. 31.
  32. 32.
  33. 33.
  34. 34.
  35. 35.
  36. 36.
  37. 37.
  38. 38.
View Abstract


  • Contributors The study was conceptualised and funding was obtained by MCE, TEC and RM. Methodology and original protocols were developed by MCE, LPC, AFB, HMW, BMM, YKL, TBT, RLD, CMG, TEC and RM. Project administration is managed by LPC and YKL. MCE, LPC, AFB, HMW, BMM, YKL, CMG, TEC and RM are responsible for the supervision of study implementation, staff and students. The original draft was prepared by MCE with the assistance of LPC, TSS and MLY. Review and editing were completed by MCE, LPC, AFB, HMW, BMM, YKL, TEC and RM.

  • Funding This work was supported by the National Institutes for Health Grant U01CA182915, the University of Michigan MCubed Program, a pilot grant from the University of Michigan Comprehensive Cancer Center’s Cancer Epidemiology & Prevention Program (supported by the National Cancer Institute of the National Institutes of Health grant P30CA046592), and the Michigan Institute for Clinical and Health Research (NIH Grant UL1TR000433).

  • Competing interests None declared.

  • Patient consent Not required.

  • Ethics approval The Michigan HPV and Oropharyngeal Cancer study was approved by the University of Michigan Institutional Review Board on 26 August 2014 (IRB # HUM00090326).

  • Provenance and peer review Not commissioned; externally peer reviewed.

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.