Article Text

This article has a correction. Please see:


Determinants of regular smoking onset in South Africa using duration analysis
  1. Nicole Vellios,
  2. Corné van Walbeek
  1. School of Economics, University of Cape Town, Cape Town, Western Cape, South Africa
  1. Correspondence to Dr Nicole Vellios; nicolevellios{at}


Objectives South Africa has achieved significant success with its tobacco control policy. Between 1994 and 2012, the real price of cigarettes increased by 229%, while regular smoking prevalence decreased from about 31% to 18.2%.

Methods Cigarette prices and socioeconomic variables are used to examine the determinants of regular smoking onset. We apply duration analysis techniques to the National Income Dynamics Study, a nationally representative survey of South Africa.

Results We find that an increase in cigarette prices significantly reduces regular smoking initiation among males, but not among females. Regular smoking among parents is positively correlated with smoking initiation among children. Children with more educated parents are less likely to initiate regular smoking than those with less educated parents. Africans initiate later and at lower rates than other race groups.

Conclusions As the tobacco epidemic is shifting towards low-income and middle-income countries, there is an increasing urgency to perform studies in these countries to influence policy. Higher cigarette excise taxes, which lead to higher retail prices, reduce smoking prevalence by encouraging smokers to quit and by discouraging young people from starting smoking.

Statistics from

Strengths and limitations of this study

  • This study uses a nationally representative data set.

  • We use duration analysis to investigate the effectiveness of prices in discouraging regular smoking initiation in a low-income and middle-income country, which has relevance for other low-income and middle-income countries considering increasing excise taxes.

  • We create a pseudo-panel to track individuals over time.

  • Some variables are time invariant (gender, population group), while other variables are time variant (price of cigarettes).


A substantial body of literature has conclusively shown that there is an inverse relationship between tobacco prices and tobacco consumption.1 By increasing the excise tax, governments are typically able to increase the retail price of tobacco products, which induces current smokers to quit, reduces the consumption of continuing smokers and reduces the initiation of tobacco products by potential smokers. This paper considers determinants of regular smoking initiation, and in particular the impact of the price of cigarettes on regular smoking initiation in South Africa.

The International Agency for Research on Cancer (IARC) found that there is sufficient evidenceii to conclude that increases in tobacco taxes that increase prices reduce the initiation and uptake of tobacco use among young people.1 Guindon2 subsequently conducted an extensive literature review on the impact of tobacco prices on smoking initiation. He identified 27 studies and found that the effect of price on the smoking initiation decision differs across these studies and argues that data, measurement and methodological issues inherent in much of the literature affect the results.2 Guindon2 concludes that the evidence that taxes or tobacco prices impact smoking initiation is not strong. Two papers, which are not covered in Guindon's review, find that cigarette taxes and prices affect smoking initiation decisions in the USA and in Canada.3 ,4

The existing literature is dominated by studies performed in high-income countries, while only several studies consider the determinants of smoking initiation in low-income and middle-income countries. Two of these are based on Vietnamese data and found that tobacco prices have a statistically significant and fairly substantial effect on smoking initiation.5 ,6 The third study looks at six low-middle and eight upper-middle-income countries and finds that higher prices reduce smoking initiation in low-income and middle-income countries and promote cessation in upper-income and middle-income countries.7 The fourth study looks at 48 countries of which 40 are developing countries and finds that price increases effectively reduce initiation in early youth.8 The fifth study analyses life-course smoking behaviour in China and finds that cigarette prices have a small and insignificant effect on smoking initiation.9

It is now possible to fill this gap in our knowledge with respect to smoking initiation in low-income and middle-income countries, thanks to the rapid growth in the number of large household surveys in those countries over the past decade.

We chose data from South Africa since it is at the forefront of middle-income countries in using excise tax increases as a tobacco control measure. A democratically elected government came to power in May 1994 and made primary and preventative healthcare a strategic priority. As part of the focus, the government has consistently increased the excise tax on tobacco products since 1994 and passed fairly stringent tobacco control legislation in 1999. The explicit rationale for raising the excise tax was to reduce tobacco use and advance public health, although a welcome side effect was that it raised excise tax revenues as well.10

Between 1994 and 2012, the real excise tax increased by 407% and the real price of cigarettes increased by 229% (see figure 1). Over this period, aggregate legal consumption of cigarettes fell by 38%, per capita consumption fell by 52% and regular smoking prevalence decreased from about 31% to 18.2%.10 ,11 The relatively large increases in cigarette prices over nearly 20 years allow one to investigate the relationship between cigarette prices and regular smoking initiation.

Figure 1

Aggregate cigarette consumption and price of cigarettes, 1970–2013.


We use data from the first three waves of the National Income Dynamics Study (NIDS), which is the first nationally representative household panel study in South Africa.12–14 A stratified, two-stage cluster sample design was used in sampling the households to be included in the base wave.15 Although the data are longitudinal, we did not use the longitudinal characteristics of the data, primarily because the change in the real price between waves was modest (only 4% per year between 2008 and 2012). Instead, we combined data from all three waves to increase the sample size. The same individuals across waves are entered only once in the study sample.

NIDS has five smoking-related questions, of which three are relevant for this study. The questions ‘Do you smoke cigarettes?’ and ‘Did you ever smoke cigarettes regularly?’ were used to identify ever-smokers. Current smokers and former smokers were asked ‘How old were you when you first smoked cigarettes regularly?’. Since ‘smoking’ and ‘regular smoking’ are not specifically defined in the NIDS questionnaire (eg, every day, at least 3 days a week, at least every week), it is open to different interpretations by respondents, resulting in some measurement error. This information, together with information on the year of birth, was used to determine the year in which the person started regular smoking. The descriptive statistics of the sample (unweighted data) are shown in table 1.

Table 1

Descriptive statistics of study sample

As is typical of panel studies, there were sometimes substantial differences in respondents' answers about their smoking behaviour between waves. A person was coded as an ever-smoker if he or she indicated in at least one wave that he or she was a current or former smoker. Given the stigma attached to smoking in some communities, respondents might lie about their smoking behaviour, especially if a family member is present during the interview. Where a respondent gave inconsistent answers about age of regular smoking initiation across waves, the median age was used.

Data on average cigarette prices are derived from Statistics South Africa, previously known as Central Statistical Services (CSS). The price data for 1970–1989 were taken from the CSS's Report on Prices, while 1990–2012 price data were obtained directly from Statistics South Africa. The data used in the analysis refer to the average price of a pack of 20 cigarettes in the popular price range (which comprises about 70% of the market). For 2002–2005 and 2008–2012, provincial averages were used. These were the only years for which provincial data were available. In all other years, national averages were used. Between 1970 and the early 2000s, cigarettes sold in a narrow price range and the average price reflected the prices of most cigarette brands fairly accurately. The entry of low-price cigarettes into the market since the early 2000s has increased the degree of price variation.


The study uses duration analysis estimation, which focuses not only on the probability of the event taking place, but specifically on the time to the event.16 A pseudo-panel was constructed using the NIDS data. We use discrete-time hazard models and a logit specification. As a functional form for the baseline hazard function, we use a dummy specification for time at risk and a quartic polynomial specification (both measured in years). The temporal dummy specification allows for a flexible approach to account for duration dependency and does not impose a specific functional form.

Since regular smoking prevalence in South Africa varies significantly by gender (males 39%, females 10% in 2010), separate models were estimated for males and females, following four other papers.17–20 To reduce the impact of recall error, the sample is limited to all people who were aged ≤ 48 in 2008. By limiting the age cohort to exclude older generations, we reduce the differential mortality bias, since smoking-related mortality differences among a younger age cohort are small.21 We include individuals aged 48 and younger in 2008, but only follow individuals until age 30, since very few people start smoking after age 30. An individual drops from the sample once he/she starts smoking. Although recall error is a significant risk, the literature indicates that historical smoking prevalence estimates reported retrospectively are valid.22 In line with other studies, we assume that a person is at risk of regular smoking from age 10.19 ,23–25 We account for regional variation in regular smoking initiation using an urban/rural variable. Average cigarette prices are included in the equations as levels (not logs). SEs are clustered at the household level.

Parents' education is used as a proxy for household income at the time when the respondent was a child or teenager. Given the well-established relationship between education and income, parents' education is likely to be a better indicator of household income at the time the person was susceptible to starting smoking than current income levels. The variable is coded as either parent's highest level of education.ii

In order to account for the fact that there may be unobserved determinants of smoking behaviour, we considered including unemployment and per capita gross domestic product (GDP) in the regression equation.26 However, given measurement errors over time in unemployment and per capita GDP, and the fact that per capita GDP is quite a distorted measure of average income in South Africa, we include a proxy for confidence in the economy, calculated as gross domestic capital formation as a percentage of GDP.

We tested the robustness of the results by means of a discrete-time split population model, which relaxes the assumption that all individuals will eventually start regular smoking. This approach has been followed in a number of other studies.5 ,18 ,19 In the initial analysis, we tested the impact of tobacco control policy changes in 1995 (warning labels on tobacco packaging and advertising material, and ban on smoking on public transport) and 2001 (tobacco advertising and sponsorship ban, prohibition of smoking in all public and workplaces and prohibition of distributing free cigarettes and the sale of single cigarettes), but the impact was found to be insignificant. The models were estimated using Stata V.12.1. Split population models were estimated using the spsurv command.iii


The results of the determinants of regular smoking initiation for males and females are presented in tables 2 and 3, respectively. Models 1–3 exclude the impact of parental regular smoking. Models 1 and 3 account for duration dependency by including temporal dummies. Model 3 is the spsurv estimation. Models 2, 4 and 5 account for duration dependency using a fourth-order polynomial.

Table 2

Determinants of regular smoking initiation for males (ORs)

Table 3

Determinants of regular smoking initiation for females (ORs)

The samples in models 4 and 5 are much smaller since we do not have information on parental regular smoking for all respondents. Since we only have parental regular smoking information at the time of the interview, the variable is time invariant. Model 4 is included as a reference to model 5 so that the effects of parental regular smoking can be analysed in comparison to the same sample of individuals.

The estimated ORs for models 1 and 2 and the HR for model 3 are broadly similar, at least within each gender group. Males' regular smoking initiation is significantly affected by the price of cigarettes. These price effects are likely to be upper limits because we do not control for other tobacco control interventions of 1995 and 2001. The OR on the price variable of 0.984 (model 1) implies that a male who is subject to a 1 rand (about US$0.10 in 2014) increase in the price has a 1.6% ((1–0.984)×100) lower probability of initiating regular smoking than a male who is not subject to the price increase. For model 2, a one rand increase reduces the probability of regular smoking initiation among males by 1.7%. These results are scalable; a 10 rand (about US$1.00) increase in the price of cigarettes reduces the probability of starting regular smoking by between 14.8% (=1-e−0.016×10) and 15.6% (=1-e−0.017×10). Whereas these results refer to the percentage change in regular smoking initiation in response to an absolute change in the price, the price elasticity of initiation refers to the percentage change in regular smoking initiation in response to a percentage change in the price. This elasticity suggests that for every 10% increase in price, the probability of initiation among males decreases by 2.1% at the mean (table 2, model 1). On the other hand, females' regular smoking initiation for the sample as a whole is not significantly impacted by the price of cigarettes.

Living in an urban area increases the probability of initiating regular smoking for males (OR=1.232, 95% CI 1.127 to 1.345) and females (OR=1.500, 95% CI 1.244 to 1.809) (model 1).

We tested the robustness of the results using low-price/discounted brands (which comprise 10–15% of the market) and found that the estimates were similar to those of the popular price category brands.

In terms of socioeconomic status, there is a negative socioeconomic gradient for regular smoking initiation, but for males only. Compared to the base (where the most educated parent has primary education or less), the probability of initiating regular smoking decreases, although not significantly so, if the most educated parent has completed secondary school (OR=0.960, 95% CI 0.820 to 1.123), and becomes significantly lower if the most educated parent has some tertiary education (OR=0.774, 95% CI 0.638 to 0.939).

A total of 82.4% of males and 82.0% of females in the sample indicate that they are literate. Being literate significantly reduces the probability of initiating regular smoking, for both males and females (OR=0.615, 95% CI 0.557 to 0.680 for males and OR=0.6184, 95% CI 0.511 to 0.739 for females).

Approximately 5.5% of respondents indicated that their mother died before they were aged 15 years. For males, having lost one's mother before age 15 is positively, but insignificantly, associated with regular smoking initiation (table 2, model 1: OR=1.091, 95% CI 0.906 to 1.313). For females, having lost one's mother by age 15 is positively and significantly associated with regular smoking initiation (table 3, model 1: OR=1.353, 95% CI 0.999 to 1.839).

An increase in confidence in the economy is associated with a decrease in the probability of smoking initiation for males (table 2, model 1: OR=0.980, 95% CI 0.970 to 0.990) and females (table 3, model 1: OR=0.995, 95% CI 0.979 to 1.010), but this result is only significant for males.

The NIDS questionnaire did not ask respondents about the smoking behaviour of their parents. We thus used a subset of the data where we were able to link the parents' regular smoking-related responses to the children's responses (1732 males and 1986 females, compared to the full sample of 7771 males and 9920 females). For both males and females (model 5 of tables 2 and 3), having at least one parent who smokes is correlated with the probability of initiating regular smoking (OR=1.943, 95% CI 1.587 to 2.378 for males and OR=2.064, 95% CI 1.371 to 3.107 for females).

In each of the estimated models above (except the split population model), we included race and time-at-risk interaction terms in order to determine the regular smoking initiation hazard rates for each of the race groups. Based on models 1 and 2, we derived the probability of initiating regular smoking for Africans, mixed-race ancestry, and whites and Asians (the latter two groups combined). In all cases, we assume a typical person is literate, lives in an urban area, has at least one parent who has completed secondary school and whose mother did not die before the person was aged 15 years. The price of cigarettes is assumed to be at the 2008 level (ie, R21.41 per pack in constant December 2010 prices). The results for males and females are shown in figures 2 and 3, respectively.

The irregularity of the hazard functions based on the temporal dummy specification (shown as the thin lines of the spiked curves in figures 2 and 3) reflects the heaping effect and recall error. For example, there are substantial spikes at ages 16 and 18. The smoothed hazard functions based on the fourth-order polynomial specification do not suffer from the heaping effect distortions. Based on the smoothed estimates, regular smoking initiation is highest among mixed-race males, peaking at age 17.2 years with a hazard rate of 0.114.iv Thus, ∼11.4% of mixed-race males who have not initiated regular smoking and who have the characteristics described in the previous paragraph are expected to initiate regular smoking past their 17th birthday. Among whites and Asians, the regular smoking initiation hazard peaks at 17.0 years, at a hazard rate of 0.088. Regular smoking initiation is substantially lower among Africans and peaks substantially later (18.5 years), at a hazard rate of 0.054.

Figure 2

Regular smoking initiation hazard rates for males, by race.

Figure 3

Regular smoking initiation hazard rates for females, by race.

Female regular smoking initiation in South Africa is lower than male regular smoking initiation. There are large differences in female regular smoking initiation between race groups. Among mixed-race females, regular smoking initiation peaks at 16.8 years, with a hazard rate of 0.092. Regular smoking initiation among white and Asian females peaks at 18.3 years, with a hazard rate of 0.062. Very few African females initiate regular smoking, indicated by the fact that the hazard peaks at 18.1 years at 0.002.


The current study contributes to the growing literature on regular smoking onset in low-income and middle-income countries. Since the tobacco epidemic is shifting towards low-income and middle-income countries, there is an increasing urgency to perform studies in these countries. Higher cigarette excise taxes, which lead to higher retail prices, reduce regular smoking prevalence by encouraging smokers to quit and by discouraging young people from starting regular smoking.

We found that the increase in cigarette prices from R7.24 in 1994 to R20.93 in 2012 decreased regular smoking initiation among males by between 14.0% and 20.8%, depending on the specification of the model.

For the full female sample, we do not find a significant relationship between the price of cigarettes and regular smoking initiation, a result in line with others.17 ,18 ,27 For a subsample of females for which we have information on parental regular smoking behaviour, an increase in cigarette prices reduces regular smoking initiation, although the effect is only significant at the 10% level.

We estimate the price elasticity of initiation is between −0.21 and −0.47 for males and −0.07 to −0.28 for females, depending on the specification. The price elasticity is significant for males only, a similar finding to Cawley et al.27 In general, our price elasticities of initiation are lower compared to other studies. Cawley et al27 find a price elasticity of initiation of −1.2 for males. Laxminarayan and Deolalikar6 consider smoking initiation among Vietnamese males and find a price elasticity of initiation of −1.18. Zhang et al28 find an elasticity of initiation with respect to cigarette price is −3.36 for males and females in Canada.6

While an increase in the price of cigarettes has a direct impact on regular smoking initiation among males, it also has an indirect impact on both males and females. The results indicate that regular smoking initiation is correlated with regular parental smoking. Regular smoking prevalence in South Africa has decreased sharply over the past 20 years, primarily because of an increase in the price of cigarettes.10

Furthermore, we find a negative gradient between parents' education and regular smoking initiation. Children of parents with limited education are more likely to start regular smoking than children of parents with more education. Education levels in South Africa are gradually improving over time, and should the trends found in this paper continue, this should gradually lead to reduced regular smoking initiation in the next generation. Closely associated with this is the finding that literate people are less likely to initiate regular smoking than illiterate people. As education levels improve, illiteracy recedes, with positive long-term tobacco control consequences.

While the relationship between parents' education and smoking behaviour on young people's regular smoking initiation (especially males) has positive tobacco control consequences in the long term, these operate too slowly to have practical application as a tobacco control mechanism. A more urgent approach is required. In this context, it is encouraging to see that the large increases in the price of cigarettes in South Africa over the past two decades have resulted in higher quit rates and have significantly decreased regular smoking initiation among males, who comprise by far the largest group of smokers (and potential smokers) in South Africa.

We note several limitations. First, there is likely to be recall error since individuals often do not remember the exact age they started regular smoking. We attempted to reduce the impact of recall error by limiting the sample to all individuals who were younger than 48 years in 2008 and accounting for heaping by interacting price with the years where heaping is most visible (ages 16 and 18). Second, we do not account for price variation across brands. Before 2000, this would not have impacted the results significantly since there was little variation in prices across brands. However, since 2000, there has been an increase in price variation across brands. For the years 2002–2005 and 2007–2012, we match cigarette prices by province to current province where the respondent resides at the time of interview. Since province at the time of interview and province at the time of making the decision to start smoking may not be the same, there is likely to be some measurement error. Third, we do not account for an HIV-related mortality bias. In a country where the prevalence of the AIDS epidemic is high, it is possible that a mortality bias might affect the sample if the propensity of HIV infection is correlated with smoking through risk preferences. If the distribution of risk preferences has changed over time, then this will affect the price elasticity of the price estimates. Fourth, it is possible that the effect of parental smoking may be attenuated by a gene-grounded propensity to act impulsively. Fifth, the price coefficients may be biased if prices are endogenous. Producers may set their prices as a function of consumer tastes and consumers' expected price responses.

South Africa should continue to increase the price of cigarettes through excise taxes. However, if consumers are able to buy cheaper illicit cigarettes, the impact of price increases is likely to be reduced. Governments should therefore implement measures to reduce the illicit trade of cigarettes as outlined by the WHO's Framework Convention on Tobacco Control's Protocol to Reduce Illicit Trade.29


The authors are very grateful to Emmanuel Guindon from the Centre for Health Economics and Policy Analysis at McMaster University for his assistance with the methodology.


View Abstract


  • Contributors CvW conceived of the study. NV conducted the literature review. CvW and NV contributed to data analysis and writing of the paper.

  • Funding This work was supported by the Bill & Melinda Gates Foundation through the American Cancer Society (IRMA 17078) and the African Capacity Building Foundation (IRMA 20177).

  • Competing interests None declared.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data sharing statement No additional data are available.

  • i ‘Sufficient evidence’ is defined as follows: ‘An association has been observed between the intervention under consideration and a given effect in studies in which chance, bias and confounding can be ruled out with reasonable confidence. The association is highly likely to be causal’ (IARC, 2011: 355).

  • iii Spsurv, developed by Stephen Jenkins, uses a cloglog specification which reports HRs (not ORs). The response curve is asymmetric, unlike the logit specification.

  • iv The decimals were derived using a more precise gradation of length of time at risk than the integer values shown in table 2.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Linked Articles