您好,欢迎来到小侦探旅游网。
搜索
您的当前位置:首页05_05Health-related non-response in the BHPS and ECHP using inverse probability weighted estimators

05_05Health-related non-response in the BHPS and ECHP using inverse probability weighted estimators

来源:小侦探旅游网


HEDG Working Paper 05/05

Health-related non-response in the BHPS and ECHP: using inverse probability weighted

estimators in nonlinear models

Andrew M Jones Xander Koolman Nigel Rice

June 2005

ISSN 1751-1976

york.ac.uk/res/herc/hedgwp Health-related non-response in the BHPS and ECHP: using inverse probability

weighted estimators in nonlinear models

Andrew M Jones1, Xander Koolman2 & Nigel Rice3

Abstract

This paper considers health-related non-response in the first eleven waves of the British Household Panel Survey (BHPS) and the full eight waves of the European Community Household Panel (ECHP) and explores its consequences for dynamic models of the association between socioeconomic status and self-assessed health (SAH). We describe the pattern of health-related non-response revealed by the BHPS and ECHP data. We both test and correct for non-response in empirical models of the impact of socioeconomic status on self-assessed health. Descriptive evidence shows that there is health-related non-response in the data, with those in very poor initial health more likely to drop out, and variable addition tests provide evidence of non-response bias in the panel data models of SAH. Nevertheless a comparison of estimates - based on the balanced sample, the unbalanced sample and corrected for non-response using inverse probability weights (IPW) – shows that, on the whole, there are not substantive differences in the average partial effects (APE) of the variables of interest. The main differences are between unweighted and one form of IPW-weighted estimates for the APE of income and education in those countries that have fewer than eight waves of data. Similar findings have been reported concerning the limited influence of non-response bias in models of various labour market outcomes; we discuss possible explanations for our results.

Keywords: Self-assessed health, attrition, non-response, non-participation, selection on observables, inverse probability weighting, BHPS, ECHP

* Data from the British Household Panel Survey (BHPS) were supplied by the UK Data Archive. Neither the original collectors of the data nor the Archive bear any responsibility for the analysis or interpretations presented here. The ECHP UDB – version of December 2003 was supplied by Eurostat. This paper derives from the project “The dynamics of income, health and inequality over the lifecycle” (known as the ECuity III Project), which is funded in part by the European Community’s Quality of Life and Management of Living Resources programme (contract QLK6-CT-2002-02297). Nigel Rice is in part supported by the UK Department of Health programme of research at the Centre for Health Economics. We are grateful to Maarten Lindeboom, John Wildman and participants at the RSS/ESRC Research Methods Programme conference on Statistical methods for attrition and non-response in social surveys, London May 2004, the ECuity III Project workshop, Helsinki June 2004, and the York Seminars in Health Econometrics (YSHE) for their comments on earlier versions of this paper.

1Corresponding author: Department of Economics and Related Studies, University of York, Heslington, York, YO10 5DD, United Kingdom: Tel: +44-1904-433790; Fax: +44-1904-433759; Email: amj1@york.ac.uk2Institute of Health Policy and Management, Erasmus University Medical Center, The Netherlands 3Centre for Health Economics, University of York, United Kingdom

1

1. Introduction

The objective of this paper is to explore the existence of health-related non-response in panel data and its consequences for modelling the association between socioeconomic status (SES) and self-assessed health (SAH). Using panel data, such as the British Household Panel Survey (BHPS) or European Community Household Panel (ECHP), to analyse longitudinal models of health creates a risk that the results will be contaminated by bias associated with longitudinal non-response. There are drop-outs from the panels at each wave and some of these may be related directly to health: due to deaths, serious illness and people moving into institutional care. In addition, other sources of non-response may be indirectly related to health, for example divorce may increase the risk of non-response and also be associated with poorer health than average. The long-term survivors who remain in the panel are likely to be healthier on average compared to the sample at wave 1. The health of survivors will tend to be higher than the population as a whole and their rate of decline in health will tend to be lower. Also, the socioeconomic status of the survivors may not be representative of the original population who were sampled at wave 1. Failing to account for non-response may result in misleading estimates of the relationship between health and socioeconomic characteristics. To address this issue we describe the pattern of health-related non-response revealed by the BHPS and ECHP data and we test and correct for non-response in empirical models of self-assessed health (SAH).

There are many recent studies that have used the BHPS, ECHP and other similar panels to estimate models involving measures of health and that have used regression analyses based on balanced or unbalanced panels which may be prone to problems of non-response. Examples include: Benzeval and Judge (2001) who analyse health and SES with the BHPS; Meer et al, (2003) who analyse health and SES with the US PSID; Buckley et al (2004) who analyse SAH and SES with the Canadian SLID; Contoyannis et al (2004a) who analyse health limitations in the BHPS; Wildman (2003) who analyses the relationship between mental health and SES in the BHPS; and Riphahn (1999) who analyses retirement and health with the GSOEP.

The paper adopts a broad definition of longitudinal non-response, that encompasses any observations that “drop-out” from the original wave 1 sample over the subsequent T waves. To borrow the taxonomy of reasons for non-participation used by Nicoletti and Peracchi (2005), non-response can arise due to:

1. Demographic events such as death.

2. Movement out of scope of the survey such as institutionalization or emigration. 3. Refusal to respond at subsequent waves.

2

4. Absence of the person at the address. 5. Other types of non-contact.

To these points, we would add item non-response for any of the variables used in the model of health, which eliminates these observations from the sample. The notion of attrition, commonly used in the survey methods literature, is usually restricted to points 3, 4 and 5. However our concern is with any longitudinal non-response that leads to missing observations in the panel data regression analysis. In fact it is points 1 and 2 – death and incapacity – that are likely to be particularly relevant as sources of health-related non-response. The original sample consists of those who provide a full interview and usable information on SAH at the first wave of the panels. Non-response encompasses all of those who fail to provide usable observations for the model of SAH at subsequent waves.

Our aim is to estimate models that focus on the relationship between health (SAH) and socioeconomic status (SES). We take a representative sample of individuals at wave 1 and follow them for the 11 or 8 years of the BHPS and ECHP panels. The sample of interest is those n original individuals observed over a full T-year period (T=11 for BHPS and T=8 for ECHP). A fully observed sample from this population would consist of nT observations. Due to non-response we only observe ∑Ti observations. The reasons for having incomplete observations include

i=1n

attrition (as conventionally defined in the survey methods literature) as well as individuals becoming ineligible, due to incapacity or death. This creates a problem of incidental truncation: we are interested in the association between SAH and SES for our n individuals over the full T waves. However the more frail individuals are more likely to die or drop-out before the end of the observation period, and their levels of SAH and SES are unobservable. This means that the remaining observed sample of survivors may contain less frail individuals – this is the source of potential bias in the relationship between SAH and SES across our sample of n individuals.

We apply variable addition tests for attrition bias (Verbeek and Nijman, 1992) and inverse probability weighting to adjust for non-response in estimation of pooled models (Robins et al. 1995; Fitzgerald et al., 1998; Moffitt et al., 1999; Wooldridge, 2002a). Descriptive evidence shows that there is health-related non-response in the data, with those in poor initial health more likely to drop out, and variable addition tests provide evidence of non-response bias in panel data models of SAH. Nevertheless a comparison of estimates - with and without correcting for non-response using inverse probability weights - does not show substantive differences in the average partial effects of the variables of interest. So, while health-related non-response exists, it does not appear to distort the magnitudes of the estimated effects of socioeconomic status. Similar findings have been

3

reported concerning the limited influence of non-response bias in models of various labour market outcomes; we discuss possible explanations for our results.

The structure of the paper is as follows. Section 2 introduces the BHPS and ECHP datasets. Section 3 presents a descriptive analysis of health-related non-response in both surveys. In Section 4 we introduce the empirical models for self-assessed health and describe the estimation strategy. Section 5 reports and discusses the results for the models of socioeconomic status and self-assessed health and a conclusion is provided in Section 6.

2. The Data

2.1 BHPS

The sample

We first exploit the panel data available in the first eleven waves (1991-2001) of the British Household Panel Survey (BHPS). The BHPS is a longitudinal survey of private households in Great Britain that provides rich information on socio-demographic and health variables. It was designed as an annual survey of each adult (16+) member of a nationally representative sample of more that 5,000 households, with a total of approximately 10,000 individual interviews. The first wave of the survey was conducted between 1st September 1990 and 30th April 1991. The initial selection of households for inclusion in the survey was performed using a two-stage clustered systematic sampling procedure designed to give each address an approximately equal probability of selection (Taylor et al., 1998). The same individuals are re-interviewed in successive waves and, if they split off from their original households are also re-interviewed along with all adult members of their new households. In this analysis we use both balanced samples of respondents, for whom information on all the required variables is reported at each wave, and unbalanced samples, that exploit all available observations for wave 1 respondents. Both samples do not include new entrants to the BHPS; they only track all of those who were observed at wave 1. In this sense, the analysis treats the sample as a cohort consisting of all those present at wave 1. To be included in the analysis individuals must be original sample members (OSMs) who were aged 16 or over and who provided a valid response for the health measure at wave 1. Our broad definition of non-response encompasses all individuals who are missing at subsequent waves.

4

Measures of health

The principal health outcome is self-assessed health (SAH), defined by a response to: ‘Please think back over the last 12 months about how your health has been. Compared to people of your own age, would you say that your health has on the whole been excellent/good/fair/poor/very poor?’ SAH should therefore be interpreted as indicating a perceived health status relative to the individual’s concept of the ‘norm’ for their age group. SAH has been used widely in previous studies of the relationship between health and socioeconomic status (e.g., Ettner, 1996; Deaton and Paxson, 1998; Smith, 1999; Benzeval et al., 2000; Salas, 2002; Adams et al., 2003; Frijters et al., 2003; Contoyannis et al., 2004b) and of the relationship between health and lifestyles (e.g., Kenkel, 1995; Contoyannis and Jones, 2004). SAH is a simple subjective measure of health that provides an ordinal ranking of perceived health status. However it has been shown to be a powerful predictor of subsequent mortality (see e.g., Idler and Kasl, 1995; Idler and Benyamini, 1997) and its predictive power does not appear to vary across socioeconomic groups (see e.g., Burström and Fredlund, 2001). Socioeconomic inequalities in SAH have been a focus of research (see e.g., van Doorslaer et al., 1997; van Ourti, 2003; van Doorslaer and Koolman, 2004) and have been shown to predict inequalities in mortality (see e.g., van Doorslaer and Gerdtham, 2003). Categorical measures of SAH have been shown to be good predictors of subsequent use of medical care (see e.g., van Doorslaer et al., 2000; van Doorslaer et al., 2004).

Unfortunately there was a change in the wording of the SAH question at wave 9 of the BHPS. For waves 1-8 and 10-11, the SAH variable represents “health status over the last 12 months”. However, the SF-36 questionnaire was included in wave 9. In this questionnaire, the SAH variable for wave 9 represents “general state of health”, using the question: “In general, would you say your health is: excellent, very good, good, fair, poor?”. Note that the question is not framed in terms of a comparison with people of one’s own age and the response categories differ from the other waves. Item non-response is greater for SAH at wave 9 than for the other waves and these factors would complicate the analysis of non-response rates. Hernandez et al. (2004) have explored the sensitivity of ordered probit models of SAH to this change in the wording, but for simplicity we exclude wave 9 from the analysis.

Other indicators of morbidity are used to describe health-related non-response and as predictors of non-response. The BHPS variable HLLT measures self-reported functional limitations and is based on the question “does your health in any way limit your daily activities compared to most people of your age?” Respondents are left to define their own concepts of health and their daily activities. In contrast, for the variable measuring specified health problems (HLPRB), respondents are presented with a prompt card and asked, “do you have any of the health problems or disabilities listed on this

5

card?” The list is made up of problems with arms, legs, hands, etc; sight; hearing; skin conditions/allergies; chest/breathing; heart/blood pressure; stomach/digestion; diabetes; anxiety/depression; alcohol/drug related; epilepsy; migraine and other (cancer and stroke were added as separate categories in wave 11 but are not included here). Also respondents are asked to report whether they are registered as a disabled person (HLDSBL).

Socioeconomic status

Two dimensions of socioeconomic status are included in our models of SAH: income and education. Income is measured as equivalised and RPI-deflated annual household income (INCOME). This variable is transformed to natural logarithms to allow for concavity of the relationship between health and income (e.g., Ettner, 1996; Frijters et al., 2003; van Doorslaer and Koolman, 2004; Contoyannis et al., 2004a,b). Education is measured by the highest educational qualification attained by the end of the sample period in descending order of attainment (DEGREE, HND/A, O/CSE). NO-QUAL (no academic qualifications) is the reference category for the educational variable. In addition to income and education, variables are included to reflect individuals’ demographic characteristics and stage of life: age, ethnic group, marital status and family composition. Marital status distinguishes between WIDOW, SINGLE (never married) and DIVORCED/SEPARATED, with married or living as a couple as the reference category. Similarly, we include an indicator of ethnic origin (NON-WHITE), the number of individuals living in the household including the respondent (HHSIZE), and the numbers of children living in the household at different ages (NCH04, NCH511, NCH1218). Age is included as a fourth-order polynomial, (AGE, AGE2 = AGE2/100, AGE3 = AGE3/10000, AGE4 = AGE4/1000000). A vector of wave dummies is included to account for aggregate health shocks, time-varying reporting changes, and any effects of age which are not captured by the polynomial.

2.2 ECHP

The sample

The detailed analysis of the BHPS is complemented by a second source of data: the full eight waves, 1994-2001, of the European Community Household Panel User Database (ECHP-UDB) designed and coordinated by Eurostat, the European Statistical Office. This puts the UK data in the context of a broader analysis of patterns of health-related non-response across European countries. The ECHP is a standardised multi-purpose annual longitudinal survey carried out at the level of the European Union (Peracchi, 2002). The survey is based on a standardised questionnaire that involves annual interviewing of a representative panel of households and individuals of 16 years and

6

older in each of the participating EU member states. It covers a wide range of topics including demographics, income, social transfers, health, housing, education and employment. We use data for the following fourteen member states of the EU for the full number of waves available for each: Austria (waves 2-8), Belgium (1-8), Denmark (1-8), Finland (3-8), France (1-8), Germany (1-3), Greece (1-8), Ireland (1-8), Italy (1-8), Luxembourg (1-3), Netherlands (1-8), Portugal (1-8), Spain (1-8) and the United Kingdom (1-3). Sweden did not take part in the ECHP although the living conditions panel is included with the UDB. The ECHP-UDB also includes comparable versions of the BHPS and German Socioeconomic Panel (GSOEP) and descriptive evidence is provided for these.

Measures of health

Self-assessed general health status (SAH) is measured as either very good, good, fair, poor or very poor. Unlike the BHPS, respondents are not asked to compare themselves with others of the same age. In France a six-category scale was used but this is recoded to the five-category scale in the ECHP-UDB. Responses are also available for the question “Do you have any chronic physical or mental health problem, illness or disability? (yes/no)” and if so “Are you hampered in your daily activities by this physical or mental health problem, illness or disability? (no; yes, to some extent; yes, severely)”. We use two dummy variables to indicate either some limitation or severe limitation.

Socioeconomic status

The ECHP income measure is disposable household income per equivalent adult, using the modified OECD equivalence scale (giving a weight of 1.0 to the first adult, 0.5 to the second and each subsequent person aged 14 and over, and 0.3 to each child aged under 4 in the household). Total household income includes all net monetary income received by the household members during the reference year. Education is measure by the highest level of general or higher education completed, i.e. third level education (ISCED 5-7), second stage of secondary level education (ISCED 3-4) or less than second stage of secondary education (ISCED 0-2)). Marital status distinguishes between married/living in consensual union, separated/divorced, widowed and unmarried. Activity status includes employed, self-employed, student, unemployed, retired, doing housework and ‘other economically inactive’. Region of residence uses the EU’s NUTS 1 level (Nomenclature of Statistical Territorial Units) except for countries where such information was withheld for confidentiality reasons (The Netherlands, Germany) or because the country is too small (Denmark, Luxembourg).

7

3. Descriptive analysis of non-response rates

3.1 BHPS

Table 1 shows how the sample size and composition evolves across the waves of the BHPS for respondents who provided information on SAH. The table, which gives figures men and women separately, shows the number of observations that are available at each wave and the corresponding number of drop-outs and re-joiners between waves. These are expressed as wave-on-wave survival and drop-out rates. The survival rate is the percentage of original sample members remaining at wave t. The drop-out rate is the percentage of the number of drop-outs between waves t-1 and t to the number of observations at t-1. The raw drop-out rate excludes re-joiners, while the net drop-out rate includes them. Drop-out rates are highest between waves 1 and 2, with the rate tending to decline over time. The table also disaggregates the raw drop-out rates according to individuals’ SAH at wave t-1. This shows that drop-out rates are inversely related to past health and, in particular, non-response is highest among those who were in very poor health prior to dropping-out. This pattern of health-related non-response persists throughout the panel and is stronger for men than women.

Table 2 shows that the overall drop-out rate across all 11 waves of the panel varies with socioeconomic characteristics measured at wave 1. The average rate of drop-out over 11 waves is 39%. As expected, non-response increases with individuals’ age at the start of the panel, ranging from 36% for those aged under 30 to 73% for those aged over 70. Some of this age-related non-response is likely to be associated with health, through deaths, serious illness and moves to institutional care. Non-response is greater among those with lower income and with less formal education: the poorest quintile have an overall drop-out rate of 58%, compared to 32% among the richest quintile; those with no qualifications have an overall drop-out rate of 48% compared to 26% among those with a degree. The table also shows that health-related non-response interacts with individuals’ socioeconomic characteristics (some caution is required as some of the cell sizes are very small). So, for example, drop-out rates are very high among elderly individuals (aged >70) who start the survey in poor (87%) or very poor health (95%).

Tables 1 and 2 provide a description of simple bivariate relationships between drop-out rates and socioeconomic characteristics. To extend this to a multivariate analysis Table 3 presents probit models for response/non-response at each wave of the panel, from wave 2 to wave 11, using the full sample of men who are observed at wave 1 (the results for women are similar and are available from the authors on request). The dependent variables for these models equal 1 if the individual

8

responds at the wave in question and 0 otherwise and are always defined relative to the full sample at wave 1 (where a response is defined as providing a usable observation for the ordered probit regression models). The probability of response is modelled as a function of the wave 1 values of all of the regressors that are included in our empirical model of SAH, along with additional wave 1 variables for region (NORTH-WEST, NORTH-EAST, SOUTH-EAST, SOUTH-WEST, MIDLANDS, SCOTLAND, WALES), activity status (SELF-EMPLOYED; UNEMPLOYED; RETIRED; family care and maternity leave, FAMILY-CARE; government training, students and other, EMP-OTHER) and occupational group (UNCLASSIFIED; MANUAL-TECHNICAL; skilled non-manual, SKILL-NON-MAN; skilled manual and armed forces, SKILLED-MANUAL; PART-SKILLED; UNSKILLED, LONG-TERM-SICK) and other indicators of morbidity (HLPRB, HLDSBL, HLLT). These additional observable variables form the basis of the inverse probability weighting approach to correcting for non-response, which is described in more detail below.

The table shows the partial effects of the regressors on the probability of response at each wave, along with an indication of which of these are statistically significant at the 5% level and 1% levels. The partial effects are computed as marginal effects for continuous regressors and average effects for discrete regressors, evaluated at the sample means of the other regressors in the model. These results reveal statistically significant associations between non-response and levels of educational attainment for both men and women. Those with DEGREE, HND/A and O/CSE qualifications are more likely to remain in the sample and the magnitude of this effect increases over the waves. On average, a man with a degree has a 0.07 higher probability of responding at wave 2, relative to one without academic qualifications. By wave 11 they have a 0.169 higher probability of responding. For women the corresponding figures are 0.084 and 0.202. Non-whites are less likely to remain in the sample, and this effect increases in magnitude as time progresses. By wave 11 the probability of responding among non-white men is 0.141 lower and among women it is 0.175 lower. There is no clear evidence of statistically significant income-related non-response.

The pattern of health-related non-response shows that, for both men and women, very poor initial health (SAHVPOOR) is associated with lower response rates, as is functional limitations (HLLT). These associations grow in magnitude and attain statistical significance as the panel lengthens. Disability (HLDSBL) does not show a clear-cut pattern in the multivariate analysis and health problems (HLPRB) shows that those who report health problems are more likely to respond at all waves. This may be because the variable HLPRB encompasses some relatively minor ailments – such that the majority of the sample report having at least one of them – and, after controlling for

9

other measures of health this variable may be capturing other forms of non-response such as geographic mobility among healthy young people and the ease of making contact with interviewees.

3.2 ECHP

Table 4 reports the overall drop-out rates across the available waves for each of the countries that participated in the ECHP, along with comparable samples from the BHPS and German Socioeconomic Panel (GSOEP) that are included in the ECHP-UDB. The evidence reinforces earlier studies (Peracchi, 2002; Behr et al., 2002; Behr, 2004). In particular the UK and Ireland stand out as having above average rates of drop-out, with 45% drop-out after only three waves in the UK and 69% after eight waves in Ireland. The high non-response in the UK is largely attributable to the decision by the national data unit (NDU) to follow only households with complete sets of personal interviews, rather than adopting the standard ECHP following rules. For the other countries that participated for the full eight waves overall drop-out rates are broadly similar, ranging from 40% in Italy to 49% in Spain. Germany and Luxembourg only participated for waves 1-3 and have low drop-out rates of 13% and 12%. The drop-out rates over the comparable period of 29% for the BHPS and 33% for the GSOEP are lower than the ECHP, reflecting the fact that these samples were established prior to 1994. As in the BHPS there is evidence of health-related non-response in the ECHP. When the samples are split by initial levels of self-assessed health a consistent pattern emerges across all countries, with higher rates of non-response among those in poor or very poor initial health. The gradient is not always monotonic, in some countries the lowest drop-out rates are for those with good or, in the case of Luxembourg, fair health.

To provide a sense of how drop-out rates vary by socioeconomic characteristics, Table 5 shows the overall drop-out rates across the available waves and across the countries split by socioeconomic characteristics at the first wave. The table shows the drop-out rates across all available waves for the upper and lower categories of income and education. However the results should be treated with caution as the number of observations in some of the cells are quite small. Patterns of overall non-response by income and education are different across countries: with a positive income gradient in Greece, Ireland, Italy, Portugal and Spain and a positive education gradient in Belgium, Denmark, Finland, France Germany, Netherlands and the UK. Generally the pattern of health-related drop-outs are similar across income and education groups, taking account of the small cell sizes in some cases for very poor health..

10

4. Models and estimation methods

4.1 The ordered probit model

To model the association between the current level of self-assessed health (SAH) and socioeconomic status (SES) we use pooled ordered probit specifications of a dynamic model (see e.g., Contoyannis et al., 2004b). The latent variable specification of the model can be written as:

(1)

h*it = β’xit + εit (i=1,…,N; t=2,…Ti)

where i denotes individuals and t denotes the waves of the panel; h*it is a latent variable that underlies reported SAH; xit is a set of regressors, that includes dummy variables for each category of SAH in the previous year (to capture dynamics), along with observed socioeconomic variables; and εit is a time and individual-specific error term, which is assumed to be normally distributed. The pseudo-ML estimator of the pooled ordered probit model is consistent even if the error terms are not serially independent and does not require that the regressors are strictly exogenous, so it can accommodate pre-determined variables (see e.g., Wooldridge, 2002b). This makes the estimator more robust in comparison to a random effects specification. A robust estimator of the covariance matrix is used to allow for clustering within individuals. As we do not have a natural scale for the latent variable the variance of the error term (ε) is restricted to equal one.

In our data the latent outcome h*it is not observed; instead, we observe an indicator of the category in which the latent indicator falls (hit). The observation mechanism can be expressed as, (2)

where µ0 = -∞, µj ≤ µj+1, µm = ∞. Given the assumption that the error term is normally distributed, the probability of observing the particular category of SAH reported by individual i at time t (hit), conditional on the regressors is, (3)

where Φ(.) is the standard normal distribution function. This formulation makes it clear that it is not possible to separately identify an intercept in the linear index (β0 ) and the cutpoints (µ), the

Pitj = P(hit = j) = Φ(µj - β’xit ) - Φ(µj-1 - β’xit)

hit = j if µj-1 < h*it ≤ µj , j = 1,......,m

11

model only identifies (µj -β0). To deal with this we have adopted a conventional normalisation by setting β0=0 (an alternative is to set µ1=0).

We do not impose an explicit error components specification in (1) but, to understand the nature of the non-response problem, it will often be helpful to think in terms of time invariant unobservable heterogeneity (an “individual effect”) and idiosyncratic random shocks that vary over time (“health shocks”). Non-response associated with individual effects implies that there are certain “types” of individual who are prone to drop out of the panel and whose health is permanently different from those who stay in. This kind of non-response can therefore be detected by comparing the outcomes that are observed prior to drop-out. Non-response associated with idiosyncratic shocks is more problematic. A transient health shock would be reflected in h*it , and hence in hit, but not necessarily in past health. The fundamental identification problem arises if the transient shock leads to the individual dropping-out of the panel, as hit is unobservable for those who have dropped-out.

4.2 Non-response bias Testing

The descriptive analysis has shown evidence of systematic patterns of non-response by socioeconomic characteristics and previous levels of health, but it remains to be seen whether this will lead to non-response bias in our empirical models of SAH. To provide an initial test for non-response bias we use the simple variable addition test proposed by Verbeek and Nijman (1992, p.688). The test variable we use is a count of the number of waves that are observed for the individual (NUMBER OF WAVES). This is added to our pooled ordered probit model and estimated with the unbalanced sample. The t-ratio on the added variables provides a test for non-response bias. The intuition behind the test is that, if non-response is random, indicators of an individual’s pattern of survey responses (R) should not be associated with the outcome of interest (h) after controlling for the observed covariates (x): in other words, it tests a conditional independence condition E(h|x,R)=E(h|x). Additional evidence can be provided by Hausman-type tests that compare estimates from the balanced - for whom we have complete information at all waves - and unbalanced - for whom we have incomplete information for some individuals - samples. In the absence of non-response bias these estimates should be comparable, but non-response bias may affect the unbalanced and balanced samples differently leading to a contrast between the estimates. It should be noted that the variable addition tests and Hausman-type tests may have low power; they rely on the sample of observed outcomes for hit and will not capture

12

non-response associated with idiosyncratic shocks that are not reflected in observed past health (Nicoletti, 2002). Estimation

To allow for non-response we adopt an inverse probability weighted (IPW) estimator and apply it to the pooled ordered probit model (Robins et al., 1995; Fitzgerald et al., 1998; Moffitt et al., 1999; Wooldridge, 2002a, 2002b). This approach is grounded in the notion of missing at random or ignorable non-response (Rubin, 1976; Little and Rubin, 1987). Using R as an indicator of response (R=1 if observed, 0 otherwise) and h and x as the outcome and covariates of interest: missing completely at random (MCAR) is defined by P(R=1|h,x)=P(R=1) and missing at random (MAR) is defined by P(R=1|h,x)=P(R=1|x). The latter implies that, after conditioning on observed covariates, the probability of non-response does not vary systematically with the outcome of interest. By Bayes rule, the MAR condition can be inverted to give P(h|x,R=1)=P(h|x), which provides a rationale for the Verbeek and Nijman (1992) approach to testing.

Fitzgerald et al. (1998) extend the notion of ignorable non-response by introducing the concepts of selection on observables and selection on unobservables. This requires an additional set of observables, z, that are available in the data but not included in the regression model for h. Selection on observables is defined by Fitzgerald et al. by the conditional independence condition P(R=1|h,x,z)=P(R=1|x,z). Selection on unobservables occurs if this conditional independence assumption does not hold. Selection on unobservables, also termed informative, non-random or non-ignorable non-response, is familiar in the econometrics literature where the dominant approach to non-response follows the sample selection model (Heckman, 1976; Hausman and Wise, 1979). This approach relies on the z being “instruments” that are good predictors of non-response and that satisfy the exclusion restriction P(h|x,z)=P(h|x). This is quite different from the selection on observables approach that seeks z’s which are endogenous to h. The statistics literature has related methods for non-ignorable non-response, some of which use the EM algorithm for data imputation (see e.g., Diggle and Kenward, 1994; Fitzmaurice et al., 1996; Molenberghs et al., 1997). Also it is worth mentioning that linear fixed effects panel estimators are consistent, in the presence of selection on unobservables, so long as the non-ignorable non-response is due to time invariant unobservables (see e.g., Verbeek and Nijman, 1992).

The validity of the selection on observables approach hinges on whether the conditional independence assumption holds and non-response can be treated as ignorable, once z is controlled for. If the condition does hold, consistent estimates can be obtained by weighting the observed data

13

by the inverse of the probability of response, conditional on the observed covariates (Robins et al., 1995). This gives more weight to individuals who have a high probability of non-response, as they are under-represented in the observed sample.

Fitzgerald et al. (1998) make it clear that this approach will be applicable when interest centres on a structural model for P(h|x) and that the z’s are deliberately excluded from the model, even though they are endogenous to the outcome of interest. They suggest lagged dependent variables as an obvious candidate for z. Rotnitzky and Robins (1997) offer a similar interpretation when they describe possible candidates for z being intermediate variables in the causal pathway from x to h. This property implies that it would not be sensible to use solely “field variables” such as changes in interviewer as candidates for the additional observables (see e.g., Behr et al., 2002). These kinds of variables may be good predictors of non-response but are unlikely to be associated with SAH. Horowitz and Manski (1998) show that if the observables (z) are statistically independent of h, conditional on (x, R=1), then the weighted estimates reduce to the unweighted ones. This would explain why no difference between weighed and unweighted estimates may be reported in empirical analyses that use inappropriate variables for z.

In our application we are interested in the distribution of self-assessed health conditional on socioeconomic status, rather than the distribution conditional on socioeconomic status and on other indicators of morbidity. We use past morbidity among our z variables. Of course, this approach will break-down if an individual suffers an unobserved health shock, that occurs after their previous interview, that leads them to drop out of the survey and that is not captured by conditioning on lagged measures of morbidity. In this case non-response would remain non-ignorable even after conditioning on z. It is possible to test the validity of the selection on observables approach. The first step is to test whether the z’s do predict non-response; this is done by testing their significance in the probit models for non-response at each wave of the panel (as in Table 3). The second is to do Hausman-type tests to compare the coefficients from the weighted and unweighted estimates. In addition the ordered probit models are compared in terms of the magnitudes of estimated average partial effects.

Implementation of the Fitzgerald et al. (1998) form of the ignorability condition implies that x is observable when R=0. In the case of the kind of unit non-response we are dealing with in the BHPS and ECHP, non-response means that there is missing data for the current period covariates (x) as well as self-assessed health (h). So we implement a stronger form of conditional independence P(R=1|h,x,z)=P(R=1|z) as proposed by Wooldridge (2002a). To compute the IPW estimator we estimate (probit) equations for response (Rit=1) versus non-response (Rit=0) at each wave,

14

t=2,…,T, conditional on a set of characteristics (zi1) that are measured for all individuals at the first wave (as in Table 3). As described above, this relies on selection on observables and implies that non-response can be treated as ignorable non-response, conditional on zi1 (Fitzgerald et al., 1998; Wooldridge, 2002b, p.588). Selection on observables requires that zi1 contains variables that predict non-response and that are correlated with the outcome of interest (SAH) but which are deliberately excluded from the model for health.

In practice zi1 includes the initial values of all of the regressors in the health equation. Also it includes initial values of SAH and of the other indicators of morbidity: for the BHPS, whether the individual reports a specific health problem (HLPRB), whether they report that health limits their daily activities (HLLT) and whether they report a disability (HLDSBL); for the ECHP, whether the individual was mildly or severely hampered in their daily activities. In addition, zi1 includes initial values of the respondent’s activity status, occupational socioeconomic group and region. The probits for response/non-response are estimated at each wave of the panel, from wave 2 to wave 11 in the case of the BHPS and waves 2 to 8 for the ECHP, using the full sample of individuals ˆit, are who are observed at wave 1. The inverse of the fitted probabilities from these models, 1/p

then used to weight observations in the IPW-ML estimation of the pooled ordered probit model using:

(4) LogL=∑∑

i

tn

T

ˆit)LogLit (Rit/p

Wooldridge (2002a) shows that, under the ignorability assumption:

(5) P(Rit =1|hit, xit, zi1 ) = P(Rit =1|zi1 ) , t=2,…,T

the IPW-ML estimator is n-consistent and asymptotically normal. Wooldridge (2002a) also ˆit rather than the true pitand ignoring the implied adjustment to shows that using the estimated p

the estimated standard errors leads to “conservative inference” so that the standard errors are larger than they would be with an adjustment for the use of fitted rather than true probabilities (see also Robins et al., 1995).

The IPW-ML estimator can be adapted to allow the elements of z to be up-dated and change across time, for example adding z variables measured at t-1 to predict response at t. This should improve the power of the probit models to predict non-response and hence make the ignorability

15

assumption more plausible. In this case the probit model for non-response at wave t is estimated relative to the sample that is observed at wave t-1. This relies on non-response being an absorbing state and is therefore confined to “monotone attrition” where respondents never re-enter the panel. Also, because estimation at each wave is based on the selected sample observed at the previous wave, the construction of inverse probability weights has to be adapted. The predicted probability ˆit=πweights are constructed cumulatively using pˆit denote the fitted ˆi2×πˆi2...×πˆit, where the πselection probabilities from each wave. In this version of the estimator the ignorability condition has to be extended to include future values of h and x (see Wooldridge, 2002b, p. 589). Once again Wooldridge shows that omitting a correction to the asymptotic variance estimator leads to conservative inference.

We have not pursued a selection on unobservables approach in this paper. This stems from the lack of credible exclusion restrictions that would define variables that predict health-related non-response but are not associated with SAH. Also, the use of fixed effects estimators is not possible for probit and ordered probit models, due to the incidental parameters problem (although we have experimented with models that use Mundlak (1978) type specifications to deal with correlated effects and this had little impact on our findings concerning non-response (see also Contoyannis et al., 2004b). With the public use versions of the BHPS and ECHP-UDB we do not have any scope for using methods based on refreshment samples (see e.g., Dolton, 2004). The IPW approach is attractive as it is easy to apply in the context of nonlinear models, such as the ordered probit model, and only requires a re-weighting of the data. In contrast to the published longitudinal weights that are supplied with the BHPS and ECHP, our IPW weights are model-specific and specifically designed for the outcome of interest (SAH) and the associated problem of health-related non-response. Past values of SAH, along with other indicators of morbidity, provide promising candidates for the z-variables; although the validity of the approach depends on the credibility of the ignorability assumption. For comparison, we present estimates based on the published BHPS and ECHP weights alongside estimates based on our own weights.

16

5. Estimation Results

The results for the various model specifications outlined above are reported in this section. For the detailed analysis of the BHPS, models for men and women are presented separately throughout. For the more parsimonious analysis of the ECHP the samples are pooled.

5.1 Tests for non-response

Table 6 presents the Verbeek and Nijman (1992) variable addition tests for non-response bias in the pooled ordered probit model for SAH in the BHPS and ECHP. This is based on adding the NUMBER OF WAVES to the model. The first set of results are for the benchmark pooled ordered probits with covariates x. The second includes the additional observables (z) that are used to compute the inverse probability weights. The latter can be regarded as a test for the ignorability assumption behind the IPW estimator. With the exception of Austria and Luxembourg, all of the test statistics show evidence of non-response bias. Adding these test variables to the model is not intended to “correct” the estimates for non-response, but it is informative to compare the estimates with the baseline model that does not include the test variables. It is striking that, for key variables such as income and education, the differences between the estimated coefficients are small (these results are available on request).

5.2 Estimates of ordered probits for SAH

Table 7presents the coefficient estimates for the dynamic pooled ordered probits for SAH estimated with the BHPS data for men (the results forwomen are available from the authors on request). The models were estimated on the balanced, the unbalanced sample and the available observations for the sample of drop-outs. The estimates for the pooled ordered probit models allow for clustering within individuals in the errors by using a robust estimator of the covariance matrix. In addition we estimated the model using the published longitudinal weights supplied with the BHPS, along with our own inverse probability weights (IPW) to adjust for non-response. Both variants of the IPW-ML estimator are presented: IPW-1 uses the full sample and wave 1 regressors to predict non-response, with IPW-2 the sample is restricted by excluding observations that exhibit non-monotone attrition and previous period regressors are used. The unbalanced sample is selected so that the same observations are used for the unweighted and all of the unweighted estimators, to allow a direct comparison of the estimators.

17

The LR tests reject the poolability of the unweighted models for the balanced and drop-outs samples when they are compared to the combined results for the unbalanced panel. The coefficients on lagged SAH show a clear gradient in the magnitude of the coefficients, running from good to very poor SAH (excellent is the omitted category), across all of the models. The results in Table 7 show differences in the sign and size of the coefficients on the age variables and on WIDOW between the three samples and between the weighted and unweighted estimates for the unbalanced sample, reflecting age-related non-response. The sign and size of the coefficients on the education variables are similar across all samples and comparing the weighted and unweighted estimates. The coefficients for ln(INCOME) are also similar across all of the specifications, but with a larger coefficient for the balanced sample than the sample of drop-outs and with the unbalanced sample bracketed between them. Pairwise comparisions of the contrasts between weighted and unweighted estimates of the coefficients on ln(INCOME) and the education variables shows that the differences are small in magnitude, relative to the size of the coefficients, with the largest differences when the IPW-2 weights are used. Pairwise Hausman-type t-tests suggest that the differences between coefficients from the unweighted, BHPS-weighted and IPW-1 weighted estimates are not significantly different. But they do reject the null that there is no statistically significant difference between the unweighted estimates and the IPW-2 estimates. Although this may be due, in part, to the smaller sample size when the sample is restricted to monotone attrition.

Average partial effects

The scaling of the ordered probit coefficients is arbitrary. To provide an indication of the magnitude of the associations between SAH and the regressors we present average partial effects (APEs). For continuous regressors, such as income, these are obtained by taking the derivative of the ordered probit probabilities with respect to the variable in question. For discrete regressors, such as the educational qualifications, they are obtained by taking differences. In general, average partial effects are averaged over the population distribution of heterogeneity and computed using the population averaged parameters (see e.g., Wooldridge, 2002b). In the pooled ordered probit models the total error variance is normalised to 1 and the estimated βs are population averaged parameters by default, so the APEs are given by the standard formula for partial effects.

In the ordered probit model it is possible to compute APEs for each of the five categories of self-assessed health. For parsimony, Table 8 summarises the APEs of lagged SAH, income and educational attainment on the probability of reporting excellent health in the BHPS data. In this case the sign of the APE has a clear qualitative interpretation, with a positive sign implying a positive association with health and vice versa. A partial effect is computed for each observation in

18

the sample, evaluated at the observed values of the regressors. The table presents the sample mean of the partial effects – the APE – along with the sample standard deviation, in parentheses, to give a sense of the variability of the partial effects across observations. These are presented for all versions of the model. Comparing the balanced sample, drop-outs sample and unbalanced samples gives very similar results, suggesting that non-response does not lead to differences in the estimated APEs. This is reinforced by the fact that the estimates with and without weights are very similar in magnitude. The largest differences are between the unweighted and the IPW-2 estimates in the sample of women for educational qualifications.

Table 9 summarises the APEs on the probability of reporting very good health, the highest category in the ECHP, for the lagged SAH variables, estimated with the ECHP-UDB data. While Table 10 presents the APEs for ln(INCOME) and education. The tables compare the estimates for the unweighted ordered probit and the weighted (ECHP published weights and IPW-2 weights) ordered probit estimated on the unbalanced sample. The estimates for lagged SAH, in Table 9, are very stable across all three estimators in all of the countries. Table 10 shows that, in all of the countries, there is a positive association between both income and education and SAH in the unweighted estimates and those based on the published . The average partial effects of income are lowest in Portugal, Italy and Spain and highest in Denmark, Ireland and Luxembourg. The average partial effect of completing tertiary education is lowest in Portugal and France and highest in Ireland and Denmark. As with the BHPS, the quantitative differences between the unweighted and weighted estimates of the average partial effects are small for most countries when the published ECHP weights are used. However there is more difference when the IPW-2 weights are used in place of the published weights and in some cases the partial effects change sign. This occurs in the countries where less than eight waves are available (Austria, Germany, Luxembourg, UK). In all of these cases the underlying coefficients for income and education are not statistically significant.

6. Discussion

This analysis shows that there is clear evidence of health-related non-response in both the BHPS and ECHP. In general, individuals in poor initial health are more likely to drop out, although for younger groups non-response is associated with good health. Furthermore, variable addition tests provide evidence of non-response bias in the models of SAH. Nevertheless a comparison of estimates based on the balanced samples, the unbalanced samples and corrected for non-response using inverse probability weights shows that, in many cases, substantive differences in the magnitudes of the average partial effects of lagged health, income and education are small. The

19

largest differences in the BHPS results are for the comparison of the weighted and IPW-2 weighted estimates of the APEs for education among women. For the ECHP the estimates of dynamics are unaffected by weighting but the IPW-2 estimates for income and education are substantially different than the unweighted estimates in the countries where less than eight waves are available (Austria, Germany, Luxembourg, UK). So, while health-related non-response clearly exists, on the whole it does not appear to distort the magnitudes of the estimated dynamics of SAH and the relationship between socioeconomic status and self-assessed health. Similar findings have been reported concerning the limited influence of non-response bias in models of income dynamics and various labour market outcomes (see e.g., Hausman and Wise, 1979; Becketti et al., 1988; Lillard and Panis, 1998; Zabel, 1998; Ziliak and Kniesner, 1998; Jimenez-Martin and Peracchi, 2002; Behr, 2004) and on measures of social exclusion such as poverty rates and income inequality indices (Watson, 2003; Rendtel et al., 2004). To understand our findings, recall that the descriptive analysis for the BHPS shows little evidence of income-related non-response. There is evidence of strong education-related and health-related non-response, but the latter is concentrated among those in poor initial health who are relatively few in number. There is no clear interaction between health-related non-response and levels of income or education. The finding that non-response has a limited impact on the estimates of health dynamics and, to a lesser extent, estimates of the association between socioeconomic status, measured by income and education, and self-assessed health holds for the BHPS and for many of the countries within the ECHP.

20

References

Adams, P., Hurd, M.D, McFadden, D., Merrill, A. and Ribeiro, T. (2003) Healthy, wealthy and wise? Tests for direct causal paths between health and socioeconomic status. Journal of Econometrics, 112, 3-56.

Becketti, S., Gould, W., Lillard, L. and Welch, F. (1988) The Panel Study of Income Dynamics after fourteen years: an evaluation. Journal of Labor Economics, 6, 472-492.

Behr, A., Bellgardt, E. and Rendtel, U. (2002) A cross-national comparison of panel attrition in the ECHP and its impact on the estimation of wage equations. Mimeo.

Behr, A. (2004) Comparing estimation strategies in the presence of panel attrition. Empirical results based on the ECHP. Mimeo, Wilhelms-Universitäüt, Münster.

Benzeval, M. and Judge, K. (2001) Income and health: the time dimension. Social Science and Medicine, 52, 1371-1390.

Benzeval, M., Taylor, J. and Judge, K. (2000) Evidence on the relationship between low income and poor health: Is the Government doing enough? Fiscal Studies, 21, 375-399.

Buckley, N.J., Denton, F.T., Robb, L. and Spencer, B.G. (2004) The transition from good to poor health: an econometric study of the older population. Journal of Health Economics, 23, 1013-1034.

Burström, B. and Fredlund, P. (2001) Self rated health: is it as good a predictor of subsequent mortality among adults in lower as well as in higher social classes? Journal of Epidemiology and Community Health, 55, 836-840

Contoyannis, P. and Jones, A. (2004) Socioeconomic status, health and lifestyle. Journal of Health Economics, 23, 965-995.

Contoyannis, P., A.M. Jones, and N. Rice (2004a) Simulation-based inference in dynamic panel probit models: an application to health. Empirical Economics, 29, 49-77.

Contoyannis, P., Jones, A.M. and Rice, N. (2004b) The dynamics of health in the British Household Panel Survey. Journal of Applied Econometrics, 19, 473-503.

Deaton, A..S. and Paxson, C.H. (1998) Ageing and inequality in income and health. American Economic Review, Papers and Proceedings 88, 248-253.

Diggle, P. and Kenward, M.G. (1994) Informative drop-out in longitudinal data analysis. Applied Statistics, 43, 49-93.

Dolton, P. (2004) Reducing attrition bias using targeted refreshment sampling and matching. University of Newcastle upon Tyne., mimeo.

Ettner, S. (1996) New evidence on the relationship between income and health. Journal of Health Economics, 15, 67-85

Fitzgerald, J., Gottshalk, P. and Moffitt, R. (1998) An analysis of sample attrition in panel data. The Michigan Panel Study on Income Dynamics. Journal of Human Resources, 33, 251-299.

21

Fitzmaurice, G.M., Heath, A.F and Clifford, P. (1996) Logistic regression models for binary panel data with attrition. Journal of the Royal Statistical Society Series A(Statistics in Society), 159, 249-263.

Frijters, P., Haisken-DeNew, J.P. and Shields, M.A. (2003) Estimating the causal effect of income on health: evidence from post reunification East Germany, Centre for Economic Policy Discussion Paper No. 465, Australian National University.

Hausman, J. and Wise, D. (1979) Attrition bias in experimental and panel data: the Gary Income Maintenance Experiment. Econometrica, 47, 455-474.

Heckman, J.J. (1976) The common structure of statistical models of truncation, sample selection and limited dependent variables and a simple estimator for such models. The Annals of Economic and Social Measurement, 5, 475-492.

Hernández-Quevedo, C., Jones, A.M. and Rice, N., (2004) Reporting bias and heterogeneity in self-assessed health. Evidence from the British Household Panel Survey, ECuity III Project Working Paper #19.

Horowitz, J.L. and Manski, C.F. (1998) Censoring of outcomes and regressors due to survey nonresponse: identification and estimation using weights and imputations. Journal of Econometrics, 84, 37-58.

Idler, E. L. and Benyamini, Y. (1997). Self-rated health and mortality: a review of twenty-seven community studies. Journal of Health and Social Behavior, 38, 21-37.

Idler, E.L. and Kasl, S.V. (1995) Self-ratings of health: do they also predict change in functional ability? Journal of Gerontology, 50B, S344-S353.

Jimenez-Martin, S. and Peracchi, F. (2002) Sample attrition and labor force dynamics: evidence from the Spanish labor force survey. Spanish Economic Review, 4, 79-102.

Kenkel, D. (1995). Should you eat breakfast? Estimates form health production functions. Health Economics, 4, 15-29.

Lillard, L.A. and Panis, C.W.A. (1998) Panel attrition from the Panel Study of Income Dynamics: household income, marital status and mortality. Journal of Human Resources, 33, 437-457.

Little, J.A. and Rubin, D.B. (1987) Statistical analysis with missing data. John Wiley and Sons, New York.

Meer, J., Miller, D.L. and Rosen, H.S. (2003) Exploring the health-wealth nexus. Journal of Health Economics, 22, 713-730.

Moffitt, R., Fitzgerald, J. and Gottschalk, P. (1999) Sample attrition in panel data: the role of selection observables. Annales d’Economie et de Statistique, 55-56, 129-152.

Molenberghs, G., Kenward, M.G. and Lesaffre, E. (1997) The analysis of longitudinal ordinal data with non-random drop-out. Biometrika, 84, 33-44.

Mundlak, Y. (1978) On the pooling of time series and cross-section data. Econometrica, 46, 69-85.

Nicoletti, C. (2002) Non-response in dynamic panel data models. Working papers of the Institute for Social and Economic Research, paper 2002-31, Colchester: University of Essex.

22

Nicoletti, C. and Peracchi, F. (2005) A cross-country comparison of survey nonparticipation in the ECHP. Journal of the Royal Statistical Society Series A forthcoming..

Peracchi, F. (2002) The European Community Household Panel: a review. Empirical Economics, 27, 63-90.

Rendtel, U., Behr, A., Bellgardt, E. and Sisto, J. (2004) Does panel attrition disturb comparative analysis with the European Community Household Panel (ECHP)? Mimeo.

Riphahn, R. (1999) Income and employment effects of health shocks. A test case for the German welfare state. Journal of Population Economics, 12, 363-389.

Robins, J., Rotnitzky, A. and Zhao, L.P. (1995) Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association, 90, 106-121.

Rotnitzky, A. and Robins, J. (1994) Analysis of semi-parametric regression models with non-ignorable non-response. Statistics in Medicine, 16, 81-102.

Rubin, D.B. (1976) Inference and missing data. Biometrika, 63, 581-592.

Salas, C. (2002) On the empirical association between poor health and low socioeconomic status at old age. Health Economics, 11, 207-220.

Smith, J.P. (1999) Healthy bodies and thick wallets: the dual relationship between health and economic status. Journal of Economic Perspectives 13, 145-166.

Taylor, M., Brice, J., Buck, N. and Prentice, E. (1998) British Household Panel Survey User Manual Volume A: Introduction, Technical Report and Appendices. University of Essex, Colchester.

van Doorslaer, E. and Gerdtham, U-G. (2003) Does inequality in self-assessed health predict inequality in survival by income? Evidence from Swedish data. Social Science and Medicine, 57, 1621-1629.

van Doorslaer, E. and Koolman, X. (2004) Explaining the differences in income-related health inequalities across European countries. Health Economics, 13, 609-628.

van Doorslaer, E., Jones, A.M. and Koolman, X. (2004). Explaining income-related inequalities in doctor utilisation in Europe: a decomposition approach. Health Economics, 13, 629-647.

van Doorslaer, E., et al. (2000) Equity in the delivery of health care in Europe and the US. Journal of Health Economics, 19, 553-583.

van Doorslaer, E, Wagstaff, A., Bleichrodt, H. et al, (1997) Income-related inequalities in health: some international comparisons, Journal of Health Economics, 16, 93-112.

van Ourti, T. (2003). Socio-economic inequality in ill-health amongst the elderly. Should one use current income or permanent income? Journal of Health Economics, 22, 187-217.

Verbeek, M. and Nijman, T.E. (1992) Testing for selectivity bias in panel data models. International Economic Review, 33, 681-703.

Verbeek, M. (2000) A guide to modern econometrics. Wiley, Chichester.

23

Watson, D. (2003) Sample attrition between waves 1 and 5 in the European Community Household Panel. European Sociological Review, 19, 361-378.

Wildman, J. (2003) Income inequalities in mental health in Great Britain: analysing the causes of health inequality over time. Journal of Health Economics, 22, 295-312.

Wooldridge, J. (2002a) Inverse probability weighted M-estimators for sample stratification, attrition and stratification. Portuguese Economic Journal, 1, 117-139.

Wooldridge, J. (2002b) Econometric Analysis of cross section and panel data. MIT Press.

Zabel, J.E. (1998) An analysis of attrition in the Panel Study of Income Dynamics and the Survey of Income and Program Participation with an application to a model of labor market behavior. Journal of Human Resources, 33, 479-506.

Ziliak, J.P. and Kniesner, T.J. (1998) The importance of sample attrition in life cycle labor supply estimation. Journal of Human Resources, 33, 507-530.

24

Table 1: SAH sample size, drop outs, re-joiners, survival rate (%) and drop-out rates (%) by wave and previous period health status, BHPS

Men Wave 1 2 3 4 5 6 7 8†

10‡11 Women 1 2 3 4 5 6 7 8†

10‡11

No. Ind. 4832 4180 3878 3675 3464 3408 3280 3137 2899 2820 5424 4777 4532 4303 4106 4016 3882 3775 3522 3403

Drop outs 652 428 285 283 156 159 175

- 137 647 373 305 268 179 166 151

- 183

Re-joiners 0 126 82 72 100 31 32 -

58

0 128 76 71 89 32 44 -

64

Survival rate 86.5 80.3 76.1 73.8 70.5 67.9 64.9

60.0 58.4 88.1 83.6 79.3 75.7 74.0 71.6 69.6

64.9 62.7

Raw Drop-out rate 13.5 10.2 7.4 7.7 4.5 4.7 5.3

- 4.7

11.9 7.8 6.7 6.2 4.4 4.1 3.9

- 5.2

EX GOOD at t-1 at t-1

Drop-out Drop-out Net rate rate Drop-out

Rate*

13.5 12.2 13.5

9.5 8.9 7.2

7.4 6.7 5.2

7.4 5.4 5.7

3.1 3.6 1.6

4.5 3.3 3.8

4.4 4.1 4.4

- - -

4.5 4.5 2.7 11.9 5.1 5.1 4.6 2.2 3.3 2.8

- 3.4

10.8 7.2 6.7 7.1 2.6 3.0 2.7

- 4.3

11.8 7.0 5.8 5.2 3.4 3.3 3.3

- 3.7

FAIR at t-1 Drop-out rate 14.2 11.5 7.3 9.6 4.8 4.6 6.4

- 4.8 12.1 8.2 6.3 6.6 5.2 4.9 4.5

- 6.8

POOR at t-1 Drop-out rate 14.6 14.6 8.5 9.7 12.2 9.7 7.0

- 5.2 13.2 11.3 11.6 8.3 9.3 8.3 5.2

- 7.4

VPOOR at t-1 Drop-out rate 26.9 24.0 14.5 23.0 25.4 11.5 22.9

- 10.4 21.4 16.4 14.0 12.0 14.3 7.1 12.6

- 13.5

Notes:

Drop-outs – respondents at wave t-1, non-respondents at wave t. Re-joiners – non-respondents at wave t-1, respondents at wave t.

* Raw drop-out rates exclude re-joiners; Net drop-out rates include re-joiners.

† At wave 9 the self-assessed health question was changed to one based on the SF-36 questionnaire. SF-36 questionnaire response rates appear lower than those for hlstat and therefore are not used as a basis for calculating drop-out rates.

‡ Drop-out rates conditional on previous wave reporting of self-assessed health are not possible due to the change in the self-assessed health question at wave 9.

25

Table 2: SAH-related drop-out rates (%) over 11 waves by gender, age, income, educational and marital status,

BHPS

ALL EX GOOD FAIR POOR at t1 at t1 at t1 at t1

ALL DATA 39 36 37 44 48 GENDER: MEN 42 38 39 49 53 WOMEN 37 33 36 40 45 AGE GROUP: <30 36 40 34 37 22 31-50 32 30 31 35 32 51-70 39 33 37 41 53 >70 73 60 69 77 87 INCOME QUINTILE: 1 58 55 57 59 58 2 41 36 39 44 47 3 37 35 36 41 45 4 36 36 34 39 44 5 32 32 32 29 38 EDUCATION: DEGREE 26 26 27 29 24 HND/A LEVEL 30 31 28 32 28 O LEVEL / CSE 34 36 32 36 32 NO QUALIFICATIONS 48 42 46 52 55 MARITAL STATUS: WIDOW 62 47 60 64 81 SINGLE 42 45 40 44 44 DIVORCED/SEPARATED 41 38 37 45 47 MARRIED/COUPLE 35 32 34 40 41

VPOOR at t1 64 67 63 22 47 64 95 74 62 62 45 71 0 67 49 65 83 42 60 64

26

Table 3: Probit models for response/non-response by wave; Men, BHPS

WAVE N 4543 2 3 4 5 6 7 8 10 11 Ln(INCOME) .021 .023* .025 .026* .027 .034** .011 .020 .016 WIDOW .009 -.034 -.030 -.010 -.022 -.032 -.044 .010 -.042 SINGLE -.046* -.018 -.053* -.040 -.065** -.074** -.070** -.048* -.051*

-.075 DIV/SEP .002 -.014 -.020 -.007 -.034 -.031 -.061 -.005

NON-WHITE -.102** -.161** -.158** -.160** -.152** -.151** -.160** -.152** -.141**

.169** DEGREE .143** .070** .154** .132** .169** .161** .163** .157**

.134** HND/A .092** .057** .100** .083** .115** .090** .121** .093**

.075** O/CSE .058** .027* .064** .042* .073** .055** .066** .054**

-.001 HHSIZE -.011 -.019** -.017* -.009 -.0001 -.031** -.019* .002

.034* NCH04 .057** .068** .044* .064** .035 .089** .027 .072**

-.005 NCH511 .011 .032** .014 .018 -.003 .036* -.013 .012

.0008 NCH1218 .043 .041** .042* .043** .017 .069** .012 .051**

-.013 AGE -.043 -.027 -.030 -.047 -.031 -.041 -.029 -.042

.052 AGE2 .130 .102 .122 .166 .120 .144 .117 .147

-.041 AGE3 -.178 -.146 -.166 -.229 -.156 -.193 -.146 -.188

-.051 AGE4 .074 .071 .068 .106 .055 .085 .043 .076

-.010 NORTH-WEST -.013 .003 .002 -.009 .016 -.021 -.006 -.008

.030 NORTH-EAST .030 .029 .027 .002 .029 .011 .024 .024

.034 SOUTH-EAST .033 .003 .027 .015 .025 .018 .042 .035

.068 SOUTH-WEST .072* .062** .062 .074** .072* .049 .065 .060*

.026 MIDLANDS .040 .009 .035 .025 .035 .024 .026 .038

-.010 -.086* SCOTLAND -.042 -.028 -.046 -.018 -.077* -.017 -.082 .021 .009 WALES .020 .027 -.002 .014 -.022 .028 -.006 .001 -.027 SELF-EMPLOYED .005 .006 -.002 -.0001 .002 .018 -.036 -.092 -.096 UNEMPLOYED -.063 -.017 -.110 -.035 -.100 -.047 -.090 -.008 -.034 RETIRED .018 .006 -.044 .025 .004 -.029 -.014 -.032 -.029 FAMILY-CARE -.030 .005 -.064 -.026 -.048 -.0004 -.019 .018 .014 EMP-OTHER .032 .019 -.010 .024 .042 .036 .013 .055 .066 UNCLASSIFIED .049 .052 .083 .039 .072 .036 .060 .017 .033 MANUAL-TECHNICAL .039 .042 .033 .046 .033 .018 .044 .039 .077 SKILL-NON-MAN .045 .041 .053 .053 .065 .058 .091* .013 .041 SKILLED-MANUAL .016 .030 .023 .024 .036 .018 .052 .023 .034 PART-SKILLED .038 .040 =.039 .038 .045 .057 .046 -.0008 .022 UNSKILLED .014 .058 .017 .073 .043 .068 -.007 .010 .009 SAHEX -.001 .006 .003 .019 .003 .010 .013 -.044* SAHFAIR -.060** -.051* .003 -.007 -.060** -.066** -.053* -.027*

.010 -.076 SAHPOOR -.011 .015 -.022 .022 -.032 .020 -.072 -.059 -.129 SAHVPOOR -.112 -.032 -.120 -.033 -.137* -.043 -.151* .054** .037* HLPRB .049** .002 .045** .026 .045** .049** .040 -.027 -.023 HLDSBL -.011 -.014 .021 -.002 .002 -.008 -.024 -.087** -.088** -.095** -.097** -.062* -.071* HLLT -.025 -.044 -.049

Log-likelihood -1660.9 -2293.5 -2472.3 -2671.8 -2730.1 -2677.6 -2846.2 -2897.1 -2927.8 1. * denotes p≤0.05, ** denotes p≤0.01.

2. All regressors represent wave 1 responses, there were no men in the long-term sick category in our

sample at wave 1.

3. Results are presented as partial effects on the probability of responding at wave t, evaluated at the

sample means of the regressors.

27

Table 4: Summary of SAH-related drop-out rates (%) over all available waves by country, ECHP-UDB

ALL 41 48 49 47 44 13 33 41 69 40 12 44 30 49 45 29

VGOOD 41 44 44 43 42 15 34 39 69 37 13 42 29 47 43 27

GOOD 38 46 47 45 42 11 30 41 68 39 12 43 26 48 44 26

FAIR 40 54 58 50 44 13 31 40 68 40 10 48 29 49 48 32

POOR 50 61 66 55 53 23 37 46 73 49 15 57 35 53 52 35

VPOOR 65 73 75 65 61 36 50 59 78 59 37 63 51 65 56 53

Country (waves)

AUSTRIA (2-8) BELGIUM (1-8) DENMARK (1-8) FINLAND (3-8) FRANCE (1-8) GERMANY (1-3)

GERMANY (GSOEP 1-8) GREECE (1-8) IRELAND (1-8) ITALY (1-8)

LUXEMBOURG (1-3) NETHERLANDS (1-8) PORTUGAL (1-8) SPAIN (1-8) UK (1-3)

UK (BHPS 1-8)

28

Table 5: Summary of SAH-related drop-out rates(%) over all available waves by income quintile amd educational

status by country, ECHP-UDB

ALL VGOOD

AUSTRIA (2-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION BELGIUM (1-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION DENMARK (1-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION FINLAND (3-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION FRANCE (1-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION GERMANY (1-3) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION GREECE (1-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION

GOOD FAIR POOR VPOOR

43 43 45 42 57 43 41 53 60 37 37 60 49 44 41 53 49 40 37 46 18 12 11 15 35 48 49 36

34464729201415164937344345434049543535575441394541454841

41424338

54404050

59383657

45424152

46383542

1510912

31485234

40384440

60514657

65484863

52464453

48433946

20111015

35535737

51373353

66664768

72505571

58574559

59524555

24191424

41595645

60 67 50 64

74 71 71 76

83 83 69 78

65 67 100 68

63 67 65 61

32 38 35 36

57 67 100 57

29

IRELAND (1-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION ITALY (1-8)

INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION LUXEMBOURG (1-3) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION NETHERLANDS (1-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION PORTUGAL (1-8) INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION SPAIN (1-8)

INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION UK (1-3)

INCOME QUINTILE 1 INCOME QUINTILE 5 PRIMARY EDUCATION TERTIARY EDUCATION

66 71 70 66 37 45 40 40 19 13 12 12 47 44 40 53 31 37 40 28 48 49 51 49 49 42 38 48

4641384543515145244735253346393665706965

66716964

33454038

2811715 46393548

23363823

48485047

494238474342395117171711

66797368

37423940

1510119

51525055

29354428

49465249

50443950

71578972

44554848

7143313

63533863

38415835

50616053

53564055

72 100 . 80

57 65 100 58

42 50 0 37

68 67 60 65

54 56 100 50

62 68 92 65

51 78 62 55

30

Table 6: Verbeek and Nijman tests for non-response, BHPS and ECHP-UDB

Country (waves)

BHPS (IPW-1) Men: Women:

BHPS (IPW-2) Men: Women:

ECHP (IPW-2)

AUSTRIA (2-8) BELGIUM (1-8) DENMARK (1-8) FINLAND (3-8) FRANCE (1-8) GERMANY (1-3) GREECE (1-8) IRELAND (1-8) ITALY (1-8)

LUXEMBOURG (1-3) NETHERLANDS (1-8) PORTUGAL (1-8) SPAIN (1-8) UK (1-3)

Model based on regressors x (t ratio)

6.42 3.53 4.76 2.03 0.39 5.45 4.02 5.28 6.03 3.15 3.69 6.81 6.54 -0.64 3.22 7.25 6.07 2.04

Model based on regressors x and z

(t ratio) 6.27 3.24

4.95 2.10

0.27 4.16 3.15 4.74 4.95 2.79 2.80 6.27 5.82 -0.44 2.03 5.89 4.88 1.94

31

Table 7: Dynamic ordered probit models for SAH – Men, BHPS (1) Balanced sample

NT = 18,616 -.981 (.029) -1.867 (.039) -2.757 (.062) -3.356 (.126) .146 (.016) .004 (.055) -.079 (.033) .046 (.048) -.174 (.048) .180 (.035) .144 (.028) .110 (.029) .030 (.013) .001 (.025) -.009 (.020) .026 (.023) -.013 (.045) .028 (.144) -.035 (.193) .020 (.093)

-2.789 (.531) -1.728 (.531) -.561 (.529) 1.117 (.529)

-18568.9

(2) Drop-outs sample

NT = 6,593 -.939 (.050) -1.794 (.064) -2.597 (.081) -3.309 (.124) .103 (.023) .048 (.070) .004 (.049) -.063 (.063) -.161 (.072) .166 (.053) .156 (.045) .145 (.039) -.018 (.018) .019 (.043) .046 (.034) .035 (.041) -.024 (.054) .100 (.170) -.180 (.224) .102 (.104)

-2.956 (.634) -2.040 (.633) -.952 (.633) .641 (.633)

-7139.7

(3) Unbalanced sample

NT = 25,209-.970 (.025) -1.845 (.033) -2.723 (.049) -3.383 (.089) .130 (.013) .015 (.044) -.067 (.027) -.003 (.039) -.165 (.040) .177 (.029) .153 (.024) .121 (.023) .013 (.010) .007 (.022) .008 (.017) .030 (.020) -.030 (.034) .081 (.107) -.103 (.142) .046 (.067)

-3.009 (.400) -2.019 (.400) -.883 (.400) .770 (.399)

-25782.0

146.8 (0.000)

(4) BHPS

longitudinal weights

NT = 25,209 -.954 (.026) -1.820 (.035) -2.701 (.051) -3.385 (.086) .125 (.015) -.012 (.044) -.066 (.028) -.004 (.042) -.181 (.041) .182 (.030) .163 (.025) .124 (.024) .004 (.011) .012 (.024) .008 (.018) .038 (.021) -.022 (.037) .054 (.117) -.069 (.156) .031 (.074)

-2.980 (.435) -1.984 (.434) -.849 (.434) .789 (.433)

-26,058.2

-0.668 0.651 1.429 0.438

(5) Inverse probability weights IPW-1

NT = 25,209 -.966 (.025) -1.841 (.034) -2.725 (.050) -3.414 (.087) .129 (.015) .011 (.047) -.075 (.028) -.009 (.041) -.160 (.041) .180 (.030) .159 (.024) .120 (.024) .015 (.011) .013 (.022) .0005 (.017) .022 (.020) -.041 (.039) .120 (.125) -.159 (.168) .074 (.081)

-3.136 (.459) -2.148 (.458) -1.004 (.457) .643 (.457)

-26041.9

-0.134 0.391 1.034 -0.146

(6) Inverse probability weights IPW-2

NT=21,630 -.953 (.030) -1.808 (.041) -2.677 (.065) -3.305 (.114) .149 (.016) .012 (.057) -.083 (.033) .014 (.048) -.118 (.057) .170 (.034) .130 (.027) .103 (.027) .019 (.013) .004 (.025) .008 (.022) .041 (.025) -.014 (.042) .025 (.134) -.024 (.179) .007 (.086)

-2.664 (.495) -1.667 (.491) -.548 (.490) 1.092 (.490)

-22450.2

2.037 -0.394 -1.859 -1.273

SAHGOOD(t-1) SAHFAIR(t-1) SAHPOOR(t-1) SAHVPOOR(t-1) Ln(INCOME) WIDOW SINGLE DIV/SEP NON-WHITE DEGREE HND/A O/CSE HHSIZE NCH04 NCH511 NCH1218 AGE AGE2 AGE3 AGE4

Cut 1 Cut 2 Cut 3 Cut 4

Log Likelihood

LR test for pooling

Hausman test: Ln(INCOME) DEGREE HND/A O/CSE

1. Standard errors are reported in parentheses. 2. Cut 1-4 are estimated cut points or thresholds. 3. Coefficients for the year dummies are not reported.

4. Descriptive summary of BHPS longitudinal weights for above sample: Mean = 1.056, SD = 0.351,

Min = 0.190, Max = 2.5.

5. Descriptive summary of IPW – 1 with health variables weights for sample above: Mean = 1.440,

SD = 0.388, Min = 1.01, Max = 15.73.

6. Descriptive summary of IPW – 2 with health variables weights for sample above: Mean = 1.908, SD

= 1.173, Min = 1.01, Max = 33.89

7. The LR test for pooling compares the unrestricted estimates (balanced+drop-outs samples) with the

restricted estimates (unbalanced sample). The statistic is chi-squared.

8. The Hausman test reports the t-test for pairwise comparisons of the contrast between the weighted

estimates of the coefficients (models 4-6) with those from the unweighted estimate (model 3). 1.

32

Table 8: Average partial effects (APE) on the probability of reporting excellent SAH, BHPS.

a) Men

(5) (6) (4) (2) (3) (1) Inverse Inverse Drop-outs Unbalanced BHPS Balanced

probability longitudinal probability sample sample sample

weights weights weights

IPW-1 IPW-2

SAHGOOD(t-1) -.277 (.141) -.234 (.144) -.266 (.143) -.262 (.140) -.265 (.142) -.264 (.139) SAHFAIR(t-1) -.337 (.178) -.297 (.178) -.327 (.179) -.325 (.176) -.326 (.178) -.326 (.176) SAHPOOR(t-1) -.295 (.205) -.260 (.197) -.286 (.203) -.286 (.201) -.286 (.203) -.289 (.202) SAHVPOOR(t-1) -.276 (.208) -.238 (.200) -.267 (.207) -.267 (.205) -.266 (.206) -.270 (.206) Ln(INCOME) .038 (.018) .024 (.014) .033 (.016) .032 (.016) .033 (.016) .038 (.019) DEGREE .049 (.021) .040 (.022) .046 (.022) .048 (.022) .047 (.022) .045 (.021) HND/A .038 (.017) .037 (.021) .040 (.019) .042 (.020) .041 (.020) .034 (.016) O/CSE .029 (.013) .034 (.019) .031 (.015) .032 (.015) .031 (.015) .027 (.013)

b) Women

(5) (6) (4) (2) (3) (1) Inverse Inverse Drop-outs Unbalanced BHPS Balanced

probability longitudinal probability sample sample sample

weights weights weights

IPW-1 IPW-2

-.237 (.141) -.240 (.146) -.240 (.146) -.239 (.146) -.197 (.133) SAHGOOD(t-1) -.252 (.148) -.283 (.168) -.287 (.173) -.284 (.173) -.286 (.172) -.250 (.157) -.296 (.177) SAHFAIR(t-1)

-.242 (.185) -.240 (.187) -.237 (.187) -.239 (.187) -.216 (.173) -.246 (.191) SAHPOOR(t-1)

-.220 (.186) -.217 (.188) -.215 (.187) -.216 (.187) -.195 (.174) SAHVPOOR(t-1) -223 (.191)

.017 (.009) .021 (.012) .022 (.013) .023 (.013) .010 (.006) .028 (.015) Ln(INCOME)

.071 (.036) .061 (.032) .062 (.032) .057 (.030) .059 (.034) .057 (.029) DEGREE

.053 (.028) .040 (.022) .039 (.021) .040 (.021) .055 (.033) .033 (.018) HND/A

.049 (.026) .040 (.022) .039 (.022) .037 (.021) .027 (.017) .040 (.021) O/CSE

1. The partial effects are computed for each individual using their observed values of the regressors. The table presents the sample mean of the partial effects – the APE – along with the sample standard deviations in parentheses.

33

Table 9: Average partial effects on the probability of reporting very good SAH, ECHP-UDB

Country (waves)

AUSTRIA (w2-8, N*T 26368) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

BELGIUM (w1-8, N*T 31699) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

DENMARK (w1-8, N*T 26848) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

FINLAND (w3-8, N*T 34439) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

FRANCE (w1-8, N*T 66988) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

GERMANY (w1-3, N*T 16403) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

GREECE (w1-8, N*T 63826) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

IRELAND (w1-8, N*T 37699) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

ITALY (w1-8, N*T 96509) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

LUXEMBOURG (w1-3, N*T 3503) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

NETHERLANDS (w1-8, N*T 55673) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

PORTUGAL (w1-8, N*T 69236) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH

SPAIN (w1-8, N*T 85111)

Unbalanced sample

-0.172 (0.190) -0.187 (0.189) -0.270 (0.171) -0.208 (0.155)

-0.209 (0.206) -0.225 (0.206) -0.301 (0.192) -0.268 (0.164)

-0.442 (0.264) -0.453 (0.248) -0.466 (0.176) -0.255 (0.117)

-0.309 (0.270) -0.325 (0.259) -0.364 (0.210) -0.260 (0.154)

-0.117 (0.116) -0.116 (0.116) -0.183 (0.111) -0.124 (0.099)

-0.127 (0.142) -0.137 (0.143) -0.188 (0.137) -0.179 (0.125)

-0.483 (0.312) -0.471 (0.264) -0.417 (0.176) -0.225 (0.112)

-0.444 (0.251) -0.447 (0.239) -0.456 (0.155) -0.251 (0.100)

-0.173 (0.180) -0.190 (0.176) -0.236 (0.153) -0.173 (0.128)

-0.214 (0.198) -0.226 (0.194) -0.282 (0.160) -0.191 (0.126)

-0.177 (0.164) -0.189 (0.166) -0.255 (0.157) -0.232 (0.135)

-0.030 (0.054) -0.042 (0.059) -0.065 (0.075) -0.076 (0.091)

ECHP published

weights

-0.171 (0.190) -0.185 (0.190) -0.269 (0.173) -0.213 (0.158)

-0.212 (0.207) -0.228 (0.208) -0.304 (0.193) -0.270 (0.165)

-0.440 (0.262) -0.450 (0.245) -0.460 (0.173) -0.249 (0.113)

-0.308 (0.267) -0.324 (0.256) -0.364 (0.207) -0.255 (0.151)

-0.116 (0.116) -0.115 (0.115) -0.181 (0.110) -0.124 (0.099)

-0.128 (0.143) -0.139 (0.144) -0.191 (0.139) -0.180 (0.126)

-0.477 (0.306) -0.463 (0.257) -0.407 (0.170) -0.219 (0.108)

-0.446 (0.251) -0.448 (0.237) -0.454 (0.152) -0.251 (0.098)

-0.174 (0.178) -0.191 (0.174) -0.238 (0.151) -0.171 (0.126)

-0.217 (0.201) -0.229 (0.197) -0.286 (0.162) -0.195 (0.128)

-0.178 (0.166) -0.190 (0.168) -0.257 (0.159) -0.235 (0.137)

-0.033 (0.058) -0.047 (0.065) -0.074 (0.083) -0.086 (0.102)

Inverse probability

weights IPW-2

-0.170 (0.183) -0.184 (0.183) -0.266 (0.166) -0.202 (0.149)

-0.227 (0.209) -0.244 (0.210) -0.323 (0.193) -0.272 (0.162)

-0.455 (0.269) -0.462 (0.248) -0.460 (0.172) -0.257 (0.113)

-0.351 (0.282) -0.366 (0.266) -0.396 (0.202) -0.254 (0.142)

-0.111 (0.108) -0.110 (0.107) -0.168 (0.100) -0.107 (0.085)

-0.124 (0.134) -0.131 (0.132) -0.165 (0.121) -0.121 (0.090)

-0.505 (0.336) -0.503 (0.292) -0.483 (0.213) -0.292 (0.154)

-0.452 (0.253) -0.455 (0.240) -0.457 (0.152) -0.252 (0.098)

-0.183 (0.200) -0.203 (0.197) -0.264 (0.177) -0.205 (0.154)

-0.243 (0.212) -0.252 (0.204) -0.272 (0.152) -0.183 (0.109)

-0.182 (0.159) -0.193 (0.159) -0.256 (0.145) -0.205 (0.119)

-0.034 (0.073) -0.056 (0.085) -0.088 (0.107) -0.112 (0.134)

34

VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH UK (w1-3, N*T 12587) VERY POOR HEALTH POOR HEALTH FAIR HEALTH GOOD HEALTH -0.173 (0.148) -0.184 (0.137) -0.183 (0.110) -0.086 (0.059)

-0.317 (0.239) -0.337 (0.233) -0.397 (0.187) -0.257 (0.146) -0.174 (0.147) -0.184 (0.136) -0.183 (0.110) -0.083 (0.057)

-0.316 (0.239) -0.335 (0.233) -0.396 (0.188) -0.257 (0.147) -0.177 (0.148) -0.187 (0.135) -0.189 (0.109) -0.071 (0.048)

-0.321 (0.224) -0.334 (0.213) -0.368 (0.161) -0.225 (0.118)

1. The partial effects are computed for each individual using their observed values of the regressors. The table presents the sample mean of the partial effects – the APE – along with the sample standard deviations in parentheses.

35

Table 10: Average partial effects on the probability of reporting very good SAH, ECHP-UDB

Country (waves)

AUSTRIA (w2-8, N*T 26368) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

BELGIUM (w1-8, N*T 31699) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

DENMARK (w1-8, N*T 26848) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

FINLAND (w3-8, N*T 34439) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7) FRANCE (w1-8, N*T 66988) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

GERMANY (w1-3, N*T 16403) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7) GREECE (w1-8, N*T 63826) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

IRELAND (w1-8, N*T 37699) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7) ITALY (w1-8, N*T 96509) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

LUXEMBOURG (w1-3, N*T 3503) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

NETHERLANDS (w1-8, N*T 55673) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

PORTUGAL (w1-8, N*T 69236) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7) SPAIN (w1-8, N*T 85111) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7) UK (w1-3, N*T 12587) Ln(INCOME)

SECONDARY (ISCED3-4) TERTIARY (ISCED5-7)

Unbalanced sample

0.010 (0.008) 0.012 (0.009) 0.044 (0.033)

0.015 (0.010) 0.014 (0.009) 0.039 (0.024)

0.033 (0.013) 0.036 (0.014) 0.067 (0.026)

0.030 (0.017) 0.029 (0.017) 0.053 (0.029)

0.011 (0.008) 0.014 (0.011) 0.018 (0.014)

0.011 (0.008) 0.004 (0.004) 0.026 (0.020)

0.019 (0.009) 0.045 (0.021) 0.053 (0.024)

0.040 (0.014) 0.049 (0.016) 0.062 (0.020)

0.007 (0.005) 0.019 (0.014) 0.034 (0.024)

0.044 (0.029) 0.044 (0.028) 0.038 (0.024)

0.022 (0.014) 0.012 (0.007) 0.027 (0.017)

0.006 (0.008) 0.011 (0.015) 0.013 (0.018)

0.013 (0.009) 0.037 (0.024) 0.040 (0.026)

0.029 (0.015) 0.028 (0.014) 0.052 (0.026)

ECHP published

weights

0.014 (0.011) 0.013 (0.010) 0.045 (0.034)

0.012 (0.008) 0.016 (0.010) 0.042 (0.026)

0.030 (0.012) 0.036 (0.014) 0.069 (0.026)

0.029 (0.017) 0.036 (0.021) 0.062 (0.033)

0.010 (0.008) 0.013 (0.011) 0.018 (0.014)

0.006 (0.005) 0.002 (0.002) 0.021 (0.016)

0.020 (0.009) 0.046 (0.020) 0.051 (0.023)

0.043 (0.015) 0.053 (0.017) 0.066 (0.021)

0.005 (0.004) 0.020 (0.014) 0.031 (0.022)

0.045 (0.029) 0.054 (0.034) 0.044 (0.027)

0.018 (0.012) 0.016 (0.010) 0.032 (0.019)

0.006 (0.009) 0.010 (0.014) 0.012 (0.016)

0.015 (0.010) 0.038 (0.025) 0.038 (0.025)

0.027 (0.014) 0.029 (0.015) 0.056 (0.027)

-0.002 (0.001) 0.017 (0.013) 0.045 (0.033)

0.012 (0.007) 0.007 (0.004) 0.048 (0.027)

0.039 (0.015) 0.020 (0.008) 0.049 (0.018)

0.022 (0.011) 0.031 (0.016) 0.088 (0.042)

0.009 (0.007) 0.019 (0.015) 0.017 (0.014)

-0.017 (0.014) -0.007 (0.006) 0.044 (0.034)

0.014 (0.007) 0.049 (0.023) 0.054 (0.027)

0.031 (0.010) 0.064 (0.020) 0.061 (0.019)

-0.001 (0.000) 0.036 (0.026) 0.041 (0.028)

0.083 (0.046) -0.066 (0.039) -0.043 (0.025)

0.025 (0.016) 0.025 (0.015) 0.019 (0.011)

0.003 (0.005) 0.006 (0.009) 0.007 (0.011)

0.005 (0.004) 0.049 (0.031) 0.028 (0.018)

0.054 (0.024) -0.011 (0.005) 0.022 (0.010) Inverse probability weights IPW-2

1. The partial effects are computed for each individual using their observed values of the regressors. The table presents the sample mean of the partial effects – the APE – along with the sample standard deviations in parentheses.

36

因篇幅问题不能全部显示,请点此查看更多更全内容

Copyright © 2019- xiaozhentang.com 版权所有

违法及侵权请联系:TEL:199 1889 7713 E-MAIL:2724546146@qq.com

本站由北京市万商天勤律师事务所王兴未律师提供法律服务