Skip to main content
  • Research article
  • Open access
  • Published:

A comparison of the test-negative and the traditional case-control study designs for estimation of influenza vaccine effectiveness under nonrandom vaccination

Abstract

Background

As annual influenza vaccination is recommended for all U.S. persons aged 6 months or older, it is unethical to conduct randomized clinical trials to estimate influenza vaccine effectiveness (VE). Observational studies are being increasingly used to estimate VE. We developed a probability model for comparing the bias and the precision of VE estimates from two case-control designs: the traditional case-control (TCC) design and the test-negative (TN) design. In both study designs, acute respiratory illness (ARI) patients seeking medical care testing positive for influenza infection are considered cases. In the TN design, ARI patients seeking medical care who test negative serve as controls, while in the TCC design, controls are randomly selected individuals from the community who did not contract an ARI.

Methods

Our model assigns each study participant a covariate corresponding to the person’s health status. The probabilities of vaccination and of contracting influenza and non-influenza ARI depend on health status. Hence, our model allows non-random vaccination and confounding. In addition, the probability of seeking care for ARI may depend on vaccination and health status. We consider two outcomes of interest: symptomatic influenza (SI) and medically-attended influenza (MAI).

Results

If vaccination does not affect the probability of non-influenza ARI, then VE estimates from TN studies usually have smaller bias than estimates from TCC studies. We also found that if vaccinated influenza ARI patients are less likely to seek medical care than unvaccinated patients because the vaccine reduces symptoms’ severity, then estimates of VE from both types of studies may be severely biased when the outcome of interest is SI. The bias is not present when the outcome of interest is MAI.

Conclusions

The TN design produces valid estimates of VE if (a) vaccination does not affect the probabilities of non-influenza ARI and of seeking care against influenza ARI, and (b) the confounding effects resulting from non-random vaccination are similar for influenza and non-influenza ARI. Since the bias of VE estimates depends on the outcome against which the vaccine is supposed to protect, it is important to specify the outcome of interest when evaluating the bias.

Peer Review reports

Background

Influenza vaccine effectiveness (VE) has to be re-estimated in every season because predominant influenza virus types, subtypes and phenotypes change from one season to the next, necessitating a new vaccine targeting different strains in most seasons. As annual influenza vaccination is now widely recommended, randomized clinical trials for estimating VE are no longer ethical in many populations, and observational studies based on patients seeking medical care for acute respiratory illnesses (ARI) are the most efficient, and hence most widely used option. However, observational studies for estimating VE are prone to multiple sources of bias.

In this paper we present a new probability model for comparing the bias and precision of VE estimates from two popular case-control study designs under nonrandom vaccination, i.e., vaccination probabilities may depend on a covariate. In both study designs, ARI patients seeking medical care who test positive for influenza infection are considered cases. In the test-negative (TN) design, ARI patients seeking medical care who test negative for influenza infection serve as controls, while in the traditional case-control (TCC) design, controls are randomly selected individuals who did not contract an ARI, usually from the same community from which the cases came. The TN design was introduced in 2007 [1], and most of the influenza VE case-control studies conducted since then have used this study design. However, TCC studies are still being used occasionally [24]. TCC studies are usually costlier and more resource intensive due to the need to recruit controls through a separate mechanism.

Estimates of VE from case-control studies may be subject to the following sources of bias:

(a) Probabilities of non-influenza ARI may depend on vaccination status: In TN studies, individuals with non-influenza ARI serve as controls. Therefore, TN studies may produce biased estimates of VE unless vaccinees and non-vaccinees are equally likely to develop non-influenza ARI. The validity of this assumption has not yet been confirmed. De Serres et al. [5] used data from randomized clinical trials to argue that this assumption is usually satisfied. However, a randomized influenza vaccine trial [6] found that vaccinees had a significantly increased risk of virologically-confirmed non-influenza infection (that may lead to ARI) as compared to those who received the placebo.

(b) Probabilities of influenza and non-influenza ARIs may depend on confounders: Covariates such as health status, age, exposure, education and socio-economic status may be associated with both the likelihood of being vaccinated and the likelihood of developing influenza and non-influenza ARIs.

(c) Vaccination may affect probability of seeking medical care in influenza patients: Several studies [79] suggest that vaccinated individuals who contract influenza may have milder symptoms than unvaccinated influenza patients, and therefore may be less likely to seek medical care. This effect of vaccination is not expected to change health-care-seeking behavior of non-influenza ARI patients.

(d) Probabilities of seeking medical care against ARIs may depend on confounders: Since only ARI patients who seek medical care may be included in a TNC study, and only influenza patients who seek care may be included as cases in TCC studies, covariates that are associated with both the likelihood of being vaccinated and the likelihood of seeking care against ARI may contribute to the bias of influenza VE estimates.

(e) Misclassification bias: Diagnostic tests for influenza viruses are not 100% sensitive and specific. Vaccination status may also be misclassified.

In this work we consider the first four sources of bias. To focus on these sources, we ignore misclassification biases which are known to result in negative bias (i.e., bias toward lower estimation of VE) and are common to all studies that rely on results of diagnostic tests.

The goal of this article is to evaluate and compare the bias and precision of estimates of VE resulting from TN and TCC studies. As we will see, the bias of VE estimates may depend on the outcome of interest, i.e., the outcome against which the vaccine is expected to protect. We consider two outcomes of interest, symptomatic influenza (SI) and medically-attended influenza (MAI). In both the TN and TCC study designs, only influenza patients seeking medical care are considered cases. Therefore, one expects these study designs to produce estimates of VE against MAI. However, the media usually reports VE estimates from these case-control studies as ‘vaccine effectiveness against influenza’, without including the ‘medically-attended’ clause. As a result, the public may interpret these estimates as the effectiveness of the vaccine against any influenza illness, i.e., VE against SI. One of the objectives of this work is to highlight the importance of (a) clearly specifying the outcome against which the vaccine is supposed to protect, and (b) understanding that the bias of a VE estimate may be different for the two outcomes of interest.

We will (a) evaluate the bias of each of the VE estimates for each of the outcomes by comparing the expected value of the estimate with the true VE, and (b) evaluate the standard errors of the VE estimates. To perform these evaluations and comparisons, we developed a detailed stepwise probability model of the process involved in collecting data in these studies and deriving VE estimates. The model includes a covariate, health status, that may be associated with both the likelihood of being vaccinated and the propensity of seeking medical care against ARI. This allows us to assess the effects of nonrandom vaccination on the bias of VE estimates.

Methods

We first describe the real-life process involved in conducting the two types of case-control studies and obtaining the estimates of VE. We then describe the model we developed to mimic this process.

The study population

The source population for both types of case-control studies consists of all individuals receiving most of their medical care at a single clinic or at a specific network of clinics. Since influenza VE varies by age, we can assume that the model pertains to a subpopulation corresponding to a single age group.

The study designs

When a member of the study population develops an ARI, s/he may decide to report to a clinic for treatment. At the clinic, the health care provider may ask the person to be tested for influenza viruses. If the person agrees then a swab is taken and sent to a laboratory for testing. In both study designs, a person who tests positive is eligible to be considered a case. In a TN study, an individual who tests negative is eligible to be considered a control. In a TCC study, controls are randomly selected members of the study population who have not developed ARI prior to their inclusion in the study. Usually, one or more controls are selected right after a case is identified. In both study designs, the vaccination status of every case or control is determined from manual or electronic records, or from oral histories.

Outcome of interest and true VE

In this work we evaluate estimates of VE when the outcome of interest is either SI or MAI. SI is sometimes called ‘influenza illness’ or ‘influenza ARI’. Surveillance for SI is needed in the entire study population, and for persons ill with compatible illnesses, samples of influenza are taken for verification. A person is considered a true case of SI if s/he has an ARI and is infected by an influenza virus. For MAI, a true case is defined as a person who is influenza-infected, develops an ARI, and seeks medical care. In both cases, the true VE is defined as one minus the ratio of the probability of the outcome of interest in vaccinees and non-vaccinees.

Estimation of VE and bias of VE estimates

In this work we focus on identifying the main sources of bias and their effects on the performance of the VE estimates. Some of these biases can be adjusted for in the analysis, but this is beyond the scope of the current work. In case-control studies, VE is usually estimated as one minus the odds ratio (OR) of being vaccinated in cases vs. controls. The bias of the estimate is defined as the difference between the expectation of the estimated VE and the true VE.

The model

The model we developed for comparing the estimates from the two study designs follows the scheme described above with a few simplifications. We assume that (a) when a person seeks medical care for ARI then her/his probability of being tested for influenza viruses does not depend on vaccination status, health status, or on the actual cause of ARI (influenza/non-influenza); (b) given a person’s symptoms and influenza infection status, the sensitivity and specificity of the test do not depend on the tested person’s vaccination or health status; (c) a person’s vaccination status is determined without error; and (d) controls in a TCC study are selected at random from all asymptomatic individuals in the study population (See “The study population” section).

Our model includes a covariate, health status, and we assume that a person’s probabilities of being vaccinated, developing an ARI, and seeking medical care against ARI may be associated with her/his health status. In this way, the model generates possible confounding effects linking vaccination status with the probabilities of being included in the study and of becoming a case or a control.

The model consists of five steps, where the value of a single variable is determined at each step. The probability distribution of this variable may depend on the values of the variables from the previous steps. Below we define the five steps, the associated variables, and the probabilities determining each variable’s distribution.

Step 1: Health status

A person can be classified as “healthy” or “frail”. Define a binary variable X, where X=1 for a “healthy” person and X=0 for a “frail” person. Denote π=P(X=1).

Step 2: Vaccination

A person may be vaccinated against influenza. Define a binary variable V, where V=1 for a vaccinated person. The probability of being vaccinated may depend on health status; therefore, denote α x =P(V=1|X=x), x=0,1.

Step 3: Influenza infection and ARI

During the influenza season, a person may become infected with an influenza virus and develop an ARI. This outcome is referred to as “influenza ARI” (FARI), where “F” stands for flu. A person may also develop an ARI not resulting from influenza infection. This outcome is referred to as “non-influenza ARI” (NFARI). We therefore define an outcome variable Y with 3 categories as follows: Y=0 for no ARI, Y=1 for NFARI, and Y=2 for FARI. The distribution of Y depends on the person’s vaccination status, V, and health status, X. We denote β vx =P(Y=1|V=v,X=x), v=0,1, x=0,1 and γ vx =P(Y=2|V=v,X=x) for v=0,1, x=0,1 with β vx +γ vx ≤1 for all v,x. Here we assume the “leaky vaccine” model, in which the vaccine provides a reduction in the probability of influenza transmission to the vaccinated person, rather than complete immunity [10]. Under this model, a vaccinee has a lower probability of becoming infected than a non-vaccinee, but is not rendered completely immune from influenza infection.

Step 4: Seeking medical care for ARI

A person with ARI may seek medical care and, in this case, be tested for influenza viruses. We define a binary variable M with M=1 for a person seeking medical care for her/his ARI. The probability of this event depends on Y (only individuals with ARI seek medical care), and it may be different for FARI and NFARI patients. In addition, the conditional distribution of M given Y may depend on X and V. We therefore define δ yvx =P(M=1|Y=y,V=v,X=x), where y=1,2, v=0,1 and x=0,1.

In order to reduce the number of parameters, we make two simplifying assumptions regarding the probabilities of seeking medical care: (1) the effect of health status on probability of seeking medical care does not depend on vaccination status or type of ARI; (2) the effect of vaccination status on probability of seeking medical care does not depend on health status (but it may depend on type of ARI).

Define a “standard person” as a person with X = 0 and V = 0. For a “standard person”, we define δ SN , δ SF as follows:

  • δ SN =P(M=1|Y=1,V=0,X=0)=δ 100

  • δ SF =P(M=1|Y=2,V=0,X=0)=δ 200

In addition, we define two multipliers:

  • λ = multiplier for x = 1; λ does not depend on V and Y.

  • Ψ F = multiplier for v=1 only when y=2; Ψ F does not depend on X.

λ is the ratio of the probabilities of seeking medical care comparing a healthy and a frail person. Ψ F is the ratio of the probabilities of seeking care comparing a vaccinated and unvaccinated influenza ARI patient.

Then, { δ yvx } can be written in terms of δ SN , δ SF and the multipliers λ, Ψ F as follows:

  • δ 100=δ SN , δ 101=δ SN λ, δ 110=δ SN , δ 111=δ SN λ.

  • δ 200=δ SF , δ 201=δ SF λ, δ 210=δ SF Ψ F , δ 211=δ SF λΨ F .

Note: The multiplier Ψ F reflects the effect of severity of ARI in an influenza infected person. We assume that vaccination may reduce severity of symptoms, hence a vaccinated influenza patient may be less likely to seek care than an unvaccinated patient.

Step 5: Testing for influenza infection.

Although only individuals who seek medical care for an ARI are tested for influenza infection, it will be convenient to define a binary variable T as the (possibly unobserved) test result for any person with an ARI, regardless of whether or not s/he is actually tested. Define T=1 (T=0) if a person would test positive (negative) for influenza if tested. Because of assumption (b) above, the probability of testing positive given the person’s influenza infection status does not depend on X, V, or M. Denote τ y =P(T=1|Y=y) for y=1,2. Note that τ 1 is one minus the test’s specificity and τ 2 is the test’s sensitivity. In this study, we assume the test has 100% sensitivity and 100% specificity, i.e. P(T=1|Y=1)=τ 1=0 and P(T=1|Y=2)=τ 2=1.

Figure 1 shows the directed acyclic graph (DAG) of the model. Recent papers by Sullivan et al. [11] and Lipsitch et al. [12] discuss the use of DAGs to explore sources of bias of VE estimates from TN studies. A summary of the variables and parameters in our model is given in Table 1.

Fig. 1
figure 1

DAG of the model

Table 1 List of parameters and other notation

True VE in our model

When we evaluate the true VE, we assume that vaccination is done at random, i.e. for true VE we assume that vaccination status does not depend on health status X (α 0=α 1=α).

The true VE against SI is:

$${{\begin{aligned} {VET}_{SI} = 1 - {RRT}_{SI} \quad \text{where} \quad {RRT}_{SI} = \frac{P(Y=2|V=1)}{P(Y=2|V=0)}. \end{aligned}}} $$

The true VE against MAI is:

$$\begin{aligned} {VET}_{MAI} &= 1 - {RRT}_{MAI} \quad \text{where} \quad \\ {RRT}_{MAI} &= \frac{P(Y=2,M=1|V=1)}{P(Y=2,M=1|V=0)}. \end{aligned} $$

Using the parameters defined above, V E T SI and V E T MAI can be written as:

$$\begin{array}{*{20}l} &{} {VET}_{SI} = 1 - {RRT}_{SI} = 1 - {\frac{\gamma_{10}(1-\pi)+\gamma_{11}\pi}{{\gamma_{00}}(1-\pi)+\gamma_{01}\pi}}\\ &{} {VET}_{MAI} = 1 - {RRT}_{MAI} = 1 - {\frac{\Psi_{F}\left[\gamma_{10}(1-\pi)+\lambda\gamma_{11}\pi\right]}{\gamma_{00}(1-\pi)+\lambda\gamma_{01}\pi}}. \end{array} $$

The proofs of these results can be found in Appendix 1.

Estimates of VE in our model

In both the TN and TCC study designs, VE is estimated as one minus the odds ratio (OR) in the C×V table cross-classifying the individuals included in the study, where C is a binary indicator of case/control status with C=1 for a case. For convenience, the TN and TCC studies will be represented by the letters A and B, respectively. In a TN study, the case/control variable is denoted C A , where (C A =1)=(M=1,T=1) and (C A =0)=(M=1,T=0). Then the estimate of VE is: V E A =1−O R A , where

$${{\begin{aligned} {OR}_{A} = {\frac{P(C_{A}=1,V=1|M=1)P(C_{A}=0,V=0|M=1)}{P(C_{A}=1,V=0|M=1)P(C_{A}=0,V=1|M=1)}}. \end{aligned}}} $$

Note that all the probabilities condition on M=1 as only individuals who seek medical care for ARI are included in the TN study.

In a TCC study, the case/control variable is denoted C B . Cases are defined in the same way as in the TN study, i.e., (C B =1)=(M=1,T=1)=(C A =1). Controls are individuals included in a random sample drawn from all the asymptomatic individuals in the study population. In other words, (C B =0) is a random subset of (Y=0). In addition, we define a binary variable B indicating whether or not a person is included in the TCC study, i.e., (B=1)=(C B =1orC B =0). The VE estimate is based on the OR in the C B ×V table when all the probabilities condition on B=1: V E B =1−O R B , where

$${} {OR}_{B} = {\frac{P(C_{B}=1,V=1|B=1)P(C_{B}=0,V=0|B=1)}{P(C_{B}=1,V=0|B=1)P(C_{B}=0,V=1|B=1)}}. $$

Note that in a real-life study, the odds ratios are estimated from the relative frequencies of the corresponding events, rather than from their (unknown) probabilities. Therefore, the model-based estimates of VE defined above are actually the expected values of the observed estimates. For convenience we will continue to refer to them as “the VE estimates”.

Using the parameters defined above, V E A and V E B can be written as follows:

$${VE}_{A} = 1 - {\frac{\Psi_{F}\left[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi\right]\left[\beta_{00}(1-\alpha_{0})(1-\pi) + \lambda\beta_{01}(1-\alpha_{1})\pi\right]}{\left[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi\right]\left[\beta_{10}\alpha_{0}(1-\pi) + \lambda\beta_{11}\alpha_{1}\pi\right]}}, $$
$${VE}_{B} = 1 - {\frac{\Psi_{F}\left[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi\right]\left[(1-\gamma_{00}-\beta_{00})(1-\alpha_{0})(1-\pi)+(1-\gamma_{01}-\beta_{01})(1-\alpha_{1})\pi\right]}{\left[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi\right]\left[(1-\gamma_{10}-\beta_{10})\alpha_{0}(1-\pi)+(1-\gamma_{11}-\beta_{11})\alpha_{1}\pi\right]}}. $$

The proofs can be found in Appendix 2.

Bias and standard errors of estimates

The bias of an estimate of VE is the difference between the expected value of the estimate and the true VE. As the true VE depends on the outcome of interest (SI or MAI), the bias of each estimated VE will be evaluated separately for each of the two outcomes.

In Appendix 3 we use approximations based on the “Delta method” for the standard errors (SEs) of odds ratios [13] to derive expressions for the SEs of both VE estimates in terms of the parameters and the corresponding sample size(s). For evaluating the SEs we consider the observed odds ratios, where the probabilities are replaced by the corresponding observed relative frequencies.

The values of bias reported in the text and tables represent absolute numbers. For example, if the true VE is 60% (i.e., 0.6) and the range of bias (-0.40, -0.20). This means that the estimated VE varies from 0.20 (underestimating the true VE = 0.6 by 0.40) to 0.80 (overestimating the true VE by 0.20).

Probability ratios

Next, we define a few probability ratios comparing vaccinees and non-vaccinees or healthy and frail individuals. These ratios will be helpful in the presentation of the results (see Table 1 for a full list of the notations used in this paper).

  • \(\rho _{\beta } = {\frac {\beta _{1x}}{\beta _{0x}}}\), the ratio of the probabilities of NFARI comparing a vaccinated and an unvaccinated person of the same health status.

  • \(\eta _{\beta } = {\frac {\beta _{v1}}{\beta _{v0}}}\), the ratio of the probabilities of NFARI comparing a healthy and a frail person of the same vaccination status.

  • \(\rho _{\gamma } = {\frac {\gamma _{1x}}{\gamma _{0x}}}\), the ratio of the probabilities of FARI comparing a vaccinated and an unvaccinated person of the same health status.

  • \(\eta _{\gamma } = {\frac {\gamma _{v1}}{\gamma _{v0}}}\), the ratio of the probabilities of FARI comparing a healthy and a frail person of the same vaccination status.

The parameters λ and Ψ F defined earlier are also probability ratios:

  • \(\lambda = {\frac {\delta _{yv1}}{\delta _{yv0}}}\) The ratio of the probabilities of seeking medical care comparing a healthy and a frail person of the same vaccination status. We assume that this ratio is the same for FARI and NFARI patients.

  • \(\Psi _{F} = {\frac {\delta _{21x}}{\delta _{20x}}}\) The ratio of the probabilities of seeking medical care comparing a vaccinated and an unvaccinated FARI patient of the same health status.

Table 2 presents the main sources of bias that can be identified from our model. The absence of bias A is essential for the validity of the TN design, since the VE estimate from this design is based on comparing the odds of being vaccinated in FARI patients (cases) and NFARI patients (controls). This bias may be a result of virus interference [6] (if vaccinees are more likely than non-vaccinees to contract NFARI, then the estimated VE will be falsely high). Biases B1 and B2 represent the effects of health status on the probabilities of NFARI and FARI, respectively. These effects, which are sometimes called the ‘healthy vaccinee effect’, represent the confounding resulting from association of health status with the probability of exposure (vaccination) and the outcome. Bias BS is a special case of B1∩B2. It results when health status affects both the probabilities of FARI and NFARI but the risk ratios comparing a healthy and a frail person are the same for the both types of ARIs. Bias C represents the effect of vaccination status on the probability of seeking care in patients with SI. This effect may be due to less severe symptoms in vaccinated persons compared to unvaccinated ones. As stated earlier, we assume perfect sensitivity and specificity of the influenza test (τ 1=0, τ 2=1), as it is well-known that misclassifications result in negatively-biased estimates of effectiveness.

Table 2 Sources of bias

Results

Sources of bias

We first state conditions for the unbiasedness of the VE estimate based on the TN design. The proofs of these results can be found in Appendix 4.

  • Under random vaccination (α 0=α 1), the estimate of VE when the outcome of interest is SI is unbiased if biases A and C are absent. When the outcome of interest is MAI, the estimate of VE is unbiased if bias A is absent.

  • Under non-random vaccination (α 0α 1), the estimate of VE when the outcome of interest is SI is unbiased if biases A, B1, B2, and C are absent. When the outcome of interest is MAI, the estimate of VE is unbiased if biases A, B1, and B2 are absent.

It is interesting to note that the absence of any source of bias, the OR-based VE estimate from a TN study is unbiased even if the ‘rare disease’ assumption is not satisfied, while the OR-based estimate from a TCC study is biased. To show this, let’s use the following simplified notation: α = probability of being vaccinated, β = probability of NFARI, γ 0 and γ 1 = probabilities of FARI in unvaccinated and vaccinated, respectively, and δ = probability of seeking care. Then the true VE is 1−ρ, where ρ = γ 1/ γ 0 is the risk ratio. In a TN study, the probabilities of vaccinated and unvaccinated cases are αδγ 1, and (1−α)δγ 0, respectively. The corresponding probabilities of controls are αδβ, and (1−α)δβ, respectively. Then the OR in the table of case-control status by vaccination status equals to ρ, i.e. the true risk ratio, implying that the estimated VE is unbiased. In a TCC studies, the probabilities of cases are the same as in the TN study, while the probabilities of vaccinated and unvaccinated controls (individuals without ARI) are ϕα(1−γ 1β) and ϕ(1−α)(1−γ 0β), respectively, where ϕ is the sampling fraction of controls. Hence, the OR in the TCC study is [ρ(1−γ 0β)]/(1−ργ 0β). This OR is less than ρ (the true RR) if ρ> 0, hence the estimated VE exceeds the true VE in a TCC study as long as the true VE is positive.

Next we explore the magnitude of the effects of various sources of bias and their combinations. We consider three scenarios for vaccination probabilities (see Table 3). In Table 4 we present the range and the maximum absolute value of the bias of VE estimates resulting from TN and TCC studies under the three vaccination scenarios and various combinations of sources of bias. For these results we used the following baseline values of some of the parameters: π=0.7, β 00=0.2, γ 00=0.1, δ SN =0.2, δ SF =0.3, ρ γ =0.4. π is the probability of being ‘healthy’; β 00 and γ 00 are the probabilities of NFARI and FARI, respectively, for an unvaccinated ‘frail’ person; δ SN and δ FN are the probabilities of seeking medical care for NFARI and FARI, respectively, for an unvaccinated ‘frail’ person; ρ γ is the risk ratio comparing the probability of FARI for a vaccinated and an unvaccinated person - thus, the true VE against SI is 1 - 0.4 = 0.6 (60%). The values of β’s, γ’s are based on data from various randomized placobo-controlled trials (see [14], Table A1), and the values of δ are based on data from several observational studies. In all the tables, figures and examples, values of VE are presented as fractions, rather than percentages.

Table 3 Three scenarios for vaccination probabilities
Table 4 Estimate of VE against symptomatic influenza and medically-attended influenza: range of bias and maximum absolute value of bias under various combinations of sources of bias

In the calculations for Tables 4 and 5, when a source of bias was present we used a reasonable range for the corresponding probability ratio. When bias A was present, ρ β was allowed to vary from 0.5 to 2.0. For biases B1, B2, and BS, we allowed η β and η γ to vary between 0.5 and 1.0, since one would not expect frail persons to have lower probabilities of ARI, compared to healthy persons. For bias C, the ratio Ψ F could vary between 0.5 to 1.0, since one would expect vaccination to reduce the probability that a person with SI will seek medical care compared to a person with ARI resulting from a different pathogen. For bias D, we let λ vary between 0.5 to 2.0.

Table 5 Minimum, mean and maximum standard errors of VE estimates under various combinations of source of bias

For each combination of two or more sources of bias, we calculated the minimum, mean, and maximum of the bias and the absolute values of the bias by allowing the probability ratios that are not fixed to vary independently in the ranges specified above. For example, when biases A, B1, and B2 are absent, we used ρ β =η β =η γ =1,0.5≤Ψ F ≤1,0.5≤λ≤2.

Summary of results

The impact of sources of bias

Our model allows us to evaluate the impact of the sources of bias listed in Table 2. Each source of bias is a result of a possible effect of vaccination or health status on the probability of FARI or NFARI or seeking care. Below we summarize our results for each of the sources of bias. We also use numerical examples to illustrate the magnitude and direction associated with each source of bias. Unless otherwise specified, the true VEs against SI and MAI are 0.6 (60%). In each of these examples we assume that only one source of bias is present.

  1. (1)

    Vaccination affects the probability of NFARI (bias A)

    • This bias does not depend on vaccination scenario nor on the outcome of interest (SI or MAI).

    • Estimates of VE from TN studies may suffer from severe bias.

    • This effect also affects the bias of VE estimates from TCC studies, though to a lesser extent.

    • Example: As the ratio of the probability of NFARI comparing vaccinated and unvaccinated persons varies from 0.5 to 2.0, VE estimates from TN studies range from 0.2 to 0.8, respectively, while VE estimates from TCC studies range from 0.67 to 0.50, respectively (Fig. 2).

      Fig. 2
      figure 2

      True and estimated VEs as a function of R1 = P(NFARI if vaccinated)/P(NFARI if unvaccinated) when only bias A is present

  1. (2)

    Health status affects the probabilities of FARI and NFARI (biases B1, B2 - the ‘ healthy vaccinee effect ’)

    • The bias does not depend on the outcome of interest (SI or MAI).

    • Under non-random vaccination, these effects may result in substantial bias of VE estimates from TN or TCC studies. However, this bias is usually less severe compared to the biases resulting from sources A, C and D.

    • If the effect of health status on the probability of ARI is the same for FARI and NFARI, i.e., bias BS is present, then the TN-based estimates of VE are unbiased.

    • Example: Suppose that the probabilities of vaccination are 0.8 and 0.4 for healthy and frail persons, respectively. Consider three cases regarding the risk ratios P(ARI in a healthy person) / P(ARI in a frail person): (a) When these risk ratios are 0.5 for NFARI and 0.8 for FARI, then the estimated VEs from TN and TCC studies are 0.51 and 0.67, respectively. (b) When the risk ratios are 0.8 for NFARI and 0.5 for FARI, then estimated VEs from TN and TCC studies are 0.67 and 0.72, respectively. (c) When the risk ratios for NFARI and FARI are equal and their common value ranges from 0.5 to 1.0, then estimated VEs from TN studies are always unbiased (i.e., they equal 0.6), while estimates from TCC studies range from 0.63 to 0.73. In Figs. 3 and 4, we set the risk ratio for NFARI to 0.75 and let the risk ratio for FARI vary between 0.5 to 1.0.

      Fig. 3
      figure 3

      True and estimated VE’s when only biases B1 and B2 are present. We set the risk ratio P(NFARI if healthy)/P(NFARI if frail) =0.75 and let the risk ratio R2 = P(FARI if healthy)/P(FARI if frail) vary between 0.5 to 1.0. The probabilities of vaccination are 0.4 and 0.8 for healthy and frail persons, respectively

      Fig. 4
      figure 4

      True and estimated VEs when only biases B1 and B2 are present. We set the risk ratio P(NFARI if healthy)/P(NFARI if frail) =0.75 and let the risk ratio R2 = P(FARI if healthy)/P(FARI if frail) vary between 0.5 to 1.0. The probabilities of vaccination are 0.8 and 0.4 for healthy and frail persons, respectively

  1. (3)

    Vaccination affects the probability of seeking medical care for FARI, but it does not affect the probability of seeking care for NFARI (bias C)

    • When this effect is present then the true VEs against SI and MAI may be different as the vaccine directly affects (reduces) the probability of seeking care in influenza cases, but not in controls. Thus the estimates’ bias may depend on the outcome of interest.

    • If all other sources of bias are absent, the bias of VE estimates does not depend on the vaccination scenario.

    • Estimates of VE from TN or TCC studies may be severely biased when the outcome of interest is SI.

    • When the outcome of interest is MAI, estimates of VE from TN studies are unbiased, while the bias of estimates from TCC studies is usually small and is not affected by the magnitude of the effect underlying this source of bias.

    • Example: Let the ratio R = P(seeking medical care against FARI if vaccinated) / P(seeking medical care against FARI if unvaccinated) vary from 0.5 to 1.0. Then the true VE against SI remains fixed at 0.6, while the true VE against MAI varies with R from 0.8 to 0.6. The estimated VEs from TN studies equal the true VE against MAI for all values of R, while the estimated VEs from TCC studies vary from 0.82 to 0.63 (see Fig. 5). For example, when R=0.5 then the true VE against MAI is 0.80, and the VE estimates from TN and TCC studies are 0.80 and 0.82, respectively. This translates into severe bias when the outcome of interest is SI but small bias when the outcome of interest is MAI.

      Fig. 5
      figure 5

      True and estimated VEs when only bias C is present as function of R3 = P(seeking medical care against FARI if vaccinated)/P(seeking medical care against FARI if unvaccinated)

  1. (4)

    Health status affects the probabilities of seeking care against FARI and NFARI (bias D)

    • The bias of VE estimates does not depend on the outcome of interest (SI or MAI).

    • In the absence of other sources of bias, VE estimates from TN studies are unbiased regardless of the vaccination scenario.

    • Under non-random vaccination, this effect may result in substantial bias in VE estimates from TCC studies.

    • Example: We assume that the probabilities of seeking care do not depend on vaccination status. As the ratio of the probabilities of seeking care comparing healthy and frail individuals varies from 0.5 to 2.0, VE estimates from TN studies remain fixed at 0.6 (i.e., they are unbiased) under both random and non-random vaccination. When the probabilities of vaccination are 0.8 and 0.4 for healthy and frail persons, respectively, the VE estimates from TCC studies vary from 0.72 to 0.53 (Figs. 6 and 7).

      Fig. 6
      figure 6

      True and estimated VEs when only bias D is present as function of R4 = P(seeking medical care if healthy)/P(seeking medical care if frail). Probabilities of vaccination are 0.8 and 0.4 for healthy and frail persons, respectively

      Fig. 7
      figure 7

      True and estimated VEs when only bias D is present as function of R4 = P(seeking medical care if healthy)/P(seeking medical care if frail). Probabilities of vaccination are 0.4 and 0.8 for healthy and frail persons, respectively

In addition, we found that in some cases the true VEs against SI and MAI are different. Hence, the bias of VE estimates may depend on the outcome against which the vaccine is supposed to protect. For example, if the only sources of bias are BS, C, and D then, the VE estimate from TN studies is unbiased when considering effectiveness against MAI. The same estimate may overestimate the true VE against SI by 0.20 (i.e. 20%).

Comparison of the bias of VE estimates from TN and TCC studies:

  • If one is concerned that vaccination may affect the probability of non-influenza ARI, then one should prefer the TCC study design. However, TCC-based VE estimates may still be biased in this case. For example, when the ratio of the probability of NFARI comparing a vaccinated and an unvaccinated person is 0.5, then the bias of VE estimate from TN study is -0.4 while the bias of VE estimate from TCC study is 0.07.

  • Under non-random vaccination, effects of health status on probabilities of influenza and non-influenza ARI (the ‘healthy vaccinee effect’) may bias VE estimates from both study designs. In general, TN-based estimates perform slightly better than TCC-based estimates when this effect is believed to be the main source of bias. If the effect of health status is similar for FARI and NFARI, then the TN design produces less biased estimates compared to the TCC design. For example, suppose the probabilities of vaccination are 0.4 and 0.8 for healthy and frail persons, respectively. When the risk ratios for NFARI and FARI are both 0.75, then the VE estimate from TN study is unbiased, while the bias of VE estimate from TCC study is 0.07.

  • If one assumes that vaccination does not affect the probability of non-influenza ARI but one is concerned that vaccinated influenza patients are less likely to seek care than unvaccinated patients (because of reduced symptoms severity), then VE estimates may suffer from severe bias in both study designs when the outcome of interest is SI. In this case, the bias of TN-based estimates may be somewhat smaller than that of TCC-based estimates. This source of bias does not affect VE estimates when the outcome of interest is MAI! For example, suppose that the ratio comparing vaccinated and unvaccinated FARI cases w.r.t. the probability of seeking medical care is 0.5. When the outcome of interest is SI, then the bias of a VE estimate from TN study is 0.2 and the bias of a VE estimate from TCC study is 0.22. When the outcome of interest is MAI, then the VE estimate from a TN study is unbiased, while the bias of a VE estimate from a TCC study is 0.02.

  • Under non-random vaccination, the TN study design is preferable to the TCC design if one is concerned about bias resulting from possible effect of a person’s health status on her/his probability of seeking care against ARI. For example, suppose that the probabilities of vaccination are 0.8 and 0.4 for healthy and frail persons, respectively. When the ratio of the probabilities of seeking medical care comparing healthy and frail persons is 0.5, then the VE estimate from TN study is unbiased while the bias of VE estimate is 0.12.

Precision of VE estimates

Table 5 presents the standard errors of VE estimates from TN and TCC studies. From this table we conclude that:

  • Non-random vaccination may reduce precision of VE estimates.

  • If the probability of NFARI is associated with vaccination status, then VE estimates from TN studies are somewhat less precise compared to VE estimates from TCC studies, although the differences in precision were small.

  • If the probability of NFARI is not associated with vaccination status, then the precision of VE estimates from TN and TCC studies are similar.

Discussion and conclusions

We developed a new model for the evaluation of the bias and precision of influenza estimates from case-control studies. The new model is more comprehensive than previously suggested models [5, 1418] for the following reasons:

  • It allows assessment of the impact of non-random vaccination.

  • It incorporates a confounder (health status) which links vaccination status with the probabilities of ARI and of seeking medical care for these ARIs.

  • By including parameters corresponding to the probabilities of seeking medical care, the model allows us to examine the effect of association of these probabilities with vaccination and health status on the bias of VE estimates.

  • The model allows evaluating and comparing the precision of VE estimates.

Some of the sources of bias discussed here have been identified and addressed in earlier publications, but, to our best knowledge, none of the previous papers present a comprehensive discussion of all the possible sources of bias that may arise under a given model. In addition, the current model attributes the associations between the various factors involved in estimation of VE (vaccination, contracting influenza and non-influenza ARIs and seeking medical care) to an underlying covariate. Previously published models, including an earlier version of our model [14], included parameters representing these associations but these associations were not based on a common underlying factor. Therefore, we believe that the current results and conclusions may differ from those derived from less structured models.

Our calculations confirm earlier findings [15] that when the probability of non-inluenza ARI depends on vaccination status, VE estimates from test-negative studies may be severely biased. However, even when this probability is not affected by vaccination, VE estimates from the two types of case-control studies considered in this work may suffer from substantial bias. In addition to the well-known ‘healthy vaccinee effect’ (probabilities of vaccination and of ARI depend on health status), bias of VE estimates may result from heterogeneities in health-care-seeking behaviors. Specifically, if vaccination reduces the probability that an influenza patient seeks medical care (because her/his symptoms are less severe than those of an unvaccinated influenza patient) then VE estimates from TN or TCC studies may grossly overestimate the true VE against SI. On the other hand, when the outcome of interest is MAI then the biases resulting from vaccine-related reduction in symptoms’ severity are very small. Recent papers [79] found evidence of vaccine-associated reduction in influenza patient’s symptoms severity. The effects of health-care-seeking behaviors on VE estimates from studies in which only ARI patients who seek medical care may become cases need to be further investigated.

The results of this work lead to the following conclusions:

  • In general, estimates of influenza VE from case-control studies where only ARI patients seeking medical care are tested for influenza infection may suffer from severe bias, i.e. an absolute bias of 20% or more, especially when the outcome of interest is SI.

  • The bias of VE estimates may depend on the outcome against which the vaccine is supposed to protect. When the outcome of interest is MAI, seeking medical care is a component of the outcome. In other words, the true VE against MAI reflects the vaccine effect on seeking medical care and on contracting influenza. This explains why true VE against MAI may differ from true VE against SI. When bias C is present, the vaccine directly affects (reduces) the probability of seeking care in influenza cases, but not in controls. As a result, VE against MAI is lower than VE against SI.

  • Influenza VE estimates from TN studies are usually presented as ‘VE against medically-attended influenza’. However, the media and lay persons may interpret these VE estimates simply as the protective effectiveness of vaccination against contracting influenza illness, i.e. VE against SI. Health authorities and the public should be made aware of this distinction.

  • When the outcome of interest is SI, the TN design provides valid estimates (i.e., no or small bias) if the following assumptions are satisfied: (a) vaccination does not affect the probability of non-influenza ARI, (b) effects of confounding variables on the probabilities of influenza and non-influenza ARI are similar, and (c) vaccination does not affect the probabilities of seeking medical care for influenza ARI due to reduced severity of symptoms. When the outcome of interest is MAI, then only assumptions (a) and (b) are necessary for obtaining a valid VE estimate from a TN study.

  • Estimates of VE from TCC studies have small bias when the outcome of interest is SI if assumptions (a) and (c) are satisfied, assumption (b) is replaced by the stronger assumption (b*) of no presence of confounding, and the additional assumption (d) that the probabilities of seeking medical care for ARI are not affected by potential confounders is satisfied. When the outcome of interest is MAI, then TCC-based estimates of VE have small bias under assumptions (a), (b*), and (d).

  • It is important to collect more data on health-care-seeking behaviors of ARI patients and to study the effects of vaccination and potential confounders on these behaviors.

In summary, the test negative design produces less biased VE estimates, compared to the traditional case-control design provided that vaccination does not modify the probability of non-influenza ARI. However, this very popular study design may still produce biased estimates of influenza VE, especially when the outcome of interest is symptomatic influenza. One can expect monitored cohort studies, where every study participant reporting an ARI is tested for influenza infection, to provide less biased estimates of VE against SI. In a future publication we plan to compare the bias of these cohort studies, which are much more expensive, with that of TN studies.

Our study has a few limitations:

  • In order to focus on bias associated with the study designs, we ignored bias resulting from misclassification of infection and vaccination status.

  • Our model does not account for the dynamics of outbreaks of influenza and other ARI-causing infections.

  • We only consider unadjusted VE estimates as we tried to focus on sources of bias rather than on how one can reduce bias using standard or novel statistical techniques [19].

In the future we plan to improve the model by incorporating dynamics of the related processes. We also plan to use stochastic simulations to assess bias and precision of influenza VE estimates for other study designs (e.g. cohort studies) and to propose new study designs resulting in less biased VE estimates.

Appendix 1

True VE‘s in our model

The true VE against SI is:

$${} \begin{aligned} {VET}_{SI} = 1 - {RRT}_{SI} \quad \text{where} \quad {RRT}_{SI} = {{P(Y=2|V=1)}\over {P(Y=2|V=0)}}. \end{aligned} $$

Since

$${{\begin{aligned} & P(Y=y|V=v) = \sum_{x} P(Y=y|V=v,X=x)P(X=x|V=v)\\ & P(V=v) = \sum_{x} P(V=v|X=x)P(X=x)\\ & P(X=x|V=v) = {P(V=v|X=x)P(X=x)\over P(V=v)}\\ & \quad\;\; = {P(V=v|X=x)P(X=x)\over {\sum_{x} P(V=v|X=x)P(X=x)}}\\ \end{aligned}}} $$

we have

$${{\begin{aligned} P(X=0|V=0) &={P(V=0|X=0)P(X=0)\over {\sum_{x} P(V=0|X=x)P(X=x)}}\\ &= {{(1-\alpha_{0})(1-\pi)}\over{(1-\alpha_{0})(1-\pi)+(1-\alpha_{1})\pi}} = 1-\pi,\\ P(X=0|V=1) &={P(V=1|X=0)P(X=0)\over {\sum_{x} P(V=1|X=x)P(X=x)}} \\ &= {{\alpha_{0}(1-\pi)}\over{\alpha_{0}(1-\pi)+\alpha_{1}\pi}}=1-\pi,\\ P(X=1|V=0) &={P(V=0|X=1)P(X=1)\over {\sum_{x} P(V=0|X=x)P(X=x)}} \\&= {{(1-\alpha_{1})\pi}\over{(1-\alpha_{0})(1-\pi)+(1-\alpha_{1})\pi}}=\pi,\\ P(X=1|V=1) &={P(V=1|X=1)P(X=1)\over {\sum_{x} P(V=1|X=x)P(X=x)}} \\&= {{\alpha_{1}\pi}\over{\alpha_{0}(1-\pi)+\alpha_{1}\pi}}=\pi.\\ \end{aligned}}} $$

Since, for true VE, we have: α 0=α 1. We can get,

$${{\begin{aligned} P(Y&=2|V=1) = \sum_{x} P(Y=2|V=1,X=x)P(X=x|V=1)\\ &= P(Y=2|V=1,X=0)P(X=0|V=1)\\&\quad+P(Y=2|V=1,X=1)P(X=1|V=1)\\ &= \gamma_{10}(1-\pi)+ \gamma_{11}\pi \end{aligned}}} $$
$${{\begin{aligned} P(Y&=2|V=0) = \sum_{x} P(Y=2|V=0,X=x)P(X=x|V=0)\\ &= P(Y=2|V=0,X=0)P(X=0|V=0)\\&\quad+P(Y=2|V=0,X=1)P(X=1|V=0)\\ &= \gamma_{00}(1-\pi)+\gamma_{01}\pi \end{aligned}}} $$

So that,

$$\begin{array}{*{20}l} {RRT}_{SI} &= {{P(Y=2|V=1)}\over {P(Y=2|V=0)}} = {{\gamma_{10}(1-\pi)+\gamma_{11}\pi}\over{\gamma_{00}}(1-\pi)+\gamma_{01}\pi} \end{array} $$

Therefore,

$$\begin{array}{*{20}l}{} {VET}_{SI} = 1 - {RRT}_{SI} &= 1 - {{\gamma_{10}(1-\pi)+\gamma_{11}\pi}\over{\gamma_{00}}(1-\pi)+\gamma_{01}\pi} \quad \quad \text{Q.E.D.} \end{array} $$

The true VE against MAI is:

$$\begin{aligned} &{VET}_{MAI} = 1 - {RRT}_{MAI} \quad \text{where} \quad\\ &{RRT}_{MAI} = {{P(Y=2,M=1|V=1)}\over {P(Y=2,M=1|V=0)}}. \end{aligned} $$

Since

$${{\begin{aligned} &P(Y=2,M=1|V=v,X=x) = P(M=1|Y=2,\\& \quad\! V=v,X=x)*P(Y=2|V=v,X=x)\\ &P(Y=2,M=1|V=v) \\& \qquad\!\! = \sum_{x} P(Y=2,M=1|V=v,X=x)P(X=x|V=v) \end{aligned}}} $$

we can get,

$${{\begin{aligned} P(Y&=2,M=1|V=1) \\&= \sum_{x} P(Y=2,M=1|V=1,X=x)P(X=x|V=1)\\ &=\delta_{210}\gamma_{10}(1-\pi)+\delta_{211}\gamma_{11}\pi\\ P(Y&=2,M=1|V=0) \\&= \sum_{x} P(Y=2,M=1|V=0,X=x)P(X=x|V=0)\\ &=\delta_{200}\gamma_{00}(1-\pi)+\delta_{201}\gamma_{01}\pi\\ \end{aligned}}} $$

Therefore,

$$\begin{array}{*{20}l} {RRT}_{MAI}&= {{P(Y=2,M=1|V=1)}\over {P(Y=2,M=1|V=0)}}\\ &={{\delta_{210}\gamma_{10}(1-\pi)+\delta_{211}\gamma_{11}\pi}\over{\delta_{200}\gamma_{00}(1-\pi)+\delta_{201}\gamma_{01}\pi}}\\ &={{\delta_{SF}\Psi_{F}\gamma_{10}(1-\pi)+\delta_{SF}\Psi_{F}\lambda\gamma_{11}\pi}\over{\delta_{SF}\gamma_{00}(1-\pi)+\delta_{SF}\lambda\gamma_{01}\pi}}\\ &= {{\Psi_{F}[\gamma_{10}(1-\pi)+\lambda\gamma_{11}\pi]}\over{\gamma_{00}(1-\pi)+\lambda\gamma_{01}\pi}}\\ \end{array} $$

Hence,

$$\begin{array}{*{20}l} {VET}_{MAI} &= 1 - {RRT}_{MAI} \\ &= 1 - {{\Psi_{F}\left[\gamma_{10}(1-\pi)+\lambda\gamma_{11}\pi\right]}\over{\gamma_{00}(1-\pi)+\lambda\gamma_{01}\pi}}. \quad \quad \text{Q.E.D.} \end{array} $$

Appendix 2

Model-based estimates of VE

The model-based estimate from TN study is:

$${{\begin{aligned} {VE}_{A} &= 1 - {OR}_{A},\quad \text{where}\quad\\ {OR}_{A} &= {{P(C_{A}=1,V=1|M=1)P(C_{A}=0,V=0|M=1)}\over{P(C_{A}=1,V=0|M=1)P(C_{A}=0,V=1|M=1)}}. \end{aligned}}} $$

O R A can be written as:

$${} {OR}_{A} = {{P(M=1,T=1,V=1)P(M=1,T=0,V=0)}\over{P(M=1,T=1,V=0)P(M=1,T=0,V=1)}} $$
$${{\begin{aligned} P(M&=1,T=1,V=v) = \sum_{x} P(M=1,T=1,\\&\quad V=v|X=x)P(X=x)\\ &=\sum_{x} P(T=1|M=1,V=v,X=x)\\&\quad P(M=1,V=v|X=x)P(X=x)\\ &=\sum_{x} P(Y=2|M=1,V=v,X=x)\\&\quad P(M=1|V=v,X=x)P(V=v|X=x)P(X=x) \\ &= \sum_{x} P(M=1|Y=2,V=v,X=x)\\&\quad P(Y=2|V=v,X=x)P(V=v|X=x)P(X=x) \end{aligned}}} $$
$${{\begin{aligned} P(M&=1,T=0,V=v) = \sum_{x} P(M=1,T=0,\\&\quad V=v|X=x)P(X=x)\\ &=\sum_{x} P(T=0|M=1,V=v,X=x)\\&\quad P(M=1,V=v|X=x)P(X=x)\\ &=\sum_{x} P(Y=1|M=1,V=v,X=x)\\&\quad P(M=1|V=v,X=x)P(V=v|X=x)P(X=x) \\ &= \sum_{x} P(M=1|Y=1,V=v,X=x)\\&\quad P(Y=1|V=v,X=x)P(V=v|X=x)P(X=x) \end{aligned}}} $$

So that,

$$\begin{array}{*{20}l} P(M=1,T=1,V=0) &= \delta_{200}\gamma_{00}(1-\alpha_{0})(1-\pi) + \delta_{201}\gamma_{01}(1-\alpha_{1})\pi\\ &= \delta_{SF}\left[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi\right]\\ P(M=1,T=1,V=1) &= \delta_{210}\gamma_{10}\alpha_{0}(1-\pi) + \delta_{211}\gamma_{11}\alpha_{1}\pi\\ &= \delta_{SF}\Psi_{F}[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi]\\ P(M=1,T=0,V=0) &= \delta_{100}\beta_{00}(1-\alpha_{0})(1-\pi) + \delta_{101}\beta_{01}(1-\alpha_{1})\pi\\ &= \delta_{SN}[\beta_{00}(1-\alpha_{0})(1-\pi) + \lambda\beta_{01}(1-\alpha_{1})\pi]\\ P(M=1,T=0,V=1) &= \delta_{110}\beta_{10}\alpha_{0}(1-\pi) + \delta_{111}\beta_{11}\alpha_{1}\pi\\ &= \delta_{SN}[\beta_{10}\alpha_{0}(1-\pi) + \lambda\beta_{11}\alpha_{1}\pi]\\ \end{array} $$

Therefore,

$${VE}_{A} = 1 - {{\Psi_{F}\left[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi\right]\left[\beta_{00}(1-\alpha_{0})(1-\pi) + \lambda\beta_{01}(1-\alpha_{1})\pi\right]}\over{\left[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi\right]\left[\beta_{10}\alpha_{0}(1-\pi) + \lambda\beta_{11}\alpha_{1}\pi\right]}}. \quad \quad \text{Q.E.D.} $$

The model-based estimates from TCC study is:

$${} {{\begin{aligned} {VE}_{B} &= 1 - {OR}_{B},\quad \text{where}\quad \\ {OR}_{B} &= {{P(C_{B}=1,V=1|B=1)P(C_{B}=0,V=0|B=1)}\over{P(C_{B}=1,V=0|B=1)P(C_{B}=0,V=1|B=1)}}. \end{aligned}}} $$

O R B can be written as:

$${OR}_{B} = {{P(M=1,T=1,V=1)P(Y=0,V=0)}\over{P(M=1,T=1,V=0)P(Y=0,V=1)}}. $$

Since,

$${} {{\begin{aligned} P(Y=0,V=v) &= \sum_{x} P(Y=0,V=v|X=x)P(X=x) \\ &= \sum_{x} P(Y=0|V=v,X=x)\\&\qquad\;\;\; P(V=v|X=x)P(X=x) \end{aligned}}} $$

and P(Y=0|V=v,X=x)=1−P(Y=1|V=v,X=x)−P(Y=2|V=v,X=x)=1−γ vx β vx , so we have:

$$\begin{array}{*{20}l} P(Y=0,V=0) & = (1-\gamma_{00}-\beta_{00})(1-\alpha_{0})(1-\pi)\\&\quad +(1-\gamma_{01}-\beta_{01})(1-\alpha_{1})\pi\\ P(Y=0,V=1) & = (1-\gamma_{10}-\beta_{10})\alpha_{0}(1-\pi)\\&\quad +(1-\gamma_{11}-\beta_{11})\alpha_{1}\pi \end{array} $$

Therefore,

$${VE}_{B} = 1 - {{\Psi_{F}\left[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi\right]\left[(1-\gamma_{00}-\beta_{00})(1-\alpha_{0})(1-\pi)+(1-\gamma_{01}-\beta_{01})(1-\alpha_{1})\pi\right]}\over{\left[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi\right]\left[(1-\gamma_{10}-\beta_{10})\alpha_{0}(1-\pi)+(1-\gamma_{11}-\beta_{11})\alpha_{1}\pi\right]}}.\quad \quad \text{Q.E.D.} $$

Appendix 3

Standard errors of the VE estimates

For TN study, the approximate standard error of V E A is:

$$\begin{array}{*{20}l} SE({VE}_{A}) &= SE({OR}_{A}) \approx {OR}_{A}*SE(log({OR}_{A})) \\ &\approx {{p^{A}_{11}\left(1-p^{A}_{01}\right)}\over{p^{A}_{01}\left(1-p^{A}_{11}\right)}}\sqrt{{1\over{N^{A}}} \left[{1\over{P^{A}_{V1}P^{A}_{11}}}+{1\over{\left(1-P^{A}_{V1}\right)P^{A}_{01}}}+{1\over{P^{A}_{V1}}\left(1-P^{A}_{11}\right)}+{1\over{\left(1-P^{A}_{V1}\right)\left(1-P^{A}_{01}\right)}}\right]} \end{array} $$

where N A is the number of persons who were tested for influenza (M=1), i.e., the total sample size for the TN study. The probabilities (\(p^{A}_{V1}\), \(p^{A}_{01}\), \(p^{A}_{11}\)) can be written in terms of the parameters defined earlier.

Base on what we got earlier, we know

$$\begin{aligned} P(M&=1,V=1) = \sum_{t} P(M=1,V=1,T=t) \\ &= \delta_{110}\beta_{10}\alpha_{0}(1-\pi) + \delta_{111}\beta_{11}\alpha_{1}\pi + \delta_{210}\gamma_{10}\alpha_{0}(1-\pi) + \delta_{211}\gamma_{11}\alpha_{1}\pi\\ &=\alpha_{0}(1-\pi)\left[\delta_{SN}\beta_{10}+\delta_{SF}\Psi_{F}\gamma_{10}\right] + \alpha_{1}\pi\lambda \left[\delta_{SN}\beta_{11}+\delta_{SF}\Psi_{F}\gamma_{11}\right] \end{aligned} $$
$$\begin{aligned} P(M&=1,V=0) = \sum_{t} P(M=1,V=0,T=t) \\ &= \delta_{100}\beta_{00}(1-\alpha_{0})(1-\pi) + \delta_{101}\beta_{01}(1-\alpha_{1})\pi + \delta_{200}\gamma_{00}(1-\alpha_{0})(1-\pi) + \delta_{201}\gamma_{01}(1-\alpha_{1})\pi\\ &= (1-\alpha_{0})(1-\pi)\left[\delta_{SN}\beta_{00}+\delta_{SF}\gamma_{00}\right]+(1-\alpha_{1})\pi\lambda\left[\delta_{SN}\beta_{01}+\delta_{SF}\gamma_{01}\right] \end{aligned} $$

Thus,

$$\begin{array}{*{20}l} P(M&=1) = \sum_{v} P(M=1, V=v)\\ &= (1-\alpha_{0})(1-\pi)\left[\delta_{SN}\beta_{00}+\delta_{SF}\gamma_{00}\right]+(1-\alpha_{1})\pi\lambda \left[\delta_{SN}\beta_{01}+\delta_{SF}\gamma_{01}\right]\\ &\quad + \alpha_{0}(1-\pi) \left[\delta_{SN}\beta_{10}+\delta_{SF}\Psi_{F}\gamma_{10}\right] + \alpha_{1}\pi\lambda \left[\delta_{SN}\beta_{11}+\delta_{SF}\Psi_{F}\gamma_{11}\right] \\ &= (1-\pi) \left[(1-\alpha_{0})(\delta_{SN}\beta_{00}+\delta_{SF}\gamma_{00})+\alpha_{0}(\delta_{SN}\beta_{10}+\delta_{SF}\Psi_{F}\gamma_{10})\right] \\ & \quad + \pi\lambda \left[((1-\alpha_{1}))(\delta_{SN}\beta_{01}+\delta_{SF}\gamma_{01}) + \alpha_{1}(\delta_{SN}\beta_{11}+\delta_{SF}\Psi_{F}\gamma_{11})\right] \end{array} $$

Therefore, we have

$${\begin{aligned} P^{A}_{V1} &= P(V=1|M=1) = {{P(V=1,M=1)}\over{P(M=1)}}\\ &= {{\alpha_{0}(1-\pi)\left[\delta_{SN}\beta_{10}+\delta_{SF}\Psi_{F}\gamma_{10}\right] + \alpha_{1}\pi\lambda\left[\delta_{SN}\beta_{11}+\delta_{SF}\Psi_{F}\gamma_{11}\right]}\over{\tiny{(1-\pi)\left[(1-\alpha_{0})(\delta_{SN}\beta_{00}+\delta_{SF}\gamma_{00})\!+ \alpha_{0}(\delta_{SN}\beta_{10}+\delta_{SF}\Psi_{F}\gamma_{10})\right] + \pi\lambda \left[((1-\alpha_{1}))(\delta_{SN}\beta_{01}+\delta_{SF}\gamma_{01}) + \alpha_{1}(\delta_{SN}\beta_{11}+\delta_{SF}\Psi_{F}\gamma_{11})\right]}}} \end{aligned}} $$
$$\begin{array}{*{20}l} p^{A}_{01} &= {{P(M=1,T=1|V=0)}\over{P(M=1|V=0)}} = {{P(M=1,T=1,V=0)}\over{P(M=1,V=0)}} \\ & = {{\delta_{200}\gamma_{00}(1-\alpha_{0})(1-\pi) + \delta_{201}\gamma_{01}(1-\alpha_{1})\pi}\over{\delta_{100}\beta_{00}(1-\alpha_{0})(1-\pi) + \delta_{101}\beta_{01}(1-\alpha_{1})\pi + \delta_{200}\gamma_{00}(1-\alpha_{0})(1-\pi) + \delta_{201}\gamma_{01}(1-\alpha_{1})\pi}}\\ &={{\delta_{SF} \left[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi\right]}\over{(1-\alpha_{0})(1-\pi) \left[\delta_{SN}\beta_{00}+\delta_{SF}\gamma_{00}\right]+(1-\alpha_{1})\pi\lambda \left[\delta_{SN}\beta_{01}+\delta_{SF}\gamma_{01}\right]}} \end{array} $$
$$\begin{array}{*{20}l} P^{A}_{11} &= {{P(M=1,T=1|V=1)}\over{P(M=1|V=1)}} = {{P(M=1,T=1,V=1)}\over{P(M=1,V=1)}}\\ &= {{\delta_{210}\gamma_{10}\alpha_{0}(1-\pi) + \delta_{211}\gamma_{11}\alpha_{1}\pi}\over{\delta_{110}\beta_{10}\alpha_{0}(1-\pi) + \delta_{111}\beta_{11}\alpha_{1}\pi + \delta_{210}\gamma_{10}\alpha_{0}(1-\pi) + \delta_{211}\gamma_{11}\alpha_{1}\pi}} \\ &= {{\delta_{SF}\Psi_{F} \left[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi\right]}\over{\alpha_{0}(1-\pi)\left[\delta_{SN}\beta_{10}+\delta_{SF}\Psi_{F}\gamma_{10}\right] + \alpha_{1}\pi\lambda \left[\delta_{SN}\beta_{11}+\delta_{SF}\Psi_{F}\gamma_{11}\right]}} \end{array} $$

In the TCC study, the approximate standard error of V E B is:

$$\begin{array}{*{20}l} SE({VE}_{B}) &= SE({OR}_{B}) \approx {OR}_{B}*SE(log({OR}_{B})) \\ &\approx {{p^{B}_{11}\left(1-p^{B}_{10}\right)}\over{p^{B}_{10}\left(1-p^{B}_{11}\right)}} \sqrt{{1\over{N^{B}_{C1}p^{B}_{11}}}+{1\over{N^{B}_{C1}\left(1-p^{B}_{11}\right)}}+{1\over{N^{B}_{C0}}p^{B}_{10}}+{1\over{N^{B}_{C0}\left(1-p^{B}_{10}\right)}}} \end{array} $$

where \(N^{b}_{C1}\) is the number of cases and \(N^{b}_{C0}\) is the number of controls. The probabilities \(\left (p^{B}_{10},p^{B}_{11}\right)\) can be written in terms of the parameters defined earlier:

$${\begin{aligned} p^{B}_{10} &= P(V=1|C_{B}=0, B=1) = {{P(Y=0,V=1)}\over{P(Y=0)}} = {{P(Y=0,V=1)}\over{\sum_{v} P(Y=0,V=v)}}\\ & = {{(1-\gamma_{10}-\beta_{10})\alpha_{0}(1-\pi)+(1-\gamma_{11}-\beta_{11})\alpha_{1}\pi}\over{(1-\pi)[(1-\gamma_{10}-\beta_{10})\alpha_{0} +(1-\gamma_{00}-\beta_{00})(1-\alpha_{0})]+\pi[(1-\gamma_{11}-\beta_{11})\alpha_{1}+(1-\gamma_{01}-\beta_{01})(1-\alpha_{1})]}} \end{aligned}} $$
$${\begin{aligned} p^{B}_{11} &= P(V=1|C_{B}=1, B=1) = {{P(M=1,T=1,V=1)}\over{P(M=1,T=1)}} = {{P(M=1,T=1,V=1)}\over{\sum_{v}P(M=1,T=1,V=v)}}\\ &= {{\delta_{210}\gamma_{10}\alpha_{0}(1-\pi)+\delta_{211}\gamma_{11}\alpha_{1}\pi}\over{\delta_{200}\gamma_{00}(1-\alpha_{0})(1-\pi) + \delta_{201}\gamma_{01}(1-\alpha_{1})\pi + \delta_{210}\gamma_{10}\alpha_{0}(1-\pi) + \delta_{211}\gamma_{11}\alpha_{1}\pi}} \\ &= {{\Psi_{F} \left[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi\right]}\over{\left[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi\right]+\Psi_{F}\left[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi\right]}} \end{aligned}} $$

Appendix 4

Unbiasness under random and non-random vaccination

Unbiasness under random vaccination

If the vaccination is done at random, then α 0=α 1. The VE estimates can be written as:

$$\begin{array}{*{20}l} {VE}_{A} = 1 - {{\Psi_{F}\left[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi\right] \left[\beta_{00}(1-\pi) + \lambda\beta_{01}\pi\right]}\over{\left[\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi\right]\left[\beta_{10}(1-\pi) + \lambda\beta_{11}\pi\right]}} \end{array} $$
$${VE}_{B} = 1 - {{\Psi_{F}[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi][(1-\gamma_{00}-\beta_{00})(1-\pi)+(1-\gamma_{01}-\beta_{01})\pi]}\over{[\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi][(1-\gamma_{10}-\beta_{10})(1-\pi)+(1-\gamma_{11}-\beta_{11})\pi]}} $$

(1) If ρ β =Ψ F = 1, and one of the following conditions is satisfied, then V E A =V E T SI .

  1. (a)

    λ=1;

  2. (b)

    η γ =1.

Proof

Since \(\rho _{\beta _{x}} = \Psi _{F}\) = 1, then \({{\beta _{10}}\over {\beta _{00}}} = {{\beta _{11}}\over {\beta _{01}}} = \Psi _{F}\) = 1. We have:

$$\beta_{10} = \beta_{00} \quad \text{and}\quad \beta_{11} = \beta_{01}. $$

So,

$$\begin{array}{*{20}l} {VE}_{A} &= 1 - {{\Psi_{F}[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi][\beta_{00}(1-\pi) + \lambda\beta_{01}\pi]}\over{[\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi][\beta_{00}(1-\pi) + \lambda\beta_{01}\pi]}}\\ &= 1 - {{\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi}\over{\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi}} \end{array} $$

If (a) λ=1 is satisfied, so

$${VE}_{A} = 1 - {{\gamma_{10}(1-\pi) + \gamma_{11}\pi}\over{\gamma_{00}(1-\pi) + \gamma_{01}\pi}} = {VET}_{SI}. $$

If (b) η γ =1 is satisfied, then γ 01=γ 00 and γ 11=γ 10. Thus,

$$\begin{array}{*{20}l} {VET}_{SI} = 1 - {{\gamma_{10}(1-\pi) + \gamma_{11}\pi}\over{\gamma_{00}(1-\pi) + \gamma_{01}\pi}} = 1 - {{\gamma_{11}}\over{\gamma_{01}}}. \end{array} $$

Hence,

$$\begin{array}{*{20}l} {VE}_{A} &= 1 - {{\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi}\over{\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi}} = 1 - {{\gamma_{11}(1-\pi) + \lambda\gamma_{11}\pi}\over{\gamma_{01}(1-\pi) + \lambda\gamma_{01}\pi}} = 1 - {{\gamma_{11}[(1-\pi) + \lambda\pi]}\over{\gamma_{01}[(1-\pi) + \lambda\pi]}} \\ &= 1 - {{\gamma_{11}}\over{\gamma_{01}}} = {VET}_{SI} \quad \quad \text{Q.E.D.} \end{array} $$

(2) If ρ β =1, then V E A =V E T MAI

Proof

Since ρ β =1, then \({{\beta _{10}}\over {\beta _{00}}} = {{\beta _{11}}\over {\beta _{01}}} = 1\).

So,

$$\begin{array}{*{20}l} {VE}_{A} &= 1 - {{\Psi_{F}\left[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi\right]\left[\beta_{00}(1-\pi) + \lambda\beta_{01}\pi\right]}\over{\left[\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi\right]\left[\beta_{00}(1-\pi) + \lambda\beta_{01}\pi\right]}}\\ &= 1 - {{\Psi_{F}\left[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi\right]}\over{\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi}}\\ &= {VET}_{MAI} \quad \quad \text{Q.E.D.} \end{array} $$

(3) If λ=1 and 1−γ 1x β 1x =Ψ F (1−γ 0x β 0x ), where x=0,1, then V E B =V E T SI

Proof

Since 1−γ 1x β 1x =Ψ F (1−γ 0x β 0x ), where x=0,1, and λ=1, then

$$\begin{array}{*{20}l} {VE}_{B} &= 1 - {{\Psi_{F}\left[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi\right]\left[(1-\gamma_{00}-\beta_{00})(1-\pi)+(1-\gamma_{01}-\beta_{01})\pi\right]}\over{\left[\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi\right]\left[\Psi_{F}(1-\gamma_{00}-\beta_{00})(1-\pi)+\Psi_{F}(1-\gamma_{01}-\beta_{01})\pi\right]}} \\ &= 1 - {{\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi}\over{\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi}}\\ &= 1 - {{\gamma_{10}(1-\pi) + \gamma_{11}\pi}\over{\gamma_{00}(1-\pi) + \gamma_{01}\pi}}\\ &= {VET}_{SI} \quad \quad \text{Q.E.D.} \end{array} $$

(4) If γ 1x +β 1x =γ 0x +β 0x , where x=0,1, then V E B =V E T MAI

Proof

Since γ 1x +β 1x =γ 0x +β 0x , x=0,1, then

$$ 1 - \gamma_{1x} - \beta_{1x} = 1 - \gamma_{0x} - \beta_{0x}, \quad \text{where }x = 0, 1 $$

So:

$$\begin{array}{*{20}l} {VE}_{B} &= 1 - {{\Psi_{F}\left[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi\right]\left[(1-\gamma_{00}-\beta_{00})(1-\pi)+(1-\gamma_{01}-\beta_{01})\pi\right]}\over{\left[\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi\right]\left[(1-\gamma_{00}-\beta_{00})(1-\pi)+(1-\gamma_{01}-\beta_{01})\pi\right]}} \\ &= 1 - {{\Psi_{F}\left[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi\right]}\over{\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi}}\\ &= {VET}_{MAI} \quad \quad \text{Q.E.D.} \end{array} $$

Unbiasness under non-random vaccination

If the vaccination is not done at random, then α 0α 1.

(5) If ρ β =η β =η γ =Ψ F =1, then V E A =V E T SI .

Proof

Since ρ β =η β =η γ =1, then β 00=β 10=β 01=β 11=Δ β, γ 01=γ 00 and γ 11=γ 10. Thus,

$$ {VET}_{SI} = 1 - {{\gamma_{10}(1-\pi) + \gamma_{11}\pi}\over{\gamma_{00}(1-\pi) + \gamma_{01}\pi}} = 1 - {{\gamma_{11}}\over{\gamma_{01}}}. $$

So,

$$\begin{array}{*{20}l} {VE}_{A} &= 1 - {{\Psi_{F}[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi][\beta_{00}(1-\alpha_{0})(1-\pi) + \lambda\beta_{01}(1-\alpha_{1})\pi]}\over{[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi][\beta_{10}\alpha_{0}(1-\pi) + \lambda\beta_{11}\alpha_{1}\pi]}} \\ &= 1 - {{[\gamma_{11}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi][(1-\alpha_{0})(1-\pi) + \lambda(1-\alpha_{1})\pi]}\over{[\gamma_{01}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi][\alpha_{0}(1-\pi) + \lambda\alpha_{1}\pi]}}\\ &= 1 - {{\gamma_{11}[\alpha_{0}(1-\pi) + \lambda\alpha_{1}\pi][(1-\alpha_{0})(1-\pi) + \lambda(1-\alpha_{1})\pi]}\over{\gamma_{01}[(1-\alpha_{0})(1-\pi) + \lambda(1-\alpha_{1})\pi][\alpha_{0}(1-\pi) + \lambda\alpha_{1}\pi]}} \\ &= 1 - {{\gamma_{11}}\over{\gamma_{01}}} = {VET}_{SI} \quad \quad \text{Q.E.D.} \end{array} $$

(6) If ρ β =1 and η β =η γ , then V E A =V E T MAI . □

Proof

Since η β =η γ , so \({{\beta _{01}}\over {\beta _{00}}} = {{\beta _{11}}\over {\beta _{10}}} = {{\gamma _{01}}\over {\gamma _{00}}} = {{\gamma _{11}}\over {\gamma _{10}}}\). Then we have: \({\gamma _{11}\over {\beta _{11}}} = {\gamma _{10}\over {\beta _{10}}} \stackrel {\Delta }{=} a, {\gamma _{00}\over {\beta _{00}}} = {\gamma _{01}\over {\beta _{01}}} \stackrel {\Delta }{=} b\), and \({{\beta _{10}}\over {\beta _{00}}} = {{\beta _{11}}\over {\beta _{01}}} = 1\). Then:

$$\begin{array}{*{20}l} {VE}_{A} &= 1 - {{\Psi_{F}\left[\gamma_{10}\alpha_{0}(1-\pi) + \lambda\gamma_{11}\alpha_{1}\pi\right]\left[\beta_{00}(1-\alpha_{0})(1-\pi) + \lambda\beta_{01}(1-\alpha_{1})\pi\right]}\over{\left[\gamma_{00}(1-\alpha_{0})(1-\pi) + \lambda\gamma_{01}(1-\alpha_{1})\pi\right]\left[\beta_{10}\alpha_{0}(1-\pi) + \lambda\beta_{11}\alpha_{1}\pi\right]}}\\ &= 1 - {{\Psi_{F}\left[a\beta_{10}\alpha_{0}(1-\pi) + a\lambda\beta_{11}\alpha_{1}\pi\right]\left[\beta_{00}(1-\alpha_{0})(1-\pi) + \lambda\beta_{01}(1-\alpha_{1})\pi\right]}\over{\left[b\beta_{00}(1-\alpha_{0})(1-\pi) + b\lambda\beta_{01}(1-\alpha_{1})\pi\right]\left[\beta_{10}\alpha_{0}(1-\pi) + \lambda\beta_{11}\alpha_{1}\pi\right]}} \\ &= 1 - \Psi_{F} \cdot {a\over b} \end{array} $$

and,

$$\begin{array}{*{20}l} {VET}_{MAI} &= 1 - {{\Psi_{F} \left[\gamma_{10}(1-\pi) + \lambda\gamma_{11}\pi\right]}\over{\gamma_{00}(1-\pi) + \lambda\gamma_{01}\pi}} = 1 - {{\Psi_{F}\left[a\beta_{10}(1-\pi) + a\lambda\beta_{11}\pi\right]}\over{b\beta_{00}(1-\pi) + b\lambda\beta_{01}\pi}}\\ &= 1 - \Psi_{F} \cdot {a\over b} \end{array} $$

So, V E A =V E T MAI . □

Abbreviations

ARI:

Acute respiratory illness

FARI:

Influenza ARI

MAI:

Medically-attended influenza

NFARI:

Non-influenza ARI

OR:

Odds ratio

SE:

Standard error

SI:

Symptomatic influenza

TCC:

Traditional case-control

TN:

Test-negative

VE:

Vaccine effectiveness

References

  1. Skowronski DM, Masaro C, Kwindt TL, Mak A, Petric M, Li Y, Sebastian R, Chong M, Tam T, De Serres G. Estimating vaccine effectiveness against laboratory-confirmed influenza using a sentinel physician network: results from the 2005-2006 season of dual a and b vaccine mismatch in canada. Vaccine. 2007; 25(15):2842–51. doi:10.1016/j.vaccine.2006.10.002.

    Article  CAS  PubMed  Google Scholar 

  2. Belongia EA, Kieke BA, Donahue JG, Greenlee RT, Balish A, Foust A, Lindstrom S, Shay DK, Marshfield Influenza Study Group. Effectiveness of inactivated influenza vaccines varied substantially with antigenic match from the 2004-2005 season to the 2006-2007 season. J Infect Dis. 2009; 199(2):159–67. doi:10.1086/595861.

    Article  PubMed  Google Scholar 

  3. Choi WS, Noh JY, Seo YB, Baek JH, Lee J, Song JY, Park DW, Lee JS, Cheong HJ, Kim WJ. Case-control study of the effectiveness of the 2010-2011 seasonal influenza vaccine for prevention of laboratory-confirmed influenza virus infection in the korean adult population. Clin Vaccine Immunol. 2013; 20(6):877–1. doi:10.1128/CVI.00009-13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Havers F, Sokolow L, Shay DK, Farley MM, Monroe M, Meek J, Daily Kirley P, Bennett NM, Morin C, Aragon D, Thomas A, Schaffner W, Zansky SM, Baumbach J, Ferdinands J, Fry AM. Case-control study of vaccine effectiveness in preventing laboratory-confirmed influenza hospitalizations in older adults, united states, 2010-2011. Clin Infect Dis. 2016; 63(10):1304–11. doi:10.1093/cid/ciw512.

    Article  PubMed  Google Scholar 

  5. De Serres G, Skowronski DM, Wu XW, Ambrose CS. The test-negative design: validity, accuracy and precision of vaccine efficacy estimates compared to the gold standard of randomised placebo-controlled clinical trials. Euro Surveill. 2013; 18(37). https://0-doi-org.brum.beds.ac.uk/10.2807/1560-7917.ES2013.18.37.20585.

  6. Cowling BJ, Fang VJ, Nishiura H, Chan KH, Ng S, Ip DKM, Chiu SS, Leung GM, Peiris JSM. Increased risk of noninfluenza respiratory virus infections associated with receipt of inactivated influenza vaccine. Clin Infect Dis. 2012; 54(12):1778–83. doi:10.1093/cid/cis307.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Castilla J, Godoy P, Dominguez A, Martinez-Baz I, Astray J, Martin V, Delgado-Rodriguez M, Baricot M, Soldevila N, Maria Mayoral J, Maria Quintana J, Carlos Galan J, Castro A, Gonzalez-Candelas F, Garin O, Saez M, Tamames S, Pumarola T, Influenza CCC. Influenza vaccine effectiveness in preventing outpatient, inpatient, and severe cases of laboratory-confirmed influenza. Clin Infect Dis. 2013; 57(2):167–75. doi:10.1093/cid/cit194.

    Article  CAS  PubMed  Google Scholar 

  8. VanWormer JJ, Sundaram ME, Meece JK, Belongia EA. A cross-sectional analysis of symptom severity in adults with influenza and other acute respiratory illness in the outpatient setting. BMC Infect Dis. 2014; 14:231. doi:10.1186/1471-2334-14-231.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Deiss RG, Arnold JC, Chen WJ, Echols S, Fairchok MP, Schofield C, Danaher PJ, McDonough E, Ridoré M, Mor D, Burgess TH, Millar EV. Vaccine-associated reduction in symptom severity among patients with influenza a/h3n2 disease. Vaccine. 2015; 33(51):7160–7. doi:10.1016/j.vaccine.2015.11.004.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Haber M, Longini Jr IM, Halloran ME. Measures of the effects of vaccination in a randomly mixing population. Int J Epidemiol. 1991; 20(1):300–10.

    Article  CAS  PubMed  Google Scholar 

  11. Sullivan SG, Tchetgen Tchetgen EJ, Cowling BJ. Theoretical basis of the test-negative study design for assessment of influenza vaccine effectiveness. Am J Epidemiol. 2016; 184(5):345–53. doi:10.1093/aje/kww064.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Lipsitch M, Jha A, Simonsen L. Observational studies and the difficult quest for causality: lessons from vaccine effectiveness and impact studies. Int J Epidemiol. 2016. doi:10.1093/ije/dyw124.

  13. Agresti A. Categorical Data Analysis, 3rd ed. Wiley series in probability and statistics, vol. 792. Hoboken, NJ: Wiley; 2013.

    Google Scholar 

  14. Haber M, An Q, Foppa IM, Shay DK, Ferdinands JM, Orenstein WA. A probability model for evaluating the bias and precision of influenza vaccine effectiveness estimates from case-control studies. Epidemiol Infect. 2015; 143(7):1417–26. doi:10.1017/S0950268814002179.

    Article  CAS  PubMed  Google Scholar 

  15. Orenstein EW, De Serres G, Haber MJ, Shay DK, Bridges CB, Gargiullo P, Orenstein WA. Methodologic issues regarding the use of three observational study designs to assess influenza vaccine effectiveness. Int J Epidemiol. 2007; 36(3):623–31. doi:10.1093/ije/dym021.

    Article  PubMed  Google Scholar 

  16. Ferdinands JM, Shay DK. Magnitude of potential biases in a simulated case-control study of the effectiveness of influenza vaccination. Clin Infect Dis. 2012; 54(1):25–32. doi:10.1093/cid/cir750.

    Article  CAS  PubMed  Google Scholar 

  17. Foppa IM, Haber M, Ferdinands JM, Shay DK. The case test-negative design for studies of the effectiveness of influenza vaccine. Vaccine. 2013; 31(30):3104–9. doi:10.1016/j.vaccine.2013.04.026.

    Article  PubMed  Google Scholar 

  18. Jackson ML, Nelson JC. The test-negative design for estimating influenza vaccine effectiveness. Vaccine. 2013; 31(17):2165–8. doi:10.1016/j.vaccine.2013.02.053.

    Article  PubMed  Google Scholar 

  19. Talbot HK, Nian H, Chen Q, Zhu Y, Edwards KM, Griffin MR. Evaluating the case-positive, control test-negative study design for influenza vaccine effectiveness for the frailty bias. Vaccine. 2016; 34(15):1806–9. doi:10.1016/j.vaccine.2016.02.037.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgement

The authors wish to thank Dr. Ivo Foppa and two reviewers for helpful comment.

Funding

This research was supported by the National Institute of Allergies and Infectious Diseases of the National Institutes of Health (NIH) under Award R01AI110474, and by IPA 1110376-05 with the Centers for Disease Controls and Prevention (CDC). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH or the CDC.

Availability of data and materials

Not applicable. No data or other materials are used in this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michael Haber.

Ethics declarations

Ethics approval and consent to participate

Not applicable. No data on human subjects are used in this work.

Consent for publication

All the authors have read and approved the manuscript and agreed to be included in the publication.

Competing interests

The authors declares that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional information

Authors’ contribution

MS, QA and MH developed the model and wrote the manuscript. WO and KA made significant revisions. All authors have read the manuscript and agreed to its content.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shi, M., An, Q., Ainslie, K. et al. A comparison of the test-negative and the traditional case-control study designs for estimation of influenza vaccine effectiveness under nonrandom vaccination. BMC Infect Dis 17, 757 (2017). https://0-doi-org.brum.beds.ac.uk/10.1186/s12879-017-2838-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12879-017-2838-2

Keywords