Monday, August 24, 2020

 

Heterogeneity Lowers Herd Immunity, but Overshoot is still there

In a recent article, Apoorva Mandavalli [1] asks if we are closer to herd immunity than previously believed.  She discussed three papers that discuss lower herd immunity thresholds (HIT) due to heterogeneity in the population. All three papers use SEIR models, but with varying approaches. The papers by Lourenco et al [2] and by Gomes et al [3] are less easy to use, but the paper by Britton et al [4] is more understandable. Mandavalli [1] also discussed these papers with other scientists to get an idea of whether a new scientific consensus is emerging.

The reason why this question is pressing is that we all want to know: when will it be over? Conventional HIT values are given in the 60-70% range. These authors argue that it could be lower. Indeed Britton’s value of 43% (higher than the other two) may have informed the Swedish ‘hands off’ attitude of ‘voluntary compliance’ in the last few months, along with the attitude that everybody is going to get infected sooner or later, no matter what we do. In this post, as before, I want to calculate the cutoff percentage Xco at which the epidemic actually stops [5], which is higher than the HIT, which is merely the point at which the number of infectious cases peaks, and herd immunity effects start to kick in. These papers have been widely discussed by several authors [6- 14], and various points that they have raised will be discussed later.

Lourenco et al [2] state that the standard formula for the herd immunity threshold (HIT):

HIT = 1 – (1/R0)

is modified if a fraction r of the population is resistant to infection, becoming:

HIT = (1 - r){ 1 – (1/R0)f} where:

f = 1/[1 – (c1/c2){r/(1 - r)}(1 - d)]

where c1 and c2 are the contact rates for the two sub-population groups, and the interaction matrix (of the two sub-populations) d. The parameter d lies between 0 and 1.

This equation for HIT suggests that a wide variation in HIT will occur depending upon: (i) the proportion that is resistant to infection (ii) the R0 within the non-resistant group, and (iii) the degree of mixing between the two groups.

a)      When d = 1, there is no mixing between the two sub-populations, it is assortative within the groups;

b)      Random (or proportionate) mixing occurs when d = r; HIT reduces to the expression:

HIT = 1 – (1/R0)  r.

c)       when d = 0, there is maximal mixing between the two subgroups.

 

What is not clear is: what is the difference between (b) and (c), between random and maximal mixing?

The proportion r of the sub-population 1 being resistant to infection means R0,1 = 0. The group specific R0,I  = bi ci/sI is the R0 of the ith group, or the fundamental transmission potential of the virus within a homogeneous population consisting of members of that group. The rates of loss of infection and immunity are given by respectively by s and g.

a)      In the no-mixing (fully assortative) case (when d = 1), f =1, and the herd immunity threshold reduces to: HIT = (1 - r) (1 – 1/R0). That is,   for this case, HIT declines in proportion to the size of the resistant group.

However, this is not very clear: if there is no mixing between groups 1 and 2, why should HIT for group 2 be independent of r, and given by the standard expression for HIT? Especially since the standard HIT does not involve the size of the population?

 

b)      But for random mixing (c1 = c2 Þ d = r):  z* = 1 – (1/R0) - r. That is, the virus will not spread if r ³ 1 – (1/R0). This is the standard case, because there is no difference between sub-groups 1 and 2 (z is the sum of the numbers of infected and recovered persons).

 

If c1 ¹ c2, random mixing occurs when d = r c1/[r c1 + (1 - r c2)]

 

c)       Maximal mixing case (d = 0): the authors do not pursue this case. Also, it is not clear: what is the difference between random mixing and maximum mixing?

 

The authors conclude:

The drop in HIT is proportional to the fraction of the population resistant only when that fraction is effectively segregated from the general population; however, when mixing is random, the drop in HIT is more precipitous.

 

In addition, they also add that their results are similar to those of Gomes et al [3] but that their values are lower than those obtained by Britton [4].

I have a couple of doubts about the paper by Lourenco…Gupta [2] which I have mentioned above. However, since I have not derived the equation used by that group, it may not be fair to comment.

Gomes [3] argues that “individual variation in susceptibility or exposure (connectivity)

accelerates the acquisition of immunity in populations due to selection by the force of infection.”

The paper of Gomes et al [3], using a SEIR model, assumes variable susceptibility and exposure, and both susceptibility and exposure (connectivity) are given by gamma distributions. The gamma distribution is fitted to experimental data for 11 countries, but one fitting parameter is the coefficient of variation, CV, defined as the ratio of the standard deviation to the mean. (The infectiousness of exposed individuals is assumed as half of that of exposed individuals; the incubation period is 4 days and the period of infectiousness is assumed as 4 days).  Gomes et al use values of CV ranging from 0 to 3, and plots the herd immunity threshold HIT vs variable CV. For CV = 3, they get values as low as 10%. We will return to Gomes later. The results of Gomes will be discussed later.

Let’s now look at Britton et al [4].

 

Traditionally:

Table 1:

 

R0

h0 = 1 - (1/R0)

xco

2

0.5

0.8

2.5

0.6

0.89

3

0.667

0.94

 

H0 is the herd immunity threshold, xco is the population cutoff fraction at which infection stops (Bastin [5])

R0 = - [ln(1-x)]/x

Britton takes into account age structure & variable social activity to obtain a value haa. Specifically, the population is divided into 6 different age groups (0-5, 6-12, 13-19,20-39, 40-59 and 60+ years) and three different activity levels (normal, doubled and half).

Table 2:

R0

haa = 1 - (1/Re)

Re

2

0.346

1.53

2.5

0.43

1.75

3

0.491

1.96

 

The equation haa = 1 - (1/Re) is an assumption to create an effective R-number Re for the situation of age structure & variable social activity. It may not be valid (but it turns out OK!).

Fitting the data of Re vs R0 yields a linear plot:

Re = 0.672 + 0.43R0

We can use this to extrapolate to values of R0 other than those calculated by Britton.

Re = ln(s¥)/[ 1 – s¥]

Varying R0 one can generate a plot of s¥ vs R0 accounting for age structure & variable transmissibility:

Table 3:

R0

haa = 1 - (1/Re)

Re

Xco

1.5

0.241

1.32

0.43

2

0.346

1.53

0.60

2.5

0.43

1.75

0.71

3

0.491

1.96

0.78

3.5

0.541

2.18

0.84

4

0.582

2.39

0.88

 

We are also assuming that the equation above for calculating the population cutoff Xco remains valid when we plug into it the effective R-value, Re.

The extrapolation of Re vs R0 to values other than those calculated by Britton (R0 = 2, 2.5 &3) is relatively more reasonable, since the plot seems to be linear. But it is an assumption, to be sure.

 

Xco takes into account overshoot along with age and variable activity. This is the fraction at which the epidemic stops.


Fig.1: age & activity [4]

The lower curve is the herd immunity threshold as calculated by Britton, for the case of age structure and activity structure. The upper curve is the population fraction at which the epidemic dies out completely and no more infections occur.

Britton has two more columns, in which he accounts for only age structure and only for variable activity. Using similar logic for the social activity – which Britton says is more important:

Re = 0.603 + 0.50R0

Table 4:

 

R0

haa = 1 - (1/Re)

Re

Xco

1.5

0.261

1.35

0.46

2

0.376

1.60

0.64

2.5

0.460

1.85

0.75

3

0.524

2.10

0.82

3.5

0.575

2.35

0.87

4

0.616

2.60

0.90



This plot looks almost the same as the previous one – but it is displaced slightly upwards.

 

For the case of age-structure only:

Re = 0.21 + 0.82R0

Table 5:

 

R0

haa = 1 - (1/Re)

Re

Xco

1.5

0.305

1.44

0.54

2

0.459

1.85

0.75

2.5

0.558

2.26

0.85

3

0.625

2.67

1.00

3.5

0.675

3.08

1.00

4

0.713

3.49

1.00

 

 

 

Fig.3: age structure only [4]

 

This plot, for age structure only, is clearly pushed upwards – especially for R0 of 3 and above – where it hits 100% of the population.

Of course, as R0 increases, the percentage asymptotically increases towards 100% and does not actually reach a hundred – the above results are due to numerical inaccuracies.

Britton says that the activity structure is dominant and the age structure does not play so much of a role (roughly activity is 3X as important as age in his calculations).

The data given by Lourenco et al [2] can be handled similarly. However, the values of HIT – as pointed out earlier – are much lower:

This table (1st 2 columns from Table 1 in Ref.2) is for assortative mixing (i.e. no mixing of groups 1 & 2):

Table 6:

 

R0

heff = 1 - (1/Re)

Re

Xco

1.5

0.16

1.19

0.30

2

0.25

1.33

0.45

2.5

0.30

1.43

0.53

3

0.33

1.49

0.57

 

Similarly, for proportionate mixing:

Table 7:

 

R0

heff = 1 - (1/Re)

Re

Xco

1.5

0

1

0

2

0

1

0

2.5

0.10

1.11

0.19

3

0.16

1.19

0.30

 


Fig.4: assortative and proportionate mixing [2]

The values for heff and Xco are shown in Fig.4 for both assortative and proportionate mixing.

 

Gomes et al [2] considered variable susceptibility and variable connectivity (separately) – but also mentioned the ‘final size of the uncontrolled epidemic’ in their Fig.3. This is listed in the 3rd column 9in the two Tables below) and compares pretty well with the value of Xco (in the 5th column in the two Tables below) as calculated from Re (obtained as above from h).

Gomes et al [2] has only considered R0 = 3 but with variable susceptibility CV (as mentioned above):

Table 8:

 

CV

heff = 1 - (1/Re)

(from Gomes)

Final % of epidemic (from Gomes)

Re

Xco

0

0.65

0.94

2.86

0.93

1

0.43

0.67

1.75

0.71

2

0.2

0.33

1.25

0.37

3

0.1

0.17

1.11

0.19

4

0.07

0.11

1.08

0.14

 

This is shown graphically:


Fig.5: variable susceptibility [3]

 

Gomes et al [3] has only considered R0 = 3 but with variable connectivity CV (as mentioned above):

Table 9:

 

CV

heff = 1 - (1/Re)

(from Gomes)

Final % of epidemic (from Gomes)

Re

Xco

0

0.65

0.94

2.86

0.93

1

0.31

0.54

1.45

0.55

2

0.12

0.23

1.14

0.23

3

0.07

0.125

1.07

0.12

4

0.04

0.08

1.04

0.07

 

In this case, the values of Xf and Xco agree somewhat better:


Fig.6: variable connectivity [3]

Neither Lourenco [2] nor Britton [4] mentions overshoot, but Gomes [3] clearly does by tabulating Xf (the final composition of the population after the epidemic is over)

Making some assumptions, we have calculated the value Xco at which transmission of the virus completely stops. These assumptions may not be correct – but the conclusions are plausible. And they do match pretty well with the calculations of Gomes [3] – although the matching is better for the variable connectivity case (Fig.5) than it is for the variable susceptibility case (Fig.4).

Discussions:

At this point, it is worth examining what various authors [6-14] have concluded, based on the three papers and their understanding. Since there is considerable overlap, only some points will be taken up.

Jha [6] discusses Sweden’s approach of voluntary compliance and the statistics that show that Sweden did better than the UK – but significantly worse than its Nordic neighbours, and attributes the high death rates to a flawed approach to taking care of the elderly in care homes and foreign migrant labour in crowded urban areas. He points out that 85% of Sweden’s population lives in cities and that the elderly in Sweden’s care homes were already frail (28% of men and 19% of women would die within 6 months of entering the facility, in normal times) and where half of all deaths occurred. Anders Tegnell expected 25% of the population to contract the virus based on Britton’s model [4], while in crowded Stockholm it was a little more than 20%. However, Johann Giesecke still argues that, despite setbacks, the Swedish strategy as a whole is not disqualified. Indeed, he said,” I expect when we count the number of deaths in each country in one year from now, the figures will be similar, regardless of the measures taken.” Giesecke has heavily criticized Ferguson’s paper as fundamentally flawed by debatable assumptions, which nevertheless, provoked ‘a huge over-reaction’ all over the world, but especially in the U.K. and the U.S. Needless to say, Giesecke does not mention the cutoff Xco and sticks to the HIT.

Vineeta Bal [7] discusses the results of serological surveys in India: particularly Delhi (23% sero-positivity), Mumbai (40%), Berhampur (31%). She also points out that it was 51.5% in Pune, ranging from 65% in the most crowded districts to 31% in the least crowded districts. Similar results were obtained in Mumbai (16-57%), with the highest sero-positivity (57%) in the slums of Dharavi. However, Vineeta Bal emphasizes that these tests do not mean that the residents of Dharavi are immune, because the serological test used only detects antibodies, not ‘neutralizing antibodies’, which confer immunity (and are more difficult to detect. It will take 6 weeks to get results, according to Arunab Ghose of IISER). So, while the number of cases in Dharavi has come down, it is premature to conclude that it is because of herd immunity.

The town of Bergamo, the epicenter of the covid-19 outbreak in Italy, recorded 57% of the population had developed antibodies [8].


Fig.7: HIT & Xco vs R0 from [9]

The above figure is from Kleczkowski [9], and it plots both HIT and Xco as a function of R0. Kleczowski is rather balanced, merely pointing out that the concept of herd immunity is ‘not without controversy’ and would lead to a large number of excess deaths. Kleczkowski accepts that diversity (heterogeneity) in the population will lower the HIT from the 60-70% value for a homogeneous population to  a value as low as 10%, and mentions the above 3 papers.

Hartnett [10] also points out that in some cases the threshold could even be higher, e.g. in a nursing or care home. But, ‘on a larger scale’ any heterogeneity in the population (a variable R0) will lower the threshold, since the virus will first pick off the more susceptible but the epidemic slows down when it starts coming up against  less susceptible hosts. Tom Britton feels now that 43% is too high, and that additional sources of heterogeneity, not considered in their published model, may lower the threshold even more. Gabriela Gomes believes that Madrid may be approaching the 20% threshold that their group has calculated. Many other experts, according to Hartnett, believe that these studies are not completely reliable and are cautious about endorsing them, because behavior of people is often random and difficult to model. Kate Langwig (a co-author of Gabriela Gomes) feels that estimating heterogeneity is indeed difficult but it is important to do it: “We have been sloppy in thinking about herd immunity”. Jeffrey Shaman of Columbia objects that 20% is not consistent with other respiratory viruses: if it isn’t 20% for flu, why should it be for covid-19?

Hamblin [11] has discussed Gomes’s paper and introduces chaos into it, arguing that Gomes works on chaos, even though she does not use the word anywhere in her paper. Nevertheless, Hamblin says that dynamic systems can be unpredictable and small changes in susceptibility can lead to large consequences for the outcome of the epidemic - which may be a factor in the low predictability of pandemics. Britton does not think 20% is likely, and favours a higher number. Marc Lipsitch (of Harvard and author of ‘Rules of Contagion’) initially quoted 40-70% but, in later discussions with Hamblin, lowered his estimate to 20-60%, but with an increasing degree of skepticism as the threshold approaches 20%. Shweta Bansal also argues that under certain conditions (nursing homes) the threshold could exceed 70%.

Harry Stevens [12] quotes Yale epidemiologist Marcus Russi to say (about the U.S.) that: “There’s just way too little seroprevalence in all of these states to come anywhere close to achieving herd immunity.” But the higher range of sero-prevalence estimates in the U.S. is 25% - which is much lower than traditional values – but in the same ball-park as the estimates of Gupta [2] and Gomes [3] – but lower than that of Britton [4]. Stevens [12] does not mention the three papers being discussed here.


Fig.8: Herd immunity threshold and Overshoot [13]

Bergstrom and Dean [13] wrote in the NYT about ‘overshoot’ beyond the HIT, and emphasize that when we reach the HIT: “That’s not when things stop — it’s only when they start to slow down.” They also do not discuss the three papers, sticking to the original values of ‘nearly two-thirds’ of the population. According to Bergstrom and Dean:  “A runaway train doesn’t stop the instant the track begins to slope uphill, and a rapidly spreading virus doesn’t stop right when herd immunity is attained.”

Regalado [14] quotes a tweet by Florian Klemmer: “It seems there is the ‘herd immunity is already reached’ team and the ‘we are all going to die’ team. The good thing is, there is a third, ‘let’s get the data and let’s look at what this all means’ team out there.” Regalado also quotes Marc Lipsitch to say that the disease itself, when it causes herd immunity, does so more ‘efficiently’ than giving out the vaccine at random. According to Youyang Gu, roughly 10% of the U.S. population has now been infected. But, estimates vary widely: 10-80% of the population might have to be infected, depending on how well the virus spreads, but also on social factors like how much people ordinarily mix with one another (to achieve herd immunity).

Apoorva Mandavalli [1] points out that in some clinics in the U.S. as many as 80% of people who were tested had antibodies to the virus, with teenage boys having the highest prevalence. In Queens the prevalence was as high as 68%, while it was as low as 13% in Brooklyn. Most experts that Mandavalli discussed the papers with were not willing to accept herd immunity thresholds as low as 10-20%. Biostatistician Natalie Dean asked: where is the evidence that the detected antibodies are actually immuno-protective? Carl Bergstrom argued that, while such low herd immunity thresholds, were mathematically possible, these models are, at the moment, ‘guesses’. Other experts pointed out that all models are flawed because they over-simplify reality and do not accurately represent real world conditions. Jeffrey Shaman considered Gomes’s model as a ‘possible solution’ but questioned the wide range of parameters for different countries in the study. Most researchers are wary of concluding that the hardest-hit neighborhoods of Brooklyn, or even the blighted areas of Mumbai, have reached herd immunity or will be spared future outbreaks.

Franks and Roclov [15] point out that the percentage of antibodies observed in the Diamond Princess was about 20%, and similar numbers have been obtained in Stockholm, New York and London – suggesting that there is something in the 20% idea. However, the prevalence was as high as 54% in the Hartsville Correction Center. The authors discuss the idea of ‘immunological dark matter’, first mentioned by Friston [16] (50% of any population is not susceptible to infection because of cross-immunity from other infections and geographic isolation), but point out that the fact that values much higher than 20% mean that the T-cell innate immunity hypothesis remains to be proved. Finally, if a 20% threshold does exist, it applies to only some communities, depending on interactions between many genetic, immunological, behavioral and environmental factors, as well as the prevalence of pre-existing diseases.

Resnick [17] discusses the possibility that immunity may have a limited shelf life, maybe as little as 3 months. Under such conditions, the ‘let it rip’ approach to going for herd immunity in a society by uncontrolled (or lightly controlled) infection, may not make much sense. Resnick also discusses the various tests for viruses, antibodies, T-cells, B-cells etc. Resnick [18] also quotes Harvard epidemiologist Stephen Kissler [19] who advocates ‘stop-and-go’ social distancing and predicts that it might take till 2022 to build up enough immunity in the population, if it is done in a cautious manner.

Overall, the consensus seems to be that heterogeneity does lower the herd immunity threshold, but whether it goes as far down as 10% is not clear. Britton’s paper is the only one to be peer-reviewed, and in the paper the authors state that the calculated values are ‘indicative’, not set in stone. The point I want to emphasize in this post is that even if heterogeneity reduces the herd immunity threshold to values as low as 10-20% - which most experts believe to be unlikely – the fact of overshoot cannot be ignored. Sunetra Gupta [2] is well aware of overshoot, but only Gabriela Gomes [3] actually gives the values in her paper (as given above in Tables 8 & 9). The cutoff values Xco are significantly higher. It is these values that should concern us – when the epidemic is over, not when it peaks. The values quoted above for serological studies in London, New York, Stockholm, Mumbai etc refer to points in time where the epidemic is definitely past its peak, if not exactly ended, so the appropriate number is the cutoff Xco, not the herd immunity threshold. 

 

References:

1.       Apoorva Mandavalli https://www.nytimes.com/2020/08/17/health/coronavirus-herd-immunity.html

2.       2. Jose Lourenco… Sunetra Gupta  medRxiv 16th July 2020 https://doi.org/10.1101/2020.07.15.20154294

3.       M.G.M.Gomes et al  medRxiv 2nd May 2020

 https://doi.org/10.1101/2020.04.27.20081893

4.       Tom Britton et al Science 14th July 2020  Science 10.1126/science.abc6810 (2020).  

5.      5. G.Bastin: “Lectures on mathematical modeling of biological systems,” 22nd Aug.2018 (GBIO 2060)

6.       6. Ramanath Jha, “Sweden’s ‘Soft’ COVID-19 Strategy: An Appraisal,” ORF Occasional Paper No. 258, July 2020, Observer Research Foundation

 

7.       Vineeta Bal and Satyajeet Rath https://indianexpress.com/article/explained/sero-surveys-during-covid-19-why-do-they-matter-what-do-they-say-6558977/

 

 

8.       Niclas Rolander https://theprint.in/world/sweden-proves-surprisingly-slow-in-achieving-herd-immunity/443329/

 

9.       Adam Kleczkowski https://scroll.in/article/968357/why-herd-immunity-may-not-be-the-perfect-solution-to-coronavirus

 

     10 Kevin Hartnett https://www.quantamagazine.org/the-tricky-math-of-covid-19-herd-immunity-20200630/

11.   James Hamblin The Atlantic 13th July 2020 https://www.theatlantic.com/newsletters/archive/2020/07/coronavirus-herd-immunity/614116/ 

12.   Harry Stevens https://www.washingtonpost.com/graphics/2020/health/coronavirus-herd-immunity-simulation-vaccine

13.   Carl T.Bergstrom and Natalie Dean https://www.nytimes.com/2020/05/01/opinion/sunday/coronavirus-herd-immunity.html

14.   Antonio Regalado MIT https://www.technologyreview.com/2020/08/11/1006366/immunity-slowing-down-coronavirus-parts-us/

.   15. Paul Franks and Joacim Rocklov https://theconversation.com/coronavirus-could-it-be-burning-out-after-20-of-a-population-is-infected-141584

16.   Karl J.Friston et al, “Tracking and tracing in the U.K., a dynamic causal modeling study,” Technical Report 2005.07994

17.   Brian Resnick 28th April 2020 https://www.vox.com/science-and-health/2020/4/28/21237922/antibody-test-covid-19-immunity

 

18.   Brian Resnick 15th May 2020 https://www.vox.com/science-and-health/2020/5/15/21256282/immunity-duration-covid-19-how-long

 

19.   Stephen M.Kissler et al, “Projecting the transmission dynamics of Sars-Cov-2 through the post-pandemic period” Science 368 (2020) 860-868  (22nd May 2020)