Research organism

elife

eLife

2050-084X

eLife Sciences Publications, Ltd

56915

10.7554/eLife.56915

Short Report

Evolutionary Biology

Microbiology and Infectious Disease

A large effective population size for established within-host influenza virus infection

Lumby

Casper K

http://orcid.org/0000-0001-8329-9228

1Zhao

Lei

1Breuer

Judith

23Illingworth

Christopher JR

https://orcid.org/0000-0002-0030-2784

cjri2@cam.ac.uk1451Department of Genetics, University of Cambridge

Cambridge

United Kingdom2Great Ormond Street Hospital

London

United Kingdom3Division of Infection and Immunity, University College London

London

United Kingdom4Department of Applied Mathematics and Theoretical Physics, University of Cambridge

Cambridge

United Kingdom5Department of Computer Science, Institute of Biotechnology, University of Helsinki

Helsinki

Finland

Walczak

Aleksandra M

Senior EditorÉcole Normale SupérieureFranceNourmohammad

Armita

Reviewing EditorUniversity of WashingtonUnited States

10082020

2020

e56915

1303202030072020

2020

Lumby et al

http://creativecommons.org/licenses/by/4.0/

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Strains of the influenza virus form coherent global populations, yet exist at the level of single infections in individual hosts. The relationship between these scales is a critical topic for understanding viral evolution. Here we investigate the within-host relationship between selection and the stochastic effects of genetic drift, estimating an effective population size of infection N_e for influenza infection. Examining whole-genome sequence data describing a chronic case of influenza B in a severely immunocompromised child we infer an N_e of 2.5 × 10⁷ (95% confidence range 1.0 × 10⁷ to 9.0 × 10⁷) suggesting that genetic drift is of minimal importance during an established influenza infection. Our result, supported by data from influenza A infection, suggests that positive selection during within-host infection is primarily limited by the typically short period of infection. Atypically long infections may have a disproportionate influence upon global patterns of viral evolution.

within-host evolutiongenetic drifteffective population sizeselection

Research organismVirus

http://dx.doi.org/10.13039/100004440

Wellcome

101239/Z/13/Z

Illingworth

Christopher JR

http://dx.doi.org/10.13039/100004440

Wellcome

101239/Z/13/A

Illingworth

Christopher JR

http://dx.doi.org/10.13039/100004440

Wellcome

105365/Z/14/Z

Lumby

Casper K

http://dx.doi.org/10.13039/501100004815

Isaac Newton Trust

Illingworth

Christopher JR

http://dx.doi.org/10.13039/100007797

Helsingin Yliopisto

Illingworth

Christopher JR

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Author impact statement

Once an influenza infection is established, selection acts efficiently in favouring fitter viral genotypes, its effects being limited only by the short length of a typical infection.

Introduction

The evolution of the influenza virus may be considered across a broad range of scales. On a global level, populations exhibit coherent behaviour (Buonagurio et al., 1986; Fitch et al., 1997; Bedford et al., 2015), evolving rapidly under collective host immune pressure (Ferguson et al., 2003; Grenfell et al., 2004). On another level, these global populations are nothing more than very large numbers of individual host infections, separated by transmission events.

Despite the clear role for selection in global influenza populations, recent studies of within-host infection have suggested that positive selection does not strongly influence evolution at this smaller scale (Debbink et al., 2017; McCrone et al., 2018; Han et al., 2019). Contrasting explanations have been given for this, with suggestions either that selection at the within-host level is intrinsically inefficient, being dominated by stochastic processes (McCrone et al., 2018), or that while selection is efficient, a mismatch in timing between the peak viral titre and the host adaptive immune response prevents selection from taking effect (Han et al., 2019).

To resolve this issue, we evaluated the relative importance of selection and genetic drift during a case of influenza infection. The balance between these factors is determined by the effective size of the population, denoted N_e. If N_e is high, selection will outweigh genetic drift, even where differences in viral fitness are small (Rouzine et al., 2001). By contrast, if N_e is low, less fit viruses are more likely to outcompete their fitter compatriots.

Estimating N_e is a difficult task, with a long history of method development in this area (Wright, 1938; Wang et al., 2016; Khatri and Burt, 2019). A simple measure of N_e may be calculated by matching the genetic change in allele frequencies in a population with the changes occurring in an idealised population evolving under genetic drift (Kimura and Crow, 1963). However, such estimates are vulnerable to distortion, for example being reduced by the effect of positive selection in a population. Where the global influenza A/H3N2 population is driven by repeated selective sweeps (Fitch et al., 1991; Rambaut et al., 2008; Strelkowa and Lässig, 2012) a neutral estimation method suggests a value for N_e not much greater than 100 (Bedford et al., 2010). While methods for jointly estimating N_e and selection exist, they are limited in considering only a few loci in linkage disequililbrium (Bollback et al., 2008; Feder et al., 2014; Foll et al., 2014; Terhorst et al., 2015; Rousseau et al., 2017). Non-trivial population structure can affect estimates (Laporte and Charlesworth, 2002); a growing body of evidence supports the existence of structure in within-host influenza infection (Lakdawala et al., 2015; Sobel Leonard et al., 2017a; Richard et al., 2018; Hamada et al., 2012). While careful experimental techniques can reduce sequencing error (McCrone and Lauring, 2016), noise from sequencing and unrepresentative sample collection combine (Illingworth et al., 2017), potentially confounding estimates of N_e in viral populations (Lumby et al., 2018). If N_e is high, any signal of drift can be obscured by noise.

We here estimate a mean effective population size for an established within-host influenza B infection using data collected from a severely immunocompromised host. While the viral load of the infection was not unusual for a hospitalised childhood infection (Wishaupt et al., 2017), an absence of cell-mediated immunity led to the persistence of the infection for several months (Lumby et al., 2020). Given extensive sequence data collected during infection, the reduced role of positive selection, combined with novel methods to account for noise and population structure, enabled an improved inference of N_e. The large effective size we infer suggests that selection acts in an efficient manner during an established influenza infection. Even in more typical cases, the influence of positive selection is likely to be limited only by the duration of infection.

Results and discussion

Viral samples were collected at 41 time points spanning 8 months during the course of an influenza B infection in a severely immunocompromised host (Figure 1A). Clinical details of the case, and the use of viral sequence data in evaluating the effectiveness of clinical intervention, have been described elsewhere (Lumby et al., 2020). After unsuccessful treatment with oseltamivir, zanamivir and nitazoxanide, a bone marrow transplant and favipiravir combination therapy led to the apparent clearance of infection. Apart from a single exception, biweekly samples tested negative for influenza across a period of close to two months. A subsequent resurgence of zanamivir-resistant infection was cleared by favipiravir and zanamivir in combination.

Figure 1.Population structure of the influenza infection.

(A) CT values from viral samples collected over time indicate the viral load of the infection; a higher number corresponds to a lower viral load. Drug information, above, shows the times during which oseltamivir (green), zanamivir (yellow), nitazoxanide (blue) and favipiravir (purple) were prescribed. Black dots show samples from which viral sequence data were collected; gray dots show samples from which viral sequence data were not collected. The green box shows the window of time over which samples were analysed, preceding the use of favipiravir in January. The mean viral load (dashed horizontal line, red) was close to the mean reported for a set of samples from hospitalised children with influenza (dashed horizontal blue line) (Wishaupt et al., 2017). A black arrow shows the date of a bone marrow transplant (BMT). (B) A phylogeny of whole-genome viral consensus sequences identified two distinct clades in the viral population. Clade B featured three samples, distributed across the period of infection, with the remaining samples contained in Clade A. (C) Sub-consensus structure of the viral population inferred via a haplotype reconstruction algorithm using data from the neuraminidase segment. The same division of sequences into two clades is visible, with samples being comprised of distinct viral genotypes. The area of each circle is proportional to the inferred frequency of the corresponding haplotype in the viral population. Haplotypes reaching a frequency of at least 10% in at least one time point are shown. Multiple drugs were administered to the patient through time, with a favipiravir/zanamivir combination first causing a temporary reduction of the population to undetectable levels, then finally clearing the infection. Haplotypes spanned the loci 96, 170, 177, 402, 403, 483, 571, 653, 968, 973, 1011, 1079, 1170, and 1240 in the NA segment. (D) Evolutionary relationship between the haplotypes; clade B is distinct from and evolves away from those sequences comprising the initial infection. Numbers refer to the distinct haplotypes identified within the population.

Figure 1—source data 1.Viral load and details of treatment with inferred haplotype frequencies for the neuraminidase viral segment.

(A) CT values and dates of treatment. (C) Reconstructed haplotype frequencies for the neuraminidase viral segment.

Figure 1—source data 2.Data for the phylogenetic tree in <xref ref-type="fig" rid="fig1">Figure 1B</xref>.

Figure 1—figure supplement 1.Complete phylogeny of whole-genome viral consensus sequences, coloured by clade.Figure 1—figure supplement 2.Haplotype reconstruction for data describing the haemagglutinin segment of the virus.

(A) Sub-consensus structure of the viral population inferred via a haplotype reconstruction algorithm using data from the haemagglutinin segment. A division of sequences into two clades is visible, with samples including largely distinct viral genotypes. The area of each circle is proportional to the amount of virus in each clade. Haplotypes reaching a frequency of at least 10% in at least one time point are shown. Haplotypes spanned the loci 258, 261, 364, 451, 521, 541, 635, and 641 in the HA segment. (B) Evolutionary relationship between the haplotypes; clade B is distinct from and evolves away from those sequences comprising the initial infection. Numbers refer to the distinct haplotypes identified within the population.

Figure 1—figure supplement 2—source data 1.Reconstructed haplotype frequencies for the haemagglutinin viral segment.

Phylogenetic analysis of whole-genome viral consensus sequences showed the existence of non-trivial population structure, with at least two distinct clades emerging over time (Figure 1B, Figure 1—figure supplement 1); we term these clades A and B. Having diverged, the two clades persisted across several months of infection. Haplotype reconstruction showed that samples from clade B were comprised of distinct viral haplotypes to those from clade A; similar patterns were observed in different viral segments (Figure 1—figure supplement 2). The October 4^th sample is intermediate between the initial and final samples collected (Figure 1D). We suggest that, from a common evolutionary origin, Clade B slowly evolved away from the initial consensus, while viruses in clade A stayed close in sequence space to this consensus. The cladal structure suggests the existence of spatially distinct viral populations in the host, samples stochastically representing one population or the other.

To estimate the effective population size, we analysed genome-wide sequence data from samples in clade A collected before first use of favipiravir. A method of linear regression was used to quantify the rate of viral evolution, measuring the genetic distance between samples as a function of increasing time between dates of sample collection. We inferred a rate equivalent to 0.051 substitutions per day (97.5% confidence interval 0.034 to 0.068) (Figure 2A), equivalent to 7.94 substitutions genome-wide across 157 days of evolution. The vertical intercept of this line provides an estimate of the contribution of noise to the measured distance between samples, potentially arising from sequencing error or undiagnosed population structure. The identified value of close to 40 substitutions is equivalent to a between-sample allele frequency difference of approximately +/- 0.3% per locus. While considerable noise affects each sample, the dataset as a whole provides a clear signal of evolutionary change.

Figure 2.Measuring rates of evolution in the viral population.

(A) Computed rate of evolution for viruses in clade A up to the time of the first use of favipiravir. The distance between two sequences is calculated as the total absolute difference in four-allele frequencies measured across the genome. The calculated rate per generation is based upon a generation time for influenza of 10 hours (Nobusawa and Sato, 2006). (B) Distribution of evolutionary distances in influenza populations simulated using a Wright-Fisher model compared to the distance per generation calculated in the regression fit. A solid blue line shows the mean, with shading indicating an approximate 97.5% confidence interval around the mean. Statistics were calculated from sets of 400 simulations conducted at each value of N_e. The dashed black line shows the rate of evolution of the real population; gray shading shows a 97.5% confidence interval for this statistic. (C) Calculated rate of evolution for viruses in clade B. For the purposes of calculating a rate of evolution the first sample collected from the patient was included as part of clade B. (D) Estimation of N_e for clade B. The results of simulations shown here are identical to those in part B of the figure.

Figure 2—source data 1.Between sample differences and simulated rates of evolution for clades A and B of the viral population.

(A) Sequence distances D calculated for pairwise samples from the population used in the calculation for clade A. Data points are ordered by the difference in sample times, measured in whole days between samples. (B) Distance statistics calculated from simulated data for different effective population sizes Ne. (C) Sequence distances D calculated for pairwise samples from the population used in the calculation for clade B. (D) Distance statistics calculated from simulated data for different effective population sizes Ne.

Figure 2—figure supplement 1.Amino acids present at codon 117 of the neuraminidase segment of the virus after the first administration of zanamivir.

The consensus glutamate nucleotide (blue) was sometimes replaced by glycine (green), valine (yellow), and alanine (red). Glycine and alanine are associated with zanamivir resistance in influenza B.

Figure 2—figure supplement 1—source data 1.Amino acid frequencies at position 117 in the neuraminidase viral segment.

Figure 2—figure supplement 2.Rates of evolutionary change at non-synonymous and synonymous sites.

(A) Comparison of rates of synonymous and non-synonymous evolution for viruses in clade A up to the time of the administration of favipiravir. The distance between two samples is calculated as the mean absolute difference in allele frequency, as averaged over synonymous and non-synonymous positions in the genome. (B) Comparison of rates of synonymous and non-synonymous evolution for viruses in clade (B) The rate of evolution in both clades was slower at non-synonymous sites than at synonymous sites, suggesting a general pattern of purifying selection at non-synonymous sites. Change in the population was not as a whole driven by positive selection.

Figure 2—figure supplement 2—source data 1.Synonymous and non-synonymous sequence distances calculated per nucleotide across the whole viral genome for different pairs of samples.

Data are given according to the interval in time, measured in whole days between samples.

Figure 2—figure supplement 3.Estimates of the effective population size for data from a study of long-term influenza A/H3N2 infection in four patients.

Patients are denoted with the letters assigned them in the original study (Xue et al., 2017). Rates of evolution within each patient were calculated by linear regression, conducted on a plot of evolutionary versus temporal distance between samples. The inferred regression line is shown in red for each dataset. For Patient W samples collected at two time points appear as outliers in the distance plot; distances involving these samples, shown in yellow, were excluded from the calculation. Accompanying plots show distances inferred via simulation compared to the inferred rates. A solid blue line shows the mean, with shading indicating an approximate 97.5% confidence interval around the mean. Statistics were calculated from sets of 400 simulations conducted at each value of N_e.

Figure 2—figure supplement 3—source data 1.Sequence distances D calculated for the Xue et al dataset.

Equivalent distances for a single generation generated from simulated data at different effective population sizes are also provided. The calculated equivalent distance per generation from the sequence data is also provided.

Figure 2—figure supplement 4.Minority allele frequencies from distinct time points used for the Wright-Fisher simulation applied to the influenza B sequence data.

Allele frequencies from across the genome are sorted and shown on a log scale.

Figure 2—figure supplement 4—source data 1.Sorted allele frequencies collected genome-wide for samples used in the simulation of data.

Only non-zero frequencies are reported. These allele frequencies were processed using a statistical method for removing false positive variant calls as a precursor step within the Wright-Fisher simulation.

Figure 2—figure supplement 5.Frequencies of minority variant alleles identified in the HCV01 dataset used to evaluate the accuracy of variant calling in our sequencing pipeline.

Samples in this dataset were split following RNA extraction with replicate sets of RNA being processed and sequenced independently. Variants at higher frequencies were identified at more consistent frequencies than variants at lower frequencies.

Figure 2—figure supplement 5—source data 1.Replicate allele frequencies from the HCV01 dataset, described in a previous publication, and used in this study to estimate a frequency-dependent positive predictive value for variant calling using the sequencing method applied to the influenza B data.

Figure 2—figure supplement 6.Regions of frequency space used to define observations and non-observations of allele frequencies.

V indicates the identification of a variant, while X indicates the non-identification of a variant. Combinations of V and X indicate observations made in two replicate samples.

Figure 2—figure supplement 7.Positive predictive value for minority variants under our sequencing pipeline, calculated at different frequency ranges.

While high frequency variants were very reliably identified, the reliability of identifying variants was significantly impaired at lower frequencies.

Figure 2—figure supplement 7—source data 1.Frequency-dependent positive predictive values for variant calling.

A simulation based analysis, measuring the extent of evolution in idealised Wright-Fisher populations (Kimura and Crow, 1963), inferred an effective population size of 2.5 × 10⁷ (95% confidence range 1.0 × 10⁷ to 9.0 × 10⁷) for viruses in clade A before the use of favipiravir (Figure 2B). This value is substantially larger than estimates made recently for within-host HIV infection (Pennings et al., 2014; Rouzine et al., 2014), and suggests that even weak selection could easily overcome genetic drift. Data from clade B gave a lower estimated value of 2 × 10⁶, (95% confidence range 4 × 10⁵ to 2 × 10⁸) perhaps reflecting the less frequent observation of samples in that clade (Figure 2C,D), and the bottleneck induced by favipiravir, which was spanned by the data used in this calculation.

Our value of N_e is representative of the population after the initial establishment of infection; the initial expansion of the viral population was not represented in our data. Population structure during the infection might have lowered the value we obtain (Whitlock and Barton, 1997). The partial onset of zanamivir resistant alleles (Jackson et al., 2005), sporadically observed at intermediate frequency in clade A after the administration of the drug (Figure 2—figure supplement 1), is suggestive of sampling a random mixture of viruses from resistant and susceptible subpopulations.

Our method equates change in a population with genetic drift (Kimura and Crow, 1963), neglecting the role of selection. As such, the influence of positive selection might have led us to underestimate N_e. While viral evolution was generally not driven by selection (Figure 2—figure supplement 2), positive selection (e.g. for zanamivir resistance) would increase the rate of viral evolution, lowering our inferred value. Selection may have influenced the division between clades, perhaps through the adaptation of the virus to specific local environments. Purifying selection may also have influenced the population in ways not accounted for by our method. Yet our result is clear. Once an infection is established, selection will dominate the stochastic effects of drift upon within-host evolution.

The dataset we considered is particularly suited to our calculation. The long period of infection combined with frequent sampling allowed for the characterisation of a slow rate of evolution amidst population structure and noise in the data. Further, the absence of strong selection reduced the error in our inference approach, which assumed an idealised neutral population. To provide further validation we repeated our approach on data describing long-term influenza A/H3N2 infection in four immunocompromised adults (Xue et al., 2017). The estimates for N_e we obtained, of between 3 × 10⁵ and 1 × 10⁶ (Figure 2—figure supplement 3), while high, were smaller than for our flu B case, potentially being reduced by an increased influence of selection.

We believe that our study provides a first realistic estimate of within-host effective population size for severe influenza infection in humans. The viral load in the influenza B case was high, representative of hospitalised cases of childhood influenza infection. However, the magnitude of our inferred effective size, of order 10⁷, suggests that selection will predominate over drift even in more typical cases. Mean CT values for influenza in non-hospitalised children have been reported as around 10 units lower than those for hospitalised cases (Wishaupt et al., 2017); an order of magnitude calculation suggests an Ne, upon the establishment of infection, of approximately 10⁴ in such cases. Such a value again reflects an established population, not accounting for the initial population bottleneck. It has the implication that the evolution of a measurable variant (i.e. at a frequency of 1% or above) will be dominated by selection of a magnitude of 1% or greater per generation (Rouzine et al., 2001).

Our result supports the idea that a tight transmission bottleneck (McCrone et al., 2018; Valesano, 2020; Ghafari et al., 2020) followed by a short period of infection is sufficient to explain the observed lack of within-host variation in typical cases of influenza (Debbink et al., 2017; McCrone et al., 2018); the stochastic effects of genetic drift do not limit the impact of positive selection. Variants arising through de novo mutation would require strong selection to reach a substantial frequency during infection (Zhao et al., 2019), particularly if the onset of selection is delayed (Miao et al., 2010; Illingworth et al., 2014; Morris, 2020). We suggest that, while not being confounded by drift, selection does not usually have time to fix novel variants in the population, exceptions including the emergence of antiviral resistance and some cases of longer infection (Xue et al., 2017; Gubareva et al., 1998; Snydman, 2006; Centers for Disease Control and Prevention (CDC), 2009; Imai et al., 2020; Rogers et al., 2015).

Our result highlights the potential importance of longer infections in the adaptation of global influenza populations, particularly where some adaptive immune response remains. A newly emergent variant under strong positive selection increases faster than linearly in frequency (Haldane, 1924). Given a large N_e, implying efficient selection, additional days of infection will have a disproportionate influence upon the potential transmission of adaptive variants. This does not imply that longer infections are the sole driving force behind global viral adaptation; selective effects affecting viral transmissibility (Lumby et al., 2018) would provide an alternative explanation. However, our work suggests that longer-term infections may be an important area of study in the quest to better understand global influenza virus evolution.

Materials and methodsSummary

In a single-locus haploid system, the expected change in a variant allele with frequency q caused by genetic drift is given by the formula (Charlesworth, 2009)(1)ΕΔq=q(1-q)Ne

This fact has been exploited to evaluate the size of transmission bottlenecks in influenza infection, comparing statistics of genome sequence data collected before and after a transmission event (Poon et al., 2016; Sobel Leonard et al., 2017b). Such a calculation may be affected by noise in the sampling or sequencing of a population, particularly where the extent of noise outweighs the genuine change in a population (Lumby et al., 2018). Here we suggest that, given multiple samples from a population, an alternative approach is possible; we use this to derive a more robust estimate of N_e. By means of evolutionary simulations we estimate N_e for cases of within-host influenza infection.

Sequence data and bioinformatics

Sequence data describing the evolution of the infection was generated as part of a previous study (Lumby et al., 2020). Data, edited to remove human genome sequence data, have been deposited in the Sequence Read Archive with BioProject ID PRJNA601176. The HCV data used in validating the sequencing pipeline (see below) were previously deposited in the Sequence Read Archive with BioProject ID PRJNA380188. Processed files describing raw variant frequencies for both datasets are available, along with code used in this project, at https://github.com/cjri/FluBData (copy archived at https://github.com/elifesciences-publications/FluBData; Illingworth, 2020a).

Short-read data were aligned first to a broad set of influenza sequences. Sequences from this set to which the highest number of reads aligned were identified and used to carry out a second short-read alignment. The SAMFIRE software package was then used to filter the short-read data with a PHRED score cutoff of 30, to identify consensus sequences, and to calculate the number of each nucleotide found at each position in the genome. SAMFIRE is available from https://github.com/cjri/samfire (Illingworth, 2020b).

Calculation of evolutionary distances

Variant frequencies at different time points during infection were used to calculate a rate of change in the population over time. We define q(t) as a 4 x L element vector describing the frequencies of each of the nucleotides A, C, G, and T at each locus in the viral genome at time t. We next define a distance between vectors q. Considering a single locus in the genome, we calculate the change in allele frequencies over time via a generalisation of the Hamming distance(2)dqit1,qit2=12∑aϵA,C,G,Tqiat1-qiat2where the term inside the sum indicates the absolute difference between the frequency of allele a at locus i. The statistic d_i is equal to one in the case of a substitution, for example where only A nucleotides are observed in one sample and only G nucleotides in another. However, in contrast to the Hamming distance it further captures smaller changes in allele frequencies, lesser changes producing values between zero and one, such that a change of a variant frequency from 45% to 55% at a two-allele locus would equate to a distance of 0.1, representing half of the sum of the absolute changes in each of the two frequencies. The total distance between the two vectors may now be calculated as(3)Dqt1,qt2=∑idqit1,qit2where the sum over i is conducted over all loci in the viral genome.

Sequence distances for non-synonymous and synonymous mutations were calculated in a similar manner, with the exception that distances were calculated over individual nucleotides rather than in a per-locus manner. We calculated(4)DNSqt1,qt2=12AN,i∑a,iϵAN,iqiat1-qiat2and(5)DSqt1,qt2=12AS,i∑a,iϵAS,iqiat1-qiat2where A_N,i and A_S,i are the sets of nucleotides a and positions i in the genome which respectively induce non-synonymous and synonymous changes in the consensus sequence. Synonymous and non-synonymous variants were identified with respect to influenza B protein sequences; a nucleotide substitution was defined as being non-synonymous if it induced a change in the coded protein in at least one viral protein sequence. By contrast to our primary distance measurement, values for synonymous and non-synonymous sites were calculated as mean distances per nucleotide, reflecting the differing numbers of each type of potential substitution in the viral genome.

Estimation of effective population size

We converted our measurements of sequence distance into an estimate of N_e by means of a simplified evolutionary model, assuming that all of the change in the population results from genetic drift. We first note the effect of error in measurements of the population upon our distance metric.

We suppose that at the time t, we make the observation:(6)q^(t)=q(t)+e(t)where e is the error in measuring the population. Our definition of ‘error’ here is a broad one; we include both the potential for viral material in a single swab to not fully capture the entire viral diversity within the host and the potential for the sequencing pipeline to distort the composition of the material in the swab (Illingworth et al., 2017). In our distance calculation, we now have:(7)D(q^(t1),q^(t2))=12∑i∑aϵ{A,C,G,T}|(qia(t1)−qia(t2))+((eia(t1)−eia(t2))|where the terms e_i are locus-specific errors in the measurement of allele frequencies; we write this equation in the form:(8)D(q^(t1),q^(t2))=D(q(t1),q(t2))+E(q(t1),q(t2))where E is the deviation incurred from the true distance.

Here, given only two error-prone samples from a system, separation of the real population distance and the error term is impossible. However, given multiple samples, an approximate separation can be made. We here use linear regression to fit a model to the observed distances, fitting the model:(9)D(q^(ti),q^(tj))≈k|tj−ti|+Efor constant values k, approximating the rate of evolutionary change in the population per unit time, and E, approximating the mean amount of error in a measurement; here the term in vertical brackets is the absolute difference in time between samples i and j. This approach makes two approximations, which we believe to be either reasonable or possible to account for. Firstly, the model assumes that a linear model is appropriate to describe the change in the population over time; within our drift framework this is correct if the effective population size N_e is constant, and if the distribution of allele frequencies does not change over time. In our data, the consensus population declines approximately eight-fold (Lumby et al., 2020), then undergoes a bottleneck due to the influence of favipiravir; we infer a representative mean value of N, selecting for clade A only samples collected before the bottleneck. Secondly, our model assumes that the deviation from truth in our distance metric does not change in a manner that is systematically associated with the time between samples. Regarding the sequencing process we believe this to be correct in so far as a consistent sequencing pipeline was used throughout. Regarding within-host population structure we note in our data a divergence over time between samples from clade A and clade B, but split these samples to obtain distinct estimates of N_e for each clade. We note that large deviations from our model assumptions can be qualitatively identified by a poor fit between a simple regression model and the data.

Linear regression was performed using the Mathematica 11 software package, using the same package to calculate a 97.5% confidence interval for the calculated gradient, k.

Wright-Fisher simulation

We next approximated the behaviour of our system using a Wright-Fisher model, re-writing the first component of Equation 9 as(10)D(q(t1),q(t2))≈ΔD(Ne,q(t1))|t2−t1|

Here ΔD is a stochastic function describing the change in the population, measured according to the metric D, that arises from a single generation of genetic drift in a population with effective size N_e and initial allele frequencies q(t₁). Regarding these allele frequencies we note that the distribution of minor allele frequencies across the genome was reasonably constant between samples for which a good read depth was achieved (Figure 2—figure supplement 4; read depths for these data have previously been reported Lumby et al., 2020). To account for variance in these statistics we used different samples to initiate our simulations, reporting error bars across choices of q(t₁).

Our Wright-Fisher model simulated the evolution of the viral population for a single generation. Rates of evolution calculated from the sequence data were rates of change per day whereas a Wright-Fisher simulation gives an estimated rate of evolution per generation. We therefore scaled the former to match the experimentally ascertained estimate of 10 hr per generation for influenza B (Nobusawa and Sato, 2006).

To conduct a simulation we constructed a population of N viruses. Each simulated virus had a genome comprised of eight segments, each identical in length to the corresponding segment of the influenza B virus sampled from the patient. Observations from the clinical viral population were used to specify the genetic composition of the viral population at the beginning of the simulation. A simulated population of viral genomes was established. For each viral segment, a clinical sample was chosen at random. Nucleotide frequencies at each locus in the clinical sample (modified as described below) were used to generate a multinomial sample of viruses from the simulated population, assigning alleles to viruses in the simulated population according to the random sample. This step was repeated for each locus in the segment, with no intrinsic association between alleles at different loci. The sample collected on 30th November 2017 was excluded as a starting point from this analysis due to its low read depth; all other samples had a mean read depth in excess of 2000-fold coverage.

Simulation of the population was conducted at the genome-wide level. We simulated a single generation of the evolution of our population under genetic drift, generating a random sample of N whole viral genomes from the population. Intra-segment recombination was assumed to be negligible (Boni et al., 2008), while reassortment between segments was neglected in line with evidence from cases of human infection (Sobel Leonard et al., 2017a). We collected allele frequency data from the initial and final populations, using these to calculate the distance in sequence space through which the population had evolved according to the modified Hamming distance described above.

For each population size tested, our simulation was run 400 times, using the data to produce a 97.5% confidence interval for the extent of evolutionary change at a given effective population size. For each of these 400 replicate simulations, an independent random set of samples was chosen to initiate each of the eight simulated viral segments. The extent of evolution of the real population was compared to the results from our simulated populations, giving an inference of the effective size of the viral population.

Amendments were made to the above approach.

Accounting for false-positive variants in sequencing: Estimating a false positive rate

The evolutionary distance ΔD(N,q(t₁)) calculated by our method is dependent upon the vector of allele frequencies q. Given a greater number of polymorphic alleles in a system, the evolutionary distance, calculated as the sum of allele frequency changes, will also increase. While the experimental pipeline we used has been shown to perform well in capturing within-host viral diversity (STOP-HCV Consortium et al., 2016), the possibility remains that sequencing could contribute additional diversity to the initial populations used in our simulation. We therefore made an estimate of the extent to which our sequencing process led to the false identification of variants. To achieve this, we used data from a previous study describing the repeat sequencing of hepatitis C virus (HCV) samples from a host (Illingworth et al., 2017); data in this previous study were collected using the same sequencing pipeline as that used to collect the data considered here and therefore provide a generic measure of the level of false positive variation. The data we analysed, coded as HCV01 in the original study, comprised four clinical HCV samples, each of which was split following nucleic acid extraction. Some replicate samples were processed using a DNase depletion method before all samples went through cDNA synthesis, library preparation and sequencing. DNase depletion led to samples with lower read depth; we here compared sequence data collected from the non-depleted replicates of each sample. Variant frequencies within this dataset, where variation was observed in more than one sample, are shown in Figure 2—figure supplement 5.

Considering the real viral sample, we note that at any given genetic locus, a minority variant either exists or does not exist according to some well-defined criterion. (For the moment the way in which variation is defined is not important; methods for defining variation, which include the use of a frequency threshold, are discussed later.) We denote the possible states of a locus as P and N, according to whether the locus is positive or negative for variation. We suppose that the probability that a random locus in the genome has a minority variant is given by P_P, leading to the equivalent statistic P_N = 1- P_P.

Sequencing of a specific position in the genome results in the observation or non-observation of a variant. In our data we have sets of two replicate observations of each position in the genome, giving for each minority variant the possible outcomes VV, VX, XV, and XX, where V corresponds to the observation of a variant, and X corresponds to the non-observation of a variant. These observations contain errors; we denote the true positive, false positive, true negative and false negative rates of the variant identification process by P_V|P, P_V|N, P_X|N, and P_X|P respectively. In this notation, V|P indicates the observation of a variant conditional on the variant being a true positive.

The underlying purpose of our calculation is to remove falsely detected variation from the population. We begin by assuming that the false negative rate of detecting variants is equal to zero. That is, where we do not see a variant in the sequence data, we assume that a variant is never actually present. This is a conservative step in so far as we never add unobserved variation to the population. Our assumption gives the result that the false negative rate, P_X|P = 0. In so far that a variant is never unobserved it follows that the true positive rate P_V|P = 1.

Thus the outcome probabilities may be expressed in terms of the underlying probability of a position having a variant, P_P, and the false positive rate P_V|N.

We next processed our sequence replicate data, considering only sites that were sequenced to a read depth of at least 2000-fold coverage. For each locus in a dataset, we calculated the observed frequency of each of the nucleotides A, C, G, and T, generating pairs which described these frequencies in each of our two replicate datasets. Removing pairs in which an allele has a frequency of more than 0.5 in either of the two datasets, we obtained a list of minority variants from each locus, generally comprising three allele frequency pairs per locus. If it is correct that two of the three minority alleles have very low frequencies, the frequencies are close to being statistically independent; the existence of a very few alleles of one minority type does not greatly affect the probability of another variant allele being observed in another read. We note that, of the more than 73 thousand sites sequenced, only 56, fewer than 0.1%, had more than one minority variant at a frequency greater than 1%. We proceeded on the assumption that each pair of minority frequencies was statistically independent of the others.

From the repeated observations of sites, we may count the number of observations of each of the four outcomes; given a total of N pairs we denote these as N_VV, N_VX, N_XV, and N_XX. Under our model of independent pairs we constructed the multinomial log likelihood of the underlying variant and false positive rates.(14)L(PP,PV|N)=log⁡NNVVNVXNXVNXXPVVNVVPVXNVXPXVNXVPXXNXXwhere the terms P_ab are constructed from P_P and P_V|N according to the equations above.

Given a set of paired observations, we calculated the maximum likelihood values of P_P and P_V|N. From these statistics we are able to calculate the positive predictive value of sequencing, namely the proportion of observed variants that are true positives. This is achieved by dividing the probability that a true positive was detected (equal to the number of true positives as P_V|P = 1), by the probability that a variant was detected:(15)PPV=PPPP+(1-PP)PV|N

Frequency dependence of false-positive variant calling

Within our data, our expectation was that minority variants at higher allele frequencies would be more likely to be observed as variants in both replicate samples. We note that, where a frequency cutoff is applied to identify variants, care is required in the above protocol. For example, if a hard threshold was applied, in which variants were called at 1% frequency, a variant that was detected at frequencies of 1.01% and 0.99% would be regarded as having been observed in one case, and not observed in the other, although it likely represents a consistent observation.

In order to assess the frequency dependence of our true positive rate, we defined minimum and maximum variant frequency thresholds q^min and q^max, and denoted the replicate observations of a minority variant frequency as q^A and q^B in the two samples. We further defined the frequency q^cut according to the formula:(16)qcut=min⁡qmin,max⁡qmin2,0.001

We then defined regions of frequency space as follows:VV:qA≥qcut;qA<qmax; qB≥qcut;qB<qmax; qA+qB≥3qmax2;qA+qB<3qmax2;VX:qmin≤qA<qmax; qB<qcutXV: qA<qcut; qmin≤qB<qmax(17)XX:qA<qcut; qB<qcut; qA+qB<3qmin2

These inequalities are illustrated in Figure 2—figure supplement 6.

In the above, q^cut functions to slightly harshen the criteria for detecting variants at low frequencies. If a variant is observed in one sample at frequency greater than q^min, then if q^min is greater than 0.2%, the frequency in the second sample had to be at least half q^min to be counted. If q^min was between 0.1% and 0.2%, the frequency in the second sample had to be at least 0.1%, while if q^min was less than 0.1%, the frequency in the second sample had to be at least q^min.

For different ranges of frequency values, q^min and q^max, the proportion of observed variants that were true positives was calculated according to the maximum likelihood method above, using these categorisations. Results are shown in Figure 2—figure supplement 7. In the process of setting up the initial state of our Wright-Fisher simulation variants observed in the sequence data were considered in turn, drawing a Bernoulli random variable for each variant. Variants were included in the initial simulated population with probability equal to the proportion of observed variants that were estimated to be true positives.

Accounting for mutation-selection balance

To account for our neglect of mutation, a frequency cutoff was applied to our simulation data. Under a pure process of genetic drift, low-frequency variants in our population are likely to die out, reaching a frequency of zero. In a real population, this would not occur, variants being sustained at low frequencies by a balance of mutation and purifying selection (Haldane, 1937; Haigh, 1978). To correct for this we post-processed the initial and final frequency values from our simulations before calculating our distance, imposing a minimum minority allele frequency of 0.1%. All changes in allele frequency below this threshold were ignored, such that, for example, if a variant changed from 0.5% to 0%, this was processed after the fact so that the variant changed from 0.5% to 0.1%. The choice of threshold here is conservative; leading to a conservatively low estimate of N_e.

Confidence intervals

Confidence intervals for the effective population size were calculated as the overlap of 97.5% confidence intervals for the evolutionary rates in the observed data, calculated from the regression for the real data, and estimated from the simulated statistics. The overlap of these values gives an approximate 95% confidence interval for N_e.

Variations in methodology

A number of choices were made in our estimation of an effective population size. The effects of each of these choices were explored through further calculation and simulation. Results are shown in Supplementary file 1.

Approximations in the Wright-Fisher model

In the calculation to set up an initial viral population, the assignment of minority alleles to sequences becomes slow at large population sizes. Our code simulated viral genomes; a variant allele was included into the population by choosing an appropriate proportion of genomes to which the variant was assigned. For greater computational efficiency we used a pseudo-random approach for choosing genomes. Given a population size N, we generated a set P of prime numbers that were each larger than N. Given some desired allele frequency q we wish to choose qN genomes to which to assign the variant. We therefore calculated the set of numbers:(18)ak(mod p)where p is a prime number sampled at random from the set P, and a is a randomly chosen primitive root of p. Given this choice of a and p, the values a^k (where k is an integer between one and p-1) form a pseudorandom permutation of the numbers from one to p-1. We constructed a set of qN genomes by choosing genomes indexed in turn by the elements of this set, beginning from k = 1, and discarding values greater than N.

To achieve calculations for population sizes larger than 10⁷ we implemented a statistical averaging method. We generated a single population of size 10⁶, then generated 200 outcomes of a single generation of the same size, recording allele frequencies in each case. In order to simulate a value of N of size r x 10⁶ we compared the frequencies of the initial population to the mean frequencies of a random set of r outcomes. This is equivalent of simulating transmission from a population of size r x 10⁶ in which the initial population contains r copies of each of one of 10⁶ genotypes.

Phylogenetic analysis

Consensus sequences of data were analysed using the BEAST2 software package (Bouckaert et al., 2014). Consensus sequences from each viral segment were concatenated then aligned using MUSCLE (Edgar, 2004) before performing a phylogenetic analysis on the whole genome sequence alignment. The B/Venezuela/02/2016 sequence was used to root the alignment, the haemagglutinin segment of this virus having been identified as being very close to those from the patient. Trees were generated using the HKY substitution model (Hasegawa et al., 1985). A Monte Carlo process was run for 10 million iterations, generating a consensus tree with TreeAnnotator using the first 10% of trees as burn-in. Figures were made using the FigTree package (http://tree.bio.ed.ac.uk/software/figtree/).

Haplotype reconstruction

Haplotype reconstruction was performed using multi-locus polymorphism data generated by the SAMFIRE software package (Illingworth, 2016). Variant loci in the genome were identified as those at which a change in the consensus nucleotide was observed between the initial and the final consensus. The short-read data were then processed, converting reads into strings of alleles observed at these loci; a single paired-end read may describe alleles at none, one, or multiple loci. Next, these strings were combined using a combinatorial algorithm to construct a list of single-segment haplotypes, sufficient to explain all of the observed data; no frequencies were inferred at this point. Finally, a Dirichlet-multinomial model was used to infer the maximum likelihood frequencies of each haplotype given the data from each time point (Illingworth, 2015). Formally, we divided reads into sets, according to the loci at which they described alleles. A multi-locus variant consists of an observation of some specific alleles at the loci in question. By way of notation, we denote by nia the number of reads in set i which describe the multi-locus variant a, and denote the total number of reads in the set as N_i. Given a set of haplotypes with frequencies given by the elements of the vector q, we write as qia the summed frequencies of haplotypes that match each multi-locus variant a in set i. For example, the haplotypes ATA and ATG would both match the multi-locus variant AT- describing alleles at only the first two loci. We now express a likelihood for the haplotype frequencies:(19)ℒq=∑ilog⁡Γ(Ni+1)∏aΓ(nia+1)Γ(∑aCqia)Γ(∑ania+Cqia)∏aΓ(nia+Cqia)Γ(Cqia)

Here the parameter C describes the extent of noise in the sequence data, a lower value indicating a lower confidence in the sequence data. Haplotype reconstruction was performed by finding the maximum likelihood value of the vector of haplotype frequencies q. A value of C = 200 was chosen for the calculation, representing a conservative estimate given the prior performance of the sequencing pipeline used in this study (Illingworth et al., 2017). In contrast to previous calculations in which an evolutionary model was fitted to data (Illingworth, 2015), haplotype frequencies for each time point and for each viral segment were in this case inferred independently, with no underlying evolutionary model.

Data describing influenza A/H3N2 infection

Our analysis of data describing long-term influenza A/H3N2 infection was performed on data from a previous study (Xue et al., 2017). As our method does not require an exceptional quality of sequencing data to calculate a rate of evolution more samples were included in our analysis than were examined in the original study. Using the codes established in the previous study, we used samples from patient W from days 0, 7, 14, 21, 28, 56, 62, 69 and 76; from patient X from days 0, 7, 14, 21, 28, 42, and 72; from patient Y from days 0, 7, 14, 21, 28, 35, 48, 56, and 70; from patient Z from days 14, 15, 20, 25, 41, 48, 55, 62, and 69. An identical procedure to that used to estimate Ne from the influenza B data was applied, calculating a rate of evolution per day from sequence data, scaling this to a rate per generation (in this case a seven hour generation time was modelled [Nobusawa and Sato, 2006]), and then running simulations to estimate N_e. We note that the estimates of false positive rate generated for the influenza B data were applied equally in this case, due to not having equivalent data to re-estimate these values. Examining the data from patient W, our distance measurements suggested potential population structure involving the samples collected on days 62 and 69; these samples were excluded from our regression analysis.

Additional information

Competing interests

No competing interests declared

Author contributions

Data curation, Software, Formal analysis, Validation, Investigation, Methodology, Writing - review and editing

Formal analysis, Investigation, Methodology, Writing - review and editing

Resources, Project administration, Writing - review and editing

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing - original draft, Project administration, Writing - review and editing

Additional files

Supplementary file 1.Inferred effective population sizes for data from clade.

A generated under different modelling assumptions.

Transparent reporting form

Data availability

All sequence data is taken from previous publications, and is available from the Sequence Read Archive. Where this is sensible, raw data underlying figures has been made available in files which accompany this document.

The following previously published datasets were used:

XueKSBloomJD2017Longitudinal deep sequencing of human influenza A (H3N2) from immunocompromised patientsNCBI BioProjectPRJNA364676

LumbyCKZhaoLOportoMBestTTutillHShahDVeysPWilliamsRWorthAIllingworthCRJBreuerJ2020Favipiravir and zanamivir clear influenza B infection in an immunocompromised childNCBI BioProjectPRJNA601176

References

Bedford

Cobey

Beerli

Pascual

2010

Global migration dynamics underlie evolution and persistence of human influenza A (H3N2)

PLOS Pathogens6

e1000918

10.1371/journal.ppat.1000918

20523898

Bedford

Riley

Barr

Broor

Chadha

Cox

Daniels

Gunasekaran

Hurt

Kelso

Klimov

Lewis

McCauley

Odagiri

Potdar

Rambaut

Shu

Skepner

Smith

Suchard

Tashiro

Wang

Lemey

Russell

2015

Global circulation patterns of seasonal influenza viruses vary with antigenic drift

Nature523217220

10.1038/nature14460

26053121

Bollback

York

Nielsen

2008

Estimation of 2nes from temporal allele frequency data

Genetics179497502

10.1534/genetics.107.085019

18493066

Boni

Zhou

Taubenberger

Holmes

2008

Homologous recombination is very rare or absent in human influenza A virus

Journal of Virology8248074811

10.1128/JVI.02683-07

18353939

Bouckaert

Heled

Kühnert

Vaughan

C-H

Xie

Suchard

Rambaut

Drummond

2014

BEAST 2: a software platform for bayesian evolutionary analysis

PLOS Computational Biology10

e1003537

10.1371/journal.pcbi.1003537

Buonagurio

Nakada

Parvin

Krystal

Palese

Fitch

1986

Evolution of human influenza A viruses over 50 years: rapid, uniform rate of change in NS gene

Science232980982

10.1126/science.2939560

Centers for Disease Control and Prevention (CDC)

2009

Oseltamivir-resistant novel influenza A (H1N1) virus infection in two immunosuppressed patients - Seattle, Washington, 2009

MMWR. Morbidity and Mortality Weekly Report58893896

19696719

Charlesworth

2009

Fundamental concepts in genetics: effective population size and patterns of molecular evolution and variation

Nature Reviews. Genetics10195205

10.1038/nrg2526

19204717

Debbink

McCrone

Petrie

Truscon

Johnson

Mantlo

Monto

Lauring

2017

Vaccination has minimal impact on the intrahost diversity of H3N2 influenza viruses

PLOS Pathogens13

e1006194

10.1371/journal.ppat.1006194

Edgar

2004

MUSCLE: multiple sequence alignment with high accuracy and high throughput

Nucleic Acids Research3217921797

10.1093/nar/gkh340

15034147

Feder

Kryazhimskiy

Plotkin

2014

Identifying signatures of selection in genetic time series

Genetics196509522

10.1534/genetics.113.158220

24318534

Ferguson

Galvani

Bush

2003

Ecological and immunological determinants of influenza evolution

Nature422428433

10.1038/nature01509

12660783

Fitch

Leiter

Palese

1991

Positive darwinian evolution in human influenza A viruses

PNAS8842704274

10.1073/pnas.88.10.4270

1840695

Fitch

Bush

Bender

Cox

1997

Long term trends in the evolution of H(3) HA1 human influenza type A

PNAS9477127718

10.1073/pnas.94.15.7712

9223253

Foll

Poh

Renzette

Ferrer-Admetlla

Bank

Shim

Malaspinas

Ewing

Liu

Wegmann

Caffrey

Zeldovich

Bolon

Wang

Kowalik

Schiffer

Finberg

Jensen

2014

Influenza virus drug resistance: a time-sampled population genetics perspective

PLOS Genetics10

e1004185

10.1371/journal.pgen.1004185

24586206

Ghafari

Lumby

Weissman

Illingworth

CJR

2020

Inferring transmission bottleneck size from viral sequence data using a novel haplotype reconstruction method

Journal of Virology94

10.1128/JVI.00014-20

Grenfell

Pybus

Gog

Wood

Daly

Mumford

Holmes

2004

Unifying the epidemiological and evolutionary dynamics of pathogens

Science303327332

10.1126/science.1090727

14726583

Gubareva

Matrosovich

Brenner

Bethell

Webster

1998

Evidence for zanamivir resistance in an immunocompromised child infected with influenza B virus

The Journal of Infectious Diseases17812571262

10.1086/314440

Haigh

1978

The accumulation of deleterious genes in a population--Muller's Ratchet

Theoretical Population Biology14251267

10.1016/0040-5809(78)90027-8

746491

Haldane

JBS

1924

A mathematical theory of natural and artificial selection

Transactions of the Cambridge Philosophical Society231941

10.1017/S0305004100015176

Haldane

JBS

1937

The effect of variation of fitness

The American Naturalist71337349

10.1086/280722

Hamada

Imamura

Hara

Kashiwagi

Imamura

Nakazono

Chijiwa

Watanabe

2012

Intrahost emergent dynamics of oseltamivir-resistant virus of pandemic influenza A (H1N1) 2009 in a fatally immunocompromised patient

Journal of Infection and Chemotherapy18865871

10.1007/s10156-012-0429-0

22661221

Han

Maurer-Stroh

Russell

2019

Individual immune selection pressure has limited impact on seasonal influenza virus evolution

Nature Ecology & Evolution3302311

10.1038/s41559-018-0741-x

30510176

Hasegawa

Kishino

Yano

1985

Dating of the human-ape splitting by a molecular clock of mitochondrial DNA

Journal of Molecular Evolution22160174

10.1007/BF02101694

3934395

Illingworth

Fischer

Mustonen

2014

Identifying selection in the within-host evolution of influenza using viral sequence data

PLOS Computational Biology10

e1003755

10.1371/journal.pcbi.1003755

25080215

Illingworth

2015

Fitness inference from Short-Read data: within-host evolution of a reassortant H5N1 influenza virus

Molecular Biology and Evolution3230123026

10.1093/molbev/msv171

26243288

Illingworth

2016

SAMFIRE: multi-locus variant calling for time-resolved sequence data

Bioinformatics3222082209

10.1093/bioinformatics/btw205

27153641

Illingworth

CJR

Roy

Beale

Tutill

Williams

Breuer

2017

On the effective depth of viral sequence data

Virus Evolution3

vex030

10.1093/ve/vex030

29250429

Illingworth

CJR

2020a

FluBData

GitHub6510fb7

https://github.com/cjri/FluBData

Illingworth

CJR

2020b

SAMFIRE

GitHub1527ed0

https://github.com/cjri/samfire

Imai

Yamashita

Sakai-Tagawa

Iwatsuki-Horimoto

Kiso

Murakami

Yasuhara

Takada

Ito

Nakajima

Takahashi

Lopes

TJS

Dutta

Khan

Kriti

van Bakel

Tokita

Hagiwara

Izumida

Kuroki

Nishino

Wada

Koga

Adachi

Jubishi

Hasegawa

Kawaoka

2020

Influenza A variants with reduced susceptibility to baloxavir isolated from japanese patients are fit and transmit through respiratory droplets

Nature Microbiology52733

10.1038/s41564-019-0609-0

31768027

Jackson

Barclay

Zürcher

2005

Characterization of recombinant influenza B viruses with key neuraminidase inhibitor resistance mutations

Journal of Antimicrobial Chemotherapy55162169

10.1093/jac/dkh528

15665027

Khatri

Burt

2019

Robust estimation of recent effective population size from number of independent origins in soft sweeps

Molecular Biology and Evolution3620402052

10.1093/molbev/msz081

30968124

Kimura

Crow

1963

The measurement of effective population number

Evolution17279288

Lakdawala

Jayaraman

Halpin

Lamirande

Shih

Stockwell

Lin

Simenauer

Hanson

Vogel

Paskel

Minai

Moore

Orandle

Das

Wentworth

Sasisekharan

Subbarao

2015

The soft palate is an important site of adaptation for transmissible influenza viruses

Nature526122125

10.1038/nature15379

26416728

Laporte

Charlesworth

2002

Effective population size and population subdivision in demographically structured populations

Genetics162501519

Lumby

Nene

Illingworth

CJR

2018

A novel framework for inferring parameters of transmission from viral sequence data

PLOS Genetics14

e1007718

10.1371/journal.pgen.1007718

30325921

Lumby

Zhao

Oporto

Best

Tutill

Shah

Veys

Williams

Worth

Illingworth

CJR

Breuer

2020

Favipiravir and zanamivir cleared infection with influenza B in a severely immunocompromised child

Clinical Infectious Diseases9

ciaa023

10.1093/cid/ciaa023

McCrone

Woods

Martin

Malosh

Monto

Lauring

2018

Stochastic processes constrain the within and between host evolution of influenza virus

eLife7

e35962

10.7554/eLife.35962

29683424

McCrone

Lauring

2016

Measurements of intrahost viral diversity are extremely sensitive to systematic errors in variant calling

Journal of Virology9068846895

10.1128/JVI.00667-16

27194763

Miao

Hollenbaugh

Zand

Holden-Wiltse

Mosmann

Perelson

Topham

2010

Quantifying the early immune response and adaptive immune response kinetics in mice infected with influenza A virus

Journal of Virology8466876698

10.1128/JVI.00266-10

20410284

Morris

2020

Asynchrony between virus diversity and antibody selection limits influenza virus evolution

bioRxiv10.1101/2020.04.27.064915

Nobusawa

Sato

2006

Comparison of the mutation rates of human influenza A and B viruses

Journal of Virology8036753678

10.1128/JVI.80.7.3675-3678.2006

Pennings

Kryazhimskiy

Wakeley

2014

Loss and recovery of genetic diversity in adapting populations of HIV

PLOS Genetics10

e1004000

10.1371/journal.pgen.1004000

24465214

Poon

Song

Rosenfeld

Lin

Rogers

Zhou

Sebra

Halpin

Guan

Twaddle

DePasse

Stockwell

Wentworth

Holmes

Greenbaum

Peiris

Cowling

Ghedin

2016

Quantifying influenza virus diversity and transmission in humans

Nature Genetics48195200

10.1038/ng.3479

26727660

Rambaut

Pybus

Nelson

Viboud

Taubenberger

Holmes

2008

The genomic and epidemiological dynamics of human influenza A virus

Nature453615619

10.1038/nature06945

18418375

Richard

Herfst

Tao

Jacobs

Lowen

2018

Influenza A virus reassortment is limited by anatomical compartmentalization following coinfection via distinct routes

Journal of Virology92

e02063-17

10.1128/JVI.02063-17

29212934

Rogers

Song

Sebra

Greenbaum

Hamelin

M-E

Fitch

Twaddle

Cui

Holmes

Boivin

Ghedin

2015

Intrahost dynamics of antiviral resistance in influenza A virus reflect complex patterns of segment linkage, reassortment, and natural selection

mBio6

e02464-14

10.1128/mBio.02464-14

Rousseau

Moury

Mailleret

Senoussi

Palloix

Simon

Valière

Grognard

Fabre

2017

Estimating virus effective population size and selection without neutral markers

PLOS Pathogens13

e1006702

10.1371/journal.ppat.1006702

29155894

Rouzine

Rodrigo

Coffin

2001

Transition between stochastic evolution and deterministic evolution in the presence of selection: general theory and application to virology

Microbiology and Molecular Biology Reviews65151185

10.1128/MMBR.65.1.151-185.2001

11238990

Rouzine

Coffin

Weinberger

2014

Fifteen years later: hard and soft selection sweeps confirm a large population number for HIV in vivo

PLOS Genetics10

e1004179

10.1371/journal.pgen.1004179

24586204

Snydman

2006

Oseltamivir resistance during treatment of influenza A (H5N1) Infection

Yearbook of Medicine20067071

10.1016/S0084-3873(08)70358-4

Sobel Leonard

McClain

Smith

Wentworth

Halpin

Lin

Ransier

Stockwell

Das

Gilbert

Lambkin-Williams

Ginsburg

Woods

Koelle

Illingworth

2017a

The effective rate of influenza reassortment is limited during human infection

PLOS Pathogens13

e1006203

10.1371/journal.ppat.1006203

28170438

Sobel Leonard

Weissman

Greenbaum

Ghedin

Koelle

2017b

Transmission bottleneck size estimation from pathogen Deep-Sequencing data, with an application to human influenza A virus

Journal of Virology91

e00171-17

10.1128/JVI.00171-17

28468874

STOP-HCV ConsortiumThomson

Badhan

Christiansen

Adamson

Ansari

Bibby

Breuer

Brown

Bowden

Bryant

Bonsall

Da Silva Filipe

Hinds

Hudson

Klenerman

Lythgow

Mbisa

McLauchlan

Myers

Piazza

Roy

Trebes

Sreenu

Witteveldt

Barnes

Simmonds

2016

Comparison of Next-Generation sequencing technologies for comprehensive assessment of Full-Length hepatitis C viral genomes

Journal of Clinical Microbiology5424702484

10.1128/JCM.00330-16

27385709

Strelkowa

Lässig

2012

Clonal interference in the evolution of influenza

Genetics192671682

10.1534/genetics.112.143396

22851649

Terhorst

Schlötterer

Song

2015

Multi-locus analysis of genomic time series data from experimental evolution

PLOS Genetics11

e1005069

10.1371/journal.pgen.1005069

25849855

Valesano

2020

Influenza B viruses exhibit lower Within-Host diversity than influenza A viruses in human hosts

Journal of Virology94

791038

10.1101/791038

Wang

Santiago

Caballero

2016

Prediction and estimation of effective population size

Heredity117193206

10.1038/hdy.2016.43

27353047

Whitlock

Barton

1997

The effective size of a subdivided population

Genetics146427441

9136031

Wishaupt

Ploeg

Smeets

Groot

Versteegh

Hartwig

2017

Pitfalls in interpretation of CT-values of RT-PCR in children with acute respiratory tract infections

Journal of Clinical Virology9016

10.1016/j.jcv.2017.02.010

28259567

Wright

1938

Size of population and breeding structure in relation to evolution

Science87430431

Xue

Stevens-Ayers

Campbell

Englund

Pergam

Boeckh

Bloom

2017

Parallel evolution of influenza across multiple spatiotemporal scales

eLife6

e26875

10.7554/eLife.26875

28653624

Zhao

Abbasi

Illingworth

CJR

2019

Mutational load causes stochastic evolutionary outcomes in acute RNA viral infection

Virus Evolution5

vez008

10.1093/ve/vez008

31024738

10.7554/eLife.56915.sa1

Decision letter

Nourmohammad

Armita

Reviewing EditorUniversity of WashingtonUnited States

Bazykin

Georgii A

ReviewerInstitute for Information Transmission Problems (Kharkevich Institute)Russian Federation

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The manuscript assesses the intra-host effective population size of influenza based on longitudinal deep sequencing data from a chronic influenza B infection. Using principles modeling and statistical approaches, the authors show that the short length of a typical influenza infection is the key limiting factor upon selection at the within-host level. The topic is important, as it sheds light on the interplay between the two scales of selection within- and between-host in shaping the evolution of influenza virus.

Decision letter after peer review:

Thank you for submitting your article "A large effective population size for within-host influenza virus infection" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Aleksandra Walczak as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Georgii A. Bazykin (Reviewer #2).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

We would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below only address clarity and presentation.

Summary:

The manuscript presents a study on within-host population genetics of influenza virus and in particular, inference of effective population size during chronic infection in immunocompromised patients. The topic is important as it explores the interplay between the two scales of selection: within-host and between-host selection that shape the evolution of influenza. Based on the analysis of sequence polymorphism, authors infer a relatively large effective population size ~10⁷ during chronic infection, in contrast to previously inferred values of ~10² or less during transmission and in acute infections. All of the reviewers agree that the findings in this manuscript are interesting and a large effective population would have significant implications for efficacy of selection during within-host evolution of influenza. However, there are still some concerns regarding methodology, interpretation and presentation of the results which we would like to see addressed.

Essential revisions:

1) Comparison between chronic and acute infections:

The authors analyzed data from chronic influenza infections and concluded that the effective population size of the virus is high, including during acute infections. For instance, the authors argue that "the observed lack of within-host variation in typical cases of influenza can be explained by the short period of infection; the stochastic effects of genetic drift do not limit the impact of positive selection". It is however not evident that the authors' estimates of effective population size from chronic infections apply to acute infections given the exponential increase and decrease of viral load that dominate the course of acute infections. In fact, it's not clear that effective population size is even a very useful concept in this case.

Also, McCrone et al., 2018, and Xue and Bloom, 2020, have both shown that within-host variation in acute infections is dominated by non-synonymous mutations, and Xue and Bloom, 2020, also document stop-codon mutations within acute infections that are rarely found at appreciable frequencies in chronic infections. These observations suggest that selection is inefficient within hosts in acute infections, contrary to the authors' claims.

Moreover, McCrone et al. see radical changes in variant frequencies over the course of a few days (Figure 2E in that work) – but lineages in chronic infections (this work) persist for many months. If the authors think that N_e is comparable between acute and chronic infections, how do they explain the lack of diversity observed in acute infections? One way to explain this is to maintain a high N_e but with strong transmission bottleneck to impose stochasticity. But as point out above, "N_e" is really not a well-defined quantity in this case. Alternatively, could the difference imply a lower census size in acute infections, and if so, is this consistent with differences in viral load? This issue is important in view of the proposed relevance of high N_e for long-term influenza evolution (e.g., last phrase of the Abstract and the last phrase of the Introduction).

Overall, the authors should acknowledge the differences between acute and chronic infections, and discuss their estimates in light of the previous observations. Moreover, it may also be helpful to revise the title to indicate that the manuscript focuses on chronic infections.

2) High N_e is inferred from small drift and a small rate of "substitutions" (which under the authors' terminology also account for minor changes in allele frequencies). In other words, the authors are inferring a large N_e based on the longer-term coexistence of multiple lineages within a host. Therefore, it would be important that the manuscript also discusses alternative explanations that could lead to such patterns of polymorphism. Importantly, as N_e in the manuscript is inferred from a Wright-Fisher (WF) model, violations in the underlying assumptions of the model can bias the results. For example, one can imagine that demographic effects like population structure could be responsible for long-term coexistence and survival of lineages, e.g., if each of the samples represents a mixture of persistent subpopulations? The authors seem to suggest this by analyzing clades A and B separately, Results and Discussion, second paragraph. Alternatively, could balancing selection in the host be responsible for maintaining this polymorphism (seems unlikely, but still a formal possibility)? A discussion and/or analysis of such alternative scenarios would be useful in assessing the robustness of the manuscript's findings.

3) Robustness of the analysis and proposed statistics:

a) It would be useful to have a clearer sense of the sensitivity of N_e to the cutoffs used. While a lot of care has gone into the choice, some diagrams showing the sensitivity of N_e to cutoff choice would better demonstrate the degree to which it is a function of low frequency variants in a straightforward way.

b) To estimate how N_e affects changes in allele frequencies, the authors simulate a single generation of Wright-Fisher evolution using initial allele frequencies from a randomly selected sample from the infection. As the equation in the subsection “Summary” indicates, populations with high-frequency alleles will experience larger changes in allele frequency at a given effective population size, so the initial distribution of allele frequencies from this randomly chosen sample can have a major effect on the expected change in allele frequencies. The authors show in Figure 2—figure supplement 1 that mutations can reach frequencies of 20-30% in neuraminidase, and in the influenza A patients analyzed in Figure 2—figure supplement 3, many mutations reach these and even higher frequencies, particularly at later points in the infection. The authors should run their Wright-Fisher simulations with different initial allele frequencies to evaluate how this choice of allele frequencies may affect estimates of effective population size.

c) The authors design statistic D to assess their estimation of N_e. This statistic is a sum of changes in variant frequencies across sites (subsection “Calculation of evolutionary rates”), which is then compared between data and Wright-Fisher simulations for different N_e values. The authors seem to suggest that D should be more robust to noise (subsection “Summary”), without providing any evidence. In particular, the authors should clearly state how the assumptions they made about recombination structure in WF simulation could impact the statistics D and the interpretation of the inferred N_e. From the manuscript it is not clear whether WF simulations are done at the site-wise, segment-wise, or genome-wise level, which would impact the correlation between changes in variant frequencies. For example, simulations done with high (free) recombination would expect a lower variance D compared to the case with strong linkage (data), for the same N_e. These points should be better clarified.

4) In Figure 1A, it is clear (and the authors also mention) that the patient's viral load drops to undetectable levels for over a month of the infection, and viral load also varies substantially while the patient is continually infected. Effective population size and census population size are not always directly related, but the authors should discuss how changing population sizes affect their estimate of effective population size and whether a single effective population size is adequate to represent the infection.

5) The authors calculate sequence distance between every pair of sequenced timepoints to reduce the influence of noise from sequencing error, but as a result, the points in Figure 2A are non-independent and may contribute to a tighter confidence interval around the evolutionary rate than is realistic. In particular, changes in variant frequencies that take place during the middle of the infection will be overcounted in these pairs and will disproportionately influence the overall estimate of evolutionary rates. When the authors estimate the evolutionary distance between consecutive timepoints and divide by the number of days between them, how well does the estimate correspond to the estimates in Figure 2? What is the variance in these estimates?

6) The regression performed in Figure 2A, C, and analogous figures may be especially influenced by the few points at the right end of the distribution, which represent evolutionary distances between points spaced further apart in time. How robust is the estimate of evolutionary rate to removal of these points, or by calculation of evolutionary rate as suggested in comment 4?

7) The authors chose to infer effective population size using variants and haplotypes on the neuraminidase and hemagglutinin segments. This is an odd choice since these regions tend to experience the strongest selection, which can strongly influence the estimates of effective population size. Selection can act on linked haplotypes across the genome in some cases, but have the authors tested to see if these results hold for other gene segments as well?

8) Why are the effective population size estimates for the clade B samples calculated separately from the clade A samples? It's not evident from the SAMFIRE inference of haplotypes that clades A and B constitute separate subpopulations; it seems that they could be distinct genotypes in a well-mixed population as well, as might result from a coinfection.

9) The authors assume the generation time of 10 hours per generation for influenza B. However, if generations are longer in immunocompromised individuals, the analysis would lead to an overestimation of N_e. Given that the main result in this manuscript is that N_e is high, this possibility should at least be discussed.

10.7554/eLife.56915.sa2

Author response

Essential revisions:

1) Comparison between chronic and acute infections:

We acknowledge that it is important to relate our result, derived from an unusual case of infection, to more regular cases of influenza in humans. We first note that what is meant in our case by the effective population size is that statistic as it relates to an established influenza infection; our data do not describe the initial founding and growth of the viral infection.

Our primary point of reference to regular influenza infection comes via measurements of CT score relating to viral infection. Our reference on this suggests about 10 fewer units of CT, or close to a 1000-fold numerical drop in census population size, in non-hospitalised, as opposed to hospitalised childhood cases. We now include the very rough calculation that this would suggest an N_e of around 10⁴ for such cases, cautioning that this is for an established infection, after the initial period of expansion from the transmission bottleneck.

We believe our consideration of an established population to match that of other studies of data from within-host influenza infection; in order for data to be collected from such infections, the viral population must be of some minimal consensus size. Noting that the threshold frequency at which the effect of selection outweighs that of drift is 1/N_es, we believe that within the window for which data can be collected, selection of 1% or greater per generation will dominate drift at an allele frequency of 1% or more. In this sense genetic drift does not limit positive selection.

On previous findings we do not completely recognise the statement that within-host variation in acute infections is dominated by non-synonymous mutations. If a simple count of variants is made, the majority will likely be non-synonymous, however this reflects a fact that the large majority of possible variants are non-synonymous for at least one viral protein. McCrone et al. state that their data, ‘suggest significant purifying selection within hosts’ while Xue and Bloom state that ‘synonymous mutations accumulate about twice as quickly as nonsynonymous mutations within hosts’. Synonymous mutations are relatively more common than nonsynonymous mutations at low frequencies, consistent with purifying selection.

We are not fully convinced that stop mutations are lethal in the traditional sense due to the nature of the influenza virus; the genome encapsulated within a virus (i.e. encoded in the RNA within a set of viral proteins) is not necessarily the same genome that was translated to produce the proteins. As such the observation of stop mutations at very low frequencies is not entirely inconsistent with efficient purifying selection. We are not aware of a great deal of work looking at stop mutations in chronic influenza infection, however the presence of purifying selection would again explain such a lack.

While we greatly admire the work of McCrone et al., we are not convinced that the within-host changes in allele frequency that they observe are caused purely by genetic drift. Selection, population structure, and rare sequencing error could all contribute to the changes observed, and the data described in that paper, with two samples collected from each individual, do not allow for discrimination between drift and these other factors. In our case, where we have multiple samples from a host, we observe both large differences between individual samples (in common with McCrone) but also an underlying pattern that suggests a large within-host population size. Previous data describing within-host evolution does not contradict our result.

We have revised the title to, ‘A large effective population size for established influenza infection’. This recognises that our inference neglects the initial phase of viral growth, which may arise from a single particle. While we cannot with our method directly evaluate N_e for acute infections, we believe that an argument based on CT values carries some weight when applied to these cases.

For additional clarity we note that our rate is equivalent to a number of substitutions per day.

Rather than the coexistence of multiple lineages we infer a rate of N_e based upon the rate of change within (primarily clade A) of the viral population. Multiple lineages are not required in the sense that there could be a fully well-mixed population and we could still infer N_e using our method. Explicitly, we derive a rate of change in the viral population, measured across multiple samples, and identify a Wright-Fisher population which under genetic drift matches this rate of change.

We have added a few words to clarify the cladal structure of the population. We believe that the infection is founded by a single viral population (as opposed to co-infection) and that subsequently there is a branching event, so that clades A and B become spatially separated in the host and evolve independently of one another. Our guess is that the less-frequently observed clade B includes a smaller number of viruses, and so evolves faster under genetic drift.

We note two possible deviations from our Wright-Fisher model. Firstly, population structure going beyond the simple cladal structure we observe would lead to a reduction in the value of N_e; we cite Whitlock and Barton on this point. Such population structure would alter the value we derive i.e. it will decrease N_e relative to a well-mixed population, leading to an increase in the rate of change of the population that our model will detect. Regarding population structure, we note that a non-well-mixed population could lead to non-representative sampling of the population and thereby increased distances between individual samples; this effect is included in our ‘error’ terminology in the method.

Secondly, we note the potential for selection to shape the population, noting the emergence of zanamivir resistance. Such selection would not be accounted for in our model i.e. to the extent that it is present it would increase the rate of change of the population that we will detect, but will attribute to a lower N_e. In this sense the presence of positive selection would lead us to underestimate N_e. Purifying selection is difficult to model; within the Wright-Fisher framework all selection is identical in leading to changes in allele frequencies with time. This has the consequence that as N_e becomes high the change in the population does not tend to zero. We note that there will be effects other than genetic drift affecting the population, and stick to our definition that our effective population size is the size at which an idealised population evolving under drift matches the behaviour of our data.

3) Robustness of the analysis and proposed statistics:

We have made two significant cutoffs in our method. The first is to remove what we believe to be false positive variant calls in the sequence data, while the second is to impose a hard cut of 0.1% allele frequency when making our calculation of distance. To evaluate these we have rerun our calculations in a way that removes each of these in turn; we find that the resulting change in N_e is not greatly changed by either of these. We have added Supplementary file 1 which contains inferences for calculations run with parameters other than the default parameters.

The reviewers are correct that the allele frequencies used to initiate the Wright-Fisher model may affect the inferred effective population size. When we calculated replicate simulated populations we accounted for this; in each of the replicate simulations a random sample from the population was chosen to provide the allele frequencies for each segment of the simulated population. The uncertainty bars in our calculations therefore incorporate the uncertainty intrinsic to the initial choice of allele frequencies.

We have now incorporated further explanation into the Materials and methods, describing in a more formal manner how our statistic works and why it is more robust than a simple distance metric based upon pairs of samples from a population. We have clarified that our WF simulations were done at the genome-wise level. Based on prior evidence from human infection, simulations assumed an absence of intra-segment recombination or of reassortment between segments; this is now more clearly stated.

The calculation we make in Clade A was performed over the samples in this clade used up to the point at which favipiravir was first used; this is shown by the green box in Figure 1A. Our belief is that CT score is somewhat noisy, sometimes providing a better measurement of the amount of viral material on a swab than of the consensus population size. Previous modelling of these data suggests a smooth, roughly 8-fold decline in viral load during this period (Lumby et al., 2020). We believe that the pre-favipiravir set of samples is the most appropriate one from which to derive a headline figure for effective population size, the subsequent clinical intervention being an unusual event.

We have added a note that our estimate for clade B spanned the interval in time with the bottleneck; this may be a reason for its lower value. We also note that our estimate is of a mean effective population size.

We have provided further explanation of our method. Our basic rationale is that individual samples from the population are considerably affected by error, such that the error in each sample is larger than the true evolutionary distances undergone by the population. The raw distances between samples in consecutive timepoints are shown in the left-most points in Figure 2A; the mean of these values is 42.6 (standard deviation 8.5), with essentially no correlation between these values and the number of days which separate the samples (pvalue 0.97 from a correlation test performed in Mathematica 11). If we persist with this calculation we obtain a mean change per day in the population of just over 9 nucleotides per day, greater than the total inferred change of close to 8 nucleotides for clade A across five months of evolution. We believe that noise in the individual samples greatly outweighs the genuine signal of evolutionary change in the population in such a way that the simple comparison of pairwise samples does not produce an accurate result.

Assuming a constant underlying effective population size (or failing that, calculating some kind of mean), our regression allows us to infer a rate of evolution even in the presence of considerable noise. We acknowledge that changes in the population during the middle of infection are over-represented but do not have a solution to this; the use of multiple samples is intrinsic to our approach.

We have checked the slope of the regression by removing points from either the beginning and the end of the infection. In Figure 2A, removing between zero and four time points from each end of the data in any combination gives 25 regression coefficients; all of these fall within the 97.5% confidence interval that we report for our original calculation. [Note : Removing a time point removes all distances associated with that point, removing multiple points from the figure]. In Figure 2C there are data from only four time points; here removing either the first or the last leads to regression coefficients within the original confidence interval. We believe that the use of consecutive samples to assess effective population size gives a highly misleading result due to noise and confounding factors in the data greatly exaggerating the real rate of change of the population. We therefore omit this result from the main text, though we note that performing this calculation for clade A gives an effective population size of approximately 800.

This is a misunderstanding of our approach. We illustrate the cladal structure of the population using haplotype reconstructions calculated using sequence data for neuraminidase and haemagglutinin. These segments were chosen as they had slightly higher levels of genetic diversity, giving the clearest illustrations of a pattern that was visible across all of the viral segments. However, the calculation of effective population size was calculated genome-wide, using data from all viral segments. We have amended the text to greater highlight the illustrative nature of the haplotype reconstructions we present.

We believe that it is unlikely that these clades arise from a well-mixed population. The samples we have are deep-sequenced, generally to in excess of 2000x coverage. However, considerable differences are observed between these samples; in a well-mixed population, the samples might exhibit a pattern of evolution, but evolution would follow a continuous pattern of change rather than identifiably (in the haplotype reconstruction) being from one subpopulation or another. Our ‘clade B’ samples describe the 18^th, 40^th, and 41^st samples from the host. The 18^th sample is evolutionarily intermediate between everything in ‘clade A’ and the final two samples. This leads us to the belief that the two clades begin as a single transmitted population (i.e. not from a co-infection), that clades A and B are very largely spatially separate, and that clade B evolves away from clade A over time, potentially as a result of genetic drift. We have added further detail to the text describing the observed relationship between samples.

We are not aware of a biological reason why the generation time for influenza would be different for immunocompromised individuals, but acknowledge that this parameter might contain some uncertainty. We have explored the effect of changes in the generation time in Supplementary file 1.