Next Article in Journal
Population Variability Generated during Rescue Process and Passaging of Recombinant Mumps Viruses
Next Article in Special Issue
Differential Transcriptomic Response of Rainbow Trout to Infection with Two Strains of IPNV
Previous Article in Journal
A Novel High-Throughput Nanopore-Sequencing-Based Strategy for Rapid and Automated S-Protein Typing of SARS-CoV-2 Variants
Previous Article in Special Issue
No Evidence of the Vertical Transmission of Non-Virulent Infectious Salmon Anaemia Virus (ISAV-HPR0) in Farmed Atlantic Salmon
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Epidemiology of Salmonid Alphavirus in Norwegian Aquaculture Reveals Recent Subtype-2 Transmission Dynamics and Novel Subtype-3 Lineages

by
Daniel J. Macqueen
1,*,
Oliver Eve
1,†,
Manu Kumar Gundappa
1,†,
Rose Ruiz Daniels
1,†,
Michael D. Gallagher
2,
Svein Alexandersen
3 and
Marius Karlsen
3
1
The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh EH25 9RG, UK
2
Bionano Genomics, 9540 Towne Centre Dr #100, San Diego, CA 92121, USA
3
Pharmaq AS, 0275 Oslo, Norway
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Submission received: 27 November 2021 / Revised: 16 December 2021 / Accepted: 17 December 2021 / Published: 20 December 2021
(This article belongs to the Special Issue Viruses Affecting Salmonids)

Abstract

:
Viral disease poses a major barrier to sustainable aquaculture, with outbreaks causing large economic losses and growing concerns for fish welfare. Genomic epidemiology can support disease control by providing rapid inferences on viral evolution and disease transmission. In this study, genomic epidemiology was used to investigate salmonid alphavirus (SAV), the causative agent of pancreas disease (PD) in Atlantic salmon. Our aim was to reconstruct SAV subtype-2 (SAV2) diversity and transmission dynamics in recent Norwegian aquaculture, including the origin of SAV2 in regions where this subtype is not tolerated under current legislation. Using nanopore sequencing, we captured ~90% of the SAV2 genome for n = 68 field isolates from 10 aquaculture production regions sampled between 2018 and 2020. Using time-calibrated phylogenetics, we infer that, following its introduction to Norway around 2010, SAV2 split into two clades (SAV2a and 2b) around 2013. While co-present at the same sites near the boundary of Møre og Romsdal and Trøndelag, SAV2a and 2b were generally detected in non-overlapping locations at more Southern and Northern latitudes, respectively. We provide evidence for recent SAV2 transmission over large distances, revealing a strong connection between Møre og Romsdal and SAV2 detected in 2019/20 in Rogaland. We also demonstrate separate introductions of SAV2a and 2b outside the SAV2 zone in Sognefjorden (Vestland), connected to samples from Møre og Romsdal and Trøndelag, respectively, and a likely 100 km Northward transmission of SAV2b within Trøndelag. Finally, we recovered genomes of SAV2a and SAV3 co-infecting single fish in Rogaland, involving novel SAV3 lineages that diverged from previously characterized strains >25 years ago. Overall, this study demonstrates useful applications of genomic epidemiology for tracking viral disease spread in aquaculture.

1. Introduction

Modern genomic tools have revolutionized how viral pathogens are studied and monitored in human and animal populations [1]. Viruses accumulate mutations rapidly and sequencing can be used to trace how genetic changes are distributed among infected individuals; when sampled through time and space, such data can be applied to make epidemiological inferences on disease transmission. Fuelled by recent advances in second and third-generation sequencing, including portable devices such as the MinION platform (Oxford Nanopore Technologies, ONT), genomic sequencing and surveillance programs have gained prominence as tools for monitoring human viral diseases [2,3]. Such efforts are best illustrated by global-scale initiatives to sequence hundreds of thousands of SARS-CoV-2 genomes in human populations, which have been instrumental in characterizing genetic diversity and COVID-19 transmission routes [4,5].
The global aquaculture sector is expanding every year to meet the growing nutritional demands of the rapidly growing human population [6]. It is well recognized that disease outbreaks are one of the main barriers to the sustainable expansion of aquaculture [7]. Viral pathogens pose a particular challenge due to limited effective treatment options, coupled with the fact that outbreaks are highly damaging financially and in terms of animal welfare. A range of approaches are available to prevent and control viral disease in aquaculture [8], including selective breeding [9] and vaccination for finfish [10]. While it is recognized that genomic epidemiology can deliver much to support disease control efforts in aquaculture [11] and other sectors [12], large-scale genomic sequencing to monitor aquaculture pathogens is not a widespread practice, even for high-value species. Nonetheless, several studies highlight the value of genomic epidemiology for linking viral genetic diversity to disease outbreaks and identifying viral transmission scenarios [13,14,15,16,17].
Atlantic salmon (Salmo salar) is globally the highest value aquaculture species [9,18], with the Norwegian sector producing 1.3 million tonnes across almost 1000 sites in 2019, valued at 68 billion NOK (5.6 billion GBP) [19]. Among several viral pathogens infecting farmed salmonids [20], SAV was first recognized as the agent of Atlantic salmon PD in the 1990s [21,22,23,24] and shortly after sleeping disease (SD) in rainbow trout (Oncorhynchus mykiss) [25]. PD and SD cause related pathologies characterized by damage to several organs, including pancreatic necrosis and striated muscle inflammation, associated with slow growth, mortality, and in surviving fish, poor flesh quality [26]. SAV is a single-stranded positive-sense RNA virus with a genome size of ~12,000 bp, belonging to the genus Alphavirus within Togiviridae [27]. It comprises seven defined phylogenetic groups (hereafter: subtypes; SAV1-SAV7) [28,29]. Among these, SAV1-6 infect Atlantic salmon, and were introduced independently to aquaculture from wild reservoirs, followed by subsequent spread within the industry [30]. SAV is endemic in European aquaculture, with the various subtypes showing partly overlapping distribution in different countries [27]. SAV has also been detected in wild reservoir species, including flatfishes and ballan wrasse [15,29,31].
PD has been present in Norway since the 1980s and is thought to have been exclusively caused by SAV3 until SAV2 was detected during an outbreak in 2010 [32,33]. SAV2 was likely initially transmitted to mid-Norway from Scotland, where SAV2 is a common PD-causing subtype [34]. This may have occurred via industry well boat activity, though transmission from wild reservoir species cannot yet be ruled out [34]. SAV2 and SAV3 are defined as separate endemics in Norway, tolerated in largely non-overlapping regions within a wider endemic PD zone, stretching from the southern end of Norway to Flatanger in Trøndelag [35] (Figure 1). Detection of PD outside the endemic zone triggers immediate disease control actions [35]. Within the endemic zone, SAV3 is predominantly detected in Hordaland and Rogaland, while SAV2 dominates in Møre og Romsdal and Trøndelag [36]. With the aim of restricting the spread of SAV2 and SAV3, the endemic PD zone is currently split at Hustadvika (Møre og Romsdal), with SAV2 and SAV3 tolerated above and below this boundary, respectively [35]. SAV2 has recently been detected outside the endemic PD zone and within the SAV3 endemic zone [36], but the origin of these outbreaks is unestablished.
The primary aim of this study was to reveal SAV2 genetic diversity and PD transmission dynamics in recent Norwegian aquaculture using a genomic epidemiology approach. We used nanopore sequencing to generate SAV2 genomic sequences from infected fish sampled between 2018 and 2020 from 27 sampling sites spanning 10 major production areas (Figure 1). Our previous work has shown that nanopore sequencing of SAV and other fish viruses offers a rapid and cost-effective approach with extremely high consensus accuracy, which can be used to identify complex infections involving different viral strains [14,37]. Using extensive new nanopore data alongside existing SAV genome sequence data, we reconstruct the phylogenetic diversity of SAV2 circulating in recent Norwegian aquaculture, revealing plausible geographical origins for recent SAV2 outbreaks outside the SAV2 endemic zone, and identify novel ancestral lineages of SAV3 in fish co-infected with both SAV2 and SAV3.

2. Material and Methods

2.1. Overview of SAV Nanopore Sequencing Approach

We adapted our published strategy [14,37] to generate SAV genomic sequence data used in this study. As summarized in Figure 2, this involves PCR amplification of the SAV genome in six products, using primers designed to cross-amplify all known SAV subtypes (primer sequences in Table 1), followed by multiplex nanopore sequencing on the ONT MinION platform (Oxford Nanopore Technologies, Oxford, UK), with samples identified by a unique barcode. The six primer pairs generated overlapping PCR products in all cases except the 3’ and 5’ respective boundaries of amplicon 4 and 5 (Figure 2), as we were unable to generate amplicons bridging ORF1 (encoding the non-structural proteins) and ORF2 (encoding the structural proteins) despite extensive trials using multiple primer pairs.

2.2. Samples, Reverse Transcription and PCR

Samples were sent by Pharmaq AS to the Roslin Institute on dry ice in July 2020. All samples were total RNA extracted from heart tissue from individual fish infected with SAV according to a commercial qPCR diagnostic test. In advance of shipping, RNA extractions were done by Pharmaq Analytiq or Patogen using standard commercial approaches applied for routine diagnostics. Full metadata for samples are provided in Table S1, including the isolate name, production region, sampling date, estimated heart viral load (qPCR Cq value), ONT barcode (Section 2.4), and sequence mapping statistics (Section 2.4).
Total RNA samples were transferred into a 96 well PCR plate, inclusive of 94 samples and 2 negative controls (nuclease-free water) (Figure 2). We quantified the concentration of each sample using a Qubit 3 fluorimeter with an RNA high sensitivity assay kit (Thermo Fisher Scientific, Waltham, MA, USA) and confirmed total RNA integrity via agarose gel electrophoresis. First strand cDNA synthesis was performed in a plate-set up with standardized total RNA input across samples (0.4 µg) using Protoscript II Reverse Transcriptase (New England Biolabs, Hitchin, UK) in 20 µL reactions following the manufacturer’s protocol, including a mix of random hexamer (1.25 µM concentration) and anchored-dT (dT23VN) primers (1.5 µM concentration) and 400 U enzyme, with the following conditions: 25 °C for 5 min, 42 °C for 1 h, and 80 °C for 5 min. The 1:1 first strand cDNA was diluted 20× with nuclease-free water to make a working stock for PCR.
PCRs were done in six separate plates (one per amplicon; Figure 2) using Phusion Hot Start II High-Fidelity PCR Master Mix (Thermo Fisher Scientific) according to the manufacturer’s protocol. Each 20 µL reaction contained 10 µL of 2× Master Mix, 0.25 µM each of forward and reverse primer (Table 1), and 2 µL of working stock cDNA. The PCR cycling conditions were: 1 cycle of 98 °C for 1 min; 35 cycles of 98 °C for 15 s, 56 °C for 1 min, 72 °C for 2 min 15 s; and 7 min at 72 °C. Prior to performing plate PCRs, a pool of all 94 cDNA samples was generated and used to confirm a PCR product of the expected size was generated using agarose gel electrophoresis for each primer set, with the above stated PCR conditions. PCR reactions for 12 (out of the 94) samples were run for each amplicon on a Tapestation 4200 using D5000 ScreenTapes (Agilent Technologies, Santa Clara, CA, USA) to confirm amplification in the expected size range. The concentration of the same 12 PCR samples for each target amplicon was quantified on a Qubit 3 fluorimeter using a high sensitivity DNA kit (Thermo Fisher Scientific). The average yield of DNA in the 12 samples was used to approximately normalize the input of DNA across amplicons when generating a final pool containing all six amplicons per sample. This aimed to avoid over-dominant sequencing of regions in the SAV genome that amplified more efficiently during PCR. For samples (including both negative controls), where there was insufficient DNA for an amplicon to meet the targeted normalized level, the whole PCR reaction was included in the pool. Finally, the pooled PCR reactions for the 94 samples plus two negative controls were individually cleaned using AMPure XP beads (Beckman Coulter, High Wycombe, UK) with a 0.6 bead-to-PCR reaction ratio (selecting for amplicons 1–3 kb pair in size). The cleaned samples were quantified with a Qubit 3 fluorimeter using a high sensitivity DNA kit (Thermo Fisher Scientific). Equimolar pooling was performed across all samples to obtain 1.5 µg of DNA in total (~16 ng per pooled sample), which was the basis for ONT sequencing (next section).

2.3. ONT Library Preparation, MinION Sequencing, Basecalling, and Trimming

0.2 µg of the pooled amplicons covering all 94 samples was the input to the ONT library preparation, using the 1D Ligation Sequencing Kit (SQK-LSK109) and the Native Barcoding Expansion 96 kit (EXP-NBD196). The NEBNext Ultra II End-Repair/dA-tailing Module (New England Biolabs, E7546) was first used to repair and dA-tail the ends of the DNA. Next, ONT’s native barcodes 1–96 were blunt-end ligated onto each sample using Blunt/TA Ligase Master Mix (New England Biolabs). 5 µL of each barcoded sample was pooled and taken forward to AMPure XP (Beckman Coulter) bead purification in a 1:2.5 ratio of beads to library. The beads were washed twice using 700 µL of short fragment buffer (SFB-ONT) before a final wash in 100 µL of freshly-prepared 80% ethanol. The barcoded, pooled library was eluted in 35 µL of nuclease-free water and 1 µL was quantified using a Qubit3 fluorometer using a high sensitivity DNA kit. 0.125 µg of DNA was carried forward for adapter ligation. AMII sequencing adapters (ONT) were ligated to the sequencing library using the NEBNext Quick Ligation Module (New England Biolabs, E6056) before purification in a 1:2.5 ratio of AMPure beads. The beads were washed twice with 125 µL short fragment buffer and then eluted in 15 µL of ONT elution buffer. 1 µL of the adapter-ligated library was quantified with a Qubit 3 fluorimeter using a high sensitivity DNA kit, before 100 ng of the pooled library was sequenced using flowcell chemistry R9.4.1 on a MinION Mk1B for 24 h without real-time basecalling.
Basecalling of raw fast5 files was carried out on a Unix GPU system with Guppy v4.2.2 using the high-accuracy basecalling model. All reads with PHRED score <7 were removed using NanoFilt [38] before demultiplexing was done using Guppy v4.2.2. ONT barcodes and adapters were trimmed using Porechop v0.2.4 [39]. The resulting fastq files were taken forward to mapping (Section 2.4).

2.4. Mapping, Consensus Sequence Generation, and Filtering

The ONT reads for each barcoded sample were mapped independently to full-length references genome sequences (NCBI accessions provided in parentheses) for the seven SAV subtypes: SAV1 (JX163854), SAV2 (MH708652 and AJ316246), SAV3 (AY604238), SAV4 (MH708651), SAV5 (MH708653), (SAV6: MH238448) and SAV7 (MT882199). Mapping was done with Minimap2 [40] using the parameter -ax map-ont, with other parameters set to default. SAMtools [41] was used to retrieve sequencing coverage and mean depth per SAV genome. Initially, the sorted bam files were parsed to retrieve genome-wide mapping coverage and mean depth values for all eight SAV genomes, revealing high-coverage mapping of most samples to SAV2 (MH708652) and for a few samples to SAV3 (AY604238) (see Section 3.1, Results). Using the same strategy, we retrieved per amplicon coverage and mapping depth for SAV2 (MH708652) and SAV3 (AY604238) across all samples (mapping depth shown in Table S1). Consensus genomic sequences were then generated separately for SAV2 and SAV3 per sample barcode using SAMtools and BCFtools [42]. Pileup information for each base pair across both the SAV2 and SAV3 genomes was generated using the SAMtools mpileup option. Resulting VCF files were converted to fastq sequences using the BCFtools vcfutils.pl vcf2fq script, setting the minimum sequencing depth to assign a base at 50×. Finally, we used amplicon mapping depth for SAV2 and SAV3 in negative control samples as an additional quality filter (Table S1) to account for background PCR amplification. Specifically, we accepted twice the mapping depth of the negative control showing the highest mapping depth as the minimum sequencing depth per amplicon. We masked amplicons (i.e., converting consensus nucleotides to ‘NNN’) that did not pass this filter using BEDtools [43].
Plots for mean depth and coverage across genomes and amplicons were generated using the ggplot2 v3.3.2 R package [44]. The scripts used for mapping and consensus sequences generation are provided in the Supplementary Information (Supplementary Note S1). Consensus sequences for SAV2 or SAV3 across the 94 samples are provided in Table S1. 68 samples where consensus SAV sequences were recovered for no less than 5 out of the 6 SAV amplicons were taken forward to phylogenetic analyses (next section).

2.5. Phylogenetic and Phylogeographic Reconstructions

Phylogenetic analyses were performed on two separate alignments. The first included SAV2 genomes, with the aim to reconstruct the genetic diversity and spatial-temporal transmission dynamics of SAV2. The second included SAV2 and SAV3 genomes, with the aim to elucidate the phylogenetic position of novel SAV3 genomes discovered during the study. For the SAV2 alignment, 68 SAV2 sequences generated by ONT sequencing (Table S1) were aligned along with 13 additional SAV2 genomes (Table S2) (four generated in this study by Sanger sequencing; Section 2.6) using Mafft v.7 [45,46] with default settings. For the SAV2 and SAV3 alignment, the same SAV2 sequences were aligned along with 42 SAV3 genomes, including 3 SAV3 genomes generated here by ONT sequencing (Table S1), and 39 further SAV3 genomes (Table S3), using the same approach. Sequence alignments for the SAV2 (81 sequences, 11,684 sites) and SAV2 + SAV3 analyses (123 sequences, 11,684 sites) are provided as Supplementary Datasets S1 and S2.
For the SAV2 analysis, we performed time-calibrated Bayesian phylogenetic reconstruction in BEAST v2.6.4 [47]. The best-fitting nucleotide substitution model was established during the run within the embedded bModelTest package [48]. The analysis applied tip-date time calibration (for samples generated outside this study where only sampling year was available, we took the middle day of the year as representative), an uncorrelated relaxed clock model [49], a tree prior assuming constant viral population size, and a phylogeographic model to reconstruct the most likely geographic location for each node [50]. A Markov chain Monte Carlo chain (MCMC) of 200 million generations was used, sampling every 20,000 generations. MCMC chain convergence was confirmed in Tracer v1.7.2, where effective sample sizes were >200 for all parameters [51]. A maximum clade credibility tree was generated in TreeAnnotator v2.6.4 after removing the first 1000 sampled trees as burn-in (leaving 9000 trees sampled after convergence). Using the same SAV2 alignment, we performed a maximum likelihood phylogenetic reconstruction using the IQ-TREE webserver [52] with automated nucleotide substitution model selection [53] and an ultrafast bootstrap algorithm [54] to generate branch support values. For the SAV2+SAV3 analyses, we performed a maximum likelihood phylogenetic reconstruction using the same IQ-TREE approach. The best-fitting IQ-TREE models were TIM2+F+I and TIM2+F+G4 for the SAV2 only and SAV2 + SAV3 datasets, respectively.
Google maps (https://www.google.com/maps/, accessed on 20 August 2021) was used to visualize the locations of sampling sites and regions, using latitude and longitude coordinates provided by Pharmaq AS (Oslo, Norway). All producer-specific information has been anonymized in this study, with information connecting viral sequences in space and time-restricted to farming regions.

2.6. Sanger Sequencing of Additional SAV2 Genomes

Four additional SAV2 genomes (details in Table S2) were sequenced by traditional Sanger sequencing. Viral RNA was first purified using a QIAamp Viral RNA Mini Kit using carrier RNA on the QIACube. Purified RNA was then reverse transcribed into cDNA using Superscript III First-strand synthesis supermix (Invitrogen, Renfrewshire, UK). The near full-length genome (Table S2) was amplified in 19 overlapping amplicons by reverse transcriptase PCR using Hot Start Taq DNA Polymerase (Qiagen, Manchester, UK). Successful amplicons were verified by gel electrophoresis and submitted to LGC (Wesel, Germany) for Sanger sequencing. The sequenced amplicons were built into genome sequences manually based on overlapping regions.

3. Results

3.1. Nanopore Sequencing of SAV from Norwegian Field Samples

Six PCR amplicons spanning the SAV genome were sequenced in SAV-infected heart tissue from 94 Atlantic salmon individuals (Figure 1 and Figure 2) on the ONT MinION platform. After 24 h of sequencing, 3,175,118 reads were generated and subsequently basecalled, of which 2,554,754 passed filtering (q-score >7) and showed N50 of 1897 bp. We demultiplexed 94.6% of the passed reads, allowing data for all samples to be recovered by a unique barcode.
Commercial qPCR diagnostic tests done prior to sample selection indicated that most PD infections would represent SAV2. However, our sequencing and bioinformatic strategy captures all SAV subtypes, and can identify co-infections where multiple subtypes are present in samples [14]. Mapping against all seven SAV subtypes revealed that the SAV sequences captured were indeed dominated by SAV2 (Figure 3A,B; Table S4). To demonstrate the sensitivity of our approach, we mapped the data to two different SAV2 genomes, one from an Atlantic salmon sampled in 2007 (accession: MH708652) [37] and the other from a rainbow trout sampled in 1997 (accession: AJ316246). While these sequences share 97.9% nucleotide identity, almost all mapping was to MH708652 (Figure 3A), which is expected as Norwegian SAV2 genomes are more closely related to this sequence than AJ316246 [34]. Mean mapping depth to SAV2 was 1010.8× across the 94 samples (range: 12.5× to 4392.3×), while mean mapping coverage was 96.3% (range: 74.5% to 99.5%), with 86 samples showing >90% coverage (Table S4). Mapping to other SAV subtypes was negligible except for three samples that showed high mapping depth and coverage to SAV3 (Figure 3A,B).
We interrogated finer mapping statistics for SAV2 and SAV3, separating out each amplicon per sample (Table S1). In addition to using a global mapping depth of >50 as a cut-off for consensus base identification, we used the negative controls to set a further cut-off for acceptance of consensus sequences per amplicon (see Methods, Section 2.4). We recovered at least 5 SAV2 amplicons for 68 out of the 94 samples, with 6 amplicons successfully captured in 33 of these samples (Figure 3C). The mean number of bases recovered across these 68 samples, which were taken forward for further analysis, was 10,599 bp (SD: 799 bp) (Table S1), equivalent to ~90% of the SAV2 genome. Amplicons 1, 2, 3, 4, 5, and 6 were recovered in 85, 90, 90, 64, 38, and 85 samples, respectively, with respective mean mapping depths of 1754×, 2655×, 1672×, 675×, 390×, and 236× (Table S1). Thus, ORF1, encoding the non-structural proteins (Figure 2), amplified more efficiently in this study.
For the 3 samples (FR182796282, FR182796280, FR182796275; all from Rogaland; Table S1) showing high mapping depth/coverage to SAV3 (Figure 3A), all six amplicons were successfully recovered, with a mean of 11,076 bp (SD: 975 bp) captured, equivalent to 94% of the SAV3 genome. Significant mapping was also observed against SAV2 (accession MH708652) for these 3 samples. For FR182796280 all 6 SAV2 amplicons were recovered, while for FR182796282 and FR182796275, 2 and 4 SAV2 amplicons were recovered, respectively. This result is consistent with co-infection of SAV2 and SAV3 in single fish heart samples and demonstrates the value of our strategy to capture both genomes in such situations.

3.2. SAV2 Genetic Diversity in Recent Norwegian Aquaculture

The 68 novel SAV2 sequences, along with 13 additional SAV2 genome sequences (Table S2), were used in a time-calibrated Bayesian phylogenetic analysis (Figure 4; Figure S1). This analysis included 4 new SAV2 genomes generated by Sanger sequencing (Section 2.5; Table S2), which were sampled in Norway between 2011 and 2017, providing further resolution to divergence time estimates. For comparison, we performed a non-time-calibrated maximum likelihood analysis, providing an alternative visualization of the data (Figure S2).
Consistent with a recent study [34], all Norwegian SAV2 sequences were monophyletic and descended from Scottish SAV2 sequences (Figure 4A; Figure S2). According to the Bayesian tree (Figure 4A), which employed a relaxed uncorrelated molecular clock model [49] assuming constant viral population size, the most recent common ancestor (MRCA) of Norwegian SAV2 existed during 2010 (95% HPD: 2009–2011). This estimate is consistent with the first time that SAV2 was detected in 2010 [33]. SAV2 then split off into two major lineages around 2013 (95% HPD: 2011–2016), hereafter named SAV2a and 2b (Figure 4A). The MRCA of the sampled SAV2a and 2b lineages existed in 2016 (respective 95% HPDs: 2013–2018 and 2013–2017) (Figure 4A).
Sampling locations for SAV2a and 2b are shown in Figure 4B. SAV2b was more frequently detected (48/68 samples) and extended more northerly than SAV2a, explaining all SAV2 detection in Northern Trøndelag and as southerly as Outer Sognefjord in Vestland. The northern and southern limit of SAV2a detection was at sites in Trøndelag (Hitra area) and Rogaland, respectively. SAV2a was also detected in middle Sognefjord in Vestland. We found that SAV2a and 2b were co-present in fish sampled from the same sites in Hitra (Trøndelag) and Smøla (Møre og Romsdal), either on the exact same date or within a very close sampling timeframe of a few days apart (Figure S1; also, compare Figure 5 and Figure 6).

3.3. Phylogeographical Reconstructions of SAV2

Our analysis incorporated a phylogeography model [50], allowing the most likely ancestral locations of nodes throughout the SAV2 tree to be reconstructed (Figure 5 and Figure 6). Most samples taken from geographically proximal regions clustered closely together in the tree, with exceptions indicating viral spread between different regions. As a caveat of our approach, it should be noted that any inferences made on viral transmission are based on the sampled SAV2 sequences, and it is possible that unsampled viruses exist from alternative geographical regions, which could alter our interpretations and provide alternative transmission scenarios.
Figure 5. (A) Time-calibrated Bayesian phylogeny (Figure 4), focussing on the SAV2a clade, including results of the phylogeographic model. Tip names are colored according to the region of sample origin. PP values for key nodes are shown in black font. PP values for the ancestral location of samples are shown at relevant nodes, using font colors matching the given key for different aquaculture regions. (B) Inferred scenario of SAV2a spread from the Hustad/Molde area to Rogaland. (C) Inferred scenario of SAV2a spread from the Smøla area to Middle Sognefjord.
Figure 5. (A) Time-calibrated Bayesian phylogeny (Figure 4), focussing on the SAV2a clade, including results of the phylogeographic model. Tip names are colored according to the region of sample origin. PP values for key nodes are shown in black font. PP values for the ancestral location of samples are shown at relevant nodes, using font colors matching the given key for different aquaculture regions. (B) Inferred scenario of SAV2a spread from the Hustad/Molde area to Rogaland. (C) Inferred scenario of SAV2a spread from the Smøla area to Middle Sognefjord.
Viruses 13 02549 g005
Figure 6. (A) Time-calibrated Bayesian phylogeny (Figure 4), focussing on the SAV2b clade, including results of the phylogeographic model. Tip names are colored according to the region of sample origin. PP values for key nodes are shown in black font. PP values for the ancestral location of samples are shown at relevant nodes, using font colors matching the given key for different aquaculture regions. (B) Inferred scenario of SAV2a spread from the Fosen area to Northern Trøndelag. (C) Inferred scenario of SAV2a spread from the Hitra area to Outer Sognefjord.
Figure 6. (A) Time-calibrated Bayesian phylogeny (Figure 4), focussing on the SAV2b clade, including results of the phylogeographic model. Tip names are colored according to the region of sample origin. PP values for key nodes are shown in black font. PP values for the ancestral location of samples are shown at relevant nodes, using font colors matching the given key for different aquaculture regions. (B) Inferred scenario of SAV2a spread from the Fosen area to Northern Trøndelag. (C) Inferred scenario of SAV2a spread from the Hitra area to Outer Sognefjord.
Viruses 13 02549 g006

3.3.1. Phylogeographical Reconstructions of SAV2a

Within the SAV2a clade, there is a strong connection between the Hustad/Molde region (Møre og Romsdal) and SAV2a detected in Rogaland in 2019/20 (Figure 5A,B). Specifically, all SAV2a sequences from Rogaland and the Hustad/Molde region comprise a sub-clade with maximal statistical support (posterior probability [PP]: 1.0). Within this clade, Rogaland sequences are monophyletic (PP: 1.0) and descended from samples in Hustad/Molde, which is strongly inferred as the ancestral location for the Rogaland + Hustad/Molde clade (Figure 5A). An SAV2a sequence from Hustad/Molde that branches as the immediate ancestor of Rogaland sequences (PP: 0.5) was sampled in late Oct 2019, 25 days before the earliest SAV2a samples taken in Rogaland (Figure 5A).
Within the SAV2a clade, our analysis infers an introduction of this strain into Middle Sognefjord (Figure 5B). Specifically, an SAV2a sequence from Middle Sognefjord formed a sister group to a clade of Smøla (Møre og Romsdal) sequences (PP: 0.51) (Figure 5A). However, statistical support for Smøla being the source of SAV2a in Middle Sognefjord was weak according to the phylogeography model (PP: 0.33). As the clade containing Smøla and Middle Sognefjord sequences otherwise only contains sequences from Hustad/Molde (Møre og Romsdal) and Hitra (Trøndelag) (Figure 5A), it seems likely that Møre og Romsdal or Trøndelag was the source of virus detected in Middle Sognefjord.

3.3.2. Phylogeographical Reconstructions of SAV2b

Within the SAV2b clade, a genome sampled in Northern Trøndelag in 2017 is the most basal branch (PP: 1.0) (Figure 6A). Two SAV2b subclades are supported (PP: 1.0), the first dominated by sequences from the Fosen area and Northern Trøndelag (Figure 6A). Within this clade, Northern Trøndelag sequences are monophyletic (PP: 1.0) and descended from sequences sampled from the Fosen area of Trøndelag, which is strongly recovered as the ancestral location of SAV2b introduced to Northern Trøndelag according to the phylogeography model (PP: 0.98) (Figure 6A,B). There is also evidence at the base of this subclade for transmission of SAV2b across different sites located in southern Trøndelag and Møre og Romsdal (Figure 6A,B).
The second SAV2b subclade is dominated by sequences from Trøndelag (Hitra, Fosen, and Frøya) and Møre og Romsdal (Kristiansund/Frei and Smøla) (Figure 6A). However, sitting at the base of this subclade, a sequence from Outer Sognefjord (Vestland) forms a sister relationship with a monophyletic group of Hitra sequences (PP: 0.93). Hitra was not strongly supported as the origin of SAV2b in Outer Sognefjord according to the phylogeography model (PP: 0.41). However, considering that the sister clade to the Hitra and Outer Sognefjord sequences solely includes SAV2b sequences from Trøndelag and Møre og Romsdal, it is likely that one of these counties was the source of SAV2b introduction to Outer Sognefjord in 2019, occurring independently from a separate introduction of SAV2a to Middle Sognefjord in a similar timeframe (Figure 5A,C).

3.4. Novel SAV3 Strains Detected in Fish Co-Infected with SAV2 and SAV3

A maximum likelihood phylogenetic tree was generated, including the three SAV3 genome sequences captured as co-infections with SAV2a in individual fish heart samples from Rogaland (Section 3.1). This was done in a framework covering all published SAV3 and SAV2 genomes, in addition to 72 SAV2 sequences generated in this study (Figure 7). The mid-point rooted tree recovered monophyletic SAV2 and SAV3 clades and the same overall branching of SAV2 sequences as the Bayesian tree (Figure 1; Figure S1). Within the SAV3 clade, the three novel genomes branched at the base of all previously published strains, which were sampled between 2002 and 2019. Furthermore, while FR182796282 and FR182796275 comprised a monophyletic group sister to all previously characterized SAV3 sequences, FR182796280 was, in turn, the sister sequence to all other SAV3 sequences, including FR182796282 and FR182796275 (Figure 7). A previous study revealed that the common ancestor of SAV3 genomes (excluding the sequences generated in this study) existed in 1994 [14]. Consequently, recently circulating SAV3 in Rogaland includes strains that shared a common ancestor with previously characterized SAV3 strains, no less than 27 years ago.

4. Discussion

PD remains a highly problematic and persistent viral disease in Atlantic salmon aquaculture, diagnosed at 158 different farming locations in Norway during routine PCR surveillance in 2020 [36]. Using genomic epidemiology approaches to study SAV as the causative agent, we have reconstructed scenarios involving multiple independent PD transmission events across different production regions. Our results strongly implicate farms in Trøndelag and Møre og Romsdal as a common source for SAV2a and 2b transmission outside the current zone of SAV2 toleration. This includes independent introductions of SAV2a and 2b into Sognefjord within Vestland, and of SAV2a into Rogaland, in 2019. These events led to the establishment of SAV2 control zones aiming to eradicate the virus outside the toleration zone [36]. In each case, the inferred spread of SAV2 occurred in the opposite direction to the prevailing Norwegian current, and in the case of the proposed transmission of SAV2a to Rogaland (Figure 5A,B), in an inferred short timeframe involving a huge physical distance. Both factors are inconsistent with passive diffusion scenarios and instead imply repeated horizontal spread via anthropogenic routes, likely through well boat activity as a known risk factor for PD transmission [55,56]. In contrast, reconstructed scenarios for Northward spread of SAV2b outside the SAV2 toleration zone—namely the inferred single introduction of SAV2b from Fosen to sites in Northern Trøndelag in 2019 (Figure 6A,B), is consistent with passive diffusion facilitated by ocean currents, but cannot exclude active transmission via a well-boat or other vector.
SAV3 genomes sequenced in this study were a component of SAV2a-SAV3 subtype co-infections from Rogaland (Figure 7) within the SAV3 toleration zone (Figure 1). The short temporal period separating the introduction of SAV2a to Rogaland from Møre og Romsdal in late 2019 (Figure 5A) to the identification of SAV2a-SAV3 co-infections in Rogaland in early 2020 indicates that few barriers exist to co-infections arising. This is consistent with a recent study that documented a PD outbreak involving both SAV2 and SAV3, where around 10% of the infections were SAV2-SAV3 co-infections according to PCR [57]. Another study identified SAV co-infections involving other subtype combinations [14]. It remains unknown if co-infections have significance in terms of pathological outcomes or PD vaccine efficacy, but future studies should consider such issues, while additional efforts may be needed to ensure co-infections are captured in routine PD surveillance. Our discovery of novel strains of SAV3 ancestral to all previously characterized genomes (Figure 7) is notable, as it implies considerable uncharacterized genetic diversity for this subtype, arguing for increased genome sequencing efforts in the SAV3 toleration zone.
Viral whole-genome sequencing has become a routine and cost-effective scientific approach that has gained much prominence during the human COVID-19 pandemic. Considering the extensive commercial importance of salmon aquaculture, balanced against the scale of financial, welfare, and reputational costs caused by viral diseases, we argue that efforts are required to develop and sustain routine viral genomic surveillance programs as part of the ‘toolbox’ to limit viral disease outbreaks [8]. Data such as that reported here, ideally expanded in spatial and temporal resolution, can be used by producers and regulating authorities to support a range of actions aimed at disease control. Moreover, the sequencing approach utilized here and in related studies, e.g., [13,14,16,17], can be readily transferred to other salmonid viruses. Beyond the identification of candidate disease transmission routes, salmonid viruses warrant routine genomic surveillance to capture variants of concern. This is emphatically illustrated by the recent discovery of novel infectious pancreatic necrosis virus (IPNv) strains that have may have evolved to evade long-standing anthropogenic control measures [17,58], including an IPN resistance QTL that has been extremely successful in limiting IPN outbreaks [9].
Finally, the conclusions made in this study were made possible through the cooperation of multiple Atlantic salmon producers sharing PD-infected samples for sequencing. Such a broad ‘buy-in’ from within the aquaculture industry will be essential to achieve and maximize the industry-wide benefits of viral genomic surveillance and epidemiology in the future, ensuring appropriate representation and spatial-temporal resolution of infected samples, essential for powerful reconstructions of viral transmission and evolution in support of disease control actions.

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/article/10.3390/v13122549/s1, Figure S1: Bayesian time-calibrated phylogenetic tree used in Figure 4; Figure S2. Mid-point rooted maximum likelihood phylogenetic tree summarizing the evolution of SAV2 in Norwegian aquaculture; Supplementary Note 1: Scripts used in the study; Table S1: Full details of all samples used in study; Table S2: Details of additional sequences used for the phylogenetic analysis of SAV2 genomes; Table S3: Details of additional sequences used for the phylogenetic analysis of SAV2 and SAV3 genomes; Table S4: Detailed mapping statistics summarized in Figure 3A,B in the main text; Supplementary Dataset 1: SAV2 alignment; Supplementary Dataset 2: SAV2+3 alignment.

Author Contributions

Designed study: D.J.M., S.A., and M.K.; collated samples in Norway: S.A. and M.K.; sorted samples for processing: D.J.M.; designed PCR strategy and primers: M.D.G. and D.J.M.; tested PCR conditions: D.J.M.; performed cDNA synthesis and PCRs: D.J.M. and R.R.D.; performed bead cleanups and prepped PCR samples for sequencing: R.R.D.; performed nanopore sequencing and basecalling: O.E.; performed mapping and consensus sequence generation: M.K.G.; performed phylogenetic analyses: D.J.M. and M.D.G.; produced figures and tables: D.J.M. and M.K.G.; drafted manuscript: D.J.M.; interpreted data and contributed to writing, leading to the final manuscript version: all authors. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Biotechnology and Biological Sciences Research Council (grant nos. BBS/E/D/20002173 and BB/M010996/1), in addition to funding contributed by the following companies (in alphabetical order): Emilsen Fisk AS, Grieg Seafood ASA, Måsøval AS, Midt-Norsk Havbruk AS, Osland Havbruk AS, Pharmaq AS, Salmar ASA, and Sinkaberg Hansen AS.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Seventy-five novel SAV genomes generated in this study and used in the presented phylogenetic analysis were deposited in the National Center for Biotechnology Information (NCBI) database with accession numbers OL754508–OL754582. The same sequences, along with an additional 25 partial SAV2 genomic sequences generated in the study but not used in phylogenetic analysis, are provided in Table S1.

Acknowledgments

We are grateful to Bjørn Gillund (Sinkaberg Hansen) for initiating the project in liaison with S.A. (Pharmaq AS). We also thank the following companies (in alphabetical order) for providing samples used in the study: Alsaker Fjordbruk, Emilsen Fisk, Grieg Seafood Rogaland, Måsøval, Midt Norsk Havbruk, Mowi, Osland Havbruk, Salmar, and Salmonor. Elin Petterson (formerly Pharmaq AS) generated the 4 Sanger sequenced SAV2 genomes reported in this work. We thank Diego Perojil Morata (Macqueen lab, Edinburgh) and Irja Roija Sunde (Pharmaq Analytiq AS) for helping with sample processing.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Gardy, J.L.; Loman, N.J. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat. Rev. Genet. 2018, 19, 9–20. [Google Scholar] [CrossRef]
  2. Gire, S.K.; Goba, A.; Andersen, K.G.; Sealfon, R.S.; Park, D.J.; Kanneh, L.; Jalloh, S.; Momoh, M.; Fullah, M.; Dudas, G.; et al. Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak. Science 2014, 345, 1369–1372. [Google Scholar] [CrossRef] [Green Version]
  3. Faria, N.R.; Quick, J.; Claro, I.M.; Thézé, J.; de Jesus, J.G.; Giovanetti, M.; Kraemer, M.U.G.; Hill, S.C.; Black, A.; da Costa, A.C.; et al. Establishment and cryptic transmission of Zika virus in Brazil and the Americas. Nature 2017, 546, 406–410. [Google Scholar] [CrossRef]
  4. Lo, S.W.; Jamrozy, D. Genomics and epidemiological surveillance. Nat. Rev. Microbiol. 2020, 18, 478. [Google Scholar] [CrossRef]
  5. COVID-19 Genomics UK (COG-UK). An integrated national scale SARS-CoV-2 genomic surveillance network. Lancet Microbe 2020, 1, e99–e100. [Google Scholar] [CrossRef]
  6. Béné, C.; Barange, M.; Subasinghe, R.; Pinstrup-Andersen, P.; Merino, G.; Hemre, G.-I.; Williams, M. Feeding 9 billion by 2050—Putting fish back on the menu. Food Secur. 2015, 7, 261–274. [Google Scholar] [CrossRef] [Green Version]
  7. Stentiford, G.D.; Sritunyalucksana, K.; Flegel, T.W.; Williams, B.A.; Withyachumnarnkul, B.; Itsathitphaisarn, O.; Bass, D. New paradigms to help solve the global aquaculture disease crisis. PLoS Pathog. 2017, 13, e1006160. [Google Scholar] [CrossRef] [PubMed]
  8. Mugimba, K.K.; Byarugaba, D.K.; Mutoloki, S.; Evensen, Ø.; Munang’andu, H.M. Challenges and solutions to viral diseases of finfish in marine aquaculture. Pathogens 2021, 10, 673. [Google Scholar] [CrossRef] [PubMed]
  9. Houston, R.D.; Bean, T.P.; Macqueen, D.J.; Gundappa, M.K.; Jin, Y.H.; Jenkins, T.L.; Selly, S.L.C.; Martin, S.A.M.; Stevens, J.R.; Santos, E.M.; et al. Harnessing genomics to fast-track genetic improvement in aquaculture. Nat. Rev. Genet. 2020, 21, 389–409. [Google Scholar] [CrossRef] [PubMed]
  10. Miccoli, A.; Manni, M.; Picchietti, S.; Scapigliati, G. State-of-the-art vaccine research for aquaculture use: The case of three economically relevant fish species. Vaccines 2021, 9, 140. [Google Scholar] [CrossRef]
  11. Bayliss, S.C.; Verner-Jeffreys, D.W.; Bartie, K.L.; Aanensen, D.M.; Sheppard, S.K.; Adams, A.; Feil, E.J. The promise of whole genome pathogen sequencing for the molecular epidemiology of emerging aquaculture pathogens. Front. Microbiol. 2017, 8, 121. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Kao, R.R.; Haydon, D.T.; Lycett, S.J.; Murcia, P.R. Supersize me: How whole-genome sequencing and big data are transforming epidemiology. Trends Microbiol. 2014, 22, 282–291. [Google Scholar] [CrossRef]
  13. Ulrich, K.; Wehner, S.; Bekaert, M.; Di Paola, N.; Dilcher, M.; Muir, K.F.; Taggart, J.B.; Matejusova, I.; Weidmann, M. Molecular epidemiological study on Infectious Pancreatic Necrosis Virus isolates from aquafarms in Scotland over three decades. J. Gen. Virol. 2018, 99, 1567–1581. [Google Scholar] [CrossRef] [PubMed]
  14. Gallagher, M.D.; Karlsen, M.; Petterson, E.; Haugland, Ø.; Matejusova, I.; Macqueen, D.J. Genome sequencing of SAV3 reveals repeated seeding events of viral strains in Norwegian aquaculture. Front. Microbiol. 2020, 11, 740. [Google Scholar] [CrossRef]
  15. Gallagher, M.D.; Matejusova, I.; Ruane, N.M.; Macqueen, D.J. Genome-wide target enriched viral sequencing reveals extensive ‘hidden’ salmonid alphavirus diversity in farmed and wild fish populations. Aquaculture 2020, 522, 735117. [Google Scholar] [CrossRef]
  16. Mordecai, G.J.; Miller, K.M.; Bass, A.L.; Bateman, A.W.; Teffer, A.K.; Caleta, J.M.; Di Cicco, E.; Schulze, A.D.; Kaukinen, K.H.; Li, S.; et al. Aquaculture mediates global transmission of a viral pathogen to wild salmon. Sci. Adv. 2021, 7, eabe2592. [Google Scholar] [CrossRef]
  17. Benkaroun, J.; Muir, K.F.; Allshire, R.; Tamer, C.; Weidmann, M. Isolation of a new infectious pancreatic necrosis virus (IPNV) variant from a fish farm in Scotland. Viruses 2021, 13, 385. [Google Scholar] [CrossRef]
  18. Houston, R.D.; Macqueen, D.J. Atlantic salmon (Salmo salar L.) genetics in the 21st century: Taking leaps forward in aquaculture and biological understanding. Anim. Genet. 2019, 50, 3–14. [Google Scholar] [CrossRef] [Green Version]
  19. Statistics Norway, Aquaculture. Available online: https://www.ssb.no/en/fiskeoppdrett/ (accessed on 5 November 2021).
  20. Crane, M.; Hyatt, A. Viruses of fish: An overview of significant pathogens. Viruses 2011, 3, 2025–2046. [Google Scholar] [CrossRef]
  21. Nelson, R.T.; McLoughlin, M.F.; Rowley, H.M.; Platten, M.A.; McCormick, J.I. Isolation of a toga-like virus from farmed Atlantic salmon Salmo salar with pancreas disease. Dis. Aquat. Org. 1995, 22, 25–32. [Google Scholar] [CrossRef] [Green Version]
  22. Rowley, H.M.; Doherty, C.E.; McLoughlin, M.F.; Welsh, M.D. Isolation of salmon pancreas disease virus (SPDV) from farmed Atlantic salmon, Salmo salar L., in Scotland. J. Fish Dis. 1990, 21, 469–471. [Google Scholar] [CrossRef]
  23. Christie, K.E.; Fyrand, K.; Holtet, L.; Rowley, H.M. Isolation of pancreas disease virus from farmed Atlantic salmon, Salmo salar L. in Norway. J. Fish Dis. 1998, 21, 391–394. [Google Scholar] [CrossRef]
  24. Weston, J.H.; Welsh, M.D.; McLoughlin, M.F.; Todd, D. Salmon pancreas disease virus, an alphavirus infecting farmed Atlantic salmon, Salmo salar L. Virology 1999, 256, 188–195. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Weston, J.H.; Villoing, S.; Bremont, M.; Castric, J.; Pfeffer, M.; Jewhurst, V.; McLoughlin, M.; Rødseth, O.; Christie, K.E.; Koumans, J.; et al. Comparison of 2 aquatic alphaviruses, salmon pancreas disease virus and sleeping disease virus, by using genome sequence analysis, monoclonal reactivity, and cross-infection. J. Virol. 2002, 76, 6155–6163. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. McLoughlin, M.F.; Graham, D.A. Alphavirus infections in salmonids—A review. J. Fish Dis. 2007, 30, 511–531. [Google Scholar] [CrossRef]
  27. Deperasińska, I.; Schulz, P.; Siwicki, A.K. Salmonid Alphavirus (SAV). J. Vet. Res. 2018, 62, 1–6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Fringuelli, E.; Rowley, H.M.; Wilson, J.C.; Hunter, R.; Rodger, H.; Graham, D.A. Phylogenetic analyses and molecular epidemiology of European salmonid alphaviruses (SAV) based on partial E2 and nsP3 gene nucleotide sequences. J. Fish Dis. 2008, 31, 811–823. [Google Scholar] [CrossRef] [PubMed]
  29. Tighe, A.J.; Gallagher, M.D.; Carlsson, J.; Matejusova, I.; Swords, F.; Macqueen, D.J.; Ruane, N.M. Nanopore whole genome sequencing and partitioned phylogenetic analysis supports a new salmonid alphavirus genotype (SAV7). Dis. Aquat. Org. 2020, 142, 203–211. [Google Scholar] [CrossRef]
  30. Karlsen, M.; Gjerset, B.; Hansen, T.; Rambaut, A. Multiple introductions of salmonid alphavirus from a wild reservoir have caused independent and self-sustainable epizootics in aquaculture. J. Gen. Virol. 2014, 95, 52–59. [Google Scholar] [CrossRef]
  31. Bruno, D.W.; Noguera, P.A.; Black, J.; Murray, W.; Macqueen, D.J.; Matejusova, I. Identification of a wild reservoir of salmonid alphavirus in common dab Limanda limanda, with emphasis on virus culture and sequencing. Aquacult. Environ. Interact. 2014, 5, 89–98. [Google Scholar] [CrossRef] [Green Version]
  32. Hodneland, K.; Bratland, A.; Christie, K.E.; Endresen, C.; Nylund, A. New subtype of salmonid alphavirus (SAV), Togaviridae, from Atlantic salmon Salmo salar and rainbow trout Oncorhynchus mykiss in Norway. Dis. Aquat. Org. 2005, 66, 113–120. [Google Scholar] [CrossRef] [PubMed]
  33. Hjortaas, M.J.; Skjelstad, H.R.; Taksdal, T.; Olsen, A.B.; Johansen, R.; Bang-Jensen, B.; Orpetveit, I.; Sindre, H. The first detections of subtype 2-related salmonid alphavirus (SAV2) in Atlantic salmon, Salmo salar L., in Norway. J. Fish Dis. 2013, 36, 71–74. [Google Scholar] [CrossRef] [PubMed]
  34. Hjortaas, M.J.; Fringuelli, E.; Monjane, A.L.; Mikalsen, A.B.; Jonassen, C.M.; Savage, P.; Sindre, H. Emergence of Salmonid Alphavirus genotype 2 in Norway-molecular characterization of viral strains circulating in Norway and Scotland. Viruses 2021, 13, 1556. [Google Scholar] [CrossRef] [PubMed]
  35. Stien, L.; Tørud, B.; Gismervik, S.; Lien, M.; Medaas, C.; Osmundsen, T.; Kristiansen, T.; Størkersen, K. Governing the welfare of Norwegian farmed salmon: Three conflict cases. Mar. Policy 2020, 117. [Google Scholar] [CrossRef]
  36. Sommerset, I.; Bang Jensen, B.; Bornø, B.; Haukaas, A.; Brun, E. (Eds.) The Health Situation in Norwegian Aquaculture 2020; Norwegian Veterinary Institute: Oslo, Norway, 2021. [Google Scholar]
  37. Gallagher, M.D.; Matejusova, I.; Nguyen, L.; Ruane, N.M.; Falk, K.; Macqueen, D.J. Nanopore sequencing for rapid diagnostics of salmonid RNA viruses. Sci. Rep. 2018, 8, 16307. [Google Scholar] [CrossRef] [Green Version]
  38. De Coster, W.; D’Hert, S.; Schultz, D.T.; Cruts, M.; Van Broeckhoven, C. NanoPack: Visualizing and processing long-read sequencing data. Bioinformatics 2018, 34, 2666–2669. [Google Scholar] [CrossRef]
  39. Wick, R.R.; Judd, L.M.; Gorrie, C.L.; Holt, K.E. Completing bacterial genome assemblies with multiplex MinION sequencing. Microb. Genom. 2017, 3, e000132. [Google Scholar] [CrossRef]
  40. Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 2018, 34, 3094–3100. [Google Scholar] [CrossRef]
  41. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R.; 1000 Genome Project Data Processing Subgroup. The sequence alignment/map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [Green Version]
  42. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve years of SAMtools and BCFtools. Gigascience 2021, 10, giab008. [Google Scholar] [CrossRef]
  43. Quinlan, A.R. BEDTools: The Swiss-army tool for genome feature analysis. Curr. Protoc. Bioinform. 2014, 47, 11–12. [Google Scholar] [CrossRef]
  44. Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016; ISBN 978-3-319-24277-4. Available online: https://ggplot2.tidyverse.org (accessed on 5 November 2021).
  45. Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef] [Green Version]
  46. Katoh, K.; Rozewicki, J.; Yamada, K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019, 20, 1160–1166. [Google Scholar] [CrossRef] [Green Version]
  47. Bouckaert, R.; Heled, J.; Kühnert, D.; Vaughan, T.; Wu, C.H.; Xie, D.; Suchard, M.A.; Rambaut, A.; Drummond, A.J. BEAST 2: A software platform for bayesian evolutionary analysis. PLoS Comput. Biol. 2014, 10, e1003537. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Bouckaert, R.R.; Drummond, A.J. bModelTest: Bayesian phylogenetic site model averaging and model comparison. BMC Evol. Biol. 2017, 17, 42. [Google Scholar] [CrossRef] [Green Version]
  49. Drummond, A.J.; Ho, S.Y.W.; Phillips, M.J.; Rambaut, A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006, 4, e88. [Google Scholar] [CrossRef] [PubMed]
  50. Lemey, P.; Rambaut, A.; Drummond, A.J.; Suchard, M.A. Bayesian phylogeography finds its roots. PLoS Comput. Biol. 2009, 5, e1000520. [Google Scholar] [CrossRef] [Green Version]
  51. Rambaut, A.; Drummond, A.J.; Xie, D.; Baele, G.; Suchard, M.A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 2018, 67, 901–904. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Trifinopoulos, J.; Nguyen, L.T.; von Haeseler, A.; Minh, B.Q. W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016, 44, W232–W235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; Von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef] [Green Version]
  54. Hoang, D.T.; Chernomor, O.; von Haeseler, A.; Minh, B.Q.; Vinh, L.S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Mol. Biol. Evol. 2018, 35, 518–522. [Google Scholar] [CrossRef] [PubMed]
  55. Høgåsen, H.R.; Lyngstad, T.M.; Jansen, M.D.; Nilsen, A. Risk of Disease Transfer with Wellboats in Norway—Technical Report; Veterinærinstituttets rapportserie Veterinærinstituttet: Oslo, Norway, 2015. [Google Scholar]
  56. Jansen, M.D.; Bang Jensen, B.; McLoughlin, M.F.; Rodger, H.D.; Taksdal, T.; Sindre, H.; Graham, D.A.; Lillehaug, A. The epidemiology of pancreas disease in salmonid aquaculture: A summary of the current state of knowledge. J. Fish Dis. 2017, 40, 141–155. [Google Scholar] [CrossRef]
  57. Røsaeg, M.V.; Thorarinsson, R.; Aunsmo, A. Effect of vaccines against pancreas disease in farmed Atlantic salmon. J. Fish Dis. 2021. [Google Scholar] [CrossRef] [PubMed]
  58. Hillestad, B.; Johannessen, S.; Melingen, G.O.; Moghadam, H.K. Identification of a new infectious pancreatic necrosis virus (IPNV) variant in atlantic salmon (Salmo salar L.) that can cause high mortality even in genetically resistant fish. Front. Genet. 2021, 12. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Sampling locations mapped against the current definition of SAV endemic zones.
Figure 1. Sampling locations mapped against the current definition of SAV endemic zones.
Viruses 13 02549 g001
Figure 2. Visualization of steps used in this study generate nanopore sequencing data spanning the SAV genome. See Methods (Section 2.1, Section 2.2 and Section 2.3) for full details.
Figure 2. Visualization of steps used in this study generate nanopore sequencing data spanning the SAV genome. See Methods (Section 2.1, Section 2.2 and Section 2.3) for full details.
Viruses 13 02549 g002
Figure 3. Summary statistics for SAV nanopore sequencing data. (A) Boxplot depicting mapping depth across the reference genomes for the seven SAV subtypes (n = 94 samples). Mapping depth represents the number of times the base pairs in the SAV genome are covered by the mapped nanopore reads. (B) Frequency histogram depicting the number of samples showing genome-wide coverage against the same reference genomes (bin-widths of 5%). Coverage represents the proportion of base pairs in the reference genome covered by mapped nanopore reads. (C). Histogram showing the number of successful amplicons generated (out of 6) as a proportion of n = 94 samples.
Figure 3. Summary statistics for SAV nanopore sequencing data. (A) Boxplot depicting mapping depth across the reference genomes for the seven SAV subtypes (n = 94 samples). Mapping depth represents the number of times the base pairs in the SAV genome are covered by the mapped nanopore reads. (B) Frequency histogram depicting the number of samples showing genome-wide coverage against the same reference genomes (bin-widths of 5%). Coverage represents the proportion of base pairs in the reference genome covered by mapped nanopore reads. (C). Histogram showing the number of successful amplicons generated (out of 6) as a proportion of n = 94 samples.
Viruses 13 02549 g003
Figure 4. (A) Time-calibrated Bayesian phylogeny summarizing the evolution of SAV2 in Norwegian aquaculture. The data represents 81 SAV2 sequences, inclusive of 68 generated by Nanopore sequencing in this study with an average length of 10,600 bp (Table S1) and 14 additional SAV2 genome sequences (Table S2). The tree was generated in BEAST2 using a relaxed molecular clock model calibrated with the tip sampling dates, a coalescent prior assuming constant population size and phylogeographic model to reconstruct the most likely geographic location for each node (data from phylogeographic model summarized in Figure 5 and Figure 6). Information is summarized at key nodes, including inferred median node age, with the associated 95% highest posterior density (HPD) around node ages given in parentheses. Posterior probability (PP) values for key nodes are shown (maximum PP = 1.0). A complete representation of the tree, inclusive of 95% HPDs on node ages and with tip sample identifiers, is provided in Figure S1. The sequence alignment is provided in Supplementary Dataset S1. (B) Geographical location of SAV2a and 2b sequences inferred by phylogenetics according to the sampling site.
Figure 4. (A) Time-calibrated Bayesian phylogeny summarizing the evolution of SAV2 in Norwegian aquaculture. The data represents 81 SAV2 sequences, inclusive of 68 generated by Nanopore sequencing in this study with an average length of 10,600 bp (Table S1) and 14 additional SAV2 genome sequences (Table S2). The tree was generated in BEAST2 using a relaxed molecular clock model calibrated with the tip sampling dates, a coalescent prior assuming constant population size and phylogeographic model to reconstruct the most likely geographic location for each node (data from phylogeographic model summarized in Figure 5 and Figure 6). Information is summarized at key nodes, including inferred median node age, with the associated 95% highest posterior density (HPD) around node ages given in parentheses. Posterior probability (PP) values for key nodes are shown (maximum PP = 1.0). A complete representation of the tree, inclusive of 95% HPDs on node ages and with tip sample identifiers, is provided in Figure S1. The sequence alignment is provided in Supplementary Dataset S1. (B) Geographical location of SAV2a and 2b sequences inferred by phylogenetics according to the sampling site.
Viruses 13 02549 g004
Figure 7. Maximum likelihood phylogeny reconstructing the evolution of SAV2 and SAV3 in Norwegian aquaculture, characterizing the phylogenetic position of the novel SAV3 genomes sequenced in this study (Table S1). The other included sequences represent 39 SAV3 genomes available on NCBI (Table S3) and the 81 SAV2 sequences used elsewhere in this study (Tables S1 and S2, i.e., Figure 3, Figure 4 and Figure 5). The tree was generated using IQTREE [52] with the best-fitting nucleotide model (TIM2+F+G4). An ultrafast bootstrapping algorithm [53] was used to generate node support values (maximum value: 100). The sequence alignment is provided in the Supplementary Dataset S2.
Figure 7. Maximum likelihood phylogeny reconstructing the evolution of SAV2 and SAV3 in Norwegian aquaculture, characterizing the phylogenetic position of the novel SAV3 genomes sequenced in this study (Table S1). The other included sequences represent 39 SAV3 genomes available on NCBI (Table S3) and the 81 SAV2 sequences used elsewhere in this study (Tables S1 and S2, i.e., Figure 3, Figure 4 and Figure 5). The tree was generated using IQTREE [52] with the best-fitting nucleotide model (TIM2+F+G4). An ultrafast bootstrapping algorithm [53] was used to generate node support values (maximum value: 100). The sequence alignment is provided in the Supplementary Dataset S2.
Viruses 13 02549 g007
Table 1. Primer sequences used to amplify SAV genomic regions that were nanopore sequenced.
Table 1. Primer sequences used to amplify SAV genomic regions that were nanopore sequenced.
AmpliconForward Primer (5’–3’)Reverse Primer (5’–3’)Product Length 1Genomic Location 1
1AGACTGCGTTTCCAGGGTTCCCCGTAGATGCCAATCGTGT2162 bp89–2251 bp
2GAATACGTTTACGAATTGTCCTCCACCGAGACGGACTTGAAATACC1967 bp 2083–4049 bp
3GACCTGGTGTTTTGTGACGCTCCCGTGTTAGCCCTCTAGG1895 bp3838–5732 bp
4GCAGCGTCCACRGCCATAGTCATCAGGCGTTTTACAGGGTC2015 bp5531–7545 bp
5ATGTTTCCCATGCAATTCACCAACGGTGCGGCTTGCCCTGGGTGAT1816 bp7818–9633 bp
6AGAGAACGCAGCAAGGGCGGCACTTCTTCACCACGCA2431 bp9318–11,748 bp
1 Product length and location defined against the genome sequence of SAV2 isolate SCO07-4619 [37], NCBI accession number: MH708652 (length: 11,780 bp). The relative length and position of each amplicon against the same SAV2 genome is shown in Figure 2.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Macqueen, D.J.; Eve, O.; Gundappa, M.K.; Daniels, R.R.; Gallagher, M.D.; Alexandersen, S.; Karlsen, M. Genomic Epidemiology of Salmonid Alphavirus in Norwegian Aquaculture Reveals Recent Subtype-2 Transmission Dynamics and Novel Subtype-3 Lineages. Viruses 2021, 13, 2549. https://0-doi-org.brum.beds.ac.uk/10.3390/v13122549

AMA Style

Macqueen DJ, Eve O, Gundappa MK, Daniels RR, Gallagher MD, Alexandersen S, Karlsen M. Genomic Epidemiology of Salmonid Alphavirus in Norwegian Aquaculture Reveals Recent Subtype-2 Transmission Dynamics and Novel Subtype-3 Lineages. Viruses. 2021; 13(12):2549. https://0-doi-org.brum.beds.ac.uk/10.3390/v13122549

Chicago/Turabian Style

Macqueen, Daniel J., Oliver Eve, Manu Kumar Gundappa, Rose Ruiz Daniels, Michael D. Gallagher, Svein Alexandersen, and Marius Karlsen. 2021. "Genomic Epidemiology of Salmonid Alphavirus in Norwegian Aquaculture Reveals Recent Subtype-2 Transmission Dynamics and Novel Subtype-3 Lineages" Viruses 13, no. 12: 2549. https://0-doi-org.brum.beds.ac.uk/10.3390/v13122549

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop