Next Article in Journal
Defense-Related Gene Expression Following an Orthotospovirus Infection Is Influenced by Host Resistance in Arachis hypogaea
Previous Article in Journal
Oral Immunization with Lactobacillus casei Expressing the Porcine Circovirus Type 2 Cap and LTB Induces Mucosal and Systemic Antibody Responses in Mice
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Illuminating the Plant Rhabdovirus Landscape through Metatranscriptomics Data

by
Nicolás Bejerman
1,2,*,
Ralf G. Dietzgen
3 and
Humberto Debat
1,2
1
Instituto de Patología Vegetal, Centro de Investigaciones Agropecuarias, Instituto Nacional de Tecnología Agropecuaria (IPAVE—CIAP—INTA), Camino 60 Cuadras Km 5.5, Córdoba X5020ICA, Argentina
2
Consejo Nacional de Investigaciones Científicas y Técnicas, Unidad de Fitopatología y Modelización Agrícola, Camino 60 Cuadras Km 5.5, Córdoba X5020ICA, Argentina
3
Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, Brisbane, QLD 4072, Australia
*
Author to whom correspondence should be addressed.
Submission received: 17 May 2021 / Revised: 25 June 2021 / Accepted: 26 June 2021 / Published: 5 July 2021
(This article belongs to the Section Viruses of Plants, Fungi and Protozoa)

Abstract

:
Rhabdoviruses infect a large number of plant species and cause significant crop diseases. They have a negative-sense, single-stranded unsegmented or bisegmented RNA genome. The number of plant-associated rhabdovirid sequences has grown in the last few years in concert with the extensive use of high-throughput sequencing platforms. Here, we report the discovery of 27 novel rhabdovirus genomes associated with 25 different host plant species and one insect, which were hidden in public databases. These viral sequences were identified through homology searches in more than 3000 plant and insect transcriptomes from the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) using known plant rhabdovirus sequences as the query. The identification, assembly and curation of raw SRA reads resulted in sixteen viral genome sequences with full-length coding regions and ten partial genomes. Highlights of the obtained sequences include viruses with unique and novel genome organizations among known plant rhabdoviruses. Phylogenetic analysis showed that thirteen of the novel viruses were related to cytorhabdoviruses, one to alphanucleorhabdoviruses, five to betanucleorhabdoviruses, one to dichorhaviruses and seven to varicosaviruses. These findings resulted in the most complete phylogeny of plant rhabdoviruses to date and shed new light on the phylogenetic relationships and evolutionary landscape of this group of plant viruses. Furthermore, this study provided additional evidence for the complexity and diversity of plant rhabdovirus genomes and demonstrated that analyzing SRA public data provides an invaluable tool to accelerate virus discovery, gain evolutionary insights and refine virus taxonomy.

1. Introduction

The costs for high-throughput sequencing (HTS) have been significantly reduced each year due to advances in sequencing technologies; therefore, the number of genome and transcriptome sequencing projects has been steadily increasing, resulting in a massive number of nucleotides deposited in the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI). Over 16,000 petabases (1015 bases) have been deposited in the SRA, with over 6000 petabases available as open-access data [1]. Thus, this large amount of data has provided significant challenges for data storage, bioinformatic analysis and management. This impressive and potentially useful amount of data concomitantly raised two issues: (I) high logistical costs of data management and (II) large amounts of neglected and unused data awaiting secondary analysis and repurposing. In the specific case of large plant sequencing project datasets, virome studies are scarce.
Abundant novel viruses, many of them not known to induce any apparent symptoms in their host or without a known host, have been identified from diverse environments using metagenomic approaches. This has highlighted our limited knowledge about the richness of a continuously expanding plant virosphere, which appears highly diverse in every potential host assessed so far [2,3,4,5]. Furthermore, the great number of viruses recently discovered by HTS, a miniscule portion of the virosphere, allowed a first glimpse of the path to a comprehensive megataxonomy of the virus world [6].
The scientific interest of the submitters of transcriptome datasets is often limited to a narrow objective within their specific field of study, which leaves a large amount of potentially valuable data not analyzed [7]. In such transcriptome datasets, viral sequences may be hidden in plain sight; thus, their analysis has become a valuable tool for the discovery of novel viral sequences [8,9,10,11,12,13,14,15,16]. In a recent consensus statement report, Simmonds and colleagues [17] contended that viruses that are known only from metagenomic data can, should and have been incorporated into the official classification scheme overseen by the International Committee on Taxonomy of Viruses (ICTV). Consequently, the analysis of public sequence databases constitutes a valuable resource for the discovery of novel plant viruses, which allows the reliable identification and characterization of new viruses in hosts with no previous record of virus infections [8]. This approach to virus discovery is inexpensive, as it does not require the acquisition of samples and subsequent sequencing but on secondary analyses of publicly available data to address novel research questions and objectives. At the same time, it is more wide-ranging and comprehensive than any other current approach due to the millions of datasets from a large variety of potential host species available from the NCBI-SRA [12].
Plant rhabdoviruses have negative-sense, single-stranded RNA genomes and are taxonomically classified in six genera: Cytorhabdovirus, Alphanucleorhabdovirus, Betanucleorhabdovirus and Gammanucleorhabdovirus for viruses that have an unsegmented genome and Dichorhavirus and Varicosavirus for viruses that have a bisegmented genome and infect both monocot and dicot plants [18]. These six genera were recently assigned to the subfamily Betarhabdovirinae within the family Rhabdoviridae [19]. Viruses classified in five of these genera are transmitted persistently by arthropods in which they also replicate [18,20], whereas varicosaviruses are transmitted by soil-borne chytrid fungi [18]. Cyto- and nucleorhabdovirus genomes have six conserved canonical genes encoding in the order 3′-nucleocapsid protein (N)-phosphoprotein (P)–putative movement protein (P3)-matrix protein (M)-glycoprotein (G)–large polymerase (L)-5′; the L gene of dichorhaviruses is located on RNA2 [21]. Up to three accessory genes with unknown functions have been identified among cyto- and nucleorhabdovirus genomes, leading to diverse genome organizations [21,22]. Conserved gene junction sequences separate each gene, and the overall coding region is flanked by 3′ leader and 5′ trailer sequences that feature partially complementary ends that may form a panhandle structure during replication [20]. Varicosavirus RNA 1 has one to two genes, with one of those encoding the RNA-dependent RNA polymerase L, while RNA 2 has three to five genes, with the first open reading frame (ORF) encoding a coat protein [20,21]. The 3′- and 5′-terminal sequences of the two varicosavirus genome segments are similar but do not exhibit inverse complementarities [21].
In this study, we queried the publicly available plant transcriptome datasets in the transcriptome shotgun assembly (TSA) database hosted at NCBI and identified 27 novel plant rhabdoviruses from 25 plant and one insect species, showing structural, functional and evolutionary cues to be classified in the family Rhabdoviridae; subfamily Betarhabdovirinae and genera Cytorhabdovirus, Alphanucleorhabdovirus, Betanucleorhabdovirus, Dichorhavirus and Varicosavirus.

2. Materials and Methods

2.1. Identification of Plant Rhabdovirus Sequences from Public Plant Transcriptome Datasets

The detection of plant rhabdovirus sequences was done as described by Longdon and colleagues [13]. Briefly, the amino acid sequences corresponding to the nucleocapsid and polymerase proteins of several known cyto- and nucleorhabdoviruses were used as query in tBlastn searches with the parameters word size = 6, expected threshold = 10 and scoring matrix = BLOSUM62 against the Viridiplantae (taxid:33090) and Hemiptera (taxid:7524) TSA databases. The obtained hits were explored by eye and based on the percentage identity, query coverage and E-value (>1 × 105), shortlisted as likely corresponding to novel virus transcripts, which were then further analyzed. Given the redundant nature of many retrieved hits, a step of contig clustering was implemented using the CD-hit suite with the standard parameters available at http://weizhongli-lab.org/cdhit_suite/cgi-bin/index.cgi?cmd=cd-hit (accessed on 10 March 2021). In addition, the raw sequence data corresponding to several SRA experiments associated with different NCBI Bioprojects (Table 1) were retrieved for further analyse

2.2. Sequence Assembly and Identification

The nucleotide (nt) raw sequence reads from each analyzed SRA experiment linked to the TSA projects returning rhabdovirus-like hits were downloaded and preprocessed by trimming and filtering with the Trimmomatic tool, as implemented in http://www.usadellab.org/cms/?page=trimmomatic (accessed on 15 March 2021), and the resulting reads were assembled de novo with Trinity v2.6.6 using the standard parameters. The transcripts obtained from de novo transcriptome assembly were subjected to bulk local BLASTX searches (E-value < 1 × 105) against a Refseq virus protein database available at ftp://ftp.ncbi.nlm.nih.gov/refseq/release/viral/viral.1.protein.faa.gz (accessed on 3 March 2021). The resulting viral sequence hits of each bioproject were visually explored. Tentative virus contigs were extended by iterative mapping of each SRA library’s raw reads. This strategy employed BLAST/nhmmer to extract a subset of reads related to the query contig, and these retrieved reads were used to extend the contig, and then, the process was repeated iteratively using as the query the extended sequence. The extended and polished transcripts, now presenting overlapping regions, were reassembled using the Geneious v8.1.9 (Biomatters Ltd., Auckland, New Zealand) alignment tool.

2.3. Bioinformatics Tools and Analyses

2.3.1. Sequence Analyses

ORFs were predicted with ORFfinder (https://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/orffinder/) (accessed on 19 March 2021), functional domains, and the architecture of the translated gene products was determined using InterPro (https://www.ebi.ac.uk/interpro/search/sequence-search) (accessed on 19 March 2021) and the NCBI Conserved domain database v3.16 (https://0-www-ncbi-nlm-nih-gov.brum.beds.ac.uk/Structure/cdd/wrpsb.cgi) (accessed on 19 March 2021). Further, HHPred and HHBlits as implemented in https://toolkit.tuebingen.mpg.de/#/tools/ (accessed on 19 March 2021) were used to complement the annotation of divergent predicted proteins by hidden Markov models. Importin-α-dependent nuclear localization signals were predicted using cNLS Mapper available at http://nls-mapper.iab.keio.ac.jp/ (accessed on 23 March 2021), nuclear export signals were predicted using NetNES 1.1 available at www.cbs.dtu.dk/services/NetNES/ (accessed on 19 March 2021) and transmembrane domains were predicted using the TMHMM version 2.0 tool (http://www.cbs.dtu.dk/services/TMHMM/) (accessed on 23 March 2021).

2.3.2. Pairwise Sequence Identity

Percentage amino acid (aa) sequence identities of the predicted ORFs of each plant-associated rhabdovirid identified in this study based on the available plant-associated rhabdovirus genome sequences were calculated using https://www.ebi.ac.uk/Tools/psa/emboss_needle/ (accessed on 23 March 2021).

2.3.3. Phylogenetic Analysis

Phylogenetic analysis based on the predicted nucleocapsid proteins of all plant rhabdovirids, listed in Table S1, was done using MAFFT 7 https://mafft.cbrc.jp/alignment/software (accessed on 29 March 2021), with multiple aa sequence alignments using FFT-NS-i as the best-fit model. The aligned aa sequences were used as the input in MegaX software [23] to generate phylogenetic trees by the maximum-likelihood method (best-fit model = WAG + G + F). Local support values were computed using bootstraps with 1000 replicates.
Table 1. Summary of rhabdovirus contigs identified from plant transcriptome data available in the National Center for Biotechnology Information (NCBI) database.
Table 1. Summary of rhabdovirus contigs identified from plant transcriptome data available in the National Center for Biotechnology Information (NCBI) database.
GenusPlant/Insect Host; Monocot/DicotVirus NameAbbreviationBioproject IDData Citation Segment NumberLength (nt)Accession NumberProtein IDLength (aa)
AlphanucleorhabdovirusBlue agave (Agave tequilana); monocotAgave tequilana virus 1ATV1PRJNA193469[24]113,166BK014297N
P
P3
M
G
L
452
335
287
257
597
1937
BetanucleorhabdovirusCommon milkweed (Asclepias syriaca); dicotAsclepias syriaca virus 2AscSyV2PRJNA210776[25]112,940BK014299N
P
P3
M
G
L
459
337
343
250
643
2023
Giant dodder (Cuscuta reflexa); dicotCuscuta reflexa virus 1CusReV1PRJNA290291 [26]18869 *BK014340; BK014341; BK014342; BK014343; BK014344; BK014345N
P
P3
M
G
L
457
336
325
269 *
550 *
740 *
Small water-pepper
(Persicaria minor); dicot
Persicaria minor virus 1PerMiV1PRJNA208436[27]14793 *BK014346; BK014347; BK014348; BK014349; BK014350; BK014351; BK014352; BK014353; BK014354 N
P
P3
M
G
L
372 *
115 *
172 *
227 *
260 *
430 *
Cubanoregano
(Plectranthus aromaticus); dicot
Plectranthus aromaticus virus 1PleArV1PRJNA491230Ab Rahim, M.H.;
University Malaysia Pahang, Malaysia; unpublished
112,994BK014300N
P
P3
M
G
L
450
332
321
286
582
2085
Red rhododendron
(Rhododendron delavayi); dicot
Rhododendron delavayi virus 1RhoDeV1PRJNA358123[28]113,719BK014301N
P
P3
M
G
L
464
338
326
279
636
2016
CytorhabdovirusChinese onion
(Allium chinense); monocot
Allium chinense virus 1AChV1PRJNA310810[29]17981 *BK014319; BK014320; BK014321; BK014322; BK014323; BK014324N
P
P′
P3
M
G
P6
L
457
316
128
231
165
542 *
-
531*
Flamingo lily
(Anthurium amnicola); monocot
Anthurium amnícola virus 1AntAmV1PRJNA288827[30]112,480BK014302N
P
P3
M
G
L
409
314
207
176
565
2047
A. syriacaAsclepias syriaca virus 1AscSyV1PRJNA210776[25]113,392BK014298N
P
P′
P3
M
G
P6
L
445
304
87
334
176
551
86
2101
Silverleaf whitefly (Bemisia tabaci)Bemisia tabaci -associated virus 1BeTaV1PRJNA237273[31]113,025BK014303N
P
P3
P4
M
G
L
447
326
187
48
206
579
2102
Yam
(Dioscorea composita)
Dioscorea composita virus 1DiCoV1PRJNA253902[32]19959 *BK014355; BK014356; BK014357; BK014358N
P
P3
M
G
P6
L
441
N/A
N/A
201
586
72
1546 *
Beach silvertop
(Glehnia littoralis); dicot
Glehnia littoralis virus 1GlLV1PRJNA248158[33]112,193BK014304N
P
P′
P3
M
G
P6
L
412
324
84
201
167
551
66
2072
Marsh fragrant orchid (Gymnadenia densiflora); monocotGymnadenia densiflora virus 1GymDenV1PRJNA504609[34]19887BK014305N
P
M
L
444
310
189
2068
Bird’s foot trefoil (Lotus corniculatus); dicotLotus corniculatus virus 1LotCorV1PRJNA77207[35]112,599 *BK014306N
P
P′
P3
M
G
P6
L
479
311
98
362
179
558
56
1985 *
European white waterlily (Nymphaea alba); dicotNymphaea alba virus 1NymAV1PRJNA472003Unlu E.S., and Yildiz, G.G; Abant Izzet Baysal University, Turkey; unpublished112,886BK014307N
P
P′
P3
M
G
P6
L
421
308
85
228
195
560
69
2076
Crowfoot geranium (Pelargonium radula), dicotPelargonium radula virus 1PelRaV1PRJNA491235Ab Rahim, M.H.;
University Malaysia Pahang, Malaysia; unpublished
111,130 *BK014325; BK014326; BK014327; BK014328N
P
P′
P3
M
G
P6
L
483
309
69
354
177
578 *
54
1391*
CytorhabdovirusSeepweed (Suaeda salsa); dicotSuaeda salsa virus 1SuSV1PRJNA395283[36]16156 *BK014359; BK014360; BK014361; BK014362; BK014363; BK014364; BK014365; BK014366N
P
P′
P3
M
G
P6
L
332 *
294 *
66 *
218
165
70 *
-
711*
African marigold (Tagetes erecta); dicotTagetes erecta virus 1TaEV1PRJNA431782[37]111,707BK014308N
P
P3
M
L
520
505
200
187
2079
Ajwain (Trachyspermum ammi); dicotTrachyspermum ammi virus 1TrAV1PRJNA359623[38]110,920BK014309N
P
P3
M
L
455
343
237
200
2069
DichorhavirusHidden violet
(Viola verecunda); dicot
Viola verecunda virus 1VVeV1PRJNA345302[39]1




2
5304 *




5212 *
BK014329;

BK014330;

BK014331
BK014332;
BK014333
N
P
P3
M
G
L
494
N/A
323
N/A
661
1668 *
VaricosavirusMouse garlic (Allium angulosum); monocotAllium angulosum virus 1AAnV1PRJNA542932[40]1
2
6679
4560
BK059208
BK059209
L
N
2
3
2048
481
454
129
Bok choy (Brassica rapa subsp. chinensis); dicotBrassica rapa virus 1BrRV1PRJNA396268[41]1
2
6397
4068
BK014310
BK014311
L
N
2
3
2019
435
411
180
Perennial ryegrass (Lolium perenne); monocotLolium perenne virus 1LoPV1PRJNA222646[42]1
2
6302
4167
BK014312
BK014313
L
N
2
3
2029
533
379
161
Asian cow-wheat
(Melampyrum roseum); dicot
Melampyrum roseum virus 1MelRoV1PRJDB5395[43]1
2
6365
4635
BK014314
BK014315
L
N
2
3
4
2003
443
350
304
193
Downy phlox (Phlox pilosa); dicotPhlox pilosa virus 1PhPiV1PRJNA360978[44]1



2
4896 *



3201 *
BK014334; BK014335; BK014336; BK014337
BK014338;

BK014339
L



N
2
3
4
1589 *



378
81 *
289
110*
Limber pine (Pinus flexilis); gymnospermPinus flexilis virus 1PiFleV1PRJNA315892[45]111,740BK014316N
2
3
4
L
405
447
318
219
2049
Spinach (Spinacia oleracea); dicotspinach virus 1SpV1PRJDB3392[46]1
2
6151 *
3750 *
BK014317
BK014318
L
N
2
3
2010 *
439
450
136*
* Partial sequence; N/A: not available.

3. Results

3.1. Summary of Discovered Rhabdovirid Sequences

The complete coding regions of 17 novel rhabdoviruses were identified; in addition, partial genomic sequences for 10 novel viruses were assembled. These viruses were associated with 25 plant host species and one insect species (Table 1). The bioinformatic and source data of each of the 27 viral sequences, as well as the GenBank accession number and proposed classification, are listed in Table 1; the summary of the assembly statistics of each virus of the plant rhabdovirus sequences identified from the transcriptome data available in the NCBI database are presented in Table S2. Based on phylogenetic relatedness, genome organization and sequence identity, the novel viruses were tentative assigned to the established plant rhabdovirus genera Alphanucleorhabdovirus, Betanucleorhabdovirus, Cytorhabdovirus, Dichoravirus and Varicosavirus. Most of the tentative plant hosts of the novel viruses are herbaceous dicots (16/25), seven are herbaceous monocots, one a woody dicot and one a gymnosperm (Table 1). The characteristics of deduced proteins encoded by each rhabdovirid sequence were determined by predictive algorithms and are shown in Table S3. The genomic architecture and evolutionary placement of the 27 discovered viruses are described below, grouped by affinity to members of the diverse genera within the Betarhabdovirinae.

3.2. Alphanucleorhabdovirus

The complete coding region of a novel putative alphanucleorhabdovirus, tentatively named Agave tequilana virus 1 (ATV1), with the genome organization 3′-N-P–P3-M-G-L-5′ (Figure 1A) was assembled from blue agave transcriptome data (Table 1). A nuclear localization signal (NLS) was predicted in every ATV1-encoded protein (Table S3). According to the NLS scores, the N protein is predicted to be located exclusively in the nucleus, whereas the P and L proteins have a partial nuclear localization, while the P3, M and G proteins are localized to both the nucleus and the cytoplasm. Leucine-rich nuclear export signals (NES) were predicted in the N, P and L proteins (Table S3). A transmembrane domain motif was detected in the C-terminus of the G protein, and a signal peptide was predicted in its N-terminus (Table S3). The consensus gene junction sequence 3′-AUUCUUUUUGGGUUG-5′ of the ATV1 genome is similar to that of the alphanucleorhabdoviruses maize mosaic virus (MMV), maize Iranian mosaic virus (MIMV), Morogoro maize-associated virus (MMaV) and taro vein chlorosis virus (TaVCV) (Table 2).
Pairwise aa sequence identities between the ATV1-encoded proteins and those from other alphanucleorhabdoviruses showed low sequence identities of 10.9–35.0% (Table 3). The nucleotide sequence identity for the complete genome sequence of ATV1 and other alphanucleorhabdoviruses ranged from 47–49.3 % (Table 3).
The phylogenetic analysis based on the N protein aa sequences showed that ATV1 clustered with the monocot-infecting alphanucleorhabdoviruses MMV, MIMV, MMaV and TaVCV (Figure 2), which were also the most similar in pairwise sequence identity values, shared equivalent genome organizations and had similar conserved gene junction sequences (Table 2).

3.3. Betanucleorhabdoviruses

The complete coding regions of three putative betanucleorhabdoviruses, tentatively named Asclepias syriaca virus 2 (AscSyV2), Plectranthus aromaticus virus 1 (PleArV1) and Rhododendron delavayi virus 1 (RhoDeV1), as well as the partial genomes of Cuscuta reflexa virus 1 (CusReV1) and Persicaria minor virus 1 (PerMiV1), were assembled in this study (Table 1). The genome organization of these five viruses is 3′-N-P–P3-M-G-L-5′ (Figure 1A). A NLS was predicted in every protein encoded by AscSyV2, CusReV1, PleArV1 and RhoDeV1 (Table S3). According to the NLS scores, the CusReV1 and PleArV1 N proteins, RhoDeV1 P protein, CusReV1 M protein and PleArV1 and RhoDeV1 L proteins are expected to be exclusively nuclear, whereas the RhoDeV1 M protein is predicted to have a partial nuclear localization. The AscSyV2 and RhoDeV1 N proteins; AscSyV2, CusReV1 and PleArV1 P proteins; AscSyV2, CusReV1, PleArV1 and RhoDeV1 P3 proteins; AscSyV2 and PleArV1 M proteins; AscSyV2, CusReV1, PleArV1 and RhoDeV1 G proteins and AscSyV2 and CusReV1 L proteins are predicted to localize to both the nucleus and the cytoplasm. Leucine-rich NES were predicted in the AscSyV2, CusReV1 and PleArV1 N proteins; the CusReV1 and PleArV1 P proteins; the CusReV1 and RhoDeV1 P3 proteins; the PleArV1 M protein and the PleArV1 and RhoDeV1 L proteins (Table S3). A transmembrane domain was detected in the C-terminal sequence of the AscSyV2, CusReV1, PleArV1 and RhoDeV1 G proteins. A signal peptide was detected in the G protein N-terminus of AscSyV2, CusReV1 and RhoDeV1 (Table S3).
The deduced AscSyV2 M protein, PleArV1 G protein and RhoDeV1 L protein are the smallest such proteins reported so far among betanucleorhabdoviruses (Table S4).
Interestingly, the N proteins of AscSyV2, PleArV1 and RhoDeV1 are basic, whereas the CusReV1 N protein is acidic. Moreover, the CusReV1 and PleArV1 M proteins are basic, while the AscSyV2 and RhoDeV1 M proteins are acidic (Table S3).
The consensus gene junction sequence 3′-AUUCUUUUU GG UUG-5′ of all five novel viral genomes is identical and the same as in the genome of every betanucleorhabdovirus described thus far: alfalfa-associated nucleorhabdovirus (AaNV), apple rootstock virus A (ApRVA), Bacopa monnieri virus 2 (BmV2), bird’s-foot trefoil associated virus 1 (BFTaV1), black currant-associated rhabdovirus (BCaRV), cardamom vein clearing virus 1 (CdVCV1), datura yellow vein virus (DYVV), green Sichuan pepper nucleorhabdovirus (GSPNuV), sonchus yellow net virus (SYNV) and sowthistle yellow vein virus (SYVV) (Table 2).
Pairwise aa sequence identity values between each encoded protein of the five novel viruses and those from other betanucleorhabdoviruses vary significantly (Table 4). The highest nucleotide sequence identity for the genome sequence between AscSyV2, PleArV1 and RhoDeV1 and other betanucleorhabdoviruses ranged from 51.4 to 62.1% (Table 4).
The phylogenetic analysis based on the deduced N protein aa sequences showed that CusReV1, RhoDeV1, PerMiV1 and PleArV1 clustered with the betanucleorhabdoviruses BFTaV1, BCaRV, BmV2, CdVCV1, DYVV, GSPNuV, SYNV and SYVV (Figure 2), which are also the most similar in pairwise sequence identity values of their cognate proteins; furthermore, all these viruses have a similar genome organization. Interestingly, AscSyV2 clustered closely with the betanucleorhabdoviruses AaNV and ApRVA and had similar pairwise sequence identity values for their cognate proteins. However, the genome organization of AaNV and ApRVA was different in that it included an accessory gene between the M and G genes.

3.4. Cytorhabdoviruses

The complete coding region of eight novel cytorhabdoviruses, tentatively named Anthurium amnicola virus 1 (AntAmV1), Asclepias syriaca virus 1 (AscSyV1), Bemisia tabaci-associated virus 1 (BeTaV1), Glehnia littoralis virus 1 (GlLV1), Gymnadenia densiflora virus 1 (GymDenV1), Nymphaea alba virus 1 (NymAV1), Tagetes erecta virus 1 (TaEV1) and Trachyspermum ammi virus 1 (TrAV1), as well as the partial genomes of five other cytorhabdoviruses, named Allium chinense virus 1 (AChV1), Dioscorea composite virus 1 (DiCoV1), Lotus corniculatus virus 1 (LotCorV1), Pelargonium radula virus 1 (PelRaV1) and Suaeda salsa virus 1 (SuSV1), were assembled in this study (Table 1). The genome organization of these thirteen novel cytorhabdoviruses is highly diverse. AntAmnV1 and likely DiCoV1 lack additional accessory genes and have the genome organization 3′-N-P–P3-M-G-L-5′ (Figure 1B). AscSyV1, GlLV1, LotCorV1, NymAV1, PelRaV1 and, likely, AChV1 and SuSV1 have an accessory ORF between the G and L genes, with the genome organization 3′-N-P–P3-M-G-P6-L-5′ (Figure 1B), while BeTaV1 have an accessory ORF between the P3 and M genes, with the genome organization 3′-N-P–P3-P4-M-G-L-5′ (Figure 1B). The genome organization of GmyDenV1, TaEV1 and TrAV1 differs from that of all other known plant rhabdoviruses: the GymDenV1 genome appears to only have four genes in the order 3′-N-P–M-L-5′ (Figure 1B), while TaEV1 and TrAV1 genomes have five genes in the order 3′-N-P–P3-M-L-5′ but do not encode a glycoprotein (Figure 1B). An overlapping ORF within the P-encoding ORF, named P′, is present in the AChV1, AscSyV1, GilV1, LotCorV1, NymAV1, PelRaV1 and SuSV1 genomes (Figure 1B and Table 1). The predicted TaEV1 N protein is the largest described so far among the cytorhabdoviruses, whereas the AntAmV1 L protein is the smallest reported so far among the cytorhabdoviruses (Table S3).
Interestingly, the predicted N protein of almost every cytorhabdovirus identified in our study is acidic, but the PelRaV1 N protein is neutral, and the BeTaV1 N protein is basic. The P protein of almost every cytorhabdovirus identified in our study is acidic, but the TrAV1 P protein is basic. Moreover, the M protein of almost every cytorhabdovirus identified in our study is basic, except GymDenV1 and TrAV1 M proteins, which are acidic. Furthermore, the predicted G proteins of AscSyV1, DiCoV1, GlLV1, NymAV1 and PelRaV1 are acidic, except the LotCorV1 G protein is neutral, and the AntAmV1 and BeTaV1 G proteins are basic (Table S3).
A putative transmembrane domain was identified in the G protein C-terminal sequence of AChV1, AntAmV1, AscSyV1, BeTaV1, DiCoV1, GlLV1, LotCorV1, NymAV1 and PelRaV1; two additional transmembrane domains were identified in the G protein N-terminus of AntAmV1 and BeTaV1 and another one in the N-terminus of PelRaV1 G protein. A transmembrane domain was also predicted in every P6 protein of these viruses (Table S3). Moreover, one transmembrane domain was predicted in the SuSV1 P′ protein and two in the AChV1, GlLV1, LotCorV1 and PelRaV1 P′ proteins, while three were predicted in the AscSyV1 and NymAV1 P′ proteins (Table S3). A signal peptide sequence was identified in the G protein N-terminus of AChV1, AscSyV1, GlLV1, LotCorV1, NymAV1 and PelRaV1 (Table S3). Interestingly, a leucine-rich NES was predicted in the AChV1, AntAmV1, AscSyV1, BeTaV1, DiCoV1, GlLV1, GymDenV1, LotCorV1, PelRaV1, SuSV1, TaEV1 and TrAV1 N proteins; the BeTaV1, GymDenV1, LotCorV1, NymAV1, PelRaV1, SuSV1 and TrAV1 P proteins; the AChV1, AntAmV1, AscSyV1, DiCoV1, GymDenV1, LotCorV1, NymAV1, PelRaV1, SuSV1 and TrAV1 M proteins and the PleArV1 and RhoDeV1 L proteins (Table S3).
The consensus gene junction sequences of the novel cytorhabdoviruses identified in our study are diverse (Table 2) but similar to those previously reported for phylogenetically related cytorhabdoviruses. Interestingly, the intergenic sequence of the GymDenV1 and TrAV1 gene junctions starts with an adenine instead of a guanine residue like in every other plant rhabdovirus (Table 2).
Pairwise aa sequence identity values between each encoded protein of the thirteen novel viruses and those from known cytorhabdoviruses vary significantly (Table 5). The highest nucleotide sequence identity for the genome sequence between AntAmV1, AscSyV1, BeTaV1, GlLV1, GymDenV1, NymAV1, TaEV1 and TrAV1 and the other cytorhabdoviruses ranged between 50.3 and 65.6% (Table 5).
The phylogenetic analysis based on the N protein aa sequence showed that the thirteen novel viruses grouped with known cytorhabdoviruses GlLV1 and NymAV1 clustered with Kenyan potato cytorhabdovirus (KePCyV), Trifolium pratense virus A (TpVA), strawberry virus 1 (StrV1) and tomato yellow mottle-associated virus (TYMaV), with whom they share a similar genomic organization. AChV1 and SuSV1 formed a clade with alfalfa dwarf virus (ADV), chrysanthemum yellow dwarf-associated virus (ChYDaV), raspberry vein chlorosis virus (RVCV) and strawberry crinkle virus (SCV). AscSyV1, LotCorV1 and PelRaV1 clustered with Actinidia cytorhabdovirus (AcCV), Bacopa monnieri virus 1 (BmV1) and Wuhan insect virus 4 (WhIV4), and all these viruses have a similar genomic organization. BeTaV1 clustered with cucurbit cytorhabdovirus 1 (CuCV1), and these viruses share a similar genomic organization. AntAmV1 clustered with paper mulberry mosaic-associated virus (PMuMaV), with which it shares a similar genomic organization. TaEV1 clustered with the cluster formed by AntAmV1 and PMuMaV, but these viruses have a dissimilar genomic organization. DiCoV1 clustered with Colocasia bobone disease-associated virus (CBDaV), but these viruses have a dissimilar genomic organization, while GymDenV1 and TrAV1 formed a monophyletic sister clade with the aphid-transmitted cytorhabdoviruses (Figure 2).

3.5. Dichorha-Like Viruses

The near-complete RNA 1 and RNA 2 sequences of a dichorha-like virus tentatively named Viola verecunda virus 1 (VVeV1) were identified in this study. We assembled three nonoverlapping fragments corresponding to RNA 1 containing the complete N, P3 and G genes and the near-complete RNA 2, which encodes the L protein (Table 1). Thus, the likely genome organization of VVeV1 is 3′-N-P-P3-M-G-5′ for RNA 1 and 3′-L-5′ for RNA 2 (Figure 1D). A NLS was predicted in every deduced VVe1 protein (Table S3). According to the NLS scores, the N and L proteins should be exclusively localized to the nucleus, whereas the P3 and G proteins are predicted to localize to both the nucleus and the cytoplasm. Leucine-rich NESs were predicted in the N and L protein sequences (Table S3). A transmembrane domain was identified in the C-terminus of the G protein (Table S3). We were not able to identify a consensus gene junction sequence in the genome fragments of VVeV1 (Table 2). Pairwise aa sequence identities between the VVeV1-encoded proteins and those from known dichorhaviruses showed low sequence identities of 20% or less in the P3 and G proteins and 20.5–23.1% in the N protein (Table 6). The phylogenetic analysis based on the N protein aa sequences showed that VVeV1 clusters with the dichorhaviruses but is placed by itself in a sister clade (Figure 2).

3.6. Varicose-Like Viruses

The complete coding sequences of five varicosa-like viruses, tentatively named Allium angulosum virus 1 (AAnV1), Brassica rapa virus 1 (BrRV1), Lolium perenne virus 1 (LoPV1), Melampyrum roseum virus 1 (MelRoV1) and Pinus flexilis virus 1 (PiFleV1), as well as the partial genomes of two others, named Phlox pilosa virus 1 (PhPiV1) and spinach virus 1 (SpV1), were assembled in this study (Table 1). The genomes of AAnV1, BrRV1, LoPV1, MelRoV1, PhPiV1 and SpV1 consist of two RNA segments. RNA 1 of AAnV1, BrRV1, LoPV1, MelRoV1 and SpV1 encode only the L protein, whereas RNA 1 of PhPiV1 likely carries an additional gene 3′ to the L gene (Figure 1C). RNA 2 of AAnV1, BrV1, LoPV1 and SpV1 has three genes in the order 3′-N-P2-P3-5′, RNA 2 of MelRoV1 has four genes in the order 3′-N-P2-P3-P4 -5′ and RNA 2 of PhPiV1 likely has five genes in the order 3′-N-P2-P3-P4-P5-5′ (Figure 1C). Based on our assemblies, the PiFleV1 genome does not appear to be segmented like the genomes of other varicosaviruses and has five ORFs in the order 3′-N-P2-P3-P4-L-5′ (Figure 1C), suggesting that varicosa-like viruses may also have unsegmented genomes.
The predicted LoPV1 N protein is the longest varicosavirus N protein described so far, whereas the MelRoV1 L protein is the smallest L protein described so far among the varicosaviruses, while the PiFleV1 L protein is the largest varicosavirus L protein described so far (Table S4). Interestingly, the PiFLeV1 N protein is basic, whereas the N proteins of AAnV1, BrRV1, LoPV1, MelRoV1, PhPiV1 and SpV1 are acidic. Moreover, AAnV1 and BrR1 P2s are basic, while P2s of LoPV1, MelRoV1, PiFleV1 and SpV1 are acidic. Furthermore, P3s of AAnV1, MelRoV1, PhPiV1 and PiFleV1 are basic, whereas the BrRV1 and LoPV1 P3s are acidic (Table S3).
Based on the NLS scores, AAnV1 L and P2, LoPV1 N and P2 and the MelRoV1 L protein are expected to be exclusively located in the nucleus. The BrRV1 N; BrRv1, PiFleV1 and SpV1 P2; PiFleV1 P3 and BrRv1, LoPV1 and SpV1 L proteins are predicted to localize to both the nucleus and the cytoplasm. Leucine-rich NESs are predicted in the BrRV1, MelRoV1, PhPiV1, PiFleV1 and SpV1 N proteins; the AAnV1, BrRV1, LoPV1, MelRoV1, PiFleV1 and SpV1 P2; the BrRV1, LoPV1, MelRoV1, PhPiV1, PiFleV1 and SpV1 P3 and the MelRoV1, PhPiV1 and PiFleV1 P4 (Table S3). No transmembrane domain or signal peptide was detected in any of the proteins encoded by AAnV1, BrRV1, LoPV1, MelRoV1, PhPiV1, PiFleV1 and SpV1.
The consensus gene junction sequences 3′ AU(N)5UUUUUGCUCU 5′ of AAnV1, LoPV1, MelRoV1, PhP1V1 and SpV1 are identical and the same as in the genomes of the known varicosaviruses Alopecurus myosuroides varicosavirus 1 (AMVV1), lettuce big-vein associated virus (LBVaV, red clover-associated varicosavirus (RCaVV) and vitis varicosavirus (VVV). The gene junction sequences of BrRV1 and PiFleV1 differ slightly: the BrRV1 5′-end sequence is CUCA instead of CUCU, while the PiFleV1 3′-end sequence is GU(N)5UUUUU instead of AU(N)5UUUUU (Table 2).
The pairwise aa sequence identities between the cognate-encoded proteins of AAnV1, BrRV1, LoPV1, MelRoV1, PhPiV1, PiFleV1 and SpV1 and those from other varicosaviruses vary significantly (Table 7). The phylogenetic analysis based on the deduced N protein aa sequences placed all seven novel varicosa-like viruses into a clade with the known varicosaviruses. BrRV1 and SpV1 clustered with RCaVV, and these viruses have a similar genomic organization with the three ORFs on RNA 2. AAnVi clustered with the clade formed by LoPV1 and AMVV1, and these viruses infect monocots. PhPV1 clustered with LBVaV, while the unique PiFleV1 was placed in a sister clade to the clade composed by LBVaV and PhPV1, and MelRoV1 clustered with vitis varicosavirus (VVV) (Figure 2).

4. Discussion

In the last few years, several novel plant rhabdoviruses, which may be asymptomatic, have been reported in HTS studies [47,48,49,50,51,52,53,54,55]. Thus, it is tempting to speculate that some plant rhabdovirus-like sequences may be hidden in published plant transcriptome databases, generated with diverse objectives beyond virus research, where viral RNA was inadvertently copurified with endogenous plant RNA and sequenced. To prove this point, we and others have previously identified the sequences of a small number of plant rhabdoviruses contained in public transcriptome databases [8,9,16]; however, an extensive search has not been conducted to date. Therefore, we queried the publicly available plant transcriptome datasets in the transcriptome shotgun assembly (TSA) database hosted at NCBI, which resulted in the identification of 27 novel plant rhabdoviruses.

4.1. Discovery of Novel Plant Rhabdoviruses Expands the Diversity and Evolutionary History of Rhabdovirids

The coding-complete or complete genomic sequences of 66 plant rhabdoviruses were reported by early 2021. Thirty-three of them are cytorhabdoviruses, thirteen alphanucleorhabdoviruses, ten betanucleorhabdoviruses, five dichorhaviruses, four varicosaviruses and one gammanucleorhabdovirus. Similarly, half of the novel sequences reported in this study are those of putative cytorhabdoviruses, indicating the extensive diversification of this group of viruses. The novel 27 viruses discovered in this study all appear to be members of new species and account for nearly half of the plant rhabdovirus species reported so far. In addition, five new putative betanucleorhabdoviruses shed more light on the diversity and evolution of this recently created genus. One newly identified putative alphanucleorhabdovirus and one novel putative dichorhavirus expand the diversity of their respective genera. Finally, we detected seven novel varicosa-like viruses, which doubles the number of known varicosaviruses, providing new information about the genomic diversity and evolution of these little-studied fungi-transmitted viruses. Thus, our study provides the most complete insight to date about the genomic diversity and evolution of plant rhabdoviruses, complementing the status quo of these viruses with additional data on genomic organization and highlighting their apparent genetic flexibility. Overall, the sequence identity between the newly discovered viruses and those plant rhabdoviruses already described was low, a common feature among these taxa characterized by a high level of diversity in both genome sequence and organization [20]. The low sequence identity of the novel viruses with the closest previously described virus may indicate that there is still a significant amount of virus “dark matter” within the plant rhabdovirus space worth exploring that potentially contains a significant number of yet to be discovered plant rhabdoviruses. Future works should focus on the analysis of additional RNA datasets of diverse potential hosts of the partial virus genomes identified in this study.
Viruses assigned to different species within the genera Alphanucleorhabdovirus, Betanucleorhabdovirus and Cytorhabdovirus have a nucleotide sequence identity lower than 75% in the complete genome sequence and occupy different ecological niches as evidenced by differences in the host species and/or arthropod vectors (https://talk.ictvonline.org/ictv-reports/ictv_online_report/negative-sense-rna-viruses/w/rhabdoviridae) (accessed 11 April 2021). Cytorhabdoviruses may also have an aa sequence identity of less than 80% in all cognate ORFs. Viruses assigned to different species in the genus Dichorhavirus have less than 80% nucleotide sequence identity in the L gene. Based on these species demarcation criteria, ATV1 should be classified in a new Alphanucleorhabdovirus species and AscSyV2, PleArV1 and RhoDeV1 in a new species in the genus Betanucleorhabdovirus; due to incomplete sequences, PerMiV1 and CusReV1 cannot be classified based on these criteria. AntAmnV1, AscSyV1, BeTaV1, GlLV1, GymDenV1, LotCorV1, NymAV1, TaEV and TrAV1 should be classified in a new species in the genus Cytorhabdovirus, whereas AChV1, DiCoV1, PelRaV1 and SuSV1, due to incomplete genome sequence data, cannot be classified. The dichorha-like VVeV1 also awaits at least coding-complete sequence data before it can be formally classified in this genus. The species demarcation criteria recently proposed for the genus Varicosavirus by the ICTV Rhabdoviridae Study Group included a minimum nucleotide sequence divergence of 50% in the L protein. Based on this genetic distance, the varicosa-like viruses AAnV1, BrRV1, LoPV1, MelRoV1, PiFleV1 and SpV1 would not be considered different species, and PhPiV1 cannot be classified due to incomplete sequencing data. Considering the diverse host plant species these viruses were identified in and their phylogenetic relationships, these viruses should likely be classified in different species. Thus, we propose a nucleotide sequence identity of 75% across the genome and in the N gene as thresholds for species demarcation in the Varicosavirus genus, in line with the species demarcation criteria for the other genera of plant rhabdoviruses.

4.2. Host Range of the Novel Plant Rhabdoviruses

Most of the plant hosts in which the novel viruses of this study were identified are dicots; nevertheless, AChV1, which was detected in Chinese onions, is likely the first monocot-infecting cytorhabdovirus that belongs to a clade of aphid-transmitted viruses. Furthermore, most of the source hosts in which the novel viruses were identified are herbaceous plants, which, overall, are the most common hosts of plant rhabdoviruses [18]. However, one novel putative varicosavirus, Pinus flexilis virus 1, was associated with a gymnosperm host. Few viruses were identified previously from gymnosperms [12,56,57,58,59]. Interestingly, a putative RdRp sequence was identified in the gymnosperm Sciadopitys verticillata and proposed to belong to a varicosavirus [14]. Therefore, to our knowledge, this study is the first to report the complete coding sequence of a plant rhabdovirus associated with a gymnosperm host and presenting a unique unsegmented genome, redefining the genome architecture of varicosaviruses. Future virus discovery studies should focus on gymnosperm viromes, which have been understudied and likely hide a rich and diverse number of novel viruses.

4.3. Diverse Genome Organizations of Plant Rhabdoviruses

Interestingly, the genome sequence of four novel viruses discovered in this study, GymDenV1 (only four predicted genes), TaEV and TrAV1 (both with only five predicted genes) and PiFleV1 (an unsegmented varicosa-like virus), have a unique genome architecture among plant rhabdoviruses that differs from the consensus genome organization reported for previously known cytorhabdoviruses and varicosaviruses [21]. The genomes of the cytorhabdoviruses GymDenV1, TaEV1 and TrAV1 lack a glycoprotein ORF, which, for rhabdoviruses, is not essential for replication and systemic movement, as demonstrated using an infectious clone of Sonchus yellow net virus [60]. Furthermore, isolates of citrus-associated rhabdovirus (CiaRV) were recently shown to have an impaired G ORF [61].
Five canonical structural protein genes 3′-N-P-M-G-L-5′ are thought to be conserved among all rhabdoviruses [20], while plant rhabdovirus genomes contain at least six ORFs [21]. However, only four genes were predicted in the genome of the varicosavirus RCaVV [51]. Four genes were also identified in some of the varicosa-like virus genomes assembled in this study, suggesting that a minimal set of four genes may be sufficient for varicosaviruses to replicate in a plant host. This hypothetical minimal requirement appears to also apply to cytorhabdoviruses. For example, GymDenV1 encodes only the nucleocapsid core (NC) proteins N, P and L that are essential for virus replication and transcription and the M protein that is required for condensation of the core during virion assembly [20]. Thus, how GymDenV1 moves from cell to cell remains to be unraveled. Interestingly, no cell-to-cell movement protein has been identified so far in the varicosaviruses.
The discovery of diverse novel rhabdo-like viruses in metagenomics studies of arthropods [62] supports the assumption that arthropods have been fundamental to rhabdovirus evolution [63], and the G protein was found to be essential for virus attachment to predict cellular receptors in the midgut of arthropod vectors that facilitate virus uptake [20]. This agrees with the hypothesis that cytorhabdoviruses, nucleorhabdoviruses and dichorhaviruses evolved from viruses of plant-feeding arthropods that acquired movement proteins and assorted RNAi suppressors through recombination with preexisting plant viruses [3]. The viruses in these three genera appear to be the least plant-specialized among the negative-sense RNA viruses. However, the evolution of the nonenveloped negative-sense RNA plant viruses, such as the fungal-transmitted varicosaviruses, which likely do not encode a G protein, clearly reflects the adaptation to a plant-specific lifestyle, raising the possibility that their origin is via a trans-kingdom horizontal transfer between fungi and plants [3]. A G protein-defective genome was recently identified in citrus isolates of CiaRV, and the authors speculated that the eventual “simplification” of viral genomes to adapt to plants without requiring an arthropod vector could provide an evolutionary advantage, especially in fruit trees that are propagated artificially by asexual modes, such as cutting and grafting [61]. Nevertheless, the tentative hosts of GymDenV1, TaEV1 and TrAV1 are herbaceous plants; thus, an evolutionary advantage linked to the lack of a G protein is not obvious for these viruses. Strikingly, TaEV1 is phylogenetically related to arthropod-transmitted cytorhabdoviruses. Thus, further studies should focus on the potential vector and the mode of transmission of GymDenV1, TaEV1 and TrAV1 to complement their peculiar minimalist genome organization and evolutionary links with biological data.

4.4. Phylogenetic Relationships among Plant Rhabdoviruses as Predictors for Vector Types

Among all plant rhabdoviruses studied so far, there is a strong correlation between phylogenetic relationships and vector types [18,20]. We therefore predict that the novel betanucleorhabdoviruses are likely aphid-transmitted, while the putative dichorhavirus VVeV1 is likely transmitted by Brevipalpus mites, and the varicosa-like viruses may be transmitted by chytrid fungi. Based on its phylogenetic clustering, the alphanucleorhabdovirus ATV1 is likely transmitted by a planthopper. Among the novel cytorhabdoviruses identified in this study, AChV1, AscSyV1, GlLV1, LotCorV1, NymAV1, PelRaV1 and SuSV1 are likely aphid-transmitted, while the vectors for AntAmV1, DiCoV1, GymDenV1, TaEV1 and TrAV1 cannot be predicted. The BeTaV1 genome was assembled from whitefly transcriptome data and, therefore, is likely a whitefly-transmitted cytorhabdovirus. The L protein-like TSA sequences included in the assembly of the BeTaV1 genome were previously reported to be 95–98% identical to soybean blotchy mosaic virus partial L gene sequences [18]; thus, it is tempting to speculate that the plant host of BeTaV1 may be soybeans. Recently, it was reported that the bean-associated cytorhabdovirus is efficiently transmitted by whiteflies, which was the first report of a whitefly-transmitted rhabdovirus [64], thus supporting our hypothesis that soybean blotchy mosaic virus may represent the second whitefly-transmitted rhabdovirus.
At least one transmembrane domain was identified in each P′ protein predicted in the genomes of a group of cytorhabdoviruses discovered in this study, consistent with a previous analysis of other cytorhabdoviruses, which also identified at least one transmembrane domain in every P′ protein [65]. Interestingly, this overprinted accessory protein is encoded by every cytorhabdovirus that appears to be aphid-transmitted. On the other hand, the P′ protein is not encoded in the genomes of aphid-transmitted betanucleorhabdoviruses. Therefore, the potential function of the P′ protein is unlikely to be directly associated with the vector specificity.

4.5. Cytorhabdoviruses

Two of the cytorhabdoviruses, AChV1 and SuSV1, whose genomes were only partially assembled, clustered phylogenetically in a monophyletic clade with ADV, ChYDaV, RVCV and SCV, all the viruses that contain a P6 accessory ORF between the G and L genes [66,67,68,69]. Given this clustering, it is likely that AChV1 and SuSV1 will have a similar genomic organization. At least one transmembrane domain has been predicted in every cytorhabdovirus with an accessory P6 ORF. Transmembrane domains were also predicted in the accessory ORF between the G and L genes of other cytorhabdoviruses [49,65,70]. Thus, the protein encoded by this small ORF may have membrane-associated functions similar to viroporins in vertebrate rhabdoviruses [71].
The phylogenetic relationships of the now expanded number of known cytorhabdoviruses provide some support for splitting the genus Cytorhabdovirus to establish three genera that represent different evolutionary lineages: (I) Alphacytohabdovirus would include species for all aphid-transmitted cytorhabdoviruses; (II) Betacytorhabdovirus would include species for all those cytorhabdoviruses likely transmitted by leafhoppers, planthoppers and whiteflies, as well as AntAmnV1, DiCoV1 and TaEV1, and (III) Gammacytorhabdovirus would include species for GymDenV1 and TrAV1 (Figure 2).

4.6. Nucleorhabdoviruses

The ATV1 genome that was assembled from the transcriptome data of the monocot agave does not encode any accessory ORFs, like most of the alphanucleorhabdoviruses, except the group of PYDV-like viruses that encode an X ORF of unknown function [18]. ATV1, while branching into a sister clade, appears to have a close evolutionary relationship with the cluster of planthopper-transmitted monocot-infecting alphanucleorhabdoviruses MIMV, MMV, MMaV and TaVCV. It is tempting to speculate that this clade evolved from a common ancestor that adapted to infect monocots and to be transmitted by planthoppers. Furthermore, the isoelectric point (IEP) of ATV1-predicted proteins is similar to that of MIMV, MMV, MMaV and TaVCV-predicted proteins, thus supporting a link between these viruses and ATV1.
All the betanucleorhabdoviruses identified in this study, AscSyV2, CusReV1, PerMiV1, PleArV1 and RhoDeV1, appear to be associated with dicot hosts, which is consistent with every betanucleorhabdovirus reported so far [18]. The dicot host range of the betanucleorhabdoviruses is likely linked to their insect vector, given that every betanucleorhabdovirus where a vector has been experimentally determined is transmitted by aphids [18]. Although aphids can colonize both dicots and monocots [72], it has been suggested that these insects more successfully feed on dicots [73]. Regarding their genome organization, the five novel betanucleorhabdoviruses identified in this study encode six plant rhabdovirus ORFs in the conserved order 3′-N-P-P3-M-G-L-5′ without any accessory ORFs, like the majority of previously known betanucleorhabdoviruses, such as the well-studied SYNV. CusReV1, PerMiV1, PleArV1 and RhoDeV1 likely belong to the same evolutionary lineage as BFTaV1, BCaRV, CdVCV1, DYVV, GSPNuV, SYNV and SYVV, which may represent the ancestral clade within the betanucleorhabdoviruses. However, AscSyV2′s closest evolutionary relationship is with the betanucleorhabdoviruses ApRVA and AaNV, which differ in their genome organization by having an accessory ORF located between the M and G genes [46,74]. Interestingly, the IEP of the AscSyV2, CusReV1, PerMiV1, PleArV1 and RhoDeV1 N proteins is different to the IEP of the cognate proteins encoded by the phylogenetically related betanucleorhabdoviruses. It is unknown if this difference in the IEP may have a biological effect in terms of RNA or protein binding. Moreover, the RhoDeV1 M protein is acidic, while the BFTaV1, BCaRV, BmV2, CdVCV1, CusReV1, DYVV, GSPNuV, PleArV1, SYNV and SYVV N proteins are basic. It has been suggested that charge differences in the M proteins may be associated with differences in their abilities to interact with the negatively charged lipids of the membrane [75]. NLS and NES were predicted for most of the AscSyV2, ATV1, CusReV1, PerMiV1, PleArV1 and RhoDeV1-encoded proteins, which can be expected, since nucleorhabdoviruses replicate in the nuclei of infected cells [20].
The phylogenetic relationships of nucleorhabdoviruses, including the novel alpha- and betanucleorhabdoviruses discovered in our analysis, clearly support the recent split of the previous genus Nucleorhabdovirus into the three genera Alphanucleorhabdovirus, Betanucleorhabdovirus and Gammanucleorhabdovirus [18].

4.7. Varicosaviruses

Currently, there are only three varicosaviruses recognized by the ICTV, LBVaV, AMVV1 and RCaVV, and a novel varicosavirus, VVV, was recently identified [52]. Nevertheless, the available information regarding varicosavirus gene functions is generally scarce, and the functional roles of the varicosavirus P2, P3, P4, P5 and P6 proteins remain to be elucidated. No transmembrane domain was predicted in any of the proteins encoded by the varicosavirus genomes identified in this study, as well as in those already described; thus, it appears that no varicosavirus-encoded protein has a membrane-associated function. The AAnV1, L and P2 proteins; LoPV1 N and P2 proteins and AMVV1 and MelRoV1 L proteins are predicted to be exclusively located in the nucleus, whereas NES are predicted in most N, P2 and P3 proteins and some P4 proteins (Table S3). This suggests a potential role of these proteins in the cell nucleus.
The N protein encoded by the unsegmented PiFLeV1 is basic; on the other hand, the N proteins encoded by the bisegmented varicosaviruses AAnV1, AMVV1, BrRV1, LBVaV, LoPV1, MelRoV1, PhPiV1, RCaVV, SpV1 and VVV are acidic. This difference in the IEP may be associated with a different replication mechanism of the unsegmented PiFLeV1 compared to the segmented varicosaviruses. Moreover, AAnV1 and BrRV1 P2s are basic, while the P2s of AMVV1, LBVaV, LoPV1, MelRoV1, PiFleV1, RCaVV, SpV1 and VVV are neutral or acidic. The biological significance of this difference is unknown, because the functional role of P2 still needs to be unraveled.
LoPV1 and AMVV1 are phylogenetically closely related; their N and P3 aa sequences are >50% identical, and both are associated with grasses in the family Poaceae. However, LoPV1 RNA1 has one ORF, while AMVV1 RNA1 has two [76]. These monocot-infecting varicosaviruses are phylogenetically related to AAnV1, which also is associated with a monocot host and shares a similar genomic organization with LoPV1. PhPiV1 clustered with LBVaV; thus, it is likely that its genome, which was partially assembled, has a similar genomic organization to LBVaV, with two ORFs encoded in RNA1 and five in RNA2 [77]. Interestingly, the RNA2 of four of the novel varicosa-like viruses identified in this study have three ORFs, similar to RCaVV and AMVV1 RNA2 [51,76]. On the other hand, the RNA2 of one novel varicosa-like virus identified in our study has four ORFs. Thus, the number of ORFs identified in the RNA2 of every novel varicosa-like virus reported in this study, except for PhPiV1, is different from the first identified varicosavirus, LBVaV, and the recently described varicosavirus, VVV, which have five ORFs [52,77]. The RNA1 segments of AAnV1, BrRV1, LoPV1, SpV1 and MelRoV1 are similar to the RNA1 of RCaVV and VVV in that they only encode the L protein, whereas a small ORF before the L gene was identified in PhPiV1 and previously in AMVV1 and LBVaV [51,52].
Among the varicosa-like viruses identified in this study, PiFleV1 is unique in terms of genome organization, since its genome is unsegmented, a characteristic that differs from the currently known bisegmented genome architecture of varicosaviruses [21]. This may be an adaptation to its gymnosperm host. Unfortunately, we were not able to assemble the complete genome of the varicosavirus associated with the gymnosperm Sciadopitys verticillata previously reported by Mushegian and colleagues [14] to support this hypothesis. Based on genome organization and phylogenetic placement, PiFleV1 appears to be the first example where rhabdoviruses with segmented and unsegmented genomes are closely related and may be classified in the same genus.

5. Conclusions

In summary, this study illustrates the complexity and diversity of plant rhabdoviruses genomes and demonstrates that analyzing SRA public data is a valuable tool not only to accelerate the discovery of novel viruses but, also, to gain insight into their evolution and to refine virus taxonomy. However, the inability to go back to the biological material to confirm viral genome sequences and to link the presence of the viruses to a specific host is the main drawback of the data mining approach for virus discovery. This limitation could lead to the potential misidentification of host species linked to viruses. Thus, researchers need to be cautious when analyzing SRA public data for virus discoveries.

Supplementary Materials

The following are available online at https://0-www-mdpi-com.brum.beds.ac.uk/article/10.3390/v13071304/s1. Table S1: Virus names, abbreviations and NCBI accession numbers of the plant rhabdovirus sequences used in this study. Table S2: Summary of the assembly statistics of the plant rhabdovirus sequences identified from the transcriptome data available in the NCBI database. Table S3: Characteristics of the deduced proteins encoded by each assembled rhabdovirid sequence determined by predictive algorithms. Table S4: Size of each encoded protein of every publicly available alpha-, beta and gamma nucleorhabdovirus, cytorhabdovirus, varicosavirus and dichorhavirus.

Author Contributions

N.B., conceptualization; N.B., R.G.D. and H.D., data analysis; N.B., writing—original draft and N.B., R.G.D. and H.D., writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable for studies not involving humans or animals.

Informed Consent Statement

Not applicable for studies not involving humans.

Data Availability Statement

The sequences deposited in GenBank with accession numbers BK014297-014366 and BK059208-BK059209 are available as Supplementary Materials.

Acknowledgments

We would like to express sincere gratitude to the generators of the underlying data used for this work, which are cited in Table 1. By following open-access practices and supporting accessible raw sequence data in public repositories available to the research community, they promoted the generation of new knowledge and ideas. This work was jointly supported by the Queensland Government Department of Agriculture and Fisheries and the University of Queensland.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Gilbert, K.B.; Holcomb, E.E.; Allscheid, R.L.; Carrington, J.C. Hiding in plain sight: New virus genomes discovered via a systematic analysis of fungal public transcriptomes. PLoS ONE 2019, 14, e0219207. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Bejerman, N.; Debat, H.; Dietzgen, R.G. The Plant Negative-Sense RNA Virosphere: Virus Discovery Through New Eyes. Front. Microbiol. 2020, 11. [Google Scholar] [CrossRef] [PubMed]
  3. Dolja, V.V.; Krupovic, M.; Koonin, E.V. Deep Roots and Splendid Boughs of the Global Plant Virome. Annu. Rev. Phytopathol. 2020, 58, 23–53. [Google Scholar] [CrossRef] [PubMed]
  4. Lefeuvre, P.; Martin, D.P.; Elena, S.F.; Shepherd, D.N.; Roumagnac, P.; Varsani, A. Evolution and ecology of plant viruses. Nat. Rev. Microbiol. 2019, 17, 632–644. [Google Scholar] [CrossRef]
  5. Roossinck, M.J.; Martin, D.P.; Roumagnac, P. Plant Virus Metagenomics: Advances in Virus Discovery. Phytopathology 2015, 105, 716–727. [Google Scholar] [CrossRef] [Green Version]
  6. Koonin, E.V.; Dolja, V.V.; Krupovic, M.; Varsani, A.; Wolf, Y.I.; Yutin, N.; Zerbini, F.M.; Kuhn, J.H. Global Organization and Proposed Megataxonomy of the Virus World. Microbiol. Mol. Biol. Rev. 2020, 84, 1–33. [Google Scholar] [CrossRef] [PubMed]
  7. Bejerman, N.; Roumagnac, P.; Nemchinov, L.G. High-Throughput Sequencing for Deciphering the Virome of Alfalfa (Medicago sativa L.). Front. Microbiol. 2020, 11. [Google Scholar] [CrossRef]
  8. Debat, H.J.; Bejerman, N. Novel bird’s-foot trefoil RNA viruses provide insights into a clade of legume-associated enamoviruses and rhabdoviruses. Arch. Virol. 2019, 164, 1419–1426. [Google Scholar] [CrossRef]
  9. Goh, C.J.; Park, D.; Hahn, Y. Identification of Trichosanthes associated rhabdovirus 1, a novel member of the genus Cytorhabdovirus of the family Rhabdoviridae, in the Trichosanthes kirilowii transcriptome. Acta Virol. 2020, 64, 36–43. [Google Scholar] [CrossRef]
  10. Jiang, P.; Shao, J.; Nemchinov, L.G. Identification of emerging viral genomes in transcriptomic datasets of alfalfa (Medicago sativa L.). Virol. J. 2019, 16, 1–12. [Google Scholar] [CrossRef]
  11. Kim, H.; Park, D.; Hahn, Y. Identification of novel RNA viruses in alfalfa (Medicago sativa): An Alphapartitivirus, a Deltapartitivirus, and a Marafivirus. Gene 2018, 638, 7–12. [Google Scholar] [CrossRef] [PubMed]
  12. Lauber, C.; Seifert, M.; Bartenschlager, R.; Seitz, S. Discovery of highly divergent lineages of plant-associated astro-like viruses sheds light on the emergence of potyviruses. Virus Res. 2019, 260, 38–48. [Google Scholar] [CrossRef]
  13. Longdon, B.; Murray, G.G.R.; Palmer, W.J.; Day, J.P.; Parker, D.J.; Welch, J.J.; Obbard, D.; Jiggins, F.M. The evolution, diversity, and host associations of rhabdoviruses. Virus Evol. 2015, 1, vev014. [Google Scholar] [CrossRef] [Green Version]
  14. Mushegian, A.; Shipunov, A.; Elena, S.F. Changes in the composition of the RNA virome mark evolutionary transi-tions in green plants. BMC Biol. 2016, 14, 68. [Google Scholar] [CrossRef] [Green Version]
  15. Nibert, M.L.; Vong, M.; Fugate, K.K.; Debat, H.J. Evidence for contemporary plant mitoviruses. Virology 2018, 518, 14–24. [Google Scholar] [CrossRef]
  16. Sidharthan, V.K.; Baranwal, V.K. Mining of the water hyssop (Bacopa monnieri) transcriptome reveals genome sequences of two putative novel rhabdoviruses and a solendovirus. Arch. Virol. 2021, 166, 1985–1990. [Google Scholar] [CrossRef] [PubMed]
  17. Simmonds, P.; Adams, M.J.; Benkő, M.; Breitbart, M.; Brister, J.R.; Carstens, E.B.; Davison, A.J.; Delwart, E.; Gorbalenya, A.E.; Harrach, B. Consensus statement: Virus taxonomy in the age of metagenomics. Nat. Rev. Microbiol. 2017, 15, 161–168. [Google Scholar] [CrossRef] [PubMed]
  18. Dietzgen, R.G.; Bejerman, N.E.; Goodin, M.M.; Higgins, C.M.; Huot, O.B.; Kondo, H.; Martin, K.M.; Whitfield, A.E. Diversity and epidemiolo-gy of plant rhabdoviruses. Virus Res. 2020, 281, 197942. [Google Scholar] [CrossRef] [PubMed]
  19. Walker, P.J.; Blasdell, K.R.; Breyta, R.; Dietzgen, R.G.; Fooks, A.R.; Freitas-Astúa, J.; Kondo, H.; Kurath, G.; Kuzmin, I.V.; Stone, D.M.; et al. Create Three New Subfamilies in the Family Rhabdoviridae (Mononegavirales). Retrieved 9 March 2021 from International Committee on Taxonomy of Viruses Website. 2020. Available online: https://talk.ictvonline.org/files/ictv_official_taxonomy_updates_since_the_8th_report/m/animal-dsrna-and-ssrna--viruses/11045 (accessed on 9 March 2021).
  20. Dietzgen, R.G.; Kondo, H.; Goodin, M.M.; Kurath, G.; Vasilakis, N. The family Rhabdoviridae: Mono- and bi-partite negative-sense RNA viruses with diverse genome organization and common evolutionary origins. Virus Res. 2017, 227, 158–170. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Walker, P.J.; Blasdell, K.R.; Calisher, C.H.; Dietzgen, R.G.; Kondo, H.; Kurath, G.; Longdon, B.; Stone, D.M.; Tesh, R.B.; Tordo, N. ICTV virus taxonomy pro-file: Rhabdoviridae. J. Gen. Virol. 2018, 99, 447–448. [Google Scholar] [CrossRef]
  22. Walker, P.J.; Dietzgen, R.G.; Joubert, D.A.; Blasdell, K. Rhabdovirus accessory genes. Virus Res. 2011, 162, 110–125. [Google Scholar] [CrossRef]
  23. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef] [PubMed]
  24. Gross, S.M.; Martin, J.A.; Simpson, J.; Abraham, J.; Wang, Z.; Visel, A. De novo transcriptome assembly of drought tolerant CAM plants, Agave deserti and Agave tequilana. BMC Genom. 2013, 14, 563. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Weitemier, K.; Straub, S.C.; Fishbein, M.; Bailey, C.D.; Cronn, R.C.; Liston, A. A draft genome and transcriptome of common milkweed (Asclepias syriaca) as resources for evolutionary, ecological, and molecular studies in milkweeds and Apocynaceae. PeerJ 2019, 7, e7649. [Google Scholar] [CrossRef]
  26. Olsen, S.; Striberny, B.; Hollmann, J.; Schwacke, R.; Popper, Z.; Krause, K. Getting ready for host invasion: Elevated expression and action of xyloglucan endotransglucosylases/hydrolases in developing haustoria of the holoparasitic angio-sperm Cuscuta. J. Exp. Bot. 2016, 67, 695–708. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Loke, K.K.; Rahnamaie-Tajadod, R.; Yeoh, C.C.; Goh, H.H.; Mohamed-Hussein, Z.A.; Zainal, Z.; Ismail, I.; Noor, N.M. Transcriptome analysis of Polygonum minus reveals candidate genes involved in important secondary metabolic pathways of phenylpropanoids and flavonoids. Peer J. 2017, 5, e2938. [Google Scholar] [CrossRef] [Green Version]
  28. Zhang, L.; Xu, P.; Cai, Y.; Ma, L.; Li, S.; Li, S.; Xie, W.; Song, J.; Peng, L.; Yan, H.; et al. The draft genome assembly of Rhododendron delavayi Franch. var. delavayi. GigaScience 2017, 6, 1–11. [Google Scholar] [CrossRef] [Green Version]
  29. Zhu, S.; Tang, S.; Tan, Z.; Yu, Y.; Dai, Q.; Liu, T. Comparative transcriptomics provide insight into the morphogenesis and evolution of fistular leaves in Allium. BMC Genom. 2017, 18, 1–9. [Google Scholar] [CrossRef] [Green Version]
  30. Suzuki, J.Y.; Amore, T.D.; Calla, B.; Palmer, N.A.; Scully, E.D.; Sattler, S.E.; Sarath, G.; Lichty, J.S.; Myers, R.Y.; Keith, L.M. Organ-specific transcriptome profiling of metabolic and pigment biosynthesis pathways in the floral ornamental progenitor species Anthurium amnicola Dressler. Sci. Rep. 2017, 7, 1596. [Google Scholar] [CrossRef] [Green Version]
  31. Upadhyay, S.K.; Sharma, S.; Singh, H.B.; Dixit, S.; Kumar, J.; Verma, P.C.; Chandrashekar, K. Whitefly Genome Expression Reveals Host-Symbiont Interaction in Amino Acid Biosynthesis. PLoS ONE 2015, 10, e0126751. [Google Scholar] [CrossRef] [Green Version]
  32. Wang, X.; Chen, D.; Wang, Y.; Xie, J. De novo transcriptome assembly and the putative biosynthetic pathway of steroidal sapogenins of Dioscorea composite. PLoS ONE 2015, 10, e0124560. [Google Scholar]
  33. Li, L.; Li, M.; Qi, X.; Tang, X.; Zhou, Y. De novo transcriptome sequencing and analysis of genes related to salt stress response in Glehnia littoralis. PeerJ 2018, 6, e5681. [Google Scholar] [CrossRef] [Green Version]
  34. Fernández, L.P.; Byers, K.J.R.P.; Cai, J.; Sedeek, K.E.M.; Kellenberger, R.T.; Russo, A.; Qi, W.; Fournier, C.A.; Schlüter, P.M. A Phylogenomic Analysis of the Floral Transcriptomes of Sexually Deceptive and Rewarding European Orchids, Ophrys and Gymnadenia. Front. Plant. Sci. 2019, 10. [Google Scholar] [CrossRef] [PubMed]
  35. Wang, Y.; Hua, W.; Wang, J.; Hannoufa, A.; Xu, Z.; Wang, Z. Deep sequencing of Lotus corniculatus L. reveals key enzymes and potential transcription factors related to the flavonoid biosynthesis pathway. Mol. Genet. Genom. 2013, 288, 131–139. [Google Scholar] [CrossRef]
  36. Xu, Y.G.; Zhao, Y.Q.; Duan, H.M.; Sui, N.; Yuan, F.; Song, J. Transcriptomic profiling of genes in matured dimorphic seeds of euhalophyte Suaeda salsa. BMC Genom. 2017, 18, 727. [Google Scholar] [CrossRef] [PubMed]
  37. Feng, G.; Huang, S.; Liu, Y.; Xiao, F.; Liu, J.; Zhang, Z.; Chen, Q.; Mao, Y.; Cao, X.; Wang, Y.; et al. The transcriptome analyses of Tagetes erecta provides novel insights into secondary metabolite biosynthesis during flower development. Gene 2018, 660, 18–27. [Google Scholar] [CrossRef]
  38. Howyzeh, M.S.; Noori, S.A.S.; Amiripour, M. Comparative transcriptome analysis to identify putative genes involved in thymol biosynthesis pathway in medicinal plant Trachyspermum ammi L. Sci. Rep. 2018, 8, 13405. [Google Scholar] [CrossRef]
  39. Park, S.; Yoo, K.-O.; Marcussen, T.; Backlund, A.; Jacobsson, E.; Rosengren, K.J.; Doo, I.; Göransson, U. Cyclotide Evolution: Insights from the Analyses of Their Precursor Sequences, Structures and Distribution in Violets (Viola). Front. Plant. Sci. 2017, 8. [Google Scholar] [CrossRef] [Green Version]
  40. Fajkus, P.; Peška, V.; Závodník, M.; Fojtová, M.; Fulnečková, J.; Dobias, Š.; Kilar, A.; Dvořáčková, M.; Zachová, D.; Nečasová, I.; et al. Telomerase RNAs in land plants. Nucleic Acids Res. 2019, 47, 9842–9856. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Chen, J.; Li, R.; Xia, Y.; Bai, G.; Guo, P.; Wang, Z.; Zhang, H.; Siddique, K.H. Development of EST-SSR markers in flowering Chinese cab-bage (Brassica campestris L. ssp. chinensis var. utilis Tsen et Lee) based on de novo transcriptomic assemblies. PLoS ONE 2017, 12, e0184736. [Google Scholar]
  42. Farrell, J.D.; Byrne, S.; Paina, C.; Asp, T. De Novo Assembly of the Perennial Ryegrass Transcriptome Using an RNA-Seq Strategy. PLoS ONE 2014, 9, e103567. [Google Scholar] [CrossRef]
  43. Kado, T.; Innan, H. Horizontal Gene Transfer in Five Parasite Plant Species in Orobanchaceae. Genome Biol. Evol. 2018, 10, 3196–3210. [Google Scholar] [CrossRef] [Green Version]
  44. Roda, F.; Mendes, F.K.; Hahn, M.W.; Hopkins, R. Genomic evidence of gene flow during reinforcement in Texas Phlox. Mol. Ecol. 2017, 26, 2317–2330. [Google Scholar] [CrossRef] [Green Version]
  45. Liu, J.-J.; Schoettle, A.W.; Sniezko, R.A.; Sturrock, R.N.; Zamany, A.; Williams, H.; Ha, A.; Chan, D.; Danchok, B.; Savin, D.P.; et al. Genetic mapping of Pinus flexilis major gene (Cr4) for resistance to white pine blister rust using transcriptome-based SNP genotyping. BMC Genom. 2016, 17, 1–12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Takahata, S.; Yago, T.; Iwabuchi, K.; Hirakawa, H.; Suzuki, Y.; Onodera, Y. Comparison of Spinach Sex Chromo-somes with Sugar Beet Autosomes Reveals Extensive Synteny and Low Recombination at the Male-Determining Locus. J. Hered. 2016, 107, 679–685. [Google Scholar] [CrossRef] [Green Version]
  47. Baek, D.; Lim, S.; Ju, H.; Kim, H.; Lee, S.; Moon, J. The complete sequence of a novel nucleorhabdovirus (apple root-stock virus A) identified in apple rootstock. Arch. Virol. 2019, 164, 2641–2644. [Google Scholar] [CrossRef]
  48. Bejerman, N.; Acevedo, R.M.; De Breuil, S.; Ruiz, O.A.; Sansberro, P.; Dietzgen, R.G.; Nome, C.; Debat, H. Molecular characterization of a novel cy-torhabdovirus with a unique genomic organization infecting yerba mate (Ilex paraguariensis) in Argentina. Arch. Virol. 2020, 165, 1475–1479. [Google Scholar] [CrossRef] [PubMed]
  49. Bolus, S.; Al Rwahnih, M.; Grinstead, S.C.; Mollov, D. Rose virus R, a cytorhabdovirus infecting rose. Arch. Virol. 2021, 166, 655–658. [Google Scholar] [CrossRef] [PubMed]
  50. Ding, X.; Chen, D.; Du, Z.; Zhang, J.; Wu, Z. The complete genome sequence of a novel cytorhabdovirus identified in strawberry (Fragaria ananassa Duch.). Arch. Virol. 2019, 164, 3127–3131. [Google Scholar] [CrossRef] [PubMed]
  51. Koloniuk, I.; Fránová, J.; Sarkisova, T.; Přibylová, J.; Lenz, O.; Petrzik, K.; Špak, J. Identification and molecular characterization of a novel varicosa-like virus from red clover. Arch. Virol. 2018, 163, 2213–2218. [Google Scholar] [CrossRef]
  52. Nabeshima, T.; Abe, J. High-throughput sequencing indicates novel Varicosavirus, Emaravirus and Deltapartitvirus infections in Vitis coignetiae. Viruses 2021, 13, 827. [Google Scholar] [CrossRef] [PubMed]
  53. Orfanidou, C.; Beta, C.; Reynard, J.-S.; Tsiolakis, G.; Katis, N.; Maliogka, V. Identification, molecular characterization and prevalence of a novel cytorhabdovirus infecting zucchini crops in Greece. Virus Res. 2020, 287, 198095. [Google Scholar] [CrossRef] [PubMed]
  54. Read, D.A.; Featherston, J.; Rees, J.; Thompson, G.D.; Roberts, R.; Flett, B.C.; Mashingaidze, K.; Pietersen, G.; Kiula, B.; Kullaya, A.; et al. Molecular characterization of Morogoro maize-associated virus, a nucleorhabdovirus detected in maize (Zea mays) in Tanzania. Arch. Virol. 2019, 164, 1711–1715. [Google Scholar] [CrossRef]
  55. Wu, L.P.; Yang, T.; Liu, H.W.; Postman, J.; Li, R. Molecular characterization of a novel rhabdovirus infecting black-currant identified by high-throughput sequencing. Arch. Virol. 2018, 162, 2493–2494. [Google Scholar]
  56. Alvarez-Quinto, R.A.; Lockhart, B.E.L.; Fetzer, J.L.; Olszewski, N.E. Genomic characterization of cycad leaf necrosis virus, the first badnavirus identified in a gymnosperm. Arch. Virol. 2020, 165, 1671–1673. [Google Scholar] [CrossRef]
  57. Han, S.S.; Karasev, A.V.; Ieki, H.; Iwanami, T. Nucleotide sequence and taxonomy of Cycas necrotic stunt virus. Arch. Virol. 2002, 147, 2207–2214. [Google Scholar] [CrossRef]
  58. Koh, S.H.; Li, H.; Admiraal, R.; Jones, M.G.K.; Wylie, S.J. Catharanthus mosaic virus: A potyvirus from a gymno-sperm, Welwitschia mirabilis. Virus Res. 2015, 203, 41–46. [Google Scholar] [CrossRef] [Green Version]
  59. Nibert, M.L.; Ghabrial, S.A.; Maiss, E.; Lesker, T.; Vainio, E.J.; Jiang, D.; Suzuki, N. Taxonomic reorganization of family Partitiviridae and other recent progress in partitivirus research. Virus Res. 2014, 188, 128–141. [Google Scholar] [CrossRef]
  60. Wang, Q.; Ma, X.; Qian, S.; Zhou, X.; Sun, K.; Chen, X.; Zhou, X.; Jackson, A.O.; Li, Z. Rescue of a Plant Negative-Strand RNA Virus from Cloned cDNA: Insights into Enveloped Plant Virus Movement and Morphogenesis. PLOS Pathog. 2015, 11, e1005223. [Google Scholar] [CrossRef]
  61. Zhang, S.; Huang, A.; Zhou, X.; Li, Z.; Dietzgen, R.G.; Zhou, C.Y.; Cao, M. Natural Defect of a Plant Rhabdovirus Glycoprotein Gene: A Case Study of Virus–Plant Coevolution. Phytopathology 2021, 111, 227–236. [Google Scholar] [CrossRef]
  62. Li, C.X.; Shi, M.; Tian, J.H.; Lin, X.D.; Kang, Y.J.; Chen, L.J.; Qin, X.C.; Xu, J.; Holmes, E.C.; Zhang, Y.Z. Unprecedented genomic diversity of RNA viruses in arthro-pods reveals the ancestry of negative-sense RNA viruses. eLife 2015, 4, e05378. [Google Scholar] [CrossRef]
  63. Whitfield, A.E.; Huot, O.B.; Martin, K.M.; Kondo, H.; Dietzgen, R.G. Plant rhabdoviruses—Their origins and vector interactions. Curr. Opin. Virol. 2018, 33, 198–207. [Google Scholar] [CrossRef]
  64. Pinheiro-Lima, B.; Pereira-Carvalho, R.C.; Alves-Freitas, D.M.; Kitajima, E.W.; Vidal, A.H.; Lacorte, C.; Godinho, M.T.; Fontenele, R.S.; Faria, J.C.; Abreu, E.F.; et al. Transmission of the vean-associated cytorhabdovirus by the whitefly Bemisia tabaci MEAM1. Viruses 2020, 12, 1028. [Google Scholar] [CrossRef]
  65. Franova, J.; Sarkisová, T.; Jakešová, H.; Koloniuk, I. Molecular and biological properties of two putative new cy-torhabdoviruses infecting Trifolium pratense. Plant. Pathol. 2019, 11, 90–1286. [Google Scholar]
  66. Bejerman, N.; Giolitti, F.; de Breuil, S.; Trucco, V.; Nome, C.; Lenardon, S.; Dietzgen, R.G. Complete genome sequence and integrated protein localization and interaction map for alfalfa dwarf virus, which combines properties of both cytoplasmic and nuclear plant rhabdoviruses. Virology 2015, 483, 275–283. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Jones, S.; McGavin, W.; MacFarlane, S. The complete sequences of two divergent variants of the rhabdovirus raspber-ry vein chlorosis virus and the design of improved primers for virus detection. Virus Res. 2019, 265, 162–165. [Google Scholar] [CrossRef] [PubMed]
  68. Koloniuk, I.; Fránová, J.; Sarkisova, T.; Přibylová, J. Complete genome sequences of two divergent isolates of straw-berry crinkle virus coinfecting a single strawberry plant. Arch. Virol. 2018, 163, 2539–2542. [Google Scholar] [CrossRef] [PubMed]
  69. Liu, Q.; Jin, J.; Yang, L.; Zhang, S.; Cao, M. Molecular characterization of a novel cytorhabdovirus infecting chrysan-themum with yellow dwarf disease. Arch. Virol. 2021, 166, 1253–1257. [Google Scholar] [CrossRef]
  70. Yang, X.; Huang, J.; Liu, C.; Chen, B.; Zhang, T.; Zhou, G. Rice Stripe Mosaic Virus, a Novel Cytorhabdovirus Infecting Rice via Leafhopper Transmission. Front. Microbiol. 2017, 7, 2140. [Google Scholar] [CrossRef] [Green Version]
  71. Joubert, D.A.; Blasdell, K.R.; Audsley, M.D.; Trinidad, L.; Monaghan, P.; Dave, K.A.; Lieu, K.G.; Amos-Ritchie, R.; Jans, D.A.; Moseley, G.W.; et al. Bovine Ephemeral Fever Rhabdovirus 1 Protein Has Viroporin-Like Properties and Binds Importin 1 and Importin 7. J. Virol. 2013, 88, 1591–1603. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  72. Döring, T.F. How aphids find their host plants, and how they don’t. Ann. Appl. Biol. 2014, 165, 3–26. [Google Scholar] [CrossRef]
  73. Van Bell, A.J.; Will, T. Functional evaluation of proteins in watery and gel saliva of aphids. Front. Plant. Sci. 2016, 7, 1840. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  74. Gaafar, Y.; Richert-Pöggeler, K.R.; Maaß, C.; Vetten, H.J.; Ziebell, H. Characterization of a novel nucleorhabdovirus infecting alfalfa (Medicago sativa). Virol. J. 2019, 16, 55. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  75. Sun, K.; Zhou, X.; Lin, W.; Zhou, X.; Jackson, A.O.; Li, Z. Matrix-glycoprotein interactions required for budding of a plant nucleorhabdovirus and induction of inner nuclear membrane invagination. Mol. Plant. Pathol. 2018, 19, 2288–2301. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  76. Sabbadin, F.; Glover, R.; Stafford, R.; Rozado-Aguirre, Z.; Boonham, N.; Adams, I.; Mumford, R.; Edwards, R. Transcriptome sequencing identifies novel persistent viruses in herbicide resistant wild-grasses. Sci. Rep. 2017, 7, srep41987. [Google Scholar] [CrossRef] [Green Version]
  77. Sasaya, T.; Kusaba, S.; Ishikawa, K.; Koganezawa, H. Nucleotide sequence of RNA2 of Lettuce big-vein virus and evidence for a possible transcription termination/initiation strategy similar to that of rhabdoviruses. J. Gen. Virol. 2004, 85, 2709–2717. [Google Scholar] [CrossRef]
Figure 1. Diagrams depicting the genomic organization of each (A) alpha- and betanucleorhabdovirus, (B) cytorhabdovirus, (C) varicosavirus and (D) dichorhavirus sequence assembled in this study. Abbreviations: N, nucleoprotein coding sequence (CDS); P, phosphoprotein CDS; P3, putative cell-to-cell movement protein CDS; M, matrix protein CDS; G, glycoprotein CDS; L, RNA-dependent RNA polymerase CDS.
Figure 1. Diagrams depicting the genomic organization of each (A) alpha- and betanucleorhabdovirus, (B) cytorhabdovirus, (C) varicosavirus and (D) dichorhavirus sequence assembled in this study. Abbreviations: N, nucleoprotein coding sequence (CDS); P, phosphoprotein CDS; P3, putative cell-to-cell movement protein CDS; M, matrix protein CDS; G, glycoprotein CDS; L, RNA-dependent RNA polymerase CDS.
Viruses 13 01304 g001
Figure 2. Maximum Likelihood phylogenetic tree based on a multiple amino acid alignment of the N protein sequences of plant rhabdoviruses constructed with the WAG + G + F model. Bootstrap values following 1000 replicates are given at the nodes, but only the values above 40% are shown. The scale bar indicates the number of substitutions per site. The viruses identified in this study are noted with green rectangles. The accession numbers of every virus used to construct the ML tree are listed in Supplementary Table S1.
Figure 2. Maximum Likelihood phylogenetic tree based on a multiple amino acid alignment of the N protein sequences of plant rhabdoviruses constructed with the WAG + G + F model. Bootstrap values following 1000 replicates are given at the nodes, but only the values above 40% are shown. The scale bar indicates the number of substitutions per site. The viruses identified in this study are noted with green rectangles. The accession numbers of every virus used to construct the ML tree are listed in Supplementary Table S1.
Viruses 13 01304 g002
Table 2. Consensus plant rhabdovirus gene junction sequences.
Table 2. Consensus plant rhabdovirus gene junction sequences.
GenusVirus *3′End mRNAIntergenic Spacer5′End mRNA
AlphanucleorhabdovirusATV1AUUCUUUUUGGGUUG
CYDVAUUAUUUUUGGGUUG
EMDVAUUAUUUUUGGGUUG
JYBaVAUUAUUUUUGGGUUG
MIMVAUUCUUUUUGGGUUG
MMVAUUCUUUUUGGGUUG
MMaVAUUCUUUUUGGGUUG
PeV1AUUU(A/C)UUUUG(N)nUUG
PhCMoVAUUAUUUUUGGGUUG
PYDVAUUAUUUUUGGGUUG
RYSVAUUAUUUUUGGGUUG
TaVCVAUUCUUUUUGGGUUG
WYSVUAAAUUUUUGGGGUUG
BetanucleorhabdovirusAscSyV2AUUCUUUUUGGUUG
CusReV1AUUCUUUUUGGUUG
PerMiV1AUUCUUUUUGGUUG
PleArV1AUUCUUUUUGGUUG
RhoDeV1AUUCUUUUUGGUUG
AaNVAUUCUUUUUGGUUG
ApRVAAUUCUUUUUGGUUG
BFTaVAUUCUUUUUGGUUG
BCaRVAUUCUUUUUGGUUG
BmV2AUUCUUUUUGGUUG
CdVCV1AUUCUUUUUGGUUG
DYVVAUUCUUUUUGGUUG
GSPNuVAUUCUUUUUGGUUG
SYNVAUUCUUUUUGGUUG
SYVV AUUCUUUUUGGUUG
GammanucleorhabdovirusMFSVAUUUAUUUUGUAGUUG
DichorhavirusVVeV1---
CiCSVAUUUAUUUUGUAGUU
CiLV-NAUUUAUUUUGUAGUU
ClCSVAUUUAUUUUGUAGUU
CoRSVAUUUAUUUUGUAGUU
OFVAUUUAUUUUGUUGUU
CytorhabdovirusAChV1AAUUAUUUUGAUCUA
AntAmV1AUUAUUUUUGCUCUU
AscSyV1AAUUAUUUUGNUCNN
BeTaV1UUAUUUUUUGACUC
DiCoV1AUAUUUUUUGG(N)nCNN
GlLV1AAUUAUUUUGAUCUU
GymDenV1AAUCUUUUUA(N)nCNN
LotCorV1AAUUAUUUUGGU(N)nCUG
NymAV1AUUAAUUUUGAUCUN
PelRaV1AAUUAUUUUGGU(N)nCUG
SuSV1AAUUAUUUUGAUCUU
TaEV1AUUCUUUUUGG(N)nCUN
TrAV1AUUCUUUUUA(N)nCNU
AcCV1AAUUAUUUUGAUCUG
ADVAAUUAUUUUGAUCUU
BmV1AAUUAUUUUGANCUG
BYSMVAUUAUUUUUGACUC
CCyV1AAUUCUUUUG(N)nCUU
CBDaVAUUCUUUUUGGCUC
ChYDaVAAUUAUUUUGAUCUN
CuCV1AUUAUUUUUGACUC
KePCyVAAUUAUUUUGAUCUU
LNYVAAUUCUUUUG(N)nCUU
LYMoVAAUUCUUUUG(N)nCUN
MaCyVAUUCUUUUUGACUC
MYSVAUUAUUUUUGACUC
NCMVAUUCUUUUUGACUC
PMuMaVAUUAUUUUUG(N)nCUA
PpVEAUUCUUUUUGACCCU
PeVAAAUUAUUUUG(N)nCUN
RVRAUUUAUUUUGACUCUA
RSMVAUUCUUUUUGCUCUG
RVCVAUUUAUUUUGAUCUU
SCVAAUUAUUUUGAUCUU
StrV1AAUUAUUUUGAUCUU
TpVAAAUUAUUUUGAUCUU
TpVBAAUUCUUUUG(N)nCUN
TrARV1AAUUAUUUUGAUCUU
TYMaVAAUUAUUUUGAUCUU
WhIV4AAUUAUUUUGNUCUU
WhIV5AAUUAUUUUGAUCNN
WhIV6AAUUAUUUUGAUCUN
YmCaVUUAUUUUUUGA CUC
YmVAAUUCUUUUUGGUCCU
VaricosavirusAAnV1AU(N)5UUUUUGCUCU
BrRV1AU(N)5UUUUUGCUCA
LoPV1AU(N)5UUUUUGCUCU
MelRoV1AU(N)5UUUUUGCUCU
PhPiV1AU(N)5UUUUUGCUCU
PiFleV1GU(N)5UUUUUGCUCU
SpV1AU(N)5UUUUUGCUCU
AMVV1AU(N)5UUUUUGCUCU
LBVaVAU(N)5UUUUUGCUCU
RCaVVAU(N)5UUUUUGCUCU
VVVAU(N)5UUUUUGCUCU
The consensus gene junction sequences of the viruses identified in this study are highlighted in light grey. * Names and abbreviations of newly identified viruses are listed in Table 1, while the names and abbreviations of known viruses are listed in Table S1.
Table 3. Pairwise identity percentages between ATV1 and the selected alphanucleorhabdoviruses.
Table 3. Pairwise identity percentages between ATV1 and the selected alphanucleorhabdoviruses.
Genome aN bP bP3 bM bG bL b
ATV1 vs.MMaV47.032.416.717.517.529.335.0
MMV49.329.113.720.516.429.134.5
PeV149.126.918.820.114.328.732.9
PYDV48.824.110.918.213.623.531.8
a Nucleotide percentages; b amino acid percentages; N/C: not complete; virus names are listed in Table S1 and Table 1.
Table 4. Pairwise identity percentages between AscSyV2, CusReV1, PleArV1 and RhoDeV1 and the selected betanucleorhabdoviruses.
Table 4. Pairwise identity percentages between AscSyV2, CusReV1, PleArV1 and RhoDeV1 and the selected betanucleorhabdoviruses.
Genome aN bP bP3 bM bG bL b
AscSyV2 vs.ApRVA51.347.524.326.323.633.244.2
BCaRV50.440.523.425.022.830.338.5
CusReV1N/C40.923.018.1N/CN/CN/C
DYVV50.642.319.522.220.527.038.1
GSPNuV46.342.721.719.920.129.038.7
PleArV149.341.119.619.319.126.437.2
RhoDeV151.440.718.125.519.327.139.9
CusReV1 vs.ApRVAN/C42.520.819.6N/CN/CN/C
AscSyV2N/C40.923.018.1N/CN/CN/C
BCaRVN/C48.327.628.4N/CN/CN/C
DYVVN/C59.032.641.7N/CN/CN/C
GSPNuVN/C59.030.844.0N/CN/CN/C
PleArV1N/C60.531.640.8N/CN/CN/C
RhoDeV1N/C38.624.823.0N/CN/CN/C
PleArV1 vs.ApRVA49.844.022.220.616.727.237.4
AscSyV249.341.119.619.319.126.437.2
BCaRV52.149.930.126.531.636.447.3
CusReV1N/C60.531.640.8N/CN/CN/C
DYVV62.170.745.663.248.161.062.2
GSPNuV45.864.534.843.741.048.455.4
RhoDeV150.942.522.729.825.431.942.8
RhoDeV1 vs.ApRVA51.740.423.222.422.230.240.8
AscSyV251.440.718.125.519.327.139.9
BCaRV53.142.027.234.527.433.645.0
CusReV1N/C38.624.823.0N/CN/CN/C
DYVV51.342.223.227.526.534.443.7
GSPNuV46.341.122.425.428.632.843.3
PleArV150.942.522.729.825.431.942.8
a Nucleotide percentages; b amino acid percentages; N/C: not complete; virus names are listed in Table S1 and Table 1.
Table 5. Pairwise identity percentages between the cytorhabdoviruses assembled in this study and the selected cytorhabdovirus.
Table 5. Pairwise identity percentages between the cytorhabdoviruses assembled in this study and the selected cytorhabdovirus.
Genome aN bP bP3 bP4 bM bG bP6 bL b
AChV1 vs.ADVN/C44.330.834.5 27.9N/CN/CN/C
ChYDaVN/C41.538.344.9 32.4N/CN/CN/C
RVCVN/C42.229.635.1 22.8N/CN/CN/C
SCVN/C48.429.938.2 26.9N/CN/CN/C
SuSV1N/CN/CN/C42.7 36.7N/CN/CN/C
AntAmV1 vs.CBDaV51.231.721.721.0 18.821.5-39.7
MaCyV50.828.319.317.9 16.422.1-40.2
PMuMaV51.841.025.629.3 35.336.4-45.5
AscSyV1 vs.BmV154.850.437.655.3 34.544.327.653.9
PelRaV1N/C39.731.935.6 22.028.920.2N/C
WhIV453.248.735.354.2 30.841.822.850.1
BeTaV1 vs.CuCVi161.868.264.853.545.661.260.4-72.5
DiCoV1N/C19.4N/CN/C-14.917.2-N/C
YmCaV51.530.620.331.719.921.230.8-42.4
DiCoV1 vs.MYSVN/C23.3N/CN/C-20.221.017.3N/C
CBDaVN/C25.2N/CN/C-23.822.5-N/C
RVRN/C20.7N/CN/C-17.618.719.0N/C
GlLV1 vs.NymAV149.740.023.827.3-24.037.624.150.0
StrV149.938.725.037.0-26.542.122.450.5
TpVA65.667.860.978.1-66.771.675.876.1
TYMaV45.936.221.634.2-20.744.111.949.1
GymDenV1 vs.LYMoV43.522.019.0--19.4--29.5
SCV40.023.517.2--18.7--30.1
TrAV153.834.423.1--26.8--56.3
WhIV544.419.915.9--10.7--30.6
LotCorV1 vs.AscSyV150.339.928.540.5-20.230.118.543.5
PelRaV1N/C61.844.758.6-55.947.931.6N/C
WhIV452.040.929.739.7-20.830.712.543.8
NymAV1 vs.GlLV149.740.023.827.3-24.037.624.150.0
KePCyV52.939.323.632.9-23.740.719.447.9
StrV151.238.622.624.5-19.840.518.048.0
TpVA53.839.224.930.8-23.637.717.050.1
PelRaV1 vs.AscSyV1N/C39.731.935.6-22.028.920.2N/C
LotCorV1N/C61.844.758.6-55.947.931.6N/C
WhIV4N/C40.528.539.6-20.427.511.7N/C
SuSV1 vs.AChV1N/CN/CN/C42.7-36.7N/CN/CN/C
ADVN/CN/CN/C37.0-30.2N/CN/CN/C
ChYDaVN/CN/CN/C37.9-35.6N/CN/CN/C
RVCVN/CN/CN/C40.2-30.8N/CN/CN/C
SCVN/CN/CN/C36.8-26.5N/CN/CN/C
TaEV1 vs.MYSV50.323.315.216.2-15.0--41.7
PpVE47.421.317.615.1-14.1--34.9
RSMV48.725.015.717.8-16.3--38.4
RVR48.523.816.826.7-21.0--38.6
TrAV1 vs.CCyV146.323.016.018.7-16.3--30.2
GymDenV153.834.423.1--26.8--56.3
StrV144.724.616.716.1-15.9--30.0
WhIV546.920.519.311.2-13.3--29.6
a Nucleotide percentages; b amino acid percentages; N/C: not complete; virus names are listed in Table S1 and Table 1.
Table 6. Pairwise identity percentages between VVeV1 and the dichorhaviruses.
Table 6. Pairwise identity percentages between VVeV1 and the dichorhaviruses.
Genome aN bP bP3 bM bG bL b
VVeV1 vs.CiCSVN/C21.3N/C18.5N/C19.0N/C
CiLV-NN/C20.8N/C18.8N/C19.6N/C
ClCSVN/C21.1N/C18.2N/C19.2N/C
CoRSVN/C20.5N/C18.4N/C18.7N/C
OFVN/C23.1N/C19.1N/C20.1N/C
a Nucleotide percentages; b amino acid percentages; N/C: not complete; virus names are listed in Table S1 and Table 1.
Table 7. Pairwise identity percentages between the varicosaviruses assembled in this study and the reported varicosaviruses.
Table 7. Pairwise identity percentages between the varicosaviruses assembled in this study and the reported varicosaviruses.
N b2 b3 b4 bL a/b
AAV1 vs.AMVV125.920,220.2-54.4/37.8
BrRV120.519.418.3-58.2/40.7
LBVaV20.919.712.6-54.1/37.7
LoPV123.219.919.1-57.3/39.4
MelRoV120.317.48.4-57.6/39.5
PhPiV118.3N/C7.4-N/C
PiFleV117.317.17.2-52.2/32.8
RCaVV21.419.818.7-56.2/38.9
SpV119.318.517.4-57.1/39.3
BrRV1 vs.AAnV120.519.418.3-58.2/40.7
AMVV122.510.918.3-52.8/36.9
LBVaV19.915.46.7-53.6/37.3
LoPV121.521.019.0-55.2/37.5
MelRoV123.016.47.8-55.8/42.0
PhPiV118.8N/C6.9-N/C
PiFleV122.917.56.6-52.3/32.2
RCaVV26.917.721.8-59.6/50.5
SpV124.618.1N/C-59.8/51.5
LoPV1 vs.AAnV123.219.919.1-57.3/39.4
AMVV152.032.750.9-67.6/71.5
BrRV121.521.019.0-55.2/37.5
LBVaV19.115.911.3-54.8/37.0
MelRoV119.416.65.9-55.1/37.8
PhPiV121.2N/C6.3-N/C
PiFleV117.117.18.8-52.5/32.1
RCaVV18.89.618.5-54.5/37.5
SpV118.816.8N/C-54.1/35.9
MelRoV1 vs.AAnV120.317.48.4-57.6/39.5
AMVV122.616.86.7-54.8/38.0
BrRV123.016.47.9-55.8/42.0
LBVaV19.616.024.011.153.6/37.7
LoPV119.416.65.9-55.1/37.8
PhPiV120.8N/C29.1N/CN/C
PiFleV119.717.427.714.250.9/31.5
RCaVV22.515.39.9-55.2/41.0
SpV121.920.5N/C-56.3/41.0
PhPiV1 vs.AAnV118.3N/C7.4-N/C
AMVV122.5N/C7.1-N/C
BrRV118.8N/C6.9-N/C
LBVaV30.4N/C30.4N/CN/C
LoPV121.2N/C6.3-N/C
MelRoV120.8N/C29.1N/CN/C
PiFleV126.8N/C25.5N/CN/C
RCaVV24.3N/C13.7-N/C
SpV121.4N/CN/C-N/C
PiFleV1 vs.AAnV117.317.17.2-52.2/32.8
AMVV119.115.39.5-51.3/32.7
BrRV122.917.56.6-52.3/32.2
LBVaV23.416.425.414.551.6/32.4
LoPV117.117.18.8-52.5/32.1
MelRoV119.717.427.714.250.9/31.5
PhPiV126.8N/C25.5N/CN/C
RCaVV20.415.96.8-52.0/31.1
SpV120.515.0N/C-51.7/32.4
SpV1 vs.AAnV119.318.517.4-57.1/39.3
AMVV118.316.6N/C-53.1/35.5
BrRV124.618.1N/C-59.8/51.5
LBVaV18.816.7N/C-53.5/36.1
LoPV118.816.8N/C-54.1/35.9
MelRoV121.920.5N/C-56.3/41.0
PhPiV121.4N/CN/C-N/C
PiFleV120.515.0N/C-51.7/32.4
RCaVV27.718.5N/C-60.8/54.8
a Nucleotide percentages; b amino acid percentages; N/C: not complete; virus names are listed in Table S1 and Table 1.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bejerman, N.; Dietzgen, R.G.; Debat, H. Illuminating the Plant Rhabdovirus Landscape through Metatranscriptomics Data. Viruses 2021, 13, 1304. https://0-doi-org.brum.beds.ac.uk/10.3390/v13071304

AMA Style

Bejerman N, Dietzgen RG, Debat H. Illuminating the Plant Rhabdovirus Landscape through Metatranscriptomics Data. Viruses. 2021; 13(7):1304. https://0-doi-org.brum.beds.ac.uk/10.3390/v13071304

Chicago/Turabian Style

Bejerman, Nicolás, Ralf G. Dietzgen, and Humberto Debat. 2021. "Illuminating the Plant Rhabdovirus Landscape through Metatranscriptomics Data" Viruses 13, no. 7: 1304. https://0-doi-org.brum.beds.ac.uk/10.3390/v13071304

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop