Review on Viral Metagenomics as Powerful Tool in Veterinary Medicine

Review Article

Austin J Vet Sci & Anim Husb. 2024; 11(3): 1146.

Review on Viral Metagenomics as Powerful Tool in Veterinary Medicine

Tewodros Legesse*

Addis Ababa University College of Veterinary Medicine and Agriculture, Ethiopia

*Corresponding author: Tewodros Legesse Addis Ababa University College of Veterinary Medicine and Agriculture, Ethiopia. Email: tewoderos.legesse@aau.edu.et

Received: March 28, 2024 Accepted: May 01, 2024 Published: May 08, 2024

Summary

New diseases continue to emerge in both human and animal populations, and the importance of animals, as reservoirs for viruses that can cause zoonosis are evident. Due to intensive globalization on, climatic changes, and viral evolution, among other factors, the emergence of viruses and new viral diseases has increased in the last decades. Thus, an increased knowledge of the viral flora in animals, both in healthy and diseased individuals, is important both for animal and human health. In this situation, it is crucial to apply powerful methods for the broad-range detection and identification of the emerging viruses. In combination with classical methods, the molecular based techniques provide sensitive and rapid means of virus detection and identification. However, most of the conventional diagnostic tests are designed to be virus specific or aimed at a limited group of infectious agents. In contrast, the novel viral metagenomics approaches allow unbiased detection of a very wide range of infectious agents in a culture independent manner. There are different possible steps of a viral metagenomics study utilizing sequence-independent amplifi cation, high- throughput sequencing, and bioinformatics to identify viruses. The objective of this review is to importance of the applications of viral metagenomics in veterinary science and discusses some of the viruses discovered within this fi eld.

Keywords: Bioinformatics; High-throughput sequencing; Sequence-independent amplification; Viral metagenomics

Abbreviations: cDNA: Complementary DNA; DNA: Deoxyribose Nuclear Acid; DNase: Deoxy-Ribonuclease; dsDNA: Double Stranded DNA; NGS: Next Generation Sequencing; PCR: Polymerase Chain Reaction; RNA: Ribosomal Ribonucleic Acid; RNase: Ribonuclease; rPCR: Random Polymerase Chain Reaction; SISPA: Sequence-Independent Single-Primer Amplification; ssDNA: Single Stranded DNA

Introduction

Throughout recorded history, viruses have caused diseases in animals and humans, with descriptions of possible viral infections appearing long before viruses were fi rst discovered. However, the concept of a virus (meaning poison or toxic) was introduced in 1898 by Beijerinck, who found that the agent causing tobacco mosaic disease in plants could not, as bacteria, be fi ltered out and could not grow by itself but required living and growing cells in which to replicate [7]. They can infect a wide range of hosts including plants, bacteria, fungi, algae, protozoa, vertebrate or non-vertebrate animals. In nature, around 1 × 1031 numbers of different viruses are present. The number it-self suggests the diversity of viruses in nature [35].

The abundance of viruses in nature is around 1000 times more than observed through cell culture dependent technique suggesting that the large pool of viruses is still unknown, only around 219 viral species (belonging to 23 families) that are known to infect humans, among which more than two-thirds are of zoonotic origin [10]. New diseases continue to emerge in both human and animal populations, and the importance of animals, as reservoirs for viruses that can cause zoonosis are evident. Because of this, an increased knowledge of the viral flora in animals, both in healthy and diseased individuals, is important both for animal and human health [15].

Thus, an improved detection of newly emerging and re-emerging viruses and a systematic characterization of the full range of viruses that infect humans are needed [4]. In this situation, it is crucial to apply powerful methods for the broad-range detection and identification of the emerging viruses. In combination with classical methods, the molecular-based techniques provide sensitive and rapid means of virus detection and identification [8]. However, most of the conventional diagnostic tests are designed to be virus-specific or aimed at a limited group of infectious agents. This makes them unsuitable for the detection of unexpected and/or completely new viruses, as well as novel viral variants [8]. In contrast, viral metagenomics approaches allow unbiased detection of a very wide range of infectious agents in a culture-independent manner and novel and highly divergent viruses can be discovered and genetically characterized for the fi rst time [14]. This technology, hold the promise to significantly improve diagnosis and disease control, in line with the ‘‘One World, One Health’’ principles [8].

Therefore, the objective of this seminar is:

• To review metagenomics as a powerful tool for the application of viral metagenomics in veterinary science and discuss some of the viruses discovered within this fi eld.

History of Viral Metagenomics

The genomic age began in 1977 when FX174, a virus that infects Escherichia coli, was sequenced [35]. The metagenomics of viruses began in 2002 with the publication of two uncultured marine viral communities [19]. Mya Breitbart, Forest Rohwer, and colleagues used environmental shotgun sequencing to show that 200 liters of seawater contains over 5000 different viruses [19]. More than 65% of the sequences could not be identifi ed, but a large portion of the identifi ed sequences was from phages, covering most of the major dsDNA phage families [19]. Since then, other studies have been performed, including the viral RNA world [20]. Similar techniques have also been used to study the viral population in various environments, such as feces [18], blood [16] and potential viral reservoirs (e.g., bats) [48]. Because these techniques investigate the complete viral community within a sample, it is possible to study both the natural viral flora and emerging pathogens in disease complexes.

Even though historically it was first associated with the study of uncultured microbial organisms (bacteria and archaea) in environmental samples [37], more recently, it has also been applied to the characterization of viral communities, a task that it is particularly suited because the small size of viral genomes makes their coverage more comprehensive using the same number of metagenomic sequences. Today, viruses are considered the most abundant and diverse living forms on earth [20,72]. Their diversity has been explored by metagenomics in a wide variety of environments: oceans [37], stromatolites [24], acidic hot springs and subterranean and hypersaline environments [24]. The first metagenomic surveys performed on environmental viral communities showed that more than 60% of the sequences had no significant similarity to sequences stored in public databases [16]. This listed benefi t is critical for developing countries to participate to the metagenomics race [15], in particular because of the applications to genomic medicine by implemen ting molecular diagnostics and molecular epidemiology. Science and technology, in particular the life sciences, are increasingly recognized as vital components for national progress in developing countries (Virgin and Todd, 2011).

Workflow of Viral Metagenomics

In the laboratory space, viral metagenomics consists of four major steps: (1) viral enrichment to minimize background of prokaryotic and eukaryotic nucleic acids thus increasing the relative proportion of viral nucleic acids, (2) amplifi cation of viral nucleic acids, (3) sequencing with or without cloning of amplifi ed polymerase chain reaction (PCR) fragments, and (4) bioinformatics analysis of the resulting sequence output [42].

Viral Enrichment

For accurate results, sample collection, proper handling, transportation, stage of the sample is very crucial. There are many standard protocols available for collection of diferent samples to laboratory and its storage techniques. Diferent protocols are used for fluid sample and for tissue samples. The tissue sample is generally homogenized in autoclaved saline and collected supernatant fi ltered through 0.8, 0.45 and 0.2 μm liters, this serial fi ltration procedure is used to separate larger particles and bacteria from viruses. If necessary, the fi rst step of sample procession is homogenization (physical, e.g., homogenizer, mortar and pestle, freeze-thaw cycles; or enzymatic techniques, e.g., salt solution, detergents, alkaline lysis), centrifugation of the sample and fi ltration of the supernatant (through 0.22 and 0.45 μm pore membranes) to remove non-viral nucleic acids (i.e., host cellular debris and bacteria). The fi ltrates can be treated with a mixture of DNases and RNases to further reduce background nucleic acids originating from the host cells and bacteria. The method chosen in this step depends on the physical properties and other characteristics of the sample type [21,54].

Following sample homogenization and reduction of the amount of debris and background nucleic acids, viral particles can be concentrated at various efficacies. Commonly used methods include tangential-flow fi ltration, poly-ethylene glycol precipitation, and ultracentrifugation. Density gradient ultracentrifugation using Cesium Chloride (CsCl) gradient provides highly purifi ed virus particles. Concerning the step of viral nucleic acid extraction, the picture is more complex. Viral particles are disrupted by using “lysis buffer,” which may contain chaotropic acids (e.g., guanidine hydrochloride), detergents (sodium dodecyl sulfate, Triton X-100), and/or proteases (e.g., proteinase K). During the subsequent separation phase, the nucleic acids could be isolated from other components. This can be done by liquid-liquid extraction or liquid-solid extraction.

During the liquid phase extraction different types of alcohol are used (e.g., phenol-chloroform- isoamyl alcohol, isopropanol, etc.). The solid phase extraction may include one of the following procedures: gel fi ltration, where nucleic acid is separated through gel- matrix (e.g., Sephadex), ion exchange chromatography (e.g., anion exchange resin, DEAE-C), and affinity chromatography (silica surface, paramagnetic beads). In general, liquid-solid extraction methods use less hazardous chemicals and provide increased throughput. Various formats have been marketed providing flexible, fast, and scalable viral nucleic acid extraction [21,55,75].

Nucleic Acid Amplification

Amplifi cation of the nucleic acids is performed in a sequence-independent manner to show the true genetic composition of the sample. This step is able to simultaneously multiply several viral genomes including highly divergent and completely novel viruses and thus enable their discovery and genetic characterization [3,9,23]. Some of the more commonly used sequence-independent strategies are briefly described in the following sections as well as a number of different sequencing technologies.

Sequence-independent single-primer amplification: The ligation of adaptors to cDNA and DNA, either directly or after restriction digestion, enables sequence-independent amplifi cation of all nucleic acids in a sample. Two decades ago, a version of this strategy, called sequence-independent single-primer amplifi cation (SISPA) [64], was shown in combination with immunoscreening to retrieve and aid in the genetic characterization of a Norwalk virus in fecal samples [51]. This strategy and modifi cations to it, such as DNase-SISPA [2] and VIDISCA [79], have successfully been used to identify a number of viruses, such as new Anello- and Parvoviruses in humans [41] and Bungowannah virus in pigs with Porcine Myocarditis (PMC) syndrome [44].

Random PCR: An alternative amplifi cation technology is random PCR (rPCR), which neither requires the digestion of the DNA/cDNA nor the ligation of adaptors. The rPCR utilizes a primer that consists of a known adaptor sequence at the 5-prime end followed by a degenerate hexa- or heptamer at the 3-prime end. Using this primer in the cDNA synthesis step, the cDNA is labeled with the adaptor sequence at both ends (a similar step can be performed on the DNA), which enables the creation of primer target sites for the subsequent PCR reaction. This strategy has been widely used for the investigation of viral metagenomes, and a number of novel viruses have been discovered using this strategy in combination with sequencing [30].

Displacement amplification: Another efficient amplifi cation strategy is the use of random primers in combination with a displacement polymerase [40]. Phi29 is a high-fi delity displacement polymerase with high processivity that can incorporate over 70,000 bases prior to detaching from its template [12,27], and only a few nanograms of starting material can produce several micrograms of DNA in one reaction. Using random primers and phi29, circular targets are efficiently amplifi ed through a reaction termed ‘‘rolling-circle amplifi cation,’’ which creates long concatemers of multiple copies of the target [22].

Circular targets are ideal for this amplifi cation strategy. However, linear targets can also be amplifi ed in a similar manner [22]. For short linear DNA viruses and cDNA, amplifi cation is less efficient, and additional steps, such as ligation, may have to be included for amplifi cation (Berthet et al., 2008). This strategy has also been successful in studying different viral metagenomes, and several novel viruses have been discovered by this method, such as a fi bropapilloma virus in sea turtles [55], Anellovirus in harbor seals [56], Bocavirus in pigs [15], and Papillomaviruses in humans [63].

Sequencing Technologies

Owing to efforts in research and innovation (Figure 3), sequencing capacity and speed have dramatically increased over the past decade, while the cost is continuously decreasing. These achievements together led to the capacity to produce billions of nucleotide bases in a single sequencing run, which was unconceivable some time ago [26].