Benchmarking of virome metagenomic analysis approaches using a large, 60+ members, viral synthetic community

In contrast to microbial metagenomics, there has still been only limited efforts to benchmark performance of virome analysis approaches in terms of faithfulness to community structure and of completeness of virome description. While natural communities are more readily accessible, synthetic communities assembled using well-characterized isolates allow more accurate performance evaluation. Starting from authenticated, quality-controlled reference isolates from the DSMZ Plant Virus Collection, we have assembled synthetic communities of varying complexity up to a highly complex community of 72 viral agents (115 viral molecules) comprising isolates from 21 viral families and 61 genera. These communities were then analyzed using two approaches frequently used in ecology-oriented plant virus metagenomics: a virion-associated nucleic acids (VANA)-based strategy and a highly purified double-stranded RNAs (dsRNAs)-based one. The results obtained allowed to compare diagnostic sensitivity of these two approaches for groups of viruses and satellites with different genome types and confirmed that the dsRNA-based approach provides a more complete representation of the RNA virome. However, for viromes of low to medium complexity, VANA appears a reasonable alternative and would be the preferred choiceif analysis of DNA viruses is of importance. They also allowed to identify several important parameters and to propose hypotheses to explain differences in performance, in particular, differences in the imbalance in the representation of individual viruses using each approach. Remarkably, these analyses highlight a strong direct relationship between the completeness of virome description and sample sequencing depth which should prove useful in further virome analysis efforts.

Saved in:
Bibliographic Details
Main Authors: Schönegger, Deborah, Moubset, Oumaima, Margaria, Paolo, Menzel, Wulf, Winter, Stephan, Roumagnac, Philippe, Marais, Armelle, Candresse, Thierry
Format: article biblioteca
Language:eng
Subjects:H20 - Maladies des plantes, virus des végétaux, ARN satellite, virologie, écologie microbienne, virus bunchy top bananier, dsRNA viruses, indexage des virus des plantes, identification, séquence nucléotidique, phylogénie, virus, http://aims.fao.org/aos/agrovoc/c_5985, http://aims.fao.org/aos/agrovoc/c_4cf70821, http://aims.fao.org/aos/agrovoc/c_8259, http://aims.fao.org/aos/agrovoc/c_24111, http://aims.fao.org/aos/agrovoc/c_32385, http://aims.fao.org/aos/agrovoc/c_081270de, http://aims.fao.org/aos/agrovoc/c_27034, http://aims.fao.org/aos/agrovoc/c_3791, http://aims.fao.org/aos/agrovoc/c_27583, http://aims.fao.org/aos/agrovoc/c_13325, http://aims.fao.org/aos/agrovoc/c_8262, http://aims.fao.org/aos/agrovoc/c_3081,
Online Access:http://agritrop.cirad.fr/607230/
http://agritrop.cirad.fr/607230/7/607230ed.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In contrast to microbial metagenomics, there has still been only limited efforts to benchmark performance of virome analysis approaches in terms of faithfulness to community structure and of completeness of virome description. While natural communities are more readily accessible, synthetic communities assembled using well-characterized isolates allow more accurate performance evaluation. Starting from authenticated, quality-controlled reference isolates from the DSMZ Plant Virus Collection, we have assembled synthetic communities of varying complexity up to a highly complex community of 72 viral agents (115 viral molecules) comprising isolates from 21 viral families and 61 genera. These communities were then analyzed using two approaches frequently used in ecology-oriented plant virus metagenomics: a virion-associated nucleic acids (VANA)-based strategy and a highly purified double-stranded RNAs (dsRNAs)-based one. The results obtained allowed to compare diagnostic sensitivity of these two approaches for groups of viruses and satellites with different genome types and confirmed that the dsRNA-based approach provides a more complete representation of the RNA virome. However, for viromes of low to medium complexity, VANA appears a reasonable alternative and would be the preferred choiceif analysis of DNA viruses is of importance. They also allowed to identify several important parameters and to propose hypotheses to explain differences in performance, in particular, differences in the imbalance in the representation of individual viruses using each approach. Remarkably, these analyses highlight a strong direct relationship between the completeness of virome description and sample sequencing depth which should prove useful in further virome analysis efforts.