Data from: Assignment of virus and antimicrobial resistance genes to microbial hosts in a complex microbial community by combined long-read assembly and proximity ligation
<p>We describe a method that adds long-read sequencing to a mix of technologies used to assemble a highly complex cattle rumen microbial community, and provide a comparison to short read-based methods. Long-read alignments and Hi-C linkage between contigs support the identification of 188 novel virus-host associations and the determination of phage life cycle states in the rumen microbial community. The long-read assembly also identifies 94 antimicrobial resistance genes, compared to only seven alleles in the short-read assembly. We demonstrate novel techniques that work synergistically to improve characterization of biological features in a highly complex rumen microbial community.</p> <p>We demonstrate the benefits of using multiple sequencing technologies and proximity ligation in identifying unique biological facets of the cattle rumen metagenome, and we present data that suggests that each has a unique niche in downstream analysis. Our comparison identified biases in the sampling of different portions of the community by each sequencing technology, suggesting that a single DNA sequencing technology is insufficient to characterize complex metagenomic samples. Using a combination of long-read alignments and proximity ligation, we identified putative hosts for assembled bacteriophage at a resolution previously unreported in other rumen surveys. These host-phage assignments support previous work that revealed increased viral predation of sulfur-metabolizing bacterial species; however, we were able to provide a higher resolution of this association, identify potential auxiliary metabolic genes related to sulfur metabolism, and identify phage that may target a diverse range of different bacterial species. Furthermore, we found evidence to support that these viruses have a lytic life cycle due to a higher proportion of Hi-C intercontig link association data in our analysis. Finally, it appears that there may be a high degree of mobile DNA that was heretofore uncharacterized in the rumen and that this mobile DNA may be shuttling antimicrobial resistance gene alleles among distantly related species. These unique characteristics of the rumen microbial community would be difficult to detect without the use of several different methods and techniques that we have refined in this study, and we recommend that future surveys incorporate these techniques to further characterize complex metagenomic communities.</p> <p>Datasets generated and/or analyzed during the current study are available in the NCBI SRA repository under Bioproject: PRJNA507739. Assemblies, bins, and ORF predictions are available on Figshare. A description of commands, scripts, and other materials used to analyze the data in this project are available in the GitHub repository: <a href="https://github.com/njdbickhart/RumenLongReadASM">https://github.com/njdbickhart/RumenLongReadASM</a> and also on Zenodo. </p><div><br>Resources in this dataset:</div><br><ul><li><p>Resource Title: Availability of data and materials.</p> <p>File Name: Web Page, url: <a href="https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1760-x#availability-of-data-and-materials">https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1760-x#availability-of-data-and-materials</a> </p><p>The datasets generated and/or analyzed during the current study are available in the NCBI SRA repository under Bioproject: PRJNA507739. The assemblies, bins, and ORF predictions are available on Figshare. A description of commands, scripts, and other materials used to analyze the data in this project can be found in the following GitHub repository: <a href="https://github.com/njdbickhart/RumenLongReadASM">https://github.com/njdbickhart/RumenLongReadASM</a> and also on Zenodo.</p></li></ul><p></p>
Main Authors: | , , , , , , , , , , , , , , , , , , , , |
---|---|
Format: | Dataset biblioteca |
Published: |
2020
|
Subjects: | Animal production, Sequence analysis, Ecology, Genetics, Animal structure and function, bacterophage interactions, allele predictions, allele similarities, ARG alleles, prodical ORF predictions, |
Online Access: | https://figshare.com/articles/dataset/Data_from_Assignment_of_virus_and_antimicrobial_resistance_genes_to_microbial_hosts_in_a_complex_microbial_community_by_combined_long-read_assembly_and_proximity_ligation/24853515 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | <p>We describe a method that adds long-read sequencing to a mix of technologies used to assemble a highly complex cattle rumen microbial community, and provide a comparison to short read-based methods. Long-read alignments and Hi-C linkage between contigs support the identification of 188 novel virus-host associations and the determination of phage life cycle states in the rumen microbial community. The long-read assembly also identifies 94 antimicrobial resistance genes, compared to only seven alleles in the short-read assembly. We demonstrate novel techniques that work synergistically to improve characterization of biological features in a highly complex rumen microbial community.</p>
<p>We demonstrate the benefits of using multiple sequencing technologies and proximity ligation in identifying unique biological facets of the cattle rumen metagenome, and we present data that suggests that each has a unique niche in downstream analysis. Our comparison identified biases in the sampling of different portions of the community by each sequencing technology, suggesting that a single DNA sequencing technology is insufficient to characterize complex metagenomic samples. Using a combination of long-read alignments and proximity ligation, we identified putative hosts for assembled bacteriophage at a resolution previously unreported in other rumen surveys. These host-phage assignments support previous work that revealed increased viral predation of sulfur-metabolizing bacterial species; however, we were able to provide a higher resolution of this association, identify potential auxiliary metabolic genes related to sulfur metabolism, and identify phage that may target a diverse range of different bacterial species. Furthermore, we found evidence to support that these viruses have a lytic life cycle due to a higher proportion of Hi-C intercontig link association data in our analysis. Finally, it appears that there may be a high degree of mobile DNA that was heretofore uncharacterized in the rumen and that this mobile DNA may be shuttling antimicrobial resistance gene alleles among distantly related species. These unique characteristics of the rumen microbial community would be difficult to detect without the use of several different methods and techniques that we have refined in this study, and we recommend that future surveys incorporate these techniques to further characterize complex metagenomic communities.</p>
<p>Datasets generated and/or analyzed during the current study are available in the NCBI SRA repository under Bioproject: PRJNA507739. Assemblies, bins, and ORF predictions are available on Figshare. A description of commands, scripts, and other materials used to analyze the data in this project are available in the GitHub repository: <a href="https://github.com/njdbickhart/RumenLongReadASM">https://github.com/njdbickhart/RumenLongReadASM</a> and also on Zenodo. </p><div><br>Resources in this dataset:</div><br><ul><li><p>Resource Title: Availability of data and materials.</p> <p>File Name: Web Page, url: <a href="https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1760-x#availability-of-data-and-materials">https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1760-x#availability-of-data-and-materials</a> </p><p>The datasets generated and/or analyzed during the current study are available in the NCBI SRA repository under Bioproject: PRJNA507739. The assemblies, bins, and ORF predictions are available on Figshare. A description of commands, scripts, and other materials used to analyze the data in this project can be found in the following GitHub repository: <a href="https://github.com/njdbickhart/RumenLongReadASM">https://github.com/njdbickhart/RumenLongReadASM</a> and also on Zenodo.</p></li></ul><p></p> |
---|