A roadmap for high-throughput sequencing studies of wild animal populations using noninvasive samples and hybridization capture

Large-scale genomic studies of wild animal populations are often limited by access to high-quality DNA. Although noninvasive samples, such as faeces, can be readily collected, DNA from the sample producers is usually present in low quantities, fragmented, and contaminated by microorganism and dietary DNAs. Hybridization capture can help to overcome these impediments by increasing the proportion of subject DNA prior to high-throughput sequencing. Here we evaluate a key design variable for hybridization capture, the number of rounds of capture, by testing whether one or two rounds are most appropriate, given varying sample quality (as measured by the ratios of subject to total DNA). We used a set of 1,780 quality-assessed wild chimpanzee (Pan troglodytes schweinfurthii) faecal samples and chose 110 samples of varying quality for exome capture and sequencing. We used multiple regression to assess the effects of the ratio of subject to total DNA (sample quality), rounds of capture and sequencing effort on the number of unique exome reads sequenced. We not only show that one round of capture is preferable when the proportion of subject DNA in a sample is above ~2%–3%, but also explore various types of bias introduced by capture, and develop a model that predicts the sequencing effort necessary for a desired data yield from samples of a given quality. Thus, our results provide a useful guide and pave a methodological way forward for researchers wishing to plan similar hybridization capture studies.

Saved in:
Bibliographic Details
Main Authors: White, Lauren C., Fontsere, Claudia, Lizano, Esther, Hughes, David A., Angedakin, Samuel, Arandjelovic, Mimi, Granjon, Anne‐Céline, Hans, Jörg B., Lester, Jack D., Rabanus-Wallace, M. Timothy, Rowney, Carolyn, Städele, Veronika, Marqués-Bonet, Tomàs, Langergraber, Kevin E., Vigilant, Linda
Other Authors: Agencia Estatal de Investigación (España)
Format: artículo biblioteca
Published: John Wiley & Sons 2019-05
Subjects:Chimpanzees, Conservation genomics, Faecal samples, Population genomics, Target enrichment,
Online Access:http://hdl.handle.net/10261/201293
http://dx.doi.org/10.13039/501100011033
http://dx.doi.org/10.13039/501100004189
http://dx.doi.org/10.13039/501100000780
http://dx.doi.org/10.13039/100000011
http://dx.doi.org/10.13039/501100002809
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Large-scale genomic studies of wild animal populations are often limited by access to high-quality DNA. Although noninvasive samples, such as faeces, can be readily collected, DNA from the sample producers is usually present in low quantities, fragmented, and contaminated by microorganism and dietary DNAs. Hybridization capture can help to overcome these impediments by increasing the proportion of subject DNA prior to high-throughput sequencing. Here we evaluate a key design variable for hybridization capture, the number of rounds of capture, by testing whether one or two rounds are most appropriate, given varying sample quality (as measured by the ratios of subject to total DNA). We used a set of 1,780 quality-assessed wild chimpanzee (Pan troglodytes schweinfurthii) faecal samples and chose 110 samples of varying quality for exome capture and sequencing. We used multiple regression to assess the effects of the ratio of subject to total DNA (sample quality), rounds of capture and sequencing effort on the number of unique exome reads sequenced. We not only show that one round of capture is preferable when the proportion of subject DNA in a sample is above ~2%–3%, but also explore various types of bias introduced by capture, and develop a model that predicts the sequencing effort necessary for a desired data yield from samples of a given quality. Thus, our results provide a useful guide and pave a methodological way forward for researchers wishing to plan similar hybridization capture studies.