TY - JOUR
T1 - A roadmap for high-throughput sequencing studies of wild animal populations using noninvasive samples and hybridization capture
AU - White, Lauren C.
AU - Fontsere, Claudia
AU - Lizano, Esther
AU - Hughes, David A.
AU - Angedakin, Samuel
AU - Arandjelovic, Mimi
AU - Granjon, Anne Céline
AU - Hans, Jörg B.
AU - Lester, Jack D.
AU - Rabanus-Wallace, M. Timothy
AU - Rowney, Carolyn
AU - Städele, Veronika
AU - Marques-Bonet, Tomas
AU - Langergraber, Kevin
AU - Vigilant, Linda
N1 - Funding Information:
We thank Anette Nicklisch, Roland Schroeder, Amy Heilman and Sebastian Ramirez Amaya for assistance in the laboratory, Barbara Schellbach and Antje Weihmann for conducting the Illumina sequencing, Janet Kelso, Johann Visagie and Joshua Schmidt for bio-informatic help and guidance, Thomas Gilbert for sharing library preparation protocols prior to their publication, Richard McElreath and Dieter Lukas for helpful discussion, and Prof. CZvilaArcos and two anonymous reviewers for constructive comments. Collection of samples would not have been possible without the invaluable assistance of the Ngogo Chimpanzee Project Field assistants and snare team members, Lawrence Ndangizi, Godfrey Mbabazi, Alfred Tumusiime, Ambrose Twineomujuni, Brian Kamugyisha, Charles Birungi, Christopher Aliganyira, James Tibisimwa, William Sunday, Braise Mugyisha, David Sunday, Ronald Mugume, Justus Byamugisha, Joseph Tumusiime, James Kyeyune and Snofrex Turyakyira. Funding was provided by the Max Planck Society and the President's Strategic Initiative Fund of ASU. T.M.B. is supported by BFU2017-86471-P (MINECO/FEDER, UE), U01 MH106874 grant, Howard Hughes International Early Career, Obra Social “La Caixa” and Secretaria d'Universitats i Recerca and CERCA Programme del Departament d'Economia i Coneixement de la Generalitat de Catalunya (GRC 2017 SGR 880) and C.F. is supported by a La Caixa PhD Fellowship.
Publisher Copyright:
© 2019 John Wiley & Sons Ltd
PY - 2019/5
Y1 - 2019/5
N2 - Large-scale genomic studies of wild animal populations are often limited by access to high-quality DNA. Although noninvasive samples, such as faeces, can be readily collected, DNA from the sample producers is usually present in low quantities, fragmented, and contaminated by microorganism and dietary DNAs. Hybridization capture can help to overcome these impediments by increasing the proportion of subject DNA prior to high-throughput sequencing. Here we evaluate a key design variable for hybridization capture, the number of rounds of capture, by testing whether one or two rounds are most appropriate, given varying sample quality (as measured by the ratios of subject to total DNA). We used a set of 1,780 quality-assessed wild chimpanzee (Pan troglodytes schweinfurthii) faecal samples and chose 110 samples of varying quality for exome capture and sequencing. We used multiple regression to assess the effects of the ratio of subject to total DNA (sample quality), rounds of capture and sequencing effort on the number of unique exome reads sequenced. We not only show that one round of capture is preferable when the proportion of subject DNA in a sample is above ~2%–3%, but also explore various types of bias introduced by capture, and develop a model that predicts the sequencing effort necessary for a desired data yield from samples of a given quality. Thus, our results provide a useful guide and pave a methodological way forward for researchers wishing to plan similar hybridization capture studies.
AB - Large-scale genomic studies of wild animal populations are often limited by access to high-quality DNA. Although noninvasive samples, such as faeces, can be readily collected, DNA from the sample producers is usually present in low quantities, fragmented, and contaminated by microorganism and dietary DNAs. Hybridization capture can help to overcome these impediments by increasing the proportion of subject DNA prior to high-throughput sequencing. Here we evaluate a key design variable for hybridization capture, the number of rounds of capture, by testing whether one or two rounds are most appropriate, given varying sample quality (as measured by the ratios of subject to total DNA). We used a set of 1,780 quality-assessed wild chimpanzee (Pan troglodytes schweinfurthii) faecal samples and chose 110 samples of varying quality for exome capture and sequencing. We used multiple regression to assess the effects of the ratio of subject to total DNA (sample quality), rounds of capture and sequencing effort on the number of unique exome reads sequenced. We not only show that one round of capture is preferable when the proportion of subject DNA in a sample is above ~2%–3%, but also explore various types of bias introduced by capture, and develop a model that predicts the sequencing effort necessary for a desired data yield from samples of a given quality. Thus, our results provide a useful guide and pave a methodological way forward for researchers wishing to plan similar hybridization capture studies.
KW - chimpanzees
KW - conservation genomics
KW - faecal samples
KW - population genomics
KW - target enrichment
UR - http://www.scopus.com/inward/record.url?scp=85064595943&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85064595943&partnerID=8YFLogxK
U2 - 10.1111/1755-0998.12993
DO - 10.1111/1755-0998.12993
M3 - Article
C2 - 30637963
AN - SCOPUS:85064595943
SN - 1755-098X
VL - 19
SP - 609
EP - 622
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
IS - 3
ER -