Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the PCC using a de novo assembly approach with a paired-end 150 strategy on the Illumina HiSeq 2500 platform, obtaining 320 GB of sequencing data. A reference scaffolding was used to improve the draft genome. The obtained genome size of the PCC was 2.81 Gb with a contig N50 of 108 Mb and 92.59% complete BUSCOs. This genome size is similar to the genome references of Bos taurus and B. indicus. In addition, we identified 40.22% of repetitive DNA of the genome assembly, of which retroelements occupy 32.39% of the total genome. A total of 19,803 protein-coding genes were annotated in the PCC genome. For SSR data mining, we detected similar statistics in comparison with other breeds. The PCC genome will contribute to a better understanding of the genetics of this species and its adaptation to tough conditions in the Andean ecosystem.
Main Authors: | , , , , , , , , |
---|---|
Format: | info:eu-repo/semantics/article biblioteca |
Language: | eng |
Published: |
MDPI
|
Subjects: | NGS, Neglected breed, Genome, Reference scaffolding, Microsatellites, https://purl.org/pe-repo/ocde/ford#4.03.01, High-throughput sequencing, Breeds (animals), Genomes, |
Online Access: | https://hdl.handle.net/20.500.12955/2054 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
dig-inia-pe-20.500.12955-2054 |
---|---|
record_format |
koha |
spelling |
dig-inia-pe-20.500.12955-20542023-08-23T22:23:32Z Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) Estrada Cañari, Richard Corredor Arizapana, Flor Anita Figueroa, Deyanira Salazar Coronel, Wilian Quilcate Pairazamán, Carlos Enrique Vásquez Pérez, Héctor Vladimir Maicelo Quintana, Jorge Luis Gonzales, Jhony Arbizu Berrocal, Carlos Irvin NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes Microsatellites The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the PCC using a de novo assembly approach with a paired-end 150 strategy on the Illumina HiSeq 2500 platform, obtaining 320 GB of sequencing data. A reference scaffolding was used to improve the draft genome. The obtained genome size of the PCC was 2.81 Gb with a contig N50 of 108 Mb and 92.59% complete BUSCOs. This genome size is similar to the genome references of Bos taurus and B. indicus. In addition, we identified 40.22% of repetitive DNA of the genome assembly, of which retroelements occupy 32.39% of the total genome. A total of 19,803 protein-coding genes were annotated in the PCC genome. For SSR data mining, we detected similar statistics in comparison with other breeds. The PCC genome will contribute to a better understanding of the genetics of this species and its adaptation to tough conditions in the Andean ecosystem. 2022-11-09 info:eu-repo/semantics/article https://hdl.handle.net/20.500.12955/2054 eng Data info:eu-repo/semantics/openAccess Attribution-NonCommercial-NoDerivs 3.0 United States http://creativecommons.org/licenses/by-nc-nd/3.0/us/ application/pdf MDPI Instituto Nacional de Innovación Agraria Repositorio Institucional - INIA |
institution |
INIA PE |
collection |
DSpace |
country |
Perú |
countrycode |
PE |
component |
Bibliográfico |
access |
En linea |
databasecode |
dig-inia-pe |
tag |
biblioteca |
region |
America del Sur |
libraryname |
Biblioteca del INIA Perú |
language |
eng |
topic |
NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes Microsatellites NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes Microsatellites |
spellingShingle |
NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes Microsatellites NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes Microsatellites Estrada Cañari, Richard Corredor Arizapana, Flor Anita Figueroa, Deyanira Salazar Coronel, Wilian Quilcate Pairazamán, Carlos Enrique Vásquez Pérez, Héctor Vladimir Maicelo Quintana, Jorge Luis Gonzales, Jhony Arbizu Berrocal, Carlos Irvin Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
description |
The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the PCC using a de novo assembly approach with a paired-end 150 strategy on the Illumina HiSeq 2500 platform, obtaining 320 GB of sequencing data. A reference scaffolding was used to improve the draft genome. The obtained genome size of the PCC was 2.81 Gb with a contig N50 of 108 Mb and 92.59% complete BUSCOs. This genome size is similar to the genome references of Bos taurus and B. indicus. In addition, we identified 40.22% of repetitive DNA of the genome assembly, of which retroelements occupy 32.39% of the total genome. A total of 19,803 protein-coding genes were annotated in the PCC genome. For SSR data mining, we detected similar statistics in comparison with other breeds. The PCC genome will contribute to a better understanding of the genetics of this species and its adaptation to tough conditions in the Andean ecosystem. |
format |
info:eu-repo/semantics/article |
topic_facet |
NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes Microsatellites |
author |
Estrada Cañari, Richard Corredor Arizapana, Flor Anita Figueroa, Deyanira Salazar Coronel, Wilian Quilcate Pairazamán, Carlos Enrique Vásquez Pérez, Héctor Vladimir Maicelo Quintana, Jorge Luis Gonzales, Jhony Arbizu Berrocal, Carlos Irvin |
author_facet |
Estrada Cañari, Richard Corredor Arizapana, Flor Anita Figueroa, Deyanira Salazar Coronel, Wilian Quilcate Pairazamán, Carlos Enrique Vásquez Pérez, Héctor Vladimir Maicelo Quintana, Jorge Luis Gonzales, Jhony Arbizu Berrocal, Carlos Irvin |
author_sort |
Estrada Cañari, Richard |
title |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_short |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_full |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_fullStr |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_full_unstemmed |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_sort |
reference-guided draft genome assembly, annotation and ssr mining data of the peruvian creole cattle (bos taurus) |
publisher |
MDPI |
url |
https://hdl.handle.net/20.500.12955/2054 |
work_keys_str_mv |
AT estradacanaririchard referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus AT corredorarizapanafloranita referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus AT figueroadeyanira referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus AT salazarcoronelwilian referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus AT quilcatepairazamancarlosenrique referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus AT vasquezperezhectorvladimir referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus AT maiceloquintanajorgeluis referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus AT gonzalesjhony referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus AT arbizuberrocalcarlosirvin referenceguideddraftgenomeassemblyannotationandssrminingdataoftheperuviancreolecattlebostaurus |
_version_ |
1781883856948822016 |