Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)

Descripción del Articulo

The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the P...

Descripción completa

Detalles Bibliográficos
Autores: Estrada Cañari, Richard, Corredor Arizapana, Flor Anita, Figueroa, Deyanira, Salazar Coronel, Wilian, Quilcate Pairazamán, Carlos Enrique, Vásquez Pérez, Héctor Vladimir, Maicelo Quintana, Jorge Luis, Gonzales, Jhony, Arbizu Berrocal, Carlos Irvin
Formato: artículo
Fecha de Publicación:2022
Institución:Instituto Nacional de Innovación Agraria
Repositorio:INIA-Institucional
Lenguaje:inglés
OAI Identifier:oai:null:20.500.12955/2054
Enlace del recurso:https://hdl.handle.net/20.500.12955/2054
https://doi.org/10.3390/data7110155
Nivel de acceso:acceso abierto
Materia:NGS
Neglected breed
Genome
Reference scaffolding
Microsatellites
https://purl.org/pe-repo/ocde/ford#4.03.01
High-throughput sequencing
Breeds (animals)
Genomes
Descripción
Sumario:The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the PCC using a de novo assembly approach with a paired-end 150 strategy on the Illumina HiSeq 2500 platform, obtaining 320 GB of sequencing data. A reference scaffolding was used to improve the draft genome. The obtained genome size of the PCC was 2.81 Gb with a contig N50 of 108 Mb and 92.59% complete BUSCOs. This genome size is similar to the genome references of Bos taurus and B. indicus. In addition, we identified 40.22% of repetitive DNA of the genome assembly, of which retroelements occupy 32.39% of the total genome. A total of 19,803 protein-coding genes were annotated in the PCC genome. For SSR data mining, we detected similar statistics in comparison with other breeds. The PCC genome will contribute to a better understanding of the genetics of this species and its adaptation to tough conditions in the Andean ecosystem.
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).