Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)

Descripción del Articulo

The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the P...

Descripción completa

Detalles Bibliográficos
Autores: Estrada Cañari, Richard, Corredor Arizapana, Flor Anita, Figueroa, Deyanira, Salazar Coronel, Wilian, Quilcate Pairazamán, Carlos Enrique, Vásquez Pérez, Héctor Vladimir, Maicelo Quintana, Jorge Luis, Gonzales, Jhony, Arbizu Berrocal, Carlos Irvin
Formato: artículo
Fecha de Publicación:2022
Institución:Instituto Nacional de Innovación Agraria
Repositorio:INIA-Institucional
Lenguaje:inglés
OAI Identifier:oai:null:20.500.12955/2054
Enlace del recurso:https://hdl.handle.net/20.500.12955/2054
https://doi.org/10.3390/data7110155
Nivel de acceso:acceso abierto
Materia:NGS
Neglected breed
Genome
Reference scaffolding
Microsatellites
https://purl.org/pe-repo/ocde/ford#4.03.01
High-throughput sequencing
Breeds (animals)
Genomes
id INIA_4d08ab5a1021f08e5aa83eda422cb0a9
oai_identifier_str oai:null:20.500.12955/2054
network_acronym_str INIA
network_name_str INIA-Institucional
repository_id_str 4830
dc.title.en.fl_str_mv Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
title Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
spellingShingle Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
Estrada Cañari, Richard
NGS
Neglected breed
Genome
Reference scaffolding
Microsatellites
https://purl.org/pe-repo/ocde/ford#4.03.01
High-throughput sequencing
Breeds (animals)
Genomes
Microsatellites
title_short Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
title_full Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
title_fullStr Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
title_full_unstemmed Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
title_sort Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
author Estrada Cañari, Richard
author_facet Estrada Cañari, Richard
Corredor Arizapana, Flor Anita
Figueroa, Deyanira
Salazar Coronel, Wilian
Quilcate Pairazamán, Carlos Enrique
Vásquez Pérez, Héctor Vladimir
Maicelo Quintana, Jorge Luis
Gonzales, Jhony
Arbizu Berrocal, Carlos Irvin
author_role author
author2 Corredor Arizapana, Flor Anita
Figueroa, Deyanira
Salazar Coronel, Wilian
Quilcate Pairazamán, Carlos Enrique
Vásquez Pérez, Héctor Vladimir
Maicelo Quintana, Jorge Luis
Gonzales, Jhony
Arbizu Berrocal, Carlos Irvin
author2_role author
author
author
author
author
author
author
author
dc.contributor.author.fl_str_mv Estrada Cañari, Richard
Corredor Arizapana, Flor Anita
Figueroa, Deyanira
Salazar Coronel, Wilian
Quilcate Pairazamán, Carlos Enrique
Vásquez Pérez, Héctor Vladimir
Maicelo Quintana, Jorge Luis
Gonzales, Jhony
Arbizu Berrocal, Carlos Irvin
dc.subject.en.fl_str_mv NGS
Neglected breed
Genome
Reference scaffolding
Microsatellites
topic NGS
Neglected breed
Genome
Reference scaffolding
Microsatellites
https://purl.org/pe-repo/ocde/ford#4.03.01
High-throughput sequencing
Breeds (animals)
Genomes
Microsatellites
dc.subject.ocde.none.fl_str_mv https://purl.org/pe-repo/ocde/ford#4.03.01
dc.subject.agrovoc.en.fl_str_mv High-throughput sequencing
Breeds (animals)
Genomes
Microsatellites
description The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the PCC using a de novo assembly approach with a paired-end 150 strategy on the Illumina HiSeq 2500 platform, obtaining 320 GB of sequencing data. A reference scaffolding was used to improve the draft genome. The obtained genome size of the PCC was 2.81 Gb with a contig N50 of 108 Mb and 92.59% complete BUSCOs. This genome size is similar to the genome references of Bos taurus and B. indicus. In addition, we identified 40.22% of repetitive DNA of the genome assembly, of which retroelements occupy 32.39% of the total genome. A total of 19,803 protein-coding genes were annotated in the PCC genome. For SSR data mining, we detected similar statistics in comparison with other breeds. The PCC genome will contribute to a better understanding of the genetics of this species and its adaptation to tough conditions in the Andean ecosystem.
publishDate 2022
dc.date.accessioned.none.fl_str_mv 2022-12-30T16:07:17Z
dc.date.available.none.fl_str_mv 2022-12-30T16:07:17Z
dc.date.issued.fl_str_mv 2022-11-09
dc.type.none.fl_str_mv info:eu-repo/semantics/article
format article
dc.identifier.citation.es_PE.fl_str_mv Estrada, R.; Corredor, F.; Figueroa, D.; Salazar, W.; Quilcate, C.; Vásquez, H.; Maicelo, J.; Gonzales, J. & Arbizu, C. (2022). Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus). Data 2022, 7, 155. doi: 10.3390/data7110155
dc.identifier.uri.none.fl_str_mv https://hdl.handle.net/20.500.12955/2054
dc.identifier.doi.none.fl_str_mv https://doi.org/10.3390/data7110155
identifier_str_mv Estrada, R.; Corredor, F.; Figueroa, D.; Salazar, W.; Quilcate, C.; Vásquez, H.; Maicelo, J.; Gonzales, J. & Arbizu, C. (2022). Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus). Data 2022, 7, 155. doi: 10.3390/data7110155
url https://hdl.handle.net/20.500.12955/2054
https://doi.org/10.3390/data7110155
dc.language.iso.none.fl_str_mv eng
language eng
dc.relation.ispartofseries.en.fl_str_mv Data
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
Attribution-NonCommercial-NoDerivs 3.0 United States
dc.rights.uri.none.fl_str_mv http://creativecommons.org/licenses/by-nc-nd/3.0/us/
eu_rights_str_mv openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivs 3.0 United States
http://creativecommons.org/licenses/by-nc-nd/3.0/us/
dc.format.none.fl_str_mv application/pdf
dc.publisher.en.fl_str_mv MDPI
dc.publisher.country.none.fl_str_mv CH
dc.source.es_PE.fl_str_mv Instituto Nacional de Innovación Agraria
dc.source.none.fl_str_mv reponame:INIA-Institucional
instname:Instituto Nacional de Innovación Agraria
instacron:INIA
instname_str Instituto Nacional de Innovación Agraria
instacron_str INIA
institution INIA
reponame_str INIA-Institucional
collection INIA-Institucional
dc.source.uri.es_PE.fl_str_mv Repositorio Institucional - INIA
bitstream.url.fl_str_mv https://repositorio.inia.gob.pe/bitstreams/42393529-1d52-405b-9259-439a6d598dd8/download
https://repositorio.inia.gob.pe/bitstreams/07bd7762-c59b-489b-8e83-ec5059b43f7b/download
https://repositorio.inia.gob.pe/bitstreams/39896f56-b00e-4b6b-9a96-1996b7f748c6/download
https://repositorio.inia.gob.pe/bitstreams/b88cf00e-5cbd-4013-aeeb-3730986c738f/download
https://repositorio.inia.gob.pe/bitstreams/7e336c2d-d375-499c-9949-57ce1c5ffb46/download
bitstream.checksum.fl_str_mv a65407a7e9e5287500676d22714b473e
73abee61e377f73f1d5fc0522cf9cde0
8a4605be74aa9ea9d79846c1fba20a33
f9b54215e6c26c33ec8e3caceaf9d323
fe62bc17bbf484a853206d8e959b4dbc
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositorio Institucional INIA
repository.mail.fl_str_mv repositorio@inia.gob.pe
_version_ 1833331627085791232
spelling Estrada Cañari, RichardCorredor Arizapana, Flor AnitaFigueroa, DeyaniraSalazar Coronel, WilianQuilcate Pairazamán, Carlos EnriqueVásquez Pérez, Héctor VladimirMaicelo Quintana, Jorge LuisGonzales, JhonyArbizu Berrocal, Carlos Irvin2022-12-30T16:07:17Z2022-12-30T16:07:17Z2022-11-09Estrada, R.; Corredor, F.; Figueroa, D.; Salazar, W.; Quilcate, C.; Vásquez, H.; Maicelo, J.; Gonzales, J. & Arbizu, C. (2022). Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus). Data 2022, 7, 155. doi: 10.3390/data7110155https://hdl.handle.net/20.500.12955/2054https://doi.org/10.3390/data7110155The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the PCC using a de novo assembly approach with a paired-end 150 strategy on the Illumina HiSeq 2500 platform, obtaining 320 GB of sequencing data. A reference scaffolding was used to improve the draft genome. The obtained genome size of the PCC was 2.81 Gb with a contig N50 of 108 Mb and 92.59% complete BUSCOs. This genome size is similar to the genome references of Bos taurus and B. indicus. In addition, we identified 40.22% of repetitive DNA of the genome assembly, of which retroelements occupy 32.39% of the total genome. A total of 19,803 protein-coding genes were annotated in the PCC genome. For SSR data mining, we detected similar statistics in comparison with other breeds. The PCC genome will contribute to a better understanding of the genetics of this species and its adaptation to tough conditions in the Andean ecosystem.application/pdfengMDPICHDatainfo:eu-repo/semantics/openAccessAttribution-NonCommercial-NoDerivs 3.0 United Stateshttp://creativecommons.org/licenses/by-nc-nd/3.0/us/Instituto Nacional de Innovación AgrariaRepositorio Institucional - INIAreponame:INIA-Institucionalinstname:Instituto Nacional de Innovación Agrariainstacron:INIANGSNeglected breedGenomeReference scaffoldingMicrosatelliteshttps://purl.org/pe-repo/ocde/ford#4.03.01High-throughput sequencingBreeds (animals)GenomesMicrosatellitesReference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)info:eu-repo/semantics/article711ORIGINALEstrada-et-al_2022_Bos-taurus_Genome.pdfEstrada-et-al_2022_Bos-taurus_Genome.pdfapplication/pdf1796441https://repositorio.inia.gob.pe/bitstreams/42393529-1d52-405b-9259-439a6d598dd8/downloada65407a7e9e5287500676d22714b473eMD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8810https://repositorio.inia.gob.pe/bitstreams/07bd7762-c59b-489b-8e83-ec5059b43f7b/download73abee61e377f73f1d5fc0522cf9cde0MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://repositorio.inia.gob.pe/bitstreams/39896f56-b00e-4b6b-9a96-1996b7f748c6/download8a4605be74aa9ea9d79846c1fba20a33MD53TEXTEstrada-et-al_2022_Bos-taurus_Genome.pdf.txtEstrada-et-al_2022_Bos-taurus_Genome.pdf.txtExtracted texttext/plain40976https://repositorio.inia.gob.pe/bitstreams/b88cf00e-5cbd-4013-aeeb-3730986c738f/downloadf9b54215e6c26c33ec8e3caceaf9d323MD54THUMBNAILEstrada-et-al_2022_Bos-taurus_Genome.pdf.jpgEstrada-et-al_2022_Bos-taurus_Genome.pdf.jpgGenerated Thumbnailimage/jpeg1604https://repositorio.inia.gob.pe/bitstreams/7e336c2d-d375-499c-9949-57ce1c5ffb46/downloadfe62bc17bbf484a853206d8e959b4dbcMD5520.500.12955/2054oai:repositorio.inia.gob.pe:20.500.12955/20542023-08-23 17:23:32.311http://creativecommons.org/licenses/by-nc-nd/3.0/us/info:eu-repo/semantics/openAccessopen.accesshttps://repositorio.inia.gob.peRepositorio Institucional INIArepositorio@inia.gob.peTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=
score 13.95948
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).