Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)
Descripción del Articulo
The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the P...
Autores: | , , , , , , , , |
---|---|
Formato: | artículo |
Fecha de Publicación: | 2022 |
Institución: | Instituto Nacional de Innovación Agraria |
Repositorio: | INIA-Institucional |
Lenguaje: | inglés |
OAI Identifier: | oai:null:20.500.12955/2054 |
Enlace del recurso: | https://hdl.handle.net/20.500.12955/2054 https://doi.org/10.3390/data7110155 |
Nivel de acceso: | acceso abierto |
Materia: | NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes |
id |
INIA_4d08ab5a1021f08e5aa83eda422cb0a9 |
---|---|
oai_identifier_str |
oai:null:20.500.12955/2054 |
network_acronym_str |
INIA |
network_name_str |
INIA-Institucional |
repository_id_str |
4830 |
dc.title.en.fl_str_mv |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
spellingShingle |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) Estrada Cañari, Richard NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes Microsatellites |
title_short |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_full |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_fullStr |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_full_unstemmed |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
title_sort |
Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus) |
author |
Estrada Cañari, Richard |
author_facet |
Estrada Cañari, Richard Corredor Arizapana, Flor Anita Figueroa, Deyanira Salazar Coronel, Wilian Quilcate Pairazamán, Carlos Enrique Vásquez Pérez, Héctor Vladimir Maicelo Quintana, Jorge Luis Gonzales, Jhony Arbizu Berrocal, Carlos Irvin |
author_role |
author |
author2 |
Corredor Arizapana, Flor Anita Figueroa, Deyanira Salazar Coronel, Wilian Quilcate Pairazamán, Carlos Enrique Vásquez Pérez, Héctor Vladimir Maicelo Quintana, Jorge Luis Gonzales, Jhony Arbizu Berrocal, Carlos Irvin |
author2_role |
author author author author author author author author |
dc.contributor.author.fl_str_mv |
Estrada Cañari, Richard Corredor Arizapana, Flor Anita Figueroa, Deyanira Salazar Coronel, Wilian Quilcate Pairazamán, Carlos Enrique Vásquez Pérez, Héctor Vladimir Maicelo Quintana, Jorge Luis Gonzales, Jhony Arbizu Berrocal, Carlos Irvin |
dc.subject.en.fl_str_mv |
NGS Neglected breed Genome Reference scaffolding Microsatellites |
topic |
NGS Neglected breed Genome Reference scaffolding Microsatellites https://purl.org/pe-repo/ocde/ford#4.03.01 High-throughput sequencing Breeds (animals) Genomes Microsatellites |
dc.subject.ocde.none.fl_str_mv |
https://purl.org/pe-repo/ocde/ford#4.03.01 |
dc.subject.agrovoc.en.fl_str_mv |
High-throughput sequencing Breeds (animals) Genomes Microsatellites |
description |
The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the PCC using a de novo assembly approach with a paired-end 150 strategy on the Illumina HiSeq 2500 platform, obtaining 320 GB of sequencing data. A reference scaffolding was used to improve the draft genome. The obtained genome size of the PCC was 2.81 Gb with a contig N50 of 108 Mb and 92.59% complete BUSCOs. This genome size is similar to the genome references of Bos taurus and B. indicus. In addition, we identified 40.22% of repetitive DNA of the genome assembly, of which retroelements occupy 32.39% of the total genome. A total of 19,803 protein-coding genes were annotated in the PCC genome. For SSR data mining, we detected similar statistics in comparison with other breeds. The PCC genome will contribute to a better understanding of the genetics of this species and its adaptation to tough conditions in the Andean ecosystem. |
publishDate |
2022 |
dc.date.accessioned.none.fl_str_mv |
2022-12-30T16:07:17Z |
dc.date.available.none.fl_str_mv |
2022-12-30T16:07:17Z |
dc.date.issued.fl_str_mv |
2022-11-09 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
dc.identifier.citation.es_PE.fl_str_mv |
Estrada, R.; Corredor, F.; Figueroa, D.; Salazar, W.; Quilcate, C.; Vásquez, H.; Maicelo, J.; Gonzales, J. & Arbizu, C. (2022). Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus). Data 2022, 7, 155. doi: 10.3390/data7110155 |
dc.identifier.uri.none.fl_str_mv |
https://hdl.handle.net/20.500.12955/2054 |
dc.identifier.doi.none.fl_str_mv |
https://doi.org/10.3390/data7110155 |
identifier_str_mv |
Estrada, R.; Corredor, F.; Figueroa, D.; Salazar, W.; Quilcate, C.; Vásquez, H.; Maicelo, J.; Gonzales, J. & Arbizu, C. (2022). Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus). Data 2022, 7, 155. doi: 10.3390/data7110155 |
url |
https://hdl.handle.net/20.500.12955/2054 https://doi.org/10.3390/data7110155 |
dc.language.iso.none.fl_str_mv |
eng |
language |
eng |
dc.relation.ispartofseries.en.fl_str_mv |
Data |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess Attribution-NonCommercial-NoDerivs 3.0 United States |
dc.rights.uri.none.fl_str_mv |
http://creativecommons.org/licenses/by-nc-nd/3.0/us/ |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
Attribution-NonCommercial-NoDerivs 3.0 United States http://creativecommons.org/licenses/by-nc-nd/3.0/us/ |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.en.fl_str_mv |
MDPI |
dc.publisher.country.none.fl_str_mv |
CH |
dc.source.es_PE.fl_str_mv |
Instituto Nacional de Innovación Agraria |
dc.source.none.fl_str_mv |
reponame:INIA-Institucional instname:Instituto Nacional de Innovación Agraria instacron:INIA |
instname_str |
Instituto Nacional de Innovación Agraria |
instacron_str |
INIA |
institution |
INIA |
reponame_str |
INIA-Institucional |
collection |
INIA-Institucional |
dc.source.uri.es_PE.fl_str_mv |
Repositorio Institucional - INIA |
bitstream.url.fl_str_mv |
https://repositorio.inia.gob.pe/bitstreams/42393529-1d52-405b-9259-439a6d598dd8/download https://repositorio.inia.gob.pe/bitstreams/07bd7762-c59b-489b-8e83-ec5059b43f7b/download https://repositorio.inia.gob.pe/bitstreams/39896f56-b00e-4b6b-9a96-1996b7f748c6/download https://repositorio.inia.gob.pe/bitstreams/b88cf00e-5cbd-4013-aeeb-3730986c738f/download https://repositorio.inia.gob.pe/bitstreams/7e336c2d-d375-499c-9949-57ce1c5ffb46/download |
bitstream.checksum.fl_str_mv |
a65407a7e9e5287500676d22714b473e 73abee61e377f73f1d5fc0522cf9cde0 8a4605be74aa9ea9d79846c1fba20a33 f9b54215e6c26c33ec8e3caceaf9d323 fe62bc17bbf484a853206d8e959b4dbc |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 MD5 |
repository.name.fl_str_mv |
Repositorio Institucional INIA |
repository.mail.fl_str_mv |
repositorio@inia.gob.pe |
_version_ |
1833331627085791232 |
spelling |
Estrada Cañari, RichardCorredor Arizapana, Flor AnitaFigueroa, DeyaniraSalazar Coronel, WilianQuilcate Pairazamán, Carlos EnriqueVásquez Pérez, Héctor VladimirMaicelo Quintana, Jorge LuisGonzales, JhonyArbizu Berrocal, Carlos Irvin2022-12-30T16:07:17Z2022-12-30T16:07:17Z2022-11-09Estrada, R.; Corredor, F.; Figueroa, D.; Salazar, W.; Quilcate, C.; Vásquez, H.; Maicelo, J.; Gonzales, J. & Arbizu, C. (2022). Reference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus). Data 2022, 7, 155. doi: 10.3390/data7110155https://hdl.handle.net/20.500.12955/2054https://doi.org/10.3390/data7110155The Peruvian creole cattle (PCC) is a neglected breed and an essential livestock resource in the Andean region of Peru. To develop a modern breeding program and conservation strategies for the PCC, a better understanding of the genetics of this breed is needed. We sequenced the whole genome of the PCC using a de novo assembly approach with a paired-end 150 strategy on the Illumina HiSeq 2500 platform, obtaining 320 GB of sequencing data. A reference scaffolding was used to improve the draft genome. The obtained genome size of the PCC was 2.81 Gb with a contig N50 of 108 Mb and 92.59% complete BUSCOs. This genome size is similar to the genome references of Bos taurus and B. indicus. In addition, we identified 40.22% of repetitive DNA of the genome assembly, of which retroelements occupy 32.39% of the total genome. A total of 19,803 protein-coding genes were annotated in the PCC genome. For SSR data mining, we detected similar statistics in comparison with other breeds. The PCC genome will contribute to a better understanding of the genetics of this species and its adaptation to tough conditions in the Andean ecosystem.application/pdfengMDPICHDatainfo:eu-repo/semantics/openAccessAttribution-NonCommercial-NoDerivs 3.0 United Stateshttp://creativecommons.org/licenses/by-nc-nd/3.0/us/Instituto Nacional de Innovación AgrariaRepositorio Institucional - INIAreponame:INIA-Institucionalinstname:Instituto Nacional de Innovación Agrariainstacron:INIANGSNeglected breedGenomeReference scaffoldingMicrosatelliteshttps://purl.org/pe-repo/ocde/ford#4.03.01High-throughput sequencingBreeds (animals)GenomesMicrosatellitesReference-Guided Draft Genome Assembly, Annotation and SSR Mining Data of the Peruvian Creole Cattle (Bos taurus)info:eu-repo/semantics/article711ORIGINALEstrada-et-al_2022_Bos-taurus_Genome.pdfEstrada-et-al_2022_Bos-taurus_Genome.pdfapplication/pdf1796441https://repositorio.inia.gob.pe/bitstreams/42393529-1d52-405b-9259-439a6d598dd8/downloada65407a7e9e5287500676d22714b473eMD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8810https://repositorio.inia.gob.pe/bitstreams/07bd7762-c59b-489b-8e83-ec5059b43f7b/download73abee61e377f73f1d5fc0522cf9cde0MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://repositorio.inia.gob.pe/bitstreams/39896f56-b00e-4b6b-9a96-1996b7f748c6/download8a4605be74aa9ea9d79846c1fba20a33MD53TEXTEstrada-et-al_2022_Bos-taurus_Genome.pdf.txtEstrada-et-al_2022_Bos-taurus_Genome.pdf.txtExtracted texttext/plain40976https://repositorio.inia.gob.pe/bitstreams/b88cf00e-5cbd-4013-aeeb-3730986c738f/downloadf9b54215e6c26c33ec8e3caceaf9d323MD54THUMBNAILEstrada-et-al_2022_Bos-taurus_Genome.pdf.jpgEstrada-et-al_2022_Bos-taurus_Genome.pdf.jpgGenerated Thumbnailimage/jpeg1604https://repositorio.inia.gob.pe/bitstreams/7e336c2d-d375-499c-9949-57ce1c5ffb46/downloadfe62bc17bbf484a853206d8e959b4dbcMD5520.500.12955/2054oai:repositorio.inia.gob.pe:20.500.12955/20542023-08-23 17:23:32.311http://creativecommons.org/licenses/by-nc-nd/3.0/us/info:eu-repo/semantics/openAccessopen.accesshttps://repositorio.inia.gob.peRepositorio Institucional INIArepositorio@inia.gob.peTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo= |
score |
13.95948 |
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).