Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview
Descripción del Articulo
The evolution of data science and the constant challenge of carrying out different processes using a few resources with simultaneous personalization has promoted interest in the development of voice cloning. Nowadays, different machine learning techniques are used, given their efficiency in generati...
| Autores: | , , , , , |
|---|---|
| Formato: | artículo |
| Fecha de Publicación: | 2023 |
| Institución: | Universidad Peruana de Ciencias Aplicadas |
| Repositorio: | UPC-Institucional |
| Lenguaje: | inglés |
| OAI Identifier: | oai:repositorioacademico.upc.edu.pe:10757/669499 |
| Enlace del recurso: | https://doi.org/10.1007/978-981-99-1912-3_21 http://hdl.handle.net/10757/669499 |
| Nivel de acceso: | acceso embargado |
| Materia: | multi-speaker neural networks Voice cloning https://purl.org/pe-repo/ocde/ford#3.00.00 |
| id |
UUPC_fb9c05cf37f86e8f09491abe1be9b26b |
|---|---|
| oai_identifier_str |
oai:repositorioacademico.upc.edu.pe:10757/669499 |
| network_acronym_str |
UUPC |
| network_name_str |
UPC-Institucional |
| repository_id_str |
2670 |
| dc.title.es_PE.fl_str_mv |
Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview |
| title |
Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview |
| spellingShingle |
Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview Fura-Mendoza, Marco multi-speaker neural networks Voice cloning https://purl.org/pe-repo/ocde/ford#3.00.00 |
| title_short |
Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview |
| title_full |
Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview |
| title_fullStr |
Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview |
| title_full_unstemmed |
Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview |
| title_sort |
Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview |
| author |
Fura-Mendoza, Marco |
| author_facet |
Fura-Mendoza, Marco Moscol-Albañil, Isabel Rodriguez, Ciro Lezama, Pedro Rodriguez, Diego Pomachagua, Yuri |
| author_role |
author |
| author2 |
Moscol-Albañil, Isabel Rodriguez, Ciro Lezama, Pedro Rodriguez, Diego Pomachagua, Yuri |
| author2_role |
author author author author author |
| dc.contributor.author.fl_str_mv |
Fura-Mendoza, Marco Moscol-Albañil, Isabel Rodriguez, Ciro Lezama, Pedro Rodriguez, Diego Pomachagua, Yuri |
| dc.subject.es_PE.fl_str_mv |
multi-speaker neural networks Voice cloning |
| topic |
multi-speaker neural networks Voice cloning https://purl.org/pe-repo/ocde/ford#3.00.00 |
| dc.subject.ocde.none.fl_str_mv |
https://purl.org/pe-repo/ocde/ford#3.00.00 |
| description |
The evolution of data science and the constant challenge of carrying out different processes using a few resources with simultaneous personalization has promoted interest in the development of voice cloning. Nowadays, different machine learning techniques are used, given their efficiency in generating relationships across multiple parameters. In this regard, we evaluated the best-performing models and the different process optimization strategies within this sector, where through neural network models separated modularly by their functionality, it is possible to generate independent processes taking into account the most significant number of linguistic factors in the generation of the voice, thus obtaining significant results of a clear improvement in the whole process of synthesizing the voice of a target speaker. |
| publishDate |
2023 |
| dc.date.accessioned.none.fl_str_mv |
2023-11-28T15:38:34Z |
| dc.date.available.none.fl_str_mv |
2023-11-28T15:38:34Z |
| dc.date.issued.fl_str_mv |
2023-01-01 |
| dc.type.es_PE.fl_str_mv |
info:eu-repo/semantics/article |
| dc.type.version.none.fl_str_mv |
http://purl.org/coar/version/c_970fb48d4fbd8a1738 |
| format |
article |
| dc.identifier.issn.none.fl_str_mv |
23673370 |
| dc.identifier.doi.none.fl_str_mv |
https://doi.org/10.1007/978-981-99-1912-3_21 |
| dc.identifier.uri.none.fl_str_mv |
http://hdl.handle.net/10757/669499 |
| dc.identifier.eissn.none.fl_str_mv |
23673389 |
| dc.identifier.journal.es_PE.fl_str_mv |
Lecture Notes in Networks and Systems |
| dc.identifier.eid.none.fl_str_mv |
2-s2.0-85171139460 |
| dc.identifier.scopusid.none.fl_str_mv |
SCOPUS_ID:85171139460 |
| dc.identifier.isni.none.fl_str_mv |
0000 0001 2196 144X |
| dc.identifier.ror.none.fl_str_mv |
047xrr705 |
| identifier_str_mv |
23673370 23673389 Lecture Notes in Networks and Systems 2-s2.0-85171139460 SCOPUS_ID:85171139460 0000 0001 2196 144X 047xrr705 |
| url |
https://doi.org/10.1007/978-981-99-1912-3_21 http://hdl.handle.net/10757/669499 |
| dc.language.iso.es_PE.fl_str_mv |
eng |
| language |
eng |
| dc.rights.es_PE.fl_str_mv |
info:eu-repo/semantics/embargoedAccess |
| eu_rights_str_mv |
embargoedAccess |
| dc.format.es_PE.fl_str_mv |
application/html |
| dc.publisher.es_PE.fl_str_mv |
Springer Science and Business Media Deutschland GmbH |
| dc.source.es_PE.fl_str_mv |
Universidad Peruana de Ciencias Aplicadas (UPC) Repositorio Academico - UPC |
| dc.source.none.fl_str_mv |
reponame:UPC-Institucional instname:Universidad Peruana de Ciencias Aplicadas instacron:UPC |
| instname_str |
Universidad Peruana de Ciencias Aplicadas |
| instacron_str |
UPC |
| institution |
UPC |
| reponame_str |
UPC-Institucional |
| collection |
UPC-Institucional |
| dc.source.journaltitle.none.fl_str_mv |
Lecture Notes in Networks and Systems |
| dc.source.volume.none.fl_str_mv |
685 LNNS |
| dc.source.beginpage.none.fl_str_mv |
229 |
| dc.source.endpage.none.fl_str_mv |
237 |
| bitstream.url.fl_str_mv |
https://repositorioacademico.upc.edu.pe/bitstream/10757/669499/1/license.txt |
| bitstream.checksum.fl_str_mv |
8a4605be74aa9ea9d79846c1fba20a33 |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 |
| repository.name.fl_str_mv |
Repositorio Académico UPC |
| repository.mail.fl_str_mv |
upc@openrepository.com |
| _version_ |
1863824046512144384 |
| spelling |
27a4dbd0a508eb478a0c2760c3d763f43006e7e5523ab88f904603e7ee314b055265001481cf04c578bc015c4403f7826ae2f2062a7deb0d67c2c464054d24853cd7215008001a598412e418a24a4afd84d68f628500040bf31ff65f6806d23fcaf71e9ea5d3500Fura-Mendoza, MarcoMoscol-Albañil, IsabelRodriguez, CiroLezama, PedroRodriguez, DiegoPomachagua, Yuri2023-11-28T15:38:34Z2023-11-28T15:38:34Z2023-01-0123673370https://doi.org/10.1007/978-981-99-1912-3_21http://hdl.handle.net/10757/66949923673389Lecture Notes in Networks and Systems2-s2.0-85171139460SCOPUS_ID:851711394600000 0001 2196 144X047xrr705The evolution of data science and the constant challenge of carrying out different processes using a few resources with simultaneous personalization has promoted interest in the development of voice cloning. Nowadays, different machine learning techniques are used, given their efficiency in generating relationships across multiple parameters. In this regard, we evaluated the best-performing models and the different process optimization strategies within this sector, where through neural network models separated modularly by their functionality, it is possible to generate independent processes taking into account the most significant number of linguistic factors in the generation of the voice, thus obtaining significant results of a clear improvement in the whole process of synthesizing the voice of a target speaker.Revisión por paresODS 9: Industria, Innovación e InfraestructuraODS 4: Educación de CalidadODS 10: Reducción de las Desigualdadesapplication/htmlengSpringer Science and Business Media Deutschland GmbHinfo:eu-repo/semantics/embargoedAccessUniversidad Peruana de Ciencias Aplicadas (UPC)Repositorio Academico - UPCLecture Notes in Networks and Systems685 LNNS229237reponame:UPC-Institucionalinstname:Universidad Peruana de Ciencias Aplicadasinstacron:UPCmulti-speakerneural networksVoice cloninghttps://purl.org/pe-repo/ocde/ford#3.00.00Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overviewinfo:eu-repo/semantics/articlehttp://purl.org/coar/version/c_970fb48d4fbd8a1738LICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://repositorioacademico.upc.edu.pe/bitstream/10757/669499/1/license.txt8a4605be74aa9ea9d79846c1fba20a33MD51false10757/669499oai:repositorioacademico.upc.edu.pe:10757/6694992026-02-17 17:46:17.067Repositorio Académico UPCupc@openrepository.comTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo= |
| score |
13.911679 |
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).