Quantum exordium for natural language processing: a novel approach to sample on decoders
Descripción del Articulo
The sampling task of Seq2Seq models in Natural Language Processing (NLP) is based on heuristics because of the Non-Deterministic Polynomial Time (NP) nature of this problem. The goal of this research is to develop a quantum sampler for Seq2Seq models, and give evidence that Quantum Annealing (QA) ca...
| Autor: | |
|---|---|
| Formato: | tesis de grado |
| Fecha de Publicación: | 2021 |
| Institución: | Universidad Católica San Pablo |
| Repositorio: | UCSP-Institucional |
| Lenguaje: | inglés |
| OAI Identifier: | oai:repositorio.ucsp.edu.pe:20.500.12590/16617 |
| Enlace del recurso: | https://hdl.handle.net/20.500.12590/16617 |
| Nivel de acceso: | acceso abierto |
| Materia: | Quantum Annealing ISING Model Sampling Natural Language Processing Seq2Seq https://purl.org/pe-repo/ocde/ford#1.02.01 |
| id |
UCSP_9699c99966e183a4434b632800906ff3 |
|---|---|
| oai_identifier_str |
oai:repositorio.ucsp.edu.pe:20.500.12590/16617 |
| network_acronym_str |
UCSP |
| network_name_str |
UCSP-Institucional |
| repository_id_str |
3854 |
| dc.title.es_PE.fl_str_mv |
Quantum exordium for natural language processing: a novel approach to sample on decoders |
| title |
Quantum exordium for natural language processing: a novel approach to sample on decoders |
| spellingShingle |
Quantum exordium for natural language processing: a novel approach to sample on decoders Muroya Lei, Stefanie Quantum Annealing ISING Model Sampling Natural Language Processing Seq2Seq https://purl.org/pe-repo/ocde/ford#1.02.01 |
| title_short |
Quantum exordium for natural language processing: a novel approach to sample on decoders |
| title_full |
Quantum exordium for natural language processing: a novel approach to sample on decoders |
| title_fullStr |
Quantum exordium for natural language processing: a novel approach to sample on decoders |
| title_full_unstemmed |
Quantum exordium for natural language processing: a novel approach to sample on decoders |
| title_sort |
Quantum exordium for natural language processing: a novel approach to sample on decoders |
| author |
Muroya Lei, Stefanie |
| author_facet |
Muroya Lei, Stefanie |
| author_role |
author |
| dc.contributor.advisor.fl_str_mv |
Ochoa Luna, Jose Eduardo |
| dc.contributor.author.fl_str_mv |
Muroya Lei, Stefanie |
| dc.subject.es_PE.fl_str_mv |
Quantum Annealing ISING Model Sampling Natural Language Processing Seq2Seq |
| topic |
Quantum Annealing ISING Model Sampling Natural Language Processing Seq2Seq https://purl.org/pe-repo/ocde/ford#1.02.01 |
| dc.subject.ocde.es_PE.fl_str_mv |
https://purl.org/pe-repo/ocde/ford#1.02.01 |
| description |
The sampling task of Seq2Seq models in Natural Language Processing (NLP) is based on heuristics because of the Non-Deterministic Polynomial Time (NP) nature of this problem. The goal of this research is to develop a quantum sampler for Seq2Seq models, and give evidence that Quantum Annealing (QA) can guide the search space of these samplers. The contribution of this work is given by showing an architecture to represent Recurrent Neural Networks (RNN) in a quantum computer to finally develop a quantum sampler. The individual architectures (i.e. summation, multiplication, argmax, and activation functions) achieve optimal accuracies in both simulated and quantum environments. While the results of the overall proposal show that it can either outperform or match greedy approaches. As the very first steps of quantum NLP, these are tested against simple RNN with a synthetic data set of random numbers, and a real quantum computer is utilized. Since affine functions are the basis of most Artificial Intelligence (AI) models, this method can be applied to more complex architectures in the future. |
| publishDate |
2021 |
| dc.date.accessioned.none.fl_str_mv |
2021-02-23T23:44:38Z |
| dc.date.available.none.fl_str_mv |
2021-02-23T23:44:38Z |
| dc.date.issued.fl_str_mv |
2021 |
| dc.type.none.fl_str_mv |
info:eu-repo/semantics/bachelorThesis |
| dc.type.version.es_PE.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| format |
bachelorThesis |
| status_str |
publishedVersion |
| dc.identifier.other.none.fl_str_mv |
1073022 |
| dc.identifier.uri.none.fl_str_mv |
https://hdl.handle.net/20.500.12590/16617 |
| identifier_str_mv |
1073022 |
| url |
https://hdl.handle.net/20.500.12590/16617 |
| dc.language.iso.es_PE.fl_str_mv |
eng |
| language |
eng |
| dc.relation.ispartof.fl_str_mv |
SUNEDU |
| dc.rights.es_PE.fl_str_mv |
info:eu-repo/semantics/openAccess |
| dc.rights.uri.es_PE.fl_str_mv |
https://creativecommons.org/licenses/by/4.0/ |
| eu_rights_str_mv |
openAccess |
| rights_invalid_str_mv |
https://creativecommons.org/licenses/by/4.0/ |
| dc.format.es_PE.fl_str_mv |
application/pdf |
| dc.publisher.es_PE.fl_str_mv |
Universidad Católica San Pablo |
| dc.publisher.country.es_PE.fl_str_mv |
PE |
| dc.source.es_PE.fl_str_mv |
Universidad Católica San Pablo Repositorio Institucional - UCSP |
| dc.source.none.fl_str_mv |
reponame:UCSP-Institucional instname:Universidad Católica San Pablo instacron:UCSP |
| instname_str |
Universidad Católica San Pablo |
| instacron_str |
UCSP |
| institution |
UCSP |
| reponame_str |
UCSP-Institucional |
| collection |
UCSP-Institucional |
| bitstream.url.fl_str_mv |
https://repositorio.ucsp.edu.pe/backend/api/core/bitstreams/37163235-fa01-47cc-b7c9-7dd216a6a048/download https://repositorio.ucsp.edu.pe/backend/api/core/bitstreams/91426b2b-cb00-4541-85b1-de9248e8b3d1/download https://repositorio.ucsp.edu.pe/backend/api/core/bitstreams/8ce1d559-e762-4422-b909-8be9d7b3c993/download https://repositorio.ucsp.edu.pe/backend/api/core/bitstreams/8fe74d97-2fef-4def-b056-140feee0fc61/download |
| bitstream.checksum.fl_str_mv |
37fd37812b28ab4b8ce1bb132387cb38 8a4605be74aa9ea9d79846c1fba20a33 21d4128416d93029988eda4fdd880868 168b4ed8b7071bab88179cf4caf5b363 |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 |
| repository.name.fl_str_mv |
Repositorio Institucional de la Universidad Católica San Pablo |
| repository.mail.fl_str_mv |
dspace@ucsp.edu.pe |
| _version_ |
1851053041960091648 |
| spelling |
Ochoa Luna, Jose EduardoMuroya Lei, Stefanie2021-02-23T23:44:38Z2021-02-23T23:44:38Z20211073022https://hdl.handle.net/20.500.12590/16617The sampling task of Seq2Seq models in Natural Language Processing (NLP) is based on heuristics because of the Non-Deterministic Polynomial Time (NP) nature of this problem. The goal of this research is to develop a quantum sampler for Seq2Seq models, and give evidence that Quantum Annealing (QA) can guide the search space of these samplers. The contribution of this work is given by showing an architecture to represent Recurrent Neural Networks (RNN) in a quantum computer to finally develop a quantum sampler. The individual architectures (i.e. summation, multiplication, argmax, and activation functions) achieve optimal accuracies in both simulated and quantum environments. While the results of the overall proposal show that it can either outperform or match greedy approaches. As the very first steps of quantum NLP, these are tested against simple RNN with a synthetic data set of random numbers, and a real quantum computer is utilized. Since affine functions are the basis of most Artificial Intelligence (AI) models, this method can be applied to more complex architectures in the future. Trabajo de investigaciónapplication/pdfengUniversidad Católica San PabloPEinfo:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by/4.0/Universidad Católica San PabloRepositorio Institucional - UCSPreponame:UCSP-Institucionalinstname:Universidad Católica San Pabloinstacron:UCSPQuantum AnnealingISING ModelSamplingNatural Language ProcessingSeq2Seqhttps://purl.org/pe-repo/ocde/ford#1.02.01Quantum exordium for natural language processing: a novel approach to sample on decodersinfo:eu-repo/semantics/bachelorThesisinfo:eu-repo/semantics/publishedVersionSUNEDUBachiller en Ciencia de la ComputaciónUniversidad Católica San Pablo. Departamento de Ciencia de la ComputaciónBachillerCiencia de la ComputaciónPrograma Profesional de Ciencia de la Computación76923844https://orcid.org/0000-0002-8979-378529738760https://purl.org/pe-repo/renati/type#trabajoDeInvestigacionhttps://purl.org/pe-repo/renati/level#bachiller611016Alex Jesús Cuadros VargasJuan Carlos Gutiérrez CáceresORIGINALMUROYA_LEI_STE_QUA.pdfMUROYA_LEI_STE_QUA.pdfapplication/pdf3473862https://repositorio.ucsp.edu.pe/backend/api/core/bitstreams/37163235-fa01-47cc-b7c9-7dd216a6a048/download37fd37812b28ab4b8ce1bb132387cb38MD51LICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://repositorio.ucsp.edu.pe/backend/api/core/bitstreams/91426b2b-cb00-4541-85b1-de9248e8b3d1/download8a4605be74aa9ea9d79846c1fba20a33MD52TEXTMUROYA_LEI_STE_QUA.pdf.txtMUROYA_LEI_STE_QUA.pdf.txtExtracted texttext/plain99586https://repositorio.ucsp.edu.pe/backend/api/core/bitstreams/8ce1d559-e762-4422-b909-8be9d7b3c993/download21d4128416d93029988eda4fdd880868MD53THUMBNAILMUROYA_LEI_STE_QUA.pdf.jpgMUROYA_LEI_STE_QUA.pdf.jpgGenerated Thumbnailimage/jpeg4473https://repositorio.ucsp.edu.pe/backend/api/core/bitstreams/8fe74d97-2fef-4def-b056-140feee0fc61/download168b4ed8b7071bab88179cf4caf5b363MD5420.500.12590/16617oai:repositorio.ucsp.edu.pe:20.500.12590/166172023-10-31 12:15:49.779https://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessopen.accesshttps://repositorio.ucsp.edu.peRepositorio Institucional de la Universidad Católica San Pablodspace@ucsp.edu.peTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo= |
| score |
13.43108 |
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).