Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient

Descripción del Articulo

Convolutional sparse representations and convolutional dictionary learning are mathematical models that consist in representing a whole signal or image as a sum of convolutions between dictionary filters and coefficient maps. Unlike the patch-based counterparts, these convolutional forms are receivi...

Descripción completa

Detalles Bibliográficos
Autor: Silva Obregón, Gustavo Manuel
Formato: tesis de maestría
Fecha de Publicación:2019
Institución:Pontificia Universidad Católica del Perú
Repositorio:PUCP-Institucional
Lenguaje:inglés
OAI Identifier:oai:repositorio.pucp.edu.pe:20.500.14657/146518
Enlace del recurso:http://hdl.handle.net/20.500.12404/13903
Nivel de acceso:acceso abierto
Materia:Visión por computadoras
Aprendizaje automático (Inteligencia artificial)
https://purl.org/pe-repo/ocde/ford#2.02.05
id RPUC_1d67faee16a8ab46c532c79fc6163113
oai_identifier_str oai:repositorio.pucp.edu.pe:20.500.14657/146518
network_acronym_str RPUC
network_name_str PUCP-Institucional
repository_id_str 2905
spelling Rodríguez Valderrama, Paul AntonioSilva Obregón, Gustavo Manuel2019-04-06T00:44:28Z2019-04-06T00:44:28Z2019-04-06T00:44:28Z20192019-04-05http://hdl.handle.net/20.500.12404/13903Convolutional sparse representations and convolutional dictionary learning are mathematical models that consist in representing a whole signal or image as a sum of convolutions between dictionary filters and coefficient maps. Unlike the patch-based counterparts, these convolutional forms are receiving an increase attention in multiple image processing tasks, since they do not present the usual patchwise drawbacks such as redundancy, multi-evaluations and non-translational invariant. Particularly, the convolutional dictionary learning (CDL) problem is addressed as an alternating minimization between coefficient update and dictionary update stages. A wide number of different algorithms based on FISTA (Fast Iterative Shrinkage-Thresholding Algorithm), ADMM (Alternating Direction Method of Multipliers) and ADMM consensus frameworks have been proposed to efficiently solve the most expensive steps of the CDL problem in the frequency domain. However, the use of the existing methods on large sets of images is computationally restricted by the dictionary update stage. The present thesis report is strategically organized in three parts. On the first part, we introduce the general topic of the CDL problem and the state-of-the-art methods used to deal with each stage. On the second part, we propose our first computationally efficient method to solve the entire CDL problem using the Accelerated Proximal Gradient (APG) framework in both updates. Additionally, a novel update model reminiscent of the Block Gauss-Seidel (BGS) method is incorporated to reduce the number of estimated components during the coefficient update. On the final part, we propose another alternative method to address the dictionary update stage based on APG consensus approach. This last method considers particular strategies of theADMMconsensus and our first APG framework to develop a less complex solution decoupled across the training images. In general, due to the lower number of operations, our first approach is a better serial option while our last approach has as advantage its independent and highly parallelizable structure. Finally, in our first set of experimental results, which is composed of serial implementations, we show that our first APG approach provides significant speedup with respect to the standard methods by a factor of 1:6 5:3. A complementary improvement by a factor of 2 is achieved by using the reminiscent BGS model. On the other hand, we also report that the second APG approach is the fastest method compared to the state-of-the-art consensus algorithm implemented in serial and parallel. Both proposed methods maintain comparable performance as the other ones in terms of reconstruction metrics, such as PSNR, SSIM and sparsity, in denoising and inpainting tasks.TesisengPontificia Universidad Católica del PerúPEinfo:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by/2.5/pe/Visión por computadorasAprendizaje automático (Inteligencia artificial)https://purl.org/pe-repo/ocde/ford#2.02.05Efficient algorithms for convolutional dictionary learning via accelerated proximal gradientinfo:eu-repo/semantics/masterThesisTesis de maestríareponame:PUCP-Institucionalinstname:Pontificia Universidad Católica del Perúinstacron:PUCPMaestro en Procesamiento de Señales e Imágenes Digitales.MaestríaPontificia Universidad Católica del Perú. Escuela de PosgradoProcesamiento de Señales e Imágenes Digitales07754238613077https://purl.org/pe-repo/renati/level#maestrohttp://purl.org/pe-repo/renati/type#tesis20.500.14657/146518oai:repositorio.pucp.edu.pe:20.500.14657/1465182024-06-10 10:10:41.771http://creativecommons.org/licenses/by/2.5/pe/info:eu-repo/semantics/openAccessmetadata.onlyhttps://repositorio.pucp.edu.peRepositorio Institucional de la PUCPrepositorio@pucp.pe
dc.title.es_ES.fl_str_mv Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient
title Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient
spellingShingle Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient
Silva Obregón, Gustavo Manuel
Visión por computadoras
Aprendizaje automático (Inteligencia artificial)
https://purl.org/pe-repo/ocde/ford#2.02.05
title_short Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient
title_full Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient
title_fullStr Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient
title_full_unstemmed Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient
title_sort Efficient algorithms for convolutional dictionary learning via accelerated proximal gradient
author Silva Obregón, Gustavo Manuel
author_facet Silva Obregón, Gustavo Manuel
author_role author
dc.contributor.advisor.fl_str_mv Rodríguez Valderrama, Paul Antonio
dc.contributor.author.fl_str_mv Silva Obregón, Gustavo Manuel
dc.subject.es_ES.fl_str_mv Visión por computadoras
Aprendizaje automático (Inteligencia artificial)
topic Visión por computadoras
Aprendizaje automático (Inteligencia artificial)
https://purl.org/pe-repo/ocde/ford#2.02.05
dc.subject.ocde.es_ES.fl_str_mv https://purl.org/pe-repo/ocde/ford#2.02.05
description Convolutional sparse representations and convolutional dictionary learning are mathematical models that consist in representing a whole signal or image as a sum of convolutions between dictionary filters and coefficient maps. Unlike the patch-based counterparts, these convolutional forms are receiving an increase attention in multiple image processing tasks, since they do not present the usual patchwise drawbacks such as redundancy, multi-evaluations and non-translational invariant. Particularly, the convolutional dictionary learning (CDL) problem is addressed as an alternating minimization between coefficient update and dictionary update stages. A wide number of different algorithms based on FISTA (Fast Iterative Shrinkage-Thresholding Algorithm), ADMM (Alternating Direction Method of Multipliers) and ADMM consensus frameworks have been proposed to efficiently solve the most expensive steps of the CDL problem in the frequency domain. However, the use of the existing methods on large sets of images is computationally restricted by the dictionary update stage. The present thesis report is strategically organized in three parts. On the first part, we introduce the general topic of the CDL problem and the state-of-the-art methods used to deal with each stage. On the second part, we propose our first computationally efficient method to solve the entire CDL problem using the Accelerated Proximal Gradient (APG) framework in both updates. Additionally, a novel update model reminiscent of the Block Gauss-Seidel (BGS) method is incorporated to reduce the number of estimated components during the coefficient update. On the final part, we propose another alternative method to address the dictionary update stage based on APG consensus approach. This last method considers particular strategies of theADMMconsensus and our first APG framework to develop a less complex solution decoupled across the training images. In general, due to the lower number of operations, our first approach is a better serial option while our last approach has as advantage its independent and highly parallelizable structure. Finally, in our first set of experimental results, which is composed of serial implementations, we show that our first APG approach provides significant speedup with respect to the standard methods by a factor of 1:6 5:3. A complementary improvement by a factor of 2 is achieved by using the reminiscent BGS model. On the other hand, we also report that the second APG approach is the fastest method compared to the state-of-the-art consensus algorithm implemented in serial and parallel. Both proposed methods maintain comparable performance as the other ones in terms of reconstruction metrics, such as PSNR, SSIM and sparsity, in denoising and inpainting tasks.
publishDate 2019
dc.date.accessioned.es_ES.fl_str_mv 2019-04-06T00:44:28Z
dc.date.available.none.fl_str_mv 2019-04-06T00:44:28Z
dc.date.available.es_ES.fl_str_mv 2019-04-06T00:44:28Z
dc.date.created.es_ES.fl_str_mv 2019
dc.date.issued.fl_str_mv 2019-04-05
dc.type.es_ES.fl_str_mv info:eu-repo/semantics/masterThesis
dc.type.other.none.fl_str_mv Tesis de maestría
format masterThesis
dc.identifier.uri.none.fl_str_mv http://hdl.handle.net/20.500.12404/13903
url http://hdl.handle.net/20.500.12404/13903
dc.language.iso.es_ES.fl_str_mv eng
language eng
dc.rights.es_ES.fl_str_mv info:eu-repo/semantics/openAccess
dc.rights.uri.*.fl_str_mv http://creativecommons.org/licenses/by/2.5/pe/
eu_rights_str_mv openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by/2.5/pe/
dc.publisher.es_ES.fl_str_mv Pontificia Universidad Católica del Perú
dc.publisher.country.es_ES.fl_str_mv PE
dc.source.none.fl_str_mv reponame:PUCP-Institucional
instname:Pontificia Universidad Católica del Perú
instacron:PUCP
instname_str Pontificia Universidad Católica del Perú
instacron_str PUCP
institution PUCP
reponame_str PUCP-Institucional
collection PUCP-Institucional
repository.name.fl_str_mv Repositorio Institucional de la PUCP
repository.mail.fl_str_mv repositorio@pucp.pe
_version_ 1835638821559992320
score 13.888049
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).