A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics

Descripción del Articulo

State-of-the-art control and robotics challenges have long been tackled using model-based control methods like model predictive control (MPC) and reinforcement learning (RL). These methods excel in complex dynamic domains, such as manipulation tasks, but struggle with real-world issues like wear-and...

Descripción completa

Detalles Bibliográficos
Autor: Guzmán Apaza, Rel
Formato: tesis doctoral
Fecha de Publicación:2023
Institución:Superintendencia Nacional de Educación Superior Universitaria
Repositorio:Registro Nacional de Trabajos conducentes a Grados y Títulos - RENATI
Lenguaje:inglés
OAI Identifier:oai:renati.sunedu.gob.pe:renati/9331
Enlace del recurso:https://renati.sunedu.gob.pe/handle/sunedu/3694610
https://hdl.handle.net/2123/32053
Nivel de acceso:acceso abierto
Materia:Robótica
Control predictivo
Control óptimo
Estadística bayesiana
Gauss, Modelo de
https://purl.org/pe-repo/ocde/ford#2.02.02
id RENATI_9d7a134203d7a78e32c7129871769036
oai_identifier_str oai:renati.sunedu.gob.pe:renati/9331
network_acronym_str RENATI
network_name_str Registro Nacional de Trabajos conducentes a Grados y Títulos - RENATI
repository_id_str
dc.title.es_PE.fl_str_mv A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics
dc.title.alternative.es_PE.fl_str_mv Un marco de optimización secuencial para el control predictivo adaptativo en robótica
title A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics
spellingShingle A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics
Guzmán Apaza, Rel
Robótica
Control predictivo
Control óptimo
Estadística bayesiana
Gauss, Modelo de
https://purl.org/pe-repo/ocde/ford#2.02.02
title_short A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics
title_full A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics
title_fullStr A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics
title_full_unstemmed A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics
title_sort A Sequential Optimisation Framework for Adaptive Model Predictive Control in Robotics
author Guzmán Apaza, Rel
author_facet Guzmán Apaza, Rel
author_role author
dc.contributor.advisor.fl_str_mv Ramos, Fabio
Oliveira, Rafael
dc.contributor.author.fl_str_mv Guzmán Apaza, Rel
dc.subject.es_PE.fl_str_mv Robótica
Control predictivo
Control óptimo
Estadística bayesiana
Gauss, Modelo de
topic Robótica
Control predictivo
Control óptimo
Estadística bayesiana
Gauss, Modelo de
https://purl.org/pe-repo/ocde/ford#2.02.02
dc.subject.ocde.es_PE.fl_str_mv https://purl.org/pe-repo/ocde/ford#2.02.02
description State-of-the-art control and robotics challenges have long been tackled using model-based control methods like model predictive control (MPC) and reinforcement learning (RL). These methods excel in complex dynamic domains, such as manipulation tasks, but struggle with real-world issues like wear-and-tear, uncalibrated sensors, and misspecifications. These factors often perturb system dynamics, leading to the 'reality gap' problem when robots transition from simulations to real-world environments. This work aims to bridge this gap by combining RL and control in a learning framework that adapts MPC to robot decisions, optimizing performance despite uncertainties in dynamics model parameters. This thesis presents three key contributions to robotics control. The first is a novel reward-based framework for refining stochastic Model Predictive Control (MPC). It utilizes Bayesian Optimization (BO) for efficient data handling and heteroscedastic noise, linking controller hyperparameters with expected rewards through a Gaussian Process (GP). This approach demonstrates success in simulated control environments and robotic tasks. The second contribution addresses the 'reality gap' in robotics, enhancing controller performance in real-world dynamics. It builds on the first by developing an adaptive stochastic MPC that optimizes hyperparameters while estimating physical parameter distributions, employing a randomized dynamics model. This method is validated in both simulations and with robotic manipulators. Finally, the thesis proposes an innovative alternative to BO, merging it with supervised classification for a surrogate-based optimization technique. This method adeptly adjusts control hyperparameters in the face of model uncertainty and noise, optimizing complex functions through a binary classifier. Tested on simulated control problems and manipulators, it offers a promising solution to complex robotics and control challenges.
publishDate 2023
dc.date.accessioned.none.fl_str_mv 2024-07-02T20:35:19Z
dc.date.available.none.fl_str_mv 2024-07-02T20:35:19Z
dc.date.issued.fl_str_mv 2023-11
dc.type.es_PE.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
dc.identifier.uri.none.fl_str_mv https://renati.sunedu.gob.pe/handle/sunedu/3694610
https://hdl.handle.net/2123/32053
url https://renati.sunedu.gob.pe/handle/sunedu/3694610
https://hdl.handle.net/2123/32053
dc.language.iso.es_PE.fl_str_mv eng
language eng
dc.rights.es_PE.fl_str_mv info:eu-repo/semantics/openAccess
dc.rights.uri.es_PE.fl_str_mv https://creativecommons.org/licenses/by/4.0/deed.es
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by/4.0/deed.es
dc.format.es_PE.fl_str_mv application/pdf
dc.publisher.es_PE.fl_str_mv University of Sydney
dc.publisher.country.es_PE.fl_str_mv AU
dc.source.es_PE.fl_str_mv Superintendencia Nacional de Educación Superior Universitaria - SUNEDU
dc.source.none.fl_str_mv reponame:Registro Nacional de Trabajos conducentes a Grados y Títulos - RENATI
instname:Superintendencia Nacional de Educación Superior Universitaria
instacron:SUNEDU
instname_str Superintendencia Nacional de Educación Superior Universitaria
instacron_str SUNEDU
institution SUNEDU
reponame_str Registro Nacional de Trabajos conducentes a Grados y Títulos - RENATI
collection Registro Nacional de Trabajos conducentes a Grados y Títulos - RENATI
dc.source.uri.es_PE.fl_str_mv Registro Nacional de Trabajos de Investigación - RENATI
bitstream.url.fl_str_mv https://renati.sunedu.gob.pe/bitstream/renati/9331/1/GuzmanApazaR.pdf
https://renati.sunedu.gob.pe/bitstream/renati/9331/2/Autorizacion.pdf
https://renati.sunedu.gob.pe/bitstream/renati/9331/3/license.txt
bitstream.checksum.fl_str_mv ef9068d20d4f6e6b001620d6d9d8d8ee
b0b8f363f32388ea2e16c0e4641b2178
b39fb1e1cb23db8e93fd74de238cfcd9
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Registro Nacional de Trabajos de Investigación
repository.mail.fl_str_mv renati@sunedu.gob.pe
_version_ 1816177360303554560
spelling Ramos, FabioOliveira, RafaelGuzmán Apaza, Rel2024-07-02T20:35:19Z2024-07-02T20:35:19Z2023-11https://renati.sunedu.gob.pe/handle/sunedu/3694610https://hdl.handle.net/2123/32053State-of-the-art control and robotics challenges have long been tackled using model-based control methods like model predictive control (MPC) and reinforcement learning (RL). These methods excel in complex dynamic domains, such as manipulation tasks, but struggle with real-world issues like wear-and-tear, uncalibrated sensors, and misspecifications. These factors often perturb system dynamics, leading to the 'reality gap' problem when robots transition from simulations to real-world environments. This work aims to bridge this gap by combining RL and control in a learning framework that adapts MPC to robot decisions, optimizing performance despite uncertainties in dynamics model parameters. This thesis presents three key contributions to robotics control. The first is a novel reward-based framework for refining stochastic Model Predictive Control (MPC). It utilizes Bayesian Optimization (BO) for efficient data handling and heteroscedastic noise, linking controller hyperparameters with expected rewards through a Gaussian Process (GP). This approach demonstrates success in simulated control environments and robotic tasks. The second contribution addresses the 'reality gap' in robotics, enhancing controller performance in real-world dynamics. It builds on the first by developing an adaptive stochastic MPC that optimizes hyperparameters while estimating physical parameter distributions, employing a randomized dynamics model. This method is validated in both simulations and with robotic manipulators. Finally, the thesis proposes an innovative alternative to BO, merging it with supervised classification for a surrogate-based optimization technique. This method adeptly adjusts control hyperparameters in the face of model uncertainty and noise, optimizing complex functions through a binary classifier. Tested on simulated control problems and manipulators, it offers a promising solution to complex robotics and control challenges.Este trabajo aborda los desafíos en control y robótica, utilizando métodos de control basados en modelos como el Control Predictivo Modelo (MPC) y el Aprendizaje por Refuerzo (RL) para optimizar el rendimiento de robots en entornos dinámicos complejos. A pesar de su eficacia, la transferencia de controladores aprendidos de simuladores a entornos reales a menudo se ve obstaculizada por el desgaste, sensores no calibrados y otros factores que distorsionan la dinámica del sistema, lo que conduce a lo que se conoce como la brecha de la realidad. Este trabajo introduce un marco de aprendizaje que utiliza RL y control para ajustar MPC estocástico, abordando la brecha de la realidad y optimizando secuencialmente el rendimiento del robot mediante la adaptación del MPC a las decisiones del robot y el aprendizaje bajo incertidumbre en los parámetros del modelo de dinámica. El enfoque principal incluye un marco basado en recompensas para la optimización eficiente de datos del MPC usando optimización Bayesiana (BO) y modelos de Procesos Gaussianos (GP) para mapear hiperparámetros del controlador a recompensas acumulativas esperadas. Además, se extiende este marco para permitir una optimización de MPC estocástica adaptativa que ajusta los hiperparámetros mientras estima probabilidades de parámetros físicos mediante modelos de dinámica aleatorizados. Finalmente, se exploran las limitaciones de BO y se proponen métodos de optimización alternativos basados en clasificación supervisada para ajustar los hiperparámetros del control en presencia de incertidumbre y ruido heterocedástico, optimizando funciones costosas y mejorando la precisión del control en simulaciones y tareas robóticas.Perú. Programa Nacional de Becas y Crédito Educativo (Pronabec). Beca Generación del BicentenarioTesis doctoralapplication/pdfengUniversity of SydneyAUinfo:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by/4.0/deed.esSuperintendencia Nacional de Educación Superior Universitaria - SUNEDURegistro Nacional de Trabajos de Investigación - RENATIreponame:Registro Nacional de Trabajos conducentes a Grados y Títulos - RENATIinstname:Superintendencia Nacional de Educación Superior Universitariainstacron:SUNEDURobóticaControl predictivoControl óptimoEstadística bayesianaGauss, Modelo dehttps://purl.org/pe-repo/ocde/ford#2.02.02A Sequential Optimisation Framework for Adaptive Model Predictive Control in RoboticsUn marco de optimización secuencial para el control predictivo adaptativo en robóticainfo:eu-repo/semantics/doctoralThesisUniversity of Sydney. Faculty of EngineeringCiencias de la ComputaciónDoctor en Filosofíahttp://purl.org/pe-repo/renati/level#doctorhttps://orcid.org/0000-0002-6094-761346822759http://purl.org/pe-repo/renati/type#tesisORIGINALGuzmanApazaR.pdfGuzmanApazaR.pdfTesisapplication/pdf2372748https://renati.sunedu.gob.pe/bitstream/renati/9331/1/GuzmanApazaR.pdfef9068d20d4f6e6b001620d6d9d8d8eeMD51Autorizacion.pdfAutorizacion.pdfAutorización del registroapplication/pdf1007875https://renati.sunedu.gob.pe/bitstream/renati/9331/2/Autorizacion.pdfb0b8f363f32388ea2e16c0e4641b2178MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-8956https://renati.sunedu.gob.pe/bitstream/renati/9331/3/license.txtb39fb1e1cb23db8e93fd74de238cfcd9MD53renati/9331oai:renati.sunedu.gob.pe:renati/93312024-07-02 15:38:42.695Registro Nacional de Trabajos de Investigaciónrenati@sunedu.gob.peTGFzIHVuaXZlcnNpZGFkZXMsIGluc3RpdHVjaW9uZXMgeSBlc2N1ZWxhcyBkZSBlZHVjYWNpw7NuIHN1cGVyaW9yIHRpZW5lbiBjb21vIG9ibGlnYWNpw7NuIHJlZ2lzdHJhciB5IGRlcG9zaXRhciB0b2RvcyBsb3MgdHJhYmFqb3MgY29uZHVjZW50ZXMgYSBncmFkb3MgeSB0w610dWxvcywgaW5jbHV5ZW5kbyBsb3MgbWV0YWRhdG9zIGVuIHN1cyByZXBvc2l0b3Jpb3MgaW5zdGl0dWNpb25hbGVzIHByZWNpc2FuZG8gc2kgc29uIGRlIGFjY2VzbyBhYmllcnRvLCByZXN0cmluZ2lkbywgY29uIHVuIHBlcmlvZG8gZGUgZW1iYXJnbyBvIGNvbiBsYSBjb25kaWNpw7NuIGNlcnJhZGEsIGNvbnRhbmRvIGNvbiBsYSBhdXRvcml6YWNpw7NuIHByZXZpYSB5IHBvciBlc2NyaXRvIGRlIGxvcyBhdXRvcmVzIGRlIGxvcyB0cmFiYWpvcyBwYXJhIHN1IHJlcHJvZHVjY2nDs24geSBjb211bmljYWNpw7NuIHDDumJsaWNhIGNvbiBlbCBmaW4gZGUgZXZpdGFyIGN1YWxxdWllciBhZmVjdGFjacOzbiBhIGxvcyBkZXJlY2hvcyBkZSBhdXRvciBjb25mb3JtZSBlbCBtYXJjbyBub3JtYXRpdm8gdmlnZW50ZSAoUmVnbGFtZW50byBSRU5BVEksIGFydMOtY3VsbyAxMi4yKS4gCgpMYXMgcGVyc29uYXMgbmF0dXJhbGVzIHF1ZSBzb2xpY2l0YW4gZWwgcmVjb25vY2ltaWVudG8gZGUgdW4gZ3JhZG8geS9vIHTDrXR1bG8gb3RvcmdhZG8gZW4gZWwgZXh0cmFuamVybyBwdWVkZW4gcmVxdWVyaXIgZWwgYWxvamFtaWVudG8gZGVsIHRyYWJham8gcG9yIGVsIGN1YWwgb2J0dXZpZXJvbiBkaWNobyBncmFkbyBvIHTDrXR1bG8gZW4gZWwgUmVjb2xlY3RvciBEaWdpdGFsIFJFTkFUSSwgZW4gY2FzbyBzZSBoYXlhIHV0aWxpemFkbyBlc3RhIG1vZGFsaWRhZCBwYXJhIGxhIG9idGVuY2nDs24gZGVsIGdyYWRvIG8gdMOtdHVsbyByZXNwZWN0aXZvIChSZWdsYW1lbnRvIFJFTkFUSSwgYXJ0w61jdWxvIDE5LjEpLgo=
score 13.924177
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).