Sistema de planeación de trayectorias basado en aprendizaje por refuerzo para vehículo 4WS con Lidar2D.

Salazar Chavarro, Robinson

Sistema de planeación de trayectorias basado en aprendizaje por refuerzo para vehículo 4WS con Lidar2D.

dc.contributor.advisor	Garcia Bedoya, Olmer
dc.creator	Salazar Chavarro, Robinson
dc.date.accessioned	2024-06-05T16:38:14Z
dc.date.available	2024-06-05T16:38:14Z
dc.date.created	2022-09-22
dc.description.abstract	El objetivo de este trabajo fue analizar un sistema de inteligencia artificial para el entrenamiento de un agente y simulación con diferentes modelos de aprendizaje por refuerzo en el ambiente para identificar cuáles son las características de aprendizaje por cada uno de los algoritmos con los diferentes métodos. Los agentes son los responsables de enviar acciones al ambiente y el ambiente entrega un estado con una recompensa dependiendo del objetivo que quiere alcanzar el agente. Después de hacer la evaluación de los modelos con una infraestructura definida para el desarrollo del proyecto, se evaluará la eficiencia del modelo, tiempo de entrenamiento y optimización de los recursos físicos, con todo lo anterior se espera entregar la evaluación de los modelos en un ambiente simulado y características de cada uno para identificar cual es la eficiencia con los recursos de infraestructura y la optimización de las respuestas obtenidas por cada uno de los modelos.	spa
dc.description.abstractenglish	The objective of this work was to analyze an artificial intelligence system for training an agent and simulating different reinforcement learning models in the environment. The goal was to identify the learning characteristics of each algorithm using various methods. Agents are responsible for sending actions to the environment, which then provides a state and a reward based on the agent’s objective. After evaluating the models using a predefined infrastructure for the project, we will assess the model’s efficiency, training time, and optimization of physical resources. With all this information, we aim to deliver an evaluation of the models in a simulated environment, highlighting the characteristics of each one to determine their efficiency with infrastructure resources and the optimization of responses obtained by each model.	spa
dc.format.extent	54 páginas	spa
dc.format.mimetype	application/pdf	spa
dc.identifier.uri	https://hdl.handle.net/20.500.12010/34506
dc.language.iso	spa	spa
dc.relation.references	Proyecto wandb total. [Online]. Available: https://wandb.ai/robinson/GPO 2021? workspace=user-robinson
dc.relation.references	K. Rana, V. Dasagi, J. Haviland, B. Talbot, M. Milford, and N. S¨underhauf, “Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics,” pp. 1–19, 2021. [Online]. Available: http://arxiv.org/abs/2107.09822
dc.relation.references	Bayesian controller fusion: Leveraging control priors in deep reinforcement learning for robotics. [Online]. Available: https://arxiv.org/abs/2107.09822
dc.relation.references	A. Moreno, E. Armengol, J. B´ejar Alonso, L. A. Belanche Mu˜noz, C. U. Cort´es Garc´ıa, R. Gavald`a Mestre, J. M. Gimeno, M. Mart´ın Mu˜noz, and M. S`anchez-Marr`e, “Aprendizaje autom´atico,” 1994.
dc.relation.references	I. N. S. Rativa, “T´ecnicas de aprendizaje autom´atico aplicadas en los sistemas de predicci´on,” Tecnolog´ıa Investigaci´on y Academia, vol. 8, no. 1, pp. 37–53, 2020.
dc.relation.references	J. C. d. J. M. Rodrıguez, R. R. Biswal, and E. S. de la Cruz, “Algoritmos de aprendizaje autom´atico de vanguardia para el diagn´ostico de enfermedades,” Res. Comput. Sci, vol. 148, no. 7, pp. 455–468, 2019.
dc.relation.references	F. J. G. Quesada, M. A. F. Graciani, M. T. L. Bonal, and M. A. D´ıaz-Mata, “Aprendizaje con redes neuronales artificiales,” Ensayos: Revista de la Facultad de Educaci´on de Albacete, no. 9, pp. 169–180, 1994.
dc.relation.references	S. P. Singh, “Reinforcement learning algorithms for average-payoff markovian decision processes,” in AAAI, vol. 94, 1994, pp. 700–705.
dc.relation.references	R. Gij´on Revelles, “T´ecnicas de aprendizaje por refuerzo para el problema de cobertura de ´areas con robots a´ereos,” Ph.D. dissertation, ETSI Informatica, 2022.
dc.relation.references	M. Wunder, M. L. Littman, and M. Babes, “Classes of multiagent q-learning dynamics with epsilon-greedy exploration,” in ICML, 2010.
dc.relation.references	] X. Wang, S. Zhang, Z. Lei, S. Liu, X. Guo, and S. Z. Li, “Ensemble soft-margin softmax loss for image classification,” arXiv preprint arXiv:1805.03922, 2018.
dc.relation.references	K. Rana, V. Dasagi, J. Haviland, B. Talbot, M. Milford, and N. S¨underhauf, “Bayesian controller fusion: Leveraging control priors in deep reinforcement learning for robotics,” arXiv preprint arXiv:2107.09822, 2021.
dc.relation.references	J. C. Carranza Caro et al., “Implementaci´on de un agente capaz de aprender las acciones ´optimas de un ascensor utilizando aprendizaje por refuerzo,” 2014.
dc.relation.references	] M. L´opez Boada, B. L´opez Boada, and V. D´ıaz L´opez, “Algoritmo de aprendizaje por refuerzo continuo para el control de un sistema de suspensi´on semi-activa,” Algoritmo de aprendizaje por refuerzo continuo para el control de un sistema de suspensi´on semiactiva, vol. 9, no. 2, pp. 77–92, 2005.
dc.relation.references	J. D. Chuga Perugachi, “Modelos de aprendizaje profundo por refuerzo (deep reinforcement learning) para el movimiento aut´onomo de veh´ıculos a´ereos no tripulados,” 2020.
dc.relation.references	J. Liberal Huarte, “Deep reinforcement learning for autonomous collision avoidance,” 2020.
dc.relation.references	K. I. Lobos Tsunekawa, “Aplicaciones del aprendizaje reforzado en rob´otica m´ovil,” 2018.
dc.relation.references	A. Roshanineshat, “Evolutionary optimization for safe navigation of an autonomous robot in cluttered dynamic unknown environments,” Ph.D. dissertation, Cleveland State University, 2018.
dc.relation.references	A. S. D´ıaz Latorre et al., “Aprendizaje por refuerzo para control de sistemas din´amicos,” 2019
dc.relation.references	W. Zhang, Y. Zhang, N. Liu, K. Ren, and P. Wang, “Ipaprec: A promising tool for learning high-performance mapless navigation skills with deep reinforcement learning,” IEEE/ASME Transactions on Mechatronics, 2022.
dc.relation.references	E. Rohmer, S. P. Singh, and M. Freese, “V-rep: A versatile and scalable robot simulation framework,” in 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2013, pp. 1321–1326.
dc.relation.references	S. James, M. Freese, and A. J. Davison, “Pyrep: Bringing v-rep to deep robot learning,” arXiv preprint arXiv:1906.11176, 2019
dc.relation.references	G. Pratx and L. Xing, “Gpu computing in medical physics: A review,” Medical physics, vol. 38, no. 5, pp. 2685–2697, 2011.
dc.relation.references	A. Boroumand, S. Ghose, M. Patel, H. Hassan, B. Lucia, R. Ausavarungnirun, K. Hsieh, N. Hajinazar, K. T. Malladi, H. Zheng et al., “Conda: Efficient cache coherence support for near-data accelerators,” in Proceedings of the 46th International Symposium on Computer Architecture, 2019, pp. 629–642.
dc.relation.references	Anaconda distribution -luigys toro — — actualizado el 15/09/2017 15:40 — aplicaciones, programaci´on. [Online]. Available: https://blog.desdelinux.net/ ciencia-de-datos-con-python/
dc.relation.references	E. Kharitonov, J. Copet, K. Lakhotia, T. A. Nguyen, P. Tomasello, A. Lee, A. Elkahky, W.-N. Hsu, A. Mohamed, E. Dupoux et al., “textless-lib: a library for textless spoken language processing,” arXiv preprint arXiv:2202.07359, 2022.
dc.relation.references	L. Rouhiainen, “Inteligencia artificial,” Madrid: Alienta Editorial, 2018.	spa
dc.relation.references	R. A. C. Ortiz, “Agente de aprendizaje por refuerzo con retroalimentaci´on interactiva.” Ph.D. dissertation, UNIVERSIDAD CENTRAL DE CHILE, 2020.	spa
dc.relation.references	M. Garc´ıa Pascual, “Aprendizaje por refuerzo profundo con openai gym,” B.S. thesis, 2021.	spa
dc.relation.references	Kuka-youbot-technical-specs.pdf. [Online]. Available: https://gaz.wiki/wiki/es/ Mecanum wheel	spa
dc.relation.references	B. Bogaerts, S. Sels, S. Vanlanduit, and R. Penne, “Connecting the coppeliasim robotics simulator to virtual reality,” SoftwareX, vol. 11, p. 100426, 2020.	spa
dc.relation.references	U. I. Mart´ın et al., “Generaci´on de trayectorias rob´oticas mediante aprendizaje profundo por refuerzo,” Master’s thesis, Universitat Polit`ecnica de Catalunya, 2018.	spa
dc.relation.references	Bayesian controller fusion: Leveraging control priors in deep reinforcement learning for robotics. [Online]. Available: https://arxiv.org/abs/2107.09822	spa
dc.subject	Aprendizaje por refuerzo	spa
dc.subject	vehículo 4WS	spa
dc.subject.keyword	4WS vehicle
dc.subject.keyword	Machine learning	spa
dc.subject.lemb	Aprendizaje por refuerzo - Modelos de simulación
dc.subject.lemb	Inteligencia artificial - Entrenamiento de agentes
dc.subject.lemb	Evaluación de modelos - Optimización de recursos.
dc.title	Sistema de planeación de trayectorias basado en aprendizaje por refuerzo para vehículo 4WS con Lidar2D.	spa
dc.type.coar	http://purl.org/coar/resource_type/c_8042	spa

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: ProyectoGradoRobinson (1) (1).pdf
Tamaño:: 4.44 MB
Formato:: Adobe Portable Document Format
Descripción:: Ver documento

Descargar

Bloque de licencias

Mostrando 1 - 2 de 2

Nombre:: license.txt
Tamaño:: 2.87 KB
Formato:: Item-specific license agreed upon to submission
Descripción:

Descargar

Nombre:: autorizacionbibliotecaRobinsonFirmada robinson salazar.pdf
Tamaño:: 797.64 KB
Formato:: Adobe Portable Document Format
Descripción:: Carta de autorización

Descargar

Colecciones

Maestría en Ingeniería y Analítica de Datos