Reinforcement Learning to calculate routes for simulated robotic safety cones

Eva Murio; Jesus Balado; Pedro Arias

doi:10.3390/ASEC2023-15962

Previous Article in event

Royal jelly suppresses invasive potential of colorectal cancer cells by attenuating Vimentin and Snail

Previous Article in session

IDENTIFYING OF PEST ATTACK ON CORN CROP USING MACHINE LEARNING TECHNIQUES

Next Article in event

An Analysis of Artificial Intelligence Adoption Behavior Applying Extended UTAUT Framework in Urban Cities: The Context of Collectivistic Culture

Reinforcement Learning to calculate routes for simulated robotic safety cones

Eva Murio

^*,

Jesus Balado

Pedro Arias

¹ GeoTECH Group, CINTECX, Universidade de Vigo, 36310 Vigo, Spain;

Academic Editor: Nunzio Cennamo

Published: 09 November 2023 by MDPI in The 4th International Electronic Conference on Applied Sciences session Computing and Artificial Intelligence

https://doi.org/10.3390/ASEC2023-15962

Abstract:

The importance of transportation cannot be overstated, with road maintenance and construction being among the most crucial sectors. However, this area has been slow to update its tools and procedures, despite the benefits of automation. By embracing automation, the road construction industry can realize benefits such as increased efficiency, reduced physical strain on workers, shorter construction times, and less economic loss. In the road construction environment, traffic cones are commonly used to delimit work areas. These cones must be placed by workers and moved as the project progresses. Automation can greatly accelerate this process, freeing up workers for more complex tasks. However, conventional robots require an operator to control the device, limiting the efficiency gains.

To address this inefficiency, we propose a solution based on a robot that can autonomously reach the desired position. Our objective is to develop a model of a robotic cone using reinforcement learning, enabling it to operate independently and improve the efficiency of road construction projects. The self-learning is based on a system of rewards and punishments to achieve the desired position. The cone is rewarded if it approaches or reaches the goal, but it is penalized if it moves away, exceeds the goal or is exploring a wrong quadrant. By using this method, the cone must choose between a 0º or 90º each step-time to maximize the long-term reward. The simulated robotic safety cones reach the target, but the large number of variables involved long training times.

Keywords: Reinforcement Learning; pathfinding, simulation; road environment, work zone.

View paper

0 Reads
0 Recommendations

Eva Murio

Jesus Balado

Pedro Arias