Comparative Analysis of Loss Functions in TD3 for Autonomous Parking

Authors

  • Chan Ka Heng Universiti Tun Hussein Onn Malaysia
  • Aida Mustapha Universiti Tun Hussein Onn Malaysia
  • Mohammed Ahmed Jubair Imam Ja'afar Al-Sadiq University

Keywords:

Autonomous Parking, Reinforcement Learning

Abstract

Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3). Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S. To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making. Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking. The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence. The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm. In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.

Downloads

Published

21-06-2024

Issue

Section

Articles

How to Cite

Chan, K. H., Mustapha, A., & Ahmed Jubair, M. (2024). Comparative Analysis of Loss Functions in TD3 for Autonomous Parking. Journal of Soft Computing and Data Mining, 5(1), 1-14. https://publisher.uthm.edu.my/ojs/index.php/jscdm/article/view/17678