This paper proposes two new algorithms for the lane keeping system (LKS) in autonomous vehicles (AVs) operating under snowy road conditions. These algorithms use deep reinforcement learning (DRL) to handle uncertainties and slippage. They include Action-Robust Recurrent Deep Deterministic Policy Gradient (AR-RDPG) and end-to-end Action-Robust convolutional neural network Attention Deterministic Policy Gradient (AR-CADPG), two action-robust approaches for decision-making. In the AR-RDPG method, within the perception layer, camera images are first denoised using multi-scale neural networks. Then, the centerline coefficients are extracted by a pre-trained deep convolutional neural network (DCNN). These coefficients, concatenated with the driving characteristics, are used as input to the control layer. The AR-CADPG method presents an end-to-end approach in which a convolutional neural network (CNN) and an attention mechanism are integrated within a DRL framework. Both methods are first trained in the CARLA simulator and validated under various snowy scenarios. Real-world experiments on a Jetson Nano-based autonomous vehicle confirm the feasibility and stability of the learned policies. Among the two models, the AR-CADPG approach demonstrates superior path-tracking accuracy and robustness, highlighting the effectiveness of combining temporal memory, adversarial resilience, and attention mechanisms in AVs.
翻译:本文针对自动驾驶车辆在积雪道路条件下的车道保持系统,提出了两种新算法。这些算法采用深度强化学习处理不确定性与滑移问题,包括动作鲁棒循环深度确定性策略梯度与端到端动作鲁棒卷积神经网络注意力确定性策略梯度两种动作鲁棒决策方法。在AR-RDPG方法中,感知层首先通过多尺度神经网络对摄像头图像进行去噪,随后通过预训练的深度卷积神经网络提取车道中心线系数。这些系数与驾驶特征拼接后作为控制层的输入。AR-CADPG方法提出了一种端到端方案,将卷积神经网络与注意力机制集成于深度强化学习框架内。两种方法均在CARLA仿真器中完成训练,并在多种积雪场景下验证。基于Jetson Nano的实车实验证实了所学策略的可行性与稳定性。两种模型中,AR-CADPG方法展现出更优的路径跟踪精度与鲁棒性,凸显了在自动驾驶系统中融合时序记忆、对抗鲁棒性与注意力机制的有效性。