Autonomous Surface Vehicles (ASVs) play a crucial role in maritime operations, yet their navigation in shallow-water environments remains challenging due to dynamic disturbances and depth constraints. Traditional navigation strategies struggle with limited sensor information, making safe and efficient operation difficult. In this paper, we propose a reinforcement learning (RL) framework for ASV navigation under depth constraints, where the vehicle must reach a target while avoiding unsafe areas with only a single depth measurement per timestep from a downward-facing Single Beam Echosounder (SBES). To enhance environmental awareness, we integrate Gaussian Process (GP) regression into the RL framework, enabling the agent to progressively estimate a bathymetric depth map from sparse sonar readings. This approach improves decision-making by providing a richer representation of the environment. Furthermore, we demonstrate effective sim-to-real transfer, ensuring that trained policies generalize well to real-world aquatic conditions. Experimental results validate our method's capability to improve ASV navigation performance while maintaining safety in challenging shallow-water environments.
翻译:自主水面航行器(ASV)在海上作业中发挥着关键作用,但由于动态扰动与深度约束,其在浅水环境中的导航仍具挑战性。传统导航策略难以应对有限的传感器信息,导致安全高效运行受阻。本文提出一种深度约束下的ASV强化学习(RL)导航框架,其中航行器需在每时间步仅通过向下单波束回声测深仪(SBES)获取单次深度测量的条件下抵达目标,同时避开危险区域。为增强环境感知能力,我们将高斯过程(GP)回归集成至RL框架中,使智能体能从稀疏声纳读数逐步估计海底地形深度图。该方法通过提供更丰富的环境表征来改进决策。此外,我们展示了有效的仿真到现实迁移,确保训练后的策略能良好泛化至真实水域条件。实验结果验证了本方法在提升ASV导航性能的同时,能在挑战性浅水环境中保障安全性。