复杂开放水域下智能船舶路径规划与避障方法

doi:10.13196/j.cims.2022.07.009

计算机集成制造系统 ›› 2022, Vol. 28 ›› Issue (7): 2030-2040.DOI: 10.13196/j.cims.2022.07.009

复杂开放水域下智能船舶路径规划与避障方法

杨琪森,王慎执,桑金楠,王朝飞,黄高,吴澄,宋士吉⁺

清华大学自动化系

出版日期:2022-07-31 发布日期:2022-08-03
基金资助:
广东省重点研发计划资助项目(2020B1111500002)。

Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment

YANG Qisen,WANG Shenzhi,SANG Jinnan,WANG Chaofei,HUANG Gao,WU Cheng,SONG Shiji⁺

Department of Automation,Tsinghua University

Online:2022-07-31 Published:2022-08-03
Supported by:
Project supported by the Key Research and Development Program of Guangdong Province,China(No.2020B1111500002).

摘要/Abstract

摘要： 针对复杂动态环境下,智能船舶航运过程中的路径规划与避障问题,结合海图与国际海上避碰规则,搭建了仿真平台,并进行马尔可夫决策过程抽象建模。理论分析了深度强化学习方法和传统确定性算法,在深度强化学习算法中设计了适用于智能船舶航行任务的势能引导奖励,并在不同障碍物数量及障碍物状态的条件下,通过实验比较了两者的路径规划与实时避障能力。仿真环境下,深度强化学习方法在不同难度的环境设置下,均表现出了优于传统方法的性能。随着环境难度的增大,传统方法的表现逐渐变差,但是深度强化学习方法性能稳定。深度强化学习方法在实时避碰的决策任务上,具有安全性高、航行时间短、性能稳定等优点。

关键词: 智能船舶, 路径规划, 避障, 深度强化学习

Abstract: To solve the path planning and obstacle avoidance problem of intelligent ships in complex open water environment,the corresponding Markov decision process was modeled,and a simulation platform was built by considering both nautical charts and international regulations for preventing collisions at sea.The theories of deep reinforcement learning methods and traditional deterministic algorithms were provided.In the deep reinforcement learning algorithms,the reward of potential energy guidance was designed according to the specific navigation task.Different algorithms were compared under the experimental settings of varied obstacle numbers and moving states.In the simulation environment,the deep reinforcement learning methods consistently perform better than the traditional methods.As the task difficulty arises,traditional methods perform much worse while deep reinforcement learning methods still achieve satisfying results.The deep reinforcement learning methods showed the advantages of high safety,short sailing time and stable performance.

Key words: intelligent ship, path planning, obstacle avoidance, deep reinforcement learning

中图分类号:

U675.79

杨琪森, 王慎执, 桑金楠, 王朝飞, 黄高, 吴澄, 宋士吉. 复杂开放水域下智能船舶路径规划与避障方法[J]. 计算机集成制造系统, 2022, 28(7): 2030-2040.

YANG Qisen, WANG Shenzhi, SANG Jinnan, WANG Chaofei, HUANG Gao, WU Cheng, SONG Shiji. Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment[J]. Computer Integrated Manufacturing System, 2022, 28(7): 2030-2040.

[1]	张立彬, 林后凯, 谭大鹏. 基于栅格空间的自适应GB_RRT*机械臂路径规划[J]. 计算机集成制造系统, 2022, 28(6): 1638-1649.
[2]	陈娇, 徐菱, 陈佳, 刘卿. 改进A*和动态窗口法的移动机器人路径规划[J]. 计算机集成制造系统, 2022, 28(6): 1650-1658.
[3]	徐兴, 俞旭阳, 赵芸, 刘成星, 吴祥. 基于改进遗传算法的移动机器人全局路径规划[J]. 计算机集成制造系统, 2022, 28(6): 1659-1672.
[4]	汤洪涛, 程晓雅, 李修琳, 鲁建厦, 陈寿伍. 跨层跨巷道穿梭车仓储系统复合作业路径优化[J]. 计算机集成制造系统, 2022, 28(6): 1888-1902.
[5]	李静, 朱小林. 集装箱码头上多自动引导车的调度和路径规划[J]. 计算机集成制造系统, 2022, 28(5): 1449-1461.
[6]	刘洪鹏, 赵文政, 刘银华, 金隼. 测量不确定度约束下的结构光检测视点规划方法[J]. 计算机集成制造系统, 2022, 28(4): 1079-1086.
[7]	冯春,张祎伟,黄成,姜文彪,武之炜. 双足机器人步态控制的深度强化学习方法[J]. 计算机集成制造系统, 2021, 27(8): 2341-2349.
[8]	鲁建厦,陈寿伍,易文超,汤洪涛. 跨层穿梭车仓储系统复合作业路径规划[J]. 计算机集成制造系统, 2021, 27(6): 1799-1808.
[9]	陈满意,张桥,张弓,梁济民,侯至丞,杨文林,徐征,王建. 多障碍环境下机械臂避障路径规划[J]. 计算机集成制造系统, 2021, 27(4): 990-998.
[10]	赵文政,刘银华,金隼. 面向多机器人协调运动规划的层级化任务分配方法[J]. 计算机集成制造系统, 2021, 27(4): 999-1007.
[11]	吴铮,陈彦杰,何炳蔚,林立雄,王耀南. 基于方向选择的移动机器人路径规划方法[J]. 计算机集成制造系统, 2021, 27(3): 672-682.
[12]	邓建新,卫世丰,石先莲,陈星雨,韦婉冬. 基于数字孪生的配送管理系统研究[J]. 计算机集成制造系统, 2021, 27(2): 585-604.
[13]	崔鹏浩,王军强,张文沛,李洋. 基于深度强化学习的流水线预测性维护决策[J]. 计算机集成制造系统, 2021, 27(12): 3416-3428.
[14]	胡玉蝶,周勇,王宇琦,李卫东. 基于高斯噪声发散的协作机器人路径优化及避障[J]. 计算机集成制造系统, 2021, 27(12): 3503-3510.
[15]	翟敬梅,刘坤,徐晓. 室内移动机器人自主导航系统设计与方法[J]. 计算机集成制造系统, 2020, 26(第4): 890-899.

复杂开放水域下智能船舶路径规划与避障方法

Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics