收起
Topic1: Tackling Safe and Efficient Multi-Agent Reinforcement Learning via Dynamic Shielding
Speaker:Wenli Xiao
Abstract: Multi-Agent Reinforcement Learning (MARL) has been increasingly used in safety-critical applications but has no safety guarantees, especially during training. In this paper, we propose dynamic shielding, a novel decentralized MARL framework to ensure safety in both training and deployment phases. Our framework leverages Shield, a reactive system running in parallel with the reinforcement learning algorithm to monitor and correct agents’ behavior. In our algorithm, shields dynamically split and merge according to the environment state in order to maintain decentralization and avoid conservative behaviors while enjoying formal safety guarantees. We demonstrate the effectiveness of MARL with dynamic shielding in the mobile navigation scenario.
Topic2: Less is more: A Robust Visual Inertial Odometry with Active Feature Extraction
Speaker: Muhan Lin
Abstract: Abstract— To achieve robust performance, it is common for visual odometry and SLAM to track more features, like several hundreds of points in real time. Although this strategy performs well on high-end desktop PCs, it is difficult to apply it to some mobile platforms with limited computation resources, such as VR, Micro UAV, and multi-camera systems. Additionally, noisy visual feature points may decrease the accuracy of visual odometry and SLAM. Therefore, fewer but more informative features can boost efficiency and accuracy compared to extracting more features. It means that less is more. To achieve this target, we propose a new criterion for the active feature selection based on singular values and then incorporate this method into an advanced VIO system, TP-TIO [1]. With the new system, using half of the features required by the original TP-TIO, the residuals can be reduced to 56.23% of the ones generated by the original TP-TIO without increasing the processing time to a large degree. The new system was verified with the mmpug datasets [2], which were extracted in a long and dark corridor.
1、本活动具体服务及内容由主办方【AIRS 研究院】提供,活动行仅提供票务技术支持,请仔细阅读活动内容后参与。
2、如在活动参与过程中遇到问题或纠纷,双方应友好协商沟通,也可联络活动行进行协助。