Aerial manipulators, which combine robotic arms with multi-rotor drones, face strict constraints on arm weight and mechanical complexity. In this work, we study a lightweight 2-degree-of-freedom (DoF) arm mounted on a quadrotor via a differential mechanism, capable of full six-DoF end-effector pose control. While the minimal design enables simplicity and reduced payload, it also introduces challenges such as underactuation and sensitivity to external disturbances, including manipulation of heavy loads and pushing tasks. To address these, we employ reinforcement learning, training a Proximal Policy Optimization (PPO) agent in simulation to generate feedforward commands for quadrotor acceleration and body rates, along with joint angle targets. These commands are tracked by an incremental nonlinear dynamic inversion (INDI) attitude controller and a PID joint controller, respectively. Flight experiments demonstrate centimeter-level position accuracy and degree-level orientation precision, with robust performance under external force disturbances. The results highlight the potential of learning-based control strategies for enabling contact-rich aerial manipulation using simple, lightweight platforms.
翻译:空中机械臂将机械臂与多旋翼无人机相结合,面临臂体重量和机械复杂性的严格限制。本研究探讨了一种通过差动机构安装在四旋翼飞行器上的轻量化二自由度机械臂,该机构能够实现完整的六自由度末端执行器位姿控制。这种极简设计虽然实现了结构简化并降低了有效载荷,但也带来了欠驱动和对包括重载操作与推压任务在内的外部干扰敏感等挑战。为解决这些问题,我们采用强化学习方法,在仿真环境中训练近端策略优化智能体,以生成四旋翼加速度与机体角速率的前馈指令,以及关节角度目标值。这些指令分别由增量非线性动态逆姿态控制器和PID关节控制器进行跟踪。飞行实验表明,该系统在外部力扰动下仍能保持厘米级的位置精度与度级的姿态精度,展现出稳健的性能。研究结果凸显了基于学习的控制策略在利用简单轻量化平台实现密集接触式空中操作的潜力。