Skypilot：基于物理接地的LLM微调用于AAV覆盖搜索 (Skypilot: Fine-Tuning LLM with Physical Grounding for AAV Coverage Search)

Autonomous aerial vehicles (AAVs) have played a pivotal role in coverage operations and search missions. Recent advances in large language models (LLMs) offer promising opportunities to augment AAV intelligence. These advances help address complex challenges like area coverage optimization, dynamic path planning, and adaptive decision-making. However, the absence of physical grounding in LLMs leads to hallucination and reproducibility problems in spatial reasoning and decision-making. To tackle these issues, we present Skypilot, an LLM-enhanced two-stage framework that grounds language models in physical reality by integrating monte carlo tree search (MCTS). In the first stage, we introduce a diversified action space that encompasses generate, regenerate, fine-tune, and evaluate operations, coupled with physics-informed reward functions to ensure trajectory feasibility. In the second stage, we fine-tune Qwen3-4B on 23,000 MCTS-generated samples, achieving substantial inference acceleration while maintaining solution quality. Extensive numerical simulations and real-world flight experiments validate the efficiency and superiority of our proposed approach. Detailed information and experimental results are accessible at https://sky-pilot.top.

翻译：自主空中飞行器（AAV）在覆盖作业和搜索任务中发挥着关键作用。大语言模型（LLM）的最新进展为增强AAV智能提供了广阔前景，有助于应对区域覆盖优化、动态路径规划和自适应决策等复杂挑战。然而，LLM缺乏物理接地，导致其在空间推理和决策过程中出现幻觉和可复现性问题。为解决这些问题，我们提出了Skypilot，一种基于LLM增强的两阶段框架，通过集成蒙特卡洛树搜索（MCTS）将语言模型锚定于物理现实。第一阶段，我们引入了包含生成、重新生成、微调和评估操作的多样化动作空间，并结合基于物理知识的奖励函数以确保轨迹可行性。第二阶段，我们在23,000个MCTS生成的样本上对Qwen3-4B模型进行微调，在保持解质量的同时实现了显著的推理加速。大量数值模拟和真实飞行实验验证了所提方法的效率和优越性。详细信息和实验结果可通过https://sky-pilot.top获取。