Industrial processes generate complex data that challenge fault detection systems, often yielding opaque or underwhelming results despite advanced machine learning techniques. This study tackles such difficulties using the Tennessee Eastman Process, a well-established benchmark known for its intricate dynamics, to develop an innovative fault detection framework. Initial attempts with standard models revealed limitations in both performance and interpretability, prompting a shift toward a more tractable approach. By employing SHAP (SHapley Additive exPlanations), we transform the problem into a more manageable and transparent form, pinpointing the most critical process features driving fault predictions. This reduction in complexity unlocks the ability to apply causal analysis through Directed Acyclic Graphs, generated by multiple algorithms, to uncover the underlying mechanisms of fault propagation. The resulting causal structures align strikingly with SHAP findings, consistently highlighting key process elements-like cooling and separation systems-as pivotal to fault development. Together, these methods not only enhance detection accuracy but also provide operators with clear, actionable insights into fault origins, a synergy that, to our knowledge, has not been previously explored in this context. This dual approach bridges predictive power with causal understanding, offering a robust tool for monitoring complex manufacturing environments and paving the way for smarter, more interpretable fault detection in industrial systems.
翻译:工业过程产生的复杂数据对故障检测系统构成挑战,尽管采用了先进的机器学习技术,结果往往仍不透明或效果有限。本研究以田纳西-伊斯曼过程(一个以其复杂动态特性著称的成熟基准测试)为对象,开发了一种创新的故障检测框架。初期采用标准模型的尝试在性能和可解释性方面均显不足,促使我们转向更易处理的方法。通过运用SHAP(SHapley可加性解释)技术,我们将问题转化为更易管理且透明的形式,精准识别驱动故障预测的关键过程特征。这种复杂度的降低使我们能够通过有向无环图(由多种算法生成)进行因果分析,从而揭示故障传播的内在机制。所得的因果结构与SHAP分析结果高度吻合,一致突显冷却系统、分离系统等关键工艺要素在故障发展中的核心作用。这两种方法的结合不仅提升了检测精度,还为操作人员提供了清晰、可操作的故障根源洞察。据我们所知,这种协同作用在此前的研究中尚未被探索。这种双重方法将预测能力与因果理解相融合,为监控复杂制造环境提供了强大工具,并为工业系统实现更智能、更可解释的故障检测开辟了新途径。