Data assimilation (DA) methods combine model predictions with observational data to improve state estimation in dynamical systems, inspiring their increasingly prominent role in geophysical and climate applications. Classical DA methods assume that the governing equations modeling the dynamics are known, which is unlikely for most real world applications. Machine learning (ML) provides a flexible alternative by learning surrogate models directly from data, but standard ML methods struggle in noisy and data-scarce environments, where meaningful extrapolation requires incorporating physical constraints. Recent advances in structure-preserving ML architectures, such as the development of the entropy-stable conservative flux form network (ESCFN), highlight the critical role of physical structure in improving learning stability and accuracy for unknown systems of conservation laws. Structural information has also been shown to improve DA performance. Gradient-based measures of spatial variability, in particular, can help refine ensemble updates in discontinuous systems. Motivated by both of these recent innovations, this investigation proposes a new non-intrusive, structure-preserving sequential data assimilation (NSSDA) framework that leverages structure at both the forecast and analysis stages. We use the ESCFN to construct a surrogate model to preserve physical laws during forecasting, and a structurally informed ensemble transform Kalman filter (SETKF) to embed local statistical structure into the assimilation step. Our method operates in a highly constrained environment, using only a single noisy trajectory for both training and assimilation. Numerical experiments where the unknown dynamics correspond respectively to the shallow water and Euler equations demonstrate significantly improved predictive accuracy.
翻译:数据同化(DA)方法将模型预测与观测数据相结合,以改进动态系统中的状态估计,这使其在地球物理和气候应用中的作用日益凸显。经典DA方法假设描述动态的控制方程已知,但这在大多数实际应用中并不成立。机器学习(ML)通过学习直接从数据中构建代理模型,提供了一种灵活的替代方案,但标准ML方法在噪声大、数据稀缺的环境中表现不佳,因为有意义的外推需要融入物理约束。结构保持ML架构的最新进展,例如熵稳定保守通量形式网络(ESCFN)的开发,突显了物理结构在提高未知守恒律系统学习稳定性和准确性方面的关键作用。结构信息也被证明可以改善DA性能。特别是空间变异性的梯度度量,有助于改进不连续系统中的集合更新。受这些最新创新的启发,本研究提出了一种新的非侵入式、结构保持序列数据同化(NSSDA)框架,该框架在预报和分析阶段均利用结构信息。我们使用ESCFN构建代理模型以在预报过程中保持物理定律,并采用结构感知的集合变换卡尔曼滤波器(SETKF)将局部统计结构嵌入同化步骤。我们的方法在高度受限的环境中运行,仅使用单个噪声轨迹进行训练和同化。数值实验中,未知动态分别对应于浅水方程和欧拉方程,结果证明了预测精度的显著提升。