Structural Equation Modeling (SEM) is an umbrella term that includes numerous multivariate statistical techniques that are employed throughout a plethora of research areas, ranging from social to natural sciences. Until recently, SEM software was either commercial or restricted to niche languages, and the lack of SEM packages compatible with more mainstream programming languages was dire. To combat that, we introduced a Python package semopy 1 that surpassed other state-of-the-art software in terms of performance and estimation accuracy. Yet, it was lacking in functionality and its usage was burdened with unnecessary boilerplate code. Here, we introduce a complete overhaul of semopy that improves upon the previous results and comes with lots of new capabilities. Furthermore, we propose a novel SEM model that combines in itself a notion of random effects from linear mixed models (LMMs) to model numerous phenomena, such as spatial data, time series or population stratification in genetics.
翻译:结构等式模型(SEM)是一个总括术语,包括社会科学到自然科学等众多研究领域使用的多种多变统计技术。直到最近,SEM软件要么是商业软件,要么仅限于特殊语言,缺乏与较主流编程语言相容的SEM软件包非常糟糕。为此,我们引入了一个Python软件模版1,该软件在性能和估计准确性方面超过了其他最先进的软件。然而,它缺乏功能,其使用也伴随着不必要的锅炉板代码。在这里,我们引入了对模版的彻底修改,改进了先前的结果,并带来了许多新的能力。此外,我们提出了一个新的SEM模型,将线性混合模型(LMMs)的随机效应概念本身结合到多种现象的模型中,例如空间数据、时间序列或基因人口分层。