Symmetry considerations are at the core of the major frameworks used to provide an effective mathematical representation of atomic configurations that is then used in machine-learning models to predict the properties associated with each structure. In most cases, the models rely on a description of atom-centered environments, and are suitable to learn atomic properties, or global observables that can be decomposed into atomic contributions. Many quantities that are relevant for quantum mechanical calculations, however -- most notably the single-particle Hamiltonian matrix when written in an atomic-orbital basis -- are not associated with a single center, but with two (or more) atoms in the structure. We discuss a family of structural descriptors that generalize the very successful atom-centered density correlation features to the N-centers case, and show in particular how this construction can be applied to efficiently learn the matrix elements of the (effective) single-particle Hamiltonian written in an atom-centered orbital basis. These N-centers features are fully equivariant -- not only in terms of translations and rotations, but also in terms of permutations of the indices associated with the atoms -- and are suitable to construct symmetry-adapted machine-learning models of new classes of properties of molecules and materials.
翻译:对称考虑是主要框架的核心,主要框架用于提供原子配置的有效数学表示,然后在机器学习模型中用来预测每个结构的属性。在多数情况下,模型依赖于原子中心环境的描述,适合学习原子特性,或可分解成原子贡献的全球性观测结果。许多数量与量子机械计算有关,但最显著的是单粒汉密尔顿矩阵,如果以原子轨道为基础写成,它们与一个中心无关,而是与结构中的两个(或更多)原子相联系。我们讨论的是结构解说器的组合,它将非常成功的原子中心密度关联特征概括到N中心案例,并特别说明如何运用这种构造来有效学习原子中心轨道基础上写成的单粒汉密尔密尔顿仪的矩阵要素。这些N中心特征完全不均匀 -- -- 不仅在翻译和旋转方面,而且在与原子模型模型相关的指数和分子模型的透化方面 -- -- 与原子模型的模型和分子模型的正确性模型的模型和模型的正确性。