Genetic Algorithm (GA) is a popular meta-heuristic evolutionary algorithm that uses stochastic operators to find optimal solution and has proved its effectiveness in solving many complex optimization problems (such as classification, optimization, and scheduling). However, despite its performance, popularity and simplicity, not much attention has been paid towards reproducibility and reusability of GA. In this paper, we have extended Findable, Accessible, Interoperable and Reusable (FAIR) data principles to enable the reproducibility and reusability of algorithms. We have chosen GA as a usecase to the demonstrate the applicability of the proposed principles. Also we have presented an overview of methodological developments and variants of GA that makes it challenging to reproduce or even find the right source. Additionally, to enable FAIR algorithms, we propose a vocabulary (i.e. $evo$) using light weight RDF format, facilitating the reproducibility. Given the stochastic nature of GAs, this work can be extended to numerous Optimization and machine learning algorithms/methods.
翻译:遗传算法(Genetic Algorithm,GA)是一种流行的元启发式演化算法,使用随机操作来寻找最优解,并已被证明在解决许多复杂的优化问题(如分类、优化和排程)方面具有有效性、流行性和简单性。然而,尽管其性能、流行和简单性,但并没有多少关注被付诸GA的再现性和可重用性。在本文中,我们扩展了可发现、可访问、可互用和可重用的(FAIR)数据的原则,以实现算法的再现性和可重用性。我们选择遗传算法作为一个应用案例来展示所提出的原则的适用性。同时,我们还提出了GA的方法论发展和变体的概述,这些使其难以重现甚至找到正确的源。此外,为了实现FAIR算法,我们提出了使用轻量级RDF格式的术语表(evo),以促进算法的可重现性。考虑到GA的随机性质,这项工作可以扩展到众多的优化和机器学习算法/方法。