How to evaluate Artificial General Intelligence (AGI) is a critical problem that is discussed and unsolved for a long period. In the research of narrow AI, this seems not a severe problem, since researchers in that field focus on some specific problems as well as one or some aspects of cognition, and the criteria for evaluation are explicitly defined. By contrast, an AGI agent should solve problems that are never-encountered by both agents and developers. However, once a developer tests and debugs the agent with a problem, the never-encountered problem becomes the encountered problem, as a result, the problem is solved by the developers to some extent, exploiting their experience, rather than the agents. This conflict, as we call the trap of developers' experience, leads to that this kind of problems is probably hard to become an acknowledged criterion. In this paper, we propose an evaluation method named Artificial Open World, aiming to jump out of the trap. The intuition is that most of the experience in the actual world should not be necessary to be applied to the artificial world, and the world should be open in some sense, such that developers are unable to perceive the world and solve problems by themselves before testing, though after that they are allowed to check all the data. The world is generated in a similar way as the actual world, and a general form of problems is proposed. A metric is proposed aiming to quantify the progress of research. This paper describes the conceptual design of the Artificial Open World, though the formalization and the implementation are left to the future.
翻译:如何评价人工通用情报(AGI)是一个关键问题,在很长一段时间内讨论和未解决。在狭义的AI研究中,这似乎不是一个严重问题,因为该领域的研究人员侧重于某些具体问题以及认知的某一或某些方面,而评估的标准则有明确规定。相比之下,AGI代理商应当解决从未被代理人和开发商所预见过的问题。然而,一旦开发商测试并解答一个问题,从未遇到的问题就成为遇到的问题,因此,问题在某种程度上由开发商解决,利用他们的经验,而不是代理人来解决。这场冲突,正如我们称之为开发商经验的陷阱,导致这类问题可能很难成为公认的标准。在本文件中,我们建议一种名为人工开放世界,目的是跳出陷阱。 直觉是,现实世界的大部分经验不应该被应用到人造世界,因此,从某种意义上说,开发商应该在某种程度上解决问题,从某种意义上说,在现实设计上,在现实中,一个真实的世界是无法被验证的,在现实中,在现实中,在现实中,一个真实地说,在现实中,一个过程中,一个过程是无法被验证出来的。