We must recognize that natural language is a way of information encoding, and it encodes not only the information but also the procedures for how information is processed. To understand natural language, the same as we conceive and design computer languages, the first step is to separate information (or data) and the processing procedures of information (or data). In natural language, some processing procedures of data are encoded directly as the structure chunk and the pointer chunk (this paper has reclassified lexical chunks as the data chunk, structure chunk, and the pointer chunk); some processing procedures of data imply in sentences structures; some requests of processing procedures are expressed by information senders and processed by information receivers. For the data parts, the classification encoding system of attribute information and the information organization architecture (including constitutional structures of information sets and the hierarchy between the information sets) were discussed. In section 2, the theoretical part elaborated in section 2 has been verified in examples and proofed that the studies in this paper have achieved the goal of enabling machines to understand the information conveyed in the dialogue. In section 4, the author summarizes the basic conditions of "Understanding", rethinks what "Understanding" is and how to proceed. The study in this paper provides a practical, theoretical basis and research methods for NLU. It also can be applied in large-scale and multi-type information processing in the artificial intelligence (AI) area.
翻译:我们必须认识到,自然语言是一种信息编码的方式,它不仅将信息编码,而且还将信息处理程序编码;理解自然语言,与我们设计和设计计算机语言相同,第一步是分离信息(或数据)和信息处理程序(或数据)。在自然语言中,数据的某些处理程序直接编码为结构块和指针块(本文将词汇块重新分类为数据块、结构块和指针块);数据处理程序在句子结构中意味着一些数据处理程序;一些处理程序的要求由信息发送者表示,由信息接收者处理。关于数据部分,讨论了属性信息的分类编码系统和信息组织结构(包括信息系统的宪法结构以及信息组之间的等级),在第二节中,第2节中阐述的理论部分已经通过实例加以核实,并证明本文件中的研究报告已经实现了使机器能够理解对话中传递的信息的目标。在第4节中,作者概述了“了解”的基本条件,对“了解”的“基础”、信息接收者提出一些处理程序的请求。对于数据部分,对属性信息的分类编码系统和信息组织结构结构结构结构结构结构进行了分类系统(包括信息系统的宪法结构结构结构结构结构)的分类,研究提供了大规模的基础。在文件类型中可以提供大规模研究。