As the demands for large-scale information processing have grown, knowledge graph-based approaches have gained prominence for representing general and domain knowledge. The development of such general representations is essential, particularly in domains such as manufacturing which intelligent processes and adaptive education can enhance. Despite the continuous accumulation of text in these domains, the lack of structured data has created information extraction and knowledge transfer barriers. In this paper, we report on work towards developing robust knowledge graphs based upon entity and relation data for both commercial and educational uses. To create the FabKG (Manufacturing knowledge graph), we have utilized textbook index words, research paper keywords, FabNER (manufacturing NER), to extract a sub knowledge base contained within Wikidata. Moreover, we propose a novel crowdsourcing method for KG creation by leveraging student notes, which contain invaluable information but are not captured as meaningful information, excluding their use in personal preparation for learning and written exams. We have created a knowledge graph containing 65000+ triples using all data sources. We have also shown the use case of domain-specific question answering and expression/formula-based question answering for educational purposes.
翻译:随着大规模信息处理需求的增加,以知识图表为基础的方法在代表一般和领域知识方面变得日益突出。这种一般性表述的发展至关重要,特别是在智能过程和适应教育能够加强的制造等领域。尽管这些领域的文本不断积累,但结构化数据的缺乏造成了信息提取和知识转让障碍。在本文件中,我们报告了根据实体和商务和教育用途关系数据编制稳健的知识图表的工作。为创建FabKG(制造知识图表),我们使用了教科书索引词、研究用纸字、FabNER(制造NER),以提取维基数据内包含的次级知识库。此外,我们提议采用新的众包方法来创建KG,利用学生笔记,其中载有宝贵的信息,但没有被记录为有意义的信息,没有将其用于个人准备学习和书面考试。我们用所有数据源制作了一个包含65000+三倍的知识图表。我们还展示了用于教育目的的特定问题回答和表达/形式问题解答的事例。