通过神经示例拆分实现神经- 神经- 双曲矩形综合框架 (Neuro-Symbolic Regex Synthesis Framework via Neural Example Splitting) - 专知论文

会员服务 ·

0

正则表达式 · 样例 · 学成 · 连结 · 推断 ·

2022 年 5 月 20 日

Neuro-Symbolic Regex Synthesis Framework via Neural Example Splitting

翻译：通过神经示例拆分实现神经- 神经- 双曲矩形综合框架

Su-Hyeon Kim,Hyunjoon Cheon,Yo-Sub Han,Sang-Ki Ko

Due to the practical importance of regular expressions (regexes, for short), there has been a lot of research to automatically generate regexes from positive and negative string examples. We tackle the problem of learning regexes faster from positive and negative strings by relying on a novel approach called `neural example splitting'. Our approach essentially split up each example string into multiple parts using a neural network trained to group similar substrings from positive strings. This helps to learn a regex faster and, thus, more accurately since we now learn from several short-length strings. We propose an effective regex synthesis framework called `SplitRegex' that synthesizes subregexes from `split' positive substrings and produces the final regex by concatenating the synthesized subregexes. For the negative sample, we exploit pre-generated subregexes during the subregex synthesis process and perform the matching against negative strings. Then the final regex becomes consistent with all negative strings. SplitRegex is a divided-and-conquer framework for learning target regexes; split (=divide) positive strings and infer partial regexes for multiple parts, which is much more accurate than the whole string inferring, and concatenate (=conquer) inferred regexes while satisfying negative strings. We empirically demonstrate that the proposed SplitRegex framework substantially improves the previous regex synthesis approaches over four benchmark datasets.

翻译：由于常规表达式的实际重要性(regexes, 短短的), 有很多研究可以自动生成正和负字符串示例的正和负字符串实例。我们通过依赖所谓的“ 神经示例分裂” 的新颖方法,解决了从正和负字符串中更快学习正和负字符串的问题。我们的方法基本上将每个例字符串分成多个部分, 使用经训练的神经网络将相似的子字符串从正字符串中分组。这有助于学习正和更快的regex, 因此, 自从我们从一些短字符里学习以来, 更准确地说, 更准确地说, 自我们从一些短字符串中学习了正和负的正的正对数化合成框架。我们建议了一个有效的正正正正正的反正对数化和正反正对比框架。分解后, 右对正对正的反正对正对正对正的反比度框架( 在正对正的正对正对正的对正对正对立中, ) 和对正对正的反正的对正对立框架的对立框架框架( 。分化前和正的对正的对正的对立框架是整的对立和正的对正的对正的对立和对正的对立框架框架框架框架,对立和对准。

0

相关内容

正则表达式

正则表达式

正则表达式（Regular Expression，一般简写为RegEx或者RegExp），也译为正规表示法、常规表示法，台湾译「规则运算式」，在计算机科学中，是指一个用来描述或者匹配一系列符合某个句法规则的字符串的单个字符串。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

极端条件下元素液体（Se和I2）的结构和性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

电热诱导抓捕分析物的高灵敏度复合纳米结构表面增强拉曼散射系统

国家自然科学基金

0+阅读 · 2014年12月31日

水稻膜联蛋白OsANN7调控种子发育和耐盐反应的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

具有曲界面表面流与多孔介质流耦合问题的有限元法

国家自然科学基金

0+阅读 · 2013年12月31日

剧烈塑性变形条件下金属间化合物相变研究

国家自然科学基金

0+阅读 · 2012年12月31日

岩石渗流应力耦合流变力学实验及本构模型研究

国家自然科学基金

0+阅读 · 2011年12月31日

利用固体中含结晶水盐受热产生纳米气泡及气泡破裂与水热合成研究

国家自然科学基金

0+阅读 · 2011年12月31日

陶瓷材料热震失效机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于MSLS插值的无网格流形方法及裂纹扩展模拟研究

国家自然科学基金

0+阅读 · 2009年12月31日

戊型肝炎病毒ORF2蛋白与宿主细胞蛋白的相互作用及其生物学意义

国家自然科学基金

0+阅读 · 2008年12月31日

Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis

Arxiv

0+阅读 · 2022年7月11日

Reconstructing Missing EHRs Using Time-Aware Within- and Cross-Visit Information for Septic Shock Early Prediction

Reconstructing Missing EHRs Using Time-Aware Within- and Cross-Visit Information for Septic Shock Early Prediction

Arxiv

0+阅读 · 2022年7月11日

A Lexicon and Depth-wise Separable Convolution Based Handwritten Text Recognition System

Arxiv

0+阅读 · 2022年7月11日

Isochrony-Aware Neural Machine Translation for Automatic Dubbing

Arxiv

0+阅读 · 2022年7月8日

Sequence tagging for biomedical extractive question answering

Arxiv

0+阅读 · 2022年7月7日

Contrastive Triple Extraction with Generative Transformer

Arxiv

13+阅读 · 2021年2月4日

Subgraph Neural Networks

Arxiv

27+阅读 · 2020年6月19日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

Order-Free RNN with Visual Attention for Multi-Label Classification

Arxiv

16+阅读 · 2017年12月20日

VIP会员

文章信息

相关主题

正则表达式

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis

Arxiv

0+阅读 · 2022年7月11日

Reconstructing Missing EHRs Using Time-Aware Within- and Cross-Visit Information for Septic Shock Early Prediction

Reconstructing Missing EHRs Using Time-Aware Within- and Cross-Visit Information for Septic Shock Early Prediction

Arxiv

0+阅读 · 2022年7月11日

A Lexicon and Depth-wise Separable Convolution Based Handwritten Text Recognition System

Arxiv

0+阅读 · 2022年7月11日

Isochrony-Aware Neural Machine Translation for Automatic Dubbing

Arxiv

0+阅读 · 2022年7月8日

Sequence tagging for biomedical extractive question answering

Arxiv

0+阅读 · 2022年7月7日

Contrastive Triple Extraction with Generative Transformer

Arxiv

13+阅读 · 2021年2月4日

Subgraph Neural Networks

Arxiv

27+阅读 · 2020年6月19日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

Order-Free RNN with Visual Attention for Multi-Label Classification

Arxiv

16+阅读 · 2017年12月20日

相关基金

极端条件下元素液体（Se和I2）的结构和性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

电热诱导抓捕分析物的高灵敏度复合纳米结构表面增强拉曼散射系统

国家自然科学基金

0+阅读 · 2014年12月31日

水稻膜联蛋白OsANN7调控种子发育和耐盐反应的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

具有曲界面表面流与多孔介质流耦合问题的有限元法

国家自然科学基金

0+阅读 · 2013年12月31日

剧烈塑性变形条件下金属间化合物相变研究

国家自然科学基金

0+阅读 · 2012年12月31日

岩石渗流应力耦合流变力学实验及本构模型研究

国家自然科学基金

0+阅读 · 2011年12月31日

利用固体中含结晶水盐受热产生纳米气泡及气泡破裂与水热合成研究

国家自然科学基金

0+阅读 · 2011年12月31日

陶瓷材料热震失效机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于MSLS插值的无网格流形方法及裂纹扩展模拟研究

国家自然科学基金

0+阅读 · 2009年12月31日

戊型肝炎病毒ORF2蛋白与宿主细胞蛋白的相互作用及其生物学意义

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员