反车辆地雷:扩展和语义保护防病毒实验室采矿方法 (AVMiner: Expansible and Semantic-Preserving Anti-Virus Labels Mining Method) - 专知论文

会员服务 ·

0

MINE · 知识 (knowledge) · 标注 · 词元分析器 · Analysis ·

2022 年 8 月 30 日

AVMiner: Expansible and Semantic-Preserving Anti-Virus Labels Mining Method

翻译：反车辆地雷:扩展和语义保护防病毒实验室采矿方法

Ligeng Chen,Zhongling He,Hao Wu,Yuhang Gong,Bing Mao

With the increase in the variety and quantity of malware, there is an urgent need to speed up the diagnosis and the analysis of malware. Extracting the malware family-related tokens from AV (Anti-Virus) labels, provided by online anti-virus engines, paves the way for pre-diagnosing the malware. Automatically extract the vital information from AV labels will greatly enhance the detection ability of security enterprises and equip the research ability of security analysts. Recent works like AVCLASS and AVCLASS2 try to extract the attributes of malware from AV labels and establish the taxonomy based on expert knowledge. However, due to the uncertain trend of complicated malicious behaviors, the system needs the following abilities to face the challenge: preserving vital semantics, being expansible, and free from expert knowledge. In this work, we present AVMiner, an expansible malware tagging system that can mine the most vital tokens from AV labels. AVMiner adopts natural language processing techniques and clustering methods to generate a sequence of tokens without expert knowledge ranked by importance. AVMiner can self-update when new samples come. Finally, we evaluate AVMiner on over 8,000 samples from well-known datasets with manually labeled ground truth, which outperforms previous works.

翻译：随着恶意软件的种类和数量的增加,迫切需要加快对恶意软件的诊断和分析。从在线反病毒引擎提供的AV(Anti-Virus)标签上提取与恶意软件有关的家庭标记,为预先诊断恶意软件铺平了道路。自动从AV标签上提取重要信息将大大增强安全企业的检测能力,并装备安全分析员的研究能力。最近的一些工作,如AVLACASS和AVLACASS2, 试图从AV标签上提取恶意软件的属性,并根据专家知识建立分类学。然而,由于复杂的恶意行为的不确定趋势,该系统需要以下能力来应对挑战:保存关键的语义,可以推广,并且没有专家知识。在这项工作中,我们介绍AV标签上最关键符号的防恶意标记系统。AViner采用自然语言处理技术和组合方法,以生成没有专家知识的标志序列,最后,从AVILA样本中进行我们所了解的样本排序。

0

相关内容

MINE

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

抑制Hedgehog信号通路的植物C21甾体化合物的构效关系、结构优化及抗肿瘤作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

PDCD5对多发性骨髓瘤survivin表达的影响及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于1,6-二氮杂萘骨架的新型c-Met激酶小分子抑制剂的发现和结构功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

几种蕨类植物抗早老性痴呆症活性成分的发现及其构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

水稻细菌性基腐病菌zeamine毒素基因簇的克隆与功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

Elf3及其天然反义转录本在大肠癌发生中的作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

辐射流体不稳定性数值分析及高效数值方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

细胞跨膜糖蛋白EphA2、CD44v5、RECK和c-met基因及表达在食管癌侵袭、转移中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

玉簪属植物中胆碱酯酶抑制剂的发现、半合成及构效关系

国家自然科学基金

0+阅读 · 2009年12月31日

靶向抑制Hedgehog/EGFR对胰腺癌的治疗作用及其交叉对话机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

STOP: A dataset for Spoken Task Oriented Semantic Parsing

STOP: A dataset for Spoken Task Oriented Semantic Parsing

Arxiv

0+阅读 · 2022年10月18日

RibSeg v2: A Large-scale Benchmark for Rib Labeling and Anatomical Centerline Extraction

Arxiv

0+阅读 · 2022年10月18日

D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments

Arxiv

0+阅读 · 2022年10月16日

EventGraph: Event Extraction as Semantic Graph Parsing

Arxiv

0+阅读 · 2022年10月16日

Trailers12k: Evaluating Transfer Learning for Movie Trailer Genre Classification

Arxiv

0+阅读 · 2022年10月14日

Multitask kernel-learning parameter prediction method for solving time-dependent linear systems

Arxiv

0+阅读 · 2022年10月14日

How Does Knowledge Graph Embedding Extrapolate to Unseen Data: a Semantic Evidence View

Arxiv

15+阅读 · 2022年1月5日

Deep Neural Network Based Relation Extraction: An Overview

Arxiv

14+阅读 · 2021年1月6日

Explainable Recommender Systems via Resolving Learning Representations

Arxiv

13+阅读 · 2020年8月21日

Deep Learning in Video Multi-Object Tracking: A Survey

Deep Learning in Video Multi-Object Tracking: A Survey

Arxiv

58+阅读 · 2019年7月31日

VIP会员

文章信息

相关主题

知识 (knowledge)

词元分析器

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

STOP: A dataset for Spoken Task Oriented Semantic Parsing

STOP: A dataset for Spoken Task Oriented Semantic Parsing

Arxiv

0+阅读 · 2022年10月18日

RibSeg v2: A Large-scale Benchmark for Rib Labeling and Anatomical Centerline Extraction

Arxiv

0+阅读 · 2022年10月18日

D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments

Arxiv

0+阅读 · 2022年10月16日

EventGraph: Event Extraction as Semantic Graph Parsing

Arxiv

0+阅读 · 2022年10月16日

Trailers12k: Evaluating Transfer Learning for Movie Trailer Genre Classification

Arxiv

0+阅读 · 2022年10月14日

Multitask kernel-learning parameter prediction method for solving time-dependent linear systems

Arxiv

0+阅读 · 2022年10月14日

How Does Knowledge Graph Embedding Extrapolate to Unseen Data: a Semantic Evidence View

Arxiv

15+阅读 · 2022年1月5日

Deep Neural Network Based Relation Extraction: An Overview

Arxiv

14+阅读 · 2021年1月6日

Explainable Recommender Systems via Resolving Learning Representations

Arxiv

13+阅读 · 2020年8月21日

Deep Learning in Video Multi-Object Tracking: A Survey

Deep Learning in Video Multi-Object Tracking: A Survey

Arxiv

58+阅读 · 2019年7月31日

相关基金

抑制Hedgehog信号通路的植物C21甾体化合物的构效关系、结构优化及抗肿瘤作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

PDCD5对多发性骨髓瘤survivin表达的影响及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于1,6-二氮杂萘骨架的新型c-Met激酶小分子抑制剂的发现和结构功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

几种蕨类植物抗早老性痴呆症活性成分的发现及其构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

水稻细菌性基腐病菌zeamine毒素基因簇的克隆与功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

Elf3及其天然反义转录本在大肠癌发生中的作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

辐射流体不稳定性数值分析及高效数值方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

细胞跨膜糖蛋白EphA2、CD44v5、RECK和c-met基因及表达在食管癌侵袭、转移中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

玉簪属植物中胆碱酯酶抑制剂的发现、半合成及构效关系

国家自然科学基金

0+阅读 · 2009年12月31日

靶向抑制Hedgehog/EGFR对胰腺癌的治疗作用及其交叉对话机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员