大语言模型危害：分类与探讨 (LLM Harms: A Taxonomy and Discussion) - 专知论文

会员服务 ·

0

类别 · 语言模型 · 大语言模型 · 系统 · 分析 ·

LLM Harms: A Taxonomy and Discussion

翻译：大语言模型危害：分类与探讨

Kevin Chen,Saleh Afroogh,Abhejay Murali,David Atkinson,Amit Dhurandhar,Junfeng Jiao

This study addresses categories of harm surrounding Large Language Models (LLMs) in the field of artificial intelligence. It addresses five categories of harms addressed before, during, and after development of AI applications: pre-development, direct output, Misuse and Malicious Application, and downstream application. By underscoring the need to define risks of the current landscape to ensure accountability, transparency and navigating bias when adapting LLMs for practical applications. It proposes mitigation strategies and future directions for specific domains and a dynamic auditing system guiding responsible development and integration of LLMs in a standardized proposal.

翻译：本研究针对人工智能领域大语言模型（LLMs）相关的危害类别进行了系统分析。研究涵盖了人工智能应用开发前、开发中及开发后五个类别的危害：开发前阶段、直接输出危害、误用与恶意应用、以及下游应用风险。通过强调在将LLMs应用于实际场景时，需明确当前环境中的风险以确保问责制、透明性并规避偏见，本文提出了针对特定领域的缓解策略与未来研究方向，并构建了一个动态审计体系，以标准化方案指导LLMs的负责任开发与集成。

0

相关内容

144页ppt《扩散模型》，Google DeepMind Sander Dieleman

144页ppt《扩散模型》，Google DeepMind Sander Dieleman

专知会员服务

46+阅读 · 11月21日

带入您自己的知识：大型语言模型（LLM）知识扩展方法综述

带入您自己的知识：大型语言模型（LLM）知识扩展方法综述

专知会员服务

38+阅读 · 2月21日

边缘大型语言模型综述：设计、执行与应用

边缘大型语言模型综述：设计、执行与应用

专知会员服务

41+阅读 · 2024年10月21日

《大型语言模型归因》综述

《大型语言模型归因》综述

专知会员服务

75+阅读 · 2023年11月8日

【香港科技大学等】视觉-语言智能:任务、表示学习和大模型，Vision-Language Intelligence: Tasks, Representation Learning, and Large Models

【香港科技大学等】视觉-语言智能:任务、表示学习和大模型，Vision-Language Intelligence: Tasks, Representation Learning, and Large Models

专知会员服务

44+阅读 · 2022年3月8日

【2022新书】Python数学逻辑，285页pdf

【2022新书】Python数学逻辑，285页pdf

专知

13+阅读 · 2022年11月24日

层级强化学习概念简介

层级强化学习概念简介

CreateAMind

19+阅读 · 2019年6月9日

PointNet系列论文解读

PointNet系列论文解读

人工智能前沿讲习班

17+阅读 · 2019年5月3日

机器翻译新时代：Facebook 开源无监督机器翻译模型和大规模训练语料

机器翻译新时代：Facebook 开源无监督机器翻译模型和大规模训练语料

机器学习研究会

12+阅读 · 2017年12月24日

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

炼数成金订阅号

26+阅读 · 2017年7月10日

信息不完全的双边匹配决策方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于犹豫模糊语言信息的定性决策理论与方法

国家自然科学基金

2+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

波动率微笑：隐含信息与动态建模

国家自然科学基金

2+阅读 · 2014年12月31日

面向汉语文本理解的语义计算方法

国家自然科学基金

8+阅读 · 2014年12月31日

Towards Corpus-Grounded Agentic LLMs for Multilingual Grammatical Analysis

Arxiv

0+阅读 · 11月28日

From Text to Multimodality: Exploring the Evolution and Impact of Large Language Models in Medical Practice

Arxiv

0+阅读 · 11月25日

The Empowerment of Science of Science by Large Language Models: New Tools and Methods

Arxiv

0+阅读 · 11月19日

Prompt Engineering vs. Fine-Tuning for LLM-Based Vulnerability Detection in Solana and Algorand Smart Contracts

Arxiv

0+阅读 · 11月14日

Academics and Generative AI: Empirical and Epistemic Indicators of Policy-Practice Voids

Arxiv

0+阅读 · 11月4日

VIP会员

文章信息

相关主题

大语言模型

相关VIP内容

144页ppt《扩散模型》，Google DeepMind Sander Dieleman

144页ppt《扩散模型》，Google DeepMind Sander Dieleman

专知会员服务

46+阅读 · 11月21日

带入您自己的知识：大型语言模型（LLM）知识扩展方法综述

带入您自己的知识：大型语言模型（LLM）知识扩展方法综述

专知会员服务

38+阅读 · 2月21日

边缘大型语言模型综述：设计、执行与应用

边缘大型语言模型综述：设计、执行与应用

专知会员服务

41+阅读 · 2024年10月21日

《大型语言模型归因》综述

《大型语言模型归因》综述

专知会员服务

75+阅读 · 2023年11月8日

【香港科技大学等】视觉-语言智能:任务、表示学习和大模型，Vision-Language Intelligence: Tasks, Representation Learning, and Large Models

【香港科技大学等】视觉-语言智能:任务、表示学习和大模型，Vision-Language Intelligence: Tasks, Representation Learning, and Large Models

专知会员服务

44+阅读 · 2022年3月8日

热门VIP内容

开通专知VIP会员享更多权益服务

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

前沿人工智能趋势报告（Frontier AI Trends Report）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

【2022新书】Python数学逻辑，285页pdf

【2022新书】Python数学逻辑，285页pdf

专知

13+阅读 · 2022年11月24日

层级强化学习概念简介

层级强化学习概念简介

CreateAMind

19+阅读 · 2019年6月9日

PointNet系列论文解读

PointNet系列论文解读

人工智能前沿讲习班

17+阅读 · 2019年5月3日

机器翻译新时代：Facebook 开源无监督机器翻译模型和大规模训练语料

机器翻译新时代：Facebook 开源无监督机器翻译模型和大规模训练语料

机器学习研究会

12+阅读 · 2017年12月24日

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

炼数成金订阅号

26+阅读 · 2017年7月10日

相关论文

Towards Corpus-Grounded Agentic LLMs for Multilingual Grammatical Analysis

Arxiv

0+阅读 · 11月28日

From Text to Multimodality: Exploring the Evolution and Impact of Large Language Models in Medical Practice

Arxiv

0+阅读 · 11月25日

The Empowerment of Science of Science by Large Language Models: New Tools and Methods

Arxiv

0+阅读 · 11月19日

Prompt Engineering vs. Fine-Tuning for LLM-Based Vulnerability Detection in Solana and Algorand Smart Contracts

Arxiv

0+阅读 · 11月14日

Academics and Generative AI: Empirical and Epistemic Indicators of Policy-Practice Voids

Arxiv

0+阅读 · 11月4日

相关基金

信息不完全的双边匹配决策方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于犹豫模糊语言信息的定性决策理论与方法

国家自然科学基金

2+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

波动率微笑：隐含信息与动态建模

国家自然科学基金

2+阅读 · 2014年12月31日

面向汉语文本理解的语义计算方法

国家自然科学基金

8+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员