高效近似搜索一组矢量 (Efficient Approximate Search for Sets of Vectors) - 专知论文

会员服务 ·

0

向量化 · 近似 · 情景 · CASE · 相似度度量 ·

2021 年 8 月 30 日

Efficient Approximate Search for Sets of Vectors

翻译：高效近似搜索一组矢量

Michael Leybovich,Oded Shmueli

from arxiv, 8 pages, 1 figure

We consider a similarity measure between two sets $A$ and $B$ of vectors, that balances the average and maximum cosine distance between pairs of vectors, one from set $A$ and one from set $B$. As a motivation for this measure, we present lineage tracking in a database. To practically realize this measure, we need an approximate search algorithm that given a set of vectors $A$ and sets of vectors $B_1,...,B_n$, the algorithm quickly locates the set $B_i$ that maximizes the similarity measure. For the case where all sets are singleton sets, essentially each is a single vector, there are known efficient approximate search algorithms, e.g., approximated versions of tree search algorithms, locality-sensitive hashing (LSH), vector quantization (VQ) and proximity graph algorithms. In this work, we present approximate search algorithms for the general case. The underlying idea in these algorithms is encoding a set of vectors via a "long" single vector. The proposed approximate approach achieves significant performance gains over an optimized, exact search on vector sets.

翻译：我们认为两种矢量的相似度度是两套A美元和两套B美元之间的相似度度量,这种量度平衡了两套矢量的平均值和最大余弦距离,一对设定美元,一对设定美元,一对设定美元,一对设定美元。作为这一度量的动机,我们在数据库中提供线系跟踪。为了实际实现这一度量,我们需要一种近似搜索算法,根据一套矢量的矢量值和几套矢量的值,一美元、一美元和一美元,算法迅速定位了一组美元,使相似度量度量最大化。对于所有数据集都是单吨数的情况,基本上每套都是单一矢量,则有已知的有效近似搜索算法,例如树木搜索算法的近似版本、对地点敏感的散射法(LSH)、矢量定量(VQ)和近距离图算法。在这项工作中,我们提出了一般情况的近似搜索算法。这些算法的基本想法是通过“长期”单一矢量对一套矢量进行编码。拟议的近似方法在优化矢量、精确的矢量组合上取得了显著的绩效。

0

相关内容

向量化

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

所有好的向量空间都是同构的吗?Are All Good Word Vector Spaces Isomorphic?

所有好的向量空间都是同构的吗?Are All Good Word Vector Spaces Isomorphic?

专知会员服务

9+阅读 · 2020年4月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

知识图谱在可解释人工智能中的作用，附81页ppt

知识图谱在可解释人工智能中的作用，附81页ppt

专知会员服务

140+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

学术报告|UCLA副教授孙怡舟博士

学术报告|UCLA副教授孙怡舟博士

科技创新与创业

9+阅读 · 2019年6月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

LibRec 精选：连通知识图谱与推荐系统

LibRec 精选：连通知识图谱与推荐系统

LibRec智能推荐

3+阅读 · 2018年8月9日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新八篇视频描述生成相关论文—在线视频理解、联合定位和描述事件、生成视频、跨模态注意力机制、联合事件检测和描述

【论文推荐】最新八篇视频描述生成相关论文—在线视频理解、联合定位和描述事件、生成视频、跨模态注意力机制、联合事件检测和描述

专知

11+阅读 · 2018年6月4日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Optimal Age over Erasure Channels

Arxiv

0+阅读 · 2021年10月20日

A Row-Wise Update Algorithm for Sparse Stochastic Matrix Factorization

Arxiv

0+阅读 · 2021年10月20日

Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation

Arxiv

0+阅读 · 2021年10月19日

A Tight Approximation Algorithm for the Cluster Vertex Deletion Problem

Arxiv

0+阅读 · 2021年10月18日

Approximate Sampling and Counting of Graphs with Near-Regular Degree Intervals

Arxiv

0+阅读 · 2021年10月18日

Minimal Conditions for Beneficial Local Search

Arxiv

0+阅读 · 2021年10月17日

There is no APTAS for 2-dimensional vector bin packing: Revisited

Arxiv

0+阅读 · 2021年10月16日

Counting Objects by Diffused Index: geometry-free and training-free approach

Arxiv

0+阅读 · 2021年10月15日

Approximation Ratios of Graph Neural Networks for Combinatorial Problems

Arxiv

7+阅读 · 2019年5月24日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

VIP会员

文章信息

相关主题

相似度度量

相关VIP内容

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

所有好的向量空间都是同构的吗?Are All Good Word Vector Spaces Isomorphic?

所有好的向量空间都是同构的吗?Are All Good Word Vector Spaces Isomorphic?

专知会员服务

9+阅读 · 2020年4月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

知识图谱在可解释人工智能中的作用，附81页ppt

知识图谱在可解释人工智能中的作用，附81页ppt

专知会员服务

140+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

学术报告|UCLA副教授孙怡舟博士

学术报告|UCLA副教授孙怡舟博士

科技创新与创业

9+阅读 · 2019年6月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

LibRec 精选：连通知识图谱与推荐系统

LibRec 精选：连通知识图谱与推荐系统

LibRec智能推荐

3+阅读 · 2018年8月9日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新八篇视频描述生成相关论文—在线视频理解、联合定位和描述事件、生成视频、跨模态注意力机制、联合事件检测和描述

【论文推荐】最新八篇视频描述生成相关论文—在线视频理解、联合定位和描述事件、生成视频、跨模态注意力机制、联合事件检测和描述

专知

11+阅读 · 2018年6月4日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Optimal Age over Erasure Channels

Arxiv

0+阅读 · 2021年10月20日

A Row-Wise Update Algorithm for Sparse Stochastic Matrix Factorization

Arxiv

0+阅读 · 2021年10月20日

Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation

Arxiv

0+阅读 · 2021年10月19日

A Tight Approximation Algorithm for the Cluster Vertex Deletion Problem

Arxiv

0+阅读 · 2021年10月18日

Approximate Sampling and Counting of Graphs with Near-Regular Degree Intervals

Arxiv

0+阅读 · 2021年10月18日

Minimal Conditions for Beneficial Local Search

Arxiv

0+阅读 · 2021年10月17日

There is no APTAS for 2-dimensional vector bin packing: Revisited

Arxiv

0+阅读 · 2021年10月16日

Counting Objects by Diffused Index: geometry-free and training-free approach

Arxiv

0+阅读 · 2021年10月15日

Approximation Ratios of Graph Neural Networks for Combinatorial Problems

Arxiv

7+阅读 · 2019年5月24日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

微信扫码咨询专知VIP会员