【导读】音频文本生成是一个新颖而令人兴奋的研究方向,着眼于自动生成常规音频的文本描述。本文整理了音频文本生成的论文列表。
The SJTU Submission for DCASE2020 Task 6: A CRNN-GRU Based Reinforcement Learning Approach to Audiocaption
Audio Captioning Based on Transformer and Pre-Training for 2020 DCASE Audio Captioning Challenge
Automatic Audio Captioning System Based on Convolutional Neural Network
Automated Audio Captioning With Temporal Attention
Audio Captioning With the Transformer
Automated Audio Captioning
IRIT-UPS DCASE 2020 audio captioning system
Task 6 DCASE 2020: Listen Carefully and Tell: An Audio Captioning System Based on Residual Learning and Gammatone Audio Representation
Automated Audio Captioning
The NTT DCASE2020 Challenge Task 6 System: Automated Audio Captioning With Keywords and Sentence Length Estimation
Audio Captioning using Gated Recurrent Units
Clotho: An Audio Captioning Dataset
Crowdsourcing a Dataset of Audio Captions
Neural Audio Captioning Based On Conditional Sequence-to-Sequence Model
AudioCaps: Generating captions for audios in the wild
Audio caption: Listen and tell
Automated Audio Captioning with Recurrent Neural Networks