Progress in automatic chord recognition has been slow since the advent of deep learning in the field. To understand why, I conduct experiments on existing methods and test hypotheses enabled by recent developments in generative models. Findings show that chord classifiers perform poorly on rare chords and that pitch augmentation boosts accuracy. Features extracted from generative models do not help and synthetic data presents an exciting avenue for future work. I conclude by improving the interpretability of model outputs with beat detection, reporting some of the best results in the field and providing qualitative analysis. Much work remains to solve automatic chord recognition, but I hope this thesis will chart a path for others to try.
翻译:自深度学习进入该领域以来,自动和弦识别的进展一直较为缓慢。为探究其原因,我对现有方法进行了实验,并基于生成模型的最新发展测试了若干假设。研究发现,和弦分类器在罕见和弦上表现不佳,而音高数据增强能有效提升准确率。从生成模型中提取的特征并无助益,而合成数据则为未来工作提供了一个令人兴奋的方向。最后,我通过结合节拍检测提升了模型输出的可解释性,报告了该领域目前最佳的部分结果,并提供了定性分析。要解决自动和弦识别问题仍有许多工作待完成,但我希望本论文能为后续研究者探索出一条可行的路径。