WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Propose a three-stage processing pipeline for filtering noisy data and generating high-quality captions, where ChatGPT.
Конвейер обработки для фильтрации зашумленных данных и создания высококачественных титров.
Github: https://github.com/xinhaomei/wavcaps
Paper: https://arxiv.org/abs/2303.17395v1
Dataset: https://paperswithcode.com/dataset/sounddescs
ai_machinelearning_big_data