Language-based audio retrieval DCASE 2022 evaluation dataset

Kuvaus

This is the evaluation dataset for Task 6 (Subtask B), Language-based Audio Retrieval, in DCASE 2022 Challenge. This evaluation dataset is meant to be used for the purposes of the Subtask B in the Task 6 at the scientific challenge 2022. This dataset is not meant to be used for developing language-based audio retrieval methods. For developing language-based audio retrieval methods, you should use the development dataset, i.e., the Clotho v2.1 dataset, which can be found also in Zenodo, at: https://zenodo.org/record/4783391. == License == The audio files in the archives: retrieval_audio.7z and the associated meta-data in the CSV file: retrieval_audio_metadata.csv are under the corresponding licenses of Freesound [1] platform, mentioned explicitly in the CSV file for each of the audio files. That is, each audio file in the 7z archives is listed in the CSV file with the meta-data. The meta-data for each file are: File name Keywords URL for the orignal audio file Start and end samples for the excerpt that is used in the dataset Uploader/user in the Freesound platform (manufacturer) Link to the license of the file The caption queries in the file: retrieval_captions.csv are under the Tampere University license, described in the LICENSE file. ==References== [1] Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245

Näytä enemmän

Julkaisuvuosi

2022

Aineiston tyyppi

Tekijät

Tampereen yliopisto

Samuel Lipping - Tekijä

Zenodo - Julkaisija

Projekti

Muut tiedot

Tieteenalat

Tietojenkäsittely ja informaatiotieteet

Kieli

englanti

Saatavuus

Avoin

Lisenssi

Ei määritelty

Avainsanat

Computer and information sciences

Asiasanat

Ajallinen kattavuus

undefined