SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation

Kuvaus

The SynthSOD dataset contains more than 47 hours of multitrack music obtained by synthesizing orchestra and ensemble pieces from the Symbolic Orchestral Database (SOD) using Spitfire BBC Symphony Orchestra Professional Library. To synthesize the MIDI files from the SOD, we needed to fix the original files into the General MIDI standard, select a subsect of files that fitted into our requirements (e.g., containing only instruments that we could synthesize), and develop a new system to generate musically-motivated random annotations about tempo, dynamic, and articulation. The code to replicate this process is available in our repository and all the details can be read in our paper: https://doi.org/10.48550/arXiv.2409.10995 We have also published the code to train and evaluate the baseline and the pre-trained models in a GitHub repository: https://github.com/repertorium/SynthSOD-Baseline
Näytä enemmän

Julkaisuvuosi

2024

Aineiston tyyppi

Tekijät

Archontis Politis - Tekijä

David Diaz-Guerra - Tekijä

Tuomas Virtanen - Tekijä

Tuntematon organisaatio

J. J. Carabias-Orti - Tekijä

Jaime García-Martínez - Tekijä

Pedro Vera-Candeas - Tekijä

Zenodo - Julkaisija

Projekti

Muut tiedot

Tieteenalat

Tietojenkäsittely ja informaatiotieteet

Kieli

englanti

Saatavuus

Avoin

Lisenssi

Creative Commons Nimeä 4.0 Kansainvälinen (CC BY 4.0)

Avainsanat

Audio and Speech Processing, Symphony Orchestra

Asiasanat

Ajallinen kattavuus

undefined

Liittyvät aineistot