AVID: Aalto Vocal Intensity Database

Kuvaus

Data description: AVID includes speech and EGG produced by 50 speakers (25 males, 25 females) who varied their vocal intensity in four categories (soft, normal, loud, and very loud). Recordings were conducted using a constant mouth-to-microphone distance and by recording a calibration tone. The speech data was labeled sentence-wise using a total of 19 labels that support the utilisation of the data in ML-based studies of vocal intensity based on supervised learning. Further information can be found in the 'readme.docx' file from the upload. when collected the data: Data is collected in 2021 Citation: P. Alku, M. Kodali, L. Laaksonen, S.R. Kadiri, AVID: A speech database for machine learning studies on vocal intensity, Computer Speech and Language (in review).
Näytä enemmän

Julkaisuvuosi

2023

Aineiston tyyppi

Tekijät

Department of Information and Communications Engineering

Manila Kodali Orcid -palvelun logo - Tekijä

Paavo Alku Orcid -palvelun logo - Tekijä

Sudarsana Reddy Kadiri Orcid -palvelun logo - Tekijä

Zenodo - Julkaisija

Projekti

Muut tiedot

Tieteenalat

Tietojenkäsittely ja informaatiotieteet

Kieli

Saatavuus

Avoin

Lisenssi

Creative Commons Nimeä 4.0 Kansainvälinen (CC BY 4.0)

Avainsanat

Asiasanat

Ajallinen kattavuus

undefined

Liittyvät aineistot