Papers
arxiv:1806.09514

The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems

Published on Jun 25, 2018
Authors:
,
,
,
,

Abstract

A dataset of emotional speech for synthesis and transformation systems demonstrates effectiveness in a perception test, showing promise for future applications.

AI-generated summary

In this paper, we present a database of emotional speech intended to be open-sourced and used for synthesis and generation purpose. It contains data for male and female actors in English and a male actor in French. The database covers 5 emotion classes so it could be suitable to build synthesis and voice transformation systems with the potential to control the emotional dimension in a continuous way. We show the data's efficiency by building a simple MLP system converting neutral to angry speech style and evaluate it via a CMOS perception test. Even though the system is a very simple one, the test show the efficiency of the data which is promising for future work.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 1806.09514
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/1806.09514 in a model README.md to link it from this page.

Datasets citing this paper 2

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/1806.09514 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.