emotion tags not working
I tried adding emotion tags that described in repo also added those in tags but still now worked, model just ignored that and synthesized speech. anyone knows why that happened or its my mistake?
this problem is happening to me too! i thought it was only me at first.
same here brother like i tried to pass in different different way( laugh, laughs,laughing inside <> )but no one did not worked.
Thanks for reporting this, and sorry for the confusion.
You are right that the current expression tag support is still limited. We added tags such as <laugh>, <breath>, and <sigh> because many users asked for non-verbal and expressive sounds after the previous release, and we did include tagged examples in the training data. However, the tagged data is currently available mostly for Korean, English, and Japanese, and even within those languages the consistency of the labels is not yet as strong as we would like.
Because of that, the model may sometimes ignore the tag or simply synthesize the surrounding text normally. This is a known limitation of the current Supertonic 3 release, and we should have communicated it more clearly. Sorry about that.
A few things may help in the current version:
- Tags tend to work better in Korean, English, and Japanese.
- Tags often work better when placed at the beginning or end of the sentence.
- If a single tag is ignored, repeating it two or three times can sometimes make the expression more likely to appear.
For example, you may get better results with inputs like:
<laugh> <laugh> That was not what I expected. <laugh> <laugh>
We are planning to improve this with additional synthetic data and supervised fine-tuning, so expression tags should become more reliable in future updates.
Thanks again for trying the model and pointing this out. We appreciate the feedback, and weโll share updates when this improves.