oto

Team

company

https://www.oto.earth/

otodotearth

otoearth

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

consome2 published an article 9 days ago

Speech-to-Speech AI: A Primer

leq6c published a dataset 21 days ago

otoearth/otoSpeech-HQ-full-duplex-samples

leq6c updated a dataset 21 days ago

otoearth/otoSpeech-HQ-full-duplex-samples

View all activity

Articles

Speech-to-Speech AI: A Primer

9 days ago

•

consome2

published an article 9 days ago

Article

Speech-to-Speech AI: A Primer

9 days ago

•

consome2

posted an update 9 days ago

Post

3221

Built a small site for tracking speech-to-speech, full-duplex, and audio foundation model work.
It covers models, benchmarks, datasets, and some blog posts to organize the landscape in one place.

Still early, but sharing in case it is useful:
https://www.fullduplex.ai/

If you spot missing entries or mistakes, I would really appreciate corrections.

2 replies

leq6c

published a dataset 21 days ago

otoearth/otoSpeech-HQ-full-duplex-samples

Viewer • Updated 21 days ago • 1 • 14 • 1

leq6c

updated a dataset 21 days ago

otoearth/otoSpeech-HQ-full-duplex-samples

Viewer • Updated 21 days ago • 1 • 14 • 1

leq6c

updated 2 datasets 3 months ago

otoearth/otoSpeech-full-duplex-processed-141h

Preview • Updated Feb 6 • 204 • 25

otoearth/otoSpeech-full-duplex-280h

Preview • Updated Feb 6 • 82 • 10

consome2

posted an update 3 months ago

Post

5289

We’ve released two conversational speech datasets from oto on Hugging Face 🤗
Both are based on real, casual, full-duplex conversations, but with slightly different focuses.

Dataset 1: Processed / curated subset
otoearth/otoSpeech-full-duplex-processed-141h
* Full-duplex, spontaneous multi-speaker conversations
* Participants filtered for high audio quality
* PII removal and audio enhancement applied
* Designed for training and benchmarking S2S or dialogue models

Dataset 2: Larger raw(er) release
otoearth/otoSpeech-full-duplex-280h
* Same collection pipeline, with broader coverage
* More diversity in speakers, accents, and conversation styles
* Useful for analysis, filtering, or custom preprocessing experiments

We intentionally split the release to support different research workflows:
clean and ready-to-use vs. more exploratory and research-oriented use.

The datasets are currently private, but we’re happy to approve access requests — feel free to request access if you’re interested.

If you’re working on speech-to-speech (S2S) models or are curious about full-duplex conversational data, we’d love to discuss and exchange ideas together.

Feedback and ideas are very welcome!