Sinapsis AI

community

https://sinapsisai.cloud

Activity Feed

AI & ML interests

Agentic Platform Completely Open Source, Three Phases: - Agentic - Agents - Tools

Recent Activity

JorgeAV updated a collection 6 days ago

Models (Audio Classification)

JorgeAV updated a collection 6 days ago

Models (Audio Classification)

JorgeAV updated a collection 6 days ago

Models (Audio Classification)

View all activity

SinapsisAI 's collections 13

Models (Text-to-Speech)

Best open-source Text-to-Speech (TTS) models — SOTA neural voice synthesis, zero-shot cloning, multilingual & expressive speech generation.

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 9.28M • • 5.84k
OpenMOSS-Team/MOSS-TTS

Text-to-Speech • 8B • Updated 6 days ago • 88.8k • 352
elbruno/Qwen3-TTS-12Hz-0.6B-Base-ONNX

Text-to-Speech • Updated Feb 23 • 8
coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 6.28M • 3.44k

Models (Audio Classification)

Best Open Source models for Audio Classification (emotion, music genre, language ID, etc.)

MIT/ast-finetuned-audioset-10-10-0.4593

Audio Classification • 86.6M • Updated Sep 6, 2023 • 913k • 348
speechbrain/emotion-recognition-wav2vec2-IEMOCAP

Audio Classification • Updated Jul 23, 2024 • 550k • 183
laion/clap-htsat-fused

Audio Classification • 0.2B • Updated Jan 12 • 26.2M • 65
m-a-p/MERT-v1-330M

Audio Classification • Updated May 25, 2025 • 31.2k • 83

Embeddings

BAAI/bge-m3

Sentence Similarity • Updated Jul 3, 2024 • 15.3M • • 2.85k
sentence-transformers/all-MiniLM-L6-v2

Sentence Similarity • 22.7M • Updated Mar 6, 2025 • 207M • • 4.61k
google/embeddinggemma-300m

Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 1.58M • • 1.55k
Qwen/Qwen3-Embedding-8B

Feature Extraction • 8B • Updated Jul 7, 2025 • 1.58M • • 622

Models (Diffusion I2I)

Image to Image

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated Feb 17 • 990k • • 1.47k
Qwen/Qwen-Image-Edit-2509

Image-to-Image • Updated Sep 22, 2025 • 220k • • 1.09k

Models (Diffusion I2V)

Image To Video

artificialguybr/360Rotation-Redmond-WAN2-I2V-14B

Image-to-Video • Updated Dec 12, 2025 • 226 • • 2

Models (Text Generation Instruct)

zai-org/GLM-5

Text Generation • 754B • Updated 2 days ago • 167k • • 1.87k

Models (Vision)

Qwen/Qwen-Image-Edit

Image-to-Image • Updated Aug 25, 2025 • 66.4k • • 2.35k

Models (Text-to-Audio)

Music & Sound Generation - Best Open Source models (MusicGen, Stable Audio, etc.)

stabilityai/stable-audio-open-1.0

Text-to-Audio • Updated Jun 19, 2025 • 30.8k • 1.43k
facebook/musicgen-large

Text-to-Audio • Updated Nov 17, 2023 • 22.7k • 526
ACE-Step/ACE-Step-v1-3.5B

Text-to-Audio • Updated May 22, 2025 • 722
facebook/musicgen-melody

Text-to-Audio • 2B • Updated Apr 24, 2024 • 5.14k • 251

STT (Speech To Text)

Speech to Text (ASR) - Best Open Source models

openai/whisper-large-v3

Automatic Speech Recognition • Updated Aug 12, 2024 • 4.92M • • 5.51k
openai/whisper-large-v3-turbo

Automatic Speech Recognition • Updated Oct 4, 2024 • 5.01M • • 2.87k
nvidia/parakeet-tdt-0.6b-v2

Automatic Speech Recognition • Updated Nov 27, 2025 • 162k • 1.44k
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 84.6k • 965

Models (Diffusion T2I)

Text to Image

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated Jan 30 • 757k • • 4.33k
dx8152/Qwen-Edit-2509-Multiple-angles

Image-to-Image • Updated Nov 28, 2025 • 79.4k • • 919
black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 766k • • 12.5k
stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.09M • • 7.56k

Models (Diffusion T2V)

Text to Video

tencent/HunyuanVideo-1.5

Text-to-Video • Updated Dec 25, 2025 • 695 • • 585
meituan-longcat/LongCat-Video

Text-to-Video • Updated Oct 29, 2025 • 718 • • 450
akhaliq/veo3.1-fast

Text-to-Video • Updated Oct 15, 2025 • • 21
akhaliq/sora-2

Text-to-Video • Updated Oct 14, 2025 • • 29

Models (Text Generation Thinking)

The idea of this Collection is to gather those interesting models that are Open Source and I can use them in the webpage

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 59.8k • • 1.68k
meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 8.46M • • 5.61k
allenai/Olmo-3-32B-Think

Text Generation • 1.05M • Updated Jan 5 • 5.39k • 169
allenai/Olmo-3-7B-Instruct

Text Generation • 528k • Updated Jan 5 • 96.6k • • 123

Models (VLMs)

Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 319k • • 385
Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • Updated Sep 17, 2025 • 183k • • 769
Qwen/Qwen-Image-Edit-2509

Image-to-Image • Updated Sep 22, 2025 • 220k • • 1.09k
Qwen/Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated Oct 15, 2025 • 6.96M • • 834

Models (Text-to-Speech)

Best open-source Text-to-Speech (TTS) models — SOTA neural voice synthesis, zero-shot cloning, multilingual & expressive speech generation.

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 9.28M • • 5.84k
OpenMOSS-Team/MOSS-TTS

Text-to-Speech • 8B • Updated 6 days ago • 88.8k • 352
elbruno/Qwen3-TTS-12Hz-0.6B-Base-ONNX

Text-to-Speech • Updated Feb 23 • 8
coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 6.28M • 3.44k

Models (Text-to-Audio)

Music & Sound Generation - Best Open Source models (MusicGen, Stable Audio, etc.)

stabilityai/stable-audio-open-1.0

Text-to-Audio • Updated Jun 19, 2025 • 30.8k • 1.43k
facebook/musicgen-large

Text-to-Audio • Updated Nov 17, 2023 • 22.7k • 526
ACE-Step/ACE-Step-v1-3.5B

Text-to-Audio • Updated May 22, 2025 • 722
facebook/musicgen-melody

Text-to-Audio • 2B • Updated Apr 24, 2024 • 5.14k • 251

Models (Audio Classification)

Best Open Source models for Audio Classification (emotion, music genre, language ID, etc.)

MIT/ast-finetuned-audioset-10-10-0.4593

Audio Classification • 86.6M • Updated Sep 6, 2023 • 913k • 348
speechbrain/emotion-recognition-wav2vec2-IEMOCAP

Audio Classification • Updated Jul 23, 2024 • 550k • 183
laion/clap-htsat-fused

Audio Classification • 0.2B • Updated Jan 12 • 26.2M • 65
m-a-p/MERT-v1-330M

Audio Classification • Updated May 25, 2025 • 31.2k • 83

STT (Speech To Text)

Speech to Text (ASR) - Best Open Source models

openai/whisper-large-v3

Automatic Speech Recognition • Updated Aug 12, 2024 • 4.92M • • 5.51k
openai/whisper-large-v3-turbo

Automatic Speech Recognition • Updated Oct 4, 2024 • 5.01M • • 2.87k
nvidia/parakeet-tdt-0.6b-v2

Automatic Speech Recognition • Updated Nov 27, 2025 • 162k • 1.44k
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 84.6k • 965

Embeddings

BAAI/bge-m3

Sentence Similarity • Updated Jul 3, 2024 • 15.3M • • 2.85k
sentence-transformers/all-MiniLM-L6-v2

Sentence Similarity • 22.7M • Updated Mar 6, 2025 • 207M • • 4.61k
google/embeddinggemma-300m

Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 1.58M • • 1.55k
Qwen/Qwen3-Embedding-8B

Feature Extraction • 8B • Updated Jul 7, 2025 • 1.58M • • 622

Models (Diffusion T2I)

Text to Image

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated Jan 30 • 757k • • 4.33k
dx8152/Qwen-Edit-2509-Multiple-angles

Image-to-Image • Updated Nov 28, 2025 • 79.4k • • 919
black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 766k • • 12.5k
stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.09M • • 7.56k

Models (Diffusion I2I)

Image to Image

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated Feb 17 • 990k • • 1.47k
Qwen/Qwen-Image-Edit-2509

Image-to-Image • Updated Sep 22, 2025 • 220k • • 1.09k

Models (Diffusion T2V)

Text to Video

tencent/HunyuanVideo-1.5

Text-to-Video • Updated Dec 25, 2025 • 695 • • 585
meituan-longcat/LongCat-Video

Text-to-Video • Updated Oct 29, 2025 • 718 • • 450
akhaliq/veo3.1-fast

Text-to-Video • Updated Oct 15, 2025 • • 21
akhaliq/sora-2

Text-to-Video • Updated Oct 14, 2025 • • 29

Models (Diffusion I2V)

Image To Video

artificialguybr/360Rotation-Redmond-WAN2-I2V-14B

Image-to-Video • Updated Dec 12, 2025 • 226 • • 2

Models (Text Generation Thinking)

The idea of this Collection is to gather those interesting models that are Open Source and I can use them in the webpage

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 59.8k • • 1.68k
meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 8.46M • • 5.61k
allenai/Olmo-3-32B-Think

Text Generation • 1.05M • Updated Jan 5 • 5.39k • 169
allenai/Olmo-3-7B-Instruct

Text Generation • 528k • Updated Jan 5 • 96.6k • • 123

Models (Text Generation Instruct)

zai-org/GLM-5

Text Generation • 754B • Updated 2 days ago • 167k • • 1.87k

Models (VLMs)

Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 319k • • 385
Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • Updated Sep 17, 2025 • 183k • • 769
Qwen/Qwen-Image-Edit-2509

Image-to-Image • Updated Sep 22, 2025 • 220k • • 1.09k
Qwen/Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated Oct 15, 2025 • 6.96M • • 834

Models (Vision)

Qwen/Qwen-Image-Edit

Image-to-Image • Updated Aug 25, 2025 • 66.4k • • 2.35k

AI & ML interests

Recent Activity

Team members 1

SinapsisAI 's collections 13