microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 335k • 1.58k
WavJourney: Compositional Audio Creation with Large Language Models Paper • 2307.14335 • Published Jul 26, 2023 • 45