Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ran's picture
2 9 35

Ran

Ran-Mewo
theminji's profile picture Mi6paulino's profile picture 21world's profile picture
·
  • Ran-Mewo

AI & ML interests

None yet

Organizations

Hugging Face Discord Community's profile picture

upvoted 2 articles 4 months ago
view article
Article

SmolVLM2: Bringing Video Understanding to Every Device

  • +5
Feb 20, 2025
•
337
view article
Article

LLM based Audio models

Dec 18, 2025
•
58
upvoted 3 papers 7 months ago

TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

Paper • 2303.09057 • Published Mar 16, 2023 • 3

Voice Separation with an Unknown Number of Multiple Speakers

Paper • 2003.01531 • Published Feb 29, 2020 • 3

MulliVC: Multi-lingual Voice Conversion With Cycle Consistency

Paper • 2408.04708 • Published Aug 8, 2024 • 8
upvoted 2 papers 9 months ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50

WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

Paper • 2505.09558 • Published May 14, 2025 • 10
upvoted 2 papers over 1 year ago

MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Paper • 2412.03558 • Published Dec 4, 2024 • 20

MV-Adapter: Multi-view Consistent Image Generation Made Easy

Paper • 2412.03632 • Published Dec 4, 2024 • 24
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs