3 11 22

lulavc PRO

lulavc

AI & ML interests

None yet

Recent Activity

liked a Space 22 days ago

openai/README

upvoted an article about 2 months ago

On the Shifting Global Compute Landscape

upvoted a collection about 2 months ago

HY-MT1.5

View all activity

Organizations

liked a Space 22 days ago

README

🦀

upvoted an article about 2 months ago

Article

On the Shifting Global Compute Landscape

Oct 29, 2025

•

upvoted 6 collections about 2 months ago

New activity in huggingface/openapi about 2 months ago

Missing list APIs for models, datasets and spaces

#5 opened 2 months ago by

pengqun

liked 2 Spaces about 2 months ago

Test

🖼

stet

HuggingDiscussions

🏢

Discuss and provide feedback on Hugging Face Hub features

liked a model about 2 months ago

openai/circuit-sparsity

Text Generation • 0.4B • Updated Dec 12, 2025 • 633 • 202

liked a Space about 2 months ago

OpenAPI

🦀

Hub API Documentation

liked a model about 2 months ago

Qwen/Qwen3-Next-80B-A3B-Thinking-GGUF

Text Generation • 80B • Updated Dec 3, 2025 • 2.36k • 29

upvoted a collection about 2 months ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.68k

liked a Space 2 months ago

VideoCoF

🎥

Unified Video Editing with Temporal Reasoner

reacted to sergiopaniego's post with 🤗 3 months ago

Post

2302

We just released TRL v0.26.0!

It comes packed with updates:
> Agent training with tools in GRPO
> New CISPO & SAPO losses + reasoning rewards
> vLLM quantization in colocate mode
> Dataset shuffling in SFT
> Lots of NEW examples
> Tons of fixes and documentation improvements

3 replies

reacted to melvindave's post with 🚀 3 months ago

Post

2585

Currently having a blast learning the transformers library.

I noticed that model cards usually have Transformers code as usage examples.

So I tried to figure out how to load a model just using the transformers library without using ollama, lmstudio, or llamacpp.

Learned how to install dependencies required to make it work like pytorch and CUDA. I also used Conda for python environment dependencies.

Once I got the model loaded and sample inference working, I made an API to serve it.

I know it's very basic stuff for machine learning experts here in HF but I'm completely new to this so I'm happy to get it working!

Model used: Qwen/Qwen3-VL-8B-Instruct
GPU: NVIDIA GeForce RTX 3090

Here's the result of my experimentation