Commit History

Switch to transformers with Qwen2.5-7B-Instruct
15dcc64
verified

Ngixdev commited on

Use pre-built llama-cpp-python wheel for cu124
85cfd66
verified

Ngixdev commited on

Switch to Gradio + ZeroGPU with llama-cpp-python
2dad00a
verified

Ngixdev commited on

Use llama.cpp server with OpenAI-compatible API
d7860c8
verified

Ngixdev commited on

Switch to Docker SDK with CUDA for llama-cpp
31b5080
verified

Ngixdev commited on

Use pre-built llama-cpp-python wheel for faster build
634a67a
verified

Ngixdev commited on

Switch to ZeroGPU with llama-cpp for GGUF model
b4cb8c4
verified

Ngixdev commited on

Initial commit: Qwen3.5-9B API interface
13d1862
verified

Ngixdev commited on

initial commit
c16b401
verified

Ngixdev commited on