Stanisław Szymczyk's picture

Stanisław Szymczyk

sszymczyk

·

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

sszymczyk/DeepSeek-V3.2-Speciale-light-GGUF

published a model 5 days ago

sszymczyk/DeepSeek-V3.2-Speciale-light-GGUF

updated a model 5 days ago

sszymczyk/DeepSeek-V3.2-Speciale-nolight-GGUF

View all activity

Organizations

None yet

New activity in deepseek-ai/DeepSeek-V3.2 11 days ago

Running the model with a dense attention

#35 opened 3 months ago by

New activity in ubergarm/Qwen3.5-397B-A17B-GGUF 21 days ago

Quick Start section in README.md is a bit misleading

#10 opened 21 days ago by

New activity in internlm/Intern-S1-Pro about 2 months ago

Temperature and top_p values are swapped in the example code:

#2 opened about 2 months ago by

New activity in stepfun-ai/Step-3.5-Flash about 2 months ago

Recommended sampling parameters?

#3 opened about 2 months ago by

New activity in zai-org/GLM-4.7-Flash about 2 months ago

Problems with logical reasoning performance of GLM-4.7-Flash

#35 opened about 2 months ago by

Recommended sampling parameters

#6 opened 2 months ago by

New activity in sszymczyk/DeepSeek-V3.2-nolight-GGUF 2 months ago

Thinking mode

#2 opened 2 months ago by

Feedback

#1 opened 2 months ago by

New activity in inclusionAI/Ring-1T 4 months ago

Responses of Ring-1T available on zenmux.ai often end abruptly after 14-16k tokens without generating complete answer

#13 opened 4 months ago by

New activity in allenai/ZebraLogic 11 months ago

Please test the QwQ-32B-Preview model

#3 opened over 1 year ago by

New activity in perplexity-ai/r1-1776 about 1 year ago

This model performs worse in complex problems compared to the DeepSeek R1

#254 opened about 1 year ago by

New activity in MiniMaxAI/MiniMax-Text-01 about 1 year ago

Requesting Support for GGUF Quantization of MiniMax-Text-01 through llama.cpp

#1 opened about 1 year ago by

Doctor-Chad-PhD

New activity in RUC-AIBOX/Virgo-72B about 1 year ago

Missing tokenizer.json and tokenizer_config.json files

#2 opened about 1 year ago by

Please add the "tokenizer.model" file

#3 opened about 1 year ago by

New activity in Qwen/QwQ-32B-preview over 1 year ago

Hardware Requirements

#1 opened over 1 year ago by

New activity in AIDC-AI/Marco-o1 over 1 year ago

Can you provide code for inference with MCTS?

#3 opened over 1 year ago by

New activity in allenai/Llama-3.1-Tulu-3-70B over 1 year ago

Reason behind not using special tokens in the prompt format?

#2 opened over 1 year ago by

New activity in mistralai/Mistral-Large-Instruct-2411 over 1 year ago

The curse of the Consolidated Safetensors strikes again...

#4 opened over 1 year ago by

New activity in meta-llama/Llama-3.1-8B-Instruct over 1 year ago

The model often enters infinite generation loops

#32 opened over 1 year ago by

New activity in nvidia/Nemotron-4-340B-Instruct over 1 year ago

Gguf

#5 opened almost 2 years ago by