Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
293
Jeff
xujfcn
Follow
0 followers
·
3 following
https://crazyrouter.com/
AI & ML interests
None yet
Recent Activity
new
activity
1 day ago
deepseek-ai/DeepSeek-R1:
[Alignment Analysis] R1 hallucinates medical false equivalencies unless strictly constrained (Diabetes vs Psychiatry)
new
activity
1 day ago
mistralai/Mixtral-8x22B-Instruct-v0.1:
Is extreme context size version Mixtral available in the future?
new
activity
1 day ago
NousResearch/Hermes-3-Llama-3.1-8B:
📋 Documentation Enhancement Suggestion
View all activity
Organizations
None yet
xujfcn
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
deepseek-ai/DeepSeek-R1
1 day ago
[Alignment Analysis] R1 hallucinates medical false equivalencies unless strictly constrained (Diabetes vs Psychiatry)
3
#237 opened about 2 months ago by
felps333
New activity in
mistralai/Mixtral-8x22B-Instruct-v0.1
1 day ago
Is extreme context size version Mixtral available in the future?
2
#56 opened over 1 year ago by
StationaryWeaver
New activity in
NousResearch/Hermes-3-Llama-3.1-8B
1 day ago
📋 Documentation Enhancement Suggestion
4
#22 opened 11 days ago by
CroviaTrust
New activity in
deepseek-ai/DeepSeek-R1
1 day ago
Using DeepSeek R1 via API Gateway
3
#244 opened 3 days ago by
xujfcn
New activity in
Qwen/Qwen2.5-Coder-32B-Instruct
1 day ago
IDE Agent Kit v0.1.0 — Let your IDE AI join the team
1
#41 opened 4 days ago by
petruspennanen
New activity in
zai-org/glm-4-9b-chat
1 day ago
Please upgrade to THUDM/glm-4-9b-chat-hf model.
4
#86 opened over 1 year ago by
ZHANGYUXUAN-zR
New activity in
01-ai/Yi-1.5-34B-Chat
1 day ago
Request: DOI
2
#16 opened 12 months ago by
qsegjukdgqegzeghehrthergr
New activity in
google/gemma-2-27b-it
1 day ago
Generate unknown output
👀
1
6
#42 opened about 1 year ago by
raminh921
New activity in
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
1 day ago
Default parameters for the model
1
#67 opened 4 months ago by
skylord
New activity in
microsoft/phi-4
1 day ago
IDE Agent Kit v0.1.0 — Let your IDE AI join the team
2
#56 opened 4 days ago by
petruspennanen
New activity in
meta-llama/Llama-3.3-70B-Instruct
1 day ago
Request: DOI
3
#151 opened 9 days ago by
Extet
New activity in
deepseek-ai/DeepSeek-V3
1 day ago
Production deployment considerations
3
#111 opened 2 months ago by
Cagnicolas
New activity in
Qwen/Qwen2.5-72B-Instruct
1 day ago
Inconsistent Output: First API call differs from subsequent identical calls with temperature=0 on Qwen models
➕
1
2
#33 opened 10 months ago by
ericshijian
New activity in
microsoft/phi-4
3 days ago
Any tips to speed up inference?
3
#35 opened about 1 year ago by
LinoHong
New activity in
Qwen/Qwen2.5-72B-Instruct
3 days ago
model is too busy
➕
7
5
#23 opened over 1 year ago by
XuemeiTang
Inference API Body Structure
4
#28 opened about 1 year ago by
Shivkumar27
New activity in
deepseek-ai/DeepSeek-V3
3 days ago
Benchmark: DeepSeek V3 vs GPT-4o vs Claude for coding tasks
#117 opened 3 days ago by
xujfcn
New activity in
microsoft/phi-4
3 days ago
Create باشگاه فوتبال تراکتورسازی تبریز
2
#32 opened about 1 year ago by
Hadinemati
Rename README.md to ideas de videos que podrían captar la atención y ser tendencia, alineadas con el enfoque de Tech Explora en innovación, ciencia y tecnología
1
#33 opened about 1 year ago by
antusti
Suggested tokenizer changes by Unsloth.ai
1
#36 opened about 1 year ago by
gugarosa
Load more