api / backend

Commit History

Add tuned lens as supplementary projection mode for logit lens
6f48db0

gary-boon Claude Opus 4.6 commited on

Rename 'counterfactual' terminology to 'input ablation' in docstrings
0d76811

gary-boon Claude Opus 4.6 commited on

Fix margin winner_token in margin_data and extract _classify_stability helper
ecf49a2

gary-boon Claude Opus 4.6 commited on

Add margin-based decision analysis, interventional counterfactuals, and run comparison (v3.0)
121a2d9

gary-boon Claude Opus 4.6 commited on

Add deep inspection: data-driven pattern classification, attention/MLP tracking, logit lens
54d9b6e

gary-boon Claude Opus 4.6 commited on

Change default model to Devstral and optimise attention extraction
d8d197a

gary-boon Claude Opus 4.6 commited on

Fix MistralTokenizer loading and token section boundaries
76b614d

gary-boon Claude Opus 4.5 commited on

Fix MistralTokenizer not loaded during model switch
2bdf299

gary-boon Claude Opus 4.5 commited on

feat: add top-k/top-p sampling and detailed logits/probability tracking
2c6343b

gary-boon Claude Opus 4.5 commited on

fix: Improve token alternatives numerical stability and temperature control
bc1f0e0

gary-boon Claude Opus 4.5 commited on

Store matrices as numpy arrays instead of Python lists
f2e89c2

gary-boon Claude Opus 4.5 commited on

Add per-step memory cleanup for large model support
a94eb19

gary-boon Claude Opus 4.5 commited on

Fix RAM exhaustion for large token generation
959074d

gary-boon Claude Opus 4.5 commited on

feat: add auto_complete parameter for token generation
bb689ce

gary-boon Claude Opus 4.5 commited on

fix: add QKV extraction support for Mistral/Devstral architecture
d1d37a8

gary-boon Claude Opus 4.5 commited on

feat: implement lazy-loading for attention matrices
929ba88

gary-boon Claude Opus 4.5 commited on

Add avg_entropy calculation for attention heads
66a46b6

gary-boon Claude Opus 4.5 commited on

Revert QKV visualization fixes - need better approach for data streaming
d0b7e29

gary-boon Claude Opus 4.5 commited on

Add safety checks for missing QKV keys
a79cb83

gary-boon Claude Opus 4.5 commited on

Limit QKV matrices to top 5 heads per layer to reduce response size
decb5ab

gary-boon Claude Opus 4.5 commited on

Fix QKV matrix extraction for Mistral/Devstral architecture
9056859

gary-boon Claude Opus 4.5 commited on

Fix QKV visualization for Mistral/Devstral architecture
4ec134b

gary-boon Claude Opus 4.5 commited on

Fix: Import time module at top level for SSE events
15a862b

gary-boon Claude Opus 4.5 commited on

Add SSE streaming endpoint for real-time analysis progress
172a186

gary-boon Claude Opus 4.5 commited on

feat: Include token metadata in analysis response
ee0f6c9

gary-boon Claude Opus 4.5 commited on

feat: Implement tier-based model filtering by device type
6bf9f5c

gary-boon Claude Opus 4.5 commited on

Fix: Add attn_implementation="eager" to model switch function
f94a7ae

gary-boon Claude Opus 4.5 commited on

Add tokenSections boundaries and update system prompt
c6f4cc5

gary-boon Claude Opus 4.5 commited on

Fix: Handle MistralCommonTokenizer pad_token setter
e20ccaf

gary-boon Claude Opus 4.5 commited on

Integrate mistral-common for correct Devstral tokenization
ed06dcb

gary-boon Claude Opus 4.5 commited on

Remove mistral_common to fix dependency conflict
3d9d9ee

gary-boon Claude Opus 4.5 commited on

Use mistral_common for proper Devstral prompt formatting
3e80769

gary-boon Claude Opus 4.5 commited on

Add system prompt support for instruction-tuned models
2860768

gary-boon Claude Opus 4.5 commited on

fix: Simpler prompt format and temperature=0 for Devstral
76020ee

gary-boon Claude Opus 4.5 commited on

fix: Sanitize JSON response for NaN/Inf float values
99f6209

gary-boon Claude Opus 4.5 commited on

fix: Check chat_template is set before using apply_chat_template
474927d

gary-boon Claude Opus 4.5 commited on

fix: Add chat template support for Devstral instruct model
8d85da8

gary-boon Claude Opus 4.5 commited on

fix: Convert bfloat16 to float32 for numpy compatibility
cb6f39c

gary-boon Claude Opus 4.5 commited on

fix: Use eager attention for output_attentions support
5333b21

gary-boon Claude Opus 4.5 commited on

Add vocabSize to modelInfo response
499afba

gary-boon Claude Opus 4.5 commited on

Add recommended_dtype to model configs
62525b2

gary-boon Claude Opus 4.5 commited on

Phase 2: Add Devstral backend support
9080f28

gary-boon Claude Opus 4.5 commited on

Add DEVICE env var to force CPU mode on DGX Spark
5f122aa

gary-boon Claude Opus 4.5 commited on

Make zarr/numcodecs imports optional for ARM64 compatibility
6435a75

gary-boon Claude Opus 4.5 commited on

Phase 1: DGX Spark infrastructure
a2bd186

gary-boon Claude Opus 4.5 commited on

Make QKV hook robust against shape mismatches
343dd57

gary-boon Claude commited on

Fix research attention endpoint model compatibility
f5ba954

gary-boon Claude commited on

Add research attention analysis endpoint with real CodeGen tokenization
8f63685

gary-boon Claude commited on

Add research attention analysis endpoints with Q/K/V extraction
37ed739

gary-boon Claude commited on

Fix ablation study for Code Llama compatibility
cd300ee

gary-boon Claude commited on