French PII & De-Identification Collection 33 models for French PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 33 items • Updated about 8 hours ago • 2
Italian PII & De-Identification Collection 33 models for Italian PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 33 items • Updated about 8 hours ago • 1
German PII & De-Identification Collection 33 models for German PII detection & de-identification. 55+ entity types. HIPAA & GDPR compliant. Apache 2.0. • 33 items • Updated about 8 hours ago • 1
Multilingual PII & De-Identification Collection Multilingual models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 113 items • Updated about 8 hours ago • 12
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output 5 days ago • 18
PII & De-Identification Collection Models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 146 items • Updated about 8 hours ago • 32
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 8 days ago • 96
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective 17 days ago • 52
gliner2 family Collection GLiNER2 extends the original GLiNER architecture to support multi-task information extraction with a schema-driven interface. This base model provid • 4 items • Updated 2 days ago • 25
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models Paper • 2601.10387 • Published 28 days ago • 12
view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset Mar 11, 2025 • 105
Teacher Logits Collection Logits captured from large models to act as the teacher for distillation • 3 items • Updated Dec 15, 2025 • 11