Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
MikeDoesΒ 
posted an update 1 day ago
Post
95
Ai4Privacy has been working on this for the past year. πŸ™

Today we're releasing the PII Masking 2M Series, the world's largest open source privacy masking dataset. (Again. πŸš€πŸš€)

πŸ”’ 2M+ synthetic examples
🌍 32 locales across Europe
🏷️ 98 entity types
πŸ₯πŸ’¬πŸ¦πŸ’ΌπŸ“ 5 industry verticals: Health, Finance, Digital, Work, Location
βœ… 1M+ entries freely available on Hugging Face

Every example is 100% synthetic. No real personal data. Built so you can train and evaluate PII detection models without the legal headaches. πŸ”’

Thank you for 15,000,000+ downloads across our datasets, models, and libraries. This one's for you. ❀️


hashtag#privacy hashtag#ai hashtag#opensource hashtag#nlp hashtag#gdpr hashtag#pii hashtag#huggingface hashtag#machinelearning
In this post