Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
21
1
12
Pedro Ortiz Suarez
pjox
Follow
barthfab's profile picture
danbri's profile picture
jizhongpeng's profile picture
17 followers
·
21 following
https://portizs.eu/
pjox13
pjox
pjox
pjox.bsky.social
AI & ML interests
Language modeling, parsing, sequence tagging, NER, historical languages.
Recent Activity
published
a dataset
6 days ago
commoncrawl/CommonLID
updated
a dataset
6 days ago
commoncrawl/CommonLID
authored
a paper
18 days ago
SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing
View all activity
Organizations
pjox
's datasets
2
Sort:Â Recently updated
pjox/tmp4c-index
Viewer
•
Updated
Aug 15, 2025
•
37.5M
•
1
pjox/tmp4c-simple-index
Viewer
•
Updated
Jul 19, 2024
•
34.8M
•
2