firas snake
abol3z
AI & ML interests
None yet
Recent Activity
liked
a model 2 months ago
asafaya/bert-base-arabic liked
a model 2 months ago
TomoroAI/tomoro-colqwen3-embed-4b commented on an article 4 months ago
Supercharge your OCR Pipelines with Open Models Organizations
None yet
Fill-Mask • 0.1B • Updated
• 11.9k • • 40
TomoroAI/tomoro-colqwen3-embed-4b
Visual Document Retrieval • 4B • Updated
• 21.2k • 23
commented on Supercharge your OCR Pipelines with Open Models 4 months ago
@doladoo yes. I tried Paddle, Miner, Marker, OlmOCR, Chandra-OCR, Docling without VL.
Overall for Arabic, VLM approach showed better performance, and the best was OlmOCR.
Note that my documents are mostly scanned text and tables, nothing more.
commented on Supercharge your OCR Pipelines with Open Models 4 months ago
commented on Supercharge your OCR Pipelines with Open Models 4 months ago
If only this came last week! I spent the last week learning about about and benchmarking all these plus extra models, and I wanna point out a correction. OlmOCR isn't an English language only model, in fact, it produced the best results across all VLM and none VLM frameworks on my Arabic language corpus.
upvoted an article 4 months ago
Article
Supercharge your OCR Pipelines with Open Models
- +5
•
306
upvoted a paper 7 months ago
upvoted a paper 10 months ago
upvoted an article 12 months ago
Article
Open-Source Handwritten Signature Detection Model
•
120