Pramit Choudhary's picture

Pramit Choudhary

maverick84

·

https://github.com/pramitchoudhary

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 hours ago

PaddleOCR-VL-1.5

upvoted a collection about 2 hours ago

liked a model about 2 hours ago

PaddlePaddle/PaddleOCR-VL-1.5

View all activity

Organizations

upvoted 2 collections about 2 hours ago

PaddleOCR-VL-1.5

Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing • 7 items • Updated Mar 6 • 18

PaddleOCR-VL

Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model • 5 items • Updated Feb 11 • 30

upvoted 2 collections 7 days ago

Gemma 4

Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 5 days ago • 151

Gemma 3

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 54 items • Updated 5 days ago • 114

upvoted an article 18 days ago

Article

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

Sep 10, 2025

•

111

upvoted a paper 26 days ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 27 days ago • 62

upvoted a paper 3 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204

upvoted an article 5 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

793

upvoted a collection 6 months ago

Orpheus TTS

TTS Towards Human-Sounding Speech • 2 items • Updated Mar 18, 2025 • 78

upvoted a paper 6 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513

upvoted 2 articles 9 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5, 2025

•

513

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

769

upvoted an article 10 months ago

Article

Introducing smolagents: simple agents that write actions in code.

+1

Dec 31, 2024

•

1.19k

upvoted an article 11 months ago

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

+5

Feb 27, 2024

•

72

upvoted a paper 12 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

upvoted an article about 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

Mar 12, 2025

•

495

upvoted a paper about 1 year ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 172

upvoted an article about 1 year ago

Article

Welcome to Inference Providers on the Hub 🔥

+5

Jan 28, 2025

•

495

upvoted a paper about 1 year ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 217

upvoted a collection about 1 year ago

GAIA release

Gather the items of the GAIA release • 4 items • Updated Nov 23, 2023 • 35