AI & ML interests

Benchmarking LLMs on the Persian language tasks.

prithivMLmodsย 
posted an update 3 days ago
view post
Post
3639
Map-Anything v1 (Universal Feed-Forward Metric 3D Reconstruction) demo is now available on Hugging Face Spaces. Built with Gradio and integrated with Rerun, it performs multi-image and video-based 3D reconstruction, depth, normal map, and interactive measurements.

๐Ÿค— Demo: prithivMLmods/Map-Anything-v1
๐Ÿค— Model: facebook/map-anything-v1
๐Ÿค— Hf-Papers: MapAnything: Universal Feed-Forward Metric 3D Reconstruction (2509.13414)
prithivMLmodsย 
posted an update 7 days ago
view post
Post
3007
Introducing QIE-Bbox-Studio! ๐Ÿ”ฅ๐Ÿค—

The QIE-Bbox-Studio demo is now live โ€” more precise and packed with more options. Users can manipulate images with object removal, design addition, and even move objects from one place to another, all in just 4-step fast inference.

๐Ÿค— Demo: prithivMLmods/QIE-Bbox-Studio
๐Ÿ”— GitHub: https://github.com/PRITHIVSAKTHIUR/QIE-Bbox-Studio

๐Ÿš€ Models [LoRA] :

โ— QIE-2511-Object-Mover-Bbox: prithivMLmods/QIE-2511-Object-Mover-Bbox
โ— QIE-2511-Object-Remover-Bbox-v3: prithivMLmods/QIE-2511-Object-Remover-Bbox-v3
โ— QIE-2511-Outfit-Design-Layout: prithivMLmods/QIE-2511-Outfit-Design-Layout
โ— QIE-2509-Object-Remover-Bbox-v3: prithivMLmods/QIE-2509-Object-Remover-Bbox-v3
โ— QIE-2509-Object-Mover-Bbox: prithivMLmods/QIE-2509-Object-Mover-Bbox

๐Ÿš€ Collection:

โ— Qwen Image Edit [Layout Bbox]: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-layout-bbox

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 9 days ago
view post
Post
5002
QIE-2509-Object-Remover-Bbox-v3 is a more stable version of the Qwen Image Edit visual groundingโ€“based object removal model. The app was previously featured in HF Spaces of the Week and is now updated with the latest Bbox-v3 LoRA adapter.

๐Ÿค— Demo: prithivMLmods/QIE-Object-Remover-Bbox
๐Ÿค— LoRA: prithivMLmods/QIE-2509-Object-Remover-Bbox-v3
๐Ÿค— Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-layout-bbox

To learn more, visit the app page or the respective model pages.
  • 2 replies
ยท
prithivMLmodsย 
posted an update 17 days ago
view post
Post
4998
The Qwen3.5 Multimodal Understanding Demo, powered by Qwen3.5-2B, is now available on HF Spaces! It is a lightweight model designed for fast image and video reasoning. Built with Gradio, the demo showcases Image QA, Video QA, object detection, and 2D point tracking, along with real-time token streaming.

๐Ÿค— Demo: prithivMLmods/Qwen-3.5-HF-Demo
โœ… Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
๐Ÿ”— Qwen3.5-2B: Qwen/Qwen3.5-2B

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 21 days ago
view post
Post
3984
QIE-Object-Remover-Bbox Demo removes objects and artifacts from selected regions using bounding box grounding. Built on Qwen-Image-Edit-2509 with Rapid Diffusers acceleration, it delivers fast 4-step inference via the QIE-2509 adapter. ๐Ÿค—๐Ÿ”ฅ

๐Ÿ”—Demo Space: prithivMLmods/QIE-Object-Remover-Bbox
๐Ÿ”—Qwen-Image-Edit-Rapid-AIO: prithivMLmods/Qwen-Image-Edit-Rapid-AIO-V4
๐Ÿ”—Adapter-(LoRA): prithivMLmods/QIE-2509-Object-Remover-Bbox

๐Ÿ”—Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-layout-bbox

To learn more, visit the app page or the respective model pages.
  • 1 reply
ยท
prithivMLmodsย 
posted an update 27 days ago
view post
Post
2528
FireRed-Image-Edit-1.0 (Rapid) Fast Experimental Demo is Out! ๐Ÿš€๐Ÿค—

Demo: prithivMLmods/FireRed-Image-Edit-1.0-Fast

-> Paired the EditPlusPipeline with the Diffusers-compatible transformer weights of Rapid AIO from Qwen-Image-Edit. (experimental)
-> This fusion delivers more accurate instruction following, higher image quality, and consistent visual coherence @ 4-step fast inference.
-> Better maintains text styles with high fidelity, along with high-quality old photo restoration, enhancement, and best-in-class virtual try-on.

prithivMLmodsย 
posted an update about 1 month ago
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
2598
Dropping the Qwen3 VL Series of Unredacted MAX-VL models. These models have undergone multi-stage training to minimize refusal rates through continuous abliterated optimization. You can find the models in BF16, FP8-Dynamic, and GGUF formats at the links below.๐Ÿ”ฅ๐Ÿš€

Unredacted MAX - VL:
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX

Unredacted MAX - VL [FP8]
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX-FP8

Unredacted MAX - VL [GGUF]
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX-GGUF

Unredacted MAX - VL [Collection]
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl-fp8
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl-gguf

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
3028
Introducing FLUX.2-Klein-LoRA-Studio, a demo for image editing using specialized LoRA adapters built for the FLUX.2-Klein-Distilled model. It features an edit-style gallery for multi-style image editing, including de-light, face swap, mannequin, and more. Try the demo below.

๐Ÿค—Demo: prithivMLmods/FLUX.2-Klein-LoRA-Studio
๐Ÿค—Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
๐Ÿค—GitHub: https://github.com/PRITHIVSAKTHIUR/FLUX.2-Klein-LoRA-Studio

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update about 2 months ago
view post
Post
891
GLM OCR, a multimodal OCR model for complex document understanding, built on the GLM-V encoderโ€“decoder architecture. It delivers high accuracy and strong generalization with a blazing-fast inference pipeline. The demo is live . Try it now. ๐Ÿค—๐Ÿš€

โœจ Demo: prithivMLmods/GLM-OCR-Demo
โœจ Multimodal Implementations: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
โœจ GitHub: https://github.com/PRITHIVSAKTHIUR/GLM-OCR-Demo
prithivMLmodsย 
posted an update about 2 months ago
view post
Post
2205
Introducing the Qwen-Image-Edit-3D-Lighting-Control app, featuring 8ร— horizontal and 3ร— elevational lighting positions for precise 3D lighting control. It enables studio-level lighting using fast Qwen Image Edit fast inference, paired with Multi-Angle-Lighting adapters. ๐Ÿ”ฆ

๐Ÿ”ฅ Space: prithivMLmods/Qwen-Image-Edit-3D-Lighting-Control
โœ… Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
๐Ÿ“‚ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-3D-Lighting-Control
prithivMLmodsย 
posted an update about 2 months ago
view post
Post
3674
Daggr UI version of the Qwen3-TTS demo.๐Ÿ”ฅ
(custom voice, voice design, qwen3-asr and voice cloning) nodes.
No remote spaces used for API inference; all functions run in-app fn.
Powered by t4-m and built with daggr@0.5.2 and gradio@6.

๐Ÿ‘‰Demo: prithivMLmods/Qwen3-TTS-Daggr-UI
โญGithub: https://github.com/PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI
  • 1 reply
ยท
prithivMLmodsย 
posted an update about 2 months ago
view post
Post
2716
Qwen-Image-Edit-Object-Manipulator Space is now featured in Hugging Face Space of the Week. It enables object manipulation such as extracting objects, adding designs, and removing objects or designs from the red highlighted area using specialized adapters.

๐Ÿ”ฅDo enjoy the demo! ~ prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Collections:
๐ŸงจAdapters-1: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps
๐ŸงจAdapters-2: https://huggingface.co/collections/prithivMLmods/qie-jan-23-26
๐ŸงจAdapters-3: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator

โญGithub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.
  • 1 reply
ยท
prithivMLmodsย 
posted an update about 2 months ago
view post
Post
3062
Introducing QIE-2511-Zoom-Master for highlight-guided area zoom-in, enabling lossless zooming within a drawn square area, and QIE-2511-Object-Remover-v2 for precise object or highlight-guided area cleanup. These experimental adapters are trained based on QIE-2511. Find the adapters below.

๐Ÿ•น๏ธQIE-2511-Zoom-Master : prithivMLmods/QIE-2511-Zoom-Master
๐Ÿ•น๏ธQIE-2511-Object-Remover-v2: prithivMLmods/QIE-2511-Object-Remover-v2

๐Ÿค—Demo: prithivMLmods/Qwen-Image-Edit-Object-Manipulator

๐Ÿ“‚Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps

To learn more, visit the app page or the respective model pages.
  • 2 replies
ยท
prithivMLmodsย 
posted an update 2 months ago
view post
Post
5592
LTX-2 Camera-Control LoRA demo with dolly-in/out and dolly-left/right is now available on Hugging Face, paired with ltx-2-19b-distilled-lora for fast inference. It also includes dynamic GPU duration adjustments for long video generations. Click the related Space links below.

๐Ÿค—Try it now on : prithivMLmods/LTX-2-LoRAs-Camera-Control-Dolly
โญGithub: https://github.com/PRITHIVSAKTHIUR/LTX-2-LoRAs-Camera-Control-Dolly
๐Ÿ•น๏ธCollection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

To learn more, visit the app page or the respective model pages.
  • 2 replies
ยท
prithivMLmodsย 
posted an update 3 months ago
view post
Post
2486
Dropping Image Edit (Object Manipulator): Add or remove specified objects/designs, with flexible support for both single-image and multi-image modes.

๐Ÿค— Demo: prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Qwen-Image-Edit-2511-Object-Remover is an adapter (LoRA) developed for Qwenโ€™s Qwen-Image-Edit-2511 image-to-image model. It is specifically designed for precise object removal from images.

โญ Model: prithivMLmods/Qwen-Image-Edit-2511-Object-Remover

Qwen-Image-Edit-2511-Object-Adder is an adapter (LoRA) developed for Qwenโ€™s Qwen-Image-Edit-2511 image-to-image model. It is specifically designed for precise object addition to images.

โญ Model: prithivMLmods/Qwen-Image-Edit-2511-Object-Adder

๐Ÿ•น๏ธ Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator
๐Ÿ•น๏ธ github: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 3 months ago
view post
Post
4244
Update: TRELLIS.2 (Text to 3D, Image to 3D) Gradio with Rerun Embedded demo with improved visualization of the 3D model previewer is now available on Hugging Face. Generate assets and view them in the 3D viewer, powered and streamlined with Microsoftโ€™s TRELLIS.2 and Tongyi-MAIโ€™s Z-Image-Turbo models.

๐Ÿค— TRELLIS.2 (Demo): prithivMLmods/TRELLIS.2-Text-to-3D
๐Ÿ•น๏ธ GitHub: https://github.com/PRITHIVSAKTHIUR/TRELLIS.2-Text-to-3D-RERUN
๐Ÿ•น๏ธ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations

To know more about it, visit the app page or the respective model page!
prithivMLmodsย 
posted an update 3 months ago
view post
Post
4286
Introducing the Qwen-Image-Edit-2511-LoRAs-Fast demo, featuring image property comparison and contrast, built on top of Gradio and the combined Rerun SDK. It supports single and multi-image edits with existing LoRAs that are lazily loaded. (Note: This is still an experimental Space for Qwen-Image-Edit-2511.)

โญ Space Demo: prithivMLmods/Qwen-Image-Edit-2511-LoRAs-Fast
โญ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-2511-LoRAs-Fast-Multi-Image-Rerun
โญ Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

To know more about it, visit the app page or the respective model page!
  • 2 replies
ยท
prithivMLmodsย 
posted an update 3 months ago
view post
Post
3750
Introducing demos for new SOTA models from AI2: SAGE-MM (Smart Any-Horizon Agents for Long-Video Reasoning) and Molmo-2, an open vision-language model that supports multi-image (QA and pointing) and video (QA, pointing, and tracking). The respective demo-related collections are listed below. ๐ŸŽƒ๐Ÿ”ฅ

โœจ SAGE-MM [Video-Reasoning]: prithivMLmods/SAGE-MM-Video-Reasoning
โœจ Molmo2 [Demo]: prithivMLmods/Molmo2-HF-Demo

๐ŸŽƒ GitHub[SAGE-MM]: https://github.com/PRITHIVSAKTHIUR/SAGE-MM-Video-Reasoning
๐ŸŽƒ GitHub[Molmo2]: https://github.com/PRITHIVSAKTHIUR/Molmo2-HF-Demo
๐ŸŽƒ Multimodal Implementations: https://huggingface.co/collections/prithivMLmods/multimodal-implementations

To know more about it, visit the app page or the respective model page!
  • 1 reply
ยท
prithivMLmodsย 
posted an update 3 months ago
view post
Post
2148
Introducing TRELLIS.2 Text-to-3D. The demo for the TRELLIS.2-4B (Image-to-3D) model is streamlined with the Z-Image Turbo image generation model to enable Text-to-3D functionality. There is no need for input assets, making a small leap forward for ideation. Optionally, it also includes default support for Image-to-3D inference using direct image assets. Find the demo and related collections below... ๐Ÿค—๐Ÿ”ฅ

โœจ TRELLIS.2-Text-to-3D [Demo]: prithivMLmods/TRELLIS.2-Text-to-3D
โœจ Multimodal Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
โœจ Github: https://github.com/PRITHIVSAKTHIUR/TRELLIS.2-Text-to-3D

To know more about it, visit the app page or the respective model page!