Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Pritam Sarkar's picture
2 1 1

Pritam Sarkar

pritamqu
dark-pen's profile picture
·
https://pritamsarkar.com
  • pritam94
  • pritamqu
  • sarkarpritam

AI & ML interests

multimodal learning with vision, language, and audio; generative modeling; large multimodal models (LMMs); multimodal LLMs (MLLMs); AI agents; alignments; representation learning; self-supervised and unsupervised learning; vision-language models; audio-visual models; foundation models; computer vision

Recent Activity

liked a dataset 17 days ago
WHB139426/Grounded-VideoLLM
commented on a paper 10 months ago
VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models
updated a dataset 10 months ago
pritamqu/VCRBench
View all activity

Organizations

None yet

pritamqu 's collections 1

HALVA
Model weights for the paper "Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination"
  • pritamqu/halva13b384-lora

    Updated Jan 29, 2025 • 3
  • pritamqu/halva7b-lora

    Updated Jan 29, 2025 • 1
  • pritamqu/halva13b-lora

    Updated Jan 29, 2025
HALVA
Model weights for the paper "Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination"
  • pritamqu/halva13b384-lora

    Updated Jan 29, 2025 • 3
  • pritamqu/halva7b-lora

    Updated Jan 29, 2025 • 1
  • pritamqu/halva13b-lora

    Updated Jan 29, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs