CompVis Community

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

seravee008 authored a paper about 1 month ago

Helios: Real Real-Time Long Video Generation Model

seravee008 authored a paper about 1 month ago

Adaptive 1D Video Diffusion Autoencoder

seravee008 authored a paper about 1 month ago

FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space

View all activity

authored a paper 2 months ago

TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors

Paper • 2601.17958 • Published Jan 25 • 3

authored a paper 4 months ago

In-Context Representation Hijacking

Paper • 2512.03771 • Published Dec 3, 2025 • 4

posted an update 6 months ago

Post

23739

Want to iterate on a Hugging Face Space with an LLM?

Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model!

multimodalart/repo2txt

1 reply

·

authored a paper 6 months ago

UniFusion: Vision-Language Model as Unified Encoder in Image Generation

Paper • 2510.12789 • Published Oct 14, 2025 • 19

authored a paper 6 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

authored a paper 6 months ago

When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity

Paper • 2509.20293 • Published Sep 24, 2025 • 8

authored a paper 6 months ago

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Paper • 2509.16117 • Published Sep 19, 2025 • 23

authored 2 papers 9 months ago

When Do Neural Nets Outperform Boosted Trees on Tabular Data?

Paper • 2305.02997 • Published May 4, 2023

MARVIS: Modality Adaptive Reasoning over VISualizations

Paper • 2507.01544 • Published Jul 2, 2025 • 13

authored a paper 9 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 78

authored a paper 9 months ago

How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions

Paper • 2506.16679 • Published Jun 20, 2025 • 2

posted an update 10 months ago

Post

18290

Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐

I've built a live real time demo on Spaces 📹💨

multimodalart/self-forcing

6 replies

·

authored a paper 10 months ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 60

authored 4 papers 10 months ago

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Paper • 2406.19314 • Published Jun 27, 2024 • 23

TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks

Paper • 2402.11137 • Published Feb 17, 2024

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4, 2025 • 54

ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models

Paper • 2310.18208 • Published Oct 27, 2023

authored a paper 10 months ago

Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability

Paper • 2506.02138 • Published Jun 2, 2025 • 1

authored a paper 10 months ago

Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation

Paper • 2506.03621 • Published Jun 4, 2025 • 22

authored a paper 10 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 157