Papers
updated
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads
to Answers Faster
Paper
• 2311.08263
• Published • 16
Exponentially Faster Language Modelling
Paper
• 2311.10770
• Published • 119
Text Generation
• Updated • 3.41k
• 666
Memory Augmented Language Models through Mixture of Word Experts
Paper
• 2311.10768
• Published • 19
VMC: Video Motion Customization using Temporal Attention Adaption for
Text-to-Video Diffusion Models
Paper
• 2312.00845
• Published • 39
DiffiT: Diffusion Vision Transformers for Image Generation
Paper
• 2312.02139
• Published • 15
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved
Pre-Training
Paper
• 2401.00849
• Published • 17
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Paper
• 2401.14404
• Published • 18
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Paper
• 2401.15024
• Published • 73
Larimar: Large Language Models with Episodic Memory Control
Paper
• 2403.11901
• Published • 33
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Paper
• 2403.08764
• Published • 36
Vid2Robot: End-to-end Video-conditioned Policy Learning with
Cross-Attention Transformers
Paper
• 2403.12943
• Published • 15