Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Mark Redito's picture

59 18

Mark Redito

markredito

Jervist's profile picture

victor's profile picture

krispyATL's profile picture

·

https://markredito.com

markredito
markredito
markredito

AI & ML interests

Generative AI, Multimodal AI, Deep Learning

Organizations

markredito 's collections 9

Image Generation

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 33
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models

Paper • 2309.05793 • Published Sep 11, 2023 • 51
Generative Image Dynamics

Paper • 2309.07906 • Published Sep 14, 2023 • 55
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Paper • 2309.15818 • Published Sep 27, 2023 • 19

Retrieval-Augmented Text-to-Audio Generation

Paper • 2309.08051 • Published Sep 14, 2023 • 7
A Large-scale Dataset for Audio-Language Representation Learning

Paper • 2309.11500 • Published Sep 20, 2023 • 9
End-to-End Speech Recognition Contextualization with Large Language Models

Paper • 2309.10917 • Published Sep 19, 2023 • 9
FoleyGen: Visually-Guided Audio Generation

Paper • 2309.10537 • Published Sep 19, 2023 • 8

Compositional Foundation Models for Hierarchical Planning

Paper • 2309.08587 • Published Sep 15, 2023 • 11
DreamLLM: Synergistic Multimodal Comprehension and Creation

Paper • 2309.11499 • Published Sep 20, 2023 • 60
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

Paper • 2309.15091 • Published Sep 26, 2023 • 35
Context-Aware Meta-Learning

Paper • 2310.10971 • Published Oct 17, 2023 • 17

meta-llama/Llama-2-70b

Text Generation • Updated Apr 17, 2024 • 15 • 538

Garment3DGen: 3D Garment Stylization and Texture Generation

Paper • 2403.18816 • Published Mar 27, 2024 • 25

Agents: An Open-source Framework for Autonomous Language Agents

Paper • 2309.07870 • Published Sep 14, 2023 • 43
Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts

Paper • 2309.07430 • Published Sep 14, 2023 • 28
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 54
Investigating Answerability of LLMs for Long-Form Question Answering

Paper • 2309.08210 • Published Sep 15, 2023 • 15

Interpretability

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 15

Music Generation

skytnt/midi-model

0.2B • Updated Oct 31, 2024 • 217 • 39

Interactive Task Planning with Language Models

Paper • 2310.10645 • Published Oct 16, 2023 • 12
Humanoid Locomotion as Next Token Prediction

Paper • 2402.19469 • Published Feb 29, 2024 • 29

Image Generation

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 33
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models

Paper • 2309.05793 • Published Sep 11, 2023 • 51
Generative Image Dynamics

Paper • 2309.07906 • Published Sep 14, 2023 • 55
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Paper • 2309.15818 • Published Sep 27, 2023 • 19

Agents: An Open-source Framework for Autonomous Language Agents

Paper • 2309.07870 • Published Sep 14, 2023 • 43
Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts

Paper • 2309.07430 • Published Sep 14, 2023 • 28
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 54
Investigating Answerability of LLMs for Long-Form Question Answering

Paper • 2309.08210 • Published Sep 15, 2023 • 15

Retrieval-Augmented Text-to-Audio Generation

Paper • 2309.08051 • Published Sep 14, 2023 • 7
A Large-scale Dataset for Audio-Language Representation Learning

Paper • 2309.11500 • Published Sep 20, 2023 • 9
End-to-End Speech Recognition Contextualization with Large Language Models

Paper • 2309.10917 • Published Sep 19, 2023 • 9
FoleyGen: Visually-Guided Audio Generation

Paper • 2309.10537 • Published Sep 19, 2023 • 8

Interpretability

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 15

Compositional Foundation Models for Hierarchical Planning

Paper • 2309.08587 • Published Sep 15, 2023 • 11
DreamLLM: Synergistic Multimodal Comprehension and Creation

Paper • 2309.11499 • Published Sep 20, 2023 • 60
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

Paper • 2309.15091 • Published Sep 26, 2023 • 35
Context-Aware Meta-Learning

Paper • 2310.10971 • Published Oct 17, 2023 • 17

Music Generation

skytnt/midi-model

0.2B • Updated Oct 31, 2024 • 217 • 39

meta-llama/Llama-2-70b

Text Generation • Updated Apr 17, 2024 • 15 • 538

Interactive Task Planning with Language Models

Paper • 2310.10645 • Published Oct 16, 2023 • 12
Humanoid Locomotion as Next Token Prediction

Paper • 2402.19469 • Published Feb 29, 2024 • 29

Garment3DGen: 3D Garment Stylization and Texture Generation

Paper • 2403.18816 • Published Mar 27, 2024 • 25

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs