Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant Paper β’ 2410.15316 β’ Published Oct 20, 2024 β’ 12
Vision Language Models Papers πΌοΈπ¬π Collection Papers about vision-language models, most important ones are on top of the list. β’ 27 items β’ Updated Apr 30, 2024 β’ 40
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Paper β’ 2407.14057 β’ Published Jul 19, 2024 β’ 46
Block Transformer: Global-to-Local Language Modeling for Fast Inference Paper β’ 2406.02657 β’ Published Jun 4, 2024 β’ 41
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Paper β’ 2406.04325 β’ Published Jun 6, 2024 β’ 74
Papers about model merging Collection referenced in the mergekit repo: https://github.com/cg123/mergekit β’ 4 items β’ Updated Feb 13, 2024 β’ 14