2 95 10

Ju He

turkeyju

https://tacju.github.io/

TACJu

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

upvoted a paper 4 days ago

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

upvoted a paper 4 days ago

Enhancing Spatial Understanding in Image Generation via Reward Modeling

View all activity

Organizations

upvoted a paper 2 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 3 days ago • 69

upvoted 3 papers 4 days ago

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

Paper • 2602.24264 • Published 7 days ago • 14

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Paper • 2602.24233 • Published 7 days ago • 47

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published 9 days ago • 116

upvoted 2 papers 16 days ago

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published 20 days ago • 52

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 17 days ago • 105

upvoted a paper 18 days ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published 26 days ago • 50

authored a paper 22 days ago

Autoregressive Image Generation with Masked Bit Modeling

Paper • 2602.09024 • Published 25 days ago • 6

upvoted a paper 22 days ago

Autoregressive Image Generation with Masked Bit Modeling

Paper • 2602.09024 • Published 25 days ago • 6

upvoted 2 papers about 1 month ago

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

Paper • 2602.02493 • Published Feb 2 • 44

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 254

upvoted a paper about 2 months ago

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 197

upvoted a paper 2 months ago

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published Dec 30, 2025 • 51

upvoted 6 papers 3 months ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 87

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106

From Pixels to Feelings: Aligning MLLMs with Human Cognitive Perception of Images

Paper • 2511.22805 • Published Nov 27, 2025 • 4

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 240

REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Paper • 2511.22625 • Published Nov 27, 2025 • 47

Vision Bridge Transformer at Scale

Paper • 2511.23199 • Published Nov 28, 2025 • 46

updated a model 3 months ago

turkeyju/FlowTok

Updated Nov 26, 2025

Ju He

AI & ML interests

Recent Activity

Organizations

turkeyju's activity