2 91 10

Ju He

turkeyju

https://tacju.github.io/

TACJu

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

upvoted a paper 10 days ago

GLM-5: from Vibe Coding to Agentic Engineering

upvoted a paper 13 days ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

View all activity

Organizations

upvoted 2 papers 10 days ago

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published 14 days ago • 50

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 12 days ago • 99

upvoted a paper 13 days ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published 20 days ago • 49

authored a paper 16 days ago

Autoregressive Image Generation with Masked Bit Modeling

Paper • 2602.09024 • Published 19 days ago • 6

upvoted a paper 17 days ago

Autoregressive Image Generation with Masked Bit Modeling

Paper • 2602.09024 • Published 19 days ago • 6

upvoted 2 papers 26 days ago

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

Paper • 2602.02493 • Published 26 days ago • 42

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 27 days ago • 251

upvoted 2 papers about 2 months ago

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 196

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published Dec 30, 2025 • 51

upvoted 2 papers 2 months ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 87

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106

upvoted 4 papers 3 months ago

From Pixels to Feelings: Aligning MLLMs with Human Cognitive Perception of Images

Paper • 2511.22805 • Published Nov 27, 2025 • 4

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 239

REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Paper • 2511.22625 • Published Nov 27, 2025 • 47

Vision Bridge Transformer at Scale

Paper • 2511.23199 • Published Nov 28, 2025 • 46

updated a model 3 months ago

turkeyju/FlowTok

Updated Nov 26, 2025

published a model 3 months ago