arxiv:2511.19365
ZehongMa
zehongma
AI & ML interests
MLLMs, Image/Video Generation, Multi-modal Representation Learning
Recent Activity
updated
a model
about 10 hours ago
zehongma/PixelGen
published
a model
about 11 hours ago
zehongma/PixelGen
upvoted
a
paper
20 days ago
BabyVision: Visual Reasoning Beyond Language
Organizations
None yet