Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

upvoted a paper 7 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

upvoted a paper 7 days ago

Guided Self-Evolving LLMs with Minimal Human Supervision

updated a dataset 20 days ago

VerlTool/SkyRL-SQL-Reproduction

View all activity

Organizations

upvoted 2 papers 7 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 9 days ago • 60

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published 8 days ago • 48

upvoted a paper 20 days ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published 21 days ago • 42

upvoted 2 articles about 1 month ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30

•

68

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30

•

28

upvoted a paper about 1 month ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published Oct 24 • 21

upvoted 2 papers about 2 months ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16 • 104

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12 • 27

upvoted 4 papers 2 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 70

VideoScore2: Think before You Score in Generative Video Evaluation

Paper • 2509.22799 • Published Sep 26 • 25

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

Paper • 2509.22824 • Published Sep 26 • 20

upvoted 6 papers 3 months ago

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7 • 149

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published Sep 4 • 57

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 124

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published Sep 1 • 33

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 75

upvoted 2 papers 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 208

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published Aug 14 • 18