Jiale Zhao

Heisenburger2000

·

https://scholar.google.com/citations?user=rtVg_VUAAAAJ&hl=en

Heisenburger2020

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

DOPD: Dual On-policy Distillation

upvoted a paper 2 days ago

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

upvoted a paper 3 days ago

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

View all activity

Organizations

upvoted 2 papers 2 days ago

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 4 days ago • 89

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

Paper • 2606.31315 • Published 3 days ago • 68

upvoted 2 papers 3 days ago

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

Paper • 2606.28480 • Published 7 days ago • 44

OSWorld2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks

Paper • 2606.29537 • Published 5 days ago • 18

upvoted a paper 7 days ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

Paper • 2606.26300 • Published 9 days ago • 46

upvoted 3 papers 8 days ago

Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence

Paper • 2606.15932 • Published 17 days ago • 38

Improved Large Language Diffusion Models

Paper • 2606.25331 • Published 9 days ago • 43

Autodata: An agentic data scientist to create high quality synthetic data

Paper • 2606.25996 • Published 9 days ago • 18

upvoted a paper 9 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 10 days ago • 144

upvoted 3 papers 10 days ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Paper • 2606.23654 • Published 11 days ago • 79

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 12 days ago • 96

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

Paper • 2606.22883 • Published 11 days ago • 37

upvoted 5 papers 17 days ago

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 18 days ago • 121

BadWorld: Adversarial Attacks on World Models

Paper • 2606.16519 • Published 18 days ago • 18

BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering

Paper • 2606.17049 • Published 18 days ago • 27

Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models

Paper • 2606.16281 • Published 18 days ago • 34

CODA-BENCH: Can Code Agents Handle Data-Intensive Tasks?

Paper • 2606.15300 • Published 20 days ago • 13

upvoted a paper 21 days ago

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

Paper • 2606.12087 • Published 23 days ago • 77

upvoted a paper 22 days ago

DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch

Paper • 2606.10728 • Published 24 days ago • 34

upvoted a paper 23 days ago

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

Paper • 2606.11042 • Published 24 days ago • 22