2 573

Lei Wang

demolei

https://demoleiwang.github.io/HomePage/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 1 day ago

MARS: Enabling Autoregressive Models Multi-Token Generation

upvoted a paper 3 days ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

upvoted a paper 8 days ago

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

View all activity

Organizations

upvoted a paper 1 day ago

MARS: Enabling Autoregressive Models Multi-Token Generation

Paper • 2604.07023 • Published 2 days ago • 25

upvoted a paper 3 days ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published 5 days ago • 30

upvoted a paper 8 days ago

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Paper • 2603.28407 • Published 11 days ago • 68

upvoted a paper 9 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 21 days ago • 330

upvoted a paper 15 days ago

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Paper • 2603.23497 • Published 16 days ago • 91

upvoted a paper 17 days ago

Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models

Paper • 2603.22212 • Published 17 days ago • 125

upvoted a paper 18 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published 23 days ago • 108

upvoted 3 papers 19 days ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published 22 days ago • 14

Memento-Skills: Let Agents Design Agents

Paper • 2603.18743 • Published 22 days ago • 56

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 21 days ago • 66

upvoted a paper 23 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published 24 days ago • 184

upvoted 3 papers 24 days ago

EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery

Paper • 2603.08127 • Published Mar 9 • 14

daVinci-Env: Open SWE Environment Synthesis at Scale

Paper • 2603.13023 • Published 28 days ago • 30

Attention Residuals

Paper • 2603.15031 • Published 25 days ago • 176

upvoted a paper 25 days ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published 28 days ago • 53

upvoted an article 26 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Mar 10

•

122

upvoted a paper 30 days ago

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published about 1 month ago • 53

upvoted a collection 30 days ago

MiroThinker-1.7

Collection

2 items • Updated 30 days ago • 54

upvoted 2 papers about 1 month ago

On-Policy Self-Distillation for Reasoning Compression

Paper • 2603.05433 • Published Mar 5 • 8

Interactive Benchmarks

Paper • 2603.04737 • Published Mar 5 • 19

Lei Wang

AI & ML interests

Recent Activity

Organizations

demolei's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries