2 23 7

yujia

yujia0913

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Self-Distilled Agentic Reinforcement Learning

upvoted a paper 5 days ago

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

upvoted a paper 25 days ago

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

View all activity

Organizations

upvoted a paper 3 days ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 5 days ago • 94

upvoted a paper 5 days ago

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

Paper • 2605.12495 • Published 7 days ago • 35

upvoted a paper 25 days ago

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

Paper • 2604.19572 • Published 28 days ago • 22

upvoted a paper 27 days ago

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Paper • 2604.19747 • Published 28 days ago • 39

upvoted a paper about 1 month ago

HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System

Paper • 2604.14125 • Published Apr 15 • 21

upvoted 2 papers about 2 months ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published Apr 2 • 101

FASTER: Rethinking Real-Time Flow VLAs

Paper • 2603.19199 • Published Mar 19 • 59

upvoted 2 papers 2 months ago

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published Mar 5 • 56

Advancing Block Diffusion Language Models for Test-Time Scaling

Paper • 2602.09555 • Published Feb 10 • 4

upvoted 7 papers 3 months ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 185

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

Prism: Spectral-Aware Block-Sparse Attention

Paper • 2602.08426 • Published Feb 9 • 38

Self-Improving World Modelling with Latent Actions

Paper • 2602.06130 • Published Feb 5 • 32

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published Feb 5 • 61

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published Jan 29 • 10

SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning

Paper • 2602.02472 • Published Feb 2 • 47

upvoted a paper 5 months ago

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

Paper • 2512.16905 • Published Dec 18, 2025 • 32

upvoted a paper 6 months ago

Visual Spatial Tuning

Paper • 2511.05491 • Published Nov 7, 2025 • 53

upvoted 2 papers 7 months ago

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 73

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 98

yujia

AI & ML interests

Recent Activity

Organizations

yujia0913's activity