6 56 2

Minki Kang

Nardien

Nardien

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

upvoted a paper 5 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

upvoted a paper 11 days ago

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

View all activity

Organizations

upvoted 2 papers 5 days ago

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

Paper • 2604.24005 • Published Apr 27 • 9

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 7 days ago • 57

upvoted a paper 11 days ago

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Paper • 2606.13673 • Published 12 days ago • 104

upvoted 2 papers 17 days ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

Paper • 2606.04743 • Published 20 days ago • 46

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

Paper • 2605.17873 • Published May 18 • 12

upvoted 2 papers 18 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 22 days ago • 132

Benchmarking Visual State Tracking in Multimodal Video Understanding

Paper • 2606.03920 • Published 21 days ago • 49

upvoted a paper 25 days ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Paper • 2605.29250 • Published 26 days ago • 78

authored a paper 25 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 27 days ago • 93

commented a paper 25 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 27 days ago • 93 •

upvoted a paper 25 days ago

Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

Paper • 2605.28775 • Published 27 days ago • 38

upvoted a paper 26 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 27 days ago • 93

upvoted a paper about 1 month ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Paper • 2605.20258 • Published May 18 • 30

submitted a paper to Daily Papers about 1 month ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

upvoted 2 papers about 1 month ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

PREPING: Building Agent Memory without Tasks

Paper • 2605.13880 • Published May 11 • 28

liked a dataset 2 months ago

nvidia/Nemotron-Terminal-Corpus

Viewer • Updated Feb 27 • 366k • 6.24k • 133

upvoted 2 papers 2 months ago

OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published Apr 20 • 84

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Paper • 2604.14004 • Published Apr 15 • 30

upvoted a paper 3 months ago

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Paper • 2603.22341 • Published Mar 21 • 37

Minki Kang

AI & ML interests

Recent Activity

Organizations

Nardien's activity