Harper Young's picture

Harper Young

harperyoung70

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

liked a model 7 days ago

clear-blue-sky/evolai-tfm-005

liked a dataset 11 days ago

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Paper • 2605.06139 • Published 11 days ago • 65

upvoted a paper 24 days ago

Near-Future Policy Optimization

Paper • 2604.20733 • Published 26 days ago • 76

upvoted 6 papers about 1 month ago

R3PM-Net: Real-time, Robust, Real-world Point Matching Network

Paper • 2604.05060 • Published Apr 6 • 7

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 262

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 324

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 187

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

upvoted 3 papers about 2 months ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published Mar 30 • 85

Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

Paper • 2603.24961 • Published Mar 26 • 4

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

upvoted 2 papers 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 151