🤝 Open to Collab

42 94 44

Kevin Lin

KevinQHLin

https://qhlin.me/

AI & ML interests

Vision-Language Model, Video Understanding, Agent

Recent Activity

upvoted a paper 9 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

submitted a paper 9 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

upvoted a paper 10 days ago

RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling

View all activity

Organizations

submitted a paper to Daily Papers 9 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Paper • 2606.11176 • Published 17 days ago • 126

authored 2 papers 14 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 23 days ago • 363

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Paper • 2606.11176 • Published 17 days ago • 126

submitted a paper to Daily Papers 15 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Paper • 2606.11176 • Published 17 days ago • 126

submitted a paper to Daily Papers 30 days ago

Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

Paper • 2605.17423 • Published May 17 • 34

submitted a paper to Daily Papers about 1 month ago

Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

Paper • 2605.17423 • Published May 17 • 34

authored a paper about 1 month ago

AI for Auto-Research: Roadmap & User Guide

Paper • 2605.18661 • Published May 18 • 69

authored 2 papers about 2 months ago

Egocentric Video-Language Pretraining

Paper • 2206.01670 • Published Jun 3, 2022

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published Apr 24 • 231

authored a paper 2 months ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 123

authored a paper 3 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 99

authored 2 papers 4 months ago

Learning Video Context as Interleaved Multimodal Sequences

Paper • 2407.21757 • Published Jul 31, 2024

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 201

authored 2 papers 5 months ago

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

Paper • 2601.03928 • Published Jan 7 • 16

ShowUI-$π$: Flow-based Generative Models as GUI Dexterous Hands

Paper • 2512.24965 • Published Dec 31, 2025 • 43

submitted a paper to Daily Papers 5 months ago

ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

Paper • 2512.24965 • Published Dec 31, 2025 • 43

authored a paper 6 months ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 65

submitted a paper to Daily Papers 6 months ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 65

authored 2 papers 6 months ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Paper • 2503.15661 • Published Mar 19, 2025 • 3

Kevin Lin

AI & ML interests

Recent Activity

Organizations

KevinQHLin's activity