Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yusu Qian's picture
2 8

Yusu Qian

YusuQian
apple-intelligence's profile picture kunkkkaa's profile picture hmb's profile picture
·

AI & ML interests

multimodal llm research

Recent Activity

upvoted a paper 22 days ago
MosaicMem: Hybrid Spatial Memory for Controllable Video World Models
upvoted a paper 5 months ago
PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection
upvoted a paper 6 months ago
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
View all activity

Organizations

Apple's profile picture

commented a paper 11 months ago

GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing

Paper • 2505.11493 • Published May 16, 2025 • 3 •
2
commented a paper over 1 year ago

How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts

Paper • 2402.13220 • Published Feb 20, 2024 • 14 •
3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs