Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shuangrui Ding's picture
3 25 18

Shuangrui Ding

Mar2Ding
svjack's profile picture russelljohnson's profile picture Fayaz's profile picture
·
https://mark12ding.github.io/
  • ShuangruiDing
  • mark12ding

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago
facebook/sam3
liked a Space 15 days ago
Tongyi-MAI/Z-Image-Turbo
upvoted a paper 28 days ago
Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents
View all activity

Organizations

None yet

authored a paper 10 months ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published Feb 18 • 41
authored 2 papers 11 months ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9 • 43

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published Jan 6 • 36
authored a paper 12 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 98
authored a paper about 1 year ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 69
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs