1 38 32

Zhisheng Zheng

zhisheng01

https://zhishengzheng.com/

zhisheng147

AI & ML interests

LLM, Speech and Audio Processing

Recent Activity

upvoted a paper 8 days ago

Qwen3-TTS Technical Report

upvoted a paper 24 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

liked a dataset 28 days ago

SparkAudio/voxbox

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Qwen3-TTS Technical Report

Paper • 2601.15621 • Published 9 days ago • 55

upvoted a paper 24 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 27 days ago • 57

liked a dataset 28 days ago

SparkAudio/voxbox

Viewer • Updated Apr 15, 2025 • 23.8M • 9.84k • 70

upvoted a paper 2 months ago

VIDEOP2R: Video Understanding from Perception to Reasoning

Paper • 2511.11113 • Published Nov 14, 2025 • 111

upvoted a paper 3 months ago

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19

liked a model 3 months ago

jordand/whisper-d-v1a

Updated Nov 1, 2024 • 11.3k • 45

upvoted 2 papers 4 months ago

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 40

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

Paper • 2509.22220 • Published Sep 26, 2025 • 65

liked a dataset 4 months ago

maimai11/MNV_17

Viewer • Updated Oct 13, 2025 • 2.6k • 235 • 17

New activity in zhisheng01/SpatialAudio 5 months ago

Validation Dataset

#1 opened 5 months ago by

guardian233

upvoted 3 papers 5 months ago

liked 2 models 6 months ago

nvidia/canary-1b-v2

Automatic Speech Recognition • Updated Dec 3, 2025 • 176k • 343

mistralai/Voxtral-Mini-3B-2507

5B • Updated Jul 28, 2025 • 441k • 610

liked a Space 6 months ago

Open ASR Leaderboard

🏆

1.21k

View and request speech models benchmark data

liked a model 6 months ago

nvidia/canary-qwen-2.5b

Automatic Speech Recognition • 3B • Updated Dec 15, 2025 • 102k • 363

updated a model 7 months ago

zhisheng01/VoiceCraft-X

Updated Jul 17, 2025 • 4 • 2

liked a model 7 months ago

zhisheng01/VoiceCraft-X

Updated Jul 17, 2025 • 4 • 2

published a model 7 months ago

zhisheng01/VoiceCraft-X

Updated Jul 17, 2025 • 4 • 2

Zhisheng Zheng

AI & ML interests

Recent Activity

Organizations

zhisheng01's activity

Validation Dataset

Open ASR Leaderboard