Yasunori Ozaki's picture

In a Training Loop 🔄

Yasunori Ozaki PRO

alfredplpl

·

https://alfredplpl.github.io/en/index.html

AI & ML interests

Computer Vision, LLM

Recent Activity

liked a model 1 day ago

LiquidAI/LFM2.5-1.2B-JP-GGUF

liked a Space 2 days ago

zerogpu-aoti/wan2-2-fp8da-aoti-faster

upvoted a paper 3 days ago

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

View all activity

Organizations

upvoted a paper 3 days ago

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

Paper • 2601.03872 • Published 4 days ago • 36

upvoted a paper 4 days ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 5 days ago • 87

upvoted a paper 12 days ago

Yume-1.5: A Text-Controlled Interactive World Generation Model

Paper • 2512.22096 • Published 16 days ago • 57

upvoted a paper 20 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 78

upvoted a paper 23 days ago

DeContext as Defense: Safe Image Editing in Diffusion Transformers

Paper • 2512.16625 • Published 24 days ago • 24

upvoted a paper 24 days ago

IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

Paper • 2512.15635 • Published 25 days ago • 19

upvoted a collection 24 days ago

Qwen-Image

14 items • Updated 11 days ago • 50

upvoted 2 papers about 1 month ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 167

Self-Improving VLM Judges Without Human Annotations

Paper • 2512.05145 • Published Dec 2, 2025 • 19

upvoted a collection about 1 month ago

Z-Image

4 items • Updated Dec 1, 2025 • 105

upvoted a changelog about 1 month ago

Changelog

Duplicate Datasets

Dec 3, 2025

• 95

upvoted a paper about 1 month ago

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published Dec 2, 2025 • 29

upvoted a paper about 2 months ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17, 2025 • 67

upvoted a collection 2 months ago

WAON

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models • 4 items • Updated Oct 28, 2025 • 1

upvoted 2 papers 2 months ago

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models

Paper • 2510.22276 • Published Oct 25, 2025 • 3

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27, 2025 • 58

upvoted 2 papers 3 months ago

UltraGen: High-Resolution Video Generation with Hierarchical Attention

Paper • 2510.18775 • Published Oct 21, 2025 • 17

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17, 2025 • 50

upvoted a collection 3 months ago

RAE

Collection for Diffusion Transformers with Representation Autoencoders • 1 item • Updated Oct 14, 2025 • 10

upvoted a paper 3 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 96