apple/DFN5B-CLIP-ViT-H-14-378
Updated
•
366k
•
98
None defined yet.
Learning Unmasking Policies for Diffusion Language Models
One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation