On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models Paper • 2603.27481 • Published 10 days ago • 35
view post Post 254 LWDetr is available in 🤗 transformers ! Checkout the collection to find the original paper, model weights and a demo space : https://huggingface.co/collections/stevenbucaille/lwdetr See translation 🔥 2 2 + Reply
HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models Paper • 2601.15968 • Published Jan 22 • 9