Debajyati Dey's picture

Debajyati Dey

DayK0n

·

Debajyati

AI & ML interests

None yet

Recent Activity

liked a Space 2 days ago

ideogram-ai/ideogram4

liked a model 2 days ago

Comfy-Org/Ideogram-4

liked a model 2 days ago

ideogram-ai/ideogram-4-nf4

View all activity

Organizations

None yet

upvoted 4 collections 7 days ago

💻 Qwopus-Coder

Reasoning-distilled coding models optimized for specialized domains like agentic workflows. • 4 items • Updated 13 days ago • 15

Qwopus3.5-v3.5/v3

🌟Qwopus3.5-v3.5 is the latest model in the Claude series. • 14 items • Updated 18 days ago • 106

🍎 Qwopus3.6

This collection features the advanced Qwopus3.6 series of multimodal large models, which are fine-tuned from the Qwen3.6 base models with a focus on e • 10 items • Updated 17 days ago • 63

Qwen3.5-Claude-4.6-Opus-Reasoning-Distilled

18 items • Updated 18 days ago • 213

upvoted a collection 10 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309

upvoted 5 papers 11 days ago

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Paper • 2605.27367 • Published 15 days ago • 71

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 13 days ago • 140

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published 16 days ago • 102

Rethinking Cross-Layer Information Routing in Diffusion Transformers

Paper • 2605.20708 • Published 21 days ago • 110

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 15 days ago • 139

upvoted 3 papers 15 days ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

Paper • 2605.21573 • Published 21 days ago • 110

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Paper • 2605.23902 • Published 19 days ago • 46

VGenST-Bench: A Benchmark for Spatio-Temporal Reasoning via Active Video Synthesis

Paper • 2605.22570 • Published 20 days ago • 24

upvoted a collection about 1 month ago

Pixio

5 items • Updated Dec 19, 2025 • 16

upvoted 4 papers about 1 month ago

VOID: Video Object and Interaction Deletion

Paper • 2604.02296 • Published Apr 2 • 56

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published Apr 23 • 37

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published Apr 24 • 227

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 276

upvoted a collection about 2 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 222

upvoted a paper about 2 months ago

Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 22