Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

liked a model about 2 hours ago

mistralai/Devstral-Small-2-24B-Instruct-2512

upvoted a paper about 4 hours ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

liked a model 1 day ago

tencent/DRIVE-RL

View all activity

Organizations

upvoted a paper about 4 hours ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 1 day ago • 46

upvoted 2 collections 7 days ago

MiroThinker-v1.0

Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling • 7 items • Updated 5 days ago • 40

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 7 days ago • 116

upvoted a paper 7 days ago

Rectifying LLM Thought from Lens of Optimization

Paper • 2512.01925 • Published 8 days ago • 23

upvoted 2 papers 8 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13 • 176

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published 12 days ago • 170

upvoted a collection 17 days ago

SenseNova-SI

Scaling Spatial Intelligence with Multimodal Foundation Models • 8 items • Updated 1 day ago • 11

upvoted a collection 21 days ago

DR Tulu

Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated 15 days ago • 31

upvoted a collection 22 days ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated about 4 hours ago • 52

upvoted a paper 25 days ago

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published 29 days ago • 13

upvoted a paper 26 days ago

Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

Paper • 2511.07003 • Published 29 days ago • 32

upvoted 2 collections 27 days ago

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 15 items • Updated May 21 • 11

Retrofitting Recurrence

40 items • Updated 28 days ago • 6

upvoted a paper 28 days ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published 29 days ago • 16

upvoted 6 papers about 1 month ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5 • 124

The Collaboration Gap

Paper • 2511.02687 • Published Nov 4 • 21

Limits of Generalization in RLVR: Two Case Studies in Mathematical Reasoning

Paper • 2510.27044 • Published Oct 30 • 5

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31 • 70

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 272

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30 • 29