view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 • 69
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 512
Set Block Decoding is a Language Model Inference Accelerator Paper • 2509.04185 • Published Sep 4, 2025 • 54
view article Article 🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? Mar 17, 2025 • 355
view article Article How to build a custom text classifier without days of human labeling Oct 17, 2024 • 57
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 Sep 18, 2024 • 278
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging Aug 19, 2024 • 79