Shyam Sudhakaran's picture

Shyam Sudhakaran

shyamsn97

·

AI & ML interests

Reinforcement Learning, Open-Ended Algorithms, Neural Cellular Automata

Recent Activity

updated a model 12 days ago

shyamsn97/tinkerbell-chat-sft

liked a dataset 12 days ago

blanchon/opencs2_dataset

upvoted a paper about 2 months ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

View all activity

Organizations

upvoted a paper about 2 months ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published Mar 26 • 53

upvoted an article 10 months ago

Article

Introducing Command A Vision: Multimodal AI built for Business

CohereLabs

•

Jul 31, 2025

• 64

upvoted a collection over 1 year ago

3D Modelization

58 items • Updated 14 days ago • 12

upvoted a paper over 1 year ago

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12, 2024 • 48

upvoted a collection over 1 year ago

WebInstruct 🌐 Embeddings 🧱 Models

A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated Sep 4, 2024 • 11

upvoted an article over 1 year ago

Article

Selective fine-tuning of Language Models with Spectrum

anakin87

•

Sep 3, 2024

• 36

upvoted a collection almost 2 years ago

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated May 5, 2025 • 59

upvoted 2 collections about 2 years ago

Mixture-of-preference-reward-modeling

The mixture of preference datasets used for reward modeling. • 2 items • Updated Apr 29, 2024 • 3

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 12 items • Updated Mar 2 • 26

upvoted a paper about 2 years ago

Data-Efficient Multimodal Fusion on a Single GPU

Paper • 2312.10144 • Published Dec 15, 2023 • 6

upvoted 2 collections about 2 years ago

Fine-Tuned

37 items • Updated Mar 2 • 7

Merges

Experimental LLM merging • 1270 items • Updated Mar 2 • 8

upvoted a paper over 2 years ago

Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11, 2024 • 39

upvoted a collection over 2 years ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 49

upvoted a paper over 2 years ago

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 13

upvoted a collection over 2 years ago

🚂 SD-XL Training Suite

All the steps to train your own SD-XL custom model • 8 items • Updated Apr 24 • 24

upvoted a paper almost 3 years ago

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Paper • 2307.06949 • Published Jul 13, 2023 • 52