Guille Pérez-Torró

guishe

AI & ML interests

Information Retrieval, Few-Shot Learning, Named Entity Recognition, Named Entity Disambiguation, Semantic Search, Aspect-based Sentiment Analysis

Recent Activity

upvoted a paper 7 days ago

Reinforcement Learning for Self-Improving Agent with Skill Library

liked a Space 15 days ago

HuggingFaceFW/finephrase

upvoted a paper about 2 months ago

ORPO: Monolithic Preference Optimization without Reference Model

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published Dec 18, 2025 • 41

liked a Space 15 days ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

207

Explore synthetic data experiments as an interactive bookshelf

upvoted a paper about 2 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 72

liked 2 models 2 months ago

BAAI/bge-reranker-v2-m3

Text Classification • 0.6B • Updated Jun 24, 2024 • 5.36M • • 930

fastino/gliner2-large-v1

Updated 26 days ago • 102k • 61

liked a model 3 months ago

unsloth/Qwen3-VL-8B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • 9B • Updated Oct 31, 2025 • 34.1k • 19

liked a Space 3 months ago

Evaluation Guidebook

📝

292

Explore LLM benchmark trends over time

upvoted 2 articles 5 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

280

Article

Merge Large Language Models with mergekit

Jan 9, 2024

•

152

liked a Space 5 months ago

The Smol Training Playbook

📚

3.06k

The secrets to building world-class LLMs

upvoted an article 5 months ago

Article

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

Oct 20, 2025

•

upvoted an article 6 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4, 2025

•

273

liked a model 7 months ago

unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit

Text Generation • Updated Aug 6, 2025 • 177k • 13

liked a model 8 months ago

unsloth/gpt-oss-20b-bnb-4bit

Text Generation • 21B • Updated Aug 6, 2025 • 2.78k • 13

upvoted a collection 8 months ago

gpt-oss

Collection

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 420

New activity in Qwen/Qwen3-Embedding-0.6B 8 months ago

task description for clustering

#33 opened 8 months ago by

guishe

updated a collection 10 months ago

Small LLMs

Collection

6 items • Updated Jun 10, 2025

upvoted an article 10 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Oct 14, 2024

•

103

updated 2 collections 10 months ago

Instruct LLMs

Collection

6 items • Updated May 26, 2025

Multi-Vector Embedding Models

Collection

2 items • Updated May 16, 2025

Guille Pérez-Torró

AI & ML interests

Recent Activity

Organizations

guishe's activity

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Evaluation Guidebook

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Merge Large Language Models with mergekit

The Smol Training Playbook

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

Welcome EmbeddingGemma, Google's new efficient embedding model

task description for clustering

Model2Vec: Distill a Small Fast Model from any Sentence Transformer