alexcombessie (Alex Combessie)

upvoted 2 articles 4 months ago

Article

How the LiteLLM PyPI Supply Chain Attack Happened — and What to Do If You're Affected

davidberenstein1957

•

Mar 25

• 2

Article

Announcing Giskard v3

davidberenstein1957

•

Apr 2

• 2

upvoted a collection 5 months ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 23 items • Updated 15 days ago • 344

upvoted an article 8 months ago

Article

Phare LLM benchmark V2: Reasoning models don't guarantee better security

davidberenstein1957

•

Dec 16, 2025

• 10

upvoted an article 10 months ago

Article

LLM vulnerability scanner for dynamic & multi-turn Red Teaming

JMJM

•

Sep 25, 2025

• 2

upvoted 3 articles about 1 year ago

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

davidberenstein1957

•

May 7, 2025

• 42

Article

RealPerformance, A Dataset of Language Model Business Compliance Issues

davidberenstein1957

•

Jul 21, 2025

• 4

Article

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

davidberenstein1957

•

Jul 2, 2025

• 16

upvoted a paper about 1 year ago

Phare: A Safety Probe for Large Language Models

Paper • 2505.11365 • Published May 16, 2025 • 7

upvoted a paper over 1 year ago

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published Apr 14, 2025 • 10

upvoted a collection over 1 year ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 268

upvoted an article about 2 years ago

Article

License to Call: Introducing Transformers Agents 2.0

+1

m-ric, lysandre, pcuenq

•

May 13, 2024

• 137

upvoted a paper over 2 years ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22, 2024 • 126

Alex Combessie

AI & ML interests

Organizations

How the LiteLLM PyPI Supply Chain Attack Happened — and What to Do If You're Affected

Announcing Giskard v3

NVIDIA Nemotron v3

Phare LLM benchmark V2: Reasoning models don't guarantee better security

LLM vulnerability scanner for dynamic & multi-turn Red Teaming

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

RealPerformance, A Dataset of Language Model Business Compliance Issues

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

Phare: A Safety Probe for Large Language Models

RealHarm: A Collection of Real-World Language Model Application Failures

The Big Benchmarks Collection

License to Call: Introducing Transformers Agents 2.0

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Alex Combessie

AI & ML interests

Organizations

alexcombessie's activity

How the LiteLLM PyPI Supply Chain Attack Happened — and What to Do If You're Affected

**Announcing Giskard v3**

Phare LLM benchmark V2: Reasoning models don't guarantee better security

LLM vulnerability scanner for dynamic & multi-turn Red Teaming

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

RealPerformance, A Dataset of Language Model Business Compliance Issues

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

License to Call: Introducing Transformers Agents 2.0

Announcing Giskard v3