Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
araag2 's Collections
OpenCTEval Benchmark Datasets
Medical-LLMs
TAI-P2

OpenCTEval Benchmark Datasets

updated Oct 15, 2025

A collection that supports the development of the OpenCTEval Benchmark, a medical dataset catered towards LLM reasoning over Clinical Trial (CT) data

Upvote
1

  • araag2/MedNLI

    Viewer • Updated Jul 28, 2025 • 42.1k • 63

  • araag2/MedQA

    Viewer • Updated Jul 28, 2025 • 38.2k • 70

  • araag2/MedMCQA

    Viewer • Updated Jul 31, 2025 • 579k • 76

  • araag2/PubMedQA

    Viewer • Updated Jul 31, 2025 • 821k • 30

  • araag2/RCT_Summary

    Viewer • Updated Jul 31, 2025 • 154k • 28

  • araag2/Evidence_Inference_v2

    Viewer • Updated Nov 4, 2025 • 37.5k • 22 • 1

  • araag2/HINT

    Viewer • Updated Nov 4, 2025 • 37.4k • 46

  • araag2/Trial_Meta-Analysis

    Viewer • Updated Oct 17, 2025 • 3.5k • 57 • 1

  • araag2/TREC_Clinicial-Decision-Support

    Viewer • Updated Jul 31, 2025 • 309k • 44

  • araag2/TREC_Precision-Medicine

    Viewer • Updated Jul 31, 2025 • 121k • 51

  • araag2/TREC_Clinical-Trials

    Viewer • Updated Nov 4, 2025 • 307k • 110 • 1

  • araag2/SemEval_NLI4CT

    Viewer • Updated Jul 31, 2025 • 29.4k • 28

  • araag2/NLI4PR

    Viewer • Updated Sep 7, 2025 • 35k • 30
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs