Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HiTZ 's Collections
Latxa Instruct
TTS
ASR Datasets
Pyannote
Nvidia NeMo
Speech Collection
Whisper
Latxa
Multilingual TruthfulQA
GoLLIE
Ask2Transformers
Metaphor Processing
MATE
EusCrawl
BERnaT
Alpaca LoRA MT
Lemmatization
Pretraining Datasets
Evaluation Datasets
Instruction Datasets
Basque Encoders
OPT RM
Composite Corpus
Medical-mT5
Lessons in Evaluation of Spanish Encoder-only Models
BasqueParl
This is not a dataset
Speech to Text
CONAN-EUS: Counternarrative Generation in Basque and Spanish
EriBERTa
BERTeus
IXAmBERT
Antidote Project
Machine Translation
XNLIeu
Odesia Challenge 2024
Medical MT

ASR Datasets

updated 3 days ago

Collection with datasets for training and benchmark-evaluating ASR in Basque, Spanish and Bilingual Basque-Spanish

Upvote
-

  • HiTZ/composite_corpus_eseu_v1.0

    Viewer • Updated May 12 • 742k • 483 • 2

    Note Dataset for training ASR models in Bilingual Basque-Spanish


  • HiTZ/composite_corpus_eu_v2.1

    Viewer • Updated Dec 19, 2024 • 407k • 288 • 2

    Note Dataset for training ASR models in Basque


  • HiTZ/composite_corpus_es_v1.0

    Viewer • Updated May 12 • 526k • 341

    Note Dataset for training ASR models in Spanish


  • HiTZ/benchmark_eseu_testsets

    Updated Apr 19 • 45

    Note Dataset for making benchmark evaluations in Basque, Spanish or Bilingual Basque-Spanish ASR models

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs