View on GitHub

Dutch Open Speech Recognition Benchmark

Results of Dutch ASR models, collected by the community

Back to homepage

WER results


1.Medicijnjournaal (MJ): 35 files

Ground truth: https://doi.org/10.34973/dpjc-0v85

ASR system WER (%) GPUs
Wav2vec2.0 12.8 Yes
Kaldi-NL 16.1 Yes


2.Medical video material (MV): 11 files.

Ground truth: Ask via email: cristian.tejedorgarcia@ru.nl

Notes: Monologues, dialogues, music in background, good audio quality overall.

ASR system WER (%) GPUs
Whisper-large-v2 10.9 Yes
Wav2vec2.0 24.2 Yes
Kaldi-NL 28.4 Yes


3.Patient-provider medical conversations at Nivel originally inteded for testing purposes (Nivel): 38 files (~10 hours)

Ground truth: You need permission to access to Nivel files. Ask via email: cristian.tejedorgarcia@ru.nl

ASR system WER (%) GPUs
Whisper-large-v2 57.1 No
Kaldi-NL (fine-tuned) 68.0 No
Kaldi-NL 71.2 No


4.Patient-provider medical conversations at Nivel originally inteded for training purposes (Nivel_Train): 110 files (~40 hours)

Ground truth: You need permission to access to Nivel files. Ask via email: cristian.tejedorgarcia@ru.nl

ASR system WER (%) GPUs
Whisper-large-v2 34.1 No
Kaldi-NL 68.5 No