View on GitHub

Dutch Open Speech Recognition Benchmark

Results of Dutch ASR models, collected by the community

This file presents the average (avg.) WER scores of the four datasets tested within the HoMed project (2021-2024).
Kaldi_NL was fine-tuned with 50 hours of real conversations between patient and medical providers at Nivel facilities.
The concrete results (INS, DEL, SUB) of each one of the files of the three datasets are included in the .sys files generated with SCLITE.

1.Medicijnjournaal (MJ): 35 files

ASR system	WER (%)	GPUs
Wav2vec2.0	12.8	Yes
Kaldi-NL	16.1	Yes

2.Medical video material (MV): 11 files.

Ground truth: Ask via email: cristian.tejedorgarcia@ru.nl

Notes: Monologues, dialogues, music in background, good audio quality overall.

3.Patient-provider medical conversations at Nivel originally inteded for testing purposes (Nivel): 38 files (~10 hours)

Ground truth: You need permission to access to Nivel files. Ask via email: cristian.tejedorgarcia@ru.nl

4.Patient-provider medical conversations at Nivel originally inteded for training purposes (Nivel_Train): 110 files (~40 hours)

Ground truth: You need permission to access to Nivel files. Ask via email: cristian.tejedorgarcia@ru.nl

ASR system	WER (%)	GPUs
Whisper-large-v2	34.1	No
Kaldi-NL	68.5	No