View on GitHub

Dutch Open Speech Recognition Benchmark

Results of Dutch ASR models, collected by the community

N-Best 2008 Dutch

The N-Best 2008 Dutch Evaluation corpus is a corpus designed to evaluate Dutch/Flemish Speech Recognition systems in 2008. The corpus consists of 4 subsets:

bn_nl: Broadcast News programmes in the Netherlands;
cts_nl: Conversational Telephone Speech in the Netherlands;
bn_vl: Broadcast News programmes in Belgium;
cts_vl: Conversational Telephone Speech in Belgium.

For more details about the corpus, click here.

The subset used in this benchmark is bn_nl (Broadcast News programmes in the Netherlands).

Detailed results per pipeline component for WhisperX

Click here

Hardware setup

A high-performance computing cluster was used. The cluster’s hardware consists of 2 x Nvidia Quadro RTX 6000 with 24 GiB VRAM each, using CUDA version 12.4, with an Intel(R) Xeon(R) Gold 5220 CPU @ 2.20GHz and 256 GB of RAM available.

The OS installed on the cluster is RHEL 9.3.