N-Best 2008 Dutch
The N-Best 2008 Dutch Evaluation corpus is a corpus designed to evaluate Dutch/Flemish Speech Recognition systems in 2008. The corpus consists of 4 subsets:
bn_nl: Broadcast News programmes in the Netherlands;cts_nl: Conversational Telephone Speech in the Netherlands;bn_vl: Broadcast News programmes in Belgium;cts_vl: Conversational Telephone Speech in Belgium.
For more details about the corpus, click here.
The subset used in this benchmark is bn_nl (Broadcast News programmes in the Netherlands).
Detailed results per pipeline component for WhisperX
Hardware setup
A high-performance computing cluster was used. The cluster’s hardware consists of 2 x Nvidia Quadro RTX 6000 with 24 GiB VRAM each, using CUDA version 12.4, with an Intel(R) Xeon(R) Gold 5220 CPU @ 2.20GHz and 256 GB of RAM available.
The OS installed on the cluster is RHEL 9.3.