N-Best 2008 Dutch
The N-Best 2008 Dutch Evaluation corpus is a corpus designed to evaluate Dutch/Flemish Speech Recognition systems in 2008. The corpus consists of 4 subsets:
bn_nl
: Broadcast News programmes in the Netherlands;cts_nl
: Conversational Telephone Speech in the Netherlands;bn_vl
: Broadcast News programmes in Belgium;cts_vl
: Conversational Telephone Speech in Belgium.
For more details about the corpus, click here.
The subset used in this benchmark is bn_nl
(Broadcast News programmes in the Netherlands).
Detailed results per pipeline component for WhisperX
Hardware setup
A high-performance computing cluster was used. The cluster’s hardware consists of 2 x Nvidia Quadro RTX 6000 with 24 GiB VRAM each, using CUDA version 12.4, with an Intel(R) Xeon(R) Gold 5220 CPU @ 2.20GHz and 256 GB of RAM available.
The OS installed on the cluster is RHEL 9.3.