Presentation

· Presenters · Organizations · Search Program · Flagged · Happening Now · Maps · Notifications

Workshop

: Training Speech Recognition Models on HPC Infrastructure

SessionMachine Learning in HPC Environments

Author/Presenters

Deepthi Karkada

Vikram A. Saletore

Event Type

Workshop

Registration Categories

Tags

TimeSunday, November 11th5pm - 5:30pm

LocationD167/174

DescriptionAutomatic speech recognition is used extensively in speech interfaces and spoken dialogue systems. To accelerate the development of new speech recognition models and techniques, developers at Mozilla have open sourced a deep learning based Speech-To-Text engine known as project DeepSpeech based on Baidu’s DeepSpeech research. In order to make model training time quicker on CPUs for DeepSpeech distributed training, we have developed optimizations on the Mozilla DeepSpeech code to scale the model training to a large number of Intel® CPU system, including Horovod integration into DeepSpeech. We have also implemented a novel dataset partitioning scheme to mitigate compute imbalance across multiple nodes of an HPC cluster. We demonstrate that we are able to train the DeepSpeech model using the LibriSpeech clean dataset to its state-of-the-art accuracy in 6.45Hrs on 16-Node Intel® Xeon® based HPC cluster.

Program November 11–16, 2018

Exhibits November 12–15, 2018

KAY BAILEY HUTCHISON CONVENTION CENTER DALLAS

The International Conference for High Performance
Computing, Networking, Storage, and Analysis

Presentation