Repository logo
 

Introduction to the Special Issue on End-to-End Speech and Language Processing

Accepted version
Peer-reviewed

Type

Article

Change log

Authors

Ramabhadran, Bhuvana 
Chen, Nancy F 
Harper, Mary P 
Kingsbury, Brian 
Knill, KM 

Abstract

The eleven papers in this special section focus on end-to-end speech and language processing (SLP) which is a series of sequence-to-sequence learning problems. Conventional SLP systems map input to output sequences through module-based architectures where each module is independently trained. These have a number of limitations including local optima, assumptions about intermediate models and features, and complex expert knowledge driven steps. It can be difficult for non-experts to use and develop new applications. Integrated End-to-End (E2E) systems aim to simplify the solution to these problems through a single network architecture to map an input sequence directly to the desired output sequence without the need for intermediate module representations. E2E models rely on flexible and powerful machine learning models such as recurrent neural networks. The emergence of models for end-to-end speech processing has lowered the barriers to entry into serious speech research. This special issue showcases the power of novel machine learning methods in end-to-end speech and language processing.

Description

Keywords

46 Information and Computing Sciences, 4006 Communications Engineering, 40 Engineering, 4603 Computer Vision and Multimedia Computation

Journal Title

IEEE Journal of Selected Topics in Signal Processing

Conference Name

Journal ISSN

1932-4553
1941-0484

Volume Title

11

Publisher

Institute of Electrical and Electronics Engineers (IEEE)