Introduction to the Special Issue on End-to-End Speech and Language Processing

Ramabhadran, Bhuvana; Chen, Nancy F; Harper, Mary P; Kingsbury, Brian; Knill, KM

Introduction to the Special Issue on End-to-End Speech and Language Processing

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/288414

Repository DOI

https://doi.org/10.17863/CAM.21627

Files

Accepted version (479.35 KB)

Type

Article

Authors

Ramabhadran, Bhuvana

Chen, Nancy F

Harper, Mary P

Kingsbury, Brian

Knill, KM

Abstract

The eleven papers in this special section focus on end-to-end speech and language processing (SLP) which is a series of sequence-to-sequence learning problems. Conventional SLP systems map input to output sequences through module-based architectures where each module is independently trained. These have a number of limitations including local optima, assumptions about intermediate models and features, and complex expert knowledge driven steps. It can be difficult for non-experts to use and develop new applications. Integrated End-to-End (E2E) systems aim to simplify the solution to these problems through a single network architecture to map an input sequence directly to the desired output sequence without the need for intermediate module representations. E2E models rely on flexible and powerful machine learning models such as recurrent neural networks. The emergence of models for end-to-end speech processing has lowered the barriers to entry into serious speech research. This special issue showcases the power of novel machine learning methods in end-to-end speech and language processing.

Keywords

46 Information and Computing Sciences, 4006 Communications Engineering, 40 Engineering, 4603 Computer Vision and Multimedia Computation

Journal Title

IEEE Journal of Selected Topics in Signal Processing

Journal ISSN

1932-4553
1941-0484

Volume Title

11

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publisher DOI

https://doi.org/10.1109/jstsp.2017.2767938

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Collections

Cambridge University Research Outputs