Supplementary data for "Improved DNN-based Segmentation for Multi-genre Broadcast Audio"

Name: Supplementary data for "Improved DNN-based Segmentation for Multi-genre Broadcast Audio"
Published: 2016-01-21T22:34:04Z
Keywords: Speech Recognition, Deep Neural Networks, Speech Segmentation

Wang, L.; Zhang, C.; Woodland, P. C.; Gales, M. J. F.; Karanasou, P.; Lanchantin, P.; Liu, X.; Qian, Y.

Supplementary data for "Improved DNN-based Segmentation for Multi-genre Broadcast Audio"

Repository URI

https://www.repository.cam.ac.uk/handle/1810/253408

Repository DOI

https://doi.org/10.17863/CAM.69120

Files

wang-icassp16-dnnseg.zip (21.25 MB)

README (1.16 KB)

Type

Dataset

Authors

Wang, L.

Zhang, C.

Woodland, P. C.

https://orcid.org/0000-0001-9069-0225

Gales, M. J. F.

Karanasou, P.

Show 3 more

Description

Description of the Speech Recognition Training and Test Data and its Availability used for Experiments. Key Speech Recognition Outputs/Detailed Scoring Results used in the paper.

Software / Usage instructions

Speech Recognition outputs are in NIST ctm format (a standard text format used for speech recognition) and .sys files are output of speech recognition scoring process

Keywords

Speech Recognition, Deep Neural Networks, Speech Segmentation

Publisher

University of Cambridge

Rights

Attribution-NonCommercial-NoDerivs 2.0 UK: England & Wales

Sponsorship

This work was supported by the EPSRC [grant number EP/I031022/1] and by Cambridge Commonwealth, European & International Trust.

Relationships

Supplements:

https://doi.org/10.1109/ICASSP.2016.7472769

Collections

Research Data - Engineering
Symplectic mapped items for data match

Supplementary data for "Improved DNN-based Segmentation for Multi-genre Broadcast Audio"

Repository URI

Repository DOI

Files

Type

Change log

Authors

Description

Version

Software / Usage instructions

Keywords

Publisher

Rights

Sponsorship

Relationships

Collections