Supplementary data for "Improved DNN-based Segmentation for Multi-genre Broadcast Audio"
Gales, M. J. F.
University of Cambridge
MetadataShow full item record
Wang, L., Zhang, C., Woodland, P. C., Gales, M. J. F., Karanasou, P., Lanchantin, P., Liu, X., & et al. (2016). Supplementary data for "Improved DNN-based Segmentation for Multi-genre Broadcast Audio" [Dataset]. https://www.repository.cam.ac.uk/handle/1810/253408
Description of the Speech Recognition Training and Test Data and its Availability used for Experiments. Key Speech Recognition Outputs/Detailed Scoring Results used in the paper.
Speech Recognition outputs are in NIST ctm format (a standard text format used for speech recognition) and .sys files are output of speech recognition scoring process
Speech Recognition, Deep Neural Networks, Speech Segmentation
Publication Reference: https://doi.org/10.1109/ICASSP.2016.7472769
This work was supported by the EPSRC [grant number EP/I031022/1] and by Cambridge Commonwealth, European & International Trust.
This record's URL: https://www.repository.cam.ac.uk/handle/1810/253408
Attribution-NonCommercial-NoDerivs 2.0 UK: England & Wales
Licence URL: http://creativecommons.org/licenses/by-nc-nd/2.0/uk/
Recommended or similar items
The following licence files are associated with this item: