CAMB at CWI Shared Task 2018: Complex Word Identification with
            Ensemble-Based Voting

Gooding, Sian; Kochmar, Ekaterina

CAMB at CWI Shared Task 2018: Complex Word Identification with Ensemble-Based Voting

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/298179

Repository DOI

https://doi.org/10.17863/CAM.45233

Files

Accepted version (173.59 KB)

Type

Conference Object

Authors

Gooding, Sian

Kochmar, Ekaterina

Abstract

This paper presents the winning systems we submitted to the Complex Word Identification Shared Task 2018. We describe our best performing systems’ implementations and discuss our key findings from this research. Our best-performing systems achieve an F1 score of 0.8736 on the NEWS, 0.8400 on the WIKINEWS and 0.8115 on the WIKIPEDIA test sets in the monolingual English binary classification track, and a mean absolute error of 0.0558 on the NEWS, 0.0674 on the WIKINEWS and 0.0739 on the WIKIPEDIA test sets in the probabilistic track.

Journal Title

Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications

Conference Name

Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications

Publisher

Association for Computational Linguistics

Publisher DOI

https://doi.org/10.18653/v1/w18-0520

Rights

Sponsorship

Cambridge Assessment (unknown)

Collections

Cambridge University Research Outputs