CAMB at CWI Shared Task 2018: Complex Word Identification with Ensemble-Based Voting
View / Open Files
Authors
Gooding, Sian
Kochmar, Ekaterina
Publication Date
2018Journal Title
Proceedings of the Thirteenth Workshop on Innovative Use of NLP for
Building Educational Applications
Conference Name
Proceedings of the Thirteenth Workshop on Innovative Use of NLP for
Building Educational Applications
ISBN
978-1-948087-11-7
Publisher
Association for Computational Linguistics
Type
Conference Object
This Version
AM
Metadata
Show full item recordCitation
Gooding, S., & Kochmar, E. (2018). CAMB at CWI Shared Task 2018: Complex Word Identification with
Ensemble-Based Voting. Proceedings of the Thirteenth Workshop on Innovative Use of NLP for
Building Educational Applications https://doi.org/10.18653/v1/w18-0520
Abstract
This paper presents the winning systems we submitted to the Complex Word Identification Shared Task 2018. We describe our best performing systems’ implementations and discuss our key findings from this research. Our best-performing systems achieve an F1 score of 0.8736 on the NEWS, 0.8400 on the WIKINEWS and 0.8115 on the WIKIPEDIA test sets in the monolingual English binary classification track, and a mean absolute error of 0.0558 on the NEWS, 0.0674 on the WIKINEWS and 0.0739 on the WIKIPEDIA test sets in the probabilistic track.
Sponsorship
Cambridge Assessment (unknown)
Identifiers
External DOI: https://doi.org/10.18653/v1/w18-0520
This record's URL: https://www.repository.cam.ac.uk/handle/1810/298179
Rights
All rights reserved
Licence:
http://www.rioxx.net/licenses/all-rights-reserved
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.