Variable typing: Assigning meaning to variables in mathematical text
View / Open Files
Publication Date
2018-01-01Journal Title
NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference
ISBN
9781948087278
Volume
1
Pages
303-312
Type
Conference Object
Metadata
Show full item recordCitation
Stathopoulos, Y., Baker, S., Rei, M., & Teufel, S. (2018). Variable typing: Assigning meaning to variables in mathematical text. NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, 1 303-312. https://doi.org/10.17863/CAM.30845
Abstract
Information about the meaning of mathematical variables in text is useful in NLP/IR tasks such as symbol disambiguation, topic modeling and mathematical information retrieval (MIR). We introduce variable typing, the task of assigning one mathematical type (multi-word technical terms referring to mathematical concepts) to each variable in a sentence of mathematical text. As part of this work, we also introduce a new annotated data set composed of 33,524 data points extracted from scientific documents published on arXiv. Our intrinsic evaluation demonstrates that our data set is sufficient to successfully train and evaluate current classifiers from three different model architectures. The best performing model is evaluated on an extrinsic task: MIR, by producing a typed formula index. Our results show that the best performing MIR models make use of our typed index, compared to a formula index only containing raw symbols, thereby demonstrating the usefulness of variable typing.
Identifiers
This record's DOI: https://doi.org/10.17863/CAM.30845
This record's URL: https://www.repository.cam.ac.uk/handle/1810/283479
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved