A distributional semantic methodology for enhanced search in historical records: A case study on smell
View / Open Files
Authors
McGregor, S
McGillivray, B
Publication Date
2018-12-13Journal Title
KONVENS 2018 - Conference on Natural Language Processing / Die Konferenz zur Verarbeitung Naturlicher Sprache
Conference Name
14th Conference on Natural Language Processing (KONVENS 2018)
Publisher
Austrian Academy of Sciences Press
Pages
1-11
Type
Conference Object
This Version
AM
Metadata
Show full item recordCitation
McGregor, S., & McGillivray, B. (2018). A distributional semantic methodology for enhanced search in historical records: A case study on smell. KONVENS 2018 - Conference on Natural Language Processing / Die Konferenz zur Verarbeitung Naturlicher Sprache, 1-11. https://doi.org/10.17863/CAM.33896
Abstract
In this paper we present a methodology based on distributional semantic models that can be flexibly adapted to the specific challenges posed by historical texts and that allow users to retrieve semantically relevant text without the need to close-read the documents. We focus on a case study concerned with detecting smell-related sentences in historical medical reports. We demonstrate a process for moving from generic domain label input to a more nuanced evaluation of the semantics of smell in a set of sentences extracted from this corpus, and then develop a machine learning technique for compounding scores on a variety of modelling parameters into more effective classifications.
Sponsorship
This work was supported by the Chist-ERA Atlantis project. This work was supported by The Alan Turing Institute under the EPSRC grant EP/N510129/1.
Funder references
Alan Turing Institute (EP/N510129/1)
Identifiers
External DOI: https://doi.org/10.17863/CAM.33896
This record's URL: https://www.repository.cam.ac.uk/handle/1810/287205
Rights
Licence:
http://creativecommons.org/licenses/by/4.0/
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk