Repository logo
 

A distributional semantic methodology for enhanced search in historical records: A case study on smell

cam.issuedOnline2018-12-13
dc.contributor.authorMcGregor, S
dc.contributor.authorMcGillivray, B
dc.contributor.orcidMcGillivray, Barbara [0000-0003-3426-8200]
dc.date.accessioned2018-12-20T00:30:18Z
dc.date.available2018-12-20T00:30:18Z
dc.date.issued2018-12-13
dc.description.abstractIn this paper we present a methodology based on distributional semantic models that can be flexibly adapted to the specific challenges posed by historical texts and that allow users to retrieve semantically relevant text without the need to close-read the documents. We focus on a case study concerned with detecting smell-related sentences in historical medical reports. We demonstrate a process for moving from generic domain label input to a more nuanced evaluation of the semantics of smell in a set of sentences extracted from this corpus, and then develop a machine learning technique for compounding scores on a variety of modelling parameters into more effective classifications.
dc.description.sponsorshipThis work was supported by the Chist-ERA Atlantis project. This work was supported by The Alan Turing Institute under the EPSRC grant EP/N510129/1.
dc.identifier.doi10.17863/CAM.33896
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/287205
dc.language.isoeng
dc.publisherAustrian Academy of Sciences Press
dc.publisher.urlhttps://austriaca.at/8437-9
dc.titleA distributional semantic methodology for enhanced search in historical records: A case study on smell
dc.typeConference Object
dcterms.dateAccepted2018-07-18
prism.endingPage11
prism.publicationDate2018
prism.publicationNameKONVENS 2018 - Conference on Natural Language Processing / Die Konferenz zur Verarbeitung Naturlicher Sprache
prism.startingPage1
pubs.conference-finish-date2018-09-21
pubs.conference-name14th Conference on Natural Language Processing (KONVENS 2018)
pubs.conference-start-date2018-09-19
pubs.funder-project-idAlan Turing Institute (EP/N510129/1)
rioxxterms.licenseref.startdate2018-01-01
rioxxterms.licenseref.urihttp://creativecommons.org/licenses/by/4.0/
rioxxterms.typeConference Paper/Proceeding/Abstract
rioxxterms.versionAM
rioxxterms.versionofrecord10.17863/CAM.33896

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
main.pdf
Size:
163.39 KB
Format:
Adobe Portable Document Format
Description:
Accepted version
Licence
http://creativecommons.org/licenses/by/4.0/
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
DepositLicenceAgreementv2.1.pdf
Size:
150.9 KB
Format:
Adobe Portable Document Format