Topic or style? Exploring the most useful features for authorship attribution
dc.contributor.author | Sari, Y | |
dc.contributor.author | Stevenson, M | |
dc.contributor.author | Vlachos, Andreas | |
dc.date.accessioned | 2021-12-09T00:32:16Z | |
dc.date.available | 2021-12-09T00:32:16Z | |
dc.date.issued | 2018-01-01 | |
dc.identifier.isbn | 9781948087506 | |
dc.identifier.uri | https://www.repository.cam.ac.uk/handle/1810/331299 | |
dc.description.abstract | Approaches to authorship attribution, the task of identifying the author of a document, are based on analysis of individuals’ writing style and/or preferred topics. Although the problem has been widely explored, no previous studies have analysed the relationship between dataset characteristics and effectiveness of different types of features. This study carries out an analysis of four widely used datasets to explore how different types of features affect authorship attribution accuracy under varying conditions. The results of the analysis are applied to authorship attribution models based on both discrete and continuous representations. We apply the conclusions from our analysis to an extension of an existing approach to authorship attribution and outperform the prior state-of-the-art on two out of the four datasets used. | |
dc.publisher | Association for Computational Linguistics | |
dc.rights | Attribution 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.title | Topic or style? Exploring the most useful features for authorship attribution | |
dc.type | Conference Object | |
dc.publisher.department | Department of Computer Science And Technology | |
dc.date.updated | 2021-12-08T11:10:47Z | |
prism.endingPage | 353 | |
prism.publicationDate | 2018 | |
prism.publicationName | COLING 2018 - 27th International Conference on Computational Linguistics, Proceedings | |
prism.startingPage | 343 | |
dc.identifier.doi | 10.17863/CAM.78746 | |
rioxxterms.versionofrecord | 10.17863/CAM.78746 | |
rioxxterms.version | VoR | |
dc.contributor.orcid | Vlachos, Andreas [0000-0003-2123-5071] | |
dc.publisher.url | https://aclanthology.org/C18-1029/ | |
pubs.conference-name | The 27th International Conference on Computational Linguistics | |
pubs.conference-start-date | 2018-08-20 | |
cam.orpheus.success | 2022-05-11: embargo success field applied | |
cam.orpheus.counter | 6 | |
cam.depositDate | 2021-12-08 | |
pubs.conference-finish-date | 2018-08-26 | |
pubs.licence-identifier | apollo-deposit-licence-2-1 | |
pubs.licence-display-name | Apollo Repository Deposit Licence Agreement |
Files in this item
This item appears in the following Collection(s)
-
Cambridge University Research Outputs
Research outputs of the University of Cambridge