Repository logo
 

Leveraging type descriptions for zero-shot named entity recognition and classification

cam.orpheus.successMon Nov 15 07:30:18 GMT 2021 - Embargo updated
dc.contributor.authorAly, R
dc.contributor.authorVlachos, A
dc.contributor.authorMcDonald, R
dc.contributor.orcidVlachos, Andreas [0000-0003-2123-5071]
dc.date.accessioned2021-11-09T00:31:14Z
dc.date.available2021-11-09T00:31:14Z
dc.date.issued2021
dc.description.abstractA common issue in real-world applications of named entity recognition and classification (NERC) is the absence of annotated data for target entity classes during training. Zeroshot learning approaches address this issue by learning models that can transfer information from observed classes in the training data to unseen classes. This paper presents the first approach for zero-shot NERC, introducing a novel architecture that leverage the fact that textual descriptions for many entity classes occur naturally. Our architecture addresses the zero-shot NERC specific challenge that the not-an-entity class is not well defined, since different entity classes are considered in training and testing. For evaluation, we adapt two datasets, OntoNotes and MedMentions, emulating the difficulty of real-world zero-shot learning by testing models on the rarest entity classes. Our proposed approach outperforms baselines adapted from machine reading comprehension and zero-shot text classification. Furthermore, we assess the effect of different class descriptions for this task.
dc.identifier.doi10.17863/CAM.77922
dc.identifier.isbn9781954085527
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/330478
dc.language.isoeng
dc.publisherAssociation for Computational Linguistics
dc.publisher.urlhttp://dx.doi.org/10.18653/v1/2021.acl-long.120
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleLeveraging type descriptions for zero-shot named entity recognition and classification
dc.typeConference Object
dcterms.dateAccepted2021-05-05
prism.endingPage1528
prism.publicationDate2021
prism.publicationNameACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference
prism.startingPage1516
pubs.conference-finish-date2021-08
pubs.conference-nameProceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
pubs.conference-start-date2021-08
pubs.funder-project-idEuropean Commission Horizon 2020 (H2020) ERC (865958)
pubs.funder-project-idEngineering and Physical Sciences Research Council (EP/R021643/2)
rioxxterms.licenseref.startdate2021-01-01
rioxxterms.licenseref.urihttp://www.rioxx.net/licenses/all-rights-reserved
rioxxterms.typeConference Paper/Proceeding/Abstract
rioxxterms.versionVoR
rioxxterms.versionofrecord10.18653/v1/2021.acl-long.120

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2021.acl-long.120.pdf
Size:
431.83 KB
Format:
Adobe Portable Document Format
Description:
Published version
Licence
https://creativecommons.org/licenses/by/4.0/
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
DepositLicenceAgreementv2.1.pdf
Size:
150.9 KB
Format:
Adobe Portable Document Format