Show simple item record

dc.contributor.authorAntonazzo, Giuliaen
dc.contributor.authorUrbano, Jose Men
dc.contributor.authorMarygold, Stevenen
dc.contributor.authorMillburn, Gillianen
dc.contributor.authorBrown, Nicholasen
dc.date.accessioned2019-12-20T00:30:21Z
dc.date.available2019-12-20T00:30:21Z
dc.date.issued2020-01en
dc.identifier.issn1758-0463
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/300148
dc.description.abstractBrief summaries describing the function of each gene’s product(s) are of great value to the research community, especially when interpreting genome-wide studies that reveal changes to hundreds of genes. However, manually writing such summaries, even for a single species, is a daunting task; for example, the Drosophila melanogaster genome contains almost 14,000 protein-coding genes. One solution is to use computational methods to generate summaries, but this often fails to capture the key functions or express them eloquently. Here, we describe how we solicited help from the research community to generate manually written summaries of D. melanogaster gene function. Based on the data within the FlyBase database, we developed a computational pipeline to identify researchers who have worked extensively on each gene. We e-mailed these researchers to ask them to draft a brief summary of the main function(s) of the gene’s product, which we edited for consistency to produce a “gene snapshot”. This approach yielded 1,800 gene snapshot submissions within a three-month period. We discuss the general utility of this strategy for other databases that capture data from the research literature. Database URL: https://flybase.org/
dc.description.sponsorshipMedical Research Council (UK) [G1000968 and MR/N030117/1] National Human Genome Research Institute at the National Institutes of Health [U41 HG00739]
dc.format.mediumPrinten
dc.languageengen
dc.publisherOxford University Press
dc.rightsAll rights reserved
dc.rights.uri
dc.subjectAnimalsen
dc.subjectDrosophila melanogasteren
dc.subjectData Collectionen
dc.subjectSoftwareen
dc.subjectDatabases, Geneticen
dc.subjectGenome, Insecten
dc.titleBuilding a pipeline to solicit expert knowledge from the community to aid gene summary curation.en
dc.typeArticle
prism.publicationDate2020en
prism.publicationNameDatabase : the journal of biological databases and curationen
prism.volume2020en
dc.identifier.doi10.17863/CAM.47219
dcterms.dateAccepted2019-12-12en
rioxxterms.versionofrecord10.1093/database/baz152en
rioxxterms.versionAM
rioxxterms.licenseref.urihttp://www.rioxx.net/licenses/all-rights-reserveden
rioxxterms.licenseref.startdate2020-01en
dc.contributor.orcidAntonazzo, Giulia [0000-0003-0086-5621]
dc.contributor.orcidUrbano, Jose M [0000-0002-6644-8129]
dc.contributor.orcidMarygold, Steven [0000-0003-2759-266X]
dc.contributor.orcidBrown, Nicholas [0000-0002-8958-7017]
dc.identifier.eissn1758-0463
rioxxterms.typeJournal Article/Reviewen
pubs.funder-project-idMRC (G1000968)
pubs.funder-project-idMRC (MR/N030117/1)
pubs.funder-project-idNational Institutes of Health (NIH) (via Harvard School of Public Health) (132685-5104589)
pubs.funder-project-idNational Institutes of Health (NIH) (via Harvard University) (132685-5104589)
pubs.funder-project-idNational Institutes of Health (NIH) (via Harvard University) (132626-5085854)
cam.orpheus.successMon Jun 08 08:20:05 BST 2020 - The item has an open VoR version.*
rioxxterms.freetoread.startdate2022-12-19


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record