ThaleMine: A Warehouse for Arabidopsis Data Integration and Discovery.
View / Open Files
Authors
Krishnakumar, Vivek
Contrino, Sergio
Cheng, Chia-Yi
Belyaeva, Irina
Ferlanti, Erik S
Miller, Jason R
Vaughn, Matthew W
Town, Christopher D
Chan, Agnes P
Publication Date
2017-01-01Journal Title
Plant and Cell Physiology
ISSN
0032-0781
Publisher
Oxford University Press (OUP)
Volume
58
Issue
1
Pages
e4-e4
Language
eng
Type
Article
Metadata
Show full item recordCitation
Krishnakumar, V., Contrino, S., Cheng, C., Belyaeva, I., Ferlanti, E. S., Miller, J. R., Vaughn, M. W., et al. (2017). ThaleMine: A Warehouse for Arabidopsis Data Integration and Discovery.. Plant and Cell Physiology, 58 (1), e4-e4. https://doi.org/10.1093/pcp/pcw200
Abstract
ThaleMine (https://apps.araport.org/thalemine/) is a comprehensive data warehouse that integrates a wide array of genomic information of the model plant Arabidopsis thaliana. The data collection currently includes the latest structural and functional annotation from the Araport11 update, the Col-0 genome sequence, RNA-seq and array expression, co-expression, protein interactions, homologs, pathways, publications, alleles, germplasm and phenotypes. The data are collected from a wide variety of public resources. Users can browse gene-specific data through Gene Report pages, identify and create gene lists based on experiments or indexed keywords, and run GO enrichment analysis to investigate the biological significance of selected gene sets. Developed by the Arabidopsis Information Portal project (Araport, https://www.araport.org/), ThaleMine uses the InterMine software framework, which builds well-structured data, and provides powerful data query and analysis functionality. The warehoused data can be accessed by users via graphical interfaces, as well as programmatically via web-services. Here we describe recent developments in ThaleMine including new features and extensions, and discuss future improvements. InterMine has been broadly adopted by the model organism research community including nematode, rat, mouse, zebrafish, budding yeast, the modENCODE project, as well as being used for human data. ThaleMine is the first InterMine developed for a plant model. As additional new plant InterMines are developed by the legume and other plant research communities, the potential of cross-organism integrative data analysis will be further enabled.
Keywords
Arabidopsis thaliana, InterMine, data integration, data warehouse, genomics, web services, Arabidopsis, Arabidopsis Proteins, Computational Biology, Databases, Genetic, Gene Expression Profiling, Gene Expression Regulation, Plant, Gene Ontology, Genomics, Information Storage and Retrieval, Internet, Protein Interaction Mapping, Protein Interaction Maps, Reproducibility of Results, Sequence Analysis, RNA
Sponsorship
Biotechnology and Biological Sciences Research Council (BB/L027151/1)
Identifiers
External DOI: https://doi.org/10.1093/pcp/pcw200
This record's URL: https://www.repository.cam.ac.uk/handle/1810/284117
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk