Direct Human-AI Comparison in the Animal-AI Environment.
Authors
Voudouris, Konstantinos
Crosby, Matthew
Beyret, Benjamin
Hernández-Orallo, José
Shanahan, Murray
Halina, Marta
Cheke, Lucy G
Publication Date
2022Journal Title
Front Psychol
ISSN
1664-1078
Publisher
Frontiers Media SA
Volume
13
Language
en
Type
Article
This Version
VoR
Metadata
Show full item recordCitation
Voudouris, K., Crosby, M., Beyret, B., Hernández-Orallo, J., Shanahan, M., Halina, M., & Cheke, L. G. (2022). Direct Human-AI Comparison in the Animal-AI Environment.. Front Psychol, 13 https://doi.org/10.3389/fpsyg.2022.711821
Abstract
Artificial Intelligence is making rapid and remarkable progress in the development of more sophisticated and powerful systems. However, the acknowledgement of several problems with modern machine learning approaches has prompted a shift in AI benchmarking away from task-oriented testing (such as Chess and Go) towards ability-oriented testing, in which AI systems are tested on their capacity to solve certain kinds of novel problems. The Animal-AI Environment is one such benchmark which aims to apply the ability-oriented testing used in comparative psychology to AI systems. Here, we present the first direct human-AI comparison in the Animal-AI Environment, using children aged 6-10 (n = 52). We found that children of all ages were significantly better than a sample of 30 AIs across most of the tests we examined, as well as performing significantly better than the two top-scoring AIs, "ironbar" and "Trrrrr," from the Animal-AI Olympics Competition 2019. While children and AIs performed similarly on basic navigational tasks, AIs performed significantly worse in more complex cognitive tests, including detour tasks, spatial elimination tasks, and object permanence tasks, indicating that AIs lack several cognitive abilities that children aged 6-10 possess. Both children and AIs performed poorly on tool-use tasks, suggesting that these tests are challenging for both biological and non-biological machines.
Keywords
AI benchmarks, Animal-AI Olympics, artificial intelligence, cognitive AI, comparative cognition, human-AI comparison, out-of-distribution testing
Identifiers
External DOI: https://doi.org/10.3389/fpsyg.2022.711821
This record's URL: https://www.repository.cam.ac.uk/handle/1810/337881
Rights
Licence:
http://creativecommons.org/licenses/by/4.0/
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk