Can large language models help predict results from a complex behavioural science study?
dc.citation.issue | 9 | |
dc.citation.volume | 11 | |
dc.contributor.author | Lippert S | |
dc.contributor.author | Dreber A | |
dc.contributor.author | Johannesson M | |
dc.contributor.author | Tierney W | |
dc.contributor.author | Cyrus-Lai W | |
dc.contributor.author | Uhlmann EL | |
dc.contributor.author | Emotion Expression Collaboration | |
dc.contributor.author | Pfeiffer T | |
dc.coverage.spatial | England | |
dc.date.accessioned | 2024-10-30T21:52:40Z | |
dc.date.available | 2024-10-30T21:52:40Z | |
dc.date.issued | 2024-09 | |
dc.description.abstract | We tested whether large language models (LLMs) can help predict results from a complex behavioural science experiment. In study 1, we investigated the performance of the widely used LLMs GPT-3.5 and GPT-4 in forecasting the empirical findings of a large-scale experimental study of emotions, gender, and social perceptions. We found that GPT-4, but not GPT-3.5, matched the performance of a cohort of 119 human experts, with correlations of 0.89 (GPT-4), 0.07 (GPT-3.5) and 0.87 (human experts) between aggregated forecasts and realized effect sizes. In study 2, providing participants from a university subject pool the opportunity to query a GPT-4 powered chatbot significantly increased the accuracy of their forecasts. Results indicate promise for artificial intelligence (AI) to help anticipate-at scale and minimal cost-which claims about human behaviour will find empirical support and which ones will not. Our discussion focuses on avenues for human-AI collaboration in science. | |
dc.description.confidential | false | |
dc.edition.edition | Sep 2024 | |
dc.format.pagination | 240682- | |
dc.identifier.author-url | https://www.ncbi.nlm.nih.gov/pubmed/39323554 | |
dc.identifier.citation | Lippert S, Dreber A, Johannesson M, Tierney W, Cyrus-Lai W, Uhlmann EL, Emotion Expression Collaboration , Pfeiffer T. (2024). Can large language models help predict results from a complex behavioural science study?. R Soc Open Sci. 11. 9. (pp. 240682-). | |
dc.identifier.doi | 10.1098/rsos.240682 | |
dc.identifier.eissn | 2054-5703 | |
dc.identifier.elements-type | journal-article | |
dc.identifier.issn | 2054-5703 | |
dc.identifier.pii | rsos240682 | |
dc.identifier.uri | https://mro.massey.ac.nz/handle/10179/71876 | |
dc.language | eng | |
dc.publisher | The Royal Society | |
dc.publisher.uri | https://royalsocietypublishing.org/doi/10.1098/rsos.240682 | |
dc.relation.isPartOf | R Soc Open Sci | |
dc.rights | (c) 2024 The Author/s | |
dc.rights | CC BY 4.0 | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | forecasting | |
dc.subject | large language models | |
dc.subject | meta-research | |
dc.title | Can large language models help predict results from a complex behavioural science study? | |
dc.type | Journal article | |
pubs.elements-id | 491662 | |
pubs.organisational-group | College of Health |
Files
License bundle
1 - 1 of 1