Neural networks for abstraction and reasoning: Towards broad generalization in machines

doi:10.21203/rs.3.rs-4296928/v1

Download PDF

Article

Neural networks for abstraction and reasoning: Towards broad generalization in machines

https://doi.org/10.21203/rs.3.rs-4296928/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

For half a century, artificial intelligence research has attempted to reproduce the human qualities of abstraction and reasoning - creating computer systems that can learn new concepts from a minimal set of examples, in settings where humans find this easy. While specific neural networks are able to solve an impressive range of problems, broad generalisation to situations outside their training data has proved elusive.

In this work, we look at several novel approaches for solving the Abstraction and Reasoning Corpus (ARC). This is a dataset of abstract visual reasoning tasks introduced to test algorithms on broad generalization. Despite three international competitions with $100,000 in prizes, the best algorithms still fail to solve a majority of ARC tasks. The best solvers today rely on complex hand-crafted rules, without using machine learning at all. We revisit whether recent advances in neural networks allow progress on this task, or whether an entirely different class of models are required.

First, we adapt the DreamCoder Neurosymbolic reasoning solver to ARC. DreamCoder automatically writes programs in a bespoke domain-specific language to perform reasoning, using a neural network to mimic human intuition. We present the Perceptual Abstraction and Reasoning Language (PeARL) language, which allow DreamCoder to solve ARC tasks, and propose a new recognition model that allows us to significantly improve on the previous best implementation.

We also propose a new encoding and augmentation scheme that allows large language models (LLMs) to solve ARC tasks, and find that the largest models can solve some ARC tasks. LLMs are able to solve a different group of problems to state-of-the-art solvers, and provide an interesting way to complement other approaches.

We perform an ensemble analysis, combining models to achieve better results than any system alone. Finally, we publish the arckit Python library to make future research on ARC easier.

Physical sciences/Mathematics and computing/Computational science

Physical sciences/Mathematics and computing/Computer science

Physical sciences/Mathematics and computing/Information technology

No competing interests reported.

scirepbroadgeneralizationSUPP.pdf

Download PDF

Editorial decision: Revision requested
26 May, 2024
Reviews received at journal
25 May, 2024
Reviews received at journal
21 May, 2024
Reviews received at journal
21 May, 2024
Reviews received at journal
09 May, 2024
Reviewers agreed at journal
30 Apr, 2024
Reviewers agreed at journal
28 Apr, 2024
Reviewers agreed at journal
25 Apr, 2024
Reviewers invited by journal
25 Apr, 2024
Editor assigned by journal
25 Apr, 2024
Editor invited by journal
24 Apr, 2024
Submission checks completed at journal
24 Apr, 2024
First submitted to journal
20 Apr, 2024

You are reading this latest preprint version

Neural networks for abstraction and reasoning: Towards broad generalization in machines

Status:

Version 1

Abstract

Full Text

Additional Declarations

Supplementary Files

Status:

Version 1