Publications
121 results found
Fountas Z, Sylaidi A, Nikiforou K, et al., 2022, A Predictive Processing Model of Episodic Memory and Time Perception, NEURAL COMPUTATION, Vol: 34, Pages: 1501-1544, ISSN: 0899-7667
- Author Web Link
- Cite
- Citations: 2
Voudouris K, Crosby M, Beyret B, et al., 2022, Direct Human-AI Comparison in the Animal-AI Environment, FRONTIERS IN PSYCHOLOGY, Vol: 13, ISSN: 1664-1078
- Author Web Link
- Cite
- Citations: 1
Shanahan M, Mitchell M, 2022, Abstraction for deep reinforcement learning, International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022), Publisher: IJCAI, ISSN: 1045-0823
We characterise the problem of abstraction in thecontext of deep reinforcement learning. Variouswell established approaches to analogical reasoningand associative memory might be brought to bearon this issue, but they present difficulties becauseof the need for end-to-end differentiability. We re-view developments in AI and machine learning thatcould facilitate their adoption.
Shanahan M, Mitchell M, 2022, Abstraction for Deep Reinforcement Learning, IJCAI International Joint Conference on Artificial Intelligence, Pages: 5588-5596, ISSN: 1045-0823
We characterise the problem of abstraction in the context of deep reinforcement learning. Various well established approaches to analogical reasoning and associative memory might be brought to bear on this issue, but they present difficulties because of the need for end-to-end differentiability. We review developments in AI and machine learning that could facilitate their adoption.
Mediano PAM, Rosas FE, Farah JC, et al., 2022, Integrated information as a common signature of dynamical and information-processing complexity, Chaos: an interdisciplinary journal of nonlinear science, Vol: 32, Pages: 1-12, ISSN: 1054-1500
The apparent dichotomy between information-processing and dynamical approaches to complexity science forces researchers to choose between two diverging sets of tools and explanations, creating conflict and often hindering scientific progress. Nonetheless, given the shared theoretical goals between both approaches, it is reasonable to conjecture the existence of underlying common signatures that capture interesting behavior in both dynamical and information-processing systems. Here, we argue that a pragmatic use of integrated information theory (IIT), originally conceived in theoretical neuroscience, can provide a potential unifying framework to study complexity in general multivariate systems. By leveraging metrics put forward by the integrated information decomposition framework, our results reveal that integrated information can effectively capture surprisingly heterogeneous signatures of complexity—including metastability and criticality in networks of coupled oscillators as well as distributed computation and emergent stable particles in cellular automata—without relying on idiosyncratic, ad hoc criteria. These results show how an agnostic use of IIT can provide important steps toward bridging the gap between informational and dynamical approaches to complex systems.Originally conceived within theoretical neuroscience, integrated information theory (IIT) has been rarely used in other fields—such as complex systems or non-linear dynamics—despite the great value it has to offer. In this article, we inspect the basics of IIT, dissociating it from its contentious claims about the nature of consciousness. Relieved of this philosophical burden, IIT presents itself as an appealing formal framework to study complexity in biological or artificial systems, applicable in a wide range of domains. To illustrate this, we present an exploration of integrated information in complex systems and relate it to other notions of complexity commonly used in sys
Shanahan M, Crosby M, Beyret B, et al., 2020, Artificial Intelligence and the Common Sense of Animals, TRENDS IN COGNITIVE SCIENCES, Vol: 24, Pages: 862-872, ISSN: 1364-6613
- Author Web Link
- Cite
- Citations: 7
Mittal S, Lamb A, Goyal A, et al., 2020, Learning to combine top-down and bottom-up signals in recurrent neural networks with attention over modules, Pages: 6928-6942
Robust perception relies on both bottom-up and top-down signals. Bottom-up signals consist of what s directly observed through sensation. Topdown signals consist of beliefs and expectations based on past experience and short-Term memory, such as how the phrase peanut butter and . will be completed. The optimal combination of bottom-up and top-down information remains an open question, but the manner of combination must be dynamic and both context and task dependent. To effectively utilize the wealth of potential top-down information available, and to prevent the cacophony of intermixed signals in a bidirectional architecture, mechanisms are needed to restrict information flow. We explore deep recurrent neural net architectures in which bottom-up and top-down signals are dynamically combined using attention. Modularity of the architecture further restricts the sharing and communication of information. Together, attention and modularity direct information flow, which leads to reliable performance improvements in perceptual and language tasks, and in particular improves robustness to distractions and noisy data. We demonstrate on a variety of benchmarks in language modeling, sequential image classification, video prediction and reinforcement learning that the bidirectional information flow can improve results over strong baselines.
Shanahan M, Nikiforou K, Creswell A, et al., 2020, An explicitly relational neural network architecture
With a view to bridging the gap between deep learning and symbolic AI, we present a novel endto-end neural network architecture that learns to form propositional representations with an explicitly relational structure from raw pixel data. In order to evaluate and analyse the architecture, we introduce a family of simple visual relational reasoning tasks of varying complexity. We show that the proposed architecture, when pre-trained on a curriculum of such tasks, learns to generate reusable representations that better facilitate subsequent learning on previously unseen tasks when compared to a number of baseline architectures. The workings of a successfully trained model are visualised to shed some light on how the architecture functions.
Dilokthanakul N, Kaplanis C, Pawlowski N, et al., 2019, Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, Vol: 30, Pages: 3409-3418, ISSN: 2162-237X
- Author Web Link
- Cite
- Citations: 27
Garnelo M, Shanahan M, 2019, Reconciling deep learning with symbolic artificial intelligence: representing objects and relations, Current Opinion in Behavioral Sciences, Vol: 29, Pages: 17-23, ISSN: 2352-1546
In the history of the quest for human-level artificial intelligence, a number of rival paradigms have vied for supremacy. Symbolic artificial intelligence was dominant for much of the 20th century, but currently a connectionist paradigm is in the ascendant, namely machine learning with deep neural networks. However, both paradigms have strengths and weaknesses, and a significant challenge for the field today is to effect a reconciliation. A central tenet of the symbolic paradigm is that intelligence results from the manipulation of abstract compositional representations whose elements stand for objects and relations. If this is correct, then a key objective for deep learning is to develop architectures capable of discovering objects and relations in raw data, and learning how to represent them in ways that are useful for downstream processing. This short review highlights recent progress in this direction.
Roseboom W, Fountas Z, Nikiforou K, et al., 2019, Activity in perceptual classification networks as a basis for human subjective time perception, Nature Communications, Vol: 10, ISSN: 2041-1723
Despite being a fundamental dimension of experience, how the human brain generates the perception of time remains unknown. Here, we provide a novel explanation for how human time perception might be accomplished, based on non-temporal perceptual classification processes. To demonstrate this proposal, we build an artificial neural system centred on a feed-forward image classification network, functionally similar to human visual processing. In this system, input videos of natural scenes drive changes in network activation, and accumulation of salient changes in activation are used to estimate duration. Estimates produced by this system match human reports made about the same videos, replicating key qualitative biases, including differentiating between scenes of walking around a busy city or sitting in a cafe or office. Our approach provides a working model of duration perception from stimulus to estimation and presents a new direction for examining the foundations of this central aspect of human experience.
Kaplanis C, Shanahan M, Clopath C, 2019, Policy Consolidation for Continual Reinforcement Learning, INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, Vol: 97, ISSN: 2640-3498
- Author Web Link
- Cite
- Citations: 8
Dilokthanakul N, Shanahan M, 2018, Deep Reinforcement Learning with Risk-Seeking Exploration, 15th International Conference on the Simulation of Adaptive Behavior (SAB), Publisher: SPRINGER INTERNATIONAL PUBLISHING AG, Pages: 201-211, ISSN: 0302-9743
- Author Web Link
- Cite
- Citations: 4
Kaplanis C, Shanahan M, Clopath C, 2018, Continual Reinforcement Learning with Complex Synapses, 35th International Conference on Machine Learning (ICML), Publisher: JMLR-JOURNAL MACHINE LEARNING RESEARCH, ISSN: 2640-3498
- Author Web Link
- Cite
- Citations: 12
Garnelo M, Rosenbaum D, Maddison CJ, et al., 2018, Conditional Neural Processes, 35th International Conference on Machine Learning (ICML), Publisher: JMLR-JOURNAL MACHINE LEARNING RESEARCH, ISSN: 2640-3498
- Author Web Link
- Cite
- Citations: 50
Fountas Z, Shanahan M, 2017, The role of cortical oscillations in a spiking neural network model of the basal ganglia., PLoS ONE, Vol: 12, ISSN: 1932-6203
Although brain oscillations involving the basal ganglia (BG) have been the target of extensive research, the main focus lies disproportionally on oscillations generated within the BG circuit rather than other sources, such as cortical areas. We remedy this here by investigating the influence of various cortical frequency bands on the intrinsic effective connectivity of the BG, as well as the role of the latter in regulating cortical behaviour. To do this, we construct a detailed neural model of the complete BG circuit based on fine-tuned spiking neurons, with both electrical and chemical synapses as well as short-term plasticity between structures. As a measure of effective connectivity, we estimate information transfer between nuclei by means of transfer entropy. Our model successfully reproduces firing and oscillatory behaviour found in both the healthy and Parkinsonian BG. We found that, indeed, effective connectivity changes dramatically for different cortical frequency bands and phase offsets, which are able to modulate (or even block) information flow in the three major BG pathways. In particular, alpha (8-12Hz) and beta (13-30Hz) oscillations activate the direct BG pathway, and favour the modulation of the indirect and hyper-direct pathways via the subthalamic nucleus-globus pallidus loop. In contrast, gamma (30-90Hz) frequencies block the information flow from the cortex completely through activation of the indirect pathway. Finally, below alpha, all pathways decay gradually and the system gives rise to spontaneous activity generated in the globus pallidus. Our results indicate the existence of a multimodal gating mechanism at the level of the BG that can be entirely controlled by cortical oscillations, and provide evidence for the hypothesis of cortically-entrained but locally-generated subthalamic beta activity. These two findings suggest new insights into the pathophysiology of specific BG disorders.
Fountas Z, Shanahan M, 2017, Assessing Selectivity in the Basal Ganglia: The “Gearbox” Hypothesis, Publisher: Cold Spring Harbor Laboratory
<jats:title>Abstract</jats:title><jats:p>Despite experimental evidence, the literature so far contains no systematic attempt to address the impact of cortical oscillations on the ability of the basal ganglia (BG) to select. In this study, we employed a state-of-the-art spiking neural model of the BG circuitry and investigated the effectiveness of this circuitry as an action selection device. We found that cortical frequency, phase, dopamine and the examined time scale, all have a very important impact on this process. Our simulations resulted in a canonical profile of selectivity, termed selectivity portraits, which suggests that the cortex is the structure that determines whether selection will be performed in the BG and what strategy will be utilized. Some frequency ranges promote the exploitation of highly salient actions, others promote the exploration of alternative options, while the remaining frequencies halt the selection process. Based on this behaviour, we propose that the BG circuitry can be viewed as the “gearbox” of action selection. Coalitions of rhythmic cortical areas are able to switch between a repertoire of available BG modes which, in turn, change the course of information flow within the cortico-BG-thalamo-cortical loop. Dopamine, akin to “control pedals”, either stops or initiates a decision, while cortical frequencies, as a “gear lever”, determine whether a decision can be triggered and what type of decision this will be. Finally, we identified a selection cycle with a period of around 200ms, which was used to assess the biological plausibility of the popular cognitive architectures.</jats:p><jats:sec><jats:title>Author summary</jats:title><jats:p>Our brains are continuously called to select the most appropriate action between alternative competing choices. A plethora of evidence and theoretical work indicates that a fundamental brain region called the basal ga
Tax T, Martinez Mediano PA, Shanahan M, 2017, The Partial Information Decomposition of GenerativeNeural Network Models, Entropy, Vol: 19, ISSN: 1099-4300
In this work we study the distributed representations learnt by generative neural network models. In particular, we investigate the properties of redundant and synergistic information that groups of hidden neurons contain about the target variable. To this end, we use an emerging branch of information theory called partial information decomposition (PID) and track the informational properties of the neurons through training. We find two differentiated phases during the training process: a first short phase in which the neurons learn redundant information about the target, and a second phase in which neurons start specialising and each of them learns unique information about the target. We also find that in smaller networks individual neurons learn more specific information about certain features of the input, suggesting that learning pressure can encourage disentangled representations.
Nikiforou K, Mediano PAM, Shanahan M, 2017, An Investigation of the Dynamical Transitions in Harmonically Driven Random Networks of Firing-Rate Neurons, Cognitive Computation, Vol: 9, Pages: 351-363, ISSN: 1866-9956
Continuous-time recurrent neural networks are widely used as models of neural dynamics and also have applications in machine learning. But their dynamics are not yet well understood, especially when they are driven by external stimuli. In this article, we study the response of stable and unstable networks to different harmonically oscillating stimuli by varying a parameter ρ, the ratio between the timescale of the network and the stimulus, and use the dimensionality of the network’s attractor as an estimate of the complexity of this response. Additionally, we propose a novel technique for exploring the stationary points and locally linear dynamics of these networks in order to understand the origin of input-dependent dynamical transitions. Attractors in both stable and unstable networks show a peak in dimensionality for intermediate values of ρ, with the latter consistently showing a higher dimensionality than the former, which exhibit a resonance-like phenomenon. We explain changes in the dimensionality of a network’s dynamics in terms of changes in the underlying structure of its vector field by analysing stationary points. Furthermore, we uncover the coexistence of underlying attractors with various geometric forms in unstable networks. As ρ is increased, our visualisation technique shows the network passing through a series of phase transitions with its trajectory taking on a sequence of qualitatively distinct figure-of-eight, cylinder, and spiral shapes. These findings bring us one step closer to a comprehensive theory of this important class of neural networks by revealing the subtle structure of their dynamics under different conditions.
Dilokthanakul N, Mediano PAM, Garnelo M, et al., 2016, Deep unsupervised clustering with Gaussian mixture variational autoencoders
We study a variant of the variational autoencoder model with a Gaussian mixture as a prior distribution, with the goal of performing unsupervised clustering through deep generative models. We observe that the standard variational approach in these models is unsuited for unsupervised clustering, and mitigate this problem by leveraging a principled information-theoretic regularisation term known as consistency violation. Adding this term to the standard variational optimisation objective yields networks with both meaningful internal representations and well-defined clusters. We demonstrate the performance of this scheme on synthetic data, MNIST and SVHN, showing that the obtained clusters are distinct, interpretable and result in achieving higher performance on unsupervised clustering classification than previous approaches.
Arulkumaran K, Dilokthanakul N, Shanahan M, et al., 2016, Classifying options for deep reinforcement learning, Publisher: IJCAI
Deep reinforcement learning is the learning of multiple levels ofhierarchical representations for reinforcement learning. Hierarchicalreinforcement learning focuses on temporal abstractions in planning andlearning, allowing temporally-extended actions to be transferred between tasks.In this paper we combine one method for hierarchical reinforcement learning -the options framework - with deep Q-networks (DQNs) through the use ofdifferent "option heads" on the policy network, and a supervisory network forchoosing between the different options. We show that in a domain where we haveprior knowledge of the mapping between states and options, our augmented DQNachieves a policy competitive with that of a standard DQN, but with much lowersample complexity. This is achieved through a straightforward architecturaladjustment to the DQN, as well as an additional supervised neural network.
Bhowmik D, Nikiforou K, Shanahan M, et al., 2016, A RESERVOIR COMPUTING MODEL OF EPISODIC MEMORY, International Joint Conference on Neural Networks (IJCNN), Publisher: IEEE, Pages: 5202-5209, ISSN: 2161-4393
- Author Web Link
- Cite
- Citations: 1
Shanahan MP, Hellyer P, Sharp DJ, et al., 2015, Cognitive flexibility through metastable neural dynamics is disrupted by damage to the structural connectome, Journal of Neuroscience, Vol: 35, Pages: 9050-9063, ISSN: 0270-6474
Current theory proposes that healthy neural dynamics operate in a metastable regime, where brain regions interact to simultaneously maximize integration and segregation. Metastability may confer important behavioral properties, such as cognitive flexibility. It is increasingly recognized that neural dynamics are constrained by the underlying structural connections between brain regions. An important challenge is, therefore, to relate structural connectivity, neural dynamics, and behavior. Traumatic brain injury (TBI) is a pre-eminent structural disconnection disorder whereby traumatic axonal injury damages large-scale connectivity, producing characteristic cognitive impairments, including slowed information processing speed and reduced cognitive flexibility, that may be a result of disrupted metastable dynamics. Therefore, TBI provides an experimental and theoretical model to examine how metastable dynamics relate to structural connectivity and cognition. Here, we use complementary empirical and computational approaches to investigate how metastability arises from the healthy structural connectome and relates to cognitive performance. We found reduced metastability in large-scale neural dynamics after TBI, measured with resting-state functional MRI. This reduction in metastability was associated with damage to the connectome, measured using diffusion MRI. Furthermore, decreased metastability was associated with reduced cognitive flexibility and information processing. A computational model, defined by empirically derived connectivity data, demonstrates how behaviorally relevant changes in neural dynamics result from structural disconnection. Our findings suggest how metastable dynamics are important for normal brain function and contingent on the structure of the human connectome.
Váša F, Shanahan M, Hellyer P, et al., 2015, Effects of lesions on synchrony and metastability in cortical networks, Neuroimage, Vol: 118, Pages: 456-467, ISSN: 1095-9572
At the macroscopic scale, the human brain can be described as a complex network of white matter tracts integrating grey matter assemblies — the human connectome. The structure of the connectome, which is often described using graph theoretic approaches, can be used to model macroscopic brain function at low computational cost. Here, we use the Kuramoto model of coupled oscillators with time-delays, calibrated with respect to empirical functional MRI data, to study the relation between the structure of the connectome and two aspects of functional brain dynamics — synchrony, a measure of general coherence, and metastability, a measure of dynamical flexibility. Specifically, we investigate the relationship between the local structure of the connectome, quantified using graph theory, and the synchrony and metastability of the model's dynamics. By removing individual nodes and all of their connections from the model, we study the effect of lesions on both global and local dynamics. Of the nine nodal graph-theoretical properties tested, two were able to predict effects of node lesion on the global dynamics. The removal of nodes with high eigenvector centrality leads to decreases in global synchrony and increases in global metastability, as does the removal of hub nodes joining topologically segregated network modules. At the level of local dynamics in the neighbourhood of the lesioned node, structural properties of the lesioned nodes hold more predictive power, as five nodal graph theoretical measures are related to changes in local dynamics following node lesions. We discuss these results in the context of empirical studies of stroke and functional brain dynamics.
Teixeira FPP, Shanahan M, 2015, Local and Global Criticality within Oscillating Networks of Spiking Neurons, International Joint Conference on Neural Networks (IJCNN), Publisher: IEEE, ISSN: 2161-4393
- Author Web Link
- Cite
- Citations: 1
Bhowmik D, Shanahan M, 2015, STDP Produces Well Behaved Oscillations and Synchrony, 4th International Conference on Cognitive Neurodynamics (ICCN), Publisher: SPRINGER, Pages: 241-252
Fountas Z, Shanahan M, 2015, GPU-based Fast Parameter Optimization for Phenomenological Spiking Neural Models, International Joint Conference on Neural Networks (IJCNN), Publisher: IEEE, ISSN: 2161-4393
Carhart-Harris RL, Leech R, Hellyer PJ, et al., 2014, The entropic brain: a theory of conscious states informed by neuroimaging research with psychedelic drugs, Frontiers in Human Neuroscience, Vol: 8, Pages: 1-22, ISSN: 1662-5161
Entropy is a dimensionless quantity that is used for measuring uncertainty about the state of a system but it can also imply physical qualities, where high entropy is synonymous with high disorder. Entropy is applied here in the context of states of consciousness and their associated neurodynamics, with a particular focus on the psychedelic state. The psychedelic state is considered an exemplar of a primitive or primary state of consciousness that preceded the development of modern, adult, human, normal waking consciousness. Based on neuroimaging data with psilocybin, a classic psychedelic drug, it is argued that the defining feature of “primary states” is elevated entropy in certain aspects of brain function, such as the repertoire of functional connectivity motifs that form and fragment across time. Indeed, since there is a greater repertoire of connectivity motifs in the psychedelic state than in normal waking consciousness, this implies that primary states may exhibit “criticality,” i.e., the property of being poised at a “critical” point in a transition zone between order and disorder where certain phenomena such as power-law scaling appear. Moreover, if primary states are critical, then this suggests that entropy is suppressed in normal waking consciousness, meaning that the brain operates just below criticality. It is argued that this entropy suppression furnishes normal waking consciousness with a constrained quality and associated metacognitive functions, including reality-testing and self-awareness. It is also proposed that entry into primary states depends on a collapse of the normally highly organized activity within the default-mode network (DMN) and a decoupling between the DMN and the medial temporal lobes (which are normally significantly coupled). These hypotheses can be tested by examining brain activity and associated cognition in other candidate primary states such as rapid eye movement (REM) sleep and early ps
Hellyer PJ, Shanahan MP, Scott G, et al., 2014, The control of global brain dynamics: opposing actions of frontoparietal control and default mode networks on attention, Journal of Neuroscience, Vol: 34, Pages: 451-461, ISSN: 1529-2401
Understanding how dynamic changes in brain activity control behavior is a major challenge of cognitive neuroscience. Here, we consider the brain as a complex dynamic system and define two measures of brain dynamics: the synchrony of brain activity, measured by the spatial coherence of the BOLD signal across regions of the brain; and metastability, which we define as the extent to which synchrony varies over time. We investigate the relationship among brain network activity, metastability, and cognitive state in humans, testing the hypothesis that global metastability is “tuned” by network interactions. We study the following two conditions: (1) an attentionally demanding choice reaction time task (CRT); and (2) an unconstrained “rest” state. Functional MRI demonstrated increased synchrony, and decreased metastability was associated with increased activity within the frontoparietal control/dorsal attention network (FPCN/DAN) activity and decreased default mode network (DMN) activity during the CRT compared with rest. Using a computational model of neural dynamics that is constrained by white matter structure to test whether simulated changes in FPCN/DAN and DMN activity produce similar effects, we demonstate that activation of the FPCN/DAN increases global synchrony and decreases metastability. DMN activation had the opposite effects. These results suggest that the balance of activity in the FPCN/DAN and DMN might control global metastability, providing a mechanistic explanation of how attentional state is shifted between an unfocused/exploratory mode characterized by high metastability, and a focused/constrained mode characterized by low metastability.
Fountas Z, Shanahan M, 2014, Phase Offset Between Slow Oscillatory Cortical Inputs Influences Competition in a Model of the Basal Ganglia, International Joint Conference on Neural Networks (IJCNN), Publisher: IEEE, Pages: 2407-2414, ISSN: 2161-4393
- Author Web Link
- Cite
- Citations: 3
This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.