• Computer programs that learn to perform tasks also typically forget them very quickly. We show that the learning rule can be modified so that a program can remember old tasks when learning a new one. This is an important step towards more intelligent programs that are able to learn progressively and adaptively.


    Deep neural networks are currently the most successful machine learning technique for solving a variety of tasks including language translation, image classification and image generation. However, they have typically been designed to learn multiple tasks only if the data is presented all at once. As a network trains on a particular task its parameters are adapted to solve the task. When a new task is introduced,  new adaptations overwrite the knowledge that the neural network had previously acquired. This phenomenon is known in cognitive science as ‘catastrophic forgetting’, and is considered one of the fundamental limitations of neural networks.

    By contrast, our brains work in a very different way. We are able to learn incrementally, acquiring skills one at a time and applying our previous knowledge when learning new tasks. As a starting point for our recent , in which we propose an approach to overcome catastrophic forgetting in neural networks, we took inspiration from neuroscience-based theories about the consolidation of previously acquired skills and memories in mammalian and human brains.

    Neuroscientists have distinguished two kinds of consolidation that occur in the brain: systems consolidation and synaptic consolidation. Systems consolidation is the process by which memories that have been acquired by the quick-learning parts of our brain are imprinted into the slow-learning parts. This imprinting is known to be mediated by conscious and unconscious recall – for instance, this can happen during dreaming. In the second mechanism, synaptic consolidation, connections between neurons are less likely to be overwritten if they have been important in previously learnt tasks. Our algorithm specifically takes inspiration from this mechanism to address the problem of catastrophic forgetting.

    A neural network consists of several connections in much the same way as a brain. After learning a task, we compute how important each connection is to that task. When we learn a new task, each connection is protected from modification by an amount proportional to its importance to the old tasks. Thus it is possible to learn the new task without overwriting what has been learnt in the previous task and without incurring a significant computational cost. In mathematical terms, we can think of the protection we attach to each connection in a new task as being linked to the old protection value by a spring, whose stiffness is proportional to the connection’s importance. For this reason, we called our algorithm Elastic Weight Consolidation (EWC).

    EWC GIF2
    Illustration of the learning process for two tasks using EWC

    To test our algorithm, we exposed an agent to Atari games sequentially. Learning an individual game from the score alone is a challenging task, but learning multiple games sequentially is even more challenging as each game requires an individual strategy.  As shown in the figure below, without EWC, the agent quickly forgets each game after it stops playing it (blue). This means that on average, the agent barely learns a single game. However, if we use EWC (brown and red), the agent does not forget as easily and can learn to play several games, one after the other.

    Today, computer programs cannot learn from data adaptively and in real time. However, we have shown that catastrophic forgetting is not an insurmountable challenge for neural networks. We hope that this research represents a step towards programs that can learn in a more flexible and efficient way.

    Our research also progresses our understanding of how consolidation happens in the human brain. The neuroscientific theories that our work is based on, in fact, have mainly been proven in very simple examples. By showing that those same theories can be applied in a more realistic and complex machine learning context, we hope to give further weight to the idea that synaptic consolidation is key to retaining memories and know-how.

     

    Comments are closed

    Sorry, but you cannot leave a comment for this post.

     

    Latest Posts

    Latest Video

     
     

    LATEST POSTS

    What To Expect When Going To A Local Therapist

    Much of the population still looks down on therapy and psychologists. The primary reason for this is that therapy is vastly misunderstood by the general…

    Calm Body and Calm Mind: 4 Stress-Relief Strategies You Can Try

    Stress comes in different ways. First, stress can be in the form of acute stress. Acute stress is the kind of stress that you acquire,…

    Programming the Subconscious to Fight Diabetes

    Diabetes is a life-threatening illness that causes strokes, heart attacks, kidney failure, and blindness. Every year, 1.5 million Americans learn they have the illness. For…

    Vision of the Future in AR and Mixed Reality

    SOURCE Hello everyone who is reading our blog In this post I would like to share some of my thoughts and observations on how different…

    Keeping out of Trouble: 3 Scenarios Where Health Insurance Can Drop You

    Your health insurance is incredibly important, which is why the idea of being dropped by your insurance company is so concerning. Your health insurance provider…

    Health Benefits Associated With Drinking Water

      Out all the things that we eat and drink, water stands out as the most important one. When you drink water the body is…

    Artificial Intelligence Diagnoses Heart Murmurs Better Than Expert Cardiologists

    SOURCE Eko’s heart murmur detection algorithm outperformed four out of five cardiologists for the detection of heart murmurs in a recent clinical study. The algorithm…

    еще по теме chemtest.com.ua

    www.shtory.ua

    www.ac-sodan.info