ALL topUMass
Home
People
Research
Pubs
Contact
Links
Restricted
RL Repository

Department of Computer Science
University of Massachusetts Amherst

Research

Computational Neuroscience

Biological motor control demonstrates an amazing ability to deal with complex and ever-changing bodies and environments. Motor control systems are ideal for us to study for several reasons. The problem of manupulating a complex system such as a multi-link arm to solve a task is very useful in testing artificial intelligence algorithms and applications. Since reinforcement learning has biological parallels, we can compare how an RL learning agent controls a limb to how humans and animals control their limbs. This comparison is much more direct than emotional or motivational aspects of biological systems. In addition, animals control their limbs in a stereotyped manner, indicating that the central nervous system of a certain animal has evolved to control motor functions in a specific way. Using computational neuroscience methods such as neural networks and physiologically based models, we investigate how the brain learns and controls movements such as reaching and grasping. We also collaborate with neuroscientist specialists in motor control from UMass and other universities. Our computational neuroscience research has connections with not only artificial intelligence, but also the developmental psychology and robotics research we are involved with. What follows are brief descriptions of some of the projects to which members of our lab have contributed. Specific papers, proceedings, and posters are referenced in the ALL publications page.


4musc We use neural network models to theorize possible mechanisms the brain use for selecting muscles and how the primary cortex (MI) encodes movement. Specifically, we use a gradient descent algorithm to train a neural network to select muscles based on their ability to produce the desired movement while minimizing the total effort required by all muscles to implement the movement. We also show that MI neurons that encode in extrinsic space (such as direction of movement) can directly activate muscles appropriately. This is in contrast to other theories in which only MI neurons that encode movement in muscle space can directly activate muscles. The illustration on the left graphs muscle activity as a function of movement direction as produced by the models (thick dark line) and as recorded from monkey subjects (thin light lines). The agreement between model and monkey data suggest that these theories are reasonable


arm The stretch reflex can be described by non-linear equations of length and velocity. In the fractional power damping model of the elbow musculature, antagonist muscle bursts that function to decelerate the ongoing movement are generated by the stretch reflex, with central commands preserving some influence over the precise timing and magnitude of this response. Although descending control is achieved by setting equilibrium points of the viscoelastic muscle-reflex system, the nonlinear damping behavior of opposing reflexes interacts to produce a region of stiction around the specified equilibrium. This slows movement so dramatically that the equilibrium point is effectively never reached in most movements. The complexities of these nonlinearities do not necessarily increase the difficulty of the central control problem. Effective control can be achieved by precisely timing the offset of the agonist burst, and the tendency for oscillations around the endpoint is greatly diminished by the stiction that results from fractional power damping.


RL_BG Rapid human arm movements often have velocity profiles consisting of several bell-shaped acceleration-deceleration phases, sometimes overlapping in time and sometimes appearing separately. We show how such sub-movement sequences can emerge naturally as a result of an optimal control policy learned by a reinforcement learning system in the face of uncertainty and feedback delay. The system learns to generate sequences of pulse-step commands, producing fast initial sub-movements followed by several slow corrective sub-movements that often begin before the initial sub-movement has completed. These results suggest how the nervous system might efficiently control a stochastic motor plant under uncertainty and feedback delay.


Cerebellum A simplified model of the cerebellum was developed to explore its potential for adaptive, predictive control based on delayed feedback information. An abstract representation of a single Purkinje cell with multistable properties was interfaced, via a formalized premotor network, with a simulated single degree-of-freedom limb. The limb actuator was a nonlinear spring-mass system based on the nonlinear velocity dependence of the stretch reflex. By including realistic mossy fiber signals, as well as realistic conduction delays in afferent and efferent pathways, the model allowed the investigation of timing and predictive processes relevant to cerebellar involvement in the control of movement. The model regulates movement by learning to react in an anticipatory fashion to sensory feedback. Learning depends on training information generated from corrective movements and uses a temporally-asymmetric form of plasticity for the parallel fiber synapses on Purkinje cells


[ Top of page ]   [ ALL Home ]   [ Department of Computer Science ]   [ University of Massachusetts Amherst ]