







|
Department of Computer Science
University of Massachusetts Amherst
ALL Publications, 1981-1995 (September)
Following is a list of publications in chronological order.
Some of these items appear both as published papers and as technical
reports, but the title may have changed. In such cases, to avoid
ambiguity, the citation shown in this list is the proper
citation. We prefer that you cite the published version over the
technical report.
Please contact the lab or a specific
author if you have any questions. To obtain copies of specific papers, please
send email to mitchell [at] cs [dot] umass [dot] edu
- A. G. Barto and R. S. Sutton. Landmark learning: An illustration of
associative search. Biological Cybernetics, 42: 1-8, 1981.
- A. G. Barto, R. S. Sutton and P. S. Brouwer. Associative search
network: A reinforcement learning associative memory. Biological Cybernetics, 40: 201-211,
1981.
- S. Bozinovski. Teaching space: A representation concept for adaptive
pattern classification. COINS Technical Report 81-18, University of
Massachusetts, 1981.
- R. S. Sutton and A. G. Barto. An adaptive network that constructs
and uses an internal model of its environment. Cognition and
Brain Theory, 4: 217-246, 1981.
- R. S. Sutton and A. G. Barto. Toward a modern theory of adaptive
networks: Expectation and prediction. Psychological Review,
88: 135-171, 1981.
- C. W. Anderson. Feature generation and selection by a layered
network of reinforcement learning elements: Some initial experiments.
COINS Technical Report 82-12, University of Massachusetts, 1982.
- A. G. Barto, C. W. Anderson, and R. S. Sutton. Synthesis of nonlinear
control surfaces by a layered associative search network.
Biological Cybernetics, 43: 175-185, 1982.
- A. G. Barto and R. S. Sutton. Simulation of anticipatory responses
in classical conditioning by a neuron-like adaptive element.
Behavioural Brain Research, 4: 221-235, 1982.
- S. Bozinovski. A self-learning system using secondary reinforcement. In
R. Trappl, editor, Cybernetics and Systems '82, Elsevier Science
Publishers (North Holland), 1982.
- A. G. Barto and R. S. Sutton. Neural problem solving. COINS
Technical Report 83-03, University of Massachusetts, 1983.
- A. G. Barto, R. S. Sutton and C. W. Anderson. Neuronlike elements
that can solve difficult learning control problems. IEEE
Transactions on Systems, Man, and Cybernetics, 13:
835-846, 1983.
- A. G. Barto. Adaptive neural networks for learning control:
Some computational experiments. In Proceedings of the
IEEE Workshop on Intelligent Control, Rensselaer Polytechnic
Institute, Troy, NY, August 1985.
- A. G. Barto. Learning by statistical cooperation of self-interested
neuron-like computing elements. Human Neurobiology,
4: 229-256, 1985. [pdf]
- A. G. Barto and P. Anandan. Pattern recognizing stochastic learning
automata. IEEE Transactions on Systems, Man, and Cybernetics,
15: 360-375, 1985.
- A. G. Barto and C. W. Anderson. Structural learning in connectionist
systems. In Proceedings of the Seventh Annual Conference
of the Cognitive Science Society, Irvine, CA, August 1985.
- O. Selfridge, R. S. Sutton and A. G. Barto. Training and tracking
in robotics. Proceedings of the Ninth International Joint
Conference on Artificial Intelligence, pp. 670-672.
San Mateo, CA: Morgan Kaufmann, 1985.
- R. S. Sutton and B. Pinette. The learning of world models by
connectionist networks. Proceedings of the Seventh Annual
Conference of the Cognitive Science Society, Irvine, CA, 1985.
- A. G. Barto. Game-theoretic cooperativity in networks of
self-interested units. In J. S. Denker, editor, Neural
Networks for Computing, American Institute of Physics, New York, 1986.
- A. G. Barto, P. Anandan, and C. W. Anderson. Cooperativity in networks
of pattern recognizing stochastic learning automata. In K. S.
Narendra, editor, Adaptive and Learning Systems: Theory and
Applications, Plenum, New York, 1986.
- M. I. Jordan. Attractor dynamics and parallelism in a connectionist
sequential machine. Proceedings of the Eighth Annual Conference
of the Cognitive Science Society, Amherst, MA, 1986.
- J. W. Moore, J. E. Desmond, N. E. Berthier, E. J. Blazis, R. S. Sutton
and A. G. Barto. Simulation of the classically conditioned
nictitating membrane response by a neuron-like adaptive element.
Response topography, neuronal firing, and interstimulus intervals.
Behavioural Brain Research, 21: 143-154, 1986.
- A. G. Barto. An approach to learning control surfaces by
connectionist systems. In M. A. Arbib and A. R. Hanson, editors,
Vision, Brain and Cooperative Computation, MIT Press/
Bradford Books, Cambridge, MA, 1987.
- A. G. Barto and M. I. Jordan. Gradient following without
back-propagation in layered networks. Proceedings of the
First IEEE Annual Conference on Neural Networks, San Diego, CA,
June 1987, pp. II-629-II-636.
- D.E.J. Blazis and J.W. Moore. Simulation of a classically
conditioned response: Components of the input trace and a
cerebellar neural network implementation of the Sutton-Barto-Desmond
model. COINS Technical Report 87-74, University of Massachusetts,
1987.
- S. Judd. Learning in networks is hard. Proceedings of the
First IEEE Annual Conference on Neural Networks, San Diego, CA,
June 1987.
- J. S. Judd. Complexity of connectionist learning with various
node functions. COINS Technical Report 87-60, University of
Massachusetts, 1987.
- N. A. Schmajuk and J. W. Moore. Two attentional models of
classical conditioning: Variations in CS effectiveness revisited.
COINS Technical Report 87-29, University of Massachusetts, 1987.
- R. S. Sutton and A. G. Barto. A temporal-difference model of
classical conditioning. TR87-509.2, Computer &Intelligent
Systems Laboratory, GTE Laboratories Inc., Waltham, MA, 1987.
[Also in Proceedings of the Ninth Annual Conference of the
Cognitive Science Society, July, 1987.]
- J. E. Desmond. Temporally adaptive conditioned responses:
Representation of the stimulus trace in neural-network models.
COINS Technical Report 88-80, University of Massachusetts, 1988.
- V. Gullapalli. A stochastic algorithm for learning real-valued
functions via reinforcement feedback.
COINS Technical Report 88-91, University of Massachusetts, 1988.
- R. A. Jacobs. Increased rates of convergence through learning rate
adaptation. Neural Networks, 1: 295-307, 1988.
- R. A. Jacobs. Initial experiments on constructing domains of
expertise and hierarchies in connectionist systems. Proceedings of the 1988 Connectionist Model Summer School,
San Mateo, CA: Morgan Kaufmann, 1988.
- M. I. Jordan. Supervised learning and systems with excess degrees
of freedom. COINS Technical Report 88-27, University of Massachusetts.
- M. I. Jordan and D. A. Rosenbaum. Action. COINS Technical Report
88-26, University of Massachusetts, 1988. [Also to appear in
M. I. Posner, editor, Handbook of Cognitive Science,
Cambridge, MA: MIT Press.]
- S. Judd. On the complexity of loading shallow neural networks.
Journal of Complexity, 1988.
- R. S. Sutton. Learning to predict by the methods of temporal
differences. Machine Learning, 3: 9-44, 1988.
- A. Barto. From chemotaxis to cooperativity: Abstract exercises
in neuronal learning strategies. In R. Durbin, C. Miall and
G. Mitchison, editors, The Computing Neuron, pp. 73-98.
Wokingham, England: Addison-Wesley, 1989. [pdf]
- A. G. Barto. Connectionist learning for control.
In W. T. Miller, R. S. Sutton and P. J. Werbos,
editors, Neural Networks for Control, pp. 5-58.
Cambridge, MA: The MIT Press, 1990. [Also appeared as
COINS Technical Report 89-89, University of Massachusetts, 1989.]
-
A. G. Barto and S. P. Singh. Reinforcement learning and dynamic programming.
In Proceedings of the Sixth Yale Workshop on Adaptive and
Learning Systems, New Haven, CT, August 1990. pp. 83-88.
- A. G. Barto and S. P. Singh. On the computational economics of
reinforcement learning. In D.S. Touretzky, J.L. Elman,
T.J. Sejnowski and G.E. Hinton, editors, Proceedings of the
1990 Connectionist Models Summer School. San Mateo, CA:
Morgan Kaufmann, 1990. pp. 35-44.
-
A. G. Barto, R. S. Sutton and C. J. C. H. Watkins. Learning and
sequential decision making. In M. Gabriel and J. Moore, editors,
Learning and Computational Neuroscience,The MIT Press, Cambridge, MA, 1990, pp. 539-602. [Also appeared as
COINS Technical Report 89-95, University
of Massachusetts, Amherst, MA, 1989.]
-
A. G. Barto, R. S. Sutton and C. J. C. H. Watkins. Sequential decision
problems and neural networks. In D. S. Touretzky, editor, Advances
in Neural Information Processing Systems 2, pp. 686-693.
Morgan Kaufmann Publishers, San Mateo, CA, 1990.
- V. Gullapalli. Associative reinforcement learning of real-valued
functions. COINS Technical Report 90-129, University of
Massachusetts, May 1990.
- V. Gullapalli. Modeling cortical area 7a using stochastic
real-valued (SRV) units. In D.S. Touretzky, J.L. Elman,
T.J. Sejnowski and G.E. Hinton, editors, Proceedings of the
1990 Connectionist Models Summer School, pp. 363-368. San Mateo, CA:
Morgan Kaufmann, 1990.
- V. Gullapalli. A stochastic reinforcement learning algorithm for
learning real-valued functions.
Neural Networks, 3: 671-692, 1990.
- J. C. Houk, S. P. Singh, C. Fisher and A. G. Barto. An adaptive
sensorimotor network inspired by the anatomy and physiology of
the cerebellum. In W. T. Miller,
R. S. Sutton and P. J. Werbos, editors, Neural Networks
for Control, pp. 301-348. Cambridge, MA: MIT Press, 1990.
[Also appeared as COINS Technical Report 89-108, University of
Massachusetts, 1989.]
- J. C. Houk. Cooperative control of limb movements by the motor
cortex, brainstem and cerebellum. COINS Technical Report 89-118,
University of Massachusetts, 1989. [Also in R.M.J.
Cotterill, editor, Models of Brain Function, Cambridge
University Press, 1990.]
-
R. A. Jacobs. Task Decomposition Through Competition in a Modular
Connectionist Architecture. (Ph.D. Thesis) COINS Technical Report 90-44,
University of Massachusetts at Amherst, May 1990.
-
R. A. Jacobs, M. I. Jordan and A. G. Barto. Task decomposition through
competition in a modular connectionist architecture: The what and
where vision tasks. Cognitive Science, 15: 219-250, 1991. [Also
appeared as COINS Technical Report 90-27,
University of Massachusetts at Amherst, March 1990.]
-
M. I. Jordan and R. A. Jacobs. Learning to control an unstable system
with forward modeling. In D. S. Touretzky, editor, Advances
in Neural Information Processing Systems 2, pp. 324-331.
Morgan Kaufmann Publishers, San Mateo, CA, 1990.
- M. C. Mozer and J. Bachrach. Discovering the structure of a
reactive environment by exploration. In D. S. Touretzky, editor,
Advances in Neural Information Processing Systems 2,
pp. 439-446. Morgan Kaufmann Publishers, San Mateo, CA, 1990.
- M. C. Mozer and J. R. Bachrach. Discovering the structure of a
reactive environment by exploration. Neural Computation,
2: 447-457, 1990.
-
R. S. Sutton and A. G. Barto. Time-derivative models of Pavlovian
reinforcement. In M. Gabriel and J. Moore, editors, Learning and
Computational Neuroscience,The MIT Press: Cambridge, MA, 1990,
pp. 497-537.
- B. E. Ydstie. Forecasting and control using adaptive connectionist
networks. Computers in Chemical Engineering.,
14: 583-299, 1990.
- R. C. Yee, S. Saxena, P. E. Utgoff and A. G. Barto. Explaining
temporal differences to create useful concepts for evaluating
states. In Proceedings of the 8th National Conference
on Artificial Intelligence, pp. 882-888. AAAI Press/MIT Press, 1990.
- M. C. Mozer and J. Bachrach. SLUG: A connectionist architecture
for inferring the structure of finite-state environments.
Machine Learning, 7 (2/3): 139-160, 1991.
- J.R. Bachrach. A connectionist learning control architecture
for navigation. In R. Lippmann, J. Moody and D. Touretzky, editors,
Advances in Neural Information Processing 3. Morgan Kaufmann:
San Mateo, CA, 1991. pp. 457-463.
- A.G. Barto. Some learning tasks from a control perspective. In
L. Nadel and D. Stein, editors, 1990 Lectures in Complex
Systems, Addison-Wesley, 1991. pp. 195-223. [Also appeared as
COINS Technical Report 90-122, University of Massachusetts at Amherst,
December 1990.]
- S.P. Singh. Transfer of learning across compositions of
sequential tasks. In L.A. Barnbaum and G.C. Collins, editors, Machine
Learning: Proceedings of the Eighth International Workshop (ML91), Morgan
Kaufmann: San Mateo, CA, 1991. pp. 348-352.
- R. S. Sutton, A. G. Barto and R. J. Williams. Reinforcement learning
is direct adaptive optimal control. Proceedings
of the 1991 American Control Conference, June 26-28, Boston, MA.
pp. 2143-2146.
- J. C. Houk and A. G. Barto. Distributed sensorimotor learning.
In G. E. Stelmach and J. Requin, editors,
Tutorials in Motor Behavior II, Elsevier Science Publishers B.V.:
Amsterdam, 1992, pp. 71-100.
(Also appeared as NPB Technical Report #1,
Center for Neuroscience Research on Neuronal Populations and Behavior,
Northwestern University, Dec. 1991.)
- V. Gullapalli. A comparison of supervised and reinforcement learning
methods on a reinforcement learning task. Proceedings of the
1991 IEEE International Symposium on Intelligent Control, Arlington, VA,
August 1991, pp. 394-399.
- V. Gullapalli. Associative reinforcement learning of real-valued
functions. Proceedings of the 1991 IEEE International
Conference on Systems, Man, and Cybernetics, Charlottesville, VA, October 1991.
- V. Gullapalli, R. A. Grupen and A. G. Barto. Learning reactive
admittance control. In Proceedings of the 1992 IEEE Conference on
Robotics and Automation. Nice, France, May 1992, pp. 1475-1480.
- J. W. Moore. A mechanism for timing conditioned responses. Technical
Report 92-3, Computer Science Dept., University of Massachusetts, January 1992.
- S.P. Singh. Transfer of learning by composing solutions for
elemental sequential tasks. Machine Learning, 8:
323-339, May 1992.
- N.E. Berthier, S.P. Singh, A.G. Barto and J.C. Houk. A
cortico-cerebellar model that learns to generate distributed motor commands to
control a kinematic arm. In Neural Information Processing
Systems 4, J.E. Moody, S.J. Hanson, and R.P. Lippmann (Eds.), Morgan Kaufmann:
San Mateo, 1992, pp. 611-618.
- V. Gullapalli and A. G. Barto. Shaping as a method for accelerating
reinforcement learning. In Proceedings of the 1992 IEEE
International Symposium on Intelligent Control, Glasgow, Scotland, August
1992.
- S. P. Singh. Scaling reinforcement learning algorithms by learning
variable temporal resolution models. In
Proceedings of the Ninth Machine Learning Conference, Aberdeen, Scotland,
1992. Morgan Kaufmann, pp. 406-415.
- J. R. Bachrach. Connectionist Modeling and Control of Finite State
Environments. (Ph.D. Thesis) COINS Technical Report 92-6, University of
Massachusetts, Amherst. January 1992.
- V. Gullapalli, Reinforcement Learning and its Application to Control.
(Ph.D. Thesis) COINS Technical Report 92-10, University of
Massachusetts, Amherst. January 1992.
- S. P. Singh. Reinforcement learning with a hierarchy of abstract
models. In Proceedings of the Tenth National
Conference on Artificial Intelligence (AAAI-92), San Jose, CA, July 1992.
AAAI Press/MIT Press, pp. 202-207.
- S. P. Singh. The efficient learning of multiple task sequences.
In Advances in Neural Information Processing
Systems 4, J.E. Moody, S.J. Hanson, and R.P. Lippmann, editors, Morgan
Kaufmann: San Mateo, 1992. pp. 251-258.
- V. Gullapalli. Dynamic systems control via associative reinforcement
learning. In B. Soucek and the IRIS Group, editors,
Dynamic, Genetic, and Chaotic Programming: The Sixth
Generation. New York, NY: John Wiley &Sons, 1992, pp. 27-64.
- R. Yee. Abstraction in control learning. COINS Technical Report
92-16, University of Massachusetts, March 1992.
- A. G. Barto. Reinforcement learning and adaptive critic methods.
In Handbook of Intelligent Control, D.A. White and D.A. Sofge, editors.
New York: Van Nostrand Reinhold, 1992, pp. 469-491.
- A. G. Barto and S. J. Bradtke. Learning to solve stochastic optimal
path problems using real-time dynamic programming. Proceedings of the
Seventh Yale Workshop on Adaptive and Learning Systems, New Haven, CT, May
1992, pp. 143-148.
- A. G. Barto and V. Gullapalli. Neural networks and adaptive control.
In P. Rudomin, M. A. Arbib, F. Cervantes-Perez, and R. Romo,
editors, Neuroscience: From Neural Networks to Artificial
Intelligence, Research Notes in Neural
Computation, Vol. 4, Springer-Verlag, 1993. pp. 471-493.
[Preprint appeared as NPB Technical Report 6, Center for Neuroscience
Research on Neuronal Populations and Behavior, Northwestern University,
March 1992.]
- V. Gullapalli. Robust control under extreme uncertainty. In Advances in Neural Information Processing Systems 5, C. L. Giles,
S. J. Hanson, and J. D. Cowan (Eds), San Mateo: Morgan Kauffmann, 1993. pp.
327-334.
- N. E. Berthier, S. P. Singh, A. G. Barto, and J. C. Houk. Distributed
representation of limb motor programs in arrays of adjustable pattern
generators. Journal of Cognitive Neuroscience, 5 (1): 56-78,
1993. [Also
appeared as NPB Technical Report #3, Center for Neuroscience
Research on Neuronal Populations and Behavior, Northwestern University, 1991.]
- S. J. Bradtke. Reinforcement Methods Applied to Linear Quadratic
Regulation. In Advances in Neural Information Processing Systems 5,
C. L. Giles, S. J. Hanson, and J. D. Cowan (Eds), San Mateo: Morgan Kauffmann,
1993. pp. 295-302.
- S. Guzmán-Lara.
Adjusting connections using reflexes as guidance.
Technical Report 8. Center for the Study of Neuronal Populations
and Behavior. August 1993.
- J. C. Houk, J. Kiefer, and A. G. Barto. Distributed motor
commands in the limb premotor network. Trends in Neuroscience
16 (1): 27-33, 1993.
- S. P. Singh. Learning to Solve Markovian Decision Processes. (Ph.D.
Thesis) CMPSCI Technical Report 93-77, University of Massachusetts, November
1993.
- V. Gullapalli, J. A. Franklin and H. Benbrahim. Acquiring robot skills
via reinforcement learning. IEEE Control Systems Special Issue
on Robotics: Capturing Natural Motion, 4(1): 13-24, Feb. 1994.
- V. Gullapalli, A. G. Barto and R. A. Grupen. Learning admittance
mappings for force-guided assembly. Proceedings of the 1994
International Conference on Robotics and Automation,1994, pp. 2633-2638.
- S. P. Singh, A. G. Barto, R. Grupen, and C. Connolly. Robust
reinforcement learning in motion planning. In Advances in Neural
Information Processing Systems 6,J.D. Cowan, G. Tesauro and J. Alspector
(Eds.), San Francisco: Morgan Kauffmann, 1994. pp. 655-662.
- A. Barto and M. Duff. Monte Carlo matrix inversion and reinforcement
learning. In Advances in Neural Information
Processing Systems 6,J.D. Cowan, G. Tesauro and J. Alspector (Eds.), San
Francisco: Morgan Kauffmann, 1994. pp. 687-694.
- V. Gullapalli and A. Barto. Convergence of indirect adaptive
asynchronous value iteration algorithms. In Advances in Neural Information
Processing Systems 6,J.D. Cowan, G. Tesauro and J. Alspector (Eds.), San
Francisco: Morgan Kauffmann, 1994. pp. 695-702.
- S. J. Bradtke, A. G. Barto, and B. E. Ydstie. A reinforcement learning
method for direct adaptive linear quadratic control. 8th Yale Workshop on
Adaptive and Learning Systems, Yale University, June 1994. pp. 85-96.
-
A. G. Barto. Adaptive critics and the basal ganglia. In Models of
Information Processing in the Basal Ganglia, J. C. Houk, J.
Davis and D. Beiser (Eds.), Cambridge, MA: MIT Press, 1995, pp. 215-232.
-
J. T. Buckingham, J. C. Houk and A. G. Barto. Controlling a nonlinear
spring-mass system with a cerebellar model. 8th Yale Workshop on Adaptive
and Learning Systems, Yale University, June 1994. pp. 1-6.
-
J. C. Houk, J. L. Adams, and A. G. Barto. A model of how the basal ganglia
generates and uses neural signals that predict reinforcement. In Models of
Information Processing in the Basal Ganglia, J. C. Houk, J.
Davis and D. Beiser (Eds.), Cambridge, MA: MIT Press, 1995, pp. 249-270.
- R. Grupen, J. Coelho, C. Connolly, V. Gullapalli, M. Huber, and K.
Souccar. Towards Physical Interaction and Manipulation: Screwing in a Light
Bulb. AAAI Spring Symposium Series. Stanford, 1994.
- S. Bradtke and M. Duff. Reinforcement learning methods for
continuous-time Markov decision problems. 7th Annual
Conference on Neural Information Processing Systems (NIPS7), November 1994.
- R. Crites and A. Barto. A class of actor/critic architectures that are
equivalent to Q-learning. 7th Annual Conference on Neural
Information Processing Systems (NIPS7), Denver, November 1994.
- S. J. Bradtke, B. E. Ydstie, and A. G. Barto. Adaptive linear quadratic
control using policy iteration. CMPSCI Technical Report 94-49, University
of Massachusetts, June 1994. Submitted to IEEE Transactions on Automatic
Control, April 1994.
- M. Duff. Solving Bellman's Equation by the method of continuity.
Proceedings of the 1994 American Control Conference, Baltimore, June
1994.
- A. G. Barto. Reinforcement learning. In Handbook of Brain Theory
and Neural Networks, M.A. Arbib (Ed.), Cambridge: MIT Press, in press.
- A. G. Barto. Reinforcement learning in motor control. In Handbook
of Brain Theory and Neural Networks, M.A. Arbib (Ed.), Cambridge: MIT Press, in
press.
- A. G. Barto. Learning as hillclimbing in weight space. In Handbook
of Brain Theory and Neural Networks, M.A. Arbib (Ed.), Cambridge: MIT Press, in
press.
- S. J. Bradtke. Incremental Dynamic Programming for On-Line Adaptive
Optimal Control. (Ph.D. Thesis) CMPSCI Technical Report 94-62, University of
Massachusetts, August 1994.
- V. Gullapalli. Direct associative reinforcement learning methods
for dynamic systems control. (Submitted to Neurocomputing.)
- N. Berthier, R. Clifton, V. Gullapalli, D. McCall, and D. Rubin. Visual
information and object size in the control of reaching. (Submitted.)
- V. Gullapalli. Skillful Control Under Uncertainty via Direct
Reinforcement Learning. (Submitted to Robotics and Autonomous
Systems.)
- A. G. Barto. Reinforcement learning control. Current Opinion in
Neurobiology, 4: 888-893, 1994.
- S. P. Singh and R. C. Yee. An upper bound on the loss from
approximate optimal-value functions. Machine
Learning, 16 (3): 227-233, 1994.
- S. J. Bradtke and A. G. Barto. New Algorithms for Temporal Difference
Learning. Machine Learning, 108: Special Issue on Reinforcement
Learning, to appear.
- T. W. Sandholm and R. H. Crites. Multiagent Reinforcement Learning in
the Iterated Prisoner's Dilemma. Submitted to Biosystems Journal,
November 1994.
- A. G. Barto, S. J. Bradtke and S. P. Singh. Learning to act
using real-time dynamic programming. Artificial Intelligence, Special
Volume on Computational Research on Interaction and Agency, 72(1): 81-138,
1995. [Also appeared as CMPSCI Technical Report 93-02, University of
Massachusetts, January 1993. (Supercedes TR 91-57.)]
- A. G. Barto. Reinforcement learning and dynamic programming.
To be presented at IFAC'95, Conference on Man-Machine Systems, Cambridge,
MA, June 1995.
- M. Duff. Q-Learning for bandit problems. In A. Prieditis and S. Russell, editors, Machine Learning: Proceedings of the Twelfth International Conference on Machine Learning (ML95), Morgan Kaufmann: Tahoe City, CA, 1995, pp. 209-217.
- J. T. Buckingham, A. G. Barto, and J. C. Houk. Adaptive Predictive
Control with a Cerebellar Model. In Proceedings of the 1995 World
Congress on Neural Networks, Volume 1, Lawrence Erlbaum
Associates, Inc: Mahwah, NJ, 1995, pp. 373-380
|