The Analytical Engine has no pretensions whatever to originate anything. It can do whatever we know how to order it to perform.[Lovelace, 1961]
Definition (of learning): Learning denotes changes in the system that are adaptive in the sense that they enable the system to do the same task or tasks drawn from the same population more efficiently and more effectively the next time. (Simon, in Machine Learning An AI Approach, 1983) Note: Often humans are interested in learning not so that the same task can be done over and over, but so that we acquire the ability to perform a wide range of tasks (similar problems)
Paradigm History:
(related research: simulation of evolutionary processes, that through random mutation and "natural selection" might create a system capable of some intelligent behavior.[Friedberg, 1958, 1959; Holland, 1980] )
This paradigm utilized logic or graphic structure representations rather than numeric or statistical methods
Most agree that a required characteristic of learning systems be that the system is adaptive, capable of improving their performance on a problem as a function of previous experience. "...changes in the system that are adaptive in the sense that they enable the system to do the same task or tasks drawn from the same population more efficiently and more effectively the next time. " One adapts by changing their knowledge and acquiring and using new knowledge (of both task and domain)
Learning covers a wide range of phenomena. At one end of the spectrum is skill refinement; at the other end is knowledge acquisition.
What is knowledge acquisition? In Computer Science, we note different meanings (levels). "When we say that someone learned physics, we mean that this person acquired significant concepts of physics, understood their meaning, and understood their relationship to each other and the physical world. The essence of learning in this case is the acquisition of new knowledge, including descriptions and models of physical systems and their behaviors, incorporating a variety of representations - from simple intuitive mental models, examples and images, to completely tested mathematical equations and physics laws. A person is said to have learned more if his knowledge explains a broader scope of situations, is more accurate, and is better able to predict the behavior of the physical world.This form of learning is ... generally termed knowledge acquisition ... defined as learning new symbolic information coupled with the ability to apply that information in an effective way.
A second kind of learning is the gradual improvement of motor and cognitive skills through practice, such as learning to ride a bicycle or to play the piano. Acquiring textbook knowledge on how to perform these activities represents only the initial phase in developing the requisite skills. The bulk of the learning process consists of refining the learned skills, whether mental or motor coordination, by repeated practice and by correcting deviations from desired behavior. This form of learning, often called skill refinement, differs in many ways from knowledge acquisition. Whereas the essence of knowledge acquisition may be a conscious process whose result is the creation of new symbolic knowledge structures and mental models, skill refinement occurs at a subconscious level by virtue of repeated practice.[Carbonell et al, 1983]"
Note:intellectual endeavors favor the former, motor coordination tasks favor the latter
* skill refinement comes closer to non-symbolic processes
(neural nets and genetic algorithms)
AI programs draw heavily on knowledge as their source of power. Generally, knowledge is acquired through experience.
(i.e., a taxonomy of learning strategies):
most basic database systems can be said to "learn" in this sense
*learning by being programmed, constructed or modified by an external entity - requiring no effort on the part of the learner
*learning by memorization of given facts and data with no inferences drawn from the incoming information Samuel's checkers program used rote learning (minimax search algorithm) and parameter adjustment (generalizing from its own experience) Capabilities needed:
(e.g., textbook, teachers) Learner is required to do some inference, but a large fraction of the burden is on the teacher, who must present and organize the knowledge in a way that incrementally augments the student's existing knowledge The ability to operationalize knowledge -- an important component in explanation-based learning
Concept acquisition Given a set of examples and counterexamples of a concept, the learner induces a general concept description that describes all the positive examples and none of the counterexamples. (alternatively said)
Classification is the process of assigning, to a particular input, the name of a class to which it belongs. The classes to which the classification procedure can choose can be described in a variety of ways:
(e.g. animals (Rich 458))
Either way, not easy to construct these class definitions.
Want a system to be able to produce a classification program that can evolve its own class definitions.
This task of constructing class definitions is called concept learning, or induction.
Techniques of concept learning from example: coefficient adjustment [Samuel, 1963] -- Credit assignment problem Winston's structural concept learning program [Winston, 1975] Version Spaces [Mitchell, 1977, 1978] Decision Trees [Quinlan, 1986]
Inductive Learning:
(We will consider distinctions between these in the next session)
For machine learning, knowledge structures are changed...
rules of behavior, descriptions of objects, problem-solving heuristics, classification taxonomies, etc. How this knowledge is acquired will depend on how it is represented. How it is represented depends on the phsychological model on which it is based.
parameters in algebraic expressions
neural nets - adjust weights
decision trees can discriminate among classes of objects
nodes correspond to attributes; leaves are objects
Quinlan
language recognition
Context-free grammars, transformation rules
Chomsky
condition-action pair
Rule-based systems
Post
related formalisms
propositions, predicates
probability theory
neat vs scruffy (Marr?)
semantic nets
Augmented Transistion Networks (ATN's)
collections of labeled entities - slots
scripts
learn for efficiency ... or reason about internal structure
hierarchies
Interactions (e.g., levels of abstraction)