UI logo
CS 440/ECE 448
Margaret Fleck

Introduction


AI systems

AI applications vary massively in complexity. The simplest ones hardly seem like AI these days because AI is often defined to include only cutting edge (often buggy) methods. Simple systems would include things like consumer-grade chess tutors, spelling correction and autocompletion for text input, and robot vacuum cleaners:

Roomba (from IRobot)

We often imagine intelligent systems that have a sophisticated physical presence, like a human or animal. Current robots are starting to move like people/animals but have only modest high-level planning ability.

Boston dynamics robot ( video)

Other AI systems live mostly or entirely inside the computer, communicating via text or simple media (e.g. voice commands). This would be true, for example of an intelligent assistant to book airplane tickets. A well-known fictional example is the HAL 9000 system from the movie "2001: A Space Odyssey" which appears mostly as a voice plus a camera lens (below).

(from Wikipedia) ( dialog) ( video)

When designing AI systems, we need to consider what kind of development it is intended to operate in:

Many AI systems work well only because they operate in limited environments. A famous sheep-shearingn robot from the 1980's depended on the sheep being tied down to limit their motion. Face identification systems at security checkpoints may depend on the fact that people consistently look straight forwards when herded through the gates.

AI researchers often plan to build a fully-featured end-to-end system, e.g. a robot child. However, most research projects and published papers consider only one small component of the task, e.g.

Layout of the field

Core reasoning techniques

Application areas

Mathematical applications have largely become their own separate field, separate from AI. The same will probably happen soon to game playing. However, both applications played a large role in early AI.

We're doing to see popular current techniques (e.g. neural nets) but also selected historical techniques that may still be relevant.

History: major trends

AI results have consistently been overly hyped. E.g. Tesla claiming to have "fully autonomous driving" when a human still has to be at the wheel. This has led to boom and bust cycles, where optimisim leads to extensive funding and then failure to deliver leads to collapse in funding.

Over the decades, AI algorithms have gradually improved. Two main reasons:

The second has been the most important. Some approaches popular today were originally proposed in the 1940's but could not be made to work without sufficient computing power.

AI has cycled between two general approaches to algorithm design:

Most likely neither extreme position is correct.

A final tension involves the definition of what makes a system "intelligent"? This could involve doing tasks that humans find hard, e.g. playing chess, proving theorems. These often turn out to be easier for computers because the tasks are very well defined. By contrast, simulating a toddler is much harer for an AI program, because AI algorithms tend not to be robust. Sensing and decision-making algorithms have trouble adapting to new environments. Robots may work well in only limited environments, be unable to pick themselves up if they fall over, etc.

History

Since the Dawn of Time

Stories of artifically created beings date back at least to the 19th century.


Prague golem from livingprague.com

1930's and 1940's

First computers (Atanasoff 1941, Eniac 1943-44), able to do almost nothing.


ENIAC, mid 1940's (from Wikipedia)

On-paper models that are insightful but can't be implemented. Some of these foreshadow approaches that work now.

1950-1970's

Computers with functioning CPU but (by today's standards) very slow with absurdly little memory. E.g. the IBM 1130 (1965-72) came with up to 11M of main memory. The VAX 11/780 had many terminals sharing one CPU.


IBM 1130, late 1960's (from Wikipedia)


The Plato system from Grainger web pages, 2023

AI algorithms are very stripped down and discrete/symbolic. Chomsky's Syntactic Structures (1957) proposed very tight constraints on linguistic models in an attempt to make them learnable from tiny datasets.

1980's-1990's

Computers now look like tower PC's, eventually also laptops. Memory and disk space are still tight. Horrible peripherals like binary (black/white) screens and monochrome surveillance cameras. HTTP servers and internet browsers appear in the 1990's:

Lisp Machine, 1980's (from Wikipedia

NCSA Mosaic, 1990's (from Wikipedia)

AI starts to use more big-data algorithms, especially at sites with strong resources. First LDC (linguistic data consortium) datasets: 1993 Switchboard 1, TIMIT. Fred Jelinek starts to get speech recognition working usefully using statistical ngram models. Linguistic theories were still very discrete/logic based. Jelinek famously said in the mid/late 1980's: "Every time I fire a linguist, the performance of our speech recognition system goes up."

This century: it works except when it doesn't

Algorithms achieve strong performance by gobbling vast quantities of training data, e.g. Google's algorithms use the entire web to simulate a toddler. Neural nets "learn to the test," i.e. learn what's in the dataset but not always with constraints that we assume implicitly as humans. So strange unexpected lossage ("adversarial examples"). Speculation that learning might need to be guided by some structural constraints.


ChatGPT crochet pattern for a banana, from CNN, 2023


Meta's chatbot critizing its company's CEO (from the BBC, 2022)


Cruise car stuck in wet concrete from SFGate, 2023

Boston Dynamics robot falling down (from The Guardian, 2017)

AI systems tend to be excessively confident of their answers and their understanding of the context (e.g. what should they talk about). So far, there hasn't been enough progress on helping them figure out when they don't know the answer. Attempts to impose "guardrails" on their responses haven't worked well, because it has proven too easy to get around them.

Perhaps more worrying, training state of the art AI algorithms requires fantastic amounts of data, energy, and water. Not green. Also, there isn't any obvious way to significantly increase the amount of training data. So we may be hitting the limits of how much performance can be improved by brute force.

In this course, we're going to look at the history of the field, the state of the art, and also ideas for getting past some of the current limitations.