0% found this document useful (0 votes)

32 views37 pages

Chapter3 - Search4

intro AI

Uploaded by

Hoàng Tùng Lê

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views37 pages

Chapter3 - Search4

intro AI

Uploaded by

Hoàng Tùng Lê

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

IT3160E

Introduction to Artificial Intelligence

Chapter 3: Problem solving

Advanced search methods

Lê Thanh Hương
School of Information and Communication Technology - HUST
Outline

• Local beam search

• Game and search
• Alpha-beta pruning

2
Local beam search

• Like greedy search, but keep K states at all times:

• Initially: k random states
• Next: determine all successors of k states
• If any of successors is goal → finished
• Else select k best from successors and repeat.

Greedy Search Beam Search

3
Local beam search

• Major difference with random-restart search

• Information is shared among k search threads: If one state generated good successor, but
others did not → “come here, the grass is greener!”

• Can suffer from lack of diversity.

• Stochastic variant: choose k successors at proportionally to state success.

• The best choice in MANY practical settings

4
Games and search

• Why study games?

• Why is search a good idea?

• Majors assumptions about games:

• Only an agent’s actions change the world
• World is deterministic and accessible

5
Why study games?

machines are better than humans in:

othello
humans are better than machines in:
go
here: perfect information zero-sum games

6
Why study games?

• Games are a form of multi-agent environment

• What do other agents do and how do they affect our success?
• Cooperative vs. competitive multi-agent environments.
• Competitive multi-agent environments give rise to adversarial search a.k.a. games

• Why study games?

• Fun; historically entertaining
• Interesting subject of study because they are hard
• Easy to represent and agents restricted to small number of actions

7
Relation of Games to Search

• Search – no adversary
• Solution is (heuristic) method for finding goal
• Heuristics and CSP techniques can find optimal solution
• Evaluation function: estimate of cost from start to goal through given node
• Examples: path planning, scheduling activities
• Games – adversary
• Solution is strategy (strategy specifies move for every possible opponent reply).
• Time limits force an approximate solution
• Evaluation function: evaluate “goodness” of game position
• Examples: chess, checkers, Othello, backgammon
• Ignoring computational complexity, games are a perfect application for a complete search.
• Of course, ignoring complexity is a bad idea, so games are a good place to study resource
bounded searches.

8
Types of Games

deterministic chance
perfect chess, checkers, go, othello backgammon monopoly
information

imperfect battleships, blind tictactoe bridge, poker, scrabble nuclear

information war

9
Minimax

• Two players: MAX and MIN

• MAX moves first and they take turns until the game is over. Winner gets award, looser gets
penalty.
• Games as search:
• Initial state: e.g. board configuration of chess
• Successor function: list of (move,state) pairs specifying legal moves.
• Terminal test: Is the game finished?
• Utility function: Gives numerical value of terminal states.
• E.g. win (+1), loose (-1) and draw (0) in tic-tac-toe
• MAX uses search tree to determine next move.
• Perfect play for deterministic games

10
Minimax

• From among the moves

available to you, take the best
one
• The best one is determined by a
search using the MiniMax
strategy

11
Optimal strategies

◼ MAX maximizes a function: find a move corresponding to max value

◼ MIN minimizes the same function: find a move corresponding to min value
At each step:
◼ If a state/node corresponds to a MAX move, the function value will be the maximum
value of its childs
◼ If a state/node corresponds to a MIN move, the function value will be the minimum
value of its childs
Given a game tree, the optimal strategy can be determined by using the minimax value of
each node:

MINIMAX-VALUE(n)=
UTILITY(n) If n is a terminal
maxs  successors(n) MINIMAX-VALUE(s) If n is a max node
mins  successors(n) MINIMAX-VALUE(s) If n is a min node

12
Minimax

13
Minimax algorithm

14
Properties of minimax

• Complete? Yes (if tree is finite)

• Optimal? Yes (against an optimal opponent)
• Time complexity? O(bm)
• Space complexity? O(bm) (depth-first exploration)

• For chess, b ≈ 35, m ≈100 for "reasonable" games

→ exact solution completely infeasible

15
Problem of minimax search

• Number of games states is exponential to the number of moves.

➢Solution: Do not examine every node

 Alpha-beta pruning:
• Remove branches that do not influence final decision
• Revisit example …

16
α-β pruning

◼ Alpha values: the best values achievable for MAX, hence the max value so
far

◼ Beta values: the best values achievable for MIN, hence the min value so far

◼ At MIN level: compare result V of node to alpha value. If V>alpha, pass

value to parent node and BREAK

◼ At MAX level: compare result V of node to beta value. If V<beta, pass value
to parent node and BREAK

17
α-β pruning

α: the best values achievable for MAX

β: the best values

achievable for MIN

18
α-β pruning example

Compare result V of node to β. If V< β, pass value to parent node

and BREAK

19
α-β pruning example

20
α-β pruning example

21
α-β pruning example

22
Properties of α-β

• Pruning does not affect final result

• Entire sub-trees can be pruned.
• Good move ordering improves effectiveness of pruning. With "perfect ordering"
➢ time complexity = O(bm/2)
→ doubles depth of search
➢ Branching factor of sqrt(b) !!
➢ Alpha-beta pruning can look twice as far as minimax in the same amount of time

• Repeated states are again possible.

➢ Store them in memory = transposition table

• A simple example of the value of reasoning about which computations are relevant (a
form of metareasoning)

23
Why is it called α-β?

• α is the value of the best (i.e., highest-value) choice found so far at any
choice point along the path for max
• If v is worse than α, max will avoid it
→ prune that branch
• Define β similarly for min

24
The α-β algorithm

25
The α-β algorithm

26
Imperfect, real-time decisions

• Minimax and alpha-beta pruning require too much leafnode evaluations.

• May be impractical within a reasonable amount of time.

• Suppose we have 100 secs, explore 104 nodes/sec

→ 106 nodes per move

• Standard approach (SHANNON, 1950):

• Cut off search earlier (replace TERMINAL-TEST by CUTOFF-TEST)
• Apply heuristic evaluation function EVAL (replacing utility function of alpha-beta)

27
Cut-off search

• Change:
if TERMINAL-TEST(state) then return UTILITY(state)
into:
if CUTOFF-TEST(state,depth) then return EVAL(state)

• Introduces a fixed-depth limit depth

• Is selected so that the amount of time will not exceed what the rules of the game
allow.

• When cut-off occurs, the evaluation is performed.

28
Heuristic evaluation (EVAL)

• Idea: produce an estimate of the expected utility of the game from a given
position.
• Requirements:
➢ EVAL should order terminal-nodes in the same way as UTILITY.
➢ Computation may not take too long.
➢ For non-terminal states the EVAL should be strongly correlated with the actual chance of
winning.
• Example:
Expected value e(p) for each state p:
E(p) = (# open rows, columns, diagonals for MAX)
- (# open rows, columns, diagonals for MIN)
• MAX moves all lines that don’t have o; MIN moves all lines that don’t have x

29
Reduces state spaces of Tictactoe based on the symmetry of the states
Expected value e(p) for each state p:
E(p) = (# open rows, columns, diagonals for MAX)
- (# open rows, columns, diagonals for MIN)
MAX moves all lines that don’t have o; MIN moves
all lines that don’t have x
1

MAX goes first

-1 1 -2

MIN goes

e(p) 1 0 1 0 -1 1 2 -1 0 -1 0 -2

→ A kind of depth-first search

30
Evaluation function example

• For chess, typically linear weighted sum of features

Eval(s) = w1 f1(s) + w2 f2(s) + … + wn fn(s)
• e.g., w1 = 9 with
f1(s) = (number of white queens) – (number of black queens), etc.

31
Chess complexity

• PC can search 200 millions nodes/3min.

• Branching factor: ~35
• 355 ~ 50 millions
➢ if use minimax, could look ahead 5 plies, defeated by average player, planning 6-8 plies.
• Does it work in practice?
• 4-ply ≈ human novice → hopeless chess player
• 8-ply ≈ typical PC, human master
• 12-ply ≈ Deep Blue, Kasparov
• To reach grandmaster level, needs a better extensively tuned evaluation and a
large database of optimal opening and ending of the game

32
Deterministic games in practice

• Checkers: Chinook ended 40-year-reign of human world champion Marion Tinsley in 1994.
Used a precomputed endgame database defining perfect play for all positions involving 8 or
fewer pieces on the board, a total of 444 billion positions.

• Chess: Deep Blue defeated human world champion Garry Kasparov in a six-game match in
1997. Deep Blue searches 200 million positions per second, uses very sophisticated
evaluation, and undisclosed methods for extending some lines of search up to 40 ply.

• Othello: human champions refuse to compete against computers, who are too good.

• Go: human champions refuse to compete against computers, who are too bad. In go, b > 300,
so most programs use pattern knowledge bases to suggest plausible moves.

33
Nondeterministic games

• Chance introduces by dice, card-shuffling, coin-flipping...

• Example with coin-flipping:

change nodes

34
Backgammon

Possible moves: (5-10,5-11), (5-11,19-24),(5-10,10-16) and (5-11,11-16)

35
Expected minimax value

EXPECTED-MINIMAX-VALUE(n)=
UTILITY(n) If n is a terminal
maxssuccessors(n) EXPECTEDMINIMAX(s) If n is a max node
minssuccessors(n) EXPECTEDMINIMAX(s) If n is a max node
Σssuccessors(n) P(s) .EXPECTEDMINIMAX(s) If n is a chance node

P(s) is probability of s occurence

36
Games of imperfect information

• E.g., card games, where opponent's initial cards are unknown

• Typically we can calculate a probability for each possible deal
• Seems just like having one big dice roll at the beginning of the game
• Idea: compute the minimax value of each action in each deal, then choose the action with
highest expected value over all deals
• Special case: if an action is optimal for all deals, it's optimal.
• GIB, current best bridge program, approximates this idea by
➢ generating 100 deals consistent with bidding information
➢ picking the action that wins most tricks on average

Decision Theory Problems
No ratings yet
Decision Theory Problems
7 pages
Game Theory
No ratings yet
Game Theory
15 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
58 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Tcs MT Booklet
No ratings yet
Tcs MT Booklet
33 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
AI Search Techniques Overview
No ratings yet
AI Search Techniques Overview
44 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
Game Playing: Games. Why? Minimax Search Alpha-Beta Pruning
No ratings yet
Game Playing: Games. Why? Minimax Search Alpha-Beta Pruning
31 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
No ratings yet
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
49 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
AI in Game Strategy
No ratings yet
AI in Game Strategy
42 pages
Cs 171 07a Games MiniMax
No ratings yet
Cs 171 07a Games MiniMax
28 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
Advanced AI Search Techniques
No ratings yet
Advanced AI Search Techniques
135 pages
Break-Even Analysis & Decision Theory
No ratings yet
Break-Even Analysis & Decision Theory
46 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Biti1113 Games in Ai
No ratings yet
Biti1113 Games in Ai
58 pages
1 GamePlaying
No ratings yet
1 GamePlaying
30 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Math 123
No ratings yet
Math 123
34 pages
Assignment and Game Theory
No ratings yet
Assignment and Game Theory
42 pages
Solution
No ratings yet
Solution
247 pages
Dec Analysis
100% (1)
Dec Analysis
19 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Game Playing
No ratings yet
Game Playing
32 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Statistical Decision Making Guide
No ratings yet
Statistical Decision Making Guide
44 pages
Evans Analytics3e PPT 16 Accessible
No ratings yet
Evans Analytics3e PPT 16 Accessible
60 pages
1 The IT Giant Tirnop Has Recently Crossed A Head Count of 150000 and Earnings of
No ratings yet
1 The IT Giant Tirnop Has Recently Crossed A Head Count of 150000 and Earnings of
4 pages
Docx
No ratings yet
Docx
16 pages
Part4.Game Playing
No ratings yet
Part4.Game Playing
35 pages
AI Lec03 Adversarial Search
No ratings yet
AI Lec03 Adversarial Search
38 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
3assignment Sol
No ratings yet
3assignment Sol
7 pages
Decision Models for Air Force CGOs
100% (1)
Decision Models for Air Force CGOs
9 pages
Game Playing. Updated
No ratings yet
Game Playing. Updated
44 pages
Lecture 1 Game Theory
No ratings yet
Lecture 1 Game Theory
10 pages
AI Game Strategy Basics
No ratings yet
AI Game Strategy Basics
45 pages
06 Decision Theory-1
No ratings yet
06 Decision Theory-1
33 pages
Time Series Analysis Guide
No ratings yet
Time Series Analysis Guide
24 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
Games
No ratings yet
Games
41 pages
Untitled Document PDF
No ratings yet
Untitled Document PDF
13 pages
Module4 Chapter2
No ratings yet
Module4 Chapter2
30 pages
Chapter 4
No ratings yet
Chapter 4
7 pages
Risk Measurement Calculations
No ratings yet
Risk Measurement Calculations
35 pages
Adversarial Search
No ratings yet
Adversarial Search
49 pages
IA c06 NoAnim
No ratings yet
IA c06 NoAnim
31 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
Stevenson CH05S Accessible
No ratings yet
Stevenson CH05S Accessible
33 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
Decision Theory Quiz Guide
No ratings yet
Decision Theory Quiz Guide
5 pages
Game AI and Strategy Analysis
No ratings yet
Game AI and Strategy Analysis
53 pages
Worksheet 2: ICS/ISE 215 - Operations Research
No ratings yet
Worksheet 2: ICS/ISE 215 - Operations Research
3 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
Operational Research Process of British American Tobacco: Afrin Akter Tumpa ID: M23030204151 Uchyas Roy ID: M23030204149
No ratings yet
Operational Research Process of British American Tobacco: Afrin Akter Tumpa ID: M23030204151 Uchyas Roy ID: M23030204149
23 pages
AI Lab Experiments for B.Tech CSE
No ratings yet
AI Lab Experiments for B.Tech CSE
37 pages
Tinf 8000 en 0
No ratings yet
Tinf 8000 en 0
1 page
26220606734381cfd 409z20e
No ratings yet
26220606734381cfd 409z20e
1 page
Ai Lecture-4
No ratings yet
Ai Lecture-4
37 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
Omgt Reviewer
No ratings yet
Omgt Reviewer
74 pages
AI Problem Solving & Game Theory
No ratings yet
AI Problem Solving & Game Theory
65 pages
Fminimax
No ratings yet
Fminimax
28 pages
Lecture 4
No ratings yet
Lecture 4
29 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Intro and Overview
No ratings yet
Intro and Overview
5 pages
Slides
No ratings yet
Slides
37 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
COE206 L5 Adversarial Search
No ratings yet
COE206 L5 Adversarial Search
44 pages
Sem4 Syllabus, Bcom Hons, NEP, 2024-25-Commerce
No ratings yet
Sem4 Syllabus, Bcom Hons, NEP, 2024-25-Commerce
21 pages
AI Lecture 08 - Games & Adversarial Search
No ratings yet
AI Lecture 08 - Games & Adversarial Search
42 pages
2021 Lecture05 AdversarialSearch
No ratings yet
2021 Lecture05 AdversarialSearch
46 pages

Chapter3 - Search4

Uploaded by

Chapter3 - Search4

Uploaded by

IT3160E

Introduction to Artificial Intelligence

Chapter 3: Problem solving

• Local beam search

• Like greedy search, but keep K states at all times:

Greedy Search Beam Search

• Major difference with random-restart search

• Can suffer from lack of diversity.

• The best choice in MANY practical settings

• Why study games?

• Majors assumptions about games:

machines are better than humans in:

• Games are a form of multi-agent environment

• Why study games?

imperfect battleships, blind tictactoe bridge, poker, scrabble nuclear

• Two players: MAX and MIN

• From among the moves

◼ MAX maximizes a function: find a move corresponding to max value

• Complete? Yes (if tree is finite)

• For chess, b ≈ 35, m ≈100 for "reasonable" games

• Number of games states is exponential to the number of moves.

◼ At MIN level: compare result V of node to alpha value. If V>alpha, pass

α: the best values achievable for MAX

β: the best values

Compare result V of node to β. If V< β, pass value to parent node

• Pruning does not affect final result

• Repeated states are again possible.

• Minimax and alpha-beta pruning require too much leafnode evaluations.

• May be impractical within a reasonable amount of time.

• Suppose we have 100 secs, explore 104 nodes/sec

• Standard approach (SHANNON, 1950):

• Introduces a fixed-depth limit depth

• When cut-off occurs, the evaluation is performed.

MAX goes first

→ A kind of depth-first search

• For chess, typically linear weighted sum of features

• PC can search 200 millions nodes/3min.

• Chance introduces by dice, card-shuffling, coin-flipping...

Possible moves: (5-10,5-11), (5-11,19-24),(5-10,10-16) and (5-11,11-16)

P(s) is probability of s occurence

• E.g., card games, where opponent's initial cards are unknown

You might also like