-
Value Iteration and Q-LearningMDPs, Bellman equations, batch value iteration, tabular Q-learning with epsilon-greedy exploration, and approximate Q-learning with feature weights.
9 min -
Minimax, Alpha-Beta, and ExpectimaxMulti-agent search in Pacman. Minimax with depth-limited game trees, alpha-beta pruning, expectimax for random ghosts, and evaluation functions.
8 min -
DFS, BFS, UCS, and A* in PacmanImplementing graph search algorithms in the Berkeley Pacman framework. DFS, BFS, UCS, A*, plus heuristics for corners and food.
8 min
Back