Programming Assignment 2 - Adversarial Search

Use Piazza if you have any questions or problems with the assignment. Start early, so you still have time to ask in case of problems!

Problem Description

Implement an agent that is able to play the game of Breakthrough. This game is a simplified version of chess.

The game is played on an grid if width W (<latex>$2 \leq W \leq 10$</latex>) and height H (<latex>$4 \leq H \leq 10$</latex>). The initial state is setup such that both players have two rows of pawns of their respective color. White's paws are on rows 1 and 2 while black's pawns are on rows H-1 and H. The two players “white” and “black” take turns in moving one of their pawns. White moves first. Pawns move like in chess, that is, they can move one spot straight forward (up for white or down for black) onto an empty square or they can capture an opponents piece by moving one spot diagonally forward. As opposed to chess, pawns on the second row can not move two spaces at once. The goal of the game is to advance any one pawn to the opposite side of the board (i.e., to promote a pawn). The game ends, if one of the players has reached his goal or if the player who's turn it is does not have any legal move. This can happen if he does not have any pieces left, or if all his pawns are in positions where they cannot move. The game ends in a draw if none of the players wins. The scores are 100 points for winning, 50 points for a draw and 0 points for losing.

The legal moves for the agent are called “(move x1 y1 x2 y2)” meaning that the pawn at x1,y1 is moved to x2,y2. x1,x2 are integers between 1 and W. y1,y2 are integers between 1 and H.

Tournament

The agents that were created in class participated in a tournament against each other. There were 16 agents competing in the first round. In that round, every agent played on the 3×5 and the 5×5 boards against everyone else, once as white and once as black. The top 7 of the first round (because there was a distinctive gap between 7th and 8th place) got to compete in a final round on the 8×8 board.

The results of the first round can be seen here. The results of the final round can be seen here.

Group	Player Name	Average Score
Fanney Sigurðardóttir	ai16_bt_10	(82.76) FINALIST - 79.2
Guðni Fannar Kristjánsson
Hrafn Orri Hrafnkelsson
Kristinn Þorri Þrastarson
Guðmundur Harðarson	ai16_bt_2	(76.72) FINALIST - 75.0
Andri Ívarsson
Björn Ingi Baldvinsson
Andri Már Þórhallsson	ai16_bt_4	(74.14) FINALIST - 62.5
Ásgeir Þór Másson
Karl Ingi Karlsson
Sindri Már Kaldal Sigurjónsson	ai16_bt_15	(70.69) FINALIST - 58.3
Eysteinn Gunnlaugsson
Magnús Sigurðarson
Brynjar Ólafsson	ai16_bt_5	(62.93) FINALIST - 54.2
Tryggvi Þór Guðmundsson
Hörður Már Hafsteinsson
Ari Þórðarson
Eva Segarra Raro	ai16_bt_9	(64.66) FINALIST - 12.5
Jakub Mackovic
Roser Sanchez Todo
Kári Eiríksson	ai16_bt_3	(66.38) FINALIST - 8.3
Magnús Vilhelm Björnsson
Andri Már Ómarsson
Sverrir Magnússon
Eiður Sveinn Gunnarsson	ai16_bt_8	48.28
Orri Ólafsson
Jóhannes Páll Magnússon
Ingimar Örn Oddsson	ai16_bt_1	44.83
Alexander Björnsson
Johan Ejstrud
Kristján Hreinn Bergsson	ai16_bt_14	41.38
Egill Anton Hlöðversson	ai16_bt_13	40.52
Jón Gísli Björgvinsson	ai16_bt_13	40.52
Atli Freyr Einarsson	ai16_bt_7	26.72
Guðjón Geir Jónsson
Gunnar Karl Pálmason
Steinar Ágúst Steinarsson
Grímur Kristinsson	ai16_bt_11	25.86
Natan Örn Ólafsson
Sölvi Hjaltason
Kristján Harðarson
Guðrún Inga Baldursdóttir	ai16_bt_12	24.14
Björn Ingemar Elfström
Davíð Freyr Jónsson
Quang Van Nguyen	ai16_bt_6	20.69
Einar Hallberg Ragnarsson
Ásgeir Frímannsson
Janus Þór Kristjánsson
Tinna Frímann Jökulsdóttir	ai16_bt_16	10.0
Dagur Arinbjörn Daníelsson	ai16_bt_16	10.0

Tasks

Develop a model of the environment. What constitutes a state of the environment? What is a successor state resulting of executing an action in a certain state? Which action is legal under which conditions? (Note that the board size is flexible and your agent should be able to handle games with different board sizes within the given restrictions for width and height).
Implement a state evaluation function for the game. You can start with the following simple evaluation function for white (and use the negation for black): 50 - <distance of most advanced white pawn to row H> + <distance of most advanced black pawn to row 1>
Implement iterative deepening alpha-beta search and use this state evaluation function to evaluate the leaf nodes of the tree.
Keep track of and output the number of state expansions, current depth limit of your iterative deepening loop and runtime of the search for each iteration of iterative deepening and in total.
Improve the state evaluation function or implement a better one.
Test if it is really better by pitching two agents (one with each evaluation function) against each other or by pitching each evaluation function against a random agent. If you run the experiments with the random agent, you need to repeat the experiment a decent number of times to get significant results. Don't forget to switch sides because white has an advantage in the game.
Run the experiments with different board sizes and time constraints (play clock) of 1s and 10s.
Make your code fast! The more state expansions you get per second, the better the player. Ideally, you should be able to solve the small boards (3×5, 5×5) in 10s.

Material

java project for the agent development

The files in the archive are similar to those in the first programming assignment. The archive contains code for implementing an agent in the src directory. The agent is actually a server process which listens on some port and waits for a game simulator or a game playing robot to send a message. It will then reply with the next action the agent wants to execute.

The zip file also contains the description of the environment for different board sizes (breakthrough_XxY.gdl) and a two simulators (gamecontroller-gui.jar and kiosk.jar). To test your agent:

Start the simulator (execute gamecontroller-gui.jar with either double-click or using the command “java -jar gamecontroller-gui.jar” on the command line)
Setup the simulator as shown in this picture:
You can use your player as both the first and the second role of the game, just not at the same time. To let two instances of your agent play against each other, start your agent twice with different ports to listen on and use the respective ports in the simulator.
Run the “Main” class in the project. If you added your own agent class, make sure that it is used in the main method of Main.java. You can also execute the “ant run” on the command line, if you have Ant installed. The output of the agent should say “NanoHTTPD is listening on port 4001”, which indicates that your agent is ready and waiting for messages to arrive on the specified port.
Now push the “Start” button in the simulator and your agent should get some messages and reply with the actions it wants to execute. At the end, the output of the simulator tells you how many points both players got: “Game over! results: 0 100”, the first number is for white and the second for black.
If the output of the simulator contains any line starting with “SEVERE”, something is wrong. The two most common problems are the network connection (e.g., due to a firewall) between the simulator and the agent or the agent sending illegal moves.
You can also use the kiosk to play against your agent. The kiosk does have visualization of the game, but it does not allow you to let two instances of the agent play against each other.

Hints

For implementing your agent:

Add a new class that implements the “Agent” interface. Look at RandomAgent.java to see how this is done.
You have to implement the methods “init” and “nextAction”. “init” will be called once at the start and should be used to initialize the agent. You will get the information, which role your agent is playing (white or black) and how much time the agent has for computing each move. “nextAction” gets the previous move as input and has to return the next action the agent is supposed to execute within the given time limit. “nextAction” is called for every step of the game. If it is not your players turn return “NOOP”.
Make sure your agent is able to play both roles (white and black)!
You can make sure to be on time by regularly checking whether there is time left during the search process and stopping the search just before you run out of time, e.g., by throwing an exception that you catch where you call the search function the first time.
Your agent should be able to play the small boards perfectly (at least the 3×5, probably also the 5×5), with a playclock of around 10 seconds. You will need a decent heuristics to play the bigger boards well.
To specify the port your agent is running on change the build.xml file as follows and use the command line ant -Dport=PORT run with PORT being the port number:

build.xml

      ...
      <target name="run" depends="dist">
              <java jar="${dist}/${projectname}.jar" fork="true">
                      <arg value="${port}"/> <!-- add this line here! -->
                      <jvmarg value="-Xmx500m" />
              </java>        
              <antcall target="clean" />
      </target>
      ...

Handing In

Please hand in a zip file containing:

the complete src folder containing the source code for your agent, but also the other classes that came with the project
a pdf with a short description of your heuristic and the results of the experiments and the conclusions you draw from them

The deadline is 29.02.2016. We will have a tournament between your agents afterwards. Extra points for the top players!

Center for Analysis and Design of Intelligent Agents

Table of Contents