public:t-720-atai:atai-20:engineering_assignment_2
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
public:t-720-atai:atai-20:engineering_assignment_2 [2020/09/10 09:30] – thorisson | public:t-720-atai:atai-20:engineering_assignment_2 [2024/04/29 13:33] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 54: | Line 54: | ||
- omega: 0.2 rad/s | - omega: 0.2 rad/s | ||
- Set the environment to run asynchronous | - Set the environment to run asynchronous | ||
- | - Play the game on Conditions 1, 2, 3, and 4 in that order for at least 10 runs each and note for each condition your | + | - Play the game on Conditions 1, 2, 3, and 4 in that order for at least 10 epochs (or better phrased: until you are confident in playing in the condition, but for at least 10 epochs) |
- highest score, | - highest score, | ||
- average score and standard deviation, | - average score and standard deviation, | ||
- median score. | - median score. | ||
- | - Invert the forces by pressing the “i” key on your keyboard during a run (after 5-10 restarts/ fails) and continue for another 5-10 episodes. Do this in all four conditions in the same order as previously. (Redo instruction 3 with this inversion). What can you say about your learning speed with force inversion. | + | - Invert the forces by pressing the “i” key on your keyboard during a run (after 5-10 restarts/ fails) and continue for another 5-10 episodes. Do this in all four conditions in the same order as previously. (Redo instruction 3 with this inversion). What can you say about your learning speed with force inversion? |
- Apply the following settings to the environment (all of them at the same time): | - Apply the following settings to the environment (all of them at the same time): | ||
- Only the variables x, v, omega are observables. | - Only the variables x, v, omega are observables. | ||
Line 67: | Line 67: | ||
- Reset the settings back to the ones from the beginning and replay the game (as described in the third instruction). | - Reset the settings back to the ones from the beginning and replay the game (as described in the third instruction). | ||
- Compare your results from the first tries from number 3 to the other, and especially the last tries from number 7. What can you conclude about the possibilities of cumulative, life-long learning? | - Compare your results from the first tries from number 3 to the other, and especially the last tries from number 7. What can you conclude about the possibilities of cumulative, life-long learning? | ||
- | - Write a report on your results including the different scores and comparisons between the different tries. Compare your results with the results from the last assignment and discuss. Discuss the advantages, and disadvantages of human learning (and human nature) this might include (but is not restricted to): | + | - **Report**. |
- Previously acquired knowledge used in this game. | - Previously acquired knowledge used in this game. | ||
- Cumulative learning. | - Cumulative learning. |
/var/www/cadia.ru.is/wiki/data/attic/public/t-720-atai/atai-20/engineering_assignment_2.1599730218.txt.gz · Last modified: 2024/04/29 13:32 (external edit)