public:t-720-atai:atai-20:engineering_assignment_1
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
public:t-720-atai:atai-20:engineering_assignment_1 [2020/08/24 09:49] – thorisson | public:t-720-atai:atai-20:engineering_assignment_1 [2024/04/29 13:33] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 20: | Line 20: | ||
- Run the learner repeatedly; collect the data. Stop each run when either 1000 epochs are reached or the agent manages to get more than 200 iterations in average per epoch over at least 100 continuous epochs (this is usually the case at around 400-500 epochs). | - Run the learner repeatedly; collect the data. Stop each run when either 1000 epochs are reached or the agent manages to get more than 200 iterations in average per epoch over at least 100 continuous epochs (this is usually the case at around 400-500 epochs). | ||
- Plot its improvement in performance over time. | - Plot its improvement in performance over time. | ||
- | - **Modified Version.** Evaluate the learner’s performance on a modified version of the cart-pole task. For this you should evaluate at least 3 of the following modifications of the environments: | + | - **Modified Version.** Evaluate the learner’s performance on a modified version of the cart-pole task. For this you should evaluate at least 3 of the following modifications of the environments |
- Noise on observation/ | - Noise on observation/ | ||
- Hide each variable once (x, v, theta, omega) and run the setup with only three observables. | - Hide each variable once (x, v, theta, omega) and run the setup with only three observables. | ||
Line 26: | Line 26: | ||
- Change the task after a certain amount of epochs. Think of at least three different changes, one is given as an example in the code. | - Change the task after a certain amount of epochs. Think of at least three different changes, one is given as an example in the code. | ||
- Change the discreteness of time/ observables increasing or decreasing the variable resolution. | - Change the discreteness of time/ observables increasing or decreasing the variable resolution. | ||
- | - And compare them to the results from 1. | ||
- **New Task-Environment.** Design your own simple task-environment in which you can show your own ideas of complexity of task-environments which might not have been included in the cart-pole. | - **New Task-Environment.** Design your own simple task-environment in which you can show your own ideas of complexity of task-environments which might not have been included in the cart-pole. | ||
- **Report.** Write a 1-2 page report where you describe your results. Draw some insights in relation to learning in general and try to make some generalizations based on them, and discuss, e.g.: | - **Report.** Write a 1-2 page report where you describe your results. Draw some insights in relation to learning in general and try to make some generalizations based on them, and discuss, e.g.: | ||
Line 32: | Line 31: | ||
- Which changes will be // | - Which changes will be // | ||
- What is your opinion of the // | - What is your opinion of the // | ||
- | - Is this in nay way similar to how humans learn? If ' | + | - Is this in any way similar to how humans learn? If ' |
- ...more | - ...more | ||
Line 50: | Line 49: | ||
$ python main.py | $ python main.py | ||
- | Zip File: {{: | + | Zip Files:\\ |
+ | {{: | ||
+ | {{: | ||
+ | |||
+ | \\ | ||
+ | \\ | ||
=== Further information === | === Further information === |
/var/www/cadia.ru.is/wiki/data/attic/public/t-720-atai/atai-20/engineering_assignment_1.1598262543.txt.gz · Last modified: 2024/04/29 13:32 (external edit)