public:t-720-atai:atai-21:engineering_assignment_1
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
public:t-720-atai:atai-21:engineering_assignment_1 [2021/09/16 10:45] – [Assignment 1.1: Deep Reinforcement Learning] thorisson | public:t-720-atai:atai-21:engineering_assignment_1 [2024/04/29 13:33] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 29: | Line 29: | ||
Run the code:\\ | Run the code:\\ | ||
$ python main.py | $ python main.py | ||
+ | | ||
+ | For the first task (Deep-Reinforcement-Learning) you will need to install pytorch. Since this is different depending on which OS you use and whether you have a GPU which supports CUDA (Nvidia GPUs only) you should follow the installation instructions [[https:// | ||
+ | |here]]. | ||
Zip Files:\\ | Zip Files:\\ | ||
{{: | {{: | ||
+ | {{: | ||
{{: | {{: | ||
Line 43: | Line 47: | ||
- Plot its improvement in performance over time. | - Plot its improvement in performance over time. | ||
- **Modified Version.** Evaluate the learner’s performance on a modified version of the cart-pole task. For this you should evaluate at least two of the following modifications of the environments and compare them to the results from 1.: | - **Modified Version.** Evaluate the learner’s performance on a modified version of the cart-pole task. For this you should evaluate at least two of the following modifications of the environments and compare them to the results from 1.: | ||
- | - Noise on observation/ action/ | + | - Noise on observation and action. |
- Hide each variable once (x, v, theta, omega) and run the setup with only three observables. | - Hide each variable once (x, v, theta, omega) and run the setup with only three observables. | ||
- Introduce extremely high noise on one observable for all four observables (three normal, one noisy variables). | - Introduce extremely high noise on one observable for all four observables (three normal, one noisy variables). | ||
- Change the task after a certain amount of epochs. Think of at least three different changes, one is given as an example in the code. | - Change the task after a certain amount of epochs. Think of at least three different changes, one is given as an example in the code. | ||
- Change the discreteness of time/ observables increasing or decreasing the variable resolution. | - Change the discreteness of time/ observables increasing or decreasing the variable resolution. | ||
+ | - Calculate the average score, median score, maximum score, and standard deviation of each task. | ||
\\ | \\ | ||
/var/www/cadia.ru.is/wiki/data/attic/public/t-720-atai/atai-21/engineering_assignment_1.1631789121.txt.gz · Last modified: 2024/04/29 13:32 (external edit)