User Tools

Site Tools


public:t_720_atai:atai-18:lecture_notes_evaluation

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
public:t_720_atai:atai-18:lecture_notes_evaluation [2018/10/08 13:00] – [State of the Art] thorissonpublic:t_720_atai:atai-18:lecture_notes_evaluation [2024/04/29 13:33] (current) – external edit 127.0.0.1
Line 107: Line 107:
 \\ \\
  
-====State of the Art==== + 
-|  Summary   Practically all proposals to date for evaluating intelligence leave out some major important aspects of intelligenceVirtually no proposals exist for evaluation of knowledge transfer, attentional capabilities, knowledge acquisition, knowledge capacity, knowledge retention, multi-goal learning, social intelligence, creativity, reasoning, cognitive growth, and meta-learning / integrated cognitive control -- all of which are quite likely vital to achieving general intelligence on par with human.  | +====Example Frameworks for Evaluating AI Systems==== 
-|  What is needed  | A theory of intelligence that allows us to construct adequatethoroughand comprehensive tests of intelligence and intelligent behavior.  | +|  \\ \\ Merlin  A significant problem facing researchers in reinforcement and multi-objective learning is the lack of good benchmarksMerlin (for Multi-objective Environments for Reinforcement LearnINg) is a software tool and method for enabling the creation of random problem instancesincluding multi-objective learning problemswith specific structural properties. Merlin provides the ability to control task features in predictable ways allowing researchers to build a more detailed understanding about what features of a problem interact with a given learning algorithm, improving or degrading its performance.    [[http://alumni.media.mit.edu/~kris/ftp/Tunable-generic-Garrett-etal-2014.pdf|Paper]] by Garrett et al.  | 
-|  What can be done  In leu of such a theory (which still is not forthcoming after over 100 years of psychology and 60 years of AI) we could use multi-dimensional "Lego" kit for exploring various means of measuring intelligence and intelligent performance, so as to be able to evaluate the pros and cons of various approaches, methods, scales, etc.    |+|  \\ FRaMoTEC  Framework that allows modular construction of physical task-environments for evaluating intelligent control systems. proto- task theory on which the framework is built aims for a deeper understanding of tasks in generalwith a future goal of providing a theoretical foundation for all resource-bounded real-world tasks. Tasks constructed in the framework can be rooted in physicsallowing us their execution to analyze the performance of control systems in terms of expended time and energy  |  [[http://alumni.media.mit.edu/~kris/ftp/EGPAI_2016_paper_8.pdf|Paper]] by Thorarensen et al.   
 +|  AI Gym  Gym is a toolkit developed by OpenAI for developing and comparing reinforcement learning algorithms. It supports teaching agents everything from walking to playing games like Pong or Pinball.    |  [[https://gym.openai.com|Link]] to Website.  |  
  
 \\ \\
 \\ \\
  
-====Example Frameworks for Evaluating AI Systems==== + 
-|  Merlin  A significant problem facing researchers in reinforcement and multi-objective learning is the lack of good benchmarksMerlin (for Multi-objective Environments for Reinforcement LearnINg) is a software tool and method for enabling the creation of random problem instancesincluding multi-objective learning problemswith specific structural properties. Merlin provides the ability to control task features in predictable ways allowing researchers to build a more detailed understanding about what features of a problem interact with a given learning algorithm, improving or degrading its performance.    [[http://alumni.media.mit.edu/~kris/ftp/Tunable-generic-Garrett-etal-2014.pdf|Paper]] by Garrett et al.  | +====State of the Art==== 
-|  FRaMoTEC  Framework that allows modular construction of physical task-environments for evaluating intelligent control systems. proto- task theory on which the framework is built aims for a deeper understanding of tasks in generalwith a future goal of providing a theoretical foundation for all resource-bounded real-world tasks. Tasks constructed in the framework can be rooted in physicsallowing us their execution to analyze the performance of control systems in terms of expended time and energy  | [[http://alumni.media.mit.edu/~kris/ftp/EGPAI_2016_paper_8.pdf|Paper]] by Thorarensen et al.   +|  Summary   Practically all proposals to date for evaluating intelligence leave out some major important aspects of intelligenceVirtually no proposals exist for evaluation of knowledge transfer, attentional capabilities, knowledge acquisition, knowledge capacity, knowledge retention, multi-goal learning, social intelligence, creativity, reasoning, cognitive growth, and meta-learning / integrated cognitive control -- all of which are quite likely vital to achieving general intelligence on par with human.  | 
-|  AI Gym  Gym is a toolkit developed by OpenAI for developing and comparing reinforcement learning algorithmsIt supports teaching agents everything from walking to playing games like Pong or Pinball.    |  [[https://gym.openai.com|Link]] to Website.  |  +|  What is needed  | A theory of intelligence that allows us to construct adequatethoroughand comprehensive tests of intelligence and intelligent behavior 
 +|  What can be done  In leu of such a theory (which still is not forthcoming after over 100 years of psychology and 60 years of AI) we could use multi-dimensional "Lego" kit for exploring various means of measuring intelligence and intelligent performance, so as to be able to evaluate the pros and cons of various approaches, methods, scales, etc\\ Some sort of kit meeting part or all of the requirements listed above would go a long way to bridging the gap, and possibly generate some ideas that could speed up theoretical development.    |
  
 \\ \\
/var/www/cadia.ru.is/wiki/data/attic/public/t_720_atai/atai-18/lecture_notes_evaluation.1539003626.txt.gz · Last modified: 2024/04/29 13:33 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki