public:t-720-atai:atai-19:lecture_notes_architectures
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
public:t-720-atai:atai-19:lecture_notes_architectures [2019/10/04 07:32] – thorisson | public:t-720-atai:atai-19:lecture_notes_architectures [2024/04/29 13:33] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 52: | Line 52: | ||
====Self-Programming ==== | ====Self-Programming ==== | ||
- | ==== Self-Programming ==== | + | |
- | | What it is | // | + | | What it is | // |
| Self-Generated Program | | Self-Generated Program | ||
| Historical note | Concept of self-programming is old (J. von Neumann one of the first to talk about self-replication in machines). However, few if any proposals for how to achieve this has been fielded. | | Historical note | Concept of self-programming is old (J. von Neumann one of the first to talk about self-replication in machines). However, few if any proposals for how to achieve this has been fielded. | ||
Line 69: | Line 69: | ||
| Why? | Building a machine that can write (sensible, meaningful!) programs means that machine is smart enough to understand the code it produces. If the purpose of its programming is to // | | Why? | Building a machine that can write (sensible, meaningful!) programs means that machine is smart enough to understand the code it produces. If the purpose of its programming is to // | ||
| What can we do? | We must create a programming language with //simple enough// semantics so that a simple machine (perhaps with some clever emergent properties) can use it to bootstrap itself in learning to write programs. | | What can we do? | We must create a programming language with //simple enough// semantics so that a simple machine (perhaps with some clever emergent properties) can use it to bootstrap itself in learning to write programs. | ||
- | | Does such a language exist? | + | | Does such a language exist? |
\\ | \\ | ||
\\ | \\ | ||
- | [S] State-space search (example: GPS (Newell 1963). The atomic actions are state-changing operators, and a program is represented as a path from the initial state to a final state. Variants of this approach include program search (examples: Gödel Machine (Schmidhuber 2006)): Given the action set A, in principle all programs formed by it can be exhaustively listed and evaluated to find an optimal one according to certain criteria. | ||
- | |||
- | [P] Production system (example: SOAR (Laird 1987)). Each production rule specifies the condition for a sequence of actions that correspond to a program. Mechanisms that produce new production rules, such as chunking, can be considered self-programming. | ||
- | [R] Reinforcement learning (example: AIXI (Hutter 2007)). When an action of an agent changes the state of the environment, | ||
- | |||
- | [G] Genetic programming (example: Koza’s Invention Machine (Koza et al. 2000). A program is formed from the system’s actions, initially randomly but subsequently via genetic operators over the best performers from prior solutions, possibly by using the output of some actions as input of some other actions. An evolution process provides a utility function that is used to select the best programs, and the process is repeated. | ||
- | |||
- | |||
- | [I] Inductive logic programming (c.f. Muggleton 1994). A program is a statement with a procedural interpretation, | ||
- | Besides the above fairly well-known AI approaches, we add here two less known ones that have recently become relevant in the context of self-programming: | ||
- | [E] Evidential reasoning (example: NARS (Wang 2006)). A program is a statement with a procedural interpretation, | ||
- | |||
- | [A] Autocatalysis (example: Ikon Flux (Nivel 2007)). In this context the architecture is in large part comprised of a large collection of models, acting as hierarchically organized controllers, | ||
\\ | \\ | ||
Line 94: | Line 81: | ||
====The SOAR Architecture==== | ====The SOAR Architecture==== | ||
- | |||
- | ===SOAR=== | ||
| What it is | One of the oldest cognitive architectures in history. | | What it is | One of the oldest cognitive architectures in history. | ||
Line 112: | Line 97: | ||
2013(c)Helgi P. Helgason | 2013(c)Helgi P. Helgason | ||
+ | \\ | ||
\\ | \\ | ||
+ | \\ | ||
+ | \\ | ||
+ | |||
+ | ====The AERA System==== | ||
+ | |||
+ | The Auto-catalytic Endogenous Reflective Architecture – AERA – is an AGI-aspiring architectural blueprint that was produced as part of the HUMANOBS FP7 project. It encompasses several fundamentally new ideas in the history of AI, including a new programming language specifically conceived to solve some major limitations of prior efforts in this respect, including self-inspection and self-representation, | ||
+ | |||
+ | AERA's knowledge is stored in models, which essentially encode transformations on input, to produce output. Models have a trigger side (left-hand side) and a result side (right-hand side). In a forward-chaining scenario, when a particular piece of data matches on the left hand of a model (it is only allowed to test the match if the data has high enough saliency and the program has sufficient activation) the model fires, producing the output specified by its left-hand side and injecting it into a global memory store. The semantics of the output is prediction, and the semantics of the input is either fact or prediction. Notice that a model in AERA is not a production rule; a model relating A to B does not mean “A entails B”, it means A predicts B, and it has an associated confidence value. Such models stem invariably (read: most of the time) from the system' | ||
+ | |||
+ | In backward-chaining – to implement the process of abduction – models act the other way around, namely, when some data match the right-hand side, a model produces new data patterned after its left side, whose semantics essentially state that “if you want a B (on the right-hand side) perhaps it would help to get an A (term on the left-hand side). The semantics of either (both the input and output) is “goal”. | ||
+ | |||
+ | A key principle of AERA operation is that of a distributed production process: Each program in AERA has a level of activation that determines if it is allowed to run or not. Every piece of data has a corresponding saliency level that determines how visible it is inside the system. In AERA there is a single memory, but it (typically) embeds groups that allow sets of data and programs to be addressed, e.g. changing their activation, saliency, or existence (via creation or deletion). | ||
+ | |||
\\ | \\ | ||
\\ | \\ | ||
Line 131: | Line 130: | ||
\\ | \\ | ||
- | ====Autonomous Model Acquisition==== | ||
- | | What it is | The ability to create a model of some target phenomenon // | ||
- | | Challenge | ||
\\ | \\ | ||
\\ | \\ | ||
- | ====Model Acquisition Function==== | ||
- | | {{public: | ||
- | | The agent has a model generation function < | ||
- | | {{public: | ||
- | | A learning agent is situated so as to perceive the effects of the relationships between variables. \\ The agent observes the interaction between the variables for a while, rendering some data about their relations (but not enough to be certain about it, and certainly not enough to create a complete model of it). \\ This generates hypotheses about the relation between variables, in the form of candidate relational models of the observed events. | ||
- | |||
- | \\ | ||
- | \\ | ||
- | ==== Model Generation & Evaluation ==== | ||
- | |||
- | | {{public: | ||
- | | Based on prior observations, | ||
- | | {{public: | ||
- | | The agent' | ||
- | | {{public: | ||
- | | The result of feedback (reinforcement) may result in the deletion, rewriting, or some other modification of the original model selected for prediction. Here the feedback has resulted in a modified model < | ||
- | |||
- | \\ | ||
- | \\ | ||
- | |||
- | |||
- | |||
- | \\ | ||
- | \\ | ||
- | |||
- | |||
- | |||
- | ====Demo Of AERA In Action==== | ||
- | | Demos | The most complex demo of an AERA system was the S1 agent learning to do an interview (in the EU-funded HUMANOBS research project). [[http:// | ||
- | | TV Interview | ||
- | | Data | S1 received realtime timestamped data from the 3D movement of the humans (digitized via appropriate tracking methods at 20 Hz), words generated by a speech recognizer, and prosody (fundamental pitch of voice at 60 Hz, along with timestamped starts and stops). | ||
- | | Seed | The seed consisted of a handful of top-level goals for each agent in the interview (interviewer and interviewee), | ||
- | | What Was Given | * actions: grab, release, point-at, look-at (defined as event types constrained by geometric relationships) \\ * stopping the interview clock ends the session \\ * objects: glass-bottle, | ||
- | | What Had To Be Learned | ||
- | | Result | ||
- | |||
- | |||
- | \\ | ||
- | \\ | ||
- | |||
- | ====The X Architecture==== | ||
- | |||
- | TBD | ||
- | \\ | ||
- | \\ | ||
- | \\ | ||
- | \\ | ||
- | 2018(c)K. R. Thórisson | + | 2019(c)K. R. Thórisson |
\\ | \\ | ||
\\ | \\ | ||
//EOF// | //EOF// |
/var/www/cadia.ru.is/wiki/data/attic/public/t-720-atai/atai-19/lecture_notes_architectures.1570174364.txt.gz · Last modified: 2024/04/29 13:32 (external edit)