ORAUE - Optimal Rational Agents in Unknown Environments

People

Hutter F.

(Responsible)

Abstract

Goal of this project is to extend and deepen a recent theory of theoretically optimal universal agents interacting with unknown environments. We build on Solomonoff´s celebrated universal theory of induction to derive an optimal reinforcement learning agent, called AIXI, embedded in a world whose responses to the agent´s actions are sampled from a computable probability distribution -- this is the only very weak assumption. From an algorithmic complexity perspective, the AIXI model generalizes optimal passive universal induction to the case of active agents. From a decision theoretic perspective, the AIXI model is a suggestion of a new (implicit) "learning" algorithm, which may overcome all (except computational) problems of previous reinforcement learning algorithms. If the optimality theorems of universal induction and decision theory generalize to the unified AIXI model, we would have, for the first time, a universal (parameterless) model of an optimal rational agent in any computable but unknown environment with reinforcement feedback.

Additional information

Acronym

ORAUE

Start date

01.04.2003

End date

01.04.2005

Duration

25 Months

Funding sources

SNSF, Swiss National Science Foundation

Status

Ended

People

Education

Research

Organisation

ORAUE - Optimal Rational Agents in Unknown Environments

People

Abstract

Additional information

Faculties

Organizational units

Additional information

Maps and directions

Stay in touch