2015
DOI: 10.1007/978-3-319-21365-1_10
|View full text |Cite
|
Sign up to set email alerts
|

Stochastic Tasks: Difficulty and Levin Search

Abstract: Abstract. We establish a setting for asynchronous stochastic tasks that account for episodes, rewards and responses, and, most especially, the computational complexity of the algorithm behind an agent solving a task. This is used to determine the difficulty of a task as the (logarithm of the) number of computational steps required to acquire an acceptable policy for the task, which includes the exploration of policies and their verification. We also analyse instance difficulty, task compositions and decomposit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 11 publications
(12 reference statements)
0
4
0
Order By: Relevance
“…The key idea was defining intelligence test items using algorithmic information theory (Hernández-Orallo and Minaya-Collado, 1998), an approach that was followed by many other proposals in the next two decades, from the very influential "universal intelligence" (Legg and Hutter, 2007) to the recent "measure of intelligence" (Chollet, 2019). However, while some of these proposals have had an important impact on the understanding of what intelligence is, its relation to compression (Dowe et al, 2011), difficulty (Hernández-Orallo, 2015Hernandez-Orallo, 2015) and generality (Martinez-Plumed and Hernandez-Orallo, 2018), the adoption of some of these tests (or associated definitions) in practice has been very limited.…”
Section: Anymentioning
confidence: 99%
See 1 more Smart Citation
“…The key idea was defining intelligence test items using algorithmic information theory (Hernández-Orallo and Minaya-Collado, 1998), an approach that was followed by many other proposals in the next two decades, from the very influential "universal intelligence" (Legg and Hutter, 2007) to the recent "measure of intelligence" (Chollet, 2019). However, while some of these proposals have had an important impact on the understanding of what intelligence is, its relation to compression (Dowe et al, 2011), difficulty (Hernández-Orallo, 2015Hernandez-Orallo, 2015) and generality (Martinez-Plumed and Hernandez-Orallo, 2018), the adoption of some of these tests (or associated definitions) in practice has been very limited.…”
Section: Anymentioning
confidence: 99%
“…Whereas the development of measurement instruments that follow the adversarial testing is still incipient, and has not progressed significantly since (Hernández-Orallo and Dowe, 2010;, it adapts according to one or more dimensions, as per the transitional and universal cases in Figure 2. Assuming each dimension is defined by a difficulty metric (Mishra et al, 2013;Hernandez-Orallo, 2015;Martinez-Plumed and Hernandez-Orallo, 2018;Martínez-Plumed et al, 2019;Hernández-Orallo, 2020), we have a multidimensional space for which the adversarial testing can derive the location of the testee in this space. By doing this, similarities and clustering are calculated in this space, with no need of exploring all the n¢pn¡1q 2 combinations when n agents are being analysed.…”
Section: Building Behavioural Taxonomiesmentioning
confidence: 99%
“…We evaluate the more able agents with more difficult tasks. In order to do this, we calculate difficulty of a task as the complexity of the simplest policy that is successful for the task (Hernández-Orallo, 2015b). Complexity/simplicity is measured as a combination of the size of the policy and its execution time.…”
Section: Analysis Of Subpopulations Binned By Abilities and Difficultiesmentioning
confidence: 99%
“…general (video) game playing of a handful of 1 Classical planning has hierarchical task networks [4], but subtask decomposition is almost always done manually and there is no real analysis of tasks on a general level. Some people working on AI evaluation -one of task theory's primary applications -attempt to analyze some properties of task-environments, but they don't go beyond complexity and difficultyrelated measures [6].…”
Section: Introductionmentioning
confidence: 99%