Comparing Humans and AI Agents

Insa-Cabrera, Javier; Dowe, David L.; España, Sergio; Hernández-Lloreda, Marı́a Victoria; Hernández-Orallo, José

doi:10.1007/978-3-642-22887-2_13

Cited by 33 publications

(59 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The first evaluations using these tests [7] show that they work well at evaluating very different agents (humans and RL algorithms), but they do not properly reflect their supposed difference in intelligence. Many possible explanations are suggested in [7], with incremental knowledge acquisition and social intelligence being two of the abilities which this test is not giving enough importance.…”

Section: Universal Tests and Social Intelligencementioning

confidence: 99%

“…Some preliminary results of this evaluation [7] show that the setting is able to compare and evaluate different kinds of agents, but it fails at placing them on the same scale, since humans usually get similar scores to those of other relatively simple agents. One possible explanation for these results is that it is virtually impossible to find other agents in the test, so social intelligence is not measured.…”

Section: I2mentioning

confidence: 99%

“…Many possible explanations are suggested in [7], with incremental knowledge acquisition and social intelligence being two of the abilities which this test is not giving enough importance.…”

Section: Universal Tests and Social Intelligencementioning

confidence: 99%

“…Environments are composed of a space of cells (a graph of nodes) and the patterns for Good and Evil (a simplified adaptation of [7]). Once an environment has been constructed, evaluation is performed in the following way.…”

Section: Intelligence Tests Considering Several Agentsmentioning

confidence: 99%

“…Good and Evil cannot share a cell with other agents. This re-introduces some degree of reactivity (with respect to the prototype in [7]), even in the single agent case.…”

Section: Intelligence Tests Considering Several Agentsmentioning

confidence: 99%

See 4 more Smart Citations

On Measuring Social Intelligence: Experiments on Competition and Cooperation

Insa-Cabrera

Benacloch-Ayuso

Hernández-Orallo

2012

Artificial General Intelligence

Self Cite

View full text Add to dashboard Cite

Abstract. Evaluating agent intelligence is a fundamental issue for the understanding, construction and improvement of autonomous agents. New intelligence tests have been recently developed based on an assessment of task complexity using algorithmic information theory. Some early experimental results have shown that these intelligence tests may be able to distinguish between agents of the same kind, but they do not place very different agents, e.g., humans and machines, on a correct scale. It has been suggested that a possible explanation is that these tests do not measure social intelligence. One formal approach to incorporate social environments in an intelligence test is the recent notion of Darwin-Wallace distribution. Inspired by this distribution we present several new test settings considering competition and cooperation, where we evaluate the "social intelligence" of several reinforcement learning algorithms. The results show that evaluating social intelligence raises many issues that need to be addressed in order to devise tests of social intelligence.

show abstract

Section: Universal Tests and Social Intelligencementioning

confidence: 99%

Section: I2mentioning

confidence: 99%

“…Many possible explanations are suggested in [7], with incremental knowledge acquisition and social intelligence being two of the abilities which this test is not giving enough importance.…”

Section: Universal Tests and Social Intelligencementioning

confidence: 99%

Section: Intelligence Tests Considering Several Agentsmentioning

confidence: 99%

“…Good and Evil cannot share a cell with other agents. This re-introduces some degree of reactivity (with respect to the prototype in [7]), even in the single agent case.…”

Section: Intelligence Tests Considering Several Agentsmentioning

confidence: 99%

See 3 more Smart Citations