Learning to Reason in Reinforcement Learning. Deep Reinforcement Learning (RL) uses deep neural networks to represent and learn optimal decision-making policies for intelligent agents in complex envir

Monash University — Discovery Projects

Amount

Up to $629,638

Closes

Tuesday 30 November 2027

Status

unknown

Type

open opportunity

Apply Now →

Description

Learning to Reason in Reinforcement Learning. Deep Reinforcement Learning (RL) uses deep neural networks to represent and learn optimal decision-making policies for intelligent agents in complex environments. However, most RL approaches require millions of episodes to converge to good policies, making it difficult for RL to be applied in real-world scenarios taking significant resources. This project aims to equip RL with capabilities such as counterfactual reasoning and outcome anticipation to significantly reduce the number of interactions required, improve generalisation, and provide the agent with the capability to consider the cause-effects. These improvements would narrow the gap between AI and human capabilities and broaden the adoption of RL in real-world applications.. Scheme: Discovery Projects. Field: 4611 - Machine Learning. Lead: A/Prof Ehsan Abbasnejad

Foundations Supporting This Area

World Vision Australia

educationhealthcommunity

$514.1M/yr

The University of Sydney

educationresearchcommunity

$340.7M/yr

Catholic Education Centre

educationindigenous

$281.5M/yr

Monash University

educationresearchhealth

$273.7M/yr

Geoffrey Cumming Foundation

healtheducationcommunity

$250.0M/yr

Discovery method: arc-grants

Last verified: Monday 2 March 2026

Added: Saturday 28 February 2026