← Back to Grants

Learning to Reason in Reinforcement Learning. Deep Reinforcement Learning (RL) uses deep neural networks to represent and learn optimal decision-making policies for intelligent agents in complex envir

Monash University — Discovery Projects
Amount
Up to $629,638
Closes
Tuesday 30 November 2027
Status
unknown
Type
open opportunity
Apply Now →

Description

Learning to Reason in Reinforcement Learning. Deep Reinforcement Learning (RL) uses deep neural networks to represent and learn optimal decision-making policies for intelligent agents in complex environments. However, most RL approaches require millions of episodes to converge to good policies, making it difficult for RL to be applied in real-world scenarios taking significant resources. This project aims to equip RL with capabilities such as counterfactual reasoning and outcome anticipation to significantly reduce the number of interactions required, improve generalisation, and provide the agent with the capability to consider the cause-effects. These improvements would narrow the gap between AI and human capabilities and broaden the adoption of RL in real-world applications.. Scheme: Discovery Projects. Field: 4611 - Machine Learning. Lead: A/Prof Ehsan Abbasnejad

Categories
regenerativeeducation
Target Recipients
researchersuniversities

Foundations Supporting This Area

Discovery method: arc-grants
Last verified: Monday 2 March 2026
Added: Saturday 28 February 2026