arXiv 2309.06687

Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics

By Jiayang Song, Zhehua Zhou, et al.

Published 2023-09-13

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Although Deep Reinforcement Learning (DRL) has achieved notable success in numerous robotic applications, designing a high-performing reward function remains a challenging task that often requires substantial manual input. Recently, Large Language Models (LLMs) have been extensively adopted to address tasks demanding in-depth common-sense knowledge, such as reasoning and planning. Recognizing that reward function de…

View the original paper on arXiv