Back to Main Conference 2024
LREC-COLING 2024main

MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/2uxyyxbkftyz

Abstract

Drawing upon the intuition that aligning different modalities to the same semantic embedding space would allow models to understand states and actions more easily, we propose a new perspective to the offline reinforcement learning (RL) challenge. More concretely, we transform it into a supervised learning task by integrating multimodal and pre-trained language models. Our approach incorporates state information derived from images and action-related data obtained from text, thereby bolstering RL training performance and promoting long-term strategic thinking. We emphasize the contextual understanding of language and demonstrate how decision-making in RL can benefit from aligning states’ and actions’ representation with languages’ representation. Our method significantly outperforms current baselines as evidenced by evaluations conducted on Atari and OpenAI Gym environments. This contributes to advancing offline RL performance and efficiency while providing a novel perspective on offline RL.

Details

Paper ID
lrec2024-main-1013
Pages
pp. 11593-11604
BibKey
zheng-etal-2024-3s
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • TZ

    Tianyu Zheng

  • GZ

    Ge Zhang

  • XQ

    Xingwei Qu

  • MK

    Ming Kuang

  • WH

    Wenhao Huang

  • ZH

    Zhaofeng He

Links