Back to Main Conference 2022
LREC 2022main

The VoxWorld Platform for Multimodal Embodied Agents

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/3pm6p4pwhj6i

Abstract

We present a five-year retrospective on the development of the VoxWorld platform, first introduced as a multimodal platform for modeling motion language, that has evolved into a platform for rapidly building and deploying embodied agents with contextual and situational awareness, capable of interacting with humans in multiple modalities, and exploring their environments. In particular, we discuss the evolution from the theoretical underpinnings of the VoxML modeling language to a platform that accommodates both neural and symbolic inputs to build agents capable of multimodal interaction and hybrid reasoning. We focus on three distinct agent implementations and the functionality needed to accommodate all of them: Diana, a virtual collaborative agent; Kirby, a mobile robot; and BabyBAW, an agent who self-guides its own exploration of the world.

Details

Paper ID
lrec2022-main-164
Pages
pp. 1529-1541
BibKey
krishnaswamy-etal-2022-voxworld
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • NK

    Nikhil Krishnaswamy

  • WP

    William Pickard

  • BC

    Brittany Cates

  • NB

    Nathaniel Blanchard

  • JP

    James Pustejovsky

Links