HomeLREC 2026WorkshopsGAZE4NLPlrec2026-ws-gaze4nlp-09
Back to GAZE4NLP 2026
LREC 2026workshop

Predicting Gaze Location without Camera or Eye-Tracker

Proceedings fo the Second International Workshop on Eye-Tracking Resources and Evaluation for Human-Aligned NLP

DOI:10.63317/2hpa22f63k2t

Abstract

The task of identifying the location that a user looks at, commonly known as gaze estimation, has various HCI and NLP applications. Traditional gaze estimation methods use special hardware such as eye-trackers or ordinary cameras such as webcams to perform this. However, they are not applicable to the majority of web users either because the user does not have them or does not want to use them due to privacy reasons. In this paper, we propose the idea of using multimodal LLMs to analyze the content of the user’s screen along with mouse location to estimate the gaze location. It primarily uses the results of studies that extract common reading patterns such as the F-pattern and Z-pattern. Our experimental results on The Eye Of The Typer (EOTT) dataset provide promising results for estimating gaze location.

Details

Paper ID
lrec2026-ws-gaze4nlp-09
Pages
pp. 58-63
BibKey
rezapoor-etal-2026-predicting
Editors
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings fo the Second International Workshop on Eye-Tracking Resources and Evaluation for Human-Aligned NLP
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • SR

    Saman Rezapoor

  • SS

    Sajad Shirali-Shahreza

  • GP

    Gerald Penn

Links