Probing Large Language Models from a Human Behavioral Perspective

Proceedings of the Workshop: Bridging Neurons and Symbols for Natural Language Processing and Knowledge Graphs Reasoning (NeusymBridge) @ LREC-COLING-2024

DOI:10.63317/2r7fmikuo4ec

Abstract

Large Language Models (LLMs) have emerged as dominant foundational models in modern NLP. However, the understanding of their prediction processes and internal mechanisms, such as feed-forward networks (FFN) and multi-head self-attention (MHSA), remains largely unexplored. In this work, we probe LLMs from a human behavioral perspective, correlating values from LLMs with eye-tracking measures, which are widely recognized as meaningful indicators of human reading patterns. Our findings reveal that LLMs exhibit a similar prediction pattern with humans but distinct from that of Shallow Language Models (SLMs). Moreover, with the escalation of LLM layers from the middle layers, the correlation coefficients also increase in FFN and MHSA, indicating that the logits within FFN increasingly encapsulate word semantics suitable for predicting tokens from the vocabulary.

Resources

Details

Paper ID

lrec2024-ws-neusymbridge-1

Pages

pp. 1-7

DOI

10.63317/2r7fmikuo4ec

BibKey

wang-etal-2024-probing

Editors

Tiansi Dong, Erhard Hinrichs, Zhen Han, Kang Liu, Yangqiu Song, Yixin Cao, Christian F. Hempelmann, Rafet Sifa

Publisher

European Language Resources Association (ELRA) and ICCL

ISSN

N/A

ISBN

N/A

Workshop

Proceedings of the Workshop: Bridging Neurons and Symbols for Natural Language Processing and Knowledge Graphs Reasoning (NeusymBridge) @ LREC-COLING-2024

Location

Turin, Italy

Date

20 - 25 May 2024

Authors

XW
Xintong Wang
XL
Xiaoyu Li
XL
Xingshan Li
CB
Chris Biemann

Links

URL

DOI