Back to ECNLP 2024
LREC-COLING 2024workshop

Explicit Attribute Extraction in e-Commerce Search

Proceedings of the Seventh Workshop on e-Commerce and NLP @ LREC-COLING 2024

DOI:10.63317/2eu2xkhydhif

Abstract

This paper presents a model architecture and training pipeline for attribute value extraction from search queries. The model uses weak labels generated from customer interactions to train a transformer-based NER model. A two-stage normalization process is then applied to deal with the problem of a large label space: first, the model output is normalized onto common generic attribute values, then it is mapped onto a larger range of actual product attribute values. This approach lets us successfully apply a transformer-based NER model to the extraction of a broad range of attribute values in a real-time production environment for e-commerce applications, contrary to previous research. In an online test, we demonstrate business value by integrating the model into a system for semantic product retrieval and ranking.

Details

Paper ID
lrec2024-ws-ecnlp-13
Pages
pp. 125-135
BibKey
loughnane-etal-2024-explicit
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Seventh Workshop on e-Commerce and NLP @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • RL

    Robyn Loughnane

  • JL

    Jiaxin Liu

  • ZC

    Zhilin Chen

  • ZW

    Zhiqi Wang

  • JG

    Joseph Giroux

  • TD

    Tianchuan Du

  • BS

    Benjamin Schroeder

  • WS

    Weiyi Sun

Links