Back to Main Conference 2022
LREC 2022main

The Lexometer: A Shiny Application for Exploratory Analysis and Visualization of Corpus Data

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/4zganysxf4n9

Abstract

Often performing even simple data science tasks with corpus data requires significant expertise in data science and programming languages like R and Python. With the aim of making quantitative research more accessible for researchers in the language sciences, we present the Lexometer, a Shiny application that integrates numerous data analysis and visualization functions into an easy-to-use graphical user interface. Some functions of the Lexometer are: filtering large databases to generate subsets of the data and variables of interest, providing a range of graphing techniques for both single and multiple variable analysis, and providing the data in a table format which can further be filtered as well as provide methods for cleaning the data. The Lexometer aims to be useful to language researchers with differing levels of programming expertise and to aid in broadening the inclusion of corpus-based empirical evidence in the language sciences.

Details

Paper ID
lrec2022-main-684
Pages
pp. 6370-6376
BibKey
hai-etal-2022-lexometer
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • OH

    Oufan Hai

  • MS

    Matthew Sundberg

  • KT

    Katherine Trice

  • RF

    Rebecca Friedman

  • SG

    Scott Grimm

Links