PyLexibench — Generating Data for Lexibench with a Python Package

Authors

  • Luise Häuser Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies
  • Robert Forkel Max Planck Institute for Evolutionary Anthropology Leipzig
  • Johann-Mattis List University of Passau image/svg+xml

DOI:

https://doi.org/10.15475/calcip.2025.1.4

Keywords:

Python package, computational historical linguistics, benchmark database, character matrices, phylogenetic reconstruction

Abstract

With PyLexibench we introduce a small Python package that can be used to populate the Lexibench benchmark for computational historical linguistics with benchmark data. Here, we introduce the package and show how it helps to access and expand Lexibench. We also introduce new data for character matrices in various forms and formats and lay out how we intend to use the package to manage Lexibench releases in the future.

Downloads

Published

2025-04-22

How to Cite

PyLexibench — Generating Data for Lexibench with a Python Package. (2025). Computer-Assisted Language Comparison in Practice: Tutorials on Computational Approaches to the History and Diversity of Languages, 8(1), 25-37. https://doi.org/10.15475/calcip.2025.1.4