Handling Non-Standard Datasets in NoRaRe: A Practical Guide

Authors

  • Mira Ahmedović Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig

DOI:

https://doi.org/10.15475/calcip.2025.1.3

Abstract

NoRaRe, the Database of Cross-Linguistic Norms, Ratings, and Relations, is a resource that curates multiple datasets containing information on various properties of words and concepts. When researchers contribute their data, the format and structure can vary widely, presenting challenges for seamless integration. Here, I offer practical guidance for addressing common issues such as data being placed in different sheets, headers in unexpected rows, or datasets contained within zip-files. The strategies shared here offer a foundational approach to understanding and adapting NoRaRe’s flexibility to accommodate the idiosyncrasy of each dataset.

Downloads

Published

2025-03-12

How to Cite

Ahmedović, M. (2025). Handling Non-Standard Datasets in NoRaRe: A Practical Guide. Computer-Assisted Language Comparison in Practice, 8(1). https://doi.org/10.15475/calcip.2025.1.3