Richard Littauer
Richard Littauer

@richlitt@mastodon.social

#OpenSource #OpenScience #OpenAccess, #birds and #birding, #inaturalist #ebird #Latin, #languages, travel and politics.

- Co-Organizer of CURIOSS

, SustainOSS , and OpenSustain.tech
- PhD student at Te Herenga Waka Victoria University of Wellington in Pōneke Wellington, Aotearoa NZ
- Professional conlanger
- Linguist, Latinist, and taxonomist
- eBird reviewer

He/him.

Grew up on unceded Abenaki land.

My first solo-authored publication just appeared in Linguistic Typology: "The over-representation of phonological features in basic vocabulary doesn’t replicate when controlling for spatial and phylogenetic effects"

Running a #Bayesian model with #Lexibank data, I show that most previously observed effects that have been claimed to be sound symbolism do not replicate. A handful of effects emerges as highly stable though, mostly related to body-parts and the pronominal system.

#linguistics #replication #typology #science #statistics

> doi.org/10.1515/lingty-2025-00

The over-representation of phonological features in basic vocabulary doesn’t replicate when controlling for spatial and phylogenetic effects - The over-representation of phonological features in basic vocabulary doesn’t replicate when controlling for spatial and phylogenetic effects

De Gruyter Brill

The over-representation of phonological features in basic vocabulary doesn’t replicate when controlling for spatial and phylogenetic effects

The statistical over-representation of certain phonological features in the basic vocabulary of languages is often interpreted as reflecting potentially universal sound symbolic patterns. However, most of these cases have not been tested explicitly for reproducibility and might be prone to biases in the study samples or models. Many studies on the topic do not adequately control for genealogical and areal dependencies between sampled languages, casting doubts on the robustness of the results. In this study, I test the robustness of a recent study on sound symbolism in basic vocabulary concepts which analyzed 245 languages. This paper adds a new sample of 2,864 languages from Lexibank. I modify the original model by adding statistical controls for spatial and phylogenetic dependencies between languages. The new results show that most of the previously observed patterns are not robust, and in fact many patterns disappear completely when adding the genealogical and areal controls. A small number of patterns, however, emerges as highly stable even with the new sample. Through the new analysis, it is possible to assess the distribution of sound symbolism on a larger scale than previously. The study further highlights the need for testing all universal claims on language for robustness on various levels.

Elk Logo

Elk is in Preview!

Thanks for your interest in trying out Elk, our work-in-progress Mastodon web client!

Expect some bugs and missing features here and there. we are working hard on the development and improving it over time.

Elk is Open Source. If you'd like to help with testing, giving feedback, or contributing, reach out to us on GitHub and get involved.

To boost development, you can sponsor the Team through GitHub Sponsors. We hope you enjoy Elk!

PatakDaniel RoeAnthony Fu三咲智子 Kevin Deng

The Elk Team