A Knowledge-Based Language Model: Deducing Grammatical Knowledge in a Multi-Agent Language Acquisition Simulation

David Ph. Shakouri, Crit Cremers, Niels O. Schiller

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

This paper presents an initial study performed by the MODOMA system. The MODOMA is a computational multi-agent laboratory environment for unsupervised language acquisition experiments such that acquisition is based on the interaction between two language models, an adult and a child agent. Although this framework employs statistical as well as rule-based procedures, the result of language acquisition is a knowledge-based language model, which can be used to generate and parse new utterances of the target language. This system is fully parametrized and researchers can control all aspects of the experiments while the results of language acquisition, that is, the acquired grammatical knowledge, are explicitly represented and can be consulted. Thus, this system introduces novel possibilities for conducting computational language acquisition experiments. The experiments presented by this paper demonstrate that functional and content categories can be acquired and represented by the daughter agent based on training and test data containing different amounts of exemplars generated by the adult agent. Interestingly, similar patterns, which are well-established for human-generated data, are also found for these machine-generated data. As the procedures resulted in the successful acquisition of discrete grammatical categories by the child agent, these experiments substantiate the validity of the MODOMA approach to modelling language acquisition. © 2025 David Ph. Shakouri, Crit Cremers, and Niels O. Schiller.
Original languageEnglish
Pages (from-to)167-189
Number of pages23
JournalComputational Linguistics in the Netherlands Journal
Volume14
Online published15 Jul 2025
Publication statusPublished - 2025
Event34th Meeting of Computational Linguistics in The Netherlands (CLIN 34) - Leiden University, Leiden, Netherlands
Duration: 30 Aug 202430 Aug 2024

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Funding

The authors would like to thank Maarten Hijzelendoorn for his assistance. This paper presents results of the MODOMA project, which was funded by the Dutch Research Council (NWO).

Fingerprint

Dive into the research topics of 'A Knowledge-Based Language Model: Deducing Grammatical Knowledge in a Multi-Agent Language Acquisition Simulation'. Together they form a unique fingerprint.

Cite this