Publication:

Model of the Dynamic Construction Process of Texts and Scaling Laws of Words Organization in Language Systems

Loading...
Thumbnail Image

Open/View Files

Date

2016

Journal Title

Journal ISSN

Volume Title

Publisher

Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Li, Shan, Ruokuang Lin, Chunhua Bian, Qianli D. Y. Ma, and Plamen Ch. Ivanov. 2016. “Model of the Dynamic Construction Process of Texts and Scaling Laws of Words Organization in Language Systems.” PLoS ONE 11 (12): e0168971. doi:10.1371/journal.pone.0168971. http://dx.doi.org/10.1371/journal.pone.0168971.

Abstract

Scaling laws characterize diverse complex systems in a broad range of fields, including physics, biology, finance, and social science. The human language is another example of a complex system of words organization. Studies on written texts have shown that scaling laws characterize the occurrence frequency of words, words rank, and the growth of distinct words with increasing text length. However, these studies have mainly concentrated on the western linguistic systems, and the laws that govern the lexical organization, structure and dynamics of the Chinese language remain not well understood. Here we study a database of Chinese and English language books. We report that three distinct scaling laws characterize words organization in the Chinese language. We find that these scaling laws have different exponents and crossover behaviors compared to English texts, indicating different words organization and dynamics of words in the process of text growth. We propose a stochastic feedback model of words organization and text growth, which successfully accounts for the empirically observed scaling laws with their corresponding scaling exponents and characteristic crossover regimes. Further, by varying key model parameters, we reproduce differences in the organization and scaling laws of words between the Chinese and English language. We also identify functional relationships between model parameters and the empirically observed scaling exponents, thus providing new insights into the words organization and growth dynamics in the Chinese and English language.

Description

Research Data

Keywords

Biology and Life Sciences, Neuroscience, Cognitive Science, Cognitive Psychology, Language, Psychology, Social Sciences, Linguistics, Computational Linguistics, Physical Sciences, Mathematics, Probability Theory, Probability Distribution, Phonology, Syntax, Simulation and Modeling, Languages, Natural Language, Computer and Information Sciences, Systems Science, Complex Systems, Behavior

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories