(PDF) Evolutionary paths of language
About
Press
Papers
We're Hiring!
Outline
Title
Abstract
Key Takeaways
Figures
References
FAQs
Evolutionary paths of language
Bert Vaux
2020, EPL (Europhysics Letters)
October 11, 2025
visibility
description
7 pages
Sign up for access to the world's latest research
check
Get notified about relevant papers
check
Save papers to use in your research
check
Join the discussion with peers
check
Track your impact
Abstract
We introduce a stochastic model of language change in a population of speakers who are divided into social or geographical groups. We assume that sequences of language changes are driven by the inference of grammatical rules from memorised linguistic patterns. These paths of inference are controlled by an inferability matrix which can be structured to model a wide range of linguistic change processes. The extent to which speakers are able to determine the dominant linguistic patterns in their speech community is captured by a temperature-like parameter. This can induce symmetry breaking phase transitions, where communities select one of two or more possible branches in the evolutionary tree of language. We use the model to investigate a grammatical change (the rise of the phrasal possessive) which took place in English and Continental North Germanic languages during the Middle Ages. Competing hypotheses regarding the sequences of precursor changes which allowed this to occur each generate a different structure of inference matrix. We show that the inference matrix of a "Norway Hypothesis" is consistent with Norwegian historical data, and because of the close relationships between these languages, we suggest that this hypothesis might explain similar changes in all of them.
Key takeaways
AI
The model explores stochastic language evolution influenced by social and geographical group structures.
Inferability matrices capture the dynamics of language change, notably during the rise of phrasal possessives.
The 'Norway Hypothesis' aligns with historical data, suggesting commonality in language evolution across similar languages.
Conformity and inference interact in shaping linguistic transitions, potentially trapping changes until random fluctuations occur.
Symmetry breaking may result in communities selecting distinct grammatical paths based on conformity levels.
Figures (7)
Fig. 1: A linguistic subspace in which stick people are evolving toward one of two grammars. Arrows from state 7 to 7 indicate that Siz = €, Sii = déwithe > dand Sj; = 1 es Sij- Dashed lines indicate transitions not based on inference.
Fig. 2: Symmetry breaking at a grammar branch. Parameter values T = 5,€ = 0.1,6 = 0.01, G = 1.05. Inset plot has 6 = 1.0. speakers cannot independently adopt grammars which are not inferable from their native (parental) grammar, they can do so if sufficiently large numbers of the speech com- munity are using the target grammar. In fig. 1, a tran- sition of this kind is illustrated between 4 and 6. Fig. 2 shows the evolution of a population beginning with gram- mar 0. When conformity is sufficiently strong the popula- tion arbitrarily select one final dominant grammar. Fig. 2 (inset) shows the evolution of a similar community with lower conformity. Here the symmetry between grammars 4 and 6 is unbroken. At much longer time scales, fluctua- tions may lead to one state dominating: a different form of symmetry breaking. We can understand the interaction between conformity
Fig. 3: Equilibrium of two state model (10) where «; = 0.05,€2 = 0.06 and 6 = 1.3. Each dot is a group and Ki; x exp(—rj;/o7) where ri; is the distance between groups i and j, and o = 5. Blue groups are dominated by state Y and red by X. The spatial distribution of groups takes the form of two gaussian blobs centred at (50,50) and (150, 50).
Fig. 4: (a) Transitions in a linguistic subspace of two features; each can exist in C' = 2 contexts. (b) Inference matrices for the three hypotheses. Red: Norwegian, Green: English, Blue: Swedish. An arrow from states i to j implies that Si; > Si; > 0. No arrow implies S;; = 0.
Fig. 5: The evolution of two equivalent features, each of which can occur in C = 5 contexts. Speakers start in state (0,0). €1 = 0.035,€ = 0.075,6 = 0.02,7 = 5 and 6 = 1.04 where 6 is the inferability of reverse transitions. Inset shows detail of spontaneous transition in feature X.
Fig. 6: Solid lines show features evolving according to the Nor- way hypothesis with 7 = 5. Parameter values found by least squares optimization @ = 1.1,So0g = 0.19,Sg9 = 0.19, S913 = 0.40, Soir = 0.25, Sitis = 0.06,Si315 = 0.11. The inference values for all reversals (taken all equal) were constrained to match the incomplete transition in concord and genitive, yield- ing an optimized value 0.03. Crosses show historical data col- lected for the same features. Features 2 and 3 have smaller data sets (table 1), and greater ambiguity in identification than 1. Time origin adjusted to match start of historical changes.
Fig. 7: Evolution of features amongst a single group (NV = 100) evolving according to the English hypothesis. In these examples tT = 20 epochs, 8 = 1.06 and all green (English) arrows in fig. 4 correspond to inferability « = 0.0175.
Related papers
The major transitions in the evolution of language
Willem Zuidema
The origins of human language, with its extraordinarily complex structure and multitude of functions, remains among the most challenging problems for evolutionary biology and the cognitive sciences. Although many will agree progress on this issue would have important consequences for linguistic theory, many remain sceptical about whether the topic is amenable to rigorous, scientific research at all. Complementing recent developments toward better empirical validation, this thesis explores how formal models from both linguistics and evolutionary biology can help to constrain the many theories and scenarios in this field. I first review a number of foundational mathematical models from three branches of evolutionary biology -- population genetics, evolutionary game theory and social evolution theory -- and discuss the relation between them. This discussion yields a list of ten requirements on evolutionary scenarios for language, and highlights the assumptions implicit in the various formalisms. I then look in more details at one specific step-by-step scenario, proposed by Ray Jackendoff, and consider the linguistic formalisms that could be used to characterise the evolutionary transitions from one stage to the next. I conclude from this review that the main challenges in evolutionary linguistics are to explain how three major linguistic innovations -- combinatorial phonology, compositional semantics and hierarchical phrase-structure -- could have spread through a population where they are initially rare. In the second part of the thesis, I critically evaluate some existing formal models of each of these major transitions and present three novel alternatives. In an abstract model of the evolution of speech sounds (viewed as trajectories through an acoustic space), I show that combinatorial phonology is a solution for robustness against noise and the only evolutionary stable strategy (ESS). In a model of the evolution of simple lexicons in a noisy environment, I show that the optimal lexicon uses a structured mapping from meanings to sounds, providing a rudimentary compositional semantics. Lexicons with this property are also ESS's. Finally, in a model of the evolution and acquisition of context-free grammars, I evaluate the conditions under which hierarchical phrase-structure will be favoured by natural selection, or will be the outcome of a process of cultural evolution. In the last chapter of the thesis, I discuss the implications of these models for the debates in linguistics on innateness and learnability, and on the nature of language universals. A mainly negative point to make is that formal learnability results cannot be used as evidence for an innate, language-specific specialisation for language. A positive point is that with the evolutionary models of language, we can begin to understand how universal properties and tendencies in natural languages can result from the intricate interaction between innate learning biases and a process of cultural evolution over many generations.
Download free PDF
View PDF
chevron_right
Dynamic Models of Language Evolution: the Linguistic Perspective
Andrew Smith
This chapter gives an overview of language variation and how the dynamics of language are explored through formal models. It briefly outlines the dimensions over which language structures can vary, then looks at some of the very different ways in which language change has been investigated (sociolinguistics, historical linguistics, evolutionary linguistics). It describes how dynamic models of change have been successfully used in all of these fields, and how they have shed light on many aspects of language dynamics, from the properties of language change through phylogenetic analyses of language history to computational and experimental models of cultural evolution.
Download free PDF
View PDF
chevron_right
A Grammatical View of Language Evolution
M. Dolores Jiménez-López
akira.ruc.dk
Abstract. Language evolves gradually through its use: over time, new forms come into fashion and others become obsolete. While traditionally a grammar provides a snapshot of an individual's or a society's linguistic competence at a given point in time, our aim is to extend ...
Download free PDF
View PDF
chevron_right
Linking the processes of language evolution and language change : a five-level hierarchy
Jérôme Michaud
2016
The class of problems that can be categorized under language evolution and change is very heterogeneous and involves many different timescales and spatial/social scales. In order to better understand the underlying evolutionary processes and the link between language evolution and language change, we introduce a five-system hierarchy based on social structures of the population and apply it to the study of the transition from protolanguage to language. This problem is composed of a biological subproblem and a cultural subproblem. Using our hierarchy, we argue that Bickerton’s “living fossils” of language can give some insight only on the cultural subproblem, but not on the biological one or on the coevolution of the two.
Download free PDF
View PDF
chevron_right
Modelling language evolution: Examples and predictions
Lan SHUAI
Physics of Life Reviews, 2014
General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.-Users may download and print one copy of any publication from the public portal for the purpose of private study or research-You may not further distribute the material or use it for any profit-making activity or commercial gain-You may freely distribute the URL identifying the publication in the public portal Take down policy If you believe that this document breaches copyright, please contact us providing details, and we will remove access to the work immediately and investigate your claim.
Download free PDF
View PDF
chevron_right
STOCHASTIC MODELS OF LANGUAGE EVOLUTION AND AN APPLICATION TO THE INDO-EUROPEAN FAMILY OF LANGUAGES
Don Ringe
We propose several models of how languages evolve, and discuss statistical estimation of evolution under these models. We also discuss issues of identifiability and statistical consistency under these models.
Download free PDF
View PDF
chevron_right
A Look at Linguistic Evolution
Anastasia Thanukos
Evolution: Education and Outreach, 2008
Download free PDF
View PDF
chevron_right
Major transitions in language evolution
Martin Nowak
2001
Language is the most important evolutionary invention of the last few million years. How human language evolved from animal communication is a challenging question for evolutionary biology. In this paper we use mathematical models to analyze the major transitions in language evolution. We begin by discussing the evolution of coordinated associations between signals and objects in a population. We then analyze word-formation and its relationship to Shannon's noisy coding theorem. Finally, we model the population dynamics of words and the adaptive emergence of syntax.
Download free PDF
View PDF
chevron_right
Language Evolution: Computer models for Empirical Data
Paul Vogt
Mathematical Programming, 2010
This methodological article serves as an introduction to this special issue, which is meant to encourage more and better interaction between empirical studies and computer modeling with regard to the study of language evolution. We argue that research into the field of language evolution is so complex that computer modeling forms an essential tool to generate predictions based on some,
Download free PDF
View PDF
chevron_right
The Evolutionary Dynamics of Natural Language Dimacs Workshop on Computation and Language , 1998
Robert Berwick
2014
Essential to Darwinian evolution are the concepts of popultional and variational thinking [15, 13]. The variant properties among individuals in a population allow the forces of evolution such as natural selection to operate. Evolutionary changes are therefore changes in the distribution of variant individuals in the population. This paper reports two projects that draw insights from these perspectives to investigate the evolutionary dynamics of natural language. The first project [19] studies the problem of child language acquisition. We propose that the state of the learner can be viewed a population of variant hypotheses/grammars with a probabilistic distribution, and that language acquisition is a selectional process where the fitness of a hypothesis is be defined as its compatibility with the linguistic data. We prove some convergence results and show that the model predicts certain psychological findings that are otherwise hard to obtain.
Download free PDF
View PDF
chevron_right
epl draft

Evolutionary paths of language

J. Burridge1 , T. Blaxter2 and B. Vaux2
School of Mathematics and Physics - University of Portsmouth
Faculty of Modern and Medieval Languages and Linguistics - University of Cambridge

PACS 87.23.Ge – Dynamics of social systems
PACS 89.75.-k – Complex Systems
PACS 89.75.Hc – Networks and genealogical trees

Abstract – We introduce a stochastic model of language change in a population of speakers who
are divided into social or geographical groups. We assume that sequences of language changes are
driven by the inference of grammatical rules from memorised linguistic patterns. These paths of
inference are controlled by an inferability matrix which can be structured to model a wide range
of linguistic change processes. The extent to which speakers are able to determine the domi-
nant linguistic patterns in their speech community is captured by a temperature-like parameter.
This can induce symmetry breaking phase transitions, where communities select one of two or
more possible branches in the evolutionary tree of language. We use the model to investigate a
grammatical change (the rise of the phrasal possessive) which took place in English and Conti-
nental North Germanic languages during the Middle Ages. Competing hypotheses regarding the
sequences of precursor changes which allowed this to occur each generate a different structure
of inference matrix. We show that the inference matrix of a “Norway Hypothesis” is consistent
with Norwegian historical data, and because of the close relationships between these languages,
we suggest that this hypothesis might explain similar changes in all of them.

Introduction. – The last several decades have seen lution [11, 12]. Recognising language as a phenomenon
an explosion in cross-disciplinary applications of statisti- which emerges from interactions between large numbers
cal physics. This work encompasses a wide range of topics, of individuals, statistical physicists have constructed de-
from the motion of crowds [1] and the flocking of birds [2] liberately minimalist models to understand properties of
to the distribution of wealth [3], the behaviour of social its evolution [4–7, 13–16]. Simplicity is also recognised
systems [4] and, the focus of this letter, the evolution of as a virtue in the field of statistical (machine) learning,
language [5–9]. Linguistics (the study of human language) where sparsity can increase predictive power [17]. This
seeks to rigorously describe how the building blocks of lan- was also the goal of the locus classicus for phonological
guage may be combined to convey meanings [10]. Each theory: Chomsky and Halle (1968) [18]. Our aim is to
spoken language has a set of sounds or phones which are develop a simple model, which is flexible enough to cap-
typically generated by shaping airflow through the vocal ture a wide range of realistic linguistic processes, and to
tract in various ways. Phones are derived from the un- be calibrated to linguistic data sets which may be time
derlying inventory of contrastive elements in a language, dependent, geographically distributed and/or labelled by
called phonemes, by a set of phonological rules. Above social, gender or other factors.
the level of individual sounds, languages have morphologi-
Model definition. – We assume it is possible to write
cal rules for building words from morphemes (the smallest
down a detailed inventory of phones, phonological, mor-
units of meaning), and syntactic rules for combining words
phological, and syntactic rules, and lexical entries (with
together to form sentences. The world’s languages are de-
meanings) which fully define each speaker’s own version
fined by a vast array of sounds, generalizations (i.e. rules,
of their language. We refer to this as the full language
constraints, and/or associations) and lexicons, which are
state of a speaker. This inventory may be viewed as a
all constantly evolving, and it is well known to linguists
high dimensional feature vector [17]. Every speaker will
that the structures of social groups, identities, hierarchies
occupy a different location in feature space, but groups
and geography play powerful roles in controlling this evo-
of speakers may share many features in common, form-

p-1

J. Burridge et al.

ing clusters which represent distinct languages or dialects. Letting X(n, p) be a generic multinomial random vector
The process of language evolution is simply the progressive with n trials and probability vector p we have
alteration of the components of individual feature vectors. d
We divide time into epochs indexed by t ∈ {0, 1, . . .}. Ni (t) = X(ni , pi (t)) (5)
We consider a population of speakers, and focus on a sub- d
space of their language which has p different linguistic where = denotes equality in distribution. This completes
states or variants (possible feature vectors). At each time the specification of our dynamics.
t, each speaker will be in one of these p states. Language In this letter we focus on linguistic change which is
evolution is intrinsically coupled to the social and geo- driven by the inference of grammatical rules from cur-
graphical structure of society, and we may account for rent linguistic patterns observable in the PLD. In this case
this by dividing our population into n groups by some cri- speaker memories for grammar k may be viewed as the
terion or set of criteria (sex, location, social class [11,12]). PLD or patterns in the PLD generated by those rules,
Let Nik (t) be the number of speakers in group i who are rather than the rules themselves. Different rule sets may
in linguistic state k at epoch t. We define the relative generate similar patterns, so speakers may infer one gram-
frequency mar from the surface forms of another [21]. Typically, in-
Nik (t) ference is carried out by younger speakers as they learn
fik (t) = (1) their language [22]. The entries of S have the following
Ni
interpretation: subconsciously, a speaker selects the pat-
where Ni is the size of the ith group. Language evolu-
tern she is going to emulate, and then selects (infers) the
tion can be driven by children inferring altered versions
rules she believes are behind it. We have, for a speaker in
of the linguistic rules which generated the speech they
group i
have heard, and by adults switching their linguistic be-
haviour over time [19]. We allow for the possibility that P(attempt to emulate surface forms of k) = [mi ]k (6)
speakers preferentially select the most common patterns; P(infer grammar j in attempt to emulate k) = Skj (7)
an optimal strategy if matching language patterns with
interlocutors confers some advantage to a speaker. Such where [mi ]k denotes the kth component of mi . Gram-
non-linearity in decision making has been observed in hu- mar selection may therefore be viewed as a two step pro-
man social learning [20] and may be modelled by defining cess in our model. The first step – pattern selection – is
an adjusted frequency [13] driven by conformity, and the second – rule selection –
by inference from linguistic patterns. Expression (4) fol-
fik
vik = P β (2) lows by considering all possible ways in which grammar
j ij k could be inferred by this two step process using proba-
bilities (6) and (7). For other forms of linguistic change
which emphasizes more popular states when β > 1. When
like lexical change or vowel shifts its interpretation will
β < 1 higher frequencies are reduced, and lower ones em-
alter. In these cases S may be viewed as a linear re-
phasised. We call β the conformity number. We write
T weighting of frequency/conformity based selection proba-
vi = (vi1 , vi2 , . . . vip ) for the set of adjusted frequencies
bilities which captures bias effects originating from struc-
in group i. In principle each group can exhibit different
tural principles like maximal dispersion [23], automation
levels of conformity to each of the groups it is exposed to,
of production [24], ratchet effects [25] (in which changes
but for simplicity we set all conformity numbers equal.
acquire social momentum), phonetically systematic trans-
We assume that the speakers in each group are all ex-
mission errors (channel bias) and cognitive predispositions
posed to approximately the same primary linguistic data
(analytic bias) [26].
(PLD), and that this experience is captured by an n × n
We are motivated by the desire to model, in a simple
stochastic (unit row sums) matrix K where Kij is the frac-
and computationally efficient way, the evolutionary routes
tional observational weight placed on group j by speakers
of language change amongst populations that tend to con-
in group i. We call K the importance matrix. It encodes
form (find consensus) and are socially and spatially la-
all information that we wish to capture about social or
belled. Our computational implementation requires five
geographical groupings and their connectivity. We endow
lines of vectorized Python [27]. The modelling of language
each group with a time decaying memory for the confor-
change is by now well established and our model has its
mity adjusted states of other groups
X roots in existing approaches. A matrix with a similar in-
mi (t) = e−1/τ mi (t − 1) + (1 − e−1/τ ) Kij vj (t). (3) terpretation to our S appears in work on the evolution of
j Universal Grammar [16]. Time decaying speaker memo-
ries similar to (3) were studied in the utterance selection
Memories are related to current behaviour via a p × p
model [7], which also allowed linear re-weighting of selec-
stochastic inference matrix S: at each epoch we take the
tion probabilities, and our model of consensus may be seen
distribution of speakers using each variant to be multino-
as a modified majority rule [28]. The emergence of con-
mial with probability vector
sensus in language, including in spatial settings, is also
pi (t + 1) = mi (t)S. (4) explored in the naming game [15, 29].

p-2

Evolution of grammar

Symmetry breaking transition
0 1 2 impossible by inference alone
(conformity driven)

Fig. 1: A linguistic subspace in which stick people are evolving
toward one of two grammars. Arrows from state P i to j indicate
that Sij = ǫ, Sji = δ with ǫ > δ and Sii = 1− j6=i Sij . Dashed
lines indicate transitions not based on inference.

Fig. 2: Symmetry breaking at a grammar branch. Parameter
When S is the identity matrix then the probability that values τ = 5, ǫ = 0.1, δ = 0.01, β = 1.05. Inset plot has β = 1.0.
grammar k will be inferred by a speaker in group i is equal
to her memory [mi ]k for the frequency of that grammar.
In this case groups of speakers can become fixed in one speakers cannot independently adopt grammars which are
grammar state when all others have been forgotten. Fix- not inferable from their native (parental) grammar, they
ation occurs because it is impossible to recreate any new can do so if sufficiently large numbers of the speech com-
grammars by mistaken inference. In the case β = 1 this munity are using the target grammar. In fig. 1, a tran-
process generates neutral evolution [30, 31], where vari- sition of this kind is illustrated between 4 and 6. Fig. 2
ants in future generations appear with a probability that shows the evolution of a population beginning with gram-
is proportional to their frequency in previous generations. mar 0. When conformity is sufficiently strong the popula-
Under some circumstances it is useful to consider the tion arbitrarily select one final dominant grammar. Fig. 2
large population (deterministic) limit of the model, which (inset) shows the evolution of a similar community with
for large τ may be approximated by the following set of lower conformity. Here the symmetry between grammars
coupled differential equations 4 and 6 is unbroken. At much longer time scales, fluctua-
  tions may lead to one state dominating: a different form
1 X of symmetry breaking.
ṁi = Kij vj − mi  (8) We can understand the interaction between conformity
τ j and inference transitions by considering a system with two
[mi S]βk grammar states X and Y . Such a system will have an
[vi ]k = P β
. (9) inferability matrix with general form
j [mi S]j
X Y
From this we see that in the deterministic setting, mem- X
1 − ǫ1 ǫ1
ory length τ linearly scales the rate of change of linguistic S= . (10)
Y ǫ2 1 − ǫ2
memory. In the stochastic case it also functions as the
window length of a time average, and so dampens fluctu- Suppose that our speech community consists of a single
ations. group, and let m = (v, 1 − v) then
Branching and symmetry breaking. – We now
p = (v(1 − ǫ1 − ǫ2 ) + ǫ2 , 1 − ǫ2 − v(1 − ǫ1 − ǫ2 )) . (11)
explore a simple inferability matrix which illustrates how
a speech community can select from two or more alterna- In statistical equilibrium the expected values of the entries
tive routes of linguistic evolution. Consider the linguistic of this vector are equal to the expectations of the current
subspace illustrated in fig. 1, where arrows between states frequencies of the two grammars in the population. Fluc-
indicate that one grammar is inferable from the other (the tuations of the true frequencies about these expectations
direction i → j indicates that j is easier to infer from i will be of order N −1/2 where N is the size of the commu-
than vice versa). In fig. 1 most speakers will eventually end nity. For large communities we can therefore approximate
up in state 4 or 6, so in a large and socially disjointed or ge- the true equilibrium frequencies with their expectations.
ographically dispersed speech community we would even- Replacing the memory values v with their expression in
tually expect to see both kinds of grammar. However, in a terms of the frequency, f , of X, we have the equilibrium
more strongly connected community with sufficiently large condition
conformity number, β, we may see a symmetry breaking fβ f − ǫ2
phase transition. To understand this we recall that while = . (12)
f β + (1 − f )β 1 − ǫ1 − ǫ2

p-3

J. Burridge et al.

(a) (b)
100
(2,0)
15
80 13
(1,0) (2,1)
60 9 11

10
40 14
(0,0) (1,1) (2,2)
12
20 8

1 4 6
0 (0,1) (1,2)

0 50 100 150 200 2
(0,2) 0

Fig. 3: Equilibrium of two state model (10) where ǫ1 =
0.05, ǫ2 = 0.06 and β = 1.3. Each dot is a group and Fig. 4: (a) Transitions in a linguistic subspace of two features;
Kij ∝ exp(−rij /σ 2 ) where rij is the distance between groups each can exist in C = 2 contexts. (b) Inference matrices for
i and j, and σ = 5. Blue groups are dominated by state Y and the three hypotheses. Red: Norwegian, Green: English, Blue:
red by X. The spatial distribution of groups takes the form of Swedish. An arrow from states i to j implies that Sij > Sij >
two gaussian blobs centred at (50, 50) and (150, 50). 0. No arrow implies Sij = 0.

For low conformity this equation has only one solution, but of dense population. Isogloss dynamics of this kind, which
for sufficiently large β a second appears, and the speech is driven by surface tension, is explored in [13, 14].
community must select one, but we cannot tell in advance
which. Even if there is an inferability bias toward one The timing of linguistic changes. – We now con-
state, conformity may keep the community closer to the sider the conditions under which two or more linguis-
other by emphasising it in speakers’ memories. If ǫ1 = tic changes which are equally likely to be inferred from
ǫ2 = ǫ then we have a symmetry between the two states, the current grammar can spontaneously occur at different
and for low conformity the solution is f = 1/2. When times. Often grammatical changes spread though multiple
linguistic contexts over time (examples include the spread
β> (13) of do-support through different clause types in Middle En-
1 − 2ǫ
glish [33] and the spread of the new pronoun chdi between
then this single solution is replaced with two f = 1/2 ± a different syntactic contexts in northern Welsh [34]). To
for some a ∈ [0, 1/2], only one of which is selected by capture this we consider features which occur in C ∈ N
the community. Hence the term symmetry breaking [32], contexts, and simultaneously evolve two equally inferable
which we use to refer generically to the appearance of features X and Y . The state of a single speaker is then
multiple steady states from which the community selects. specified by the integer pair (i, j) indicating that she uses
In thermodynamic systems symmetry breaking generates feature X in i contexts, and Y in j contexts. Because the
macroscopic ordering, and is induced by non-linearity in spread of a feature from one context to another may be
the response of the microscopic constituents of the sys- easier to infer than its first appearance, we set the infer-
tem to their local environment. The strength of this non- ability of the first appearance of a features to be ǫ1 , with
linearity is typically controlled by temperature. In our later changes of the form (i, j) → (i+1, j) or (i, j +1) hav-
case the non-linearity which induces symmetry breaking ing inferability ǫ > ǫ1 . Such acceleration effects may also
lies in the relationship between v and f , and is controlled be caused by social factors [25,35]. A schematic diagram of
by β, which plays a role analogous to inverse temperature the square state space in the C = 2 case is shown in fig. 4
in thermodynamic systems. (a). The spread of a feature may be tracked by calculating
If we imagine the branch in fig. 1 being part of a much the average location of the population as its members nav-
larger network of grammars, then we see that symmetry igate the square. The appearance of new features may be
breaking will cause communities to select one particular prevented or delayed if conformity to the current dominant
route among many through the network. In spatially dis- linguistic state overwhelms any inferability bias toward
tributed cases, we might expect to see different routes new features. This conformity trapping may be overcome
taken in different places or amongst certain well defined by random fluctuations, where a large enough number of
and separated groups, leading to dialects and eventually speakers spontaneously adopt the new feature, reducing
distinct languages. A simple illustration of how a spatially the dominance of the (0, 0) state. A simulation example is
distributed system can support two equilibrium states is shown in fig. 5. Here, both features remain trapped at low
given in fig. 3, where the importance matrix K is struc- levels until fluctuations spontaneously allow one to emerge
tured to model a gaussian spatial interaction kernel be- fully, following the classic S-curve evolution [36, 37].
tween groups. Here we see that a domain wall or isogloss At any moment in time a language has the potential
(in linguistic terminology) has formed between two regions to undergo changes, but their timing appears difficult to

p-4

Evolution of grammar

Table 1: Features for a subspace of Norwegian / Swedish /
Danish / English grammar, and numbers of texts in datasets.

Label Feature Abbrev. Texts
1 uniform genitive endings Noun 3076
2 loss of concord for genitive Concord 933
3 loss of lexical genitives Genitive 1437
4 -s is a phrasal affix Affix NA

feature 4
{1, 2, 3} → 4. (14)
In another hypothesis (the ‘Swedish hypothesis’ [39]) the
reverse is true, with the reanalysis happening first in a
Fig. 5: The evolution of two equivalent features, each of which very restricted context and causing the latter changes
can occur in C = 5 contexts. Speakers start in state (0, 0).
ǫ1 = 0.035, ǫ = 0.075, δ = 0.02, τ = 5 and β = 1.04 where δ 4 → {1, 2, 3}. (15)
is the inferability of reverse transitions. Inset shows detail of
spontaneous transition in feature X. Our main focus is on a ‘Norwegian hypothesis’: that just
one of these changes, the change to uniform genitive end-
ings, is a necessary precondition for the reanlaysis, and
predict. One proposed explanation for major changes [35]
the other changes result from it
is the trigger event: some relatively small change in part of
the linguistic system which facilitates a much wider shift, 1 → 4 → {2, 3}. (16)
which may then accelerate. The natural question then
is: what triggered the trigger? In our simple model, it is Our model of these processes is simplified in a number
possible for changes to be trapped and waiting to happen of ways. First, we examine only a very small subsystem
with their release an inherently unpredictable event. of the grammar. In reality these changes took place in
The rise of phrasal marking of possessive. – We the context of a much more complex grammatical system
now use our model to provide a simplified account of his- and there are other changes, involving other grammati-
torical changes within a linguistic subspace relating to the cal features, which might be argued to have played a role.
genitive case. One function of the genitive is to signify Second, our four labelled features are abstractions from
possession (e.g. in modern English ‘Anna’s shoes’ ). The more complex sets of sub-features (loss of lexical genitives
change we are interested in took place in the grammar of in different contexts; spread of the uniform -(e)s ending to
English and Continental North Germanic (Swedish, Dan- different classes of words, or even to different individual
ish, Norwegian). In each language, a reanalysis of the words; loss of concord in different grammatical contexts
genitive ending -(e)s takes place where it goes from hav- and on different classes of words). Third, we assume that
ing scope over the preceding word to having scope over inferential changes (from the current grammar) occur in
the preceding phrase; for example one feature at a time, although speakers can of course
jump between any two grammar states provided there are
þe [kyng]is of ffraunce → [the king of France]’s. others in those states within the community. These sim-
| {z } | {z } plifications allow us to describe the model and its output
Middle English Modern English
in graspable terms.
In all four languages, other changes affect the genitive. We assume that each feature is either present or ab-
Among these: all endings are replaced by a uniform end- sent in each speaker, so that each speaker’s grammar
ing, -(e)s; all functions of this ending are lost except mark- has a representation of the form (σ1 , σ2 , σ3 , σ4 ) where
ing of possessive (loss of ‘lexical genitives’); a pattern by σk ∈ {0, 1} is the indicator of feature k. For brevity
which all elements of the phrase are independently marked we treat this state as the binary representation of an in-
for genitive (‘concord’) is lost. These changes are listed teger (σ1 , σ2 , σ3 , σ4 ) ≡ 8σ1 + 4σ2 + 2σ3 + σ4 . Within
and labelled in table 1. Different hypotheses about the this state space there are 16 possible grammars and all
relationships among these changes have been proposed, speakers start in state (0, 0, 0, 0) ≡ 0 and eventually reach
which can be expressed as different inferability matrices. (1, 1, 1, 1) ≡ 15. The set of inferable transitions for each
In one hypothesis (the ‘English hypothesis’ [38]), these lat- hypothesis is shown in fig. 4 (b).
ter changes create the ambiguity in surface forms which Fig. 6 shows linguistic data from the Annotated DN on-
are necessary for the reanalysis. That is, with reference line [40], a corpus of medieval Norwegian charters. To gen-
to table 1, features 1, 2 and 3 are needed in order to infer erate the data, for each change, a restricted context that

p-5

J. Burridge et al.

1.0 noun 1.0 noun
affix affix
0.8 concord 0.8 concord
1.0 noun
genitive genitive
affix
0.6 0.6
concord
0.8 genitive
0.4 0.4
Fraction of Speakers

0.2 0.2
0.6
0.0 0.0
0 200 400 600 0 200 400 600
0.4 1.0 noun 1.0 noun
affix affix
0.8 concord 0.8 concord
0.2 genitive genitive
0.6 0.6

0.4 0.4
0.0
1300 1350 1400 1450 1500 1550 0.2 0.2
Year
0.0 0.0
0 200 400 600 0 200 400 600

Fig. 6: Solid lines show features evolving according to the Nor-
way hypothesis with τ = 5. Parameter values found by least Fig. 7: Evolution of features amongst a single group (N =
squares optimization β = 1.1, S0 8 = 0.19, S8 9 = 0.19, S9 13 = 100) evolving according to the English hypothesis. In these
0.40, S9 11 = 0.25, S11 15 = 0.06, S13 15 = 0.11. The inference examples τ = 20 epochs, β = 1.06 and all green (English)
values for all reversals (taken all equal) were constrained to arrows in fig. 4 correspond to inferability ǫ = 0.0175.
match the incomplete transition in concord and genitive, yield-
ing an optimized value 0.03. Crosses show historical data col-
lected for the same features. Features 2 and 3 have smaller dataWe earlier demonstrated that the timing of linguistic
sets (table 1), and greater ambiguity in identification than 1.
changes in the stochastic model can be unpredictable un-
Time origin adjusted to match start of historical changes.
der certain conditions, and we now consider this possibility
along with the other two hypotheses. When the inferabil-
could be exhaustively searched and quantified was identi- ity values corresponding to each arrow in fig. 4 are small
fied: for uniform genitive endings, patronymics in the form in comparison to the conformity number (in the sense of
name-GEN+sonr (GEN stands for standard genitive end- eq. (13)) then the majority of the population may become
ing and sonr ≡ son); for the loss of concord, agreement be- trapped by conformity in its current state. In order to
tween genitive forename and patronymic; and for the loss escape this state and continue linguistic evolution, they
of lexical genitives, the case taken by the preposition mil- must be freed by random fluctuations. To demonstrate
lum ‘between’. Feature 4, the change in scope from word- the combined effect of conformity and fluctuations, fig. 7
to phrasal-level affix, is very hard to quantify from occur- shows four simulations of the English hypothesis, accord-
rences in texts as most potential examples exhibit surface ing to which features 1,2,3 must evolve before the phrasal
ambiguity between the older and newer structures; for this affix. Although the inference matrix treats each of these
reason, no linguistic data are presented for this feature. preliminary features as equivalent, because of the trap-
For simplicity we compare our data to the deterministic ping effects of conformity, the order in which they appear
form (8) of the model. We use calendar years as our unit is randomized, as in fig. 5. In the lower conformity, non-
of time, and set τ = 5, equivalent to ≈ 90% turnover of trapping regime all three features would evolve at the same
memory every 10 years. Although τ cannot be directly time, consistent with the deterministic model. Therefore,
measured, we are motivated by the fact that substantial in the presence of trapping, if we cannot be sure exactly
linguistic changes can take place over a decade for speakers when the phrasal affix arose in comparison to the other
across a wide range of ages [19]. To mimic the embedding features, then all three hypotheses may be consistent with
of the inference matrix (fig. 4 (b)) in a larger network, we any observed sequence of changes. We qualify this state-
incorporated a chain of three precursor states to state 0, ment as follows. Trapping of features which are all equally
and began the entire community in the first of these. To likely to evolve at a given time requires quite a delicate
generate the model fit in fig. 6 we performed a least squares balance between conformity and inferability, and features
optimization of the conformity number and elements of the tend to arise rapidly once they escape the trap. The time
inference matrix, restricted to the Norway form. The cor- intervals between feature appearance can also be highly
respondence between model and data in fig. 6 highlights variable. For these reasons the close fit to experimental
the plausibility of the Norway hypothesis. However the data shown in fig. 6 would be hard to achieve in the trap-
lack data on the rise of phrasal affix means that a range ping regime, both in terms of feature timing and S-curve
of inferability values, consistent with the hypothesis, yield shape. We therefore suggest that the non-trapping case
similarly good fits to data. We have therefore only estab- is the more likely of the two explanations that our model
lished that the structure of S implied by the hypothesis is provides for observations, at least in the case of Norway.
consistent with data; there is uncertainty as to the mag- What are the implications of this for the three hypothe-
nitudes of its non-zero entries. ses? Since each hypothesis applies to a different language,

p-6

Evolution of grammar

and Norwegian is the only one for which we currently have [11] Chambers, J. K. and Trudgill, P., Dialectology (Cam-
high quality quantitative data, then it is possible that all bridge University Press, Cambridge) 1998
three are correct. However, these languages are closely [12] Labov W., Principles of Linguistic Change (Blackwell,
related: medieval Danish, Swedish and Norwegian were Oxford) 1994
mutually comprehensible and formed a single, continuous [13] Burridge J., Phys. Rev. X, 7 (2017) 031008.
[14] Burridge J., Royal Society Open Science, 5 (2018)
dialect area [41]; English was a near geographical neigh-
171446.
bour and close relative. Given that such similar changes
[15] Baronchelli, A. and Felici, M. and Loreto, V. and
happened in all these languages at a similar time, it would Caglioti, E. and Steels, L., Journal of Statistical Me-
be surprising if the mechanisms by which they arose were chanics: Theory and Experiment, 2006 (P06014) .
different in each case. We have provided a simple model [16] Nowak, M. A. and Komarova, N. L. and Niyogi, P. ,
which matches the historical data for Norway, and sup- Nature, 291 (2001) 114.
ports the Norway hypothesis. While we offer no firm con- [17] Hastie T., Tibshirani R. and Friedman J. , The Ele-
clusion about the veracity of the three hypotheses, we sug- ments of Statistical Learning (Springer, New York) 1994
gest that the possibility of a single unifying explanation [18] Chomsky, N. and Halle, M., The Sound Pattern of En-
should be considered, requiring further data collection, glish (Harper and Row, New York) 1968
and a more sophisticated application of our model, involv- [19] Sankoff, G. and Blondeau, H. , Language, 83 (2007)
560.
ing geographical and cross-linguistic interactions, which
[20] Morgan, T. J. H., Rendell, L. E. Ehn, M. Hoppitt,
may be implemented using the importance matrix, K.
W. and K. N. Laland, Proc. R. Soc. B, 279 (2012) 653.
Summary. – We have introduced a model of language [21] Hale, M. and Reiss, C., The phonological enterprise
change which is driven by social conformity and the infer- (Oxford University Press, Oxford) 2008
ence of new linguistic rules from those currently in use. We [22] Yang, C., Language variation and change, 12 (2000) 231.
[23] Liljencrants, J. and Lindblom, B. , Language, 48
have explored some essential properties of the model, in-
(1972) 839.
cluding symmetry breaking and conformity trapping, and [24] Bybee, J., Language variation and change, 14 (2002) 261.
demonstrated how it may be used to quantitatively ad- [25] Lieberson, S., A Matter of Taste:How Names, Fashions
dress a specific linguistic question. The model in principle and Culture Change (Yale University Press, New Haven)
allows exploration of a variety of different processes and 2000
changes in greater detail (possibly requiring a reinterpre- [26] Moreton, E. , Phonology, 25 (2008) 25.
tation of S), including the effects of geographical distribu- [27] Van der Walt, S., Colbert, S. C., and Varoquaux,
tion, and interconnectivity of social groups. G. , Computing in Science and Engineering, 13 (2011) 22.
[28] Krapivsky, P. L. and Redner, S., Physcial Review Let-
∗∗∗ ters, 90 (2003) 238701.
[29] Castello, X. and Baronchelli, A. and Loreto V.,
The authors are grateful to the Royal Society for an The European Physical Journal B, 71 (2009) 557.
APEX award (2018-2020), funded by the Leverhulme [30] Blythe R. A. and McKane A. J., Journal of Statistical
Mechanics: Theory and Experiment, 2007 (P07018) .
Trust.
[31] Kauhanen H. , Journal of Linguistics, 53 (2016) 327.
[32] Blundell S. J. and Blundell K. M., Concepts in Ther-
REFERENCES mal Physics (Oxford University Press, Oxford) 2006
[33] Kroch A. S. , Language Variation and Change, 1 (1989)
[1] Helbing, D., Farkas, I. and Vicsek, T., Nature, 407 199.
(2000) 487. [34] Willis D., Journal of Linguistic Geography, 5 (2017) 41–
[2] Bialek W. et. al., PNAS, 109 (2012) 4786. 66.
[3] Yakovenko, V. M. and Rosser J. B. , Rev. Mod. Phys., [35] Labov, W., Principles of Linguistic Change: Cognitive
81 (2009) 1703. and Cultural Factors, Volume 3 (Wiley, Chichester) 2010
[4] Castellano, C. Fortunato, S. and Loreto, V., Rev. [36] Blythe R. A. and Croft W. , Language, 88 (2012) 269.
Mod. Phys., 81 (2009) 591. [37] Ghanbarnejad F., Gerlach M., Miotto J. M. and
[5] Baxter G. J. , Blythe R. A. , Croft W. and McKane Altmann E. G., J. R. Soc. Interface, 11 (2014) 20141044.
A. J., Language Variation and Change, 21 (2009) 257. [38] Allen C. L., Genitives in Early English: Typology and
[6] Schulze C., Stauffer D., and Wichmann S., Commu- Evidence (Oxford University Press, Oxford) 2008
nications in Computational Physics, 3 (2009) 271. [39] Norde M. , The History of the Genitive in Swedish. A
[7] Baxter G. J. , Blythe R. A. , Croft W. and McKane Case Study in Degrammaticalization (Universiteit van Am-
A. J., Phys. Rev. E, 73 (2006) 046118. sterdam, Amsterdam) 1997
[8] Fort J., and Perez-Losada J, Journal of The Royal So- [40] Blaxter T., Annotated DN online (http://www.icge.
ciety Interface, 13 (2016) 20160185. co.uk/dn_online/) 2017
[9] Goncalves B., Loureiro-Porto L., Ramasco J. J. and [41] Vikor L. S. , The Nordic languages: An international
Sanchez, D., PLoS ONE, 13 (2018) e0197741. handbook of the history of the North Germanic languages,
[10] Fasold, R. W. and Connor-Linton J., An Introduction edited by Bandle, O. et al., Vol. 1 (de Gruyter, New
to Language and Linguistics (Cambridge University Press, York) 2002, p. 6–7.
Cambridge) 2014

p-7
References (41)
Helbing, D., Farkas, I. and Vicsek, T., Nature, 407 (2000) 487.
Bialek W. et. al., PNAS, 109 (2012) 4786.
Yakovenko, V. M. and Rosser J. B. , Rev. Mod. Phys., 81 (2009) 1703.
Castellano, C. Fortunato, S. and Loreto, V., Rev. Mod. Phys., 81 (2009) 591.
Baxter G. J. , Blythe R. A. , Croft W. and McKane A. J., Language Variation and Change, 21 (2009) 257.
Schulze C., Stauffer D., and Wichmann S., Commu- nications in Computational Physics, 3 (2009) 271.
Baxter G. J. , Blythe R. A. , Croft W. and McKane A. J., Phys. Rev. E, 73 (2006) 046118.
Fort J., and Perez-Losada J, Journal of The Royal So- ciety Interface, 13 (2016) 20160185.
Goncalves B., Loureiro-Porto L., Ramasco J. J. and Sanchez, D., PLoS ONE, 13 (2018) e0197741.
Fasold, R. W. and Connor-Linton J., An Introduction to Language and Linguistics (Cambridge University Press, Cambridge) 2014
Chambers, J. K. and Trudgill, P., Dialectology (Cam- bridge University Press, Cambridge) 1998
Labov W., Principles of Linguistic Change (Blackwell, Oxford) 1994
Burridge J., Phys. Rev. X, 7 (2017) 031008.
Burridge J., Royal Society Open Science, 5 (2018) 171446.
Baronchelli, A. and Felici, M. and Loreto, V. and Caglioti, E. and Steels, L., Journal of Statistical Me- chanics: Theory and Experiment, 2006 (P06014) .
Nowak, M. A. and Komarova, N. L. and Niyogi, P. , Nature, 291 (2001) 114.
Hastie T., Tibshirani R. and Friedman J. , The Ele- ments of Statistical Learning (Springer, New York) 1994
Chomsky, N. and Halle, M., The Sound Pattern of En- glish (Harper and Row, New York) 1968
Sankoff, G. and Blondeau, H. , Language, 83 (2007) 560.
Morgan, T. J. H., Rendell, L. E. Ehn, M. Hoppitt, W. and K. N. Laland, Proc. R. Soc. B, 279 (2012) 653.
Hale, M. and Reiss, C., The phonological enterprise (Oxford University Press, Oxford) 2008
Yang, C., Language variation and change, 12 (2000) 231.
Liljencrants, J. and Lindblom, B. , Language, 48 (1972) 839.
Bybee, J., Language variation and change, 14 (2002) 261.
Lieberson, S., A Matter of Taste:How Names, Fashions and Culture Change (Yale University Press, New Haven) 2000
Moreton, E. , Phonology, 25 (2008) 25.
Van der Walt, S., Colbert, S. C., and Varoquaux, G. , Computing in Science and Engineering, 13 (2011) 22.
Krapivsky, P. L. and Redner, S., Physcial Review Let- ters, 90 (2003) 238701.
Castello, X. and Baronchelli, A. and Loreto V., The European Physical Journal B, 71 (2009) 557.
Blythe R. A. and McKane A. J., Journal of Statistical Mechanics: Theory and Experiment, 2007 (P07018) .
Kauhanen H. , Journal of Linguistics, 53 (2016) 327.
Blundell S. J. and Blundell K. M., Concepts in Ther- mal Physics (Oxford University Press, Oxford) 2006
Kroch A. S. , Language Variation and Change, 1 (1989) 199.
Willis D., Journal of Linguistic Geography, 5 (2017) 41- 66.
Labov, W., Principles of Linguistic Change: Cognitive and Cultural Factors, Volume 3 (Wiley, Chichester) 2010
Blythe R. A. and Croft W. , Language, 88 (2012) 269.
Ghanbarnejad F., Gerlach M., Miotto J. M. and Altmann E. G., J. R. Soc. Interface, 11 (2014) 20141044.
Allen C. L., Genitives in Early English: Typology and Evidence (Oxford University Press, Oxford) 2008
Norde M. , The History of the Genitive in Swedish. A Case Study in Degrammaticalization (Universiteit van Am- sterdam, Amsterdam) 1997
Blaxter T., Annotated DN online (http://www.icge. co.uk/dn_online/) 2017
Vikor L. S. , The Nordic languages: An international handbook of the history of the North Germanic languages, edited by Bandle, O. et al., Vol. 1 (de Gruyter, New York) 2002, p. 6-7.
FAQs
AI
What explains the role of conformity in language evolution?
add
The model reveals that a high conformity number (β > 1) drives language communities towards dominant grammar states, while lower β allows for multiple grammatical evolutions to coexist.
How do speaker memories influence language change dynamics?
add
The study finds that speaker memories decay over time, influencing linguistic states' probabilities, which significantly affects the timing and nature of language evolution.
When did key changes in the genitive case occur in English?
add
The model suggests that major shifts in the genitive case began around the 14th century, coinciding with phonological simplifications observed in historical texts.
How did the model approach inferability and its impact on changes?
add
The research demonstrates that different inferability matrices dictate the sequence of linguistic changes, highlighting how speakers navigate and adopt grammatical features in evolving contexts.
What implications arise from exploring dialect formation through this model?
add
The findings indicate that spatial and social group interconnectivity substantially shapes dialect emergence, suggesting potential pathways for distinct language development across regions.
Bert Vaux
University of Cambridge, Faculty Member
Papers
224
Followers
9,410
View all papers from
Bert Vaux
arrow_forward
Related papers
Statisical Physics of Language Evolution: The Grammaticalization Phenomenon
Quentin Feltgen
Cette thèse se propose d'étudier la grammaticalisation, processus d'évolution linguistique par lequel les éléments fonctionnels de la langue se trouvent remplacés au cours du temps par des mots ou des constructions de contenu, c'est-à-dire servant à désigner des entités plus concrètes. La grammaticalisation est donc un cas particulier de remplacement sémantique. Or, la langue faisant l'objet d'un consensus social bien établi, il semble que le changement sémantique s'effectue à contre-courant de la bonne efficacité de la communication ; pourtant, il est attesté dans toutes les langues, toutes les époques et, comme le montre la grammaticalisation, toutes les catégories linguistiques. Dans cette thèse, nous étudions d'abord le phénomène de grammaticalisation d'un point de vue empirique, en analysant les fréquences d'usage de plusieurs centaines de constructions du langage connaissant une ou plusieurs grammaticalisations au cours de l'histoire de la langue française. Ces profils de fréquence sont extraits de la base de données de Frantext, qui permet de couvrir une période de sept siècles. L'augmentation de fréquence en courbe en S concomitante du remplacement sémantique, attestée dans la littérature, est confirmée, mais aussi complétée par l'observation d'une période de latence, une stagnation de la fréquence d'usage de la construction alors même que celle-ci manifeste déjà son nouveau sens. Les distributions statistiques des observables décrivant ces deux phénomènes sont obtenues et quantifiées. Un modèle de marche aléatoire est ensuite proposé reproduisant ces deux phénomènes. La latence s'y trouve expliquée comme un phénomène critique, au voisinage d'une bifurcation point-col. Une extension de ce modèle articulant l'organisation du réseau sémantique et les formes possibles de l'évolution est ensuite discutée.
Download free PDF
View PDF
chevron_right
Modelling language evolution: Examples and predictions (target article, commentaries and authors' replies)
Tao Gong
Lan SHUAI
We survey recent computer modelling research of language evolution, focusing on a rule-based model simulating the lexiconsyntax coevolution and an equation-based model quantifying the language competition dynamics. We discuss four predictions of these models: (a) correlation between domain-general abilities (e.g. sequential learning) and language-specific mechanisms (e.g. word order processing); (b) coevolution of language and relevant competences (e.g. joint attention); (c) effects of cultural transmission and social structure on linguistic understandability; and (d) commonalities between linguistic, biological, and physical phenomena. All these contribute significantly to our understanding of the evolutions of language structures, individual learning mechanisms, and relevant biological and socio-cultural factors. We conclude the survey by highlighting three future directions of modelling studies of language evolution: (a) adopting experimental approaches for model evaluation; (b) consolidating empirical foundations of models; and (c) multi-disciplinary collaboration among modelling, linguistics, and other relevant disciplines.
Download free PDF
View PDF
chevron_right
Evolution of grammatical forms: some quantitative approaches
Anita Mehta
Eur. Phys. J. B 96:19, 2023
Grammatical forms are said to evolve via two main mechanisms. These are, respectively, the 'descent' mechanism, where current forms can be seen to have descended (albeit with occasional modifications) from their roots in ancient languages, and the 'contact' mechanism, where evolution in a given language occurs via borrowing from other languages with which it is in contact. We use ideas and concepts from statistical physics to formulate a series of static and dynamical models which illustrate these issues in general terms. The static models emphasise the relative numbers of rules and exceptions, while the dynamical models focus on the emergence of exceptional forms. These unlikely survivors among various competing grammatical forms are winners against the odds. Our analysis suggests that they emerge when the influence of neighbouring languages exceeds the generic tendency towards regularisation within individual languages.
Download free PDF
View PDF
chevron_right
Language change as an evolutionary process: Theoretic background and computational applications
Jyri Lehtinen
2010
In the thesis it is discussed in what ways concepts and methodology developed in evolutionary biology can be applied to the explanation and research of language change. The parallel nature of the mechanisms of biological evolution and language change is explored along with the history of the exchange of ideas between these two disciplines. Against this background computational methods developed in evolutionary biology are taken into consideration in terms of their applicability to the study of historical relationships between languages. Different phylogenetic methods are explained in common terminology, avoiding the technical language of statistics. The thesis is on one hand a synthesis of earlier scientific discussion, and on the other an attempt to map out the problems of earlier approaches in addition to finding new guidelines in the study of language change on their basis. Primarily literature about the connections between evolutionary biology and language change, along with research articles describing applications of phylogenetic methods into language change have been used as source material. The thesis starts out by describing the initial development of the disciplines of evolutionary biology and historical linguistics, a process which right from the beginning can be seen to have involved an exchange of ideas concerning the mechanisms of language change and biological evolution. The historical discussion lays the foundation for the handling of the generalised account of selection developed during the recent few decades. This account is aimed for creating a theoretical framework capable of explaining both biological evolution and cultural change as selection processes acting on self-replicating entities. This thesis focusses on the capacity of the generalised account of selection to describe language change as a process of this kind. In biology, the mechanisms of evolution are seen to form populations of genetically related organisms through time. One of the central questions explored in this thesis is whether selection theory makes it possible to picture languages are forming populations of a similar kind, and what a perspective like this can offer to the understanding of language in general. In historical linguistics, the comparative method and other, complementing methods have been traditionally used to study the development of languages from a common ancestral language. Computational, quantitative methods have not become widely used as part of the central methodology of historical linguistics. After the fading of a limited popularity enjoyed by the lexicostatistical method since the 1950s, only in the recent years have also the computational methods of phylogenetic inference used in evolutionary biology been applied to the study of early language history. In this thesis the possibilities offered by the traditional methodology of historical linguistics and the new phylogenetic methods are compared. The methods are approached through the ways in which they have been applied to the Indo-European languages, which is the most thoroughly investigated language family using both the traditional and the phylogenetic methods. The problems of these applications along with the optimal form of the linguistic data used in these methods are explored in the thesis. The mechanisms of biological evolution are seen in the thesis as parallel in a limited sense to the mechanisms of language change, however sufficiently so that the development of a generalised account of selection is deemed as possibly fruiful for understanding language change. These similarities are also seen to support the validity of using phylogenetic methods in the study of language history, although the use of linguistic data and the models of language change employed by these models are seen to await further development.
Download free PDF
View PDF
chevron_right
Quantifying the evolutionary dynamics of language
Martin Nowak
Nature, 2007
Human language is based on grammatical rules. Cultural evolution allows these rules to change over time. Rules compete with each other: as new rules rise to prominence, old ones die away. To quantify the dynamics of language evolution, we studied the regularization of English verbs over the past 1,200 years. Although an elaborate system of productive conjugations existed in English&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#39;s proto-Germanic ancestor, Modern English uses the dental suffix, &amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#39;-ed&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#39;, to signify past tense. Here we describe the emergence of this linguistic rule amidst the evolutionary decay of its exceptions, known to us as irregular verbs. We have generated a data set of verbs whose conjugations have been evolving for more than a millennium, tracking inflectional changes to 177 Old-English irregular verbs. Of these irregular verbs, 145 remained irregular in Middle English and 98 are still irregular today. We study how the rate of regularization depends on the frequency of word usage. The half-life of an irregular verb scales as the square root of its usage frequency: a verb that is 100 times less frequent regularizes 10 times as fast. Our study provides a quantitative analysis of the regularization process by which ancestral forms gradually yield to an emerging linguistic rule.
Download free PDF
View PDF
chevron_right
Exemplar models, evolution and language change (2006) The Linguistic Review 23: 247–274
Andrew Wedel
Evidence supporting a rich memory for associations suggests that people can store perceptual details in the form of exemplars. The resulting particulate model of category contents allows the application of evolution theory in mod-eling category change, because variation in categorized percepts is reflected in the distribution of exemplars in a category. Within a production-perception feedback loop, variation within an exemplar-based category provides a reserve of variants that can serve as the seeds for shifts in the system over time through random or selection-driven asymmetries in production and perception. Here, three potential pathways for evolutionary change are identified in linguistic categories: pruning of lines of inheritance, blending inheritance and natural selection. Simulations of each of these pathways are shown within a simple exemplar-based model of category production and perception, showing how consideration of evolutionary processes may contribute to our understanding of linguistic category change over time.
Download free PDF
View PDF
chevron_right
Did language evolve through language change? On language change, language evolution and grammaticalization theory
Jose-Luis Mendivil-Giro
Glossa: a journal of general linguistics 4(1): 124. 1–30., 2019
The relation between language change and the process of language evolution is controversial in current linguistic theory. Some authors believe that the two processes are completely unrelated, while for others the evolution of language is (at least in part) a consequence of linguistic changes. Both models imply a very different assessment of what is changing when languages themselves change. I present an explicit model of what changes when languages change, and I show that the claim that language change is a crucial factor in explaining the evolution of human language, although suggestive and very popular, faces problems of a theoretical and empirical nature.
Download free PDF
View PDF
chevron_right
The Evolutionary-Emergence Model: Explaining Linguistic Variation in a Transition Zone.
Douglas Bigham
Download free PDF
View PDF
chevron_right
Evolutionary dynamics in language form and language
Monica Tamariz
2010
This paper describes evolutionary dynamics in language and presents a genetic framework of language akin to those of and Mufwene , where language is a complex system that inhabits, interacts with and evolves in communities of human speakers. The novelty of the present framework resides in the separation between form (phonology and syntax) and meaning (semantics), which are described as two different selection systems, connected by symbolic association and by probabilistic encoding of information.
Download free PDF
View PDF
chevron_right
Evaluating the role of quantitative modelling in language evolution
Maurício de Jesus Dias Martins
The Past, Present and Future of Language Evolution, 2014
Models are a flourishing and indispensable area of research in language evolution. Here we highlight critical issues in using and interpreting models and suggests viable approaches. First, contrasting models can explain the same data and similar modelling techniques can lead to diverging conclusions. This should act as a reminder to use the extreme malleability of modelling parsimoniously when interpreting results. Second, quantitative techniques similar to those used in modelling language evolution have proven themselves inadequate in other disciplines. Cross-disciplinary fertilization is crucial to avoid mistakes previously occurred in other areas. Finally, experimental validation is necessary both to sharpen models' hypotheses, and to support their conclusions. Our belief is that models should be interpreted as quantitative demonstrations of logical possibilities, rather than direct sources of evidence. Only an integration of theoretical principles, quantitative proofs and empirical validation can allow research in the evolution of language to progress.
Download free PDF
View PDF
chevron_right
Cited by
Using spatial patterns of English folk speech to infer the universality class of linguistic copying
James Burridge
Physical Review Research, 2020
Both linguistic and genetic evolution involve copying and mutation of variants. The simplest copying process assumes that variants are reproduced at a rate equal to their current frequency, exemplified by Kimura's stepping stone model of neutral evolution, and the voter model. In this case, spatial patterns are driven by noise. In the linguistic context, an alternative possibility is that speakers preferentially select variants which are already popular, yielding patterns driven by surface tension, exemplified by the Ising model. In this paper, we model language change using a spatial network of speakers, inspired by the Hopfield neural network. The model's universality class-Voter or Ising-is determined by speakers' learning function. We view maps generated by the Survey of English Dialects as samples from our network. Maximum likelihood analysis, and comparison of spatial auto-correlations between real and simulated maps, indicates that the underlying copying processes is more likely to belong to the conformity-driven Ising class.
Download free PDF
View PDF
chevron_right
Explore
Papers
Topics
Features
Mentions
Analytics
PDF Packages
Advanced Search
Search Alerts
Journals
Academia.edu Journals
My submissions
Reviewer Hub
Why publish with us
Testimonials
Company
About
Careers
Press
Content Policy
580 California St., Suite 400
San Francisco, CA, 94104