Academia.edu uses cookies to personalize content, tailor ads and improve the user experience. By using our site, you agree to our collection of information through the use of cookies. To learn more, view our Privacy Policy.
Regression Analysis of Lexical and Morpho-Syntactic Properties of Kiezdeutsch
2021
…
7 pages
Sign up for access to the world's latest research
Abstract
Kiezdeutsch is a variety of German predominantly spoken by teenagers from multi-ethnic urban neighborhoods in casual conversations with their peers. In recent years, the popularity of Kiezdeutsch has increased among young people, independently of their socio-economic origin, and has spread in social media, too. While previous studies have extensively investigated this language variety from a linguistic and qualitative perspective, not much has been done from a quantitative point of view. We perform the first large-scale data-driven analysis of the lexical and morpho-syntactic properties of Kiezdeutsch in comparison with standard German. At the level of results, we confirm predictions of previous qualitative analyses and integrate them with further observations on specific linguistic phenomena such as slang and self-centered speaker attitude. At the methodological level, we provide logistic regression as a framework to perform bottom-up feature selection in order to quantify differen...
Key takeaways
AI
AI
- This study introduces logistic regression for quantifying lexical and morpho-syntactic differences in Kiezdeutsch.
- Kiezdeutsch shows significant syntactic variations, including verb-first declaratives and absence of determiners.
- Three studies analyze unigram and trigram POS distributions, revealing distinctive patterns between Kiezdeutsch and standard German.
- The KiDKo corpus contains 63,604 sentences from Kiezdeutsch, while GRAIN has 14,097 sentences from standard German.
- Findings support previous qualitative claims while highlighting Kiezdeutsch's unique linguistic features and slang.
Related papers
Education and Linguistics Research, 2018
Facebook (FB) is one of the social networks that allow its users to interact freely by posting short messages, pictures and videos. FB has a forum where people write and post their opinions, pictures and videos to see their friends’ reactions. FB also allows anonymity thus giving users the freedom to use a language of their choice without restrictions. Given the fact that FB is an informal context, users employ certain patterns of language in their interactions. This paper endeavors to examine the manner in which these patterns of language are used on FB with special focus on Kiswahili language. Kiswahili is now an official language in Kenya and there is a paradigm shift concerning patterns of texts that are sent on FB interaction. The objective of the study was to analyze the linguistic features used in selected social interactions on FB (SSIFB). The units of analysis in this study were texts that were sent as reactions to the news and pictures that were posted on the FB forums suc...
In the last few years, microblogging platforms such as Twitter have given rise to a deluge of textual data that can be used for the analysis of informal communication between millions of individuals. In this work, we propose an information-theoretic approach to geographic language variation using a corpus based on Twitter. We test our models with tens of concepts and their associated keywords detected in Spanish tweets geolocated in Spain. We employ dialectometric measures (cosine similarity and Jensen-Shannon divergence) to quantify the linguistic distance on the lexical level between cells created in a uniform grid over the map. This can be done for a single concept or in the general case taking into account an average of the considered variants. The latter permits an analysis of the dialects that naturally emerge from the data. Interestingly, our results reveal the existence of two dialect macrovarieties. The first group includes a region-specific speech spoken in small towns and rural areas whereas the second cluster encompasses cities that tend to use a more uniform variety. Since the results obtained with the two different met-rics qualitatively agree, our work suggests that social media corpora can be efficiently used for dialectometric analyses.
Journal of Language Contact, 2016
In the past decade there is a growing interest in Urban Youth Speech Styles (uyss). In this article Dutch uyss is the focus of attention. The basic question to be addressed is whether the identifying characteristics and functions of spoken uyss can be used and recognized in written form on the Internet as well. There is no standardized form of uyss and the use of it is restricted to members of specific subcultures, not necessarily linked to specific ethnic groups. First, linguistic and functional characteristics of uyss as they are used in the Netherlands will be described. Linguistically, a distinction is made between lexical, grammatical and phonetic/prosodic aspects. Furthermore, a closer look will be taken at the use of uyss on the Internet (mostly through rap) and examples of the use of uyss in written comments on the rap videos will be presented and compared to the spoken varieties. It will be shown how written clues are used for identification purposes that are usually non-li...
2022
Languages are continuously undergoing changes, and the mechanisms that underlie these changes are still a matter of debate. In this work, we approach language evolution through the lens of causality in order to model not only how various distributional factors associate with language change, but how they causally affect it. In particular, we study slang, which is an informal language that is typically restricted to a specific group or social setting. We analyze the semantic change and frequency shift of slang words and compare them to those of standard, nonslang words. With causal discovery and causal inference techniques, we measure the effect that word type (slang/nonslang) has on both semantic change and frequency shift, as well as its relationship to frequency, polysemy and part of speech. Our analysis provides some new insights in the study of language change, e.g., we show that slang words undergo less semantic change but tend to have larger frequency shifts over time. 1
Jurnal Ilmu Pendidikan dan Humaniora, 2022
This research delves into the dynamic world of informal language usage among students on the popular messaging platform, WhatsApp. As digital communication becomes an integral part of daily life, the study examines the frequency, variability, motivations, and social dynamics of slang usage among students. Through surveys, interviews, and, where possible, data analysis of WhatsApp conversations, the research uncovers the complex interplay between language, technology, and human connection in the digital realm. The findings reveal that slang is not merely a linguistic phenomenon but a reflection of the adaptability of language in the digital age. It serves as a linguistic bridge that enables informal communication in digital interactions. The lexicon of slang is diverse and ever-evolving, reflecting the cultural and social context in which it thrives. Motivations for slang usage go beyond humor and informality, extending to self-expression, emotional connection, and the formation of digital identities. Slang enhances social bonds and fosters a sense of belonging among peers, shaping the quality of digital interactions. Demographic variations in slang usage demonstrate its context-dependent nature, influenced by factors such as age, gender, and geographical location. Slang's impact on digital communication is significant, enhancing informal exchanges while presenting challenges, particularly in cross-cultural interactions. This research underscores the importance of digital literacy and cross-cultural understanding in online interactions and has implications for education, linguistic research, and cross-cultural communication. As the digital landscape continues to evolve, this research offers a deeper understanding of the role of language in shaping human connections in the digital age. It calls for ongoing exploration into the everchanging linguistic dynamics of digital communication and its profound impact on contemporary society.
Scriptora International Journal of Research and Innovation, 2025
The swift development of social media sites has not only altered the way of communication but also the form and role of language as such. This paper discusses the ways in which online communication using platforms like Twitter (X) Instagram, Tik Tok and WhatsApp has enhanced the speed of linguistic change in modern societies. Based on sociolinguistic and discourse-analytic approaches, the study analyzes lexical change, morphological decline, and code-mixing behaviors that can be observed in the online communication. To examine the ways in which users form meaning, identity and community by using changing forms of language, a total of 2,000 public posts and comments in English, Hindi-English and regional vernaculars were collected. The discussion shows that there is a massive trend of moving towards being brief, creative, and visually hybrid, with emojis, hashtags, abbreviations, and multimodal cues taking over or adding to the conventional syntax. Besides, the results point to the fact that social media promotes linguistic democratization through the degradation of prescriptive norms and the enhancement of non-standard varieties. Yet, the same dynamics bring along the issue to do with clarity, intergenerational communication, and linguistic fragmentation. The paper states that the language in the digital world is not being ruined but evolving-it is much more immediate, informal, and global in interaction. This paper will contribute to the general discourse of digital literacy and cultural identity, as well as the future direction of linguistic evolution in networked societies by following the patterns of language variation and change. Finally, the study highlights the fact that the social media context as both a trigger and a reflection of the current alterations in the language use, shows how communication technologies transform the linguistic behaviour in the XXI century.
JOURNAL OF LANGUAGE AND COMMUNICATION, 2024
The internet is an environment where users display the prowess in language and all forms of semiotic resources for different forms of communication. The current study examines Generation Z's use of language in order to highlight the emerging forms of language structures on TikTok, as a result of the innate ability of humans to create new forms, mostly in verbal communication. The research questions are: What are the forms of construction of Generation Z's lingo on TikTok? What semantic peculiarities are Generation Z's lingo on TikTok characterized by? The premise of this study is that, Generation Z's creative use of language on TikTok, shows the innate ability of humans to create new forms, and adapt to the affordances of new platforms in using language for communication. Crystal's (2005) propositions concerning Internet Linguistics-the Stylistic perspective is applied in this study. Fifty Tik Tok videos were downloaded from the TikTok platform, some of the videos had the same lingos, in which case only one of such videos was chosen. Seventeen videos were used, each video was played; the part of the video that had the Generation Z's word(s) was screen shot and used as the data for the analysis. In all, seventeen images were used. Findings show that the Generation Z's lingo is replete with: acronyms used as full words in speech, combination of words and image in speech, the use of neologisms, esoteric words, and laconic structures. Some words and phrases have new meanings in addition to their previous meanings; all these show a new variety of the English language. The study recommends that language users should deploy the affordance of new internet platforms to freely express themselves, in doing so, naturally, new varieties will spring up and widen the lexicon of the English language.
2021
Background : Language is the most important thing in life, because it can unite one and other from different regions although different countries. That is why it also has a problem. Language is not just communication but also it becomes an identity for some region or country, as like in Indonesia they have different languages from different regions that make them unique. But unfortunately, most millennial people now choose to use another language as slang that can affect their own language. This study makes to discuss this problem because many people does not matter with this, so we try to let them know that this is important to know because concerning our identity. Purpose : the purpose on this article is to explore a slang language in Indonesia especially in younger generation that affect to Indonesian language in this digital era. Method : This study used a qualitative descriptive method that concerned a questioner as main data, articles then other websites. Results : The result ...
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Though dialectal language is increasingly abundant on social media, few resources exist for developing NLP tools to handle such language. We conduct a case study of dialectal language in online conversational text by investigating African-American English (AAE) on Twitter. We propose a distantly supervised model to identify AAE-like language from demographics associated with geo-located messages, and we verify that this language follows well-known AAE linguistic phenomena. In addition, we analyze the quality of existing language identification and dependency parsing tools on AAE-like text, demonstrating that they perform poorly on such text compared to text associated with white speakers. We also provide an ensemble classifier for language identification which eliminates this disparity and release a new corpus of tweets containing AAE-like language.
The emergence of digital technology has brought profound changes in the ways language is produced, shared, and interpreted. With the widespread use of social media platforms, instant messaging applications, blogs, and online discussion forums, communication has become faster, more interactive, and less bound by traditional linguistic norms. These digital environments encourage new linguistic practices that influence vocabulary, grammar, spelling, and discourse patterns. As a result, language in the digital age is constantly evolving and reshaping conventional forms of communication. This study investigates language change in the digital age by conducting a linguistic analysis of online communication. The research focuses on identifying key linguistic features commonly used in digital contexts, such as abbreviations, acronyms, emoji's, creative spellings, grammatical simplification, and code-switching. A qualitative research approach is employed to analyze selected online texts in order to understand how technological and social factors contribute to linguistic variation and innovation. The findings of the study reveal that digital communication promotes linguistic creativity and flexibility rather than linguistic decay. Online language reflects users' identities, social relationships, and communicative needs within digital spaces. The study concludes that language change driven by digital communication is a natural process of linguistic evolution and highlights the importance of recognizing online discourse as a significant and legitimate area of linguistic research. Language is a living and dynamic system that continuously evolves in response to social, cultural, political, and technological changes. Throughout history, major transformations in modes of communication such as the invention of writing, the printing press, and mass media have played a crucial role in shaping linguistic practices. In the contemporary world, digital technology has emerged as one of the most powerful forces influencing language use and change. The rapid expansion of the internet, smartphones, and digital platforms has transformed the way people communicate, resulting in new linguistic forms that differ significantly from traditional spoken and written language. The digital age has introduced a wide range of communication platforms, including social media networks, instant messaging applications, emails, blogs, online forums, and virtual communities. These platforms have reshaped interpersonal communication by making it faster, more interactive, and more informal. Unlike traditional written communication, which often follows standardized grammatical rules and formal structures, online communication allows greater flexibility and creativity. Users frequently adapt language to meet the demands of speed, space limitations, and audience engagement, leading to noticeable linguistic variation and innovation. One of the most striking features of online communication is the emergence of new linguistic forms. Abbreviations, acronyms, shortened spellings, hashtags, emoji's, and gifs have become integral parts of digital discourse. These features serve multiple communicative functions, such as expressing emotions, emphasizing meaning, maintaining social relationships, and enhancing clarity in text-based interaction. From a linguistic perspective, such features demonstrate how language adapts to technological environments rather than deteriorates because of them. The digital medium encourages efficiency and expressiveness, which directly influence language structure and usage. Another important aspect of language change in the digital age is the blurring of boundaries between spoken and written language. Online communication often combines characteristics of both modes. For example, instant messages and social media posts may appear in written form but reflect the spontaneity, informality, and conversational tone of speech. This hybrid nature of digital language challenges traditional linguistic classifications and invites scholars to reconsider established concepts of language norms, correctness, and standardization. Social factors also play a significant role in shaping digital language. Online platforms bring together users from diverse linguistic, cultural, and social backgrounds, creating multilingual and multicultural spaces. As a result, practices such as code-switching and codemixing are commonly observed in digital communication, especially in multilingual societies. Users often alternate between languages to express identity, solidarity, humor, or social belonging. These practices highlight the close relationship between language, identity, and social interaction in digital contexts. The role of young people in driving digital language change is particularly significant. Younger generations are often early adopters of new technologies and platforms, and they actively experiment with language to create new expressions, slang, and stylistic trends. These innovations frequently spread beyond digital spaces and influence offline communication as well. Over time, some digital linguistics features become normalized and integrated into mainstream language use, demonstrating how digital communication contributes to long-term language change. Despite the growing presence of digital language in everyday life, it has often been viewed negatively, especially in educational and formal contexts. Critics argue that excessive use of online language leads to the deterioration of grammar, spelling, and writing skills. However, from a linguistic standpoint, such concerns overlook the adaptive and rulegoverned nature of language change. Linguists argue that variation and change are natural processes, and digital language represents an expansion of communicative resources rather than
References (24)
- Jannis Androutsopoulos. 1998a. Deutsche Jugend- sprache: Untersuchungen zu ihren Strukturen und Funktionen. Peter Lang, Frankfurt am Main.
- Jannis Androutsopoulos. 1998b. Forschungsperspek- tiven auf Jugendsprache: Ein integrativer Überblick. In Jannis Androutsopoulos and Arno Scholz, editors, Jugendsprache -Langue des Jeunes -Young Peo- ple's Language. Peter Lang, Frankfurt am Main.
- Jannis Androutsopoulos. 2001. Ultra korregd Alder! Zur medialen Stilisierung und Aneignung von 'Türkendeutsch'. Deutsche Sprache, 29:321-339.
- Peter Auer. 2003. "Türkenslang": Ein jugendsprach- licher Ethnolekt des Deutschen und seine Trans- formationen. In Annelies Häcki-Buhofer, edi- tor, Spracherwerb und Lebensalter, pages 255-264. Tübingen: Francke.
- Ulrike Freywald, Katharina Mayr, Tiner Özc ¸elik, and Heike Wiese. 2011. Kiezdeutsch as a multiethnolect. Ethnic Styles of Speaking in European Metropolitan Areas, pages 45-73.
- Susanne Fuchs, Jelena Krivokapic, and Stefanie Jannedy. 2010. Prosodic boundaries in German: Final lengthening in spontaneous speech. Journal of the Acoustical Society of America, 127(3):1851- 1851.
- Stefanie Jannedy. 2010. The usage and distribution of "so" in spontaneous Berlin Kiezdeutsch. ZASPiL Pa- pers from the Linguistics Laboratory, 43(52).
- Inken Keim and Ralf Knöbl. 2011. Linguistic vari- ation and linguistic virtuosity of young "ghetto"- migrants in Mannheim. In Friederike Kern and Mar- gret Selting, editors, Ethnic Styles of Speaking in Eu- ropean Metropolitan Areas, pages 239-264. Amster- dam: Benjamins.
- Paul S. Levy and Stanley Lemeshow. 2013. Sampling of populations: Methods and applications. John Wi- ley & Sons.
- Lindsay Preseau. 2018. Kiezdeutsch, Kiezenglish: En- glish in German Multilingual/-ethnic Speech Com- munities. Ph.D. thesis, UC Berkeley.
- Ines Rehbein and Sören Schalowski. 2013. STTS goes Kiez-Experiments on annotating and tagging urban youth language. Journal for Language Technology and Computational Linguistics, 28(1).
- Ines Rehbein, Sören Schalowski, and Heike Wiese. 2014. The KiezDeutsch Korpus (KiDKo) Release 1.0.
- Helmut Schmid. 1994. Probabilistic part-of-speech tagging using decision trees. In International Con- ference on New Methods in Language Processing, pages 44-49, Manchester, UK.
- Katrin Schweitzer, Kerstin Eckart, Markus Gärtner, Ag- nieszka Falenska, Arndt Riester, Ina Rösiger, An- tje Schweitzer, Sabrina Stehwien, and Jonas Kuhn. 2018. German radio interviews: The GRAIN re- lease of the SFB732 Silver Standard Collection. In Proceedings of the 11th International Conference on Language Resources and Evaluation.
- Patrick Stevenson, Kristine Horner, Nils Langer, and Gertrud Reershemius. 2017. The German-speaking world: A practical introduction to sociolinguistic is- sues. Routledge.
- Sali A. Tagliamonte and R. Harald Baayen. 2012. Mod- els, forests, and trees of York English: Was/were variation as a case study for statistical practice. Lan- guage Variation and Change, 24(2):135-178.
- Hermann Tertilt. 1996. Turkish Power Boys. Ethnogra- phie einer Jugendbande. Suhrkamp, Frankfurt am Main.
- John R. te Velde. 2017. German V2 and the PF- interface: Evidence from dialects. Journal of Ger- manic Linguistics, 29(2):147-194.
- Heike Wiese. 2012. Kiezdeutsch: Ein neuer Dialekt entsteht. C.H. Beck.
- Heike Wiese. 2013. What can new urban dialects tell us about internal language dynamics? The power of language diversity. Linguistische Berichte, 19:208- 245.
- Heike Wiese. 2017. Urban contact dialects. In Sa- likoko S. Mufwene and Anna Maria Escobar, editors, Cambridge Handbook of Language Contact. Cam- bridge: Cambridge University Press.
- Heike Wiese, Ulrike Freywald, and Katharina Mayr. 2009. Kiezdeutsch as a test case for the interaction between grammar and information structure. Inter- disciplinary Studies on Information Structure. Work- ing Papers of the SFB 632, 12.
- Heike Wiese and Maria Pohle. 2016. "Ich geh Kino" oder "... ins Kino"? Zeitschrift für Sprachwis- senschaft, 35(2):171-216.
- Heike Wiese and Ines Rehbein. 2016. Coherence in new urban dialects: A case study. Lingua, 172:45- 61. Feridun Zaimoglu. 1995. Kanak Sprak. Rotbuch Ver- lag, Berlin.
FAQs
AI
What morphological differences were found between Kiezdeutsch and standard German?add
The analysis reveals that Kiezdeutsch frequently employs bare noun phrases and lacks copula verbs, which contrasts with standard German's syntactic structures.
How do logistic regression results inform understanding of Kiezdeutsch features?add
Logistic regression models showed positive predictive power for five POS types indicative of Kiezdeutsch, including pronouns and verbs, while standard German was associated with nouns and determiners.
What role does lexical variation play in Kiezdeutsch compared to standard German?add
Kiezdeutsch features verbs related to obligation and existence, while standard German includes more formal nouns and complex structures, exemplifying different conversational contexts inherent to each variety.
What are the implications of slang usage in Kiezdeutsch from a sociolinguistic perspective?add
The presence of slang, such as 'Alter' for informal address, highlights the self-referential nature of Kiezdeutsch, reflecting the identity and daily experiences of its teenage speakers.
When was the KiDKo corpus collected and what distinguishes its composition?add
The KiDKo corpus was collected from 2008 to 2015 and consists of recordings of spontaneous conversations among teenagers from multi-ethnic communities, differing significantly from the controlled settings of standard German corpora.
Related papers
Journal of Open Humanities Data, 2025
The Kölner Korpus des Kiezdeutschen/Cologne Corpus of Kiezdeutsch is a dataset documenting the urban youth language variety known as Kiezdeutsch as spoken in Cologne (North Rhine-Westphalia), Germany. It includes audio recordings and GAT 2-transcribed conversations among adolescent male speakers recorded in 2023. The data were collected in a vocational school and comprise approximately three hours of conversation across three speaker groups: monolingual, multilingual, and mixed. The corpus is pseudonymized and published under a CC BY 4.0 license. It is intended as a broadly reusable linguistic resource and provides empirical data for research in sociolinguistics, morphosyntax, grammatical variation, lexical innovation, discoursepragmatics and interactional linguistics. Its structure and basic annotation also make it suitable for applications in language contact research, corpus-based analysis and language pedagogy.
Journal of English as a Foreign Language Teaching and Research, 2023
Filipino Generation Z's creative ability to experiment with morphemes is a factor in the emergence of new words. These lexical items possess the capacity to function as nouns, verbs, and adjectives within the Filipino language. This study aims to examine how Filipino Generation Z slang terminologies are formed if such terms undergo a particular process that adheres to a specific morphological structure and if new sets of rules are observed that are not included in the current morphological rules. Since the researchers used words to analyze the gathered data, a qualitative research design utilizing content analysis was employed in this study. This study employed homogeneous sampling, with researchers extracting only Generation Z slang from the Facebook posts of PNUV first-to fourth-year students as the corpus. The framework matrix was used to determine the morphological processes underlying Generation Z slang. Consequently, it was determined that word formation processes such as coinage, borrowing, compounding, blending, clipping, acronyms, affixation, conversion, and multiple processes were utilized to create Generation Z slang. In addition, the data revealed additional word formation processes, such as contraction, reduplication, and spelling change. Most Filipino Generation Z slang word formation was categorized as a change in spelling. Future researchers can gain a deeper understanding of how the Filipino generation z's language is shaped and how it differs from previous generations by examining linguistic patterns and word forms.
Spanish in Context
Study of speech and written texts has provided significant insight regarding linguistic variation and its social correlates. Variation in the representation or display of language, however, remains a relatively understudied phenomenon. With this in mind, we present a quantitative and qualitative analysis of the variation observed in the Linguistic Landscape (LL) of Pilsen, Chicago. A community undergoing perceived processes of gentrification, Pilsen is an active site of economic, sociocultural change as well as newly intensified language contact. To investigate Pilsen’s displayed language variation, we implement a series of logistic regression models that analyze the distribution of both language and contextual framing observed on signs in four key areas in Pilsen. In doing so, we present an informed means with which to understand the sociolinguistic context of Pilsen as a community undergoing change and provide a replicable framework for future study of LLs that experience similar ...
ArXiv, 2016
We present a corpus-based analysis of the effects of age, gender and region of origin on the production of both "netspeak" or "chatspeak" features and regional speech features in Flemish Dutch posts that were collected from a Belgian online social network platform. The present study shows that combining quantitative and qualitative approaches is essential for understanding non-standard linguistic variation in a CMC corpus. It also presents a methodology that enables the systematic study of this variation by including all non-standard words in the corpus. The analyses resulted in a convincing illustration of the Adolescent Peak Principle. In addition, our approach revealed an intriguing correlation between the use of regional speech features and chatspeak features.
Glossa, 2020
Data gathered from social media have been used extensively to examine lexical dialect variation in widely used languages such as English and Spanish, but their use to date in mor-phosyntax and for lesser-used languages has been more limited. This paper tests the usefulness of using data derived from Twitter to address traditional questions in dialect syntax and sociolinguistics. It uses two cases studies from Welsh-the form of the second-person singular pronoun in various syntactic contexts, and the availability of auxiliary deletion-to assess whether datasets based on Twitter data can successfully replicate and enhance results derived by traditional means. The results of the case studies coincide to a large extent with distributions established in existing studies, even ones using entirely different methods, such as dialect questionnaires or acceptability judgment tests. Twitter data also show considerable success in establishing implicational hierarchies and conditioning factors comparable to those typical of the field. Where the results differ from existing studies, the differences may be due to the younger demographics of Twitter users, or to differences in the quantity of data provided by different methodologies. The results produce patterns closer to spoken data than to written data, giving us reasonable confidence in such data as a relatively good proxy for spoken usage of large numbers of language users.
PloS one, 2011
In this study we examine linguistic variation and its dependence on both social and geographic factors. We follow dialectometry in applying a quantitative methodology and focusing on dialect distances, and social dialectology in the choice of factors we examine in building a model to predict word pronunciation distances from the standard Dutch language to 424 Dutch dialects. We combine linear mixed-effects regression modeling with generalized additive modeling to predict the pronunciation distance of 559 words. Although geographical position is the dominant predictor, several other factors emerged as significant. The model predicts a greater distance from the standard for smaller communities, for communities with a higher average age, for nouns (as contrasted with verbs and adjectives), for more frequent words, and for words with relatively many vowels. The impact of the demographic variables, however, varied from word to word. For a majority of words, larger, richer and younger com...
Diego Frassinelli