Wikidata:Lexicographical data - Wikidata
Jump to content
From Wikidata
Translate this page
Other languages:
Bahasa Indonesia
Bahasa Melayu
British English
Cymraeg
Esperanto
Nederlands
Tiếng Việt
Türkçe
asturianu
brezhoneg
català
dansk
eesti
euskara
galego
italiano
latviešu
lietuvių
magyar
norsk bokmål
polski
português
português do Brasil
română
slovenčina
suomi
svenska
čeština
Ελληνικά
български
македонски
русский
српски / srpski
українська
հայերեն
עברית
العربية
بهاس ملايو
فارسی
مصرى
پنجابی
अंगिका
हिन्दी
বাংলা
ਪੰਜਾਬੀ
ગુજરાતી
தமிழ்
తెలుగు
മലയാളം
ไทย
中文
閩南語(傳統漢字)
한국어
Overview
Documentation
Development
Tools
Support for Wiktionary
How to help
Statistics
Lexemes
Discussion
Wikidata:Lexicographical data
a logo based on
ama
(L1)
in Wikimedia colors
Welcome to the project page for lexicographical data!
What is lexicographical data?
edit
Since the start of Wikidata in 2012, the multilingual knowledge base was mainly focused on
concepts
Q-items
are related to a thing or an idea, not to the
word
describing it. Since 2018, Wikidata has also stored a new type of data: words, phrases and sentences, in many languages, described in many languages. This information is stored in new types of entities, called Lexemes (L), Forms (F) and Senses (S). You can learn more about the data model on the
documentation page
The structured description of the words will be directly connected to the concepts. It will allow editors to describe precisely all words in all languages, and will be reusable, just like the whole content of Wikidata, by multiple
tools
and
queries
—everything that the community creates to play with words. Lexicographical data can be reused on the Wikimedia projects, and can provide
support for Wiktionary
Timeline
edit
2012: first discussions about including lexicographical data into Wikidata
2013–2016: many discussions with editors and developers, leading to several versions of the
development plan
2016: start of the development
2017: continuing the development of the structure (Wikibase/Lexeme), development of several tools for Wiktionary (
Sitelinks
May 23rd, 2018: deployment of the first version of lexicographical data
Done
October 2018: enabling lexicographical data in the Query Service and enabling Senses
Done
2018–2022: iteration of the project, maintenance
External users
edit
Unicode quotes Wikidata lexicographical data as a key data source for
Unicode Inflexion Library
(Q136796507)
in their blogpost presentation:
Introducing the Unicode Inflection Library Technical Preview Release
, or Wednesday, November 5, 2025.
Useful links
edit
Best practices
Documentation
Glossary
Data Model
How to help?
Create a new Lexeme
Lexeme Statistics
Lexicographical properties
Property proposals
Maintenance reports
Useful queries
Existing tools
Ideas on tools based on lexicographical data
Support for Wiktionary
Wikidata + Wiktionary - Frequently asked questions
Development plan
First lexeme created on Wikidata
Language-specific resources:
Documentation pages for single languages
subpage for English
A list of resources (primarily dictionaries, but other linguistic finds as well) that might be useful when looking for sense glosses that are in the public domain
Lexicographical properties
General
item for this sense
grammatical gender
conjugation class
word stem
derived from lexeme
mode of derivation
object form
object sense
Wikidata property example for lexemes
Wikidata property example for forms
officialized by
attested in
first attested from
stroke count
combines lexemes
auxiliary verb
homograph lexeme
Han character in this lexeme
valency
requires grammatical feature
usage example
subject lexeme form
subject sense
paradigm class
root
creates lexeme type
translation
synonym
antonym
troponym of
false friend
Wikidata property example for senses
classifier
location of sense usage
language style
collective noun for animals
variety of lexeme, form or sense
grammatical aspect
gloss quote
pertainym of
predicate for
said to be the same as lexeme
semantic derivation of
kigo of
Phonetics
pronunciation audio
IPA transcription
X-SAMPA code
pronunciation variety
Slavistic Phonetic Alphabet transcription
hyphenation
tone or pitch accent class
position of accent nucleus
position of devoiced vowel
position of nasal sonant
UPA transcription
pronunciation
IAST transliteration
Other properties useful in lexicography
image
described at URL
described by source
quotation
Bharati Braille
Values for property
instance of
or
has characteristic
of the lexeme
plurale tantum
collective noun
singulare tantum
inanimate
animate
reconstructed word
acronym
Values for property
instance of
or
has characteristic
of the form
obsolete form
depreciative form
rare form
potential form
non-depreciative form
vocalic form
non-vocalic form
colloquial form
strong form
weak form
incorrect form
former form
spelling recommended by Duden
alternative spelling
Values for property
language style
of the sense
outdatedness
colloquial language
archaism
rare
idiomatic
humorous
euphemism
vulgarism
pejorative
neologism
profanity
Sandboxes
sandbox
(L123)
sandbox 2
(L1234)
Sandbox-Lexeme
Sandbox-Form
Sandbox-Sense
Dictionaries and databases
list per language
Lexicographical external identifiers
French
TLFi ID
Littré ID
Dictionnaire de l'Académie française ID (9th edition)
Bob ID
Grand dictionnaire terminologique ID
Larousse Online French Dictionary ID
Dictionnaire de l'Académie française ID (1st edition)
Dictionnaire de l'Académie française ID (2nd edition)
Dictionnaire de l'Académie française ID (3rd edition)
Dictionnaire de l'Académie française ID (4th edition)
Dictionnaire de l'Académie française ID (5th edition)
Dictionnaire de l'Académie française ID (6th edition)
Dictionnaire de l'Académie française ID (7th edition)
Dictionnaire de l'Académie française ID (8th edition)
Oxford French-English Dictionary ID
FranceTerme identifier
Usito ID
German
Duden lexeme ID
Adelung lemma ID
DWB lemma ID
DWB2 lemma ID
GWB lemma ID
Meyers lemma ID
RDWB1 lemma ID
Wander lemma ID
elexiko ID
OWID Neologismenwörterbuch ID
OWID Deutsches Fremdwörterbuch ID
OWID Sprichwörterbuch ID
OWID Kommunikationsverben ID
Kleines Wörterbuch der Verlaufsformen im Deutschen ID
Oxford German-English Dictionary ID
Italian
Treccani Vocabulary ID
Encyclopedia of Italian ID
Il Nuovo De Mauro ID
Tesoro della Lingua Italiana delle Origini ID
Tommaseo-Bellini Online ID
Il Nuovo DOP ID
Oxford Italian-English Dictionary ID
Polish
SJP Online ID
SGJP Online ID
Doroszewski Online ID
Kopaliński Online ID
WSO Online ID
WSJP ID
Dobry słownik ID
Słownik języka polskiego XVII i XVIII wieku ID
SPXVI ID
Finnish
Kielitoimiston sanakirja ID
Suomen etymologinen sanakirja ID
Suomen murteiden sanakirja ID
Vanhan kirjasuomen sanakirja ID
Suomi–ruotsi-suursanakirja ID
Oxford University Press
Oxford English Dictionary entry ID (pre-July 2023)
Oxford English Dictionary object ID (post-July 2023)
The Oxford Dictionary of Phrase and Fable ID
Australian Oxford Dictionary ID
The New Zealand Oxford Dictionary ID
Canadian Oxford Dictionary ID
New Oxford Rhyming Dictionary ID
A Dictionary of Biology ID
A Dictionary of Plant Sciences ID
A Dictionary of Zoology ID
A Dictionary of Sociology entry ID
A Dictionary of Public Health entry ID
A Dictionary of Genetics entry ID
A Dictionary of Sports Studies entry ID
Dictionary of American Family Names ID
A Dictionary of Genetics entry ID
The Concise Oxford Dictionary of Art Terms entry ID
The Oxford Essential Dictionary of the U.S. Military entry ID
A Dictionary of Travel and Tourism entry ID
A Dictionary of Dentistry entry ID
The Oxford Dictionary of Architecture entry ID
A Dictionary of Food and Nutrition entry ID
The Oxford Dictionary of Dance entry ID
other
IHO Hydrographic Dictionary (S-32) Number
Techopedia ID
ODLIS ID
APA Dictionary of Psychology entry
Biology Online Biology Dictionary entry
IGI Global Dictionary ID
Investopedia term ID
Glossary of Astronomical Terms ID
Mindat.org Glossary of Mineralogical Terms ID
Merriam-Webster online dictionary entry
Dictionary.com entry
Collins Online English Dictionary entry
The Britannica Dictionary entry
Cambridge Dictionary entry (British English)
Cambridge Dictionary entry (American English)
NCI Drug Dictionary ID
Dictionary of South African English entry ID
Jewish English Lexicon ID
Law Insider Legal Dictionary entry
NCI Dictionary of Cancer Terms entry
NCI Dictionary of Genetics Terms entry
The Law Dictionary entry
Vocabulary.com word ID
Antique Chinese and Japanese Porcelain Dictionary and Glossary of Terms entry
Japanese
Digital Daijisen ID
JMdict sequence number
wadoku ID
Russian
Little Academic Dictionary ID
18th Century Russian Dictionary ID
Ushakov Dictionary ID
Swedish
Svenska Akademiens Ordbok entry ID
Svensk ordbok ID
Svenska Akademiens ordlista ID
Ordbok över Finlands svenska folkmål ID
Danish
DanNet 2.2 word ID
Den Danske Ordbog article ID
Den Danske Ordbog idiom ID
Ordbog over det danske sprog ID
DAKA Danish-Greenlandic Dictionary ID
Korean
Basic Korean Dictionary ID
Standard Korean Language Dictionary ID
Open Korean Knowledge Dictionary sense ID
Cantonese
CantoDict word ID
CantoDict character ID
Mandarin-Cantonese Comparative Study ID
Mongolian
toli.query.mn lexeme ID
toli.gov.mn lexeme ID
mongoltoli.mn lexeme ID
Hebrew
Ma'agarim ID
milog.co.il entry ID
milononline.net entry ID
PBY Ben-Yehuda dictionary identifier
Malay
Kamus Dewan Edisi Keempat ID
Kamus Pelajar ID
Kamus Dewan Edisi Tiga ID
Arabic
Arabic Ontology lemma ID
Quranic Arabic Corpus root ID
Arabic Ontology lexical concept ID
ARABTERM entry ID
Hawramani Arabic Lexicon entry ID
Southern Min
Taiwanese-Japanese Dictionary ID
Dictionary of Frequently-Used Taiwanese Taigi ID (deprecated)
Sutian entry ID
New Persian
Dehkhoda ID
Spesalay Pashto (Dari/Persian Dictionary) ID
Farhang-i forsī ba rusī ID
multiple languages
Álgu lexeme ID
(Northern Sami, Finnish, Inari Sami, and so on)
Oqaasileriffik online dictionary ID
(Greenlandic, Danish, English)
Sõnaveeb entry ID
(Estonian, English, Russian, and so on)
Strong's number
(Hebrew, Ancient Greek)
Intercontinental Dictionary Series unit ID
Online Aboriginal Language Dictionary ID
Tatoeba sentence ID
Hebrew Academy term ID
(Hebrew, English)
‎qamus.inoor.ir entry ID
(Arabic, New Persian)
Presisov večjezični slovar ID
(Slovene, English, German, French, Albanian, Serbo-Croatian)
other
Uralonet ID
Ġabra lexeme ID
Oudnederlands Woordenboek GTB ID
Vroegmiddelnederlands Woordenboek GTB ID
Middelnederlandsch Woordenboek GTB ID
Bantu Lexical Reconstructions ID
Elhuyar Dictionary ID
ePSD ID
Ahotsak lexeme
Sri Granth word ID
Diccionario de la lengua española word (non-ID)
Punjabipedia ID
PIV Online ID
Reta Vortaro ID
synonymer.se ID
sense on DHLE
Lur Encyclopedic Dictionary ID
VerbaAlpina ID
Revised Mandarin Chinese Dictionary ID
Vocabulário Ortográfico Comum da Língua Portuguesa lemma ID
JLect entry ID
Urdu Lughat ID
Middle English Dictionary entry ID
STEDT ID
Infopédia entry
Bosworth-Toller's Anglo-Saxon Dictionary Online ID
Sindhi English Dictionary ID
Jeju Dialect Dictionary ID
Michaelis ID (Brazilian Portuguese)
LSJ Wiki ID
woordenlijst.org ID
Woordenboek der Nederlandsche Taal GTB ID
OSL ID
Explanatory Ukrainian Dictionary ID
Slovenian Etymological Dictionary ID
Meurgorf identifier
Oxford Irish-English Dictionary ID
‎Spanish-German Dictionary ID
Wikidata:Lexicographical data
Examples & resources
Property proposals
Create a new Lexeme
Wikidata Lexeme Forms
Property used with senses
Template:Language properties
and (
with their qualifiers
Retrieved from "
Category
Wikidata:Lexicographical data
Wikidata
Lexicographical data
Add topic