UD_Low_Saxon-LSDC
edit page
issue tracker
This page pertains to UD version 2.
UD Low Saxon LSDC
Language:
Low Saxon
(code:
nds
Family: IE
This treebank has been part of Universal Dependencies since the UD v2.8 release.
The following people have contributed to making this treebank part of UD: Janine Siewert.
Repository:
UD_Low_Saxon-LSDC
Search this treebank on-line:
PML-TQ
Download all treebanks:
UD 2.17
License: CC BY-SA 4.0
Genre: fiction, nonfiction
Questions, comments?
General annotation questions (either Low Saxon-specific or cross-linguistic) can be raised in the
main UD issue tracker
You can report bugs in this treebank in the
treebank-specific issue tracker on Github
If you want to collaborate, please contact [janine • siewert (æt) helsinki • fi].
Development of the treebank happens outside the UD repository.
If there are bugs, either the original data source or the conversion procedure must be fixed.
Do not submit pull requests against the UD repository.
Annotation
Source
Lemmas
annotated manually
UPOS
annotated manually, natively in UD style
XPOS
annotated manually
Features
annotated manually, natively in UD style
Relations
annotated manually, natively in UD style
Description
The UD Low Saxon LSDC dataset consists of sentences in 8 major Low Saxon dialect groups from both Germany and the Netherlands. These sentences are (or are to become) part of the LSDC dataset and represent the language from mostly the 19th and early 20th century in genres such as short stories, novels, speeches, letters and fairytales.
The first version of the UD Low Saxon LSDC dataset contained 18 Low Saxon (sub-)dialects from both Germany and the Netherlands represented by 2 sentences each and belonging to the domains of short stories, novels, speeches, letters and fairytales. Each sentence was chosen from a different text to present some of the variation within the different dialect groups. In the second version, 40 sentences from four Westphalian dialects, two from Germany and two from the Netherlands, were added. The coverage of other dialect groups will be improved in future releases. For the third version, we have raised the number of sentences to 190 and made slight modifications to the subgrouping of the dialects. The
current fourth version
of the dataset contains 1,000 sentences, of which 500 are placed in the train and 500 in the test set. The dialect distribution between the train and the test set is not balanced yet and we plan to improve this in future releases.
The major dialect group is shown as the third segment of the sentence ID. The following dialects are included:
BRA = Brandenburgish
DNS = German North Saxon
DWF = German Westphalian
MVP = Mecklenburgish – West Pomeranian
NNS = Dutch North Saxon
NPR = Low Prussian
NWF = Dutch Westphalian
OFL = Eastphalian
Since there is no official interregional spelling, the interregional spelling suggestion used by e.g. the Dutch Low Saxon Wikipedia (
Nysassiske Skryvwyse
, described in more detail here: https://skryvwyse.eu/ (only in Low Saxon)) is used as a compromise for normalisation, but the original spelling of the source is included in the line “text_orig =” and a Middle Low Saxon lemma is added in the tenth column (“lemma_gml=xxx”) in order to make the Modern Low Saxon data more easily comparable with the Middle Low Saxon data in the reference corpus “Referenzkorpus Mittelniederdeutsch/Niederrheinisch”. For this reason, the Middle Low Saxon lemma forms largely follow the “Mittelniederdeutsches Handwörterbuch” by Agathe Lasch et al. like in the reference corpus. Middle Low Saxon lemmata are only added in the cases where there is an attestation in Middle Low Saxon, i.e. the word is either listed in the Handwörterbuch or is found in the reference corpus. Middle Low Saxon lemmata are still included if the word’s meaning has changed, an in addition, we have done our best to create new complex word lemmata from known simplex words and reconstruct potential Middle Low Saxon forms for words which have not yet been attested at that stage of the language. The last few hundred sentences in the train set either do not contain Middle Low Saxon lemmata yet or they have been done automatically.
The first version of the dataset contained only sentences from copyright-free material from the 19th and early 20th century. Part of the sentences are already included in the first release of the LSDC dataset found here: https://github.com/Helsinki-NLP/LSDC/ See there for further information on the origin of the data. The other sentences originate mostly from Joh. A. Leopold’s work ‘Van de Schelde tot te Weichsel’, a digitised version of which is accessible here: https://www.dbnl.org/titels/titel.php?id=leop008sche00 An exception constitutes the text ‘Krisjaon Klaover’ to be found in the Twentse Taalbank: http://www.twentsetaalbank.nl/docs/TWA.1894-Heinink-Krisjaon_Klaover-150.pdf These other sentences will be added to the next release of the LSDC dataset. Starting from the third version, the dataset also contains a few sentences from works by modern authors from which we have received permission to include small parts of their work in annotated corpora.
Acknowledgments
The following people were involved in the creation of this dataset:
Janine Siewert (data collection, selection and annotation)
Jack Michael Rueter (annotation-related advice)
References
If you use this treebank, please cite this paper:
@inproceedings{siewert-rueter-2024,
author = {Siewert, Janine and Rueter, Jack},
title = {{The Low Saxon LSDC Dataset at Universal Dependencies}},
booktitle = {Proceedings of the 2024 Joint International Conference on Computational
Linguistics, Language Resources and Evaluation},
series = {LREC-COLING 2024},
year = {2024},
month = {05},
address = {Torino, Italia},
organization = {ELRA Language Resources Association (ELRA) and the
International Committee on Computational Linguistics
(ICCL)},
pubstate = {forthcoming},
note = {accepted}
References used for the creation of this dataset:
Lasch, Agathe et al. 1928 ff.
Mittelniederdeutsches Handwörterbuch.
Neumünster: Wachholtz.
ReN-Team. 2019.
Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650).
Archived in Hamburger Zentrum für Sprachkorpora. Version 1.0. Publication date 2019-08-14. http://hdl.handle.net/11022/0000-0007-D829-8.
Statistics of UD Low Saxon LSDC
POS Tags
ADJ
ADP
ADV
AUX
CCONJ
DET
INTJ
NOUN
NUM
PART
PRON
PROPN
PUNCT
SCONJ
VERB
Features
AdpType
Aspect
Case
Definite
Degree
Foreign
Gender
Gender[psor]
Mood
Number
Number[psor]
NumType
PartType
Person
Person[psor]
Polite
Poss
PronType
Reflex
Tense
VerbForm
VerbType
Relations
acl
acl:relcl
advcl
advmod
amod
appos
aux
aux:pass
case
cc
ccomp
compound
compound:prt
compound:redup
conj
cop
csubj
csubj:outer
dep
det
det:poss
discourse
dislocated
expl
expl:pv
fixed
flat
iobj
list
mark
nmod
nmod:poss
nsubj
nsubj:pass
nummod
obj
obl
obl:agent
obl:arg
orphan
parataxis
punct
reparandum
root
vocative
xcomp
Tokenization and Word Segmentation
This corpus contains 1000 sentences, 22615 tokens and 22639 syntactic words.
This corpus contains 3382 tokens (15%) that are not followed by a space.
This corpus does not contain words with spaces.
This corpus contains 35 types of words that contain both letters and punctuation. Examples: 'n, 'm, 't, 'r, aller-, knee'e, pöäkel-up-stelten, 'e, 'ne, A., Bähnhas', CDU-fraktioon, E., G., Grote-Oog, Hiem-Hannes, Kasper-oum, Krus’, Luoden-heide, R., Röm., S., St., West-Indys, ao., botter-, c.q., dree'en, e-mailvorkeyr, gir-af-geskigde, kümmel-akwavit, midwochs-, twey-en-twintig, v., vyv-
This corpus contains 15 multi-word tokens. On average, one multi-word token consists of 2.60 syntactic words.
There are 10 types of multi-word tokens. Examples: to'm, im, to'n, Kumste, am, bym, in'en, in't, ten, van'er.
Morphology
Tags
This corpus uses 16 UPOS tags out of 17 possible:
ADJ
ADP
ADV
AUX
CCONJ
DET
INTJ
NOUN
NUM
PART
PRON
PROPN
PUNCT
SCONJ
VERB
This corpus does not use the following tags: SYM
This corpus contains 7 word types tagged as particles (PART): en, ne, neet, nich, nit, te, to
This corpus contains 44 lemmas tagged as pronouns (PRON): al, allebeide, allens, alles, ander, dat, dee, disse, dit, du, dyn, dår, elk, elkeyn, et, eyn, eynander, eyner, eynig, geyn, hee, ichts, ik, jenne, jy, keyn, keynminske, man, mekare, my, myn, männig, niks, nüms, see, sik, uns, wat, wee, wek, wy, yder, ydereyn, yts
This corpus contains 25 lemmas tagged as determiners (DET): al, allerley, de, dee, desülve, disse, dyn, ear, elk, en, ergenden, et, eyn, eynig, geyn, juw, keyn, myn, männig, neyn, syn, sük, uns, yder, −
Out of the above, 14 lemmas occurred sometimes as PRON and sometimes as DET: al, dee, disse, dyn, elk, et, eyn, eynig, geyn, keyn, myn, männig, uns, yder
This corpus contains 10 lemmas tagged as auxiliaries (AUX): doon, dörven, hebben, künnen, möten, möägen, sköälen, weasen, werden, willen
Out of the above, 10 lemmas occurred sometimes as AUX and sometimes as VERB: doon, dörven, hebben, künnen, möten, möägen, sköälen, weasen, werden, willen
There are 2
(de)verbal forms:
Inf
AUX: syn, weasen, werden, hebben, hevven, können, künnen, mütten, süllen, warden
VERB: seggen, maken, seen, gån, hebben, doon, holden, eaten, höyren, stån
Part
ADJ: vorkeyrd, vorknüpped, Doudeslån, Ofgeloupen, anedån, anvroaren, anweasen, bedrämmelden, bekend, dalevyrd
AUX: west, worden, ewesd, must, weasd, weasen, ewest, kund, möcht, müst
VERB: gån, dån, koamen, maked, höyrd, worden, giaven, had, seen, segd
Nominal Features
Gender
Fem
ADJ: olde, ganse, groute, ander, gode, grout, houge, lest, olden, andere
ADJ-Part: terreatene, vordörde
DET: de, der, en, ne, dee, syne, eyne, myn, syn, düsse
NOUN: vrouwe, tyd, stad, döäre, syde, hand, werld, nacht, aerde, dochter
NUM: eyne
PRON: see, dee, ear, höär, haar, andere, diaser, mynde, änder
PROPN: Hente, CDU, Havel, Luoden-heide, Marigge, Nicolaikarke, Slaumayerske, St., Trina
Fem,Masc
ADJ: olde, Golden, heyle, lest, olden, smalle, vorweerden, weinig
ADJ-Part: vorweerden
DET: de, dee, en, Eynes, des, dissen, gyn, ne
NOUN: tyd, heyrskop, nachts, tyden, Ploege, gek, gråten, kansen, last, leavtyden
PRON: dee, eyne, Geyn, ander, andere, eyn, wek
PROPN: Strüwingken
Fem,Masc,Neut
DET: Myne, syn
NOUN: andächtige, gardynen
Fem,Neut
ADJ: lakensk
DET: en
NOUN: jakke
Masc
ADJ: goden, grouten, olde, olden, anderen, gode, groute, andere, beiden, eyrste
ADJ-Part: Ofgeloupen, bedrämmelden, gepokkeneerden, köften, uutsochten, vorgangen, vorkneapen, vorvealende
DET: de, den, en, dem, dee, synen, myn, eynen, syne, syn
NOUN: dag, man, god, her, buur, åvend, doud, junge, möller, kearl
NUM: eynen, veer
PRON: hee, dee, em, den, man, hum, en, üm, iame, eyner
PROPN: Hiärmen, Andrees, Bennad, Claus, Friedrich, Gravenes, Harms, Hein, Henrick, Krisjaon
Masc,Neut
ADJ: krütslike, olde, vorstandig
DET: keyn, de, Alle, al, en, neyn, synen
NOUN: menske, minske, minsken, bast, hokuspokus, lyv, mensken, minsker, noorden, vlas
PRON: geyn
Neut
ADJ: eyrste, old, ander, andere, anders, gode, grout, heyle, leste, leve
ADJ-Part: gegeaven, gemästed, terspleaten, uutgereaten, vordraides, vorvreaten, vöärgånde
DET: dat, en, et, de, syn, myn, det, dem, 'n, den
NOUN: lüde, huus, kinder, mål, geld, ougen, woord, ende, jår, leaven
NUM: eyn, eynen, twey
PRON: et, dat, wat, niks, det, alles, allens, dee, dit, nist
PROPN: Eykertyn, Grote-Oog
Number
Plur
ADJ: olden, beiden, beide, gode, olde, andere, lange, eyrsten, goden, grouten
ADJ-Part: vordörde, vöärgånde
AUX: hadden, weren, sint, sünt, willet, hebbet, köänet, hebben, wassen, kunnen
DET: de, dee, syne, den, alle, eare, keyne, myn, syn, uw
NOUN: lüde, kinder, ougen, dage, buren, dagen, jåren, minsken, tyden, dinge
NUM: dree'en, veer
PRON: see, wy, dee, jy, sik, juw, uns, uus, alle, y
PROPN: Berbiessies, Drüüksken, Dörchläuchten, Slaumayers
VERB: saeten, sean, hebbet, gåt, hebben, hevvet, kaemen, ståt, doot, gungen
Plur,Sing
AUX: kün, bint, künnet, heb, sünt, kan, mag, mö, müttet, sin
NOUN: mark
PRON: y, See, u, jy, uw, hee, höär, sik, wat, ü
VERB: hebbet, doot, gå, gåt, seggen, weat, wordet, Hold, Koup, Låt
Sing
ADJ: olde, goden, grouten, groute, eyrste, ganse, gode, old, heyle, olden
ADJ-Part: Ofgeloupen, bedrämmelden, gegeaven, gemästed, gepokkeneerden, köften, terreatene, terspleaten, uutgereaten, uutsochten
AUX: is, was, het, hadde, weer, kan, wil, kun, wul, sal
DET: de, en, den, dat, et, dem, myn, syn, dee, der
NOUN: dag, vrouwe, man, tyd, god, her, buur, åvend, doud, huus
NUM: eynen, eyn, eyne
PRON: ik, hee, et, dat, my, dee, wat, sik, see, em
PROPN: Pölz, Hiärmen, Anna, Koch, Andries, Gassen, Jesus, Willem, Hein, Annegyn
VERB: hadde, sea, kam, het, sead, segt, gung, geit, kaem, segge
Case
Acc
ADJ: goden, groute, gode, grouten, olde, anderen, ganse, grout, heyle, lange
ADJ-Part: Ofgeloupen, bedrämmelden, gemästed, gepokkeneerden, terspleaten, uutgereaten, uutsochten, vordörde, vorgangen, vorkneapen
DET: de, en, den, dat, et, syn, syne, dee, myn, ne
NOUN: dag, åvend, tyd, werld, woord, geld, leaven, ouge, ougen, dage
NUM: veer, dree, eynen, twaalv, twey
PRON: dat, wat, et, niks, see, dee, den, en, andere, ne
PROPN: Eykertyn, Garrelt, dalailama
Acc,Dat
ADJ: andere, grouten, olden, eyrste, ander, houge, gode, golden, lest, leste
ADJ-Part: gegeaven, vorweerden, vöärgånde
DET: de, den, en, et, dat, myn, syn, syne, synen, dee
NOUN: huus, man, stad, tyd, God, dag, döäre, syde, aerde, hüüs
NUM: eyn, hunderd, dree, dree'en, eynen
PRON: my, sik, em, dy, u, juw, uns, uus, hum, üm
PROPN: CDU, Grote-Oog, Havel, Luoden-heide, St., Trina, Zütphen
Dat
ADJ: besten, goden, eyrsten, olden, anderen, grouten, grönen, leven, minste, 31.
DET: dem, der, den, 'n, m, eynem, synem, 'm, dear, mynem
NOUN: tyd, doude, möller, dag, ende, god, heyren, houpe, lande, ougen
NUM: eyne, eynen, veer
PRON: em, iame, dem, ear, öäme, iam, mik, my, nen, öäm
PROPN: Marigge, Nicolaikarke
Gen
ADJ: anders, goder, nys, wits, öldesten
DET: des, eynes, en, synes, 't, deas, der
NOUN: åvends, nachts, dages, anderdages, broders, hoapeninge, junges, maargens, mensken, moders
PRON: anders
PROPN: Esaias, Mariekens, Reinekens, Winkels
Nom
ADJ: olde, leve, old, beiden, gode, groute, olden, beide, ganse, grouten
ADJ-Part: köften, terreatene, vordraides, vorvealende
DET: de, en, dat, myn, dee, et, ne, syn, keyn, syne
NOUN: lüde, vrouwe, her, man, God, buur, doud, junge, kearl, vader
NUM: tein, eyn
PRON: ik, hee, see, dee, et, dat, wy, y, du, wat
PROPN: Hiärmen, Andrees, Gravenes, Heem, Hein, Henrick, Jouke, Krisjaon, Lulef, Slaumayerske
Definite
Def
DET: de, den, dat, et, dem, dee, der, det, en, des
Ind
DET: en, ne, eyne, eynen, eyn, nen, den, e, eynem, 'm
Degree and Polarity
Degree
Cmp
ADJ: meyr, beater, wyder, naeger, Later, lever, länger, minder, slechter, vröer
ADV: meyr, eyrder
Pos
ADJ: good, gans, olde, recht, goden, richtig, grouten, doud, vul, bange
ADJ-Part: vorkeyrd, vorknüpped, Doudeslån, anedån, anvroaren, anweasen, bedrämmelden, bekend, egolded, ervroid
ADV: völle, lange, heyldal, seyre, stille, tovrea, veal, vial
Sup
ADJ: best, lest, besten, eyrste, leste, allerbelangrykste, beste, grötsten, leevst, letste
Verbal Features
Aspect
Perf
AUX-Part: weasd
VERB-Part: ebracht, gån, maked, Beskreaven, afemaked, ankündigd, antwoorded, anvungen, beknütted, bewysd
Mood
Imp
AUX: syt, Sy
VERB: see, sü, låt, giv, gåt, höyr, Haal, Maak, Seg, gå
Ind
AUX: is, was, het, hadde, weer, kan, wil, sint, kun, hadden
VERB: hadde, sea, kam, het, sead, segt, gung, geit, segge, kaem
Ind,Sub
AUX: hadde, sul, was, weer, weren, wul, Solst, had, hadden, hädden
VERB: wol, Seggen, angung, dea, had, kaemen, kreyg, plükkede, reisen, sat
Sub
AUX: weer, möchte, sül, wöre, hädde, wolde, würd, dead, drövden, können
VERB: hädde, höyren, kaem, kaeme, Bestünde, Låt, Låten, Skämen, deade, geave
Tense
Past
ADJ-Part: vorkeyrd, vorknüpped, Doudeslån, Ofgeloupen, anedån, anvroaren, anweasen, bedrämmelden, bekend, dalevyrd
AUX: was, hadde, weer, west, kun, hadden, weren, wul, had, kon
AUX-Inf: west
AUX-Part: west, worden, ewesd, must, weasen, ewest, kund, möcht, müst, wesd
VERB: hadde, sea, kam, sead, gung, kaem, stund, had, kwam, sagde
VERB-Inf: wedded
VERB-Part: dån, koamen, gån, höyrd, maked, worden, giaven, segd, bleaven, geaven
Pres
ADJ-Part: helderseend, hülpbehövend, smunselend, spottend, vansülvspreakend, vorvealende, vöärgånde
AUX: is, het, kan, wil, sint, sal, sünt, hev, heb, bin
VERB: het, segt, geit, segge, weyt, hebbet, hevt, höyrt, heyt, ligt
VERB-Part: wüst
Pronouns, Determiners, Quantifiers
PronType
Art
DET: de, en, den, dat, et, dem, dee, der, ne, det
PRON: et, dat
Dem
DET: dee, düsse, dissen, düssen, dease, disse, düäse, düäsen, dat, dit
PRON: dat, dee, det, den, dit, dem, disse, Düsse, dean, deane
Dem,Prs
PRON: dat
Ind
DET: eyn, eyne, eynem, eynes, eynige, mannig, männig, ergendeyn, eynen, sük
PRON: man, wat, anderen, eyne, eyner, eyn, andere, men, ander, eynen
Ind,Int
PRON: wat
Ind,Neg,Tot
DET: gin
Int
PRON: wat, wee, wel, wer, Hwekke, Wekke, hwat, hwekker, wen
Int,Rel
PRON: wat
Neg
DET: gyn, keyn, keyne, keynen, kyn, nin, geyn, gin, kyne, kynen
PRON: niks, nist, keynen, Geyn, geynt, keymes, keyn, keyner, nüms
Prs
DET: myn, syn, syne, synen, uw, ear, dyn, myne, eare, dyne
PRON: ik, hee, see, et, my, sik, wy, dat, em, y
Rcp
PRON: enander, eynander, mekare, sik
Rel
PRON: dee, den, wat, dat, dem, hwekken, wek, welke
Tot
DET: alle, al, olle, alles, elken, ydem, yder, ydere, allen, elke
PRON: alle, alles, allens, al, allers, allet, olles, ydereyn, Elkeyn, allen
NumType
Card
NUM: twey, dree, eyn, veer, 14, acht, dusend, sös, tweyhunderd, veertein
Ord
ADJ: eyrste, eyrsten, tweyde, siavende, twölvden, vövd
Poss
Yes
DET: myn, syn, syne, synen, ear, uw, dyn, myne, eare, dyne
PRON: myn, dyn, höär, mynde
Reflex
Yes
PRON: sik, sek, sich, süch, sük, sy
Person
AUX: wil, hev, kan, bin, heb, was, had, hebbe, hevve, sal
PRON: ik, my, wy, ek, uns, uus, mik, myn, ikke, we
VERB: segge, hev, weyt, dacht, do, gelöyve, gå, hadde, heb, sat
1,3
PRON: See
AUX: büst, hest, kün, heb, bint, bis, hes, künnet, kanst, künnen
PRON: y, du, jy, dy, u, juw, uw, dik, jit, See
VERB: hest, hevvet, weytst, Sü, do, doot, geavet, gå, gåt, hebben
2,3
AUX: sünt
PRON: See, sik
VERB: seggen, Låten, Skämen, Weaten, gelöyvet, gån, hebbet, höyren, kaemen, koamen
AUX: is, was, het, hadde, weer, kan, hadden, kun, sint, weren
PRON: hee, see, et, sik, dat, em, det, hum, ear, dee
VERB: hadde, sea, het, kam, sead, segt, gung, geit, kaem, stund
Polite
Form
AUX: sünt, künnen, syt, willet
PRON: See, Jy, sik, ju
VERB: seggen, Låten, Skämen, Weaten, geavet, gelöyvet, gån, gåt, hebbet, höyren
Gender[psor]
Fem
DET: Syn, höären
Masc
DET: syn, syne, ear, synen
Number[psor]
Plur
DET: unsen, eare, Unse, ear
Sing
DET: syn, syne, myn, myne, mynen, dyn, höären, synen, uw
Other Features
AdpType
Post
ADP: to, an
Prep
ADP: in, van, up, mid, an, vöär, to, by, nå, med
Foreign
Yes
X: decipi, mundus, vult, Amicorum, Batavorum, De, Feststellung, Iovivat, Personalien, Pompadour
PartType
Inf
PART: to, te
Neg
PART: nich, neet, nit, ne, en
Person[psor]
DET: myn, unsen, myne, mynen, Unse
DET: dyn, juwen, ouw, uw
DET: syn, syne, eare, ear, höären, synen
VerbType
Aux
AUX: het, is, wil, hadde, kan, skölde, willet, hadden, hebben, hevt
AUX-Inf: hebben
Cop
AUX: is, was, weer, sint, weren
Mod
AUX: kun, künne, möcht, sul, wul
Syntax
Auxiliary Verbs and Copula
This corpus uses 1 lemmas as copulas (
cop
). Examples: weasen.
This corpus uses 10 lemmas as auxiliaries (
aux
). Examples: hebben, weasen, künnen, willen, sköälen, möten, werden, möägen, doon, dörven.
This corpus uses 2 lemmas as passive auxiliaries (
aux:pass
). Examples: werden, weasen.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
nsubj
VERB--NOUN (3)
VERB--NOUN-Acc (4)
VERB--NOUN-Dat (2)
VERB--NOUN-Nom (308)
VERB--PRON (5)
VERB--PRON-Acc (4)
VERB--PRON-Acc,Dat (6)
VERB--PRON-Nom (862)
VERB-Inf--NOUN (1)
VERB-Inf--NOUN-Acc (2)
VERB-Inf--NOUN-Dat (1)
VERB-Inf--NOUN-Nom (46)
VERB-Inf--PRON (1)
VERB-Inf--PRON-Acc,Dat (4)
VERB-Inf--PRON-Nom (259)
VERB-Part--NOUN-Acc (2)
VERB-Part--NOUN-Nom (69)
VERB-Part--PRON-Acc,Dat (3)
VERB-Part--PRON-Nom (182)
obj
VERB--NOUN (3)
VERB--NOUN-Acc (308)
VERB--NOUN-Acc,Dat (3)
VERB--NOUN-Acc,Dat-ADP(mid) (1)
VERB--NOUN-Acc-ADP(dale) (2)
VERB--NOUN-Dat (8)
VERB--NOUN-Dat-ADP(in) (1)
VERB--NOUN-Dat-ADP(nå) (1)
VERB--NOUN-Gen (1)
VERB--NOUN-Nom (13)
VERB--PRON (3)
VERB--PRON-Acc (194)
VERB--PRON-Acc,Dat (121)
VERB--PRON-Dat (6)
VERB--PRON-Nom (9)
VERB-Inf--NOUN-Acc (125)
VERB-Inf--NOUN-Acc,Dat (2)
VERB-Inf--NOUN-Acc,Dat-ADP(mid) (1)
VERB-Inf--NOUN-Acc-ADP(vöär) (1)
VERB-Inf--NOUN-Acc-ADP(åne) (1)
VERB-Inf--NOUN-Dat (1)
VERB-Inf--NOUN-Nom (3)
VERB-Inf--PRON (1)
VERB-Inf--PRON-Acc (65)
VERB-Inf--PRON-Acc,Dat (39)
VERB-Inf--PRON-Dat (1)
VERB-Inf--PRON-Nom (5)
VERB-Part--NOUN (1)
VERB-Part--NOUN-Acc (76)
VERB-Part--NOUN-Acc,Dat-ADP(in) (1)
VERB-Part--NOUN-Acc,Dat-ADP(mid) (1)
VERB-Part--NOUN-Dat (1)
VERB-Part--NOUN-Nom (5)
VERB-Part--PRON-Acc (61)
VERB-Part--PRON-Acc,Dat (20)
VERB-Part--PRON-Dat (1)
VERB-Part--PRON-Nom (2)
iobj
VERB--NOUN-Acc (3)
VERB--NOUN-Acc,Dat (2)
VERB--NOUN-Dat (5)
VERB--PRON-Acc (2)
VERB--PRON-Acc,Dat (57)
VERB--PRON-Dat (9)
VERB-Inf--NOUN-Acc,Dat (6)
VERB-Inf--NOUN-Dat (2)
VERB-Inf--PRON-Acc,Dat (22)
VERB-Inf--PRON-Dat (3)
VERB-Part--NOUN-Acc (1)
VERB-Part--NOUN-Acc,Dat (3)
VERB-Part--NOUN-Dat (2)
VERB-Part--PRON-Acc,Dat (20)
VERB-Part--PRON-Dat (3)
Reflexive Verbs
This corpus contains 39 lemmas that occur at least once with an
expl:pv
child. Examples: skamen sik, vorwunderen sik, bangen sik, bekyken sik, besinnen juw, besinnen sik, besnorgelen sik, besteaden sik, denken sik, eaten sik, geaven et, holden uus, inslån et, keyren sik, kyken sik, lägeren sik, låten my, låten sek, låten sik, maken dy, maken sik, neamen sik, nådenken my, resolveren sik, setten ju, setten sik, smegen sik, stellen sik, swearen my, trekken sik, tyren sük, vallen et, vordragen dy, vorlåten sik, vorvlöken sik, vorwylen sik, vörderen sik, wearen sik, weaten sik
Verbs with Reflexive Core Objects
This corpus contains 42 lemmas that occur at least once with a reflexive core object (
obj
or
iobj
). Examples: låten sik, stellen sik, draien sik, låten sek, maken sik, richten sik, setten sik, vortellen sik, vorwandelen sik, wunderen sik, anhandelen sik, bemöien süch, beperken sich, bringen sik, byten sik, entsluten sik, geaven sik, hebben sik, heisteren süch, helpen sik, höägen sik, inslachten sik, koaken sik, köypen sik, köypen sy, leggen sek, rekken süch, rennen sik, rögen sik, sitten sik, sküddelen sik, stöyten sik, tröysten sik, underholden sik, vorswearen sik, vroien sik, vöärneamen sik, weaten sik, wenden sik, winnen sik, wärmen sik, öäverdenken sik
Out of those, 1 lemmas occurred more than once, but never without a reflexive dependent. Examples: vorwandelen
Relations Overview
This corpus uses 11 relation subtypes:
acl:relcl
aux:pass
compound:prt
compound:redup
csubj:outer
det:poss
expl:pv
nmod:poss
nsubj:pass
obl:agent
obl:arg
The following 2 relation types are not used in this corpus at all:
clf
goeswith
Universal Dependencies contributors
Site powered by
Annodoc
and
brat
US