Documenting the Southern Sámi lexicon file
Introduction
The sma lexicon is organised the same way as the Northern Sámi
lexicon, cf. the flowchart of
Northern Sámi.
So far (May 2002) the sma lexicon contains nouns and some closed
classes (auxiliary verbs, adverbs, adpositions and personal and
demonstrative pronouns.
Morphophonology in the lexical representations
The nouns are entered with a morphophonolgical underlying form, and a
corresponding simple morphological structure, with few morpholocial
sublexica. Before the system is enlarged, we should evaluate it, and
compare it to the other languages, and to Xerox practice.
The nouns
The noun stems are stored in noun-sma-lex,txt, whereas the
morphology is found in sma-lex.txt. It is made by converting
the original Moshagen / Trosterud sma lexicon to the Xerox format. The
lexical rules are taken from Karttunen's alternative formualtion of
our file. His original is found in the archive of original files
(Contact Trond for reference, if needed).
The nouns are divided into three stem classes, the N_IE nouns (gåetie,
etc.), the N_OE nouns (bearkoe, etc) and the N_OTHER nouns (all other
ones, bi- and trisyllabic alike).
The case forms fall in three different groups: Forms unique to each of
the three stem classes (listed under each sublexicon), forms with -j-
in the N_OE class and -i- in the other ones (with separate i- and j-
sublexica and a common suffix lexicon), and forms common to all stem
classes (covered in a common continuation lexicon).
The proper nouns
The proper nouns file contains appr. 1000 Southern Sami place names
(achieved from Statens Kartverk), and appr. 23000 general names. Both
the general names and the Sami names are divided in two groups, VNAME
and CNAME. This is most certainly not correct for the latter group,
probably also not for the former (the syllable structure will be
relevant). Work is needed.
The adjectives
Nothing has been done on the adjectives.
The verbs
The auxiliary lea and the negative verbs have been added. These
verbs are irregular, and have thus been added without the use of any
morphophonological processes.
Verb entries have been added for all stem classes and all Umlaut types
in Bergsland's grammar.
stemcl. lexicon umlaut row
---------------------------
odd DÅERIEDIDH -
I BÅETEDH row A
II TJEARODH row C
III GUARKEDH row B
IV TJOEHPEDH row D
V VÅÅJNEDH row E
VI GÖÖLEDH row F
---------------------------
Thus, the framework is now ready to add all the verbs. In addition,
most inflection has been added (present and past tense, infinitive,
verb genitive, actio, gerund, the participles, conneg form and
verbabbessive).
No derivation is included.
The adpositions and prepositions
The adpositions in Bergsland's grammar have been listed, in two groups, pure postpositions and combined pre/postpositions (named "adpositions").
The adverbs
The adv-sma-lex.txt file lists all the adverbs mentioned in
Bergsland's grammar. In some cases, it is unclear whether it is an
adverb or just a noun prototypically used as an adverb.
Trond Trosterud
Last modified: Mon May 6 00:44:21 CEST 2002