Documentation of the twol-smn.txt file

This file is written in the Koskenniemi-style twolc format. Given the frequent many-to-many-alternations in Inari Sá grammar, it would probably be better to write it in xfst.

Needless to say, the rule set is still preliminary.

Alphabet and sets

The parser uses Latin 1 plus digraphs c1 d1 n1 s1 t1 z1 for the Sá letters, as in Northern Sámi. In addition, some morphographemes (h2, b3, etc) are added, this is for letter not undergoing the same changes as other h, b, etc.

The X1, Q1 etc. are diacritical symbols triggering morphophonological processes such as consonant gradation, etc.

The rules

Vowel lengthening

tbw.

Stem vowel alternation

There are rules for vowel lenghtening (inserting V) and for â to a triggered by WeG and a lenghtening diacriticon X4.

Consonant gradation

Deletion rule

The quantitative gradation is covered by the rule "Gradation: Double Consonant", deleting the second of a double consonant in WeG environments.

Change rules

The parser contains the following alternation rules:

"Qualitative Gradation of Simple Consonant"
"III-I Gradation"
"t:d1"
"h:v"


Last modified: Mon Nov 15 09:12:30 2004