Documentation of the twol-smn.txt file
This file is written in the Koskenniemi-style twolc format. Given the
frequent many-to-many-alternations in Inari Sá grammar, it
would probably be better to write it in xfst.
Needless to say, the rule set is still preliminary.
Alphabet and sets
The parser uses Latin 1 plus digraphs c1 d1 n1 s1 t1 z1 for the
Sá letters, as in Northern Sámi. In addition, some
morphographemes (h2, b3, etc) are added, this is for letter not
undergoing the same changes as other h, b, etc.
The X1, Q1 etc. are diacritical symbols triggering morphophonological
processes such as consonant gradation, etc.
The rules
Vowel lengthening
tbw.
Stem vowel alternation
There are rules for vowel lenghtening (inserting V) and for â to
a triggered by WeG and a lenghtening diacriticon X4.
Consonant gradation
Deletion rule
The quantitative gradation is covered by the rule "Gradation: Double
Consonant", deleting the second of a double consonant in WeG
environments.
Change rules
The parser contains the following alternation rules:
"Qualitative Gradation of Simple Consonant"
"III-I Gradation"
"t:d1"
"h:v"
Last modified: Mon Nov 15 09:12:30 2004