A spell-checker for Sami
Intro
The Sami Parliament in Norway plans to make spell checkers for North,
Lule and South Sami, within the eSápmi program.
The Sami Parliament plans
Our lexica seen in a spellchecker perspective
At the moment, our lexica (*-sm?-lex.txt) are not normative, but
descriptive, recognising both standard and substandard spelling
conventions. If our lexica shall be used for spellcheckers, it must
distinguish between standard and substandard forms, in the way
described in Beesley's and Karttunen's book.
Different technologies
The spellchecker plans involve different technologies. Basically, the
distinction goes between automata and transducers, i.e. in this
context, between systems unaware and systems aware of grammatical
information.
transducer-based spell checkers
hmm...
automaton-based spell checkers
Some applications will need an ispell-type interface. This should be
made, either by using the original ispell method, or by compressing a
fullformlist to an automaton automatically.
Trond Trosterud
Last modified: Sat Mar 27 23:55:19 2004