! Divvun & Giellatekno - open source grammars for Sámi and other languages ! Copyright © 2000-2010 The University of Tromsø & the Norwegian Sámi Parliament ! http://giellatekno.uit.no & http://divvun.no ! ! This program is free software; you can redistribute and/or modify ! this file under the terms of the GNU General Public License as published by ! the Free Software Foundation, either version 3 of the License, or ! (at your option) any later version. The GNU General Public License ! is found at http://www.gnu.org/licenses/gpl.html. It is ! also available in the file $GTHOME/LICENSE.txt. ! ! Other licensing options are available upon request, please contact ! giellatekno@hum.uit.no or divvun@hum.uit.no Multichar_Symbols +Use/NG ! Do not generate, for isme-ped.fst and apertium LEXICON Root !Setting aside the thousands and millions !#! Lexica for 1001 - hundreds of millions are moved to the end !#! of this document, and commented out with !#! !#! Move it up here if higher numbers are needed. ! Modeling 1-1000 only 1:+Use/NGpêyak% kihci% mitâtahtomtanaw THOUSAND ; ! 1000 1:pêyak% kihci% mitâtahtomitanaw THOUSAND ; ! 1000 UNDERTHOUSAND ; ! 100-999 TENS ; ! 20-99 NEARLY ; ! 29-99 TEENS ; ! 10-19 ONES ; ! 1-9 LEXICON THOUSAND :% ayiwâk% UNDERTHOUSAND ; ! 1100-1999 %0:% ayiwâk% TENS ; ! 1020-1099 %0:% ayiwâk% TEENS ; ! 1010-1019 %0%0:% ayiwâk% ONES ; ! 1001-1009 %0%0%0: ENDLEX ; ! 1000 !=========================== !Here starts the 999 numbers !=========================== LEXICON UNDERTHOUSAND HUNDREDS ; 1:mitâtahtomitanaw HUNDRED ; 1:+Use/NGmitâtahtomtanaw HUNDRED ; LEXICON HUNDREDS 2:nîso% CUODI ; 3:nisto% CUODI ; 4:nêwo% CUODI ; 5:niyânano% CUODI ; !OBS 6:nikotwâsiko% CUODI ; 7:têpakohpo% CUODI ; 8:ayinânêwo% CUODI ; 9:kêkâmitâtahto% CUODI ; LEXICON CUODI :mitâtahtomitanaw HUNDRED ; :+Use/NGmitâtahtomtanaw HUNDRED ; LEXICON HUNDRED :% ayiwâk% TENS ; :% ayiwâk% NEARLY ; :% ayiwâk% TEENS ; %0:% ayiwâk% ONES ; %0%0: ENDLEX ; ! !!!!!!!!!!!!!!!!!!!!!!!!!!!!! LEXICON TENS 1%0:mitâtaht ENDLEX ; 2:nîsi TEN ; 2:+Use/NGnîs TEN ; 2:+Use/NGnîsomi TEN; 3:nistomi TEN ; 4:nêmi TEN ; 5:niyânanomi TEN ; 6:nikotwâsomi TEN ; 7:têpakohpomi TEN ; 7:+Use/NGtêpakohpimi TEN ; 8:ayinânêwomi TEN ; 9:kêkâ-mitâtahtomi TEN ; 9:+Use/NGkêkâmitâtahtomi TEN ; LEXICON TEN :tanaw% ONESTONEXT ; %0:tanaw ENDLEX ; LEXICON NEARLY 29:kêkâ-nistomi NEARLY-TEN ; 29:+Use/NGkêkâc% nistom NEARLY-TEN ; 29:+Use/NGkêkâc% nistomi NEARLY-TEN ; 29:+Use/NGkêkâc-nistomi NEARLY-TEN ; 39:kêkâ-nêmi NEARLY-TEN ; 39:+Use/NGkêkâc% nêwom NEARLY-TEN ; 39:+Use/NGkêkâc% nêwomi NEARLY-TEN ; 39:+Use/NGkêkâc-nêwomi NEARLY-TEN ; 39:+Use/NGkêkâc% nêmi NEARLY-TEN ; 39:+Use/NGkêkâc-nêmi NEARLY-TEN ; 49:kêkâ-niyânanomi NEARLY-TEN ; 49:+Use/NGkêkâc% nîyânom NEARLY-TEN ; 49:+Use/NGkêkâc% nîyânomi NEARLY-TEN ; 49:+Use/NGkêkâc-niyânanomi NEARLY-TEN ; 59:kêkâ-nikotwâsomi NEARLY-TEN ; 59:+Use/NGkêkâc% nikotwâsom NEARLY-TEN ; 59:+Use/NGkêkâc-nikotwâsomi NEARLY-TEN ; 69:kêkâ-têpakohpomi NEARLY-TEN ; 69:+Use/NGkêkâ-têpakohpimi NEARLY-TEN ; 69:+Use/NGkêkâc% têpakohpim NEARLY-TEN ; 69:+Use/NGkêkâc-têpakohpimi NEARLY-TEN ; 69:+Use/NGkêkâc-têpakohpomi NEARLY-TEN ; 79:kêkâ-ayinânêwomi NEARLY-TEN ; 79:+Use/NGkêkâc% ayinânêwom NEARLY-TEN ; 79:+Use/NGkêkâc-ayinânêwomi NEARLY-TEN ; 89:kêkâc-kêkâ-mitâtahtomi NEARLY-TEN ; 89:+Use/NGkêkâc% kêkâ% mitâtahtom NEARLY-TEN ; 89:+Use/NGkêkâc% kêkâ% mitâtahtomi NEARLY-TEN ; 9%0:+Use/NGkêkâ% mitâtahtom NEARLY-TEN ; 9%0:+Use/NGkêkâ% mitâtahtomi NEARLY-TEN ; 99:kêkâ-mitâtahtomitanaw% kêkâ-mitâtaht ENDLEX ; 99:+Use/NGkêkâ% mitatahtomitanaw% ayiwâk% kêkâ% mitâtaht ENDLEX ; LEXICON NEARLY-TEN :tanaw ENDLEX ; LEXICON ONESTONEXT 1:pêyakosâp ENDLEX ; 2:nîsosâp ENDLEX ; 3:nistosâp ENDLEX ; 4:nêwosâp ENDLEX ; 5:niyânanosâp ENDLEX ; 6:nikotwâsosâp ENDLEX ; 7:têpakohposâp ENDLEX ; 8:ayinânêwosâp ENDLEX ; 9:+Use/NGkêkâ-mitâtahtosâp ENDLEX ; ! use the NEARLY instead (check) 9:+Use/NGkêkâmitâtahtosâp ENDLEX ; 1:+Use/NGpêyako-sâp ENDLEX ; 2:+Use/NGnîso-sâp ENDLEX ; 3:+Use/NGnisto-sâp ENDLEX ; 4:+Use/NGnêwo-sâp ENDLEX ; 5:+Use/NGniyânano-sâp ENDLEX ; 6:+Use/NGnikotwâso-sâp ENDLEX ; 7:+Use/NGtêpakohp-sâp ENDLEX ; 8:+Use/NGayinânêwo-sâp ENDLEX ; !9:kêkâc% nîsitanaw ENDLEX ; ! only for 19 (almost 20) :+Use/NGayiwâk% CARDINAL ; LEXICON TEENS 11:pêyakosâp ENDLEX ; 12:nîsosâp ENDLEX ; 13:nistosâp ENDLEX ; 14:nêwosâp ENDLEX ; 15:niyânanosâp ENDLEX ; 16:+Use/NGnikotwâsikosâp ENDLEX ; 16:nikotwâsosâp ENDLEX ; 17:têpakohposâp ENDLEX ; 18:ayinânêwosâp ENDLEX ; 19:kêkâ-nîsitanaw ENDLEX ; 19:+Use/NGkêkâc% nîsitanaw ENDLEX ; 19:+Use/NGkêkâmitâtahtosâp ENDLEX ; 19:+Use/NGkêkâ-mitâtahtosâp ENDLEX ; LEXICON ONES CARDINAL ; !ORDINAL ; ! not done yet, unfortunately LEXICON CARDINAL 1:pêyak ENDLEX ; 2:nîso ENDLEX ; 3:nisto ENDLEX ; 4:nêwo ENDLEX ; 5:niyânan ENDLEX ; 6:nikotwâsik ENDLEX ; 7:têpakohp ENDLEX ; 8:ayinânêw ENDLEX ; 9:kêkâ-mitâtaht ENDLEX ; 9:+Use/NGkêkâmitâtaht ENDLEX ; 9:+Use/NGkêkâ% mitâtaht ENDLEX ; LEXICON ENDLEX # ; ! Down here comes outcommented stuff, it may be needed some day. ! +String ! Tag to denote non-numeric strings ! +NumNum ! Tag to denote real numbers in one form or another !These 4 lines are for TTS, they are commented out. !< %+String [a|b|c|d|e|f|g|h|i|j|k|l|m|n|o|p|q|r|s|t|u|v|w|q|y|z|â|ê|î|ô|A|B|C|D|E|F|G|H|I|J|K|L|M|N|O|P|Q|R|S|T|U|V|W|X|Y|Z|Â|Ê|Î|Ô]* > ENDLEX ; ! This first line is to allow all letter strings. !+NumNum NUMBERSECTION ; !+NumNum ENDLEXSECTION ; !LEXICON NUMBERSECTION !#! This shall be part of the Root lexicon: !#! HUNDREDSM ; ! 200M !#! 1:mitâtahtomitanaw% HUNDREDM ; ! 100M !#! 1:mitâtahtomtanaw% HUNDREDM ; ! 100M !#! TENSM ; ! 20-99M !#! TEENSM ; ! 10-19M !#! ONESM ; ! 1-9M !#! !HUNDREDST ; ! 200000-999999 NB !#! 1:mitâtahtomitanaw HUNDREDT ; ! 100000-100999 !#! 1:mitâtahtomtanaw HUNDREDT ; ! 100000-100999 !#! TENST ; ! 20000-99999,10000-10999 !#! TEENST ; ! 11000-19999 !#! ONEST ; ! 2000-9999 !#! Here shall come the rest of what is today under LEXICON Root !#! !#! Then the rest of this !#! section shall come between the Root !#! lexicon and the rest of the file. !#! LEXICON HUNDREDSM !#! 2:nîso CUODIM ; !#! 3:nisto CUODIM ; !#! 4:nêwo CUODIM ; ! Google: 459 !#! 5:niyânan CUODIM ; !#! 6:nikotwâsik CUODIM ; !#! 7:têpakohp CUODIM ; !#! 8:ayinânêw CUODIM ; !#! 9:kêkâ-mitâtaht CUODIM ; !#! !#! LEXICON CUODIM !#! :mitâtahtomitanaw HUNDREDM ; !#! :+Use/NGmitâtahtomtanaw HUNDREDM ; !#! !#! LEXICON HUNDREDM !#! TENSM ; !#! TEENSM ; !#! %0: ONESM ; !#! %0%0: MILJON ; !#! !#! LEXICON TEENSM !#! 1:luhkie TEENM ; ! South Saami (!), to be replaced !#! !#! LEXICON TEENM !#! 1:pêyak LOHKAIM ; !#! 2:nîso LOHKAIM ; !#! 3:nisto LOHKAIM ; !#! 4:nêwo LOHKAIM ; !#! 5:niyânan LOHKAIM ; !#! 6:nikotwâsik LOHKAIM ; !#! 7:têpakohp LOHKAIM ; !#! 8:ayinânêw LOHKAIM ; !#! 9:kêkâmitâtaht LOHKAIM ; !#! !#! LEXICON LOHKAIM !#! MILJON ; !#! !#! LEXICON TENSM !#! 1%0:luhkie MILJON ; ! South Saami (!), to be replaced !#! 2:nîso TENM ; !#! 3:nisto TENM ; !#! 4:nêwo TENM ; !#! 5:niyânan TENM ; !#! 6:nikotwâsik TENM ; !#! 7:têpakohp TENM ; !#! 8:ayinânêw TENM ; !#! 9:kêkâmitâtaht TENM ; !#! !#! LEXICON TENM !#! %0:luhkie MILJON ; ! South Saami (!), to be replaced !#! :luhkie ONESM ; ! South Saami (!), to be replaced !#! !#! LEXICON ONESM !#! 1:akte% miljon% OVERTHOUSANDS ; ! South Saami (!), to be replaced !#! 2:nîso MILJON ; !#! 3:nisto MILJON ; !#! 4:nêwo MILJON ; !#! 5:niyânan MILJON ; !#! 6:nikotwâsik MILJON ; !#! 7:têpakohp MILJON ; !#! 8:ayinânêw MILJON ; !#! 9:kêkâmitâtaht MILJON ; !#! !#! LEXICON MILJON !#! :% miljona% OVERTHOUSANDS ; ! South Saami (!), to be replaced !#! !#! ! ================= !#! ! Under the million !#! ! ================= !#! !#! LEXICON OVERTHOUSANDS !#! HUNDREDST ; ! x00.000.000 !#! 1:mitâtahtomitanaw HUNDREDT ; ! 100.000.000 !#! 1:tahtomimitâtahtomtanaw HUNDREDT ; ! 100.000.000 !#! %0: TENST ; ! 20.000.000 !#! %0: TEENST ; ! 10.000.000 !#! %0%0: ONEST ; ! 2.000.000 !#! %0%01:tahtomipêyak% kihci% mitâtahtomtanaw THOUSAND ; !#! %0%01:pêyak% kihci% mitâtahtomitanaw THOUSAND ; !#! %0%0%0: UNDERTHOUSAND ; !#! %0%0%0%0: TENS ; !#! %0%0%0%0: TEENS ; !#! %0%0%0%0%0: ONES ; !#! %0%0%0%0%0%0: ENDLEX ; ! x000000 !#! !#! LEXICON HUNDREDST !#! 2:nîso CUODIT ; ! 200000-299999 !#! 3:nisto CUODIT ; ! 300000-399999 !#! 4:nêwo CUODIT ; ! 400000-499999 !#! 5:niyânan CUODIT ; ! 500000-599999 !#! 6:nikotwâsik CUODIT ; ! 600000-699999 !#! 7:têpakohp CUODIT ; ! 700000-799999 !#! 8:ayinânêw CUODIT ; ! 800000-899999 !#! 9:kêkâmitâtaht CUODIT ; ! 900000-999999 !#! !#! LEXICON CUODIT !#! :mitâtahtomitanaw HUNDREDT ; ! 100000-199999 !#! :+Use/NGmitâtahtomtanaw HUNDREDT ; ! 100000-199999 !#! !#! LEXICON HUNDREDT ! X = 1-9, Y = 0-9 !#! :% TENST ; ! X2XYYY, X10YYY !#! TEENST ; ! X1XYYY !#! %01:akte THOUSANDS ; ! South Saami (!), to be replaced !#! %0: ONEST ; ! XX0YYY !#! %0%0: THOUSANDS ; ! X00YYY !#! !#! LEXICON TEENST !#! 1:luhkie TEENT ; ! South Saami (!), to be replaced !#! 1%0:luhkie THOUSANDS ; ! South Saami (!), to be replaced !#! !#! LEXICON TEENT !#! 1:pêyak LOHKAIT ; !#! 2:nîso LOHKAIT ; !#! 3:nisto LOHKAIT ; !#! 4:nêwo LOHKAIT ; !#! 5:niyânan LOHKAIT ; !#! 6:nikotwâsik LOHKAIT ; !#! 7:têpakohp LOHKAIT ; !#! 8:ayinânêw LOHKAIT ; !#! 9:kêkâmitâtaht LOHKAIT ; !#! !#! LEXICON LOHKAIT !#! THOUSANDS ; !#! !#! LEXICON TENST !#! 2:nîsi TANAW ; !#! 2:+Use/NGnîs TANAW ; !#! 2:+Use/NGnîsomi TANAW ; !#! 3:nistomi TANAW ; !#! 4:nêmi TANAW ; !#! 5:niyânanomi TANAW ; !#! 6:nikotwâsomi TANAW ; !#! 7:têpakohpimi TANAW ; !#! 8:ayinânêwomi TANAW ; !#! 9:kêkâmitâtahtomi TANAW ; !#! !9:kâmitâtahtomi TANAW ; !#! !#! LEXICON TANAW !#! %0:tanaw ENDLEX ; !#! %0:tanaw% ONESTONEXT ; !#! !#! LEXICON TENT !#! %0: THOUSANDS ; !#! 1:akte THOUSANDS ; ! error sma !#! ONEST ; !#! !#! LEXICON ONEST !#! 2:nîso% THOUSANDS ; !#! 3:nisto% THOUSANDS ; !#! 4:nêwo% THOUSANDS ; !#! 5:niyânan% THOUSANDS ; !#! 6:nikotwâsik% THOUSANDS ; !#! 7:têpakohp% THOUSANDS ; !#! 8:ayinânêw% THOUSANDS ; !#! 9:kêkâmitâtaht% THOUSANDS ; !#! !#! LEXICON THOUSANDS ! x > 1 !#! :kihci% mitâtahtomitanaw THOUSAND ; !#! :+Use/NGkihci% mitâtahtomtanaw THOUSAND ; !#!