!!!Kick-off meeting * antti keeps the system * the parts we will have to do for new languages (they are not modules yet) ** letter-to-sound ** how we will model prosody ** phrasing ** how to split training sentences * a big issue is the handling of numbers and abbreviations * inflect numbers for case * tts for prosody * html marking as clues * hmm synthesis * does not glue * make a reverse speech recognition * not tobi * perceptual prominence model * accent * stress on the first syllable The Antti model for now: * 0 - deaccented * 1 - weak cont wo * 2 - strong foc * 3 - emphatic As comparision: the Trondheim model: * 1 - nonfoc * 2 - foc * 3 - acc {{{ 3Pekkakin sen möi Minä puhun 2Pekastakin }}} cg-stype rule set: * prom of -1 0 1 on word level * prom of -1 0 1 on syll level word level {{{ funct w deacc pron v weak num adv N adj strong NP cost can be considered having focus and background theme rheme verb .... if theme has two constituents the first is sprominent and others are secontary or olc 3Pekka 2kalan 1osta 3-keskiviikkona 2-kaikki tutkijat 1-læhtivæt kotiin }}} rheme {{{ S O systemic ordering deviations from syystemic ordering has prosodic contecequence if A B in text sys B A act A B => prominent-B, less-prominent-A hypothesis for finnish SO subj < obj < loc < manner < osurce < goal if any of these is moved to the left in the string it will be part of the theme and therefor less prominent. pekka osti KALAN-obj TORILTA-src Pekka osti torilta-src KALAN-obj }}} * if word order follows the systemic order we cannot say anything * if word order deviates from systemic order (as indicated) we can draw conclusions on theme status and therefore (de)accent accordingly rule against data driven approch: * pro ** insight to phon ** motivated ** not huge c ** rare * con