Error report for improving the cg2visl.pl script First of all: Good work, Saara! Then: comments to things to improve: - Coordination ============== S:n('juolgi',pl,nom) Juolggit CO:conj-s('ja') ja S:n('giehta',pl,nom) gieđat P:v('leat',IV,ind,pr,3pl) leat =H:v('seargat',IV,pcp2) seargan . Coordination structures should be collected into one phrase, of type X:par (not X:g), like this: S:par =CJT:n('juolgi',pl,nom) Juolggit =CO:conj-s('ja') ja =CJT:n('giehta',pl,nom) gieđat Pg =D:v('leat',IV,ind,pr,3pl) leat =H:v('seargat',IV,pcp2) seargan . Rule: For a constituent CO, identify the identical constituents X before and after it, and make a par group X:par. - Discontinuous constituents: ============================= Trond: More text here!! SOURCE: text SME42 In monge háliidivčče. A1 P:g =D:vaux:v('ii',IV,neg,ind,1sg) In S:pron('mun',,1sg,nom,foc/ge) monge =H:v('háliidit',TV,cond,pr,conneg) háliidivčče . - P:v as a group In several cases we have this pattern: S:n('ruhta',pl,nom) Ruđat P:v('leat',IV,ind,pr,3pl) leat =H:v('nohkat',IV,pcp2) nohkan P:v('leat',IV,cond,pr,1sg) livččen =H:v('boahtit',IV,pcp2) boahtán . Target, in each case, is: P:g =D:v('leat',IV,cond,pr,1sg) livččen =H:v('boahtit',IV,pcp2) boahtán . So, the quasicode is: In a structure P:v =H:v expand to P:g =D:v =H:v or, more general A:x =H:y . assume that A is the group and go for A:g =D:x =H:y This seems to be a problem with head-final phrases: A1 A1 =D:pron('mun',,1sg,gen) Mu =D:adj('ođas',attr) ođđa S:n('ustit',sg,nom) ustit P:v('boahtit',IV,ind,pr,3sg) boahtá . Here, we want the head of the phrase to be picked up by the group: A1 S:g =D:pron('mun',,1sg,gen) Mu =D:adj('ođas',attr) ođđa =H:n('ustit',sg,nom) ustit P:v('boahtit',IV,ind,pr,3sg) boahtá . Sentence-initial =D-s ===================== Since they are sentence-initial, their H comes to the right. Thus, the following structures A1 :g =D:pron('buot',) Buot S:n('oahppi',pl,nom) oahppit P:v('leat',IV,ind,pr,3pl) leat =H:v('mannat',IV,pcp2) mannan A1 :g =D:pron('juohke',) Juohke S:n('loddi',sg,nom) loddi A1 =D:pron('mun',,1sg,gen) Mu =D:adj('boaris',sup,attr) boarráseamos S:n('viellja',sg,nom) viellja may safely be made into S:g =D:pron('buot',) Buot =H:n('oahppi',pl,nom) oahppit P:v('leat',IV,ind,pr,3pl) leat =H:v('mannat',IV,pcp2) mannan A1 S:g =D:pron('juohke',) Juohke =H:n('loddi',sg,nom) loddi A1 S:g =D:pron('mun',,1sg,gen) Mu =D:adj('boaris',sup,attr) boarráseamos =H:n('viellja',sg,nom) viellja BOS and chain of =D: Take the first non-=D constituent and make into head of the :g today :g =D:n Pekan Det @N> =D:adj vanha Adj @N> S:n kaveri @SUBJ on tullut S:g =D:n Pekan @DN> =D:adj vanha @AN> =H:n kaveri @SUBJ Finding the head of preposition and postposition @ADVL tänään @GP> ilman @ADVL maitoa @ADVL maidon @GP< takia @ADVL tänään A:g =D:n =H:po A:adv A:adv A:g =D:n =H:po Neutralised homonymy ==================== Words are represented twice when there are several homonym analyses. But in several cases these are neutralised during cg2visl, e.g. SOURCE: text SME85 Orun stuora stuora vilges vilges viesus. A1 P:v('orrut',IV,ind,pr,1sg) Orun =D:adj('stuoris',attr) stuora =D:adj('stuoris',attr) stuora =D:adj('vielgat',attr) vilges =D:adj('vielgat',attr) vilges A:n('viessu',sg,loc) viesus . In those cases, the resulting structure may be uniq-ed. =D:pron('dat',,sg,acc) Dan Od:n('oassi',sg,acc) oasi =D:n('joavku',sg,gen) joavkku A:n('mandáhta',sg,loc) mandáhtas =H:v('ollašuhttit',TV,inf) ollašuhttit A:pron('mii',,sg,loc) mas P:g =D:vaux:v('galgat',IV,ind,pr,3sg) galgá " =H:v('čielggadit',TV,inf) čielggadit Od:n('stáhta',sg,acc) stáhta A:adv('nu') nu =D:TV('gohčodit',Der/PassL,v,pcp2,) gohčoduvvon S:n('stáhta',sg,gen) stáhta =D:TV('matrikuleret',vabe,adj,attr,) matrikulerekeahtes =D:n('eana',pl,gen) eatnamiid A:v('háhkat',TV,actio,gen) háhkama " , P:g =D:vaux:v('leat',IV,ind,pr,3sg) lea A:adv('liikká') liikká S:n('riekte#ovdáneapmi',sg,nom) riekteovdáneapmi =H:TV('váldit',Der/PassS,v,pcp2,) váldon A:adv('mielde') mielde A:adv('eanet',comp) eanet :cl =SUB:conj-c('go') go @COMP-CS<:pron('mii',,sg,nom) mii A:adv('duođaid') duođaid P:v('leat',IV,cond,pr,3sg) livččii =D:vaux:v('leat',IV,pcp2) leamaš Cs:adj('dárbbašlaš',sg,nom) dárbbašlaš S:v('čielggadit',TV,inf) čielggadit =D:adj('dálá',attr) dálá Od:n('njuolggadus',pl,acc) njuolggadusaid , Od:cl =SUB:conj-c('go') go =P:v('leat',IV,ind,pr,3sg) lea =A:adv('maid') maid ==D:vaux:v('fertet',IV,pcp2) ferten ==H:v('geahčadit',TV,inf) geahčadit ==D:prop('Finnmárku',plc,sg,gen) Finnmárkku =Od:n('vuoigatvuohta',sg,acc) vuoigatvuođa =CO:conj-s('ja') ja ==D:n('ráđđejupmi',sg,gen) ráđđejumi ==D:n('dilálašvuohta',pl,gen) dilálašvuođaid =Od:num('20',sg,acc) 20 =Co:n('ovdáneapmi',sg,acc) ovdáneami .