This dir contains the meta-data sets for the CLARINO project. Is this data shareable via CLARINO? DONE: - sjeswe: 6547 entries (ask Josh) - sjdrus: 7616 entries (Kurutch ==> ask Michael) - smsfin: (ask Michael and Jack) DONE: - 10 dict data sets from the GT/DV-group have been uploaded on the Clarino Repository UB Berger https://repo.clarino.uib.no/xmlui/ DONE: - all free corpora: sme, smj, sma, fkv - frequency lists - ngrams: 2- and 3-grams DONE: - upload the corpus/freq lists/n-grams on the Bergen repository DONE: - sent a representative sample for the Trolling repository for testing purposes TODO - all u_korp corpora (wikipedia texts): Komi texts|Udmurt texts|Moksha texts|Erzya texts|Hill Mari texts|Meadow Mari texts - upload the freq lists/n-grams for smn on the Bergen repository - correct metadata as discussed with Gunn Inger via email