JTDH 2022 PROCEEDINGS

Proceedings of the
Conference on Language Technologies & Digital Humanities 2020


Invited lectures

Eetu Mäkelä: Designing computational systems to support humanities and social sciences research [PDF] [Video]

Benoît Sagot: Large-scale language models: challenges and perspective [PDF] [Video]


Prispevki

Amaury Flávio Silva: The impact of a one-session-phonetic training on the improvement of non-native speakers’ pronunciation of English [PDF] [Video]

Špela Arhar Holdt, Polona Gantar, Iztok Kosem, Eva Pori, Nataša Logar,
Vojko Gorjanc, Simon Krek: Sovražno in grobo besedišče v odzivnem Slovarju sopomenk sodobne slovenščine [PDF] [Video]

Saša Babič, Tomaž Erjavec: Izdelava in analiza digitalizirane zbirke paremioloških enot [PDF]

Petra Bago, Virna Karlić: DirKorp: A Croatian Corpus of Directive Speech Acts [PDF] [Video]

Kaja Dobrovoljc, Luka Terčon, Nikola Ljubešić: Universal Dependencies za slovenščino: nadgradnja smernic, učnih podatkov in razčlenjevalnega modela [PDF] [Video]

Gregor Donaj, Mirjam Sepesy Maučec: Primerjava načinov razcepljanja besed v strojnem prevajanju slovenščina–angleščina [PDF] [Video]

Tomaž Erjavec, Kaja Dobrovoljc, Darja Fišer, Jan Jona Javoršek, Simon Krek, Taja Kuzman, Cyprian Laskowski, Nikola Ljubešić, Katja Meden: Raziskovalna infrastruktura CLARIN.SI [PDF] [Video]

Simon Gonzalez: ILiAD: An Interactive Corpus for Linguistic Annotated Data from Twitter Posts [PDF] [Video]

Martin Anton Grad, Nataša Hirci: Raba Kolokacijskega slovarja sodobne slovenščine pri prevajanju kolokacij [PDF] [Video]

Lucija Gril, Simon Dobrišek, Andrej Žgank: Akustično modeliranje z različnimi osnovnimi enotami za avtomatsko razpoznavanje slovenskega govora [PDF]

Boshko Koloski, Senja Pollak, Matej Martinc: What works for Slovenian? A comparative study of different keyword extraction systems [PDF] [Video]

Iztok Kosem, Jaka Čibej, Kaja Dobrovoljc, Nikola Ljubešić: Spremljevalni korpus Trendi: metode, vsebina in kategorizacija besedil [PDF] [Video]

Sara Košutar, Dario Karl, Matea Kramarić, Gordana Hržica: Automatic Text Analysis in Language Assessment: Developing a MultiDis Web Application [PDF] [Video]

Taja Kuzman, Nikola Ljubešić, Senja Pollak: Assessing Comparability of Genre Datasets via Cross-Lingual and Cross-Dataset Experiments [PDF] [Video]

Jakob Lenardič, Kristina Pahor de Maiti: Slovenian Epistemic and Deontic Modals in Socially Unacceptable Discourse Online [PDF] [Video]

Nikola Ljubešić, Peter Rupnik: The ParlaSpeech-HR benchmark for speaker profiling in Croatian [PDF] [Video]

Maja Miličević Petrović, Vuk Batanović, Radoslava Trnavac, Borko Kovačević: Cross-Level Semantic Similarity in Newswire Texts and Software Code Comments: Insights from Serbian Data in the AVANTES Project [PDF] [Video]

Michal Mochtak, Peter Rupnik, Nikola Ljubešić: The ParlaSent-BCS Dataset of Sentiment-annotated Parliamentary Debates from Bosnia and Herzegovina, Croatia, and Serbia [PDF] [Video]

Marta Petrak, Mia Uremović, Bogdanka Pavelin Lešić: Fine-grained human evaluation of NMT applied to literary text: case study of a French-toCroatian translation [PDF] [Video]

Aleksandar Petrovski: A Bilingual English-Ukrainian Lexicon of Named Entities Extracted from Wikipedia [PDF] [Video]

Vladimir Polomac: Serbian Early Printed Books: Towards Generic Model for Automatic Text Recognition using Transkribus [PDF]

Eva Pori, Jaka Čibej, Tina Munda, Luka Terčon, Špela Arhar Holdt: Lematizacija in oblikoskladenjsko označevanje korpusa SentiCoref [PDF] [Video]

Ajda Pretnar Žagar, Nikola Ðukić, Rajko Muršić: Document Enrichment as a Tool for Automated Interview Coding [PDF] [Video]

Jure Skubic, Darja Fišer: Parliamentary Discourse Research in History: Literature Review [PDF] [Video]

Mojca Šorli, Andrejka Žejn: Annotation of Named Entities in the May68 Corpus: NEs in modernist literary texts [PDF] [Video]

Thi Hong Hanh Tran, Matej Martinc, Andraž Repar, Antoine Doucet, Senja Pollak: A Transformer-based Sequence-labeling Approach to the Slovenian Cross-domain Automatic Term Extraction [PDF] [Video]

Darinka Verdonik, Andreja Bizjak, Andrej Žgank, Simon Dobrišek: Metapodatki o posnetkih in govorcih v govornih virih: primer baze Artur [PDF] [Video]

Maja Veselič, Dunja Zorman: Uporaba Europeaninega podatkovnega modela (EDM) pri digitalizaciji kulturne dediščine: primer Skuškove zbirke iz Slovenskega etnografskega muzeja v projektu PAGODE-Europeana China [PDF] [Video]

Špela Vintar, Andraž Repar: Human Evaluation of Machine Translations by Semi-Professionals: Lessons Learnt [PDF] [Video]

Branko Žitko, Lucija Bročić, Angelina Gašpar, Ani Grubišić, Daniel Vasić,
Ines Šarić-Grgić: Automatic Predicate Sense Disambiguation Using Syntactic and Semantic Features [PDF]


Povzetki

Petra Bago: Progress of the RETROGRAM Project: Developing a TEI-like Model for Croatian Grammars Books before Illyrism [PDF] [Video]

Tvrtko Balić: The CCRU as an Attempt of Doing Philosophy in a Digital World [PDF]

Darja Fišer, Tjaša Konovšek, Andrej Pančur: Referencing the Public by Populist and Non-Populist Parties in the Slovene Parliament [PDF] [Video]

Janez Križaj, Simon Dobrišek, Aleš Mihelič, Jerneja Žganec Gros: Uporaba postopkov strojnega učenja pri samodejni slovenski grafemsko-fonemski pretvorbi [PDF] [Video]

Matija Marolt, Mark Žakelj, Alenka Kavčič, Matevž Pesek: Poravnava zvočnih posnetkov s transkripcijami narečnega govora in petja [PDF] [Video]

Petra Matović, Katarina Radić: A Parallel Corpus of the New Testament: Digital Philology and Teaching the Classical Languages in Croatia [PDF]

Petya Osenova, Kiril Simov, Yura Konstantinova: Pre-Processing Terms in Bulgarian from Various Social Sciences and Humanities (SSH) Domains: Status and Challenges [PDF] [Video]

Henna Paakki, Faeze Ghorbanpour, Nitin Sawhney: An Approach to Computational Crisis Narrative Analysis: A Case-study of Social Media Narratives Around the COVID-19 Crisis in India [PDF]

Tadeja Rozman, Špela Arhar Holdt: Gradnja Korpusa študentskih besedil KOŠ [PDF] [Video]


Študentski prispevki

Špela Antloga: Korpusni pristopi za identifikacijo metafore in metonimije: primer metonimije v korpusu gKOMET [PDF]

Eva Boneš, Teja Hadalin, Meta Jazbinšek, Sara Sever, Erika Stanković: Neural Translation Model Specialized in Translating English TED Talks into Slovene [PDF]

David Bordon: Govoriš nevronsko? Kako ljudje razumemo jezik sodobnih strojnih prevajalnikov [PDF]

Jasna Cindrič, Lara Kuhelj, Sara Sever, Živa Simonišek, Miha Šemen: Data Collection and Definition Annotation for Semantic Relation Extraction [PDF]

Ružica Farmakovski, Natalija Tomić: Serbo-Croatian Wikipedia Between Serbian and Croatian Wikipedia [PDF]

Magdalena Gapsa: Ocenjevanje uporabniško dodanih sopomenk v Slovarju sopomenk sodobne slovenščine – pilotna študija [PDF]

Vili Grdič, Alja Križanec, Kaja Perme, Lea Turšič: Angleško-slovenska šahovska terminološka baza [PDF]

Katja Meden: Speech-level Sentiment Analysis of Parliamentary Debates using Lexicon-based Approaches [PDF]

Tina Mozetič, Miha Sever, Martin Justin, Jasmina Pegan: Evalvacijska kategorizacija strojno izluščenih protipomenskih parov [PDF]

Nina Sangawa Hmeljak, Anna Sangawa Hmeljak, Jan Hrastnik: Ilukana – aplikacija za učenje japonskih zlogovnih pisav hiragana in katakana s pomočjo asociacij [PDF]

Anja Vrečer: Filter nezaželene elektronske pošte za akademski svet [Video] [PDF]

Matjaž Zupanič, Maj Zirkelbach, Uroš Šmajdek, Meta Jazbinšek: Preparing a Corpus and a Question Answering System for Slovene [PDF]