September 24-25 2020
The Slovenian Language Technologies Society (SDJT), the Centre for Language Resources and Technologies at the University of Ljubljana (CJVT), the Institute of Contemporary History (INZ) and the research infrastructures CLARIN.SI and DARIAH-SI have organised the conference “Language Technologies and Digital Humanities” on 24th and 25th September 2020. The conference has more than a 20-year tradition, and in 2016, the thematic expansion to digital humanities was introduced. The virtual conference was supported by CLARIN ERIC.
- Proceedings of the conference
- Conference schedule and a photo of the virtual participants
- Invited speakers
- Panel “Development of Slovene in a Digital Environment”
- Programme committee
Before the conference:
- Thematic areas of the conference
- Important dates
- Instructions for authors
- Information for presenters
In the scope of JT-DH 2020 the following awards were given:
- The comittee award for the best student paper:
Zoran Fijavž: Ambivalence of Queer Visibility in Video-Based Social Media Content. [PDF] [Video]
- The award by the public for the best recorded video presentation:
Špela Antloga: Korpus metafor KOMET 1.0. [PDF] [Video]
“Abusive Language Detection: Too much Digital, not enough Humanities?”
Social media messages are often written to attack specific groups of users based on their religion, ethnicity or social status, and they can be particularly threatening to vulnerable users such as teenagers. While it is important to develop tools and algorithms that automatically detect abusive messages, so that they can be deleted and hateful accounts can be blocked, it is also important to understand the shared language used by online communities, to what extent offensiveness can be generalized, and what is the role of dataset annotation in avoiding biases in online abuse detection. This talk gives an overview of current state-of-the-art approaches in abusive language detection and discusses ongoing work to better understand this phenomenon, analyzing the role of communities, textual context and stakeholders’ involvement.
Sara Tonelli is the head of the Digital Humanities research group at Fondazione Bruno Kessler in Trento, and adjunct professor of Language Interfaces at the Dept. of Psychology and Cognitive Science, University of Trento. She got a PhD in Language Sciences from Università Ca’ Foscari in Venice in 2010, after which she joined FBK first as a post-doc in the Natural Language Processing group and then as a tenured researcher leading the newly founded DH group. She has been involved in several national and European projects dealing with historical archives, event and temporal information extraction and political stance detection and, more recently, with social media monitoring and hate speech detection. Her research interests are highly interdisciplinary, trying to apply and adapt advanced text analysis approaches to social sciences and historical investigation.
Speaking on behalf of others
Why the digital humanities should care about parliamentary data
Over the last decades, these (almost verbatim) proceedings have become increasingly available in digital format, spawning immense historical corpora, which contain millions of words, often spanning multiple centuries of political debate. These data give a unique insight into the language and worldview of members of parliaments, and the constituents they aimed to represent, thus providing a detailed account on almost every issue that moved public opinion at some point in time.Digital parliamentary proceedings constitute an invaluable, but still under-explored collection for digital historians. This talk gives an overview of emerging approaches to parliamentary data that combine computer science and history. It firstly covers how speeches of MPs allow scholars to study different aspects of language, form sentiment, or sophistication, to use of humour. Secondly, it discusses how the computational analysis of parliamentary debates contributes to answering critical historical questions, such as the role of women in national politics or the changing shape of political representation.
Kaspar Beelen is a digital historian, who explores the application of machine learning to humanities research. After obtaining his PhD in History (2014) at the University of Antwerp he worked as postdoctoral fellow at the University of Toronto. As researcher on the Digging into Linked Parliamentary Data (Dilipad) project, he published several papers situated at the interface of data science, political science and history, which explored a wide range of topics, including: the representation of women in Westminster, the evolution of public health discourse, and the use of affect in parliamentary language. In 2016, Kaspar moved to the University of Amsterdam where he first worked as a postdoc at the Informatics Institute, and later became assistant professor in Digital Humanities (Media Studies). Since February 2019, he works at the Alan Turing Institute as research associate for the Living with Machines project.
Panel presentation of the project “Development of Slovene in a Digital Environment”
Between 2020 and 2022, the national project “Development of Slovene in a Digital Environment” is being carried out. Its main focus is the development of computational tools and services in the field of language technologies for Slovene. Its outcomes will include new language resources, tools and applications for processing Slovene, such as speech recognition, transcription, machine translation, a terminology portal, and terminology extractors.
The panel presented the goals and activities of the project, with some time for discussion. The panel was held in Slovene, but the video recording with English subtitles is planned to be made available later for an international audience.
Thematic areas of the conference
The conference aimes to bring together researchers from various backgrounds and methodological frameworks. The main topics will include but are not limited to:
- Speech and other mono- and multilingual language technologies
- Digital linguistics: translation studies, corpus linguistics, lexicology and lexicography, standardisation
- Digital humanities and historical studies, ethnology, literary studies, musicology, cultural heritage, archaeology, and fine arts
- Digital humanities in education and digital publishing
We welcome submissions that present guidelines, research, good practices, projects and results in these areas. The conference will also include invited lectures, a student section, and roundtables on topics related to the conference. The official languages of the conference will be Slovene and English.
Authors of selected papers will be invited to extend their JTDH 2020 papers for a special English issue of the journal Slovenščina 2.0.. The journal is indexed in the Scopus database. Each of the selected papers will go through the journal’s regular review procedure and should be prepared according to the journal’s guidelines.
- June 1st, 2020: Extended deadline for submission of papers and extended abstracts
- July 15th, 2020: Notification of acceptance
- September 1st, 2020: Notification of about the format of the conference
- September 10th, 2020: Submission of final papers
- September 24th-25th, 2020: CONFERENCE
Instructions for authors
The authors are invited to submit either a full paper or an extended abstract describing work to be presented at the conference. The extended abstract will be published in the book of abstracts and the full papers in the conference proceedings, which will be published on the conference website under the Creative Commons license at the beginning of the conference. We leave it up to the authors whether to submit their contribution anonymized or not.
The official languages of the conference are Slovene and English.
The extended abstracts should be 2-4 pages long and the full papers 6–8 pages. The camera-ready papers and extended abstracts should be formatted according to the conference guidelines:
- extended abstract: example, Word template
- full paper: example, Word template, LaTeX template
- templates are also avilable for papers written in Slovene; you can find them on the Slovene page of the conference.
The contributions are collected using EasyChair by clicking on this link.
The authors of the contributions should indicate if it is a student contribution by adding “student paper” to the list of keywords. All the co-authors of student papers should be students. These papers will be presented in a separate student session and will be eligible for the best student paper award.
Attendance at the virtual conference is free of charge for all participants but registration is required in order to receive an invitation to the Zoom meeting room: registration form.
To ensure that the conference is as accessible as possible to everyone, the conference will have an asynchronous and a synchronous component:
- The conference proceedings as well as the recordings of the presentations of the accepted papers will be published on the conference website in advance so that everyone can view them on their own time. All paper presentations have to be pre-recorded and sent to the organizers. For more information how to submit a pre-recorded video see information below.
- During the conference, there will be a series of live moderated thematic discussion sessions on the Zoom platform during which registered participants will be able to ask questions and discuss the papers.
The schedule of the conference is available here.
Information for presenters
Submitting pre-recorded video presentations
Simple solutions, such as Zoom video recording or PowerPoint record slideshow can be used for creating video of your presentation. Other means of recording can also be used as long as you meet their requirements. For recording presentations in Zoom or Powerpoint see the tutorials:
The length of presentations:
- Short presentations: pre-recorded presentation no longer than 10 min
- Long papers: pre-recorded presentation no longer than 15 min
The language of presentations:
- The presentation of your paper can be recorded in either Slovene or English, whichever you prefer
- The live discussion session will be conducted in English
The creation of concise, focused, thought-provoking presentations is encouraged. Speakers should introduce themselves briefly at the beginning of the recording.The slides should be seen in the video. Although it is important to see slides, we will be happy to see you too.
While recording the video, please use a microphone (the sound is even more important than camera).
- Simon Dobrišek (SDJT, FE)
- Nataša Frank (INZ)
- Darja Fišer (CJVT)
- Jerneja Fridl (DARIAH-SI, ZRC SAZU)
- Jurij Hadalin (DARIAH-SI, INZ)
- Katja Meden (CLARIN)
- Mihael Ojsteršek (DARIAH-SI, INZ)
- Mojca Šorn (predsednik, DARIAH-SI, INZ)
- Darja Fišer, Faculty of Arts, University of Ljubljana (CJVT, chair)
- Simon Dobrišek, Faculty of Electrical Engineering, University of Ljubljana (SDJT)
- Tomaž Erjavec, Jožef Stefan Institute (CLARIN.SI)
- Andrej Pančur, Inštitut for contemporary history (DARIAH-SI)
- Ajda Pretnar, Faculty for computer science and informatics, University of Ljubljana
- Špela Arhar Holdt, PhD, Faculty of Arts, University of Ljubljana
- Assoc. Prof. Zoran Bosnić, Faculty of Computer Information Science, University of Ljubljana
- Asst. Prof. Narvika Bovcon, Faculty of Computer Information Science, University of Ljubljana
- Asst. Prof. Václav Cvrček, Institute of the Czech National Corpus, Charles University in Prague
- Assoc. Prof. Simon Dobrišek, Faculty of Electrical Enginnering, University of Ljubljana
- Asst. Prof. Helena Dobrovoljc, Fran Ramovš Institute of the Slovenian Language, ZRC SAZU
- Asst. Prof. Darja Fišer, Faculty of Arts, University of Ljubljana and Jožef Stefan Institute (SDJT, chair)
- Jerneja Fridl, PhD, ZRC SAZU
- Polona Gantar, PhD, Faculty of Arts, University of Ljubljana
- Vojko Gorjanc, PhD, Faculty of Arts, University of Ljubljana
- Jurij Hadalin, PhD, Institute of Contemporary History
- Asst. Prof. Mario Hibert, Faculty of Arts, University of Sarajevo
- Prof. Miran Hladnik, Faculty of Arts, University of Ljubljana
- Prof. Ivo Ipšić, Faculty of Engineering, University of Rijeka
- Asst. Prof. Mateja Jemec Tomazin, Fran Ramovš Institute of the Slovenian Language, ZRC SAZU
- Prof. Zdravko Kačič, Faculty of Electrical Engineering and Computer Science, University of Maribor
- Mojca Kotar, PhD, University of Ljubljana
- Simon Krek, PhD, Jožef Stefan Institute / Centre for Language Resources and Technologies, University of Ljubljana
- Prof. Cvetana Krstev, Faculty of Philology, University of Belgrade
- Asst. Prof. Drago Kunej, Institute of Ethnomusicology, ZRC SAZU
- Asst. Prof. Nikola Ljubešić, Department of Information and Communication Sciences, Faculty of Humanities and Social Sciences, University of Zagreb
- Asst. Prof. Nataša Logar, Faculty of Social Sciences, University of Ljubljana
- Assoc. Prof. Matija Marolt, Faculty of Computer Information Science, University of Ljubljana
- Prof. France Mihelič, Faculty of Electrical Engineering, University of Ljubljana
- Asst. Prof. Maja Miličević, Faculty of Philology, University of Belgrade
- Prof. Dunja Mladenić, Artificial Intelligence Laboratory, Jožef Stefan Institute
- Asst. Prof. Matija Ogrin, PhD, Institute of Slovene Literature and Literary Sciences, ZRC SAZU
- Andrej Pančur, PhD, Institute of Contemporary History
- Asst. Prof. Dan Podjed, Institute of Slovenian Ethnology, ZRC SAZU
- Senja Pollak, PhD, Jožef Stefan Institute
- Prof. Marko Robnik-Šikonja, Faculty of Computer Information Science, University of Ljubljana
- Tanja Samardžić, PhD, University of Zurich
- Miha Seručnik, PhD, Milko Kos Historical Institute, ZRC SAZU
- Prof. Marko Stabej, Faculty of Arts, University of Ljubljana
- Mojca Šorn, PhD, Institute of Contemporary History
- Asst. Prof. Janez Štebe, Faculty of Social Sciences, University of Ljubljana
- Assoc. Prof. Aleš Vaupotič, Research Centre for Humanities, University of Nova Gorica
- Prof. Špela Vintar, Faculty of Arts, University of Ljubljana
- Prof. Jerneja Žganec Gros, Alpineon Ltd., Slovenia