The SSH Training Discovery Toolkit provides an inventory of training materials relevant for the Social Sciences and Humanities.

Use the search bar to discover materials or browse through the collections. The filters will help you identify your area of interest.

 

English

Item
Title Body
Voices of the Parliament: A Corpus Approach to Parliamentary Discourse Research

While corpus methods are widely used in linguistics, including gender analysis, this tutorial shows the potential of richly annotated language corpora for research of the socio-cultural context and changes over time that are reflected through language use. The tutorial encourages students and scholars of modern languages, as well as users from other fields of digital humanities and social sciences who are interested in the study of socio-cultural phenomena through language, to engage with user-friendly digital tools for the analysis of large text collections. The tutorial is designed in such a way that it takes full advantage of both linguistic annotations and the available speaker and text metadata to formulate powerful quantitative queries that are then further extended with manual qualitative analysis in order to ensure adequate framing and interpretation of the results.

The tutorial demonstrates the potential of parliamentary corpora research via concordancers without the need for programming skills. No prior experience in using language corpora and corpus querying tools is required in order to follow this tutorial. While the same analysis could be carried out on any parliamentary corpus with similar annotations and metadata, in this tutorial we will use the siParl 2.0 corpus which contains parliamentary debates of the National Assembly of the Republic of Slovenia from 1990 to 2018. Knowledge of Slovenian is not required to follow the tutorial. To reproduce the analyses in other languages, we invite you to explore a parliamentary corpus of your choice from those available through CLARIN.

 

Taken from: Teaching with CLARIN: 

Privacy by Design in Research

This is privacy by design in research! The aim of this training material is to show researchers how to perform a data protection impact assessment for an innovative research scenario to enable responsible re-use of archived speech corpora.

The project website provides help to develop learning goals.

 

Taken from: Teaching with CLARIN: https://www.clarin.eu/content/privacy-design-research

Introduction to Speech Analysis

This course offers a general picture of managing speech corpora and of the methods that are available for the acoustic-phonetic study of speech. During the course, students use a speech analysis program called Praat and learn to apply the main features of the program in their own work with speech recordings. In addition, students will learn the basics of another program called ELAN that can be used for transcribing and annotating audio as well as video material.

Taken from: Teaching with CLARIN: https://www.clarin.eu/content/introduction-speech-analysis

Introduction to Digital Humanities

The aim of the course is to introduce digital humanities and to describe various aspects of digital content processing. The practical aims consist of introducing current data sources, annotation, pre-processing methods, software tools for data analysis and visualisation, and evaluation methods.

Currently, we identified that students are somewhat aware of digital humanities but it is difficult for them to dive in and, mainly, to anticipate what they should learn for their future research. A more detailed goal of this course is to present some current projects, show the datasets and technologies behind, and encourage students to explore the datasets and use the technologies on data they already know. A high level goal is to set the knowledge of the technologies and available datasets into the research iteration loop (create hypotheses -> design instruments -> collect data -> analyze and evaluate).

 

Taken from: Teaching with CLARIN: https://www.clarin.eu/content/introduction-digital-humanities

GATE Training Course

The training materials are all based around teaching the use of GATE, a freely available open-source toolkit for Natural Language Processing that has been widely used in both academia and industry for many different tasks.

The modules provide instruction on how to get to grips with the GATE toolkit for basic language processing, as well as more advanced techniques, and include a number of different scenarios, such as processing social media, hate speech and misinformation detection. They include modules both for programmers who want to further develop their own tools within the toolkit, and for non-programmers who want to just make use of existing tools. The modules teach not only the use of GATE itself, but also how to adapt it to one’s own needs (for example, to adapt English tools to a different language, or how to customise existing tools), and also the basic concepts around a number of language processing tasks including both low-level (tokenisation, POS tagging, parsing) to more sophisticated (information extraction, social media analysis, hate speech detection, misinformation detection), as well as how to interpret and integrate the results of the processing. Finally, it teaches programmers how to extend the toolkit itself, by adding new tools or integrating it into other systems.

 

Taken from Teaching with CLARIN: https://www.clarin.eu/content/gate-training-course 

Computational Morphology with HFST

The course demonstrates how HFST tools can be used for generating finite-state morphologies. Through practical exercises, students will learn how to use finite-state methods to develop a morphology for a language. This online course is suitable as a complement to a more theory or linguistics-oriented course on morphology.

After successfully completing the course:

- you can explain the basic theory on finite-state automata and transducers,

- you can design morphological lexica using finite-state technology,

- you know how to write morpho-phonological rules in a finite-state framework,

- you understand the diversity of morphological structure in different languages

 and you know how to take these differences into account when designing computational models of morphology.

 

Taken from Teaching with CLARIN: https://www.clarin.eu/content/computational-morphology-hfst 

Archilochus of Paros: Elegiac Fragments – XML Archive

Goals and objectives of the training materials:

  • to improve textual criticism on ancient Greek fragmentary texts (research skills and data acquisition skills: research data management + text analytics)
  • to improve competence on text annotation (research skills and data acquisition skills: analytical thinking + text annotation);
  • to reach a reliable corpus for a digital scholarly edition of an ancient Greek poet (research skills and data handling: research methods + data repositories + data formats and standards).

Taken from Teaching with CLARIN: https://www.clarin.eu/content/archilochus-paros-elegiac-fragments-xml-archive

Applied Language Technology

These learning materials consist of a two-course module that seeks to provide humanities majors with a basic understanding of language technology and the practical skills needed to apply language technology using Python. The module is intended to empower the students by showing that language technology is both accessible and applicable to research in the humanities.

 

Taken from: Teaching with CLARIN 

https://www.clarin.eu/content/applied-language-technology 

Teaching ideas: Guides for teaching data analysis

This resource is a collection of short guides designed to make lesson planning more efficient for those teaching data analysis skills. Drawing on real classroom experiences, each guide includes suggested research questions, dataset and exercises:

  • Gender differences in sexual attitudes (PDF) (using the National Survey of Sexual Attitudes and Lifestyles).
  • Risk factors associated with increased levels of systolic blood pressure (PDF) (using the Health Survey for England).
  • The gender gap in life satisfaction (PDF) (using the Opinions and Lifestyle Survey).
  • Public confidence in the police (PDF) (using the Crime Survey for England and Wales).
Building skills in quantitative methods and statistical software

A collection of quantitative methods e-books and accompanying quizzes for direct use in teaching students or for self-study. E-books aim to build skills in quantitative methods and statistical software and use the Living Costs and Food Survey.

The e-books have been developed through a collaboration of the UK Data Service, National Centre for Research Methods (NCRM), and the Centre for Multi-Level Modelling at the University of Bristol and were created using the StatJR software based on original outputs from the project Using Statistical E-books to teach undergraduate students quantitative methods and statistical software funded by the British Academy.