Visegrad Fund project description

 

Output title: Gentle Introduction into Natural Language Processing and Corpus Linguistics
Output type: Event: International Training School
Description: This training school will contain:

search in text corpora via user interface export of results

– manipulation of texts by simple Python scripts

– access to selected linguistic tools and services via their APIs

– crash course in Python for linguists

On the 5th day we organize poster sessions/doctoral consortiums/mentoring sessions for students toto help them display their knowledge in the scientific discourse.

We contribute to the costs of the three students submitting the best application.

Venue: Czech Republic, Prague
Publicity: Public
Delivery date from: June 2019
Delivery date to: October 2019 with the actual training school week around September 2019

 

 

Output title: Gentle Introduction into Natural Language Processing for Scholars
Output type: Product: online study material
Description: This online course/study material will contain the scripts and instructions from the live training school on NLP and Corpus Linguistics. It will concentrate on NLP, since materials on corpus linguistics will be available from a different project soon (DigiLing from Erasmus+, http://www.digiling.eu/)
Venue: Czech Republic
Publicity: Public
Delivery date from: 01/06/2019
Delivery date to: 01/10/2019

 

 

Output title: Linked Data and the Semantic Web for Humanities research
Output type: Event: International Training School
Description: This international training school will contain:

An introduction to the concepts, technologies and tools of the Semantic Web and Linked data with a special focus on the Humanities

– Identifying standards and technologies

– RDF and triples

– Semantic querying: SPARQL and triple stores

– Knowledge Representation and Ontologies

– Ontologies for the Humanities (CIDOC-CRM, OLiA… )

On the 5th day we organize poster sessions/doctoral consortiums/mentoring sessions for students to help them display their knowledge in the scientific discourse.

We contribute to the costs of the three students submitting the best application.

Venue: Graz, Austria
Publicity: Public
Delivery date from: November 2018
Delivery date to: March 2019 (with the actual training school week around February 2019 )

 

 

Output title: Linked Data and the Semantic Web for Humanities research
Output type: Product: online study material
Description: This online course will contain the scripts and instructions from the live training school on Linked Data and the Semantic Web.
Venue: Austria
Publicity: Public
Delivery date from: 01/11/2018
Delivery date to: 01/03/2019

 

 

Output title: Digital Philology
Output type: Event: International training school
Description: This training school will contain:

An introduction to digital philology.

Scholarly/critical textual philology has a history dated back to the famous Ancient Library of Alexandria. The process of comparing text versions and providing references to other texts is one of the earliest example of building a network to make the archived cultural heritage available and reusable to a wider public. Still, this kind of textual matrix is underrepresented in current projects of digital cultural heritage. The reason: while in principle digital technology is eminently capable of producing and displaying textual networks, the practice shows us that the fast changing of digital infrastructure and environment makes it very hard to find stable and durable solutions that are suitable to provide standardized framework for such textual practices.

We tackle:

– Theoretical and practical issues – a praxeological perspective on recent debates on the materiality of literature.

– Old and new philology:

  • Versions, variability and editorial interventions
  • Three basic concepts: the “work”, the “text” and the “document”

– Encoding, markup and the logic of the XML

  • Standard Generalized Markup Language (SGML)
  • Text Encoding Initiative (TEI) XML
  • XSLT and search queries

On the 5th day we organize poster sessions/doctoral consortiums/mentoring sessions for students to help them display their knowledge in the scientific discourse.

We contribute to the costs of the three students submitting the best application.

Venue: Budapest, Hungary
Publicity: Public
Delivery date from: November 2019
Delivery date to: March 2020 (with the actual training school week around February 2020 )

 

 

Output title: Digital Philology
Output type: product: online study material
Description: This online course and online study material introduces theories, exercises and methods used in humanities to encode, research, preserve and distribute scholarly texts online. It focuses on such text encoding methodologies as the Text Encoding Initiative based XML, its historical background and practical usage.

The online study material will consist of three main parts, which will be divided into further lessons:

The first part will be the introduction to digital scholarly editing, to metadata standards and mark-up languages: what these concepts mean, what their backgrounds are, and what their main uses and features are. Formation, structure and form of the XML, as well as the TEI standard used in the rest of the course will also be presented.

The second part will describe the XML, its building blocks, rules, and structure and will introduce the concept of schema and DTDs to illustrate how to model a document using DTD and XML. This part of the course will also give practical knowledge on such XML operations as transformations via stylesheets (e.g. XSLT) and search tools (e.g. SOLR, XPath).

The third part will provide an opportunity to apply the previously learned knowledge. It will begin with a brief overview of the TEI Guide and with exercises that serve the practical application of previously learned principles.

Venue: Hungary
Publicity: Public
Delivery date from: 01/11/2019
Delivery date to: 01/03/2020

 

 

Output title: Digital Projects in the Humanities in V4 Countries
Output type: online survey
Description:
  • The aim of this task is to conduct an online survey in V4 countries to gather the information about the existing and accomplished digital projects in the humanities. Since the lack of information may be perceived as the main obstacle for the progress of Digital Humanities, as many similar projects are often developed in isolation. The proposed task offers to bridge this gap. Moreover, a comprehensive survey of such projects in Visegrad countries, will stimulate a regional cooperation between similar projects developed in different V4 countries.
  • The questionnaire on digital projects in V4 will be based on an earlier survey conducted in Poland by PSNC & IBL PAN in 2014. The questions will be revised in order to cover:
  • 1. basic data about the project (key personnel, website, title, description, contact person);
  • 2. basic information in English (title, description)
  • 3. information about the project encoded onto standardised NeMO ontology of tools and methods
  • 4. Geographical information which would help in mapping the projects.
  • In order to receive the best results, the survey should be conducted in national languages, with some information collected also in English.
  • As a result a comprehensive dataset about DH projects will be gathered and made available via a trusted repository. The standardised format of the data will allow for its further reuse in other projects mapping DH tools, courses and services across the Europe (e.g. http://teresah.dariah.eu/) and will contribute to the overall visibility of V4 countries scientific output in ERA.
  • The survey will be conducted by IBL PAN and PSNC in cooperation with other partners.
Venue: Poland
Publicity: Public
Delivery date from: 01/10/2018
Delivery date to: 01/02/2020