CLARIN Tool Portal

Active filters:

Project: CLARIN in the Netherlands

72 record(s) found

Search results

DSS: Dutch Ships and Sailors

A tool chain and methodology for converting legacy datasets in the area of maritime history. Set up to facilitate over 25 data sets, the initial population consists of 4 selected maritime-historical datasets. The maritime industry has been central to regional and global economic, social and cultural exchange. It is also one of the best historically documented sectors of human activity. Many aspects of it have been recorded by shipping companies, governments, newspapers and other institutions. In the past few decades, much of the data in the preserved historical source material has been digitized. Among the most interesting data are those on shipping movement and crew members. The Dutch Republic in the 17th and 18th centuries had to rely to a large extent on immigration to man its fleet. Especially in Asian waters, it also relied on Asian crews. Often information that deals with the same shipping movements and crew composition is spread over several historical sources and hence over several databases. The data often refers to the same ‘places’, ‘ships’, ‘persons’ and ‘events’. By linking the different available databases, the data complements and amplifies each other, and new research possibilities open up. Ideally, we would want to follow a ship from port to port, and crew members pursuing their careers from ship to ship. The Dutch Ships and Sailors project provides a tool chain and methodology for converting legacy datasets. The infrastructure includes common vocabularies to normalize and enrich existing data. Links are established between the datasets and to other relevant datasets. In doing so, Dutch Ships and Sailors builds a (semantic) web-based structure that aims to function as a future platform and infrastructure for maritime historical datasets. Initially, this portal contains the following datasets: - Historische Kranten of the Koninklijke Bibliotheek; - The Monsterrollen databases contains elaborate data on the crew composition of ships from the Northern Netherlands (c. 1800-1930) and provides information on the sailors involved, such as the places of origin, wage and age; - The databases VOC Opvarenden, providing extensive data on crews of VOC ships leaving the Republic; - The database Dutch-Asiatic Shipping, providing data on all inter-continental voyages of VOC ships; - The database Generale Zeemonsterrollen, providing data on the crew composition and sometimes location of VOC ships stationed in Asia and not engaged in inter-continental shipping.

Victor de Boer, Jur Leinenga, Matthias van Rossum and Rik Hoekstra. Dutch Ships and Sailors Linked Data Cloud. AcIn Proceedings of the International Semantic Web Conference (ISWC 2014), 19-23 October, Riva del Garda, Italy, 2014.

A. Bravo Balado. Information extraction on newspaper archives for historical research. a dutch maritime history case study. M.Sc. thesis VU University Amsterdam (forthcoming), 2014.

Andrea Bravo Balado, Victor de Boer, and Guus Schreiber. Linking historical ship records to a newspaper archive. Proceedings of the 6th International Conference on Social Informatics (workshops). LNCS. ed. Luca Maria Aiello, Daniel McFarland, 2014.

R.Ponstein. Reconciling dutch ships and sailors. M.Sc. thesis VU University Amsterdam, 2014.

Use "DSS: Dutch Ships and Sailors"
WAHSP/BILAND: web application for (bilingual) historical sentiment mining in public media

WAHSP/BLAND has been succeeded by TexCavator: http://texcavator.surfsaralabs.nl/

WAHSP/BILAND is a research tool for historians that uses textual data of news media from the period 1863-1940 of the Koninklijke Bibliotheek and Staatsbibliothek zu Berlin as input material. One can search with single query terms or with combinations thereof. Apart from showing the articles that match the query, the results can be visualized by word clouds of single articles together with sentiment words highlighted, or by a word cloud of the whole result set together with newspaper statistics derived from their metadata. WAHSP/BILAND enables historians to collect and process large bi-lingual (Dutch and German) sets of opinionated text-data from news media and extract discourse identity and intensity patterns in two different countries with different scripts (e.g. Latin and Gothic). This tool offers a unique opportunity for non-technical humanities researchers to perform a new kind of historical e-research for studying changing opinions, notions and perceptions regarding public health and policy issues. The text mining tools for opinion/sentiment extraction that form the technological base for WAHSP/BILAND have been developed within the NTU/STEVIN DuOMAn project. The technology includes algorithms and tools for identification of polarity (positive/support or negative/criticism), sources (opinion-holders), frequency of items and specific targets of discourses. The tools and subjectivity lexicons are implemented as modules of ‘Fietstas’ 2, an web service for text analysis. Fietstas also provides other essential text processing modules (morphological normalization, format and encoding reconciliation, named entity recognition and normalization, etc.) and visualization modules (interactive word clouds and timelines). Fietstas has been developed and is being used for processing of large-scale datasets in the context of several projects, such as DuOMAn. A text translation service based on Machine Learning can be used to translate existing lexicons and documents between Dutch and German (both directions). The web application uses this functionality of Fietstas to leverage interactive creation, expansion and refinement of lexicons specific to the user’s research questions and needs. For BILAND new bilingual and biscriptural lexicons have been developed. The application uses the visualization features of Fietstas to allow users to examine the research domain along the dimensions of time, context, and the identity and frequency of the discourse. WAHSP/BILAND is meant to be generic and testable in all domains, where analysis of topics, contexts and attitudes in large volumes of text is needed.

Snelders, S, Huijnen, P, Verheul, J, de Rijke, M and Pieters. T. 2017. A Digital Humanities Approach to the History of Culture and Science: Drugs and Eugenics Revisited in Early 20th-Century Dutch Newspapers, Using Semantic TextMining. In:Odijk, J and van Hessen, A. (eds.) CLARIN in the Low Countries, Pp. 325–336. London: Ubiquity Press. DOI: https://doi.org/10.5334/bbi.27. License: CC-BY 4.0

Result filters

Metadata provider

Language

Resource type

Availability

Organisation

Project

Active filters:

Search results

DSS: Dutch Ships and Sailors

WAHSP/BILAND: web application for (bilingual) historical sentiment mining in public media

Result filters

Metadata provider

Language

Resource type

Availability

Organisation

Project

Active filters:

Search results

DSS: Dutch Ships and Sailors

WAHSP/BILAND: web application for (bilingual) historical sentiment mining in public media

Session recording