Portes Ouvertes de l'EPFL

You need to register and log in to access this functionality

Impresso: explore 200 years of newspaper archives

Apr 30, 20237:00 AM - 3:00 PM

STCC - niveau Garden, halls 3 & 4

Description

Historical newspapers constitute an extremely rich and varied historical source, but they are difficult to exploit (there are many pages to read!) and they very often remain isolated in 'institutional silos' (the archival collections are disconnected from one others). ‘Impresso. Media Monitoring of the Past' used text mining techniques to enrich a corpus of almost 100 newspapers in French, German and Luxembourgish, developing a new web interface to facilitate data exploration inspired by historical research practices . To accomplish this, an interdisciplinary team composed of computer scientists, designers and historians worked closely together to meet the challenges of accessing data from the past, namely: obtaining good performance despite historical documents that are difficult to process with automatic tools, applying the tools to very large volumes of data, and designing an appropriate interface to facilitate the work of historians. Beyond these specific challenges, the question of how best to adapt text mining tools and their use by humanities researchers is at the heart of the Impresso project. The interface is accessible free of charge to all and can also be useful to journalists, genealogists or teachers who are interested in information extracted from historical newspapers. All project results (interface, annotations, tools and system architecture) are published under a free license. And the adventure continues from September 2023, with a second project and the addition of radio sources!

Activity Type

Booth
Demo

Target audience

Children (4-6 years)
Children (7-10 years)
Children (11-13 years)
Young people (14-16 ans)
Adults

Language

French

Organized by

component bloc not found

Where to find this activity on campus?

Activities sessions