French vital records data gathering and analysis through image processing and machine learning algorithms - Université de Technologie de Belfort-Montbeliard Accéder directement au contenu
Article Dans Une Revue Journal of Data Mining and Digital Humanities Année : 2021

French vital records data gathering and analysis through image processing and machine learning algorithms

Résumé

Vital records are rich of meaningful historical data concerning city as well as countryside inhabitants that can be used, among others, to study former populations and then reveal the social, economic and demographic characteristics of those populations. However, these studies encounter a main difficulty for collecting the data needed since most of these records are scanned documents that need a manual transcription step in order to gather all the data and start exploiting it from a historical point of view. This step consequently slows down the historical research and is an obstacle to a better knowledge of the population habits depending on their social conditions. Therefore in this paper, we present a modular and self-sufficient analysis pipeline using state-of-the-art algorithms mostly regardless of the document layout that aims to automate this data extraction process.
Fichier principal
Vignette du fichier
French_vital_records_data_gathering_and_analysis_through_image_processing_and_machine_learning_algorithms.pdf (13.16 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03189188 , version 1 (02-04-2021)
hal-03189188 , version 2 (18-06-2021)
hal-03189188 , version 3 (14-07-2021)

Identifiants

Citer

Cyprien Plateau-Holleville, Enzo Bonnot, Franck Gechter, Laurent Heyberger. French vital records data gathering and analysis through image processing and machine learning algorithms. Journal of Data Mining and Digital Humanities, In press, 2021, ⟨10.46298/jdmdh.7327⟩. ⟨hal-03189188v3⟩
577 Consultations
583 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More