Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics

Van Hooland, Seth <Vrije Universiteit Brussel>; Coeckelbergs, Mathias <Université libre de Bruxelles>

dc.date.accessioned	2022-06-01T14:42:40Z
dc.date.available	2022-06-01T14:42:40Z
dc.description.abstract	The current excitement in regards to machine learning has spurred enthu-siasm amongst collection holders and historians alike to rely on algorithms to re-duce the amount of manual labor required for management and appraisal of largevolumes of non-structured archival content. The Digital Humanities and commer-cial archival software promote out-of-the-box tools for auto-classification, but is theadoption of machine learning as straightforward as it is currently presented in boththe popular press and the Digital Humanities literature? This chapter brings a senseof pragmatism to the debate by giving an overview of both possibilities and limitsof machine learning to extract semantics from large collections of digitized textualarchives. Two methods have gained substantial popularity: Topic Modeling (TM)and Word Embeddings (WE). This chapter introduces these non-supervised ma-chine learning methods to the community of historians, based on an experimentalcase-study of digitized archival holdings of the European Commission (EC).	it_IT
dc.language.iso	en	it_IT
dc.relation.ispartof	De Gruyter Reference	it_IT
dc.rights	Diritti riservati Walter de Gruyter GmbH, Berlin/Boston	it_IT
dc.identifier.citation	Seth van Hooland, Mathias Coeckelbergs, "Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics", in Handbook of Digital Public History, edited by Serge Noiret, Mark Tebeau and Gerben Zaagsma, Berlin, Boston: De Gruyter Oldenbourg, 2022, pp. 517-530	it_IT
dc.title	Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics	it_IT
dc.source	UniSa. Sistema Bibliotecario di Ateneo	it_IT
dc.contributor.author	Van Hooland, Seth <Vrije Universiteit Brussel>
dc.contributor.author	Coeckelbergs, Mathias <Université libre de Bruxelles>
dc.date.issued	2022
dc.identifier.uri	http://dx.doi.org/10.14273/unisa-4256
dc.identifier.uri	http://elea.unisa.it:8080/xmlui/handle/10556/6164
dc.identifier.uri	https://doi.org/10.1515/9783110430295-046	it_IT
dc.type	Book chapter	it_IT
dc.format.extent	P. 517-530	it_IT
dc.identifier.isbn	e-ISBN: 978-3-11-043029-5
dc.identifier.isbn	978-3-11-043922-9	it_IT
dc.subject	Machine learning	it_IT
dc.subject	Metadata	it_IT
dc.subject	Information extraction	it_IT
dc.subject	Big data	it_IT
dc.subject	Digital humanities	it_IT
dc.subject	Digital archives	it_IT
dc.publisher.alternative	S. van Hooland, M. Coeckelbergs, "Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics", in Handbook of Digital Public History, Berlin, Boston: De Gruyter Oldenbourg, 2022, pp. 517-530	it_IT

Find Full text

Files in questo item

Name:: la documentazione non è dispon ...
Dimensione:: 41.54Kb
Formato:: JPEG image
Description:: la documentazione non è disponibile

Mostra/Apri

Questo item appare nelle seguenti collezioni

Contributi in volume / Contributions in books

Mostra i principali dati dell'item