Mostra i principali dati dell'item
Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics
dc.contributor.author | Van Hooland, Seth <Vrije Universiteit Brussel> | |
dc.contributor.author | Coeckelbergs, Mathias <Université libre de Bruxelles> | |
dc.date.accessioned | 2022-06-01T14:42:40Z | |
dc.date.available | 2022-06-01T14:42:40Z | |
dc.date.issued | 2022 | |
dc.identifier.citation | Seth van Hooland, Mathias Coeckelbergs, "Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics", in Handbook of Digital Public History, edited by Serge Noiret, Mark Tebeau and Gerben Zaagsma, Berlin, Boston: De Gruyter Oldenbourg, 2022, pp. 517-530 | it_IT |
dc.identifier.isbn | 978-3-11-043922-9 | it_IT |
dc.identifier.isbn | e-ISBN: 978-3-11-043029-5 | |
dc.identifier.uri | https://doi.org/10.1515/9783110430295-046 | it_IT |
dc.identifier.uri | http://elea.unisa.it:8080/xmlui/handle/10556/6164 | |
dc.identifier.uri | http://dx.doi.org/10.14273/unisa-4256 | |
dc.description.abstract | The current excitement in regards to machine learning has spurred enthu-siasm amongst collection holders and historians alike to rely on algorithms to re-duce the amount of manual labor required for management and appraisal of largevolumes of non-structured archival content. The Digital Humanities and commer-cial archival software promote out-of-the-box tools for auto-classification, but is theadoption of machine learning as straightforward as it is currently presented in boththe popular press and the Digital Humanities literature? This chapter brings a senseof pragmatism to the debate by giving an overview of both possibilities and limitsof machine learning to extract semantics from large collections of digitized textualarchives. Two methods have gained substantial popularity: Topic Modeling (TM)and Word Embeddings (WE). This chapter introduces these non-supervised ma-chine learning methods to the community of historians, based on an experimentalcase-study of digitized archival holdings of the European Commission (EC). | it_IT |
dc.format.extent | P. 517-530 | it_IT |
dc.language.iso | en | it_IT |
dc.publisher | S. van Hooland, M. Coeckelbergs, "Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics", in Handbook of Digital Public History, Berlin, Boston: De Gruyter Oldenbourg, 2022, pp. 517-530 | it_IT |
dc.relation.ispartof | De Gruyter Reference | it_IT |
dc.rights | Diritti riservati Walter de Gruyter GmbH, Berlin/Boston | it_IT |
dc.source | UniSa. Sistema Bibliotecario di Ateneo | it_IT |
dc.subject | Machine learning | it_IT |
dc.subject | Metadata | it_IT |
dc.subject | Information extraction | it_IT |
dc.subject | Big data | it_IT |
dc.subject | Digital humanities | it_IT |
dc.subject | Digital archives | it_IT |
dc.title | Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics | it_IT |
dc.type | Book chapter | it_IT |