Using Machine Learning to Enhance Archival Processing of Social Media Archives

Titel: Using Machine Learning to Enhance Archival Processing of Social Media Archives
verantwortlich: Anne Gilliland; Zhanyuan Yin; Huizi Yu; Lizhou Fan
Erscheinungsjahr: 2020
Medientyp: Preprint
Datenquelle: LISSA
sid-179-col-lissa
Tags: Tag hinzufügen

Zugang

Diese Ressource ist frei verfügbar.

Weblinks

author_facet	Anne Gilliland Zhanyuan Yin Huizi Yu Lizhou Fan Anne Gilliland Zhanyuan Yin Huizi Yu Lizhou Fan
author	Anne Gilliland Zhanyuan Yin Huizi Yu Lizhou Fan
spellingShingle	Anne Gilliland Zhanyuan Yin Huizi Yu Lizhou Fan Using Machine Learning to Enhance Archival Processing of Social Media Archives Archival Science Social and Behavioral Sciences Collection Development and Management hate speech generative adversarial network archival processing bepress LIS Scholarship Archive covid-19 machine learning Library and Information Science
author_sort	anne gilliland
spelling	Anne Gilliland Zhanyuan Yin Huizi Yu Lizhou Fan Archival Science Social and Behavioral Sciences Collection Development and Management hate speech generative adversarial network archival processing bepress LIS Scholarship Archive covid-19 machine learning Library and Information Science http://dx.doi.org/10.31229/OSF.IO/GKYDM http://osf.io/gkydm/ This paper reports on a study using machine learning to identify incidences and shifting dynamics of hate speech in social media archives. To better cope with the archival processing need for such large scale and fast evolving archives, we propose the Data-driven and Circulating Archival Processing (DCAP) method. As a proof-of-concept, our study focuses on an English language Twitter archive relating to COVID-19: tweets were repeatedly scraped between February and June 2020, ingested and aggregated within the COVID-19 Hate Speech Twitter Archive (CHSTA) and analyzed for hate speech using the Generative Adversarial Network (GAN)-inspired DCAP Method. Outcomes suggest that it is possible to use machine learning and data analytics to surface and substantiate trends from CHSTA and similar social media archives that could provide immediately useful knowledge for crisis response, in controversial situations, or for public policy development, as well as for subsequent historical analysis. The approach shows potential for integrating multiple aspects of the archival workflow, and supporting automatic iterative redescription and reappraisal activities in ways that make them more accountable and more rapidly responsive to changing societal interests and unfolding developments. Using Machine Learning to Enhance Archival Processing of Social Media Archives
doi_str_mv	10.31229/OSF.IO/GKYDM
facet_avail	Online
format	Preprint
fullrecord	blob:ai-179-E01EC-A62-C0E
id	ai-179-E01EC-A62-C0E
institution	FID-BBI-DE-23
imprint	2020
imprint_str_mv	2020
language	English
mega_collection	LISSA
match_str	gilliland2020usingmachinelearningtoenhancearchivalprocessingofsocialmediaarchives
publishDateSort	2020
record_id	E01EC-A62-C0E
recordtype	ai
record_format	ai
source_id	179
title	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_unstemmed	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_full	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_fullStr	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_full_unstemmed	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_short	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_sort	using machine learning to enhance archival processing of social media archives
topic	Archival Science Social and Behavioral Sciences Collection Development and Management hate speech generative adversarial network archival processing bepress LIS Scholarship Archive covid-19 machine learning Library and Information Science
url	http://dx.doi.org/10.31229/OSF.IO/GKYDM http://osf.io/gkydm/
publishDate	2020
physical
description	This paper reports on a study using machine learning to identify incidences and shifting dynamics of hate speech in social media archives. To better cope with the archival processing need for such large scale and fast evolving archives, we propose the Data-driven and Circulating Archival Processing (DCAP) method. As a proof-of-concept, our study focuses on an English language Twitter archive relating to COVID-19: tweets were repeatedly scraped between February and June 2020, ingested and aggregated within the COVID-19 Hate Speech Twitter Archive (CHSTA) and analyzed for hate speech using the Generative Adversarial Network (GAN)-inspired DCAP Method. Outcomes suggest that it is possible to use machine learning and data analytics to surface and substantiate trends from CHSTA and similar social media archives that could provide immediately useful knowledge for crisis response, in controversial situations, or for public policy development, as well as for subsequent historical analysis. The approach shows potential for integrating multiple aspects of the archival workflow, and supporting automatic iterative redescription and reappraisal activities in ways that make them more accountable and more rapidly responsive to changing societal interests and unfolding developments.
collection	sid-179-col-lissa
format_de105
format_de14
format_de15	Preprint
format_de520
format_de540
format_dech1
format_ded117
format_degla1
format_del152
format_del189
format_dezi4
format_dezwi2
format_finc	Preprint
format_nrw
_version_	1792366090290987029
geogr_code	not assigned
last_indexed	2024-03-01T22:51:42.957Z
geogr_code_person	not assigned
openURL	url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fkatalog.fid-bbi.de%3Agenerator&rft.title=Using+Machine+Learning+to+Enhance+Archival+Processing+of+Social+Media+Archives&rft.date=2020-11-10&genre=article&rft_id=info%3Adoi%2F10.31229%2FOSF.IO%2FGKYDM&atitle=Using+Machine+Learning+to+Enhance+Archival+Processing+of+Social+Media+Archives&au=Lizhou+Fan&rft.language%5B0%5D=eng
SOLR
_version_	1792366090290987029
author	Anne Gilliland, Zhanyuan Yin, Huizi Yu, Lizhou Fan
author_facet	Anne Gilliland, Zhanyuan Yin, Huizi Yu, Lizhou Fan, Anne Gilliland, Zhanyuan Yin, Huizi Yu, Lizhou Fan
author_sort	anne gilliland
collection	sid-179-col-lissa
description	This paper reports on a study using machine learning to identify incidences and shifting dynamics of hate speech in social media archives. To better cope with the archival processing need for such large scale and fast evolving archives, we propose the Data-driven and Circulating Archival Processing (DCAP) method. As a proof-of-concept, our study focuses on an English language Twitter archive relating to COVID-19: tweets were repeatedly scraped between February and June 2020, ingested and aggregated within the COVID-19 Hate Speech Twitter Archive (CHSTA) and analyzed for hate speech using the Generative Adversarial Network (GAN)-inspired DCAP Method. Outcomes suggest that it is possible to use machine learning and data analytics to surface and substantiate trends from CHSTA and similar social media archives that could provide immediately useful knowledge for crisis response, in controversial situations, or for public policy development, as well as for subsequent historical analysis. The approach shows potential for integrating multiple aspects of the archival workflow, and supporting automatic iterative redescription and reappraisal activities in ways that make them more accountable and more rapidly responsive to changing societal interests and unfolding developments.
doi_str_mv	10.31229/OSF.IO/GKYDM
facet_avail	Online
format	Preprint
format_de105
format_de14
format_de15	Preprint
format_de520
format_de540
format_dech1
format_ded117
format_degla1
format_del152
format_del189
format_dezi4
format_dezwi2
format_finc	Preprint
format_nrw
geogr_code	not assigned
geogr_code_person	not assigned
id	ai-179-E01EC-A62-C0E
imprint	2020
imprint_str_mv	2020
institution	FID-BBI-DE-23
language	English
last_indexed	2024-03-01T22:51:42.957Z
match_str	gilliland2020usingmachinelearningtoenhancearchivalprocessingofsocialmediaarchives
mega_collection	LISSA
physical
publishDate	2020
publishDateSort	2020
record_format	ai
record_id	E01EC-A62-C0E
recordtype	ai
source_id	179
spelling	Anne Gilliland Zhanyuan Yin Huizi Yu Lizhou Fan Archival Science Social and Behavioral Sciences Collection Development and Management hate speech generative adversarial network archival processing bepress LIS Scholarship Archive covid-19 machine learning Library and Information Science http://dx.doi.org/10.31229/OSF.IO/GKYDM http://osf.io/gkydm/ This paper reports on a study using machine learning to identify incidences and shifting dynamics of hate speech in social media archives. To better cope with the archival processing need for such large scale and fast evolving archives, we propose the Data-driven and Circulating Archival Processing (DCAP) method. As a proof-of-concept, our study focuses on an English language Twitter archive relating to COVID-19: tweets were repeatedly scraped between February and June 2020, ingested and aggregated within the COVID-19 Hate Speech Twitter Archive (CHSTA) and analyzed for hate speech using the Generative Adversarial Network (GAN)-inspired DCAP Method. Outcomes suggest that it is possible to use machine learning and data analytics to surface and substantiate trends from CHSTA and similar social media archives that could provide immediately useful knowledge for crisis response, in controversial situations, or for public policy development, as well as for subsequent historical analysis. The approach shows potential for integrating multiple aspects of the archival workflow, and supporting automatic iterative redescription and reappraisal activities in ways that make them more accountable and more rapidly responsive to changing societal interests and unfolding developments. Using Machine Learning to Enhance Archival Processing of Social Media Archives
spellingShingle	Anne Gilliland, Zhanyuan Yin, Huizi Yu, Lizhou Fan, Using Machine Learning to Enhance Archival Processing of Social Media Archives, Archival Science, Social and Behavioral Sciences, Collection Development and Management, hate speech, generative adversarial network, archival processing, bepress, LIS Scholarship Archive, covid-19, machine learning, Library and Information Science
title	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_full	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_fullStr	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_full_unstemmed	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_short	Using Machine Learning to Enhance Archival Processing of Social Media Archives
title_sort	using machine learning to enhance archival processing of social media archives
title_unstemmed	Using Machine Learning to Enhance Archival Processing of Social Media Archives
topic	Archival Science, Social and Behavioral Sciences, Collection Development and Management, hate speech, generative adversarial network, archival processing, bepress, LIS Scholarship Archive, covid-19, machine learning, Library and Information Science
url	http://dx.doi.org/10.31229/OSF.IO/GKYDM, http://osf.io/gkydm/

Using Machine Learning to Enhance Archival Processing of Social Media Archives

Bibliographische Detailangaben

Zugang

Weblinks