Text mining and big textual data: Relevant statistical models

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A general overview is provided through examples and case studies, retrieved from research experiences, to foster description and debate on effectiveness in Big Data environments. At issue are early stage case studies relating to: research publishing and research impact; literature, narrative and foundational emotional tracking; and social media, here Twitter, with a social science orientation. Central relevance and importance will be associated with the following aspects of analytical methodology: context, leading to availing of semantics; focus, motivating homology between fields of analytical orientation; resolution scale, which can incorporate a concept hierarchy and aggregation in general; and acknowledging all that is implied by this expression: correlation is not causation. Application areas are: quantitative and also qualitative assessment, narrative analysis and assessing impact, and baselining and contextualizing, statistically and in related aspects such as visualization.

LanguageEnglish
Title of host publicationNew Statistical Developments in Data Science - SIS 2017
EditorsAlessandra Petrucci, Filomena Racioppi, Rosanna Verde
PublisherSpringer New York LLC
Pages39-52
Number of pages14
Volume288
ISBN (Print)9783030211578
DOIs
Publication statusPublished - 21 Aug 2019
EventSIS Conference on Statistics and Data Science: New Challenges, New Generations - University of Florence, Florence, Italy
Duration: 28 Jun 201730 Jun 2017
http://meetings3.sis-statistica.org/index.php/sis2017/sis2017

Conference

ConferenceSIS Conference on Statistics and Data Science
CountryItaly
CityFlorence
Period28/06/1730/06/17
Internet address

Fingerprint

Text Mining
Data Model
Statistical Model
Concept Hierarchy
Causation
Social Media
Social Sciences
Homology
Aggregation
Visualization
Methodology
Narrative
Relevance
Semantics
Context
Experience
Emotion

Cite this

Murtagh, F. (2019). Text mining and big textual data: Relevant statistical models. In A. Petrucci, F. Racioppi, & R. Verde (Eds.), New Statistical Developments in Data Science - SIS 2017 (Vol. 288, pp. 39-52). Springer New York LLC. https://doi.org/10.1007/978-3-030-21158-5_4
Murtagh, Fionn. / Text mining and big textual data : Relevant statistical models. New Statistical Developments in Data Science - SIS 2017. editor / Alessandra Petrucci ; Filomena Racioppi ; Rosanna Verde. Vol. 288 Springer New York LLC, 2019. pp. 39-52
@inproceedings{e76b037c8868439185f3772d0987b2af,
title = "Text mining and big textual data: Relevant statistical models",
abstract = "A general overview is provided through examples and case studies, retrieved from research experiences, to foster description and debate on effectiveness in Big Data environments. At issue are early stage case studies relating to: research publishing and research impact; literature, narrative and foundational emotional tracking; and social media, here Twitter, with a social science orientation. Central relevance and importance will be associated with the following aspects of analytical methodology: context, leading to availing of semantics; focus, motivating homology between fields of analytical orientation; resolution scale, which can incorporate a concept hierarchy and aggregation in general; and acknowledging all that is implied by this expression: correlation is not causation. Application areas are: quantitative and also qualitative assessment, narrative analysis and assessing impact, and baselining and contextualizing, statistically and in related aspects such as visualization.",
keywords = "Chronological hierarchical clustering, Correspondence analysis, Emotion tracking, Mapping narrative, Significance of style",
author = "Fionn Murtagh",
year = "2019",
month = "8",
day = "21",
doi = "10.1007/978-3-030-21158-5_4",
language = "English",
isbn = "9783030211578",
volume = "288",
pages = "39--52",
editor = "Alessandra Petrucci and Filomena Racioppi and Rosanna Verde",
booktitle = "New Statistical Developments in Data Science - SIS 2017",
publisher = "Springer New York LLC",
address = "United States",

}

Murtagh, F 2019, Text mining and big textual data: Relevant statistical models. in A Petrucci, F Racioppi & R Verde (eds), New Statistical Developments in Data Science - SIS 2017. vol. 288, Springer New York LLC, pp. 39-52, SIS Conference on Statistics and Data Science, Florence, Italy, 28/06/17. https://doi.org/10.1007/978-3-030-21158-5_4

Text mining and big textual data : Relevant statistical models. / Murtagh, Fionn.

New Statistical Developments in Data Science - SIS 2017. ed. / Alessandra Petrucci; Filomena Racioppi; Rosanna Verde. Vol. 288 Springer New York LLC, 2019. p. 39-52.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Text mining and big textual data

T2 - Relevant statistical models

AU - Murtagh, Fionn

PY - 2019/8/21

Y1 - 2019/8/21

N2 - A general overview is provided through examples and case studies, retrieved from research experiences, to foster description and debate on effectiveness in Big Data environments. At issue are early stage case studies relating to: research publishing and research impact; literature, narrative and foundational emotional tracking; and social media, here Twitter, with a social science orientation. Central relevance and importance will be associated with the following aspects of analytical methodology: context, leading to availing of semantics; focus, motivating homology between fields of analytical orientation; resolution scale, which can incorporate a concept hierarchy and aggregation in general; and acknowledging all that is implied by this expression: correlation is not causation. Application areas are: quantitative and also qualitative assessment, narrative analysis and assessing impact, and baselining and contextualizing, statistically and in related aspects such as visualization.

AB - A general overview is provided through examples and case studies, retrieved from research experiences, to foster description and debate on effectiveness in Big Data environments. At issue are early stage case studies relating to: research publishing and research impact; literature, narrative and foundational emotional tracking; and social media, here Twitter, with a social science orientation. Central relevance and importance will be associated with the following aspects of analytical methodology: context, leading to availing of semantics; focus, motivating homology between fields of analytical orientation; resolution scale, which can incorporate a concept hierarchy and aggregation in general; and acknowledging all that is implied by this expression: correlation is not causation. Application areas are: quantitative and also qualitative assessment, narrative analysis and assessing impact, and baselining and contextualizing, statistically and in related aspects such as visualization.

KW - Chronological hierarchical clustering

KW - Correspondence analysis

KW - Emotion tracking

KW - Mapping narrative

KW - Significance of style

UR - http://www.scopus.com/inward/record.url?scp=85071667120&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-21158-5_4

DO - 10.1007/978-3-030-21158-5_4

M3 - Conference contribution

SN - 9783030211578

VL - 288

SP - 39

EP - 52

BT - New Statistical Developments in Data Science - SIS 2017

A2 - Petrucci, Alessandra

A2 - Racioppi, Filomena

A2 - Verde, Rosanna

PB - Springer New York LLC

ER -

Murtagh F. Text mining and big textual data: Relevant statistical models. In Petrucci A, Racioppi F, Verde R, editors, New Statistical Developments in Data Science - SIS 2017. Vol. 288. Springer New York LLC. 2019. p. 39-52 https://doi.org/10.1007/978-3-030-21158-5_4