Data Mining and Big Data Analytics: Exploiting Resolution Scale, Addressing Bias, Having Analytical Focus

Research output: Contribution to journalArticle

Abstract

The key theme of the analytics here encompasses data mining and knowledge discovery in data, and that comprises unsupervised classification, analytics of semantics, and what can be well considered as cross-disciplinarity and multi-disciplinarity. In analyzing data, there are requirements and also possible additional perspectives to have. This allows coverage of both quantitative and qualitative themes and aspects. A basis for much of the Correspondence Analysis, latent semantics, methodology here is the mapping of data into the Euclidean metric endowed factor space. The latter expresses and represents the information space, that can also be well displayed and visualized. New methods in this paper include: process convergence and its application; how analytical focus and contextualization are very important and how these are implemented, and further aspects of semantic analytics. Since semantics are underlying meanings, this indicates the importance here for decision support and for the well known saying that "correlation is not causation". The latter expression means that understanding causal actions and events cannot be purely reduced to the input data and starting point relative to the output data or finalization. Interesting and important results and outcomes, at issue here, include social media analytics; incorporating context in mental health analytics; and large-scale social media analytics, being Twitter text mining.
LanguageEnglish
Number of pages7
JournalInternational Journal of Computer and Software Engineering
Volume3
Issue number1
Publication statusPublished - 17 Feb 2018

Fingerprint

Data mining
Semantics
Health
Big data

Cite this

@article{529d31b2762d41ab8ccae3c4b27bd5fc,
title = "Data Mining and Big Data Analytics: Exploiting Resolution Scale, Addressing Bias, Having Analytical Focus",
abstract = "The key theme of the analytics here encompasses data mining and knowledge discovery in data, and that comprises unsupervised classification, analytics of semantics, and what can be well considered as cross-disciplinarity and multi-disciplinarity. In analyzing data, there are requirements and also possible additional perspectives to have. This allows coverage of both quantitative and qualitative themes and aspects. A basis for much of the Correspondence Analysis, latent semantics, methodology here is the mapping of data into the Euclidean metric endowed factor space. The latter expresses and represents the information space, that can also be well displayed and visualized. New methods in this paper include: process convergence and its application; how analytical focus and contextualization are very important and how these are implemented, and further aspects of semantic analytics. Since semantics are underlying meanings, this indicates the importance here for decision support and for the well known saying that {"}correlation is not causation{"}. The latter expression means that understanding causal actions and events cannot be purely reduced to the input data and starting point relative to the output data or finalization. Interesting and important results and outcomes, at issue here, include social media analytics; incorporating context in mental health analytics; and large-scale social media analytics, being Twitter text mining.",
keywords = "computational complexity, Correspondence Analysis, latent semantic analysis, mental health analysis, multivariate statistics, social media analysis, Twitter analysis",
author = "Fionn Murtagh",
year = "2018",
month = "2",
day = "17",
language = "English",
volume = "3",
journal = "International Journal of Computer and Software Engineering",
issn = "2456-4451",
number = "1",

}

TY - JOUR

T1 - Data Mining and Big Data Analytics

T2 - International Journal of Computer and Software Engineering

AU - Murtagh,Fionn

PY - 2018/2/17

Y1 - 2018/2/17

N2 - The key theme of the analytics here encompasses data mining and knowledge discovery in data, and that comprises unsupervised classification, analytics of semantics, and what can be well considered as cross-disciplinarity and multi-disciplinarity. In analyzing data, there are requirements and also possible additional perspectives to have. This allows coverage of both quantitative and qualitative themes and aspects. A basis for much of the Correspondence Analysis, latent semantics, methodology here is the mapping of data into the Euclidean metric endowed factor space. The latter expresses and represents the information space, that can also be well displayed and visualized. New methods in this paper include: process convergence and its application; how analytical focus and contextualization are very important and how these are implemented, and further aspects of semantic analytics. Since semantics are underlying meanings, this indicates the importance here for decision support and for the well known saying that "correlation is not causation". The latter expression means that understanding causal actions and events cannot be purely reduced to the input data and starting point relative to the output data or finalization. Interesting and important results and outcomes, at issue here, include social media analytics; incorporating context in mental health analytics; and large-scale social media analytics, being Twitter text mining.

AB - The key theme of the analytics here encompasses data mining and knowledge discovery in data, and that comprises unsupervised classification, analytics of semantics, and what can be well considered as cross-disciplinarity and multi-disciplinarity. In analyzing data, there are requirements and also possible additional perspectives to have. This allows coverage of both quantitative and qualitative themes and aspects. A basis for much of the Correspondence Analysis, latent semantics, methodology here is the mapping of data into the Euclidean metric endowed factor space. The latter expresses and represents the information space, that can also be well displayed and visualized. New methods in this paper include: process convergence and its application; how analytical focus and contextualization are very important and how these are implemented, and further aspects of semantic analytics. Since semantics are underlying meanings, this indicates the importance here for decision support and for the well known saying that "correlation is not causation". The latter expression means that understanding causal actions and events cannot be purely reduced to the input data and starting point relative to the output data or finalization. Interesting and important results and outcomes, at issue here, include social media analytics; incorporating context in mental health analytics; and large-scale social media analytics, being Twitter text mining.

KW - computational complexity

KW - Correspondence Analysis

KW - latent semantic analysis

KW - mental health analysis

KW - multivariate statistics

KW - social media analysis

KW - Twitter analysis

UR - https://www.graphyonline.com/journal/journal_article_inpress.php?journalid=IJCSE

M3 - Article

VL - 3

JO - International Journal of Computer and Software Engineering

JF - International Journal of Computer and Software Engineering

SN - 2456-4451

IS - 1

ER -