TY - JOUR
T1 - Data Mining and Big Data Analytics
T2 - Exploiting Resolution Scale, Addressing Bias, Having Analytical Focus
AU - Murtagh, Fionn
PY - 2018/2/17
Y1 - 2018/2/17
N2 - The key theme of the analytics here encompasses data mining and knowledge discovery in data, and that comprises unsupervised classification, analytics of semantics, and what can be well considered as cross-disciplinarity and multi-disciplinarity. In analyzing data, there are requirements and also possible additional perspectives to have. This allows coverage of both quantitative and qualitative themes and aspects. A basis for much of the Correspondence Analysis, latent semantics, methodology here is the mapping of data into the Euclidean metric endowed factor space. The latter expresses and represents the information space, that can also be well displayed and visualized. New methods in this paper include: process convergence and its application; how analytical focus and contextualization are very important and how these are implemented, and further aspects of semantic analytics. Since semantics are underlying meanings, this indicates the importance here for decision support and for the well known saying that "correlation is not causation". The latter expression means that understanding causal actions and events cannot be purely reduced to the input data and starting point relative to the output data or finalization. Interesting and important results and outcomes, at issue here, include social media analytics; incorporating context in mental health analytics; and large-scale social media analytics, being Twitter text mining.
AB - The key theme of the analytics here encompasses data mining and knowledge discovery in data, and that comprises unsupervised classification, analytics of semantics, and what can be well considered as cross-disciplinarity and multi-disciplinarity. In analyzing data, there are requirements and also possible additional perspectives to have. This allows coverage of both quantitative and qualitative themes and aspects. A basis for much of the Correspondence Analysis, latent semantics, methodology here is the mapping of data into the Euclidean metric endowed factor space. The latter expresses and represents the information space, that can also be well displayed and visualized. New methods in this paper include: process convergence and its application; how analytical focus and contextualization are very important and how these are implemented, and further aspects of semantic analytics. Since semantics are underlying meanings, this indicates the importance here for decision support and for the well known saying that "correlation is not causation". The latter expression means that understanding causal actions and events cannot be purely reduced to the input data and starting point relative to the output data or finalization. Interesting and important results and outcomes, at issue here, include social media analytics; incorporating context in mental health analytics; and large-scale social media analytics, being Twitter text mining.
KW - computational complexity
KW - Correspondence Analysis
KW - latent semantic analysis
KW - mental health analysis
KW - multivariate statistics
KW - social media analysis
KW - Twitter analysis
U2 - 10.15344/2456-4451/2018/127
DO - 10.15344/2456-4451/2018/127
M3 - Article
VL - 3
JO - International Journal of Computer and Software Engineering
JF - International Journal of Computer and Software Engineering
SN - 2456-4451
IS - 1
M1 - 127
ER -