Big data scaling through metric mapping: exploiting the remarkable simplicity of very high dimensional spaces using correspondence analysis

Fionn Murtagh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

We present new findings in regard to data analysis in very high dimensional spaces. We use dimensionalities up to around one million. A particular benefit of Correspondence Analysis is its suitability for carrying out an orthonormal mapping, or scaling, of power law distributed data. Power law distributed data are found in many domains. Correspondence factor analysis provides a latent semantic or principal axes mapping. Our experiments use data from digital chemistry and finance, and other statistically generated data.

Original languageEnglish
Title of host publicationData Science
EditorsFrancesco Palumbo, Angela Montanari, Maurizio Vichi
Pages295-306
Number of pages12
Edition195089
ISBN (Electronic)9783319557236
DOIs
Publication statusPublished - 5 Jul 2017
Externally publishedYes
Event15th Conference of the International Federation of Classification Societies - Bologna, Italy
Duration: 6 Jul 20158 Jul 2015
Conference number: 15
https://studylib.net/doc/10711915/ifcs-2015-call-for-papers-conference-of-the-international... (Link to Call for Papers)
https://web.archive.org/web/20160717055549/http://ifcs.boku.ac.at/_conference/public/conferences/1/schedConfs/1/program-en_US.pdf (Program)

Publication series

NameStudies in Classification, Data Analysis, and Knowledge Organization
ISSN (Print)1431-8814

Conference

Conference15th Conference of the International Federation of Classification Societies
Abbreviated titleIFCS 2015
Country/TerritoryItaly
CityBologna
Period6/07/158/07/15
Internet address

Fingerprint

Dive into the research topics of 'Big data scaling through metric mapping: exploiting the remarkable simplicity of very high dimensional spaces using correspondence analysis'. Together they form a unique fingerprint.

Cite this