TY - JOUR
T1 - From data to the p-adic or ultrametric model
AU - Murtagh, F.
PY - 2009/3/1
Y1 - 2009/3/1
N2 - We model anomaly and change in data by embedding the data in an ultrametric space. Taking our initial data as cross-tabulation counts (or other input data formats), Correspondence Analysis allows us to endow the information space with a Euclidean metric. We then model anomaly or change by an induced ultrametric. The induced ultrametric that we are particularly interested in takes a sequential — e.g. temporal — ordering of the data into account. We apply this work to the flow of narrative expressed in the film script of the Casablanca movie; and to the evolution between 1988 and 2004 of the Colombian social conflict and violence.
AB - We model anomaly and change in data by embedding the data in an ultrametric space. Taking our initial data as cross-tabulation counts (or other input data formats), Correspondence Analysis allows us to endow the information space with a Euclidean metric. We then model anomaly or change by an induced ultrametric. The induced ultrametric that we are particularly interested in takes a sequential — e.g. temporal — ordering of the data into account. We apply this work to the flow of narrative expressed in the film script of the Casablanca movie; and to the evolution between 1988 and 2004 of the Colombian social conflict and violence.
KW - applications
KW - Euclidean metric
KW - hierarchic clustering
KW - multivariate data analysis
KW - ultrametric topology
UR - http://www.scopus.com/inward/record.url?scp=77950938697&partnerID=8YFLogxK
U2 - 10.1134/S2070046609010063
DO - 10.1134/S2070046609010063
M3 - Article
AN - SCOPUS:77950938697
VL - 1
SP - 58
EP - 68
JO - P-Adic Numbers, Ultrametric Analysis, and Applications
JF - P-Adic Numbers, Ultrametric Analysis, and Applications
SN - 2070-0466
IS - 1
ER -