TY - JOUR
T1 - A Language Modelling approach to linking criminal styles with offender characteristics
AU - Bache, R.
AU - Crestani, F.
AU - Canter, D.
AU - Youngs, D.
PY - 2010/3/1
Y1 - 2010/3/1
N2 - The ability to infer the characteristics of offenders from their criminal behaviour ('offender profiling') has only been partially successful since it has relied on subjective judgments based on limited data. Words and structured data used in crime descriptions recorded by the police relate to behavioural features. Thus Language Modelling was applied to an existing police archive to link behavioural features with significant characteristics of offenders. Both multinomial and multiple Bernoulli models were used. Although categories selected are gender, age group, ethnic appearance and broad occupation (employed or not), in principle this can be applied to any characteristic recorded. Results indicate that statistically significant relationships exist between all characteristics for many types of crime. Bernoulli models tend to perform better than multinomial ones. It is also possible to identify automatically specific terms which when taken together give insight into the style of offending related to a particular group.
AB - The ability to infer the characteristics of offenders from their criminal behaviour ('offender profiling') has only been partially successful since it has relied on subjective judgments based on limited data. Words and structured data used in crime descriptions recorded by the police relate to behavioural features. Thus Language Modelling was applied to an existing police archive to link behavioural features with significant characteristics of offenders. Both multinomial and multiple Bernoulli models were used. Although categories selected are gender, age group, ethnic appearance and broad occupation (employed or not), in principle this can be applied to any characteristic recorded. Results indicate that statistically significant relationships exist between all characteristics for many types of crime. Bernoulli models tend to perform better than multinomial ones. It is also possible to identify automatically specific terms which when taken together give insight into the style of offending related to a particular group.
KW - Information Retrieval
KW - Investigative psychology
KW - Language Modelling
KW - Offender profiling
UR - http://www.scopus.com/inward/record.url?scp=74849103623&partnerID=8YFLogxK
U2 - 10.1016/j.datak.2009.10.009
DO - 10.1016/j.datak.2009.10.009
M3 - Article
AN - SCOPUS:74849103623
VL - 69
SP - 303
EP - 315
JO - Data and Knowledge Engineering
JF - Data and Knowledge Engineering
SN - 0169-023X
IS - 3
ER -