An interactive machine-learning method to obtain safety information from free text

Peter Hughes, Coen Van Gulijk, Rawia El Rashidy

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper describes the continued development of natural language processing (NLP) techniques to support safety management on the GB railways. The work considers machine reading techniques to obtain information from more than 800,000 free text hazard records in the Close Call System (CCS) Our research has found that the non-standard nature of the text means that standard NLP techniques yield accuracies between 0% and 60% when categorising hazard reports. To improve accuracy, a workflow was developed that uses the results from individual techniques in an iterative cycle with a human analyst. The technique, dubbed interactive learning, has achieved substantially improved accuracy of up to 98% and is currently being integrated by the GB railway industry as part of its SMS.
Original languageEnglish
Title of host publicationProceedings of the 29th European Safety and Reliability Conference (ESREL 2019)
EditorsMichael Beer, Enrico Zio
Pages46-54
Number of pages9
ISBN (Electronic)9789811127243
Publication statusPublished - Sep 2019
Event29th European Safety and Reliability Conference - Leibniz Universität, Hannover, Germany
Duration: 22 Sep 201926 Sep 2019
Conference number: 29
https://esrel2019.org/#/

Conference

Conference29th European Safety and Reliability Conference
Abbreviated titleESREL 2019
CountryGermany
CityHannover
Period22/09/1926/09/19
Internet address

Fingerprint

Learning systems
Hazards
Processing
Industry

Cite this

Hughes, P., Van Gulijk, C., & El Rashidy, R. (2019). An interactive machine-learning method to obtain safety information from free text. In M. Beer, & E. Zio (Eds.), Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019) (pp. 46-54)
Hughes, Peter ; Van Gulijk, Coen ; El Rashidy, Rawia. / An interactive machine-learning method to obtain safety information from free text. Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019). editor / Michael Beer ; Enrico Zio. 2019. pp. 46-54
@inproceedings{714db834ba7843c2859f0ce2dd29f4c5,
title = "An interactive machine-learning method to obtain safety information from free text",
abstract = "This paper describes the continued development of natural language processing (NLP) techniques to support safety management on the GB railways. The work considers machine reading techniques to obtain information from more than 800,000 free text hazard records in the Close Call System (CCS) Our research has found that the non-standard nature of the text means that standard NLP techniques yield accuracies between 0{\%} and 60{\%} when categorising hazard reports. To improve accuracy, a workflow was developed that uses the results from individual techniques in an iterative cycle with a human analyst. The technique, dubbed interactive learning, has achieved substantially improved accuracy of up to 98{\%} and is currently being integrated by the GB railway industry as part of its SMS.",
keywords = "close calls, natural language processing, interactive learning, text analysis",
author = "Peter Hughes and {Van Gulijk}, Coen and {El Rashidy}, Rawia",
year = "2019",
month = "9",
language = "English",
pages = "46--54",
editor = "Michael Beer and Enrico Zio",
booktitle = "Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019)",

}

Hughes, P, Van Gulijk, C & El Rashidy, R 2019, An interactive machine-learning method to obtain safety information from free text. in M Beer & E Zio (eds), Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019). pp. 46-54, 29th European Safety and Reliability Conference, Hannover, Germany, 22/09/19.

An interactive machine-learning method to obtain safety information from free text. / Hughes, Peter; Van Gulijk, Coen; El Rashidy, Rawia.

Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019). ed. / Michael Beer; Enrico Zio. 2019. p. 46-54.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - An interactive machine-learning method to obtain safety information from free text

AU - Hughes, Peter

AU - Van Gulijk, Coen

AU - El Rashidy, Rawia

PY - 2019/9

Y1 - 2019/9

N2 - This paper describes the continued development of natural language processing (NLP) techniques to support safety management on the GB railways. The work considers machine reading techniques to obtain information from more than 800,000 free text hazard records in the Close Call System (CCS) Our research has found that the non-standard nature of the text means that standard NLP techniques yield accuracies between 0% and 60% when categorising hazard reports. To improve accuracy, a workflow was developed that uses the results from individual techniques in an iterative cycle with a human analyst. The technique, dubbed interactive learning, has achieved substantially improved accuracy of up to 98% and is currently being integrated by the GB railway industry as part of its SMS.

AB - This paper describes the continued development of natural language processing (NLP) techniques to support safety management on the GB railways. The work considers machine reading techniques to obtain information from more than 800,000 free text hazard records in the Close Call System (CCS) Our research has found that the non-standard nature of the text means that standard NLP techniques yield accuracies between 0% and 60% when categorising hazard reports. To improve accuracy, a workflow was developed that uses the results from individual techniques in an iterative cycle with a human analyst. The technique, dubbed interactive learning, has achieved substantially improved accuracy of up to 98% and is currently being integrated by the GB railway industry as part of its SMS.

KW - close calls

KW - natural language processing

KW - interactive learning

KW - text analysis

UR - https://esrel2019.org/#/

UR - http://itekcmsonline.com/rps2prod/esrel2019/e-proceedings/index.html

M3 - Conference contribution

SP - 46

EP - 54

BT - Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019)

A2 - Beer, Michael

A2 - Zio, Enrico

ER -

Hughes P, Van Gulijk C, El Rashidy R. An interactive machine-learning method to obtain safety information from free text. In Beer M, Zio E, editors, Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019). 2019. p. 46-54