An interactive machine-learning method to obtain safety information from free text

Peter Hughes, Coen Van Gulijk, Rawia El Rashidy

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper describes the continued development of natural language processing (NLP) techniques to support safety management on the GB railways. The work considers machine reading techniques to obtain information from more than 800,000 free text hazard records in the Close Call System (CCS) Our research has found that the non-standard nature of the text means that standard NLP techniques yield accuracies between 0% and 60% when categorising hazard reports. To improve accuracy, a workflow was developed that uses the results from individual techniques in an iterative cycle with a human analyst. The technique, dubbed interactive learning, has achieved substantially improved accuracy of up to 98% and is currently being integrated by the GB railway industry as part of its SMS.
LanguageEnglish
Title of host publicationProceedings of the 29th European Safety and Reliability Conference (ESREL 2019)
Publication statusAccepted/In press - 2019
Event29th European Safety and Reliability Conference - Leibniz Universität, Hannover, Germany
Duration: 22 Sep 201926 Sep 2019
Conference number: 29
https://esrel2019.org/#/

Conference

Conference29th European Safety and Reliability Conference
Abbreviated titleESREL 2019
CountryGermany
CityHannover
Period22/09/1926/09/19
Internet address

Fingerprint

Learning systems
Hazards
Processing
Industry

Cite this

Hughes, P., Van Gulijk, C., & El Rashidy, R. (Accepted/In press). An interactive machine-learning method to obtain safety information from free text. In Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019)
Hughes, Peter ; Van Gulijk, Coen ; El Rashidy, Rawia. / An interactive machine-learning method to obtain safety information from free text. Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019). 2019.
@inproceedings{714db834ba7843c2859f0ce2dd29f4c5,
title = "An interactive machine-learning method to obtain safety information from free text",
abstract = "This paper describes the continued development of natural language processing (NLP) techniques to support safety management on the GB railways. The work considers machine reading techniques to obtain information from more than 800,000 free text hazard records in the Close Call System (CCS) Our research has found that the non-standard nature of the text means that standard NLP techniques yield accuracies between 0{\%} and 60{\%} when categorising hazard reports. To improve accuracy, a workflow was developed that uses the results from individual techniques in an iterative cycle with a human analyst. The technique, dubbed interactive learning, has achieved substantially improved accuracy of up to 98{\%} and is currently being integrated by the GB railway industry as part of its SMS.",
keywords = "close calls, natural language processing, interactive learning, text analysis",
author = "Peter Hughes and {Van Gulijk}, Coen and {El Rashidy}, Rawia",
year = "2019",
language = "English",
booktitle = "Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019)",

}

Hughes, P, Van Gulijk, C & El Rashidy, R 2019, An interactive machine-learning method to obtain safety information from free text. in Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019). 29th European Safety and Reliability Conference, Hannover, Germany, 22/09/19.

An interactive machine-learning method to obtain safety information from free text. / Hughes, Peter; Van Gulijk, Coen; El Rashidy, Rawia.

Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019). 2019.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - An interactive machine-learning method to obtain safety information from free text

AU - Hughes, Peter

AU - Van Gulijk, Coen

AU - El Rashidy, Rawia

PY - 2019

Y1 - 2019

N2 - This paper describes the continued development of natural language processing (NLP) techniques to support safety management on the GB railways. The work considers machine reading techniques to obtain information from more than 800,000 free text hazard records in the Close Call System (CCS) Our research has found that the non-standard nature of the text means that standard NLP techniques yield accuracies between 0% and 60% when categorising hazard reports. To improve accuracy, a workflow was developed that uses the results from individual techniques in an iterative cycle with a human analyst. The technique, dubbed interactive learning, has achieved substantially improved accuracy of up to 98% and is currently being integrated by the GB railway industry as part of its SMS.

AB - This paper describes the continued development of natural language processing (NLP) techniques to support safety management on the GB railways. The work considers machine reading techniques to obtain information from more than 800,000 free text hazard records in the Close Call System (CCS) Our research has found that the non-standard nature of the text means that standard NLP techniques yield accuracies between 0% and 60% when categorising hazard reports. To improve accuracy, a workflow was developed that uses the results from individual techniques in an iterative cycle with a human analyst. The technique, dubbed interactive learning, has achieved substantially improved accuracy of up to 98% and is currently being integrated by the GB railway industry as part of its SMS.

KW - close calls

KW - natural language processing

KW - interactive learning

KW - text analysis

UR - https://esrel2019.org/#/

M3 - Conference contribution

BT - Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019)

ER -

Hughes P, Van Gulijk C, El Rashidy R. An interactive machine-learning method to obtain safety information from free text. In Proceedings of the 29th European Safety and Reliability Conference (ESREL 2019). 2019