Information Extraction for Additive Manufacturing Using News Data

Neha Sehgal, Andrew Crampton

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recognizing named entities like Person, Organization, Locations and Date are very useful for web mining. Named Entity Recognition (NER) is an emerging research area which aims to address problems such as Machine Translation, Question Answering Systems and Semantic Web Search. The study focuses on proposing a methodology based on the integration of an NER system and Text Analytics to provide information necessary for business in Additive Manufacturing. The study proposes a foundation of utilizing the Stanford NER system for tagging news data related to the keywords “Additive Manufacturing”. The objective is to first derive the organization names from news data. This information is useful to define the digital footprints of an organization in the Additive Manufacturing sector. The existence of an organization derived using the NER approach is validated by matching their names with companies listed on the Companies House portal. The organization names will be matched using a Fuzzy-based text matching algorithm. Further information on company profile, officers and key financial data is extracted to provide information about companies interested and working within the Additive Manufacturing sector. This data gives an insight into which companies have digital footprints in the Additive Manufacturing sector within the UK.
Original languageEnglish
Title of host publicationAdvanced Information Systems Engineering Workshops
Subtitle of host publicationCAiSE 2019 International Workshops, Rome, Italy, June 3-7, 2019, Proceedings
EditorsHenderik A. Proper, Janis Stirna
Place of PublicationCham
PublisherSpringer, Cham
Pages132-138
Number of pages7
VolumeLNBIP 349
Edition1st
ISBN (Electronic)9783030209483
ISBN (Print)9783030209476, 3030209474
DOIs
Publication statusPublished - 17 Jul 2019
Event31st International Conference on Advanced Information Systems Engineering 2019 - Rome, Italy
Duration: 3 Jun 20197 Jun 2019

Publication series

NameLecture Notes in Business Information Processing
PublisherSpringer, Cham
Volume349
ISSN (Print)1865-1348
ISSN (Electronic)1865-1356

Conference

Conference31st International Conference on Advanced Information Systems Engineering 2019
Abbreviated titleCAiSE 2019
CountryItaly
CityRome
Period3/06/197/06/19

Fingerprint

3D printers
Industry
Semantic Web

Cite this

Sehgal, N., & Crampton, A. (2019). Information Extraction for Additive Manufacturing Using News Data. In H. A. Proper, & J. Stirna (Eds.), Advanced Information Systems Engineering Workshops: CAiSE 2019 International Workshops, Rome, Italy, June 3-7, 2019, Proceedings (1st ed., Vol. LNBIP 349, pp. 132-138). (Lecture Notes in Business Information Processing; Vol. 349). Cham: Springer, Cham. https://doi.org/10.1007/978-3-030-20948-3_12
Sehgal, Neha ; Crampton, Andrew. / Information Extraction for Additive Manufacturing Using News Data. Advanced Information Systems Engineering Workshops: CAiSE 2019 International Workshops, Rome, Italy, June 3-7, 2019, Proceedings. editor / Henderik A. Proper ; Janis Stirna. Vol. LNBIP 349 1st. ed. Cham : Springer, Cham, 2019. pp. 132-138 (Lecture Notes in Business Information Processing).
@inproceedings{9fd5a1f490434afda7bc9b37c8f13f6a,
title = "Information Extraction for Additive Manufacturing Using News Data",
abstract = "Recognizing named entities like Person, Organization, Locations and Date are very useful for web mining. Named Entity Recognition (NER) is an emerging research area which aims to address problems such as Machine Translation, Question Answering Systems and Semantic Web Search. The study focuses on proposing a methodology based on the integration of an NER system and Text Analytics to provide information necessary for business in Additive Manufacturing. The study proposes a foundation of utilizing the Stanford NER system for tagging news data related to the keywords “Additive Manufacturing”. The objective is to first derive the organization names from news data. This information is useful to define the digital footprints of an organization in the Additive Manufacturing sector. The existence of an organization derived using the NER approach is validated by matching their names with companies listed on the Companies House portal. The organization names will be matched using a Fuzzy-based text matching algorithm. Further information on company profile, officers and key financial data is extracted to provide information about companies interested and working within the Additive Manufacturing sector. This data gives an insight into which companies have digital footprints in the Additive Manufacturing sector within the UK.",
keywords = "Named Entity Recognition, Additive Manufacturing, Text matching, Open data, News data",
author = "Neha Sehgal and Andrew Crampton",
year = "2019",
month = "7",
day = "17",
doi = "10.1007/978-3-030-20948-3_12",
language = "English",
isbn = "9783030209476",
volume = "LNBIP 349",
series = "Lecture Notes in Business Information Processing",
publisher = "Springer, Cham",
pages = "132--138",
editor = "Proper, {Henderik A.} and Janis Stirna",
booktitle = "Advanced Information Systems Engineering Workshops",
edition = "1st",

}

Sehgal, N & Crampton, A 2019, Information Extraction for Additive Manufacturing Using News Data. in HA Proper & J Stirna (eds), Advanced Information Systems Engineering Workshops: CAiSE 2019 International Workshops, Rome, Italy, June 3-7, 2019, Proceedings. 1st edn, vol. LNBIP 349, Lecture Notes in Business Information Processing, vol. 349, Springer, Cham, Cham, pp. 132-138, 31st International Conference on Advanced Information Systems Engineering 2019, Rome, Italy, 3/06/19. https://doi.org/10.1007/978-3-030-20948-3_12

Information Extraction for Additive Manufacturing Using News Data. / Sehgal, Neha; Crampton, Andrew.

Advanced Information Systems Engineering Workshops: CAiSE 2019 International Workshops, Rome, Italy, June 3-7, 2019, Proceedings. ed. / Henderik A. Proper; Janis Stirna. Vol. LNBIP 349 1st. ed. Cham : Springer, Cham, 2019. p. 132-138 (Lecture Notes in Business Information Processing; Vol. 349).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Information Extraction for Additive Manufacturing Using News Data

AU - Sehgal, Neha

AU - Crampton, Andrew

PY - 2019/7/17

Y1 - 2019/7/17

N2 - Recognizing named entities like Person, Organization, Locations and Date are very useful for web mining. Named Entity Recognition (NER) is an emerging research area which aims to address problems such as Machine Translation, Question Answering Systems and Semantic Web Search. The study focuses on proposing a methodology based on the integration of an NER system and Text Analytics to provide information necessary for business in Additive Manufacturing. The study proposes a foundation of utilizing the Stanford NER system for tagging news data related to the keywords “Additive Manufacturing”. The objective is to first derive the organization names from news data. This information is useful to define the digital footprints of an organization in the Additive Manufacturing sector. The existence of an organization derived using the NER approach is validated by matching their names with companies listed on the Companies House portal. The organization names will be matched using a Fuzzy-based text matching algorithm. Further information on company profile, officers and key financial data is extracted to provide information about companies interested and working within the Additive Manufacturing sector. This data gives an insight into which companies have digital footprints in the Additive Manufacturing sector within the UK.

AB - Recognizing named entities like Person, Organization, Locations and Date are very useful for web mining. Named Entity Recognition (NER) is an emerging research area which aims to address problems such as Machine Translation, Question Answering Systems and Semantic Web Search. The study focuses on proposing a methodology based on the integration of an NER system and Text Analytics to provide information necessary for business in Additive Manufacturing. The study proposes a foundation of utilizing the Stanford NER system for tagging news data related to the keywords “Additive Manufacturing”. The objective is to first derive the organization names from news data. This information is useful to define the digital footprints of an organization in the Additive Manufacturing sector. The existence of an organization derived using the NER approach is validated by matching their names with companies listed on the Companies House portal. The organization names will be matched using a Fuzzy-based text matching algorithm. Further information on company profile, officers and key financial data is extracted to provide information about companies interested and working within the Additive Manufacturing sector. This data gives an insight into which companies have digital footprints in the Additive Manufacturing sector within the UK.

KW - Named Entity Recognition

KW - Additive Manufacturing

KW - Text matching

KW - Open data

KW - News data

UR - http://www.scopus.com/inward/record.url?scp=85066844977&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-20948-3_12

DO - 10.1007/978-3-030-20948-3_12

M3 - Conference contribution

SN - 9783030209476

SN - 3030209474

VL - LNBIP 349

T3 - Lecture Notes in Business Information Processing

SP - 132

EP - 138

BT - Advanced Information Systems Engineering Workshops

A2 - Proper, Henderik A.

A2 - Stirna, Janis

PB - Springer, Cham

CY - Cham

ER -

Sehgal N, Crampton A. Information Extraction for Additive Manufacturing Using News Data. In Proper HA, Stirna J, editors, Advanced Information Systems Engineering Workshops: CAiSE 2019 International Workshops, Rome, Italy, June 3-7, 2019, Proceedings. 1st ed. Vol. LNBIP 349. Cham: Springer, Cham. 2019. p. 132-138. (Lecture Notes in Business Information Processing). https://doi.org/10.1007/978-3-030-20948-3_12