Natural Language Processing for Classification of Narratives from MSHA Data

- Organization:
- Society for Mining, Metallurgy & Exploration
- Pages:
- 10
- File Size:
- 225 KB
- Publication Date:
- Jun 25, 2023
Abstract
Mining can be categorized as a hazardous activity, considering some factors such as environmental conditions with a considerable presence of humidity, suspended particles, or falling rocks have affected the severity and number of accidents compared with other economic sectors. The industry analyzes incident reports to narrow the rate of severe injuries and fatalities, conducting root cause analysis and identifying leading indicators. As the International Council on Mining and Metals noted, the vast trove of incident data is not analyzed as much as possible due to a lack of analytics expertise at mine sites. However, machine learning could solve the problem of analyzing all the incident data in a no-timeconsuming way, considering the abundant data, and without using expert personnel in data science. Thus, a Convolutional Neural Network and a Naïve Bayes model were introduced to perform classification in the MSHA database. The database from 2020 consists of 60 fields to describe safety incidents; these fields include mine I.D., accident date, subunit (mill, surface), material extracted, and other metadata.
Citation
APA:
(2023) Natural Language Processing for Classification of Narratives from MSHA DataMLA: Natural Language Processing for Classification of Narratives from MSHA Data. Society for Mining, Metallurgy & Exploration, 2023.