Article Text

Download PDFPDF

P.1.11 ‘Creation of classification model using machine learning; to detect dysphonia work-related’
  1. Natalia Gilbert1,
  2. Rodrigo Assar2,
  3. Rodrigo Martinez2
  1. 1Instituto de Salud Pública de Chile, Santiago, Chile
  2. 2Universidad de Chile, Santiago, Chile


The underdiagnosis of occupational disease causes severe damage to the health system. The classification of a disease as a professional is based on the decision on whether the present labor factors are sufficient for the generation of the disease, and this function is carried out by a qualified professional or committee.

Occupational dysphonia is one of the 5 most frequent occupational diseases in Chile, whose condition impact on the labor productivity and the quality of life of the patient. Today there are no unified criteria among the occupational qualification decisión makers to decide on the sufficient of laboral factors of occupational dysphonia disease.

Computerized systems have been developed to support clinical diagnosis decision-making process; among these, Machine Learning methods have been used to simulate the reasoning of the expert from the analysis and identification of complex patterns in large databases, so in this study it is suggested that the creation of a dysphonia classification model is possible employing Machine Learning tools. For this purpose, 103 cases obtained from patients with qualification results cause by dysphonia was analize in relation to the number of variables studied and their distribution for the observation of the characteristics that give identity to the groups studied. Subsequently, different classification models were developed using Machine Learning and the one that presented the best performance was chosen.

Statistical analyzes show that of the 6 models of Machine Learning elaborated, Random Forest was the one that presented the best performance (accuracity=0.83 and Kappa value=0.61), variables that manage to establish identity to each group represent 26.5% of the total of studied variables. The results in this work show the potential of the use of computer tools can be useful as a support tool for diagnosis of occupational disease.

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.