Uma proposta para identificação de outliers multivariados

Ciência E Natura

Endereço:
Revista Ciência e Natura | Campus Sede-Cidade Universitária | Av. Roraima nº 1000, Prédio 13, Sala 1122 | Fone/Fax +55(55) 3220-8735 | Bairro Camobi
Santa Maria / RS
97105-900
Site: http://www.ufsm.br/cienciaenatura
Telefone: (55) 3220-8735
ISSN: 2179-460X
Editor Chefe: Marcelo Barcellos da Rosa
Início Publicação: 30/11/1979
Periodicidade: Quadrimestral

Uma proposta para identificação de outliers multivariados

Ano: 2018 | Volume: 40 | Número: 1
Autores: Josino José Barbosa , Tiago Martins Pereira e Fernando Luiz Pereira de Oliveira
Autor Correspondente: Josino José Barbosa | [email protected]

Palavras-chave: outlier, grouping analysis, monte carlo method

Resumos Cadastrados

Resumo Inglês:

ehe identification of outliers plays an important role in the statistical analysis, since such observations may contain important informationgarding the hypotheses of the study. If classical statistical models are blindly applied to data containing atypical values, the results may be misleading and mistaken decisions can be made. Moreover, in practical situations, the outliers themselves are often the special points of interest and their identification may be the main objective of the investigation. In this way, it was proposed to propose a technique of detection of multivariate outliers, based on cluster analysis and to compare this technique with the method of identification of outliers via Mahalanobis Distance. For data generation, Monte Carlo method simulation and the mixed multivariate normal distribution technique were used. The results presented in the simulations showed that the proposed method was superior to the Mahalanobis method for both sensitivity and specificity, that is, it presented greater ability to correctly diagnose outliers and non-outliers individuals. In addition, the proposed methodology was illustrated with an application in real data from the health are