Statistical techniques vs. SEE5 algorithman application to a small business environment

  1. Andrés Suárez, Javier de 1
  1. 1 Universidad de Oviedo
    info

    Universidad de Oviedo

    Oviedo, España

    ROR https://ror.org/006gksa02

Journal:
The International Journal of Digital Accounting Research

ISSN: 1577-8517

Year of publication: 2001

Volume: 1

Issue: 2

Pages: 153-179

Type: Article

More publications in: The International Journal of Digital Accounting Research

Abstract

The aim of this research is to compare the accuracy of a rule induction classifier system -Quinlan's SEE5- with linear discriminant analysis and logit. The classification task chosen is the differentiation of the most efficient companies from the least efficient ones on the basis of a set of financial variables. The sample consists of a database containing the annual accounts of the companies located in the Principality of Asturias (Spain), which are mainly small businesses. The main results indicate that SEE5 outperforms logit, but it is not clearly better than discriminant analysis. However, SEE5 models suffer from bigger increases in error rates when tested with validation samples. Another interesting finding is that in SEE5 systems both the number of variables selected and the number of rules inferred grow when sample size increases.