Data mining in bioinformatics using Weka

Bioinformatics. 2004 Oct 12;20(15):2479-81. doi: 10.1093/bioinformatics/bth261. Epub 2004 Apr 8.

Abstract

The Weka machine learning workbench provides a general-purpose environment for automatic classification, regression, clustering and feature selection-common data mining problems in bioinformatics research. It contains an extensive collection of machine learning algorithms and data pre-processing methods complemented by graphical user interfaces for data exploration and the experimental comparison of different machine learning techniques on the same problem. Weka can process data given in the form of a single relational table. Its main objectives are to (a) assist users in extracting useful information from data and (b) enable them to easily identify a suitable algorithm for generating an accurate predictive model from it.

Availability: http://www.cs.waikato.ac.nz/ml/weka.

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Computational Biology / methods*
  • Database Management Systems*
  • Databases, Factual*
  • Information Storage and Retrieval / methods*
  • Natural Language Processing
  • Software
  • User-Computer Interface*