Using WEKA in the CSLab

Tom Bylander

Department of Computer Science



Weka is a collection of machine learning algorithms that is implemented in Java. Weka includes algorithms for classification, numeric prediction, clustering, and association rules. This document provides information about the installation and usage for faculty and students, especially for the Data Mining course.

Installed Software

Weka is installed on on the CS Sun Network in the directory /home/bylander/weka. Documentation for weka is in this directory and in the doc subdirectory. The documentation includes the following:

Environment Variables

This describes how to run Weka in combination with DB2.

tcsh and csh

When you do:

source /db2home/db2inst1/sqllib/db2cshrc
as suggested by the DB2 document, it does bad things to your CLASSPATH environmental variable. I suggest that you put the following sequence of commands in your .cshrc file.
source /db2home/db2inst1/sqllib/db2cshrc
setenv WEKAHOME /home/bylander/weka
unset CLASSPATH
setenv CLASSPATH .:/home/bylander/weka/weka.jar:/db2home/db2inst1/sqllib/java/sqlj.zip:/db2home/db2inst1/sqllib/function:/db2home/db2inst1/sqllib/java/db2java.zip:/db2home/db2inst1/sqllib/java/runtime.zip
This should get you all the Java objects for Weka and DB2, though I have to admit that I haven't tried to use any of the DB2 Java.

ksh

For ksh, all you should need to do is to add:

WEKAHOME=/home/bylander/weka
CLASSPATH=/home/bylander/weka/weka.jar:$CLASSPATH
to your .profile file, but I haven't tested this.

Running Weka

The README file has several examples of commands to try.



Last updated on 8/30/01 by bylander@cs.utsa.edu