Faculty:
Tom Bylander (email: bylander@cs.utsa.edu
)
Stephen Kwek (email: kwek@cs.utsa.edu
)
Current Students:
-
Chau Nguyen (Thesis: iBoost, Boosting Using an instance
Based Exponential Weighting Scheme)
-
Michael Jon Maltrud (Thesis: Multiclass Predictions Using Weighted Error
Correcting Output Codes)
-
John Andrichak IV
-
Matthew Grantz
-
Trung Nguyen
-
Lisa Tate
Seminar Schedule:
Projects:
Technical Reports:
Useful Computational Research Links:
Funding: National Science Foundation, CCR-0208935 (PI: Kwek)
Projects
Theoretical Project 1: Foundation of Multiple Classifier
Systems and Active Learning
Computational learning theory (COLT) research has made significant progress
in our understanding of concept learning from sampled labeled instances.
Although many efficient algorithms were constructed for learning various
classes of concepts, these concept classes are extremely simple. Further,
numerous hardness results seem to suggest that learning more expressive
concept classes seems to be extremely difficult. Two research directions
have evolved in an effort to address the difficulty of learning more expressive
concepts. One is to construct multiple classifier systems (MCS) and the
other is to consider the possibility of actively interacting with the environment
(a.k.a. active learning). Currently, due to the success of Adaboost,
theoretical research on MCS (and in fact machine learning in general) tends
to focus rather narrowly on ensemble methods. Similarly, active learning
research has been concentrated on the use of membership queries, and to
some extent the teaching model. Other types of MCS and active learning
that may lead to more intelligent learning systems have not been well studied.
A good understanding of how to build more sophisticated MCS and exploit
other possibilities of extracting information from the environment will
move us one step closer to achieving the original intent of machine learning,
which is to automate the knowledge acquisition process.
There are many research problems to consider. The following is a list
of some of the problems that this project will examine. The list is
not intended to be exhaustive, but rather gives some concrete starting
points that are illustrative of the type of research problems to be pursued.
Other problems may also be studied in this project. This list also serves
to demonstrate some of the work that the PI has done in MCS and active
learning. The PI has previously investigated these problems, but many important
issues remain to be investigated. Although the main approach in this research
is that of COLT, empirical studies will be performed to test the efficacy
of the algorithms and heuristics that are derived from the research.
MCS 1: Learning When to Trust Which Experts: If we have a number
of experts (or learning algorithms) each having its own strengths and weaknesses,
how can a learner learn the conditions in which each expert's predictions
can be trusted? Such an understanding will give rise to better ways of
combining the experts' advice than simply taking a weighted vote as done
in ensemble methods.
MCS 2: Learning Multiple Intermediate Concepts: The target concept
may be dependent on or correlated to some other related intermediate concepts.
How do we exploit such dependencies and learn these intermediate concepts
so that we can use them to enhance our approximation of the target concept?
Although such situations occur quite often in practice, most research work
concentrates on learning a single target concept.
Active Learning 1: Learning from Data with Unspecified Attribute
Values: Almost all work in COLT assumes that the attribute values of the
instances are totally specified. However, it is common in practice that
the instances are often partially specified. How can a learner learn in
the presence of unspecified attribute values where an instance can be classified
as 'undeterminable' (based only on those attributes with specified values)?
Active Learning 2: Learning by Observing an Expert Perform the
Classification Task: Instead of having a teacher who actively teaches the
learner, how can a learner learn by observing an expert perform the classification
task? This provides a better alternative than the current teaching model
which assumes that the student is unwilling to learn (so as to circumvent
the "outright-encoding'' problem).
For more details, please click here
.
References:
-
Sally Goldman, Stephen Kwek and Stephen Scott, Learning
From Examples With Unspecified Attribute Values. To appear in Information
and Computation 2002. An extended abstract appeared in Proceedings
of the10th Annual ACM conference on Computational Learning Theory,
page 231-242, ACM Press, New York, NY 1997.
-
Stephen Kwek, Learning Intermediate Concepts.
Proc . of the 12th International Conference on Algorithmic Learning
Theory . Volume 2225 of Lecture Notes in Artificial Intelligence. November
25-28, 2001. Pages 151-166, 2001
-
Stephen Kwek, An Apprentice Learning
Model . Proceedings of the 12th Annual ACM conference on Computational
Learning Theory. pages 63-74, ACM Press, New York, NY 1999.
-
David Helmbold, Stephen Kwek and Leonard Pitt, Learning
When to Trust Which Experts. Proceedings of the 3rd European Conference
on Computational Learning Theory, page 134-150, Lecture Notes in Artificial
Intelligence, No. 1208, Springer, 1997.
Theoretical Project 2: Geometric Concept Learning
Geometric concepts play a major role in machine learning. Neural networks
with linear threshold gates can be interpreted as geometrical concepts
with linear bounding hyper-planes. Early work on perceptron learning was
in essence an investigation into the learnability of linear half-spaces.
Further, geometrical concepts like convex polyhedra and unions of axis-parallel
boxes can be viewed as a continuous-valued relaxation of boolean functions
that have short representations as DNF or CNF formula. The efficient learnability
of the latter concept classes is a challenging open problem in computational
learning theory.
Various results have shown that geometric concepts in arbitrary dimensional
space, in particular convex polytopes, are hard to learn in most traditional
learning models. One objective of this project is to examine how convex
polytopes can be learned with a richer way of interacting with the environment
besides using membership queries. Another objective is to investigate how
low dimensional geometric concepts can be learned.
References:
-
Sally Goldman, Stephen Kwek and Stephen Scott, Agnostic
Learning of Geometric Patterns . Journal of Computer and System
Science, 62(1):123-151, 2001.
-
Stephen Kwek and Leonard Pitt, PAC Learning Intersections
of Halfspaces with Membership Queries . Algorithmica, Special Issue
on Computational Learning Theory, 22 (1998) pp. 53-75.
-
Paul Goldberg and Stephen Kwek, The
Precision of Query Points as a Resource for Learning Convex Polytopes with
Membership Queries . Proc. of 13th Annual Anniversary on Computational
Learning Theory , Morgan Kauffman, pages 225-235, 2000.
-
Stephen Kwek, An Efficient Algorithm for
Learning Upper Convex Polytopes Using Membership Queries. In Proc.
of 6th International Symposium on Math and Artificial Intelligence
, 2000.
-
Peter Auer, Stephen Kwek, Wolfgang Maass and Manfred Warmuth, Learning
of Depth Two Neural Networks with Constant Fan-in at the Hidden Nodes
. Proceedings of the 9th Annual ACM Conference on Computational Learning
Theory, page 244-254, ACM Press, New York, NY 1996.
-
Stephen Kwek, Geometric Concept Learning
and Related Topics (Ph. D. Thesis). University of Illinois at Urbana-Champaign,
Department of Computer Science, Technical report UIUCDCS-R-97-1980. 1997
For more details, please click here
.
Theoretical Project 3: Learning Linear Threshold
Functions
... Later ...
References:
-
Bylander, The Binary Exponentiated Gradient Algorithm for Learning Linear
Functions, Proc. 10th Annu. Conf. on Comput. Learning Theory, pp. 184-192,
ACM Press, New York, NY, 1997.
-
Bylander, Estimating Generalization Error on Two-Class Datasets Using Out-of-Bag
Estimates, Machine Learning, Vol. 48, Number 1/3, pp. 287-297, Kluwer Academic
Publishers, Boston, 2002.
-
Bylander, Learning linear threshold functions in the presence of classification
noise, Proc. 7th Annu. ACM Conf. on Comput. Learning Theory, pp.
340-347, ACM Press, New York, NY, 1994.
-
Bylander, Learning Probabilistically Consistent Linear Threshold Functions,
Proc. 10th Annu. Conf. on Comput. Learning Theory, pp. 62-71, ACM Press,
New York, NY, 1997.
-
Bylander, Polynomial learnability of linear threshold approximations, Proceedings
of the Sixth Annual ACM Conference on Computational Learning Theory, pp.
297-302, ACM Press, New York, NY, 1993
Theoretical Projects: Miscellaneous Topics
... Later ...
References:
-
Sally Goldman and Stephen Kwek, On
Learning Unions of Pattern Languages and Tree Patterns in the Mistake Bound
Model . To appear in Journal of Theoretical Computer Science special
issue on Algorithmic Learning Theory 2002. An extended abstract appeared
in Proceedings of the 10th International Conference on Algorithmic Learning
Theory . page 347-363, Lecture Notes in Artificial Intelligence, No.
1720, Springer, 1999.
-
Daniel Dooly, Sally Goldman and Stephen Kwek, Real-Valued
Multiple-Instance Learning with Queries . Proc. of 12th International
Conference on Algorithmic Learning Theory . Volume 2225 of Lecture
Notes in Artificial Intelligence. November 25-28, 2001. Pages 167-180,
2001.
-
Stephen Kwek, On a Simple Depth-First
Search Strategy for Exploring Unknown Graph . Proceedings of the
5th International Workshop in Algorithms and Data Structures, page
345-353, Lecture Notes in Computer Science, No. 1272, Springer, 1997.
-
Stephen Kwek, Learning Disjunctions of
Features . Proceedings of the 8th Int. Conf. on Algorithmic Learning
Theory , page 401-415, Lecture Notes in Artificial Intelligence, No.1316,
Springer, 1997.
Empirical Project: Instance Based Weighting
Schemes in Ensemble Methods
Recent research in classification problems has mostly concentrated on ensemble
methods that construct a set of base classifiers instead of a single classifier.
An unlabeled instance is then classified by taking a vote of the base classifiers'
predictions of its class label. Ensemble methods like Bagging and AdaBoost
have been shown to outperform the individual base classifiers when the
base
inducer that produces the base classifiers is unstable. An inducer
is said to be unstable if a slight change in the training examples
results in a very different classifier being constructed. Further, the
idea of ensemble methods also gives rise to the use of error-correcting
output code technique in enhancing accuracy in multi-class classification
problems.
In ensemble methods, the vote of each base classifier either receives
the same weight (e.g. Bagging and Arcing) or is weighted according to the
estimated error rate (e.g. AdaBoost). Based on my recent work, I am proposing
an instance-based approach of assigning weights to the base classifiers
that may enhance the performance of ensemble methods.
Instead of assigning a weight to a base classifier that is fixed for
all instances, we attempt to assign a weight that is based upon how well
we think the base classifier is going to predict on the label of the test
instance. Intuitively, given an unlabeled instance, if there is some indication
that the base classifier's prediction is correct then we should increase
its weight. Otherwise, we should reduce its weight. In our preliminary
investigation, on nine UC Irvine datasets where AdaBoost did not perform
well, with accuracy between 70% to 80%, we manage to improve the prediction
accuracy on all datasets with an average improvement of more than 1%. This
suggests that such instance-based weighting scheme does enhance the accuracy
of ensemble methods. The implementation is currently still quite crude.
We believe it can be further fine-tuned to achieve an even higher accuracy.
This project explores instance-based weighting schemes further so as to
enhance existing ensemble methods and their derivatives.
References:
-
Stephen Kwek and Chau Nguyen. iBoost:
Boosting with an instance-based voting scheme
. To appear in the 13th European Conference on Machine Learning 2002.
-
David Helmbold, Stephen Kwek and Leonard Pitt, Learning
When to Trust Which Experts . Proceedings of the 3rd European Conference
on Computational Learning Theory, page 134-150, Lecture Notes in Artificial
Intelligence, No. 1208, Springer, 1997.
For more information, please click here
.
Application Projects:
Our research group is actively seeking interesting applications.
References:
-
Dragoljub Pokrajac, Stephen Kwek, Zoran Obradovic, Tim Fiez and Dragan
Obradovic, Distribution Comparison
for Site-specific Regression Modeling in Agriculture. Proceedings
of the International Joint Conference on Neural Networks , page 346-352,
1999.
Seminar Schedule
Summer 2002 (TENTATIVE)
Meeting Time : Friday 12:30 - 2 pm at the department conference room SB
3.02.01
The topics and presenters listed below are tentative and may subject
to change .
|
Date
|
Speaker |
Topics
|
| 5 / 24 / 02 |
Kwek |
An Overview of Boosting |
| 5 / 31 / 02 |
Bylander |
Introduction to Naive Bayes Method |
| 6 / 7 / 02 |
Andrichak |
Bagging Predictors |
| 6 / 14 / 02 |
Maltrud |
Enhancing Multiclass Predictions Using Error Correcting
Output Codes |
| 6 / 21 / 02 |
Chau |
iBoost: Boosting using an instance-based exponential
weighting scheme |
| 6 / 28 / 02 |
Grantz |
Boosting Methodology for Regression Problems |
| 7 / 5 / 02 |
Tate ??? |
TBA (Dr. Kwek will be out of town) |
| 7 / 12 / 02 |
Trung ??? |
TBA (Dr. Kwek will be out of town) |
| 7 / 19 / 02 |
TBA |
TBA |
| 7 / 26 / 02 |
TBA |
TBA |
| 8 / 2 / 02 |
TBA |
TBA |
| 8 / 9 / 02 |
TBA |
TBA |
| 8 / 16 / 02 |
Maltrud |
Comparative Studies on Boosting, Bagging and Stacking ??? |
| 8 / 23 / 02 |
|
BREAK before Fall 02 semester starts. |
Technical Reports