Department of Computer Science
University of Texas at San Antonio
Schedule:
Alt, H. & Godau, M. Computing the Frechet Distance Between Two Polygonal Curves International Journal of Computational Geometry and Applications, 1995, 5, 75-91
Ben-Or, M. Lower bounds for algebraic computation trees STOC '83: Proceedings of the fifteenth annual ACM symposium on Theory of computing, ACM Press, 1983, 80-86
Maheshwari, A. & Yi, J. On Computing Frechet Distance of Two Paths on a Convex Polyhedron EWCG 2005, 2005 , 41-4
Mitchell, J. Geometric shortest paths and network optimization Handbook of Computational Geometry, 1998
Abstract: Similarity metrics measure how "similar" two objects are to each other. They are useful for authentication, map matching, and shape recognition. These applications rely on using similarity metrics to find an input object's "closest match" in a known collection of objects. For example, given a thumbprint one can find the closest matching thumbprint in a thumbprint database. Although there are many ways to compare objects, both the Hausdorff and Fréchet similarity metrics compare objects based on distance. This means that "similar" objects are close together while "different" objects are far apart. Given that "distance" is key to calculating similarity, it is interesting to change how "distance" is measured. For example, distance can be measured using Lp metrics or with shortest obstacle-avoiding paths (geodesics).
Abstract: Publishing data about individuals without revealing sensitive information about them is an important problem. In recent years, a new definition of privacy called k-anonymity has gained popularity. In a k-anonymized dataset, each record is indistinguishable from at least k-1 other records with respect to certain "identifying" attributes. However, there are two simple attacks that a k-anonymized dataset has some subtle, but severe privacy problems: First, an attacker can discover the values of sensitive attributes when there is little diversity in those sensitive attributes; second, attackers often have background knowledge. Unfortunately k-anonymity does not guarantee privacy against such attackers. Based on the k-anonymity, they propose a novel and powerful privacy definition called l-diversity. In addition to building a formal foundation for l-diversity, they show how to inject utilities to randomized datasets.
Abstract: This paper addresses privacy-preserving classification problem in a distributed system. Randomization has been the approach proposed to preserve privacy in such scenario. However, this approach has been proven to be insecure as it has been discovered that some privacy intrusion techniques can be used to reconstruct private information from the randomized data tuples. So it introduces an algebraic technique- based scheme. Compared to the randomization approach, this scheme can build classifiers more accurately but disclose less private information.
Please send emails to carola@cs.utsa.edu, or seminar co-organizers: Kay Robbins, Weining Zhang, Yufei Huang, Carola Wenk, and Qi Tian.