## Schedule for Seminar on Massive Data Sets

Please look frequently at the
special announcements.
If you would like copies of papers presented here, send
a request to
matias+seminar@math.tau.ac.il

### Association rules and their generalizations

**24/3/98**

Fast Algorithms for Mining Association Rules

R. Agrawal and R. Srikant.

*Proc. of the 20th Int'l Conference on Very Large Databases*, Santiago,
Chile, Sept. 1994. Expanded version
available as IBM Research Report RJ9839, June 1994.

Lecturer: **Rami Citrom**

Dynamic Itemset Counting and Implication Rules for Market Basket Data

S. Brin, R. Motwani, J.D. Ullman, S. Tsur.

*1997 ACM SIGMOD Conference on Management of Data*, 1997, pp. 255-264.

Lecturer: **Ranen Goren**
**31/3/98**

Sampling large databases for association rules

H. Toivonen

*Proc. of the Int'l Conf. on Very Large Data Bases (VLDB)*, 1996.

Lecturer: **Amitai Irron**
The presentation is available in
ppt;
handouts are given in
doc.

Beyond Market Baskets: Generalizing
Association Rules to Correlations.

S. Brin, R. Motwani and C. Silverstein.

*1997 ACM SIGMOD Conference on Management of Data*, 1997, pp. 265-276.
Data Mining and Knowledge Discovery, 1998.

Lecturer: **Micky Frankel**

### Time Sequences

**7/4/98**

Efficiently supporting AD Hoc Queries in
Large databases of Time sequences.

F. Korn, H.V. Jagadish, and C. Faloutsos,

*Proc. SIGMOD*, 1997.

Lecturer: **Zipi Fligelman**

Efficient Retrieval of Similar Time Sequences
Under Time Warping

B.-K. Yi, H.V. Jagadish, C. Faloutsos,

*Proc. ICDE*, 1998.

Lecturer: **Zipi Fligelman**
A summary is available
here.

**14/4/98**

Happy Passover!

### Histograms

**21/4/98**

Improved Histograms for Selectivity
Estimation of Range Predicates.

V. Poosala, Y.E. Ioannidis, P.J. Haas and E.J. Shekita,

*Proc. 1996 ACM SIGMOD Intl. Conf. Managment of Data*, pages
294-305, 1996.

Lecturer: **Ran Shaham**
The presentation is available in
doc.

Fast Incremental Maintenance of
Approximate Histograms.

P.B. Gibbons, Y. Matias and V. Poosala,

*Proceedings of the 23rd International Conference on Very Large
Databases* (VLDB),
Athens, Greece, August 1997, pp. 466-475.

Lecturer: **Uri Stav**
### Special Talk (note the unusual schedule)

**22/4/98, 2:15-3:15**

Compressing Finite Strings:
Optimal algorithms for non-asymptotic/non-probabilistic data

Guest Lecturer: **Dr. S. Cenk Sahinalp**

### Histograms (cont.)

**28/4/98**

Approximate Order Statistics in One Pass and with Limited Memory

Sridhar Rajagopalan, Gurmeet Singh Manku, and Bruce Lindsay

*Proc. ACM SIGMOD*, 1998 (to appear).

Lecturer: **Gil Arditi**
The presentation is available in
doc.

**5/5/98**

Wavelet-Based Histograms for
Selectivity Estimation.

Y. Matias, J. S. Vitter, and M. Wang.

Proc. of the 1998 ACM SIGMOD International Conference on
Management of Data (SIGMOD '98), Seattle, Washington, June 1998 (to appear).

Lecturer: **Ran Adler**

### Clustering techniques

**12/5/98**

BIRCH: an Efficient Data Clustering
Method for Very Large Databases.

T. Zhang, R. Ramakrishnan and M. Livny,

*Proc. 1996 SIGMOD*, pp. 103-114, 1996.

See also the BIRCH
project home page

Lecturer: **Boaz Shaham**
The presentation is available in
ppt.

CURE: An Efficient Clustering Algorithm for Large Databases.

S. Guha, R. Rastogi and K. Shim.

*Proc. of the ACM SIGMOD Conference*, 1998.

Lecturer: **Yakov Zakai**
The presentation is available in
doc.

### Index Trees

**19/5/98**

Generalized Search Trees for Database Systems.

J.M. Hellerstein, J.F. Naughton, and A. Pfeffer.

*Proc. 21st International Conference on Very Large Data Bases*
(VLDB), Zurich, September 1995.

See also the GIST
project home page

Lecturer: **Assaf Almaz**

The speaker will also present indexing problems and solutions
from (real-life) products related to massive document processing.
The presentation is available in
doc.

### Parallel and External Memory Algorithms

**26/5/98**

Asynchronous Parallel Algorithms for Mining Association Rules on a
Shared-memory Multi-processors

D.W. Cheung, K. Hu, and S. Xia.

*Proc. SPAA*, 1998.
*Proc. 10th Annual ACM Symposium on Parallel Algorithms and
Architectures (SPAA '98), (to appear) June 1998*

Lecturer: **Saar Cohen**

Simple Randomized Mergesort on Parallel Disks

R. Barve, E. F. Grove and J. S. Vitter.

*Proc. 8th Annual ACM Symposium on Parallel Algorithms and
Architectures (SPAA '96), Padua, Italy, June 1996, 109-118.*

Lecturer: **Ido Safruti**
The presentation is available in
doc.

**2/6/98**

No Seminar.

**9/6/98**

High-Performance Sorting on Networks of Workstations

A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, D. E. Culler,
J. M. Hellerstein, D. A. Patterson.

*Proc. of the ACM SIGMOD Conference*, 1997.

Lecturer: **Nadav Grossaug**

Return to seminar home page

For requests or corrections contact
matias+seminar@math.tau.ac.il
Last updated *June 7, 1998*