Seminar Managing Information on the Web


Tova Milo, Winter 2009


Seminar Information

The seminar focuses on managing, analyzing, sharing, and integrating data and applications across multiple sources, either on the Internet or at enterprises. This topic has received much attention in the database, AI, Web, IR and verification communities. We shall read recent papers in this area, focusing on several specific issues, and then explore possible future directions. A list of tentative topics/papers is enclosed.




         Web Search

1.      Using Trees to Depict a Forest Bin Liu, H.V. Jagadish, VLDB '09 [ Slava Novgorodov, 5/11 ]

2.      Automated creation of a forms-based database query interface, Jayapandian, Jagadish, VLDB '08 [ Gorelik Ilia 12/11 ]

3.      IRLbot: Scaling to 6 Billion Pages and Beyond, Lee et. Al WWW '08 Best Paper [ Momi Sabag 19/11 ]




1.      Annotated XML: queries and provenance, Foster, Green, Tannen, PODS '08 [ Afengar Eli 26/11 ]

2.      Believe It or Not: Adding Belief Annotations to Databases Gatterbauer, Balazinska, Khoussainova, Suciu, VLDB '09 [ Yuval Rochman 3/12 ]

3.      On the Expressiveness of Implicit Provenance in Query and Update Languages, Buneman, Cheney, Vansummeren , ICDT '07


         Emerging Technologies

1.      Consistency Rationing in the Cloud: Pay only when it matters Kraska, Hentschel, Alonso, Kossmann, VLDB '09 [ Meni Livne 10/12 ]

2.      Group Recommendation: Semantics and Efficiency Amer-Yahia, Roy, Chawla , Das, Yu, VLDB '09 [ Dionis Teshler 17/12 ]

3.      Class-based graph anonimization for social network data Cormode, Srivastava, Bhagat, Krishnamurthy, VLDB '09 [ Shay Houri 24/12 ]

4.      Improved Search for Socially Annotated Data Sarkas, Das, Koudas , VLDB '09 [Alexey Zagalsky 31/12]

         Probabilistic Data

1.      MCDB: A Monte Carlo Approach to Managing Uncertain Data, Haas, Wu, Xu, Jampani, Jermaine, Perez, SIGMOD '08 [ Alon Margalit 7/1 ]

2.      Access control over uncertain data, Vibhor Rastogi, Dan Suciu, Evan Welbourne. VLDB '08 [ Eyal WIdder 14/1 ]

3.      A Unified Approach to Ranking in Probabilistic Databases, VLDB '09 Best Paper, Li, Saha, Deshpande [ Shaine (Eugene) Mednikov 21/1 ]

4.      Approximate lineage for probabilistic databases ,Re,Suciu, VLDB '08

5.      BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models, Wang et. Al, VLDB '08



1.      Dictionary-based Order-preserving String Compression for Main Memory Column Stores, Carsten Binnig, Hildenbrand, Faerber , SIGMOD '09

2.      Self-organizing Tuple Reconstruction in Column-stores Stratos Idreos, Martin Kersten, Stefan Manegold, SIGMOD '09