Monday, 4 September 2006
9.00 – 9.30 Opening
Session 1 ETL Processing
9.30- 11.00 Chair: Juan Trujillo
Welcome by the Programme Committee Chairs
ETLDiff: A Semi-Automatic Framework for Regression Test of ETL Software
Christian Thomsen, Torben Bach Pedersen
Applying Transformations to Model Driven Data Warehouses
Jose-Norberto Mazon, Jesus Pardillo, Juan Trujillo
Bulk loading a Linear Hash File
Davood Rafiei, Cheng Hu
11.00 – 11.30 Coffee Break
Invited Talk DEXA Conference
11.30 – 12.30 Chair:
From Extreme Programming to Extreme Non-Programming: Is It Really the Time of Model Transformation Technology?
Oscar Pastor
12.30 – 14.00 Lunch
Session 2 Materialized View
14.00 – 15.30 Chair: Ladjel Bellatreche
Dynamic View Selection for OLAP
Michael Lawrence, Andrew Rau-Chaplin
Preview: Optimizing View Materialization Cost in Spatial Data Warehouses
Songmei Yu, Vijay Atluri, Nabil Adam
Preprocessing for Fast Refreshing Materialized Views in DB2
Wugang Xu, Calisto Zuzarte, Dimitri Theodoratos, Wenbin Ma
15.30 – 16.00 Coffee Break
Session 3 Multidimensional Design
16.00 – 17.30 Chair: Matteo Golfarelli
A multiversion-based multidimensional model
Franck Ravat, Olivier Teste, Gilles Zurfluh
Towards Multidimensional Requirement Design
Estella Annoni, Franck Ravat, Olivier Teste, Gilles Zurfluh
Multidimensional Design by Examples
Oscar Romero and Alberto Abelló
Tuesday, 5 September 2006
Session 4 OLAP & Multidimensional Model
9.00 – 11.00 Chair: A Min Tjoa
Extending Visual OLAP for Handling Irregular Dimensional Hierarchies
Svetlana Mansmann and Marc H. Scholl
A Hierachy-Driven Compression Technique for Advanced OLAP Visualization of Multidimensional Data Cubes
Alfredo Cuzzocrea, Domenico Saccà, Paolo Serafino
Analysing multi-dimensional data across autonomous Data Warehouses
Stefan Berger and Michael Schrefl
What Time is it in the Data Warehouse?
Stefano Rizzi, Matteo Golfarelli
11.00 - 11.30 Coffee Break
Keynote EC-Web Conference
11.30 – 12.30 Chair:
12.30 – 14.00 Lunch
Session 5 Cubes Processing
14.00 – 15.30 Chair: Pedro Furtado
Computing Iceberg Quotient Cubes with Bounding
Xiuzhen Zhang, Pauline Chou, Kotagiri Ramamohanarao
An Effective Algorithm to Extract Dense Sub-Cubes from a Large Sparse Cube
Seok-Lyong Lee
On the Computation of Maximal-Correlated Cuboids Cells
Ronnie Alves, Orlando Belo
15.30 - 16.00 Coffee Break
Session 6 Data Warehouse Applications
16.00 – 18.00 Chair: Andrew Rau-Chaplin
Warehousing dynamic XML documents
Laura Irina Rusu, Wenny Rahayu, David Taniar
Integrating Different Grain Levels in a Medical Data Warehouse Federation
Marko Banek, A Min Tjoa, Nevena Stolba
A Versioning Management Model for Ontology-Based Data Warehouses
Dung Nguyen Xuan, Ladjel Bellatreche , Guy Pierra
Large Data Warehouses in Grids with high QoS
Rogério Luis de Carvalho Costa, Pedro Furtado
Wednesday, 6 September 2006
Session 7 Mining Techniques (1)
9.00 – 11.00 Chair: Vladimir Estivill-Castro
Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods
Jerzy Blaszczynski, Krzysztof Dembczynski, Wojciech Kotlowski, Mariusz Pawlowski
Efficient Mining of Dissociation Rules
Mikolaj Morzy
Optimized Rule Mining Through a Unified Framework for Interestingness Measures
Céline Hébert, Bruno Crémilleux
An Information-Theoretic Framework for Process Structure and Data Mining
Antonio D. Chiaravalloti, Gianluigi Greco, Antonella Guzzo, Luigi Pontieri
11.00 - 11.30 Coffee Break
Keynote TrustBus Conference
11.30 – 12.30 Chair:
12.30 – 14.00 Lunch
Session 8 Mining Techniques (2)
14.00 – 15.30 Chair: Mikolaj Morzy
Mixed Decision Trees: An Evolutionary Approach
Marek Kretowski, Marek Grzes
ITER: an Algorithm for Predictive Regression Rule Extraction
Johan Huysmans, Bart Baesens, Jan Vanthienen
COBRA: Closed Sequential Pattern Mining Using Bi-phase Reduction Approach
Kuo-Yu Huang, Chia-Hui Chang, Jiun-Hung Tung, Cheng-Tao Ho
15.30 – 16.00 Coffee Break
Session 9 Frequent Itemsets
16.00- 17.30 Chair: Krzysztof Dembczynski
A Greedy Approach to Concurrent Processing of Frequent Itemset Queries
Pawel Boinski, Marek Wojciechowski, Maciej Zakrzewicz
Two New Techniques for Hiding Sensitive Itemsets and Their Empirical Evaluation
Ahmed HajYasien, Vladimir Estivill-Castro
EStream: Online Mining of Frequent Sets with Precise Error Guarantee
Xuan Hong Dang, Wee-Keong Ng, Kok-Leong Ong
Thursday, 7 September 2006
Session 10 Mining Data Streams
9.00 – 11.00 Chair: Illhoi Yoo
Granularity Adaptive Density Estimation and on Demand Clustering of Concept-Drifting Data Streams
Weiheng Zhu, Jian Pei, Jian Yin, Yihuang Xie
Classification of Hidden Network Streams
Matthew Gebski, Alex Penev, Raymond K. Wong
Adaptive Load Shedding for Mining Frequent Patterns from Data Streams
Xuan Hong Dang, Wee Keong Ng, Kok Leong Ong
An Approximate Approach for Mining Recently Frequent Itemsets from Data Streams
Jia-Ling Koh, Shu-Ning Shin
11.00 - 11.30 Coffee Break
Keynote DaWaK Conference
11.30 – 12.30 Chair: A Min Tjoa
xStreaming your BI process
Thierry Winckelmans, Lead Area Architect, Sybase Inc
Roman Miller, Business Development Manager, EMEA Country Distributors, Sybase Inc.
12.30 – 14.00 Lunch
Session 11 Ontology-based Mining
14.00 – 15.30 Chair: Tho Manh Nguyen
Learning Classifiers from Distributed, Ontology-Extended Data Sources
Doina Caragea, Jun Zhang, Jyotishman Pathak, Vasant Honavar
A Coherent Biomedical Litrature Clustring and Summarization Approach through Ontology-enriched Graphical Representation
Illhoi Yoo, Xiaohua Hu, Il-Yeol Song
Creating a Lexical Ontology of Abbreviations in the Biomedical literature
Min Song, Il-Yeol Song, KiJung Lee, Michael Bieber
15.30 – 16.00 Coffee Break
Session 12 Clustering
16.00- 18.00 Chair: Jinyan Li
Priority-based k-anonymity accomplished by dynamic generalization structures
Konrad Stark, Johann Eder, Kurt Zatloukal
Achieving k-Anonymity by Clustering in Attribute Hierarchical Structures
Jiuyong Li, Raymond Chi-Wing Wong, Ada Wai-Chee Fu, Jian Pei
Calculation of Density-Based Clustering Parameters Supported with Distributed Processing
Marcin Gorawski, Rafal Malczok
Cluster-based Sampling Approaches to Imbalanced Data Distributions
Show-Jane Yen, Yue-Shi Lee
Friday, 8 September 2006
Session 13 Advanced Mining Techniques
9.00 – 10.30 Chair: Juan Trujillo
Efficient Mining of Large Maximal Bicliques
Guimei Liu, Kelvin S.H. Sim, Jinyan Li
Automatic Image Annotation by Mining the Web
Zhiguo Gong , Qian Liu, Jingbai Zhang
Privacy Preserving Spatio-Temporal Clustering on Horizontally Partitioned Data
Ali Inan, Yucel Saygin
10.30 - 11.00 Coffee Break
Session 14 Association Rules
11.00 – 12.30 Chair: Reda Alhajj
Discovering Semantic Sibling Associations from Web Documents with XTREEM-SP
Marko Brunzel, Myra Spiliopoulou
Difference Detection between Two Contrastsets
Shichao Zhang, Yongsong Qin, Chengqi Zhang
EGEA : A new hybrid approach towards extracting reduced generic association rule set (Application to AML blood cancer therapy)
M. A. Esseghir, G. Gasmi, S. Ben Yahia, E. Mephu Nguifo
12.30 – 14.00 Lunch
Session 15 Miscellaneous Applications
14.00 – 15.30 Chair: Zose Zubcoff
AISS: An Index for Non-TimeStamped Set Subsequence Queries
Witold Andrzejewski, Tadeusz Morzy
A method for feature selection on microarray data using suport vector machine
Xiaobin Huang, Jian Tang
Providing Persistence for Sensor Data Streams by Remote WAL
Hideyuki Kawashima, Michita Imai, Yuichiro Anzai
15.30 – 16.00 Coffee Break
Session 16 Classification
16.00- 18.00 Chair: A Min Tjoa
Support Vector Machine Approach for Fast Classification
Keivan Kianmehr, Reda Alhajj
Document Representations for Classification of Short Web-Page Descriptions
Milos Radovanović, Mirjana Ivanović
Garc: a new associative classification approach
Ines Bouzouita, Samir Elloumi, Sadok Ben Yahia
Conceptual Modeling for Classification Mining In Data Warehouses
Jose Zubcoff, Juan Trujillo