DEXA 2006

DaWaK Program

Monday, 4 September 2006

9.00 – 9.30 Opening

Session 1 ETL Processing
9.30- 11.00 Chair: Juan Trujillo

Welcome by the Programme Committee Chairs

ETLDiff: A Semi-Automatic Framework for Regression Test of ETL Software
Christian Thomsen, Torben Bach Pedersen

Applying Transformations to Model Driven Data Warehouses
Jose-Norberto Mazon, Jesus Pardillo, Juan Trujillo

Bulk loading a Linear Hash File
Davood Rafiei, Cheng Hu

11.00 – 11.30 Coffee Break

Invited Talk DEXA Conference
11.30 – 12.30 Chair:
From Extreme Programming to Extreme Non-Programming: Is It Really the Time of Model Transformation Technology?
Oscar Pastor

12.30 – 14.00 Lunch

Session 2 Materialized View
14.00 – 15.30 Chair: Ladjel Bellatreche

Dynamic View Selection for OLAP
Michael Lawrence, Andrew Rau-Chaplin

Preview: Optimizing View Materialization Cost in Spatial Data Warehouses
Songmei Yu, Vijay Atluri, Nabil Adam

Preprocessing for Fast Refreshing Materialized Views in DB2
Wugang Xu, Calisto Zuzarte, Dimitri Theodoratos, Wenbin Ma

15.30 – 16.00 Coffee Break

Session 3 Multidimensional Design
16.00 – 17.30 Chair: Matteo Golfarelli

A multiversion-based multidimensional model
Franck Ravat, Olivier Teste, Gilles Zurfluh

Towards Multidimensional Requirement Design
Estella Annoni, Franck Ravat, Olivier Teste, Gilles Zurfluh

Multidimensional Design by Examples
Oscar Romero and Alberto Abelló

Tuesday, 5 September 2006

Session 4 OLAP & Multidimensional Model
9.00 – 11.00 Chair: A Min Tjoa

Extending Visual OLAP for Handling Irregular Dimensional Hierarchies
Svetlana Mansmann and Marc H. Scholl

A Hierachy-Driven Compression Technique for Advanced OLAP Visualization of Multidimensional Data Cubes
Alfredo Cuzzocrea, Domenico Saccà, Paolo Serafino

Analysing multi-dimensional data across autonomous Data Warehouses
Stefan Berger and Michael Schrefl

What Time is it in the Data Warehouse?
Stefano Rizzi, Matteo Golfarelli

11.00 - 11.30 Coffee Break

Keynote EC-Web Conference
11.30 – 12.30 Chair:
12.30 – 14.00 Lunch

Session 5 Cubes Processing
14.00 – 15.30 Chair: Pedro Furtado

Computing Iceberg Quotient Cubes with Bounding
Xiuzhen Zhang, Pauline Chou, Kotagiri Ramamohanarao

An Effective Algorithm to Extract Dense Sub-Cubes from a Large Sparse Cube
Seok-Lyong Lee

On the Computation of Maximal-Correlated Cuboids Cells
Ronnie Alves, Orlando Belo

15.30 - 16.00 Coffee Break

Session 6 Data Warehouse Applications
16.00 – 18.00 Chair: Andrew Rau-Chaplin

Warehousing dynamic XML documents
Laura Irina Rusu, Wenny Rahayu, David Taniar

Integrating Different Grain Levels in a Medical Data Warehouse Federation
Marko Banek, A Min Tjoa, Nevena Stolba

A Versioning Management Model for Ontology-Based Data Warehouses
Dung Nguyen Xuan, Ladjel Bellatreche , Guy Pierra

Large Data Warehouses in Grids with high QoS
Rogério Luis de Carvalho Costa, Pedro Furtado

Wednesday, 6 September 2006

Session 7 Mining Techniques (1)
9.00 – 11.00 Chair: Vladimir Estivill-Castro

Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods
Jerzy Blaszczynski, Krzysztof Dembczynski, Wojciech Kotlowski, Mariusz Pawlowski

Efficient Mining of Dissociation Rules
Mikolaj Morzy

Optimized Rule Mining Through a Unified Framework for Interestingness Measures
Céline Hébert, Bruno Crémilleux

An Information-Theoretic Framework for Process Structure and Data Mining
Antonio D. Chiaravalloti, Gianluigi Greco, Antonella Guzzo, Luigi Pontieri

11.00 - 11.30 Coffee Break

Keynote TrustBus Conference
11.30 – 12.30 Chair:
12.30 – 14.00 Lunch

Session 8 Mining Techniques (2)
14.00 – 15.30 Chair: Mikolaj Morzy

Mixed Decision Trees: An Evolutionary Approach
Marek Kretowski, Marek Grzes

ITER: an Algorithm for Predictive Regression Rule Extraction
Johan Huysmans, Bart Baesens, Jan Vanthienen

COBRA: Closed Sequential Pattern Mining Using Bi-phase Reduction Approach
Kuo-Yu Huang, Chia-Hui Chang, Jiun-Hung Tung, Cheng-Tao Ho

15.30 – 16.00 Coffee Break

Session 9 Frequent Itemsets
16.00- 17.30 Chair: Krzysztof Dembczynski

A Greedy Approach to Concurrent Processing of Frequent Itemset Queries
Pawel Boinski, Marek Wojciechowski, Maciej Zakrzewicz

Two New Techniques for Hiding Sensitive Itemsets and Their Empirical Evaluation
Ahmed HajYasien, Vladimir Estivill-Castro

EStream: Online Mining of Frequent Sets with Precise Error Guarantee
Xuan Hong Dang, Wee-Keong Ng, Kok-Leong Ong

Thursday, 7 September 2006

Session 10 Mining Data Streams
9.00 – 11.00 Chair: Illhoi Yoo

Granularity Adaptive Density Estimation and on Demand Clustering of Concept-Drifting Data Streams
Weiheng Zhu, Jian Pei, Jian Yin, Yihuang Xie

Classification of Hidden Network Streams
Matthew Gebski, Alex Penev, Raymond K. Wong

Adaptive Load Shedding for Mining Frequent Patterns from Data Streams
Xuan Hong Dang, Wee Keong Ng, Kok Leong Ong

An Approximate Approach for Mining Recently Frequent Itemsets from Data Streams
Jia-Ling Koh, Shu-Ning Shin

11.00 - 11.30 Coffee Break

Keynote DaWaK Conference
11.30 – 12.30 Chair: A Min Tjoa

xStreaming your BI process
Thierry Winckelmans, Lead Area Architect, Sybase Inc
Roman Miller, Business Development Manager, EMEA Country Distributors, Sybase Inc.

12.30 – 14.00 Lunch

Session 11 Ontology-based Mining
14.00 – 15.30 Chair: Tho Manh Nguyen

Learning Classifiers from Distributed, Ontology-Extended Data Sources
Doina Caragea, Jun Zhang, Jyotishman Pathak, Vasant Honavar

A Coherent Biomedical Litrature Clustring and Summarization Approach through Ontology-enriched Graphical Representation
Illhoi Yoo, Xiaohua Hu, Il-Yeol Song

Creating a Lexical Ontology of Abbreviations in the Biomedical literature
Min Song, Il-Yeol Song, KiJung Lee, Michael Bieber

15.30 – 16.00 Coffee Break

Session 12 Clustering
16.00- 18.00 Chair: Jinyan Li

Priority-based k-anonymity accomplished by dynamic generalization structures
Konrad Stark, Johann Eder, Kurt Zatloukal

Achieving k-Anonymity by Clustering in Attribute Hierarchical Structures
Jiuyong Li, Raymond Chi-Wing Wong, Ada Wai-Chee Fu, Jian Pei

Calculation of Density-Based Clustering Parameters Supported with Distributed Processing
Marcin Gorawski, Rafal Malczok

Cluster-based Sampling Approaches to Imbalanced Data Distributions
Show-Jane Yen, Yue-Shi Lee

Friday, 8 September 2006

Session 13 Advanced Mining Techniques
9.00 – 10.30 Chair: Juan Trujillo

Efficient Mining of Large Maximal Bicliques
Guimei Liu, Kelvin S.H. Sim, Jinyan Li

Automatic Image Annotation by Mining the Web
Zhiguo Gong , Qian Liu, Jingbai Zhang

Privacy Preserving Spatio-Temporal Clustering on Horizontally Partitioned Data
Ali Inan, Yucel Saygin

10.30 - 11.00 Coffee Break

Session 14 Association Rules
11.00 – 12.30 Chair: Reda Alhajj

Discovering Semantic Sibling Associations from Web Documents with XTREEM-SP
Marko Brunzel, Myra Spiliopoulou

Difference Detection between Two Contrastsets
Shichao Zhang, Yongsong Qin, Chengqi Zhang

EGEA : A new hybrid approach towards extracting reduced generic association rule set (Application to AML blood cancer therapy)
M. A. Esseghir, G. Gasmi, S. Ben Yahia, E. Mephu Nguifo

12.30 – 14.00 Lunch

Session 15 Miscellaneous Applications
14.00 – 15.30 Chair: Zose Zubcoff

AISS: An Index for Non-TimeStamped Set Subsequence Queries
Witold Andrzejewski, Tadeusz Morzy

A method for feature selection on microarray data using suport vector machine
Xiaobin Huang, Jian Tang

Providing Persistence for Sensor Data Streams by Remote WAL
Hideyuki Kawashima, Michita Imai, Yuichiro Anzai

15.30 – 16.00 Coffee Break

Session 16 Classification
16.00- 18.00 Chair: A Min Tjoa

Support Vector Machine Approach for Fast Classification
Keivan Kianmehr, Reda Alhajj

Document Representations for Classification of Short Web-Page Descriptions
Milos Radovanović, Mirjana Ivanović

Garc: a new associative classification approach
Ines Bouzouita, Samir Elloumi, Sadok Ben Yahia

Conceptual Modeling for Classification Mining In Data Warehouses
Jose Zubcoff, Juan Trujillo

GENERAL

CONTRIBUTORS

PARTICIPANTS

PROGRAM

DaWaK Program

Monday, 4 September 2006

Tuesday, 5 September 2006

Wednesday, 6 September 2006

Thursday, 7 September 2006

Friday, 8 September 2006

ANNOUNCEMENTS

Keynote Talks

Registration