Where academic tradition
meets the exciting future

Evaluating Multidimensional Visualization Techniques in Data Mining Tasks

Dorina Marghescu, Evaluating Multidimensional Visualization Techniques in Data Mining Tasks. TUCS Dissertations 107. Turku Centre for Computer Science, 2008.

Abstract:

Visual data mining (VDM) tools employ information visualization techniques in order to represent large amounts of high-dimensional data graphically and to involve the user in exploring data at different levels of detail. The users are looking for outliers, patterns and models – in the form of clusters, classes, trends, and relationships – in different categories of data, i.e., financial, business information, etc.

The focus of this thesis is the evaluation of multidimensional visualization techniques, especially from the business user’s perspective. We address three research problems. The first problem is the evaluation of projection-based visualizations with respect to their effectiveness in preserving the original distances between data points and the clustering structure of the data. In this respect, we propose the use of existing clustering validity measures. We illustrate their usefulness in evaluating five visualization techniques: Principal Components Analysis (PCA), Sammon’s Mapping, Self-Organizing Map (SOM), Radial Coordinate Visualization and Star Coordinates. The second problem is concerned with evaluating different visualization techniques as to their effectiveness in visual data mining of business data. For this purpose, we propose an inquiry evaluation technique and conduct the evaluation of nine visualization techniques. The visualizations under evaluation are Multiple Line Graphs, Permutation Matrix, Survey Plot, Scatter Plot Matrix, Parallel Coordinates, Treemap, PCA, Sammon’s Mapping and the SOM. The third problem is the evaluation of quality of use of VDM tools. We provide a conceptual framework for evaluating the quality of use of VDM tools and apply it to the evaluation of the SOM. In the evaluation, we use an inquiry technique for which we developed a questionnaire based on the proposed framework.

The contributions of the thesis consist of three new evaluation techniques and the results obtained by applying these evaluation techniques. The thesis provides a systematic approach to evaluation of various visualization techniques. In this respect, first, we performed and described the evaluations in a systematic way, highlighting the evaluation activities, and their inputs and outputs. Secondly, we integrated the evaluation studies in the broad framework of usability evaluation.

The results of the evaluations are intended to help developers and researchers of visualization systems to select appropriate visualization techniques in specific situations. The results of the evaluations also contribute to the understanding of the strengths and limitations of the visualization techniques evaluated and further to the improvement of these techniques.

Files:

Full publication in PDF-format

BibTeX entry:

@PHDTHESIS{phdMarghescu08a,
  title = {Evaluating Multidimensional Visualization Techniques in Data Mining Tasks},
  author = {Marghescu, Dorina},
  number = {107},
  series = {TUCS Dissertations},
  school = {Turku Centre for Computer Science},
  year = {2008},
  keywords = {multidimensional visualization techniques, visual data mining, evaluation of visualization techniques},
  ISBN = {978-952-12-2152-8},
}

Belongs to TUCS Research Unit(s): Data Mining and Knowledge Management Laboratory

Edit publication