File(s) under embargo

9

month(s)

7

day(s)

until file(s) become available

Analysis of Visual Encodings Effectiveness for Multivariate Data Similarity Identification

thesis
posted on 01.05.2020, 00:00 by Mirko Mantovani
Similarity detection seeks to identify items which resemble other items without being identical to them, sometimes over relatively large collections of multivariate items. Oftentimes, similarity cannot be defined computationally over a dataset, leading to a need for visual analysis. Such situations arise commonly in the analysis of ensemble simulations, of multiple computational models, of patient data repositories, or of geospatial data. In this research, we examine, in the context of similarity detection, the effectiveness of several visual encodings for multivariate data. We conducted a user study with 40 participants to measure similarity detection accuracy and response time under two conditions: moderate-scale (16 items) and large-scale (36 items). Our statistical analysis shows that there are significant differences in encoding performance, especially in the large-scale setting of the experiment. In all settings, we found that plain parallel coordinate plots are slower to read and lead to lower accuracy than juxtaposed star glyph approaches. When the number of items grows, the contour star plot (Kiviat diagram) outperforms other variations, including data lines star plots, and is therefore suitable for similarity identification when dealing with relatively large multivariate datasets.

History

Advisor

Marai, Elisabeta Georgeta

Chair

Marai, Elisabeta Georgeta

Department

Computer Science

Degree Grantor

University of Illinois at Chicago

Degree Level

Masters

Degree name

MS, Master of Science

Committee Member

Johnson, Andrew Edward Lanzi, Pier Luca

Submitted date

May 2020

Thesis type

application/pdf

Language

en