Conference
Proceedings Graphics Interface 2019 (GI 2019), 2019, pp. 1-8
APA
Click to copy
Nie, S., Healey, C. G., Chirkova, R. Y., & Reutter, J. L. (2019). Rapid sequence matching for visualization recommender systems. In Proceedings Graphics Interface 2019 (GI 2019) (pp. 1–8).
Chicago/Turabian
Click to copy
Nie, S., C. G. Healey, R. Y. Chirkova, and J. L. Reutter. “Rapid Sequence Matching for Visualization Recommender Systems.” In Proceedings Graphics Interface 2019 (GI 2019), 1–8, 2019.
MLA
Click to copy
Nie, S., et al. “Rapid Sequence Matching for Visualization Recommender Systems.” Proceedings Graphics Interface 2019 (GI 2019), 2019, pp. 1–8.
BibTeX Click to copy
@conference{s2019a,
title = {Rapid sequence matching for visualization recommender systems},
year = {2019},
pages = {1-8},
author = {Nie, S. and Healey, C. G. and Chirkova, R. Y. and Reutter, J. L.},
booktitle = {Proceedings Graphics Interface 2019 (GI 2019)}
}
We present a method to support high quality visualization recommendations for analytic tasks. Visualization converts large datasets into images that allow viewers to efficiently explore, discover, and validate within their data. Visualization recommenders have been proposed that store past sequences: an ordered collection of design choices leading to successful task completion; then match them against an ongoing visualization construction. Based on this matching, a system recommends visualizations that better support the analysts’ tasks. A problem of scalability occurs when many sequences are stored. One solution would be to index the sequence database. However, during matching we require sequences that are similar to the partially constructed visualization, not only those that are identical. We implement a locality sensitive hashing algorithm that converts visualizations into set representations, then uses Jaccard similarity to store similar sequence nodes in common hash buckets. This allows us to match partial sequences against a database containing tens of thousands of full sequences in less than 100ms. Experiments show that our algorithm locates 95% or more of the sequences found in an exhaustive search, producing high-quality visualization recommendations.