Rapid sequence matching for visualization recommender systems


Conference


S. Nie, C. G. Healey, R. Y. Chirkova, J. L. Reutter
Proceedings Graphics Interface 2019 (GI 2019), 2019, pp. 1-8

View PDF Semantic Scholar DOI DBLP
Cite

Cite

APA   Click to copy
Nie, S., Healey, C. G., Chirkova, R. Y., & Reutter, J. L. (2019). Rapid sequence matching for visualization recommender systems. In Proceedings Graphics Interface 2019 (GI 2019) (pp. 1–8).


Chicago/Turabian   Click to copy
Nie, S., C. G. Healey, R. Y. Chirkova, and J. L. Reutter. “Rapid Sequence Matching for Visualization Recommender Systems.” In Proceedings Graphics Interface 2019 (GI 2019), 1–8, 2019.


MLA   Click to copy
Nie, S., et al. “Rapid Sequence Matching for Visualization Recommender Systems.” Proceedings Graphics Interface 2019 (GI 2019), 2019, pp. 1–8.


BibTeX   Click to copy

@conference{s2019a,
  title = {Rapid sequence matching for visualization recommender systems},
  year = {2019},
  pages = {1-8},
  author = {Nie, S. and Healey, C. G. and Chirkova, R. Y. and Reutter, J. L.},
  booktitle = {Proceedings Graphics Interface 2019 (GI 2019)}
}

Abstract

We present a method to support high quality visualization recommendations for analytic tasks. Visualization converts large datasets into images that allow viewers to efficiently explore, discover, and validate within their data. Visualization recommenders have been proposed that store past sequences: an ordered collection of design choices leading to successful task completion; then match them against an ongoing visualization construction. Based on this matching, a system recommends visualizations that better support the analysts’ tasks. A problem of scalability occurs when many sequences are stored. One solution would be to index the sequence database. However, during matching we require sequences that are similar to the partially constructed visualization, not only those that are identical. We implement a locality sensitive hashing algorithm that converts visualizations into set representations, then uses Jaccard similarity to store similar sequence nodes in common hash buckets. This allows us to match partial sequences against a database containing tens of thousands of full sequences in less than 100ms. Experiments show that our algorithm locates 95% or more of the sequences found in an exhaustive search, producing high-quality visualization recommendations.