VIS30K: A Collection of Figures and Tables from IEEE Visualization Conference Publications

Description:

We present the VIS30K dataset, a collection of 29,689 images that represents 30 years of figures and tables from each track of the IEEE Visualization conference series (Vis, SciVis, InfoVis, VAST). VIS30K’s comprehensive coverage of the scientific literature in visualization not only reflects the progress of the field but also enables researchers to study the evolution of the state-of-the-art and to find relevant work based on graphical content. We describe the dataset and our semi-automatic collection process, which couples convolutional neural networks (CNN) with curation. Extracting figures and tables semi-automatically allows us to verify that no images are overlooked or extracted erroneously. To improve quality further, we engaged in a peer-search process for high-quality figures from early IEEE Visualization papers. With the resulting data, we also contribute VISImageNavigator (VIN, visimagenavigator.github.io), a web-based tool that facilitates searching and exploring VIS30K by author names, paper keywords, title and abstract, and years.

Paper download: (21.4 MB)

Data explorer: https://visimagenavigator.github.io/

The source code for the VISImageNavigator is available on GitHub.

Datasets:

IEEE dataport doi: 10.21227/4hy6-vh52,
meta data,
CNN algorithm training data and validation data,
image corpus and text corpus

Videos:

Presentation at IEEE :

Fast-Forward for presentation at IEEE :

Get the videos:

Main paper reference:

Jian Chen, Meng Ling, Rui Li, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Torsten Möller, Robert S. Laramee, Han-Wei Shen, Katharina Wünsche, and Qiru Wang (2021) VIS30K: A Collection of Figures and Tables from IEEE Visualization Conference Publications. IEEE Transactions on Visualization and Computer Graphics, 27(9):3826–3833, September 2021.

BibTeX entry:



@ARTICLE{Chen:2021:VCF,
  author      = {Jian Chen and Meng Ling and Rui Li and Petra Isenberg and Tobias Isenberg and Michael Sedlmair and Torsten M{\"o}ller and Robert S. Laramee and Han-Wei Shen and Katharina W{\"u}nsche and Qiru Wang},
  title       = {{VIS30K}: A Collection of Figures and Tables from {IEEE} Visualization Conference Publications},
  journal     = {IEEE Transactions on Visualization and Computer Graphics},
  year        = {2021},
  volume      = {27},
  number      = {9},
  month       = sep,
  pages       = {3826--3833},
  doi         = {10.1109/TVCG.2021.3054916},
  doi_url     = {https://doi.org/10.1109/TVCG.2021.3054916},
  oa_hal_url  = {https://hal.science/hal-03123279},
  preprint    = {https://doi.org/10.48550/arXiv.2101.01036},
  github_url  = {https://github.com/tobiasisenberg/VIS30KLink},
  github_url2 = {https://github.com/VisImageNavigator/VisImageNavigator.Release},
  url         = {https://tobias.isenberg.cc/p/Chen2021VCF},
  url2        = {https://visimagenavigator.github.io/},
  pdf         = {https://tobias.isenberg.cc/personal/papers/Chen_2021_VCF.pdf},
}

Main data reference:

Jian Chen, Meng Ling, Rui Li, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Torsten Möller, Robert Laramee, Han-Wei Shen, Katharina Wünsche, and Qiru Wang (2020) IEEE VIS Figures and Tables Image Dataset. Dataset and online search, https://visimagenavigator.github.io/, 2020.

BibTeX entry:



@MISC{Chen:2020:VCF,
  author      = {Jian Chen and Meng Ling and Rui Li and Petra Isenberg and Tobias Isenberg and Michael Sedlmair and Torsten M{\"o}ller and Robert Laramee and Han-Wei Shen and Katharina W{\"u}nsche and Qiru Wang},
  title       = {{IEEE} {VIS} Figures and Tables Image Dataset},
  howpublished= {Dataset and online search, https://visimagenavigator.github.io/},
  year        = {2020},
  doi         = {10.21227/4hy6-vh52},
  doi_url     = {https://doi.org/10.21227/4hy6-vh52},
  oa_gold_url = {https://doi.org/10.21227/4hy6-vh52},
  url         = {https://tobias.isenberg.cc/p/Chen2021VCF},
  url2        = {https://visimagenavigator.github.io/},
}