@parkerziegler/reviz

v0.2.0

Published

3 years ago

An engine for reverse engineering data visualizations from the DOM.

Downloads

0High
0Medium
0Low

parkerziegler

`reviz`

reviz is a lightweight engine for reverse engineering data visualizations from the DOM. Its core goal is to assist in rapid visualization sketching and prototyping by automatically generating partial programs written using Observable Plot from input svg subtrees.

Installation

yarn add @parkerziegler/reviz

# If using npm:
npm install --save @parkerziegler/reviz

API

The reviz API is very small; in fact, it consists of only a single function, analyzeVisualization!

import { analyzeVisualization } from `@parkerziegler/reviz`;

const viz = document.querySelector('#my-viz');

const { spec, program } = analyzeVisualization(viz);

`analyzeVisualization`

export declare const analyzeVisualization: (root: SVGElement) => {
  spec: VizSpec;
  program: string;
};

analyzeVisualization is a function that takes in an SVGElement as input and returns an Object containing two properties, spec and program.

spec refers to the intermediate representation used by reviz to generate partial Observable Plot programs. It encodes semantic information about the input svg subtree, including its inferred visualization type, geometric attributes of its marks (currently either circle or rect elements), and presentational attributes of its marks. reviz's architecture mimics that of a traditional compiler, with spec representing the IR. It can be useful to inspect spec to see whether or not reviz has inferred the correct visualization type for your input svg subtree.

program refers to the partial Observable Plot program that reviz generates. These programs are intentionally incomplete and contain "holes" represented by the string '??'. The presence of a hole indicates that the value for a particular attribute (e.g. the r attribute of a bubble chart or the fill attribute of a stacked bar chart) should be mapped to a column in a user's input dataset rather than kept static across all data elements. After filling in holes with column names from your input dataset, you'll have a complete visualization program ready to run in the browser!

An Example

Let's look at an example to see how reviz works in practice. We'll use this visualization from the New York Times:

If we point reviz at the root svg Node of this visualization, it generates the following (partial) program:

const plot = Plot.plot({
  color: {
    scale: 'ordinal',
    range: ['#C67371', '#ccc', '#709DDE', '#A7B9D3', '#C23734'],
  },
  marks: [
    Plot.dot(data, {
      fill: '??',
      stroke: '??',
      fillOpacity: 0.8,
      strokeOpacity: 1,
      'stroke-width': '1px',
      x: '??',
      y: '??',
      r: 7,
    }),
  ],
});

Notice that fill, stroke, x and y are all inferred to be holes (indicated by'??') that must be mapped to columns of an input dataset. Conversely, attributes like fill-opacity and stroke-width are automatically inferred because they are found to be consistent across all mark elements. We can also see that reviz has inferred that the visualization is using an ordinal color scale and automatically configures the scale for us.

We can now apply this partial program to a new dataset. Let's use this delightful dataset about penguins from the Observable Plot docs. We can choose input columns from this dataset to "fill in" the holes like so:

const plot = Plot.plot({
  color: {
    scale: 'ordinal',
    range: ['#C67371', '#ccc', '#709DDE', '#A7B9D3', '#C23734'],
  },
  marks: [
    Plot.dot(data, {
-     fill: '??',
+     fill: 'island',
-     stroke: '??',
+     stroke: 'island',
      fillOpacity: 0.8,
      strokeOpacity: 1,
      'stroke-width': '1px',
-     x: '??',
+     x: 'flipper_length_mm',
-     y: '??',
+     y: 'body_mass_g',
      r: 7,
    }),
  ],
});

The result that we get is a new visualization that takes the appearance of the original New York Times piece and applies it to our data.

In this way, reviz allows end users to quickly experiment with seeing their data in the form of a visualization they encounter anywhere in the wild.

To see more examples of the partial programs reviz generates, check out our example site. To go into depth into how reviz works, consider reading our paper.

Supported Visualization Types

reviz is restricted to only work on a small subset of visualization types. We hope to extend reviz to include more visualization types in the future.

| Visualization Type | Description | | ------------------------------------------------------------------ | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | Bar Chart | Old trusty. The bar chart represents data values using the height of each rect mark. The data values mapped to the x-axis must be discrete, not continuous. | | Bubble Chart | The bubble chart is similar to the scatterplot, with the radius of each circle mark mapped to the square root of a data value. | | Histogram | Similar to a bar chart, but the data values mapped to the x-axis must be continuous, not discrete. Histograms are typically used to visualize distributions in a dataset. | | Scatterplot | The scatterplot places circle marks in an x-y coordinate plane, often to show a correlation between two variables. | | Stacked Bar Chart | A dressed up version of the bar chart in which subcategories of data can be compared across groups. | | Strip Plot | Many rows of circle marks are placed on the same continous scale to visualize distributions in a dataset. |

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

reviz

Installation

API

analyzeVisualization

An Example

Supported Visualization Types

`reviz`

`analyzeVisualization`