raster mark #1196

mbostock · 2022-12-21T21:44:20Z

This is an alternative to the pixel mark #1185; the raster mark takes a set of discrete {x, y, fill} samples and produces a corresponding raster grid (image) using putImageData. The canvas is then converted to a URL for use as an svg:image, which is positioned using {x1, y1, x2, y2} abstract coordinates likewise bound to the x and y scales.

mbostock · 2022-12-21T22:01:49Z

I’m not sure why CI is failing; presumably it’s a non-visible difference in the result of canvas.toDataURL. 😞

Fil · 2022-12-22T15:05:31Z

I'm not sure this mark works for me, because I don't see the use case (besides copying a raster with the same dimensions (?)), and I don't see how it fits with the general approach of Plot's scales etc. To help clarify what we want to cover, I've started a list of use cases in this notebook: https://observablehq.com/@observablehq/pixel-or-imagedata--dev

Two unrelated remarks:

Re: opacity, I think we might have to compose it with the A of rgbA color channel?
I still want to pass the facet information (the actual value of fx, fy) somewhere in the call to mark.render (render API #501); this would help make this mark facet-dependent. I'm using "facet-dependent" and not "faceted", in the sense that they would not have a partition index, but could act differently depending on the facet they're drawing.

mbostock · 2022-12-23T17:19:32Z

Okay, here’s my third attempt… I hope you like it better. 🙏 It feels very convenient for the continuous f(x, y) case, and supports all (continuous) scale types. And it’s still convenient for the simple case of rendering an existing raster grid (e.g., volcano) with linear scales. I would like to implement a contour mark with the same type of x/y/value specification… but I’m not sure how much work I’ll do over Christmas.

Fil · 2022-12-23T22:19:30Z

test/plots/heatmap.js

+      type: "diverging"
+    },
+    marks: [
+      Plot.raster(d3.range(width * height), {


would this be easier if the signature for fill was f({x,y}, i)?

I don’t think this test needs to be “easy”—the easy form is shown above, and this test isn’t supposed to be representative of recommended usage. The point here is to test the explicit form where fill is a function of data.

Fil

I really like where this is going. As mentioned in the comments there's more we could do (in terms of performance, compatibility with opacity, faceting, projections…) but they could be done in follow-ups.

Note that Safari currently doesn't support image-rendering: pixelated on svg images which makes it a bit awkward for us. Should we expect Webkit to fix itself, or look for a workaround (foreignObject+img)?

I'm adding a bit of documentation. I must say that I'm still struggling to understand/explain how the x and y channels work in conjunction with width, x1 and x2 (?). I've been trying as an exercise to create the image of the volcano with x, y, fill as channels, but not very successfully (seems like I need to specify all of x1, x2, width…?).

src/marks/raster.js

test/plots/heatmap.js

Fil · 2023-01-02T14:40:58Z

src/marks/raster.js

+        if (xi < 0 || xi >= width) continue;
+        const yi = Math.floor((Y[i] - y2) * ky);
+        if (yi < 0 || yi >= height) continue;
+        const {r, g, b} = rgb(F[i]);


Suggested change

const {r, g, b} = rgb(F[i]);

// TODO: memoize for performance? We'll usually have a maximum of 128 different shades, but 100x more pixels.

const {r, g, b} = rgb(F[i]);

I think it would be better to do the optimization earlier when the scale is applied, like how Yuri does here:

// Convenience function to create a cached color interpolator that // returns cached rgb objects, avoiding color string parsing. cacheInterpolator = (interpolator, n = 250) => d3.scaleQuantize(d3.quantize(pc => d3.rgb(interpolator(pc)), n))

https://observablehq.com/@twitter/density-plot@4159

Like maybe there’s some hint that the mark wants {r, g, b} objects instead of color strings and can instruct Plot to materialize those efficiently when applying the color scale. But in any case I think we should do optimizations as follow-up, as you say.

mbostock · 2023-01-02T19:46:42Z

I've been trying as an exercise to create the image of the volcano with x, y, fill as channels, but not very successfully (seems like I need to specify all of x1, x2, width…?).

Yes, you must specify all those options. Like this:

Plot.plot({
  marks: [
    Plot.raster(volcano.values, {
      width: volcano.width,
      height: volcano.height,
      x1: 0,
      y1: 0,
      x2: volcano.width,
      y2: volcano.height,
      x: (_, i) => (i % volcano.width) + 0.5,
      y: (_, i) => Math.floor(i / volcano.width) + 0.5,
      fill: (d) => d
    }),
    Plot.frame()
  ]
})

Alternatively if you want the samples at integer locations, you have to offset the bounds by 0.5:

Plot.plot({
  marks: [
    Plot.raster(volcano.values, {
      width: volcano.width,
      height: volcano.height,
      x1: -0.5,
      y1: -0.5,
      x2: volcano.width - 0.5,
      y2: volcano.height - 0.5,
      x: (_, i) => (i % volcano.width),
      y: (_, i) => Math.floor(i / volcano.width),
      fill: (d) => d
    }),
    Plot.frame()
  ]
})

The x and y channels specify the centroids of the pixels, so you need x1, y1, x2, and y2 to specify the extent of the pixel grid; otherwise the raster mark cannot know how wide a pixel is in abstract coordinates. And you need to specify width and height because this is the size of the raster grid (in grid coordinates, i.e., pixel indexes) which might be a different resolution than the samples.

mbostock · 2023-01-02T19:50:37Z

If you would prefer that the samples are taken at integer locations by default (rather than offset by 0.5) then we could apply this patch:

% git diff -p
diff --git a/src/marks/raster.js b/src/marks/raster.js
index 71a0af1c..52864d10 100644
--- a/src/marks/raster.js
+++ b/src/marks/raster.js
@@ -19,10 +19,10 @@ export class Raster extends Mark {
       // If X and Y are not given, we assume that F is a dense array of samples
       // covering the entire grid in row-major order. These defaults allow
       // further shorthand where x and y represent grid column and row index.
-      x1 = x == null ? 0 : undefined,
-      y1 = y == null ? 0 : undefined,
-      x2 = x == null ? width : undefined,
-      y2 = y == null ? height : undefined,
+      x1 = x == null ? -0.5 : undefined,
+      y1 = y == null ? -0.5 : undefined,
+      x2 = x == null ? width - 0.5 : undefined,
+      y2 = y == null ? height - 0.5 : undefined,
       imageRendering,
       pixelRatio = 1,
       fill,
@@ -156,7 +156,6 @@ function sampleFill({fill, fillOpacity, pixelRatio = 1, ...options} = {}) {
     if (h === undefined) h = Math.round(Math.abs(y2 - y1) / pixelRatio);
     const kx = (x2 - x1) / w;
     const ky = (y1 - y2) / h;
-    (x1 += kx / 2), (y2 += ky / 2);
     let F, FO;
     if (fill) {
       F = new Array(w * h);

Do you have a preference? I think it looks better to have the grid start and end on integer boundaries.

Fil · 2023-01-02T20:08:39Z

Integer boundaries look better, I agree.

My difficulty is in precisely articulating how the various options must be combined, or on the contrary are "not compatible". For example, if you try to replace width: volcano.width with a "finger in the air" pixelRatio in the example above, you will get banding (unless pixelRatio=580 / 87). To document this, we need to mention that there is rounding, and that the pixelRatio is imputed from the width of the frame divided by the width of the data.

mbostock · 2023-01-02T20:35:09Z

My difficulty is in precisely articulating how the various options must be combined, or on the contrary are "not compatible".

It’s best to think about this as multiple coordinate systems.

First there are a discrete set of samples in abstract coordinates x and y with fill or fillOpacity (or both). These correspond to the raster mark’s data. (The samples are imputed when fill or fillOpacity is specified as a function and data is null.) These samples are typically in an axis-aligned grid but not necessarily so; arbitrary sample positions will be more useful in the future if and when the raster mark supports different methods of interpolation.

Second there is a raster grid (a.k.a. canvas) with its own pixel coordinates. The aforementioned samples are mapped to a canvas that is width pixels by height pixels, with the origin [0, 0] in the top-left corner. The extent of the canvas in pixel coordinates [0, 0, width, height] corresponds to an abstract extent [x1, y1, x2, y2] in the same abstract coordinate system as the samples. (In some cases x1 and x2 can be flipped, and likewise y1 and y2.) During rendering, the raster mark assigns (bins using Math.floor) each sample in x and y to a pixel.

Lastly there are screen coordinates (really Plot frame coordinates, in the range of the x and y scales). These are needed to place the svg:image in the correct position within the Plot frame.

If you specify an incorrect width in the volcano example, then you expect to see banding because you no longer have exactly one sample per pixel in the raster grid. For example, here three columns are missing samples:

Plot.plot({
  marks: [
    Plot.raster(volcano.values, {
      width: volcano.width + 3,
      height: volcano.height,
      x1: 0,
      y1: 0,
      x2: volcano.width,
      y2: volcano.height,
      x: (_, i) => (i % volcano.width) + 0.5,
      y: (_, i) => Math.floor(i / volcano.width) + 0.5,
      fill: (d) => d
    }),
    Plot.frame()
  ]
})

(~~They should really be transparent though… I think that’s a regression I introduced when adding support for fillOpacity.~~ Fixed.) If and when the raster mark supports better interpolation—something smarter than just binning the samples into rectangular pixels—then we could e.g. fill those gaps with the closest sample, or a blend of nearby samples.

… tests the same thing

mbostock · 2023-01-10T16:46:08Z

Last step here is to figure out faceting, or at least document that it doesn’t work yet.

… and fix default contour thresholds

mbostock · 2023-01-11T01:15:15Z

@Fil The latest changes are in 1e22d3d...a745cc3; documentation aside, this should be ready to go! 🚢

src/plot.js

Fil · 2023-01-11T08:36:23Z

I'm tempted to say "ship", but this morning I tried to plug in https://observablehq.com/@jobleonard/pseudo-blue-noise instead of randomLcg(42), and the improvement in quality is really remarkable:

walk-on-spheres (before, after)

barycentric (before, after)

contours with random walk (before, after)

mbostock · 2023-01-11T15:23:27Z

The pseudo blue noise looks nice but I would like to focus on documenting and releasing this and returning to the axis mark. I’d be willing to expose the built-in spatial interpolation methods as functions so that you can pass in a custom RNG.

mbostock · 2023-01-11T15:40:31Z

The latest two commits expose the built-in spatial interpolation methods

Plot.interpolateNone
Plot.interpolateNearest
Plot.interpolatorBarycentric({random})
Plot.interpolatorRandomWalk({random, minDistance, maxSteps})

So you can now provide your own RNG that uses pseudo blue noise e.g.

* image data mark * PreTtiER * handle invalid data; stride, offset * handle flipped images * archive test failure artifacts * skip image data tests, for now * PreTtiER * only ignore generated images in CI * only ignore large generated images * fillOpacity * tweak * fix formula * PreTtiER * volcano * more idiomatic heatmap * fill as f(x, y) * pixel midpoints * PreTtiER * not pixelated, again * PreTtiER * raster * pixelRatio * fix aria-label; comments * Goldstein–Price * tentative documentation for Plot.raster * fix partial coverage of sample fill * raster fillOpacity * require x1, y1, x2, y2 * validate width, height * fix for sparse samples * better error on missing scales * document * floor rounded (or floored?) * exploration for a "nearest" raster interpolate method * barycentric interpolation see https://observablehq.com/@visionscarto/igrf-90 * raster tuple shorthand * barycentric interpolate and extrapolate * only maybeTuple if isTuples * allow marks to apply scales selectively (like we do with projections) * interpolate on values * 3 interpolation methods for the nearest neighbor: voronoi renderCell, quadree.find, delaunay.find. This is completely gratuitous since they all run in less than 1ms… It's even hard to know which one is the fastest, because if I loop on 100s of them the browser starts to thrash (allocating so much memory for images it immediately discards, I guess…) * barycentric walmart * fold mark.project into mark.scale * fix barycentric extrapolation * materialize fewer arrays * use channel names * don’t pass {r, g, b, a} * don’t overload x & y channels * fix inverted x or y; simplify example * simpler * fix grid orientation * only stroke if opaque * optional x1, y1, x2, y2 * shorten * fix order * const * rasterize * The performance measurements I had done were just rubbish (I forgot to await on the promises!). Measuring the three methods on the ca55 dataset I see this order: voronoi cellRender (180ms), delaunay find (220ms), quadtree (500ms). * rasterize * tolerance for points that are on a triangle's edge * use a symbol for values that need extrapolation, simplify and fix a few issues, use a mixing function for categorical interpolation * rasterize with walk on spheres * document rasterize * pixelSize * default to full frame * remove ignored options * reformat options * fix the ca55 tests (the coordinates represent a planar projection) * caveat about webkit/safari * remove console.log * more built-in rasterizers * fix walk-on-spheres implementation; remove blur * port fixes to wos * adaptive extrapolation * fillOpacity fixes * renames walk-on-spheres to random-walk; documents the rasterize option rationale for the renaming: "random-walk" is more commonly known, and expresses well enough what's happening. Walk on spheres converges much faster than a basic random walk would, and makes it feasible, but it is a question of implementation. * a constant fillOpacity informs the opacity property on the g element, not the opacity of each pixel * fix bug with projection clip in indirectStyles * performance optimizations for randow-walk: 1. use rasterizeNull to boot; if we have more samples (and a costlier delaunay), at least we have less pixels to impute. 2. cache more aggressively the result of delaunay.find: at the beginning of each line, for each pixel, and for each step of the walk On actual tests it can be up to 2x faster. * sample pixel centroids * fix handling of undefined values * use transform for equirectangular coordinates * don’t delete * stroke if constant fillOpacity * fix test snapshots * fix typo in test name * note potential bias caused by stroke * rename tests * don’t bootstrap random-walk with none * terminate walk when minimum distance is reached * comment re. opacity * comment re. none order bias * contour mark * dense grid contours * consolidate code * more code consolidation * cleaner * cleaner deferred channels * interpolate, not rasterize * blur * cleaner * use typed array when possible * optimize barycentric interpolation * nicer contours for ca55 with barycentric+blur 3; support raster blur Contour blurring is unchanged, and blurs the abstract data (with a linear interpolation). Raster blurring is made with d3.blurImage. Two consequences: * we can now blur “categorical” colors, if we want to smooth out the image and give it a polished look in the higher variance regions. (This works very well when we have two colors, but with more categories there is a risk of hiding the components of a color, making the image more difficult to understand. Anyway, it’s available as an option to play with.) * for quantitative data, and with a color scale with continuous scheme and linear transform, this is very close to linear interpolation; but if the underlying data is better rendered with a log color scale, the color interpolation takes this into account (which IMO is better). * ignore negative blur * cleaner tests * for contours, filter points with missing X and Y before calling the interpolate function, and ignore x and y filters on geometries * fix barycentric interpolate for filtered points note: the penguins dataset is full of surprises since some points are occluded by others of a different species… * contour shorthands * fix contour filtering * filter value, too * materialize x and y when needed * default to nearest * comment * remove obsolete opacity trick * better contour thresholds; fix test * nullish instead of undefined * renderBounds * fix circular import * a hand-written Peters projection seemed more fun than the sqrt scale; tests the same thing * update raster documentation with interpolate; document contour * document Plot.identity * peters axes * symmetric Peters * style tweak * NaN instead of null * avoid error when empty quantile domain * faceted sampler raster * fix test snapshot * faceted contour; fix dense faceted raster … and fix default contour thresholds * expose spatial interpolators * pass x, y, step * error when data undefined, but not null * d3 7.8.1 Co-authored-by: Philippe Rivière <[email protected]>

mbostock requested a review from Fil December 21, 2022 21:44

mbostock changed the title ~~image data mark~~ raster mark Dec 23, 2022

Fil reviewed Dec 23, 2022

View reviewed changes

mbostock added the enhancement New feature or request label Dec 30, 2022

Fil requested changes Jan 2, 2023

View reviewed changes

mbostock marked this pull request as ready for review January 2, 2023 22:04

mbostock added 17 commits January 2, 2023 15:12

image data mark

067e3d4

PreTtiER

8d74b41

handle invalid data; stride, offset

f7963ad

handle flipped images

1829c6a

archive test failure artifacts

37726fb

skip image data tests, for now

b899270

PreTtiER

6723675

only ignore generated images in CI

9ca0583

only ignore large generated images

04eacc1

fillOpacity

cbeb965

tweak

568e0c0

fix formula

fb7b938

PreTtiER

32d04f2

volcano

dea0da5

more idiomatic heatmap

30c4bd3

fill as f(x, y)

ce258ba

pixel midpoints

afcf8d0

Fil and others added 6 commits January 10, 2023 11:23

a hand-written Peters projection seemed more fun than the sqrt scale;…

7cceb64

… tests the same thing

update raster documentation with interpolate; document contour

4dd319f

document Plot.identity

4bb1716

peters axes

307aae6

symmetric Peters

9eb341c

style tweak

1e22d3d

mbostock added 5 commits January 10, 2023 09:00

NaN instead of null

21c9705

avoid error when empty quantile domain

9a2e357

faceted sampler raster

85d81ab

fix test snapshot

820a44c

faceted contour; fix dense faceted raster

a745cc3

… and fix default contour thresholds

mbostock requested a review from Fil January 11, 2023 01:12

Merge branch 'main' into mbostock/image-data

d191ed0

Fil reviewed Jan 11, 2023

View reviewed changes

src/plot.js Show resolved Hide resolved

mbostock added 2 commits January 11, 2023 07:37

expose spatial interpolators

756050a

pass x, y, step

83a1983

error when data undefined, but not null

e122bc2

Fil approved these changes Jan 11, 2023

View reviewed changes

d3 7.8.1

3f694e8

mbostock merged commit 44b4a1d into main Jan 11, 2023

mbostock deleted the mbostock/image-data branch January 11, 2023 16:53

This was referenced Jan 14, 2023

heatmap mark? #1122

Closed

A raster/2d function plot mark #984

Closed

mbostock mentioned this pull request May 2, 2023

fix missing facet data error #1521

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

raster mark #1196

raster mark #1196

mbostock commented Dec 21, 2022 •

edited

Loading

mbostock commented Dec 21, 2022

Fil commented Dec 22, 2022

mbostock commented Dec 23, 2022

Fil Dec 23, 2022

mbostock Dec 23, 2022

Fil left a comment

Fil Jan 2, 2023

mbostock Jan 3, 2023 •

edited

Loading

mbostock commented Jan 2, 2023

mbostock commented Jan 2, 2023 •

edited

Loading

Fil commented Jan 2, 2023

mbostock commented Jan 2, 2023 •

edited

Loading

mbostock commented Jan 10, 2023

mbostock commented Jan 11, 2023

Fil commented Jan 11, 2023 •

edited

Loading

mbostock commented Jan 11, 2023

mbostock commented Jan 11, 2023

	const {r, g, b} = rgb(F[i]);
	// TODO: memoize for performance? We'll usually have a maximum of 128 different shades, but 100x more pixels.
	const {r, g, b} = rgb(F[i]);

raster mark #1196

raster mark #1196

Conversation

mbostock commented Dec 21, 2022 • edited Loading

mbostock commented Dec 21, 2022

Fil commented Dec 22, 2022

mbostock commented Dec 23, 2022

Fil Dec 23, 2022

Choose a reason for hiding this comment

mbostock Dec 23, 2022

Choose a reason for hiding this comment

Fil left a comment

Choose a reason for hiding this comment

Fil Jan 2, 2023

Choose a reason for hiding this comment

mbostock Jan 3, 2023 • edited Loading

Choose a reason for hiding this comment

mbostock commented Jan 2, 2023

mbostock commented Jan 2, 2023 • edited Loading

Fil commented Jan 2, 2023

mbostock commented Jan 2, 2023 • edited Loading

mbostock commented Jan 10, 2023

mbostock commented Jan 11, 2023

Fil commented Jan 11, 2023 • edited Loading

walk-on-spheres (before, after)

barycentric (before, after)

contours with random walk (before, after)

mbostock commented Jan 11, 2023

mbostock commented Jan 11, 2023

mbostock commented Dec 21, 2022 •

edited

Loading

mbostock Jan 3, 2023 •

edited

Loading

mbostock commented Jan 2, 2023 •

edited

Loading

mbostock commented Jan 2, 2023 •

edited

Loading

Fil commented Jan 11, 2023 •

edited

Loading