Improve performance for 'bitmap' render method #680

almarklein · 2025-03-11T13:27:51Z

I did some experiments and this is a relatively easy way to improve performance by about 20%, going from ~24 to ~30 fps on fullscreen window on a 4k monitor on my M1 MacBook.

Tricks applied:

~~re-use the target texture~~ -> let's not for memory concerns, also seems to not help much.
Reuse (and share) the buffer needed to copy the texture data to the CPU.
Avoid a data-copy when bytes-per-row is not a multiple of 256.
~~use numpy for copying~~ does not seem to help much, and would introduce a new optional dependency.

We can go much faster, but for that we need to have our async stuff sorted out better.
More details here: pygfx/rendercanvas#40 (comment)

wgpu/_classes.py

hmaarrfk · 2025-03-11T13:29:33Z

Is this "bitmap" method the current path for Wayland + Qt?

almarklein · 2025-03-11T13:31:37Z

Is this "bitmap" method the current path for Wayland + Qt?

yes

almarklein · 2025-03-11T13:44:43Z

mmm ... I'm a bit worried about what this means for memory.

Previously, if you had say 10 canvases, they'd be rendered to one by one, and after each draw, the texture is released, allowing the GPU to re-use that memory for e.g. the texture target for the next canvas. But now all canvases hold onto their texture and a buffer of the same size. 🤔 This easily happens in e.g. a notebook.

almarklein · 2025-03-11T14:18:14Z

Solved this by using a shared copy-buffer. It looks like re-using the texture does not do much; re-using the temporary buffer was what contributed by far the most to the speed-up.

hmaarrfk · 2025-03-11T14:19:10Z

e-using the temporary buffer was what contributed by far the most to the speed-up.

can you point to the one you are referring to in the diff (for my own learning)

wgpu/backends/wgpu_native/_api.py

almarklein · 2025-03-11T14:42:29Z

This does not seem to clash with #673, except for a change in tests_mem/testutils.py which seems to be exactly the same.

almarklein · 2025-03-12T09:40:48Z

I tested performance on Windows and Linux, and did not observe a difference. That's on a machine with integrated graphics. Will test on a Windows machine with a GPU later today.

edit: tested again, now with the same 4k monitor. The integrated graphics have trouble rendering that many pixels already ... it looks like maybe there is a tiny improvement from 18-ish to 19-ish fps, compared to 35 fps when rendering to screen.

almarklein · 2025-03-12T15:32:22Z

Tested on a laptop with an nvidia gpu. Can observe a similar performance enhancement as for macos.

Improve performance for 'bitmap' render method

3090cd2

almarklein requested a review from Korijn as a code owner March 11, 2025 13:27

almarklein commented Mar 11, 2025

View reviewed changes

wgpu/_classes.py Outdated Show resolved Hide resolved

add comment

be2fd30

codegen

4b2e764

almarklein added 2 commits March 11, 2025 15:11

Only store buffer, and share between textures

3b9fc2c

fix memtest

116a0ec

codegen

b28743e

almarklein commented Mar 11, 2025

View reviewed changes

wgpu/backends/wgpu_native/_api.py Show resolved Hide resolved

leftover print call

0e97913

almarklein mentioned this pull request Mar 11, 2025

[Qt] multiple window/docking support pygfx/rendercanvas#55

Open

Korijn approved these changes Mar 11, 2025

View reviewed changes

almarklein enabled auto-merge (squash) March 12, 2025 15:32

almarklein merged commit 1f86cd4 into main Mar 12, 2025
20 checks passed

almarklein deleted the bitmap-permformance branch March 12, 2025 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance for 'bitmap' render method #680

Improve performance for 'bitmap' render method #680

almarklein commented Mar 11, 2025 •

edited

Loading

hmaarrfk commented Mar 11, 2025

almarklein commented Mar 11, 2025

almarklein commented Mar 11, 2025

almarklein commented Mar 11, 2025

hmaarrfk commented Mar 11, 2025

almarklein commented Mar 11, 2025

almarklein commented Mar 12, 2025 •

edited

Loading

almarklein commented Mar 12, 2025

Improve performance for 'bitmap' render method #680

Improve performance for 'bitmap' render method #680

Conversation

almarklein commented Mar 11, 2025 • edited Loading

hmaarrfk commented Mar 11, 2025

almarklein commented Mar 11, 2025

almarklein commented Mar 11, 2025

almarklein commented Mar 11, 2025

hmaarrfk commented Mar 11, 2025

almarklein commented Mar 11, 2025

almarklein commented Mar 12, 2025 • edited Loading

almarklein commented Mar 12, 2025

almarklein commented Mar 11, 2025 •

edited

Loading

almarklein commented Mar 12, 2025 •

edited

Loading