[d3d9] Optimize SWVP devices #4274

K0bin · 2024-09-18T22:19:29Z

Needs lots of testing.

This makes D3D9 devices, that are configured to always use software vertex processing (so not MIXED), always use the late per draw buffer upload path. We copy the vertex data that each specific draw accesses to a temporary buffer and render from that, similar to Up-draws.
This makes sense because games that use pure SWVP expect vertex processing to be synchronous which has lead to both bugs and performance problems. For example we used to run into issues when respecting NOOVERWRITE or have dozens or even hundreds of queue syncs per frame. Considering that SWVP is supposed to run on the CPU, the amount of vertices is hopefully small.

I hope this won't impact more modern or demanding games.

The game that inspires this was Phantasmat from this comment:
#4263 (comment)

It uses a single 96,000 byte vertex buffer (POOL = DEFAULT, USAGE = WRITEONLY, FVF != 0) and writes data to it before every single draw. Ofc it also doesn't specify a lock range, so we end up uploading the entire 96 KB buffer over and over again, run out of staging memory and then stall. It is a 2D game, so with this PR we upload 4 vertices for every draw.

K0bin · 2024-09-18T23:30:48Z

cc @WinterSnowfall

WinterSnowfall · 2024-09-19T06:49:46Z

I can throw this at a bunch of games of course, but I think it would be useful and very helpful (since enhancing the HUD is a trend now) to also add the type of VP (based on device type and on m_isSWVP in case of Mixed) as an element to the D3D9 HUD.

The use of D3DCREATE_SOFTWARE_VERTEXPROCESSING devices is AFAIK very limited even in d3d8 and generally only used as a fallback in case HW or Mixed modes fail. A very limited set of games let you pick which to use.

src/d3d9/d3d9_hud.cpp

WinterSnowfall · 2024-09-19T16:44:00Z

This PR also properly fixes AlpyneDreams#179 , on which we had more or less given up in d8vk. The Supreme Ruler d3d8 games can now be played with correct text rendering even without the "Nvidia driver workaround" configuration option (which affected performance very negatively).

K0bin · 2024-09-19T20:04:36Z

Now you know what is a problem, as NINE have SVPs optimized.

It has nothing to do with that.

src/d3d9/d3d9_common_buffer.cpp

K0bin added the d3d9 label Sep 18, 2024

This was referenced Sep 18, 2024

[d3d8] Old EverQuest client (takproject.net) still needs dgVoodoo #4263

Closed

[d3d9] Optimize late buffer uploads with dirty bitmasks #4275

Merged

K0bin force-pushed the swvp-opt branch from 256957e to 88b035f Compare September 18, 2024 23:29

K0bin force-pushed the swvp-opt branch from 88b035f to 44fb07d Compare September 18, 2024 23:59

K0bin force-pushed the swvp-opt branch 2 times, most recently from b06b8fe to ccaf5be Compare September 19, 2024 14:19

K0bin requested review from misyltoad and doitsujin September 19, 2024 14:40

K0bin force-pushed the swvp-opt branch from 491e38a to 8b16639 Compare September 19, 2024 15:46

WinterSnowfall reviewed Sep 19, 2024

View reviewed changes

src/d3d9/d3d9_hud.cpp Show resolved Hide resolved

K0bin force-pushed the swvp-opt branch 2 times, most recently from bbba13c to 88c6e82 Compare September 20, 2024 11:23

K0bin changed the title ~~[d3d9] Optimize pure SWVP devices~~ [d3d9] Optimize SWVP devices Sep 20, 2024

misyltoad reviewed Sep 20, 2024

View reviewed changes

src/d3d9/d3d9_common_buffer.cpp Show resolved Hide resolved

K0bin force-pushed the swvp-opt branch from 88c6e82 to 9f6491d Compare September 20, 2024 11:37

WinterSnowfall suggested changes Sep 20, 2024

View reviewed changes

src/d3d9/d3d9_common_buffer.cpp Outdated Show resolved Hide resolved

K0bin force-pushed the swvp-opt branch 4 times, most recently from 19792a4 to 115e9d6 Compare September 21, 2024 16:23

K0bin added 3 commits September 21, 2024 18:26

[d3d9] Always use per-draw buffer uploads on pure SWVP devices

6f67d46

[d3d9] Cleanup buffer memory flag selection

d12e424

[d3d9] Add SWVP HUD item

f7696c5

K0bin force-pushed the swvp-opt branch from 115e9d6 to f7696c5 Compare September 21, 2024 16:26

WinterSnowfall approved these changes Sep 21, 2024

View reviewed changes

doitsujin merged commit 04ad986 into doitsujin:master Sep 22, 2024
4 checks passed

K0bin deleted the swvp-opt branch September 22, 2024 19:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[d3d9] Optimize SWVP devices #4274

[d3d9] Optimize SWVP devices #4274

K0bin commented Sep 18, 2024

K0bin commented Sep 18, 2024

WinterSnowfall commented Sep 19, 2024

WinterSnowfall commented Sep 19, 2024 •

edited

Loading

K0bin commented Sep 19, 2024

[d3d9] Optimize SWVP devices #4274

[d3d9] Optimize SWVP devices #4274

Conversation

K0bin commented Sep 18, 2024

K0bin commented Sep 18, 2024

WinterSnowfall commented Sep 19, 2024

WinterSnowfall commented Sep 19, 2024 • edited Loading

K0bin commented Sep 19, 2024

WinterSnowfall commented Sep 19, 2024 •

edited

Loading