upgrade to wgpu 24.0.0 #455

ygdrasil-io · 2025-01-25T19:01:24Z

This PR is based on #445 to upgrade wgpu to 24.0.0 and remove the memory leak.

Feel free to comment regarding the last changes.

I found one memory leak here: eliemichel@452b21e, but there is another one.

After 5-10 minutes of running, this decreases the memory leak by 50%, but it is still leaking, while the same sample is memory stable on the wgpu Rust repository.

This goes up to webgpu-native/webgpu-headers@2b59747 Things I *didn't* do: * I didn't update the library to make sure "instance dropped" callback error codes are guaranteed to happen, like they seem to be in Dawn. List of changes (roughly in order of header commits): Various enum and struct renames Updated callbacks to use the new *CallbackInfo structs and 2-userdata system. Also updated functions to return WGPUFuture, though the WGPUFuture thing is just stubbed out at the moment as I don't think wgpu-core has the necessary functionality for it. wgpuInstanceWaitAny is unimplemented!() DepthClipControl merged into PrimitiveState, related code simplified. Updated depthWriteEnabled to use an optional bool, mostly matters due to added validation. Add TODOs for missing features (sliced 3D compressed textures) *Reference() became *AddRef() Added unorm10-10-10-2 vertex format Usage field in TextureViewDescriptor, just used for validation as wgpu-core doesn't allow specifying it anyways. Removed maxInterStageShaderComponents Added clang_macro_fallback to bindgen config, since the headers switched to using UINT32_MAX etc. UINT64_MAX still doesn't work so I had to manually define those. Renamed flags enums. Added a conversion helper function to convert them from u64 -> u32 for mapping. (means added direct dependency on bitflags crate) Removed device argument from (unimplemented) wgpuGetProcAddress Suboptimal surface texture acquisition moved to enum return value, was easy since wgpu-core already returns it like that. "Undefined" present mode added, it just selects FIFO.

This enum was replaced with WGPUMapAsyncStatus, but I didn't quite notice. The error codes map different.

Replaces *EnumerateFeatures with *GetFeatures. Also fixes CI due to fix in headers.

Upstream removed the "Flags" suffix from flags types and moved them to no longer be C enums. This matches that change. WGPUInstanceFlag still has "Flag" in the name because, well, there'd be nothing left to distinguish it from WGPUInstance, and it makes sense for it.

Also updates wgpu.h to use WGPUStringView everywhere.

All stubs, since we don't have WaitAny at the moment.

Updates to webgpu-native/webgpu-headers@6a23100

Update to webgpu-native/webgpu-headers@f1cdc3f Also went through a bunch of existing enum conversions and fixed up some seemingly spec-incorrect cases of undefined enums not being handled properly. As part of this, I made a new helper map_enum_with_undefined!() which distinguishes undefined and unknown enum values. Previously, much code relied on undefined being caught in the same net as an unknown value. This is no longer the case.

This updates the headers to webgpu-native/webgpu-headers@af63d34 These changes are exclusively in the header enum values, so no Rust code needs changing. Making this a separate commit for easier review.

Update headers to webgpu-native/webgpu-headers@b7656d0 Adds dual source blending. Other two features (float32 blendable and clip distances feature in wgsl) are not supported by wgpu. Also made map_blend_factor use the shorter macro form, as all the enum names match (they didn't 3 years ago when this code was originally written, according to Git history).

Oops that's not how these chained structs work.

Updates headers to webgpu-native/webgpu-headers@6f549cc Also made matching changes to wgpu.h

…aders

This commit updates the wgpu-core, wgpu-types, wgpu-hal, and naga crates to version 24.0.0. It also upgrades several other dependencies, including bitflags to 2.8.0 and serde to 1.0.217, while introducing new crates like js-sys, strum, and ordered-float. These changes ensure compatibility with the latest ecosystem updates and resolve potential build issues.

Previously, the R64Uint texture format was not accounted for, resulting in a lack of proper mapping. This change ensures the format is explicitly handled and returns `None`, aligning with other unimplemented or unsupported formats.

Added mapping for `wgt::SurfaceStatus::Unknown` to return `native::WGPUSurfaceGetCurrentTextureStatus_Error`. This ensures the application can appropriately handle unexpected surface status values and reduces potential runtime issues. A TODO comment was also added to include logging for better context in the future.

Replaced `ShaderBoundChecks` with `ShaderRuntimeChecks` in `ShaderModuleDescriptor` for clarity and consistency with naming conventions. This adjusts the descriptor to align with updated semantics in the wgt module.

This change introduces the mapping of texture usage flags during the construction of a TextureViewDescriptor. It ensures correct usage information is passed, improving compatibility and correctness in texture handling.

The removed check and comment were outdated and no longer necessary. This cleanup simplifies the code and avoids confusion regarding usage validation, which is now handled elsewhere.

The `texture_usage` variable was removed as it was defined but never used. This change simplifies the code and eliminates redundant assignments, improving readability and maintainability.

The error variant `Unsupported` was replaced with a more specific `FailedToRetrieveSurfaceCapabilitiesForAdapter`. This improves clarity and aligns with the updated error handling semantics.

Replaced `shader_bound_checks` with `runtime_checks` to match updates in the wgt API. This ensures compatibility with the latest naming conventions and improves code clarity.

Replaced `ImageCopy*` types with `TexelCopy*` equivalents for consistency with updated API naming. Adjusted feature detection to use experimental ray tracing and ray query feature names. These changes improve clarity and align with updated standards.

Simplifies the code by eliminating the unnecessary `return` keyword. This improves readability and aligns with idiomatic Rust practices. No functionality is affected by this change.

Reorganized load/store op mapping to include clear values and introduced `map_load_op_and_color` for better readability. Updated DX12 compiler handling to differentiate between dynamic and static Dxc paths. These changes improve code clarity and make handling of load/store operations and DX12 configurations more robust.

Corrects the incorrect use of `dxilPath` for `dxcPath` in DXC path assignment. Refactors instance descriptor to use structured `BackendOptions` for improved clarity and maintainability.

Simplified and streamlined the syntax for handling callbacks and closures. This enhances readability and consistency in the code by removing unnecessary wrapping and renaming variables for better alignment with Rust conventions.

Simplified the code by eliminating unnecessary `return` keywords before `NULL_FUTURE`. This improves code readability and aligns with Rust's idiomatic style for returning expressions. No functional changes were introduced.

Simplify conditional branches by reducing nested matches and removing redundant code. This improves readability and maintains the intended functionality for surface texture management and resource cleanup.

…-to-wgpu-24.0.0 # Conflicts: # src/lib.rs

Updated the shader stage type from WGPUShaderStageFlags to WGPUShaderStage to align with API requirements. Adjusted conversion logic to handle the new type safely and ensure compatibility with associated functionality.

Corrected the initial state of `open` and adjusted its updates to ensure proper command buffer lifecycle management. This addresses potential inconsistencies in how command buffers are handled during operations.

ygdrasil-io · 2025-01-26T00:10:57Z

With 70a8705 memory is stable. If someone could test on other OSs and architectures than Mac Arm.

ygdrasil-io · 2025-01-26T00:22:11Z

src/conv.rs

-            native::WGPUDx12Compiler_Dxc => wgt::Dx12Compiler::Dxc {
-                dxil_path: ptr_into_pathbuf(extras.dxilPath),
-                dxc_path: ptr_into_pathbuf(extras.dxcPath),
+            // TODO add specific value to cover dynamic and static Dxc


On wgpu.h should we convert

typedef enum WGPUDx12Compiler { WGPUDx12Compiler_Undefined = 0x00000000, WGPUDx12Compiler_Fxc = 0x00000001, WGPUDx12Compiler_Dxc = 0x00000002, WGPUDx12Compiler_Force32 = 0x7FFFFFFF } WGPUDx12Compiler;

To

typedef enum WGPUDx12Compiler { WGPUDx12Compiler_Undefined = 0x00000000, WGPUDx12Compiler_Fxc = 0x00000001, WGPUDx12Compiler_StaticDxc = 0x00000002, WGPUDx12Compiler_DynamicDxc = 0x00000004, WGPUDx12Compiler_Force32 = 0x7FFFFFFF } WGPUDx12Compiler;

?

Refactored code to use `map_load_op` directly, replacing the removed `map_load_op_and_color` function. Adjusted call sites to explicitly map clear values, improving clarity and reducing redundant abstractions.

Capati · 2025-01-26T18:20:27Z

With 70a8705 memory is stable. If someone could test on other OSs and architectures than Mac Arm.

Thanks for this!

I just tested a simple example (rotating cube) on Windows 11 x64, and it appears to be stable now. The memory usage no longer increases over time. I recall previous tests where the global report indicated numerous command buffers were not being released, but the issue seems to be resolved:

Global_Report {
  Surfaces:
        Surfaces:num_allocated = 0
        Surfaces:num_kept_from_user = 0
        Surfaces:num_released_from_user = 1
        Surfaces:element_size = 8
  Overview:
        adapters.num_allocated = 0
        adapters.num_kept_from_user = 0
        adapters.num_released_from_user = 1
        adapters.element_size = 8
        ----------
        devices.num_allocated = 0
        devices.num_kept_from_user = 0
        devices.num_released_from_user = 1
        devices.element_size = 8
        ----------
        pipeline_layouts.num_allocated = 0
        pipeline_layouts.num_kept_from_user = 0
        pipeline_layouts.num_released_from_user = 0
        pipeline_layouts.element_size = 16
        ----------
        shader_modules.num_allocated = 0
        shader_modules.num_kept_from_user = 0
        shader_modules.num_released_from_user = 1
        shader_modules.element_size = 16
        ----------
        bind_group_layouts.num_allocated = 0
        bind_group_layouts.num_kept_from_user = 0
        bind_group_layouts.num_released_from_user = 1
        bind_group_layouts.element_size = 16
        ----------
        bind_groups.num_allocated = 0
        bind_groups.num_kept_from_user = 0
        bind_groups.num_released_from_user = 1
        bind_groups.element_size = 16
        ----------
        command_buffers.num_allocated = 0
        command_buffers.num_kept_from_user = 0
        command_buffers.num_released_from_user = 1
        command_buffers.element_size = 8
        ----------
        render_bundles.num_allocated = 0
        render_bundles.num_kept_from_user = 0
        render_bundles.num_released_from_user = 0
        render_bundles.element_size = 16
        ----------
        render_pipelines.num_allocated = 0
        render_pipelines.num_kept_from_user = 0
        render_pipelines.num_released_from_user = 1
        render_pipelines.element_size = 16
        ----------
        compute_pipelines.num_allocated = 0
        compute_pipelines.num_kept_from_user = 0
        compute_pipelines.num_released_from_user = 0
        compute_pipelines.element_size = 16
        ----------
        query_sets.num_allocated = 0
        query_sets.num_kept_from_user = 0
        query_sets.num_released_from_user = 0
        query_sets.element_size = 16
        ----------
        textures.num_allocated = 0
        textures.num_kept_from_user = 0
        textures.num_released_from_user = 2
        textures.element_size = 16
        ----------
        texture_views.num_allocated = 0
        texture_views.num_kept_from_user = 0
        texture_views.num_released_from_user = 2
        texture_views.element_size = 16
        ----------
        samplers.num_allocated = 0
        samplers.num_kept_from_user = 0
        samplers.num_released_from_user = 0
        samplers.element_size = 16
}

PJB3005 and others added 30 commits September 19, 2024 17:53

Update C examples to new headers

0c60733

Fix WGPUBufferMapAsyncStatus tomfoolery

980c2ff

This enum was replaced with WGPUMapAsyncStatus, but I didn't quite notice. The error codes map different.

Fix macOS typos in examples

8f34e48

Update headers again

b36e558

Replaces *EnumerateFeatures with *GetFeatures. Also fixes CI due to fix in headers.

Update headers again, WGPUStringView

a256544

Also updates wgpu.h to use WGPUStringView everywhere.

Only specify major version for bitflags dependency

8947cf2

Remove redundant unsafe block

fe965c8

Start upgrading, try to use Box<ComputePass>

f5040ed

Switch to *mut for ComputePass and RenderPass

2afb356

Adapt C examples

2293063

Clean up dependencies to point to wgpu repo

eaa44b1

Format code

23d8907

Add wgpuGetInstanceFeatures stuff

482bc24

All stubs, since we don't have WaitAny at the moment.

I forgot to cargo fmt again

8f80a2c

Implement "NotUsed" bind group entry types

77dfe52

Updates to webgpu-native/webgpu-headers@6a23100

Update for other enum changes

aa6b617

This updates the headers to webgpu-native/webgpu-headers@af63d34 These changes are exclusively in the header enum values, so no Rust code needs changing. Making this a separate commit for easier review.

Fix getFeatures chained struct handling.

b6b6903

Oops that's not how these chained structs work.

Flatten limits structures

4d49d7f

Updates headers to webgpu-native/webgpu-headers@6f549cc Also made matching changes to wgpu.h

Merge remote-tracking branch 'upstream/trunk' into 24-09-19-update-he…

24ec452

…aders

Some day I will learn to run cargo fmt and check compile before merges

620405f

Merge branch 'trunk' into eliemichel/2024-10-14-upgrade-wgpu

ab33119

Upgrade timestamp functions

2d1888b

Fix push_constant example

e81dd63

Upgrade to what is almost wgpu 23.0.0

76df580

Upgrade to official v23.0.0

ecc0266

Upgrade to latest webgpu.h

e36359f

eliemichel and others added 21 commits November 15, 2024 15:53

Add missing unimplemented symbols

6eb8945

Update shader bound checks to runtime checks

b5136f8

Replaced `ShaderBoundChecks` with `ShaderRuntimeChecks` in `ShaderModuleDescriptor` for clarity and consistency with naming conventions. This adjusts the descriptor to align with updated semantics in the wgt module.

Add usage mapping to TextureViewDescriptor construction

ab9338b

This change introduces the mapping of texture usage flags during the construction of a TextureViewDescriptor. It ensures correct usage information is passed, improving compatibility and correctness in texture handling.

Remove unused texture view usage check and TODO comment

6a7cd2c

The removed check and comment were outdated and no longer necessary. This cleanup simplifies the code and avoids confusion regarding usage validation, which is now handled elsewhere.

Refactor to remove unused texture_usage variable.

345e112

The `texture_usage` variable was removed as it was defined but never used. This change simplifies the code and eliminates redundant assignments, improving readability and maintainability.

Update error variant for surface capabilities retrieval

8e8de3c

The error variant `Unsupported` was replaced with a more specific `FailedToRetrieveSurfaceCapabilitiesForAdapter`. This improves clarity and aligns with the updated error handling semantics.

Update ShaderModuleDescriptor to use runtime_checks field

69aabec

Replaced `shader_bound_checks` with `runtime_checks` to match updates in the wgt API. This ensures compatibility with the latest naming conventions and improves code clarity.

Remove redundant return statement in NULL_FUTURE usage

8e70d82

Simplifies the code by eliminating the unnecessary `return` keyword. This improves readability and aligns with idiomatic Rust practices. No functionality is affected by this change.

Fix DXC path assignment and update backend options structure

e4cb7b5

Corrects the incorrect use of `dxilPath` for `dxcPath` in DXC path assignment. Refactors instance descriptor to use structured `BackendOptions` for improved clarity and maintainability.

Refactor callback and closure handling for clarity

90d7ce2

Simplified and streamlined the syntax for handling callbacks and closures. This enhances readability and consistency in the code by removing unnecessary wrapping and renaming variables for better alignment with Rust conventions.

Remove redundant return statements for NULL_FUTURE

da90331

Simplified the code by eliminating unnecessary `return` keywords before `NULL_FUTURE`. This improves code readability and aligns with Rust's idiomatic style for returning expressions. No functional changes were introduced.

Refactor surface handling logic in texture release.

452b21e

Simplify conditional branches by reducing nested matches and removing redundant code. This improves readability and maintains the intended functionality for surface texture management and resource cleanup.

Merge remote-tracking branch 'origin-real/trunk' into feature/upgrade…

61c9e75

…-to-wgpu-24.0.0 # Conflicts: # src/lib.rs

Fix type mismatch in wgpuRenderBundleEncoderSetPushConstants

2411bb8

Updated the shader stage type from WGPUShaderStageFlags to WGPUShaderStage to align with API requirements. Adjusted conversion logic to handle the new type safely and ensure compatibility with associated functionality.

Run cargo fmt

9ab239b

Fix incorrect initialization and state updates for command buffers

70a8705

Corrected the initial state of `open` and adjusted its updates to ensure proper command buffer lifecycle management. This addresses potential inconsistencies in how command buffers are handled during operations.

ygdrasil-io commented Jan 26, 2025

View reviewed changes

Remove map_load_op_and_color and update load operation mapping.

df33227

Refactored code to use `map_load_op` directly, replacing the removed `map_load_op_and_color` function. Adjusted call sites to explicitly map clear values, improving clarity and reducing redundant abstractions.

ygdrasil-io marked this pull request as ready for review January 26, 2025 00:35

Merge branch 'trunk' into feature/upgrade-to-wgpu-24.0.0

4fbdf65

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

upgrade to wgpu 24.0.0 #455

upgrade to wgpu 24.0.0 #455

ygdrasil-io commented Jan 25, 2025

ygdrasil-io commented Jan 26, 2025

ygdrasil-io Jan 26, 2025

Capati commented Jan 26, 2025

upgrade to wgpu 24.0.0 #455

Are you sure you want to change the base?

upgrade to wgpu 24.0.0 #455

Conversation

ygdrasil-io commented Jan 25, 2025

ygdrasil-io commented Jan 26, 2025

ygdrasil-io Jan 26, 2025

Choose a reason for hiding this comment

Capati commented Jan 26, 2025