mirrors/vkd3d-proton

Commit Graph

Author	SHA1	Message	Date
Hans-Kristian Arntzen	5017b3723c	vkd3d: Enable VK_AMD_device_coherent_memory. For breadcrumbs support, along with buffer marker. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 13:07:56 +01:00
Hans-Kristian Arntzen	6a4f2842cb	cache: Move d3d12_pipeline_library to internal references. Allow us to hold internal magic pipeline libraries without creating cycles. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 12:29:32 +01:00
Hans-Kristian Arntzen	18a5315db4	cache: Refactor lock strategy of internal hashmaps. Rather than having to take writer lock on serialize calls from the outside, we should just take locks when accessing the internal hashmaps instead. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 12:29:32 +01:00
Hans-Kristian Arntzen	17b1ffb41a	vkd3d: Add path to use GENERAL depth-stencil images. On some implementations, it doesn't matter for performance what we use, and we can avoid a lot of ugly barriers this way. Opt-in to use this extensions on GPUs we know handles it well, otherwise, keep using the tracking paths. With VK_KHR_dynamic_rendering, this is now feasible to do since we no longer have to deal with shenanigans related to VkRenderPass layouts and complicated compatibility rules. To make this work with the existing framework, just need to consider that GENERAL can be a common layout alongside DEPTH_STENCIL_OPTIMAL, which are both common layouts that do not need to be tracked at all. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-10 15:14:55 +01:00
Hans-Kristian Arntzen	f9da3bf564	vkd3d: Add VK_KHR_driver_properties. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-10 15:14:55 +01:00
Hans-Kristian Arntzen	c6149b47cd	cache: Handle ref-count rules for multiple LoadPipeline/StorePipeline. In pipeline libraries, the library holds on to private references of the libraries so that they can be rapidly loaded on-demand. This behavior is verifed by API tests. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-09 18:35:09 +01:00
Hans-Kristian Arntzen	cc08339624	vkd3d: Use internal_refcounts for pipeline state. When we store pipeline state in libraries we have to manage lifetime a bit differently, which requires internal refcounts of some sort. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-09 18:35:09 +01:00
Hans-Kristian Arntzen	422f6804fb	vkd3d: Enable VK_KHR_create_renderpass2. Required extension by VK_KHR_fragment_shading_rate and VK_KHR_separate_depth_stencil_layouts, but we don't care about enabling any features or use it directly. Needed to silence validation errors. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-09 16:35:05 +01:00
Georg Lehmann	14a06680d9	vkd3d: Remove unused renderpass remains. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2022-03-08 18:34:18 +01:00
Philip Rebohle	9a408367dc	vkd3d: Remove render pass cache. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	51e6b2bbbe	vkd3d: Remove render pass from command list state. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	94f82d1085	vkd3d: Get rid of pipeline variant flags. These only existed for VRS attachment, which is no longer necessary with VK_KHR_dynamic_rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	1a68267962	vkd3d: Remove framebuffer list from d3d12_command_allocator. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	c4f88951fc	vkd3d: Use dynamic rendering for regular draw calls. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	9673ac173d	vkd3d: Use dynamic rendering for pipeline creation. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	3783eaf4f7	vkd3d: Implement swap chain blits using dynamic rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	024ef02f9b	vkd3d: Implement meta image copies using dynamic rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	549d4ee63f	vkd3d: Remove render pass list from d3d12_command_allocator. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	2c92ab7d1e	vkd3d: Enable and require VK_KHR_dynamic_rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Hans-Kristian Arntzen	9a63df07b8	vkd3d: Add punchthrough path for descriptor copies. Proves out the viability of this style of implementation. Ideally we'd have a more officially sanctioned way of doing similar things later :) Unfortunately, the overhead removal is too great to ignore on target platform. Makes use of a private (reserved) extension for now ... Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-04 13:34:18 +01:00
Hans-Kristian Arntzen	54fbadcc94	vkd3d: Recycle command pools. Elden Ring in particular spam frees and allocates command pools despite this being a very bad idea. Add a simple 8-entry cache which seems to take care of it. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:59:08 +01:00
Hans-Kristian Arntzen	4b07535909	vkd3d: Optimize memory access pattern for single descriptor copies. We can mark a descriptor as being SINGLE_DESCRIPTOR, which means we only need one descriptor copy. This way, we can avoid doing somewhat expensive work (every nanosecond counts here): - Bitscan loop - Read deep into d3d12_device guts (often a cache miss). The memory index depends on the bitscan, which causes bubble. When we have a single descriptor, we can just store the binding information inline and avoid this jank. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	84d632f194	vkd3d: Rewrite memory layout for resource descriptors. Tune memory layout so that we can deduce various information without making a single pointer dereference: - d3d12_descriptor_heap* - heap offset - Pointer to various side data structures we need to keep around. Instead of having one big 64 byte data structure with tons of padding, tune it down to 32 + 8 bytes per descriptor of extra dummy data. To make all of this work, use a somewhat clever encoding scheme for CPU VA where lower bits store number of active bits used to encode descriptor offset. From there, we can mask away bits to recover d3d12_descriptor_heap. Metadata is stored inline in one big allocation, and we can just offset from there based on extracted log2i_ceil(descriptor count). Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	b309913b6d	vkd3d: Use unsafe_impl in CopyDescriptorsSimple. This is an ultra-hot path and seems to show up somehow on profile. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	83c4e62660	vkd3d: Bump suballocation limit to 2 MiB. This is a more principled limit since that's the huge page size. Avoids some allocation spam. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-24 12:14:22 +01:00
Hans-Kristian Arntzen	edbf49aad4	vkd3d: Support opt-in to single MUTABLE set. Useful for Intel since Intel hardware cannot support more than 1M descriptors in general, and opting in to correct behavior should improve CPU overhead as well when copying descriptors. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 17:08:25 +01:00
Hans-Kristian Arntzen	15704b2419	vkd3d: Optimize descriptor copies for common code paths. The common path that we really need to optimize for is CBV_SRV_UAV + Simple + 1 descriptor. Descriptor benchmark shows an almost 50% reduction in overhead now. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	c725c29bb6	vkd3d: Inline query for set/binding from set_index. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	2f6a91e772	vkd3d: De-virtualize query for descriptor size. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	5d345f47cc	vkd3d: Rewrite the pipeline library implementation. This became basically a rewrite in the end, and it got too awkward to split these commits in any meaningful way. The goals here were primarily to: - Support serializing SPIR-V and load SPIR-V. To do this robustly requires a lot more validation and checks to make sure end up compiling the same SPIR-V that we load from cache. This is critical for performance when games have primed their pipeline libraries and expect that loading a PSO should be fast. Without this, we will hit vkd3d-shader for every PSO, causing very long load times. - Implement the required validation for mismatched PSO descriptions. - Rewrite the binary layout of the pipeline library for flexibility concerns and performance. If the pipeline library is mmap-ed from disk - which appears to be the intended use - we only need to scan through the TOC to fully parse the library contents. From a flexibility concern, a blob needs to support inlined data, but a library can use referential links. We introduce separate hashmaps which store deduplicated SPIR-V and pipeline cache blobs, which significantly drop memory and storage requirements. For future improvements, it should be fairly easy to add information which lets us avoid SPIR-V or pipeline cache data altogether if relevant changes to Vulkan/drivers are made. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-17 11:00:03 +01:00
Hans-Kristian Arntzen	33f17cc74d	vkd3d: Add VK_EXT_pipeline_creation_feedback. Useful when used together with pipeline library logging. Confirms that we can load pipeline caches as expected. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 14:31:34 +01:00
Hans-Kristian Arntzen	e5e662ce22	vkd3d: Record root signature compatibility hashes. For pipeline libraries and DXR to some extent later, we'll need an easy way to compare root signature objects. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 14:31:34 +01:00
Hans-Kristian Arntzen	1d39c25a59	vkd3d: Properly invalidate pipeline when binding NULL DSV. We did not test the scenario where we first render with depth enabled, and then bind a NULL DSV with the same pipeline. Also fix issues if we bind NULL RTVs with same pipeline bound. Fixes crash in Guardians of the Galaxy. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 13:10:16 +01:00
Hans-Kristian Arntzen	81a215d0bf	vkd3d: Implement COLOR -> STENCIL copy if stencil export is supported. Fallback is a bit more involved. Cleans up the FIXME to not report benign issues. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-03 15:43:41 +01:00
Hans-Kristian Arntzen	86f8f41490	vkd3d: Compute a global shader interface key for a D3D12 device. This key represents the variations of SPIR-V which would be generated from otherwise identical inputs like DXBC blobs and root signatures. Typically, changing VKD3D_CONFIG flags or enabled extensions will affect this key. This ensures that we will not attempt to use a cached SPIR-V file unless we can trust that the SPIR-V interface will match. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	6e697a54b6	vkd3d: Add d3d12_cached_pipeline_state. Wraps the D3D12 struct with a pipeline library handle. This is needed if the blob contains references to external data, which then needs to be resolved. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	41c977d616	cache: Move cache implementation over to read-writer locks. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	5c492e9e6c	vkd3d: Handle overlapped transfer writes. D3D12 expects drivers to implicitly synchronize transfer operations, since there is no TRANSFER barrier ala UAV barriers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-19 14:44:33 +01:00
Hans-Kristian Arntzen	6cba8b9945	vkd3d: Workaround broken barriers in DEATHLOOP. In DEATHLOOP, there is a render pass which renders out a simple image, which is then directly followed by a compute dispatch, reading that image. The image is still in RENDER_TARGET state, and color buffers are not flushed properly on at least RADV, manifesting as a very distracting glitch pattern. This is a game bug, but for the time being, we have to workaround it, sigh. For a simple workaround, we can detect patterns where we see these events in succession: - Color RT is started - StateBefore == RENDER_TARGET is not observed - Dispatch() In particular, when entering the options menu, highly distracting glitches are observed in the background. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-12 12:20:03 +01:00
Samuel Pitoiset	f6fe3e0183	vkd3d: Require VK_KHR_copy_commands2 This extension is trivial to implement for vendors and should be widely supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2022-01-12 12:06:06 +01:00
Samuel Pitoiset	b42a7193fc	vkd3d: Require VK_KHR_bind_memory2 This extension is trivial to implement for vendors and should be widely supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2022-01-12 12:06:06 +01:00
Philip Rebohle	4000397570	vkd3d: Remove legacy format compatibility info. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	9624102dcb	vkd3d: Rework format compatibility lists. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	9185edb42a	vkd3d: Implement ID3D12GraphicsCommandList6. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	b03c1fcb5f	vkd3d: Implement ID3D12Device9. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	3b6a4ab988	vkd3d: Implement ID3D12Device8 and ID3D12Resource2. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	d61f562a3e	vkd3d: Implement ID3D12Device7. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Joshua Ashton	046524f2a1	vkd3d: Implement MinLODClamp using VK_EXT_image_view_min_lod Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-11-17 20:51:20 +01:00
Hans-Kristian Arntzen	b53a4a98a6	vkd3d: Enable per component robustness on AMD. Tested and verified to work as expected, not so much on NV. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00
Hans-Kristian Arntzen	3210832ad9	vkd3d: Enable VK_EXT_scalar_block_layout. dxil-spirv can take advantage of this now. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00

1 2 3 4 5 ...

802 Commits