mirrors/vkd3d-proton

Commit Graph

Author	SHA1	Message	Date
Hans-Kristian Arntzen	4b07535909	vkd3d: Optimize memory access pattern for single descriptor copies. We can mark a descriptor as being SINGLE_DESCRIPTOR, which means we only need one descriptor copy. This way, we can avoid doing somewhat expensive work (every nanosecond counts here): - Bitscan loop - Read deep into d3d12_device guts (often a cache miss). The memory index depends on the bitscan, which causes bubble. When we have a single descriptor, we can just store the binding information inline and avoid this jank. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	84d632f194	vkd3d: Rewrite memory layout for resource descriptors. Tune memory layout so that we can deduce various information without making a single pointer dereference: - d3d12_descriptor_heap* - heap offset - Pointer to various side data structures we need to keep around. Instead of having one big 64 byte data structure with tons of padding, tune it down to 32 + 8 bytes per descriptor of extra dummy data. To make all of this work, use a somewhat clever encoding scheme for CPU VA where lower bits store number of active bits used to encode descriptor offset. From there, we can mask away bits to recover d3d12_descriptor_heap. Metadata is stored inline in one big allocation, and we can just offset from there based on extracted log2i_ceil(descriptor count). Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	b309913b6d	vkd3d: Use unsafe_impl in CopyDescriptorsSimple. This is an ultra-hot path and seems to show up somehow on profile. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	83c4e62660	vkd3d: Bump suballocation limit to 2 MiB. This is a more principled limit since that's the huge page size. Avoids some allocation spam. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-24 12:14:22 +01:00
Hans-Kristian Arntzen	edbf49aad4	vkd3d: Support opt-in to single MUTABLE set. Useful for Intel since Intel hardware cannot support more than 1M descriptors in general, and opting in to correct behavior should improve CPU overhead as well when copying descriptors. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 17:08:25 +01:00
Hans-Kristian Arntzen	15704b2419	vkd3d: Optimize descriptor copies for common code paths. The common path that we really need to optimize for is CBV_SRV_UAV + Simple + 1 descriptor. Descriptor benchmark shows an almost 50% reduction in overhead now. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	c725c29bb6	vkd3d: Inline query for set/binding from set_index. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	2f6a91e772	vkd3d: De-virtualize query for descriptor size. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	5d345f47cc	vkd3d: Rewrite the pipeline library implementation. This became basically a rewrite in the end, and it got too awkward to split these commits in any meaningful way. The goals here were primarily to: - Support serializing SPIR-V and load SPIR-V. To do this robustly requires a lot more validation and checks to make sure end up compiling the same SPIR-V that we load from cache. This is critical for performance when games have primed their pipeline libraries and expect that loading a PSO should be fast. Without this, we will hit vkd3d-shader for every PSO, causing very long load times. - Implement the required validation for mismatched PSO descriptions. - Rewrite the binary layout of the pipeline library for flexibility concerns and performance. If the pipeline library is mmap-ed from disk - which appears to be the intended use - we only need to scan through the TOC to fully parse the library contents. From a flexibility concern, a blob needs to support inlined data, but a library can use referential links. We introduce separate hashmaps which store deduplicated SPIR-V and pipeline cache blobs, which significantly drop memory and storage requirements. For future improvements, it should be fairly easy to add information which lets us avoid SPIR-V or pipeline cache data altogether if relevant changes to Vulkan/drivers are made. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-17 11:00:03 +01:00
Hans-Kristian Arntzen	33f17cc74d	vkd3d: Add VK_EXT_pipeline_creation_feedback. Useful when used together with pipeline library logging. Confirms that we can load pipeline caches as expected. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 14:31:34 +01:00
Hans-Kristian Arntzen	e5e662ce22	vkd3d: Record root signature compatibility hashes. For pipeline libraries and DXR to some extent later, we'll need an easy way to compare root signature objects. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 14:31:34 +01:00
Hans-Kristian Arntzen	1d39c25a59	vkd3d: Properly invalidate pipeline when binding NULL DSV. We did not test the scenario where we first render with depth enabled, and then bind a NULL DSV with the same pipeline. Also fix issues if we bind NULL RTVs with same pipeline bound. Fixes crash in Guardians of the Galaxy. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 13:10:16 +01:00
Hans-Kristian Arntzen	81a215d0bf	vkd3d: Implement COLOR -> STENCIL copy if stencil export is supported. Fallback is a bit more involved. Cleans up the FIXME to not report benign issues. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-03 15:43:41 +01:00
Hans-Kristian Arntzen	86f8f41490	vkd3d: Compute a global shader interface key for a D3D12 device. This key represents the variations of SPIR-V which would be generated from otherwise identical inputs like DXBC blobs and root signatures. Typically, changing VKD3D_CONFIG flags or enabled extensions will affect this key. This ensures that we will not attempt to use a cached SPIR-V file unless we can trust that the SPIR-V interface will match. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	6e697a54b6	vkd3d: Add d3d12_cached_pipeline_state. Wraps the D3D12 struct with a pipeline library handle. This is needed if the blob contains references to external data, which then needs to be resolved. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	41c977d616	cache: Move cache implementation over to read-writer locks. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	5c492e9e6c	vkd3d: Handle overlapped transfer writes. D3D12 expects drivers to implicitly synchronize transfer operations, since there is no TRANSFER barrier ala UAV barriers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-19 14:44:33 +01:00
Hans-Kristian Arntzen	6cba8b9945	vkd3d: Workaround broken barriers in DEATHLOOP. In DEATHLOOP, there is a render pass which renders out a simple image, which is then directly followed by a compute dispatch, reading that image. The image is still in RENDER_TARGET state, and color buffers are not flushed properly on at least RADV, manifesting as a very distracting glitch pattern. This is a game bug, but for the time being, we have to workaround it, sigh. For a simple workaround, we can detect patterns where we see these events in succession: - Color RT is started - StateBefore == RENDER_TARGET is not observed - Dispatch() In particular, when entering the options menu, highly distracting glitches are observed in the background. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-12 12:20:03 +01:00
Samuel Pitoiset	f6fe3e0183	vkd3d: Require VK_KHR_copy_commands2 This extension is trivial to implement for vendors and should be widely supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2022-01-12 12:06:06 +01:00
Samuel Pitoiset	b42a7193fc	vkd3d: Require VK_KHR_bind_memory2 This extension is trivial to implement for vendors and should be widely supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2022-01-12 12:06:06 +01:00
Philip Rebohle	4000397570	vkd3d: Remove legacy format compatibility info. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	9624102dcb	vkd3d: Rework format compatibility lists. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	9185edb42a	vkd3d: Implement ID3D12GraphicsCommandList6. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	b03c1fcb5f	vkd3d: Implement ID3D12Device9. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	3b6a4ab988	vkd3d: Implement ID3D12Device8 and ID3D12Resource2. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	d61f562a3e	vkd3d: Implement ID3D12Device7. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Joshua Ashton	046524f2a1	vkd3d: Implement MinLODClamp using VK_EXT_image_view_min_lod Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-11-17 20:51:20 +01:00
Hans-Kristian Arntzen	b53a4a98a6	vkd3d: Enable per component robustness on AMD. Tested and verified to work as expected, not so much on NV. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00
Hans-Kristian Arntzen	3210832ad9	vkd3d: Enable VK_EXT_scalar_block_layout. dxil-spirv can take advantage of this now. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00
Hans-Kristian Arntzen	23ad0247e3	vkd3d: Enable 64-bit atomics extensions. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
Hans-Kristian Arntzen	6255eaec32	vkd3d: Stub out the more recent FEATURE_DATA structs. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
Hans-Kristian Arntzen	85c75a042f	vkd3d: Enable VK_NV_compute_shader_derivatives. Supported on more implementations too :) Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-27 17:00:33 +02:00
Georg Lehmann	c8d633cb51	vkd3d: Enable VK_KHR_format_feature_flags2. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-10-27 17:00:21 +02:00
Hans-Kristian Arntzen	be8d6ec7ad	vkd3d: Make global quirks info struct a value. Allows us to fiddle with it after the fact. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-18 15:51:20 +02:00
Hans-Kristian Arntzen	0c60791bb1	vkd3d: Pass down PrimitiveCulling extension to vkd3d-shader. DXR 1.1 only feature. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 16:04:30 +02:00
Hans-Kristian Arntzen	6866b45637	vkd3d: Add CONFIG flag for enabling DXR 1.1. We cannot support ExecuteIndirect with TraceRays() for time being. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 16:04:30 +02:00
Hans-Kristian Arntzen	a36b987bf1	vkd3d: Add static pipeline variant flag to pipeline key. If we need to fallback in both VRS and non-VRS scenarios, we need to key on it. Fixes segfault in DIRT5 when toggling VRS. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 12:35:58 +02:00
Hans-Kristian Arntzen	99365bcaec	vkd3d: Enable VK_NV_fragment_shader_barycentric. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-11 13:53:19 +01:00
Hans-Kristian Arntzen	f605b88e90	vkd3d: Make some RS related functions non-static. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	90d52abe94	vkd3d: Parse local RS static samplers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	393ef6261b	vkd3d: Add local root signature objects to RTPSO. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	6802d9e5a3	vkd3d: Add helper to create augmented pipeline layout. For local root signature static samplers, this is handy. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	67be905421	vkd3d: Bump max number of descriptor sets. Need one potentially for local root signature static samplers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	b661c9b8ba	vkd3d: Store set layout array in root signature. With RTPSOs we might have to create static sampler sets for local root signatures. In this case we will have to create a compatible pipeline layout which is equal to global pipeline layout, except for an extra set. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	7ee8eac818	vkd3d: Add allocation flag for DEDICATED. When allocating dedicated memory, ignore heap_flag requirements we deduce from memory info. Any memory type is allowed. This is important on NV when allocating fallback render targets. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	9065f312d5	vkd3d: Refactor out validation of CUSTOM heap types. Don't attempt to enter memory allocation when we can invalidate a heap allocation up front. Avoids some dumb edge cases later. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	cd3d759b95	vkd3d: Enable VK_KHR_shader_integer_dot_product. Accelerates SM 6.4 packed ops if present. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-05 15:38:59 +02:00
Hans-Kristian Arntzen	af822939fb	vkd3d: Implement support for rendering to NULL/unbound RTV. Need to use fallback pipeline system here. Keep track of active masks for PSO and current render target. The intersection of those sets are the attachments which should be active in the render pass. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-30 16:50:02 +02:00
Joshua Ashton	bde3ad8e01	vkd3d: Move ID3D12StateObject impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	cabc31fc4c	vkd3d: Move ID3D12Device impl_froms to header Basic casts should not be function calls.	2021-09-23 12:12:13 +02:00
Joshua Ashton	bfaf72386f	vkd3d: Move ID3D12CommandSignature impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	b84c3ff163	vkd3d: Move ID3D12PipelineState impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	7c993ae1a6	vkd3d: Move ID3D12RootSignature impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	875fbe5f50	vkd3d: Move ID3D12QueryHeap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	2334c136e3	vkd3d: Move ID3D12DescriptorHeap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	8d5308c9a1	vkd3d: Move ID3D12Resource impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	27e66b5c4a	vkd3d: Move ID3D12Heap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	26d8011b06	vkd3d: Move ID3D12Fence impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	e597adb83a	vkd3d: Move d3d12_query_heap_type_get_data_size to header This should be inlined. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Hans-Kristian Arntzen	1d51818d8f	vkd3d: Fix compile error introduced by bad rebase. Somehow the rebase got really screwed up :\ Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:42:30 +02:00
Hans-Kristian Arntzen	abdaeb136d	vkd3d: Add a memory budget per memory type. For resizable BAR, we don't want to endlessly promote UPLOAD heaps to BAR since VRAM is precious. The aim is to set a fixed budget where we can keep allocating until full, at which point we fall back to plain HOST. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	e0451bb541	vkd3d: Handle fallbacks properly in suballocator. With BAR budgets, what will happen is that - Small allocation is requested - A new chunk is requested - try_suballocate_memory will end up calling allocate_memory, which allocates a fallback memory type - Subsequent small allocators will always end up allocating a new fallback memory block, never reusing existing blocks. - System memory is rapidly exhausted once apps start hitting against budget. The fix is to add flags which explicitly do not attempt to fallback allocate. This makes it possible to handle fallbacks at the appropriate level in try_suballocate_memory instead. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	69d4f55219	vkd3d: Refactor VkDeviceMemory allocation to keep track of type/size. We will need to consider some form of budgeting, so make sure that all allocation and freeing is done in a central place. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	41295eff6c	vkd3d: Consider CPU availibility when selecting memory types. Need to consider that based on host visibility requirements, we need to select either LINEAR or OPTIMAL image types, and those tiling modes can have different memory requirements. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 15:35:57 +02:00
Hans-Kristian Arntzen	11086a94e0	vkd3d: Add macros to parse/build NV driver versions. The bit offsets are a bit different from Vulkan API. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-08 18:37:55 +02:00
Hans-Kristian Arntzen	403d1f9743	vkd3d: Workaround huge memory overhead for individual VkPipelineCaches. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-07 13:21:54 +02:00
Hans-Kristian Arntzen	fa1d82e141	vkd3d: Fix regressions when introducing null-copy elision. Need to initialize the set mask so that copies happen properly on default-initialized descriptors. Also, move the current_null_type to metadata so that it's properly copied on descriptor copy. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-03 12:24:26 +02:00
Rodrigo Locatti	b4cb5a37f8	vkd3d: Optimize repeated null descriptor updates There are titles clearing the same descriptors constantly. This leads to unnecessary updates that can become costly. This commit introduces a new flag to track when D3D12 descriptors are not null, and skips clearing them if they are already null. Descriptors are assumed to be null by default. This fixes a performance regression introduced by `9983a1720f` Signed-off-by: Rodrigo Locatti <rlocatti@nvidia.com>	2021-09-02 21:21:34 +02:00
Philip Rebohle	7fea3527ed	vkd3d: Remove deferred clears. Emitting render pass clears while we're in the process of starting a render pass overrides dsv layout tracking info. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-09-02 17:11:35 +02:00
Hans-Kristian Arntzen	bc9bd9c482	vkd3d: Fix member types in vkd3d_format. No need to use size_t. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 12:21:22 +02:00
Hans-Kristian Arntzen	7b67de7d0e	vkd3d: Generalize get_plane_footprints. Get information directly from vkd3d_format and allow for subsampled formats in the future. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 12:21:22 +02:00
Hans-Kristian Arntzen	3d5010555e	vkd3d: Add d3d12_resource_desc_get_sub_resource_count. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 12:21:22 +02:00
Hans-Kristian Arntzen	5c2376faf5	vkd3d: Handle multiplanar formats in GetCopyableFootprints. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 12:21:22 +02:00
rochaudhari	0828aec4f6	vkd3d: Implement new interfaces required for DX12 DLSS support. Adds ID3D12GraphicsCommandListExt and ID3D12DeviceExt interfaces. Signed-off-by: Roshan Chaudhari <rochaudhari@nvidia.com>	2021-08-27 11:37:15 +02:00
Philip Rebohle	715eca1b95	vkd3d: Reimplement frame latency event as a semaphore. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-08-26 14:21:38 +02:00
Philip Rebohle	fef30f5037	vkd3d: Support releasing semaphores from a D3D12 fence. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-08-26 14:21:38 +02:00
Hans-Kristian Arntzen	e1bb5f3b77	vkd3d: Handle NULL event handles in ID3D12Fence::SetEvent*(). We need to block here for whatever reason. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-29 17:21:20 +02:00
Hans-Kristian Arntzen	5b013d0b02	vkd3d: Validate shader meta against features. We're supposed to validate and fail compilation if certain features are not supported. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-28 15:28:19 +02:00
Joshua Ashton	1d23bdbab7	vkd3d: Don't store pointer to QA info when not building with QA This is entirely unnecessary and a waste of space as it will never be used. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-07-08 16:52:58 +02:00
Hans-Kristian Arntzen	3915090c12	vkd3d: Track depth-stencil image layouts over a command buffer. Goal here is to avoid unnecessary image layout transitions when render passes toggle depth-stencil PSO states. Since we cannot know which states a resource is in, we have to be conservative, and assume that shader reads could happen. The best effort we can do is to detect when writes happen to a DSV resource. In this scenario, we can deduce that the aspect cannot be read, since DEPTH_WRITE \| RESOURCE state is not allowed. To make the tracking somewhat sane, we only promote to OPTIMAL if an entire image's worth of subresources for a given aspect is transitioned. The common case for depth-stencil images is 1 mip / 1 layer anyways. Some other changes are required here: - Instead of common_layout for the depth image, we need to consult the command list, which might promote the layout to optimal. - We make use of render pass compatibility rules which state that we can change attachment reference layouts as well as initial/finalLayout. To make this change, a pipeline will fill in a vkd3d_render_pass_compat struct. - A command list has a dsv_plane_optimal_mask which keeps track of the plane aspects we have promoted to OPTIMAL, and we know cannot be read by shaders. The desired optimal mask is (existing optimal \| PSO write). The initial existing optimal is inherited from the command list's tracker. - RTV/DSV/views no longer keep track of VkImageLayout. This is unnecessary since we always deduce image layout based on context. Overall, this shows a massive gain in HZD benchmark (RADV, 1440p ultimate, ~16% FPS on RX 6800). Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:45:46 +02:00
Hans-Kristian Arntzen	8f05ac298c	vkd3d: Add implementation for plane optimal tracker. Idea is to keep track of scenarios where we know a resource's aspect is known to be in a OPTIMAL state. Based on this, we can override the image layout from the common_layout in order to avoid unnecessary full barriers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:45:46 +02:00
Hans-Kristian Arntzen	68ce7bd324	vkd3d: Handle separate DS layout for destination copies. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:31:52 +02:00
Hans-Kristian Arntzen	cf632186fd	vkd3d: Add workaround for MinLODClamp. Not correct, will need spec additions to handle it properly. Fixes ground rendering in DIRT 5. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-06 16:45:19 +02:00
Hans-Kristian Arntzen	398724cd6e	vkd3d: Require VK_KHR_separate_depth_stencil_layouts. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-02 15:18:16 +02:00
Hans-Kristian Arntzen	7a00e56792	vkd3d: Handle multiple planes in d3d12_resource_get_subresource_count. Separate out an explicit per_plane query for the cases where we need it. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-02 14:16:18 +02:00
Hans-Kristian Arntzen	7c80c92304	vkd3d: Use ALLOW_VARYING_SUBGROUP_SIZE flag as appropriate. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-25 15:08:53 +02:00
Georg Lehmann	a7922a7c85	vkd3d: Introduce vkd3d_internal_get_vk_format. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-06-24 12:55:17 +02:00
Georg Lehmann	0d9c7bc3ad	vkd3d: Index formats by format. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-06-24 12:55:17 +02:00
Hans-Kristian Arntzen	9900301886	vkd3d: Use read-write lock for fallback pipeline cache. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:41:09 +02:00
Hans-Kristian Arntzen	bb723e859b	vkd3d: Use read-write locks for render pass cache. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:41:09 +02:00
Hans-Kristian Arntzen	8225edc726	vkd3d: Rewrite resource state implementation. - Honor resource barriers for resource states which cannot automatically decay or promote. This includes COLOR_ATTACHMENT, UNORDERED_ACCESS and VRS image. If SIMULTANEOUS_ACCESS is used, we can still promote, and we handle that by setting common layout to GENERAL for these resources. - Avoid redundant barriers in render passes since normal resource barriers will always make sure we are already in COLOR_ATTACHMENT_OPTIMAL. - Do not force GENERAL layout if resource has UNORDERED_ACCESS flag set. As this is not a promotable state, we have to explicitly transition into it. I tested this on validation layers, where even COMMON state refuses to promote to UAV state. The exception here of course is SIMULTANOUS_ACCESS, but we handle that properly now. - Verify that UAV or SIMULTANEOUS access is not used together with DSV state. This is explicitly banned in the API docs. - Actually emit image barriers. Batch the image transitions as that's what D3D12 docs encourage app developers to do, and it also expects that drivers can optimize this. Ensure that we respect the in-order resource barrier rules by splitting batches if there are overlaps in the transitions. - Ensure that correct image layout is used when clearing a suspended render pass attachment. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:32:48 +02:00
Hans-Kristian Arntzen	177679a766	vkd3d: Add VKD3D_RESOURCE_SIMULTANEOUS_ACCESS. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:32:48 +02:00
Hans-Kristian Arntzen	02398c4eef	vkd3d: Normalize depth-stencil layouts if only one aspect is used. Avoid using the separate layouts if we're only using formats with one aspects. This makes it more likely to match layouts with common layout, and we can avoid awkward transition barriers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:32:48 +02:00
Philip Rebohle	014a3c0b94	vkd3d: Handle plane slice index in descriptor creation. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-06-21 21:23:03 +02:00
Hans-Kristian Arntzen	28c8a595fa	vkd3d: Pass down shader quirks for Necromunda. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-17 16:42:14 +02:00
rochaudhari	1699743c37	vkd3d: Enable binary import and image view handle extensions Signed-off-by: Roshan Chaudhari <rochaudhari@nvidia.com> Reviewed-by: Liam Middlebrook <lmiddlebrook@nvidia.com>	2021-06-10 11:26:34 +02:00
Hans-Kristian Arntzen	9983a1720f	vkd3d: Splat null descriptors to all sets. Some games end up writing the wrong descriptor type when using null descriptors, and to be robust against that, we have to clear out all descriptors when creating null descriptors. If we copy a null descriptor, we will also have to copy from all sets. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-07 13:21:31 +02:00
Hans-Kristian Arntzen	3c7f188863	vkd3d: Nuke code paths for !nullDescriptor. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-05-27 10:39:22 +02:00
Hans-Kristian Arntzen	a256a9266e	vkd3d: Rewrite descriptor QA. Adds support for GPU-assisted validation of descriptor usage in the CBV_SRV_UAV heap. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-05-26 17:26:01 +02:00
David McCloskey	217ffc27d2	vkd3d: Type error fix for d3d12_device_get_query_pool. Signed-off-by: David McCloskey <davmcclo@gmail.com>	2021-05-07 06:41:59 +01:00

1 2 3 4 5 ...

831 Commits