mirrors/vkd3d-proton

Commit Graph

Author	SHA1	Message	Date
Hans-Kristian Arntzen	7ee8eac818	vkd3d: Add allocation flag for DEDICATED. When allocating dedicated memory, ignore heap_flag requirements we deduce from memory info. Any memory type is allowed. This is important on NV when allocating fallback render targets. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	cddb98acc6	vkd3d: Consider that we might attempt to free NULL memory. For deferred heaps, we will accept NULL allocations. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	4075809a91	vkd3d: Make error message more precise when failing to allocate memory. There are situations where we cannot fallback to system memory, so don't log that we're going to do so. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	9065f312d5	vkd3d: Refactor out validation of CUSTOM heap types. Don't attempt to enter memory allocation when we can invalidate a heap allocation up front. Avoids some dumb edge cases later. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	9415191111	vkd3d: Add LOG_MEMORY_BUDGET logging for non-budget as well. Useful to be able to debug which allocations happen. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Joshua Ashton	c9ff20d4ac	vkd3d: Make a generic UE4 shader quirk collection Many UE4 games have this broken bloom shader that samples a texture with implicit lod in divergent control flow. Fixes Bus Simulator 21 Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-10-07 10:18:47 +01:00
Joshua Ashton	7a66669e92	vkd3d: Add empty element to shader quirks If we ever remove these, we need this for MSVC. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-10-07 10:18:47 +01:00
Joshua Ashton	d91d47d827	vkd3d: Use vkd3d_string_compare for shader quirks Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-10-07 10:18:47 +01:00
Joshua Ashton	70ee02bce0	vkd3d: Use vkd3d_string_compare for application overrides Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-10-07 10:18:47 +01:00
Hans-Kristian Arntzen	cd3d759b95	vkd3d: Enable VK_KHR_shader_integer_dot_product. Accelerates SM 6.4 packed ops if present. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-05 15:38:59 +02:00
Hans-Kristian Arntzen	af822939fb	vkd3d: Implement support for rendering to NULL/unbound RTV. Need to use fallback pipeline system here. Keep track of active masks for PSO and current render target. The intersection of those sets are the attachments which should be active in the render pass. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-30 16:50:02 +02:00
Hans-Kristian Arntzen	173b565ccf	vkd3d: Optimize DiscardResource when all subresources are discarded. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-29 14:17:31 +02:00
Hans-Kristian Arntzen	0b11fad67c	vkd3d: Allow discarding UAV resources. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-29 14:17:31 +02:00
Hans-Kristian Arntzen	6f0677eb2e	vkd3d: Refactor out queue flags -> stages conversion. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-29 14:17:31 +02:00
Hans-Kristian Arntzen	0c2ddb89cd	vkd3d: Add CONFIG for forced CACHED memory. Very useful for capturing. Speeds up a ton. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-27 14:48:26 +02:00
Hans-Kristian Arntzen	6863f1c6a8	vkd3d: Fix test suite regression on NV. Fix failure in test_create_heap where a TIER_2 host visible heap was attempted, but failed due to recent DEATHLOOP fixes. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-24 16:48:34 +02:00
Joshua Ashton	bde3ad8e01	vkd3d: Move ID3D12StateObject impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	cabc31fc4c	vkd3d: Move ID3D12Device impl_froms to header Basic casts should not be function calls.	2021-09-23 12:12:13 +02:00
Joshua Ashton	bfaf72386f	vkd3d: Move ID3D12CommandSignature impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	b84c3ff163	vkd3d: Move ID3D12PipelineState impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	7c993ae1a6	vkd3d: Move ID3D12RootSignature impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	875fbe5f50	vkd3d: Move ID3D12QueryHeap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	2334c136e3	vkd3d: Move ID3D12DescriptorHeap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	8d5308c9a1	vkd3d: Move ID3D12Resource impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	27e66b5c4a	vkd3d: Move ID3D12Heap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	26d8011b06	vkd3d: Move ID3D12Fence impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	e597adb83a	vkd3d: Move d3d12_query_heap_type_get_data_size to header This should be inlined. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	3b3bd37f93	vkd3d: Avoid tracking + ending render passes when calling ResolveQueryData with 0 queries Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Conor McCarthy	446c7423ce	vkd3d: Return E_INVALIDARG for texture creation if SampleDesc.Count == 0. Windows returns E_INVALIDARG at least on AMD and Intel. Psychonaughts 2 seems to use this as a de facto "do not create" value, and reasonable vram usage depends on the call failing. Signed-off-by: Conor McCarthy <cmccarthy@codeweavers.com>	2021-09-23 11:00:04 +01:00
Conor McCarthy	d366ba47ac	Revert "vkd3d: Support SAMPLE_DESC.Count of 0" Windows returns E_INVALIDARG in this case. Signed-off-by: Conor McCarthy <cmccarthy@codeweavers.com>	2021-09-23 11:00:04 +01:00
Georg Lehmann	cf4fb44629	vkd3d: Remove almost unused variable. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-09-21 11:22:34 +01:00
Georg Lehmann	edeb0658b7	vkd3d: Fix memory leak on failure. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-09-21 11:22:34 +01:00
Georg Lehmann	0afa6732ad	vkd3d: Cleanup weird assignment. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-09-21 11:22:34 +01:00
David McCloskey	a19619ccbf	vkd3d: Fixing compile errors on Windows.	2021-09-18 21:40:30 +01:00
Hans-Kristian Arntzen	173b8ecef0	vkd3d: Add workaround for DEATHLOOP. Game attempts to create a host visible resource with ALLOW_RENDER_TARGET flag. We cannot make this work on NVIDIA, but the game never seems to actually create an RTV, so as a workaround, nop out the flag, which does make it work after all :3 Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-17 14:21:09 +02:00
Hans-Kristian Arntzen	fa4d2182b1	vkd3d: Copy all aspects in CopyResource. Just like we're promoting layer count, also promote aspect mask. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-17 14:21:09 +02:00
Hans-Kristian Arntzen	e687d489ab	vkd3d: Validate blend state against output signature. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:57:28 +02:00
Hans-Kristian Arntzen	1d51818d8f	vkd3d: Fix compile error introduced by bad rebase. Somehow the rebase got really screwed up :\ Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:42:30 +02:00
Hans-Kristian Arntzen	a8f623e60d	vkd3d: Negate upload_hvv config. Enable resizable BAR style allocations by default, and add option to disable it. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	12066a2b67	vkd3d: Add debug config to log resizable BAR allocations. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	710fa98918	vkd3d: Setup resizable bar budget. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	cec741706d	vkd3d: Refactor out memory topology queries. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	abdaeb136d	vkd3d: Add a memory budget per memory type. For resizable BAR, we don't want to endlessly promote UPLOAD heaps to BAR since VRAM is precious. The aim is to set a fixed budget where we can keep allocating until full, at which point we fall back to plain HOST. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	e0451bb541	vkd3d: Handle fallbacks properly in suballocator. With BAR budgets, what will happen is that - Small allocation is requested - A new chunk is requested - try_suballocate_memory will end up calling allocate_memory, which allocates a fallback memory type - Subsequent small allocators will always end up allocating a new fallback memory block, never reusing existing blocks. - System memory is rapidly exhausted once apps start hitting against budget. The fix is to add flags which explicitly do not attempt to fallback allocate. This makes it possible to handle fallbacks at the appropriate level in try_suballocate_memory instead. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	cb94cfd10c	vkd3d: Fix silly typo in global mask. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	426cdc9218	vkd3d: Destroy GLOBAL_BUFFER for some early error out paths. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	69d4f55219	vkd3d: Refactor VkDeviceMemory allocation to keep track of type/size. We will need to consider some form of budgeting, so make sure that all allocation and freeing is done in a central place. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 16:10:57 +02:00
Hans-Kristian Arntzen	8d49d3e9ae	vkd3d: Add extra validation for mapping textures. D3D12 validation layers complain if you try to map mipmapped 3D volumes for ... some reason. The error is very explicit, so I assume it's intentional :) Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 15:35:57 +02:00
Hans-Kristian Arntzen	9fd422a0fd	vkd3d: Fix default layout check when using LINEAR tiled images. Match behavior of d3d12_resource_pick_layout. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 15:35:57 +02:00
Hans-Kristian Arntzen	41295eff6c	vkd3d: Consider CPU availibility when selecting memory types. Need to consider that based on host visibility requirements, we need to select either LINEAR or OPTIMAL image types, and those tiling modes can have different memory requirements. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 15:35:57 +02:00
Hans-Kristian Arntzen	132638be67	vkd3d: Add more logging when linear image allocation fails. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 15:35:57 +02:00
Hans-Kristian Arntzen	50f2c35b44	vkd3d: Add stricter ROW_MAJOR texture validation. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 15:35:57 +02:00
Hans-Kristian Arntzen	961fef84de	vkd3d: Allow map of texture as long as ppData is NULL. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-16 15:35:57 +02:00
Joshua Ashton	9c0fa91ca5	vkd3d: Add shader quirks for Psychonauts 2 Works around a game bug. It uses texture() inside divergent control flow. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-15 11:52:39 +02:00
Hans-Kristian Arntzen	3081887757	vkd3d: Add 12_2 to list of valid feature levels. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-14 21:18:29 +02:00
Hans-Kristian Arntzen	0e216b2b10	vkd3d: Narrow workaround for global pipeline cache. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-08 18:37:55 +02:00
Hans-Kristian Arntzen	11086a94e0	vkd3d: Add macros to parse/build NV driver versions. The bit offsets are a bit different from Vulkan API. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-08 18:37:55 +02:00
Hans-Kristian Arntzen	fcaeca8d27	vkd3d: Allow typeless depth-stencil formats without ALLOW_DEPTH_STENCIL. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-07 13:31:28 +02:00
Hans-Kristian Arntzen	403d1f9743	vkd3d: Workaround huge memory overhead for individual VkPipelineCaches. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-07 13:21:54 +02:00
Hans-Kristian Arntzen	a3267ba8e5	vkd3d: Fix copies between footprint and DS aspects. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-06 17:00:51 +02:00
Hans-Kristian Arntzen	fa1d82e141	vkd3d: Fix regressions when introducing null-copy elision. Need to initialize the set mask so that copies happen properly on default-initialized descriptors. Also, move the current_null_type to metadata so that it's properly copied on descriptor copy. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-03 12:24:26 +02:00
Rodrigo Locatti	b4cb5a37f8	vkd3d: Optimize repeated null descriptor updates There are titles clearing the same descriptors constantly. This leads to unnecessary updates that can become costly. This commit introduces a new flag to track when D3D12 descriptors are not null, and skips clearing them if they are already null. Descriptors are assumed to be null by default. This fixes a performance regression introduced by `9983a1720f` Signed-off-by: Rodrigo Locatti <rlocatti@nvidia.com>	2021-09-02 21:21:34 +02:00
Philip Rebohle	7fea3527ed	vkd3d: Remove deferred clears. Emitting render pass clears while we're in the process of starting a render pass overrides dsv layout tracking info. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-09-02 17:11:35 +02:00
Hans-Kristian Arntzen	ff74ad0ec5	vkd3d: Skip draw call if doing depth test on null DSV. D3D12 validation layer errors out, so unless we can prove that specific behavior is relied upon, we should be okay to just ignore. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 17:10:47 +02:00
Hans-Kristian Arntzen	b54a1a6c2b	vkd3d: Fix MSVC build. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 16:56:39 +02:00
Hans-Kristian Arntzen	00e4397467	vkd3d: Ignore depth/stencil test if DSVFormat does not have that aspect. Fix some validation errors in F1 2021. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 16:25:27 +02:00
Hans-Kristian Arntzen	bc9bd9c482	vkd3d: Fix member types in vkd3d_format. No need to use size_t. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 12:21:22 +02:00
Hans-Kristian Arntzen	7b67de7d0e	vkd3d: Generalize get_plane_footprints. Get information directly from vkd3d_format and allow for subsampled formats in the future. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 12:21:22 +02:00
Hans-Kristian Arntzen	3d5010555e	vkd3d: Add d3d12_resource_desc_get_sub_resource_count. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 12:21:22 +02:00
Hans-Kristian Arntzen	5c2376faf5	vkd3d: Handle multiplanar formats in GetCopyableFootprints. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-02 12:21:22 +02:00
Hans-Kristian Arntzen	c1f848ed3b	vkd3d: Only look at SourceRTAS when updating. Be more robust against garbage inputs. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-08-28 12:16:42 +02:00
rochaudhari	0828aec4f6	vkd3d: Implement new interfaces required for DX12 DLSS support. Adds ID3D12GraphicsCommandListExt and ID3D12DeviceExt interfaces. Signed-off-by: Roshan Chaudhari <rochaudhari@nvidia.com>	2021-08-27 11:37:15 +02:00
Joshua Ashton	e9f04e8e0e	vkd3d: Support SAMPLE_DESC.Count of 0 Psychonauts 2 uses a SAMPLE_DESC.Count of 0 for some things, which previously was forcing it down the MSAA alignment placement path. Found from playing a native D3D12 apitrace back and seeing the log spam. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-08-26 14:23:37 +02:00
Philip Rebohle	715eca1b95	vkd3d: Reimplement frame latency event as a semaphore. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-08-26 14:21:38 +02:00
Philip Rebohle	fef30f5037	vkd3d: Support releasing semaphores from a D3D12 fence. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-08-26 14:21:38 +02:00
Hans-Kristian Arntzen	f3fd2bf70b	vkd3d: Use BAR memory type for descriptor heap helpers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-08-23 13:24:43 +02:00
Hans-Kristian Arntzen	7e165238e6	vkd3d: Allow all memory types if UPLOAD_HVV is used. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-08-23 13:24:43 +02:00
Joshua Ashton	1b957a1f74	vkd3d: Add config to use host-visible vram for UPLOAD heap Adds the "upload_hvv" config flag, which will make D3D12_HEAP_TYPE_UPLOAD attempt to use host-visible VRAM for allocations. This takes advantage of large or resizable BAR if available. I see a perf delta of 83-84 -> 92-94 (~12%) when using this in Horizon Zero Dawn. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-08-23 13:24:43 +02:00
Hans-Kristian Arntzen	05e31bfba9	vkd3d: Ensure we do not fallback device allocations to BAR. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-08-23 13:24:43 +02:00
Robin Kertels	76f37c3cbf	vkd3d: Only disable raster based on SO stream if SO is used. Signed-off-by: Robin Kertels <robin.kertels@gmail.com>	2021-08-23 13:10:14 +02:00
Hans-Kristian Arntzen	b2c99b035a	vkd3d: Allow SM 6.2 on NV. FloatControlProperties struct appears to be broken, and it does seem to work just fine. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-30 15:19:35 +00:00
Hans-Kristian Arntzen	093a8c49f3	vkd3d: Expose shader model 6.5. WaveMatch and WaveMultiPrefix are implemented and pass test. Other features are gated behind feature bits. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-29 20:42:32 +02:00
David McCloskey	a2a7d78c27	vkd3d: Fixing CopyTextureRegion going out of bounds when src_box is null. Signed-off-by: David McCloskey <davmcclo@gmail.com>	2021-07-29 17:28:52 +02:00
Hans-Kristian Arntzen	e1bb5f3b77	vkd3d: Handle NULL event handles in ID3D12Fence::SetEvent*(). We need to block here for whatever reason. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-29 17:21:20 +02:00
Hans-Kristian Arntzen	455f00fe26	vkd3d: Log failures when signaling external events. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-29 17:21:20 +02:00
Hans-Kristian Arntzen	435a087047	vkd3d: Rework how shader model versions are exposed. From native testing, we can expose higher shader models if cap bits features are not supported. E.g. Polaris exposes SM 6.5, even when 16-bit and barycentrics are not supported. With latest dxil-spirv updates we can support the required SM 6.4 features. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-28 15:28:19 +02:00
Hans-Kristian Arntzen	5b013d0b02	vkd3d: Validate shader meta against features. We're supposed to validate and fail compilation if certain features are not supported. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-28 15:28:19 +02:00
Hans-Kristian Arntzen	ab9e99cbfa	vkd3d: Check for Int16 capability as well as extended subgroup types when exposing 16-bit ops. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-28 15:28:19 +02:00
Joshua Ashton	1d23bdbab7	vkd3d: Don't store pointer to QA info when not building with QA This is entirely unnecessary and a waste of space as it will never be used. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-07-08 16:52:58 +02:00
Joshua Ashton	309fc817e8	vkd3d: Fix RT local root signature interface flags This was passing through flags of the root signature not the shader interface flags of it. Need to get the shader interface flags of the root signature instead. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-07-08 16:52:58 +02:00
Hans-Kristian Arntzen	29a9ccd356	vkd3d: Basic implementation of ResolveSubresourceRegion. Used by DIRT5. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-08 13:54:05 +02:00
Hans-Kristian Arntzen	f3c3e53f7a	vkd3d: Add resolve mode argument to resolve helper. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-08 13:54:05 +02:00
Hans-Kristian Arntzen	591d47a6c5	vkd3d: Refactor out ResolveSubresource. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-08 13:54:05 +02:00
Hans-Kristian Arntzen	37e8f42f4a	vkd3d: Move patch vertex count to meta struct. Will make it easier to implement for DXIL. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:58:45 +02:00
Hans-Kristian Arntzen	3915090c12	vkd3d: Track depth-stencil image layouts over a command buffer. Goal here is to avoid unnecessary image layout transitions when render passes toggle depth-stencil PSO states. Since we cannot know which states a resource is in, we have to be conservative, and assume that shader reads could happen. The best effort we can do is to detect when writes happen to a DSV resource. In this scenario, we can deduce that the aspect cannot be read, since DEPTH_WRITE \| RESOURCE state is not allowed. To make the tracking somewhat sane, we only promote to OPTIMAL if an entire image's worth of subresources for a given aspect is transitioned. The common case for depth-stencil images is 1 mip / 1 layer anyways. Some other changes are required here: - Instead of common_layout for the depth image, we need to consult the command list, which might promote the layout to optimal. - We make use of render pass compatibility rules which state that we can change attachment reference layouts as well as initial/finalLayout. To make this change, a pipeline will fill in a vkd3d_render_pass_compat struct. - A command list has a dsv_plane_optimal_mask which keeps track of the plane aspects we have promoted to OPTIMAL, and we know cannot be read by shaders. The desired optimal mask is (existing optimal \| PSO write). The initial existing optimal is inherited from the command list's tracker. - RTV/DSV/views no longer keep track of VkImageLayout. This is unnecessary since we always deduce image layout based on context. Overall, this shows a massive gain in HZD benchmark (RADV, 1440p ultimate, ~16% FPS on RX 6800). Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:45:46 +02:00
Hans-Kristian Arntzen	515ed7fbd1	vkd3d: Make sure memory is available before change image layout. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:45:46 +02:00
Hans-Kristian Arntzen	8f05ac298c	vkd3d: Add implementation for plane optimal tracker. Idea is to keep track of scenarios where we know a resource's aspect is known to be in a OPTIMAL state. Based on this, we can override the image layout from the common_layout in order to avoid unnecessary full barriers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:45:46 +02:00
Hans-Kristian Arntzen	1288d0f9b1	vkd3d: Remove obsolete all_aspect parameter. For copies, we can always use the intended aspects, since we have separate DS layouts now. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:31:52 +02:00
Hans-Kristian Arntzen	68ce7bd324	vkd3d: Handle separate DS layout for destination copies. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:31:52 +02:00
Hans-Kristian Arntzen	81d472242b	vkd3d: Clear single depth-stencil aspect correctly. When clearing a DSV, we must get aliasing guarantees, so we must transition away from UNDEFINED. This is only possible when using separate_ds_layouts and for render pass clears we need to use renderpass2 mechanisms to do this. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 15:31:52 +02:00
Hans-Kristian Arntzen	35c555c479	vkd3d: Use more correct fallback path for minLODClamp. The clamp is absolute, not relative to baseMip. Also avoids validation error and potential crash when LODClamp > numLevels. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-07 12:50:23 +02:00
Joshua Ashton	61ccdb9037	vkd3d: Make invalid RTV for attachment FIXME_ONCE This spams constantly in Dirt 5. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-07-07 11:49:18 +02:00
Hans-Kristian Arntzen	cf632186fd	vkd3d: Add workaround for MinLODClamp. Not correct, will need spec additions to handle it properly. Fixes ground rendering in DIRT 5. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-06 16:45:19 +02:00
Hans-Kristian Arntzen	3090ae01c1	vkd3d: Support discarding single aspects as required. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-02 15:18:16 +02:00
Hans-Kristian Arntzen	398724cd6e	vkd3d: Require VK_KHR_separate_depth_stencil_layouts. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-02 15:18:16 +02:00
Hans-Kristian Arntzen	419790ac77	vkd3d: Add wave size workaround for GravityMark. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-02 15:15:42 +02:00
Hans-Kristian Arntzen	7a00e56792	vkd3d: Handle multiple planes in d3d12_resource_get_subresource_count. Separate out an explicit per_plane query for the cases where we need it. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-02 14:16:18 +02:00
rochaudhari	be2362268c	vkd3d: Return format2 information for d3d12_device_CheckFeatureSupport Currently only format1 information is being returned for D3D12_FORMAT_SUPPORT. Signed-off-by: Roshan Chaudhari <rochaudhari@nvidia.com>	2021-07-02 14:07:39 +02:00
Hans-Kristian Arntzen	3ea20a91ad	vkd3d: Handle zero viewports. This can be used for rasterizer discard, just bind dummy viewport and scissor. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-07-01 13:53:19 +02:00
Hans-Kristian Arntzen	cb5283b6fb	vkd3d: Allow dynamic vertex stride == 0 to go through. Eliminates all late pipeline compiles in Scarlet Nexus DX12 (and several other games). Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-29 16:00:33 +02:00
Hans-Kristian Arntzen	c1860a1ead	vkd3d: Add VKD3D_CONFIG flags for forcing EXCLUSIVE queue modes. Helps in some cases, but we cannot do this by default :( Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-29 12:24:24 +02:00
Joshua Ashton	5e3ec4337b	vkd3d: Fix top-most handling when restoring from fullscreen Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-06-25 17:28:35 +02:00
Hans-Kristian Arntzen	ba7c2b7c5f	swapchain: Log window rects for leaving and entering fullscreen. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-25 08:01:26 -07:00
Paul Gofman	ca2ae195fb	swapchain: Update original_window_rect in d3d12_swapchain_SetFullscreenState(). Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-25 08:01:26 -07:00
Hans-Kristian Arntzen	84f4b893ee	swapchain: Use VK_CALL macro. There's a mix and match of vk_procs-> and CALL conventions. Harmonize this. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-25 15:18:27 +02:00
Hans-Kristian Arntzen	b5023bab32	swapchain: Synchronize before resetting blit command buffer. Randomly appears in GravityMark, odd that validation didn't find this in other cases. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-25 15:18:27 +02:00
Hans-Kristian Arntzen	7c80c92304	vkd3d: Use ALLOW_VARYING_SUBGROUP_SIZE flag as appropriate. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-25 15:08:53 +02:00
Hans-Kristian Arntzen	27fdc39e67	vkd3d: Be more robust with out of bounds clear/discard rects. GravityBench ends up using ClearView with too large dimensions. This is a validation error in Vulkan, so just clamp the extents. To make full rect detection a bit more robust, do a range check instead of memcmp(). Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-24 16:18:38 +02:00
Georg Lehmann	a7922a7c85	vkd3d: Introduce vkd3d_internal_get_vk_format. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-06-24 12:55:17 +02:00
Georg Lehmann	0d9c7bc3ad	vkd3d: Index formats by format. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-06-24 12:55:17 +02:00
Georg Lehmann	c915f237e3	vkd3d: Index depth stencil formats by format. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-06-24 12:55:17 +02:00
Georg Lehmann	1af017c284	include: Add some new dxgi formats. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-06-24 12:55:17 +02:00
Hans-Kristian Arntzen	c108bec58f	vkd3d: Fix trivial indentation nit. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:41:09 +02:00
Hans-Kristian Arntzen	9900301886	vkd3d: Use read-write lock for fallback pipeline cache. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:41:09 +02:00
Hans-Kristian Arntzen	bb723e859b	vkd3d: Use read-write locks for render pass cache. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:41:09 +02:00
Hans-Kristian Arntzen	5fe135f3fb	vkd3d: Ensure shader visibility happens for DEPTH_READ \| RESOURCE scenarios. If we're doing a layout transition of depth-stencil aspects, we need to ensure all potential accesses are made visible. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:32:48 +02:00
Hans-Kristian Arntzen	8225edc726	vkd3d: Rewrite resource state implementation. - Honor resource barriers for resource states which cannot automatically decay or promote. This includes COLOR_ATTACHMENT, UNORDERED_ACCESS and VRS image. If SIMULTANEOUS_ACCESS is used, we can still promote, and we handle that by setting common layout to GENERAL for these resources. - Avoid redundant barriers in render passes since normal resource barriers will always make sure we are already in COLOR_ATTACHMENT_OPTIMAL. - Do not force GENERAL layout if resource has UNORDERED_ACCESS flag set. As this is not a promotable state, we have to explicitly transition into it. I tested this on validation layers, where even COMMON state refuses to promote to UAV state. The exception here of course is SIMULTANOUS_ACCESS, but we handle that properly now. - Verify that UAV or SIMULTANEOUS access is not used together with DSV state. This is explicitly banned in the API docs. - Actually emit image barriers. Batch the image transitions as that's what D3D12 docs encourage app developers to do, and it also expects that drivers can optimize this. Ensure that we respect the in-order resource barrier rules by splitting batches if there are overlaps in the transitions. - Ensure that correct image layout is used when clearing a suspended render pass attachment. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:32:48 +02:00
Hans-Kristian Arntzen	177679a766	vkd3d: Add VKD3D_RESOURCE_SIMULTANEOUS_ACCESS. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:32:48 +02:00
Hans-Kristian Arntzen	02398c4eef	vkd3d: Normalize depth-stencil layouts if only one aspect is used. Avoid using the separate layouts if we're only using formats with one aspects. This makes it more likely to match layouts with common layout, and we can avoid awkward transition barriers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-22 14:32:48 +02:00
Philip Rebohle	014a3c0b94	vkd3d: Handle plane slice index in descriptor creation. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-06-21 21:23:03 +02:00
Samuel Pitoiset	72d9b322b8	vkd3d: reject creating a resource that is placed if the heap is too small The spec is pretty clear that it's invalid usage. Return E_INVALIDARG like native drivers. This is a workaround for the inventory GPU hang with Cyberpunk 2077 which is actually a game bug. Luckily the game handles this error properly. The problem is that the game always assume that an image with 2 mips is smaller than the same image but with 6 mips. This is not always true if the swizzle mode is different and a recent Mesa update changed that. Then the game creates a D3D12 heap that is too small and this triggered a memory violation and then a GPU hang with RADV. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2021-06-17 16:42:23 +02:00
Hans-Kristian Arntzen	1ea31701c5	vkd3d: Move F1 2020 workaround over to quirks system. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-17 16:42:14 +02:00
Hans-Kristian Arntzen	28c8a595fa	vkd3d: Pass down shader quirks for Necromunda. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-17 16:42:14 +02:00
Hans-Kristian Arntzen	9207d4f019	vkd3d: Ignore BlendEnable if write mask is 0. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-17 16:42:14 +02:00
Hans-Kristian Arntzen	5c971f216e	vkd3d: Invalidate binding state on query resolve. Fixes random broken AO in Necromunda on RADV. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-17 15:59:05 +02:00
Philip Rebohle	b97a012787	vkd3d: Enable tiled resources tier 3. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-06-14 15:53:33 +02:00
Hans-Kristian Arntzen	42fb018d85	vkd3d: Fix leak of command pools on device destruction. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-11 15:17:45 +02:00
Hans-Kristian Arntzen	d7843fa012	vkd3d: Fix potential deadlock in debug ring. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-11 11:04:38 +02:00
Hans-Kristian Arntzen	58854b0a9c	vkd3d: Fix potential deadlock in descriptor QA checks. If we destroy device right after creating it, we risk a deadlock. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-11 11:04:38 +02:00
Hans-Kristian Arntzen	76a8914d6b	vkd3d: Add validation error workaround. Our internal copy shaders are fine, but we get benign errors about sample count being wrong since we alias descriptors. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-10 14:19:04 +02:00
Hans-Kristian Arntzen	abe0995e88	vkd3d: Use correct allocation size for memory block. We cannot use the memory requirement output, since we will zero-clear memory with a size that might be larger than the VkBuffer size. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-10 14:16:01 +02:00
Hans-Kristian Arntzen	b922292852	vkd3d: Fix view object leak when creating fallback UAV clear view. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-10 13:50:54 +02:00
rochaudhari	1699743c37	vkd3d: Enable binary import and image view handle extensions Signed-off-by: Roshan Chaudhari <rochaudhari@nvidia.com> Reviewed-by: Liam Middlebrook <lmiddlebrook@nvidia.com>	2021-06-10 11:26:34 +02:00
Hans-Kristian Arntzen	9983a1720f	vkd3d: Splat null descriptors to all sets. Some games end up writing the wrong descriptor type when using null descriptors, and to be robust against that, we have to clear out all descriptors when creating null descriptors. If we copy a null descriptor, we will also have to copy from all sets. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-07 13:21:31 +02:00
Hans-Kristian Arntzen	969776c1f8	vkd3d: Ignore NULL descriptor ClearUAV. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-07 13:21:31 +02:00
Hans-Kristian Arntzen	c7c17d05ed	vkd3d: Fix descriptor QA checks for CBV_AS_SSBO. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-07 13:21:06 +02:00
Hans-Kristian Arntzen	ec5b4ccecf	vkd3d: Ensure that swapchain is eventually recreated. Latch SUBOPTIMAL state. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-02 19:46:05 +02:00
Joshua Ashton	efa0eccc59	vkd3d: Low latency presentation and acquire semaphores In cases where acquire image is blocking, we should call that after presentation to avoid latency when the app calls present. This avoids weird inverse frame cadences with Mesa WSI right now, as acquiring an image is always a blocking call until it is complete. In cases when we aren't blocking, this kicks off the acquisition so it can be waited upon by the next present blit pass. Use another set of semaphores to wait for the image acquisition on the GPU. In the non-blocking vkAcquireNextImageKHR case, this means that a potential bubble of time between waiting on the fence and submitting the blit + presentation is eliminated. Runaway presentation in this setup is avoided by frame latency objects and normal frame latency which is always 3 according to documentation. Be careful about handling SUBOPTIMAL. Semaphores will be signaled, but we might want to tear down the swapchain. In these cases, we need to wait for the semaphore to be signaled first, which can only be done by submitting a wait, since QueueWaitIdle or DeviceWaitIdle don't cover WSI. Signed-off-by: Joshua Ashton <joshua@froggi.es> Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Co-authored-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-02 19:46:05 +02:00
Joshua Ashton	92ed98ccea	vkd3d: Handle frame latency without WAITABLE_OBJECT Documentation says that this should always be 3 without WAITABLE_OBJECT unlike in D3D11 where it will use the DXGI device's frame latency. This stops runaway presentations in the non-blocking acquire image case with the new semaphore setup. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-06-02 19:46:05 +02:00
Hans-Kristian Arntzen	6f5f55c84a	vkd3d: Avoid oldSwapchain. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-06-02 19:46:05 +02:00

1 2 3 4 5 ...

1912 Commits