mirrors/vkd3d-proton

Commit Graph

Author	SHA1	Message	Date
Hans-Kristian Arntzen	365dd05557	vkd3d: Add breadcrumbs support. AMD path for this commit. Idea is that we can automatically instrument markers with command list information we can make some sense of in vkd3d-proton. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 13:07:56 +01:00
Hans-Kristian Arntzen	5017b3723c	vkd3d: Enable VK_AMD_device_coherent_memory. For breadcrumbs support, along with buffer marker. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 13:07:56 +01:00
Hans-Kristian Arntzen	6a4f2842cb	cache: Move d3d12_pipeline_library to internal references. Allow us to hold internal magic pipeline libraries without creating cycles. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 12:29:32 +01:00
Hans-Kristian Arntzen	18a5315db4	cache: Refactor lock strategy of internal hashmaps. Rather than having to take writer lock on serialize calls from the outside, we should just take locks when accessing the internal hashmaps instead. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 12:29:32 +01:00
Hans-Kristian Arntzen	7c228139c3	cache: Refactor out pipeline library serialization. If outer code has taken a reader lock, we don't need to lock again. Also allows a reader lock to go GetSerializedSize + Serialize with one reader lock. This will be relevant for magic cache implementation. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 12:29:32 +01:00
Hans-Kristian Arntzen	30b4abcea1	vkd3d: Do not discard images in Clear*View() unless we have to. It's redundant to add an UNDEFINED transition here for committed resources. We need it for sparse and placed resources to handle aliasing rules, but that's it. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-10 15:14:55 +01:00
Hans-Kristian Arntzen	17b1ffb41a	vkd3d: Add path to use GENERAL depth-stencil images. On some implementations, it doesn't matter for performance what we use, and we can avoid a lot of ugly barriers this way. Opt-in to use this extensions on GPUs we know handles it well, otherwise, keep using the tracking paths. With VK_KHR_dynamic_rendering, this is now feasible to do since we no longer have to deal with shenanigans related to VkRenderPass layouts and complicated compatibility rules. To make this work with the existing framework, just need to consider that GENERAL can be a common layout alongside DEPTH_STENCIL_OPTIMAL, which are both common layouts that do not need to be tracked at all. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-10 15:14:55 +01:00
Hans-Kristian Arntzen	f9da3bf564	vkd3d: Add VK_KHR_driver_properties. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-10 15:14:55 +01:00
Hans-Kristian Arntzen	c6149b47cd	cache: Handle ref-count rules for multiple LoadPipeline/StorePipeline. In pipeline libraries, the library holds on to private references of the libraries so that they can be rapidly loaded on-demand. This behavior is verifed by API tests. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-09 18:35:09 +01:00
Hans-Kristian Arntzen	cc08339624	vkd3d: Use internal_refcounts for pipeline state. When we store pipeline state in libraries we have to manage lifetime a bit differently, which requires internal refcounts of some sort. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-09 18:35:09 +01:00
Hans-Kristian Arntzen	422f6804fb	vkd3d: Enable VK_KHR_create_renderpass2. Required extension by VK_KHR_fragment_shading_rate and VK_KHR_separate_depth_stencil_layouts, but we don't care about enabling any features or use it directly. Needed to silence validation errors. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-09 16:35:05 +01:00
Georg Lehmann	14a06680d9	vkd3d: Remove unused renderpass remains. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2022-03-08 18:34:18 +01:00
Hans-Kristian Arntzen	409dc57645	vkd3d: Properly decay depth-stencil images. When performing a decay of a DSV resource, make sure to transition all subresources, not just the particular aspect being transitioned. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-08 18:11:50 +01:00
Hans-Kristian Arntzen	b330900659	vkd3d: Do not transition all aspects for single subresource. We require separate DS layouts. Fixes validation errors where we transition from read-only, but our neighbor aspect might have been optimal. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-08 18:11:50 +01:00
Philip Rebohle	9a408367dc	vkd3d: Remove render pass cache. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	51e6b2bbbe	vkd3d: Remove render pass from command list state. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	94f82d1085	vkd3d: Get rid of pipeline variant flags. These only existed for VRS attachment, which is no longer necessary with VK_KHR_dynamic_rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	1a68267962	vkd3d: Remove framebuffer list from d3d12_command_allocator. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	c4f88951fc	vkd3d: Use dynamic rendering for regular draw calls. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	9673ac173d	vkd3d: Use dynamic rendering for pipeline creation. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	3783eaf4f7	vkd3d: Implement swap chain blits using dynamic rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	024ef02f9b	vkd3d: Implement meta image copies using dynamic rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	549d4ee63f	vkd3d: Remove render pass list from d3d12_command_allocator. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	6186cc1f0e	vkd3d: Implement clears using dynamic rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Philip Rebohle	2c92ab7d1e	vkd3d: Enable and require VK_KHR_dynamic_rendering. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-08 17:44:47 +01:00
Hans-Kristian Arntzen	9fbae668fe	vkd3d: Ensure that all SPIR-V modules are properly cached. When we require inter-stage fixups, we need a solution for partial validity of the cache. Accept the modules all or nothing. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-08 16:43:30 +01:00
Hans-Kristian Arntzen	ce45297695	vkd3d: Enable debug_utils if vk_debug is enabled. Allows debug callbacks to go through in Wine. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-08 16:40:51 +01:00
LemiSt24	c411d0d0c2	vkd3d: Add case for D3D12_STATE_SUBOBJECT_TYPE_GLOBAL_ROOT_SIGNATURE Signed-off-by: LemiSt24 <lennard.strohmeyer@gmail.com>	2022-03-07 16:15:22 +01:00
Hans-Kristian Arntzen	9a63df07b8	vkd3d: Add punchthrough path for descriptor copies. Proves out the viability of this style of implementation. Ideally we'd have a more officially sanctioned way of doing similar things later :) Unfortunately, the overhead removal is too great to ignore on target platform. Makes use of a private (reserved) extension for now ... Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-04 13:34:18 +01:00
Mike Blumenkrantz	1d76803aff	vkd3d: optimize memory access pattern for sampler descriptors this removes them from the bitscan path Signed-off-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com>	2022-03-01 22:50:45 +01:00
Hans-Kristian Arntzen	dc622fc715	vkd3d: Recycle command pools in Elden Ring. Very churny. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 18:40:52 +01:00
Hans-Kristian Arntzen	9817c52d24	vkd3d: Add workaround to ignore mismatch driver/device in PSO library. Elden Ring does not detect the proper error code and create a new pipeline library. Instead, create a fresh new library, which works around the issue. The game has a pattern of LoadPipeline -> if fail -> CreatePSO -> StorePipeline. Sometimes, in the same process it will LoadLibrary from its own cache (could explain some stutters), so it's very useful to have this either way. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 14:50:57 +01:00
Hans-Kristian Arntzen	a8229390f9	vkd3d: Add more pipeline_library_log snippets. Hook GetCachedBlob and various attempts to use LoadPipeline. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 14:50:57 +01:00
Hans-Kristian Arntzen	12c73ee18a	swapchain: More gracefully handle SURFACE_LOST. Just like handling min/maxImageExtent of 0, we can just fall back to user buffers. Elden Ring hits this case on application teardown. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 14:04:06 +01:00
Hans-Kristian Arntzen	f39ece9a7c	vkd3d: Enable performance workarounds for Elden Ring. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:59:08 +01:00
Hans-Kristian Arntzen	c19eaac376	vkd3d: Add VKD3D_CONFIG option for command pool recycling. Normal behaving apps should not benefit from any of this. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:59:08 +01:00
Hans-Kristian Arntzen	54fbadcc94	vkd3d: Recycle command pools. Elden Ring in particular spam frees and allocates command pools despite this being a very bad idea. Add a simple 8-entry cache which seems to take care of it. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:59:08 +01:00
Hans-Kristian Arntzen	4b07535909	vkd3d: Optimize memory access pattern for single descriptor copies. We can mark a descriptor as being SINGLE_DESCRIPTOR, which means we only need one descriptor copy. This way, we can avoid doing somewhat expensive work (every nanosecond counts here): - Bitscan loop - Read deep into d3d12_device guts (often a cache miss). The memory index depends on the bitscan, which causes bubble. When we have a single descriptor, we can just store the binding information inline and avoid this jank. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	84d632f194	vkd3d: Rewrite memory layout for resource descriptors. Tune memory layout so that we can deduce various information without making a single pointer dereference: - d3d12_descriptor_heap* - heap offset - Pointer to various side data structures we need to keep around. Instead of having one big 64 byte data structure with tons of padding, tune it down to 32 + 8 bytes per descriptor of extra dummy data. To make all of this work, use a somewhat clever encoding scheme for CPU VA where lower bits store number of active bits used to encode descriptor offset. From there, we can mask away bits to recover d3d12_descriptor_heap. Metadata is stored inline in one big allocation, and we can just offset from there based on extracted log2i_ceil(descriptor count). Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	b309913b6d	vkd3d: Use unsafe_impl in CopyDescriptorsSimple. This is an ultra-hot path and seems to show up somehow on profile. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-25 13:04:43 +01:00
Hans-Kristian Arntzen	c29d005ef4	vkd3d: Don't enable fast descriptor copy path for descriptor QA. The hooks are in the generic function. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-24 16:42:00 +01:00
Hans-Kristian Arntzen	8a46c21254	vkd3d: Add VKD3D_CONFIG to skip memory allocator clears. For cases where games spam committed allocations and don't use NOT_ZEROED. We still rely on zerovram behavior for initial backing which should be enough in most cases. Strictly speaking however, we are forced to clear the allocations every time if application does not use the flag correctly. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-24 12:52:05 +01:00
Hans-Kristian Arntzen	76ca492a39	vkd3d: Add some debug logging for when clear passes happen. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-24 12:52:05 +01:00
Hans-Kristian Arntzen	83c4e62660	vkd3d: Bump suballocation limit to 2 MiB. This is a more principled limit since that's the huge page size. Avoids some allocation spam. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-24 12:14:22 +01:00
Hans-Kristian Arntzen	4bea653504	vkd3d: Fix CopyTiles for suballocated linear resources. Forgot to offset buffer offset. Fun! Found when bumping VA allocation limit to 2 MiB instead of 1 MiB. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-24 12:14:22 +01:00
Hans-Kristian Arntzen	edbf49aad4	vkd3d: Support opt-in to single MUTABLE set. Useful for Intel since Intel hardware cannot support more than 1M descriptors in general, and opting in to correct behavior should improve CPU overhead as well when copying descriptors. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 17:08:25 +01:00
Hans-Kristian Arntzen	e0af8f2810	vkd3d: Make error message for buffer alignment more direct. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:37:12 +01:00
Hans-Kristian Arntzen	b066e72243	swapchain: Add env-var to override swapchain images. For perf debug mostly. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:36:36 +01:00
Hans-Kristian Arntzen	15704b2419	vkd3d: Optimize descriptor copies for common code paths. The common path that we really need to optimize for is CBV_SRV_UAV + Simple + 1 descriptor. Descriptor benchmark shows an almost 50% reduction in overhead now. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	c725c29bb6	vkd3d: Inline query for set/binding from set_index. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	2f6a91e772	vkd3d: De-virtualize query for descriptor size. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-21 16:35:36 +01:00
Hans-Kristian Arntzen	1cc8afcc8e	vkd3d: Fix potential crashes when VK_KHR_dynamic_rendering is added. Checking for pNext here is too brittle and causes crashes when dynamic rendering path is added. Also need to chain in existing pNexts. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-17 11:27:25 +01:00
Hans-Kristian Arntzen	5d345f47cc	vkd3d: Rewrite the pipeline library implementation. This became basically a rewrite in the end, and it got too awkward to split these commits in any meaningful way. The goals here were primarily to: - Support serializing SPIR-V and load SPIR-V. To do this robustly requires a lot more validation and checks to make sure end up compiling the same SPIR-V that we load from cache. This is critical for performance when games have primed their pipeline libraries and expect that loading a PSO should be fast. Without this, we will hit vkd3d-shader for every PSO, causing very long load times. - Implement the required validation for mismatched PSO descriptions. - Rewrite the binary layout of the pipeline library for flexibility concerns and performance. If the pipeline library is mmap-ed from disk - which appears to be the intended use - we only need to scan through the TOC to fully parse the library contents. From a flexibility concern, a blob needs to support inlined data, but a library can use referential links. We introduce separate hashmaps which store deduplicated SPIR-V and pipeline cache blobs, which significantly drop memory and storage requirements. For future improvements, it should be fairly easy to add information which lets us avoid SPIR-V or pipeline cache data altogether if relevant changes to Vulkan/drivers are made. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-17 11:00:03 +01:00
Hans-Kristian Arntzen	33f17cc74d	vkd3d: Add VK_EXT_pipeline_creation_feedback. Useful when used together with pipeline library logging. Confirms that we can load pipeline caches as expected. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 14:31:34 +01:00
Hans-Kristian Arntzen	47337d5e0b	vkd3d: Add VKD3D_CONFIG flags for various pipeline library logging. Additionally, add option to ignore cached SPIR-V. Will be useful for debugging, and also required for VKD3D_SHADER_OVERRIDE. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 14:31:34 +01:00
Hans-Kristian Arntzen	f03940ef4b	vkd3d: Add global_pipeline_cache option. Avoids saving out pipeline cache blobs which are likely going to be cached by on-disk cache anyways. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 14:31:34 +01:00
Hans-Kristian Arntzen	e5e662ce22	vkd3d: Record root signature compatibility hashes. For pipeline libraries and DXR to some extent later, we'll need an easy way to compare root signature objects. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 14:31:34 +01:00
Hans-Kristian Arntzen	1d39c25a59	vkd3d: Properly invalidate pipeline when binding NULL DSV. We did not test the scenario where we first render with depth enabled, and then bind a NULL DSV with the same pipeline. Also fix issues if we bind NULL RTVs with same pipeline bound. Fixes crash in Guardians of the Galaxy. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-04 13:10:16 +01:00
Hans-Kristian Arntzen	5e526d506b	vkd3d: Remove warning for setting NULL index buffer. This is benign and easily gets spammed a TON. We will warn if an indexed draw is actually made like this. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-03 18:16:36 +01:00
Hans-Kristian Arntzen	81a215d0bf	vkd3d: Implement COLOR -> STENCIL copy if stencil export is supported. Fallback is a bit more involved. Cleans up the FIXME to not report benign issues. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-03 15:43:41 +01:00
Hans-Kristian Arntzen	29d956c6c4	vkd3d: Fix memory leak of D3D12 device singleton. Fairly trivial, caught by ASAN. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-02 13:56:36 +01:00
Hans-Kristian Arntzen	49d0eb37e3	vkd3d: Properly align d3d12_command_list allocations. UBSAN found a bug here since we store RTV descriptors inline, the compiler can assume the pointer is 64 byte aligned. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-02-02 13:56:36 +01:00
Philip Rebohle	8f81aaa710	vkd3d: Fix reporting of WriteBufferImmediateSupportFlags. Oversight from when we added bundles. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-02-01 16:21:43 +01:00
Hans-Kristian Arntzen	833f56154c	cache: Store shader interface key in pipeline library as well. If we're going to create different SPIR-V files from what the VkPipelineCache represents, it's meaningless to load it. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	86f8f41490	vkd3d: Compute a global shader interface key for a D3D12 device. This key represents the variations of SPIR-V which would be generated from otherwise identical inputs like DXBC blobs and root signatures. Typically, changing VKD3D_CONFIG flags or enabled extensions will affect this key. This ensures that we will not attempt to use a cached SPIR-V file unless we can trust that the SPIR-V interface will match. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	a3f1a0e3cd	vkd3d-shader: Add mechanism to get vkd3d-shader implementation revision. Not immediately useful, might be nuked later in development. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	e90b573896	vkd3d-shader: Use flag for vkd3d_shader_meta bools. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	8196b85408	vkd3d-shader: Make vkd3d_shader_hash public. Prepare for meta struct to be serialized to a cache. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	a2c1527acd	vkd3d-shader: Reuse hashmap.h hasher for shader hash. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	6e697a54b6	vkd3d: Add d3d12_cached_pipeline_state. Wraps the D3D12 struct with a pipeline library handle. This is needed if the blob contains references to external data, which then needs to be resolved. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	41c977d616	cache: Move cache implementation over to read-writer locks. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-25 14:07:07 +01:00
Hans-Kristian Arntzen	1409ebab1f	vkd3d: Consider sparse buffers to alias any other buffer. Technically cannot alias committed buffers, but 🤷 ... Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-20 15:14:27 +01:00
Hans-Kristian Arntzen	7d0743345a	vkd3d: Remove useless buffer barrier tracking. This copy is to a scratch buffer, which needs no tracking. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-20 15:14:27 +01:00
Philip Rebohle	1af62abfe7	vkd3d: Enable quirk for further UE4 shaders. Fixes artifacts in The Ascent. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-01-19 16:49:42 +01:00
Hans-Kristian Arntzen	5c492e9e6c	vkd3d: Handle overlapped transfer writes. D3D12 expects drivers to implicitly synchronize transfer operations, since there is no TRANSFER barrier ala UAV barriers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-19 14:44:33 +01:00
Hans-Kristian Arntzen	68ce4b4116	vkd3d: MSVC build fix. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-19 14:21:09 +01:00
Hans-Kristian Arntzen	6cba8b9945	vkd3d: Workaround broken barriers in DEATHLOOP. In DEATHLOOP, there is a render pass which renders out a simple image, which is then directly followed by a compute dispatch, reading that image. The image is still in RENDER_TARGET state, and color buffers are not flushed properly on at least RADV, manifesting as a very distracting glitch pattern. This is a game bug, but for the time being, we have to workaround it, sigh. For a simple workaround, we can detect patterns where we see these events in succession: - Color RT is started - StateBefore == RENDER_TARGET is not observed - Dispatch() In particular, when entering the options menu, highly distracting glitches are observed in the background. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-12 12:20:03 +01:00
Robin Kertels	35be1329ed	vkd3d: Don't do layout transition in aliasing barrier. HZD issues an aliasing barrier for an alias of a resource that it still needs. Because D3D12 requires you to call DiscardResource or a full resource clear/copy, we can just rely on those to do the actual image layout transition and treat the aliasing barrier as a pure sync + flush. This behavior is also observed in a test case where D3D12 drivers do not seem to discard / fast-clear anything in an aliasing barrier. Signed-off-by: Robin Kertels <robin.kertels@gmail.com> Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Co-authored-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-01-12 12:16:52 +01:00
Samuel Pitoiset	f6a4e0fb71	vkd3d: Use VK_KHR_copy_commands2 Mesa RADV translates these legacy entrypoints to the 2 variants. Using them directly will cost a bit less CPU cycles. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2022-01-12 12:06:06 +01:00
Samuel Pitoiset	f6fe3e0183	vkd3d: Require VK_KHR_copy_commands2 This extension is trivial to implement for vendors and should be widely supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2022-01-12 12:06:06 +01:00
Samuel Pitoiset	870dda927d	vkd3d: Use VK_KHR_bind_memory2 Mesa RADV translates these legacy entrypoints to the 2 variants. Using them directly will cost a bit less CPU cycles. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2022-01-12 12:06:06 +01:00
Samuel Pitoiset	b42a7193fc	vkd3d: Require VK_KHR_bind_memory2 This extension is trivial to implement for vendors and should be widely supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2022-01-12 12:06:06 +01:00
Philip Rebohle	5923c53111	vkd3d: Only use VK_IMAGE_CREATE_EXTENDED_USAGE_BIT if necessary. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-01-11 15:14:30 +01:00
Philip Rebohle	1354ecabb4	vkd3d: Consider query pool when merging query ranges. Otherwise, we accidentally merge ranges from different pools if the indices happen to align. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-01-06 14:27:36 +01:00
Hans-Kristian Arntzen	c0a3fa8adc	vkd3d: Attempt to create linear image without EXTENDED_USAGE. NVIDIA drivers apparently cannot support EXTENDED_USAGE linear images for whatever reason, so attempt to create these images without the creation flag. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-12-03 12:47:09 +01:00
Hans-Kristian Arntzen	459cae5673	vkd3d: Fix redundant return from void. Fix MSVC warning. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-12-02 22:48:48 +01:00
Hans-Kristian Arntzen	7502b4c4c8	vkd3d: Fix MSVC build. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-12-02 22:48:48 +01:00
Hans-Kristian Arntzen	fffd6e935c	vkd3d: Add R64_UINT to format compatibility list when needed. For 64-bit image atomics, we should at the very least add 64-bit format to compatibility list to avoid potential problems. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-12-02 22:40:32 +01:00
Hans-Kristian Arntzen	72f26c5699	vkd3d: Remove misleading FIXME. We can bind texel buffers at scalar alignment now. The warning is misleading for placed resources, since 64k never aligns with a float3. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-12-02 22:40:21 +01:00
Arkadiusz Hiler	93d105adae	vkd3d: Retry to create Vk device without NVX extensions. The creation with those extensions may fail in few cases: * older 32 bit drivers * missing or inaccessible /dev/nvidia-uvm There's also a mysterious crash that some Debian users experience with 64bit titles and a correct /dev/nvidia-uvm. Signed-off-by: Arkadiusz Hiler <ahiler@codeweavers.com>	2021-12-02 12:44:37 +01:00
Hans-Kristian Arntzen	d2fd3de7c1	vkd3d: Handle somewhat common VkResult. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-26 20:02:14 +01:00
Hans-Kristian Arntzen	d9636d5c67	vkd3d: Fix check for vkBindImageMemory. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-26 20:02:14 +01:00
Hans-Kristian Arntzen	9a59ded1c4	vkd3d: Simplify MinLod setup. Only bother if we actually need to clamp LOD. Simplifies some clamping logic as well. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-26 16:51:18 +01:00
Philip Rebohle	a99914b6ea	vkd3d: Fix clear color swizzle for various UAV formats. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	4000397570	vkd3d: Remove legacy format compatibility info. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	0de25ac3cd	vkd3d: Do not use vkd3d_find_uint_format in ClearUAV. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	ab111dcdbe	vkd3d: Don't use vkd3d_get_typeless_format to determine shader copy usage. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	99d949f5fb	vkd3d: Fix enablement of MUTABLE_FORMAT_BIT and EXTENDED_USAGE_BIT. We previously did not take into account the new relaxed format compatibility rules that we allow with CastingFullyTypedFormatSupported being supported. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	9624102dcb	vkd3d: Rework format compatibility lists. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Philip Rebohle	42b8fc3338	vkd3d: Introduce new format compatibility table. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-26 16:51:01 +01:00
Georg Lehmann	4240ab7559	vkd3d: Allow B8G8R8A8 UAVs. This is now allowed according to https://microsoft.github.io/DirectX-Specs/d3d/RelaxedCasting.html Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-11-24 15:15:14 +01:00
Hans-Kristian Arntzen	7391e38602	vkd3d: Fix some type errors after idl update. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-22 16:25:13 +01:00
Philip Rebohle	9185edb42a	vkd3d: Implement ID3D12GraphicsCommandList6. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	b03c1fcb5f	vkd3d: Implement ID3D12Device9. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	3b6a4ab988	vkd3d: Implement ID3D12Device8 and ID3D12Resource2. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Philip Rebohle	d61f562a3e	vkd3d: Implement ID3D12Device7. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-11-19 14:57:51 +01:00
Joshua Ashton	046524f2a1	vkd3d: Implement MinLODClamp using VK_EXT_image_view_min_lod Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-11-17 20:51:20 +01:00
Robin Kertels	19a1dce393	vkd3d: Set GetCopyableFootprints total_bytes late. Halo Infinite uses &desc->Width for total_bytes. We can't set total_bytes early because code after this relies on desc->Width. Signed-off-by: Robin Kertels <robin.kertels@gmail.com>	2021-11-16 11:53:18 +01:00
Hans-Kristian Arntzen	3fefc540c8	vkd3d: Handle 64KB_UNDEFINED_SWIZZLE. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-12 10:32:13 +01:00
Hans-Kristian Arntzen	3937e1a298	vkd3d: Handle illegal rendering to NULL DSV. Guardians of the Galaxy hits this case. Fallback is to disable depth attachment entirely in a fallback pipeline. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 15:56:45 +01:00
Hans-Kristian Arntzen	45ae742526	vkd3d: Pretend that SSBO alignment on NV is 4 bytes. The 16-byte requirement is kind of a lie. The real requirement is tied to how vectorized load-store instructions are emitted in the shader itself since I guess it allows compiler to assume something about alignment of the base pointer. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00
Hans-Kristian Arntzen	b53a4a98a6	vkd3d: Enable per component robustness on AMD. Tested and verified to work as expected, not so much on NV. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00
Hans-Kristian Arntzen	3210832ad9	vkd3d: Enable VK_EXT_scalar_block_layout. dxil-spirv can take advantage of this now. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00
Hans-Kristian Arntzen	58aab78a5b	vkd3d-shader: Add PER_COMPONENT_ROBUSTNESS shader extension. Signals that we can use vectorized vec3 byte address buffers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00
Hans-Kristian Arntzen	e605d19ef7	vkd3d-shader: Add shader extension for scalar block layout. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-09 14:47:10 +01:00
Hans-Kristian Arntzen	35d2f1e87f	vkd3d: Correctly check for SM 6.6 required features. Remove the experimental flag and unconditionally enable SM 6.6 if available. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
Hans-Kristian Arntzen	2b11c70129	vkd3d: Hook up WaveSize implementation. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
Hans-Kristian Arntzen	6966cd2f33	vkd3d-shader: Reflect CS WaveSize. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
Hans-Kristian Arntzen	7cc435c0bc	vkd3d: Enable feature bits for 64-bit atomics. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
Hans-Kristian Arntzen	de64ebd1d1	vkd3d: Expose Int64 feature. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
Hans-Kristian Arntzen	23ad0247e3	vkd3d: Enable 64-bit atomics extensions. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
Hans-Kristian Arntzen	6255eaec32	vkd3d: Stub out the more recent FEATURE_DATA structs. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-11-01 14:20:38 +01:00
David Gow	2a8b5471ca	vkd3d: Handle D3D12_APPEND_ALIGNED_ELEMENT for <4 byte wide elements In d3d12, input element alignment needs to be the _minimum_ of 4 and the size of the type. See the D3D11 spec, section 4.4.6, which behaves similarly: https://microsoft.github.io/DirectX-Specs/d3d/archive/D3D11_3_FunctionalSpec.htm#4.4.6%20Element%20Alignment This is correctly taken into account when generating, e.g., the vertex_buffer_stride_align_mask used for validation, but is not taken into account when D3D12_APPEND_ALIGNED_ELEMENT is used to automatically place input elements. Currently, vkd3d always assumes the alignment is 4. This means that, for example, bytes or shorts should be packed tightly together when D3D12_APPEND_ALIGNED_ELEMENT is used, but are instead padded to 4 bytes. Fixing this makes units appear in Age of Empires IV (see vkd3d-proton issue #880 for examples.) Signed-off-by: David Gow <david@ingeniumdigital.com>	2021-11-01 13:30:04 +01:00
Robin Kertels	430c77d3b3	vkd3d: Don't add xfb struct to rasterization state when NumEntries is 0. Wine VKD3D version of my original commit. Co-authored-by: Conor McCarthy <cmccarthy@codeweavers.com> Signed-off-by: Robin Kertels <robin.kertels@gmail.com>	2021-10-29 18:06:31 +02:00
Hans-Kristian Arntzen	85c75a042f	vkd3d: Enable VK_NV_compute_shader_derivatives. Supported on more implementations too :) Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-27 17:00:33 +02:00
Georg Lehmann	eb48213bfa	vkd3d: Follow the new shaderStorageImage{Read, Write}WithoutFormat rules. The Vulkan spec update 1.2.195 restricted these features to a very limited format subset, and somehow this is supposed to not be an API break? Anyway, let's follow the new rules. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-10-27 17:00:21 +02:00
Georg Lehmann	fd690e3831	vkd3d: Enable typed uav loads based on KHR_format_feature_flags2. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-10-27 17:00:21 +02:00
Georg Lehmann	07d53a82cc	vkd3d: Init shader extensions later. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-10-27 17:00:21 +02:00
Georg Lehmann	4c37b4c341	vkd3d: Use vkGetPhysicalDeviceFormatProperties2. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-10-27 17:00:21 +02:00
Georg Lehmann	c8d633cb51	vkd3d: Enable VK_KHR_format_feature_flags2. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com>	2021-10-27 17:00:21 +02:00
Hans-Kristian Arntzen	8ff91b23d6	vkd3d-shader: Hook up global descriptor heap for DXIL. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-26 15:32:36 +02:00
Hans-Kristian Arntzen	aadccb66cf	vkd3d: Add more root signature flags to the list of flags we recognize. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-26 15:32:36 +02:00
Hans-Kristian Arntzen	8977eaef88	vkd3d: Initialize global heap bindings for SM 6.6. Refactor code which emits SRV/UAV bindings to common code. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-26 15:32:36 +02:00
Hans-Kristian Arntzen	cbef48f90a	vkd3d: Refactor out how binding counts are parsed. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-26 15:32:36 +02:00
Hans-Kristian Arntzen	6548e4fd00	vkd3d: Add VKD3D_CONFIG for experimentally enabling SM 6.6. To be used for bringup and removed when we complete the support. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-26 15:32:36 +02:00
Hans-Kristian Arntzen	a0a29bae43	vkd3d: Use correct formats for image -> buffer copies. Need to use placed format explicitly if we're copying planar resources. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-20 15:48:12 +02:00
ifedorov	0abe8a21dd	Fixed row length calculation in CopyTextureRegion() Signed-off-by: Ivan Fedorov <ifedorov@nvidia.com>	2021-10-20 14:28:35 +02:00
Philip Rebohle	890ba87a7c	vkd3d-shader: Merge i/o variables using the same location. Fixes a number of issues observed in tessellation shaders, and potentially geometry shaders, when inputs and/or outputs are array variables. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-10-18 17:25:18 +02:00
Hans-Kristian Arntzen	740e23ea8a	vkd3d: Add VKD3D_CONFIG to force non-invariant position. It's common enough that new games break on RDNA2 because of this that we should enable this by default. This matches DXVK behavior. SOTTR gets a special weird exception, just like DXVK. The shaders are broken enough that the proper fix is actually precise, not invariant. This will be addressed at some later point. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-18 15:51:20 +02:00
Hans-Kristian Arntzen	be8d6ec7ad	vkd3d: Make global quirks info struct a value. Allows us to fiddle with it after the fact. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-18 15:51:20 +02:00
Hans-Kristian Arntzen	26bd08bbde	vkd3d-shader: Add global quirks for vkd3d_shader_quirk_info. Will be used for VKD3D_CONFIG overrides. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-18 15:51:20 +02:00
Hans-Kristian Arntzen	32c5abf496	vkd3d-shader: Add INVARIANT_POSITION quirk. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-18 15:51:20 +02:00
Hans-Kristian Arntzen	2152500014	vkd3d-shader: Refactor out quirk selection. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-18 15:51:20 +02:00
Hans-Kristian Arntzen	3b415dbc89	vkd3d: Don't spam error if ReleaseSemaphore fails. This function fails if the counter overflows. CP77 hits this case a lot and we should just warn the specific failure instead of a random error. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-18 14:58:49 +02:00
Hans-Kristian Arntzen	dda02faf89	vkd3d: Pad reserved resources to 64k alignment. Fix GPU crashes when attempting to bind non-aligned reserved resource. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-18 14:58:34 +02:00
Hans-Kristian Arntzen	8beb7dde89	vkd3d: Handle NULL pointers in GetResourceTiling in more places. DEATHLOOP uses all NULL at some point ... Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-15 15:12:41 +02:00
Philip Rebohle	dd23492348	vkd3d: Reduce memset overhead for query map. Potentially reduces the size of the query map, and makes each entry versioned so that we no longer have to clear the entire map for multiple dispatches even if it is sparsely populated. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2021-10-14 17:48:13 +02:00
Hans-Kristian Arntzen	0c60791bb1	vkd3d: Pass down PrimitiveCulling extension to vkd3d-shader. DXR 1.1 only feature. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 16:04:30 +02:00
Hans-Kristian Arntzen	f98702603d	vkd3d-shader: Add SPIR-V extension for PrimitiveCulling. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 16:04:30 +02:00
Hans-Kristian Arntzen	6866b45637	vkd3d: Add CONFIG flag for enabling DXR 1.1. We cannot support ExecuteIndirect with TraceRays() for time being. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 16:04:30 +02:00
Hans-Kristian Arntzen	e6836c6255	vkd3d: Support RTPSO CONFIG1 flags. DXR 1.1 and requires PrimitiveCullingFlags feature. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 16:04:30 +02:00
Hans-Kristian Arntzen	105882466b	vkd3d: Validate that we cannot mix and match geom types in BLAS. Runtime will error out and return 0 size. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 16:04:30 +02:00
Hans-Kristian Arntzen	a3202444c8	vkd3d: Fix stack deduction for anyhit shaders. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 16:04:30 +02:00
Hans-Kristian Arntzen	a36b987bf1	vkd3d: Add static pipeline variant flag to pipeline key. If we need to fallback in both VRS and non-VRS scenarios, we need to key on it. Fixes segfault in DIRT5 when toggling VRS. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 12:35:58 +02:00
Hans-Kristian Arntzen	3182882e21	d3d12: Do not export ordinals for most symbols. The ordinals except for D3D12CreateDevice and GetDebugInterface are not part of the ABI apparently. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-12 11:09:16 +01:00
Hans-Kristian Arntzen	99365bcaec	vkd3d: Enable VK_NV_fragment_shader_barycentric. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-11 13:53:19 +01:00
Hans-Kristian Arntzen	08a7d7a165	vkd3d: Bind local root signature static set. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	d83ce4392b	vkd3d: Check root signature associations in hit groups as well. If we don't find a clear association to an entry point, we can also find it in the hit group. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	c672429c70	vkd3d: Fix demangling of RT entry points. Digits are of course also valid identifiers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	235541ace5	vkd3d: Build local static sampler set/pipeline layouts and allocate set. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	f605b88e90	vkd3d: Make some RS related functions non-static. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	90d52abe94	vkd3d: Parse local RS static samplers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	74f62784e4	vkd3d: Cleanup redundant parameter_count assignment. parameter_count == NumParameters for local RS since hoisting is explicitly ignored for those. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	393ef6261b	vkd3d: Add local root signature objects to RTPSO. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	6802d9e5a3	vkd3d: Add helper to create augmented pipeline layout. For local root signature static samplers, this is handy. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	67be905421	vkd3d: Bump max number of descriptor sets. Need one potentially for local root signature static samplers. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	b661c9b8ba	vkd3d: Store set layout array in root signature. With RTPSOs we might have to create static sampler sets for local root signatures. In this case we will have to create a compatible pipeline layout which is equal to global pipeline layout, except for an extra set. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 11:51:47 +02:00
Hans-Kristian Arntzen	1e42acf492	vkd3d: Allow BUILD_MODE_UPDATE in PrebuildInfo check. Metro Exodus Enhanced Edition hits this a lot. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-08 07:21:47 +01:00
Hans-Kristian Arntzen	0f2e448659	vkd3d: Handle CreatePipelineLibrary with NULL ppData. Supposed to return S_FALSE. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 17:55:57 +02:00
Hans-Kristian Arntzen	ab4e847e74	renderdoc: Add global capture support. Useful for test suite since a test can be comprised of several smaller submissions, and it's easier to debug if we have one trace. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:33:06 +02:00
Hans-Kristian Arntzen	385c3dc012	vkd3d: Add bug reference for split fallback types. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	26dc9e7da5	vkd3d: Allow CreateHeap to fail in certain fallback situations. If we deduce that fallback heap allocation is impossible, we will accept this, and defer allocation to CreatePlacedResource() instead where we make a committed resource. This breaks aliasing, but in practice, this situation will only arise for render targets, and it's not like we have a choice in the matter here on NV :\ Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	7ee8eac818	vkd3d: Add allocation flag for DEDICATED. When allocating dedicated memory, ignore heap_flag requirements we deduce from memory info. Any memory type is allowed. This is important on NV when allocating fallback render targets. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	cddb98acc6	vkd3d: Consider that we might attempt to free NULL memory. For deferred heaps, we will accept NULL allocations. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	4075809a91	vkd3d: Make error message more precise when failing to allocate memory. There are situations where we cannot fallback to system memory, so don't log that we're going to do so. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	9065f312d5	vkd3d: Refactor out validation of CUSTOM heap types. Don't attempt to enter memory allocation when we can invalidate a heap allocation up front. Avoids some dumb edge cases later. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Hans-Kristian Arntzen	9415191111	vkd3d: Add LOG_MEMORY_BUDGET logging for non-budget as well. Useful to be able to debug which allocations happen. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-07 15:32:54 +02:00
Joshua Ashton	c9ff20d4ac	vkd3d: Make a generic UE4 shader quirk collection Many UE4 games have this broken bloom shader that samples a texture with implicit lod in divergent control flow. Fixes Bus Simulator 21 Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-10-07 10:18:47 +01:00
Joshua Ashton	7a66669e92	vkd3d: Add empty element to shader quirks If we ever remove these, we need this for MSVC. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-10-07 10:18:47 +01:00
Joshua Ashton	d91d47d827	vkd3d: Use vkd3d_string_compare for shader quirks Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-10-07 10:18:47 +01:00
Joshua Ashton	70ee02bce0	vkd3d: Use vkd3d_string_compare for application overrides Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-10-07 10:18:47 +01:00
Hans-Kristian Arntzen	0f802b151e	vkd3d-shader: Avoid undefined result for Ibfe/Ubfe/Bfi. Width + offset must not overflow in SPIR-V. SM 5+ is well-defined here. It's enough to just clamp the width against 32 - offset in all cases. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-05 15:45:02 +02:00
Hans-Kristian Arntzen	cd3d759b95	vkd3d: Enable VK_KHR_shader_integer_dot_product. Accelerates SM 6.4 packed ops if present. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-05 15:38:59 +02:00
Hans-Kristian Arntzen	d9cd18b1ca	vkd3d-shader: Handle vectorized FIRSTBIT_HI. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-10-01 16:23:52 +02:00
Hans-Kristian Arntzen	af822939fb	vkd3d: Implement support for rendering to NULL/unbound RTV. Need to use fallback pipeline system here. Keep track of active masks for PSO and current render target. The intersection of those sets are the attachments which should be active in the render pass. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-30 16:50:02 +02:00
Hans-Kristian Arntzen	173b565ccf	vkd3d: Optimize DiscardResource when all subresources are discarded. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-29 14:17:31 +02:00
Hans-Kristian Arntzen	0b11fad67c	vkd3d: Allow discarding UAV resources. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-29 14:17:31 +02:00
Hans-Kristian Arntzen	6f0677eb2e	vkd3d: Refactor out queue flags -> stages conversion. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-29 14:17:31 +02:00
Hans-Kristian Arntzen	0c2ddb89cd	vkd3d: Add CONFIG for forced CACHED memory. Very useful for capturing. Speeds up a ton. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-27 14:48:26 +02:00
Hans-Kristian Arntzen	6863f1c6a8	vkd3d: Fix test suite regression on NV. Fix failure in test_create_heap where a TIER_2 host visible heap was attempted, but failed due to recent DEATHLOOP fixes. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2021-09-24 16:48:34 +02:00
Joshua Ashton	bde3ad8e01	vkd3d: Move ID3D12StateObject impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	cabc31fc4c	vkd3d: Move ID3D12Device impl_froms to header Basic casts should not be function calls.	2021-09-23 12:12:13 +02:00
Joshua Ashton	bfaf72386f	vkd3d: Move ID3D12CommandSignature impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	b84c3ff163	vkd3d: Move ID3D12PipelineState impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	7c993ae1a6	vkd3d: Move ID3D12RootSignature impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	875fbe5f50	vkd3d: Move ID3D12QueryHeap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	2334c136e3	vkd3d: Move ID3D12DescriptorHeap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	8d5308c9a1	vkd3d: Move ID3D12Resource impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	27e66b5c4a	vkd3d: Move ID3D12Heap impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00
Joshua Ashton	26d8011b06	vkd3d: Move ID3D12Fence impl_froms to header Basic casts should not be function calls. Signed-off-by: Joshua Ashton <joshua@froggi.es>	2021-09-23 12:12:13 +02:00

... 2 3 4 5 6 ...

2839 Commits