mirrors/vkd3d-proton

Commit Graph

Author	SHA1	Message	Date
Hans-Kristian Arntzen	d3a76eee90	idl: Fix const correctness of UpdateTileMappings. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-25 18:10:08 +02:00
Derek Lesho	df1829e407	vkd3d: Implement ID3D12Fence sharing on top of D3D12-Fence exportable Vulkan timeline semaphores. Signed-off-by: Derek Lesho <dlesho@codeweavers.com>	2022-07-25 11:16:53 +02:00
Hans-Kristian Arntzen	be2aafff1a	vkd3d: Resolve fence waiters early. Temporarily abandons the idea to fuse waiters with execution. For whatever reason, this seemed to cause random flicker in Halo Infinite with async compute on, and I have failed to figure out exactly why. By playing around with how commands are fused, the results changed dramatically, which means I doubt vkd3d-proton was actually at fault here. There is some questionable code around UpdateTileMappings in the game where a COPY queue is used, and it does not seem to synchronize this with other queues as far as I can tell. It is uncertain at this time if D3D12 requires a tile update to synchronize with every queue or just the queue being submitted to. We assume the latter, as it's the only behavior that makes sense. It is possible that submitting waits as they are queued up affects synchronization between queues in unexpected ways. When separating out the wait operations, everything appears to work. It is also simpler code. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-21 21:10:34 +02:00
Derek Lesho	a2439e766f	vkd3d: Flush queued waiters before waiting for the sparse binding semaphore. Fixes a bug in the logic trying to combine the waits by simplifying the code. Problem discovered by HK. Signed-off-by: Derek Lesho <dlesho@codeweavers.com>	2022-07-20 01:27:20 +02:00
Hans-Kristian Arntzen	4ff504b52d	vkd3d: Match native runtime better in command allocator reset. Even when misusing the API, S_OK is still returned on native runtimes. Keep the error log, and add an error report to command allocator release if there are still pending submissions. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-18 19:00:25 +02:00
Hans-Kristian Arntzen	6335e411bb	vkd3d: Rewrite submission logic for wait fences. D3D12 has some unfortunate rules around CommandQueue::Wait(). It's legal to release the fence early, before the fence actually completes its wait operation. The behavior on D3D12 is just to release all waiters. For out of order signal/wait, we hold off submissions, so we can implement this implicitly through CPU signal to UINT64_MAX on fence release. If we have submitted a wait which depends on the fence, it will complete in finite time, so it still works fine. We cannot release the semaphores early in Vulkan, so we must hold on to a private reference of the ID3D12Fence object until we have observed that the wait is complete. To make this work, we refactor waits to use the vkd3d_queue wait list. On other submits, we resolve the wait. This is a small optimization since we don't have to perform dummy submits that only performs the wait. At that time, we signal a timeline semaphore and queue up a d3d12_fence_dec_ref(). Since we're also adding this system where normal submissions signal timelines, handle the submission counters more correctly by deferring the decrements until we have waited for the submission itself. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-18 19:00:25 +02:00
Hans-Kristian Arntzen	11c943dd7e	vkd3d: Unblock all fence waiters when public ref-count hits 0. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-18 19:00:25 +02:00
Hans-Kristian Arntzen	5b73139f18	vkd3d: Fail creation of command signature if DGC is not supported. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-12 14:31:53 +02:00
Hans-Kristian Arntzen	f704cb9776	vkd3d: Use index type LUT for DGC. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-11 15:14:13 +02:00
Hans-Kristian Arntzen	e17a7cb40c	vkd3d: Attempt to reuse application indirect command buffer. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-11 15:14:13 +02:00
Hans-Kristian Arntzen	4a07d9c038	debug: Add concept of implicit instance index to debug ring. For internal debug shaders, it is helpful to ensure in-order logs when sorted for later inspection. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-11 14:59:00 +02:00
Hans-Kristian Arntzen	e138a5117a	vkd3d: Encode in detail which commands we're emitting in template. Feed this back to debug ring for less cryptic logs. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-11 14:59:00 +02:00
Hans-Kristian Arntzen	96fdb71ae4	vkd3d: Refactor out patch command token enum. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-11 14:59:00 +02:00
Hans-Kristian Arntzen	f93a581dae	vkd3d: Trace breadcrumbs for execute indirect templates. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-11 14:59:00 +02:00
Hans-Kristian Arntzen	03fdbac59e	vkd3d: Dump TraceRays parameters to breadcrumbs. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-11 14:04:38 +02:00
Hans-Kristian Arntzen	8a94c3ce0e	vkd3d: Add more detailed breadcrumb logging for TraceRays. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-07-11 14:04:38 +02:00
Philip Rebohle	1d869e3e21	vkd3d: Do not execute indirect commands if count buffer is unsupported. Also be a bit more uniform with using break/return on fail conditions. Otherwise, the indirect command will read data from the count buffer instead, which may lead to bugs or GPU hangs. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-06-28 14:57:11 +02:00
Tatsuyuki Ishi	02c7ec404c	vkd3d: Fix transfer batch clobbering state in begin_render_pass. Transfer batch can clobber graphics pipeline for e.g. depth->color copies. Hence, flushing the batches after applying the graphics pipeline set by the app can cause correctness issues. To prevent that, do the transfer batch flush first before we apply any render-related states. Signed-off-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com>	2022-06-28 13:53:03 +02:00
Hans-Kristian Arntzen	bc759be2af	vkd3d: Optimize ExecuteIndirect() if no INDIRECT transitions happened. The D3D12 docs outline this as an implementation detail explicitly, so we should do the same thing. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-06-24 14:55:39 +02:00
Hans-Kristian Arntzen	18f1d1c72e	vkd3d: Implement ExecuteIndirect with state update. Implements the most basic iteration where we don't try to take advantage of index LUT, hoisting CS patching or attempting to reuse application indirect buffer directly. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-06-24 14:55:39 +02:00
Hans-Kristian Arntzen	619a54810d	vkd3d: Pass down required memory types to scratch allocators. Separate scratch pools by their intended usage. Allows e.g. preprocess buffers to be allocated differently from normal buffers, which is necessary on implementations that use special memory types to implement preprocess buffers. Potentially can also allow for separate pools for host visible scratch memory down the line. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-06-23 14:39:22 +02:00
Hans-Kristian Arntzen	8ae391e675	vkd3d: Add more stringent validation for CreateCommandSignature. The runtime is specified to validate certain things. Also, be more robust against unsupported command signatures, since we might need to draw/dispatch at an offset. Avoids hard GPU crashes. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-06-23 12:52:29 +02:00
Hans-Kristian Arntzen	abdef77695	vkd3d: Add helper to invalidate all state. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-06-23 12:52:29 +02:00
Hans-Kristian Arntzen	c132073df8	vkd3d: Refactor index buffer state to be flushed late. With ExecuteIndirect state we'll need to modify or refresh index buffer state. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-06-23 12:52:29 +02:00
Tatsuyuki Ishi	39d07dea2c	vkd3d: Check for alias and batch barriers in CopyTextureRegion batches. Signed-off-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com>	2022-06-16 11:54:26 +02:00
Tatsuyuki Ishi	3577ca3144	vkd3d: Introduce transfer batches. Transfer batches buffers CopyTextureRegion calls for batching. The flushes needs to happen in a few places: 1. ResourceBarrier: This is where the transition from COPY_DEST to other might happen, at which point the writes must be visible. This might also transition away from COPY_SRC which invalidates the precondition. 2. Copy operations. Copies to the same resource are implicitly ordered. 3. Draws and dispatches. These are not strictly necessary, but we don't want too much command reordering so flushing here seems good. 4. Close. So that we don't throw commands into the void. Signed-off-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com>	2022-06-16 11:54:26 +02:00
Tatsuyuki Ishi	829ac72e3d	vkd3d: Break up CopyTextureRegion into three stages. A parameter preparation stage, a pre-execution barrier stage, then finally the execution and post-execution barrier stage. Signed-off-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com>	2022-06-13 14:40:23 +02:00
Hans-Kristian Arntzen	c64916686d	vkd3d: Clear SUSPENDED flag properly. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-06-13 13:46:49 +02:00
Hans-Kristian Arntzen	467db76f90	vkd3d: Remove obsolete COLOR -> COMPUTE workaround for Deathloop. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-05-31 15:59:35 +02:00
Robin Kertels	cdabda7805	vkd3d: Implement indirect ray tracing. Signed-off-by: Robin Kertels <robin.kertels@gmail.com>	2022-05-11 19:11:01 +02:00
Hans-Kristian Arntzen	71940797d1	vkd3d: Check for redundant dynamic state in some cases. Some dynamic state is at risk of being spammed with same arguments many times. For the dynamic state that is trivial to check, do so. Ghostwire: Tokyo has been observed to spam the same OMSetStencilRef value causing some context rolls, also RSSetShadingRate has been set redundantly. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-05-03 16:30:42 +02:00
Philip Rebohle	beaedbd857	vkd3d: Use UAV clear fallback based on format compatibility. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-04-21 13:51:58 +02:00
Philip Rebohle	81927c5895	vkd3d: Fix handling of non-zero base layer in ClearUAV fallback path. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-04-21 13:51:58 +02:00
Philip Rebohle	e7a6af4971	vkd3d: Use texel buffer views for UAV clears with buffer to image copy. Allows this to more easily work with more formats. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-04-21 13:51:58 +02:00
Philip Rebohle	e4184830c5	vkd3d: Add ClearUAV path that uses buffer-to-image copies. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-04-05 11:52:23 +02:00
Philip Rebohle	d1425ee4d1	vkd3d: Use VK_ACCESS_MEMORY_{READ,WRITE}_BIT where appropriate Buggy RADV versions no longer work due to missing extension support. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-04-05 11:52:23 +02:00
Hans-Kristian Arntzen	6f43f450c8	vkd3d: Disable primitive restart when using non-compatible topologies. Primitive restart is only used for strip primitive types, and must be ignored for lists. Use and require extended_dynamic_state2 for this purpose. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-30 16:12:16 +02:00
Hans-Kristian Arntzen	da63f0beac	vkd3d: Compute range_end after sparse checks in copy tracking. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-30 12:13:25 +02:00
Philip Rebohle	6378f1b880	vkd3d: Optimize WriteBufferImmediate for consecutive writes. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-30 11:51:10 +02:00
Hans-Kristian Arntzen	2e8fb27182	vkd3d: Correctly handle dynamic depth/stencil attachment infos. {depth,stencil}AttachmentFormat and p{Depth,Stencil}Attachment are only allowed if the format contains that aspect. Check this explicitly. Fixes some validation errors. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-24 17:55:32 +01:00
Hans-Kristian Arntzen	1b5f7e8fc3	vkd3d: Use VkImageViewCreateInfo correctly. For EXTENDED_USAGE, we still need to restrict image usage when creating concrete views. Use VkImageViewUsageCreateInfo to restrict usage flags to the kind of view we're creating. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-24 17:55:32 +01:00
Hans-Kristian Arntzen	cf65a78570	vkd3d: Rename DSV UNKNOWN workaround query. Make it more obvious what it's really trying to check. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-23 22:36:00 +01:00
Philip Rebohle	1d3957fe6d	vkd3d: Do not create pipeline variants for NULL DSV. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-23 22:22:09 +01:00
Hans-Kristian Arntzen	6e915dd2c0	vkd3d: Use rt_count as basis for binding RTVs. Found some validation errors where rt_count != rtv_active_mask, and blending used rt_count instead of rtv_active_mask. If shader renders to a NULL attachment, we must make sure that it's part of the PSO interface. Also, use rt_count rather than active mask when beginning render pass. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-23 14:29:51 +01:00
Philip Rebohle	34f5fc6a31	vkd3d: Do not create pipeline variants for NULL RTVs. Signed-off-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2022-03-22 13:06:00 +01:00
Hans-Kristian Arntzen	63530501a5	vkd3d: Require VK_EXT_extended_dynamic_state. This is basically required for not horrible stutter and performance and is widely supported. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-16 17:48:21 +01:00
Hans-Kristian Arntzen	e61cc0234a	vkd3d: Allow debug ring to know about device lost scenarios. For this case, we want to block and teardown the debug ring thread. It's okay to fish for dead messages in the ring, since we know there won't be more GPU work submitted. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 13:26:27 +01:00
Hans-Kristian Arntzen	a6700d3d85	vkd3d: Make debug ring aware of potential crash scenarios. If we expect device losts (breadcrumb debug), we need to use DEVICE uncached/coherent, since we might not be able to flush GPU caches properly. We also need to remove the idea of being able to copy out the control block back to host. This is too brittle and we should instead just place the control block in PCI-e BAR instead. Rethink how we pass messages from GPU to CPU to make it more robust. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 13:26:27 +01:00
Hans-Kristian Arntzen	972ce74ac6	vkd3d: When using breadcrumbs, consider that WaitSemaphore can be buggy. Spec says that in device lost, driver must return DEVICE_LOST in finite time, but this does not happen on NV drivers. Use a long timeout instead in this scenario. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 13:07:56 +01:00
Hans-Kristian Arntzen	365dd05557	vkd3d: Add breadcrumbs support. AMD path for this commit. Idea is that we can automatically instrument markers with command list information we can make some sense of in vkd3d-proton. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no>	2022-03-11 13:07:56 +01:00

1 2 3 4 5 ...

746 Commits