mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	bb9b7d0a68	radv: fix missing initialization of the predication value It's expected to be 0. Fixes: `62d9ca696e` ("radv: use 32-bit predication for conditional rendering on GFX10.3+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7789>	2020-11-26 12:30:27 +01:00
Samuel Pitoiset	8da98beb5d	radv: always use 32-bit predication on compute queues It seems that only gfx queue doesn't support it, except on GFX10.3 which supports all queues. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7732>	2020-11-25 08:13:43 +00:00
Samuel Pitoiset	62d9ca696e	radv: use 32-bit predication for conditional rendering on GFX10.3+ It's now supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7732>	2020-11-25 08:13:43 +00:00
Bas Nieuwenhuizen	025cb90042	radv: Fix RB+ blending for VK_FORMAT_E5B9G9R9_UFLOAT_PACK32. Fixes: `e893102bcf` ("radv: Add VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 rendering support.") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7716>	2020-11-24 21:25:57 +00:00
Tony Wasserka	cba6ec309a	radv: Fix -Wshadow warnings Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7430>	2020-11-20 09:29:19 +00:00
Marek Olšák	603b5340b9	ac: rename num_render_backends -> max_render_backends Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7542>	2020-11-18 06:19:59 +00:00
Samuel Pitoiset	0790105f2f	radv: do VGT_FLUSH when switching NGG -> legacy on Sienna Cichlid Ported from RadeonSI. Cc: 20.2 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7566>	2020-11-17 10:34:28 +00:00
Bas Nieuwenhuizen	8943c80c9b	radv: Fix variable name collision. idx was aliased, and `eb104e949e` started using the outer var in the inner scope ... Fixes: `eb104e949e` Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3701 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7388>	2020-10-30 23:44:48 +01:00
Bas Nieuwenhuizen	eb104e949e	radv: Do not access set layout during vkCmdBindDescriptorSets. The spec says: " VkDescriptorSetLayout objects may be accessed by commands that operate on descriptor sets allocated using that layout " So our behavior is valid here, but this is a temporary workaround for an issue with Baldur's Gate 3. CC: mesa-stable Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3607 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7207>	2020-10-28 03:06:20 +00:00
Samuel Pitoiset	48e83f7665	radv: do not perform a FMASK expand for non-writeable MSAA images It should only be required for writeable MSAA images. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7292>	2020-10-27 13:16:50 +01:00
James Park	28d02b9d3e	ac,amd/llvm,radv: Initialize structs with {0} Necessary to compile with MSVC. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7123>	2020-10-14 12:15:23 +00:00
Bas Nieuwenhuizen	ea778693bf	radv: Fix event write cmdbuffer allocation when tracing. The trace emit is another 7 words. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7091>	2020-10-12 10:55:08 +00:00
Bas Nieuwenhuizen	da132d802b	radv: Set fce metadata correctly on DCC initialization. The fce metadata can always be set to false as we don't care about the compressed clear color. Avoiding useless fast clear eliminates improves basemark performance by 1%-1.5%. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7005>	2020-10-09 13:46:49 +00:00
Rhys Perry	19561f31a8	radv: remove trailing whitespace Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7043>	2020-10-07 11:53:23 +00:00
Bas Nieuwenhuizen	24f19f409d	radv: Write correct dispatch size for RGP. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6875>	2020-09-29 20:14:40 +00:00
Bas Nieuwenhuizen	78165ea3e2	radv: Record cache flushes for RGP. Not doing the EOP TS cacheflush event because that break wave counting in RGP for some reason. But the rest looks to be all there. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6550>	2020-09-28 15:46:08 +00:00
Bas Nieuwenhuizen	cc73182152	radv: Include flushes in the barrier. Since the flushes really happen on the next draw delay the barrier end to include the flushes. This fixes the barrier duration in RGP. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6550>	2020-09-28 15:46:08 +00:00
Bas Nieuwenhuizen	e893102bcf	radv: Add VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 rendering support. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6831>	2020-09-23 09:22:03 +00:00
Samuel Pitoiset	2b99e15d0a	radv: fix transform feedback crashes if pCounterBufferOffsets is NULL From the Vulkan 1.2.154 spec: "If pCounterBufferOffsets is NULL, then it is assumed the offsets are zero." Fix new CTS dEQP-VK.transform_feedback.simple.backward_dependency_no_offset_array. CC: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6798>	2020-09-21 15:02:02 +00:00
Bas Nieuwenhuizen	8ae4cec95f	Revert "radv: emit {CB,DB}_RMI_L2_CACHE_CONTROL at framebuffer time" This reverts commit `d6bc0f26c9`. These registers are now constant. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6726>	2020-09-21 10:34:46 +00:00
Bas Nieuwenhuizen	0a84c595c2	Revert "radv: set BIG_PAGE to improve performance on GFX10.3" This reverts commit `f4d861696d`. Turns out we cannot use BIG_PAGE with GTT and we can't tell when a buffer is spilled to GTT. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6726>	2020-09-21 10:34:46 +00:00
Pierre-Loup A. Griffais	7b4eaac6a9	radv: fix vertex buffer null descriptors Fixes: `0f1ead7b53` "radv: handle NULL vertex bindings" Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6773>	2020-09-18 17:12:40 +00:00
Pierre-Loup A. Griffais	ec13622ff4	radv: fix null descriptor for dynamic buffers Fixes: `c1ef225d18` "radv: handle NULL descriptors" Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6772>	2020-09-18 16:30:17 +00:00
Rhys Perry	85cc2950a0	radv: initialize with expanded cmask if the destination layout needs it If radv_layout_can_fast_clear() is false, 028C70_COMPRESSION is unset when the image is rendered to and CMASK isn't updated. This appears to cause FMASK to be ignored and the 0th sample to always be used. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3449 Fixes: `7b21ce401f` ('radv: disable FMASK compression when drawing with GENERAL layout') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6745>	2020-09-17 10:28:29 +00:00
Marek Olšák	b7a6333ee4	amd/registers: switch to new generated register definitions Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6423>	2020-09-01 08:45:54 -04:00
Samuel Pitoiset	aa675cdc91	radv: improve reporting faulty pipelines when a GPU hang is detected Because the driver now waits for idle after every draw/dispatch calls, we shouldn't report gfx pipelines when the GPU hang happens after a dispatch (or the opposite). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6471>	2020-09-01 08:27:48 +02:00
Samuel Pitoiset	f4d861696d	radv: set BIG_PAGE to improve performance on GFX10.3 It reduces traffic between CB, DB and TCP blocks if buffers respect a certain alignment. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6482>	2020-08-28 05:53:41 +00:00
Samuel Pitoiset	d6bc0f26c9	radv: emit {CB,DB}_RMI_L2_CACHE_CONTROL at framebuffer time The upcoming patch will set BIG_PAGE if needed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6482>	2020-08-28 05:53:41 +00:00
Bas Nieuwenhuizen	18fe130ec9	radv: Fix uninitialized variable in renderpass. Fixes some dEQP-VK.renderpass2.* flakes. Valgrind: Test case 'dEQP-VK.renderpass2.dedicated_allocation.attachment.8.724'.. ==754520== Conditional jump or move depends on uninitialised value(s) ==754520== at 0x575B21C: radv_layout_is_htile_compressed (radv_image.c:1690) ==754520== by 0x572F470: radv_handle_depth_image_transition (radv_cmd_buffer.c:5855) ==754520== by 0x572F2F2: radv_handle_image_transition (radv_cmd_buffer.c:6123) ==754520== by 0x572EEC6: radv_handle_subpass_image_transition (radv_cmd_buffer.c:3385) ==754520== by 0x572A104: radv_cmd_buffer_begin_subpass (radv_cmd_buffer.c:4843) ==754520== by 0x572A007: radv_CmdBeginRenderPass (radv_cmd_buffer.c:4913) ==754520== by 0x572A197: radv_CmdBeginRenderPass2 (radv_cmd_buffer.c:4921) Why false? A renderloop happens when the same attachment is both used as input attachment and output (color, ds) attachment in a subpass. Of course this doesn't happen outside of a renderpass and hence we can initialize it to false at the start of the renderpass. Fixes: `66131ceb8b` "radv: Pass through render loop detection to internal layout decisions." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3074 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6068>	2020-07-26 13:35:16 +00:00
Samuel Pitoiset	6ced98c94e	radv: disable CPU caching for the upload BO to reduce fetch latency AMDGPU_GEM_CREATE_CPU_GTT_USWC should be faster when CPU reads are unexpected (because they aren't cached). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5978>	2020-07-21 11:54:39 +00:00
Samuel Pitoiset	b3eae4e037	radv: do not perform read-modify-write with the upload BO To disable CPU caching. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5978>	2020-07-21 11:54:39 +00:00
Samuel Pitoiset	50fdefc025	radv: destroy the base object if VkAllocateCommandBuffers() failed Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5868>	2020-07-15 13:53:35 +02:00
Samuel Pitoiset	b262284300	radv: add support for dynamic vertex input binding stride Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5718>	2020-07-13 08:31:54 +00:00
Samuel Pitoiset	9cc99baa4a	radv: add support for dynamic depth/stencil states Out-of-order rasterization is disabled if a pipeline uses an extended dynamic depth/stencil state because the driver doesn't support enabling/disabling out-of-order dynamically. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5718>	2020-07-13 08:31:54 +00:00
Samuel Pitoiset	e8a69b782d	radv: add support for dynamic and scissor count Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5718>	2020-07-13 08:31:54 +00:00
Samuel Pitoiset	d6c1e5051e	radv: add support for dynamic primitive topology Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5718>	2020-07-13 08:31:54 +00:00
Samuel Pitoiset	52bf1035a6	radv: add support for dynamic cull mode and front face Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5718>	2020-07-13 08:31:54 +00:00
Samuel Pitoiset	ac575f4215	radv: rework dynamic viewports/scissors support The number of viewports/scissors is currently static because it can only be specified at pipeline creation, but it doesn't hurt to assume it's dynamic. Will help for supporting setting the number of viewports/scissors dynamically. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5718>	2020-07-13 08:31:54 +00:00
Samuel Pitoiset	9f561feecc	radv: store the primitive topology hardware value in the pipeline Will help for upcoming changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5801>	2020-07-09 06:31:39 +00:00
Samuel Pitoiset	6f734324a5	radv: implement missing VK_ACCESS_MEMORY_{READ,WRITE}_BIT From the Vulkan spec 1.2.146: "VK_ACCESS_MEMORY_READ_BIT specifies all read accesses. It is always valid in any access mask, and is treated as equivalent to setting all READ access flags that are valid where it is used." "VK_ACCESS_MEMORY_WRITE_BIT specifies all write accesses. It is always valid in any access mask, and is treated as equivalent to setting all WRITE access flags that are valid where it is used." Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3241 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5807>	2020-07-09 08:05:20 +02:00
Bas Nieuwenhuizen	ad913a18b1	radv: Always enable PERFECT_ZPASS_COUNTS. We have an issue with early depth testing and discard, where non-perfect counts count the tile if the early depth test succeeds. We could spend a lot of effort to set this conditionally based on the presence of the two conditions, but in the presence of inherited queries let's try this first. Changing PERFECT_ZPASS_COUNTS since I'm pretty sure this has a lower performance impact than always using late depth testing. CC: <mesa-stable@lists.freedesktop.org> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3218 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5757>	2020-07-06 13:54:38 +00:00
Samuel Pitoiset	7b21ce401f	radv: disable FMASK compression when drawing with GENERAL layout Fixes: `96063100` "radv: enable shaderStorageImageMultisample feature on GFX8+" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3219 Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/855 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3165>	2020-07-06 13:26:58 +00:00
Samuel Pitoiset	53372175c9	radv: fix wide points and lines The maximum value for both points and lines is 65536. This doesn't fix anything known (just found this while looking in that area). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5696>	2020-07-02 08:26:03 +02:00
Daniel Schürmann	db0afb3800	radv: change use_aco -> use_llvm We are about to make ACO the default backend. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5445>	2020-06-25 15:16:28 +02:00
Bas Nieuwenhuizen	64a92ef7a2	radv/winsys: Distinguish device/host memory errors. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5578>	2020-06-24 13:00:02 +00:00
Rhys Perry	841fdfcd45	radv/aco,aco: allow SMEM SSBO loads on GFX6/7 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Bas Nieuwenhuizen	81dee6cf8f	radv: Use offsets in surface struct. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	cd0c5b64cc	radv: Remove dead code. pool is always non-NULL, and is also accessed before this check in the function, so remove the pool = NULL case. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5181>	2020-05-25 11:12:07 +00:00
Marek Olšák	3509d3bd53	ac: update register and packet definitions for preemption Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5095>	2020-05-23 03:45:07 -04:00
Samuel Pitoiset	178adfa6a8	radv: use the base object struct types Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4886>	2020-05-13 08:23:23 +02:00
Samuel Pitoiset	65458528fc	radv: use the common base object type for VkDevice Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4886>	2020-05-13 08:23:23 +02:00
Joshua Ashton	24f9aea770	radv: Remove RANGE_SIZE usage These were removed from the latest Vulkan headers https://github.com/KhronosGroup/Vulkan-Docs/issues/1230 Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4878>	2020-05-05 00:28:00 +00:00
Samuel Pitoiset	0f1ead7b53	radv: handle NULL vertex bindings With VK_EXT_robustness2, an element of pBuffers can be NULL. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4775>	2020-04-29 07:29:54 +00:00
Samuel Pitoiset	ff3f775476	radv: simplify checking for Navi1x chips Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4702>	2020-04-23 15:54:32 +02:00
Albert Astals Cid	06c5875fd6	Fix promotion of floats to doubles Use the f variants of the math functions if the input parameter is a float, saves converting from float to double and running the double variant of the math function for gaining no precision at all Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3969>	2020-04-18 19:55:45 +00:00
Samuel Pitoiset	849eb0a776	radv: use RMW packets for updating the maximum sample distance Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4531>	2020-04-14 11:31:37 +02:00
Bas Nieuwenhuizen	a7e2efa7c9	radv: Use correct buffer count with variable descriptor set sizes. Fixes dEQP-VK.binding_model.descriptorset_random.sets16.noarray.ubolimitlow.sbolimitlow.imglimitlow.iublimitlow.frag.ialimitlow.0 CC: <mesa-stable@lists.freedesktop.org> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2607 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4489>	2020-04-08 15:26:50 +00:00
Bas Nieuwenhuizen	a3682670c8	radv: Consider maximum sample distances for entire grid. The other pixels in the grid might have samples with a larger distance than the (0,0) pixel. Fixes dEQP-VK.pipeline.multisample.sample_locations_ext.verify_location.samples_8_packed when CTS is compiled with clang. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4480>	2020-04-08 10:53:33 +00:00
Samuel Pitoiset	cd99ea7318	radv: remove radv_layout_has_htile() helper The goal of this function was to return whether a depth-stencil image has HTILE, in comparison to radv_layout_is_htile_compressed() which is used to know whether a depth-stencil image has HTILE compressed. These two functions are actually similar and they have never been used for what they were supposed to. Remove radv_layout_has_htile() in favour of radv_layout_is_htile_compressed() for now. If it's needed in the future, I will re-introduce this concept properly. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4389>	2020-04-08 07:55:16 +02:00
Samuel Pitoiset	8b7586655f	radv: rename decompress/resummarize depth/stencil functions Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4389>	2020-04-08 07:55:10 +02:00
Samuel Pitoiset	2d3223ca90	radv: fix optional pSizes parameter when binding streamout buffers The Vulkan spec 1.2.135 says: "pSizes is an optional array of buffer sizes, specifying the maximum number of bytes to capture to the corresponding transform feedback buffer. If pSizes is NULL, or the value of the pSizes array element is VK_WHOLE_SIZE, then the maximum bytes captured will be the size of the corresponding buffer minus the buffer offset." Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2650 Fixes: `b4eb029062` ("radv: implement VK_EXT_transform_feedback") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4232> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4232>	2020-03-20 09:25:14 +01:00
Samuel Pitoiset	e6e97ea92e	radv/sqtt: describe layout transitions with user markers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4138> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4138>	2020-03-12 17:04:55 +00:00
Samuel Pitoiset	b229302b96	radv/sqtt: describe begin/end subpass barriers with user markers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4138>	2020-03-12 17:04:55 +00:00
Samuel Pitoiset	b6cebf6439	radv: do not recursively begin/end render pass for meta operations To avoid breaking SQTT user markers that are emitted to report barriers and layout transitions to RGP. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4136> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4136>	2020-03-11 07:54:43 +00:00
Samuel Pitoiset	24db276d11	radv/sqtt: describe pipeline and wait events barriers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 10:05:40 +01:00
Samuel Pitoiset	b829fbb7f0	radv/sqtt: describe draw/dispatch and emit event markers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 10:05:40 +01:00
Samuel Pitoiset	dcfc08f5b8	radv/sqtt: describe begin/end command buffers with user markers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 09:58:02 +01:00
Samuel Pitoiset	b3ef07db96	radv: emit thread trace markers after every draw/dispatch call Thread trace markers (also called events in Radeon GPU Profiler) should be emitted after every draw/dispatch calls to collect data. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3900>	2020-02-28 08:11:02 +01:00
Samuel Pitoiset	12a22da683	radv: add the trace BO to the BO list at submit time Instead of adding it in every command buffer. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3891> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3891>	2020-02-24 12:43:53 +01:00
Samuel Pitoiset	556c940149	radv: implement VK_EXT_line_rasterization Only Bresenham lines are supported. GFX9 is currently disabled because there is some CTS failures for some weird reasons. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2982> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2982>	2020-02-13 08:14:01 +01:00
Bas Nieuwenhuizen	5b335e1599	radv: Do not redundantly set the RB+ regs on pipeline switch. No significant perf changes seen on Bayonetta. (Changes are in the noise on my Raven Laptop) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3735> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3735>	2020-02-11 04:39:42 +00:00
Bas Nieuwenhuizen	7792d774e0	radv: Optimize emitting index buffer changes. Since the direct indexed draw packet has the address/count info inline, there is no sense in emitting the base and size. No real significant changes found during benchmarks. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3466> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3466>	2020-02-11 03:07:11 +00:00
Bas Nieuwenhuizen	65a6dc5139	radv: Do not set SX DISABLE bits for RB+ with unused surfaces. The extra bits in CB_SHADER_MASK break dual source blending in SkQP on a Stoney device. However: - As far as I can tell, some other dual source blend tests are passing before and after the change. - A hacked around skqp passes on my Vega desktop and Raven laptop - Getting Skqp to give any useful info or to run it outside of Android on ChromeOS is proving difficult. I have confirmed 3 strategies that seem to work: - The old radv behavior of setting CB_SHADER_MASK to 0xF - AMDVLK: CB_SHADER_MASK = 0xFF, and the 3 RB+ regs are 0. - radeonsi: CB_SHADER_MASK = 0xFF, but does not set DISABLE bits in SX_BLEND_OPT_CONTROL for CB 1-7. Let us use the radeonsi solution as that solution also seems like the correct thing to do for holes. I have tested on my Raven laptop that setting the high surfaces to not disabled and downconvert to 32_R does not imply a performance penalty. Fixes: `e9316fdfd4` "radv: fix setting CB_SHADER_MASK for dual source blending" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3670> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3670>	2020-02-04 21:22:30 +00:00
Samuel Pitoiset	e4752dafed	radv/gfx10: implement NGG GS queries The number of generated primitives is only counted by the hardware if GS uses the legacy path. For NGG GS, we need to accumulate that value in the NGG GS itself. To achieve that, we use a plain GDS atomic operation. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3380>	2020-01-29 17:40:48 +01:00
Samuel Pitoiset	3c1f657f35	radv/gfx10: add a separate flag for creating a GDS OA buffer For implementing NGG GS queries, we decided to use GDS but GDS OA is only required for NGG streamout. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3380>	2020-01-29 17:40:46 +01:00
Samuel Pitoiset	83d1773a57	radv: update VK_KHR_imageless_framebuffer for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	af883bf3dc	radv: update VK_KHR_draw_indirect_count for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	5993f13b27	radv: update VK_KHR_create_renderpass2 for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Bas Nieuwenhuizen	7cc0702bbb	radv: Emit a BATCH_BREAK when changing pixel shaders or CB_TARGET_MASK. Fixes a hang on Raven with Resident Evil 2. I did not find anything more restricted to fix it: - Setting persistent_states_per_bin to 1 fixes it too, but likely does an internal break on any descriptor set changes too. - Only breaking the batch when cb_target_mask changes does not fix it (and looking at AMDVLK comments, I suspect the code in radeonsi should really be doing a FLUSH_DFSM). - Always doing a FLUSH_DFSM on shader switch helps, but that is more often than this and I don't think we should be doing that when DFSM is disabled. - Also emitting the existing break on framebuffer change when DFSM is disabled does not fix the issue. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2315 CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2020-01-07 22:44:31 +01:00
Samuel Pitoiset	13b4e9adcf	ac: declare an enum for the OOB select field on GFX10 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3147> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3147>	2019-12-19 15:15:32 +01:00
Samuel Pitoiset	f3cccd05d9	radv/gfx10: fix the out-of-bounds check for vertex descriptors When stride is 0, it should check against the offset not the index. This fixes black character models with Beat Saber and missing snow with Dragon Quest. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2233 Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/1975 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3147>	2019-12-19 15:15:30 +01:00
Samuel Pitoiset	e4c8491bdf	radv: implement VK_KHR_separate_depth_stencil_layouts Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-10 13:16:17 +01:00
Samuel Pitoiset	41cebfc9c1	radv: do not init HTILE as compressed state when dst layout allows it I don't think this makes much differences and a potential clear following the initialization will overwrite HTILE anyways. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-10 13:09:26 +01:00
Samuel Pitoiset	008fe909ca	radv: fix possibly wrong PA_SC_AA_CONFIG value for conservative rast PA_SC_AA_CONFIG might be updated when conversative rasterization is enabled. Because the driver only re-emits the multisample state if the number of samples is different, that register value might not be updated correctly. Found by inspection, doesn't fix anything known. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-10 11:04:43 +01:00
Samuel Pitoiset	4f659224c8	radv: move emission of two PA_SC_* registers to the pipeline CS They don't have to be updated dynamically. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-10 11:04:40 +01:00
Daniel Schürmann	21f67a3bdc	radv: only flush scalar cache for SSBO writes with ACO on GFX8+ Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>	2019-12-07 11:23:11 +01:00
Bas Nieuwenhuizen	25bc9102d8	radv: Allocate cmdbuffer space for buffer marker write. Fixes: `946193ae00` "radv: add support for VK_AMD_buffer_marker" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-26 09:35:02 +00:00
Samuel Pitoiset	9dec90b7bc	radv: set the image view aspect mask during subpass transitions No functional changes because the aspect mask is still not used during image transitions but it will be needed for the separate depth/stencil aspects logic. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-25 16:29:13 +01:00
Bas Nieuwenhuizen	4eb2a1dc6f	radv: Do not change scratch settings while shaders are active. When the scratch ringbuffer settings are changed, the shader unit has to be idle or we will have shaders using old and new settings. That combination is not supported on the HW (likely the offset is ringbuffer idx * WAVESIZE * 1024). CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-20 01:18:36 +00:00
Samuel Pitoiset	f010b90ac5	radv/gfx10: enable wave32 for compute based on shader's wavesize This will allow to change wavesize on-demand. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-06 09:20:30 +01:00
Timothy Arceri	7f106a2b5d	util: rename list_empty() to list_is_empty() This makes it clear that it's a boolean test and not an action (eg. "empty the list"). Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Samuel Pitoiset	4b17311e52	radv: compute the number of records correctly for vertex buffers On GFX8 the number of records is in bytes while on other chips it's in units of "stride". Fixes dEQP-VK.robustness.vertex_access..draw.vertex_ on RAVEN. Tested on GFX6, GFX8, GFX10 and RAVEN. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-24 17:14:43 +02:00
Samuel Pitoiset	956d825ed8	radv: do not emit rbplus if attachments are undefined Fixes some crashes with dEQP-VK.geometry.layered.*.secondary_cmd_buffer on Raven and other chips that allow rbplus. This just prevents a crash and rbplus probaby needs more work. Cc: 19.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-23 08:57:31 +02:00
Samuel Pitoiset	a13320370e	radv: fix updating bound fast ds clear values with different aspects On GFX9, the driver is able to do an optimized fast depth/stencil clear with only one aspect (ie. clear the stencil part of a depth/stencil image). When this happens, the driver should only update the clear values of the given aspect. Note that it's currently only supported on GFX9 but I have some local patches that extend this optimized path for other gens. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/1967 Cc: 19.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-22 11:16:13 +02:00
Bas Nieuwenhuizen	fd21ee8b52	radv: Fix single stage constant flush with merged shaders. e.g. a VERTEX only flush with tess on Vega should look at the TCS to see which bits are needed. CC: <mesa-stable@lists.freedesktop.org> Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/1953 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-10-18 10:49:29 +00:00
Bas Nieuwenhuizen	c837872fba	radv: Fix warning in 32-bit build. uintptr_t is 32 bits in a 32-bits build, resulting in shifting out of bounds. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-10-03 13:06:08 +00:00
Samuel Pitoiset	56e1b1ff0c	radv/gfx10: add missing counter buffer to the BO list The buffer isn't necessarily used before. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-02 18:09:25 +02:00
Daniel Schürmann	a70a998718	radv/aco: Setup alternate path in RADV to support the experimental ACO compiler LLVM remains default and ACO can be enabled with RADV_PERFTEST=aco. Co-authored-by: Daniel Schürmann <daniel@schuermann.dev> Co-authored-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-19 12:10:00 +02:00
Bas Nieuwenhuizen	f2dffb395f	radv: Only break batch on framebuffer change with dfsm. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 21:28:51 +00:00
Samuel Pitoiset	46b7512b0a	radv: fix writing depth/stencil clear values to image Use the fastest way only if both aspects are used. Oops. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111728 Fixes: `218ce34962` ("radv: add mipmap support for the clear depth/stencil values") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-18 13:27:46 +02:00
Samuel Pitoiset	63b20fb0cf	radv/gfx10: make sure to wait for idle before clearing GDS Otherwise the next streamout operation will overwrite GDS. This can be improved by tracking if there is a streamout operation in flight. Currently the driver unconditionally flushes but that doesn't matter much as NGG streamout is disabled by default. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-16 12:08:22 +02:00
Samuel Pitoiset	7314f6ef97	radv/gfx10: make GDS idle when leaving the IB NGG streamout uses GDS and we have to make sure that another process isn't going to overwrite GDS while our shaders are busy. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-16 12:08:22 +02:00
Samuel Pitoiset	b617156621	radv/gfx10: compute the correct buffer size for NGG streamout It's used to determined the max emit per buffer. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-16 12:08:22 +02:00
Samuel Pitoiset	e1dc3ab753	radv/gfx10: allocate GDS/OA buffer objects for NGG streamout This allocates two BOs for GFX10 NGG streamout. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-16 12:08:22 +02:00
Samuel Pitoiset	957c3436fa	radv/gfx10: implement NGG streamout begin/end functions Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-16 12:08:22 +02:00
Samuel Pitoiset	a15b3bcf1a	radv/gfx10: add an option to switch from legacy to NGG streamout This internal option is turned off by default because NGG streamout still hangs. It seems like it's related to GDS as RadeonSI. That option will be turned on once all issues are resolved. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-16 12:08:22 +02:00
Samuel Pitoiset	83499ac765	radv: merge radv_shader_variant_info into radv_shader_info Having two different structs is useless. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-06 15:52:03 +02:00
Samuel Pitoiset	021feb1bf6	ac: add rbplus_allowed to ac_gpu_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-27 08:04:41 +02:00
Samuel Pitoiset	20c5db02b5	ac: add has_tc_compat_zrange_bug to ac_gpu_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-27 08:04:36 +02:00
Samuel Pitoiset	b55919cf2a	ac: add has_gfx9_scissor_bug to ac_gpu_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-27 08:04:32 +02:00
Samuel Pitoiset	ed720af46d	ac: add has_load_ctx_reg_pkt to ac_gpu_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-27 08:04:22 +02:00
Samuel Pitoiset	44a46c09de	ac: add has_dcc_constant_encode to ac_gpu_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-27 08:04:16 +02:00
Samuel Pitoiset	218ce34962	radv: add mipmap support for the clear depth/stencil values Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-26 15:56:59 +02:00
Samuel Pitoiset	e36e260c42	radv: add mipmap support for the TC-compat zrange bug Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-26 15:56:55 +02:00
Samuel Pitoiset	76812339f7	radv: decompress mipmapped depth/stencil images during transitions Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-26 15:56:48 +02:00
Samuel Pitoiset	89671ef205	radv: fix getting the index type size for uint8_t 16-bit and 32-bit values match hardware values but 8-bit doesn't. This fixes dEQP-VK.pipeline.input_assembly.* with 8-bit index. Fixes: `372c3dcfdb` ("radv: implement VK_EXT_index_type_uint8") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl	2019-08-26 09:23:23 +02:00
Bas Nieuwenhuizen	e040c1b274	radv: Do not setup attachments without a framebuffer. Test that found this: dEQP-VK.geometry.layered.1d_array.secondary_cmd_buffer Fixes: `49e6c2fb78` "radv: Store color/depth surface info in attachment info instead of framebuffer." Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-12 17:19:24 +02:00
Bas Nieuwenhuizen	3a5950f501	radv: Add device argument for dcc compression check. Because it is about to be generation dependent. Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-07 02:13:07 +02:00
Bas Nieuwenhuizen	66131ceb8b	radv: Pass through render loop detection to internal layout decisions. And do nothing with it yet. Everything outside a renderpass has no render loop. Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-07 02:13:07 +02:00
Bas Nieuwenhuizen	9475782eac	radv: Implement VK_KHR_imageless_framebuffer. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-08-02 22:35:19 +02:00
Bas Nieuwenhuizen	a7041f3b4e	radv: Store image view also outside framebuffer. So we can use it with imageless framebuffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-08-02 22:19:16 +02:00
Bas Nieuwenhuizen	49e6c2fb78	radv: Store color/depth surface info in attachment info instead of framebuffer. That way we can use it for imageless framebuffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-08-02 22:18:51 +02:00
Samuel Pitoiset	7368000868	radv: re-apply "Optimize rebinding the same descriptor set." This makes it cheaper to just change the dynamic offsets with the same descriptor sets. This optimization has been reverted a while back because of random GPU hangs on GFX9, no it looks fine, at least CTS no longer hangs on GFX9 and it doesn't hang on GFX10 as well. It fixes a performance problem with Wolfenstein Youngblood. Suggested-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2019-08-02 09:56:55 +02:00
Samuel Pitoiset	0e1724af61	radv/gfx10: implement a bug workaround for NGG -> legacy transitions Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-31 12:14:29 +02:00
Samuel Pitoiset	29cca5f381	radv: skip draw calls with 0-sized index buffers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-31 12:14:29 +02:00
Eric Engestrom	abc226cf41	tree-wide: replace MAYBE_UNUSED with ASSERTED Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Samuel Pitoiset	372c3dcfdb	radv: implement VK_EXT_index_type_uint8 Natively supported on VI+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-29 23:36:53 +02:00
Samuel Pitoiset	fd195d8085	radv/gfx10: update streamout descriptors Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-24 08:23:27 +02:00
Bas Nieuwenhuizen	4058b354c5	radv: Set FLUSH_ON_BINNING_TRANSITION. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-23 21:26:59 +02:00
Bas Nieuwenhuizen	906fcfccfd	radv: Use pbb_allow for framebuffer BREAK_BATCH. Ported from radeonsi. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-07-23 21:26:59 +02:00
Samuel Pitoiset	8c97a07967	radv/gfx10: do not allocate space for the ZPASS_DONE bug GFX10 isn't affected. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-22 09:02:35 +02:00
Samuel Pitoiset	e7c356866e	radv: change a bunch of >= GFX9 to == GFX9 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-22 09:02:26 +02:00
Samuel Pitoiset	ed53d2c4be	radv/gfx10: disable the TC compat zrange workaround Unnecessary. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-17 08:32:36 +02:00
Samuel Pitoiset	ae4b1fc095	radv/gfx10: always build the GS copy shader but uses it on-demand It should be possible to build it on-demand too but it requires more work. On GFX10, the GS copy shader is required when tess is enabled with extreme geometry. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-17 08:32:30 +02:00
Samuel Pitoiset	afa102d65b	radv: add radv_emit_streamout_{begin,end} helpers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-07-16 11:17:00 +02:00
Samuel Pitoiset	4dcdc4cdc5	radv: allow to select DST_SEL with RELEASE_MEM Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-07-16 11:16:57 +02:00
Samuel Pitoiset	b393b2ce95	radv/gfx10: emit DISABLE_CONSERVATIVE_ZPASS_COUNTS Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-12 17:47:12 +02:00
Samuel Pitoiset	37aefb2be1	radv/gfx10: invalidate everything in L2 when shaders read data This includes metadata as well. On GFX10, we have to invalidate the L2 metadata cache when shaders read DCC. Note that we still have to implement GFX10 coherency by introducing INV_L2_METATADA but for now just flush L2. This fixes a corruption with DCC and Talos. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-12 14:08:12 +02:00
Samuel Pitoiset	ffd6a979bf	radv/gfx10: update OVERWRITE_COMBINER_{MRT_SHARING,WATERMARK} DCC related, mirror RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com	2019-07-12 08:19:53 +02:00
Bas Nieuwenhuizen	45b73b3aa9	radv/gfx10: Do not allocate a gs_copy_shader on gfx10. Will use ngg for any gs anyway. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-11 15:45:47 +02:00
Bas Nieuwenhuizen	d0978427cb	radv/gfx10: Use new uconfig reg index packet for GFX10+. Otherwise the hardware/firmware seems to not set the registers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-07 17:51:32 +02:00
Samuel Pitoiset	67b6888d8b	radv/gfx10: emit GE_CNTL instead of IA_MULTI_VGT_PARAM for legacy mode Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-07 17:51:32 +02:00
Samuel Pitoiset	12a42c2d9f	radv/gfx10: implement radv_flush_vertex_descriptors() change Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-07 17:51:32 +02:00
Samuel Pitoiset	ebeb319f0e	radv/gfx10: implement radv_CmdBindDescriptorSets() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-07 17:51:31 +02:00
Samuel Pitoiset	2435b571de	radv/gfx10: update DB_Z_INFO register GFX10 uses the same register as GFX8. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-07 17:03:38 +02:00
Samuel Pitoiset	17048c1765	radv/gfx10: implement radv_emit_fb_ds_state() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-07 17:03:38 +02:00
Samuel Pitoiset	c2a5d98148	radv/gfx10: implement radv_emit_fb_color_state() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-07 17:03:38 +02:00
Bas Nieuwenhuizen	c6cb9b197d	radv: Support VK_EXT_queue_family_foreign. Basically same as external for now. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Only case we might need to handle differently in the near future is Raven's case of displayable DCC which is not renderable. But we don't support that yet.	2019-07-03 10:56:21 +00:00
Samuel Pitoiset	6baa453dd5	radv: remove unused code in radv_update_tc_compat_zrange_metadata() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 08:51:58 +02:00
Samuel Pitoiset	e41e575e24	radv: implement clearing DCC layers on GFX8 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-02 09:37:56 +02:00

1 2 3 4 5 ...

766 Commits