KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Timur Kristóf	c9f8be9d58	ac/llvm: Implement load_num_subgroups for NGG shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17706>	2022-07-22 08:17:39 +00:00
Konstantin Seurer	1a95d43e55	radv: Simplify the meta init fail path Move most of the the cleanup into radv_device_init_meta. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17652>	2022-07-22 07:51:14 +00:00
Konstantin Seurer	b06e5535c0	radv: Use RADV_META_SUSPEND_PREDICATING Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17625>	2022-07-22 07:25:09 +00:00
Konstantin Seurer	fdeca2ca0b	radv: Add meta saving/restoring for predicating There are a bunch of places, where this is done manually. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17625>	2022-07-22 07:25:09 +00:00
Timur Kristóf	dd781c1ccb	ac/nir/ngg: Create output variable for primitive ID export. This makes the RADV/LLVM backend happy and mitigates a crash. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17581>	2022-07-21 21:53:29 +00:00
Timur Kristóf	22796d91ea	aco: Remove hack for primitive ID export. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17581>	2022-07-21 21:53:29 +00:00
Timur Kristóf	b0a7db1d3b	ac/nir/ngg: Move primitive ID workgroup barrier to proper place. Previously, it was in a divergent branch, therefore it could hang the GPU when a workgroup had a primitive-only wave. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17581>	2022-07-21 21:53:29 +00:00
Qiang Yu	754e43369d	ac/nir/ngg: Decouple primitive ID store and primitive export. There's no dependency between them. This can simplify the compiler backend translation by always storing prim id before vertex export, which also benefits the LLVM backend in latter changes. Signed-off-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17581>	2022-07-21 21:53:29 +00:00
Georg Lehmann	87e3277b82	nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. Allow folding constants/undef sources by sharing more code with the image_store 16bit folding pass. Allow more than one set of sources because RADV wants two, one for G16 (ddx/ddy) and one for A16 (all other sources). Allow folding cube sampling destination conversions on radeonsi/radv because I think the limitation only applies to sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Konstantin Seurer	7e7e01fe0c	radv: Use desc->format Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17490>	2022-07-21 12:48:01 +00:00
Konstantin Seurer	ae67294479	radv: Remove format desc null checks Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17490>	2022-07-21 12:48:01 +00:00
Samuel Pitoiset	85e859ac5f	radv: only force 1x sample for Bresenham lines when pipeline draws lines Otherwise, this would affect non-line draws. While we are at it, adjust a comment. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6303 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17657>	2022-07-21 08:05:35 +02:00
Timur Kristóf	4eb7e53963	radv: Enable task shader feature for NV_mesh_shader. Still hidden behind RADV_PERFTEST=nv_ms but now advertises task shader support too. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:31 +00:00
Timur Kristóf	28db493cdc	radv: Copy BO list to ACE internal CS. This is necessary to make sure the ACE internal cmdbuf can access the same memory as the GFX cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	5a1cbafd9d	radv: Submit internal compute cmdbuf. Use scheduled dependencies to create two submissions: first we submit ACE then GFX. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	eeb8366bbd	radv: Workaround MEC taskmesh dispatch hang when count buffer has zero. The DISPATCH_TASKMESH_INDIRECT_MULTI_ACE packet has a firmware bug, it hangs the GPU when the draw count is zero. This commit adds a workaround sequence using COND_EXEC packets which make sure that this indirect packet is never executed when the draw count is zero. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	7b8f087f6b	radv: Support task shaders in secondary cmd buffers. Special consideration is needed to keep ACE and GFX in sync. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	4c6f83006d	radv: Synchronization for task shaders. Add a separate flush_bits field for tracking cache flushes in the ACE internal cmdbuf. In barriers and image transitions we add these flush bits to ACE. Create a semaphore in the upload BO which makes it possible for ACE to wait for GFX for the purpose of synchronization. This is necessary when a barrier needs to block task shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	2479b62869	radv: Implement mesh shading draw calls with task shaders. This implements NV_mesh_shader draw calls with task shaders. - On the GFX side: DISPATCH_TASKMESH_GFX for all draws - On the ACE side: DISPATCH_TASKMESH_DIRECT_ACE for direct draws DISPATCH_TASKMESH_INDIRECT_MULTI_ACE for indirect draws Additionally, the NV_mesh_shader indirect BO layout is incompatible with AMD HW, so we add a function that copies that into a suitable layout. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	99179f8490	radv: Introduce radv_before_taskmesh_draw. This includes additional code that takes care of handling the internal ACE cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	79287f5ae6	radv: Flush descriptors and push constants for task shaders. Task shaders are executed on the internal compute cmdbuf, so they need special consideration. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	ef07c3272a	radv: Create internal cmdbuf when a graphics pipeline needs compute. This is mainly going to be used by task shaders, because the HW implementation mismatches the API: - In the API, task shaders are considered graphics shaders which are part of a graphics pipeline and the draws are submitted to a graphics queue. - The HW requires the driver to dispatch task shaders on an async compute queue. When a pipeline is bound that has a task shader, create a driver-internal ACE (async compute engine) cmdbuf which we are going to submit to an ACE queue. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	44a71594c8	radv: Add dispatch_initiator_task field to radv_device. This is going to be used with task shader dispatches. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	822e370390	radv: Allow reusing pipeline compute state emit functions. We are going to reuse them outside of radv_pipeline. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	bc4012d7ed	radv: Minor cleanup of radv_queue_submit_normal. - Move the uses_perf_counters ternary expression out of the loop into a variable called cs_offset. - Constify cmd_buffer_count. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	d718bea92d	radv: Move inline push constants to a new function. This cleans up radv_flush_constants and also the new function will be reused later. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	c4d60df380	radv: Refactor view index emit to use a per-stage function. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	99b7521504	radv: Refactor radv_emit_inline_push_consts to work with radeon_cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	19fcf5a4cb	radv: Refactor radv_emit_descriptor_pointers to work with radeon_cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	21e6d596ed	radv: Refactor radv_emit_userdata_address to work with radeon_cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	aadb3d69d1	radv: Refactor some CP DMA functions to work with radeon_cmdbuf. Allow emitting these packets without a radv_cmd_buffer object. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	e10cbb5d98	radv: Refactor predication for compute queues. Initialize the inverted predication VA only when it is used for the first time. This is needed to get conditional rendering work correctly with task shaders because the internal compute cmdbuf may not exist yet when conditional rendering starts. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Georg Lehmann	333f056edf	radv, aco: Don't lower 16bit isign. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17440>	2022-07-20 14:31:15 +00:00
Georg Lehmann	4be6164ac9	radv: Remove ineg from lower_bit_size_callback callback. It's always lowered to isub. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17440>	2022-07-20 14:31:15 +00:00
Georg Lehmann	b96126ee95	radv,aco: Don't lower and vectorize 16bit iabs. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17440>	2022-07-20 14:31:15 +00:00
Rhys Perry	37dda55f46	aco: remove unnecessary exp_cnt increments update_counters() already does this. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17257>	2022-07-20 14:04:15 +00:00
Vinson Lee	7384aab3ee	aco: Initialize spill_ctx members in constructor. Fix defects reported by Coverity Scan. Uninitialized scalar field (UNINIT_CTOR) uninit_member: Non-static class member sgpr_spill_slots is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member vgpr_spill_slots is not initialized in this constructor nor in any functions that it calls. Fixes: `7d34044908` ("aco: refactor VGPR spill/reload lowering") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17583>	2022-07-20 13:39:12 +00:00
Rhys Perry	0fb3aa005f	radv: don't update misaligned_mask in CmdBindVertexBuffers2 This can't do it in the loop because it doesn't easily know what attributes use a binding. We could do it in a separate loop, but there's no point, especially since zink does CmdSetVertexInputEXT() after CmdBindVertexBuffers2(). Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Fixes: `c335a4d70e` ("radv: dynamically calculate misaligned_mask for dynamic vertex input") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17521>	2022-07-19 20:40:20 +00:00
Rhys Perry	1d019d2ab7	radv: fix vbo_bound_mask indexing Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6812 Fixes: `1b8bdecf6e` ("radv: add a mask of bound descriptor buffers for dynamic vertex input") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17521>	2022-07-19 20:40:20 +00:00
Jason Ekstrand	b510ee0d22	Use vk_foreach_struct_const where needed We're about to make it so that the compiler warns/errors if you use the wrong iterator macro. Fix up a bunch of places where someone used the wrong one before we break anything. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17630>	2022-07-19 19:55:17 +00:00
Samuel Pitoiset	e840ba9ed8	aco: requires Exact for p_jump_to_epilog Otherwise, in presence of p_exit_early_if the main FS will always jump to the PS epilog regardless the exact mask. This fixes dEQP-VK.draw.renderpass.shader_invocation.helper_invocation and few vkd3d-proton regressions when PS epilogs are forced. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17617>	2022-07-19 17:52:36 +00:00
Daniel Schürmann	ac39e7bf23	aco: fix assertion in insert_exec_mask The exec mask might also be of type mask_type_loop. Fixes: `d068eb53e8` ('aco/insert_exec_mask: optimize top-level transition to exact before demote') Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17402>	2022-07-19 16:56:37 +00:00
Daniel Schürmann	6de68c5dca	aco: Avoid live-range splits in Exact mode Because the data register of atomic VMEM instructions is shared between src and dst, it might be necessary to create live-range splits during RA. Make the live-range splits explicit in WQM mode. Totals from 7 (0.01% of 134913) affected shaders: (GFX10.3) Latency: 17209 -> 17210 (+0.01%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15347>	2022-07-19 16:30:49 +00:00
Daniel Schürmann	f12eb5c213	aco: avoid unnecessary copies in emit_wqm() No fossil-db changes. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15347>	2022-07-19 16:30:49 +00:00
Konstantin Seurer	d4ca5942be	ac/llvm: Remove load_vertex_id handling Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17539>	2022-07-19 13:26:09 +00:00
Konstantin Seurer	83ccc810b4	aco: Remove dead nir_intrinsic_load_vertex_id case This intrinsic is lowered in NIR. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17539>	2022-07-19 13:26:09 +00:00
Konstantin Seurer	f90babb567	radv: Use nir_gen_rect_vertices Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:30 +00:00
Samuel Pitoiset	203afc9351	radv: disable viewport depth clamping only when necessary When the application uses depth values outside of the [0.0,1.0] range with VK_EXT_depth_range_unrestricted, or when explicitly disabled. Otherwise, the driver can clamp to [0.0,1.0] internally for optimal performance. From the Vulkan spec "in 28.10.1. Depth Clamping and Range Adjustment": "If depth clamping is not enabled and zf is not in the range [0, 1] and either VK_EXT_depth_range_unrestricted is not enabled, or the depth attachment has a fixed-point format, then zf is undefined following this step." Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16349>	2022-07-19 09:00:27 +00:00
Konstantin Seurer	fc26fbde3d	vulkan: Common vk_format_get_component_bits RADV and PowerVR use the same implementation. Turnip does use a slightly modified version but the helper only has one use -> just inline it and get rid of turnip's vk_format.h. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17515>	2022-07-18 22:14:06 +00:00
Samuel Pitoiset	5ee5c73d2d	radv: implement PS epilogs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	270cc39648	aco: add support for compiling PS epilogs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	8d13392969	aco: refactor export_fs_mrt_color() for PS epilogs preparation Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	df8fb721a5	radv,aco: rename radv_aco_build_prolog to radv_aco_build_shader_part Will be re-used for PS epilogs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	897561b7b9	aco: add aco_postprocess_shader() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	d9ffff09b0	aco: prevent adding DONE/VM to the last export if the FS has an epilog If the fragment shader exports MRTZ and the epilog some color exports, DONE/VM should be added to the last export. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	2784bfe93f	aco: do not abort if the FS doesn't export anything but has an epilog The main fragment shader can only export MRTZ (if present) and the epilog will export colors. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	a6dff6caa1	aco: emit p_jump_to_epilog if the main fragment shader has an epilog MRTZ is still exported from the main shader. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	8bdcc20815	aco: add new pseudo instruction p_jump_to_epilog The first operand of this new pseudo-instruction is a 64-bit SGPR for the continue PC, followed by a variable list of fixed VGPRS for the color exports which are the PS epilog inputs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	0fd3754c26	radv: add a function that declares PS epilog shader arguments The PS epilog would be a "normal" compiled shader using RA, etc. It will declare up to 8x4 VGPRs for all color exports. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	a38db1a94e	radv: declare a new user SGPR arg in FS for the epilog PC The main FS would have to jump to the PC of the PS epilog. Given that shaders are allocated in the 32-bit addr space, one user SGPR is fine. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	0db7a0b6e8	radv,aco: introduce {radv,aco}_ps_epilog_key To pass the necessary pipeline information for compiling PS epilogs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	eee098486a	radv,aco: track if a fragment shader needs an epilog This is currently disabled but it will be used for testing first, and then for graphics pipeline libraries. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Konstantin Seurer	0580910aa9	radv: Only set rt stack size for dynamic stacks When using a static callable stack, the required scratch has already been allocated. Dynamic stacks are located at the end of scratch memory and are allocated on demand using radv_set_rt_stack_size. Static stacks live at the start of scratch memory and are allocated in create_rt_shader by setting scratch_size. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-By: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17579>	2022-07-18 12:05:35 +00:00
Qiang Yu	eeaf0b1888	ac/nir/ngg: add a barrier before prim id export When culling enabled, it will use LDS space, which overlap with the prim id export. Fixes: `e97f0463a8` ("ac/nir: Implement NGG deferred attribute culling in NIR.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17593>	2022-07-18 09:50:09 +00:00
Qiang Yu	0b7ef846b3	ac/nir/ngg: fix nogs culling scratch size Should be in bytes not dwords. Fixes: `e97f0463a8` ("ac/nir: Implement NGG deferred attribute culling in NIR.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17593>	2022-07-18 09:50:09 +00:00
Timur Kristóf	5050db0b55	radv: Remove trailing whitespace introduced by DGC commits. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17580>	2022-07-18 08:23:41 +00:00
Timur Kristóf	b732285312	radv: Only initialize DGC state when DGC is enabled. This function causes a crash with RADV_DEBUG=llvm and this commit works around that crash. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17580>	2022-07-18 08:23:41 +00:00
Arvind Yadav	8adbd2a964	ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:10 -04:00
Arvind Yadav	689559d10f	ac/llvm : Adding Number of all interpolated inputs in ac_shader_abi Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:10 -04:00
Tatsuyuki Ishi	648731e2bd	radv: Only set pstate for the first hw_ctx. We used to do it for every queue, which was duplicate work as pstate is per-device. It could also cause trouble when multiple hw_ctx are created as the call will succeed for only one of them and the rest will return -EBUSY. Simplify and fix this by only setting for the first non-null hw_ctx. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17541>	2022-07-15 15:08:07 +00:00
Bas Nieuwenhuizen	2d2591bbb7	radv: Expose VK_NV_device_generated_commands. Closes: #6736 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	05bf39238b	radv: Add stub for vkCmdBindPipelineShaderGroupNV. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	3f09bd5a0e	radv: Implement CmdExecuteGeneratedCommandsNV. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	d7bd9db9fc	radv: Implement DGC cmdbuffer generation. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	37a619f517	radv: Implement DGC generated command layout structure. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	0c7bb92a78	radv: Add DGC meta shader. This generated the cmd and upload buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	848d3fdeb6	radv: Add flushing for DGC. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	df69b34450	radv: Add helper to write scissors. For use by DGC shader. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	57017b494d	radv: Make radv_get_vgt_index_size non-static. For DGC cmdbuffer generation use. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	f6a21fccf9	radv: Expose helper for base pa_su_sc_mode_cntl. So that we can feed it to the DGC shader for front face overrides. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	b213de12d3	radv: Require 32bit memory for indirect buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	cca680bab1	radv: Always store stride in the vbo descriptor. So we can use it in the DGC shader. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	bce3af2cb3	radv: Expose function to write vertex descriptors for dgc. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	82c2e99102	radv: Skip setting empty index buffers to avoid hang In the direct path we already skipped draws, but in DGC I noticed that just emitting these packets can cause issues ... Cc: mesa-stable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	f27f06d72c	radv: Add a 32bit memory type. Got to put the commandbuffers & uploadbuffers there. With DGC those can be allocated by the application. Excluding it from all other buffers/images to avoid using the precious 32bit address space. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Konstantin Seurer	82283de717	radv: Use a global address for sbt_base Required for indirect(2) ray tracing to work. Fixes the following tests: dEQP-VK.ray_tracing_pipeline.trace_rays_indirect2.indirect_* dEQP-VK.ray_tracing_pipeline.trace_rays_cmds_maintenance_1.indirect2_* Fixes: `16585664` ("radv: vkCmdTraceRaysIndirect2KHR") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17316>	2022-07-14 16:35:31 +00:00
Konstantin Seurer	69daa3f762	radv: Use a global address for ray_launch_size Required for indirect(2) ray tracing to work. Fixes: `b30f96dd` ("radv,aco: Use ray_launch_size_addr") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17316>	2022-07-14 16:35:31 +00:00
Lionel Landwerlin	a41e8dc588	spirv: switch to uint64 for rayquery internal type Fixes dEQP-VK.ray_query.advanced.using_wrapper_function.comp.* An empty struct is causing problems because when passing it as argument the spirv parser will just drop the argument, considering it does not hold any data. v2: update radv CI Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4c703686db` ("spirv: handle ray query intrinsics") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17420>	2022-07-14 09:15:52 +00:00
Jason Ekstrand	4f3bf712cf	radv: Set uses_sample_shading for copy shaders Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Dave Airlie	105279e989	radv: add a dynamic vertex format cache. With dynamic vertex bindings the vertex format lookups are a lot more frequent (vs being baked in the pipeline). Add a simple lookup cache using a dynamic array to keep track of the hw values, and avoid repeated translation. This also reduces the memset to just the bitfields since all the others will be overwritten. Seen in perf traces gputest gimark with zink on radv. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15846>	2022-07-13 01:10:09 +00:00
Eric Engestrom	9035408d62	radv: use updated tokens from vk.xml Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17342>	2022-07-12 15:53:11 +00:00
Konstantin Seurer	e8d12bc2b7	radv: Fix acceleration structure size queries From looking at the CTS, VK_QUERY_TYPE_ACCELERATION_STRUCTURE_SIZE_KHR refers to the serialization size and not to the actual, current size. Fixes the following CTS: dEQP-VK.ray_tracing_pipeline.acceleration_structures.query_pool_results.cpu.buffer.size dEQP-VK.ray_tracing_pipeline.acceleration_structures.query_pool_results.cpu.memory.size dEQP-VK.ray_tracing_pipeline.acceleration_structures.query_pool_results.gpu.buffer.size dEQP-VK.ray_tracing_pipeline.acceleration_structures.query_pool_results.gpu.memory.size Fixes: `5d56c2c` ("radv: Add accel struct queries for maintenance1") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17444>	2022-07-12 12:30:42 +00:00
Bas Nieuwenhuizen	97641e5c94	radv: Add ability to override the build id for the cache. This would allow us to keep the shader cache key the same for updates that we know won't impact compilation on SteamOS. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17426>	2022-07-12 12:08:12 +00:00
Konstantin Seurer	d528289c74	radv: Skip pipeline_no_null_shaders_flag tests Those tests either fail or hang so just exclude all of them for now to make ray tracing CTS usable again. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17443>	2022-07-12 09:36:29 +00:00
Samuel Pitoiset	e9b2fa6527	radv: fix wide points/lines by configuring the guardband correctly Fixes all remaining wide points/lines failures with Zink. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6121 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17392>	2022-07-12 09:11:40 +02:00
Daniel Schürmann	66d46a23fb	aco: fix packed 16bit fneg/fsat optimization Make sure that the Operand is '1.0.xx'. Fixes: `b03be30e07` ('aco: optimize packed fneg') Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17395>	2022-07-11 10:10:00 +00:00
Tatsuyuki Ishi	11fd0e1dda	amd: Revert gfx10 addrlib changes These broke a bunch of RADV VK CTS tests. Revert the ADDR_SW_4KB_R_X changes from the commit to get CTS passing again. Fixes: `3514b73244` ("amd: update addrlib - trivial changes") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6844 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17453>	2022-07-11 09:10:16 +00:00
Daniel Schürmann	676700d660	radv/shader_info: fix load_frag_coord and load_sample_pos read masks Fixes: `a8c471f962` ('radv: gather more information about PS in the shader info pass') Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17417>	2022-07-11 06:36:45 +00:00
Samuel Pitoiset	30846f1596	radv: remove the radv_report_apu_as_dgpu workaround for Red Dead Redemption 2 This workaround looks actually broken. We added it in the past because otherwise the game would just report 3GiB of video memory (ie. size of GTT on SD). Though, with this workaround enabled, the game explodes in memory easily. One theory is that because we fake integrated GPUs as discrete GPUS, and because we report 6GiB of VRAM (ie. driver redistributes memory for small carveout), the game thinks there is 6GiB of VRAM only and then keep allocating stuff. People reported that the memory explosion is gone without this workaround applied and I confirmed this myself. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17421>	2022-07-11 06:04:03 +00:00
Marek Olšák	9a39da359e	ac/surface: expose all 64K_R_X and 256K_R_X modifiers on gfx11 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17410>	2022-07-09 21:00:51 +00:00

1 2 3 4 5 ...

9687 Commits