KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Timur Kristóf	4c6f83006d	radv: Synchronization for task shaders. Add a separate flush_bits field for tracking cache flushes in the ACE internal cmdbuf. In barriers and image transitions we add these flush bits to ACE. Create a semaphore in the upload BO which makes it possible for ACE to wait for GFX for the purpose of synchronization. This is necessary when a barrier needs to block task shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	2479b62869	radv: Implement mesh shading draw calls with task shaders. This implements NV_mesh_shader draw calls with task shaders. - On the GFX side: DISPATCH_TASKMESH_GFX for all draws - On the ACE side: DISPATCH_TASKMESH_DIRECT_ACE for direct draws DISPATCH_TASKMESH_INDIRECT_MULTI_ACE for indirect draws Additionally, the NV_mesh_shader indirect BO layout is incompatible with AMD HW, so we add a function that copies that into a suitable layout. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	99179f8490	radv: Introduce radv_before_taskmesh_draw. This includes additional code that takes care of handling the internal ACE cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	79287f5ae6	radv: Flush descriptors and push constants for task shaders. Task shaders are executed on the internal compute cmdbuf, so they need special consideration. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	ef07c3272a	radv: Create internal cmdbuf when a graphics pipeline needs compute. This is mainly going to be used by task shaders, because the HW implementation mismatches the API: - In the API, task shaders are considered graphics shaders which are part of a graphics pipeline and the draws are submitted to a graphics queue. - The HW requires the driver to dispatch task shaders on an async compute queue. When a pipeline is bound that has a task shader, create a driver-internal ACE (async compute engine) cmdbuf which we are going to submit to an ACE queue. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	44a71594c8	radv: Add dispatch_initiator_task field to radv_device. This is going to be used with task shader dispatches. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	822e370390	radv: Allow reusing pipeline compute state emit functions. We are going to reuse them outside of radv_pipeline. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	bc4012d7ed	radv: Minor cleanup of radv_queue_submit_normal. - Move the uses_perf_counters ternary expression out of the loop into a variable called cs_offset. - Constify cmd_buffer_count. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	d718bea92d	radv: Move inline push constants to a new function. This cleans up radv_flush_constants and also the new function will be reused later. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	c4d60df380	radv: Refactor view index emit to use a per-stage function. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	99b7521504	radv: Refactor radv_emit_inline_push_consts to work with radeon_cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	19fcf5a4cb	radv: Refactor radv_emit_descriptor_pointers to work with radeon_cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	21e6d596ed	radv: Refactor radv_emit_userdata_address to work with radeon_cmdbuf. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	aadb3d69d1	radv: Refactor some CP DMA functions to work with radeon_cmdbuf. Allow emitting these packets without a radv_cmd_buffer object. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Timur Kristóf	e10cbb5d98	radv: Refactor predication for compute queues. Initialize the inverted predication VA only when it is used for the first time. This is needed to get conditional rendering work correctly with task shaders because the internal compute cmdbuf may not exist yet when conditional rendering starts. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16531>	2022-07-20 19:00:30 +00:00
Juan A. Suarez Romero	7131a05f92	v3d/ci: update expected list Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17660>	2022-07-20 18:37:26 +00:00
Mike Blumenkrantz	ea92421298	zink: scale number of shader cache-get threads this should improve startup times for short tests Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17659>	2022-07-20 18:19:10 +00:00
Mike Blumenkrantz	153983328d	zink: handle null samplerview/image rebinds more gracefully fixes a crash in nine cc: mesa-stable Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17656>	2022-07-20 17:48:03 +00:00
Mike Blumenkrantz	3c66617213	zink: move layout-setting for dynamic render attachments after no-op case this avoids unnecessarily terminating a renderpass to insert unnecessary barriers Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	edab34f68d	zink: allow no-op renderpass updates in zink_batch_rp() in some cases it becomes desirable to "maybe" stop and start the current renderpass, such as when updates MAY result in layout changes for attachments for such cases, avoid splitting the renderpass unless it actually needs to be split Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	4aec761596	zink: improve granularity of renderpass switching this should ensure that (future) renderpass recalcs will never split a renderpass unnecessarily Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	f78919d36d	zink: add a bits union for zink_render_pass_state comparisons Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	13951385b3	zink: reuse rt layout functions to refactor zink_render_pass_attachment_get_barrier_info() maybe slightly easier to read? Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	c602e877a3	zink: break out color rt layout calc Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	903060787e	zink: break out zs renderpass layout calc Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	1ff18f0879	zink: break out renderpass attachment load op calc no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	1bc9db6879	zink: split out rp_changed to be more granular for dynamic render sometimes a state change MAY require a renderpass change, but this change will not require splitting the current renderpass Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	9b087dcccd	zink: add an assert to verify that deferred clears aren't added in renderpass this would be very, very broken Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	d9a320b62c	zink: remove redundant renderpass update check for clears this should be handled already in the clear code Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	06b04a5c71	zink: unflag ctx->rp_changed after dynamic render update Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	b31a8dfc85	zink: do dynamic render implicit clear updates on rp_changed this should be flagged correctly now Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:19 +00:00
Mike Blumenkrantz	5ae1c76baa	zink: flag renderpass change if renderpass clears change Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:18 +00:00
Mike Blumenkrantz	ca3d696f80	zink: don't flag renderpass change on depth write change as often this is only "sometimes" required and only if dynamic render isn't being used Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:18 +00:00
Mike Blumenkrantz	d079e3c9b6	zink: don't try to start a renderpass before CmdClearAttachments it should be impossible to reach this point if a renderpass is not active Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17640>	2022-07-20 17:30:18 +00:00
Mike Blumenkrantz	644f059a8d	llvmpipe: ignore multisample state for alpha_to_one this is more accurate and fixes usage with lavapipe cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17627>	2022-07-20 15:41:14 +00:00
Mike Blumenkrantz	a884d1eb0e	vulkan/wsi: fix multiple acquires for sw without mit-shm in this case, lying about having multiple images and then returning the same image every time doesn't work, so use the busy flag and return an available image when possible Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17590>	2022-07-20 15:10:15 +00:00
Georg Lehmann	333f056edf	radv, aco: Don't lower 16bit isign. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17440>	2022-07-20 14:31:15 +00:00
Georg Lehmann	4be6164ac9	radv: Remove ineg from lower_bit_size_callback callback. It's always lowered to isub. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17440>	2022-07-20 14:31:15 +00:00
Georg Lehmann	b96126ee95	radv,aco: Don't lower and vectorize 16bit iabs. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17440>	2022-07-20 14:31:15 +00:00
Rhys Perry	37dda55f46	aco: remove unnecessary exp_cnt increments update_counters() already does this. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17257>	2022-07-20 14:04:15 +00:00
Vinson Lee	7384aab3ee	aco: Initialize spill_ctx members in constructor. Fix defects reported by Coverity Scan. Uninitialized scalar field (UNINIT_CTOR) uninit_member: Non-static class member sgpr_spill_slots is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member vgpr_spill_slots is not initialized in this constructor nor in any functions that it calls. Fixes: `7d34044908` ("aco: refactor VGPR spill/reload lowering") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17583>	2022-07-20 13:39:12 +00:00
Mike Blumenkrantz	687a5d8690	zink: use gfx_barrier for synchronization when binding buffer descriptors this will preserve existing shader stages in the access mask and avoid desync later on Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17639>	2022-07-20 09:21:30 -04:00
Mike Blumenkrantz	268011e8c6	zink: fix viewport count tracking the number of viewports in use is based on the outputs of the last vertex stage, not the viewports passed by the state tracker cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17639>	2022-07-20 09:21:30 -04:00
Mike Blumenkrantz	eaf11d3cd9	zink: call bind_last_vertex_stage() when binding vs ensure all the necessary updates are flagged cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17639>	2022-07-20 09:21:30 -04:00
Mike Blumenkrantz	011c02ce3c	zink: improve oom error message Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17639>	2022-07-20 09:21:30 -04:00
Mike Blumenkrantz	2aab0a9c16	zink: fix crashing on bo allocation failure list init occurs after mem allocation, so check this first cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17639>	2022-07-20 09:21:30 -04:00
Mike Blumenkrantz	19eddeb226	zink: handle null samplerview in get_imageview_for_binding() this is legal cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17639>	2022-07-20 09:21:30 -04:00
Mike Blumenkrantz	5b58f8df53	zink: handle max_vertices=0 in geometry shader this is a weird corner case where glsl permits a zero value, so clamp to 1 and then don't emit any vertices to avoid driver hangs affects: dEQP-GL45-ES31.functional.geometry_shading.emit.points_emit_0_end_0 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17639>	2022-07-20 09:21:30 -04:00
Alejandro Piñeiro	efc827ceea	v3d/v3dv: use NIR_PASS(_ Instead of NIR_PASS_V, when possible. This was done recently on anv (see commit `ce60195ec` and MR#17014) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:25 +00:00
Alejandro Piñeiro	0a50330c3d	broadcom/compiler: make several passes to return a progress Two advantages: * When using NIR_DEBUG=nir_print_xx, will print outcome only if there is a change * We can use NIR_PASS(_, ...) instead of NIR_PASS_V, that has slightly more validation checks. This includes: * v3d_nir_lower_image_load_store * v3d_nir_lower_io * v3d_nir_lower_line_smooth * v3d_nir_lower_load_store_bitsize * v3d_nir_lower_robust_buffer_access * v3d_nir_lower_scratch * v3d_nir_lower_txf_ms As we are here we also simplify some of them by using the nir_shader_instructions_pass helper. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:25 +00:00
Alejandro Piñeiro	81ca0b4191	broadcom/compiler: removed unused function It is not even implemented. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:25 +00:00
Alejandro Piñeiro	22022dfa04	v3dv/pipeline: use nir_shader_instructions_pass when lowering layout The trigger for this commit was when we found that we were not calling nir_metadata_preserve when lowering the layout code. But then I found that it would be better to just update the code to use nir_shader_instructions_pass, so we can avoid to manually: * Initialize the nir_builder * Call nir_foreach functions (we pass the callback) * Call nir_metadata_preserve functions (that as mentioned we were not calling) We also get a nice cleanup of several functions by reducing the number of parameters (we pass a state struct). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Alejandro Piñeiro	8d3ce4eb06	nir: call nir_metadata_preserve at nir_remove_unused_io_vars Without it we got a metadata assert: deqp-vk: ../src/compiler/nir/nir_metadata.c:108: nir_metadata_check_validation_flag: Assertion `!(function->impl->valid_metadata & nir_metadata_not_properly_reset)' failed if we try to use NIR_PASS(_, instead of NIR_PASS_V (that among other things, do more validations). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Alejandro Piñeiro	d8fee4cdaa	broadcom/compiler: use NIR_PASS for nir_lower_vars_to_ssa at v3d_optimize_nir There's no reason to not take into account progress at that point. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Alejandro Piñeiro	dea0fe8a06	broadcom/compiler: wrap nir_convert_to_lcssa with NIR_PASS_V So we get it included with the NIR_DEBUG=print_xx debug options. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Alejandro Piñeiro	72cc268858	broadcom: document which debug options are specific for a given driver Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Alejandro Piñeiro	e4d7c3b179	broadcom: add no_merge_jobs debug option To avoid having a custom getenv on v3dv. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Alejandro Piñeiro	bdf23805ab	v3dv/device: add comment on depthClamp feature As we can't support it, and I need to recheck now and then when I read that code. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17609>	2022-07-20 11:35:24 +00:00
Ricardo Garcia	4058984b63	vulkan: allow null descriptor set layouts in pipeline layouts VK_NULL_HANDLE descriptor set layouts are allowed when creating pipeline layouts without VK_PIPELINE_LAYOUT_CREATE_INDEPENDENT_SETS_BIT_EXT. From VUID-VkPipelineLayoutCreateInfo-graphicsPipelineLibrary-06753: > If graphicsPipelineLibrary is not enabled, elements of pSetLayouts > must be valid VkDescriptorSetLayout objects From VUID-VkPipelineLayoutCreateInfo-pSetLayouts-parameter: > If setLayoutCount is not 0, pSetLayouts must be a valid pointer to an > array of setLayoutCount valid or VK_NULL_HANDLE VkDescriptorSetLayout > handles Signed-off-by: Ricardo Garcia <rgarcia@igalia.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17629>	2022-07-20 10:59:56 +00:00
Iago Toral Quiroga	b54544852c	v3dv: fix offset reported by vkGetImageSubresourceLayout This represents an offset from the actual start of the image data, not from the start of the memory allocation bound to the image. Fixes: dEQP-VK.image.subresource_layout.* Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17648>	2022-07-20 10:33:57 +00:00
Iago Toral Quiroga	d2d2fdf934	v3dv: remove assert that poolSizeCount must be larger than 0 From the Vulkan spec: "If poolSizeCount is not 0, pPoolSizes must be a valid pointer to an array of poolSizeCount valid VkDescriptorPoolSize structures" So 0 is actually allowed and there is a CTS to check it is handled gracefully. Fixes: dEQP-VK.api.descriptor_pool.zero_pool_size_count Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17648>	2022-07-20 10:33:57 +00:00
Iago Toral Quiroga	19ef41b181	v3dv: handle VK_STRUCTURE_TYPE_DRM_FORMAT_MODIFIER_PROPERTIES_LIST_2_EXT Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17616>	2022-07-20 10:05:59 +00:00
Marcin Ślusarz	5e14445430	nir: convert unused mesh outputs to shared memory Otherwise reads from output in one subgroup may not see writes from other subgroups. Temp variables are later converted to scratch, so even within one subgroup we may not see correct values. Test case in https://gitlab.freedesktop.org/mesa/crucible/-/merge_requests/115 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17517>	2022-07-20 09:22:06 +00:00
Juan A. Suarez Romero	95fe540139	v3d/ci: (partially) revert expected list This reverts the traces expected list changed in `f3579a62e9` ("v3d/v3dv/ci: update expected results"). Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17647>	2022-07-20 09:37:43 +02:00
Mike Blumenkrantz	2e7ecc02bc	zink: default to lazy/templated descriptors cached mode was great 2 years ago when template support was less widespread, but now that templates are everywhere, caching is less performant in every scenario Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17636>	2022-07-20 04:24:19 +00:00
Mike Blumenkrantz	6493f6ef79	zink: add an explicit ZINK_DESCRIPTOR_MODE_CACHED no functional changes, but now AUTO is no longer a mode that gets used Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17636>	2022-07-20 04:24:19 +00:00
Mike Blumenkrantz	9172127057	zink: delete ZINK_DESCRIPTOR_MODE_NOFALLBACK this was useful for testing but is going to be less useful soon Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17636>	2022-07-20 04:24:19 +00:00
Mike Blumenkrantz	1c2bd27d81	zink: move descriptor_mode to be a global this can't change once it's been set, so having it per-object is pointless Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17636>	2022-07-20 04:24:19 +00:00
Timothy Arceri	d1e36634bd	nir/loop_unroll: clean up after complex_unroll_single_terminator() Previously we would just unroll the loop one extra iteration and let other optimisation passes clean up the mess. This worked to a degree but if the loop happened to be nested inside another loop we would end up with phi chains that would block other passes from being able to do the cleanup. With this commit we explicitly clone the variables create by lcsaa and insert them directly in the last continue branch after we are done unrolling. With this optimisation passes can recognise both sides of the if output the same values and can progress further. Help with the issues described in: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6051 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17611>	2022-07-20 03:47:45 +00:00
Dave Airlie	bfebf51571	lavapipe: state latest conformance tests passed Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Cc: mesa-stable "22.1" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17643>	2022-07-20 03:36:13 +00:00
Danylo Piliaiev	7b0fcd8932	turnip: Disable LRZ fast-clear for gen1 and gen2 LRZ fast-clear works on all gens, however blob disables it on gen1 and gen2. We also elect to disable fast-clear on these gens because for close to none gains it adds complexity and seem to work a bit differently from gen3+. Which creates at least one edge case: if first draw which uses LRZ fast-clear doesn't lock LRZ direction the fast-clear value is undefined. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6829 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17599>	2022-07-20 02:58:44 +00:00
Emma Anholt	7e381ba9fc	ci/freedreno: Switch a630 to manual/disabled for lab maintenance. We're getting several more 630s in the lab, but need a bit of time to swap out some broken old ones and stabilize the new ones. Fritz thinks this should be done in an hour or so, but I want to turn off the CI for main so that we don't block anyone else. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17641>	2022-07-20 02:00:43 +00:00
Yiwei Zhang	4ce29c5568	venus: reenable VK_EXT_extended_dynamic_state2 Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17582>	2022-07-20 01:53:16 +00:00
Yiwei Zhang	4eaa780944	venus: check dynamic state for raster enablement We should not scrub raster dedicated states when dynamic state includes VK_DYNAMIC_STATE_RASTERIZER_DISCARD_ENABLE. Test: - dEQP-VK.pipeline.extended_dynamic_state.*_raster - dEQP-VK.api.pipeline.pipeline_invalid_pointers_unused_structs.graphics Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17582>	2022-07-20 01:53:16 +00:00
Dave Airlie	e14022c052	crocus: fail query begin if upload allocation fails. This is the only place I think I can see this crashing. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17615>	2022-07-20 01:46:32 +00:00
SoroushIMG	2159a377c0	Zink: Fix clear being missed when using emulated draws in zink_blit zink_kopper_acquire_readback will flush any outstanding clears, this means that the current clears need to be applied first before calling zink_kopper_acquire_readback. This was already done for native_blit and native_resolve, also do this for the emulated draw path. Seen as intermittent failures in cts case GTF-GL33.gtf21.GL2FixedTests.buffer_clear.buffer_clear. Cc: mesa-stable Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17631>	2022-07-20 00:39:33 +00:00
SoroushIMG	3dfd8e4d7d	zink: Fix spirv stream 0 vertex emit for multistream shaders Spirv spec does not allow the use of OpEmitVertex or OpEndPrimitive when there are multiple streams. Instead emit the multi-stream version of these with stream set to 0. This issue was seen when testing cts case KHR-GL46.transform_feedback.draw_xfb_stream_test Fixes: `35e346f428` ("zink: handle vertex streams") Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17513>	2022-07-20 00:19:38 +00:00
Jason Ekstrand	9ecc26ff27	anv: Do depth/stencil optimization for dynamic depth/stencil When we made depth/stencil dynamic, we lost the optimization. This is particularly important for cases where the stencil test is enabled but never writes anything as certain combinations with discard can cause the stencil write (which doesn't do anything) to get moved late which can be a measurable perf hit. According to `028e1137e6` ("anv/pipeline: Be smarter about depth/stencil state", it was a couple percent for DOTA2 on Broadwell back in the day. No idea how it affects current titles. This may also improve the depth/stncil PMA workarounds on Gen8 and Gen9 since they're now looking at optimized depth/stencil state. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Jason Ekstrand	07f5f15963	vulkan/graphics_state: Improve the depth/stencil optimization code We now disable stencil writes when the stencil test is disabled or when both stencil write masks are zero. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Jason Ekstrand	7d25c04236	anv: Switch to using common dynamic state tracking Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Jason Ekstrand	cc89232f2e	anv: Stop merging DEPTH_STENCIL state Now that we've stopped trying to do dynamic stuff up-front, we're only merging in one bit: DoubleSidedStencilEnable. There's no point in all the merging code for one bit which is a constant anyway. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Jason Ekstrand	97dd1a100d	anv: Rework setting primitive topology For one thing, we were deceptively setting it wrong in genX_cmd_buffer.c and then overwriting it in each of of gfx7_cmd_buffer.c and gfx8_cmd_buffer.c. Pull it all into genX_cmd_buffer.c so it's no longer duplicated. Also, stop doing the PATCHLIST conversion in anv_pipeline.c and just store the number of patch vertices. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Jason Ekstrand	b656957c56	anv: Convert to using vk_graphics_pipeline_state This attempts to be a pretty minimal refactor. We just switch all the state emit code to use the new graphics pipeline state object. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Lionel Landwerlin	44cbb7d9c8	anv: remove pipeline setting dynamic state Following our rule that anything that can be dynamic should be set (and is this case is already) in the following files : - genX_cmd_buffer.c - gfx8_cmd_buffer.c - gfx7_cmd_buffer.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Jason Ekstrand	18868f1c76	anv: Stop recording sample locations per-sample-count The only reason why we recorded them per-sample-count is because Intel hardware is weird starting with Broadwell. The API, requires that the dynamic sample pattern be reset every time the sample count changes so we only need to record the pattern for the current sample count. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Jason Ekstrand	36417bd05b	anv: Handle vertex buffer sizes in anv_CmdBindVertexBuffers2 There's no good reason to defer figuring out the size until we emit the packet. We know everything when the bind happens. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Jason Ekstrand	1faba01c2b	anv: Add an instance multiplier to anv_pipeline This lets us get rid of the view_mask and pipeline replication logic repeated all over everywhere. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Daniel Schürmann	6f25d45877	spirv: handle phis decorated with RelaxedPrecision If the driver can do 16-bit ALU ops, then store RelaxedPrecision phi values into 16-bit NIR variables with downconverts/upconverts on the way in/out. This has no impact on shader-db on freedreno (not that we have a ton of GLES content there), but it does cause an ANGLE-translated CTS shader on vulkan to get consistent conversions between two copies of a value, and avoid a test bug. Reviewed-by: Emma Anholt <emma@anholt.net> Closes: #6585 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14018>	2022-07-19 21:58:01 +00:00
Rhys Perry	0fb3aa005f	radv: don't update misaligned_mask in CmdBindVertexBuffers2 This can't do it in the loop because it doesn't easily know what attributes use a binding. We could do it in a separate loop, but there's no point, especially since zink does CmdSetVertexInputEXT() after CmdBindVertexBuffers2(). Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Fixes: `c335a4d70e` ("radv: dynamically calculate misaligned_mask for dynamic vertex input") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17521>	2022-07-19 20:40:20 +00:00
Rhys Perry	1d019d2ab7	radv: fix vbo_bound_mask indexing Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6812 Fixes: `1b8bdecf6e` ("radv: add a mask of bound descriptor buffers for dynamic vertex input") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17521>	2022-07-19 20:40:20 +00:00
Pavel Ondračka	ae003ef78c	r300: disable PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF We can't do uniform buffer objects and from the hardware perspective constants (uniforms) and immediates are treated in the same way. They are uploaded together and fit together into the (rather low) total constant limit. Therefore, there is actually no advantage in converting immediates to uniforms, and a whole lot of disadvantages (less possible optimizations and no inlining). Fixes the dEQP regressions from https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770 The tests failed because when there is an indirect array access the compiler inserts a big if ladder and converting the temp to a uniform means more instructions because the ifs cant be lowered at least partially to selects. It is particularly visible, because the code NIR currently emits for the indirect access doesn't really fit the hardware, see: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6366 No change in my shader-db, so this concerns the tests only. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17576>	2022-07-19 20:31:37 +00:00
Jason Ekstrand	3f19a60869	vulkan: Detect pNext chain loops in vk_foreach_struct() (v2) This implements the "tortoise and the hare" algorithm for detecting cycles in graphs. We use the caller's iterator as the hare and our own internal copy as the tortoise. Conveniently, VkBaseOutStructure (and VkBaseInStructure which is identical except the pointer type on pNext) have a pointer we can use for the tortoise and an sType which we can use for a counter to ensure we only increment the tortose every other loop iteration. There are more efficient algorithms than tortoise and hare but they require allocating memory for something like a hash set of seen nodes. Since this for debug purposes only, it's ok for it to be a bit inefficient in the case where it hits the assert. In the usual case of no loops, it's the same runtime efficiency as the unchecked version except that it does a tiny bit of math and 50% more pointer chases. Version 1 worked fine with clang and with GCC 12.1 with -O0 but not with optimizations. See https://gitlab.freedesktop.org/mesa/mesa/-/issues/6895. Unfortunately, the first version required modifying a temporary declared const inside the for loop and that seems to have been the problem. This version instead has an iterator struct which is managed by an outer for loop and the inner for loop exists to declare the user's requested iteration variable and manage the actual iteration. Because the outer for loop is effectively `for (bool done = false; !done; done = true)`, it will execute exactly once, regardless of the inner loop, so break and continue inside the inner loop should work the same as if it's a single for loop. The other major difference with the new version is that the code is the same for debug and release except the half_iter and loop check are gone. I've verified by hand that this produces virtually identical code to the old simple iterators on both GCC andl clang with an optimized build. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17630>	2022-07-19 19:55:17 +00:00
Jason Ekstrand	b510ee0d22	Use vk_foreach_struct_const where needed We're about to make it so that the compiler warns/errors if you use the wrong iterator macro. Fix up a bunch of places where someone used the wrong one before we break anything. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17630>	2022-07-19 19:55:17 +00:00
Jason Ekstrand	9066cc86bb	vulkan: Copy the depth/stencil state optimization code from ANV Instead of having stencil writes as an out parameter of the optimization function, we add a new write_enable field for stencil that's equivalent to the similarly named field for depth. This doesn't mean drivers must actually support disabling stencil writes independently but the information may be helpful on some hardware. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:34 +00:00
Jason Ekstrand	4ad149a8fc	vulkan: Add a common implementation of vkCmdSet* Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:34 +00:00
Jason Ekstrand	fcedb1250b	vulkan: Add a common data structure for dynamic states Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:34 +00:00
Jason Ekstrand	09188c9659	util: Constify __bitset_test_range Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:33 +00:00
Jason Ekstrand	7ca8dcb05f	vulkan: Add data structures to store all graphics state Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:33 +00:00
Jason Ekstrand	fed81dc306	vulkan: Add standard sample locations This looks a bit funny because we define the struct in the .c file but that will be fixed in the next commit. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:33 +00:00
Jason Ekstrand	fb3f9c0aac	vulkan: Add an enum for all dynamic graphics states Also, add a helper function to turn VkPipelineDynamicStateCreateInfo into a bitset. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:33 +00:00
Jason Ekstrand	6b6c1a7ddd	vulkan: Add a vk_limits.h file for runtime limits Individual driver limits may be smaller than these. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:33 +00:00
Samuel Pitoiset	e840ba9ed8	aco: requires Exact for p_jump_to_epilog Otherwise, in presence of p_exit_early_if the main FS will always jump to the PS epilog regardless the exact mask. This fixes dEQP-VK.draw.renderpass.shader_invocation.helper_invocation and few vkd3d-proton regressions when PS epilogs are forced. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17617>	2022-07-19 17:52:36 +00:00
Juan A. Suarez Romero	f3579a62e9	v3d/v3dv/ci: update expected results Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17632>	2022-07-19 17:26:52 +00:00
Daniel Schürmann	ac39e7bf23	aco: fix assertion in insert_exec_mask The exec mask might also be of type mask_type_loop. Fixes: `d068eb53e8` ('aco/insert_exec_mask: optimize top-level transition to exact before demote') Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17402>	2022-07-19 16:56:37 +00:00
Daniel Schürmann	6de68c5dca	aco: Avoid live-range splits in Exact mode Because the data register of atomic VMEM instructions is shared between src and dst, it might be necessary to create live-range splits during RA. Make the live-range splits explicit in WQM mode. Totals from 7 (0.01% of 134913) affected shaders: (GFX10.3) Latency: 17209 -> 17210 (+0.01%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15347>	2022-07-19 16:30:49 +00:00
Daniel Schürmann	f12eb5c213	aco: avoid unnecessary copies in emit_wqm() No fossil-db changes. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15347>	2022-07-19 16:30:49 +00:00
Hyunjun Ko	4bccee123f	turnip: expose VK_EXT_shader_module_identifier Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17614>	2022-07-19 16:12:15 +00:00
Hyunjun Ko	d046d6e9e0	turnip: Remove an unnecessary assert. The assertion is already in the common implementation. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17614>	2022-07-19 16:12:15 +00:00
Konstantin Seurer	d4ca5942be	ac/llvm: Remove load_vertex_id handling Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17539>	2022-07-19 13:26:09 +00:00
Konstantin Seurer	83ccc810b4	aco: Remove dead nir_intrinsic_load_vertex_id case This intrinsic is lowered in NIR. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17539>	2022-07-19 13:26:09 +00:00
Konstantin Seurer	eb19640d61	radeonsi: Set vertex_id_zero_based Let NIR lower load_vertex_id instead of LLVM. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17539>	2022-07-19 13:26:09 +00:00
Konstantin Seurer	d316d24d74	v3dv: Use nir_gen_rect_vertices Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:31 +00:00
Konstantin Seurer	f90babb567	radv: Use nir_gen_rect_vertices Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:30 +00:00
Konstantin Seurer	fab0050223	nir: Add a common gen_rect_vertices implementation Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:30 +00:00
Samuel Pitoiset	203afc9351	radv: disable viewport depth clamping only when necessary When the application uses depth values outside of the [0.0,1.0] range with VK_EXT_depth_range_unrestricted, or when explicitly disabled. Otherwise, the driver can clamp to [0.0,1.0] internally for optimal performance. From the Vulkan spec "in 28.10.1. Depth Clamping and Range Adjustment": "If depth clamping is not enabled and zf is not in the range [0, 1] and either VK_EXT_depth_range_unrestricted is not enabled, or the depth attachment has a fixed-point format, then zf is undefined following this step." Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16349>	2022-07-19 09:00:27 +00:00
Iago Toral Quiroga	bec3c83e19	v3dv: implement VK_KHR_buffer_device_address This feature allows shaders to use pointers to buffers which may not be bound via descriptor sets. Access to these buffers is done via global intrinsics. Because the buffers are not accessed through descriptor sets, any live buffer flagged with VK_BUFFER_USAGE_SHADER_DEVICE_ADDRESS_BIT_KHR can be accessed by any shader using global intrinsics, so the driver needs to make sure all these buffers are mapped by the kernel when it submits the job for execution. We handle this by tracking if any draw call or compute dispatch in a job uses a pipeline that has any such shaders. If so, the job is flagged as using buffer device address and the kernel submission for that job will add all live BOs bound to buffers flagged with the buffer device address usage flag. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Iago Toral Quiroga	90054e9c5d	broadcom/compiler: track if a shader uses global intrinsics Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Iago Toral Quiroga	fa03d9c8be	broadcom/compiler: implement 2x32 global intrinsics Notice we ignore the high 32-bit component of the address because we know it must be 0. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Iago Toral Quiroga	b18cecbfb6	nir: add nir_address_format_2x32bit_global This adds support for global 64-bit GPU addresses as a pair of 32-bit values. This is useful for platforms with 32-bit GPUs that want to support VK_KHR_buffer_device_address, which makes GPU addresses explicitly 64-bit. With the new format we also add new global intrinsics with 2x32 suffix that consume the new address format. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Iago Toral Quiroga	ea3acbef8d	v3dv: remove duplicate condition Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	706f1252ba	v3dv: explain why we clear certain state after a draw call Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	702b685b07	v3dv: add a dirty state for pending push constants UBO updates If we have 2 pipelines that consume the same push constant data but where one of them only uses direct access and the other has indirect access, a draw with the first pipeline would clear the dirty flag without updating the UBO and by the time we bind and draw with the second pipeline we won't upload the constants either because the first draw cleared the dirty flag. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	3898bf6971	v3dv: allocate more push constant buffers if needed Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	e451c612df	v3dv: stop tracking push constant buffer references Since we allocate this ourselves we can immediately add it to the job at the time we allocate it. This also fixes a bug we introduced when we implemented inline uniforms because since that commit, if we had an inline uniform buffer at index 1 which happend to have indirect access we would track it in slot 0 instead of slot 1, potentially overwriting the push constant buffer reference. Fixes: `ea3223e7a4` ('v3dv: implement VK_EXT_inline_uniform_block') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	45b8dc667a	v3dv: don't allocate MAX_PUSH_CONSTANTS_SIZE bytes for the push constants UBO We have code in there to allocate various segments of MAX_PUSH_CONSTANTS_SIZE to handle the case of various draw calls in the same command buffer requiring different push constants, so we are implicitly expecting it to be larger than this. In fact, this only works now because when we allocate a BO we are always at least allocating a full page, so the least we ever allocate is 4096 bytes, so be explicit about it to avoid confusion. Also, since we were always mapping MAX_PUSH_CONSTANTS_SIZE and the mapping always starts at the beginning of the BO, it looks like after the first copy when the resource offset is not zero, we would be writing outside the mapped range. Always map the full size of the BO instead to ensure this doesn't happen. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	51a45f9315	v3dv: limit upload of indirect push constant data We have been always uploading MAX_PUSH_CONSTANTS_SIZE but now that we track the actual size of the push constant buffer we can use this instead. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	005542f0e3	v3dv: move push constant data to the command buffer state Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	41a0c89d9f	v3dv: only save/restore push constant data for meta operations if needed If the command buffer didn't have any push constants or the meta operation didn't write any new constants we don't need to restore the state. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Jason Ekstrand	669daa37b1	Revert "vulkan: Detect pNext chain loops in vk_foreach_struct()" This reverts commit `4c56b535f5`.	2022-07-18 23:48:59 -05:00
Lionel Landwerlin	2bfcd29155	anv: move restart index to gfx state Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	a9abf1dd93	anv: fix primitive topology dynamic state emission on gfx7 This dynamic state uses the pipeline information so it needs to be reemitted whenever the pipeline changes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	25de752234	anv: name non dynamic state fields correctly Those fields have confusing names. They hold non dynamic state, specified at pipeline creation that we copy into the dynamic state of the command buffer whenever we bind a pipeline. This non dynamic state might get picked up in the dynamically emitted instructions of the 3D pipeline because our HW packets are not exactly splitted like the Vulkan API. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	5b6e6a672c	anv: reorder & document fields of anv_graphics_pipeline Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	aea9abd71b	anv: move CreateRayTracingPipelines to common code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	ffc798c364	anv: move CreateComputePipelines to common code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	2c816b4f2e	anv: move CreateGraphicsPipelines to common code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	1ba89d35ab	anv: rename internal function for consistency Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Jason Ekstrand	cb682a1cdd	anv: Don't use the wrong ARRAY_SIZE Even though this doesn't change anything, it's not good to use an ARRAY_SIZE for one array to iterate over another. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	f66192a4b3	anv: split graphics nir loading Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	36aa0f668f	anv: break up anv_pipeline_compile_graphics() This function is pretty overwhelming. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	c806d1e5ed	anv: simplify dynamic buffer count in pipeline layout anv_descriptor_set_layout already has the information we're gather here. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	5b561b501a	anv: remove local computation of dynamic states This bit mask is already computed in anv_graphics_pipeline::dynamic_states in anv_graphics_pipeline_init(). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Jason Ekstrand	4c56b535f5	vulkan: Detect pNext chain loops in vk_foreach_struct() This implements the "tortoise and the hare" algorithm for detecting cycles in graphs. We use the caller's iterator as the hare and our own internal copy as the tortoise. Conveniently, VkBaseOutStructure (and VkBaseInStructure which is identical except the pointer type on pNext) have a pointer we can use for the tortoise and an sType which we can use for a counter to ensure we only increment the tortose every other loop iteration. There are more efficient algorithms than tortoise and hare but they require allocating memory for something like a hash set of seen nodes. Since this for debug purposes only, it's ok for it to be a bit inefficient in the case where it hits the assert. In the usual case of no loops, it's the same runtime efficiency as the unchecked version except that it does a tiny bit of math and 50% more pointer chases. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17596>	2022-07-19 02:11:07 +00:00
Emma Anholt	94bd06256a	intel/fs: Simplify brw_barycentric_mode() args. Reduce a bit of mode lookup noise I was tracing through trying to resolve the previous bug. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17381>	2022-07-19 01:25:47 +00:00
Lionel Landwerlin	2d1f021e16	intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it. [anholt: changed to make all drivers do the right thing by moving the payload barycentric check into the compiler] Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17381>	2022-07-19 01:25:47 +00:00
Emma Anholt	3d62a41dcc	freedreno/ir3: Enable core NIR's 16-bit ALU optimizations. In addition to hopefully generating shorter code, this optimizes out a comparison of a mediump-cast value in dEQP-GLES2.functional.shaders.algorithm.rgb_to_hsl_fragment passed through ANGLE, and allows the test to pass. We believe it to be a test bug, but emitting better code like apparently everyone else does is also a fine result. No change on GLES gfxbench shaders. Fixes: #6585 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17546>	2022-07-18 22:41:18 +00:00
Konstantin Seurer	fc26fbde3d	vulkan: Common vk_format_get_component_bits RADV and PowerVR use the same implementation. Turnip does use a slightly modified version but the helper only has one use -> just inline it and get rid of turnip's vk_format.h. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17515>	2022-07-18 22:14:06 +00:00
jheaff1	2e71e23188	build(glx): Fix build by adding missing deps dri3_glx.c includes xshmfence and glxcmds.c includes xf86vm, neither of which are listed as dependencies of the glx lib in the meson.build file. Consequently, those files would fail to compile on machines that did not have xshmfence and xf86vm installed globally. This commit rectifies the issue by adding the missing dependencies Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17585>	2022-07-18 21:12:26 +00:00
Mike Blumenkrantz	48491386ff	mesa/st: add implicit zeroing of clipdistance array GL drivers have an implicit default of "in bounds" for unwritten clipdistance values, but some (layered) drivers have to deal with api mismatch which prevents that implicit value from being used after the shader gets mangled by various compiler passes to avoid issues here, write out all members of the clipdistance array with zeroes at the very start of the shader such that any values the shader actually writes will naturally overwrite the implicit zero and any unwritten values will now be written as zero fixes #6845 Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17498>	2022-07-18 20:33:11 +00:00
Mike Blumenkrantz	071d335ca2	zink: tu a630 baseline update Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17606>	2022-07-18 20:15:05 +00:00
Mike Blumenkrantz	b6df410d26	zink: nv baseline update Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17606>	2022-07-18 20:15:05 +00:00
Adam Jackson	c123ab2137	kopper: Implement {EGL,GLX}_EXT_buffer_age Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17527>	2022-07-18 19:31:29 +00:00
Mike Blumenkrantz	81d83e81db	zink: break out tc/trace context unwrapping Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17527>	2022-07-18 19:31:28 +00:00
Yiwei Zhang	a211d74096	venus: filter out VK_EXT_physical_device_drm on the driver side Fixes: `a1a22862c6` ("venus: implement VK_EXT_physical_device_drm") Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17591>	2022-07-18 19:23:53 +00:00
Dave Airlie	50e3303b3d	kms/dri: add mutex lock around map/unmap this can get called from multiple threads with the recent llvmpipe overlapping rendering changes, so make sure to lock around the map/unmapping so they can't race. This should fixes some crashes seen with kwin. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Tested-by: Adam Williamson (Fedora) Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17531>	2022-07-18 19:06:30 +00:00
Samuel Pitoiset	5ee5c73d2d	radv: implement PS epilogs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	270cc39648	aco: add support for compiling PS epilogs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	8d13392969	aco: refactor export_fs_mrt_color() for PS epilogs preparation Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	df8fb721a5	radv,aco: rename radv_aco_build_prolog to radv_aco_build_shader_part Will be re-used for PS epilogs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	897561b7b9	aco: add aco_postprocess_shader() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	d9ffff09b0	aco: prevent adding DONE/VM to the last export if the FS has an epilog If the fragment shader exports MRTZ and the epilog some color exports, DONE/VM should be added to the last export. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	2784bfe93f	aco: do not abort if the FS doesn't export anything but has an epilog The main fragment shader can only export MRTZ (if present) and the epilog will export colors. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	a6dff6caa1	aco: emit p_jump_to_epilog if the main fragment shader has an epilog MRTZ is still exported from the main shader. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	8bdcc20815	aco: add new pseudo instruction p_jump_to_epilog The first operand of this new pseudo-instruction is a 64-bit SGPR for the continue PC, followed by a variable list of fixed VGPRS for the color exports which are the PS epilog inputs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	0fd3754c26	radv: add a function that declares PS epilog shader arguments The PS epilog would be a "normal" compiled shader using RA, etc. It will declare up to 8x4 VGPRs for all color exports. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	a38db1a94e	radv: declare a new user SGPR arg in FS for the epilog PC The main FS would have to jump to the PC of the PS epilog. Given that shaders are allocated in the 32-bit addr space, one user SGPR is fine. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	0db7a0b6e8	radv,aco: introduce {radv,aco}_ps_epilog_key To pass the necessary pipeline information for compiling PS epilogs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	eee098486a	radv,aco: track if a fragment shader needs an epilog This is currently disabled but it will be used for testing first, and then for graphics pipeline libraries. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Ruijing Dong	a585d95803	frontends/va: WA for ffmpeg 10bit encoding crash When doing 10bit encoding in ffmpeg it uses VaDeriveImage, and that could result in missing mapping the chroma buffer of the input frame. This WA to disallow ffmpeg using VaDeriveImage function, so that VaCreateImage and VaPutImage can be used and WA the chroma buffer mapping issue. Reviewed-by: Leo Liu <leo.liu@amd.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17472>	2022-07-18 17:23:22 +00:00
Ruijing Dong	cd653e5cc7	frontends/va: do texture_map when needed When map buffer, and its target is texture, texture_map/unmap need to be used. Reviewed-by: Leo Liu <leo.liu@amd.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17472>	2022-07-18 17:23:22 +00:00
Erik Faye-Lund	e630637eab	dzn: expose VK_KHR_driver_properties We're not quite conformant with the extension, because we don't have a valid conformance version. That's not a quick-fix, so we should probably just accept some failures for now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 17:10:52 +02:00
Erik Faye-Lund	e5da067384	dzn: fill misc props This is just a bag of misc properties that we should fill in. Not all of them are filled out super accurately, but this is the best we can do for now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	053e2fd9d0	dzn: fill in minmax props This should be possible to support, but we don't support minmax blending at all yet, so let's leave these as unsupported for now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	18c590e0b3	dzn: fill in depth/stencil resolve props Before enabling Vulkan 1.2 support, we need to fix the TODO in here. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	141e715f29	dzn: fill in bindless props These might not be exactly right, but they are good enough for now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	34b0828cdc	dzn: fill in non-uniform-indexing props Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	8e9191cd24	dzn: fill in float-control details We can do better here in the future, but this is what's supported right now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	4d7403d4dc	dzn: fill in driver name and info Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Konstantin Seurer	0580910aa9	radv: Only set rt stack size for dynamic stacks When using a static callable stack, the required scratch has already been allocated. Dynamic stacks are located at the end of scratch memory and are allocated on demand using radv_set_rt_stack_size. Static stacks live at the start of scratch memory and are allocated in create_rt_shader by setting scratch_size. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-By: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17579>	2022-07-18 12:05:35 +00:00
Qiang Yu	eeaf0b1888	ac/nir/ngg: add a barrier before prim id export When culling enabled, it will use LDS space, which overlap with the prim id export. Fixes: `e97f0463a8` ("ac/nir: Implement NGG deferred attribute culling in NIR.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17593>	2022-07-18 09:50:09 +00:00
Qiang Yu	0b7ef846b3	ac/nir/ngg: fix nogs culling scratch size Should be in bytes not dwords. Fixes: `e97f0463a8` ("ac/nir: Implement NGG deferred attribute culling in NIR.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17593>	2022-07-18 09:50:09 +00:00
Timur Kristóf	5050db0b55	radv: Remove trailing whitespace introduced by DGC commits. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17580>	2022-07-18 08:23:41 +00:00
Timur Kristóf	b732285312	radv: Only initialize DGC state when DGC is enabled. This function causes a crash with RADV_DEBUG=llvm and this commit works around that crash. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17580>	2022-07-18 08:23:41 +00:00
Mike Blumenkrantz	7ea7d0687b	zink: inject a 0,0,0,1 clear for RGBX formats this ensures the alpha component is full if it must be read for fbfetch fixes (RGBX swapchain config): KHR-GL46.blend_equation_advanced* Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	ac38139c34	zink: simplify zink_framebuffer_clear_data union no functional changes Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	7754041ed6	zink: delete srgb tracking for clears no longer used Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	b0e62adbcc	zink: delete zink_fb_clear_util_unpack_clear_color no longer used Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	d8fa4e6797	zink: remove out-of-renderpass clears these are only ever going to hurt tiler perf, so remove the footgun this also means there's no more srgb format conversion needed, so delete all of that too Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	66ceea7ed9	zink: lift clearing on fb state change up a level in the scenario where: * at least 1 color buffer was bound and a depth buffer was bound * no color clear was enabled * a zs clear was enabled * the zs clear was never flushed * the zs clear needs a renderpass * the fb state changes the color buffer(s) would be unbound, following which the depth buffer unbind would trigger a renderpass, which would utilize the just-unbound color buffers, which have no batch tracking, thus creating a case where the surface was destroyed while it was still in use cc: mesa-stable Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	e81d6cec27	zink: clamp color clear values based on format formats like GL_RGB10_A2UI can be cleared with out of range values, so to ensure consistent driver behavior, pre-clamp to the valid range affects: KHR-GL46.direct_state_access.renderbuffers_storage_multisample Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	95f1c84739	zink: add explicit (awful) handling for fb layer mismatch clears this is terrible and (hopefully?) rare, so just force it out early to avoid any kind of issue Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	631db579af	zink: track a bitmask of fb attachments with mismatched layer counts these need special handling for clears Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	9d68684240	zink: always use storeOp=STORE for depth renderpass it's unknown whether there may be clears to the depth attachment at the start of a renderpass, so always assume there will be Fixes: `c132a28745` ("zink: use store op NONE when necessary for depth usage") Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	442281447a	zink: remove u_blitter usage from zink_clear_render_target this is more operations than needed Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	d53395ad06	zink: remove non-renderpass clear path from zink_clear_texture this should always be faster Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	f1f08e3529	zink: massively simplify zink_clear_depth_stencil this now just uses renderpass clears Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	3488dc7dd0	zink: improve zink_clear_depth_stencil check for current attachment this is technically more correct since it accounts for multi-context usage Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	c1cc2bd48e	zink: stop using u_blitter for texture clears this was really stupid: instead of just binding a new fb and firing off a clear, the code was calling u_blitter to bind a new fb and do actual draws Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	04a5471b5e	zink: fix coverage check for texture clears this wasn't actively harmful, but it was potentially differently-performant Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	b33f041810	zink: remove format check from clear texture this is going to be a render target no matter what Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00

... 2 3 4 5 6 ...

145368 Commits