KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jason Ekstrand	6b6c1a7ddd	vulkan: Add a vk_limits.h file for runtime limits Individual driver limits may be smaller than these. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17328>	2022-07-19 19:19:33 +00:00
Samuel Pitoiset	e840ba9ed8	aco: requires Exact for p_jump_to_epilog Otherwise, in presence of p_exit_early_if the main FS will always jump to the PS epilog regardless the exact mask. This fixes dEQP-VK.draw.renderpass.shader_invocation.helper_invocation and few vkd3d-proton regressions when PS epilogs are forced. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17617>	2022-07-19 17:52:36 +00:00
Juan A. Suarez Romero	f3579a62e9	v3d/v3dv/ci: update expected results Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17632>	2022-07-19 17:26:52 +00:00
Daniel Schürmann	ac39e7bf23	aco: fix assertion in insert_exec_mask The exec mask might also be of type mask_type_loop. Fixes: `d068eb53e8` ('aco/insert_exec_mask: optimize top-level transition to exact before demote') Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17402>	2022-07-19 16:56:37 +00:00
Daniel Schürmann	6de68c5dca	aco: Avoid live-range splits in Exact mode Because the data register of atomic VMEM instructions is shared between src and dst, it might be necessary to create live-range splits during RA. Make the live-range splits explicit in WQM mode. Totals from 7 (0.01% of 134913) affected shaders: (GFX10.3) Latency: 17209 -> 17210 (+0.01%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15347>	2022-07-19 16:30:49 +00:00
Daniel Schürmann	f12eb5c213	aco: avoid unnecessary copies in emit_wqm() No fossil-db changes. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15347>	2022-07-19 16:30:49 +00:00
Hyunjun Ko	4bccee123f	turnip: expose VK_EXT_shader_module_identifier Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17614>	2022-07-19 16:12:15 +00:00
Hyunjun Ko	d046d6e9e0	turnip: Remove an unnecessary assert. The assertion is already in the common implementation. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17614>	2022-07-19 16:12:15 +00:00
Konstantin Seurer	d4ca5942be	ac/llvm: Remove load_vertex_id handling Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17539>	2022-07-19 13:26:09 +00:00
Konstantin Seurer	83ccc810b4	aco: Remove dead nir_intrinsic_load_vertex_id case This intrinsic is lowered in NIR. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17539>	2022-07-19 13:26:09 +00:00
Konstantin Seurer	eb19640d61	radeonsi: Set vertex_id_zero_based Let NIR lower load_vertex_id instead of LLVM. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17539>	2022-07-19 13:26:09 +00:00
Konstantin Seurer	d316d24d74	v3dv: Use nir_gen_rect_vertices Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:31 +00:00
Konstantin Seurer	f90babb567	radv: Use nir_gen_rect_vertices Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:30 +00:00
Konstantin Seurer	fab0050223	nir: Add a common gen_rect_vertices implementation Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17535>	2022-07-19 12:47:30 +00:00
Samuel Pitoiset	203afc9351	radv: disable viewport depth clamping only when necessary When the application uses depth values outside of the [0.0,1.0] range with VK_EXT_depth_range_unrestricted, or when explicitly disabled. Otherwise, the driver can clamp to [0.0,1.0] internally for optimal performance. From the Vulkan spec "in 28.10.1. Depth Clamping and Range Adjustment": "If depth clamping is not enabled and zf is not in the range [0, 1] and either VK_EXT_depth_range_unrestricted is not enabled, or the depth attachment has a fixed-point format, then zf is undefined following this step." Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16349>	2022-07-19 09:00:27 +00:00
Iago Toral Quiroga	bec3c83e19	v3dv: implement VK_KHR_buffer_device_address This feature allows shaders to use pointers to buffers which may not be bound via descriptor sets. Access to these buffers is done via global intrinsics. Because the buffers are not accessed through descriptor sets, any live buffer flagged with VK_BUFFER_USAGE_SHADER_DEVICE_ADDRESS_BIT_KHR can be accessed by any shader using global intrinsics, so the driver needs to make sure all these buffers are mapped by the kernel when it submits the job for execution. We handle this by tracking if any draw call or compute dispatch in a job uses a pipeline that has any such shaders. If so, the job is flagged as using buffer device address and the kernel submission for that job will add all live BOs bound to buffers flagged with the buffer device address usage flag. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Iago Toral Quiroga	90054e9c5d	broadcom/compiler: track if a shader uses global intrinsics Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Iago Toral Quiroga	fa03d9c8be	broadcom/compiler: implement 2x32 global intrinsics Notice we ignore the high 32-bit component of the address because we know it must be 0. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Iago Toral Quiroga	b18cecbfb6	nir: add nir_address_format_2x32bit_global This adds support for global 64-bit GPU addresses as a pair of 32-bit values. This is useful for platforms with 32-bit GPUs that want to support VK_KHR_buffer_device_address, which makes GPU addresses explicitly 64-bit. With the new format we also add new global intrinsics with 2x32 suffix that consume the new address format. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17275>	2022-07-19 09:47:34 +02:00
Iago Toral Quiroga	ea3acbef8d	v3dv: remove duplicate condition Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	706f1252ba	v3dv: explain why we clear certain state after a draw call Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	702b685b07	v3dv: add a dirty state for pending push constants UBO updates If we have 2 pipelines that consume the same push constant data but where one of them only uses direct access and the other has indirect access, a draw with the first pipeline would clear the dirty flag without updating the UBO and by the time we bind and draw with the second pipeline we won't upload the constants either because the first draw cleared the dirty flag. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	3898bf6971	v3dv: allocate more push constant buffers if needed Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	e451c612df	v3dv: stop tracking push constant buffer references Since we allocate this ourselves we can immediately add it to the job at the time we allocate it. This also fixes a bug we introduced when we implemented inline uniforms because since that commit, if we had an inline uniform buffer at index 1 which happend to have indirect access we would track it in slot 0 instead of slot 1, potentially overwriting the push constant buffer reference. Fixes: `ea3223e7a4` ('v3dv: implement VK_EXT_inline_uniform_block') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	45b8dc667a	v3dv: don't allocate MAX_PUSH_CONSTANTS_SIZE bytes for the push constants UBO We have code in there to allocate various segments of MAX_PUSH_CONSTANTS_SIZE to handle the case of various draw calls in the same command buffer requiring different push constants, so we are implicitly expecting it to be larger than this. In fact, this only works now because when we allocate a BO we are always at least allocating a full page, so the least we ever allocate is 4096 bytes, so be explicit about it to avoid confusion. Also, since we were always mapping MAX_PUSH_CONSTANTS_SIZE and the mapping always starts at the beginning of the BO, it looks like after the first copy when the resource offset is not zero, we would be writing outside the mapped range. Always map the full size of the BO instead to ensure this doesn't happen. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	51a45f9315	v3dv: limit upload of indirect push constant data We have been always uploading MAX_PUSH_CONSTANTS_SIZE but now that we track the actual size of the push constant buffer we can use this instead. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	005542f0e3	v3dv: move push constant data to the command buffer state Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Iago Toral Quiroga	41a0c89d9f	v3dv: only save/restore push constant data for meta operations if needed If the command buffer didn't have any push constants or the meta operation didn't write any new constants we don't need to restore the state. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17536>	2022-07-19 05:46:04 +00:00
Jason Ekstrand	669daa37b1	Revert "vulkan: Detect pNext chain loops in vk_foreach_struct()" This reverts commit `4c56b535f5`.	2022-07-18 23:48:59 -05:00
Lionel Landwerlin	2bfcd29155	anv: move restart index to gfx state Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	a9abf1dd93	anv: fix primitive topology dynamic state emission on gfx7 This dynamic state uses the pipeline information so it needs to be reemitted whenever the pipeline changes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	25de752234	anv: name non dynamic state fields correctly Those fields have confusing names. They hold non dynamic state, specified at pipeline creation that we copy into the dynamic state of the command buffer whenever we bind a pipeline. This non dynamic state might get picked up in the dynamically emitted instructions of the 3D pipeline because our HW packets are not exactly splitted like the Vulkan API. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	5b6e6a672c	anv: reorder & document fields of anv_graphics_pipeline Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	aea9abd71b	anv: move CreateRayTracingPipelines to common code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	ffc798c364	anv: move CreateComputePipelines to common code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	2c816b4f2e	anv: move CreateGraphicsPipelines to common code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	1ba89d35ab	anv: rename internal function for consistency Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Jason Ekstrand	cb682a1cdd	anv: Don't use the wrong ARRAY_SIZE Even though this doesn't change anything, it's not good to use an ARRAY_SIZE for one array to iterate over another. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	f66192a4b3	anv: split graphics nir loading Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	36aa0f668f	anv: break up anv_pipeline_compile_graphics() This function is pretty overwhelming. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	c806d1e5ed	anv: simplify dynamic buffer count in pipeline layout anv_descriptor_set_layout already has the information we're gather here. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	5b561b501a	anv: remove local computation of dynamic states This bit mask is already computed in anv_graphics_pipeline::dynamic_states in anv_graphics_pipeline_init(). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Jason Ekstrand	4c56b535f5	vulkan: Detect pNext chain loops in vk_foreach_struct() This implements the "tortoise and the hare" algorithm for detecting cycles in graphs. We use the caller's iterator as the hare and our own internal copy as the tortoise. Conveniently, VkBaseOutStructure (and VkBaseInStructure which is identical except the pointer type on pNext) have a pointer we can use for the tortoise and an sType which we can use for a counter to ensure we only increment the tortose every other loop iteration. There are more efficient algorithms than tortoise and hare but they require allocating memory for something like a hash set of seen nodes. Since this for debug purposes only, it's ok for it to be a bit inefficient in the case where it hits the assert. In the usual case of no loops, it's the same runtime efficiency as the unchecked version except that it does a tiny bit of math and 50% more pointer chases. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17596>	2022-07-19 02:11:07 +00:00
Emma Anholt	94bd06256a	intel/fs: Simplify brw_barycentric_mode() args. Reduce a bit of mode lookup noise I was tracing through trying to resolve the previous bug. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17381>	2022-07-19 01:25:47 +00:00
Lionel Landwerlin	2d1f021e16	intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it. [anholt: changed to make all drivers do the right thing by moving the payload barycentric check into the compiler] Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17381>	2022-07-19 01:25:47 +00:00
Emma Anholt	3d62a41dcc	freedreno/ir3: Enable core NIR's 16-bit ALU optimizations. In addition to hopefully generating shorter code, this optimizes out a comparison of a mediump-cast value in dEQP-GLES2.functional.shaders.algorithm.rgb_to_hsl_fragment passed through ANGLE, and allows the test to pass. We believe it to be a test bug, but emitting better code like apparently everyone else does is also a fine result. No change on GLES gfxbench shaders. Fixes: #6585 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17546>	2022-07-18 22:41:18 +00:00
Konstantin Seurer	fc26fbde3d	vulkan: Common vk_format_get_component_bits RADV and PowerVR use the same implementation. Turnip does use a slightly modified version but the helper only has one use -> just inline it and get rid of turnip's vk_format.h. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17515>	2022-07-18 22:14:06 +00:00
jheaff1	2e71e23188	build(glx): Fix build by adding missing deps dri3_glx.c includes xshmfence and glxcmds.c includes xf86vm, neither of which are listed as dependencies of the glx lib in the meson.build file. Consequently, those files would fail to compile on machines that did not have xshmfence and xf86vm installed globally. This commit rectifies the issue by adding the missing dependencies Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17585>	2022-07-18 21:12:26 +00:00
Mike Blumenkrantz	48491386ff	mesa/st: add implicit zeroing of clipdistance array GL drivers have an implicit default of "in bounds" for unwritten clipdistance values, but some (layered) drivers have to deal with api mismatch which prevents that implicit value from being used after the shader gets mangled by various compiler passes to avoid issues here, write out all members of the clipdistance array with zeroes at the very start of the shader such that any values the shader actually writes will naturally overwrite the implicit zero and any unwritten values will now be written as zero fixes #6845 Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17498>	2022-07-18 20:33:11 +00:00
Mike Blumenkrantz	071d335ca2	zink: tu a630 baseline update Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17606>	2022-07-18 20:15:05 +00:00
Mike Blumenkrantz	b6df410d26	zink: nv baseline update Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17606>	2022-07-18 20:15:05 +00:00
Adam Jackson	c123ab2137	kopper: Implement {EGL,GLX}_EXT_buffer_age Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17527>	2022-07-18 19:31:29 +00:00
Mike Blumenkrantz	81d83e81db	zink: break out tc/trace context unwrapping Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17527>	2022-07-18 19:31:28 +00:00
Yiwei Zhang	a211d74096	venus: filter out VK_EXT_physical_device_drm on the driver side Fixes: `a1a22862c6` ("venus: implement VK_EXT_physical_device_drm") Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17591>	2022-07-18 19:23:53 +00:00
Dave Airlie	50e3303b3d	kms/dri: add mutex lock around map/unmap this can get called from multiple threads with the recent llvmpipe overlapping rendering changes, so make sure to lock around the map/unmapping so they can't race. This should fixes some crashes seen with kwin. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Tested-by: Adam Williamson (Fedora) Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17531>	2022-07-18 19:06:30 +00:00
Samuel Pitoiset	5ee5c73d2d	radv: implement PS epilogs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	270cc39648	aco: add support for compiling PS epilogs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	8d13392969	aco: refactor export_fs_mrt_color() for PS epilogs preparation Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	df8fb721a5	radv,aco: rename radv_aco_build_prolog to radv_aco_build_shader_part Will be re-used for PS epilogs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	897561b7b9	aco: add aco_postprocess_shader() helper Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	d9ffff09b0	aco: prevent adding DONE/VM to the last export if the FS has an epilog If the fragment shader exports MRTZ and the epilog some color exports, DONE/VM should be added to the last export. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	2784bfe93f	aco: do not abort if the FS doesn't export anything but has an epilog The main fragment shader can only export MRTZ (if present) and the epilog will export colors. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	a6dff6caa1	aco: emit p_jump_to_epilog if the main fragment shader has an epilog MRTZ is still exported from the main shader. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	8bdcc20815	aco: add new pseudo instruction p_jump_to_epilog The first operand of this new pseudo-instruction is a 64-bit SGPR for the continue PC, followed by a variable list of fixed VGPRS for the color exports which are the PS epilog inputs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	0fd3754c26	radv: add a function that declares PS epilog shader arguments The PS epilog would be a "normal" compiled shader using RA, etc. It will declare up to 8x4 VGPRs for all color exports. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	a38db1a94e	radv: declare a new user SGPR arg in FS for the epilog PC The main FS would have to jump to the PC of the PS epilog. Given that shaders are allocated in the 32-bit addr space, one user SGPR is fine. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	0db7a0b6e8	radv,aco: introduce {radv,aco}_ps_epilog_key To pass the necessary pipeline information for compiling PS epilogs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Samuel Pitoiset	eee098486a	radv,aco: track if a fragment shader needs an epilog This is currently disabled but it will be used for testing first, and then for graphics pipeline libraries. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Ruijing Dong	a585d95803	frontends/va: WA for ffmpeg 10bit encoding crash When doing 10bit encoding in ffmpeg it uses VaDeriveImage, and that could result in missing mapping the chroma buffer of the input frame. This WA to disallow ffmpeg using VaDeriveImage function, so that VaCreateImage and VaPutImage can be used and WA the chroma buffer mapping issue. Reviewed-by: Leo Liu <leo.liu@amd.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17472>	2022-07-18 17:23:22 +00:00
Ruijing Dong	cd653e5cc7	frontends/va: do texture_map when needed When map buffer, and its target is texture, texture_map/unmap need to be used. Reviewed-by: Leo Liu <leo.liu@amd.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17472>	2022-07-18 17:23:22 +00:00
Erik Faye-Lund	e630637eab	dzn: expose VK_KHR_driver_properties We're not quite conformant with the extension, because we don't have a valid conformance version. That's not a quick-fix, so we should probably just accept some failures for now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 17:10:52 +02:00
Erik Faye-Lund	e5da067384	dzn: fill misc props This is just a bag of misc properties that we should fill in. Not all of them are filled out super accurately, but this is the best we can do for now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	053e2fd9d0	dzn: fill in minmax props This should be possible to support, but we don't support minmax blending at all yet, so let's leave these as unsupported for now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	18c590e0b3	dzn: fill in depth/stencil resolve props Before enabling Vulkan 1.2 support, we need to fix the TODO in here. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	141e715f29	dzn: fill in bindless props These might not be exactly right, but they are good enough for now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	34b0828cdc	dzn: fill in non-uniform-indexing props Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	8e9191cd24	dzn: fill in float-control details We can do better here in the future, but this is what's supported right now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Erik Faye-Lund	4d7403d4dc	dzn: fill in driver name and info Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16998>	2022-07-18 16:49:42 +02:00
Konstantin Seurer	0580910aa9	radv: Only set rt stack size for dynamic stacks When using a static callable stack, the required scratch has already been allocated. Dynamic stacks are located at the end of scratch memory and are allocated on demand using radv_set_rt_stack_size. Static stacks live at the start of scratch memory and are allocated in create_rt_shader by setting scratch_size. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-By: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17579>	2022-07-18 12:05:35 +00:00
Qiang Yu	eeaf0b1888	ac/nir/ngg: add a barrier before prim id export When culling enabled, it will use LDS space, which overlap with the prim id export. Fixes: `e97f0463a8` ("ac/nir: Implement NGG deferred attribute culling in NIR.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17593>	2022-07-18 09:50:09 +00:00
Qiang Yu	0b7ef846b3	ac/nir/ngg: fix nogs culling scratch size Should be in bytes not dwords. Fixes: `e97f0463a8` ("ac/nir: Implement NGG deferred attribute culling in NIR.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17593>	2022-07-18 09:50:09 +00:00
Timur Kristóf	5050db0b55	radv: Remove trailing whitespace introduced by DGC commits. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17580>	2022-07-18 08:23:41 +00:00
Timur Kristóf	b732285312	radv: Only initialize DGC state when DGC is enabled. This function causes a crash with RADV_DEBUG=llvm and this commit works around that crash. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17580>	2022-07-18 08:23:41 +00:00
Mike Blumenkrantz	7ea7d0687b	zink: inject a 0,0,0,1 clear for RGBX formats this ensures the alpha component is full if it must be read for fbfetch fixes (RGBX swapchain config): KHR-GL46.blend_equation_advanced* Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	ac38139c34	zink: simplify zink_framebuffer_clear_data union no functional changes Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	7754041ed6	zink: delete srgb tracking for clears no longer used Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	b0e62adbcc	zink: delete zink_fb_clear_util_unpack_clear_color no longer used Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	d8fa4e6797	zink: remove out-of-renderpass clears these are only ever going to hurt tiler perf, so remove the footgun this also means there's no more srgb format conversion needed, so delete all of that too Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	66ceea7ed9	zink: lift clearing on fb state change up a level in the scenario where: * at least 1 color buffer was bound and a depth buffer was bound * no color clear was enabled * a zs clear was enabled * the zs clear was never flushed * the zs clear needs a renderpass * the fb state changes the color buffer(s) would be unbound, following which the depth buffer unbind would trigger a renderpass, which would utilize the just-unbound color buffers, which have no batch tracking, thus creating a case where the surface was destroyed while it was still in use cc: mesa-stable Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	e81d6cec27	zink: clamp color clear values based on format formats like GL_RGB10_A2UI can be cleared with out of range values, so to ensure consistent driver behavior, pre-clamp to the valid range affects: KHR-GL46.direct_state_access.renderbuffers_storage_multisample Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	95f1c84739	zink: add explicit (awful) handling for fb layer mismatch clears this is terrible and (hopefully?) rare, so just force it out early to avoid any kind of issue Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	631db579af	zink: track a bitmask of fb attachments with mismatched layer counts these need special handling for clears Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	9d68684240	zink: always use storeOp=STORE for depth renderpass it's unknown whether there may be clears to the depth attachment at the start of a renderpass, so always assume there will be Fixes: `c132a28745` ("zink: use store op NONE when necessary for depth usage") Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	442281447a	zink: remove u_blitter usage from zink_clear_render_target this is more operations than needed Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:38 +00:00
Mike Blumenkrantz	d53395ad06	zink: remove non-renderpass clear path from zink_clear_texture this should always be faster Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	f1f08e3529	zink: massively simplify zink_clear_depth_stencil this now just uses renderpass clears Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	3488dc7dd0	zink: improve zink_clear_depth_stencil check for current attachment this is technically more correct since it accounts for multi-context usage Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	c1cc2bd48e	zink: stop using u_blitter for texture clears this was really stupid: instead of just binding a new fb and firing off a clear, the code was calling u_blitter to bind a new fb and do actual draws Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	04a5471b5e	zink: fix coverage check for texture clears this wasn't actively harmful, but it was potentially differently-performant Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	b33f041810	zink: remove format check from clear texture this is going to be a render target no matter what Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	8bb5a11503	zink: fix transient attachment rp assert cc: mesa-stable Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Mike Blumenkrantz	d1db078619	zink: remove bogus range tracking from texture clear too much copy/paste Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17366>	2022-07-18 03:40:37 +00:00
Dave Airlie	af2e4a23c9	lavapipe: enable variablePointers This passes the CTS with no regressions. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17587>	2022-07-17 23:16:40 +00:00
Dave Airlie	dbd9caae5e	lavapipe: drop unreachable pNext checks. These are reachable, and dEQP-VK.api.smoke.triangle_ext_structs,Crash is why. Signed-off-by: Dave Airlie <airlied@redhat.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17586>	2022-07-17 23:02:19 +00:00
Mihai Preda	bc78b861ca	gallium: LLVM-15 contexts use non-opaque pointers LLVM-15 enables opaque pointers by default. We temporarilly request non-opaque pointers while we migrate our code to support non-opaque pointers. This workaround needs to be removed before LLVM-16. See #6615 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Kai Wasserbäch <kai@dev.carbon-project.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17514>	2022-07-17 21:44:37 +00:00
Rob Clark	81d85be9a5	freedreno/gmem: Reverse order of alternative tile rows Similar motivation as `c426e21ff1` ("turnip: Reverse the order of walking pipes or tiles on odd rows."), but instead we just swap the order of alternate rows of fd_tile the the gmem stateobj. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17303>	2022-07-16 15:37:34 +00:00
Daniel Stone	cdb7a3b0e2	Revert "CI: Disable Collabora lab" This reverts commit 7a336c97ef692ed96cc93394596a7d0650983874. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17563>	2022-07-16 11:44:26 +00:00
Arvind Yadav	1fbc7337a1	radeonsi: Enable nir_lower_point_smooth lowering pass for point smoothing Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:33 -04:00
Arvind Yadav	8adbd2a964	ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:10 -04:00
Arvind Yadav	689559d10f	ac/llvm : Adding Number of all interpolated inputs in ac_shader_abi Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:10 -04:00
Arvind Yadav	30865756db	nir: Add a lowering pass for point smoothing When point smoothing is enabled then this lowering pass will modifies the alpha component of every write to fragment output. Anti-aliased points get rounded with respect to their radius instead of square. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:09 -04:00
Arvind Yadav	cad4908fa0	nir: add load_point_coord_maybe_flipped intrinsics for point smoothing gl_PointCoord can be flipped upside down via a state. To avoid this adding new load_point_coord_maybe_flipped intrinsics. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:07:32 -04:00
Arvind Yadav	25204d89a6	radeonsi: Add nir_lower_poly_line_smooth pass for polygon and line smoothing Added a new NIR pass for handling polygon and line smoothing and Removed previous smoothing changes. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16245>	2022-07-16 10:15:22 +00:00
Arvind Yadav	2709786bde	nir: Add a lowering pass for polygon and line smoothing When poly_line smoothing is enabled then this lowering pass will modify the alpha component of every write to fragment output using sample coverage mask. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16245>	2022-07-16 10:15:22 +00:00
Emma Anholt	a43b96ab1a	ci/crocus: Drop xfails for the recent image external fix. Fixes: `8856379a03` ("mesa/st: don't guess the internal format if it's known") Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17572>	2022-07-16 04:31:56 +00:00
Emma Anholt	c0930b552d	ci/crocus: Disable the blender trace. It gives inconsistent sha1s. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17572>	2022-07-16 04:31:56 +00:00
Emma Anholt	f96688fec2	ci/crocus: Update portal 2 trace shas for the recent fix. they render correctly now. Fixes: `4e797ac530` ("st/glsl: fix broken vertex attrib mapping") Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17572>	2022-07-16 04:31:56 +00:00
Emma Anholt	c7c265892a	mesa/arbprog: Stop doing optimization in the ARB program IR. You'll get all this and more anyway once you're in NIR. This lets us GC a bunch more ARB program transformation code. No effect in shader-db on softpipe. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17528>	2022-07-16 04:08:32 +00:00
Emma Anholt	c13dbf6ae9	mesa/arbprog: Use nir_lower_io_to_temporaries. This replaces our mesa_remove_output_reads(), which in turn GCs some other ARB program transformation code. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17528>	2022-07-16 04:08:32 +00:00
Emma Anholt	153f7b8852	mesa/arbprog: Move the GLSLFragCoordIsSysVal handling to prog_to_nir. We don't need to go grubbing around in the ARB program when we can use the right variable type at prog_to_nir time. This does leave fp->system_values_read/inputs_read as they were, but I don't see anywhere that that matters (the NIR will have its info gathered appropriately, and other lowering may also cause mismatch between the gl_program and the NIR). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17528>	2022-07-16 04:08:32 +00:00
Jesse Natalie	c002bbeb2f	util: Add a Win32 futex impl This uses APIs that are not available on Win7. Since this is a build-time configuration, and since we can't use the SDK version as an indicator (since you can support Win7 via new SDKs), a new option is added to allow disabling it, to maintain Win7 support if desired. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17431>	2022-07-15 21:31:51 +00:00
Yiwei Zhang	62f79f9ec1	venus: add more tracepoints for perf analysis This change adds the tracepoints that can help understand app behavior for debugging and performance optimization purposes. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17497>	2022-07-15 20:57:41 +00:00
Yiwei Zhang	f96e25ae05	venus: suballocate more for layering Previously we suballocate only for host visible memory type to reduce the kvm mem slot usage. That is no longer an issue given the limit has been raised. However, we should still suballocate to make layering clients performant. So we just suballocate regardless of mem type. This change also increases the allowed suballocation size request from 64K to 128K, which makes layering clients happier. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17497>	2022-07-15 20:57:41 +00:00
Mike Blumenkrantz	3ff058ed0b	mesa: update GL_CLAMP emulation when binding/unbinding textures binding/unbinding a texture affects the previously specified parameters, so ensure the driver flag for clamp emulation is also set to perform updates as needed Fixes: `e8f71f6ac4` ("mesa/st: add PIPE_CAP_GL_CLAMP") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17459>	2022-07-15 18:58:15 +00:00
Mike Blumenkrantz	068239dad0	mesa: track which sampler wrap params use GL_CLAMP this adds a bitmask to sampler objects for tracking whether GL_CLAMP is active Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17459>	2022-07-15 18:58:15 +00:00
Mike Blumenkrantz	c17bfc5309	mesa: move is_wrap_gl_clamp() to samplerobj.h and deduplicate Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17459>	2022-07-15 18:58:15 +00:00
Mike Blumenkrantz	56e5eaeba1	zink: fix xfb emit check in compiler nir->info.has_transform_feedback_varyings is set for all stages in the pipeline when xfb is present, so it can't be used for this harmless, but this is more correct Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17404>	2022-07-15 17:32:27 +00:00
Mike Blumenkrantz	bd2eaaeb7d	zink: add a compiler pass to split xfb block outputs this splits all the members of a struct into separate variables to improve xfb inlining and reduce the number of locations consumed by xfb outputs, reducing the chances of running out of shader outputs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17404>	2022-07-15 17:32:27 +00:00
Mike Blumenkrantz	924145c7b5	zink: bitcast extracted streamout components to uint before creating uvec spirv can't create a uvec from float components, so pre-cast here Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17404>	2022-07-15 17:32:27 +00:00
Mike Blumenkrantz	c189b7f585	zink: use right glsl length getter for ntv partial stores why does glsl_get_length exist if it returns 0 for the most common cases? Fixes: `31ba19ff68` ("zink: fix ntv partial stores") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17404>	2022-07-15 17:32:27 +00:00
Mike Blumenkrantz	5245290565	zink: fix xfb array inlining get_slot_components() returns the total number of output components for arrays for initial evaluation phase, but during the packed->inlined conversion the arrayed size must be normalized to the slot's component count in order to effectively catch and inline the array Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17404>	2022-07-15 17:32:27 +00:00
Mike Blumenkrantz	042cc6e6e6	zink: split xfb block emission from array/matrix handling these are not necessarily the same case even if in glsl they are the same, and by splitting it out a bunch of redundant array[scalar] code can be deleted Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17404>	2022-07-15 17:32:27 +00:00
Mike Blumenkrantz	76cc519866	zink: handle bare matrix types in xfb emission these have no inherent slot index since they aren't block members Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17404>	2022-07-15 17:32:27 +00:00
Mike Blumenkrantz	a0771cd4ab	zink: always use 32bit floats for so output types doubles may be the output variable type, but the xfb output will always be 32bit Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17404>	2022-07-15 17:32:27 +00:00
Jesse Natalie	f4f1914cd2	microsoft/clc: Add helpers to build with correct ABI for MinGW Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17548>	2022-07-15 16:27:11 +00:00
Jesse Natalie	e14beaf05b	d3d12: Add helpers to build with correct ABI for MinGW Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17548>	2022-07-15 16:27:11 +00:00
Jesse Natalie	489ab1aa3b	dzn: Remove the cast when the SDK version is high enough Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17548>	2022-07-15 16:27:11 +00:00
Jesse Natalie	b9fb14da06	dzn: Missed ABI fixes for GetCustomHeapProperties Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17548>	2022-07-15 16:27:11 +00:00
Tatsuyuki Ishi	648731e2bd	radv: Only set pstate for the first hw_ctx. We used to do it for every queue, which was duplicate work as pstate is per-device. It could also cause trouble when multiple hw_ctx are created as the call will succeed for only one of them and the rest will return -EBUSY. Simplify and fix this by only setting for the first non-null hw_ctx. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17541>	2022-07-15 15:08:07 +00:00
Bas Nieuwenhuizen	2d2591bbb7	radv: Expose VK_NV_device_generated_commands. Closes: #6736 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	05bf39238b	radv: Add stub for vkCmdBindPipelineShaderGroupNV. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	3f09bd5a0e	radv: Implement CmdExecuteGeneratedCommandsNV. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	d7bd9db9fc	radv: Implement DGC cmdbuffer generation. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	37a619f517	radv: Implement DGC generated command layout structure. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	0c7bb92a78	radv: Add DGC meta shader. This generated the cmd and upload buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	848d3fdeb6	radv: Add flushing for DGC. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	df69b34450	radv: Add helper to write scissors. For use by DGC shader. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	57017b494d	radv: Make radv_get_vgt_index_size non-static. For DGC cmdbuffer generation use. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	f6a21fccf9	radv: Expose helper for base pa_su_sc_mode_cntl. So that we can feed it to the DGC shader for front face overrides. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	b213de12d3	radv: Require 32bit memory for indirect buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	cca680bab1	radv: Always store stride in the vbo descriptor. So we can use it in the DGC shader. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	bce3af2cb3	radv: Expose function to write vertex descriptors for dgc. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	82c2e99102	radv: Skip setting empty index buffers to avoid hang In the direct path we already skipped draws, but in DGC I noticed that just emitting these packets can cause issues ... Cc: mesa-stable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Bas Nieuwenhuizen	f27f06d72c	radv: Add a 32bit memory type. Got to put the commandbuffers & uploadbuffers there. With DGC those can be allocated by the application. Excluding it from all other buffers/images to avoid using the precious 32bit address space. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17269>	2022-07-15 14:45:13 +00:00
Mike Blumenkrantz	b1c1d099a9	zink: always update sampler descriptor layouts on fb surface unbind this will affect the layout Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17524>	2022-07-15 14:20:24 +00:00
Mike Blumenkrantz	f79d71f59e	zink: break out samplerview layout reset code Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17524>	2022-07-15 14:20:24 +00:00
Mike Blumenkrantz	960a6316d4	zink: use sampler_bind_count to simplify some code Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17524>	2022-07-15 14:20:24 +00:00
Mike Blumenkrantz	3a47576687	zink: add a compiler pass to match up tex op dest types this handles bitsize conversions and depth component splatting to enable simplifying some of the related ntv code Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17427>	2022-07-15 14:08:56 +00:00
Mike Blumenkrantz	49d5fa12f2	zink: always use 32bit sample ops while some (tg4) sample ops can use different bit sizes in spirv, most cannot, and all the shader variables are always emitted as 32bit, so ensure the 32bit type is always what's being used for sampling cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17427>	2022-07-15 14:08:56 +00:00
Mike Blumenkrantz	35a4b8989f	zink: allow multiple tex components for depth tg4 this returns a vec4, so don't break the return type by clamping 1 component cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17427>	2022-07-15 14:08:56 +00:00
Daniel Stone	02d9d1557b	CI: Disable Collabora lab It's everyone's favourite day, infrastructure maintenance Friday. This includes manual disables for a618-vk and zink-anv-tgl, because apparently the disable-on-variable rules don't carry through to those jobs for ... some reason. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17553>	2022-07-15 13:26:42 +00:00
Lucas Stach	612b99d721	etnaviv: fix use after free in async shader compile When the shader state is destroyed before the async shader compile is done, we get a use after free in the async thread, as the shader state it is operating on is gone. Fix this by dropping any pending job from the async queue or wait for it to finish before destroying the state by calling util_queue_drop_job. Also call util_queue_fence_destroy, which would have caught this issue by asserting that the queue_fence is in signalled state when the shader state is destroyed. Fixes: `1141ed5859` ("etnaviv: async shader compile") Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17534>	2022-07-15 11:07:12 +00:00
Matt Coster	20350a73a7	pvr: Add helper macros for creating pvr_dev_addr_t instances The two macros introduced here form a (hopefully) unobjectionable subset of those added in !17203. Signed-off-by: Matt Coster <matt.coster@imgtec.com> Reviewed-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17488>	2022-07-15 10:39:21 +01:00
Matt Coster	282f0a9330	pvr: Split pvr_dev_addr_t into a separate header This type is useful beyond the scope of winsys. It can now be used without being lumbered with a dependency on pvr_winsys.h. Since pvr_winsys.h is used by pvr_private.h, this can be a common cause for circular dependencies during development. Signed-off-by: Matt Coster <matt.coster@imgtec.com> Reviewed-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17488>	2022-07-15 10:39:21 +01:00
Karmjit Mahil	f2e2e66e42	pvr: Update pvrsrv build version for fixed size fw. It's still 1.17 but the version is changed due to the fixed size fw struct update. Signed-off-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Reviewed-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17491>	2022-07-15 00:52:14 +00:00
Jason Ekstrand	cb6375d40c	anv: Stop compacting surface state tables Instead of trying to compact the surface state table to get rid of any unused render targets, emit MAX(1, colorAttachmentCount) surface states always. This ensures that secondaries will always match with primaries when we go to do the copy since there's no rule requiring the secondary to have VK_FORMAT_UNDEFINED when the primary has a NULL image view. Fixes: `3501a3f9ed` ("anv: Convert to 100% dynamic rendering") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17543>	2022-07-14 20:36:24 +00:00
Jesse Natalie	ddbbe96c88	Fix static glapi on Windows On Linux, the static glapi path sees libGL.so implementing the static glapi, and the drivers (libgallium_dri.so) updating/reading the TLS vars. On Windows, to allow libgallium_wgl.dll to be a full ICD, it's responsible for implementing the actual static glapi. However, before this change, OpenGL32.dll was also implementing the static glapi, meaning that GL API calls from OpenGL32.dll didn't route to the driver correctly because the TLS vars were never actually set - the driver set its copy, and OpenGL32.dll read its own copy. Now, always build a bridge and static version of glapi when not using shared. The bridge version is linked into OpenGL32.dll, and the static version is linked into the driver on Windows. GLES only builds with shared glapi - but after this, shared glapi is not really needed on Windows for GLES, since the driver has all of the data. Fixes: `f36921ef` ("wgl: Refactor drivers to a libgallium_wgl.dll") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6560 Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16713>	2022-07-14 20:01:22 +00:00
Brian Paul	29ec6372cc	lavapipe: fix incorrect sv[] array size The sampler views array needs to be dimensioned by PIPE_MAX_SHADER_SAMPLER_VIEWS, not PIPE_MAX_SAMPLERS. This fixes out-of-bounds array writes when using more than 32 textures in a shader. Also add some assertions to check array indexing elsewhere. And change loop limits to be based on ARRAY_SIZE(). Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	a0ea45fb68	llvmpipe: initialize a local var to fix compiler warning in release build Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	dfb2ea3531	llvmpipe: don't allow texture/resource swizzles on linear path This fixes another VMware test (dx9-stretch-formats-copy-a8r8g8b8-x8r8g8b8). Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	2b14a7658a	lavapipe: fix logicop, independent blend enable/disable The logicop_enable and independent_blend_enable vars need to always be assigned, otherwise, once turned on, they could never be disabled. This fixes a number of failures in VMware's test suite. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	98593360c4	gallivm: increase LP_MAX_TGSI_SHADER_IMAGES from 16 to 32 This allows a VMware test to pass. The comments in lp_bld_limits.h mention SM 3.0 requirements, but we're in the SM 5.0 era... Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	c9fd8abe50	llvmpipe: replace LP_RAST_OP_ #defines with enum type Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	90d011de8e	llvmpipe: fix texcoord analysis in llvmpipe_nir_fn_is_linear_compat() For the linear rendering fast path, we need to know whether the texcoord argument to tex instructions comes directly from an FS input (swizzling OK, but no arithmetic, etc). Use the input register info to fill in the tex_info object. This is part of a fix for some linear rendering bugs. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	c6f9a91015	llvmpipe: fix invalid memory used in lp_fs_linear_run We were saving the address of the constants[] and nir_constants[] arrays in the jit structure. But those arrays went out of scope before use. This patch moves the constants[] array to the function scope and consolidates the TGSI/NIR paths. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	3743f74d30	llvmpipe: add missing tex_info->texture_unit assignment The texture_unit field needs to be set like the sampler_unit field. Also, add a swizzle initialization and some comments. Signed-off-by: Brian Paul <brianp@vmare.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	53cd3f331f	llvmpipe: replace GET_A0() macro w/ inline function And GET_DADX(), GET_DADY(), GET_PLANES(). This is a bit more readable, making the expected parameter types explicit. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	26775d7ca7	gallivm: s/0/LP_BLD_TEX_MODIFIER_NONE/ A minor readability improvement. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	2d4473c2ac	llvmpipe: replace if/then with switch in llvmpipe_nir_fn_is_linear_compat() To simplify the logic a bit. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	3e50112861	nir: add const qualifiers, move some decls in nir_to_tgsi_info.c Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	36c22b5dfb	llvmpipe: minor code re-org in lp_state_fs_analysis.c And add comments. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	923c73290e	lavapipe: rework code to compute textures_used, samplers_used The code did not handle more than 32 textures. We have a test that exercises 128 textures (views) and crashed w/ memory corruption down in the llvm code generator because of this. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	8bd6feaca5	util/bitset: add BITSET_SIZE() To get the size (in bits) of a bitset. And minor clean-up in __bitset_ffs(). Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	938767e835	llvmpipe: add simple assertion in generate_fragment() Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	0c80bdf758	gallivm: s/unsigned/enum pipe_swizzle/ Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	3f0a6c7ac1	llvmpipe: remove lp_rast_cmd_arg::state field Use the existing 'set_state' field. Some code was using 'state' and other code was using 'set_state'. It didn't really matter since lp_rast_cmd_arg is a union, but this removes some potential confusion. Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Brian Paul	4d5d7d16dc	llvmpipe: add minor comments in lp_rast.h, lp_setup-rect.c Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17062>	2022-07-14 19:16:07 +00:00
Lionel Landwerlin	2cac3b3817	anv: ensure tile flush before streamout writes Streamout is not L3 coherent so previous writes to the same address might be pending and overwrite the SO writes later when they get flushed from L3, even though the SO write happened later in the batch. v2: Use the right flag (not COUNTER) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6680 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17503>	2022-07-14 18:28:52 +00:00
Jordan Justen	4246a1ff47	intel/compiler: Don't create vec4 reg-set for gen8+ After `60e1d0f028`, we know that vec4 will never be used for gen >= 8. Ref: `60e1d0f028` ("intel/compiler: Remove INTEL_SCALAR_... env variables") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17437>	2022-07-14 17:49:01 +00:00
Connor Abbott	67c9ca2319	tu: Use incoherent CCU write for buffer accesses Unlike image writes, buffer writes may access the same memory in different ways, which we've seen in the past can cause problems. Use an incoherent access to force flush/invalidate between accesses to the same buffer, unless we know the barrier applies to images only. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17193>	2022-07-14 17:23:39 +00:00
Konstantin Seurer	82283de717	radv: Use a global address for sbt_base Required for indirect(2) ray tracing to work. Fixes the following tests: dEQP-VK.ray_tracing_pipeline.trace_rays_indirect2.indirect_* dEQP-VK.ray_tracing_pipeline.trace_rays_cmds_maintenance_1.indirect2_* Fixes: `16585664` ("radv: vkCmdTraceRaysIndirect2KHR") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17316>	2022-07-14 16:35:31 +00:00
Konstantin Seurer	69daa3f762	radv: Use a global address for ray_launch_size Required for indirect(2) ray tracing to work. Fixes: `b30f96dd` ("radv,aco: Use ray_launch_size_addr") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17316>	2022-07-14 16:35:31 +00:00
Boyuan Zhang	3962555db8	radeonsi/vcn: use calculated max hierarchy depth for hevc enc Certain player has hard coded max_transform_hierarchy_depth_inter and max_transform_hierarchy_depth_intra values set through VA-API, which doesn't work on radeon HW. Until properly fixing it on player side, temporarily adding this workaround to use calculated values instead. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17489>	2022-07-14 14:23:02 +00:00
Jesse Natalie	13e73e39cc	simple_mtx: Replace GCC sync intrinsics with u_atomic ops Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17529>	2022-07-14 12:49:51 +00:00
Jesse Natalie	4845bc7072	zink: Use p_atomic_fetch_add Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17529>	2022-07-14 12:49:51 +00:00
Jesse Natalie	3245d3a219	u_atomic: Add p_atomic_fetch_add which returns the old value Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17529>	2022-07-14 12:49:51 +00:00
Jesse Natalie	81bbfab5df	u_atomic: Fix MSVC p_atomic_add_return InterlockedExchangeAdd returns the old value, not the new one Cc: mesa-stable Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17529>	2022-07-14 12:49:51 +00:00
Jesse Natalie	104c301658	u_atomic: Implement p_atomic_xchg for Windows Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17529>	2022-07-14 12:49:51 +00:00
Yogesh Mohan Marimuthu	1c2ca3cfb7	radeonsi: no need to call si_pm4_clear_state() in si_pm4_free_state() the si_pm4_clear_state() initializes the variable in struct si_pm4_state which anyway gets freed in si_pm4_free_state(). Hence no need to call si_pm4_clear_state() in si_pm4_free_state(). Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17504>	2022-07-14 10:30:09 +00:00
Yogesh Mohan Marimuthu	2330c71751	radeonsi: remove tabs from code v2: fix indentation after if (Marek Olšák) Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17504>	2022-07-14 10:30:09 +00:00
Kai Wasserbäch	301bcbac0e	fix(gallivm): Replace LLVMConstF* with LLVMBuild* methods. LLVM 15 removed support for LLVMConstF in commit 4bb7b6fae3be02031878b2aa3be584c6627ad8ec. Reference: `4bb7b6fae3` Reference: `07146a9e64/llvm/docs/ReleaseNotes.rst (changes-to-the-c-api)` Closes: mesa/mesa#6863 Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Tested-by: Marcus Seyfarth <m.seyfarth@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17518>	2022-07-14 10:02:14 +00:00
Lionel Landwerlin	a41e8dc588	spirv: switch to uint64 for rayquery internal type Fixes dEQP-VK.ray_query.advanced.using_wrapper_function.comp.* An empty struct is causing problems because when passing it as argument the spirv parser will just drop the argument, considering it does not hold any data. v2: update radv CI Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4c703686db` ("spirv: handle ray query intrinsics") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17420>	2022-07-14 09:15:52 +00:00
Mike Blumenkrantz	05552b4688	lavapipe: support inlined shader spirv for compute no testing, full send Fixes: `d4d5a7abba` ("lavapipe: implement EXT_graphics_pipeline_library") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17365>	2022-07-14 04:34:18 +00:00
Jesse Natalie	333432310a	d3d12: Fix up resource import validation Format on buffers is irrelevant and bind flag validation needs to be disabled. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	d119e6a46f	d3d12: Implement fence opening and value setting Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	ea243ef1d5	d3d12: Implement server signal/wait Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	416f10fc3f	d3d12: Support importing fences / timeline semaphores Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	e3e22ce882	d3d12: Support opening resources and memobj by name Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	d462325895	d3d12: Implement resource_from_memobj If the memobj wraps a resource, then we only succeed the mapping operation if the gallium desc matches the D3D12 resource desc. If the memobj wraps a heap, then we can place whatever gallium is describing on the heap. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	bd0407a4a6	d3d12: Support creating memory objects They can wrap either an opened heap or resource Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	c87d7cfaad	d3d12: PIPE_BIND_SHARED doesn't mean linear and is always on opened resources Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	e02b11af57	d3d12: Get adapter LUID after device creation Otherwise it's only set if the GL frontend gave us one Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	d6fa0a20b0	d3d12: Support B4G4R4A4 format Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	5a9cc96784	d3d12: Add pipe getters for Win32 and base external objects device matching Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	5f6795309f	d3d12: Compute UUIDs required by external objects extension Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	1b59b542ba	d3d12: Store the rest of the device IDs in the screen Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	acba2bbb0e	gallium, mesa: Support setting timeline semaphore values Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:17 +00:00
Jesse Natalie	cd01e71999	mesa: Implement ImportSemaphoreWin32NameEXT Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Jesse Natalie	c760f0e8b8	mesa: Support importing D3D12 fences as timeline semaphores Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Jesse Natalie	8cd391a63e	gallium: Add a new fence type with a pipe cap to indicate it can be imported Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Jesse Natalie	633a469a7a	driver_noop: Remove infinite recursion from create_fence_win32 Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Jesse Natalie	7c4fa79bfa	gallium: Add 'name' field to Win32 semaphore import Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Jesse Natalie	51408dfec4	mesa: Implement ImportMemoryWin32NameEXT Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Jesse Natalie	91b14d4a77	gallium: Add a 'name' field to winsys_handle Win32 memory objects can be imported by name (const void * that will be interpreted as const wchar_t *) Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Jesse Natalie	8f11c0553c	mapi: Add more EXT_external_objects_win32 functions/enums Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Jesse Natalie	78ba74cfda	mesa: Support D3D11/D3D12 memory imports Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17446>	2022-07-14 03:45:16 +00:00
Sami Kyöstilä	0ff4f5a7e9	util: Shut down Perfetto before driver unload Shut down Perfetto before unloading the driver to fix a crash caused by an internal Perfetto thread that kept running after dlclose() took away its code. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4909 Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17454>	2022-07-14 00:41:51 +00:00
Mike Blumenkrantz	89a9220cbf	zink: reject swizzled format blits e.g., R8G8B8A8 -> B8G8R8A8 is invalid, so use u_blitter fixes (various gl configs): KHR-GL46.blend_equation_advanced* cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17523>	2022-07-13 22:14:39 +00:00
Alyssa Rosenzweig	3a0a8688d3	panfrost: Use early-ZS helpers Remove the previous compile-time early-ZS implementation and replace it with the decoupled early-ZS implementation. This uses more efficient settings in some cases (depth/stencil tests always passes or do not write), and fixes the settings used in another case (alpha-to-coverage enabled with an otherwise early-ZS shader.) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Closes: #6206 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17428>	2022-07-13 21:05:35 +00:00
Alyssa Rosenzweig	2454531de4	panfrost: Add zsa->zs_always_passes flag If we know ahead-of-time that depth/stencil testing will always pass, it's better to use weak_early than force_early. However, if depth/stencil testing could fail (discarding pixels), we'd rather use force_early. Determine which case we're in at CSO create time. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17428>	2022-07-13 21:05:35 +00:00
Alyssa Rosenzweig	fe875c0144	panfrost: Unit test early-ZS helpers The new early-ZS helpers are pure functions, leaf nodes of the call graph, and implemented with a different algorithm from the "oracle" table of correct values for various combinations of states. Further, incorrect settings often still pass CTS while causing game bugs or inefficiencies. That combination makes the helpers an excellent candidate for unit tests. Add some. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17428>	2022-07-13 21:05:35 +00:00
Alyssa Rosenzweig	e96292bc07	panfrost: Add decoupled early-ZS helpers Bifrost (and Valhall) separate early-ZS configuration into two fields: when does the depth/stencil buffer update happen? and when are pixels killed by the depth/stencil tests? The driver separately configures these to occur early (before the shader executes) or late (after the ATEST instruction executes at the end of the shader). Early tests are generally more efficient, but various combinations of API state and fragment shader properties can require late updates and/or late kills for correctness. Determining how to configure these fields is nontrivial. Our current implementation (on Bifrost) configures these fields at fragment shader compile time and bakes the settings into the RSD. This is both wrong (using early testing when late testing is required) and suboptimal (using late testing when early testing would suffice). We need to defer this configuration until draw time, when we know rasterizer and Z/S state. Reclassifying at draw time (as we currently do on Valhall) would be expensive, especially with the extra terms added in here. To cope, decouple the shader classification from the draw-time configuration. Since there are only a few bits of draw state involved, this implementation just calculates all possible states. Then the draw time classification is just indexing into a lookup table. The actual algorithm used to classify is written with correctness and clarity in mind. Unlike the current classification algorithm (which tries to match what the DDK does, poorly), this algorithm embeds its proofs of correctness. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17428>	2022-07-13 21:05:35 +00:00
Alyssa Rosenzweig	c89a3ad857	panfrost: Fix shader_modifies_coverage on Valhall Alpha-to-coverage should set this flag. This is a hardware change since Bifrost. Fixes: `26d339ef8a` ("panfrost: Generate Valhall Malloc IDVS jobs") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17428>	2022-07-13 21:05:35 +00:00
Alyssa Rosenzweig	29c33f75d3	pan/va: Stall after ATEST In theory this wait is required for correct behaviour of discarded threads with ATEST. Mesa usually waits before the instruction after ATEST, so this wait will get optimized out by va_merge_flow, but as our scheduler gets more sophisticated this could become an issue. Let's stay on the safe side and insert the recommended wait. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17428>	2022-07-13 21:05:35 +00:00
Alyssa Rosenzweig	db2bdc1dc3	pan/bi: Require ATEST coverage mask input in R60 In theory, ATEST can take any combination of registers for inputs. Experimentally, however, ATEST requires the coverage mask in R60. This avoids regressing the following dEQP tests, which write their coverage mask with pixel-frequency-shading but without writing to the depth/stencil buffer. dEQP-GLES31.functional.shaders.sample_variables.sample_mask.discard_half_per_pixel.* This issue is known to affect both Mali-G52 (v7) and Mali-G57 (v9). I am unsure if this is a silicon bug or just an obscure implementation detail. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17428>	2022-07-13 21:05:35 +00:00
Jason Ekstrand	1b3777ee0f	panfrost: Simplify sample_shading Nos that glsl_to_nir is setting sample_shading_enable whenever FB fetch is used, we don't need to duplicate it here. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	3346f6918f	intel/fs,anv: Rework handling of coarse and sample shading Now that this information is accurately gathered by spirv_to_nir, we no longer need the hack. We just need to fix up the way we handle some of the key bits. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	d0b154319d	intel/fs: Simplify persample_dispatch Thanks to the previous commit, we no longer need this check. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	1124bee4ba	glsl/nir: Set sample_shading if a FS output ever shows up as an rvalue If framebuffer fetch is used, we have to enable sample shading because the fetched framebuffer value is per-sample. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	3cf103f23d	nir/gather_info: Stop gathering uses_sample_shading Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	b6543470fe	spirv/nir: Set uses_sample_shading from spirv_to_nir We don't really want to base this on a late nir_gather_info for two reasons: 1) The Vulkan spec says that if a sample-qualified input, SampleID, or SamplePosition are in the entry-point's interface, you get per-sample dispatch. This means we really should gather this information before dead-code has a chance to delete anything. 2) We want to be able to add nir_intrinsic_load_sample_pos intrinsics as part of lowering passes without causing per-sample interpolation. This means nir_gather_info needs to stop gathering it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	830654b7b0	glsl/nir: Set uses_sample_shading from glsl_to_nir We don't really want to base this on a late nir_gather_info for two reasons: 1) The GL spec says that any static use of a sample-qualified input, gl_SampleID, or gl_SamplePosition causes per-sample dispatch. This means we really should gather this information before dead-code has a chance to delete anything. 2) We want to be able to add nir_intrinsic_load_sample_pos intrinsics as part of lowering passes without causing per-sample interpolation. This means nir_gather_info needs to stop gathering it. For 1, this doesn't actually get us quite there as GLSL IR may have deleted something already. However, it does get us closer. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	fd17aaf430	intel/fs: Use nir_lower_single_sampled This lets us drop demote_sample_qualifiers as well as a back-end check for key->multisample_fbo. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	23b2d625dd	nir: Add a pass for lowering shaders to single-sampled On Intel, we have to do this because we can't ask for the per-sample barycentrics without setting the per-sample dispatch bit or the GPU will hang. However, nothing we're doing in this pass is Intel-specific and it may be a useful optimization for someone else so we may as well make it a generic NIR pass. This version actually does a bit more than the current brw_nir_demote_sample_qualifiers() pass as it also handles pre-nir_lower_io interp_dref_at* as well as a couple system values which we can easily constant-fold. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	ca9f0f72db	intel/fs: Use shader_info::fs::uses_sample_shading NIR constructs this information for us as part of nir_gather_info these days so we can simplify our logic a bit. This will also let us be more correct once we move uses_sample_shading scraping earlier. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	4f3bf712cf	radv: Set uses_sample_shading for copy shaders Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	9d438799c8	intel/blorp: Set uses_sample_shading for MSAA blit shaders Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Alyssa Rosenzweig	cc980ee0ed	panfrost: Protect pandecode by a mutex Pandecode is not thread-safe (for a large number of reasons) and does not even try to be. This is a problem when tracing (or just using PAN_MESA_DEBUG=sync) multithreaded applications. The most common symptom of the problem are assertion failures deep in the red-black tree implementation, which is not thread-safe. Just protect the whole thing by a "in pandecode?" mutex, since this is not performance sensitive code and we don't really care about the extra serialization incurred. As pandecode does not recurse into itself, we may simply lock at the beginning and unlock at the end of each entrypoint in pandecode, which is thread-safe regardless of how pandecode is used. A few entrypoints are refactored to avoid early returns to keep the lock/unlock calls in obvious visual pairs. Fixes flakes when running the CL CTS with PAN_MESA_DEBUG=sync like we would in CI (e.g: events.event_flush) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Tested-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17409>	2022-07-13 19:15:13 +00:00
Alyssa Rosenzweig	96d65b47c7	panfrost: Use implementation-specific tile size The physical tile buffer size (and hence the maximum available tilebuffer size) are implementation-defined. Track this information on the device so we can correctly select tile sizes, instead of hardcoding the value for Midgard. Implementation values are pulled from the "Tile bits/pixel" row of the public Mali data sheet [1]. That row lists the maximum number of bits available for a pixel given the maximum tile size and pipelining. For currently supported hardware (v9 and older), that maximum tile size is 16x16. So those values should be multiplied by (16 * 16 * 2) / 8 to get the physical size in bytes. This may improve Bifrost/Valhall performance on workloads using multiple render targets. It also gets us ready for the dazzling array of tile sizes available with v10. [1] https://developer.arm.com/documentation/102849/latest/ Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17432>	2022-07-13 19:00:41 +00:00
Alyssa Rosenzweig	d67681c4ea	panfrost: Make pan_select_max_tile_size O(1) Separate out "calculating the size of each pixel", "selecting a tile size", and "calculating the colour buffer allocation". Then implement the middle (selecting a tile size) with a simple constant time expression, rather than a loop. There's a bit of related clean up in here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17432>	2022-07-13 19:00:41 +00:00
Alyssa Rosenzweig	d458384883	pan/va: Handle BIFROST_MESA_DEBUG=nosb For debugging flakes that might be caused due to wrong scoreboarding. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17223>	2022-07-13 18:46:15 +00:00
Emma Anholt	43579901be	ir3: Fix the no-emitted-vertex condition emission in geom lowering. The if statement we insert would insert a new block before the end block (and remove the old pre-end-block). If the new block ended up later in the HT due to its pointer's hash value, you'd emit another copy of the if statement after the last one. I saw this happen up to 4 times in testing. The worst case would be if all those additions and removals ended up reallocating the HT, at which point we might use-after-free. Fixes inconsistent shader-db results with geometry shaders. Cc: mesa-stable. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17501>	2022-07-13 18:16:45 +00:00
Alyssa Rosenzweig	2171412c66	pan/va: Print instructions with pack assert fails va_pack asserts a large number of invariants about the instruction being packed. If one of these fails (due to an invalid instruction), it's helpful to inspect the failing instruction, as it may not be apparent in a large shader. Pass the instruction through with all the assertions in va_pack for easier debugging. Now assertion failures when packing are easier to debug: Invalid invariant lo.type == hi.type: = STORE.i32.flow2.wls br5, br2, wls_ptr[1], byte_offset:0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17224>	2022-07-13 18:01:56 +00:00
Alyssa Rosenzweig	cfeafef755	pan/va: Use invalid_instruction in more places By passing the instruction pointer through the packer, we can get better error messages with invalid_instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17224>	2022-07-13 18:01:56 +00:00
Alyssa Rosenzweig	cc94409d70	pan/va: Dump unencodable instructions When we assert out due to certain invalid encoding, it's helpful to know what instruction is causing the failure, since it may not be obvious from the assembly for large shaders. Now we get nice errors when failing: Invalid opcode: br0 = VAR_TEX.f32.flow8.store.skip.lod_mode.center , texture_index:0, varying_index:0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17224>	2022-07-13 18:01:56 +00:00
Alyssa Rosenzweig	fd1edbc6e5	panfrost: Only key points to point coord origin Apparently, the point coord origin within a batch can change with Gallium (seemingly even with GLES? where that's impossible at an API level?) but that doesn't matter if we're not drawing points. This might have to do with internal Gallium CSOs (e.g. u_blitter). Issue noticed in SuperTuxKart, which was getting state change flushes. Performance on one level on a Valhall GPU improved from 26fps to 29fps. Fixes: `3641dfe436` ("panfrost: Flip point coords in hardware") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17430>	2022-07-13 17:43:50 +00:00
Alyssa Rosenzweig	44d9c41b6b	panfrost: Revert provoking vertex assertion `b6a30b72ab` ("panfrost: Implement provoking vertices on Valhall") added an assertion that every draw selects a particular provoking vertex. The intent was to ensure provoking vertex selection actually happened. Unfortunately, the assertion is too strong, as the provoking vertex is irrelevant for some (most) draws. For those, we don't want to commit to a particular provoking vertex for those to avoid flushing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17430>	2022-07-13 17:43:50 +00:00
Adam Jackson	768238fdc0	glx: Fix drawable refcounting for naked Windows driFetchDrawable is only ever called from the MakeCurrent path, which means it has to handle the case of pre-GLX-1.3 Windows being named as the drawable. When it finds the drawable in the hash, it increments its refcount before returning it, so for a GLXWindow it would be 2 on first return, one from glXCreateWindow and one from glXMakeCurrent. But when it does not find the drawable and creates one for the naked Window, the reference count on first return would only be 1. As a result, if this context was then ever bound to a different drawable, the old Window's DRI drawable state (like the back buffer) would be destroyed. Fixes piglit's glx-multi-window-single-context and glx-make-current for a variety of drivers. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6713 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17479>	2022-07-13 12:25:30 -04:00
Marcin Ślusarz	585d81e3ec	intel/compiler: print shaders after nir_remove_unused_varyings Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17516>	2022-07-13 15:50:02 +00:00
Lucas Stach	0d13dfcf7c	etnaviv: tex_desc: remove descriptor patch TODO comment There is nothing more TODO here. With softpin, which is available on all GPUs using texture descriptors, there is no need for the kernel to patch the descriptor, as the proper GPU virtual address is filled in by userspace. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17448>	2022-07-13 15:00:33 +00:00
Lucas Stach	8ddaca1633	etnaviv: tex_desc: make error handling more consistent There already is a error handling label to free the sampler view struct and return failure. Consistently use this label to make error handling more uniform. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17448>	2022-07-13 15:00:33 +00:00
Lucas Stach	9a48b1bdb2	etnaviv: add texture descriptor suballocator Texture descriptors currently waste a massive ammount of memory, as every one is allocated via a separate BO. As the allocation granularity of the kernel is 4KB and the descriptor is only 256B, 93.75% of the allocated memory is wasted. Add a simple suballocator for the texture descriptors, to allocate multiple ones out of a single kernel BO. This isn't perfect, as freed slots in the suballocated resource are not reused, but worst-case we end up with the same waste as we had before. This also potentially improves efficiency at the kernel side, as this reduces the number of BOs needed for the sampler views in each submit. As the BO is now used by multiple descriptors, avoid syncing with the GPU via the cpu_prep/fini calls, as to not introduce stalls between pending rendering and new descriptors being filled. This is safe, as each suballocation slot is only used once, so newly filled slots are certainly not in use by the GPU. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17448>	2022-07-13 15:00:33 +00:00
Lucas Stach	2c08decc8f	etnaviv: move dummy BOs to screen The dummy texture descriptor and the dummy render target relocs are not ever changed by a context operation, so we can save some space by moving them to the screen and potentially share them and the BOs backing them between multiple contexts. Also don't hold two pointers to the same BO, one in the reloc and one raw, but always just use the reloc one. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17448>	2022-07-13 15:00:33 +00:00
Eric Engestrom	f7f74a984b	zink: add missing guards around `have_{ext}` Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17466>	2022-07-13 14:41:59 +00:00
Eric Engestrom	672df4d0fe	zink: drop unused VkPhysicalDevicePortabilitySubsetPropertiesKHR Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17466>	2022-07-13 14:41:59 +00:00
Eric Engestrom	282013fe86	zink: fix portability_subset usage after rename from EXTX to KHR Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17466>	2022-07-13 14:41:59 +00:00
Gert Wollny	643623e1a3	r600/sfn: emulate pmr::monotonic_buffer_resource if needed libc++ does not yet implement the c++17 monotonic_buffer_resource, so emulate it by doing normal allocations that are cleaned up when the resource is destroyed. v2: - Use C include and version without namespace aligned_alloc - add include for sstream needed with clang++ (maurossi) Closes: #6836 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17452>	2022-07-13 13:32:33 +00:00
Gert Wollny	3340c7ce35	r600/sfn: lower CLIPVERTEX to clip planes With that most piglits for compatibility contexts are passing, so enable higher compatibility profile support. v2: fix formatting ahd fallthrough tag (Filip) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17484>	2022-07-13 15:17:17 +02:00
Gert Wollny	19ba29d996	r600/sfn: Add support for fdph Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17484>	2022-07-13 15:10:39 +02:00
Gert Wollny	0b0a04635b	r600/sfn: Never consider an op with register dest as dead Another hot-fix: when a local register is written to, it is actually unlikely that the value is never used, so just make sure that this is never done. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17484>	2022-07-13 15:10:34 +02:00
Gert Wollny	8222840e3f	r600: limit loops when trying to merge alu groups On Cayman bank_swizzle[4] is never counted up, so add an additional condition to make sure the loop is finished at one point. This is a hot-fix, the logic below should be improved. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17484>	2022-07-13 15:10:28 +02:00
Matt Coster	6165701b2e	pvr: Implicitly assert that the correct sub-command type is present Now that we have separate C types for the different sub-command types, we can require a pointer to that type to be passed into functions which expect the current sub-command to be of a specific type. Signed-off-by: Matt Coster <matt.coster@imgtec.com> Reviewed-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17458>	2022-07-13 12:30:10 +01:00
Matt Coster	b9d6ed445d	pvr: Split out unioned structs from struct pvr_sub_cmd This is a simple optimization to make type-specific uses of struct pvr_sub_cmd slightly less verbose. Signed-off-by: Matt Coster <matt.coster@imgtec.com> Reviewed-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17458>	2022-07-13 12:28:05 +01:00
Rajnesh Kanwal	1df16b5f22	pvr: Implement vkCmdDraw API. Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17487>	2022-07-13 10:35:10 +00:00
Iago Toral Quiroga	40976356f2	v3d,v3dv: stop copying and pasting the translate_swizzle helper Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17509>	2022-07-13 10:09:34 +00:00
Iago Toral Quiroga	8d8491df5e	v3d: stop using a smaller texture limit in OpenGL The compiler has improved significantly since we found this issue and this is no longer required. Notice that because we are increasing the number of samplers supported beyond what we can loop unroll (currently capped at 16), some piglit tests that test the maximum number of samplers supported start to fail because they use indirect indexing on a sampler array and we don't support that (previously the indirect indexing was removed by loop unrolling). This is a bug in tests which the GLSL linker detects, failing to compile the shaders. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17509>	2022-07-13 10:09:34 +00:00
Iago Toral Quiroga	9b74f4218f	v3d,v3dv: stop hardcoding various image limits Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17509>	2022-07-13 10:09:34 +00:00
Iago Toral Quiroga	25fc388d7e	v3dv: clean up get_internal_type_bpp_for_image_aspects Also, remove the FIXME to pre-compute this in images. We only use this helper from copy/clear operations where we may be working with a compatible framebuffer format instead of the original image. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17509>	2022-07-13 10:09:34 +00:00
newbluemoon	6a4fc6f6ca	nine: replace ulimit with sysconf call __UL_GETOPENMAX seems to be glibc specific and not portable. In glibc’s sysdeps/posix/ulimit.c it is assigned the return value of sysconf(_SC_OPEN_MAX). So use the latter in the first place. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5176 CC: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17471>	2022-07-13 08:34:18 +00:00
Chuansheng Liu	39f8c61f32	iris,anv: correct the max thread number for DG2+ Correct the max thread number for DG2+ platforms according to below bspec. Ref: Bspec: 47202 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chuansheng Liu <chuansheng.liu@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17506>	2022-07-13 08:11:19 +00:00
Georg Lehmann	aac8ddae2f	nir/opt_algebraic: Optimize [ui](add\|sub)_sat with 0. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Georg Lehmann	90a8fb0355	nir/lower_io: Fix array length of buffers larger than INT32_MAX. Before, if the ssbo is too large this would always return 0. Also, this code is easier to optimize, so the common case of offset 0 and pot stride results in one ushr instead of 5+ instructions. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Georg Lehmann	d9fb1b05eb	ir3: Implement [iu]sub_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Georg Lehmann	9a83ccf1fa	r600: Lower uadd_sat/usub_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Georg Lehmann	e09e04a2c0	zink: Lower uadd_sat/usub_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Georg Lehmann	d472c45810	nir_to_tgsi: Lower uadd_sat/usub_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17468>	2022-07-13 07:34:09 +00:00
Panagiotis Apostolou	6f0aba42ad	util: Don't block SIGSEGV for new threads SIGSEGV is used by Vulkan API trace layers to track user changes in device memory mapped to user space. Now with drivers such as Zink, GLES applications are translated into Vulkan API calls and therefore it is possible to be tracked by Vulkan api trace layers. Blocking SIGSEGV hinders one of the memory tracking mechanisms used by such layers. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17273>	2022-07-13 06:51:27 +00:00
Iago Toral Quiroga	1442861141	v3dv: fix comment for point_sprite_mask filed in shader key Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17486>	2022-07-13 05:20:31 +00:00
Lionel Landwerlin	3a8ad28524	anv: skip flush/invalidate faster Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17481>	2022-07-13 01:33:27 +00:00
Lionel Landwerlin	1aeb11cde1	intel: protect against empty invalidate ranges It's legal for an application to call vkInvalidateMappedMemoryRanges() / vkFlushMappedMemoryRanges() with zero sized ranges. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b91971c240` ("anv: use the right helper to invalidate memory") Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6852 Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17481>	2022-07-13 01:33:27 +00:00
Dave Airlie	105279e989	radv: add a dynamic vertex format cache. With dynamic vertex bindings the vertex format lookups are a lot more frequent (vs being baked in the pipeline). Add a simple lookup cache using a dynamic array to keep track of the hw values, and avoid repeated translation. This also reduces the memset to just the bitfields since all the others will be overwritten. Seen in perf traces gputest gimark with zink on radv. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15846>	2022-07-13 01:10:09 +00:00
Lionel Landwerlin	af1ecbeb0a	anv: add a comment about handling buffer view swizzles on gfx7 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17385>	2022-07-13 00:45:36 +00:00
Lionel Landwerlin	a9edc268b9	anv: validate image view lowered storage formats for storage Ensure that if we have swizzle on the initial format, that the component bits are identical with the lowered format. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17385>	2022-07-13 00:45:36 +00:00
Lionel Landwerlin	57a8efa222	anv: deal with isl format swizzles for buffer views For some formats like VK_FORMAT_B5G6R5_UNORM_PACK16, we have no direct matching HW format. We can support it by swizzling. We already apply those swizzles for image views. We just forgot to deal with buffer views. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6235 Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17385>	2022-07-13 00:45:36 +00:00
Danylo Piliaiev	bfd3c43aa9	freedreno: Add FD_GPU_TRACEPOINT envvar to toggle tracepoints All tracepoints are enabled by default. Example: FD_GPU_TRACEPOINT=-flush_batch Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16781>	2022-07-12 22:24:19 +00:00
Danylo Piliaiev	7a62219d33	freedreno: Refactor tracepoints generation to reduce duplication This way we will not need to repeat arguments for "start" and "end" tracepoints. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16781>	2022-07-12 22:24:19 +00:00
Danylo Piliaiev	82e79f6cdf	freedreno: Add the rest of tracepoints with start/end to perfetto Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16781>	2022-07-12 22:24:19 +00:00
Danylo Piliaiev	b059cdad40	turnip: Add TU_GPU_TRACEPOINT envvar to toggle tracepoints All tracepoints are enabled by default. Example: TU_GPU_TRACEPOINT=-sysmem_clear Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16781>	2022-07-12 22:24:19 +00:00
Danylo Piliaiev	d903c6c7f3	turnip: Refactor tracepoints generation to reduce duplication This way we will not need to repeat arguments for "start" and "end" tracepoints. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16781>	2022-07-12 22:24:19 +00:00
Danylo Piliaiev	8d34cc2471	util/u_trace: Fix iteration over config_control Fixes: `e1811af75d` ("util/perf: add options to enable/disable tracepoints") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16781>	2022-07-12 22:24:19 +00:00

... 4 5 6 7 8 ...

145368 Commits