KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Boris Brezillon	370ca07482	nir/serialize: Support texop >= 16 Extend the packed_instr struct to support texops above nir_texop_fragment_fetch_amd. Fixes: `603e6ba972` ("nir: add two new texture ops for multisample fragment color/mask fetches") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17186>	2022-06-28 09:07:32 +00:00
Lionel Landwerlin	cf44282deb	clc: add new feature options for intel_clc Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17253>	2022-06-27 15:31:49 +00:00
Boris Brezillon	ee536ea633	nir/serialize: Put dest last in packed_instr::tex packed_instr::tex::dest must be last to match the packed_instr::any::dest position. Fixes: `35655865cb` ("nir/serialize: pack instructions better") Cc: stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17212>	2022-06-27 09:39:22 +00:00
Qiang Yu	fdf589321c	ac/nir: add nir_intrinsic_load_hs_out_patch_data_offset_amd Also add radv and radeonsi implementation. Will be used in tess lowering. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16705>	2022-06-27 02:38:21 +00:00
Jason Ekstrand	7c127ca018	nir/opt_memcpy: Add another case for function_temp Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> (1.5 years later) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13166>	2022-06-24 19:21:26 +00:00
Jason Ekstrand	dc85065944	nir: Add an options parameter to deref_instr_has_complex_use Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> (1.5 years later) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13166>	2022-06-24 19:21:26 +00:00
Jason Ekstrand	d6123460fd	nir/opt_memcpy: lower copies to/from tightly packed types v2: Add comment by Jason (Lionel) Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> (1.5 years later) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13166>	2022-06-24 19:21:26 +00:00
Ian Romanick	6689fa2ab4	nir/range_analysis: Teach range analysis about fdot opcodes This really, really helps on platforms where fabs() isn't free. A great many shaders use a * frsq(fabs(fdot(a, a))) to normalize a vector. Since the result of the fdot must be non-negative, the fabs can be eliminated by an existing algebraic rule. shader-db results: r300 (run on R420 - X800XL) total instructions in shared programs: 1369807 -> 1368550 (-0.09%) instructions in affected programs: 59986 -> 58729 (-2.10%) helped: 609 HURT: 0 total vinst in shared programs: 512899 -> 512861 (<.01%) vinst in affected programs: 1522 -> 1484 (-2.50%) helped: 36 HURT: 0 total sinst in shared programs: 260690 -> 260570 (-0.05%) sinst in affected programs: 1419 -> 1299 (-8.46%) helped: 120 HURT: 0 total consts in shared programs: 957295 -> 957230 (<.01%) consts in affected programs: 849 -> 784 (-7.66%) helped: 65 HURT: 0 LOST: 0 GAINED: 3 The 3 gained shaders are all vertex shaders from XCom: Enemy Unknown. I'm guessing that game is never going to run on my X800XL. :) i915 total instructions in shared programs: 791121 -> 780843 (-1.30%) instructions in affected programs: 220170 -> 209892 (-4.67%) helped: 2085 HURT: 0 total temps in shared programs: 47765 -> 47766 (<.01%) temps in affected programs: 9 -> 10 (11.11%) helped: 0 HURT: 1 total const in shared programs: 93048 -> 92983 (-0.07%) const in affected programs: 784 -> 719 (-8.29%) helped: 65 HURT: 0 LOST: 0 GAINED: 36 Haswell, Ivy Bridge, and Sandy Bridge had similar results. (Haswell shown) total instructions in shared programs: 16702250 -> 16697908 (-0.03%) instructions in affected programs: 119277 -> 114935 (-3.64%) helped: 1065 HURT: 0 helped stats (abs) min: 1 max: 20 x̄: 4.08 x̃: 4 helped stats (rel) min: 0.48% max: 10.17% x̄: 3.66% x̃: 3.94% 95% mean confidence interval for instructions value: -4.26 -3.89 95% mean confidence interval for instructions %-change: -3.76% -3.56% Instructions are helped. total cycles in shared programs: 880772068 -> 880734134 (<.01%) cycles in affected programs: 2134456 -> 2096522 (-1.78%) helped: 941 HURT: 324 helped stats (abs) min: 2 max: 2180 x̄: 123.06 x̃: 44 helped stats (rel) min: 0.04% max: 49.96% x̄: 7.08% x̃: 3.81% HURT stats (abs) min: 2 max: 2098 x̄: 240.33 x̃: 35 HURT stats (rel) min: 0.04% max: 77.07% x̄: 12.34% x̃: 3.00% 95% mean confidence interval for cycles value: -47.93 -12.04 95% mean confidence interval for cycles %-change: -2.87% -1.34% Cycles are helped. No shader-db changes on any other Intel platform. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17181>	2022-06-23 18:46:27 +00:00
Ian Romanick	fd1f2d3b5a	nir: Add and use algebraic property "is selection" There are several places that should have supported the various sized versions of bcsel and the various nir_op_[fi]csel_* opcodes. Rather than enumerate the whole list, add a property. v2: Make the comment for NIR_OP_IS_SELECTION more descriptive. Suggested by Jason. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17048>	2022-06-22 19:26:59 +00:00
Ian Romanick	a2a2fbc510	nir/algebraic: Fix NaN-unsafe fcsel patterns For example, the proof for this pattern (('bcsel', ('flt', 'a@32', 0), 'b@32', 'c@32'), ('fcsel_ge', a, c, b)), would be bcsel(a < 0, b, c) bcsel(!(a < 0), c, b) bcsel(a >= 0, c, b) fcsel_ge(a, c, b) However, !(a < 0) => (a >= 0) is well known to produce different results if `a` is NaN. Instead of that replacement, use this replacement: bcsel(a < 0, b, c) bcsel(-0 < -a, b, c) bcsel(0 < -a, b, c) fcsel_gt(-a, b, c) This is NaN-safe and exact. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `0f5b3c37c5` ("nir: Add opcodes for fused comp + csel and optimizations") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17048>	2022-06-22 19:26:59 +00:00
Ian Romanick	ccd18ec4f3	nir: i32csel opcodes should compare with integer zero Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Noticed-by: Georg Lehmann <dadschoorse@gmail.com> Fixes: `0f5b3c37c5` ("nir: Add opcodes for fused comp + csel and optimizations") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17048>	2022-06-22 19:26:59 +00:00
Mike Blumenkrantz	4830cc77cb	nir/lower_point_size: apply point size clamping point size min/max values are provided through the state vars, so ensure these are always applied in order to respect ARB_point_parameters cc: mesa-stable Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17145>	2022-06-22 13:27:29 +00:00
Timur Kristóf	e5970fe22a	nir/lower_task_shader: don't use base index for shared memory intrinsics Intel backend doesn't handle them very well. Fixes: `8aff8d3dd4` ("nir: Add common task shader lowering to make the backend's job easier.") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17143>	2022-06-22 10:32:13 +00:00
Marcin Ślusarz	49b8fffeed	nir/lower_task_shader: insert barrier before/after shared memory read/write Fixes: `8aff8d3dd4` ("nir: Add common task shader lowering to make the backend's job easier.") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17143>	2022-06-22 10:32:13 +00:00
Pavel Asyutchenko	959b748038	glsl: add language support for GL_ARM_shader_framebuffer_fetch_depth_stencil This extension adds built-in variables gl_LastFragDepthARM and gl_LastFragStencilARM which can be implemented almost the same as gl_LastFragData from color fetch extension. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13979>	2022-06-22 04:32:44 +00:00
Marcin Ślusarz	97b53ad759	nir/opt_load_store_vectorize: handle task payloads Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17000>	2022-06-20 17:38:20 +00:00
Andres Calderon Jaramillo	a5d09d7a0a	nir: Account for YUV range. This patch expands on what commit `d8fdb8dab4` did. It adds support for YUV-to-RGB conversions depending on the range of the YUV samples. The conversion matrices and offsets are derived from https://gist.github.com/yohhoy/dafa5a47dade85d8b40625261af3776a. Tested-by: Andres Calderon Jaramillo <andrescj@chromium.org> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16651>	2022-06-17 17:25:44 +00:00
Christian Gmeiner	15f394cc7a	nir: Fix unused-variable compile warnings Fixes: `8492e78f9d` ("nir/deref: Handle SSBO array bindings specially") Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17021>	2022-06-15 19:43:27 +00:00
Boris Brezillon	cdbc8a8c85	spirv: Add a dst/src type to image deref loads/stores coming from OpAtomic{Load,Store} nir_to_dxil() uses those types to pick the right operation overload, and atomic loads/stores are no different from their non-atomic counterpart apart from the atomicity property, so it makes sense to pass a type to the deref_{load,store} intrinsic in that case too. Suggested-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16926>	2022-06-14 22:44:42 +00:00
Erik Faye-Lund	ec9d7872ac	glcpp: remove outdated msvc hack While MSVC doesn't do __STDC_VERSION__ correctly for C99, it does for C11, which is what we now require. So we can remove this hack. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16908>	2022-06-14 15:08:37 +00:00
Alejandro Piñeiro	481df13f27	nir: get res binding using component 0, instead of asssumig an uint Needed to be able to call nir_opt_gcm on the v3dv driver. This change is needed as on v3dv we honor vulkan resource index returning a vec2. See commit `21b0a4c80c` for more info. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16986>	2022-06-14 13:12:46 +00:00
Christian Gmeiner	d364d445ad	isaspec: Handle patterns bigger then 64 bit Currently uint64_t_to_bitmask(..) is used in combination with the pattern 'match'. This only works for values smaller then 64 bit. Add support for bigger isa sizes. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16996>	2022-06-14 12:35:39 +00:00
Christian Gmeiner	119d8488dd	isaspec: Extend split_bits(..) to accept a bitsize Make split_bits(..) more generic. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16996>	2022-06-14 12:35:39 +00:00
Boris Brezillon	d9ec7df2f4	nir: Fix flat new_var assignment in create_new_io_vars() If the type is not an array, glsl_get_length() returns 0 and we don't update the new_vars[]/flat_vars[] entries. Fixes: `bcd14756ee` ("nir/lower_io_to_vector: add flat mode") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16960>	2022-06-10 08:06:46 +00:00
Emma Anholt	cf265c6606	nir: Rename is_arb_asm to use_legacy_math_rules and document its meaning. On iris and crocus, this flag is used to set "alt mode" math on the shader as a whole. Some other drivers have a similar mode for DX9/ARB-program behavior, so document what it does so we can start using it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16176>	2022-06-10 03:26:32 +00:00
Jason Ekstrand	133620196d	compiler/types: Don't place members in the previous substruct's end padding With the following structures : struct StructA { uint64_t value0; uint8_t value1; }; struct TopStruct { struct StructA a; uint8_t value3; }; Currently offsetof(struct TopStruct, value3) = 9. While the same code on the CPU gives offsetof(struct TopStruct, value3) = 16. This is impacting OpenCL kernels we're trying to use to build acceleration structures. v2: Add comment/link to some description of the alignment/size computation Cc: mesa-stable Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16940>	2022-06-09 22:55:37 +00:00
Konstantin Seurer	08577bbb70	nir/nir_lower_io: Optimize 32-bit inbounds access Perform address calculation in 32 bits when dealing with inbounds array derefs. Closes: #6562 Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16729>	2022-06-09 16:20:16 +00:00
Konstantin Seurer	f19cbe98e3	nir,spirv: Preserve inbounds access information Preserving information about inbounds access and the required bit size for the bounds will help with avoiding 64-bit operations when lowering io. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16729>	2022-06-09 16:20:16 +00:00
Timothy Arceri	893b4d98f8	glsl: inline do_common_optimization() call The function was previously a helper for when some drivers still called the GLSL IR optimisations in a loop. No drivers do that anymore. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16924>	2022-06-08 22:58:50 +00:00
Timothy Arceri	d09a37ef54	glsl: remove never true do_dead_code() parameter Since we have now switched all drivers to using NIR and therefore the NIR based uniform linker this param never needs to be set to true so remove it. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16924>	2022-06-08 22:58:50 +00:00
Jason Ekstrand	4655ff1f5b	nir/deref: Handle RESTRICT for SSBO deref bindings Tested-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Jason Ekstrand	8492e78f9d	nir/deref: Handle SSBO array bindings specially Instead of just checking for the variables to match, check that the entire deref up to the interface type matches. Tested-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Jason Ekstrand	a5b1274275	nir/vars_tests: Use nir_var_mem_global instead of ssbo We're about to add a bunch of SSBO special cases which will depend on SSBOs always being either structs or arrays of structs. All those little vector SSBOs we're creating will no longer be valid. Switch to nir_var_mem_global to avoid this. Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Jason Ekstrand	2d221c64e7	nir: Increase nir_variable_data::mode to 16 bits This is required if we want to have variables with nir_var_mem_global which we will for CL eventually. Also, they're useful in unit tests because they're the most generic thing imaginable and can't get eliminated by normal means. Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Jason Ekstrand	0ad2dfe942	nir/deref: Re-arrange variable checks in compare_deref_paths Instead of having a bunch of mode checks as special cases, assert that the modes equal and then switch on the mode. This should make the special cases a bit easier to understand. Handling of `a_var == b_var` looks redundant now but it won't be in the next patch. Tested-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Jason Ekstrand	130d9d80db	nir/deref: Make compare_deref_paths take a stop callback This will let us use it to compare only the first part of a pair of deref paths and continue the comparison later. Tested-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Jason Ekstrand	7ebcdada00	nir/deref: Use an index in compare_deref_paths Instead of incrementing pointers, use an integer index. This makes it clear that we always increment them together. It'll also make the next change a bit easier. We use a pointer to an integer because the next patch is going to let us abort the walk and we want to be able to continue where we left off. Tested-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Jason Ekstrand	4d80b3217e	nir/deref: Break out a helper for compare_deref_paths Tested-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Rhys Perry	cb5c1bcb7c	nir/deref: stop assuming coherent accesses of different SSBOs may alias Whether it's coherent should be irrelevant and the ACCESS_RESTRICT check above should consider all cases aliasing unless NIR makes it clear they're not. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Tested-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16894>	2022-06-08 21:30:59 +00:00
Georg Lehmann	1b68d3e43a	nir/lower_tex: Add lower_array_layer_round_even option. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16871>	2022-06-08 20:57:22 +00:00
Konstantin Seurer	16585664cd	radv: vkCmdTraceRaysIndirect2KHR This changes the trace rays logic to always use VkTraceRaysIndirectCommand2KHR and implements vkCmdTraceRaysIndirect2KHR. I renamed the load_sbt_amd to sbt_base_amd and moved the SBT load lowering from ACO to NIR. Note that we can not just upload one pointer to all the trace parameters because that would be incompatible with traceRaysIndirect. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16430>	2022-06-08 20:20:21 +00:00
Konstantin Seurer	3aa0ea8279	nir: Handle ray_launch_size_addr in opt_preamble Found this while working on traceRaysIndirect2. I don't think this is relevant for now at least since we don't use the pass in RADV. Fixes: `938c9d9` ("nir: Add a ray launch size addr intrinsic") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16430>	2022-06-08 20:20:21 +00:00
Konstantin Seurer	2e0e150e69	spirv: Add plumbing for ray_cull_mask Add a new cull_mask system value that is exposed by the ray_cull_mask capability of SPV_KHR_ray_cull_mask. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16430>	2022-06-08 20:20:21 +00:00
Konstantin Seurer	7c44cb6f5e	vulkan: Spec update to 1.3.216 Update the vulkan headers and xml to 1.3.216 including the spirv headers. The new spirv spec added new OpAlias*INTEL ops, which we ignore in vtn_gather_types_c.py. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16430>	2022-06-08 20:20:21 +00:00
Timur Kristóf	02c87e66e9	nir: Introduce new intrinsics for AMD specific mesh shader task ring. The mesh shader task ring is a buffer in VRAM which we will use to store some mesh shader outputs that don't fit into LDS. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16737>	2022-06-08 08:43:51 +00:00
Emma Anholt	ab3a1d41c5	glsl: Drop INT_DIV_TO_MUL_RCP lowering. nir_lower_int_to_float() does this at the end of compilation, no need to do it up front. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Emma Anholt	5c499d6d1a	nir: Fix idiv lowering on !NativeIntegers when lower_fdiv is also set. Avoids a regression when turning off GLSL's int div lowering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Emma Anholt	464b32c030	glsl: Drop the div-to-mul-rcp lowering for floats. NIR has fdiv, and all the NIR backends have to have lower_fdiv set appropriately already since various passes (format conversions, tgsi_to_nir, nir_fast_normalize(), etc.) might generate one. This causes softpipe and llvmpipe to now do actual divides, since lower_fdiv is not set there. Note that llvmpipe's rcp implementation is a divide of 1.0 by x, so now we're going to be just doing div(x, y) instead of mul(x, div(1.0, y)). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Emma Anholt	0fbd1b1f4c	glsl: Move exp/log-to-exp2/log2 lowering to glsl-to-NIR. It's way more concise to write as nir_builder calls. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Emma Anholt	d024eb6fab	glsl: Remove stale lower_instructions comments. Should have been in `3a42e92a4f` ("glsl: Drop the dead MOD_TO_FLOOR path.") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Emma Anholt	8c4b88ee48	gallium+glsl: Remove EmitNoSat/PIPE_CAP_VERTEX_SHADER_SATURATE The drivers not setting it were: - nv30, which gets lowering using NIR's lower_fsat flag. - r300, which gets lowering using NIR's lower_fsat flag. - a2xx, which has was getting it optimized back to fsat anyway. This drops the check for the cap from gallium nine. While nine does have a non-nir path, I think it's safe to assume that if you have SM3 texturing, you can do fsat. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Timur Kristóf	f7f2770e72	ac/nir: Add remappability to tess and ESGS I/O lowering passes. This will be used for radeonsi to map common I/O location to fixed slots agreed by different shader stages. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16418>	2022-06-07 01:40:14 +00:00
Qiang Yu	33b4b923ee	nir: add nir_intrinsic_load_lshs_vertex_stride_amd For loading LS-HS vertex stride by shader argument in radeonsi. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16418>	2022-06-07 01:40:14 +00:00
Timothy Arceri	4237932685	glsl: tidy up link_varyings_and_uniforms() All uniform linking is now done via nir based linker not via this code so we drop that from its name. We also drop a bunch of unused parameters. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16880>	2022-06-07 01:11:19 +00:00
Timothy Arceri	f00be793e4	glsl: drop extra optimise swizzles call As per the comment this was meant to tidy things up after varying linking but varying linking has been moved into a nir based linker so this extra call is no longer needed. This optimisation pass is still called in the regular glsl ir optimisation loop. No shader-db change on Iris (BDW). Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16880>	2022-06-07 01:11:19 +00:00
Qiang Yu	19f3737262	mesa: pass select result buffer offset as attribute/varying Will be used by geometry shader to store hit result. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Qiang Yu	ff8ae4e589	nir/builder: add load/store array variable helper functions Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Mike Blumenkrantz	06859ba69c	mesa: handle atomic counter lowering for drivers with big ssbo offset aligns according to the spec, atomic counters can be bound at any offset divisible by 4, which means that any driver that uses the ssbo lowering pass and doesn't have a min offset align of 4 is potentially broken to handle this, use a statevar to inject the misaligned remainder of the offset into the shader as a uniform. for well-aligned counter binds, the uniform offset will be 0 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16749>	2022-06-05 23:16:36 +00:00
Vinson Lee	3e679219a1	clc: Fix build with llvm-15. opencl_c_h is defined only for llvm < 15. Fixes: `bcc2df4890` ("clc: speed up compilation by not relying on opencl-c.h") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16808>	2022-06-04 22:27:55 -07:00
Timothy Arceri	5aec67a1e1	glsl: remove the now unused GLSL IR loop unrolling code This code was slow, buggy and hard to understand. All drivers have now switched to using the NIR unrolling code \o/ Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Alyssa Rosenzweig	dc2d8a643f	nir: Export nir_io_add_intrinsic_xfb_info This is useful for drivers which wish to consume XFB information. These hopefully-uncontroversial hunks are extracted from the much more controversial "st,nir,radeons: Move nir_lower_io_passes to si_nir_lower_io" by Jason. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15720>	2022-06-04 14:35:56 +00:00
Alyssa Rosenzweig	5c79d649af	nir: Add transform feedback system values These will be used to facilitate transform feedback lowering for Panfrost, although other backends could use the sysvals in the future. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15720>	2022-06-04 14:35:56 +00:00
Timothy Arceri	87aaa0f915	glsl: remove now unused lower_const_arrays_to_uniforms() We now use a NIR version instead. Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	c573260c9b	glsl: switch to NIR based implementation of lower_const_arrays_to_uniforms() Shader-db results iris (BDW): total instructions in shared programs: 17523543 -> 17513909 (-0.05%) instructions in affected programs: 218091 -> 208457 (-4.42%) helped: 69 HURT: 327 helped stats (abs) min: 2 max: 2919 x̄: 160.84 x̃: 12 helped stats (rel) min: 0.21% max: 96.88% x̄: 14.87% x̃: 6.40% HURT stats (abs) min: 1 max: 47 x̄: 4.48 x̃: 1 HURT stats (rel) min: 0.10% max: 22.02% x̄: 3.33% x̃: 0.18% 95% mean confidence interval for instructions value: -45.02 -3.63 95% mean confidence interval for instructions %-change: -1.16% 1.47% Inconclusive result (%-change mean confidence interval includes 0). total loops in shared programs: 4875 -> 4868 (-0.14%) loops in affected programs: 7 -> 0 helped: 7 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% 95% mean confidence interval for loops value: -1.00 -1.00 95% mean confidence interval for loops %-change: -100.00% -100.00% Loops are helped. total cycles in shared programs: 858032406 -> 857984712 (<.01%) cycles in affected programs: 22940290 -> 22892596 (-0.21%) helped: 155 HURT: 312 helped stats (abs) min: 1 max: 49696 x̄: 1697.70 x̃: 62 helped stats (rel) min: <.01% max: 70.84% x̄: 5.60% x̃: 0.82% HURT stats (abs) min: 1 max: 19640 x̄: 690.54 x̃: 100 HURT stats (rel) min: <.01% max: 217.23% x̄: 33.57% x̃: 0.92% 95% mean confidence interval for cycles value: -436.09 231.84 95% mean confidence interval for cycles %-change: 15.39% 25.75% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 16289 -> 15205 (-6.65%) spills in affected programs: 2753 -> 1669 (-39.38%) helped: 9 HURT: 1 total fills in shared programs: 20347 -> 20324 (-0.11%) fills in affected programs: 1642 -> 1619 (-1.40%) helped: 9 HURT: 1 total sends in shared programs: 972151 -> 971960 (-0.02%) sends in affected programs: 1910 -> 1719 (-10.00%) helped: 25 HURT: 20 helped stats (abs) min: 1 max: 50 x̄: 9.00 x̃: 2 helped stats (rel) min: 0.87% max: 53.76% x̄: 13.89% x̃: 6.25% HURT stats (abs) min: 1 max: 8 x̄: 1.70 x̃: 1 HURT stats (rel) min: 8.33% max: 200.00% x̄: 52.36% x̃: 33.33% 95% mean confidence interval for sends value: -8.19 -0.29 95% mean confidence interval for sends %-change: -1.07% 32.18% Inconclusive result (%-change mean confidence interval includes 0). LOST: 3 GAINED: 27 Note a small number of tests fail on lima and r300 after this patch. However since we are doing the correct thing here and they only fail due to a slight increase in instruction count pushing them over their instruction count limit, we are defering that issue to a different bug report for further discussion. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6540 Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	1805ee8d7b	glsl: move gl_nir_link_opts() call out of the st code Calling this directly in the linker code allows us to place it between the varying linker and uniform linker calls which allows for better optimisation/removal of uniforms. Also in a later patch it allows us to insert a new nir based lower_const_arrays_to_uniforms() call after the gl_nir_link_opts() call. This is important because it allows the linking opts to move constant arrays to later stages if possible before lower_const_arrays_to_uniforms() turns them into uniforms. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6541 Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	a14e2733ce	glsl: move common link time optimisation calls to linker code In the following patch we will move the users of this function to this file too and make it static again. Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	64dbc3f03a	glsl/nir: allow the nir linker to remove dead uniforms we created Some backends lower constant arrays to uniforms in GLSL IR. These create so called hidden uniforms. Since we know these are added per stage it is safe to remove them if we detect they are dead. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	4488b577a1	glsl/nir: skip adding hidden uniforms to the remap tables The remap tables are used with the GL API so there is no need to add hidden uniforms to them. Also when we switch to lowering some constant arrays to uniforms in NIR in a following patch there will no longer be enough room in the tables as we assign their size in the GLSL IR linker not the NIR linker currently. Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	44d6068c5b	nir: add nir based version of the lower_const_arrays_to_uniforms pass Doing this in NIR should give better results, but also allows us to stop calling more GLSL IR optimisations passes. v2: Skip 8bit and 16bit type that would require further processing I believe this is an existing bug in the GLSL IR pass also. v3: rebuild constant initialisers as we want to call this pass after nir has already lowered them and performed optimisations. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (v1) Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Daniel Schürmann	b56fcefa0f	nir/opt_vectorize: refactor src rewriting to avoid unnecessary mov instructions Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15647>	2022-06-03 08:53:18 +00:00
Danylo Piliaiev	eb5f4c2f6b	spirv: Workaround for RelaxedPrecision on OpLogical* in 3DMark Per spec RelaxedPrecision cannot be applied to bool types, however 3DMark Wild Life does it: OpDecorate %171 RelaxedPrecision ... %171 = OpLogicalAnd %bool %169 %170 Fixes crash in 3DMark Wild Life on Android. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16746>	2022-06-03 07:50:53 +00:00
Jason Ekstrand	d8df87056c	nir: xfb_buffer_info::stride is in bytes For the NIR XFB gathering as well as all the Vulkan drivers, buffer strides in nir_xfb_info are in bytes. When Marek started using nir_xfb_info for GLSL on radeonsi, he copied directly from the GLSL struct which has strides in dwords. This inconsistency didn't show up until I went through and started us using the NIR passes for GL drivers directly without going through the GLSL structs. We could change the nir_xfb_buffer_info field to be in dwords to be consistent with shader_info but that would mean changing all the Vulkan drivers but, for now, it's easier to always use bytes in nir_xfb_info. Fixes: `2a22885a45` ("st,nir: Use nir_shader::xfb_info in nir_lower_io_passes") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16819>	2022-06-02 14:06:31 +00:00
Jason Ekstrand	7c876a6b2f	nir/glsl: Use rzalloc for nir_xfb_info A lot of the fields get fully overwritten but outputs/buffers_written are both bitfields that we set one bit at a time. Fixes: `7c5dc0b11a` ("glsl/nir: Populate nir_shader::xfb_info after linking varyings") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16819>	2022-06-02 14:06:31 +00:00
Erik Faye-Lund	18246ed06a	include: drop c99_math.h Since we now depend on C11, we know that we have support for the C99 math functionality. So let's drop the c99_math.h compatibility wrapper, and just include <math.h> directly. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812>	2022-06-02 13:09:16 +00:00
Emma Anholt	6e087f96c9	nir_lower_mediump: Drop assertion about not containing movs. A 1D texture operation may need to do a mov to turn a reference to a channel of an SSA value into a scalar value to be passed as the texture coordinate (since texture srcs can't do swizzles). Seen in amnesia-the-dark-descent/low/46.shader_test() for example, where a 1D texture is used to remap each of r,g,b from a previous texture result. Besides, the nir_op_is_vec() case will (perhaps surprisingly) look through a mov, anyway. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Georg Lehmann	bfc25d6ec9	nir: Add optional lowering for mul_32x16. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13895>	2022-06-01 17:09:25 +00:00
Daniel Schürmann	be01e8711b	nir: introduce new nir_alu_alu_width() with nir_vectorize_cb callback This function allows to only scalarize instructions down to a desired vectorization width. nir_lower_alu_to_scalar() was changed to use the new function with a width of 1. Swizzles outside vectorization width are considered and reduce the target width. This prevents ending up with code like vec2 16 ssa_2 = iadd ssa_0.xz, ssa_1.xz which requires to emit shuffle code in backends and usually is not beneficial. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080>	2022-06-01 11:41:44 +00:00
Daniel Schürmann	bd151a256e	nir/opt_vectorize: add callback for max vectorization width The callback allows to request different vectorization factors per instruction depending on e.g. bitsize or opcode. This patch also removes using the vectorize_vec2_16bit option from nir_opt_vectorize(). Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080>	2022-06-01 11:41:44 +00:00
Emma Anholt	7472bb4bad	glsl,nir: Move i/umulExtended lowering to NIR. NIR already has the necessary lowering, and the GLSL lowering violates GLSL IR validation rules. Once quadop lowering was turned off, the IR validation at the end of the compile path on DEBUG builds caught the problem. In order to move the lowering to NIR, though, we need to make sure that drivers supporting these functions actually have the lowering flag set. xfails added for t860, where apparently this tickles a variety of existing 64-bit bugs in the backend. Fixes: #6461 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16437>	2022-06-01 10:56:35 +00:00
Lionel Landwerlin	5078b4fff1	nir/divergence: handle load_ray_num_dss_rt_stacks_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16797>	2022-06-01 04:58:50 +00:00
Lionel Landwerlin	d3c1b0ac28	nir/divergence: handle load_scratch_base_ptr v2: divergent (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16797>	2022-06-01 04:58:50 +00:00
Jason Ekstrand	2a22885a45	st,nir: Use nir_shader::xfb_info in nir_lower_io_passes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	16b0719441	glsl/nir: Stash the xfb_info in the nir_shader when linking XFB This pass is used for shaders coming in from SPIR-V. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	36d8a2f1d7	glsl/nir: Stop leaking varyings_info Fixes: `34b3b92bbe` ("nir/xfb: move varyings info out of nir_xfb_info") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	7c5dc0b11a	glsl/nir: Populate nir_shader::xfb_info after linking varyings Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	64cc35d2ac	nir: Drop nir_shader_get_xfb_info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	23b55dcff4	nir: Add a nir_xfb_info to nir_shader We want to be able to carry this along with the shader instead of always having to re-generate it from scratch. A new nir_gather_xfb_info() helper is also added which, instead of returning it, adds it to the shader. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	3e04432b3a	nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jesse Natalie	f812cc0fe6	nir: Consider PNTC to be a varying Fixes: `3528dcdf` ("nir: add nir_io_semantics::no_varying, no_sysval_output, and helpers") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6091 Reviewed-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16761>	2022-05-31 20:51:22 +00:00
Jesse Natalie	f61788d7d3	nir_lower_task_shader: Fix return from lower_task_intrin (bool, not void*) Fixes: `8aff8d3d` ("nir: Add common task shader lowering to make the backend's job easier.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16756>	2022-05-31 18:32:59 +00:00
Jason Ekstrand	eb0d571ce4	nir: Add a correctness note for nir_lower_phis_to_regs_block Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16111>	2022-05-31 14:12:21 +00:00
Jason Ekstrand	4a4d6cdc80	nir: Handle register sources in lower_phis_to_regs_block During certain control-flow manipulation passes, we go out-of-SSA temporarily in certain areas of the code to make control-flow manipulation easier. This can result in registers being in phi sources temporarily. If two sub-passes run before we get a chance to do clean-up, we can end up doing some out-of-SSA and then a bit more out-of-SSA and trigger this case. It's easy enough to handle. Fixes: `a620f66872` ("nir: Add a couple quick-and-dirty out-of-SSA helpers") Fixes: `79a987ad2a` ("nir/opt_if: also merge break statements with ones after the branch") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6370 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16111>	2022-05-31 14:12:21 +00:00
Karol Herbst	9ff04985b9	nir/gce: pin call instructions Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16202>	2022-05-31 12:36:48 +00:00
Karol Herbst	ad34d81c48	nir/gather_info: allow to run it before inlining Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16202>	2022-05-31 12:36:48 +00:00
Timothy Arceri	00313effdb	nir/gcm: fix pushing instructions into if blocks The previous logic would just set the block to the instructions original location if we couldn't evict it from a loop. For now we only push const loads to a later block inside ifs but we can add more heuristics later. This change helps a hand full of shaders but also stops a CTS regression caused by excess spilling after a series I'm working on to disable more of the GLSL IR optimisation passes. Shader-db results iris (BDW): total instructions in shared programs: 17529759 -> 17529749 (<.01%) instructions in affected programs: 15929 -> 15919 (-0.06%) helped: 5 HURT: 2 helped stats (abs) min: 1 max: 5 x̄: 2.40 x̃: 2 helped stats (rel) min: 0.06% max: 0.15% x̄: 0.11% x̃: 0.12% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.06% max: 0.06% x̄: 0.06% x̃: 0.06% 95% mean confidence interval for instructions value: -3.34 0.49 95% mean confidence interval for instructions %-change: -0.14% 0.02% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 861109994 -> 861099681 (<.01%) cycles in affected programs: 7027698 -> 7017385 (-0.15%) helped: 95 HURT: 72 helped stats (abs) min: 1 max: 7995 x̄: 138.54 x̃: 9 helped stats (rel) min: <.01% max: 15.96% x̄: 0.54% x̃: 0.11% HURT stats (abs) min: 1 max: 474 x̄: 39.56 x̃: 12 HURT stats (rel) min: <.01% max: 1.17% x̄: 0.20% x̃: 0.11% 95% mean confidence interval for cycles value: -159.05 35.54 95% mean confidence interval for cycles %-change: -0.45% 0.01% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 17606 -> 17605 (<.01%) spills in affected programs: 323 -> 322 (-0.31%) helped: 1 HURT: 0 total fills in shared programs: 22599 -> 22598 (<.01%) fills in affected programs: 1348 -> 1347 (-0.07%) helped: 1 HURT: 0 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14940>	2022-05-31 01:03:43 +00:00
Mike Blumenkrantz	3394e81eb1	vtn: assert that composite members have the same bit size as the result Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16667>	2022-05-27 14:06:32 +00:00
Mike Blumenkrantz	54e1072ff6	vtn: assert that vector shuffle indices are in-bounds Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16667>	2022-05-27 14:06:32 +00:00
Timur Kristóf	112a856813	nir: Keep track of cross-invocation mesh shader output access. On some implementations eg. AMD RDNA2 the driver can generate a more optimal code path knowing whether outputs are indexed using the local invocation index or not. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16736>	2022-05-27 11:22:07 +00:00
Timur Kristóf	8aff8d3dd4	nir: Add common task shader lowering to make the backend's job easier. 1. Lowers NV_mesh_shader TASK_COUNT output to launch_mesh_workgroups. 2. Removes all code after launch_mesh_workgroups, enforcing the fact that it's a terminating instruction. 3. Ensures that task shaders always have at least one launch_mesh_workgroups instruction, so the backend doesn't need to implement a special case when the shader doesn't have it. 4. Optionally, implements task_payload using shared memory when task_payload atomics are used. This is useful when the backend is otherwise not capable of handling the same atomic features as it can for shared memory. If this is used, the backend only has to implement the basic load/store operations for task_payload. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16720>	2022-05-27 07:52:03 +00:00
Timur Kristóf	9eaf918ed2	nir: Add new launch_mesh_workgroups intrinsic. The new intrinsic launches mesh shader workgroups from a task shader, with explicit task_payload. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16720>	2022-05-27 07:52:03 +00:00

1 2 3 4 5 ...

7169 Commits