KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marcin Ślusarz	b95d9bca1d	nir: add load_task_payload intrinsic to nir_divergence_analysis It's divergent depending on sources. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16668>	2022-05-24 17:53:29 +00:00
Marcin Ślusarz	95dbdbf063	nir: add load_mesh_inline_data_intel intrinsic to nir_divergence_analysis It's not divergent. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16668>	2022-05-24 17:53:29 +00:00
Timur Kristóf	47da245ff2	nir: Add explicit task payload atomic intrinsics. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16693>	2022-05-24 17:21:22 +00:00
Icecream95	9f9ed959bd	nir: Add store_combined_output_pan BASE back It's meaningful for this intrinsic and so does not add noise to the lowering pass. (Although dual-source writes must be to RT 0, depth and stencil writes, which store_combined_output_pan is also used for, can still be done with MRT enabled.) Fixes: `5c168f09eb` ("nir: Eliminate store_combined_output_pan BASE") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16685>	2022-05-24 16:13:33 +00:00
Jason Ekstrand	836ff4b586	nir/algebraic: Add two more pack/unpack rules Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16591>	2022-05-23 14:10:54 +00:00
Boris Brezillon	c79451e23c	spirv: Fix windows build Looks like MSVC doesn't like VLAs: src/compiler/spirv/spirv_to_nir.c(3879): error C2057: expected constant expression src/compiler/spirv/spirv_to_nir.c(3879): error C2466: cannot allocate an array of constant size 0 src/compiler/spirv/spirv_to_nir.c(3879): error C2133: 'srcs': unknown size so let's use a static array size. Fixes: `87d7431198` ("spirv: Use nir_vec_scalars() to simplify matrix transpose.") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16632>	2022-05-23 08:17:02 +00:00
Karol Herbst	cd8c083ab5	clc: disable opaque pointers until they are supported Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16479>	2022-05-21 12:26:37 +00:00
Karol Herbst	b6ed3c6ea2	clc: fix compiler features_macro CTS Test Even with that alone we can't pass the test, as LLVM enables some extensions based on the SPIR target we choose. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16479>	2022-05-21 12:26:37 +00:00
Karol Herbst	bcc2df4890	clc: speed up compilation by not relying on opencl-c.h This depends on LLVM change: https://reviews.llvm.org/D125401 Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16479>	2022-05-21 12:26:37 +00:00
Karol Herbst	e5a052f75f	clc: drop parsingComplete check This relies too much on the properties of the SPIRV-LLVM-Translator and is required to load SPIR-Vs found in the OpenCL CTS. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16479>	2022-05-21 12:26:37 +00:00
Karol Herbst	c0cf7f578a	clc: parse localSize and localSizeHint Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16479>	2022-05-21 12:26:37 +00:00
Emma Anholt	260559050a	spirv_to_nir: Cast RelaxedPrecision ALU op dests to mediump. This is controlled by spirv_to_nir_options.relaxed_precision_alu, because some drivers don't want it. This gets us mostly 16-bit math on turnip in vk-5-normal. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Emma Anholt	87d7431198	spirv: Use nir_vec_scalars() to simplify matrix transpose. This should emit fewer instructions that need to be copy-propagated away. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Rhys Perry	6087f1951e	nir: call nir_metadata_preserve in nir_lower_memory_model Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Rhys Perry	3eed871f41	nir: call nir_metadata_preserve in nir_vectorize_tess_levels This is necessary to use this pass with the NIR_PASS() macro. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Rhys Perry	f10d4bf963	nir: call nir_metadata_preserve in nir_io_add_const_offset_to_base This is necessary to use this pass with the NIR_PASS() macro. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Rhys Perry	0d9ead8ca2	nir: print file when validation fails This should make it clear whether a validation failure happens in RADV or zink. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Rhys Perry	836470d433	nir: allow NIR_PASS(_, ) If a user wants to skip printing the shader if no changes were made without declaring a dummy variable for the progress. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Timothy Arceri	c4cec84231	nir/i915g/r300/nv30: skip marking varyings as flat in some drivers Some older drivers don't support GLSL versions with the concept of flat varyings and also don't support integers. Here we add a new setting to make sure we don't use the optimisation that sets varyings to flat. This setting helps us avoid marking varyings as flat and therefore potentially having them changed to ints via varying packing. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6500 Fixes: `7647023f3b` ("glsl: enable the use of the nir based varying linker") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16573>	2022-05-19 01:05:32 +00:00
Jason Ekstrand	fa67f119f2	glsl: Drop this != NULL assertions If this == NULL, we'll get a crash which is pretty much the same thing when it comes to ease of debugging. This fixes a giant pile of warnings with GCC 12 of the form: ../src/compiler/glsl/ir.h: In member function ‘ir_dereference* ir_instruction::as_dereference()’: ../src/util/macros.h:149:30: warning: ‘nonnull’ argument ‘this’ compared to NULL [-Wnonnull-compare] 149 \| #define assume(expr) ((expr) ? ((void) 0) \ \| ~~~~~~~~^~~~~~~~~~~~~~ 150 \| : (assert(!"assumption failed"), \ \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 \| __builtin_unreachable())) \| ~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/compiler/glsl/ir.h:150:7: note: in expansion of macro ‘assume’ 150 \| assume(this != NULL); \ \| ^~~~~~ ../src/compiler/glsl/ir.h:160:4: note: in expansion of macro ‘AS_BASE’ 160 \| AS_BASE(dereference) Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16558>	2022-05-18 00:37:50 +00:00
Mike Blumenkrantz	8c8e6e953f	spirv: fix barrier scope assert glslang generates barriers with QueueFamily, so this is totally legal cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16462>	2022-05-17 02:55:20 +00:00
Timothy Arceri	792c9a0a24	glsl: move validation of sampler indirects to the nir linker This will allow us to disable the GLSL IR loop unroller in a following patch and rely on the NIR loop unroller instead. This allows the piglit test spec@!opengl 2.0@max-samplers border to pass on the v3d rpi4 driver. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16543>	2022-05-17 02:12:21 +00:00
Timothy Arceri	ff8ddcb23e	nir: add support for forced sampler indirect loop unrolling Some drivers don't support these indirects and therefore require loop unrolling if a shader uses a loop induction variable to access a sampler array. Here we add a new nir shader compiler option that drivers can set, this will be the equivalent of the EmitNoIndirectSampler setting used in the GLSL IR unrolling pass. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16543>	2022-05-17 02:12:21 +00:00
Ian Romanick	5c90eb1c53	glsl: Delete lower_extracts code The single caller of this function (in st_glsl_to_ir.cpp) always passes false, so this is dead code. v2: Delete convert_vec_index_to_cond_assign method because all the callers are deleted too. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16440>	2022-05-16 16:06:01 +00:00
Ian Romanick	bd665fdd7f	nir: Use nir_vector_extract to generate code for ir_binop_vector_extract Tiger Lake and Ice Lake had similar results. (Ice Lake shown) total cycles in shared programs: 861153442 -> 861153533 (<.01%) cycles in affected programs: 14748 -> 14839 (0.62%) helped: 5 HURT: 10 helped stats (abs) min: 1 max: 2 x̄: 1.80 x̃: 2 helped stats (rel) min: 0.09% max: 0.18% x̄: 0.16% x̃: 0.17% HURT stats (abs) min: 2 max: 18 x̄: 10.00 x̃: 10 HURT stats (rel) min: 0.17% max: 1.54% x̄: 1.06% x̃: 1.24% 95% mean confidence interval for cycles value: 1.15 10.99 95% mean confidence interval for cycles %-change: 0.25% 1.07% Cycles are HURT. Skylake and Broadwell had similar results. (Skylake shown) total cycles in shared programs: 844405063 -> 844405073 (<.01%) cycles in affected programs: 1710 -> 1720 (0.58%) helped: 0 HURT: 4 HURT stats (abs) min: 2 max: 4 x̄: 2.50 x̃: 2 HURT stats (rel) min: 0.35% max: 1.16% x̄: 0.88% x̃: 1.00% 95% mean confidence interval for cycles value: 0.91 4.09 95% mean confidence interval for cycles %-change: 0.30% 1.45% Cycles are HURT. Haswell and all earlier Intel GPUs had similar results. (Haswell shown) total instructions in shared programs: 16710016 -> 16709769 (<.01%) instructions in affected programs: 5842 -> 5595 (-4.23%) helped: 64 HURT: 0 helped stats (abs) min: 3 max: 4 x̄: 3.86 x̃: 4 helped stats (rel) min: 3.36% max: 7.69% x̄: 4.52% x̃: 4.17% 95% mean confidence interval for instructions value: -3.95 -3.77 95% mean confidence interval for instructions %-change: -4.83% -4.22% Instructions are helped. total cycles in shared programs: 881088472 -> 881086722 (<.01%) cycles in affected programs: 68696 -> 66946 (-2.55%) helped: 58 HURT: 6 helped stats (abs) min: 10 max: 202 x̄: 36.41 x̃: 18 helped stats (rel) min: 0.81% max: 16.42% x̄: 4.15% x̃: 1.51% HURT stats (abs) min: 2 max: 88 x̄: 60.33 x̃: 68 HURT stats (rel) min: 0.17% max: 7.06% x̄: 4.94% x̃: 5.60% 95% mean confidence interval for cycles value: -42.14 -12.54 95% mean confidence interval for cycles %-change: -4.66% -1.94% Cycles are helped. No fossil-db changes on any Intel platform. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16440>	2022-05-16 16:06:01 +00:00
Ian Romanick	e944a98826	glsl: Add flag to disable part of do_vec_index_to_cond_assign As of `ca63a5ed3e` ("glsl: fix interpolateAtXxx(some_vec[idx], ...) with dynamic idx"), this lowering pass does two things. It converts ir_binop_vector_extract to an if-ladder to select the dynamically indexed component, and it extracts a ir_binop_vector_extract from the source of an interpolateAt function and applies to the result instead. This change adds a flag to disable the former behavior. The latter is still useful, but NIR has better (and soon even better) ways of doing the former. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16440>	2022-05-16 16:06:01 +00:00
Ian Romanick	4eff1e6481	glsl: Fix mixed tabs and spaces in lower_mat_op_to_vec.cpp This was originally part of a series that made other changes to this file, but all of those changes got dropped. Since the typing was already done, there's no reason to not fix the formatting. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16440>	2022-05-16 16:06:01 +00:00
Gert Wollny	3749a6ecd2	nir: honor lower_double options for ffloor and ffract v2: Don't lower ffloor@64 to ffract@64 when both ops are to be lowered. Settle on ffloor in opt_algebraic because in can be lowered to other ops in lower_double_ops. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>(v1) Jason Ekstrand <jason.ekstrand@collabora.com> (v1) Reviewed-by: Emma Anholt <emma@anholt.net> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16431>	2022-05-16 15:03:05 +00:00
Timothy Arceri	9b14636876	glsl: simplify finding cursor in varying packing code This is simpler and also avoids an assert() when the last block is empty. Fixes: `e3a45a4778` ("glsl: implement lower_packed_varyings() as a NIR pass") Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16527>	2022-05-16 14:40:14 +00:00
Timothy Arceri	318d8ce6fc	glsl: remove now unused GLSL IR varying linker code Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	7647023f3b	glsl: enable the use of the nir based varying linker Here as well as calling the pass we need to switch the order of some of the information gathering and optimisation calls. We also need to create a custom callback for the dead variables removal pass to clean up dead builtin varying in SSO programs without causing piglit regressions. shader-db results IRIS (BDW): total instructions in shared programs: 17487900 -> 17477072 (-0.06%) instructions in affected programs: 128682 -> 117854 (-8.41%) helped: 587 HURT: 82 helped stats (abs) min: 1 max: 145 x̄: 18.82 x̃: 20 helped stats (rel) min: 0.21% max: 77.78% x̄: 17.41% x̃: 8.85% HURT stats (abs) min: 1 max: 6 x̄: 2.68 x̃: 2 HURT stats (rel) min: 0.25% max: 9.76% x̄: 2.94% x̃: 2.16% 95% mean confidence interval for instructions value: -17.71 -14.66 95% mean confidence interval for instructions %-change: -16.40% -13.42% Instructions are helped. total cycles in shared programs: 857442520 -> 857170199 (-0.03%) cycles in affected programs: 112252720 -> 111980399 (-0.24%) helped: 13733 HURT: 13349 helped stats (abs) min: 1 max: 7293 x̄: 81.44 x̃: 10 helped stats (rel) min: <.01% max: 90.32% x̄: 3.30% x̃: 0.62% HURT stats (abs) min: 1 max: 7424 x̄: 63.38 x̃: 8 HURT stats (rel) min: <.01% max: 192.23% x̄: 3.28% x̃: 0.54% 95% mean confidence interval for cycles value: -14.01 -6.10 95% mean confidence interval for cycles %-change: -0.17% 0.06% Inconclusive result (%-change mean confidence interval includes 0). total sends in shared programs: 971443 -> 970010 (-0.15%) sends in affected programs: 4596 -> 3163 (-31.18%) helped: 446 HURT: 39 helped stats (abs) min: 1 max: 6 x̄: 3.40 x̃: 4 helped stats (rel) min: 3.03% max: 85.71% x̄: 46.48% x̃: 50.00% HURT stats (abs) min: 1 max: 3 x̄: 2.15 x̃: 2 HURT stats (rel) min: 6.67% max: 25.00% x̄: 15.16% x̃: 10.53% 95% mean confidence interval for sends value: -3.13 -2.78 95% mean confidence interval for sends %-change: -44.16% -38.88% Sends are helped. LOST: 235 GAINED: 262 Shader-db results radeonsi (RX580): 169505 shaders in 102144 tests Totals: SGPRS: 7698832 -> 7696552 (-0.03 %) VGPRS: 5547296 -> 5545280 (-0.04 %) Spilled SGPRs: 14795 -> 14773 (-0.15 %) Spilled VGPRs: 3782 -> 3782 (0.00 %) Private memory VGPRs: 1152 -> 1152 (0.00 %) Scratch size: 3872 -> 3872 (0.00 %) dwords per thread Code Size: 162946528 -> 162895264 (-0.03 %) bytes Max Waves: 2449334 -> 2449736 (0.02 %) Totals from affected shaders: SGPRS: 215024 -> 212744 (-1.06 %) VGPRS: 151976 -> 149960 (-1.33 %) Spilled SGPRs: 162 -> 140 (-13.58 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 5249916 -> 5198652 (-0.98 %) bytes Max Waves: 54588 -> 54990 (0.74 %) Panfrost trace checksum is updated as per discussion in: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6343 Some virpipe tess shader piglit tests are added as failures to CI these failures are not a regression but an uncovered existing bug exposed due to the linker no longer sorting internally facing shader interfaces in alphabetical order. See details in: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6481 Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	fa9cee4247	glsl: implement lower_xfb_varying() as a NIR pass This just converts the GLSL IR pass to NIR. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	4600108ddf	glsl: implement opt_dead_builtin_varyings() as a NIR pass And also call it via the NIR varying linker. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	e5122a5543	glsl: add a NIR based varying linker With a NIR based linker we get better xfb packing, and we no longer depend on the GLSL IR optimisations to be able to link shaders with a large amount of dead input/outputs. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	e3a45a4778	glsl: implement lower_packed_varyings() as a NIR pass This is essentially the old GLSL IR packing pass rewritten as a NIR based pass. Doing this packing in NIR after we have preformed NIRs optimisation passes can give us better packing results. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	c1fbd0b8ab	nir: skip lowering io to scalar for must_be_shader_input These varyings cannot be packed by the GLSL linkers packing pass so we need to skip this lowering until later when we can properly handle them. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	99ab530617	nir: abort io info gathering if location is not set or is a temp value Unlike spirv glsl varyings might not have explicit locations set. nir_shader_gather_info() was once only called at the end of linking but these days it even gets called in NIR optimisation loops via nir_opt_phi_precision. In the following patches we implement a NIR version of the GLSL varying linker which means we will have varyings with no location set when nir_shader_gather_info() gets called the first few times, and temp values set only for the purpose of removing unmatched varyings between shaders for some calls after that. Here rather than asserting we simply abort the io info gathering, when we hit these values. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	cba2fd51a2	nir: add variable data fields required for NIR glsl varying linking These will be used in the following patches that add a NIR based varying linker. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	43a8454ea8	glsl: add new build program resource helpers These will be used by a new nir based glsl varying linker that will add varyings directly to the list before the are packed and we lose the information. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	23ea24e11f	glsl/mesa: move parse_program_resource_name() to common linker_util code This will be shared by a new NIR varying linking pass in following patches but probably fits better here anyway considering its also used by shader_query.cpp Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	5d57bd0345	nir/glsl: wrap component_slots_aligned() Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	6dbe075f92	nir/glsl: wrapper field_index() Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	42a97a0aef	nir/glsl: wrapper contains_{double,interger}() Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	7af9459670	nir/glsl: add glsl_record_compare() wrapper Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Jason Ekstrand	98cc4c3a20	nir: Use nir_shader_instructions_pass in nir_lower_input_attachments This simplifies things a bit and also fixes metadata handling. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	a170448a18	nir: Put the builder first in lower_input_attachments helpers This is more idiomatic. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	5410f4ee89	mesa/st: Use lower_indirect_var_derefs in st_nir_lower_builtin Instead of having a special NIR helper for GL stuff, we can now use the more generic helper and do so directly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	e16197c46e	nir: Add a var set version of lower_indirect_derefs This version takes a set of variables and totally lowers indirects on any variable in the set. We also rewrite the builtin_uniform version to use the new helper internally. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	c23b20d43a	nir: Preserve metadata if remove_dead_derefs makes no progress Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Marcin Ślusarz	7446acf4b4	compiler: add VARYING_SLOT_CULL_PRIMITIVE Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16493>	2022-05-13 09:43:02 +00:00
Georg Lehmann	bc5c68fc08	nir/opt_algebraic: Optimize Doom Eternal's word extract by LSB. Foz-db GFX10_3: Totals from 419 (0.31% of 134913) affected shaders: CodeSize: 4126032 -> 4121756 (-0.10%) Instrs: 783608 -> 782541 (-0.14%) Latency: 7889664 -> 7888521 (-0.01%); split: -0.02%, +0.00% InvThroughput: 1315690 -> 1314863 (-0.06%); split: -0.06%, +0.00% VClause: 11826 -> 11830 (+0.03%) SClause: 27736 -> 27734 (-0.01%) Copies: 50493 -> 50428 (-0.13%); split: -0.13%, +0.01% PreSGPRs: 23264 -> 23265 (+0.00%) Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16436>	2022-05-12 17:10:41 +00:00
Konstantin Seurer	938c9d9615	nir: Add a ray launch size addr intrinsic Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15712>	2022-05-12 15:04:31 +00:00
Timothy Arceri	0f98ed4afe	nir: remove unreachable loop terminators Remove the conditional break statements associated with all terminators that are associated with a fixed iteration count, except for the one associated with the limiting terminator. This logic matches similiar functionality that exists in the old GLSL IR unrolling code. This change helps a piglit test pass on the r300 driver once we switch off the old GLSL IR unrolling code. Shader-db results IRIS (BDW): total instructions in shared programs: 17538619 -> 17538595 (<.01%) instructions in affected programs: 216 -> 192 (-11.11%) helped: 3 HURT: 0 helped stats (abs) min: 7 max: 10 x̄: 8.00 x̃: 7 helped stats (rel) min: 10.00% max: 12.07% x̄: 11.38% x̃: 12.07% total cycles in shared programs: 858674910 -> 858672810 (<.01%) cycles in affected programs: 79540 -> 77440 (-2.64%) helped: 3 HURT: 0 helped stats (abs) min: 620 max: 800 x̄: 700.00 x̃: 680 helped stats (rel) min: 2.45% max: 2.83% x̄: 2.63% x̃: 2.62% Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16399>	2022-05-12 02:06:31 +00:00
Timothy Arceri	4c3d138e5d	nir: always set the exact_trip_count_unknown loop terminator property Previously we only cared if this was set for the limiting terminator. However in the following patch we will make use of this information on other terminators to decide if we can eliminate them. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16399>	2022-05-12 02:06:31 +00:00
Timur Kristóf	7de3034897	ac/nir: Add I/O lowering for task and mesh shaders. Task shaders store their output payload to VRAM where mesh shaders read from. There are two ring buffers: 1. Draw ring: this is where mesh dispatch sizes and the ready bit are stored. 2. Payload ring: this is where the optional payload is stored (up to 16K per task workgroup). Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14929>	2022-05-12 00:29:51 +00:00
Jason Ekstrand	df1876f615	nir: Mark negative re-distribution on fadd as imprecise Otherwise, it would mutate `fneg(fadd(-0, 0))` into `fadd(0, -0)` which isn't correct since -0 + (+0) = +0 + (-0) = +0. This fixes the OpenCL contraction tests on Iris. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16041>	2022-05-12 00:05:10 +00:00
Jason Ekstrand	25249e8be2	nir/lower_blend: Expand or shrink output variables as needed Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	1d22465362	nir/builder: Add a nir_resize_vector helper We're about to use this a couple of places. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	352e32e5ba	nir/builder: Add a nir_trim_vector helper This pattern pops up a bunch and the semantics of nir_channels() aren't very convenient much of the time. Let's add a nir_trim_vector() which matches nir_pad_vector(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	244b654de6	nir/lower_blend: Support SNORM and integer formats for logic ops This fixes 158 of the dEQP-VK.pipeline.logic_op.* tests, once we turn the feature on. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	730d2b7660	nir/lower_blend: Stop passing the whole options object around Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Jason Ekstrand	dcfffdcad1	nir/lower_blend: Be more explicit about deref assumptions Because we pull the RT from the variable location and use that to look up formats, we need a constant RT index. To deal with arrays (possibly of arrays), we would either need to handle array derefs (we don't today) or we need to require the variables to be split into one variable per RT. Given that we have to lower indirect derefs anyway (to get constant indices), we may as well require the client to split output variables by calling nir_lower_io_arrays_to_elements_no_indirect(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Michael Skorokhodov	fd75be7986	glsl: Fix ir_quadop_vector validation Some glcts tests have failed due to incorrect processing of `ir_quadop_vector` in `ir_validation`. e.g: `GLES31.functional.shaders.builtin_functions.integer.imulextended.int_highp_geometry` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6461 Fixes: `23cde71b` ("glsl: Stop lowering ir_quadop_vector.") Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16420>	2022-05-10 23:53:33 +00:00
Georg Lehmann	60c9a45562	nir/opt_algebraic: Simple xor/ishr optimizations. The first pattern here removes the xor-swap pattern. Foz-DB GFX10_3: Totals from 305 (0.23% of 134913) affected shaders: CodeSize: 1589040 -> 1585164 (-0.24%) Instrs: 284344 -> 283375 (-0.34%) Latency: 4205148 -> 4198472 (-0.16%); split: -0.16%, +0.00% InvThroughput: 708745 -> 708739 (-0.00%) Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16411>	2022-05-10 19:29:31 +00:00
Georg Lehmann	66e917fff6	nir/opt_algebraic: Fix mask in shift by constant combining. The comment above is correct, but the code to calculate the mask was broken. No Foz-db changes outside of noise. Fixes: `0e6581b87d` ("nir/algebraic: Reassociate shift-by-constant of shift-by-constant") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15990>	2022-05-10 18:47:21 +00:00
Timur Kristóf	7f189e3467	nir: Add upper bound for AMD shader arg intrinsics. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13155>	2022-05-10 17:16:03 +00:00
Jason Ekstrand	aea935264a	shader_info: Bump the number of images and textures supported OpenCL requires up to 128 read-only images and up to 64 write images. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15988>	2022-05-10 11:23:15 -05:00
Jason Ekstrand	b37831c606	nir: Gather samplers_used separately from textures Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15988>	2022-05-10 11:23:12 -05:00
Jason Ekstrand	3c07c3e16d	shader_info: Make images_used a bitset Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15988>	2022-05-10 11:23:11 -05:00
Jason Ekstrand	28f534350c	nir: Stop assuming shader_info::textures_used is 32-bit This isn't a hot path. We don't need to be manually using the INSIDE_WORD version which will assert if we ever get a bigger texture index. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15988>	2022-05-10 11:23:07 -05:00
Jason Ekstrand	625b352f14	nir: Set image_buffers and msaa_images in lower_samplers_as_deref This is where we set images_used so it's less likely that things will accidentally get out-of-sync. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15988>	2022-05-10 11:21:39 -05:00
Jordan Justen	1c3e584dfa	nir/divergence: handle more *_intel intrinsics v2: fix topo/btd (Lionel) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16421>	2022-05-10 08:49:58 +00:00
Emma Anholt	f3df3d4c80	glsl: Make all drivers take the GLSLOptimizeConservatively path. Now that all consumers of GLSL use NIR, make the remaining drivers take the path that relies on NIR to really do optimization. nouveau steam shader-db runtime -6.69631% +/- 1.29235% (n=12). No change on shader-db there. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16364>	2022-05-10 05:03:34 +00:00
Karol Herbst	9c5fd100cc	nir: add a nir_remove_non_entrypoints helper This code just got duplicated a lot. There is still more, but the remaining instances do a bit more than just removing other functions. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16348>	2022-05-10 03:37:44 +00:00
Jason Ekstrand	4b67d70d22	nir: Fix constant folding for non-32-bit ifind_msb and clz Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16348>	2022-05-10 03:37:44 +00:00
Emma Anholt	23cde71bb9	glsl: Stop lowering ir_quadop_vector. Now that everybody goes through NIR, glsl_to_nir is happy to handle the instruction and turn it into nir_op_vec4 instead of going to a temp variable and back. No changes on freedreno shader-db. Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16363>	2022-05-09 22:13:31 +00:00
Lionel Landwerlin	35d82ecf1e	nir/lower_shader_calls: put inserted instructions into a dummy block When moving code into the main block or loop blocks, put the code into its own : if(true) { ... } block so that we avoid break/continue/return issues. v2: Also take care of the main block with return instructions v3: Make deletion more obvious with dummy if blocks (Jason) v4: Fixup assert for loops (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8dfb240b1f` ("nir: Add raytracing shader call lowering pass.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Lionel Landwerlin	9cf986dcff	nir/lower_shader_calls: don't insert code after break/continue When moving code from below to the insertion cursor point, if the cursor points to a jump instruction, don't bother inserting the code. It would break the break/continue assumptions of NIR and would not be executed anyway. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8dfb240b1f` ("nir: Add raytracing shader call lowering pass.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Lionel Landwerlin	51dea59eb4	nir/lower_shader_calls: don't use nop instructions as cursors Stop using nop instructions which are causing issues with break/continue, instead use a nir_cursor (which brings its share of pain). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8dfb240b1f` ("nir: Add raytracing shader call lowering pass.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Jason Ekstrand	25661ea028	nir/cf: Return a cursor from nir_cf_extract as well Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Lionel Landwerlin	d65cf403f3	nir/cf: return cursor after insertion of cf_list This will be useful to cut code from one location and paste it at another place and later keep pasting after the previous insertions. v2: update comment (Jason) deal with stiching 2 empty blocks (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16036>	2022-05-09 08:43:40 +00:00
Icecream95	ad864a7c15	nir/lower_tex: Copy more fields in lower_tex_to_txd and friends Fixes NIR validation errors for OpenMW on Panfrost. Fixes: `1f97819fbe` ("panfrost: Emulate GL_CLAMP on Bifrost") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15939>	2022-05-07 10:51:10 +00:00
Mike Blumenkrantz	5c24eb721a	nir/gather_info: flag fbfetch on subpass image loads might not be able to determine which output is being read, but these are definitely fbfetch uses (from lavapipe) Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16346>	2022-05-06 17:04:34 +00:00
Emma Anholt	dd3179aff0	glsl: Remove unused lower_variable_index_to_cond_assign. It's been replaced by nir_lower_indirect_derefs(). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Emma Anholt	2529690ee3	glsl: Remove EmitNoLoops and the associated lower_jumps(lower_break=true) code. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Emma Anholt	c03cc83ef1	compiler/glsl: Remove the dead parts of build_program_resource_list(). These have all moved to NIR linking. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Emma Anholt	3a42e92a4f	glsl: Drop the dead MOD_TO_FLOOR path. It's now called lower_fmod in NIR. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Emma Anholt	7f13763690	glsl: Remove the unused lower_if_to_cond_assign. Now that everything goes through NIR, nir_opt_peephole_select has replaced it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Emma Anholt	9617184bc2	glsl: Retire the non-NIR GLSL linking paths. Now that we have only GLSL->NIR as a path in the frontend, we can rely on the NIR linking support. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Eric Anholt	e566b54a59	glsl: Remove UBO reference lowering. All UBO-supporting drivers now go through the NIR path, which does a better job of it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Georg Lehmann	5833fab766	nir/lower_mediump: Add a new pass to fold 16bit image load/store. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15179>	2022-05-04 09:58:03 +00:00
Emma Anholt	695587413b	nir: Don't assert on tg4 offset range. From the GL 4.6 spec: "If the value of any non-ignored component of the offset vector operand is outside implementation-dependent limits, the results of the texture lookup are undefined." We shouldn't assertion fail, then. GLSL-to-NIR shouldn't be enforcing limits on TG4 offsets, since it doesn't for non-TG4 tex_src_offset either. Leave it up to the driver to handle it. Fixes a crash in a piglit test on nouveau that supplies a negative random number up to 10,000 as the first coordinate for some reason. Other NIR drivers lowered TG4 explicit offsets to tex_src_offset, so they weren't affected. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16261>	2022-05-03 21:45:49 +00:00
Alyssa Rosenzweig	ca280b2283	nir: Don't set writes_memory for reading XFB That's a read, not a write. Fixes optimizations getting disabled for fragment shaders when linked with a shader producing transform feedback varyings. Fixes: `85a723975b` ("nir: add and gather shader_info::writes_memory") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16285>	2022-05-03 19:02:17 +00:00
Emma Anholt	e3607e96bb	nir: Eliminate out-of-bounds read/writes in local lowering. Avoids nir validation assertion failures, and it's not like backend drivers would want to see definitely-out-of-bounds read/writes either. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16066>	2022-05-03 18:32:47 +00:00
Timothy Arceri	180398f785	nir: fix sorting before assigning varying driver locations We need to make sure we also properly sort varyings sharing a single slot otherwise we can end up assigning earlier components to the next slot if we have already processed later components. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6392 Fixes: `1e93b0caa1` ("mesa/st: add support for NIR as possible driver IR") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16208>	2022-05-03 00:04:30 +00:00
Karol Herbst	93144175fa	vtn: clamp SpvOpImageQuerySize dest to 32 bit CL image arrays slice is 64 bit for whatever reason... Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16205>	2022-04-29 00:01:20 +00:00
Jason Ekstrand	c31db58f65	nir/deref: Add an alu-of-cast optimization Casts shouldn't change the bit pattern of the deref and you have to cast again after you're done with the ALU anyway so we can ignore casts on ALU sources. This means we can actually start constant folding NULL checks even if there are annoying casts in the way. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15673>	2022-04-28 23:05:48 +00:00
Emma Anholt	536c8ee96d	nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. This controls the whole lowering of "make tex ops with implicit derivatives on non-implicit-derivative stages be tex ops with an explicit lod of 0 instead", but it's really hard to describe that in a git commit summary. All existing callers get it added except: - nir_to_tgsi which didn't want it. - nouveau, which didn't want it (fixes regressions in shadowcube and shadow2darray with NIR, since the shading languages don't expose txl of those sampler types and thus it's not supported in HW) - optional lowering passes in mesa/st (lower_rect, YUV lowering, etc) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16156>	2022-04-28 21:26:08 +00:00
Karol Herbst	a2c9e1cb50	nir: add 16 and 64 bit fisnormal lowering Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16206>	2022-04-28 18:36:52 +00:00
Gert Wollny	47d3f7c69f	nir: Don't optimize to 64 bit fsub if the driver doesn't support it Fixes: `a4840e15ab` r600: Use nir-to-tgsi instead of TGSI when the NIR debug opt is disabled. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16130>	2022-04-27 00:01:20 +00:00
Jason Ekstrand	e24d8760e9	nir: Constant fold sampler/texture offsets Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16171>	2022-04-26 22:34:39 +00:00
Jason Ekstrand	9332598b26	nir/constant_folding: Break TXB folding into a helper function Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16171>	2022-04-26 22:34:39 +00:00
Yevhenii Kolesnikov	65caf46b3b	nir: Remove single-source phis before opt_if_loop_last_continue We might have some single-source phis leftover after prior optimizations. We want to get rid of them before merging the blocks. Fixes: `5921a19d4b` ("nir: add if opt opt_if_loop_last_continue()") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6312 Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16095>	2022-04-26 17:06:07 +00:00
Jason Ekstrand	ef9d97ec1f	spirv: Handle Op*MulExtended for non-32-bit types Fixes: `58bcebd987` ("spirv: Allow [i/u]mulExtended to use new nir opcode") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6306 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16060>	2022-04-26 15:16:11 +00:00
Alyssa Rosenzweig	94b01ddcdd	nir: Use u_worklist to back nir_block_worklist u_worklist is nir_block_worklist, suitably generalized. All we need to do is define the macros to translate between the APIs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16046>	2022-04-25 23:50:57 +00:00
Jason Ekstrand	1755730362	nir: Lower all bit sizes of usub_borrow It's not clear why this is restricted to 32-bit besides that being the only bit size where GLSL has an intrinsic for this. All drivers that set this probably want it lowered for all bit sizes as far as I can tell. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6353 Fixes: `8a3e344180` ("nir/opt_algebraic: Fix some expressions with ambiguous bit sizes") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16146>	2022-04-25 21:27:09 +00:00
Samuel Pitoiset	4ebb5391ac	nir: mark XFB varyings as unmoveable to prevent them to be remapped XFB varyings are considered as always active IO to prevent them to be removed or compacted. Though, if the NIR linker doesn't mark XFB varyings as unmoveable it still possible to remap other varyings to the same location/component. Fixes KHR-Single-GL46.enhanced_layouts.xfb_override_qualifiers_with_api with Zink and a bunch of other dEQP XFB tests. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6301 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16092>	2022-04-25 07:56:27 +00:00
Samuel Pitoiset	26f74f17d9	nir: fix marking XFB varyings as always active IO Components need to be handled, otherwise if a shader has two XFB varyings at the same location, only one will be marked as always active. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16092>	2022-04-25 07:56:27 +00:00
Mike Blumenkrantz	a6a4bf0f1e	glsl/nir: set new_style_shadow for sparse tex ops as necessary this needs the sparse result type, which is not the ir type Fixes: `f4a972b748` ("glsl/nir: convert sparse ir_texture to nir") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16097>	2022-04-24 15:56:05 +00:00
Marek Olšák	f7a77ff900	nir: fix an uninitialized variable valgrind warning in nir_group_loads pass_flags is only initialized for grouped loads, so change the order Fixes: `33b4eb149e` - nir: add new SSA instruction scheduler grouping loads into indirection groups Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16090>	2022-04-22 18:18:09 +00:00
Lionel Landwerlin	9f44a26462	nir/divergence: handle load_global_block_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `dd39e311b3` ("nir: Add nir_intrinsic_{load,store}_deref_block_intel") Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16075>	2022-04-21 17:41:04 +00:00
Gert Wollny	496fd59d71	nir: Add pass to split 64 bit vec3 and vec4 variable access and phis NTT can't convert local 64 variables of type vec3 or vec4, therefore, they need to be split into vec2 + double or vec2 + vec2. At the same time deal splitting the phi nodes. The pass goes into the global namespace because it is also useful for r600. v2: only lower function_temps (Emma) and handle array of arrays (Jason) v3: - remove bool parameter in function to merge vec3 and vec4 - simplify load/store_deref_(array\|var) - use a pointer hash table - simplify PHI splitting (all Emma) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15945>	2022-04-21 16:57:11 +00:00
Erik Faye-Lund	30aab0af07	nir/lower_int64: do not try to clamp floats to int-range The clamping isn't correct, because the exact values ended up getting rounded off a bit when converting back to floats. But, converting floats to integers have undefined results when the float value doesn't fit in the integer. So let's not try to clamp the value here. This was caught by digging at a Clang warning, see this thread for details: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15547#note_1329769 Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16022>	2022-04-21 14:12:34 +00:00
Alexey Bozhenko	25acf1d869	spirv: fix OpBranchConditional when both branches are the same Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6246 Signed-off-by: Bozhenko Alexey <oleksii.bozhenko@globallogic.com> Fixes: `64cb143b92` ("spirv: Fix handling of OpBranchConditional with same THEN and ELSE") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15929>	2022-04-21 13:41:24 +00:00
Rhys Perry	dab745f3b4	nir/copy_prop_vars: fix non-vector shader call payloads Fixes RADV+Q2RTX. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `ff05137c2d` ("nir: introduce and use nir_component_mask") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16029>	2022-04-20 21:03:03 +00:00
Jason Ekstrand	1b8a43a0ba	util: Remove util_cpu_detect util_cpu_detect is an anti-pattern: it relies on callers high up in the call chain initializing a local implementation detail. As a real example, I added: ...a Mali compiler unit test ...that called bi_imm_f16() to construct an FP16 immediate ...that calls _mesa_float_to_half internally ...that calls util_get_cpu_caps internally, but only on x86_64! ...that relies on util_cpu_detect having been called before. As a consequence, this unit test: ...crashes on x86_64 with USE_X86_64_ASM set ...passes on every other architecture ...works on my local arm64 workstation and on my test board ...failed CI which runs on x86_64 ...needed to have a random util_cpu_detect() call sprinkled in. This is a bad design decision. It pollutes the tree with magic, it causes mysterious CI failures especially for non-x86_64 developers, and it is not justified by a micro-optimization. Instead, let's call util_cpu_detect directly from util_get_cpu_caps, avoiding the footgun where it fails to be called. This cleans up Mesa's design, simplifies the tree, and avoids a class of a (possibly platform-specific) failures. To mitigate the added overhead, wrap it all in a (fast) atomic load check and declare the whole thing as ATTRIBUTE_CONST so the compiler will CSE calls to util_cpu_detect. Co-authored-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15580>	2022-04-20 18:44:35 +00:00
Daniel Schürmann	90a0675989	nir/lower_alu_to_scalar: don't set the nir_builder cursor This ensures recursive lowering in a single pass. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15977>	2022-04-20 17:53:48 +00:00
Mike Blumenkrantz	27a43b531b	nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims AMD might not support cubes, but that doesn't mean cubes can't be used on other drivers Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15852>	2022-04-20 12:12:36 +00:00
Erik Faye-Lund	ff05137c2d	nir: introduce and use nir_component_mask The BITFIELD_MASK() macro is intended for using with actual bitfields, not with nir_component_mask_t. This means we do some extra work to handle values that are invalid for nir_component_mask_t in the first place. This eliminates some warnings on Clang, where the compiler complains about casting UINT32_MAX to UINT16_MAX. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15547>	2022-04-19 06:54:47 +00:00
Timothy Arceri	4b4bb46af4	nir: fix setting varying from uniform as flat Here we just make sure we match the interpolation type on both sides of the shader interface. Drivers like d3d12 are expecting this. Fixes: `9401990e6f` ("nir/linker: set varying from uniform as flat") Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16003>	2022-04-18 11:45:56 +00:00
Emma Anholt	66a0f318fd	nir: Avoid generating extra ftruncs for array handling. It's quite likely that the source of the f2i32 was already an integer, in which case we can skip the ftrunc (particularly useful on the int-to-float class of hardware that's unlikely to just have a native trunc opcode!). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15870>	2022-04-16 13:07:09 -07:00
Emma Anholt	e4aa5f7889	nir: Skip fround_even on already-integral values. Just like the other make-the-float-an-integer opcodes. Noticed in a gallium nine shader run through TGSI-to-NIR, where the array index had been floored by the user, but got implicitly rounded by DX9 array indexing. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15870>	2022-04-16 13:07:09 -07:00
Emma Anholt	6947016b46	nir: Add lowering for fround_even on r300. When we put NIR in the compiler stack for r300, indirect addressing broke for gallium nine. DX's array indirects round the float value, so the DX shader gets mapped to a TGSI "ARR ADDR[0] src.x" instruction. Translating that to NIR maps to r0[f2i32(fround(src.x))]. While we might hope that in translation back using nir-to-tgsi after optimization we would recognize the construct and emit ARR again, that's going to be error prone (think "what if src.x is in a NIR register?") so we need a fallback plan. r300 will be able to handle this lowering, so get it in place first to fix the regression. Fixes: #6297 Fixes: `7d2ea9b0ed` ("r300: Request NIR shaders from mesa/st and use NIR-to-TGSI.") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15870>	2022-04-16 13:07:09 -07:00
Jason Ekstrand	5c9e4d400a	nir/opcodes: fisfinite32 should return bool32 Otherwise constant-folding will fold it to 0/1 instead of 0/~0. Fixes: `330e28155f` ("nir: add 32-bit bool of fisfinite") Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15984>	2022-04-16 02:46:12 +00:00
Jason Ekstrand	319d87846c	nir,microsoft: Move scale_fdiv into a common NIR pass While we're at it, convert to nir_shader_instructions_pass() to get rid of some boilerplate and get metadata correct. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15983>	2022-04-16 02:10:25 +00:00
Rhys Perry	46d14abeae	nir/builder: add nir_{ine,ibfe,ubfe}_imm() helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15854>	2022-04-15 23:56:11 +00:00
Rhys Perry	9baa45c189	nir/gather_info: fix system_value_read for rt/mesh system values Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `c7eaf03068` ("radv: use shader_info::system_values_read") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15952>	2022-04-15 11:10:22 +00:00
Mike Blumenkrantz	5b0634d735	nir/lower_tex: fix rect queries with lower_rect set queries still need the sampler_dim changed Fixes: `682e14d3ea` ("nir: lower_tex: Don't normalize coordinates for TXF with RECT") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15895>	2022-04-14 22:57:23 +00:00
Jason Ekstrand	46d9b0e431	clc: Declare LLVMContexts on the stack This prevents more use-after-free errors. Passing them around using std::unique_ptr ensures that the LLVMContext gets destroyed but doesn't ensure destruction order. Declaring it on the stack ensures that the context doesn't get destroyed until right before the the function returns which is after any other LLVM stuff is destroyed. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15937>	2022-04-14 21:19:56 +00:00
Jason Ekstrand	6099e6ce9a	clc: Rework logging a bit First, separate out the LLVM context logging to make it take a clc_logger instead of passing in a string stream. Currently, the LLVM context may outlive the string stream which we assign which may lead to use-after-free errors. Second, use a separate string stream for clang diagnosticl logging which we intentionally declare before the compiler so the compiler can't outlive it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15937>	2022-04-14 21:19:56 +00:00
Jason Ekstrand	6e3b9b1b1d	clc: Only initialize LLVM once Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15937>	2022-04-14 21:19:56 +00:00
Dave Airlie	fdab872224	clc: initialise one more llvm stage Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15937>	2022-04-14 21:19:56 +00:00
Dave Airlie	b518020f64	clc: add simple llvm initialise API This just calls some of the LLVM init functions in a common place Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15937>	2022-04-14 21:19:56 +00:00
Icecream95	f226222846	clc: Use stringstream for printing spirv errors The type of the spv_position_t components can differ across platforms, it's simpler to just let C++ overloading handle it. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15437>	2022-04-14 00:14:43 +00:00
Rhys Perry	778fc176b1	nir/opt_load_store_vectorize: create load_shared2_amd/store_shared2_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	dc835626b3	nir/opt_load_store_vectorize: fix broken indentation Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	8ff122f8b8	nir: add load_shared2_amd and store_shared2_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	5c038b3f02	nir: add _amd global access intrinsics These are the same as the normal ones, but they take an unsigned 32-bit offset in BASE and another unsigned 32-bit offset in the last source. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Timur Kristóf	a7147ef1e8	nir: Handle out of bounds access in nir_vectorize_tess_levels. Replace out of bounds loads with undef. Then, delete instructions with out of bounds access. Fixes: `f5adf27fb9` "nir,radv: add and use nir_vectorize_tess_levels()" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6264 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15775>	2022-04-13 13:25:10 +00:00
Lionel Landwerlin	3394680368	nir/lower_shader_calls: name resume shaders Helpful when lost in a sea of NIR :) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15887>	2022-04-13 06:59:29 +00:00
Jason Ekstrand	d0ace28790	nir/lower_int64: Fix [iu]mul_high handling `e551040c60`, which added a new mechanism for 64-bit imul which is more efficient on BDW and later Intel hardware also introduced a bug where we weren't properly walking both X and Y. No idea how testing didn't find this. Fixes: `e551040c60` ("nir/glsl: Add another way of doing lower_imul64 for gen8+" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6306 Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Marcin Ślusarz	9b23aaf3cf	nir: remove gl_PrimitiveID output from MS when it's not used in FS Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15340>	2022-04-12 09:35:26 +00:00
Mike Blumenkrantz	9c212e117d	nir/lower_point_size_mov: handle case where gl_Position isn't written Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15821>	2022-04-10 16:45:15 +00:00
Mike Blumenkrantz	310903d096	nir/lower_point_size_mov: fix check for overwriting existing pointsize this should match the comment and allow overwriting injected pointsize variables regardless of whether xfb is flagged Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:09 -04:00
Timothy Arceri	7d216f296a	glsl: fix needs_lowering() call in varying packing pass Here we remove the outer arrays on geom and tess shaders where needed. Without this the pass can sometimes attempt to pack a varying on only one side of the shader interface where it is not actually needed. The result can be mismatching varying types. Fixes: `d6b9202873` ("glsl: disable varying packing when its not safe") Tested-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15761>	2022-04-07 23:57:40 +00:00
Mike Blumenkrantz	6cfcf891c1	nir/lower_tex: avoid adding invalid LOD to RECT textures this is illegal Fixes: `74ec2b12be` ("nir/lower_tex: Rework invalid implicit LOD lowering") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15804>	2022-04-07 21:37:58 +00:00
Jason Ekstrand	4cd260590c	nir: Dont set coord_components on txs Fixes: `e1fc23265f` ("nir: Add a pass for lowering CL-style image ops to texture ops") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15758>	2022-04-07 21:13:35 +00:00
Emma Anholt	03549f3bf3	spirv: Silence "Decoration not allowed on struct members: SpvDecorationRestrict" VK-GL-CTS causes tons of these due to a bug in glslang, to the point where it's hard to find actual issues in test logs. Disable the warning for now, with a link to the issue we're waiting on being resolved. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15332>	2022-04-05 21:37:46 +00:00
Erik Faye-Lund	ed399a179e	nir/tests: do not use designated initializers in c++ code Designated initializers require C++20, which is a bit easier said than done to support well across meson versions. Let's avoid using them for now instead. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15706>	2022-04-05 16:58:56 +00:00
Ian Romanick	7fd1955412	nir: intel/compiler: Lower TXD on array surfaces on DG2+ DG2 can only do sample_d and sample_d_c on 1D and 2D surfaces. Cube maps and 3D surfaces were already handled, but 1D array and 2D array surfaces were not. Fixes the following Vulkan CTS failures on DG2: dEQP-VK.glsl.texture_functions.texturegradclamp.isampler1darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.isampler2darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1darray_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1darray_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2darray_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2darray_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler1darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler2darray_fragment The Fixes: tag below is a bit misleading. This commit adds another lowering, similar to the one in the Fixes: commit, that probably should have been added at the same time. I just want to make sure this commit gets applied everywhere that commit was also applied. Fixes: `635ed58e52` ("intel/compiler: Lower txd for 3D samplers on XeHP.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15681>	2022-03-31 12:59:18 -07:00
Emma Anholt	97f17d4b38	glsl: Delete dont_lower_swz path of lower_quadop_vector. This was last used with Mesa classic, in _mesa_ir_link_shader(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15623>	2022-03-30 22:26:15 +00:00
Emma Anholt	761eb7e539	glsl: Delete unused EmitNoPow path. This was last used with i915c, now lower_fpow covers this class of lowering. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15623>	2022-03-30 22:26:15 +00:00
Yonggang Luo	a1814067cd	nir: Move the define of snprintf to header nir.h The define of snprintf in nir_lower_atomics_to_ssbo.c is duplicated, so remove it from this file Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14014>	2022-03-30 00:45:11 +08:00
Yonggang Luo	153cb830c4	vtn: Fixes compiling error for mingw/ucrt by using setjmp/longjmp function instead compiler builtin The compiling error are: ``` [369/1463] Compiling C object src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj FAILED: src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj "cc" "-Isrc/compiler/nir/libnir.a.p" "-Isrc/compiler/nir" "-I../../src/compiler/nir" "-Iinclude" "-I../../include" "-Isrc" "-I../../src" "-Isrc/mapi" "-I../../src/mapi" "-Isrc/mesa" "-I../../src/mesa" "-I../../src/gallium/include" "-Isrc/gallium/auxiliary" "-I../../src/gallium/auxiliary" "-Isrc/compiler" "-I../../src/compiler" "-Isrc/compiler/spirv" "-I../../src/compiler/spirv" "-fvisibility=hidden" "-fcolor-diagnostics" "-D_FILE_OFFSET_BITS=64" "-Wall" "-Winvalid-pch" "-std=c11" "-O2" "-g" "-D__STDC_CONSTANT_MACROS" "-D__STDC_FORMAT_MACROS" "-D__STDC_LIMIT_MACROS" "-DPACKAGE_VERSION=\"22.1.0-devel\"" "-DPACKAGE_BUGREPORT=\"https://gitlab.freedesktop.org/mesa/mesa/-/issues\"" "-DHAVE_WINDOWS_PLATFORM" "-DHAVE_SURFACELESS_PLATFORM" "-DUSE_ELF_TLS" "-DUSE_TLS_BEHIND_FUNCTIONS" "-DENABLE_ST_OMX_BELLAGIO=0" "-DENABLE_ST_OMX_TIZONIA=0" "-DEGL_NO_X11" "-DHAVE___BUILTIN_BSWAP32" "-DHAVE___BUILTIN_BSWAP64" "-DHAVE___BUILTIN_CLZ" "-DHAVE___BUILTIN_CLZLL" "-DHAVE___BUILTIN_CTZ" "-DHAVE___BUILTIN_EXPECT" "-DHAVE___BUILTIN_FFS" "-DHAVE___BUILTIN_FFSLL" "-DHAVE___BUILTIN_POPCOUNT" "-DHAVE___BUILTIN_POPCOUNTLL" "-DHAVE___BUILTIN_UNREACHABLE" "-DHAVE___BUILTIN_TYPES_COMPATIBLE_P" "-DHAVE_FUNC_ATTRIBUTE_CONST" "-DHAVE_FUNC_ATTRIBUTE_FLATTEN" "-DHAVE_FUNC_ATTRIBUTE_MALLOC" "-DHAVE_FUNC_ATTRIBUTE_PURE" "-DHAVE_FUNC_ATTRIBUTE_UNUSED" "-DHAVE_FUNC_ATTRIBUTE_WARN_UNUSED_RESULT" "-DHAVE_FUNC_ATTRIBUTE_WEAK" "-DHAVE_FUNC_ATTRIBUTE_FORMAT" "-DHAVE_FUNC_ATTRIBUTE_PACKED" "-DHAVE_FUNC_ATTRIBUTE_RETURNS_NONNULL" "-DHAVE_FUNC_ATTRIBUTE_ALIAS" "-DHAVE_FUNC_ATTRIBUTE_NORETURN" "-DHAVE_FUNC_ATTRIBUTE_VISIBILITY" "-DHAVE_UINT128" "-D_WINDOWS" "-D_WIN32_WINNT=0x0A00" "-DWINVER=0x0A00" "-DPIPE_SUBSYSTEM_WINDOWS_USER" "-D_USE_MATH_DEFINES" "-DUSE_SSE41" "-DUSE_GCC_ATOMIC_BUILTINS" "-DHAS_SCHED_H" "-DHAVE_CET_H" "-DHAVE_STRTOF" "-DHAVE_TIMESPEC_GET" "-DHAVE_STRTOK_R" "-DHAVE_QSORT_S" "-DHAVE_ZLIB" "-DHAVE_ZSTD" "-DHAVE_COMPRESSION" "-DLLVM_AVAILABLE" "-DMESA_LLVM_VERSION_STRING=\"13.0.1\"" "-DLLVM_IS_SHARED=1" "-DDRAW_LLVM_AVAILABLE" "-DMESA_EXECMEM" "-DVK_USE_PLATFORM_WIN32_KHR" "-Werror=implicit-function-declaration" "-Werror=missing-prototypes" "-Werror=return-type" "-Werror=empty-body" "-Werror=incompatible-pointer-types" "-Werror=int-conversion" "-Wimplicit-fallthrough" "-Wno-missing-field-initializers" "-fno-math-errno" "-fno-trapping-math" "-Qunused-arguments" "-fno-common" "-Wno-microsoft-enum-value" "-Werror=format" "-Wformat-security" "-Werror=thread-safety" "-ffunction-sections" "-fdata-sections" "-Werror=pointer-arith" "-Werror=gnu-empty-initializer" "-Wno-override-init" "-Wno-initializer-overrides" -MD -MQ src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj -MF "src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj.d" -o src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj "-c" ../../src/compiler/spirv/gl_spirv.c ../../src/compiler/spirv/gl_spirv.c:241:19: error: incompatible pointer types passing 'jmp_buf' (aka '_JBTYPE [16]') to parameter of type 'void ' [-Werror,-Wincompatible-pointer-types] if (vtn_setjmp(b->fail_jump)) { ^~~~~~~~~~~~ 1 error generated. [376/1463] Compiling C object src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj FAILED: src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj "cc" "-Isrc/compiler/nir/libnir.a.p" "-Isrc/compiler/nir" "-I../../src/compiler/nir" "-Iinclude" "-I../../include" "-Isrc" "-I../../src" "-Isrc/mapi" "-I../../src/mapi" "-Isrc/mesa" "-I../../src/mesa" "-I../../src/gallium/include" "-Isrc/gallium/auxiliary" "-I../../src/gallium/auxiliary" "-Isrc/compiler" "-I../../src/compiler" "-Isrc/compiler/spirv" "-I../../src/compiler/spirv" "-fvisibility=hidden" "-fcolor-diagnostics" "-D_FILE_OFFSET_BITS=64" "-Wall" "-Winvalid-pch" "-std=c11" "-O2" "-g" "-D__STDC_CONSTANT_MACROS" "-D__STDC_FORMAT_MACROS" "-D__STDC_LIMIT_MACROS" "-DPACKAGE_VERSION=\"22.1.0-devel\"" "-DPACKAGE_BUGREPORT=\"https://gitlab.freedesktop.org/mesa/mesa/-/issues\"" "-DHAVE_WINDOWS_PLATFORM" "-DHAVE_SURFACELESS_PLATFORM" "-DUSE_ELF_TLS" "-DUSE_TLS_BEHIND_FUNCTIONS" "-DENABLE_ST_OMX_BELLAGIO=0" "-DENABLE_ST_OMX_TIZONIA=0" "-DEGL_NO_X11" "-DHAVE___BUILTIN_BSWAP32" "-DHAVE___BUILTIN_BSWAP64" "-DHAVE___BUILTIN_CLZ" "-DHAVE___BUILTIN_CLZLL" "-DHAVE___BUILTIN_CTZ" "-DHAVE___BUILTIN_EXPECT" "-DHAVE___BUILTIN_FFS" "-DHAVE___BUILTIN_FFSLL" "-DHAVE___BUILTIN_POPCOUNT" "-DHAVE___BUILTIN_POPCOUNTLL" "-DHAVE___BUILTIN_UNREACHABLE" "-DHAVE___BUILTIN_TYPES_COMPATIBLE_P" "-DHAVE_FUNC_ATTRIBUTE_CONST" "-DHAVE_FUNC_ATTRIBUTE_FLATTEN" "-DHAVE_FUNC_ATTRIBUTE_MALLOC" "-DHAVE_FUNC_ATTRIBUTE_PURE" "-DHAVE_FUNC_ATTRIBUTE_UNUSED" "-DHAVE_FUNC_ATTRIBUTE_WARN_UNUSED_RESULT" "-DHAVE_FUNC_ATTRIBUTE_WEAK" "-DHAVE_FUNC_ATTRIBUTE_FORMAT" "-DHAVE_FUNC_ATTRIBUTE_PACKED" "-DHAVE_FUNC_ATTRIBUTE_RETURNS_NONNULL" "-DHAVE_FUNC_ATTRIBUTE_ALIAS" "-DHAVE_FUNC_ATTRIBUTE_NORETURN" "-DHAVE_FUNC_ATTRIBUTE_VISIBILITY" "-DHAVE_UINT128" "-D_WINDOWS" "-D_WIN32_WINNT=0x0A00" "-DWINVER=0x0A00" "-DPIPE_SUBSYSTEM_WINDOWS_USER" "-D_USE_MATH_DEFINES" "-DUSE_SSE41" "-DUSE_GCC_ATOMIC_BUILTINS" "-DHAS_SCHED_H" "-DHAVE_CET_H" "-DHAVE_STRTOF" "-DHAVE_TIMESPEC_GET" "-DHAVE_STRTOK_R" "-DHAVE_QSORT_S" "-DHAVE_ZLIB" "-DHAVE_ZSTD" "-DHAVE_COMPRESSION" "-DLLVM_AVAILABLE" "-DMESA_LLVM_VERSION_STRING=\"13.0.1\"" "-DLLVM_IS_SHARED=1" "-DDRAW_LLVM_AVAILABLE" "-DMESA_EXECMEM" "-DVK_USE_PLATFORM_WIN32_KHR" "-Werror=implicit-function-declaration" "-Werror=missing-prototypes" "-Werror=return-type" "-Werror=empty-body" "-Werror=incompatible-pointer-types" "-Werror=int-conversion" "-Wimplicit-fallthrough" "-Wno-missing-field-initializers" "-fno-math-errno" "-fno-trapping-math" "-Qunused-arguments" "-fno-common" "-Wno-microsoft-enum-value" "-Werror=format" "-Wformat-security" "-Werror=thread-safety" "-ffunction-sections" "-fdata-sections" "-Werror=pointer-arith" "-Werror=gnu-empty-initializer" "-Wno-override-init" "-Wno-initializer-overrides" -MD -MQ src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj -MF "src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj.d" -o src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj "-c" ../../src/compiler/spirv/spirv_to_nir.c ../../src/compiler/spirv/spirv_to_nir.c:196:16: error: incompatible pointer types passing 'jmp_buf' (aka '_JBTYPE [16]') to parameter of type 'void ' [-Werror,-Wincompatible-pointer-types] vtn_longjmp(b->fail_jump, 1); ^~~~~~~~~~~~ ``` Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14014>	2022-03-30 00:45:08 +08:00
Jason Ekstrand	4a08ee7ecf	spirv/libclc: Add generic versions of arithmetic functions Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15622>	2022-03-29 15:02:07 +00:00
Georg Lehmann	16be909936	nir: Add an option to lower 64bit iadd_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15421>	2022-03-28 20:02:52 +00:00
Georg Lehmann	922916bf64	nir: Move lower_usub_sat64 to nir_lower_int64_options. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15421>	2022-03-28 20:02:52 +00:00
Pierre-Eric Pelloux-Prayer	2bc933f7d5	glsl/nir/linker: fix shader_storage_blocks_write_access shader_storage_blocks_write_access was computed using the buffer indices in the program but ShaderStorageBlocksWriteAccess is used with the shader buffers. So if a VS had 3 SSBOs and a FS had 4, the mask for VS was 0x3 (correct) but the mask for the FS was 0x78 instead of 0x15. Fix this by substracting the index of the first shader buffer in the program's buffers. Fixes: `79127f8d5b` ("glsl: set ShaderStorageBlocksWriteAccess in the nir linker") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6184 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15552>	2022-03-28 11:06:31 +02:00
Pierre-Eric Pelloux-Prayer	61ee560bc5	glsl/nir/linker: update shader_storage_blocks_write_access for SPIR-V Most of the code inside the "!prog->data->spirv" blocks shouldn't be executed for SPIR-V except the part updating the writable mask. See https://gitlab.freedesktop.org/mesa/mesa/-/issues/6184 Cc: mesa-stable Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15552>	2022-03-28 10:37:45 +02:00
Kenneth Graunke	af529b545a	nir: Teach nir_divergence_analysis about Intel-specific intrinsics - load_reloc_const is just an immediate constant load, it's convergent. - nir_intrinsic_load_global_const_block_intel should be convergent, it says the address must be uniform, and we uniformize the predicate - Lowered image intrinsics: image_deref_load_param_intel just reads information about an image, as long as the image variable is convergent it should be too. load_raw_intel...if the address we come up with is convergent, it ought to be as well. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15484>	2022-03-26 00:28:19 +00:00
Christian Gmeiner	2648ccb341	nir: Use const for nir_shader_get_entrypoint(..) nir_shader_get_entrypoint(..) should not modify the passed nir_shader object. Enforce this by marking shader paramenter as const. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15362>	2022-03-25 07:25:37 +00:00
Mike Blumenkrantz	4e35ed8c67	nir/lower_tex: add txp lowering option for arrays this is illegal in vulkan, so zink needs to be able to lower these Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15510>	2022-03-23 23:18:47 +00:00
Georg Lehmann	81b2008af9	nir/legalize_16bit_sampler_srcs: Don't guess source type. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	b5fe1187ec	nir/fold_16bit_sampler_conversions: Fix src type mismatches. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5996 Fixes: `fb29cef8` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	88ec73e5e8	nir/fold_16bit_sampler_conversions: Fix dest type mismatches. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5996 Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	798e47be51	nir/fold_16bit_sampler_conversions: Don't fold dest upcasts. This is not a valid optimization. Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Daniel Schürmann	832d67e99d	nir: rename nir_src_is_dynamically_uniform to nir_src_is_always_uniform As this function doesn't check for any control-flow dependence, it only returns true for statically (or globally) uniform values. The same holds true for is_binding_dynamically_uniform() in nir_opt_gcm(). Rename to better reflect that property. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14994>	2022-03-23 14:02:08 +00:00
Jason Ekstrand	4b2c78c08a	spirv: Implement the function portion of the Linkage capability Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Jason Ekstrand	80a076382d	nir: Allow nir_var_mem_global variables Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Rhys Perry	e82aba88dc	nir: allow bindless image/texture/sampler handles to be vectors Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	ff52a724d4	nir: add load_{scalar,vector}_arg_amd and load_smem_amd intrinsics load_smem_gcn is similar to load_global/load_global_constant, but it's guaranteed to use SMEM and it's much easier to utilize the format's 32-bit offset source. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Kenneth Graunke	85d30846db	nir: Print divergence status of SSA values if analysis was ever run. After running divergence analysis, we include "div" or "con" for each SSA def's divergence/convergence status: vec1 32 div ssa_35 = fddy ssa_34 vec1 32 con ssa_36 = fddy ssa_6.x We omit this before the first time divergence analysis has been run. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15445>	2022-03-21 22:46:34 -07:00
Qiang Yu	9401990e6f	nir/linker: set varying from uniform as flat Flat varying can save some rasterization compute cost and register needed by shader. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15341>	2022-03-22 01:33:23 +00:00
Qiang Yu	2617e6c028	nir/linker: disable varying from uniform lowering by default This fixes performance regression for Specviewperf/Energy on AMD GPU. Other GPUs passing varying by memory may choose to re-enable it as need. Fixes: `2604625043` ("nir/linker: support uniform when optimizing varying") Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15341>	2022-03-22 01:33:23 +00:00
Karol Herbst	43c3f4386b	nir: fix nir_sweep for printf I hit a memory corruption trying to implement printf for Rusticl Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15474>	2022-03-21 13:23:04 +00:00
Jason Ekstrand	e9ff6f4f06	nir/print: Add support for generic pointers The way they're handled is that deref->modes is treated as a bitfield of possible modes. Variables are required to have a specific mode and derefs with deref_type_var are as well. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Mike Blumenkrantz	cdcfcb7916	nir/lower_is_helper_invocation: create load_helper_invocation instr with bitsize=1 the specification stipulates that this is a bool value, so don't load it as an int or else nir_validate explodes Fixes: `f17b41ab4f` ("nir: add lowering pass for helperInvocationEXT()") Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15402>	2022-03-21 03:20:33 +00:00
Jason Ekstrand	7030d14e0d	spirv: Properly mangle generic pointers Fixes: `a8e53a772f` ("spirv: Add generic pointer support") Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15470>	2022-03-18 21:52:05 +00:00
Connor Abbott	acba08b58f	ir3: Implement and document ldc.k Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	ccc64b7e00	ir3: Plumb through store_uniform_ir3 intrinsic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	3244e659e0	ir3: Implement basic shader preamble intrinsics These will be used to implement the ir3-specific shader preamble lowering in NIR. shps is conceptually similar to getone (although it technically can't be duplicated) and shpe is similar to other barriers, since it has to happen after any stores to the constant file in the preamble. Add NIR intrinsics and plumbs them through ir3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	3b96ad70ee	nir: Add a preamble optimization pass This pass tries to move computations that are uniform for the entire draw to the preamble. There's also an API for backends to insert their own instructions into the preamble, for porting existing UBO pushing passes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	31221ee556	nir: Add a "deep" instruction clone For the shader preamble, we need to add support for cloning one instruction at a time into the preamble, but we also need to rewrite sources. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	d1b017d479	nir: Add preamble functions These are functions that run before the entrypoint at least once per draw and write their results via store_preamble, and then are loaded in the rest of the shader via load_preamble. We will add users in the following commits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Konstantin Seurer	357bd1829f	nir,spirv: Preserve ray_query_value Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14565>	2022-03-13 12:02:05 +01:00
Iago Toral Quiroga	fed51585c4	nir/schedule: allow drivers to decide about instruction latency On V3D reading UBOs from uniform addresses uses a more efficient mechanism with lower latency. On other platforms there may be simular scenarios. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	e7a4e97076	nir/schedule: use larger delay for non-filtered memory reads This has been pending for a long time. It is not very consistent to add a significant delay for textures and not do it for UBOs, etc The reason we have not been doing this so far is the accumulated effect on register pressure for V3D as shown by shader-db results below, but from the point of view of a generic scheduler it makes sense to do this. Later patches will address V3D specific issues with register pressure derived from this by letting the driver control its instruction delay settings. total instructions in shared programs: 12662138 -> 13126587 (3.67%) instructions in affected programs: 1813091 -> 2277540 (25.62%) helped: 2410 HURT: 10499 total threads in shared programs: 415858 -> 407208 (-2.08%) threads in affected programs: 17348 -> 8698 (-49.86%) helped: 8 HURT: 4333 total uniforms in shared programs: 3711483 -> 3812698 (2.73%) uniforms in affected programs: 128012 -> 229227 (79.07%) helped: 3474 HURT: 2143 total max-temps in shared programs: 2138763 -> 2318430 (8.40%) max-temps in affected programs: 318780 -> 498447 (56.36%) helped: 588 HURT: 11997 total spills in shared programs: 3860 -> 49086 (1171.66%) spills in affected programs: 709 -> 45935 (6378.84%) helped: 23 HURT: 1595 total fills in shared programs: 5573 -> 55810 (901.44%) fills in affected programs: 1067 -> 51304 (4708.25%) helped: 23 HURT: 1595 LOST: 3 GAINED: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	3bd041e2fb	nir/schedule: handle nir_intrinsic_group_memory_barrier Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	46e330c07e	nir/schedule: fix handling of generic memory barrier We can get a generic nir_intrinsic_memory_barrier to represent a barrier involving multiple semantics (instead of getting individual specific barriers for each semantic). This means that we need to consider these as potentially affecting shared memory access as well. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Mike Blumenkrantz	0d80aed363	nir/gather_info: check copy_deref instrs for writing outputs this is a valid way to write an output even though it usually gets rewritten to some other instruction later on Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	53cbba83eb	glsl: store OES/EXT point_size extension enablement to shader struct Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Timur Kristóf	4b99b528f5	nir: Introduce workgroup_index and ability to lower workgroup_id to it. The workgroup_index is intended for situations when a 3 dimensional workgroup_id is not available on the HW, but a 1 dimensional index is. In this case, we can use lower the 3D ID to use this. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Timur Kristóf	6a4c01f3ef	nir: Extract lower_id_to_index into a separate function. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Timur Kristóf	64acec0ef9	nir: Fix lowering terminology of compute system values: "from"->"to". This is to match other NIR terminology. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Timur Kristóf	5b9bf3434f	nir: Fix handling of NV_mesh_shader PRIMITIVE_INDICES output. PRIMITIVE_INDICES is a flat array in NV_mesh_shader, not a proper arrayed output, as opposed to D3D-style mesh shaders where it's addressed by the primitive index. Prevent assigning several slots to primitive indices, to avoid issues. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15160>	2022-03-08 13:44:10 +00:00
Georg Lehmann	6731460194	nir: Fix source type for fragment_fetch_amd. Like txf_ms, these take integers not floats. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15242>	2022-03-07 12:21:12 +00:00
Samuel Pitoiset	6532307555	nir: introduce nir_pack_{sint,uint}_2x16 instructions These instructions have AMD hardware equivalent and they will be used to lower fragment shader outputs in NIR. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15231>	2022-03-04 08:06:56 +00:00
Daniel Schürmann	ca4595e01a	nir/opt_shrink_vectors: update docstring in order to reflect the various recent improvements. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12468>	2022-03-04 00:18:58 +00:00
Daniel Schürmann	405829cd85	nir/opt_shrink_vectors: remove duplicate components from vecN vecN instructions which are only used by other ALU will now get duplicate channels removed. i915g: total instructions in shared programs: 396309 -> 396294 (<.01%) instructions in affected programs: 186 -> 171 (-8.06%) r300: total instructions in shared programs: 1165059 -> 1164354 (-0.06%) instructions in affected programs: 35884 -> 35179 (-1.96%) total temps in shared programs: 165497 -> 165326 (-0.10%) temps in affected programs: 2990 -> 2819 (-5.72%) softpipe: total instructions in shared programs: 2860028 -> 2859084 (-0.03%) instructions in affected programs: 55539 -> 54595 (-1.70%) total temps in shared programs: 516939 -> 516546 (-0.08%) temps in affected programs: 6623 -> 6230 (-5.93%) Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12468>	2022-03-04 00:18:58 +00:00
Daniel Schürmann	e5963478c2	nir/opt_shrink_vectors: shrink load_const properly This patch enables removal of arbitrary channels in load_const instructions, if they are either unused or duplicates of other channels and only used by ALU. Totals from 692 (0.51% of 134913) affected shaders: (GFX10.3) VGPRs: 21832 -> 21544 (-1.32%) CodeSize: 1322016 -> 1313080 (-0.68%); split: -0.68%, +0.01% Instrs: 243635 -> 242231 (-0.58%); split: -0.58%, +0.00% Latency: 1856138 -> 1857237 (+0.06%); split: -0.09%, +0.15% InvThroughput: 424298 -> 421671 (-0.62%); split: -0.62%, +0.01% VClause: 4580 -> 4583 (+0.07%); split: -0.02%, +0.09% SClause: 14336 -> 14354 (+0.13%); split: -0.04%, +0.17% Copies: 8897 -> 8859 (-0.43%); split: -0.45%, +0.02% PreSGPRs: 20439 -> 20437 (-0.01%) PreVGPRs: 16011 -> 15907 (-0.65%); split: -0.97%, +0.32% i915g: total instructions in shared programs: 396471 -> 396309 (-0.04%) instructions in affected programs: 6408 -> 6246 (-2.53%) total const in shared programs: 56458 -> 56422 (-0.06%) const in affected programs: 407 -> 371 (-8.85%) LOST: shaders/closed/steam/trine-2/fp-3.shader_test FS r300: total instructions in shared programs: 1164421 -> 1165059 (0.05%) instructions in affected programs: 143981 -> 144619 (0.44%) total temps in shared programs: 165488 -> 165497 (<.01%) temps in affected programs: 318 -> 327 (2.83%) total consts in shared programs: 922140 -> 921952 (-0.02%) consts in affected programs: 12438 -> 12250 (-1.51%) softpipe: total instructions in shared programs: 2859978 -> 2860028 (<.01%) instructions in affected programs: 183355 -> 183405 (0.03%) total temps in shared programs: 517071 -> 516939 (-0.03%) temps in affected programs: 1416 -> 1284 (-9.32%) total imm in shared programs: 103601 -> 102767 (-0.81%) imm in affected programs: 3928 -> 3094 (-21.23%) Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12468>	2022-03-04 00:18:58 +00:00

... 2 3 4 5 6 ...

7169 Commits