KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jason Ekstrand	fa717a202c	docs,nir: Document NIR texture instructions Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11775>	2021-07-23 15:53:57 +00:00
Jason Ekstrand	4465ca296d	nir: Suffix all the MCS texture stuff _intel It's intel-specific, used to get at MSAA compression information. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11775>	2021-07-23 15:53:57 +00:00
Jason Ekstrand	60b5faf572	nir/lower_tex: Add a lower_txs_cube_array option Several bits of hardware require the division by 6 to happen in the shader. May as well have common lowering for it. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12005>	2021-07-22 14:22:35 -05:00
Jordan Justen	6898549d56	nir: Add nir_lower_image() to lower cube image sizes Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9466>	2021-07-21 11:02:15 -07:00
Sagar Ghuge	06ab737686	nir: Add optimizations for iadd3 This patch also adds has_iadd3 bit to give more control if backend supports ternary add instruction or not. v2: - Add patterns in late optimization (Connor Abbott) Suggested-by: Alyssa/Jason Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11596>	2021-07-16 15:59:56 +00:00
Jason Ekstrand	624e799cc3	nir: Drop nir_ssa_def::name and nir_register::name We say that they're for debug only but we don't really have a good policy around when to set them and when not to. In particular, nir_lower_system_values and nir_lower_vars_to_ssa which are the chief producers of SSA values which might reasonably have a name do not bother to set one. We have some names set from things like BLORP and RADV's meta shaders but AFAICT, they're setting a name more because it's there than because they actually care. Also, most things other than nir_clone and nir_serialize don't bother to try and preserve them. You can see in the diffstat of this commit exactly what passes attempt to preserve names. Notably missing from the list is opt_algebraic which is the single largest source of SSA def churn and it happily throws names away. These observations lead me to question whether or not names are actually useful at all or if they're just taking up space (8B per instruction) and wasting CPU cycles (to ralloc_strdup on the off chance we do have one). I don't think I can think of a single time in recent history where I've been debugging a shader issue and a SSA value name has been there and been useful. If anything, the few times they are there, they just throw me off because they mess up the indentation in nir_print. iris shader-db on my system gets runtime -2.07734% +/- 1.26933% (n=5) Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5439>	2021-07-08 17:34:41 +00:00
Connor Abbott	cc514bfa0e	nir: Add read_invocation_cond_ir3 intrinsic On qualcomm, we have shared registers similar to SGPR's on AMD. However, there is no readlane or readfirstlane primitive. shared registers can only be written to when just one lane is active. This means that we have to lower readInvocation(val, id) to something like: if (gl_SubgroupInvocation == id) { scalar_reg = val; } return scalar_reg; However it's a bit difficult to actually get the value of gl_SubgroupInvocation in the backend, because for compute it requires some calculations and we don't have any CSE support in the backend. This intrinsic lets us turn it into "readInvocationCond(val, id == gl_SubgroupInvocation)" in NIR at which point the backend code generation is a lot easier. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Connor Abbott	e4e79de2a4	nir/subgroups: Support > 1 ballot components Qualcomm has a mode with a subgroup size of 128, so just emitting larger integer operations and then lowering them later isn't an option. This makes the pass able to handle the lowering itself, so that we don't have to go down to 64-thread wavefronts when ballots are used. (The GLSL and legacy SPIR-V extensions only support a maximum of 64 threads, but I guess we'll cross that bridge when we come to it...) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Connor Abbott	90819b9b0e	nir/subgroups: Replace lower_vote_eq_to_ballot with lower_vote_eq Lower it to a vote instead of a ballot. This was only used for AMD, and in that case they're pretty much the same. However Qualcomm has a vote builtin, which we want to use instead of ballots. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Emma Anholt	4118264643	nir: Free the instructions in a DCE instr removal. No significant change in shader-db time (n=11), but should be a little win for memory usage by the compiler. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11628>	2021-07-06 11:24:48 -07:00
Emma Anholt	5618445d45	nir: Use remove_and_dce for nir_shader_lower_instructions(). Reduces the work that other shader passes have to do to look at dead code, and possibly extra rounds around the optimization loop if dce wasn't the last pass in it. shader-db runtime -1.12919% +/- 0.264337% (n=49) on SKL. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11628>	2021-07-06 11:24:45 -07:00
Emma Anholt	5251548572	nir: Add a nir_instr_remove that recursively removes dead code. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11628>	2021-07-06 11:24:43 -07:00
Rob Clark	c7b935962b	nir: Add pass to lower phi precision In addition to register pressure benefits from getting more fp16/int16, this avoids i2imp's from standing in the way of loop unrolling. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11545>	2021-06-29 23:27:28 +00:00
Emma Anholt	0afab39af9	nir: Add a helper for chasing movs with nir_ssa_scalar(). Sometimes you might want to find a constant source without going through all the copy prop and constant folding to make your source be a load_const. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11613>	2021-06-28 16:26:24 +00:00
Enrico Galli	8a5333c105	nir: Add modes filter to nir_sort_variables Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10989>	2021-06-24 20:05:13 +00:00
Jason Ekstrand	81cb20bd17	nir: Add a function for sorting variables Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10989>	2021-06-24 20:05:13 +00:00
Bas Nieuwenhuizen	8dfb240b1f	nir: Add raytracing shader call lowering pass. Really copying Jason's pass. Changes: - Instead of all the intel lowering introduce rt_{execute_callable,trace_ray,resume} - Add the ability to use scratch intrinsics directly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10339>	2021-06-21 21:23:51 +00:00
Jason Ekstrand	73188c6954	nir,docs: Add docs for NIR ALU instructions About half or more of the text here is actually from Connor Abbot. I've edited it a bit to bring it up-to-date and make a few things more clear. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11438>	2021-06-21 16:46:59 +00:00
Rhys Perry	ea68d4a676	nir/propagate_invariant: add invariant_prim option Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11035>	2021-06-21 15:13:05 +00:00
Emma Anholt	990c232603	nir: Add an interface for logging shaders with mesa_log*. For debug on Android, it's useful to be able to print shaders to the android log interface, since you don't usually have stdout/stderr. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9262>	2021-06-18 18:18:35 +00:00
Rhys Perry	1cbcfb8b38	nir, nir/algebraic: add byte/word insertion instructions Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:42 +00:00
Caio Marcelo de Oliveira Filho	c8a7bd0dc8	nir: Rename WORK_GROUP (and similar) to WORKGROUP Be consistent with other usages in Vulkan and SPIR-V, and the recently added workgroup_size field. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Hoe Hao Cheng	90a5fef85c	nir: define NIR_ALU_MAX_INPUTS Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11172>	2021-06-04 19:33:13 +00:00
Ian Romanick	880b00dc59	nir/lower_tex: Add support for lowering YUYV formats v2: Rebase on `bc438c91d9` ("nir/lower_tex: ignore texture_index if tex_instr has deref src") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9610>	2021-05-21 01:40:22 +00:00
Ian Romanick	1358d93650	nir/lower_tex: Add support for lowering Y41x formats These are similar to AYUV, but the channel ordering is different... in such a way that there's no RGBA format that will make the channels line up right. v2: Rebase on `bc438c91d9` ("nir/lower_tex: ignore texture_index if tex_instr has deref src") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9610>	2021-05-21 01:40:22 +00:00
Jason Ekstrand	b447f5049b	nir: Add a discard optimization pass Many fragment shaders do a discard using relatively little information but still put the discard fairly far down in the shader for no good reason. If the discard is moved higher up, we can possibly avoid doing some or almost all of the work in the shader. When this lets us skip texturing operations, it's an especially high win. One of the biggest offenders here is DXVK. The D3D APIs have different rules for discards than OpenGL and Vulkan. One effective way (which is what DXVK uses) to implement DX behavior on top of GL or Vulkan is to wait until the very end of the shader to discard. This ends up in the pessimal case where we always do all of the work before discarding. This pass helps some DXVK shaders significantly. v2 (Jason Ekstrand): - Fix a couple of typos (Grazvydas, Ian) - Use the new nir_instr_move helper - Find all movable discards before moving anything so we don't accidentally re-order anything and break dependencies v3 (Pierre-Eric): remove the call to nir_opt_conditional_discard based on Daniel Schürmann comment. v4 (Pierre-Eric): - handle demote intrinsics and drop derivatives_safe_after_discard - add early return if discards/demotes aren't used v5 (Pierre-Eric): - use pass_flags instead of instr set (Daniel Schürmann) v6 (Daniel Schürmann): - cleanup and fix pass_flags handling Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10522>	2021-05-19 18:04:44 +00:00
Jason Ekstrand	3033410b10	nir/gather_info: Expose a nir_intrinsic_writes_external_memory helper Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10522>	2021-05-19 18:04:44 +00:00
Jason Ekstrand	f97fb1fa55	nir: Add a nir_instr_move helper Removes an instruction from one place and inserts it at another while working around a weird cursor corner-case. v2: change return value to bool (Daniel Schürmann) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> (v1) Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10522>	2021-05-19 18:04:44 +00:00
Connor Abbott	a40714abf7	nir/lower_phis_to_scalar: Add "lower_all" option We don't want to have to deal with vector phis in freedreno, because vectors are always split/unsplit around vectorized instructions anyways, and the stated reason for not scalarising them (it hurting coalescing) won't apply to us because we won't be using nir_from_ssa. Add this option so that we don't have to do the equivalent thing while translating from NIR. Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10809>	2021-05-17 09:59:45 +00:00
Gert Wollny	4c045ad11e	nir/linker: add option to ignore the IO precisions for better varying packing Backends that don't handle IO component precision can pack more varyings into one slot if the linker ignores the precision. If the IO is vectorized then this can save IO instructions. Related: `165a69d2f7` nir: handle mediump varyings in varying compaction helpers Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10722>	2021-05-15 09:58:27 +02:00
Caio Marcelo de Oliveira Filho	09984fd02f	nir: Rename nir_is_per_vertex_io to nir_is_arrayed_io VS outputs are "per vertex" but not the kind of I/O we want to match with this helper. Change to a name that covers the "arrayness" required by the type. Name inspired by the GLSL spec definition of arrayed I/O. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10493>	2021-05-14 16:17:45 +00:00
Gert Wollny	b4600d9352	nir: Add filter callback for lower_to_scalar to the options Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9943>	2021-05-07 12:09:03 +00:00
Iago Toral Quiroga	aebb47b7d1	compiler/nir: add a divergence analysis option for non-uniform workgroup id The V3D hardware allows us to pack multiple workgroups together to avoid wasting execution lanes in shader cores. For example, if we dispatch 16 workgroups with a local size of 1 element, we can pack all 16 workgroups in a single 16-wide dispatch where each lane executes a different workgroup, instead of 16 1-wide dispatches. When we do this, we don't have a uniform workgroup id any more. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10541>	2021-05-04 15:53:23 +00:00
Rhys Perry	7a7838529a	nir/lower_non_uniform: allow lowering with vec2 handles Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9523>	2021-04-27 15:56:07 +00:00
Jesse Natalie	fa677c8644	nir_lower_readonly_images_to_tex: Support non-CL semantics For non-CL, intrinsic access isn't set, because the image type doesn't have access qualifier. Instead, the access qualifier is set on the variable. So, add a mode to this pass which can chase back to the variable in addition to the intrinsic access. Also, update the variable type and the deref chain types so everything is consistent, that the tex is accessing a sampler. Note we can't do this for CL, because void-typed samplers don't exist. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10356>	2021-04-23 23:16:15 +00:00
Jesse Natalie	29c9731400	nir: Rename nir_lower_cl_images_to_tex, replace 'cl' with 'readonly' Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10356>	2021-04-23 23:16:15 +00:00
Alyssa Rosenzweig	c84804f167	nir/lower_fragcolor: Take max cbufs as argument One step closer to generalizing this pass to more drivers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10411>	2021-04-23 17:20:43 +00:00
Danylo Piliaiev	f17b41ab4f	nir: add lowering pass for helperInvocationEXT() Some hardware doesn't have a way to check if invocation was demoted, in such case we have to track it ourselves. OpIsHelperInvocationEXT is specified as: "An invocation is currently a helper invocation if it was originally invoked as a helper invocation or if it has been demoted to a helper invocation by OpDemoteToHelperInvocationEXT." Therefore we: - Set gl_IsHelperInvocationEXT = gl_HelperInvocation - Add "gl_IsHelperInvocationEXT = true" right before each demote - Add "gl_IsHelperInvocationEXT = gl_IsHelperInvocationEXT \|\| condition" right before each demote_if Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9460>	2021-04-19 17:11:36 +00:00
Eric Anholt	5de3cbbb2e	nir: Generate load_ubo_vec4 directly for !PIPE_CAP_NATIVE_INTEGERS The prog_to_nir->NIR-to-TGSI change ended up causing regressions on r300, and svga against r300-class hardware, because nir_lower_uniforms_to_ubo() introduced shifts that nir_lower_ubo_vec4() tried to reverse, but that NIR couldn't prove are no-ops (since shifting up and back down may drop bits), and the hardware can't do the integer ops. Instead, make it so that nir_lower_uniforms_to_ubo can generate nir_intrinsic_load_ubo_vec4 directly for !INTEGER hardware. Fixes: `cf3fc79cd0` ("st/mesa: Replace mesa_to_tgsi() with prog_to_nir() and nir_to_tgsi().") Closes: #4602 Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10194>	2021-04-16 21:58:00 +00:00
Alyssa Rosenzweig	5d32cf642f	nir: Add varying precision linking helper (v2) It is useful for the precisions of varyings to match across shader stages at link-time to enable precision lowering optimizations, which would otherwise require costly draw-time fixups. The goal is to enable `producer->precision == consumer->precision` to be an invariant drivers may rely on for linked shaders. v2: keep transform feedback outputs at mediump - mareko Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> (v1) Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Marek Olšák	fb29cef8dd	nir: add many passes that lower and optimize 16-bit input/outputs and samplers Added: * a pass that renumbers bases of IO intrinsics * a pass that converts mediump IO to 16 bits, optionally using the new packed varying slots * a pass that sets (forces) mediump in IO intrinsics (for testing) * a pass that remaps VARYING_SLOT_VAR[0..15]_16BIT to VARYING_SLOT_VAR[0..31] (if some shader stages don't want packed varyings) * a pass that folds type conversions around texture opcodes into those opcodes (e.g. tex(f2f32(coord), ..) is changed into tex accepting f16) * a pass that changes (legalizes) sampler src and dst types based on specified hw constraints (e.g. derivatives must be the same type as coordinates) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Marek Olšák	73f532e5bf	nir: add new VARYING_SLOTs and shader info for packed 16-bit varyings This allows mediump inputs and outputs to be trivially lowered into packed 16-bit varyings where 1 slot is occupied by 2 16-bit vec4s, without any packing instructions in NIR and without any conflicts with 32-bit varyings. The only thing that is changed is IO semantics in intrinsics to get packed 16-bit varyings. This simplifies supporting 16-bit types for drivers that have 32-bit slots everywhere except the fragment shader where they can do 16-bit interpolation on either the low or high half of each slot. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Rhys Perry	7050896be0	nir: add nir_block_get_predecessors_sorted() helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3364>	2021-04-12 18:17:19 +00:00
Rhys Perry	a2619b97f5	nir/lower_idiv: add options to use fp32 for 8-bit division lowering Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10081>	2021-04-12 16:19:46 +00:00
Bas Nieuwenhuizen	4ca4de50f7	nir: Remove nir_shader->shared_size. The same info is in shader_info. Dedupe. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Danylo Piliaiev	2bff8fd53b	nir: add nir_shader_as_str function It would be later used by Turnip in implementation of VK_KHR_pipeline_executable_properties. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8877>	2021-03-25 13:53:33 +00:00
Mike Blumenkrantz	6900498faa	nir: add nir_lower_indirect_builtin_uniform_derefs() this is a special version of indirect deref lowering which is used by mesa/st to remove dynamic indexing from builtin uniforms for the lowering pass in non-packed uniform case Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9741>	2021-03-23 14:44:48 +00:00
Gert Wollny	0f5b3c37c5	nir: Add opcodes for fused comp + csel and optimizations Some backends, like r600 support a fused version of int and float compare against zero and and csel. Adding these opcodes here makes it possible to optimize this in nir. v2: Add rules for float compare + csel Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9452>	2021-03-22 15:19:46 +01:00
Gert Wollny	a5747f8ab3	nir: add opcodes for find_msb_rev and lowering Some hardware supports a version of find_msb where the bits are counted starting at the high bit, and this needs some lowering to obtain the value that is expected by find_msb Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9452>	2021-03-22 15:19:46 +01:00
Dave Airlie	01dfd65a2d	nir: port fp16 casting code from dxil This moves the dxil pass to common code and makes dxil use the new code. Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9643>	2021-03-22 12:16:59 +10:00

1 2 3 4 5 ...

831 Commits