KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Dave Airlie	37d6ce4ebb	llvmpipe: swizzle image stores for CL BGRA OpenCL requires image stores to BGRA to work, so add the swizzle code here. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13404>	2021-10-19 03:11:08 +00:00
Dave Airlie	1d48022dab	gallivm/format: clamp SINT conversion rather than truncate. KHR-GL32.packed_pixels.pbo_rectangle.r16i on zink on lavapipe ends up using a pbo that does an SINT image write. This was producing truncated rather than clamped values. Fix the calculations for 8/16-bit signed ints to clamp not truncate. Fixes: `13e5f331db` ("gallivm/nir: fix image store conversions") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13187>	2021-10-05 23:11:57 +00:00
Pavel Asyutchenko	7ffb152276	llvmpipe: fix crash when doing FB fetch + gl_FragDepth write in one shader Reproducible by piglit test from this MR: https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/576 Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12705>	2021-10-05 20:45:38 +00:00
Dave Airlie	b9aee98912	gallivm: use pmulhrsw to make aos sampling more accurate. This uses pmulhrsw avx2 and ssse3 variants. It fixes the precision of texture filtering calculations. However it does leave these paths inaccurate on platforms that don't support it. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13075>	2021-09-29 21:55:23 +00:00
Dave Airlie	6f6f7d2680	gallivm/nir: fix subgroup invocation read. Again if you get passed an invoc but the exec mask has the active lane somewhere other than at 0, then if we have an invoc we should find the active lane and extract the value from invoc rather than using the idx. This fixes a bunch of VK 1.2 subgroup tests once 1.2 is enabled: dEQP-VK.subgroups.ballot_broadcast.compute.subgroupbroadcast_nonconst* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12953>	2021-09-27 16:35:45 +10:00
Dave Airlie	143167f2a0	gallivm/nir: handle subgroup reduction across all types Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	f814a2449e	llvmpipe: enable FP16 and update CL + traces piglit results. The fails will be addressed later. This adds a fail in GLSL compiler that is due to a workaround that fails when fp16 constants are lowered Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	0d3b285360	gallivm: use llvm intrinsics for 16-bit round/trunc/roundeven Otherwise the inf translations don't seem to work, and the VK CTS fails Fixes VK CTS dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	2277386565	gallivm: increase tgsi nesting call stack size Some VK CTS tests are topping this out around 76, increase it to 80 for now. Fixes: dEQP-VK.spirv_assembly.instruction.graphics.float16.opvectorshuffle.44 Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	c118888f92	gallivm/nir: pass the correct float builder to ddx/y Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	bceae73b3f	gallivm/nir: call pow with correct flt builder Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	836b0ace10	gallivm/nir: handle 16-bit exp/lod using intrinsics. This just passes the 16-bit float versions to the llvm intrinsics Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	6decb1b896	gallivm: add 16-bit sin/cos via llvm intrinsic Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	af49f9697a	gallivvm/nir: handle non-32bit mask scatter stores Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	0d4f17fe1f	gallivm/nir: fix f2b32 Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	0776628d1d	gallivm/nir: handle conversion to 16-bit texel fetch Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	c396067366	gallivm: add initial support for 16-bit float builder. This is an initial patch that is needed for OpenCL and Vulkan support for proper 16-bit floats. This doesn't enable the cap bit yet Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11816>	2021-09-16 04:15:41 +00:00
Dave Airlie	259e26e5e3	llvmpipe/cs: rework coroutine context handling (v2) Get comfy. llvmpipe coroutines have a stack frame. This is created by hooking in malloc and coro.alloc and coro.size intrinsics. LLVM has an CoroElide pass that is meant to allow that stack frame to be done as an alloca in the caller instead of using the malloc path. The CoroElide pass relies on the coroutine being inlined (fixed that). The CoroElide pass relies on there being a direct connect between coro.destroy(i8 *arg) and arg = coro.begin(id). However due to the way the compute shaders are launched, there is no way to ensure that link. Fixing the CoroElide pass seems quite difficult, I considered having a force CoroElide always flag to make it dtrt, however I'm not sure how ugly that would end up. My first attempt tried to preallocate the stacks at a fixed size, this turned out to be naive as the stack frame size was not sized like I expected. Instead the first coro to run allocs enough for everyone, so avoid the massive amounts of small allocations. This remove coro malloc from a lot of profiles and shaves another 30s or so from OpenCL ./conversions/test_conversions uchar_uin (from 4.40m to just under 4m on my ryzen 7 1800x) Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12432>	2021-09-16 13:21:34 +10:00
Dave Airlie	4ccee031e9	gallivm/coro: use a phi instead of alloca this just matches what the docs recommend Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12432>	2021-09-16 13:21:27 +10:00
Dave Airlie	fc0bf57632	gallivm/ssbo: cast ssbo index to int type. Since these can be loaded from ubos or other places now. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12689>	2021-09-16 08:05:56 +10:00
Dave Airlie	1ccac4abff	gallivm/ssbo: fix up dynamic indexed ssbo load/stores/atomics Although the index has to be dynamically uniform, if we don't ever execute a few lanes then we'll have 0, so it important to read the ssbo index from the first active lane. Just loop over them all. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12689>	2021-09-16 08:05:51 +10:00
Erik Faye-Lund	45a61f1782	gallivm: fix texture-mapping with 16-bit result 16bit integer support also implies using 16-bit results when sampling textures. Because we're returning the results in float SSA values instead of int, we need to bitcast back to integers before truncating the values. Fixes: `00ff60f799` ("gallivm: add 16-bit integer support") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12413>	2021-08-18 07:55:34 +00:00
Dave Airlie	ff99270923	gallivm: fix non-32 bit popcounts. Fixes OpenCL CTS integer_ops popcount Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12285>	2021-08-16 23:20:00 +00:00
Dave Airlie	9922ea7e66	gallivm: fix idiv/irem for 8/16/64-bit and 32-bit INT_MIN/-1 This fixes integer division for non-32bit but also fixes the 32-bit case where INT_MIN/-1 causes an exception. Fixes CL CTS ./integer_ops/test_integer_ops quick_long_math Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12285>	2021-08-16 23:20:00 +00:00
Dave Airlie	c3bede9c96	gallivm: don't lower local invocation index in frontend The frontend can't handle variable block sizes properly, so just lower it here in the backend. Fixes CTS basic local_linear_id + get_linear_ids Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12285>	2021-08-16 23:20:00 +00:00
Erik Faye-Lund	2f06642b06	gallivm: remove code to force nearest s/t interpolation These two bits were added in 2012, but never got wired up. Let's cut our losses, and remove them again. 9 years unused seems sufficient. While we're at it, remove reduction_mode from the hacks-section, because this isn't a hack at all, rather normal state. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12240>	2021-08-09 06:42:59 +00:00
Dave Airlie	78b4e417d4	gallivm: handle fisfinite/fisnormal lower one, do the other. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12207>	2021-08-06 14:28:10 +10:00
Erik Faye-Lund	08a12feb6e	gallivm: use lp_build_log2_safe for pow lp_build_log2 isn't robust enough to handle special cases for pow, so let's use lp_build_log2_safe instead. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11800>	2021-08-05 17:18:57 +00:00
Erik Faye-Lund	f395057a82	gallivm: remove pointless no_filter_hacks flag This flag now does the same thing as the no_quad_lod flag, so let's get rid of it to reduce confusion. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12125>	2021-08-03 12:08:38 +00:00
Erik Faye-Lund	34423391e0	gallivm: make rho-approximation opt-in instead of opt-out The RHO approximation code has issues passing the OpenGL ES 3.0 CTS, so let's make correctness the default instead. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12125>	2021-08-03 12:08:38 +00:00
Erik Faye-Lund	175bc33da7	gallivm: let nir_lower_tex handle projectors There's no need for us to do this lowering ourselves while emitting code, when there's already a helper that can do this for us that we're even using. Let's just set the right flag, and not worry about projectors any more. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12126>	2021-07-30 11:19:50 +00:00
Emma Anholt	d413fd0219	gallivm: Always take the per-pixel LOD path for cubemaps. The VK spec requires that derivatives are mapped to the face, and given that the face is per pixel that implies that we need per-pixel derivatives. This also seems to match hardware behavior. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10219>	2021-07-28 23:38:38 +00:00
Emma Anholt	371cf5d06a	gallivm: Default brilinear filtering to off. It's non-conformant for GL and Vulkan, and not expected to be a desired quality/speed tradeoff for applications using llvmpipe. The option is left in place using GALLIVM_PERF=brilinear. Causes minor rasterization changes in our traces: - improved sharpness of the trees in CS - more consistent mipmap filtering of the ground in CS:Source, STK and the RaytracedShadows demo. - changing some aliasing on shadow maps in 0ad (not consistently visibly better or worse) Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10219>	2021-07-28 23:38:38 +00:00
suijingfeng	44aa7e25dd	llvmpipe: correct the debug information printed with GALLIVM_PERF=nopt GALLIVM_PERF_NO_OPT is "belond" to GALLIVM_PERF environment variable, not GALLIVM_DEBUG. when GALLIVM_PERF=nopt is passed to llvmpipe, "-sroa -early-cse -simplifycfg -reassociate -constprop -instcombine -gvn" should not be printed. Those llvm optimation techniques should only be printed when the default(-O2) optimation is enabled. $ GALLIVM_PERF=nopt GALLIVM_DEBUG=dumpbc glmark2 Before apply this patch: Invoke as "opt -sroa -early-cse -simplifycfg -reassociate -mem2reg -constprop -instcombine -gvn ir_setup_variant_0.bc \| llc -O2 [-mcpu=<-mcpu option>] [-mattr=<-mattr option(s)>]" After apply this patch: Invoke as "opt -mem2reg ir_fs304_variant0.bc \| llc -O0 [-mcpu=<-mcpu option>] [-mattr=<-mattr option(s)>]" Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Signed-off-by: suijingfeng <suijingfeng@loongson.cn> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11994>	2021-07-27 19:57:21 +00:00
Roland Scheidegger	cac5711d43	llvmpipe: fix nir dot products (fsum op) When the dot product uses a source which can be optimized to a scalar, after a bunch of nir optimization steps the source to fsum will be a scalar with a x replicate swizzle. Hence nir_src_num_components is just 1 and the fsum was just a no-op which is not correct. Arguably this could be optimized a bit better, but just determine the number of addends by using nir_op_infos instead (the operand fetch was fixed already by `39a938ecf4` doing the same). Fixes: `4eb0475b5a` ("gallivm/nir: add fsum support") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12066>	2021-07-27 00:50:55 +00:00
Jason Ekstrand	74ec2b12be	nir/lower_tex: Rework invalid implicit LOD lowering Only fragment and some compute shaders support implicit derivatives. They're totally meaningless without helper invocations and some understanding of the dispatch pattern. We've got code to lower nir_texop_tex in these shader stages to use an explicit derivative of 0 but it was pretty badly broken: 1. It only handled nir_texop_tex, not nir_texop_txb or nir_texop_lod. 2. It didn't take min_lod into account 3. It was conflated with adding a missing LOD parameter to opcodes which expect one such as nir_texop_txf. While not really a bug, this does make it way harder to reason about the code. 4. Unless you set a flag (which most drivers don't), it left the opcode nir_texop_tex instead of nir_texop_txl which it should have been. This reworks it to go through roughly the same path as other LOD lowering only with a constant lod of 0 instead of calling out to nir_texop_lod. We also get rid of the lower_tex_without_implicit_lod flag because most drivers set it and those that don't are probably subtly broken. If someone really wants to get nir_texop_tex in their vertex shaders, they can write a new patch to add the flag back in. Fixes: `e382890e25` "nir: set default lod to texture opcodes that..." Fixes: `d5ac5d6e83` "nir: Add option to lower tex to txl when..." Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11775>	2021-07-23 15:53:57 +00:00
Dave Airlie	1087cf680a	gallivm/img: use uint for image coord builder. VK CTS has a test that does -256 for sample index but since we use signed >= we fail and write to random memory, changing to uint here for comparisons for out of bounds and address calcs should ensure, we stay within bounds even for negative vals. Remove unused int_type/bld while here. Fixes dEQP-VK.texture.multisample.invalid_sample_index.sample_count_4 Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11913>	2021-07-22 20:47:48 +00:00
Kai Wasserbäch	3a2d317b99	gallivm: fix FTBFS on i386 with LLVM >= 13, StackAlignmentOverride is gone Use the newly added wrapper lp_set_module_stack_alignment_override() to achieve the same end result. Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Closes: mesa/mesa#4906 Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11940>	2021-07-22 17:30:42 +00:00
Kai Wasserbäch	c1b4c64a28	gallivm: add new wrapper around Module::setOverrideStackAlignment() We need it in the next commit to replace setting the stack alignment on i386 with LLVM >= 13 through the TargetOption::StackAlignmentOverride, which was removed in the upstream commit <`3787ee4571`>. Unfortunately Module::setOverrideStackAlignment() is not available through the C API and we need to wrap it ourselves. Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reference: mesa/mesa#4906 Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11940>	2021-07-22 17:30:42 +00:00
Jose Fonseca	5a1d3bcf26	llvmpipe: Add a linear rasterizer optimized for 2D rendering. This change adds: - an alternative rasterizer, which rasterizes bins in a left->right & top->bottom linear fashion; - triangle -> rectangle detection; - 1:1 blit detection; - a special TGSI -> LLVM IR code generation that uses 8-bit SSE integers in AoS fashion (as opposed to 32bits floats.) Altogether these changes yield a 2x to 3x performance improvement for 2D workloads. It was designed to render Windows 7 Aero and other Windows built-in 3D applications (like Windows Media Player, Internet Explorer 11, UWP applications) with minimum CPU utilization, but it should be generally applicable to other 2D-on-3D applications, like desktop compositors, HTML browsers, 3D based UI toolkits, etc. This was mostly the brainchild of Keith Whitwell back in 2010. I wrote TGSI -> AoS translation. And many others added bug-fixes and enhancements over the years: Roland Scheidegger, Brian Paul, and James Benton. Known issues: - piglit spec@!opengl 1.1@quad-invariance will warn that "left and right half should match" due to rounding error difference - These optimized paths to kick in is that depth-buffer must not be used, so some applications which want to benefit from these improvements might need to be modified to ensure they use painter's algorithm instead of depth-buffers. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Acked-by: Keith Whitwell <keithw@vmware.com> v2: Incorporate Dave Airlie feedback: cleanup LP_DEBUG_xx; shrink 3+ empty lines. v3: silence unused var warning, adapt to new upstream code (point setup) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11969>	2021-07-21 22:40:18 +02:00
suijingfeng	7c81a9d56f	gallivm: fix pass init order on mips64 with llvm 8 llvm 8 has some missing pass dependencies, fix the mips64 case as well. See similiar fix: [1] `f59ff014b1` [2] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3805 Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: suijingfeng <suijingfeng@loongson.cn> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11955>	2021-07-21 13:14:05 +00:00
suijingfeng	88b234d7a7	gallivm: add basic mips64 support and set mcpu to mips64r5 on ls3a4000 ls3a4000 and ls2k1000 cpu is mips64r5 compatible with MSA SIMD instruction set implemented, while ls3a3000 is mips64r2 compatible only. Due to lacking llvm support for loongson CPU, llvm::sys::getHostCPUName(). return "generic" on all loongson mips CPU. So we override the MCPU to mips64r5 if MSA is implemented, feedback to mips64r2 for all other ordinaries. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: suijingfeng <suijingfeng@loongson.cn> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11955>	2021-07-21 13:14:05 +00:00
Dave Airlie	ce2b711c0a	gallivm: add support for anisotropic sampling. This is a port of the softpipe anisotropic filtering to llvmpipe. It should produce pretty similiar results. This contains the proposed fix to the softpipe calculating dq after scaling. It also contains a number of other fixes around vector lengths etc caught during test. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8804>	2021-07-21 10:19:51 +10:00
Dave Airlie	848d4e7e43	gallivm: add anisotropic filter weight table. The anisotropic filtering needs access to a table of weights, to make the calculations easier. This routes the table through from shader parameter and makes it available for the sampler code. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8804>	2021-07-21 10:19:51 +10:00
Connor Abbott	e4e79de2a4	nir/subgroups: Support > 1 ballot components Qualcomm has a mode with a subgroup size of 128, so just emitting larger integer operations and then lowering them later isn't an option. This makes the pass able to handle the lowering itself, so that we don't have to go down to 64-thread wavefronts when ballots are used. (The GLSL and legacy SPIR-V extensions only support a maximum of 64 threads, but I guess we'll cross that bridge when we come to it...) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Jason Ekstrand	f0f713960b	nir,amd: Suffix nir_op_cube_face_coord/index with _amd Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11463>	2021-06-21 09:03:34 -05:00
Michel Zou	fe625241f7	llvmpipe: restrict optim bug workaround to gcc 10.x seems fixed in 11.x, see https://gitlab.freedesktop.org/mesa/mesa/-/issues/3906 Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11209>	2021-06-09 16:19:16 +00:00
Caio Marcelo de Oliveira Filho	c8a7bd0dc8	nir: Rename WORK_GROUP (and similar) to WORKGROUP Be consistent with other usages in Vulkan and SPIR-V, and the recently added workgroup_size field. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Caio Marcelo de Oliveira Filho	a71a780598	nir: Rename nir_intrinsic_load_local_group_size to nir_intrinsic_load_workgroup_size Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Mike Blumenkrantz	5275ab1bf3	gallivm: fix oob imageLoad with formats that have <4 components it's expected that these loads will have full alpha fixes spec@arb_shader_image_load_store@invalid Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11049>	2021-05-28 20:46:14 +00:00
Timur Kristóf	b778564c76	gallivm: Fix a few uninitialized variable warnings. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10806>	2021-05-20 17:11:22 +00:00
Adam Jackson	37e1526479	gallivm: Fix a signature mismatch warning gcc 11 says: ../src/gallium/auxiliary/gallivm/lp_bld_format_soa.c:84:49: warning: argument 3 of type ‘struct LLVMOpaqueValue * const’ declared as a pointer [-Warray-parameter=] 84 \| const LLVMValueRef unswizzled, \| ~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~ In file included from ../src/gallium/auxiliary/gallivm/lp_bld_format_soa.c:42: ../src/gallium/auxiliary/gallivm/lp_bld_format.h:126:48: note: previously declared as an array ‘struct LLVMOpaqueValue * const[4]’ 126 \| const LLVMValueRef unswizzled[4], \| ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~ Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10738>	2021-05-11 09:14:30 -04:00
Erik Faye-Lund	39a938ecf4	lavapipe: fix fsum with swizzle We can do stuff like this: vec1 32 ssa_207 = fsum3 ssa_209.xxx In this case, we'd end up not swizzling in get_alu_src, and reading components out-of-bounds, which LLVM isn't very happy about, and thus takes punitive actions, in the form of a segfault. We don't want that, and we already know from the opcode what the component counts should be here. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10690>	2021-05-07 18:48:28 +00:00
Dave Airlie	83a05caaf2	gallivm: handle texture arrays in non-fragment shaders with lod. We have to unwind the lod into the scalar path correctly. Fixes a crash with renderdoc demo Fixes: `e168d148d7` ("gallivm/nir: handle non-uniform texture offsets") Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10676>	2021-05-07 02:41:29 +00:00
Jesse Natalie	9410eb7e39	llvmpipe: Fix optimization loop to actually loop Reviewed-by: Dave Airlie <airlied@redhat.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10549>	2021-05-05 22:06:42 +00:00
Ian Romanick	2d5b64818f	gallivm: Remove unused GALLIVM_NAN_RETURN_NAN In the review, Roland says, "I think the unused nan behaviors was there just for completeness, so it can easily go." Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Ian Romanick	61624934f6	gallivm: Use GALLIVM_NAN_RETURN_OTHER_SECOND_NONNAN for norm clamping Since the second source is always a constant that is known to be a number, this should have the same performance as GALLIVM_NAN_BEHAVIOR_UNDEFINED. A lofty goal is to eventually remove GALLIVM_NAN_BEHAVIOR_UNDEFINED. There's still a lot of (mostly implicit) users, and I don't feel like tackling that right now. :) Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Ian Romanick	aaeff52bbe	gallivm: Use range analysis to generate better fmin and fmax code If it is known that one of the source must be a number, then the (more efficient) GALLIVM_NAN_RETURN_OTHER_SECOND_NONNAN path can be used. v2: s/know to be/known to be/. Noticed by Roland. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Ian Romanick	b3f3287eac	gallivm: Fix NaN behavior of min and max Like softpipe in mesa!10419, llvmpipe suffers from improper handling of NaN in nir_op_fmax and nir_op_fmin. nir_op_fsat is already handled correctly. OpenCL strictly requires the "NaN cleansing" behavior, so all of the functionality is in place. Just make the graphics APIs use the OpenCL path. The majority of the possible performance penalty incurred here should be resolved in the next commit. v2: Add updated checksum for bgfx/39-assao.rdc trace. Rendering goes from mostly garbage to looking correct to me. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10532>	2021-05-04 00:13:34 +00:00
Erik Faye-Lund	00ff60f799	gallivm: add 16-bit integer support The new failure here is due to a bug in the test-case, and a fix has been submitted here: https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/502 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10185>	2021-04-20 07:49:21 +00:00
Erik Faye-Lund	1c82c05d58	gallivm: run nir_opt_algebraic_late Without this, we're left with a bunch of medium-precision residue, like i2imp. And to avoid that the lowering intrudces left-over instructions, also run other lowering passes, similar to what other drivers do. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10185>	2021-04-20 07:49:21 +00:00
Erik Faye-Lund	da5f346e6c	gallivm: handle 16-bit input in i2b32 Fixes: `dac8cb981f` ("gallivm/nir: allow 8/16-bit conversion and comparison.") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10185>	2021-04-20 07:49:21 +00:00
Michel Zou	54deb1010f	gallium: fix uninitialized variable warning Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10276>	2021-04-19 22:46:30 +00:00
Michel Dänzer	9ef5372441	Guard FALLTHROUGH annotations after assert() clang warns if it can determine that the assert() never returns and there's a fall-through annotation below. Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10220>	2021-04-15 16:01:22 +00:00
Michel Dänzer	2928c21eb7	Convert most remaining free-form fall-through comments to FALLTHROUGH One exception is src/amd/addrlib/, for which -Wimplicit-fallthrough is explicitly disabled. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10220>	2021-04-15 16:01:22 +00:00
Marek Olšák	d7250b0b08	gallium: add PIPE_SHADER_CAP_FP16_CONST_BUFFERS for FP16 uniforms Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Dave Airlie	bf6fd9a2eb	gallivm: add subgroup lowering support Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9645>	2021-04-07 19:56:17 +00:00
Dave Airlie	8dc398848a	gallivm: add subgroup read invocation support Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9645>	2021-04-07 19:56:17 +00:00
Dave Airlie	2d6a0a8620	gallivm: add subgroup ballot support Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9645>	2021-04-07 19:56:17 +00:00
Dave Airlie	7b3073ad44	gallivm: add subgroup reduction + in/ex scan support Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9645>	2021-04-07 19:56:17 +00:00
Dave Airlie	2e1266d1fe	gallivm: add subgroup elect intrinsic support. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9645>	2021-04-07 19:56:17 +00:00
Dave Airlie	52415cd39e	gallivm: add subgroup system values support This just adds support for retrieving subgroup system values. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9645>	2021-04-07 19:56:17 +00:00
Dave Airlie	1b8145eee9	gallivm: move get_flt_bld to header. Rewrite a little as we have to add 16-bit support later in life Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9645>	2021-04-07 19:56:17 +00:00
Dave Airlie	2e23108274	gallivm: add subgroup vote 64-bit and feq support. These are both required for vulkan subgroup support. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9645>	2021-04-07 19:56:17 +00:00
Charmaine Lee	a442e3ff55	gallivm: increase size of texture target enum bitfield Need to bump up the size of texture target bitfield for MSVC. Fixes: `0ce7c4a7c9` ("gallivm: Use the proper enum for the texture target bitfield.") Reviewed-by: Neha Bhende <bhenden@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9928>	2021-03-31 18:18:13 +00:00
Dave Airlie	8c1d5fcb7c	gallivm: add 64-bit atomic global support Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9591>	2021-03-29 20:32:09 +00:00
Dave Airlie	92eafe42af	gallivm: add 64-bit atomic support for ssbo/shared. This just fixes things up to handle bit-size Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9591>	2021-03-29 20:32:09 +00:00
Dave Airlie	b03cfad77a	llvmpipe: fix cube image size query The 3rd coord wasn't being handled properly Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9746>	2021-03-29 08:04:59 +00:00
Dave Airlie	27822a6f0b	gallivm: use fp16 casts lowering Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9643>	2021-03-22 12:17:31 +10:00
Dave Airlie	224069cefd	gallivm/nir: handle bool registers. lowering to 32-bit bools doesn't get rid of register stores, so handle those. Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9643>	2021-03-22 12:16:45 +10:00
Dave Airlie	2a9e98130b	gallivm: fix non-32bit ubo loads 8/16-bit storage requires ubo loads for the smaller types, fix the ubo loading and bounds checking. Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9643>	2021-03-22 12:16:14 +10:00
Dave Airlie	c16f63b343	gallivm: convert packing to uint64 not double Fixes some issues with later inttopotr casting. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9616>	2021-03-16 23:05:10 +00:00
Dave Airlie	6adbf6c86c	llvmpipe: add reduction mode support Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9423>	2021-03-12 16:02:25 +10:00
Dave Airlie	9f0fd85474	gallivm: add support for load_view_index intrinsic This just adds the system value Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	974f2e6c6a	gallivm: mark subpass input attachments as 2d arrays This matters when multiview is enabled. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Rob Clark	a9618e7c42	util: Add accessor for util_cpu_caps In release builds, there should be no change, but in debug builds the assert will help us catch undefined behavior resulting from using util_cpu_caps before it is initialized. With fix for u_half_test for MSVC from Jesse Natalie squashed in. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9266>	2021-02-26 18:31:19 +00:00
Adam Jackson	ba4b9cc0c7	gallivm: Silence a warning at -Og ../src/gallium/auxiliary/gallivm/lp_bld_sample_soa.c: In function ‘lp_build_sample_image_linear’: ../src/gallium/auxiliary/gallivm/lp_bld_sample_soa.c:1412:24: warning: ‘have_corners’ may be used uninitialized in this function [-Wmaybe-uninitialized] 1412 \| have_corner = LLVMBuildLoad(builder, have_corners, ""); I don't think you can get here and really not have it initialized, but if you _could_ then it would clearly be wrong to do anything, so do nothing instead. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8724>	2021-02-18 20:59:43 +00:00
Adam Jackson	10bcf25766	gallivm: Silence a warning at -Og ../src/gallium/auxiliary/gallivm/lp_bld_arit.c: In function ‘lp_build_round_arch’: ../src/gallium/auxiliary/gallivm/lp_bld_arit.c:2042:7: warning: ‘intrinsic_root’ may be used uninitialized in this function [-Wmaybe-uninitialized] 2042 \| lp_format_intrinsic(intrinsic, sizeof intrinsic, intrinsic_root, bld->vec_type); Can't happen, mark it unreachable. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8724>	2021-02-18 20:59:43 +00:00
Dave Airlie	ae2edc8127	llvmpipe: handle firstvertex for vulkan draw parameters Vulkan defines this a bit differently, so add support for load_first_vertex. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8219>	2021-01-27 23:36:52 +00:00
Eric Anholt	1fc79592b7	gallium/tgsi: Remove support for f64 src modifiers. The tgsi.rst was not very clear but didn't indicate that they were supported, and llvmpipe only supported double abs and not negate. glsl_to_tgsi never generated them, and neither did the new nir_to_tgsi, so just garbage collect it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8369>	2021-01-13 22:52:10 +00:00
Eric Anholt	0ce7c4a7c9	gallivm: Use the proper enum for the texture target bitfield. This gets you nice output in gdb when trying to figure out why your 2D sample shader is hitting the cube paths. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8118>	2020-12-24 00:18:19 +00:00
Michel Zou	c4342755cc	llvmpipe: work around mingw compiler optimization bug Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3906 Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7914>	2020-12-07 17:11:19 +00:00
Eric Anholt	d90107a200	gallivm: Fix max const buffer count. llvmpipe was reporting 32 max const buffers, while sizing its arrays to 16 according to gallivm's #define. Fixes: `1d35f77228` ("gallivm,llvmpipe,draw: Support multiple constant buffers.") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7695>	2020-12-02 20:43:33 +00:00
Pierre-Eric Pelloux-Prayer	fecbc1e541	gallivm: update fallthrough comments Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7747>	2020-12-01 10:04:41 +01:00
Dave Airlie	f586a8efb7	gallivm: fix float atomic exchange. for atomic exchange floats are valid. Fixes CL CTS test_atomic fails Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7595>	2020-11-16 06:53:20 +10:00
Dave Airlie	2a3fd242b0	gallivm/nir: add fsum support This is needed for lowered dot products, this opcode just sums all the vector elements. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7595>	2020-11-16 06:53:09 +10:00
Dave Airlie	53064ce6b5	gallivm: add float to 8/16 int Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7595>	2020-11-16 06:52:59 +10:00
Jason Ekstrand	9d377c01d0	nir: Make nir_deref_instr::mode a bitfield We rename it to "modes" to make it clear that it may contain more than one mode and adjust all the uses of nir_deref_instr::modes to attempt to handle multiple modes. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	3cc58e6470	nir: Add and use some deref mode helpers NIR derefs currently have exactly one variable mode. This is about to change so we can handle OpenCL generic pointers. In order to transition safely, we need to audit every deref->mode check. This commit adds a set of helpers that provide more nuanced mode checks and converts most of NIR to use them. For simple cases, we add nir_deref_mode_is and nir_deref_mode_is_one_of helpers. These can be used in passes which don't have to bother with generic pointers and just want to know what mode a thing is. If the pass ever encounters generic pointers in a way that this check would be unsafe, it will assert-fail to alert developers that they need to think harder about things and fix the pass. For more complex passes which require a more nuanced understanding of modes, we add nir_deref_mode_may_be and nir_deref_mode_must_be helpers which accurately describe the compiler's best knowledge about the given deref. Unfortunately, we may not be able to exactly identify the mode in a generic pointers scenario so we have to be very careful when we use these. Conversion of these passes is left to later commits. For the case of mass lowering of a particular mode (nir_lower_explicit_io is one good example), we add nir_deref_mode_is_in_set. This is also pretty assert-happy like nir_deref_mode_is but is for a set containment comparison on deref modes where you expect the deref to either be all-in or all-out. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Dave Airlie	621c4f816f	gallivm/nir: handle dvec3/4 inputs properly. This code works but isn't entirely correct, for a dvec3 it would fetch loc 0,1 2,3 4,5 but really each loc only has 4 entries, instead catch this and read loc 0,1 2,3 loc+1 0,1 Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7416>	2020-11-03 21:35:08 +00:00
Dave Airlie	ad3849259a	gallivm: fix f16 quantize. Add the correct flush to 0 behaviour. Fixes: dEQP-VK.spirv_assembly.instruction.compute.opquantize.flush_to_zero Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7416>	2020-11-03 21:35:08 +00:00
Dave Airlie	f7d1460418	gallivm: zero init the temporary register storage. Due to flow control we can end up with random values in here having side effects. This fixes a crash in gtk4-demo. Fixes: `44a6b0107b` ("gallivm: add nir->llvm translation (v2)") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7327>	2020-10-30 05:30:35 +10:00
Dave Airlie	82512a163b	gallivm: lower flrp for all sizes. This fixes: builtin-float-mix-1.0.generated.cl Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7309>	2020-10-26 11:33:28 +10:00
Dave Airlie	b8a9bd9b93	gallivm: get correct min/max behaviour for kernels. NaN handling for CL is harsher than GLSL Fixes piglit min/max CL tests Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7309>	2020-10-26 11:23:05 +10:00
Dave Airlie	9845c1636c	gallivm: add support for 8/16-bit mul_hi This 32x32 code only needs small tweaks for this case. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7309>	2020-10-26 11:03:51 +10:00
Dave Airlie	4b6d332843	gallivm: handle sub-32 bit masked stores. This is used for register stores < 32 bit size. Fixes: builtin-char-rotate-1.0.generated.cl Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7309>	2020-10-26 10:53:00 +10:00
Dave Airlie	81d8ca70eb	gallivm: add b2i8/b216 support This is part of the fix for builtin-*-rotate-1.0.generated.cl Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7309>	2020-10-26 10:09:58 +10:00
Dave Airlie	0f78ca9d54	gallivm: add 16-bit split/merge support. Fixes piglit load-hi16.cl, load-lo16.cl Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7309>	2020-10-26 10:08:34 +10:00
Dave Airlie	4ecdc5ec4e	gallivm: fix 64->16 f2f16 llvm appears to callout to a library to do 64-bit->16-bit fp trunc, just trunc to 32-bit first to avoid it. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7309>	2020-10-26 09:55:35 +10:00
Dave Airlie	842a53913c	llvmpipe: fix 8/16 bit global stores Fixes: CL vstore_half-float-global* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7309>	2020-10-26 08:30:04 +10:00
Dave Airlie	fb56fb02a1	gallivm: add load/store scratch support. Scratch space is per-thread space, so allocate the scratch size * vector width, and add a per-thread base offset to each load/store. This is needed for OpenCL private memory space Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7304>	2020-10-25 08:02:06 +10:00
Tapani Pälli	c83d6ffa32	gallivm/nir: handle nir_op_flt in lp_build_nir_llvm Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3663 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7248>	2020-10-22 05:32:25 +00:00
Marek Olšák	f5f0c012ad	gallium/util: remove empty file u_half.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6987>	2020-10-06 21:07:11 -04:00
Marek Olšák	b42c6ff6f6	util: remove util_float_to_half and util_half_to_float wrappers Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6987>	2020-10-06 21:07:07 -04:00
Matt Turner	1aac47db69	Revert F16C series (MR 6774) This reverts commit `4fb2eddfdf`. This reverts commit `7a1deb16f8`. This reverts commit `2b6a172343`. This reverts commit `5af81393e4`. This reverts commit `87900afe5b`. A couple of problems were discovered after this series was merged that cause breakage in different configurations: (1) It seems that using -mf16c also enables AVX, leading to SIGILL on platforms that do not support AVX. (2) Since clang only warns about unknown flags, and as I understand it Meson's handling in cc.has_argument() is broken, the F16C code is wrongly enabled when clang is used, even for example on ARM, leading to a compilation error. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3583 Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6969>	2020-10-01 21:08:12 +00:00
Dave Airlie	4c70f1ba2f	gallivm/nir: fix non-32 bit find lsb/msb fixes piglit cl get-global-id Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6954>	2020-10-02 04:17:49 +10:00
Dave Airlie	35b162eb2c	gallivm/nir: make sure to mask global reads. Make the driver only read values for the active lanes, otherwise it can cause unwanted oob accesses that aren't the apps fault. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6954>	2020-10-02 04:17:41 +10:00
Marek Olšák	4fb2eddfdf	gallium/util: remove empty file u_half.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6774>	2020-09-30 16:28:24 +00:00
Marek Olšák	2b6a172343	util: remove util_float_to_half and util_half_to_float wrappers Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6774>	2020-09-30 16:28:24 +00:00
Dave Airlie	760ba65733	gallivm/nir: handle non-32-bit mul high Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6928>	2020-09-30 16:54:07 +10:00
Dave Airlie	5d4502c085	gallivm/nir: fix up non 32-bit load stores This fixes a bunch of opencl tests. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6928>	2020-09-30 16:54:04 +10:00
Dave Airlie	2891136050	gallivm: fix 64-bit CL intrinsics. This fixes a bunch of bad casts in piglit tests Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6928>	2020-09-30 16:53:55 +10:00
Dave Airlie	d12cdc9374	gallivm: fix pow(0, y) to be 0 The log2(0) was producing bad results. Fixes: piglit pow tests on llvmpipe. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6907>	2020-09-29 21:51:22 +00:00
Marek Olšák	4cb6a8d2de	gallivm: add support for lowered IO in vertex shaders for Selection/Feedback/RasterPos in st/mesa. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6698>	2020-09-25 21:55:07 +00:00
Jason Ekstrand	9750164c09	nir: Rename get_buffer_size to get_ssbo_size This makes it explicit that this intrinsic is only for SSBOs. For the v3dv driver, we'll be adding a get_ubo_size intrinsic and we want to be able to distinguish between the two. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6812>	2020-09-22 13:34:12 +00:00
Dave Airlie	b0722cb670	gallivm: disable brilinear for lod bias and explicit lod. This allows GL 4.5 CTS to pass in full with no flags, other than that I'm not sure if it's a good or bad idea. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6638>	2020-09-10 11:38:19 +10:00
Dave Airlie	4df0eef188	llvmpipe: include gallivm perf flags in shader cache. Otherwise if you set perf flags, then don't set them, they won't take affect. Fixes: `6c0c61cb48` ("llvmpipe: add infrastructure for disk cache support") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6630>	2020-09-09 09:21:41 +10:00
Roland Scheidegger	1d018bc7fd	gallivm: add InstSimplify pass This is the recommended replacement for the removed ConstantPropagation pass, and llvm now added c binding for it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6585>	2020-09-04 18:19:07 +00:00
Eric Anholt	6e7c006062	gallivm: Report the unsupported intrinsic instead of just assert(0); Tripped over this in rendermode with softpipe NIR. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6567>	2020-09-02 09:58:44 -07:00
Jason Ekstrand	dfa63f2656	llvmpipe: Add support for load_global_constant Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6379>	2020-09-01 20:50:04 +00:00
Marek Olšák	52cac06862	gallivm: fix build on LLVM 12 due to LLVMAddConstantPropagationPass removal Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3465 Cc: 20.1 20.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6531>	2020-09-01 06:59:33 +00:00
Dave Airlie	8f6eb35e09	gallivm/nir: add some f16 support GLSL lowers packhalf2x16 itself, but for SPIRV we don't have that option. For packing when NIR lowers it uses f2f16 and for unpack it needs the casting and f2f32 Fixes: dEQP-VK.glsl.builtin.function.pack_unpack.packhalf2x16* dEQP-VK.glsl.builtin.function.pack_unpack.unpackhalf2x16* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 13:20:11 +10:00
Dave Airlie	b31e8460a6	gallivm/nir: allow 64-bit arit ops Fixes: dEQP-VK.glsl.builtin.precision_double.round.* dEQP-VK.glsl.builtin.precision_double.roundeven.* dEQP-VK.glsl.builtin.precision_double.trunc.* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 13:20:11 +10:00
Dave Airlie	727bb217be	gallivm: use common code to select texel type This was suggested by Roland, and fixes stencil images. Fixes: dEQP-VK.renderpass.dedicated_allocation.formats.d24_unorm_s8_uint.* dEQP-VK.renderpass.dedicated_allocation.formats.d32_sfloat_s8_uint.* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 13:20:11 +10:00
Dave Airlie	6596957641	gallivm/nir: add subpass sampler type support Fixes: dEQP-VK.renderpass.dedicated_allocation.attachment_allocation.input_output.* dEQP-VK.renderpass.dedicated_allocation.formatsinput Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 13:20:11 +10:00
Dave Airlie	21324c4036	gallivm/nir: lower frexp/ldexp Fixes: dEQP-VK.glsl.builtin.precision.frexp* dEQP-VK.glsl.builtin.precision.ldexp* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 13:20:11 +10:00
Dave Airlie	d7aed02bcc	gallivm/nir: fix const compact Since adding support for compacts this crashes with indirects don't get the const address unless it's a const. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 13:20:11 +10:00
Dave Airlie	96a5ced65f	gallivm/nir: add quantize to f16 support Fixes: dEQP-VK.spirv_assembly.instruction.graphics.opquantize.* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 13:20:11 +10:00
Dave Airlie	1d4a560ba2	gallivm/nir: add indirect swizzle output loading support Fixes: dEQP-VK.clipping.user_defined.clip_distance.* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 13:20:11 +10:00
Dave Airlie	5177ffed65	gallivm/nir: lower tg4 offsets. Fixes: dEQP-VK.glsl.texture_gather.offsets.* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6381>	2020-08-31 10:28:30 +10:00
Karol Herbst	e5899c1e88	nir: rename nir_op_fne to nir_op_fneu It was always fneu but naming it fne causes confusion from time to time. So lets rename it. Later we also want to add other unordered and fne, this is a smaller preparation for that. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6377>	2020-08-21 17:26:21 +00:00
Dave Airlie	3cc5b75577	gallivm/nir: split tex/sampler indirect offsets vulkan has these separate, should be fine for GL as well as the values will be the same anyways. Fixes: dEQP-VK.binding_model.shader_access.secondary_cmd_buf.uniform_texel_buffer.* dEQP-VK.binding_model.descriptorset_random.sets4.noarray.ubolimitlow.sbolimitlow.sampledimglow.outimgtexlow.noiub.uab.vert.noia* dEQP-VK.binding_model.descriptor_copy.graphics.uniform_texel_buffer.* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6339>	2020-08-19 10:16:40 +10:00
Dave Airlie	e616223024	gallivm/nir: fixup compact TCS variable storage. This fixes a lot of tessellation shaders, since tess factors get emitted with spir-v now. Fixes a bunch of: dEQP-VK.tessellation* along with 7000 others. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6339>	2020-08-19 10:16:36 +10:00
Dave Airlie	7893dfa247	gallivm/sample: fix lod query on array textures. The lod query doesn't take a layer, but the code tries to use one, detect lodq and don't use a layer in those cases. There appears to be no GL tests for this behaviour, but the vulkan CTS hits it. Fixes: dEQP-VK.glsl.texture_functions.query.texturequerylod.* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6339>	2020-08-19 10:16:27 +10:00
Dave Airlie	da4f2215aa	gallivm/nir: add imod support Just adds support for the imod instruction Fixes: dEQP-VK.glsl.operator.binary_operator.mod.* Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6339>	2020-08-19 10:16:24 +10:00
Dave Airlie	3abf548270	gallivm/nir: add load push constant support This convets the push constant load to a ubo load against the 0 constant buffer, which is where the vallium layer will bind things. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6082>	2020-08-17 14:30:49 +10:00
Eric Anholt	5b8d67cb64	util: Move fetch_rgba to a separate function table. Only llvmpipe and translate_generic use it, and only in fallbacks, so if you're not building that then let's not bloat our binaries with it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6305>	2020-08-16 21:25:14 +00:00
Eric Anholt	35b22b5da0	util: Make all 3 fetch_rgba functions occupy the same function slot. A single format either had the float, the sint, or the uint version. Making the dst be void * lets us store them in the same slot and not have logic in the callers to call the right one. -6kb on gallium drivers Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6305>	2020-08-16 21:25:14 +00:00
Eric Anholt	4064a6cd20	util: Split the pack/unpack functions out of the format desc. This gives the compiler a chance to GC pack/unpack functions separate from the format descriptions. For drivers that use everything, this is +10-20kb, while for libvulkan_intel it's -1.3MB. Bug: https://bugs.chromium.org/p/chromium/issues/detail?id=1048434 Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5826>	2020-08-12 13:13:39 -07:00
Rob Clark	c4e0cae90c	gallium: replace 16BIT_TEMPS cap with 16BIT_CONSTS All drivers that support mediump lowering should support 16BIT_TEMPS, but some do not also want 16b consts to be lowered. Replace the pipe cap in preperation to remove LowerPrecisionTemporaries. Note: also updates reference checksums for the arm64_a630_traces job, due to lowering more to 16b Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6189>	2020-08-05 21:00:44 +00:00

1 2 3 4 5 ...

1642 Commits