KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Lionel Landwerlin	fdb74a77d2	intel/ds: fix compilation with perfetto Forgot to test with perfetto... Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9da3d714b8` ("anv: add dynamic rendering traces") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5992 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Tested-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14914>	2022-02-08 12:29:21 +00:00
Dylan Baker	a52e4871fe	meson: add radv to meson devenv I either rebased this out of the original PR, just failed to commit it and then reset it. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14879>	2022-02-07 21:22:12 -08:00
Mike Blumenkrantz	8335fdfeaf	vk/sync: add asserts for timeline semaphore count matching spec requires that the number of timeline waits/signals matches the base number of waits/signals if there are any timeline semaphores being processed by the submit, so asserting here is in line with what validation will yield failure to match these will also hang every driver I've tested, so asserting here potentially saves some people their desktop session Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14741>	2022-02-08 04:09:13 +00:00
Mike Blumenkrantz	388f23eabe	zink: min/max blit region in coverage functions these regions might not have the coords in the correct order, which will cause them to fail intersection tests, resulting in clears that are never applied cc: mesa-stable fixes: GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_all_buffer_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_color_and_depth_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_color_and_stencil_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_linear_filter_color_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_magnifying_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_minifying_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_missing_buffers_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_nearest_filter_color_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_negative_dimensions_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_negative_height_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_negative_width_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_scissor_blit Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14867>	2022-02-08 03:42:01 +00:00
Mike Blumenkrantz	b656ab75a6	zink: reject invalid draws cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14859>	2022-02-08 03:29:45 +00:00
Mike Blumenkrantz	e38c13830f	zink: fix PIPE_CAP_TGSI_BALLOT export conditional this requires VK_EXT_shader_subgroup_ballot cc: mesa-stable fixes (lavapipe): KHR-GL46.shader_ballot_tests.ShaderBallotAvailability KHR-GL46.shader_ballot_tests.ShaderBallotFunctionRead Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14858>	2022-02-08 03:16:06 +00:00
Mike Blumenkrantz	8907964dcd	zink: export PIPE_SHADER_CAP_TGSI_CONT_SUPPORTED this is supported and has been for a while Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14858>	2022-02-08 03:16:06 +00:00
Pierre-Eric Pelloux-Prayer	413bf889e7	radeonsi/blit: relax conditions to use sdma copy for prime buffers We don't need to check if it's imported: PIPE_BIND_DRI_PRIME is enough. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14615>	2022-02-08 00:13:07 +00:00
Pierre-Eric Pelloux-Prayer	3b27ad1504	radeonsi: create prime buffers as uncached `8791e831b1` marked imported prime buffers as uncached (useful when prime buffer is allocated by the display GPU), but they should also be created as uncached (useful when allocated by the render GPU). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14615>	2022-02-08 00:13:07 +00:00
Pierre-Eric Pelloux-Prayer	18c38bf78f	gallium: rename PIPE_BIND_DRI_PRIME The new name PIPE_BIND_PRIME_BLIT_DST is more precise. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14615>	2022-02-08 00:13:07 +00:00
Pierre-Eric Pelloux-Prayer	42c149e36b	gallium/dri: add missing PIPE_BIND_DRI_PRIME handling `e9c3dbd046` added PIPE_BIND_DRI_PRIME but it was only set when importing a prime buffer. This commit adds handling of this flag in the other codepath = the one where the prime buffer is allocated by the render GPU. With this change PIPE_BIND_DRI_PRIME is still only set for the render GPU - the display GPU will never see this flag; a future commit will rename it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14615>	2022-02-08 00:13:07 +00:00
Kenneth Graunke	3926be368e	ci/iris: Mark qbo tests as flakes These appear to have some kind of race condition and usually fail, but sometimes pass. We had already attempted to mark them as flakes on amly, but need to mark them as flakes on KBL+ too. See https://gitlab.freedesktop.org/mesa/mesa/-/jobs/18522605 and https://gitlab.freedesktop.org/mesa/mesa/-/jobs/18527737 where these unexpectedly passed on KBL, and also where the top-level test not being caught by the regex led to failures. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14916>	2022-02-07 14:30:52 -08:00
Zoltán Böszörményi	df1751a2bb	crocus: Enable compat profile the same way as core profile Signed-off-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11756>	2022-02-07 19:41:44 +00:00
Kenneth Graunke	604d97671b	iris: Add support for flushing the blitter (hackily) To flush the blitter, we need to use MI_FLUSH_DW rather than the usual PIPE_CONTROL we use on the 3D engine. Most of our code is set up to suggest flushes via PIPE_CONTROL commands, however, so we hackily just emit MI_FLUSH_DW when they ask for any kind of PIPE_CONTROL flush. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14912>	2022-02-07 09:50:01 -08:00
Kenneth Graunke	9c5dc4985b	blorp: Assert that blorp_copy() on the blitter can handle it Safeguards against callers that don't guarantee the necessary things. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14912>	2022-02-07 09:50:01 -08:00
Kenneth Graunke	d2646e147b	intel/genxml: Add missing MI_FLUSH_DW::Flush CCS field Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14912>	2022-02-07 09:50:01 -08:00
Rhys Perry	7ddad1b93a	radv: fix R_02881C_PA_CL_VS_OUT_CNTL with mixed cull/clip distances Matches radeonsi. Seems Vulkan CTS doesn't really test cull distances. Removing VARYING_SLOT_CULL_DIST0/VARYING_SLOT_CULL_DIST1 variables doesn't break any of dEQP-VK.clipping.*, except for tests which read the variables in the fragment shader. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5984 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14882>	2022-02-07 12:36:38 +00:00
Danylo Piliaiev	44bdac9849	tu: Implement VK_AMD_buffer_marker to support Graphics Flight Recorder Graphics Flight Recorder is: "The Graphics Flight Recorder (GFR) is a Vulkan layer to help trackdown and identify the cause of GPU hangs and crashes. It works by instrumenting command buffers with completion tags." This is a nice little tool which could help quickly identify the call which hanged. Or if command buffer is executed for too long. The tiling nature of our GPU shouldn't be a big issue aside from lower performance. For non-segfault case, if: - Hang happens at the same place in cmdbuf and draw/dispatch is not finished at that point - it is likely that there is an infinite loop in some of the shaders in this draw. - Hang happens always in different place - likely there is nothing wrong and command buffer just takes too long to execute and you should try increasing hangcheck_period_ms. If it doesn't help it is likely a synchronization issue. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13553>	2022-02-07 12:53:34 +02:00
Daniel Stone	b561946497	egl/wayland: Don't replace existing backbuffer in get_buffers If the surface already has a current backbuffer - say through a buffer_age query - we do not want to replace it in get_buffers, because it means the result we'd previously returned them is stale. If we already have a backbuffer set on the surface, keep it locked in no matter what until we hit SwapBuffers. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14873>	2022-02-07 09:57:41 +00:00
Daniel Stone	3da8300562	egl/wayland: Reset buffer age when destroying buffers A buffer age of 0 means that the buffer is uninitialised or has unknown content. We rely on the buffer age initially being 0 through zalloc when the surface is first created; when they are first used for a swap, we set their age to 1, and then we increment the age of every buffer in the chain with a non-zero age when we swap. Now that we can release buffers, both through dmabuf-feedback as well as detecting when we're using a deeper swapchain than the compositor needs, make sure to reset their age as they are released. Without doing this, the age will stay as it was before it was released and be incremented, returning the wrong age to the user the first time a previously-released buffer slot has been reused. Signed-off-by: Daniel Stone <daniels@collabora.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5977 Fixes: `22d796feb8` ("egl/wayland: break double/tripple buffering feedback loops") Fixes: `b5848b2dac` ("egl/wayland: use surface dma-buf feedback to allocate surface buffers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14873>	2022-02-07 09:57:41 +00:00
Emma Anholt	fa4390f7bf	ci/iris: Add skips and flakes notes for recent #intel-ci logs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14904>	2022-02-07 09:33:59 +00:00
Emma Anholt	0cf32b5079	ci/crocus: Add recent flakes from #intel-ci Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14904>	2022-02-07 09:33:59 +00:00
Emma Anholt	6423045957	ci/softpipe,llvmpipe: Disable Xvfb server reset on piglit runs. The resets take time that we don't need to spend. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14904>	2022-02-07 09:33:59 +00:00
Samuel Pitoiset	9ea4029f9f	Revert "radv: re-apply "Do not access set layout during vkCmdBindDescriptorSets."" The most famous RADV revert over the past months. This was an issue in RADV and not an use-after-free (descriptor set layouts can be destroyed almost at any time). This reverts commit `b775aaff1e`. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14621>	2022-02-07 08:24:36 +01:00
Samuel Pitoiset	66f7289d56	radv: add reference counting for descriptor set layouts The spec states that descriptor set layouts can be destroyed almost at any time: "VkDescriptorSetLayout objects may be accessed by commands that operate on descriptor sets allocated using that layout, and those descriptor sets must not be updated with vkUpdateDescriptorSets after the descriptor set layout has been destroyed. Otherwise, descriptor set layouts can be destroyed any time they are not in use by an API command." Based on ANV. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5893 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14621>	2022-02-07 08:24:36 +01:00
Dave Airlie	37c3be6947	crocus: find correct relocation target for the bo. If we have batch a + b, and writing to batch b, causes batch a to flush, all the bo->index get reset, and we try to submit a -1 to the kernel. Look the bo index up when creating relocations. Fixes crash seen in KHR-GL46.compute_shader.pipeline-post-fs and a trace from Wasteland 3 Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14905>	2022-02-07 18:01:36 +10:00
Zoltán Böszörményi	d774059a0c	crocus: enable GL46 tests for HSW in ci Signed-off-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14889>	2022-02-06 23:46:28 +00:00
Alyssa Rosenzweig	0299600efb	asahi: Fix memory unsafety in delete_sampler_state The type is wrong, masked by a void*, meaning the free is completely wrong. ASan is rightfully unhappy. Fixes crashes destroying the context. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14901>	2022-02-06 15:46:55 -05:00
Alyssa Rosenzweig	786871c87e	agx: Don't kill helper threads in ld_var Apparently this is yet another .kill bit. Fixes: dEQP-GLES3.functional.shaders.derivate.dfdx.linear.* dEQP-GLES3.functional.shaders.derivate.dfdy.linear.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	367d93bcd4	agx: Handle texture array indices These need to be converted to integers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	b459473bb9	agx: Implement nir_op_txb Like explicit LODs, biases must be 16-bit, so add a lowering rule for this. With the LOD mode selection updated for txb, we can then ingest biases like explicit LODs and allowlist txb. Passes: dEQP-GLES2.functional.shaders.texture_functions.fragment.texture2d_bias dEQP-GLES2.functional.texture.mipmap.2d.bias.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	e2903f66ec	agx: Translate LOD modes more generically Now includes support for auto_load_bias mode. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	d5b7d629d7	agx: Add AUTO_LOD_BIAS mode Automatic load with a bias. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	93f2ae1205	asahi: Correctly set IOGPU_ATTACHMENT::size Not sure what this is used for, but let's not lie to the kernel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14898>	2022-02-06 09:48:34 -05:00
Alyssa Rosenzweig	daab41b80b	asahi: Identify IOGPU_ATTACHMENT::size Oops. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14898>	2022-02-06 09:48:34 -05:00
Charmaine Lee	945a1e0b8c	mesa: fix misaligned pointer returned by dlist_alloc In cases where the to-be-allocated node size with padding exceeds BLOCK_SIZE but without padding doesn't, a new block is not created and no padding is done to the previous instruction, causing a misaligned pointer to be returned. v2: Per Ilia Mirkin's suggestion, remove the extra condition in the first if statement, let it unconditionally pad the last instruction if needed. The updated currentPos will then be taken into account in the block size checking. This fixes crash seen with lightsmark and Optuma apitraces Fixes: `05605d7f53` (' mesa: remove display list OPCODE_NOP') Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Neha Bhende <bhenden@vmware.com> Tested-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14871>	2022-02-05 22:45:01 +00:00
Neha Bhende	9230b28533	svga: store shared_mem_size in svga_compute_shader instead of svga_context When new context was created, shared_mem_size was getting overwritten. This fixes glretrace failure seen with manhattan, aztec and BASS2_intro apitraces Fixes: `247c61f2d0` ('svga: Add support for compute shader, shader buffers and image views') Tested with glretrace, piglit Reviewed-by: Charmaine Lee <charmainel@vmware.com> (cherry picked from commit dd6793ec9218782b1b716a87582d7219bae4e75f) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14870>	2022-02-05 22:17:32 +00:00
Kenneth Graunke	c7a357787f	anv: Increase maxUniformBufferRange to 2^30 when not using the sampler The limit here is from the RENDER_SURFACE_STATE height/width/depth fields - it's 2^30 for ISL_FORMAT_RAW buffers, and 2^27 otherwise. anv_isl_format_for_descriptor_type() uses ISL_FORMAT_R32G32B32A32_FLOAT for uniform buffers when compiler->indirect_ubos_use_sampler is set (Icelake and earlier), but ISL_FORMAT_RAW when it isn't (Tigerlake+). So we can increase the limit on Tigerlake and later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14892>	2022-02-05 11:23:16 -08:00
Pavel Ondračka	49f4f1ec22	r300: fix deadcode elimination in loops with breaks We are updating the deadcode state while walking the program backwards. When encountering ENDLOOP, we scan the loop, mark everything in the loop as used and than continue as usuall. We were previously trying to be smart with the breaks. This was however not working as expected. Instead, save the most pesimistic deadcode state from the ENDLOOP and just restore it anytime we see a break. This keeps the code simple and more importantly does not touch the flat and IF(-ELSE)-ENDIF paths at all so reduces the chances of regression. No changes with my shader-db. Fixes piglits on RV530: shaders/ssa/fs-if-def-else-break.shader_test spec/glsl-1.10/execution/vs-loop-array-index-unroll.shader_test Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5832 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14661>	2022-02-05 15:39:26 +01:00
Lionel Landwerlin	9da3d714b8	anv: add dynamic rendering traces v2: Get rid of subpass_count Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14798>	2022-02-04 23:43:48 +00:00
Lionel Landwerlin	d0811ca046	anv: flush utrace before at device destroy Ensuring any remaining traces are displayed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14798>	2022-02-04 23:43:48 +00:00
Mike Blumenkrantz	960e72417f	zink: use scanout obj when returning resource param info embarrassing typo since the base obj has no modifier data available cc: mesa-stable fixes #5980 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14875>	2022-02-04 23:25:36 +00:00
Boris Brezillon	a8fbfcfbd3	pan/midg: Support 8/16 bit load/store Needed for panvk copy shaders to support 8 or 16bit formats. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	59ea6e2e27	pan/midg: Add a pass to lower non-logbase2 global/shared loads Compute shaders might do vec3(Xbits) loads which are translated to LD.<next_pow2(3 * X)> by the midgard compiler. This might cause out-of-bound accesses potentially leading to pagefaults if the access is at the end of a BO. One solution to avoid that (maybe not the best) is to split non-log2 loads to make sure we only read what's requested. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	3f9bce08e1	pan/midg: Fix swizzle packing on 64bit instructions with src-expansion + dst-shrinking In that case, the mask is specified on 32bit lanes, so we need to shift it if it's > 0x3. The expand modifier will take care of selecting the right side of the 32bit vector. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	da474d5d14	pan/midg: Fix the upper/lower limit on 8bit vectors If I'm correct, the lower/upper split on 8bit vectors is 8, not 4. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	b58c262144	pan/midg: Fix 64-bit swizzle printer Swizzling happens in 2 steps on Midgard: 1. Vector expansion/shuffling 2. Swizzling at the instruction-size granularity, but defined using the source size. Those size are different if the source is expanded. So, when we print 64 bit swizzles on an expanded source, we first need to apply an offset if the high part of the 32bit vector was selected, and then divide the result by 2 to account for vector expansion. To sum-up, swizzling on midgard is complicated, and I'm not sure I got it right, but it seems to print what I expect on the few compute shaders using 64bit arithmetic I debugged. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	39e4b7279d	pan/midg: Fix swizzling on 8-bit sources Even though 8-bit ALUs are not supported, we can have [un]pack_32_4x8 instructions which translate to IMOVs, and those operate on 8-bit vectors. The problem is, the swizzling granularity is 16 bit, which means we don't support MOV.i8 R0.xyzw, TMP0.xxxx, R1.zyxw and the compiler doesn't even complain, it just applies 8 bit swizzling directly, which obviously doesn't work. This is probably not the right way to fix that, but I thought I'd raised the issue with a hack to fix, so we can get the discussion started. (Found while debugging FB store lowering on Midgard). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	65209b1adb	pan/midg: Prefix scalar immediates with '#' instead of '<' We already do that for scalar instructions, so let's do it for vector instructions with a single component too. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	36bb1ac453	pan/midg: Remove spurious printf() in print_vector_constants() Also tried to replace that one by an fprintf(fp, ...), but it pollutes the dump too. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00

... 2 3 4 5 6 ...

150037 Commits All Branches Search

150037 Commits

All Branches