mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Jason Ekstrand	cf8cf8a827	vulkan/cmd_queue: Re-flow MANUAL_COMMANDS This just makes it all a bit easier to read. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Boris Brezillon	290e33ab20	vulkan/cmd_queue: Remove duplicate entries in MANUAL_COMMANDS Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Louis-Francis Ratté-Boulianne	6bd8a3c7e4	vulkan/runtime: Add a vk_cmd_queue object to vk_command_buffer This is paving the road for generic secondary command buffer support, where commands are simply recorded in a software queue and replayed on the primary command buffer when vkCmdExecuteCommands() is called. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Louis-Francis Ratté-Boulianne	ad4d2da90a	vulkan/cmd_queue: Add an initializer for the vk_cmd_queue object Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Boris Brezillon	dd0f6cb45b	vulkan/cmd_queue: Constify vk_cmd_queue.alloc The implementation shouldn't modify the allocator. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Mike Blumenkrantz	a3d096f4ba	lavapipe: add the full list of cts fails easier to keep track this way Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15294>	2022-03-10 10:03:16 -05:00
Akihiko Odaki	b70f14188d	virgl: Check texture multisample compatibility v2: Support VIRGL_FORMAT_NONE (Gert Wollny) Signed-off-by: Akihiko Odaki <akihiko.odaki@gmail.com> Suggested-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15115>	2022-03-10 10:34:12 +00:00
Akihiko Odaki	571c5e8fdc	virgl/ci: Uprev virglrenderer Suggested-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Signed-off-by: Akihiko Odaki <akihiko.odaki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15115>	2022-03-10 10:34:12 +00:00
Danylo Piliaiev	c4703cd846	tu: Implement VK_EXT_depth_clip_control Since negativeOneToOne is a static property of the pipeline and viewport state could be dynamic, we have to defer viewport state emission until negativeOneToOne value is known. See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6070 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14363>	2022-03-10 11:08:50 +02:00
Iago Toral Quiroga	49b5431197	broadcom/compiler: remove unused functions Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15302>	2022-03-10 07:25:37 +00:00
Dylan Baker	45770ac286	docs: add release notes for 22.0.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15312>	2022-03-09 22:47:56 +00:00
Dylan Baker	8474817253	docs: Add calendar entries for 22.0 release. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15312>	2022-03-09 22:47:56 +00:00
Dylan Baker	b7e1df14f0	docs: update calendar and link releases notes for 22.0.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15312>	2022-03-09 22:47:56 +00:00
Timur Kristóf	75a783ea73	ac: Query the amdgpu MEC firmware version. MEC (Micro Engine Compute) is the firmware which is responsible for the compute-only queues on AMD GPUs. It is present on GFX7 and newer. This patch will query the version of this firmware and print it among the others. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15283>	2022-03-09 21:31:48 +00:00
Rob Clark	f4ec900953	mesa: Fix discard_framebuffer for fbo vs winsys GL is annoying when it comes to having different enums for winsys vs fbo. Note that the issue this closes was only accidentially exposed by a change the resulted in sysmem vs GMEM path taken. Fixes: `db2ae51121` ("mesa: Skip partial InvalidateFramebuffer of packed depth/stencil.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6103 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15308>	2022-03-09 20:40:53 +00:00
Emma Anholt	d5d8519cb5	docs/ci: Add docs for using a POE switch to control boards, like nouveau. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	e8da28d5e8	docs/ci: Update some bare-metal CI docs. We haven't been using initramfs in a long time, don't point people that direction. Do point people at existing instances of these CI variants, though. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	5497d60639	ci/nouveau: Add a manual run for the Jetson Nano (GM20B). The test suite is full of flakes around transform feedback, atomics, and tess. But, I hope it can be useful for regression testing core Mesa reworks. This required updating the kernel to 5.16.12 to get a more stable boot process. That kernel rebuild caused an update of the container with piglit which that was missed in a previous MR, so we got new xfails in x86 swrast. Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> (nouveau) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	1b374f8c91	ci/nouveau: Add nouveau support to the rootfs. This required updating the kernel to 5.16.12 to get a more stable boot process. That kernel rebuild caused an update of the container with piglit which that was missed in a previous MR, so we got new xfails in x86 swrast. Also, including modules on arm64 exposed a bug in v3d's poe-powered.sh rsyncing of modules. Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	a9e67738d6	ci: Stop xz-compressing firmware for ramdisks. This ends up breaking nouveau because the renames break symlinks in the firmware directory structure. We don't need it any more since we stopped doing ramdisks. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	9b918c4df2	ci/bare-metal: Increase maximum retry count for POE boots. The manual jetson CI job I'm introducing has serious boot reliability trouble, but also we've seen frequent intermittent failures on bcm where at least 2 boots don't seem to be enough (#6041). Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	45b7648cb1	ci/bare-metal: Drop the BM_POE_USERNAME/PASSWORD env var checks. They're unused since the transition to SNMP in the rpi test farm. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Mike Blumenkrantz	c24bca2d3a	zink: lower dmod on AMD hardware this hardware won't return the correct value from dmod instructions, so lower it to ensure that cts passes nobody else will ever hit this, so perf isn't an issue and regular fmod can be left alone fixes (amd): KHR-GL46.gpu_shader_fp64.builtin.mod_d* Fixes: `5fae35fb17` ('zink: fix 64bit float shader ops ') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15306>	2022-03-09 19:13:02 +00:00
Mike Blumenkrantz	1845957a31	zink: add another radv fail it looks like this one was erroneously excluded Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15307>	2022-03-09 14:00:06 -05:00
Mike Blumenkrantz	e70b6be117	zink: update radv fails Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15307>	2022-03-09 14:00:06 -05:00
Chia-I Wu	889d050739	venus: add VK_EXT_vertex_attribute_divisor Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	4752429e36	venus: add VK_EXT_shader_stencil_export Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	1ecd481bd7	venus: add VK_EXT_robustness2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	25795308ef	venus: add VK_EXT_depth_clip_enable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	bebe5e3925	venus: add VK_EXT_conservative_rasterization Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	f0e0daf46b	venus: add VK_EXT_shader_demote_to_helper_invocation Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	99473f610a	venus: update venus-protocol headers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Marcin Ślusarz	823cffbe1c	anv: include Primitive Header in mesh shader per-primitive output Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	f410c1142f	anv: set number of viewports in clip state (mesh) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	81df66bfff	intel/compiler: mark some variables as per-primitive in FS if they come from MS Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	8c16ce53a9	intel/compiler: handle ViewportIndex, PrimitiveID and Layer in MUE setup Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	bc4f8c073a	intel/compiler: inject MUE initialization Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	333a490e32	intel/compiler: shift mesh urb read/write window when offset is too large Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Samuel Pitoiset	6c1c9067d9	aco: always emit vk_cvt_pkrtz_f16_f32 for nir_op_pack_half_2x16_split From the VK_KHR_shader_float_controls extension: "5) Do any of the “Pack” GLSL.std.450 instructions count as conversion instructions and have the rounding mode applied?" "RESOLVED: No, only instructions listed in “section 3.32.11. Conversion Instructions” of the SPIR-V specification count as conversion instructions." This is also the same logic as the LLVM backend. No fossils-db changes on Sienna Cichlid. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15301>	2022-03-09 16:24:20 +00:00
Erik Faye-Lund	fa41bd0687	docs: improve language in zink article Turns out, this was not proper use of language! Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15300>	2022-03-09 16:19:16 +00:00
Erik Faye-Lund	e666134975	docs: fixup zink gl 4.3 requirements The multiViewport feature isn't required for GL 4.3, it's required for GL 4.1. Technically speaking, we could have just dropped it because we already list the maxViewports requirement. But it seems better to be very clear here to me. Fixes: `29f8f21bff` ("docs: document zink GL 4.3 requirements") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15300>	2022-03-09 16:19:16 +00:00
Iago Toral Quiroga	44feff93c2	broadcom/compiler: don't always assign r5 if available Instead, only favor assigning r5 if we have first decided to assign an accumulator. This helps with assining r5 to short lived uniforms, favoring accumulator rotation to facilitate QPU merges. total instructions in shared programs: 12656164 -> 12628339 (-0.22%) instructions in affected programs: 5368373 -> 5340548 (-0.52%) helped: 17420 HURT: 9996 total uniforms in shared programs: 3704776 -> 3704863 (<.01%) uniforms in affected programs: 12247 -> 12334 (0.71%) helped: 23 HURT: 78 total max-temps in shared programs: 2153505 -> 2152684 (-0.04%) max-temps in affected programs: 26468 -> 25647 (-3.10%) helped: 569 HURT: 328 total fills in shared programs: 4656 -> 4657 (0.02%) fills in affected programs: 43 -> 44 (2.33%) helped: 0 HURT: 1 total sfu-stalls in shared programs: 34728 -> 34403 (-0.94%) sfu-stalls in affected programs: 3411 -> 3086 (-9.53%) helped: 842 HURT: 534 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	77f58b46d9	broadcom/compiler: add comment on why we don't use r5 with ldunifa Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	5b140428b0	broadcom/compiler: adjust register threshold for 2-thread compiles We have twice the registers in this case so it makes sense to double this as well. While this causes slight regressions in shader-db stats (due to additional register pressure), it helps us hide latency of memory reads better on 2-thread compiles, where the thread switch mechanism will be less effective. This shows a ~3% performance improvement on the UE4 SunTemple demo. total instructions in shared programs: 12642413 -> 12656164 (0.11%) instructions in affected programs: 2272652 -> 2286403 (0.61%) helped: 2924 HURT: 3389 total uniforms in shared programs: 3703861 -> 3704776 (0.02%) uniforms in affected programs: 213729 -> 214644 (0.43%) helped: 823 HURT: 1272 total max-temps in shared programs: `2150686` -> 2153505 (0.13%) max-temps in affected programs: 191332 -> 194151 (1.47%) helped: 1900 HURT: 1891 total spills in shared programs: 3255 -> 3274 (0.58%) spills in affected programs: 166 -> 185 (11.45%) helped: 3 HURT: 6 total fills in shared programs: 4630 -> 4656 (0.56%) fills in affected programs: 367 -> 393 (7.08%) helped: 7 HURT: 15 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	a35b47a0b1	broadcom/compiler: add a strategy to disable scheduling of general TMU reads This can add quite a bit of register pressure so it makes sense to disable it to prevent us from dropping to 2 threads or increase spills: total instructions in shared programs: 12672813 -> 12642413 (-0.24%) instructions in affected programs: 256721 -> 226321 (-11.84%) helped: 719 HURT: 77 total threads in shared programs: 415534 -> 416322 (0.19%) threads in affected programs: 788 -> 1576 (100.00%) helped: 394 HURT: 0 total uniforms in shared programs: 3711370 -> 3703861 (-0.20%) uniforms in affected programs: 28859 -> 21350 (-26.02%) helped: 204 HURT: 455 total max-temps in shared programs: 2159439 -> `2150686` (-0.41%) max-temps in affected programs: 32945 -> 24192 (-26.57%) helped: 585 HURT: 47 total spills in shared programs: 5966 -> 3255 (-45.44%) spills in affected programs: 2933 -> 222 (-92.43%) helped: 192 HURT: 4 total fills in shared programs: 9328 -> 4630 (-50.36%) fills in affected programs: 5184 -> 486 (-90.62%) helped: 196 HURT: 0 Compared to the stats before adding scheduling of non-filtered memory reads we see we that we have now gotten back all that was lost and then some: total instructions in shared programs: 12663186 -> 12642413 (-0.16%) instructions in affected programs: 2051803 -> 2031030 (-1.01%) helped: 4885 HURT: 3338 total threads in shared programs: 415870 -> 416322 (0.11%) threads in affected programs: 896 -> 1348 (50.45%) helped: 300 HURT: 74 total uniforms in shared programs: 3711629 -> 3703861 (-0.21%) uniforms in affected programs: 158766 -> 150998 (-4.89%) helped: 1973 HURT: 499 total max-temps in shared programs: 2138857 -> `2150686` (0.55%) max-temps in affected programs: 177920 -> 189749 (6.65%) helped: 2666 HURT: 2035 total spills in shared programs: 3860 -> 3255 (-15.67%) spills in affected programs: 2653 -> 2048 (-22.80%) helped: 77 HURT: 21 total fills in shared programs: 5573 -> 4630 (-16.92%) fills in affected programs: 3839 -> 2896 (-24.56%) helped: 81 HURT: 15 total sfu-stalls in shared programs: 39583 -> 38154 (-3.61%) sfu-stalls in affected programs: 8993 -> 7564 (-15.89%) helped: 1808 HURT: 1038 total nops in shared programs: 324894 -> 323685 (-0.37%) nops in affected programs: 30362 -> 29153 (-3.98%) helped: 2513 HURT: 2077 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	f783bd0d2a	broadcom/compiler: define v3d-specific delays for NIR instructions We do a few changes over NIR's defaults: 1. Lower delay for texture reads. Empirically, we don't observe any benefits with delays over 50 and since this delay value is still used by the scheduler in the "favor register pressure" case it is benefitial to avoid overestimating it too much. 2. Adjust delay for non-filtered TMU reads to the delay selected for texture reads. 3. In our case, UBO reads from dynamically uniform addresses don't use the TMU and have a latency of 1 instruction in the best case scenario or 4 at worse, so we go with 1 so we don't try to move this early. This helps us get back some of what we lost when updating the default scheduler configuration to add a delay for non-filtered memory reads: total instructions in shared programs: 13126587 -> 12671765 (-3.46%) instructions in affected programs: 3764097 -> 3309275 (-12.08%) helped: 14664 HURT: 4244 total threads in shared programs: 407208 -> 415522 (2.04%) threads in affected programs: 8716 -> 17030 (95.39%) helped: 4224 HURT: 67 total uniforms in shared programs: 3812698 -> 3711224 (-2.66%) uniforms in affected programs: 335170 -> 233696 (-30.28%) helped: 2816 HURT: 3551 total max-temps in shared programs: 2318430 -> 2159345 (-6.86%) max-temps in affected programs: 539991 -> 380906 (-29.46%) helped: 13173 HURT: 1440 total spills in shared programs: 49086 -> 5966 (-87.85%) spills in affected programs: 48306 -> 5186 (-89.26%) helped: 1655 HURT: 28 total fills in shared programs: 55810 -> 9328 (-83.29%) fills in affected programs: 54821 -> 8339 (-84.79%) helped: 1659 HURT: 22 LOST: 0 GAINED: 3 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	fed51585c4	nir/schedule: allow drivers to decide about instruction latency On V3D reading UBOs from uniform addresses uses a more efficient mechanism with lower latency. On other platforms there may be simular scenarios. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	e7a4e97076	nir/schedule: use larger delay for non-filtered memory reads This has been pending for a long time. It is not very consistent to add a significant delay for textures and not do it for UBOs, etc The reason we have not been doing this so far is the accumulated effect on register pressure for V3D as shown by shader-db results below, but from the point of view of a generic scheduler it makes sense to do this. Later patches will address V3D specific issues with register pressure derived from this by letting the driver control its instruction delay settings. total instructions in shared programs: 12662138 -> 13126587 (3.67%) instructions in affected programs: 1813091 -> 2277540 (25.62%) helped: 2410 HURT: 10499 total threads in shared programs: 415858 -> 407208 (-2.08%) threads in affected programs: 17348 -> 8698 (-49.86%) helped: 8 HURT: 4333 total uniforms in shared programs: 3711483 -> 3812698 (2.73%) uniforms in affected programs: 128012 -> 229227 (79.07%) helped: 3474 HURT: 2143 total max-temps in shared programs: 2138763 -> 2318430 (8.40%) max-temps in affected programs: 318780 -> 498447 (56.36%) helped: 588 HURT: 11997 total spills in shared programs: 3860 -> 49086 (1171.66%) spills in affected programs: 709 -> 45935 (6378.84%) helped: 23 HURT: 1595 total fills in shared programs: 5573 -> 55810 (901.44%) fills in affected programs: 1067 -> 51304 (4708.25%) helped: 23 HURT: 1595 LOST: 3 GAINED: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	3bd041e2fb	nir/schedule: handle nir_intrinsic_group_memory_barrier Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	46e330c07e	nir/schedule: fix handling of generic memory barrier We can get a generic nir_intrinsic_memory_barrier to represent a barrier involving multiple semantics (instead of getting individual specific barriers for each semantic). This means that we need to consider these as potentially affecting shared memory access as well. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00

1 2 3 4 5 ...

150975 Commits All Branches Search

150975 Commits

All Branches