mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	59d3a8ea07	ci: uprev CTS to 1.3.8.2 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28871>	2024-04-24 10:48:11 +00:00
Karol Herbst	cd5c9870ea	rusticl/program: handle -cl-no-subgroup-ifp As per spec we don't have to do anything with that flag. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28873>	2024-04-24 10:25:41 +00:00
Corentin Noël	ca861e8f75	ci: Add zink-venus-lvp job Test Zink on Venus on Lavapipe. Acked-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27790>	2024-04-24 09:01:15 +00:00
Corentin Noël	e9dacca3f7	ci: Allow to pass LIBGL_ALWAYS_SOFTWARE to the guest environment Acked-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27790>	2024-04-24 09:01:15 +00:00
Iago Toral Quiroga	708a635902	broadcom/ci: document external causes for some CTS 1.3.8 failures Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28891>	2024-04-24 06:59:53 +00:00
Yonggang Luo	1de805e986	nouveau: Fixes error: unused import: `crate::nvh_classes_cl906f::` Full error message: error: unused import: `crate::nvh_classes_cl906f::` --> src/nouveau/headers/lib.rs:184:9 \| 184 \| pub use crate::nvh_classes_cl906f::*; \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ \| = note: `-D unused-imports` implied by `-D warnings` = help: to override `-D warnings` add `#[allow(unused_imports)]` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28855>	2024-04-24 06:37:39 +00:00
Yiwei Zhang	4fc3f11545	venus: fix VkDeviceGroupSubmitInfo::deviceMask for feedback cmds Unlike sync2, a legacy deviceMask of zero is indeed to skip. Fixes: `80f532a636` ("venus: fix VkDeviceGroupSubmitInfo cmd counts from feedback") Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28888>	2024-04-24 02:43:46 +00:00
Sagar Ghuge	46e4354940	intel/compiler: Disassemble mlen/rlen/ex_mlen in units of registers Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28637>	2024-04-23 23:46:26 +00:00
Caio Oliveira	ff89e83178	intel/brw: Lower VGRFs to FIXED_GRFs earlier Moves the lowering of VGRFs into FIXED_GRFs from the code generation to (almost) right after the register allocation. This will allow: (1) later passes not worry about VGRFs (and what they mean in a post reg alloc phase) and (2) make easier to add certain types of validation post reg alloc phase using the backend IR. Note that a couple of passes still take advantage of seeing "allocated VGRFs", so perform lowering after they run. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28604>	2024-04-23 23:17:57 +00:00
Caio Oliveira	5b3d4c757d	intel/brw: Support FIXED_GRF when generating code for CLUSTER_BROADCAST Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28604>	2024-04-23 23:17:57 +00:00
Pierre-Eric Pelloux-Prayer	b926cd3dd9	radv: don't use python 3.9 feature in radv_annotate_layer_gen.py This commit adds an implementation of removesuffix so we don't need the 'str' one which was added in 3.9. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28831>	2024-04-23 22:45:51 +00:00
Pierre-Eric Pelloux-Prayer	27a3880ada	aco: don't use python 3.7+ feature in aco_opcodes.py Use the suggestion from https://stackoverflow.com/questions/11351032/named-tuple-and-default-values-for-optional-keyword-arguments so the script works on older Python. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28831>	2024-04-23 22:45:51 +00:00
Sagar Ghuge	fe4f6dd18f	isl: Update shader channel select for missing components Bspec 57023: RENDER_SURFACE_STATE::Shader Channel Select Red "For channels not present in the surface format, the corresponding Surface Channel Select is either SCS_ZERO or SCS_ONE." This restriction applies to alpha channel as well if an associated resource is not used as a render target. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28791>	2024-04-23 22:08:30 +00:00
Sagar Ghuge	2d8686ccd5	isl: Update isl_swizzle_supports_rendering comment Bspec 57023: RENDER_SURFACE_STATE:: Shader Channel Select Red "Render Target messages do not support swapping of colors with alpha. The Red, Green, or Blue Shader Channel Selects do not support SCS_ALPHA. The Shader Channel Select Alpha does not support SCS_RED, SCS_GREEN, or SCS_BLUE." Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28791>	2024-04-23 22:08:30 +00:00
Mike Blumenkrantz	3a868970a2	zink: disable command reordering for compute-only contexts this is pointless Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28880>	2024-04-23 21:45:40 +00:00
Mike Blumenkrantz	ffb082f811	zink: make NOREORDER mode context-based Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28880>	2024-04-23 21:45:40 +00:00
Mike Blumenkrantz	ef0c9231a7	mesa/st: don't use serialized_nir for cached shaders serialized_nir doesn't exist here, so just use the cached nir Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11051 Fixes: `5eb0136a3c` ("mesa/st: when creating draw shader variants, use the base nir and skip driver opts") Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28857>	2024-04-23 21:06:31 +00:00
Leo Liu	dc85832c35	ac/gpu_info: Fix broken UVD firmware query UVD and VCE are separated engines, and not co-exist with VCNs Fixes: `c34cfc1a3b` (ac/gpu_info: update multimedia info) Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28863>	2024-04-23 20:26:14 +00:00
Job Noorman	f0ddba819f	freedreno/drm-shim: remove duplicate entry for a630 Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28459>	2024-04-23 20:03:51 +00:00
Job Noorman	1ffae320a8	freedreno/drm-shim: add a730, a740, and a750 Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28459>	2024-04-23 20:03:50 +00:00
Job Noorman	39088571f0	ir3: add support for predication Use predication instead of branching whenever possible and profitable: all divergent leaf branches are replaced with predication. Non-divergent branches are kept since for those a branch might be more performant when it jumps over all instructions. Although it might be possible to support a limited form of nested predication, this is more difficult to implement so we only support leaf branches for now. When translating from NIR to ir3, predication is emitted just like normal branches except that the branch is replaced with pred[tf] and the opposite (pred[ft]) is inserted at the end of the then-block. This pattern is then recognized during legalization at which point the closing prede is inserted. We don't insert this right away to allow opt_jump to optimize jumps out of the else-block. Since the branches we support for predication always have exactly one block in each arm, the then-block is emitted first, and blocks are never reordered, this way of emitting predicated branches ensures they have the correct memory layout. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27982>	2024-04-23 19:18:29 +00:00
Job Noorman	bbc78e92ff	ir3: add support for precolored sources in predicate RA To support predt/predf which always read from p0.x, we need to support precolored sources for the predicates RA. This patch implements this as follows: whenever a precolored source is encountered whose def isn't live in the correct register, reload it into the correct one. To make sure we don't reload too often, two precautions are made. First, we precolor all defs of precolored sources and try do use that register when allocating one for a def. Second, since currently only p0.x is used for precoloring, we try not to allocate it whenever there are outstanding precolored defs. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27982>	2024-04-23 19:18:29 +00:00
Job Noorman	2288ef916c	ir3: model predt/predf without sources We used to model predt/predf as taking a predicate register source. The blob disassembler shows them taking a label argument. However, it seems that both are incorrect: the condition is always taken from p0.x and I have not been able to construct a test case were the label makes any difference. This patch changes predt/predf to not take any arguments and adds documentation about how predicated execution works. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27982>	2024-04-23 19:18:29 +00:00
Job Noorman	d56f1abd72	ir3: remove unnecessary tessellation epilogue The tessellation epilogue was emitted as an empty predt/prede pair which has no functional use so can be removed. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27982>	2024-04-23 19:18:29 +00:00
David Heidelberg	44b080af07	meson: implement split-debug split-debug uses C args `--gsplit-dwarf` and linker args `--gdb-index` to achieve split debug, speed up the CI linking, and allow us to distribute debug symbols standalone. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28576>	2024-04-23 18:31:39 +00:00
Juan A. Suarez Romero	9d5af35318	nir/lower_clip: update inputs/ouputs read/written bitmask Set the proper bit when adding clipdist load/store. It also sets the variable name to match with the CLIPDISTn created. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28798>	2024-04-23 17:52:09 +00:00
Samuel Pitoiset	2e79234f9d	ac: allow to use 64K of LDS for tessellation on GFX9+ This is the hardware limit and it's supposed to be working. GFX7-8 also support 64KiB but Stoney used to hang in the past and using 32KiB was the only known solution. fossils-db (NAVI21): Totals from 326 (0.41% of 79395) affected shaders: MaxWaves: 6352 -> 6378 (+0.41%); split: +0.50%, -0.09% Instrs: 232575 -> 232827 (+0.11%); split: -0.04%, +0.15% CodeSize: 1256940 -> 1258744 (+0.14%); split: -0.04%, +0.18% VGPRs: 17552 -> 17384 (-0.96%); split: -1.09%, +0.14% LDS: 2828800 -> 3899392 (+37.85%) Latency: 2937650 -> 2934667 (-0.10%); split: -0.30%, +0.20% InvThroughput: 704214 -> 700854 (-0.48%); split: -0.51%, +0.04% VClause: 4398 -> 4442 (+1.00%); split: -0.20%, +1.21% SClause: 5297 -> 5292 (-0.09%); split: -0.32%, +0.23% Copies: 14892 -> 14921 (+0.19%); split: -0.44%, +0.63% PreVGPRs: 13294 -> 13293 (-0.01%); split: -0.06%, +0.05% VALU: 156536 -> 156793 (+0.16%); split: -0.03%, +0.20% SALU: 21806 -> 21795 (-0.05%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28015>	2024-04-23 17:20:40 +00:00
Samuel Pitoiset	fb323ae46b	radv: rework the number of tess patches computation This uses the same helper as RadeonSI which seems more robust and more optimal (eg. it reduces the number of patches to increase occupancy). fossils-db (NAVI21): Totals from 638 (0.80% of 79395) affected shaders: MaxWaves: 13182 -> 13142 (-0.30%) Instrs: 419446 -> 419322 (-0.03%); split: -0.08%, +0.05% CodeSize: 2261408 -> 2261200 (-0.01%); split: -0.06%, +0.05% VGPRs: 32560 -> 32600 (+0.12%) LDS: 4648960 -> 5343232 (+14.93%); split: -1.67%, +16.61% Latency: 4812105 -> 4811141 (-0.02%); split: -0.04%, +0.02% InvThroughput: 1159924 -> 1153998 (-0.51%); split: -0.60%, +0.09% VClause: 7837 -> 7871 (+0.43%); split: -0.36%, +0.79% SClause: 9378 -> 9381 (+0.03%); split: -0.21%, +0.25% Copies: 28451 -> 28211 (-0.84%); split: -0.97%, +0.13% PreVGPRs: 25404 -> 25411 (+0.03%); split: -0.06%, +0.09% VALU: 278086 -> 277975 (-0.04%); split: -0.11%, +0.07% SALU: 43657 -> 43617 (-0.09%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28015>	2024-04-23 17:20:40 +00:00
Samuel Pitoiset	758e6d9005	ac,radeonsi: add helpers to compute the number of tess patches/lds size Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28015>	2024-04-23 17:20:40 +00:00
Samuel Pitoiset	8b8d194bfb	radv: advertise VK_EXT_nested_command_buffer Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28826>	2024-04-23 16:41:57 +00:00
Samuel Pitoiset	7de95e7742	radv: track if nested command buffers uses indirect draws IB2 packets should be avoided when a cmdbuf executes nested cmdbufs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28826>	2024-04-23 16:41:57 +00:00
Samuel Pitoiset	0d18a2f4fb	radv/amdgpu: do not use IB2 for nested command buffers This should be enough to support executing nested command buffers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28826>	2024-04-23 16:41:56 +00:00
José Roberto de Souza	1763d1aab1	iris: Avoid allocation of not needed iris_bucket_cache Following the previous patch and allocating just the number of iris_bucket_cache that will be used by giving platform. While at it also adding util_vma_heap_finish() call in the iris_bufmgr_create() error path. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28864>	2024-04-23 15:59:01 +00:00
José Roberto de Souza	c473a156dc	iris: Avoid creation of slabs and cache buckets of lmem heaps in integrated gpus It was allocating slabs and cache buckets data structs of lmem heaps but those will never be used in integrated gpus, so lets avoid waste cpu time and memory with those. This will also remove slabs and cache buckets for IRIS_HEAP_DEVICE_LOCAL_CPU_VISIBLE_SMALL_BAR for discrete GPUs in systems with resizeble bar enabled. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28864>	2024-04-23 15:59:01 +00:00
José Roberto de Souza	a51c64ac5c	iris: Add comments to BO_ALLOC flags Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28864>	2024-04-23 15:59:01 +00:00
Connor Abbott	7a1779edc7	ir3: Don't pack FS inlocs Thanks to transform feedback, we don't know which varying components will be used when compiling the FS. The VS could use additional components for xfb, and packing the inlocs per-component would result in overlapping varyings. In order to do this properly, we'd need to create a variant for the FS when used with xfb. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28626>	2024-04-23 15:22:19 +00:00
Connor Abbott	56607fafc2	ir3: Don't use non-contiguous component masks for FS I think this isn't necessary, and when we disable packing inlocs we will start actually using the compmask computed here tests like KHR-Single-GL46.enhanced_layouts.varying_components on zink will fail unless we add the extra unused components at the beginning. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28626>	2024-04-23 15:22:19 +00:00
Bas Nieuwenhuizen	d0c4b9144a	radv: Fix differing aspect masks for multiplane image copies. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11050 CC: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28867>	2024-04-23 13:11:49 +00:00
Rhys Perry	37e9e8b06c	aco: split vop3p results Removes copies in the case of: a = fmul b = fmul c = vec4(a.x, a.y, b.x, b.y) fossil-db (navi31): Totals from 21 (0.03% of 79395) affected shaders: Instrs: 96481 -> 96338 (-0.15%) CodeSize: 548452 -> 548196 (-0.05%); split: -0.13%, +0.09% Latency: 1514460 -> 1514238 (-0.01%); split: -0.02%, +0.00% InvThroughput: 683048 -> 682942 (-0.02%); split: -0.02%, +0.00% VClause: 1611 -> 1613 (+0.12%) Copies: 21326 -> 21190 (-0.64%) Branches: 2427 -> 2426 (-0.04%) PreVGPRs: 2289 -> 2298 (+0.39%) VALU: 59090 -> 58954 (-0.23%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28763>	2024-04-23 12:31:59 +00:00
Rhys Perry	88e03feb27	aco: schedule LDS instructions fossil-db (navi31): Totals from 1823 (2.30% of 79395) affected shaders: MaxWaves: 53845 -> 53827 (-0.03%); split: +0.02%, -0.05% Instrs: 1736317 -> 1731200 (-0.29%); split: -0.38%, +0.09% CodeSize: 8876760 -> 8857908 (-0.21%); split: -0.29%, +0.08% VGPRs: 91688 -> 92276 (+0.64%); split: -0.03%, +0.67% Latency: 11743095 -> 11698872 (-0.38%); split: -0.42%, +0.04% InvThroughput: 2070526 -> 2067440 (-0.15%); split: -0.17%, +0.02% VClause: 39048 -> 39058 (+0.03%); split: -0.01%, +0.03% SClause: 35371 -> 35406 (+0.10%); split: -0.02%, +0.12% Copies: 104335 -> 104384 (+0.05%); split: -0.21%, +0.26% Branches: 29769 -> 29794 (+0.08%); split: -0.00%, +0.09% VALU: 970925 -> 970974 (+0.01%); split: -0.01%, +0.02% SALU: 146222 -> 146345 (+0.08%); split: -0.01%, +0.09% VOPD: 1119 -> 1162 (+3.84%); split: +4.29%, -0.45% fossil-db (navi21): Totals from 37078 (46.70% of 79395) affected shaders: MaxWaves: 990093 -> 990025 (-0.01%) Instrs: 21130662 -> 21182543 (+0.25%); split: -0.01%, +0.26% CodeSize: 110205364 -> 110415032 (+0.19%); split: -0.01%, +0.20% VGPRs: 1407168 -> 1410768 (+0.26%) Latency: 90024839 -> 89929196 (-0.11%); split: -0.11%, +0.01% InvThroughput: 17170356 -> 17167412 (-0.02%); split: -0.02%, +0.00% VClause: 392830 -> 392825 (-0.00%); split: -0.01%, +0.01% SClause: 463150 -> 463188 (+0.01%); split: -0.00%, +0.01% Copies: 1768433 -> 1768483 (+0.00%); split: -0.02%, +0.02% Branches: 605989 -> 606011 (+0.00%); split: -0.00%, +0.00% VALU: 11614810 -> 11614912 (+0.00%); split: -0.00%, +0.00% SALU: 3794531 -> 3794655 (+0.00%); split: -0.00%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28763>	2024-04-23 12:31:59 +00:00
Rhys Perry	0ee4fa33bc	aco: schedule LDSDIR instructions fossil-db (navi31): Totals from 33850 (42.63% of 79395) affected shaders: MaxWaves: 1011236 -> 1011204 (-0.00%) Instrs: 23589117 -> 23559185 (-0.13%); split: -0.21%, +0.08% CodeSize: 126099716 -> 125968376 (-0.10%); split: -0.17%, +0.07% VGPRs: 1348632 -> 1356012 (+0.55%); split: -0.09%, +0.63% Latency: 183233795 -> 180997751 (-1.22%); split: -1.33%, +0.11% InvThroughput: 27081576 -> 27056383 (-0.09%); split: -0.15%, +0.06% VClause: 386453 -> 386551 (+0.03%); split: -0.11%, +0.13% SClause: 811941 -> 813023 (+0.13%); split: -0.38%, +0.52% Copies: 1279706 -> 1280051 (+0.03%); split: -0.46%, +0.49% Branches: 416940 -> 416938 (-0.00%); split: -0.02%, +0.02% VALU: 13566410 -> 13567367 (+0.01%); split: -0.04%, +0.04% SALU: 1835804 -> 1835652 (-0.01%); split: -0.02%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11013 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28763>	2024-04-23 12:31:59 +00:00
Rhys Perry	0bc8a9be67	aco: make store clauses more aggressively Apparently this significantly improves performance of a radeonsi resolve shader. fossil-db (navi31): Totals from 2372 (2.99% of 79395) affected shaders: MaxWaves: 59903 -> 59863 (-0.07%) Instrs: 3508838 -> 3506178 (-0.08%); split: -0.10%, +0.02% CodeSize: 18516272 -> 18505956 (-0.06%); split: -0.07%, +0.02% VGPRs: 152708 -> 154604 (+1.24%) Latency: 27881253 -> 27861445 (-0.07%); split: -0.07%, +0.00% InvThroughput: 4076649 -> 4076220 (-0.01%); split: -0.03%, +0.02% VClause: 92696 -> 89409 (-3.55%); split: -3.55%, +0.01% Copies: 310787 -> 311697 (+0.29%); split: -0.03%, +0.32% VALU: 1891048 -> 1891933 (+0.05%); split: -0.01%, +0.05% VOPD: 2534 -> 2559 (+0.99%); split: +1.07%, -0.08% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11014 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28763>	2024-04-23 12:31:59 +00:00
Rhys Perry	1bce498bbf	aco: include LDSDIR in latency/etc stats Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28763>	2024-04-23 12:31:59 +00:00
Iago Toral Quiroga	6c73c9bb16	v3d/simulator: size counter_values array correctly on V3D 7.x sim_state.perfcnt_total provides the total number of counters supported by the underlying simulated platform and is what we use when we create a perform to validate that the counters requested are valid, so we should use this. V3D_PERFCNT_NUM is a fixed enum value that is only valid for V3D 4.2 at present and is not sufficiently large for all the counters available in V3D 7.x. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28870>	2024-04-23 11:20:08 +00:00
Tomeu Vizoso	0c0d62ba70	etnaviv/nn: Implement zero run length encoding of weights Check how much smaller can the weight+bias buffers be with different amount of bits to encode runs of zeroes and choose the smallest one. This reduces the bandwidth considerably, which is at present the bottleneck with useful models. On a Libre Computer Alta AML-A311D-CC, I see these improvements: MobileNetV1: 15.650ms -> 9.991ms SSDLite MobileDet: 56.149ms -> 32.692ms Acked-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27513>	2024-04-23 10:55:24 +02:00
Erik Faye-Lund	1e78d9aaca	panfrost: use util_debug_message for perf_debug This way, applications can get to know about performance issues when they happen, using the debug callback mechanism. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28693>	2024-04-23 10:09:41 +02:00
Erik Faye-Lund	ef4c6e9345	panfrost: perf_debug_ctx -> perf_debug Now that we only call one of these, the other one is superfluous. So let's combine them and use the shorter name for the result. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28693>	2024-04-23 10:09:37 +02:00
Erik Faye-Lund	7655257c82	panfrost: use perf_debug_ctx instead of perf_debug This allows us to use perf_debug_ctx() instead of perf_debug(), which will help make things a bit cleaner down the line. In order to do this, we also need to make sure we always have access to the context, so let's also pass ctx to panfrost_should_linear_convert while we're at it. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28693>	2024-04-23 10:09:32 +02:00
Samuel Pitoiset	e4f945cd4a	vulkan: pass cmdbuf level to vk_command_buffer_ops::create() RADV needs to know the command buffer level in the create() helper. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28861>	2024-04-23 06:33:31 +00:00
Christian Gmeiner	1fb9e67f7e	etnaviv: drm: Drop NPU-related params All of the NPU related DRM_ETNAVIV_GET_PARAM values, which got introduced in 6.9-rc1 of the kernel got removed before the 6.9 release. Clean-up our code base. NPU support _NEEDS_ hwdb support and a recent stable kernel. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Tomeu Vizoso <tomeu@tomeuvizoso.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28837>	2024-04-23 05:39:57 +00:00

1 2 3 4 5 ...

188274 Commits All Branches Search

188274 Commits

All Branches