mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Mark Collins	dd19da31f2	tu: Expose VK_EXT_tooling_info using common implementation Signed-off-by: Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18390>	2022-09-08 08:14:40 +00:00
Mark Collins	c82249aa68	tu: Clamp priority in DRM submitqueue creation The kernel driver has a range of valid priority values that can be supplied to it, submitting any priority value outside these bounds will result in `-EINVAL`. To avoid this, the priority value is now clamped to the range that the kernel supports. Fixes: `0c6fbfca0c` Signed-off-by: Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18389>	2022-09-08 08:04:10 +00:00
Danylo Piliaiev	dda28a3e35	tu: Trivially expose VK_EXT_rasterization_order_attachment_access It is just a renamed VK_ARM_rasterization_order_attachment_access. Zink depends on it to expose KHR_blend_equation_advanced_coherent Passes GL tests via Zink: dEQP-GLES31.functional.blend_equation_advanced.* KHR-GLES31.core.blend_equation_advanced.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18420>	2022-09-06 16:51:54 +00:00
Danylo Piliaiev	1dfc5929d1	tu: Use newly obtained magic reg values Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18229>	2022-09-06 16:18:58 +00:00
Danylo Piliaiev	c22444ebcc	freedreno: Add all variable magic regs to device-info tables There are more magic regs which have different values between GPU subgenerations than we specified. The updated list and values where obtained by using libwrapfake with v631 blob and dEQP-VK.draw.renderpass.basic_draw.draw.triangle_list.1 vk cts test. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18229>	2022-09-06 16:18:58 +00:00
Danylo Piliaiev	df51e96c33	freedreno: Name more _DBG_ECO_CNTL regs There is known pattern of DBG_ECO_CNTL being right before *_ADDR_MODE_CNTL, name such regs that we are sure about. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18229>	2022-09-06 16:18:58 +00:00
Christian Gmeiner	912d0383b4	isaspec: Move isa_decode(..) declaration The implementation of isa_decode(..) is already part of isaspec. So lets move the function declaration and some related structs to a src/isaspec. Also make the header C++ safe. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18403>	2022-09-03 19:26:04 +00:00
David Heidelberg	250c684750	ci: uprev piglit 2022-08-30 Acked-by: Juan A. Suarez <jasuarez@igalia.com> # for broadcom Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> # for zink Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18318>	2022-09-02 20:15:28 +00:00
Emma Anholt	3ef13ef234	turnip: Treating non-d/s-write pipelines as not having d/s feedback loops. A subpass in gfxbench has the depth buffer present, but not written to, for a render pass using the depth buffer as an input attachment. We can skip single-prim-mode and the associated "oh no don't use sysmem" in that case. Improves gfxbench vk-5-normal perf by 1.56193% +/- 0.0743035% (n=14). Part of #6327. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18241>	2022-09-02 16:47:02 +00:00
Emma Anholt	08548650bd	turnip: Enable lowering of mediump temps/CS shared to 16-bit. In Aztec Ruins, we end up storing some big shared-mem arrays as 16-bit, cutting shared mem size in half across many shaders while also reducing conversions. gfxbench vk-5-normal perf +0.364983% +/- 0.189764% (n=4). fossil-db: Totals from 448 (2.99% of 14988) affected shaders: MaxWaves: 6154 -> 6390 (+3.83%); split: +3.96%, -0.13% Instrs: 174554 -> 165045 (-5.45%); split: -6.45%, +1.01% CodeSize: 364224 -> 345558 (-5.12%); split: -6.03%, +0.90% NOPs: 48224 -> 48024 (-0.41%); split: -3.33%, +2.91% MOVs: 6985 -> 6104 (-12.61%); split: -19.11%, +6.50% Full: 4577 -> 4101 (-10.40%); split: -11.08%, +0.68% (ss): 3428 -> 3335 (-2.71%); split: -4.17%, +1.46% (sy): 1250 -> 1205 (-3.60%); split: -4.72%, +1.12% (ss)-stall: 14695 -> 14528 (-1.14%); split: -2.25%, +1.12% (sy)-stall: 19565 -> 17998 (-8.01%); split: -9.55%, +1.54% STPs: 1086 -> 870 (-19.89%) LDPs: 162 -> 108 (-33.33%) Cat0: 51400 -> 51120 (-0.54%); split: -3.31%, +2.76% Cat1: 16861 -> 14688 (-12.89%); split: -18.18%, +5.30% Cat2: 71161 -> 68454 (-3.80%); split: -4.52%, +0.72% Cat3: 29572 -> 25306 (-14.43%); split: -14.49%, +0.06% Cat4: 3128 -> 3131 (+0.10%) Cat5: 1502 -> 1506 (+0.27%) Cat6: 840 -> 750 (-10.71%) aztec ruins is a big winner with the ldp/stp reductions. summoners_war racks up an astounding 41% reduction in instructions and +15% max_waves. Most affected apps show a minor win in instrs, with fallout_shelter_online, and aztec ruins on ANGLE taking minor hits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18259>	2022-09-01 22:39:39 +00:00
Jason Ekstrand	c052c6a333	tu: Move to the common command pool framework Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18324>	2022-09-01 20:17:25 +00:00
Jason Ekstrand	44ab076fea	vulkan: Add a vk_command_buffer_ops struct This is the standard pattern in the kernel for providing vfunc tables for C objects. We're using it in the pipeline cache code but we're about to start adding more stuff and so it really helps if we have it for command buffers as well. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18324>	2022-09-01 20:17:25 +00:00
Jason Ekstrand	5c143b132a	vulkan: Re-order arguments to vk_command_buffer_init Most other init functions follow the Vulkan API convention of putting the parent object first. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18324>	2022-09-01 20:17:25 +00:00
Jason Ekstrand	f90e2af996	vulkan: Re-order arguments to vk_command_pool_init Most other init functions follow the Vulkan API convention of putting the parent object first. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18324>	2022-09-01 20:17:25 +00:00
Chia-I Wu	63b1290c2f	turnip: add some cpu tracepionts Add tracepoints for pipeline creations and queue submits. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18262>	2022-09-01 19:30:13 +00:00
Connor Abbott	f640661e54	tu/lrz: Fix multiple depth attachment case with secondaries This is a counterpart to the previous commit. When we have multiple depth attachments, in the secondary we currently don't disable LRZ and so we may need a valid LRZ fast-clear base. Fixes: `4b5f0d98` ("tu: Overhaul LRZ, implement on-GPU dir tracking and LRZ fast-clear") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18369>	2022-09-01 19:11:19 +00:00
Connor Abbott	bf09a5881b	tu/lrz: Fix multiple subpass case with secondaries In dEQP-VK.renderpass.dedicated_allocation.attachment_allocation.input_output.94 we have the following: - There is more than one subpass, but only one depth attachment. - The first subpass doesn't use depth. - The subpass that does use depth has a draw call in a secondary. We wouldn't hit the case where there's more than one depth attachment, but because tu_begin_resumed_renderpass() only looked at the first subpass it wouldn't find the depth attachment and would leave LRZ invalid and thus a NULL LRZ fast-clear base. Then tu_begin_secondary_cmdbuf() would leave LRZ enabled and the draw would have LRZ enabled, leading to a hang. Fix this by making tu_begin_resumed_renderpass() match tu_begin_renderpass() with how it finds the depth attachment. Fixes: `4b5f0d98` ("tu: Overhaul LRZ, implement on-GPU dir tracking and LRZ fast-clear") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18369>	2022-09-01 19:11:19 +00:00
Dave Airlie	3c092f5cd8	turnip: use common command record result. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16918>	2022-09-01 14:11:55 +00:00
Emma Anholt	4bdd226ab6	freedreno/ir3: Switch to NIR for a3xx/a4xx's vertex id lowering. We already have the compiler pass, just need to set the flag. We were the last consumer of glsl's lower_vertex_id. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18327>	2022-08-31 22:57:03 +00:00
Emma Anholt	78fce4d9e3	freedreno/ir3: Move chip-specific nir compiler options to C code. We had a mix of common-macro-and-chip-overrides in static decls and plus more overrides later in C code. It's way cleaner to just have a static decl for the base options and chip overrides in C code. This moves a few things (lower_cs_local_index_to_id, lower_wpos_pntc, lower_int64_options) to the common static decl that had been pasted into both a3xx-a5xx and a6xx. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18327>	2022-08-31 22:57:03 +00:00
Emma Anholt	b4aa46ff89	turnip: Make sure bandwidth config per pixel starts from 0. We'd keep incrementing the costs in a cmd buffer's dynamic_pass on each BeginRendering. This fixes the main renderpass of aztec ruins on zink to use gmem, taking fps from ~8 to ~10. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18352>	2022-08-31 22:38:04 +00:00
Konstantin Seurer	6c452aa1ad	turnip: Use the common physical device enumeration Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17750>	2022-08-30 19:34:47 +00:00
Emma Anholt	f69820468c	turnip: Fix enabling of IB2 skipping. We weren't setting LOCAL, so unless freedreno GL had set it since the GPU woke up, we wouldn't get it. This requires moving the GLOBAL unsetting out of tile_store's IB, since it would never be executed when it mattered, anyway. No perf difference detected on gfxbench vk-5-normal, or ANGLE minecraft, genshin, and pubg. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18187>	2022-08-29 18:34:10 +00:00
Emma Anholt	554cd8bcc2	freedreno/regs: Add a bit of documentation of what SKIP_IB2 does. Based on @cwabbott's research. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18187>	2022-08-29 18:34:09 +00:00
Chia-I Wu	022cd156d4	util/perf: add cpu_trace.h Move MESA_TRACE_* to the new file. Acked-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18260>	2022-08-26 21:47:44 +00:00
Chia-I Wu	19b7d5284b	util/perf: move u_perfetto to here Acked-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18260>	2022-08-26 21:47:44 +00:00
Rob Clark	4703efd23f	freedreno/drm: Add offset param for fd_bo_upload() Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18258>	2022-08-26 20:48:36 +00:00
Rob Clark	a9e79dd88a	freedreno/drm: Let backend hint about upload vs map For the virtgpu backend, immediately mmap'ing a buffer can be expensive (ie. require a sync with host), so for small transfers we'd prefer to go the upload path. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18258>	2022-08-26 20:48:36 +00:00
Chia-I Wu	b1cb764316	ir3: fix predicate splitting in scheduler Fix up src->def->instr, not src->instr. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7014 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18096>	2022-08-26 17:36:18 +00:00
Danylo Piliaiev	1eb7a85b55	tu: Update HS_WAVE_INPUT_SIZE formula A better explanation for SP_HS_WAVE_INPUT_SIZE is that it is the size of local memory to allocate per wave (which can be more than one patch), in 256B units. Then the maximum of 64 makes sense because only 16KB of local memory is reserved for VS<->HS linkage. The resulting formula matches the blob behaviour, even when patch_control_points and tcs_vertices_out have different values, while the past formula gave wrong answers on gen3+. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Suggested-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17957>	2022-08-26 15:18:42 +03:00
Danylo Piliaiev	a7db1da37d	tu: Fix streamout with tess_use_shared Mirrors `31835ac3b8` change in freedreno. Together with "tu: Fix HS input size formula for gen3+" fixes following tests from GL CTS running via Zink: dEQP-GLES31.functional.tessellation.invariance.inner_triangle_set.quads_fractional_odd_spacing dEQP-GLES31.functional.tessellation.invariance.inner_triangle_set.triangles_fractional_odd_spacing dEQP-GLES31.functional.tessellation.invariance.primitive_set.triangles_fractional_odd_spacing_ccw dEQP-GLES31.functional.tessellation.invariance.primitive_set.triangles_fractional_odd_spacing_cw dEQP-GLES31.functional.tessellation.invariance.triangle_set.triangles_fractional_odd_spacing dEQP-GLES31.functional.tessellation.primitive_discard.quads_fractional_odd_spacing_ccw dEQP-GLES31.functional.tessellation.primitive_discard.quads_fractional_odd_spacing_cw dEQP-GLES31.functional.tessellation.primitive_discard.triangles_fractional_odd_spacing_ccw dEQP-GLES31.functional.tessellation.primitive_discard.triangles_fractional_odd_spacing_cw Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17957>	2022-08-26 15:14:10 +03:00
Danylo Piliaiev	0120e7b9d9	freedreno: PC_SO_STREAM_CNTL_STREAM_ENABLE has per-stream enable bits PC_SO_STREAM_CNTL.STREAM_ENABLE mirrors VPC_SO_STREAM_CNTL.STREAM_ENABLE Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17957>	2022-08-26 15:14:10 +03:00
Danylo Piliaiev	0bf2033e0d	tu: Implement VK_EXT_attachment_feedback_loop_layout Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18064>	2022-08-26 10:29:00 +00:00
Chia-I Wu	6abadd27ec	turnip: improve tracing of secondary cmd buffers This visualizes secondary cmd buffers in perfetto. I did not test dynamic rendering, which appears to call tu_clone_trace_range already. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	4b37439764	turnip: add cmd_buffer tracepoint It is only used for primary cmd buffers for the moment. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	4c03e40991	turnip: rename some tracing stages Rename SURFACE_STAGE_ID to RENDER_PASS_STAGE_ID. Indicate whether gmem or bypass is used in the stage name. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	57d2d75237	turnip: clean up tu_perfetto.h Move enums, stages, queues, and some function declarations to tu_perfetto.cc. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	c867753c94	turnip: convert tu_perfetto_state to a stack A stage does not end until its nested stages end. tu_perfetto_state can be a stack. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	c6d488814c	turnip: add tu_clone_trace_range helper Remove some duplicated code. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	0d57cf8cad	turnip: tidy up tracepoint header includes Remove unused util/u_dump.h. Add missing forward declarations. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	b1ba0791e3	turnip: fix gem_store tracepoint Set cmd->trace_renderpass_end after tu6_emit_tile_store in case of gmem. To be able to do that, we push the update of cmd->trace_renderpass_end down into tu_cmd_render_tiles/tu_cmd_render_sysmem. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	f539bd7a03	turnip: move trace_start_gmem_store before cond exec Suggested by Danylo. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	3f045bd176	turnip: fix a missing trace_end_gmem_clear Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:14 +00:00
Chia-I Wu	91a0411d2a	turnip: improve perfetto sync_timestamp tu_device_get_gpu_timestamp takes >100us on my otherwise idle sc7180. Read the cpu block again after the call returns. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18238>	2022-08-25 21:00:13 +00:00
Eric Engestrom	c66622de3a	meson: replace manual compiler flags with meson arguments These would only have worked in GCC and Clang, which so far wasn't an issue, but let's clean it up anyway. Cc: mesa-stable Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18190>	2022-08-24 22:13:19 +00:00
Mike Blumenkrantz	1e7a131fd1	tu: fix invalid free on alloc failure this is not an allocated pointer cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18230>	2022-08-24 17:29:53 +00:00
Mark Collins	9e31b0fba1	tu: Allocate private memory per-device instead of per-pipeline It was discovered that a substantial amount (several GiB) of private memory was being used by Skyline Emulator as it used a substantial amount of pipelines with private memory that were never deleted throughout the lifetime of the application. These private memory allocations are now pooled into per-device BOs shared among several pipelines instead of a single BO for every pipeline, this reduces the memory footprint of private memory allocations from several GiB to 8 MiB in Skyline Emulator on certain titles. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7033 Signed-off-by: Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18073>	2022-08-23 08:07:58 +00:00
Danylo Piliaiev	c9d9155787	tu: Disable userspace iova allocations If BO is freed while kernel considers it busy, our VMA state gets desynchronized from kernel's VMA state because kernel waits until BO stops being busy. And whether BO is busy kernel decides at submission granularity. On the other hand in Vulkan we may free resource as soon as we know it won't be used. Not completely reverting the changes in hope that proper resolution would be found soon. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7106 Fixes: `e23c4fbd9b` ("tu: Switch to userspace iova allocations if kernel supports it") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18201>	2022-08-22 23:28:30 +00:00
Emma Anholt	d0401f02fd	ci/tu: Add another cwe_after_bind fail that appeared in the reshuffle. gmem is a fractional run of the full caselist, and one of them showed up crashing on a630_vk_full after the deqp-runner uprev. Add all of them so we don't fail on the next reshuffle either. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17932>	2022-08-22 22:26:29 +00:00
Emma Anholt	b0a74776d1	tu: Emit only as many VBs as we've ever seen bound on the command buffer. A later CmdBindPipeline would shrink the two draw states' sizes to the number of VBs the pipeline actually uses, but we can save some CPU overhead and memory by not emitting all the unused VBs as well. Improves zink drawoverhead throughput on test 5 (1 VB change) by 38.5178% +/- 0.48738% (n=18). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17932>	2022-08-22 22:26:29 +00:00

1 2 3 4 5 ...

3591 Commits