KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	a9f7744cfe	radeonsi: rework how vs_state_bits is set and unpacked Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	c9c7dcb619	radeonsi: rename and regroup VS_STATE definitions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	091617002f	radeonsi: rework how VS_STATE_BITS are set for VS, TES, and GS We need more GS/NGG bits, so we need to add current_gs_state for that. This simplifies the logic in the draw code. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	928e5f240d	radeonsi: simplify how pipeline statistic offsets are computed Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	39800f0fa3	amd: change chip_class naming to "enum amd_gfx_level gfx_level" This aligns the naming with PAL. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pellou-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16469>	2022-05-13 14:56:22 -04:00
Pierre-Eric Pelloux-Prayer	f8d205c400	radeonsi: fix gs_invocation query with NGG When NGG is active, the GS invocation counter is always incremented, even if there's no explicit GS. Implementing the counter manually fixes it: * in emit_gs_epilogue for the legacy path * in gfx10_ngg_gs_emit_prologue for the ngg path This fixes piglit's arb_query_buffer_object-qbo test. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15861>	2022-05-12 09:16:11 +02:00
Pierre-Eric Pelloux-Prayer	061f64f351	radeonsi/ngg: reuse the pipeline stats buffer when using atomics To support PIPE_STAT_QUERY_GS_INVOCATIONS and PIPE_STAT_QUERY_GS_PRIMITIVES being used at the same time we have to reuse the same buffer. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15861>	2022-05-12 09:16:11 +02:00
Pierre-Eric Pelloux-Prayer	0cb6fd0b00	radeonsi/query: use the qbo correct size Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15861>	2022-05-12 09:16:11 +02:00
Pierre-Eric Pelloux-Prayer	38e8a73e14	radeonsi: implement GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB in shaders Statistics only work in non-NGG mode. If screen->use_ngg is true, we can't know if the draw will actually use NGG or not, so this commit switch to a shader based implementation of this counter. To avoid modifying si_query, the shader implementation behaves like the hw one: it uses the same buffer size and offset. The emulation path activation in the shader is controlled by vs_state_bit[31]. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15861>	2022-05-12 09:16:11 +02:00
Pierre-Eric Pelloux-Prayer	bbaf4f1954	radeonsi: store the pipeline stats index Will be used in later commits. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15861>	2022-05-12 09:16:11 +02:00
Pierre-Eric Pelloux-Prayer	637f09f10e	radeonsi: deduplicate query offsets Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15861>	2022-05-12 09:16:11 +02:00
Yogesh Mohanmarimuthu	6e537680c4	radeonsi/gfx11: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE Use PIXEL_PIPE_STATE_CONTROL/DUMP event instead of ZPASS_DONE for gfx11. Signed-off-by: Yogesh Mohanmarimuthu <yogesh.mohanmarimuthu@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16328>	2022-05-10 04:29:55 +00:00
Dave Airlie	127bcbed18	gallivm/st/lvp: add flags arg to get_query_result_resource api. Currently this just has wait, but in order to get the right answer for vulkan partial, lavapipe/llvmpipe need to pass a partial flag through here in the future. This just changes the API so that's possible. v2: use an enum (zmike) Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15009>	2022-02-15 10:12:01 +10:00
Marek Olšák	61bd8ec043	gallium/radeon: merge BO read/write usage flags with priority flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:21 +00:00
Marek Olšák	57bb89fdc5	radeonsi: remove the unused cs parameter from radeon_emit Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13015>	2021-09-25 08:32:03 +00:00
Marek Olšák	576f8394db	radeonsi: remove the primitive discard compute shader It doesn't always work, it's only useful on gfx9 and older, and it's too complicated. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4011 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12812>	2021-09-10 23:32:03 +00:00
Samuel Pitoiset	380ac28891	ac: import performance counters from RadeonSI Performance counters will be used by RADV for VK_KHR_performance_query and also for adding SPM support. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11140>	2021-06-03 07:15:21 +00:00
Marek Olšák	f9b527a9a5	radeonsi: unify internal compute with SSBOs in si_launch_grid_internal_ssbos just deduplicate the code Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	65495e6caa	radeon_winsys.h: add a winsys parameter to most winsys buffer functions This will allow removing the winsys pointer from buffers. The amdgpu winsys adds dummy_ws to get radeon_winsys because there can be no radeon_winsys around (e.g. while amdgpu_winsys is being destroyed), but we still need some way to call buffer functions. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9809>	2021-04-06 22:31:15 +00:00
Marek Olšák	c53261645d	radeonsi: add SI_CONTEXT_PFP_SYNC_ME to skip syncing PFP for image operations DCC/CMASK/HTILE clears will not set this. We could do a better job at not setting this in other cases too Image copies also don't set this. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	965c6445ad	winsys/amdgpu,radeonsi: add HUD counters for how much memory is wasted by slabs Slabs always allocate the next power of two size from their pools. This wastes memory if the size is not a power of two. bo->base.size is overwritten because the default is the allocated power of two size, but we need the real size to compute the wasted size in amdgpu_bo_slab_destroy. entry_size is added to the hole in pb_slab_entry to hold the real entry size. Like other memory stats, no atomics are used. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8683>	2021-02-03 21:53:33 +00:00
Marek Olšák	a51d4b10f1	gallium: add take_ownership param into set_constant_buffer to eliminate atomics We often do this: pipe->set_constant_buffer(pipe, shader, slot, &cb); pipe_resource_reference(&cb->buffer, NULL); That results in atomic increment in set_constant_buffer followed by atomic decrement after set_constant_buffer. This new interface eliminates those atomics. For the case above, this should be used instead: pipe->set_constant_buffer(pipe, shader, slot, true, &cb); cb->buffer = NULL; // if cb is not a local variable, else do nothing AMD Zen benefits from this. The perf improvement is ~3% for Viewperf13/Catia. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8298>	2021-01-27 23:53:34 +00:00
Marek Olšák	ca2062a394	radeonsi: simplify determining whether render condition is enabled at draw time Read one bool instead of reading one bool and one pointer. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	a0978fffb8	radeonsi: add new possibly faster command submission helpers This decreases the release libgallium_dri.so size without debug symbols by 16384 bytes. The CPU time spent in si_emit_draw_packets decreased from 4.5% to 4.1% in viewperf13/catia/plane01. The previous code did: cs->current.buf[cs->current.cdw++] = ...; cs->current.buf[cs->current.cdw++] = ...; cs->current.buf[cs->current.cdw++] = ...; cs->current.buf[cs->current.cdw++] = ...; The new code does: unsigned num = cs->current.cdw; uint32_t *buf = cs->current.buf; buf[num++] = ...; buf[num++] = ...; buf[num++] = ...; buf[num++] = ...; cs->current.cdw = num; The code is the same (radeon_emit is redefined as a macro) except that all set and emit functions must be surrounded by radeon_begin(cs) and radeon_end(). radeon_packets_added() returns whether there has been any new packets added since radeon_begin. radeon_end_update_context_roll(sctx) sets sctx->context_roll = true if there has been any new packets added since radeon_begin. For now, the "cs" parameter is intentionally unused in radeon_emit and radeon_emit_array. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:29 +00:00
Marek Olšák	73709143d2	radeonsi: remove MRT-draw-calls, spill-draw-calls, spill-compute-calls due to limited usefulness and overhead in si_draw_vbo. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Rob Clark	790144e65a	util+treewide: container_of() cleanup Replace mesa's slightly different container_of() with one more aligned to the linux kernel's version which takes a type as the 2nd param. This avoids warnings like: freedreno_context.c:396:44: warning: variable 'batch' is uninitialized when used within its own initialization [-Wuninitialized] At the same time, we can add additional build-time type-checking asserts Signed-off-by: Rob Clark <robdclark@chromium.org> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7941>	2020-12-10 16:48:36 +00:00
Marek Olšák	1f31a21664	radeonsi: remove SDMA support There are many issues with SDMA across many generations of hardware. A recent example is that gfx10.3 suffers from random GPU hangs if userspace uses SDMA. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	5b81194fee	radeonsi: rename buffer functions so as not to reference rings Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Indrajit Kumar Das	6df572532d	radeonsi/gfx10: added support for gfx10 conditional rendering Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7526>	2020-12-05 16:11:28 +00:00
Marek Olšák	3bd9db5be3	r300,r600,radeonsi: inline struct radeon_cmdbuf to remove dereferences It's straightforward except that the amdgpu winsys had to be cleaned up to allow this. radeon_cmdbuf is inlined and optionally the winsys can save the pointer to it. radeon_cmdbuf::priv points to the winsys cs structure. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7907>	2020-12-05 10:52:17 -05:00
Marek Olšák	8904fcca6d	gallium: inline struct u_suballocator to remove dereferences Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7901>	2020-12-03 21:41:19 +00:00
Pierre-Eric Pelloux-Prayer	ead225bb6f	radeonsi: update fallthrough comments Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7747>	2020-12-01 10:04:41 +01:00
Marek Olšák	603b5340b9	ac: rename num_render_backends -> max_render_backends Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7542>	2020-11-18 06:19:59 +00:00
Marek Olšák	7cc939f7dd	radeonsi: add num_draws parameter into si_need_gfx_cs_space Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7056>	2020-10-31 00:18:11 +00:00
Marek Olšák	22253e6b65	gallium: rename PIPE_TRANSFER_* -> PIPE_MAP_* Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5749>	2020-09-22 03:20:54 +00:00
Timothy Arceri	cb5fafd617	radeonsi: add missing fallthrough comment Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5705>	2020-07-02 23:52:52 +00:00
Marek Olšák	71794567f9	radeonsi: remove tabs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5603>	2020-06-26 07:02:57 +00:00
Pierre-Eric Pelloux-Prayer	d7008fe46a	radeonsi: switch to 3-spaces style Generated automatically using clang-format and the following config: AlignAfterOpenBracket: true AlignConsecutiveMacros: true AllowAllArgumentsOnNextLine: false AllowShortCaseLabelsOnASingleLine: false AllowShortFunctionsOnASingleLine: false AlwaysBreakAfterReturnType: None BasedOnStyle: LLVM BraceWrapping: AfterControlStatement: false AfterEnum: true AfterFunction: true AfterStruct: false BeforeElse: false SplitEmptyFunction: true BinPackArguments: true BinPackParameters: true BreakBeforeBraces: Custom ColumnLimit: 100 ContinuationIndentWidth: 3 Cpp11BracedListStyle: false Cpp11BracedListStyle: true ForEachMacros: - LIST_FOR_EACH_ENTRY - LIST_FOR_EACH_ENTRY_SAFE - util_dynarray_foreach - nir_foreach_variable - nir_foreach_variable_safe - nir_foreach_register - nir_foreach_register_safe - nir_foreach_use - nir_foreach_use_safe - nir_foreach_if_use - nir_foreach_if_use_safe - nir_foreach_def - nir_foreach_def_safe - nir_foreach_phi_src - nir_foreach_phi_src_safe - nir_foreach_parallel_copy_entry - nir_foreach_instr - nir_foreach_instr_reverse - nir_foreach_instr_safe - nir_foreach_instr_reverse_safe - nir_foreach_function - nir_foreach_block - nir_foreach_block_safe - nir_foreach_block_reverse - nir_foreach_block_reverse_safe - nir_foreach_block_in_cf_node IncludeBlocks: Regroup IncludeCategories: - Regex: '<[[:alnum:].]+>' Priority: 2 - Regex: '.*' Priority: 1 IndentWidth: 3 PenaltyBreakBeforeFirstCallParameter: 1 PenaltyExcessCharacter: 100 SpaceAfterCStyleCast: false SpaceBeforeCpp11BracedList: false SpaceBeforeCtorInitializerColon: false SpacesInContainerLiterals: false Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4319>	2020-03-30 11:05:52 +00:00
Marek Olšák	0366c8c5b7	radeonsi: expose shader cache stats to the HUD Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2929> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2929>	2020-01-24 20:29:29 -05:00
Marek Olšák	c046551e60	radeonsi: print shader cache stats with AMD_DEBUG=cache_stats Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2929>	2020-01-24 20:29:29 -05:00
Timothy Arceri	c976b427c4	util: remove LIST_DELINIT macro Just use the inlined function directly. The macro was replaced with the function in `ebe304fa54`. Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Timothy Arceri	255de06c59	util: remove LIST_ADDTAIL macro Just use the inlined function directly. The macro was replaced with the function in `ebe304fa54`. Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Timothy Arceri	7ae1be1028	util: remove LIST_INITHEAD macro Just use the inlined function directly. The macro was replaced with the function in `ebe304fa54`. Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Marek Olšák	91227a1e17	radeonsi/gfx10: add global use_ngg and use_ngg_streamout flags Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-06 17:09:02 -04:00
Ilia Mirkin	0e30c6b8a7	gallium: switch boolean -> bool at the interface definitions This is a relatively minimal change to adjust all the gallium interfaces to use bool instead of boolean. I tried to avoid making unrelated changes inside of drivers to flip boolean -> bool to reduce the risk of regressions (the compiler will much more easily allow "dirty" values inside a char-based boolean than a C99 _Bool). This has been build-tested on amd64 with: Gallium drivers: nouveau r300 r600 radeonsi freedreno swrast etnaviv v3d vc4 i915 svga virgl swr panfrost iris lima kmsro Gallium st: mesa xa xvmc xvmc vdpau va Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-22 22:13:51 -04:00
Nicolai Hähnle	792a638b03	radeonsi/gfx10: implement streamout-related queries The NGG hardware pipeline doesn't track these statistics automatically, and in fact cannot track them automatically when API geometry shaders are involved, so we accumulate statistics in the shader using atomic adds. This implementation accumulates statistics via the memory system and the RW buffer descriptor setup. We could use GDS, but since these atomics aren't latency-sensitive, that basically just trades off L2$ bandwidth vs. export bus bandwidth. One single memory transaction per shader workgroup doesn't seem too bad. The result ring buffer in memory is needed either way to avoid pipeline stalls. The shader code contains the atomic unconditionally, though the GFX10_GS_QUERY_BUF is a null buffer when no queries are active. The atomic is simply discarded by the shader hardware in that case. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:13 -04:00
Nicolai Hähnle	e241b405ca	radeonsi: pass the context to query destroy functions We'll need this in the future. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:12 -04:00
Nicolai Hähnle	064f195ef0	radeonsi: make si_restore_qbo_state externally available Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:12 -04:00
Marek Olšák	abe9a51d27	ac: add radeon_info::is_amdgpu instead of checking drm_major == 3 and clean up Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-06-14 13:31:18 -04:00
Marek Olšák	bb5d82bd06	radeonsi: allow query functions for compute-only contexts	2019-05-27 15:26:06 -04:00

1 2

89 Commits