KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	0e64252912	radeonsi: add AMD_DEBUG=ib to print IBs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12812>	2021-09-10 23:32:03 +00:00
Marek Olšák	576f8394db	radeonsi: remove the primitive discard compute shader It doesn't always work, it's only useful on gfx9 and older, and it's too complicated. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4011 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12812>	2021-09-10 23:32:03 +00:00
Marek Olšák	34a2c75310	radeonsi: enable DCC stores on gfx10.3 APUs for better performance There is just one hw bug that we need to handle. NO_DCC_FB was unused. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12449>	2021-09-01 07:51:30 +00:00
Marek Olšák	8c845d4cb4	radeonsi: rename DCC_WRITE -> ALLOW_DCC_STORE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12449>	2021-09-01 07:51:30 +00:00
Marek Olšák	6cb2f07e90	radeonsi: add si_print_current_ib function for debugging Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12656>	2021-09-01 00:42:58 +00:00
Marek Olšák	9fb77745f5	radeonsi: inline si_need_gfx_cs_space Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12656>	2021-09-01 00:42:58 +00:00
Marek Olšák	b15c413947	radeonsi: simplify memory usage checking by merging vram and gtt counters no change in behavior Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12656>	2021-09-01 00:42:58 +00:00
Marek Olšák	c005b2cd4b	radeonsi: move as_ls/es/ngg setting out of si_shader_selector_key Do it when we bind shaders. The advantages are: - no need to memset the fields when any shader variant state is changed (e.g. culling on/off) - no need to recompute the fields every time that happens Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12656>	2021-09-01 00:42:57 +00:00
Marek Olšák	08310f85ae	radeonsi: remove instancing support from the prim discard compute shader It's not important for workstation apps on Vega. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12656>	2021-09-01 00:42:57 +00:00
Marek Olšák	10a46226b1	gallium: remove vertices_per_patch, add pipe_context::set_patch_vertices We would like draw-only display lists to have immutable draw info and this is the only GL non-draw state in pipe_draw_info (not counting view_mask). It also allows removing some code from draw_vbo for tessellation. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12351>	2021-08-21 00:08:11 +00:00
Marek Olšák	6fc38d3b07	radeonsi: allow arbitrary swizzle modes for displayable DCC by adding retile shader variants Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12430>	2021-08-20 14:28:36 +00:00
Marek Olšák	59fe704c45	gallium: simplify VRAM uploads by adding PIPE_RESOURCE_FLAG_DONT_MAP_DIRECTLY When this flag is set, u_threaded_context will try not to map it directly for better buffer placement. It's set by drivers when visible VRAM is too small. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12257>	2021-08-09 11:58:48 +00:00
Marek Olšák	6546f28cc8	radeonsi: drop smoothing quality to 4xAA for better performance Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11754>	2021-07-08 18:37:41 +00:00
Marek Olšák	b141e50282	radeonsi: add optimal multi draws and draw-level splitting for prim discard CS This is a partial rewrite of some parts of the code. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11510>	2021-06-28 13:23:14 +00:00
Marek Olšák	9fa0d2cf35	radeonsi: change how the prim discard CS is enabled and splitting limits Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11510>	2021-06-28 13:23:14 +00:00
Marek Olšák	06da711350	radeonsi: remove the GDS variants of compute-based primitive discard The GDS ordered append variant is unstable due to kernel and firmware bugs. The unordered GDS variant isn't faster than the memory-based variant. Only the memory-based variant is kept. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11510>	2021-06-28 13:23:14 +00:00
Marek Olšák	a448074d05	radeonsi: don't compile TES and GS draw_vbo variants for the prim discard CS This also fixes the incorrect emit_draw_packets template argument. The condition should be inverted. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11102>	2021-06-21 19:03:29 +00:00
Marek Olšák	72a395b6de	radeonsi: remove the chip_class dimension from the draw_vbo array We don't use/initialize draw_vbo callbacks for other generations anymore. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11384>	2021-06-16 21:29:13 +00:00
Marek Olšák	24895f020a	radeonsi: move a few functions from si_state_draw.cpp into si_gfx_cs.c Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11384>	2021-06-16 21:29:13 +00:00
Pierre-Eric Pelloux-Prayer	83250036be	radeonsi/nir: add si_nir_is_output_const_if_tex_is_const Determine if a given shader write the same constant value to its output if a specific input texture is replaced by constant load. It's done by checking if the store_output intrinsics only depends on constant and a texture. If it's true, the given texture is replaced by a constant load in cloned shader and this clone is optimized. Then the output is checked (= is it constant or not). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10979>	2021-06-15 11:18:02 +02:00
Pierre-Eric Pelloux-Prayer	b2bd9c5ccd	radeonsi: add si_install_draw_wrapper This allows to implement custom draw_vbo code-path without touching si_draw_vbo. As an example, skipped all draw calls with an odd new_draws could be done like this: void mywrapper(...) { if (new_draws % 2) return; return sctx->real_draw_vbo(...); } if (some_condition_is_met) si_install_draw_wrapper(sctx, mywrapper); Instead of having to add the "if ()" condition inside si_draw_vbo. Note that a single wrapper may be installed so care must be taken to not override an existing wrapper. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10979>	2021-06-15 10:19:04 +02:00
Pierre-Eric Pelloux-Prayer	ff8a930cf7	radeonsi: add _once suffix to depth_cleared_level_mask And add a new variable to disambiguate between "has been cleared once" and "is cleared". Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10979>	2021-06-15 10:19:02 +02:00
Mike Blumenkrantz	74abd5df0e	aux/tc: pass rebind count and rebind bitmask with replace_buffer_storage func tc already calculates all the rebinding that needs to be done on a given context, so (some of) this info can be passed on to drivers to enable optimizations Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11245>	2021-06-14 20:42:47 +00:00
Marek Olšák	7844bdadac	radeonsi: remove DFSM after we discovered how bad it is Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	9ba17ec21a	radeonsi: generate buffer_id_unique for u_threaded_context Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	9dc7fff448	radeonsi: allow changing the NGG subgroup size to 256 but don't change it yet Currently, 128 seems to have the best performance. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	712f74f590	radeonsi: remove 8 bytes from si_resource, turn other 4 bytes into padding Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	5af124c92c	radeonsi: change si_resource::alignment to alignment_log2 for better packing Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	36e07198a7	radeonsi: always use the L2 LRU cache policy for faster clears and copies Waves and CP DMA can finish sooner if L2 doesn't do any evictions, which is hard to predict. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	c7e731c737	radeonsi: remove unused SI_IMAGE_ACCESS_AS_BUFFER Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	94a1f45e15	ac/llvm: set target features per function instead of per target machine This is a cleanup that allows the removal of the wave32 target machine and the wave32 pass manager. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	b04044b350	radeonsi: stop using u_resource_vtbl::resource_destroy Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10659>	2021-05-21 17:38:04 +00:00
Marek Olšák	ec77a2d43a	gallium/u_threaded: add callbacks and documentation for resource busy checking Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10662>	2021-05-17 10:37:24 +00:00
Marek Olšák	967757a208	gallium+(u_threaded,r300,r600,radeonsi): move transfer offset into pipe_transfer Let's use the 4 bytes of unused padding usefully in pipe_transfer. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10527>	2021-05-01 17:38:42 +00:00
Mike Blumenkrantz	dae3113c3d	gallium: split drawid out of pipe_draw_info and as a separate draw_vbo param the only case in which this is nonzero is if a multidraw gets split by the frontend, i.e., mesa core, and in all other cases it can be ignored. the value can also be ignored for all indirect draws, though it seems many (most?) gallium drivers are not aware of this Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10166>	2021-04-30 03:59:19 +00:00
Mike Blumenkrantz	4566383ae4	gallium: move pipe_draw_info::index_bias to pipe_draw_start_count_bias this moves index_bias into the multidraw struct, enabling draws where the value changes to be merged; the draw_info struct member is renamed and moved to the end of the struct for tc use u_vbuf still has some checks to split draws if index_bias changes, maybe this can be removed at some point? Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10166>	2021-04-30 03:59:19 +00:00
Mike Blumenkrantz	4fe6c85526	gallium: rename pipe_draw_start_count -> pipe_draw_start_count_bias and add an index_bias member no functional changes yet, just the rename and unused struct member Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10166>	2021-04-30 03:59:19 +00:00
Marek Olšák	804e292440	radeonsi: remove the separate DCC optimization for Stoney This removes some complexity from the driver. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10343>	2021-04-26 22:53:30 +00:00
Marek Olšák	1f8fa96412	radeonsi: make the gfx9 DCC MSAA clear shader depend on the number of samples because different DCC equations are used. Fixes: `3120113ee7` - radeonsi: implement DCC MSAA 4x/8x fast clear using DCC equations on gfx9 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10343>	2021-04-26 22:53:30 +00:00
Simon Ser	4a6b87ceab	radeonsi: implement pipe_context.create_video_buffer_with_modifiers Just pass down the modifier list to vl_video_buffer_create_as_resource, filtering out DCC modifiers because we don't support these for now. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10237>	2021-04-22 15:57:29 +00:00
Marek Olšák	a1653854f5	radeonsi: fix automatic DCC retiling after compute image stores Only internal compute shaders use DCC stores, so the TODOs are not critical yet. Fixes: `1d64a1045e` - radeonsi: enable dcc image stores on gfx10+ Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10261>	2021-04-17 02:37:49 +00:00
Marek Olšák	f9b527a9a5	radeonsi: unify internal compute with SSBOs in si_launch_grid_internal_ssbos just deduplicate the code Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	ec60526035	radeonsi: move binding the internal compute shader into si_launch_grid_internal instead of doing it in each function Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	3120113ee7	radeonsi: implement DCC MSAA 4x/8x fast clear using DCC equations on gfx9 MSAA 4x and 8x should only clear the first 2 samples because other samples are uncompressed. The compute shader only clears that subset of DCC. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	8b95f51ef1	radeonsi: fix and enable full DCC with MSAA 2x on gfx9 This enables fast clear with any clear color (not just 0/1) for bpp >= 32. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	7e68fae25f	ac,radeonsi: rewrite DCC retiling without the DCC retile map The retile map is removed and replaced by direct DCC address computations in the retile shader using the new function ac_nir_dcc_addr_from_coord. The RADV code is disabled. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	06b6af596c	radeonsi: do Z-only or S-only HTILE clear using a compute shader doing RMW This adds a clear_buffer compute shader that does read-modify-write to update a subset of bits in HTILE. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	4dd8d58ad5	radeonsi: clean up some mess around htile_stencil_disabled Set the final value in si_texture_create_object, so that other places don't have to derive it redundantly. The only thing to remember is that HTILE stencil can be enabled when stencil is not present, and it can be disabled when stencil is present due to various workarounds. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	bcd1a69f79	radeonsi: parallelize Z/S conversion into TC-compatible with fast color clears It's not really a fast clear, but it's the next logical step towards doing HTILE clears here. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	fb72d41b18	radeonsi: implement Z/S fast clear for non-zero mipmap levels Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10003>	2021-04-13 03:17:42 +00:00
Marek Olšák	faf10bd49d	ac/surface: use named "color and "zs" structures in unions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10083>	2021-04-12 20:53:45 +00:00
Marek Olšák	468836317b	ac/surface: unify htile_* and dcc_* fields as meta_* fields Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10083>	2021-04-12 20:53:45 +00:00
Pierre-Eric Pelloux-Prayer	8c6a64c9b0	radeonsi/rgp: export compute shader programs Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10105>	2021-04-12 14:27:29 +02:00
Pierre-Eric Pelloux-Prayer	aa077ba3a2	radeonsi/rgp: export barriers Wrap the si_cp_wait_mem call to emit RGP_SQTT_MARKER_IDENTIFIER_BARRIER_START and RGP_SQTT_MARKER_IDENTIFIER_BARRIER_END events. Only for gfx9+ for now. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10105>	2021-04-12 14:27:26 +02:00
Marek Olšák	0580d4c1a2	radeonsi: enable HTILE with mipmapping on gfx9+ Everything seems to be there except fast clears. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	3345e32de7	radeonsi: group and parallelize all clears in si_texture_create_object This reduces aux_context flushes significantly. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	8cd61d1248	radeonsi: parallelize CMASK and DCC clears Clearing 8 RTs with both DCC and CMASK caused 16 synchronized clears where we also did 16 times WAIT_REG_MEM for CB flushes that were 15 times useless. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	d0f06e5c47	radeonsi: remove si_screen::dcc_msaa_allowed Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	4707dc6a64	radeonsi: determine accurately whether the framebuffer state has DCC MSAA We only need to check storage samples, which is what affects DCC. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	034c1e4845	radeonsi: decrease the maximum variable block size to allow packing the block size in 1 user SGPR with 10 bits per component, so that block sizes such as 512x1x1 fit in there. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	c53261645d	radeonsi: add SI_CONTEXT_PFP_SYNC_ME to skip syncing PFP for image operations DCC/CMASK/HTILE clears will not set this. We could do a better job at not setting this in other cases too Image copies also don't set this. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	b1a73ec99b	radeonsi: rename and apply SI_OP_CPDMA_SKIP_CACHE_FLUSH to compute as well Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	7e2b5ce722	radeonsi: set compute/cpdma sync flags in the outermost caller This allows us to control syncing everywhere. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	1af99a28a0	radeonsi: merge CP DMA flags with internal compute flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	dd5e9af78f	radeonsi: remove unused SI_CP_DMA_SKIP_* definitions The existing uses had no effect. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	938dc0e291	radeonsi: rename internal compute sync flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	28d065d3e5	radeonsi: don't insert start/stop pipeline stat events if it has no effect Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	e5ea9a3baa	radeonsi: add a fast path for MSAA resolving with RGB -> BGR swizzling When we encounter a situation when we need to swizzle, which the CB can't resolve in one pass, swap the channel order on the next clear, so that we don't have to swizzle. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9615>	2021-03-19 16:05:03 +00:00
Marek Olšák	a94bd9033d	radeonsi: use pipe_sampler_state::border_color_is_integer to simplify stuff We don't need the separate integer sampler state if we know the border color type. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9577>	2021-03-17 22:36:42 +00:00
Marek Olšák	32eb74e1e1	ac/gpu_info: fix more non-coherent RB and GL2 combinations It ignored non-harvested chips with a non-power-of-two memory bus. Fixes: `abed921ce7` - amd: add support for Navy Flounder Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9568>	2021-03-17 14:40:54 +00:00
Axel Davy	8283ed65cf	radeonsi: Limit the size of the in-memory shader cache The in-memory shader cache can get significantly huge in some rare cases. Limit its size to 64MB on 32 bits, and 1GB else. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9578>	2021-03-13 21:51:38 +00:00
Marek Olšák	e6a0f243ea	radeonsi: update pipe_screen::num_contexts This allows skipping mutex locking. Don't take the aux context into account. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9356>	2021-03-11 05:05:39 +00:00
Pierre-Eric Pelloux-Prayer	c276bde34a	radeonsi/sqtt: export shader code to RGP With these changes the shader code is visible in RGP. Vk pipeline feature is emulated using si_update_shaders: when shaders are updated we compute a sha1 of their code and use it as a pipeline hash. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9277>	2021-03-05 13:10:11 +00:00
Marek Olšák	c97ebe1461	radeonsi: don't index si_context::shaders with enum gl_shader_stage Fixes: `a8373b3d38` "radeonsi: store si_context::xxx_shader members in union" Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9313>	2021-03-02 01:14:44 +00:00
Pierre-Eric Pelloux-Prayer	1d64a1045e	radeonsi: enable dcc image stores on gfx10+ This was implemented in `1d3bffaf9c`, but missing the WRITE_COMPRESS_ENABLE bit, then disabled by 4dc6ed2a59040f04648eadbffeb1522587d00f3. This commits reimplements it to: - avoid disabling dcc when uploading FP16 textures (see si_use_compute_copy_for_float_formats) - being able to use compute to upload textures in more cases, rather than using the blit path Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8958>	2021-02-17 14:57:26 +01:00
Pierre-Eric Pelloux-Prayer	f18bceac72	radeonsi: replace force_cp_dma arg of si_clear_buffer by enum The new enum has 3 values: - SI_CP_DMA_CLEAR_METHOD: equivalent to force_cp_dma = true - SI_COMPUTE_CLEAR_METHOD: to force the clear to use compute - SI_AUTO_SELECT_CLEAR_METHOD: equivalent to force_cp_dma = false No functional change yet, but this will be used later. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8958>	2021-02-17 14:57:26 +01:00
Pierre-Eric Pelloux-Prayer	bddc0e023c	radeonsi: fix read from compute / write from draw sync A compute dispatch should see the result of a previous draw command. radeonsi was missing this implicit sync, causing rendering artifacts: the compute shader was reading from a texture still being written to by the previous draw. Framebuffer BOs are marked with RADEON_USAGE_NEEDS_IMPLICIT_SYNC, so compute jobs will sync. v2: use RADEON_USAGE_NEEDS_IMPLICIT_SYNC v3: unconditionally make CB coherent after a flush Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> (v3) Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v3) Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4032 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2878 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/1336 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8869>	2021-02-17 09:11:46 +00:00
Pierre-Eric Pelloux-Prayer	a8373b3d38	radeonsi: store si_context::xxx_shader members in union This allows to access them individually (sctx->shader.ps) or using array indexing (sctx->shaders[PIPE_SHADER_FRAGMENT]). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8869>	2021-02-17 09:11:46 +00:00
Marek Olšák	0408279e8c	radeonsi: add debug options nodisplaytiling and nodisplaydcc Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8892>	2021-02-13 04:56:05 +00:00
Marek Olšák	47587758f2	radeonsi: prefetch VB descriptors right after uploading This skips the logic that sets and checks prefetch_L2_mask. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8794>	2021-01-30 15:41:23 -05:00
Marek Olšák	e93b42c214	ac,radeonsi: track memory usage in KB to reduce types from uint64 to uint32 Decreasing the time spent in radeon_cs_memory_below_limit is the motivation. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8794>	2021-01-30 15:38:15 -05:00
Pierre-Eric Pelloux-Prayer	5dc823304b	radeonsi/sqtt: forward string markers to sqtt Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8746>	2021-01-29 08:44:12 +00:00
Pierre-Eric Pelloux-Prayer	f2d57d28ed	radeonsi/sqtt: use more event identifier Using event identifiers allows to add a bit more context to the RGP trace. Without this all draw calls are identified as vkCmdDraw. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8746>	2021-01-29 08:44:11 +00:00
Marek Olšák	dd9801a918	radeonsi: rename SI_SGPR_RW_BUFFERS to SI_SGPR_INTERNAL_BINDINGS They are just internal buffers and images. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	26d785fbbd	radeonsi: move y_inverted out of si_viewports for better packing Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	c1957e58a6	radeonsi: inline si_blend_color and si_clip_state structures better packing Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	ca2062a394	radeonsi: simplify determining whether render condition is enabled at draw time Read one bool instead of reading one bool and one pointer. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	1a2dde8f86	radeonsi: add internal blitter_running flag to skip the indirection in si_decompress_textures Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	76d6351dab	radeonsi: don't validate inlinable uniforms at draw time Let's trust the state tracker that it sets inlinable uniforms only when shaders can use them. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8600>	2021-01-20 21:53:13 +00:00
Marek Olšák	888a45a362	radeonsi: evaluate si_get_vs in si_draw_vbo at compile time Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8600>	2021-01-20 21:53:13 +00:00
Marek Olšák	c5d3341b6e	radeonsi: inline the last use of si_get_vs_state Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8600>	2021-01-20 21:53:13 +00:00
Pierre-Eric Pelloux-Prayer	41d22eb68e	radeonsi: inhibit clockgating when using SQTT Ported from PAL. Fixes: `07c1504d1b` ("radeonsi: implement SQTT support") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8480>	2021-01-19 09:52:08 +01:00
Marek Olšák	b06f3c52bf	radeonsi: trim the size of si_vgt_param_key and si_vgt_stages_key These are the minimum sizes we can use. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	73709143d2	radeonsi: remove MRT-draw-calls, spill-draw-calls, spill-compute-calls due to limited usefulness and overhead in si_draw_vbo. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	f2a5148701	radeonsi: make sctx->vertex_elements always non-NULL Bind a state with 0 vertex elements there. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	961aa67adf	radeonsi: add a specialized function for CP DMA L2 prefetch This radically simplifies the code to decrease CPU overhead in si_draw_vbo. The generic CP DMA copy function is too complicated. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	0eca4660a5	radeonsi: make cik_emit_prefetch_L2 templated and move it to si_state_draw.cpp This is a great candidate for a template. There are a lot of conditions that are already templated in si_draw_vbo. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	4056e953fe	radeonsi: move emit_cache_flush functions into si_gfx_cs.c This is a better place for them. They are not inlined anyway. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Pierre-Eric Pelloux-Prayer	07c1504d1b	radeonsi: implement SQTT support Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:10:17 +01:00
Pierre-Eric Pelloux-Prayer	b94104c0c0	radeonsi: pass radeon_cmdbuf to si_cp_dma_wait_for_idle Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:25 +01:00
Pierre-Eric Pelloux-Prayer	aa9fe1e423	radeonsi: pass radeon_cmdbuf to emit_cache_flush Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:25 +01:00
Yogesh mohan marimuthu	8a22fc9502	radeonsi: enable vrs2x2 coarse shading if flat shading (v9) Enable vrs2x2 coarse shading if flat shading as per idea and guidance given by Marek. is_flat_shading variable in struct si_shader_info is set based on the data from gather_intrinsic_info() function and struct si_state_rasterizer. If is_flat_shading_variable is set, then in function si_emit_db_render_state() vrs2x2 shading is enabled in hardware. v2: Fix review comments from Pierre-Eric. Code optimizations. v3: Fix indentation style issue. v4: Fix review comments from Marek. Fixed logical issue pointed by Marek where info->is_flat_shading variable can be corrupted and other code cleanup. v5: Make the code compact as suggested by Pierre-Eric. v6: Fix new review comments from Marek. v7: use info->uses_interp_color variable fix from Marek. v8: Fix coding style comment from Marek. v9: Add uses_fbfetch_output check as suggested by Marek. Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8161>	2021-01-06 10:12:10 +05:30
Vinson Lee	8457be1497	radeonsi: Fix typos. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8289>	2021-01-05 02:25:36 +00:00
Marek Olšák	dffc27e5e1	radeonsi: fix small primitive culling with MSAA force-disabled and smoothing The problem was that the shader constants were based on the framebuffer sample count and ignored the multisample enable state and the line/polygon smoothing state, which uses MSAA rasterization that only sets SampleMaskIn to get the coverage for alpha-blended smoothing (the PS epilog computes the alpha channel from SampleMaskIn and blending generates the AA results). - This is a complete rework that adds a new state for NGG cull constants. - It fixes the same thing for the prim discard compute shader. - It documents how VS_STATE.SMALL_PRIM_PRECISION is encoded. It fixes blue corruption in Unigine Heaven with MSAA and Medium details or better. Fixes: `7648060dc0` - radeonsi: enable NGG culling by default on gfx10.3 dGPUs Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8022>	2020-12-16 00:43:45 -05:00
Marek Olšák	2b09bde1f5	radeonsi: use a C++ template to decrease draw_vbo overhead by 13 % With GALLIUM_THREAD=0 to disable draw merging. Before: 1, DrawElements ( 1 VBO\| 0 UBO\| 0 ) w/ no state change, 8736 After: 1, DrawElements ( 1 VBO\| 0 UBO\| 0 ) w/ no state change, 10059 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7807>	2020-12-09 16:01:32 -05:00
Marek Olšák	fe839baf6a	radeonsi: fix future C++ compile failures and warnings Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7807>	2020-12-09 16:01:29 -05:00
Marek Olšák	85af48b0ee	radeonsi: allow including a few files from C++ Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7807>	2020-12-09 16:01:21 -05:00
Marek Olšák	21b97ef013	radeonsi: rename SI_TEST_DMA to SI_TEST_BLIT Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	1f31a21664	radeonsi: remove SDMA support There are many issues with SDMA across many generations of hardware. A recent example is that gfx10.3 suffers from random GPU hangs if userspace uses SDMA. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	5b81194fee	radeonsi: rename buffer functions so as not to reference rings Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	ab1377cf92	radeonsi: move si_screen_clear_buffer into si_compute_blit.c w/o SDMA option Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	3bd9db5be3	r300,r600,radeonsi: inline struct radeon_cmdbuf to remove dereferences It's straightforward except that the amdgpu winsys had to be cleaned up to allow this. radeon_cmdbuf is inlined and optionally the winsys can save the pointer to it. radeon_cmdbuf::priv points to the winsys cs structure. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7907>	2020-12-05 10:52:17 -05:00
Marek Olšák	8904fcca6d	gallium: inline struct u_suballocator to remove dereferences Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7901>	2020-12-03 21:41:19 +00:00
Marek Olšák	c7470c1760	radeonsi: don't set DrawID and StartInstance if they are unused Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Marek Olšák	c4ddf67ee1	radeonsi: don't invalidate emitted NUM_INSTANCES for u_blitter invalidate_draw_sh_constants should invalidate only SGPRs. invalidate_draw_constants invalidates SGPRs and NUM_INSTANCES. u_blitter called invalidate_draw_sh_constants, which previously invalidated NUM_INSTANCES as well. This commit fixes that. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Marek Olšák	509142876b	radeonsi: add AMD_DEBUG=nofastlaunch for debugging Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Indrajit Kumar Das	5d14562da8	radeonsi/gfx10: fix overflow and primitive queries This aligns the offsets to match the memory layout of the query buffer defined by gfx10_sh_query_buffer_mem and calls si_launch_grid_internal to flush caches and wait for completion of shaders prior to retrieving results. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7181>	2020-12-01 11:34:16 +00:00
Marek Olšák	1190808eca	radeonsi: if VS and TCS have the same number of threads, merge the conditonals Instead of: if (VS) { VS; } if (TCS) { TCS; } Do this if the number of threads is the same in VS and TCS: exec = enabled_threads; VS; TCS; Skipping declare_vb_descriptor_input_sgprs is needed to match the VS return values. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7623>	2020-11-23 02:22:21 +00:00
Marek Olšák	602d4a78bc	radeonsi: handle pipe_draw_info::increment_draw_id Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7441>	2020-11-18 01:41:25 +00:00
Pierre-Eric Pelloux-Prayer	6e7e208867	radeonsi: remove AMD_DEBUG=zerovram flag The same feature is available by using: radeonsi_zerovram=true Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7525>	2020-11-13 11:19:58 +00:00
Pierre-Eric Pelloux-Prayer	b9605f1a74	radeonsi: remove unused NO_RB_PLUS flag It's not used since https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1751. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7525>	2020-11-13 11:19:58 +00:00
Bas Nieuwenhuizen	d4f7962d48	radeonsi: Add displayable DCC flushing without explicit flushes. Flushes non-explicit shared textures that need retiling on * glFlush * glSync * glSignalSemaphoreEXT * DRI fences. * The first time we create a non-explicit handle for it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6176>	2020-11-13 03:27:28 +00:00
Marek Olšák	a44868beda	radeonsi: implement multi_draw for compute-based primitive culling Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7056>	2020-10-31 00:18:11 +00:00
Marek Olšák	0ce68852c1	radeonsi: implement multi_draw but supporting only 1 draw just adapting to the new interface Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7056>	2020-10-31 00:18:11 +00:00
Marek Olšák	7cc939f7dd	radeonsi: add num_draws parameter into si_need_gfx_cs_space Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7056>	2020-10-31 00:18:11 +00:00
Marek Olšák	b7501184b9	radeonsi: implement inlinable uniforms This improves performance for uber shaders. It must be enabled using the new driconf option. The driver compiles the specialized shaders in another thread without stalls, same as all other optimizations. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7057>	2020-10-30 11:07:22 +00:00
Marek Olšák	ed3c5fe469	radeonsi: implement GL_INTEL_blackhole_render Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7031>	2020-10-06 15:59:08 +00:00
Marek Olšák	30c3b2c0b6	radeonsi: simplify NGG culling enablement and add radeonsi_shader_culling option Add a vertex count threshold into si_shader_selector to simplify the draw_vbo code. The new option is supposed to be used in 00-mesa-defaults.conf and should be tweaked for best performance unlike the AMD_DEBUG experimental options. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Eleni Maria Stea	03af98abe2	radeonsi: support for external buffers (ext_external_objects) So far, the callback to create a resource from a memory object had code for importing textures only. Modified it to allow importing buffers too. Fixes the following piglit tests: - ext_external_objects/vk-buf-exchange - ext_external_objects/vk-pix-buf-update-errors - ext_external_objects/vk-vert-buf-update-errors - ext_external_objects/vk-vert-buf-reuse v2: Used si_alloc_buffer_struct instead of CALLOC v3: Fixed indentation issue, removed free in case of unsuccessful allocation, joined two if conditions together Signed-off-by: Eleni Maria Stea <estea@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6364>	2020-10-01 15:35:07 +00:00
Pierre-Eric Pelloux-Prayer	2c6643546a	radeonsi/tmz: add a tmz variant for sctx::eop_bug_scratch Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Pierre-Eric Pelloux-Prayer	8e2768bbfb	radeonsi/tmz: add tmz variant for sctx::tess_rings tess_rings must be encrypted when used in a secure job so this commit introduces a tess_rings_tmz resource. The cs_preamble_state doesn't contain the tess_rings address anymore since it can change. The tess_rings related registers go in a separate preamble. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Pierre-Eric Pelloux-Prayer	2589888ce9	radeonsi/tmz: add tmz variant of sctx::wait_mem_scratch Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Pierre-Eric Pelloux-Prayer	1b0d660cbc	radeonsi/tmz: allow secure job if the app made a tmz allocation This commit makes TMZ always allowed instead of being either off or forced-on with AMD_DEBUG=tmz. With this change: - secure job can be used as soon as the application made a tmz allocation. Driver internal allocations are not enough to enable secure jobs (if tmz is supported and enabled by the kernel) - AMD_DEBUG=tmz forces all scanout/depth/stencil buffers to be allocated as TMZ. This is useful to test app thats don't explicitely support protected content. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Pierre-Eric Pelloux-Prayer	5e4aecec93	radeonsi: introduce SI_RESOURCE_FLAG_INTERNAL / RADEON_FLAG_DRIVER_INTERNAL Tag allocations as driver internal. Some of these allocations will need to be doubled to handle TMZ (one secure bo, one normal bo) but these allocations shouldn't switch the winsys in "the app is using TMZ". Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Marek Olšák	972fb0368c	radeonsi: move binning parameters into si_screen it will be used in the next commit Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6822>	2020-09-24 11:55:06 +00:00
Marek Olšák	40a50e9398	radeonsi: remove KILL_PS_INF_INTERP/CLAMP_DIV_BY_ZERO, use screen::options Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6810>	2020-09-22 15:58:51 +00:00
Bas Nieuwenhuizen	017ca86b22	radeonsi: Move display dcc dirty tracking to framebuffer emission. To improve performance. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6783>	2020-09-19 03:15:28 -04:00
Bas Nieuwenhuizen	c6c1fa9a26	radeonsi: Put retile map in separate buffers. The retile maps are a software mechanism and hence very suceptible to change. As such I'd like to avoid making it part of the cross driver ABI. Ideally we'd just use the cached tile info + a shader to avoid these buffers altogether. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6783>	2020-09-19 03:15:25 -04:00
Marek Olšák	b23013db0a	Revert "radeonsi: set BIG_PAGE fields on gfx10.3" This reverts commit `430d384c31`. BIT_PAGE can't be set for GTT and we don't know if a buffer has been evicted to GTT. Fixes: `430d384c31` Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6722>	2020-09-16 02:54:01 +00:00
Marek Olšák	cb7bc983ae	radeonsi: stop using TGSI_PROPERTY_FS_COLOR0_WRITES_ALL_CBUFS Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	a407123789	radeonsi: move nir_shader_compiler_options into si_screen so that they can be different depending on the GPU (for 16-bit support) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6284>	2020-09-06 14:36:20 +00:00
Marek Olšák	3c54d73e4b	radeonsi: change PIPE_SHADER to MESA_SHADER (debug flags) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Marek Olšák	b1cb72c449	radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_selector::type) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Pierre-Eric Pelloux-Prayer	b8445520cb	radeonsi,driconf: add clamp_div_by_zero option Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6259>	2020-09-02 11:53:16 +02:00
Marek Olšák	b8892bc818	radeonsi: don't restore states at the beginning of IBs if they're shadowed Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5798>	2020-07-22 12:08:33 -04:00
Marek Olšák	69014d8c94	radeonsi: implement CP register shadowing Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5798>	2020-07-22 12:08:19 -04:00
Timothy Arceri	4686a95621	r600/radeonsi: silence zero-length-bounds gcc warnings Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5766>	2020-07-08 03:04:03 +00:00
Marek Olšák	50d7553600	radeonsi: add a debug option to enable NGG culling for tessellation Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5524>	2020-06-30 10:56:41 +00:00
Marek Olšák	9049e39804	radeonsi: always use Wave32 for GS fast launch, because Wave64 hangs Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5524>	2020-06-30 10:56:41 +00:00
Marek Olšák	1c1d34a67a	radeonsi: rename init_config states to cs_preamble states Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5603>	2020-06-26 07:02:57 +00:00

1 2 3 4 5 ...

796 Commits