KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	3345e32de7	radeonsi: group and parallelize all clears in si_texture_create_object This reduces aux_context flushes significantly. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	8cd61d1248	radeonsi: parallelize CMASK and DCC clears Clearing 8 RTs with both DCC and CMASK caused 16 synchronized clears where we also did 16 times WAIT_REG_MEM for CB flushes that were 15 times useless. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	d0f06e5c47	radeonsi: remove si_screen::dcc_msaa_allowed Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	4707dc6a64	radeonsi: determine accurately whether the framebuffer state has DCC MSAA We only need to check storage samples, which is what affects DCC. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	034c1e4845	radeonsi: decrease the maximum variable block size to allow packing the block size in 1 user SGPR with 10 bits per component, so that block sizes such as 512x1x1 fit in there. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	c53261645d	radeonsi: add SI_CONTEXT_PFP_SYNC_ME to skip syncing PFP for image operations DCC/CMASK/HTILE clears will not set this. We could do a better job at not setting this in other cases too Image copies also don't set this. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	b1a73ec99b	radeonsi: rename and apply SI_OP_CPDMA_SKIP_CACHE_FLUSH to compute as well Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	7e2b5ce722	radeonsi: set compute/cpdma sync flags in the outermost caller This allows us to control syncing everywhere. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	1af99a28a0	radeonsi: merge CP DMA flags with internal compute flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	dd5e9af78f	radeonsi: remove unused SI_CP_DMA_SKIP_* definitions The existing uses had no effect. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	938dc0e291	radeonsi: rename internal compute sync flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	28d065d3e5	radeonsi: don't insert start/stop pipeline stat events if it has no effect Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9795>	2021-04-02 12:05:00 +00:00
Marek Olšák	e5ea9a3baa	radeonsi: add a fast path for MSAA resolving with RGB -> BGR swizzling When we encounter a situation when we need to swizzle, which the CB can't resolve in one pass, swap the channel order on the next clear, so that we don't have to swizzle. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9615>	2021-03-19 16:05:03 +00:00
Marek Olšák	a94bd9033d	radeonsi: use pipe_sampler_state::border_color_is_integer to simplify stuff We don't need the separate integer sampler state if we know the border color type. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9577>	2021-03-17 22:36:42 +00:00
Marek Olšák	32eb74e1e1	ac/gpu_info: fix more non-coherent RB and GL2 combinations It ignored non-harvested chips with a non-power-of-two memory bus. Fixes: `abed921ce7` - amd: add support for Navy Flounder Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9568>	2021-03-17 14:40:54 +00:00
Axel Davy	8283ed65cf	radeonsi: Limit the size of the in-memory shader cache The in-memory shader cache can get significantly huge in some rare cases. Limit its size to 64MB on 32 bits, and 1GB else. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9578>	2021-03-13 21:51:38 +00:00
Marek Olšák	e6a0f243ea	radeonsi: update pipe_screen::num_contexts This allows skipping mutex locking. Don't take the aux context into account. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9356>	2021-03-11 05:05:39 +00:00
Pierre-Eric Pelloux-Prayer	c276bde34a	radeonsi/sqtt: export shader code to RGP With these changes the shader code is visible in RGP. Vk pipeline feature is emulated using si_update_shaders: when shaders are updated we compute a sha1 of their code and use it as a pipeline hash. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9277>	2021-03-05 13:10:11 +00:00
Marek Olšák	c97ebe1461	radeonsi: don't index si_context::shaders with enum gl_shader_stage Fixes: `a8373b3d38` "radeonsi: store si_context::xxx_shader members in union" Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9313>	2021-03-02 01:14:44 +00:00
Pierre-Eric Pelloux-Prayer	1d64a1045e	radeonsi: enable dcc image stores on gfx10+ This was implemented in `1d3bffaf9c`, but missing the WRITE_COMPRESS_ENABLE bit, then disabled by 4dc6ed2a59040f04648eadbffeb1522587d00f3. This commits reimplements it to: - avoid disabling dcc when uploading FP16 textures (see si_use_compute_copy_for_float_formats) - being able to use compute to upload textures in more cases, rather than using the blit path Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8958>	2021-02-17 14:57:26 +01:00
Pierre-Eric Pelloux-Prayer	f18bceac72	radeonsi: replace force_cp_dma arg of si_clear_buffer by enum The new enum has 3 values: - SI_CP_DMA_CLEAR_METHOD: equivalent to force_cp_dma = true - SI_COMPUTE_CLEAR_METHOD: to force the clear to use compute - SI_AUTO_SELECT_CLEAR_METHOD: equivalent to force_cp_dma = false No functional change yet, but this will be used later. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8958>	2021-02-17 14:57:26 +01:00
Pierre-Eric Pelloux-Prayer	bddc0e023c	radeonsi: fix read from compute / write from draw sync A compute dispatch should see the result of a previous draw command. radeonsi was missing this implicit sync, causing rendering artifacts: the compute shader was reading from a texture still being written to by the previous draw. Framebuffer BOs are marked with RADEON_USAGE_NEEDS_IMPLICIT_SYNC, so compute jobs will sync. v2: use RADEON_USAGE_NEEDS_IMPLICIT_SYNC v3: unconditionally make CB coherent after a flush Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> (v3) Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v3) Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4032 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2878 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/1336 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8869>	2021-02-17 09:11:46 +00:00
Pierre-Eric Pelloux-Prayer	a8373b3d38	radeonsi: store si_context::xxx_shader members in union This allows to access them individually (sctx->shader.ps) or using array indexing (sctx->shaders[PIPE_SHADER_FRAGMENT]). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8869>	2021-02-17 09:11:46 +00:00
Marek Olšák	0408279e8c	radeonsi: add debug options nodisplaytiling and nodisplaydcc Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8892>	2021-02-13 04:56:05 +00:00
Marek Olšák	47587758f2	radeonsi: prefetch VB descriptors right after uploading This skips the logic that sets and checks prefetch_L2_mask. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8794>	2021-01-30 15:41:23 -05:00
Marek Olšák	e93b42c214	ac,radeonsi: track memory usage in KB to reduce types from uint64 to uint32 Decreasing the time spent in radeon_cs_memory_below_limit is the motivation. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8794>	2021-01-30 15:38:15 -05:00
Pierre-Eric Pelloux-Prayer	5dc823304b	radeonsi/sqtt: forward string markers to sqtt Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8746>	2021-01-29 08:44:12 +00:00
Pierre-Eric Pelloux-Prayer	f2d57d28ed	radeonsi/sqtt: use more event identifier Using event identifiers allows to add a bit more context to the RGP trace. Without this all draw calls are identified as vkCmdDraw. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8746>	2021-01-29 08:44:11 +00:00
Marek Olšák	dd9801a918	radeonsi: rename SI_SGPR_RW_BUFFERS to SI_SGPR_INTERNAL_BINDINGS They are just internal buffers and images. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	26d785fbbd	radeonsi: move y_inverted out of si_viewports for better packing Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	c1957e58a6	radeonsi: inline si_blend_color and si_clip_state structures better packing Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	ca2062a394	radeonsi: simplify determining whether render condition is enabled at draw time Read one bool instead of reading one bool and one pointer. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	1a2dde8f86	radeonsi: add internal blitter_running flag to skip the indirection in si_decompress_textures Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	76d6351dab	radeonsi: don't validate inlinable uniforms at draw time Let's trust the state tracker that it sets inlinable uniforms only when shaders can use them. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8600>	2021-01-20 21:53:13 +00:00
Marek Olšák	888a45a362	radeonsi: evaluate si_get_vs in si_draw_vbo at compile time Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8600>	2021-01-20 21:53:13 +00:00
Marek Olšák	c5d3341b6e	radeonsi: inline the last use of si_get_vs_state Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8600>	2021-01-20 21:53:13 +00:00
Pierre-Eric Pelloux-Prayer	41d22eb68e	radeonsi: inhibit clockgating when using SQTT Ported from PAL. Fixes: `07c1504d1b` ("radeonsi: implement SQTT support") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8480>	2021-01-19 09:52:08 +01:00
Marek Olšák	b06f3c52bf	radeonsi: trim the size of si_vgt_param_key and si_vgt_stages_key These are the minimum sizes we can use. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	73709143d2	radeonsi: remove MRT-draw-calls, spill-draw-calls, spill-compute-calls due to limited usefulness and overhead in si_draw_vbo. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	f2a5148701	radeonsi: make sctx->vertex_elements always non-NULL Bind a state with 0 vertex elements there. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	961aa67adf	radeonsi: add a specialized function for CP DMA L2 prefetch This radically simplifies the code to decrease CPU overhead in si_draw_vbo. The generic CP DMA copy function is too complicated. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	0eca4660a5	radeonsi: make cik_emit_prefetch_L2 templated and move it to si_state_draw.cpp This is a great candidate for a template. There are a lot of conditions that are already templated in si_draw_vbo. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Marek Olšák	4056e953fe	radeonsi: move emit_cache_flush functions into si_gfx_cs.c This is a better place for them. They are not inlined anyway. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8548>	2021-01-18 01:17:19 +00:00
Pierre-Eric Pelloux-Prayer	07c1504d1b	radeonsi: implement SQTT support Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:10:17 +01:00
Pierre-Eric Pelloux-Prayer	b94104c0c0	radeonsi: pass radeon_cmdbuf to si_cp_dma_wait_for_idle Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:25 +01:00
Pierre-Eric Pelloux-Prayer	aa9fe1e423	radeonsi: pass radeon_cmdbuf to emit_cache_flush Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:25 +01:00
Yogesh mohan marimuthu	8a22fc9502	radeonsi: enable vrs2x2 coarse shading if flat shading (v9) Enable vrs2x2 coarse shading if flat shading as per idea and guidance given by Marek. is_flat_shading variable in struct si_shader_info is set based on the data from gather_intrinsic_info() function and struct si_state_rasterizer. If is_flat_shading_variable is set, then in function si_emit_db_render_state() vrs2x2 shading is enabled in hardware. v2: Fix review comments from Pierre-Eric. Code optimizations. v3: Fix indentation style issue. v4: Fix review comments from Marek. Fixed logical issue pointed by Marek where info->is_flat_shading variable can be corrupted and other code cleanup. v5: Make the code compact as suggested by Pierre-Eric. v6: Fix new review comments from Marek. v7: use info->uses_interp_color variable fix from Marek. v8: Fix coding style comment from Marek. v9: Add uses_fbfetch_output check as suggested by Marek. Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8161>	2021-01-06 10:12:10 +05:30
Vinson Lee	8457be1497	radeonsi: Fix typos. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8289>	2021-01-05 02:25:36 +00:00
Marek Olšák	dffc27e5e1	radeonsi: fix small primitive culling with MSAA force-disabled and smoothing The problem was that the shader constants were based on the framebuffer sample count and ignored the multisample enable state and the line/polygon smoothing state, which uses MSAA rasterization that only sets SampleMaskIn to get the coverage for alpha-blended smoothing (the PS epilog computes the alpha channel from SampleMaskIn and blending generates the AA results). - This is a complete rework that adds a new state for NGG cull constants. - It fixes the same thing for the prim discard compute shader. - It documents how VS_STATE.SMALL_PRIM_PRECISION is encoded. It fixes blue corruption in Unigine Heaven with MSAA and Medium details or better. Fixes: `7648060dc0` - radeonsi: enable NGG culling by default on gfx10.3 dGPUs Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8022>	2020-12-16 00:43:45 -05:00
Marek Olšák	2b09bde1f5	radeonsi: use a C++ template to decrease draw_vbo overhead by 13 % With GALLIUM_THREAD=0 to disable draw merging. Before: 1, DrawElements ( 1 VBO\| 0 UBO\| 0 ) w/ no state change, 8736 After: 1, DrawElements ( 1 VBO\| 0 UBO\| 0 ) w/ no state change, 10059 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7807>	2020-12-09 16:01:32 -05:00
Marek Olšák	fe839baf6a	radeonsi: fix future C++ compile failures and warnings Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7807>	2020-12-09 16:01:29 -05:00
Marek Olšák	85af48b0ee	radeonsi: allow including a few files from C++ Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7807>	2020-12-09 16:01:21 -05:00
Marek Olšák	21b97ef013	radeonsi: rename SI_TEST_DMA to SI_TEST_BLIT Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	1f31a21664	radeonsi: remove SDMA support There are many issues with SDMA across many generations of hardware. A recent example is that gfx10.3 suffers from random GPU hangs if userspace uses SDMA. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	5b81194fee	radeonsi: rename buffer functions so as not to reference rings Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	ab1377cf92	radeonsi: move si_screen_clear_buffer into si_compute_blit.c w/o SDMA option Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7908>	2020-12-09 00:52:26 +00:00
Marek Olšák	3bd9db5be3	r300,r600,radeonsi: inline struct radeon_cmdbuf to remove dereferences It's straightforward except that the amdgpu winsys had to be cleaned up to allow this. radeon_cmdbuf is inlined and optionally the winsys can save the pointer to it. radeon_cmdbuf::priv points to the winsys cs structure. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7907>	2020-12-05 10:52:17 -05:00
Marek Olšák	8904fcca6d	gallium: inline struct u_suballocator to remove dereferences Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7901>	2020-12-03 21:41:19 +00:00
Marek Olšák	c7470c1760	radeonsi: don't set DrawID and StartInstance if they are unused Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Marek Olšák	c4ddf67ee1	radeonsi: don't invalidate emitted NUM_INSTANCES for u_blitter invalidate_draw_sh_constants should invalidate only SGPRs. invalidate_draw_constants invalidates SGPRs and NUM_INSTANCES. u_blitter called invalidate_draw_sh_constants, which previously invalidated NUM_INSTANCES as well. This commit fixes that. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Marek Olšák	509142876b	radeonsi: add AMD_DEBUG=nofastlaunch for debugging Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Indrajit Kumar Das	5d14562da8	radeonsi/gfx10: fix overflow and primitive queries This aligns the offsets to match the memory layout of the query buffer defined by gfx10_sh_query_buffer_mem and calls si_launch_grid_internal to flush caches and wait for completion of shaders prior to retrieving results. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7181>	2020-12-01 11:34:16 +00:00
Marek Olšák	1190808eca	radeonsi: if VS and TCS have the same number of threads, merge the conditonals Instead of: if (VS) { VS; } if (TCS) { TCS; } Do this if the number of threads is the same in VS and TCS: exec = enabled_threads; VS; TCS; Skipping declare_vb_descriptor_input_sgprs is needed to match the VS return values. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7623>	2020-11-23 02:22:21 +00:00
Marek Olšák	602d4a78bc	radeonsi: handle pipe_draw_info::increment_draw_id Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7441>	2020-11-18 01:41:25 +00:00
Pierre-Eric Pelloux-Prayer	6e7e208867	radeonsi: remove AMD_DEBUG=zerovram flag The same feature is available by using: radeonsi_zerovram=true Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7525>	2020-11-13 11:19:58 +00:00
Pierre-Eric Pelloux-Prayer	b9605f1a74	radeonsi: remove unused NO_RB_PLUS flag It's not used since https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1751. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7525>	2020-11-13 11:19:58 +00:00
Bas Nieuwenhuizen	d4f7962d48	radeonsi: Add displayable DCC flushing without explicit flushes. Flushes non-explicit shared textures that need retiling on * glFlush * glSync * glSignalSemaphoreEXT * DRI fences. * The first time we create a non-explicit handle for it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6176>	2020-11-13 03:27:28 +00:00
Marek Olšák	a44868beda	radeonsi: implement multi_draw for compute-based primitive culling Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7056>	2020-10-31 00:18:11 +00:00
Marek Olšák	0ce68852c1	radeonsi: implement multi_draw but supporting only 1 draw just adapting to the new interface Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7056>	2020-10-31 00:18:11 +00:00
Marek Olšák	7cc939f7dd	radeonsi: add num_draws parameter into si_need_gfx_cs_space Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7056>	2020-10-31 00:18:11 +00:00
Marek Olšák	b7501184b9	radeonsi: implement inlinable uniforms This improves performance for uber shaders. It must be enabled using the new driconf option. The driver compiles the specialized shaders in another thread without stalls, same as all other optimizations. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7057>	2020-10-30 11:07:22 +00:00
Marek Olšák	ed3c5fe469	radeonsi: implement GL_INTEL_blackhole_render Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7031>	2020-10-06 15:59:08 +00:00
Marek Olšák	30c3b2c0b6	radeonsi: simplify NGG culling enablement and add radeonsi_shader_culling option Add a vertex count threshold into si_shader_selector to simplify the draw_vbo code. The new option is supposed to be used in 00-mesa-defaults.conf and should be tweaked for best performance unlike the AMD_DEBUG experimental options. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Eleni Maria Stea	03af98abe2	radeonsi: support for external buffers (ext_external_objects) So far, the callback to create a resource from a memory object had code for importing textures only. Modified it to allow importing buffers too. Fixes the following piglit tests: - ext_external_objects/vk-buf-exchange - ext_external_objects/vk-pix-buf-update-errors - ext_external_objects/vk-vert-buf-update-errors - ext_external_objects/vk-vert-buf-reuse v2: Used si_alloc_buffer_struct instead of CALLOC v3: Fixed indentation issue, removed free in case of unsuccessful allocation, joined two if conditions together Signed-off-by: Eleni Maria Stea <estea@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6364>	2020-10-01 15:35:07 +00:00
Pierre-Eric Pelloux-Prayer	2c6643546a	radeonsi/tmz: add a tmz variant for sctx::eop_bug_scratch Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Pierre-Eric Pelloux-Prayer	8e2768bbfb	radeonsi/tmz: add tmz variant for sctx::tess_rings tess_rings must be encrypted when used in a secure job so this commit introduces a tess_rings_tmz resource. The cs_preamble_state doesn't contain the tess_rings address anymore since it can change. The tess_rings related registers go in a separate preamble. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Pierre-Eric Pelloux-Prayer	2589888ce9	radeonsi/tmz: add tmz variant of sctx::wait_mem_scratch Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Pierre-Eric Pelloux-Prayer	1b0d660cbc	radeonsi/tmz: allow secure job if the app made a tmz allocation This commit makes TMZ always allowed instead of being either off or forced-on with AMD_DEBUG=tmz. With this change: - secure job can be used as soon as the application made a tmz allocation. Driver internal allocations are not enough to enable secure jobs (if tmz is supported and enabled by the kernel) - AMD_DEBUG=tmz forces all scanout/depth/stencil buffers to be allocated as TMZ. This is useful to test app thats don't explicitely support protected content. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Pierre-Eric Pelloux-Prayer	5e4aecec93	radeonsi: introduce SI_RESOURCE_FLAG_INTERNAL / RADEON_FLAG_DRIVER_INTERNAL Tag allocations as driver internal. Some of these allocations will need to be doubled to handle TMZ (one secure bo, one normal bo) but these allocations shouldn't switch the winsys in "the app is using TMZ". Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6049>	2020-09-24 14:51:16 +00:00
Marek Olšák	972fb0368c	radeonsi: move binning parameters into si_screen it will be used in the next commit Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6822>	2020-09-24 11:55:06 +00:00
Marek Olšák	40a50e9398	radeonsi: remove KILL_PS_INF_INTERP/CLAMP_DIV_BY_ZERO, use screen::options Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6810>	2020-09-22 15:58:51 +00:00
Bas Nieuwenhuizen	017ca86b22	radeonsi: Move display dcc dirty tracking to framebuffer emission. To improve performance. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6783>	2020-09-19 03:15:28 -04:00
Bas Nieuwenhuizen	c6c1fa9a26	radeonsi: Put retile map in separate buffers. The retile maps are a software mechanism and hence very suceptible to change. As such I'd like to avoid making it part of the cross driver ABI. Ideally we'd just use the cached tile info + a shader to avoid these buffers altogether. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6783>	2020-09-19 03:15:25 -04:00
Marek Olšák	b23013db0a	Revert "radeonsi: set BIG_PAGE fields on gfx10.3" This reverts commit `430d384c31`. BIT_PAGE can't be set for GTT and we don't know if a buffer has been evicted to GTT. Fixes: `430d384c31` Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6722>	2020-09-16 02:54:01 +00:00
Marek Olšák	cb7bc983ae	radeonsi: stop using TGSI_PROPERTY_FS_COLOR0_WRITES_ALL_CBUFS Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	a407123789	radeonsi: move nir_shader_compiler_options into si_screen so that they can be different depending on the GPU (for 16-bit support) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6284>	2020-09-06 14:36:20 +00:00
Marek Olšák	3c54d73e4b	radeonsi: change PIPE_SHADER to MESA_SHADER (debug flags) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Marek Olšák	b1cb72c449	radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_selector::type) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Pierre-Eric Pelloux-Prayer	b8445520cb	radeonsi,driconf: add clamp_div_by_zero option Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6259>	2020-09-02 11:53:16 +02:00
Marek Olšák	b8892bc818	radeonsi: don't restore states at the beginning of IBs if they're shadowed Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5798>	2020-07-22 12:08:33 -04:00
Marek Olšák	69014d8c94	radeonsi: implement CP register shadowing Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5798>	2020-07-22 12:08:19 -04:00
Timothy Arceri	4686a95621	r600/radeonsi: silence zero-length-bounds gcc warnings Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5766>	2020-07-08 03:04:03 +00:00
Marek Olšák	50d7553600	radeonsi: add a debug option to enable NGG culling for tessellation Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5524>	2020-06-30 10:56:41 +00:00
Marek Olšák	9049e39804	radeonsi: always use Wave32 for GS fast launch, because Wave64 hangs Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5524>	2020-06-30 10:56:41 +00:00
Marek Olšák	1c1d34a67a	radeonsi: rename init_config states to cs_preamble states Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5603>	2020-06-26 07:02:57 +00:00
Marek Olšák	430d384c31	radeonsi: set BIG_PAGE fields on gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	85a6bcca61	radeonsi: pass at most 3 images and/or shader buffers via user SGPRs for compute This should slightly decrease shader lifetime. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Marek Olšák	7b6b35c6b5	radeonsi: move resetting tracked registers into a new function Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5095>	2020-05-23 03:45:07 -04:00
Marek Olšák	7356144fe4	radeonsi: disable the L2 cache for most CPU mappings of textures for faster blits over PCIe and no need to flush L2 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4935>	2020-05-15 22:12:35 +00:00
Marek Olšák	2c4c1b0499	radeonsi: rename SI_RESOURCE_FLAG_TRANSFER to FORCE_LINEAR Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4935>	2020-05-15 22:12:35 +00:00

1 2 3 4 5 ...

691 Commits