KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Arvind Yadav	1fbc7337a1	radeonsi: Enable nir_lower_point_smooth lowering pass for point smoothing Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15117>	2022-07-16 07:08:33 -04:00
Qiang Yu	a1763ad4b3	radeonsi: replace llvm based fixed tcs with nir Create nir passthrough shader with explicit input/output and vertex output count so that it can be handled by compiler same as user tcs. The drawback is we create more si_shader_selector with different input/output and vertex output count which was handled by compiler backend before. As fixed function tcs can be handled like user tcs, we don't need the dedicated fixed_func_tcs_shader state either. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16705>	2022-06-27 02:38:21 +00:00
Qiang Yu	05b829cd0c	radeonsi: deserialize nir binary in si_check_blend_dst_sampler_noop We can do this parse with original nir instead of shader key pass applied nir in si_get_nir_shader. This can free si_get_nir_shader to just use si_shader as parameter. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16705>	2022-06-27 02:38:21 +00:00
Marek Olšák	bdf3797aeb	ac,radeonsi: don't export null from PS if it has no effect on gfx10+ We just need to pass the uses_discard flag to the epilog. The hw skips the export anyway. This will hang if SPI registers declare an output format or KILL_ENABLE is set because those cases require an export with done=1. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	dfa8dcf80e	radeonsi: remove streamout code from shaders if no streamout buffers are bound This is an optimization using asynchronous shader compilation. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	a9f7744cfe	radeonsi: rework how vs_state_bits is set and unpacked Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	c2342e6770	radeonsi: move GS_STATE bits to the end to make space at the beginning Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	c9c7dcb619	radeonsi: rename and regroup VS_STATE definitions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	4f3c74ddfb	radeonsi: determine DB_SHADER_CONTROL in si_shader_ps This is cleaner and more flexible. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	af880e591e	radeonsi: remove GFX9_MERGED_NUM_USER_SGPR definition Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Pierre-Eric Pelloux-Prayer	d4e4aa997a	radeonsi: store shader variants in an array Instead of storing them in a linked list, put them in an array in si_shader_selector. The keys are also stored separatly, to avoid pointer chasing when searching a variant in si_shader_select_with_key. This main point here is to simplify the code by storing everything in the selector instead of splitting the list storage between the selector and the shaders; this shouldn't affect performance in a meaningful way. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16273>	2022-05-13 14:40:56 +00:00
Marek Olšák	2a1c711052	ac/llvm: skip s_barrier if tess patches don't cross a wave boundary If tess patches are wholly in one wave, "s_waitcnt lgkm(0)" is sufficient. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16304>	2022-05-13 09:15:20 +00:00
Pierre-Eric Pelloux-Prayer	38e8a73e14	radeonsi: implement GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB in shaders Statistics only work in non-NGG mode. If screen->use_ngg is true, we can't know if the draw will actually use NGG or not, so this commit switch to a shader based implementation of this counter. To avoid modifying si_query, the shader implementation behaves like the hw one: it uses the same buffer size and offset. The emulation path activation in the shader is controlled by vs_state_bit[31]. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15861>	2022-05-12 09:16:11 +02:00
Marek Olšák	c749b6f6ae	radeonsi/gfx11: update the initialization of SGPR0/1 registers for HS and GS Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16328>	2022-05-10 04:29:55 +00:00
Yogesh mohan marimuthu	6531ec8922	ac,radeonsi/gfx11: swizzle MRT0/1 for dual source blending If dual source blending is enabled, use export targets 21 and 22. Also we have to swap odd/even lanes between export target 21 and 22. Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16328>	2022-05-10 04:29:55 +00:00
Yogesh Mohan Marimuthu	12a606c1bd	radeonsi/gfx11: export alpha through mrtz for alpha-to-coverage if mrtz is there If both mrtz and alpha-to-coverage are enabled, the alpha channel must be exported through mrtz. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16328>	2022-05-10 04:29:55 +00:00
Marek Olšák	afc110a1f6	radeonsi/gfx11: implement attributes through memory Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16328>	2022-05-10 04:29:55 +00:00
Marek Olšák	3777a5d715	radeonsi: assign param export indices before compilation This moves the logic out of LLVM-specific codepaths. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14414>	2022-04-22 22:21:11 +00:00
Marek Olšák	4e8b8555c6	radeonsi: move si_shader_info::stage into si_shader_selector This will help me see all places where we use "info", which will be moved from si_shader_selector to shader variants. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14414>	2022-04-22 22:21:11 +00:00
Marek Olšák	8de5b11b29	radeonsi: move most "info" fields from si_shader_selector into si_shader_info It's where they should be, and future commits might require this. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14414>	2022-04-22 22:21:11 +00:00
Marek Olšák	b57a163b7d	radeonsi: gather pipe_stream_output_info from NIR intrinsics This stops pipe_stream_output_info from create_*s_state context functions because NIR contains everything and can do more advanced shader linking this way. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14414>	2022-04-22 22:21:11 +00:00
Yonggang Luo	2ca6ef22f7	util: Rename pipe_debug_callback to util_debug_callback Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15657>	2022-04-01 01:52:43 +00:00
Marek Olšák	12e00be09b	radeonsi: apply the LLVM discard bug workaround to LLVM 13 only It was fixed in LLVM 14. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Pierre-Eric Pelloux-Prayer	9685b5785b	radeonsi: add SI_PROFILE_CLAMP_DIV_BY_ZERO To enable divide by zero clamping per shader, instead of per app. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14931>	2022-02-16 11:13:25 +00:00
Marek Olšák	afdfcdd542	radeonsi: determine MEM_ORDERED after generating a shader variant because si_get_nir_shader runs NIR passes and some of them can introduce new loads. Fixes: `3fb77ef2e0` - radeonsi: do opt_large_constants & lower_indirect_derefs after uniform inlining Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14528>	2022-01-18 11:11:08 +00:00
Marek Olšák	08cc73a218	radeonsi: rename uses_vmem_* flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14528>	2022-01-18 11:11:08 +00:00
Marek Olšák	d4a1766a5a	radeonsi: move the GS copy shader into shader variants This will allow further optimizations for shader variants that change GS outputs (affecting the copy shader), and this is mainly about sharing optimizations with NGG instead of having a totally separate codepath for legacy GS. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	8ed9d38e73	radeonsi: move si_nir_scan_shader into si_shader_info.c Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	198ad7e4dc	radeonsi: move smoothing to the main shader part to remove 1 live VGPR The samplemask VGPR that we had to pass to the epilog increased VGPR usage by 1 for all shaders. Do it in the main function by using the mono key structure, which causes on-demand compilation and stall, but we'll save the VGPR. 57794 shaders in 35145 tests Totals: SGPRS: 2715856 -> 2716272 (0.02 %) VGPRS: 1776168 -> 1718432 (-3.25 %) Spilled SGPRs: 3704 -> 3630 (-2.00 %) Spilled VGPRs: 1727 -> 1733 (0.35 %) Private memory VGPRs: 256 -> 256 (0.00 %) Scratch size: 2008 -> 2016 (0.40 %) dwords per thread Code Size: 61429584 -> 61393288 (-0.06 %) bytes Max Waves: 838645 -> 840484 (0.22 %) Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	12b942bd16	radeonsi: pass sample_coverage VGPR index to the PS prolog instead of guessing The code was correct, but little confusing. This is cleaner. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	3283df1425	radeonsi: remove unused si_shader::prolog2 This became unused when the GS prolog was removed. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	af9ec3c45d	radeonsi: add shader profiles that disable binning Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	b3b2f97f2e	radeonsi: add Wave32 heuristics and shader profiles This generally works well. There are new cases that select Wave32, and there are shader profiles which adjust that. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	cd86f1dc2b	radeonsi: rename si_get_shader_wave_size and make it non-inline Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	1ef027851d	radeonsi: propagate si_shader::wave_size to VGT_SHADER_STAGES instead of hardcoding them Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	bc57488936	radeonsi: add si_shader::wave_size because it will vary Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	41523773f5	radeonsi: add wave32 flag into prolog/epilog keys It will vary between shaders. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	ba6d389fa7	radeonsi: don't use GS SGPR6 for the small prim cull info use a user SGPR instead. This will be needed in the future. Also don't upload small_prim_precision because it's passed via VS_STATE_BITS. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	513bd6acca	radeonsi: cull against clip planes, clipvertex, clip/cull distances in shader The downside is that this duplicates shader code for clip/cull distances in both the position and parameter portions of the shader. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	881c459191	radeonsi: unify how ngg_cull_flags are set Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	5a5263d65d	radeonsi: unify GFX9_VSGS_NUM_USER_SGPR and GFX9_TESGS_NUM_USER_SGPR Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	9151ac3531	ac,radeonsi: cull small lines in the shader using the diamond exit rule It also splits clip_half_line_width into X and Y components for tighter view culling. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	8cf802e8ef	radeonsi: replace the GS prolog with a monolithic shader variant It only exists because of the hw bug and is used very rarely. Let's simplify it. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13393>	2021-10-18 18:08:59 +00:00
Marek Olšák	8c5a32b5fe	radeonsi: split si_shader_key into ps and ge parts to minimize memcmp overhead ps is for the pixel shader, while ge is for VS, TCS, TES, and GS. si_shader_key: 68 bytes si_shader_key_ge: 68 bytes si_shader_key_ps: 28 bytes The only notable change is that si_shader_select_with_key is changed to a C++ template. Other changes are trivial. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13285>	2021-10-16 10:41:51 +00:00
Marek Olšák	385c9e1caf	radeonsi: si_state_shaders.c -> cpp We'll add some templates here. Why is `extern "C"` not needed for exported functions? Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13285>	2021-10-16 10:41:51 +00:00
Arvind Yadav	8f9945a75b	radeonsi: remove the use of PKT3_CONTEXT_REG_RMW This patch is to to remove PKT3_CONTEXT_REG_RMW from radeonsi. and avoid multiple command buffer(PM4 packet)creation for R_02881C_PA_CL_VS_OUT_CNTL. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Arvind Yadav <arvind.yadav@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12789>	2021-10-13 10:28:14 +00:00
Marek Olšák	844f66bf38	radeonsi: remove GS fast launch It regresses the first snx test because it adds CPU overhead, and there is no way to work around it. The average effect on viewperf is 0, meaning that a few cases improve, while a few others regress. Acked-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13279>	2021-10-11 07:13:48 +00:00
Marek Olšák	f00d3e2909	radeonsi: implement shader-based culling for lines This helps some viewperf subtests. Only view XY culling is done. Edgeflags are always disabled with lines. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13048>	2021-09-28 17:30:06 +00:00
Marek Olšák	0030bdf9a6	radeonsi: add gfx10 helpers for determining whether edgeflags are enabled They will return false when culling lines. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13048>	2021-09-28 17:30:06 +00:00
Marek Olšák	edb5fa4d59	radeonsi: eliminate redundant SPI_SHADER_PGM_RSRC3/4_GS register writes They don't change much. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12343>	2021-09-14 15:24:11 +00:00

1 2 3 4 5 ...

465 Commits