KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	19e6601413	radeonsi: do late NIR optimizations after uniform inlining This was missing. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9028>	2021-02-17 04:49:24 -05:00
Marek Olšák	ffbf3a5f8b	radeonsi: simplify the NGG culling condition in si_draw_vbo Changes: - disallow NGG culling for GS, fast launch for tess using template args (GS can't do NGG culling, tess can't do fast launch) - skip checking current_rast_prim with tessellation (bake the condition into ngg_cull_vert_threshold) - use only 1 vertex count threshold for enabling NGG shader culling to simplify it. I think it doesn't have a big impact. The threshold computation depends on more parameters than just fast launch. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8434>	2021-02-02 05:42:32 +00:00
Marek Olšák	dd9801a918	radeonsi: rename SI_SGPR_RW_BUFFERS to SI_SGPR_INTERNAL_BINDINGS They are just internal buffers and images. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8653>	2021-01-22 16:45:30 +00:00
Marek Olšák	62703b79a5	radeonsi: remove si_gs_prolog_bits::gfx9_prev_is_vs It didn't do anything useful. GS doesn't use the other user SGPRs. If we decrease the number of user SGPRs we declare for the GS prolog, we can remove gfx9_prev_is_vs. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8344>	2021-01-06 23:28:04 -05:00
Marek Olšák	b6b6d1ff3c	radeonsi: fix hang caused by for loop with exec=0 in LS and ES LLVM expects that exec != 0 when entering loops and generates this code that becomes an infinite loop if exec == 0: BB5_1: vcc_lo = (inverted terminating condition) s_and_b32 vcc_lo, exec_lo, vcc_lo s_cbranch_vccnz BB5_3 // jump if vcc != 0 (break statement) // ... loop body ... s_branch BB5_1 BB5_3: For non-monolithic VS before TCS, VS before GS, and TES before GS, we set exec = (thread enabledmask), which sets 0 for HS-only and GS-only waves, causing the infinite loop condition above. Fix it as follows: - set exec = ~0 at the beginning - wrap the whole shader (LS and ES) in a conditional block, so that HS-only and GS-only waves jump over it and never enter such a loop The TES before GS hang can be reproduced by gfxbench: testfw_app --gfx egl -w 1920 -h 1080 --gl_api gles -t gl_tess Fixes: `68d6d097f1` - radeonsi/gfx9: add GFX9 and VEGA10 enums Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8344>	2021-01-06 23:28:01 -05:00
Yogesh mohan marimuthu	8a22fc9502	radeonsi: enable vrs2x2 coarse shading if flat shading (v9) Enable vrs2x2 coarse shading if flat shading as per idea and guidance given by Marek. is_flat_shading variable in struct si_shader_info is set based on the data from gather_intrinsic_info() function and struct si_state_rasterizer. If is_flat_shading_variable is set, then in function si_emit_db_render_state() vrs2x2 shading is enabled in hardware. v2: Fix review comments from Pierre-Eric. Code optimizations. v3: Fix indentation style issue. v4: Fix review comments from Marek. Fixed logical issue pointed by Marek where info->is_flat_shading variable can be corrupted and other code cleanup. v5: Make the code compact as suggested by Pierre-Eric. v6: Fix new review comments from Marek. v7: use info->uses_interp_color variable fix from Marek. v8: Fix coding style comment from Marek. v9: Add uses_fbfetch_output check as suggested by Marek. Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8161>	2021-01-06 10:12:10 +05:30
Marek Olšák	76eb3478cf	radeonsi: take color interpolation into account for shader variants Fixes: - Sample shading now uses per-sample interpolation for colors if colors are the only inputs. (this is the only case that was broken) Optimizations: - BC_OPTIMIZE (barycentric optimization) is now enabled with MSAA if colors are qualified with both center and centroid. (BC_OPTIMIZE means that the hardware skips initializing centroid (i,j) if they are equal to center (i,j)) - If MSAA is disabled and at least 2 out of (center, centroid, sample) are used by all inputs now including colors, center is forced for all inputs. - If INTERP_MODE_COLOR is not used and the legacy GL shade model is flat, the shader variant for flat shading is not generated. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8225>	2021-01-05 02:43:55 +00:00
Marek Olšák	fe839baf6a	radeonsi: fix future C++ compile failures and warnings Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7807>	2020-12-09 16:01:29 -05:00
Marek Olšák	85af48b0ee	radeonsi: allow including a few files from C++ Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7807>	2020-12-09 16:01:21 -05:00
Marek Olšák	c7470c1760	radeonsi: don't set DrawID and StartInstance if they are unused Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Marek Olšák	623ea81530	radeonsi: don't update provoking vertex and outprim states in SGPR if unused Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Marek Olšák	4641dca269	radeonsi: don't update indexed flag in SGPR if it's unused to skip the register update when switching between indexed and non-indexed Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Marek Olšák	509142876b	radeonsi: add AMD_DEBUG=nofastlaunch for debugging Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7721>	2020-12-01 15:33:03 -05:00
Marek Olšák	aaed7a29be	radeonsi: implement GS fast launch for indexed triangle strips This increases performance for indexed triangle strips up to +100%. In practice, it's limited by memory bandwidth and compute power, so 256-bit memory bus and a lot of CUs are recommended. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7681>	2020-11-27 06:16:59 +00:00
Marek Olšák	61fe66a2e4	radeonsi: pass VS->TCS IO via VGPRs if VS and TCS have the same thread count It can only be done if a TCS input is accessed without indirect indexing and with gl_InvocationID as the vertex index, and the number of VS and TCS threads is the same. This eliminates LDS stores and loads for VS->TCS IO, reducing shader lifetime and LDS traffic. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7623>	2020-11-23 02:22:21 +00:00
Marek Olšák	1190808eca	radeonsi: if VS and TCS have the same number of threads, merge the conditonals Instead of: if (VS) { VS; } if (TCS) { TCS; } Do this if the number of threads is the same in VS and TCS: exec = enabled_threads; VS; TCS; Skipping declare_vb_descriptor_input_sgprs is needed to match the VS return values. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7623>	2020-11-23 02:22:21 +00:00
Marek Olšák	c4310f70aa	radeonsi: swap DrawId and StartInstance SGPR locations We need to change both values at the same time, so they need to be next to each other. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7441>	2020-11-18 01:41:25 +00:00
Marek Olšák	b7501184b9	radeonsi: implement inlinable uniforms This improves performance for uber shaders. It must be enabled using the new driconf option. The driver compiles the specialized shaders in another thread without stalls, same as all other optimizations. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7057>	2020-10-30 11:07:22 +00:00
Marek Olšák	1de0bf0a56	radeonsi: remove indirection when loading position at the end for NGG culling If we store the position into LDS after we know the new thread ID, we don't need to remember the old thread ID. The culling code only needs W, X/W, Y/W, so we have to keep those. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7172>	2020-10-17 01:58:19 +00:00
Marek Olšák	f5912c6d32	radeonsi: kill disabled clip distances and planes at per-channel granularity Apps often enable only 1 plane for gl_ClipVertex, which means 1 scalar clip distance. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Marek Olšák	30c3b2c0b6	radeonsi: simplify NGG culling enablement and add radeonsi_shader_culling option Add a vertex count threshold into si_shader_selector to simplify the draw_vbo code. The new option is supposed to be used in 00-mesa-defaults.conf and should be tweaked for best performance unlike the AMD_DEBUG experimental options. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Marek Olšák	d1d27e9db4	radeonsi: remove redundant info.uses_fbfetch Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6782>	2020-09-25 04:37:23 -04:00
Marek Olšák	98a52fecda	radeonsi: implement 16-bit FS color outputs This removes type conversions from 16 bits to 32 bits in the main function and then back to 16 bits in the epilog. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6622>	2020-09-22 02:44:53 +00:00
Marek Olšák	c56fbed99b	radeonsi: kill point size VS output if it's not used by the rasterizer Fixed-func shaders can contain the output, because their generator doesn't consider the current primitive type into account. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6620>	2020-09-07 11:27:30 +00:00
Marek Olšák	1dd243d4f5	radeonsi: use shader_info::cs::local_size_variable to clean up some code Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:41 +00:00
Marek Olšák	757f790ad8	radeonsi: remove redundant si_shader_info::uses_derivatives Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:41 +00:00
Marek Olšák	f3f08bca23	radeonsi: remove redundant si_shader_selector::max_gs_stream Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:41 +00:00
Marek Olšák	2b4fa68808	radeonsi: remove redundant GS variables in si_shader_selector Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	7960668dc9	radeonsi: remove redundant si_shader_info::writes_memory Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	83cdffd435	radeonsi: rename num_memory_instructions -> num_memory_stores it only counts stores Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	c8ab5899c1	radeonsi: reduce type sizes in si_shader_selector Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	99c4e61084	radeonsi: remove redundant si_shader_info::uses_kill Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	8df349a31e	radeonsi: merge uses_persp_opcode_interp_sample/uses_linear_opcode_interp_sample Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	7b3e24c2d8	radeonsi: remove unused si_shader_info::uses_(vertexid\|basevertex) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	f02cd0e027	radeonsi: remove redundant si_shader_info:(clip\|cull) fields Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	d15a7d16d6	radeonsi: remove redundant si_shader_info::const_buffers_declared Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	0dabcb9e53	radeonsi: remove redundant si_shader_info::images_declared Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	c1af2f4bee	radeonsi: remove redundant si_shader_info::shader_buffers_declared Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	cb63e4afc9	radeonsi: remove info::samplers_declared, image_buffers, msaa_images_declared They are redundant with shader_info. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	cb7bc983ae	radeonsi: stop using TGSI_PROPERTY_FS_COLOR0_WRITES_ALL_CBUFS Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	89cf8789cd	radeonsi: stop using TGSI_PROPERTY_CS_LOCAL_SIZE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	46bb051bc2	radeonsi: stop using TGSI_PROPERTY_NEXT_SHADER Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6624>	2020-09-07 11:15:40 +00:00
Marek Olšák	98e866c669	radeonsi: optimize out the loop in si_get_ps_input_cntl Use a remap table from a semantic to an index instead of searching for the correct index. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Marek Olšák	6ecb8b6899	radeonsi: replace TGSI_SEMANTIC with VARYING_SLOT and FRAG_RESULT Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Marek Olšák	6925401a38	radeonsi: remove si_shader_selector::type Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Marek Olšák	966307983b	radeonsi: precompute si_*_descriptors_idx in si_shader_selector It helps remove one use of sel->type. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Marek Olšák	b1cb72c449	radeonsi: change PIPE_SHADER to MESA_SHADER (si_shader_selector::type) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Marek Olšák	562b8c1a47	radeonsi: don't execute LDS stores for TCS outputs that are never read This is a per-component version of the previous mechanism. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6340>	2020-09-02 23:03:00 -04:00
Marek Olšák	81d106d6ec	radeonsi: lower IO intrinsics - complete rewrite of input/output scanning Input and output info is gathered from intrinsics. nir_variables are ignored (and we'll remove them anyway). This is a prerequisite for ACO, but also makes the IR prettier. The ac_nir_to_llvm change has to be in this commit. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6445>	2020-09-02 22:45:38 -04:00
Marek Olšák	ed9391df3f	radeonsi: get color interpolation info from shader_info Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6445>	2020-09-02 22:45:38 -04:00
Pierre-Eric Pelloux-Prayer	5a05f9714b	radeonsi: bump SI_NUM_SHADER_BUFFERS to 32 Some app uses more than 8 SSBOs (https://gitlab.freedesktop.org/mesa/mesa/-/issues/2946), so increase SI_NUM_SHADER_BUFFERS to 32 (which allows 16 SSBOs). Since we're now using a 64 bits number to track buffers, we could bump SI_NUM_SHADER_BUFFERS to 48 but that would conflict with Mesa's MAX_COMBINED_ATOMIC_BUFFERS limit (= 90). Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2122 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5632>	2020-06-30 09:23:14 +02:00
Marek Olšák	85a6bcca61	radeonsi: pass at most 3 images and/or shader buffers via user SGPRs for compute This should slightly decrease shader lifetime. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Pierre-Eric Pelloux-Prayer	d7008fe46a	radeonsi: switch to 3-spaces style Generated automatically using clang-format and the following config: AlignAfterOpenBracket: true AlignConsecutiveMacros: true AllowAllArgumentsOnNextLine: false AllowShortCaseLabelsOnASingleLine: false AllowShortFunctionsOnASingleLine: false AlwaysBreakAfterReturnType: None BasedOnStyle: LLVM BraceWrapping: AfterControlStatement: false AfterEnum: true AfterFunction: true AfterStruct: false BeforeElse: false SplitEmptyFunction: true BinPackArguments: true BinPackParameters: true BreakBeforeBraces: Custom ColumnLimit: 100 ContinuationIndentWidth: 3 Cpp11BracedListStyle: false Cpp11BracedListStyle: true ForEachMacros: - LIST_FOR_EACH_ENTRY - LIST_FOR_EACH_ENTRY_SAFE - util_dynarray_foreach - nir_foreach_variable - nir_foreach_variable_safe - nir_foreach_register - nir_foreach_register_safe - nir_foreach_use - nir_foreach_use_safe - nir_foreach_if_use - nir_foreach_if_use_safe - nir_foreach_def - nir_foreach_def_safe - nir_foreach_phi_src - nir_foreach_phi_src_safe - nir_foreach_parallel_copy_entry - nir_foreach_instr - nir_foreach_instr_reverse - nir_foreach_instr_safe - nir_foreach_instr_reverse_safe - nir_foreach_function - nir_foreach_block - nir_foreach_block_safe - nir_foreach_block_reverse - nir_foreach_block_reverse_safe - nir_foreach_block_in_cf_node IncludeBlocks: Regroup IncludeCategories: - Regex: '<[[:alnum:].]+>' Priority: 2 - Regex: '.*' Priority: 1 IndentWidth: 3 PenaltyBreakBeforeFirstCallParameter: 1 PenaltyExcessCharacter: 100 SpaceAfterCStyleCast: false SpaceBeforeCpp11BracedList: false SpaceBeforeCtorInitializerColon: false SpacesInContainerLiterals: false Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4319>	2020-03-30 11:05:52 +00:00
Marek Olšák	4ef1c8d60b	radeonsi/gfx10: fix the wave size for compute-based culling Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4269>	2020-03-28 00:58:34 +00:00
Daniel Schürmann	9d64ad2fe7	radeonsi: lower discard to demote when FS_CORRECT_DERIVS_AFTER_KILL is enabled Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4047>	2020-03-09 12:29:32 +00:00
Marek Olšák	0db74f479b	radeonsi: use the live shader cache Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2929>	2020-01-24 20:29:29 -05:00
Marek Olšák	7ce84b256e	radeonsi: make si_compile_shader return bool Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3421>	2020-01-23 19:10:21 +00:00
Marek Olšák	735a3ba007	radeonsi/gfx10: enable GS fast launch for triangles and strips with NGG culling Only non-indexed triangle lists and strips are supported. This increases performance if there is something to cull. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-20 16:16:11 -05:00
Marek Olšák	8db00a51f8	radeonsi/gfx10: implement NGG culling for 4x wave32 subgroups Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-20 16:16:11 -05:00
Marek Olšák	aa2d846604	radeonsi/gfx10: move GE_PC_ALLOC setting to shader states The value is not changed. I just use a different way to compute it. The value will vary with NGG culling. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-20 16:16:11 -05:00
Marek Olšák	68586bdd21	radeonsi: remove useless #includes Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	da2c12af4b	radeonsi: move geometry shader code into si_shader_llvm_gs.c Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	cf65c6f0d2	radeonsi: move VS_STATE.LS_OUT_PATCH_SIZE a few bits higher to make space there Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-15 15:06:31 -05:00
Marek Olšák	42112010a3	radeonsi: rename si_shader_create -> si_create_shader_variant for clarity Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2020-01-14 18:46:07 -05:00
Marek Olšák	f4ba457e1e	radeonsi: clean up si_shader_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2020-01-14 18:46:07 -05:00
Marek Olšák	03950473df	radeonsi: merge si_tessctrl_info into si_shader_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2020-01-14 18:46:07 -05:00
Marek Olšák	5fa2ab831e	radeonsi: fork tgsi_shader_info and tgsi_tessctrl_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2020-01-14 18:46:07 -05:00
Marek Olšák	18aaceae8d	radeonsi: rename si_shader_info -> si_shader_binary_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2020-01-14 18:46:07 -05:00
Marek Olšák	7f4a54d5bd	radeonsi: remove TGSI from comments Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2020-01-14 18:46:07 -05:00
Marek Olšák	363b4027fc	radeonsi: put up to 5 VBO descriptors into user SGPRs gfx6-8: 1 VBO descriptor in user SGPRs gfx9-10: 5 VBO descriptors in user SGPRs We no longer pull up to 5 VBO descriptors from GTT when SDMA is disabled. Totals from affected shaders: SGPRS: 1110528 -> 1170528 (5.40 %) VGPRS: 952896 -> 951936 (-0.10 %) Spilled SGPRs: 83 -> 61 (-26.51 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 23766296 -> 22843920 (-3.88 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 179344 -> 179344 (0.00 %) Wait states: 0 -> 0 (0.00 %) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-13 15:57:07 -05:00
Marek Olšák	312e04689a	radeonsi: don't allow draw calls with uninitialized VS inputs These always hang, because vertex buffer descriptors are not set up. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-13 15:57:07 -05:00
Marek Olšák	420fe1e7f9	radeonsi: remove TGSI Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-06 15:57:20 -05:00
Marek Olšák	17164d4e27	radeonsi/gfx10: don't declare any LDS for NGG if it's not used Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-12-27 13:50:57 -05:00
Marek Olšák	378444ce90	radeonsi: allow generating VS prologs with 0 inputs If "ls_vgpr_fix" is set, we use a prolog, but it can have 0 inputs. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3095>	2019-12-16 20:06:07 +00:00
Marek Olšák	442ef8c3e3	radeonsi: keep serialized NIR instead of nir_shader in si_shader_selector This decreases memory usage, because serialized NIR is more compact. The main shader part is compiled from nir_shader. Monolithic shader variants are compiled from nir_binary. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-11-05 23:28:45 -05:00
Marek Olšák	fff884e09d	radeonsi/nir: implement pipe_screen::finalize_nir Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-23 21:12:52 -04:00
Marek Olšák	268e0e01f3	radeonsi/nir: simplify si_lower_nir signature just a cleanup	2019-10-15 21:52:09 -04:00
Marek Olšák	eec7b0a865	radeonsi: use simple_mtx_t instead of mtx_t Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-07 20:05:07 -04:00
Marek Olšák	4dde40908f	radeonsi/gfx10: set PA_CL_VS_OUT_CNTL with CONTEXT_REG_RMW to fix edge flags We need two different values of the register, one for NGG and one for legacy, in order to fix edge flags for the legacy pipeline. Passing the ngg flag to emit_clip_regs would be too complicated, so CONTEXT_REG_RMW is used for partial register updates. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	1426acf9e7	radeonsi/gfx10: remove incorrect ngg/pos_writes_edgeflag variables It varies depending on si_shader_key::as_ngg. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	223b3174bd	radeonsi/nir: always lower ballot masks as 64-bit, codegen handles it This fixes KHR-GL45.shader_ballot_tests.ShaderBallotBitmasks. This solution is better, because the IR isn't dependent on wave32.	2019-08-19 17:23:38 -04:00
Marek Olšák	5167ca27fa	gallium: add TGSI_SEMANTIC_DEFAULT_OUTER/INNER_LEVEL for radeonsi NIR support.	2019-08-12 14:52:17 -04:00
Marek Olšák	902dd50cf0	gallium: add AMD-specific compute TGSI enums for tgsi_to_nir	2019-08-12 14:52:17 -04:00
Marek Olšák	6a2bdb8d01	gallium: add TGSI_PROPERTY_VS_BLIT_SGPRS_AMD for tgsi_to_nir needed by radeonsi NIR support	2019-08-12 14:52:17 -04:00
Marek Olšák	f064b530f6	radeonsi/gfx10: remove an obsolete VGT_REUSE_OFF workaround Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-06 17:09:01 -04:00
Marek Olšák	e777720173	radeonsi/nir: lower PS inputs before scanning the shader Lowering PS inputs can eliminate some of them, which messes up persp/linear barycentric coord usage info. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-06 17:08:46 -04:00
Marek Olšák	9ac7d0a0e2	radeonsi: fix packing of key.mono.u.ps	2019-07-30 22:06:23 -04:00
Marek Olšák	8f72f137ad	radeonsi/gfx10: add as_ngg variant for TES as ES to select Wave32/64 Legacy GS has to use Wave64, so TES before GS has to use Wave64 too. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-19 20:16:19 -04:00
Marek Olšák	88efb63caf	radeonsi/gfx10: implement Wave32 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-19 20:16:19 -04:00
Marek Olšák	a8a526c5cb	radeonsi/gfx10: set as_ngg for GS prolog as_ngg is required by Wave32. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-19 20:16:19 -04:00
Marek Olšák	0f30223cf4	radeonsi/gfx10: combine hw edgeflags with user edgeflags for correct behavior Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-19 20:16:19 -04:00
Marek Olšák	985a59e0d1	radeonsi/gfx10: don't compile the GS copy shader if it's 100% not needed Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-19 20:16:19 -04:00
Marek Olšák	1c99a13f89	radeonsi: decrease maximum supported GENERIC varying index from 42 to 31 This can decrease LDS and/or memory usage for shader outputs when geometry shaders or tessellation is used. Only PS inputs support higher indices and those aren't eliminated by kill_outputs. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Dave Airlie <airlied@redhat.com>	2019-07-09 17:24:16 -04:00
Marek Olšák	3be4ed2fe1	radeonsi: fix and clean up shader_type passing - don't pass it via a parameter if it can be derived from other parameters - set shader_type for ac_rtld_open - use enum pipe_shader_type instead of unsigned Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Dave Airlie <airlied@redhat.com>	2019-07-09 17:24:16 -04:00
Marek Olšák	f66ee5af2f	radeonsi: determine the rasterization primitive type accurately (v2) v2: reworked version to fix bugs and make it more efficient Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:13 -04:00
Marek Olšák	32694456f7	radeonsi/gfx10: jump over the shader query atomic if the queries are disabled Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:13 -04:00
Marek Olšák	b680f723f8	radeonsi/gfx10: export correct PrimitiveID from NGG vertex shaders Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:13 -04:00
Marek Olšák	6920f09f4b	radeonsi/gfx10: fix GL_LINE polygon mode for decomposed primitives We need to tell PA to accept edge flags generated by the input assembler, because decomposed primitives shouldn't draw inner edges. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:13 -04:00
Nicolai Hähnle	77e715541c	radeonsi/gfx10: emit VGT_GS_OUT_PRIM_TYPE from draw and add it to VS_STATE With NGG, the VGT_GS_OUT_PRIM_TYPE can change without a shader change. The VS_STATE is required for both streamout and culling from a vertex shader without pre-compiling outprim-specific variants. We could consider compiling specialized variants in the future. We could also consider compiling the NGG logic as an epilog. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:12 -04:00
Nicolai Hähnle	4ecc39e1aa	radeonsi/gfx10: NGG geometry shader PM4 and upload Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-03 15:51:12 -04:00

1 2 3 4 5 ...

442 Commits