KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jason Ekstrand	e83fe65cd8	radv,radeonsi: Do cube size divide-by-6 lowering in NIR No point in carrying all this code around twice each in two back-ends. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12005>	2021-07-22 14:22:35 -05:00
Timur Kristóf	aa24740370	radv: Don't compile NGG culling into shaders that write viewport index. We don't support NGG culling with multiple viewports yet. Fixes: `f30e4351de` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11910>	2021-07-16 13:00:36 +00:00
Timur Kristóf	8341af5109	radv, aco, ac/nir: Tweak position export scheduling for NGG culling. The result is about +5-ish fps in Doom Eternal. It turns out that the location of position exports matters more than we thought, and it's actually better to keep them at the bottom for culling shaders rather than schedule it up to the top. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	d18920e03a	radv: Run algebraic optimizations before NGG lowering. This makes culling shaders more efficient because they split the shader in two parts. It is better to optimize before this split happens. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	f30e4351de	radv: Support NGG culling with new perftest environment variable. Currently we don't enable it on any chip by default, but we plan to enable it soon on GFX10.3 when we are comfortable with its performance. RADV_PERFTEST=nggc environment variable enables it on GFX10+ GPUs. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	9a95f5487f	radv: New shader args for NGG culling settings and viewport. Add new shader arguments in RADV for: - NGG culling settings - Viewport transform These will be used by NGG culling shaders. Additionally, some tweaks are made to some config registers in order to make culling shaders more efficient. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Samuel Pitoiset	ee79b87c62	radv: lower primitive shading rate in NIR This allows more potential compiler optimizations if the value is a constant or from a scalar load. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11579>	2021-07-12 17:54:07 +00:00
Connor Abbott	e4e79de2a4	nir/subgroups: Support > 1 ballot components Qualcomm has a mode with a subgroup size of 128, so just emitting larger integer operations and then lowering them later isn't an option. This makes the pass able to handle the lowering itself, so that we don't have to go down to 64-thread wavefronts when ballots are used. (The GLSL and legacy SPIR-V extensions only support a maximum of 64 threads, but I guess we'll cross that bridge when we come to it...) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Connor Abbott	90819b9b0e	nir/subgroups: Replace lower_vote_eq_to_ballot with lower_vote_eq Lower it to a vote instead of a ballot. This was only used for AMD, and in that case they're pretty much the same. However Qualcomm has a vote builtin, which we want to use instead of ballots. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Timur Kristóf	32fafa43ae	radv: Allow enabling vertex grouping, fix NGG info with it disabled. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11711>	2021-07-08 16:54:29 +02:00
Timur Kristóf	fd64c18214	radv: Move radv_optimize_nir_algebraic to a separate function. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11711>	2021-07-08 16:54:29 +02:00
Timur Kristóf	00d11c2065	radv: Add last_vgt_api_stage and use it to simplify some code. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11711>	2021-07-08 16:54:26 +02:00
Bas Nieuwenhuizen	f9cc94af7b	radv/winsys: Add support for a fixed VA address for replay. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10570>	2021-07-05 17:27:51 +00:00
Bas Nieuwenhuizen	8025b4120f	radv/winsys: Return vulkan errors for buffer creation. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10570>	2021-07-05 17:27:51 +00:00
Rhys Perry	ed123a03be	radv: enable VK_KHR_shader_subgroup_uniform_control_flow Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11626>	2021-07-01 10:01:52 +00:00
Samuel Pitoiset	4a031a01ba	radv: fix RADV_FORCE_VRS for 2x1 and 1x2 The rates were inverted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11578>	2021-06-29 12:47:49 +02:00
Rhys Perry	ea68d4a676	nir/propagate_invariant: add invariant_prim option Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11035>	2021-06-21 15:13:05 +00:00
Bas Nieuwenhuizen	9920eadf83	radv: Implement load_vulkan_descriptor for acceleration structures. It always uses nir_address_format_64bit_global. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11078>	2021-06-18 22:16:27 +00:00
Bas Nieuwenhuizen	85e39cb325	radv: Convert lower_intrinsics to a switch statement Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11078>	2021-06-18 22:16:27 +00:00
Caio Marcelo de Oliveira Filho	b5f6fc442c	nir: Move zero_initialize_shared_memory into common shader_info Move it out the "cs" sub-struct, since the bit will be used for other shader stages in the future. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11225>	2021-06-08 09:23:55 -07:00
Rhys Perry	63659fc15c	radv: use byte/word extract/insert instructions ACO doesn't yet combine extract/insert into instructions, but it seems to already generate less instructions because NIR optimizes shift+and to these instructions. Code size is worse in some cases though because we have to always use a literal when masking. fossil-db (Sienna Cichlid): Totals from 14361 (9.58% of 149839) affected shaders: VGPRs: 850152 -> 850304 (+0.02%); split: -0.02%, +0.04% SpillSGPRs: 7979 -> 7989 (+0.13%); split: -0.03%, +0.15% CodeSize: 88031216 -> 88162520 (+0.15%); split: -0.01%, +0.16% MaxWaves: 269414 -> 269426 (+0.00%) Instrs: 16695182 -> 16662852 (-0.19%); split: -0.21%, +0.01% Latency: 375592693 -> 375544364 (-0.01%); split: -0.04%, +0.03% InvThroughput: 75627700 -> 75607720 (-0.03%); split: -0.07%, +0.04% fossil-db (Polaris): Totals from 13816 (9.13% of 151365) affected shaders: SGPRs: 984896 -> 982512 (-0.24%); split: -0.29%, +0.05% VGPRs: 809220 -> 809112 (-0.01%); split: -0.02%, +0.01% SpillSGPRs: 9181 -> 9185 (+0.04%); split: -0.04%, +0.09% CodeSize: 82017952 -> 82123484 (+0.13%); split: -0.01%, +0.14% MaxWaves: 65721 -> 65723 (+0.00%) Instrs: 16008744 -> 15988007 (-0.13%); split: -0.18%, +0.05% Latency: 439911623 -> 439869622 (-0.01%); split: -0.04%, +0.03% InvThroughput: 185898770 -> 185841742 (-0.03%); split: -0.08%, +0.05% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:43 +00:00
Rhys Perry	1cbcfb8b38	nir, nir/algebraic: add byte/word insertion instructions Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:42 +00:00
Daniel Schürmann	dc807dff3e	radv,aco: scalarize all phis via nir_lower_phis_to_scalar() This allows to remove some ACO code which did so previously. Totals from 93 (0.06% of 149839) affected shaders (Navi2): CodeSize: 582424 -> 582348 (-0.01%); split: -0.10%, +0.08% Instrs: 107083 -> 107011 (-0.07%); split: -0.08%, +0.01% Latency: 483338 -> 484881 (+0.32%); split: -0.09%, +0.40% InvThroughput: 101129 -> 101532 (+0.40%); split: -0.03%, +0.42% Copies: 9893 -> 9774 (-1.20%); split: -1.28%, +0.08% Branches: 2862 -> 2858 (-0.14%) PreSGPRs: 3342 -> 3339 (-0.09%) PreVGPRs: 4567 -> 4565 (-0.04%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11181>	2021-06-04 16:47:01 +00:00
Samuel Pitoiset	6d56c16c9c	radv: dump the trap handler shader with RADV_DEBUG=metashaders Help debugging it. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10656>	2021-05-17 15:18:16 +00:00
Connor Abbott	a40714abf7	nir/lower_phis_to_scalar: Add "lower_all" option We don't want to have to deal with vector phis in freedreno, because vectors are always split/unsplit around vectorized instructions anyways, and the stated reason for not scalarising them (it hurting coalescing) won't apply to us because we won't be using nir_from_ssa. Add this option so that we don't have to do the equivalent thing while translating from NIR. Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10809>	2021-05-17 09:59:45 +00:00
Timur Kristóf	bb127c2130	radv: Use new NIR lowering of NGG GS when ACO is used. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10740>	2021-05-12 13:47:04 +00:00
Timur Kristóf	9732881729	radv: Use new NGG NIR lowering for VS/TES when ACO is used. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10740>	2021-05-12 13:47:04 +00:00
Rhys Perry	20a0744e22	Revert "radv,aco: don't use MUBUF for multi-channel loads on GFX8 with robustness2" This reverts commit a8a6b9fb2fdcb1bea55707fa0c2b8e96f03c6b5b. This is no longer necessary now that we fixup the size when creating the descriptors. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7871>	2021-05-10 12:09:14 +00:00
Caio Marcelo de Oliveira Filho	dd48683cfd	nir: Move shared_memory_explicit_layout bit into common shader_info Move it out of the "cs" sub-struct, since the bit can be used for other shader stages in the future. This also removes a subtle issue in spirv_to_nir: info.cs.shared_memory_explicit_layout was used without checking for the CS shader stage. It ended up being "harmless" since the effects also depended on presence of shared variables. Fixes: `5de6c5973a` ("spirv: Implement SPV_KHR_workgroup_memory_explicit_layout") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10529>	2021-05-04 20:54:58 +00:00
Rhys Perry	ee9b744cb5	radv,aco: use nir_address_format_vec2_index_32bit_offset The vec2 index helps the compiler make use of SMEM's SOFFSET field when loading descriptors. fossil-db (GFX10.3): Totals from 126326 (86.37% of 146267) affected shaders: VGPRs: 4898704 -> 4899088 (+0.01%); split: -0.02%, +0.03% SpillSGPRs: 13490 -> 14404 (+6.78%); split: -1.10%, +7.87% CodeSize: 306442996 -> 302277700 (-1.36%); split: -1.36%, +0.01% MaxWaves: 3277108 -> 3276624 (-0.01%); split: +0.01%, -0.02% Instrs: 58301101 -> 57469370 (-1.43%); split: -1.43%, +0.01% VClause: 1208270 -> 1199264 (-0.75%); split: -1.02%, +0.28% SClause: 2517691 -> 2432744 (-3.37%); split: -3.75%, +0.38% Copies: 3518643 -> 3161097 (-10.16%); split: -10.45%, +0.29% Branches: 1228383 -> 1228254 (-0.01%); split: -0.12%, +0.11% PreSGPRs: 3973880 -> 4031099 (+1.44%); split: -0.19%, +1.63% PreVGPRs: 3831599 -> 3831707 (+0.00%) Cycles: 1785250712 -> 1778222316 (-0.39%); split: -0.42%, +0.03% VMEM: 52873776 -> 50663317 (-4.18%); split: +0.18%, -4.36% SMEM: 8534270 -> 8361666 (-2.02%); split: +1.79%, -3.82% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9523>	2021-04-27 15:56:07 +00:00
Rhys Perry	00d52492bf	radv: implement vulkan_resource_reindex Fixes dEQP-VK.spirv_assembly.instruction.compute.variable_pointers.dynamic_offset.select_descriptor_array Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9523>	2021-04-27 15:56:07 +00:00
Samuel Pitoiset	9a0098ee50	radv: cleanup LLVM related includes Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10403>	2021-04-23 11:52:01 +02:00
Samuel Pitoiset	1c702a8239	ac: move ac_lower_indirect_derefs() outside of the LLVM dir Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10403>	2021-04-23 11:52:01 +02:00
Samuel Pitoiset	0b7c442d39	radv: remove old comment about LLVM <= 8 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10403>	2021-04-23 11:52:01 +02:00
Bas Nieuwenhuizen	59c501ca35	radv: Format. Using find ./src/amd/vulkan -regex './.\.\(c\\|h\\|cpp\)' \| xargs -P8 -n1 clang-format --style=file -i Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10091>	2021-04-10 03:31:58 +02:00
Samuel Pitoiset	1ad295ed6f	radv: allow to force VRS rates on GFX10.3 with RADV_FORCE_VRS This allows to force the VRS rates via RADV_FORCE_VRS, the supported values are 2x2, 1x2 and 2x1. This supports the primitive shading rate mode for non GUI elements. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7794>	2021-04-09 14:47:53 +02:00
Bas Nieuwenhuizen	580f1ac473	nir: Extract shader_info->cs.shared_size out of union. It is valid for all stages, just 0 for most of them. In particular mesh/task shaders might be using it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Samuel Pitoiset	65bca137bd	aco: implement a workaround for the image load DCC hw bug on GFX10.3 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9919>	2021-04-05 08:54:55 +00:00
Rhys Perry	28d116c889	radv: lower view_index to zero if multiview is disabled Apparently, gl_ViewIndex can be used if multiview is disabled. See https://gitlab.freedesktop.org/mesa/mesa/-/issues/4446 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9718>	2021-03-22 13:38:43 +00:00
Rhys Perry	854b33de2f	radv: lower variables to ssa before nir_propagate_invariant Variables make the propagation more conservative than it needs to be. fossil-db (GFX10.3, based on !8056): Totals from 22102 (15.86% of 139391) affected shaders: SGPRs: 1103833 -> 1104265 (+0.04%); split: -0.06%, +0.10% VGPRs: 851316 -> 849396 (-0.23%); split: -0.58%, +0.35% CodeSize: 34422452 -> 34206052 (-0.63%); split: -0.72%, +0.09% MaxWaves: 291050 -> 291186 (+0.05%); split: +0.23%, -0.18% Instrs: 6749188 -> 6662804 (-1.28%); split: -1.37%, +0.09% Cycles: 63456312 -> 62995828 (-0.73%); split: -0.92%, +0.20% fossil-db (GFX10.3): Totals from 841 (0.60% of 139391) affected shaders: VGPRs: 44444 -> 44500 (+0.13%) CodeSize: 3985604 -> 3987188 (+0.04%); split: -0.11%, +0.15% Instrs: 748847 -> 749174 (+0.04%); split: -0.13%, +0.18% Cycles: 35801628 -> 35825676 (+0.07%); split: -0.23%, +0.30% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8493>	2021-03-19 18:04:00 +00:00
Timur Kristóf	16021e3bae	radv/llvm: Only store TCS outputs where they are really needed. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9201>	2021-03-17 12:42:23 +00:00
Timur Kristóf	540168fd15	radv: Use new, NIR-based I/O lowering. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9201>	2021-03-17 12:42:23 +00:00
Mike Blumenkrantz	1d1c7a965c	radv: use common interfaces for shader modules Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9508>	2021-03-15 21:47:44 +00:00
Marek Olšák	230a6dc55d	ac,radeonsi: add sampler changes for Aldebaran - no 3D and cube textures - no mipmapping - no border color - image_sample is the only supported opcode with a sampler (behaves like _lz) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9389>	2021-03-10 18:02:27 +00:00
Jason Ekstrand	117668b811	nir: Make nir_ssa_def_rewrite_uses take an SSA value This commit replaces the new_src parameter of nir_ssa_def_rewrite_uses() with an SSA def, removes nir_ssa_def_rewrite_uses_ssa(), and rewrites all the users as needed. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9383>	2021-03-08 16:59:55 +00:00
Rhys Perry	21697082ec	radv: don't shrink image stores for The Surge 2 The game seems to declare the wrong format. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `e4d75c22` ("nir/opt_shrink_vectors: shrink image stores using the format") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4347 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9229>	2021-03-03 14:18:37 +00:00
Rhys Perry	cbb5ed476c	nir/opt_shrink_vectors: add option to skip shrinking image stores Some games declare the wrong format, so we might want to disable this optimization in that case. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `e4d75c22` ("nir/opt_shrink_vectors: shrink image stores using the format") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9229>	2021-03-03 14:18:37 +00:00
Samuel Pitoiset	859dbf953d	radv: fix exporting SQTT pipelines with LLVM Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9300>	2021-03-01 11:20:19 +00:00
Samuel Pitoiset	884e14d784	radv: store a pointer to the code in radv_shader_variant This will be used by SQTT to dump the disassembly. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9130>	2021-02-23 13:03:39 +00:00
Timur Kristóf	72c348f85c	radv/llvm: Fix reporting LDS stats of tess control shaders. The LLVM backend forgot to set config->lds_size, which is used for reporting LDS stats. Fixes: `cf89bdb9ba` "radv: align the LDS size in calculate_tess_lds_size()" Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9098>	2021-02-18 11:57:22 +00:00
Samuel Pitoiset	f502bdf1ab	radv: only apply the MRT output NaN fixup to non-meta shaders We only want this workaround to be applied for game shaders. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4163 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9048>	2021-02-16 09:03:31 +01:00
Rhys Perry	7ff805a19d	radv,aco: add radv_nir_compiler_options::wgp_mode Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8761>	2021-02-15 13:35:36 +00:00
Rhys Perry	6a770cae4b	radv: round up max_lds_per_simd / lds_per_wave If each SIMD has to get an different number of waves, report the maximum. One example of a situation is when a single-wave workgroup uses more than max_lds_per_simd. This change causes radv_get_max_waves() to report a single wave per SIMD instead of none. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8761>	2021-02-15 13:34:30 +00:00
Rhys Perry	267d7074d9	radv: use lds_{encode,alloc}_granularity This fixes a issue in radv_get_max_waves() where it aligned the LDS allocation to 512 bytes instead of 1024 on GFX10.3. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8761>	2021-02-15 13:34:30 +00:00
Rhys Perry	df61444ac4	radv: switch MaxWaves statistic to wave32 waves Always return the wave32 waves instead of wave64 waves because the wave32 wave count is more precise in the case of wave32. This also fixes usage of lds_per_wave in wave32. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8761>	2021-02-15 13:34:30 +00:00
Rhys Perry	43108824ec	radv: fix max_lds_per_simd on GFX10 num_simd_per_compute_unit was the number of SIMDs per compute unit, but lds_size_per_workgroup was the bytes of LDS per WGP. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8761>	2021-02-15 13:34:30 +00:00
Samuel Pitoiset	e02b1577d0	radv/winsys: remove the radv_amdgpu_winsys_bo::ws indirection This saves a 64-bit pointer from radv_amdgpu_winsys_bo and it's also common to pass a winsys pointer as the first parameter. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8859>	2021-02-08 08:45:38 +01:00
Bas Nieuwenhuizen	fdfd316d5b	radv: Implement VK_KHR_zero_initialize_workgroup_memory. Reuses the pass that was implemented for ANV. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8751>	2021-02-04 01:29:58 +00:00
Rhys Perry	0602d4ec69	radv: correctly enable WGP_MODE for tessellation control Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8811>	2021-02-03 11:27:50 +00:00
Rhys Perry	2338e4ad36	radv: correctly enable WGP_MODE for NGG and GS Previously, we would set WGP_MODE on GFX10+ and then only on GFX10. Because we used bitwise or, the result was WGP_MODE being set on GFX10+. We also set the wrong bit, S_00B848_WGP_MODE instead of S_00B228_WGP_MODE. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8811>	2021-02-03 11:27:50 +00:00
Jason Ekstrand	23ba48a0c7	vulkan: Make the debug_report implementation internal Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:25 +00:00
Jason Ekstrand	41318a5819	vulkan: Use vk_object_base::type for debug_report Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:25 +00:00
Jason Ekstrand	19d7cf0457	radv: Switch to the common VK_EXT_debug_report Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:25 +00:00
Bas Nieuwenhuizen	d938fcefb9	radv: Expose VK_KHR_workgroup_memory_explicit_layout. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8752>	2021-01-29 00:05:36 +01:00
James Park	2e81ed2a47	radv: Pointer arithmetic on char/uint8_t, not void Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7793>	2021-01-26 09:16:15 +00:00
Rhys Perry	af4c6605a8	radv: use nir_opt_access fossil-db (GFX10.3): Totals from 3231 (2.32% of 139391) affected shaders: SGPRs: 168654 -> 167454 (-0.71%); split: -0.72%, +0.00% VGPRs: 152352 -> 152416 (+0.04%) CodeSize: 13872836 -> 13806376 (-0.48%); split: -0.50%, +0.02% MaxWaves: 36640 -> 36634 (-0.02%) Instrs: 2639959 -> 2626852 (-0.50%); split: -0.52%, +0.03% Cycles: 77706000 -> 77496792 (-0.27%); split: -0.28%, +0.01% VMEM: 809496 -> 790847 (-2.30%); split: +2.06%, -4.36% SMEM: 267843 -> 253187 (-5.47%); split: +0.76%, -6.23% VClause: 61353 -> 60426 (-1.51%); split: -1.86%, +0.35% SClause: 95409 -> 92355 (-3.20%); split: -3.24%, +0.04% Copies: 194951 -> 196702 (+0.90%); split: -0.53%, +1.43% Branches: 84320 -> 84331 (+0.01%); split: -0.00%, +0.02% PreSGPRs: 110162 -> 110203 (+0.04%); split: -0.04%, +0.07% PreVGPRs: 127021 -> 127037 (+0.01%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6490>	2021-01-21 18:07:03 +00:00
Rhys Perry	dc19fe0e9f	radv,aco: use deref_buffer_array_length Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3993 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8163>	2021-01-21 11:53:12 +00:00
Rhys Perry	914c61d6c0	radv,aco: don't use MUBUF for multi-channel loads on GFX8 with robustness2 Fixes several dEQP-VK.robustness.robustness2.* tests on GFX8. Generations other than GFX8 don't fail the tests because bounds-checking is done using the index (making it per-vertex). fossil-db (Polaris): Totals from 1387 (0.99% of 140385) affected shaders: (no statistics affected) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `03a0d39366` ("aco: use MUBUF in some situations instead of splitting vertex fetches") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7834>	2021-01-20 17:57:56 +00:00
Rhys Perry	12ea0143de	radv: fix max_waves estimation on GFX10.3 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8523>	2021-01-20 16:46:54 +00:00
Rhys Perry	dfe429eb41	nir/loop_unroll: unroll more aggressively if it can improve load scheduling Significantly improves performance of a Control compute shader. Also seems to increase FPS at the very start of the game by ~5% (RX 580, 1080p, medium settings, no MSAA). fossil-db (Sienna): Totals from 81 (0.06% of 139391) affected shaders: SGPRs: 3848 -> 4362 (+13.36%); split: -0.99%, +14.35% VGPRs: 4132 -> 4648 (+12.49%) CodeSize: 275532 -> 659188 (+139.24%) MaxWaves: 986 -> 906 (-8.11%) Instrs: 54422 -> 126865 (+133.11%) Cycles: 1057240 -> 750464 (-29.02%); split: -42.61%, +13.60% VMEM: 26507 -> 61829 (+133.26%); split: +135.56%, -2.30% SMEM: 4748 -> 5895 (+24.16%); split: +31.47%, -7.31% VClause: 1933 -> 6802 (+251.89%); split: -0.72%, +252.61% SClause: 1179 -> 1810 (+53.52%); split: -3.14%, +56.66% Branches: 1174 -> 1157 (-1.45%); split: -23.94%, +22.49% PreVGPRs: 3219 -> 3387 (+5.22%); split: -0.96%, +6.18% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6538>	2021-01-13 18:54:18 +00:00
Daniel Schürmann	fcd2ef23e5	radv: vectorize 16bit instructions Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6680>	2021-01-13 17:46:56 +00:00
Daniel Schürmann	d495a5c183	radv: enable .lower_ineg We already emit ineg as isub most of the time. The results are a bit mixed, but shouldn't really make a difference. A couple of additional copies are needed as isub writes scc. Totals from 5975 (4.29% of 139391) affected shaders: CodeSize: 31508648 -> 31509264 (+0.00%); split: -0.00%, +0.00% Instrs: 6073379 -> 6073531 (+0.00%); split: -0.00%, +0.00% Cycles: 47186280 -> 47187116 (+0.00%); split: -0.00%, +0.00% VMEM: 2528515 -> 2529139 (+0.02%); split: +0.03%, -0.01% SMEM: 596842 -> 596924 (+0.01%); split: +0.02%, -0.00% SClause: 280596 -> 280594 (-0.00%) Copies: 288554 -> 288669 (+0.04%); split: -0.00%, +0.04% PreSGPRs: 240390 -> 240397 (+0.00%) PreVGPRs: 349630 -> 349749 (+0.03%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8425>	2021-01-12 16:14:00 +00:00
Rhys Perry	f17de6a803	radv: add RADV_DEBUG=invariantgeom This can be used to work around a common class of bugs appearing as flickering. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8104>	2021-01-12 15:11:49 +00:00
Daniel Schürmann	bd8e84eb8d	nir: replace .lower_sub with .has_fsub and .has_isub This allows a more fine-grained control about whether a backend supports one of these instructions. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6597>	2021-01-11 19:13:51 +00:00
Rhys Perry	d95fe8a25e	radv: support SpvCapabilitySparseResidency Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7775>	2021-01-08 14:27:07 +00:00
Rhys Perry	4c67423e99	radv: implement is_sparse_texels_resident and sparse_residency_code_and Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7775>	2021-01-08 14:27:07 +00:00
Samuel Pitoiset	7a464f4296	radv: track if VRS is enabled to apply a workaround on GFX10.3 On some chips, gl_FragCoord.z has to be adjusted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7837>	2020-12-14 16:22:38 +00:00
Samuel Pitoiset	bf69d89b5a	radv: implement VK_KHR_fragment_shading_rate Only supported on GFX10.3+. Attachment Fragment Shading Rate is for later. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7837>	2020-12-14 16:22:38 +00:00
James Park	fe67fe688a	radv: Wrap pragmas with __GNUC__ to fix MSVC Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7866>	2020-12-02 11:27:01 +00:00
Samuel Pitoiset	04ea3d6501	radv: disable WGP_MODE for NGG on GFX10.3 Ported from RadeonSI, reducing the CU mask probably broke WGP mode. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7769>	2020-11-30 09:31:29 +00:00
Tony Wasserka	cba6ec309a	radv: Fix -Wshadow warnings Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7430>	2020-11-20 09:29:19 +00:00
Marek Olšák	cb20d58f45	nir: optimize nir_lower_discard_to_demote to lower discard/demote both ways This is smarter and also lowers demote to discard if helper invocations are not needed. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7586>	2020-11-12 21:02:05 +00:00
Eric Anholt	eda3e4e055	nir/builder: Add a name format arg to nir_builder_init_simple_shader(). This cleans up a bunch of gross sprintfs and keeps the caller from needing to remember to ralloc_strdup. I added a couple of '"%s", name ? name : ""' to radv where I didn't fully trace through whether a non-null name was being passed in. I also took the liberty of adding a basic name to a few shaders (pan_blit, unit tests) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7323>	2020-11-11 08:50:29 -08:00
Eric Anholt	5f992802f5	nir/builder: Drop the mem_ctx arg from nir_builder_init_simple_shader(). This looks a lot more simple now! Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7323>	2020-11-11 08:50:29 -08:00
Eric Anholt	4e9328e3b6	nir_builder: Return a new builder from nir_builder_init_simple_shader(). It's a little inline function, so we can just RAII it for better ergonomics. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7323>	2020-11-11 08:49:49 -08:00
Rhys Perry	86ef139bf4	radv: implement VK_EXT_shader_image_atomic_int64 The extension is only exposed on ACO and LLVM 11+ because of a LLVM bug. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7234>	2020-11-09 18:28:59 +00:00
Daniel Schürmann	fef8a4befd	radv: remove call to nir_lower_pack() The pack_* instructions are now lowered via nir_lower_alu_to_scalar() and unpack_* are not lowered anymore. These bitcasts are no-ops, and lowering prevents some optimizations like vectorization. Note: There are still some *_split variations remaining from different other NIR passes. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6527>	2020-10-28 10:14:26 +00:00
Daniel Schürmann	212be2a04e	radv: lower pack_[64/32]_* via nir_lower_alu_to_scalar() Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6527>	2020-10-28 10:14:26 +00:00
Samuel Pitoiset	6d32fcaaaf	Revert "radv/aco: disable NGG GS support because it randomly hangs the GPU" This reverts commit `b84d1a0c42`. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7213>	2020-10-20 07:11:29 +00:00
James Park	af8d488ea5	util,ac,aco,radv: Cross-platform memstream API POSIX memstream is not available on Windows. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7143>	2020-10-19 03:37:42 -07:00
Daniel Schürmann	2f125908b3	radv,aco: lower_pack_half_2x16 This patch also optimizes pack_half_2x16(a, 0.0). Totals from 1949 (1.43% of 136546) affected shaders (RAVEN): SGPRs: 83376 -> 83336 (-0.05%) CodeSize: 3532144 -> 3512352 (-0.56%) Instrs: 660746 -> 660682 (-0.01%); split: -0.01%, +0.00% Cycles: 6780716 -> 6780472 (-0.00%); split: -0.00%, +0.00% VMEM: 990886 -> 990883 (-0.00%); split: +0.00%, -0.00% SMEM: 150506 -> 150538 (+0.02%); split: +0.05%, -0.03% SClause: 30595 -> 30594 (-0.00%); split: -0.01%, +0.00% Copies: 40801 -> 40729 (-0.18%) PreSGPRs: 52335 -> 52341 (+0.01%); split: -0.03%, +0.04% PreVGPRs: 45104 -> 45097 (-0.02%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6777>	2020-10-14 15:31:38 +00:00
Samuel Pitoiset	e3e8d13ada	radv: move compiler statistics to ACO They are really specific to ACO. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7061>	2020-10-14 15:09:34 +00:00
James Park	28d02b9d3e	ac,amd/llvm,radv: Initialize structs with {0} Necessary to compile with MSVC. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7123>	2020-10-14 12:15:23 +00:00
Samuel Pitoiset	b84d1a0c42	radv/aco: disable NGG GS support because it randomly hangs the GPU Disable ACO NGG GS until the random GPU hangs are fixed (one CTS run == one GPU hang here). No hangs so far after 5 full CTS runs with this disabled. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7108>	2020-10-14 13:52:42 +02:00
Rhys Perry	e1120f274f	nir: move divergence analysis options to nir_shader_compiler_options Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:21 +00:00
Rhys Perry	8850a63161	radv/aco,nir/lower_subgroups: don't lower elect ACO can implement this better. fossil-db (Navi): Totals from 33 (0.02% of 135946) affected shaders: SGPRs: 1736 -> 1744 (+0.46%) VGPRs: 1680 -> 1656 (-1.43%) CodeSize: 246160 -> 245916 (-0.10%); split: -0.14%, +0.04% MaxWaves: 449 -> 461 (+2.67%) Instrs: 48301 -> 48266 (-0.07%); split: -0.12%, +0.05% Cycles: 469740 -> 469240 (-0.11%); split: -0.18%, +0.08% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:20 +00:00
Rhys Perry	8e981453ed	radv: use radv_optimize_nir() less in radv_link_shaders() fossil-db (Navi): Totals from 11 (0.01% of 137413) affected shaders: CodeSize: 99372 -> 99480 (+0.11%) Instrs: 19119 -> 19110 (-0.05%) Cycles: 222144 -> 222000 (-0.06%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6891>	2020-10-09 15:48:00 +00:00
Rhys Perry	55254f241f	radv: move optimizations in shader_compile_to_nir() to after io_to_scalar This results in at least one less radv_optimize_nir() iteration. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6891>	2020-10-09 15:47:59 +00:00
Timur Kristóf	17ad2ade82	radv/aco: Use new GS lowering options for ACO with NGG GS. This makes it easier for ACO to implement NGG GS: 1. No need to keep track of vertex and primitive counts. 2. No need to discard incomplete primitives. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Timur Kristóf	2be99012e9	nir: Add ability to count emitted GS primitives. Add an option to nir_lower_gs_intrinsics which tells it to track the number of emitted primitives, not just vertices. Additionally, also make it per-stream. Also rename the set_vertex_count intrinsic to set_vertex_and_primitive_count. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Samuel Pitoiset	9aa5c7ce72	radv: use the same NIR compiler options for both compiler backends No changes, they are already similar. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6938>	2020-10-09 11:06:36 +02:00
Samuel Pitoiset	63049b0444	radv/llvm: do not lower sub To match ACO. Totals from 268 (0.20% of 136420) affected shaders: CodeSize: 1214060 -> 1214096 (+0.00%); split: -0.05%, +0.06% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6938>	2020-10-09 11:06:34 +02:00
Samuel Pitoiset	a41bed243e	radv/llvm: do not lower nir_op_fsat To match ACO. fossilds-db (Navi10): Totals from 20869 (15.30% of 136420) affected shaders: SGPRs: 1851128 -> 1851920 (+0.04%); split: -0.41%, +0.46% VGPRs: 1607360 -> 1608212 (+0.05%); split: -0.20%, +0.25% SpillSGPRs: 267331 -> 261350 (-2.24%); split: -3.67%, +1.43% CodeSize: 155460104 -> 155303508 (-0.10%); split: -0.21%, +0.11% MaxWaves: 179156 -> 178928 (-0.13%); split: +0.48%, -0.60% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6932>	2020-10-08 12:38:04 +00:00
Tony Wasserka	76add3565e	radv: Fix unaligned memory access when writing specialization map entries Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6728>	2020-10-07 19:50:01 +00:00
Rhys Perry	19561f31a8	radv: remove trailing whitespace Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7043>	2020-10-07 11:53:23 +00:00
Samuel Pitoiset	6b0695c42a	radv/llvm: enable lower_unpack_half_2x16 To match ACO. fossils-db (Navi10): Totals from 294 (0.22% of 136420) affected shaders: SGPRs: 16504 -> 16496 (-0.05%) VGPRs: 19008 -> 19124 (+0.61%); split: -0.06%, +0.67% SpillVGPRs: 511 -> 476 (-6.85%); split: -7.63%, +0.78% CodeSize: 1688852 -> `1687932` (-0.05%); split: -0.10%, +0.05% Scratch: 305152 -> 307200 (+0.67%) MaxWaves: 2877 -> 2878 (+0.03%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6937>	2020-10-05 12:42:42 +02:00
Samuel Pitoiset	cdf6d93498	radv/llvm: lower VS IO Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6912>	2020-10-05 08:06:12 +00:00
Samuel Pitoiset	1c4a21328e	radv/llvm: lower TCS IO Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6912>	2020-10-05 08:06:12 +00:00
Samuel Pitoiset	9615273907	radv/llvm: lower TES IO Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6912>	2020-10-05 08:06:12 +00:00
Samuel Pitoiset	6e339418a7	radv/llvm: lower GS IO The LLVM bakend expects 64-bit IO to be lowered to 32-bit but it's unclear if we want to do that for ACO at this point. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6912>	2020-10-05 08:06:12 +00:00
Samuel Pitoiset	df63491594	radv/aco: lower IO for all stages outside of ACO Lowering IO for VS, TCS, TES and GS still have to be done for LLVM. No fossils-db change on NAVI10. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6897>	2020-10-01 14:58:25 +00:00
Jason Ekstrand	d3fa7451a6	anv,radv,tu,val: Call nir_lower_io for push constants Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5275>	2020-09-30 07:20:39 +00:00
Samuel Pitoiset	291cfb1e41	radv: move lowering of FS outputs outside of ACO This enables lowering of FS outputs for RADV/LLVM. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6865>	2020-09-29 14:44:05 +00:00
Samuel Pitoiset	4dae9e53f6	radv: call nir_io_add_const_offset_to_base for FS outputs The store_output of RADV/LLVM expects the const offset to be 0. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6865>	2020-09-29 14:44:05 +00:00
Samuel Pitoiset	778fe02f3b	radv/llvm: call nir_lower_io_to_vector with FS to fix array tests Fixes dEQP-VK.glsl.440.linkage.varying.component.frag_out.*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6883>	2020-09-29 10:00:50 +00:00
Samuel Pitoiset	1588644543	radv: lower deref operations for global memory for both backends To match ACO. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5316>	2020-09-29 07:24:35 +00:00
Kenneth Graunke	140f53e646	Revert "nir: replace lower_ffma and fuse_ffma with has_ffma" This reverts commit `939ddf3f67`. Intel has a separate pass for fusing FFMAs selectively. We split these flags in commit `1b72c31e1f` and the reasoning still stands. The patch being reverted was just a cleanup, so there should be no issue with reverting it. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6849>	2020-09-24 13:11:50 -07:00
Marek Olšák	939ddf3f67	nir: replace lower_ffma and fuse_ffma with has_ffma Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Marek Olšák	771aad3027	nir: split lower_ffma into lower_ffma16/32/64 AMD wants different behavior for each bit size Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Samuel Pitoiset	05b6612b4e	radv: do not lower UBO/SSBO access to offsets Use nir_lower_explicit_io instead of lowering to offsets. Extra (useless) additions are removed by lowering load_vulkan_descriptor to vec2(src.x, 0). fossils-db (Navi): Totals from 18236 (13.21% of 138013) affected shaders: SGPRs: 1172766 -> 1168278 (-0.38%); split: -0.89%, +0.50% VGPRs: 940156 -> 952232 (+1.28%); split: -0.08%, +1.37% SpillSGPRs: 30286 -> 31109 (+2.72%); split: -0.78%, +3.50% SpillVGPRs: 1893 -> 1909 (+0.85%) CodeSize: 87910396 -> 88113592 (+0.23%); split: -0.35%, +0.58% Scratch: 819200 -> 823296 (+0.50%) MaxWaves: 205535 -> 202102 (-1.67%); split: +0.05%, -1.72% Instrs: 17052527 -> 17113484 (+0.36%); split: -0.32%, +0.67% Cycles: 670794876 -> 669084540 (-0.25%); split: -0.38%, +0.13% VMEM: 5274728 -> 5388556 (+2.16%); split: +3.10%, -0.94% SMEM: 1196146 -> 1165850 (-2.53%); split: +2.06%, -4.60% VClause: 381463 -> 399217 (+4.65%); split: -1.08%, +5.73% SClause: 666216 -> 631135 (-5.27%); split: -5.44%, +0.18% Copies: 1292720 -> 1289318 (-0.26%); split: -1.28%, +1.01% Branches: 467336 -> 473028 (+1.22%); split: -0.67%, +1.89% PreSGPRs: 766459 -> 772175 (+0.75%); split: -0.53%, +1.28% PreVGPRs: 819746 -> 825327 (+0.68%); split: -0.05%, +0.73% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6202>	2020-09-21 15:37:11 +00:00
Marek Olšák	ac55b1a9a6	nir: get ffma support from NIR options for nir_lower_flrp This also fixes the inverted last parameter of nir_lower_flrp in most drivers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6599>	2020-09-04 17:06:22 +00:00
Samuel Pitoiset	ebf2576862	radv,aco: disable opts if VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT Sounds useful to determine if ACO breaks a specific pipeline because of various optimizations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6487>	2020-09-04 06:59:45 +00:00
Marek Olšák	b7a6333ee4	amd/registers: switch to new generated register definitions Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6423>	2020-09-01 08:45:54 -04:00
Samuel Pitoiset	8301a43f27	radv: dump shader stats with VK_KHR_pipeline_executable_properties Instead of duplicating shader statistics in two different parts in the driver. This also now reports the LDS size in bytes instead of blocks with VK_AMD_shader_info. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6406>	2020-08-31 09:42:25 +02:00
Samuel Pitoiset	0d8ae4ac15	radv: fix setting EXCP_EN for different shader stages While TRAP_PRESENT is always at the same place, EXCP_EN can be different between shader stages. This sets it properly. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6452>	2020-08-26 08:12:22 +02:00
Samuel Pitoiset	8e97a61cfb	radv: enable the trap handler and configure the shader exceptions When TRAP_PRESENT is not enabled, all traps and exceptions are ignored. Only EXCP_EN.mem_viol is currently supported because the other exceptions have to be tested/validated first. EXCP_EN.mem_viol is used to detect any sort of invalid memory access like VM fault. When a memory violation is reported, the hw jumps to the trap handler. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6384>	2020-08-24 11:08:24 +00:00
Samuel Pitoiset	8fd2f5c16d	radv: add a small interface for creating the trap handler shader Similar to the GS copy shader except that NIR is unused because the shader is written directly using ACO IR. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6384>	2020-08-24 11:08:24 +00:00
Jesse Natalie	d3faac7a15	nir: Add options to nir_lower_compute_system_values to control compute ID base lowering If no options are provided, existing intrinsics are used. If the lowering pass indicates there should be offsets used for global invocation ID or work group ID, then those instructions are lowered to include the offset. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5891>	2020-08-21 22:07:05 +00:00
Jesse Natalie	2e1df6a17f	nir: Move compute system value lowering to a separate pass The actual variable -> intrinsic lowering stays where it is, but ops which convert one intrinsic to be implemented in terms of another have moved. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5891>	2020-08-21 22:07:05 +00:00
Eric Anholt	b3c822a0a8	radv: Move nir_opt_shrink_vectors() into the opt loop. Upcoming changes to opt_undef will result in this pass doing more work and generating vector MOVs that need re-scalarizing (which is inside of the main opt loop). Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6054>	2020-08-20 16:44:08 +00:00
Samuel Pitoiset	e901b901cb	radv,aco: report ACO errors/warnings back via VK_EXT_debug_report To help developers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6318>	2020-08-20 08:15:08 +02:00
Connor Abbott	c77716294b	radv: Use an input for the layer when lowering input attachments Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5719>	2020-08-19 16:36:43 +00:00
Connor Abbott	d243bf1032	nir/lower_input_attachments: Support loading layer id as an input Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5719>	2020-08-19 16:36:43 +00:00
Connor Abbott	e72895767b	nir/lower_input_attachments: Refactor to use an options struct While we're at it, fold the details of how to load the fragcoord into load_fragcoord(). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5719>	2020-08-19 16:36:43 +00:00
Samuel Pitoiset	11781c0e49	radv: report the spirv-nir logs back to the application Via VK_EXT_debug_report to help debugging various SPIRV->NIR issues. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6223>	2020-08-10 13:19:21 +02:00
Samuel Pitoiset	bea8930468	radv: allow to force-enable LLVM internally for a specific shader stage For ACO debugging purposes, developers only. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6208>	2020-08-07 07:45:06 +00:00
Rhys Perry	6e2e77557e	radv/llvm: enable VK_KHR_memory_model Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6063>	2020-08-05 09:45:54 +00:00
Rhys Perry	da38e99eda	radv/aco: enable VK_KHR_memory_model Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6063>	2020-08-05 09:45:54 +00:00
Eric Anholt	d8c2f896db	amd: Swap from nir_opt_shrink_load() to nir_opt_shrink_vectors(). This should do much more trimming than shrink_load, and is a win on i965's vec4 and nir-to-tgsi. For scalar backends like this that don't need ALU shrinking, it still gets more load intrinsics covered. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6050>	2020-08-03 21:26:45 +00:00
Rhys Perry	cc3bc9493c	radv: use scoped barriers fossil-db (LLVM, Navi): Totals from 843 (0.62% of 135820) affected shaders: SGPRs: 40456 -> 40480 (+0.06%); split: -0.10%, +0.16% VGPRs: 39648 -> 39688 (+0.10%); split: -0.01%, +0.11% CodeSize: 2936164 -> 2932508 (-0.12%); split: -0.21%, +0.09% MaxWaves: 10828 -> 10827 (-0.01%) fossil-db changes seem to be due to SPIR-V -> NIR emitting a workgroup scope shared memory barrier instead of a group_memory_barrier. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5980>	2020-07-29 17:57:13 +00:00
Jason Ekstrand	5c5555a862	nir: Add a find_variable_with_[driver_]location helper We've hand-rolled this loop 10 places and those are just the ones I found easily. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5966>	2020-07-29 17:38:58 +00:00
Jason Ekstrand	caab46c1e9	nir: Take a shader and variable mode in nir_assign_io_var_locations Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5966>	2020-07-29 17:38:58 +00:00
Jason Ekstrand	2956d53400	nir: Add nir_foreach_shader_in/out_variable helpers Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5966>	2020-07-29 17:38:57 +00:00
Rhys Perry	cd392a10d0	radv/aco,aco: use scoped barriers fossil-db (Navi): Totals from 109 (0.08% of 132058) affected shaders: SGPRs: 5416 -> 5424 (+0.15%) CodeSize: 460500 -> 460508 (+0.00%); split: -0.07%, +0.07% Instrs: 87278 -> 87272 (-0.01%); split: -0.09%, +0.09% Cycles: 2241996 -> 2241852 (-0.01%); split: -0.04%, +0.04% VMEM: 33868 -> 35539 (+4.93%); split: +5.14%, -0.20% SMEM: 7183 -> 7184 (+0.01%); split: +0.36%, -0.35% VClause: 1857 -> 1882 (+1.35%) SClause: 2052 -> 2055 (+0.15%); split: -0.05%, +0.19% Copies: 6377 -> 6380 (+0.05%); split: -0.02%, +0.06% PreSGPRs: 3391 -> 3392 (+0.03%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4905>	2020-07-28 16:56:34 +00:00
Daniel Schürmann	af0bc71015	radv: call radv_nir_lower_ycbcr_textures after first optimizations There might still be tex instructions with undef texture/sampler before the first round of optimizations. No pipelinedb changes. Fixes: `14a12b771d` ('spirv: Rework our handling of images and samplers') Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6083>	2020-07-27 10:03:20 +00:00
Samuel Pitoiset	a1b237b9ef	radv: set LDS TCS size at shaders creation for GFX9+ Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5837>	2020-07-24 12:30:03 +00:00
Jason Ekstrand	196db51fc2	anv,turnip,radv,clover,glspirv: Run nir_copy_prop before nir_opt_deref We're about to make the SPIR-V -> NIR path generate a bit more complex SSA chains for certain derefs. This will ensure we don't regress anyone when we start making vec2's of derefs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5278>	2020-07-23 22:43:21 -05:00
Samuel Pitoiset	6c1108d25b	radv: advertise VK_EXT_shader_atomic_float No hw support for float atomic add for buffer and (sparse) images. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6000>	2020-07-22 10:20:58 +02:00
Rhys Perry	ec9920e72b	radv: use lower_shuffle_to_swizzle_amd Affects a few shaders in Detroit: Become Human and Doom Eternal. fossil-db (Navi): Totals from 9 (0.01% of 135946) affected shaders: CodeSize: 31188 -> 25096 (-19.53%) Instrs: 6136 -> 4999 (-18.53%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5695>	2020-07-13 14:11:50 +00:00
Samuel Pitoiset	7324977e42	radv: remove the secure compile support feature Steam was the only client of this feature and it seems no longer used. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5869>	2020-07-13 08:56:44 +02:00
Samuel Pitoiset	26a48d8d35	radv: enable VK_AMD_shader_ballot on GFX6-7 with both compiler backends It gives +1-2 FPS with Doom Eternal on Pitcairn. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5659>	2020-06-29 07:40:05 +00:00
Daniel Schürmann	db0afb3800	radv: change use_aco -> use_llvm We are about to make ACO the default backend. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5445>	2020-06-25 15:16:28 +02:00
Samuel Pitoiset	a102896cff	radv: lower 64-bit dfloor on GFX6 for fixing precision issues GFX6 doesn't support v_floor_f64 and the precision of v_fract_f64 which is used to implement 64-bit floor is less than what Vulkan requires. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5609>	2020-06-25 12:09:08 +00:00
Samuel Pitoiset	c84f11e7b6	radv: lower 64-bit drcp/dsqrt/drsq for fixing precision issues The hardware precision of v_rcp_f64, v_sqrt_f64 and v_rsq_f64 is less than what Vulkan requires. This lowers using the Goldschmidt's algorithm to improve precision. Fixes dEQP-VK.glsl.builtin.precision_double.* on both compiler backends. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5609>	2020-06-25 12:09:08 +00:00
Bas Nieuwenhuizen	aa35670fd0	radv: Make radv_alloc_shader_memory static. Just a cleanup. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5578>	2020-06-24 13:00:02 +00:00
Bas Nieuwenhuizen	a5cb88eea4	radv: Handle mmap failures. Which can happen if we have to many mmaps active in the process. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5578>	2020-06-24 13:00:02 +00:00
Daniel Schürmann	f03a5f6cac	radv/aco: implement logic64 instead of lowering to make use of the scalar ALU Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5527>	2020-06-22 10:59:45 +00:00
Samuel Pitoiset	51fb3b09dc	radv/aco: enable FP16 features/extensions on GFX9+ This enables shaderFloat16, VK_AMD_gpu_shader_half_float and VK_AMD_gpu_shader_int16. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5347>	2020-06-17 18:12:51 +02:00
Samuel Pitoiset	6f21995f98	radv: add new drirc option radv_enable_mrt_output_nan_fixup To replace NaN from FS with zeros to fix game bugs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5359>	2020-06-12 14:43:31 +02:00
Samuel Pitoiset	64f2d45c3b	radv/aco: enable shaderInt8 and VK_KHR_shader_float16_int8 on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Samuel Pitoiset	be4dd6abd1	radv/aco: enable shaderInt16 on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Samuel Pitoiset	b3aee3aa23	radv/aco: enable 8-bit/16-bit storage on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Marek Olšák	789cdab3b6	ac: align num_vgprs for gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Samuel Pitoiset	d7923c74d4	radv/llvm: expose VK_EXT_shader_demote_to_helper_invocation with LLVM 9+ It should already work with the LLVM backend. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5361>	2020-06-09 08:04:23 +02:00
Timothy Arceri	04dbf709ed	nir: add callback to nir_remove_dead_variables() This allows us to do API specific checks before removing variable without filling nir_remove_dead_variables() with API specific code. In the following patches we will use this to support the removal of dead uniforms in GLSL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4797>	2020-06-03 02:22:23 +00:00
Marek Olšák	116ec85012	ac: rename has_double_rate_fp16 -> has_packed_math_16bit Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Samuel Pitoiset	b3c0f82841	radv: advertise VK_AMD_texture_gather_bias_lod Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5147>	2020-05-25 08:51:10 +02:00
Samuel Pitoiset	b1f0233077	radv: enable shaderResourceMinLod This feature was missing for unknown reasons. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4989>	2020-05-14 10:05:44 +00:00
Samuel Pitoiset	178adfa6a8	radv: use the base object struct types Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4886>	2020-05-13 08:23:23 +02:00
Samuel Pitoiset	65458528fc	radv: use the common base object type for VkDevice Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4886>	2020-05-13 08:23:23 +02:00
Rhys Perry	5c5c2dd48f	radv/aco: enable 8/16-bit storage and int8/int16 on GFX8+ With this, Doom Eternal should now run with ACO on GFX8+. The generated 8/16-bit storage code is okay but the generated int8/int16 code is currently pretty bad but it works and apparently Doom Eternal doesn't actually use it (even though it requires it). Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4707>	2020-04-24 20:04:39 +01:00
Rhys Perry	03568249f9	radv: allocate larger shader memory slabs if needed Fixes dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 hang with ACO (features needed for the test are implemented in a later commit) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4639>	2020-04-24 18:52:54 +00:00
Jason Ekstrand	f4addfdde3	spirv: Use nir_const_value for spec constants When we originally wrote spirv_to_nir we didn't have a good scalar value union to handily use so we rolled our own thing for spec constants. Now that we have nir_const_value, we can use that and simplify a bunch of the spec constant logic. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4675>	2020-04-24 09:23:59 +00:00
Jason Ekstrand	a4885df9f8	radv: Properly handle all sizes of specialization constants cc: mesa-stable@lists.freedesktop.org Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4675>	2020-04-24 09:23:59 +00:00
Samuel Pitoiset	19aa68ae31	radv: set missing SHARED_VGPR_CNT for NGG VS and ACO shuffle is implemented with shared VGPRs with ACO and Wave64. Fixes dEQP-VK.subgroups.shuffle.framebuffer.subgroupshuffle*_vertex with Wave64. Fixes: `c24d9522da` ("radv: Enable ACO for NGG VS/TES, but disable NGG for ACO GS.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4595>	2020-04-17 16:11:17 +00:00
Samuel Pitoiset	1d74c6565d	radv: only expose shaderFloat16 for chips with double rate fp16 This disables shaderFloat16 on GFX8 because only GFX9+ supports double rate packed math. This improves consistency regarding other AMD Vulkan drivers and it makes no sense to enable that feature without packed math. This also reduces performance with Wolfeinstein Youngblood if fp16 is forced enabled on GFX8, while it's similar on GFX9. We might re-introduce that feature in the future with ACO support if it ends up being faster and correct. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4453>	2020-04-09 13:34:36 +02:00
Samuel Pitoiset	9f005f1f85	radv: enable lowering of GS intrinsics for the LLVM backend This replaces emit_vertex with: if (vertex_count < max_vertices) { emit_vertex_with_counter vertex_count ... vertex_count += 1 } Which is exactly what NIR->LLVM was doing but at NIR level. This pass is already called by ACO. pipeline-db changes on GFX10: Totals from affected shaders: SGPRS: 1952 -> 1912 (-2.05 %) VGPRS: 2112 -> 2044 (-3.22 %) Code Size: 189368 -> 185620 (-1.98 %) bytes Max Waves: 494 -> 491 (-0.61 %) No pipeline-db changes on other generations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4182>	2020-04-08 08:24:05 +02:00
Timur Kristóf	db2ee3686d	radv: Print shader stage before disassembly. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3576> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3576>	2020-04-07 11:29:35 +00:00
Rhys Perry	7e6aec6687	radv, aco: collect statistics if requested but executables are not Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2965> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2965>	2020-04-03 12:12:08 +00:00
Rhys Perry	ad2703653f	radv: add code for exposing compiler statistics Statistics will be added to ACO in later commits. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2965>	2020-04-03 12:12:08 +00:00
Marek Olšák	56cc10bd27	ac: unify denorm setting enforcement Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4196>	2020-03-17 20:47:48 +00:00
Samuel Pitoiset	c923de68dd	radv/gfx10: fix required ballot size with VK_EXT_subgroup_size_control If compute shaders require a specific subgroup size (ie. Wave32), we have to use the correct ballot size. Fixes dEQP-VK.subgroups.ballot_other.compute.*_requiredsubgroupSize. Fixes: `fb07fd4e6c` ("radv: implement VK_EXT_subgroup_size_control") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4215> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4215>	2020-03-17 12:45:01 +00:00
Samuel Pitoiset	672d106199	radv/gfx10: fix required subgroup size with VK_EXT_subgroup_size_control If compute shaders require a specific subgroup size (ie. Wave32), we have to return the correct one. Fixes dEQP-VK.subgroups.size_control.compute.required_subgroup_size_*. Fixes: `fb07fd4e6c` ("radv: implement VK_EXT_subgroup_size_control") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4215>	2020-03-17 12:45:01 +00:00
Samuel Pitoiset	2d295ab3f3	radv: add llvm_compiler_shader() helper To match aco_compile_shader(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4163> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4163>	2020-03-13 10:22:13 +00:00
Samuel Pitoiset	4d991c2de4	radv: remove unnecessary LLVM includes They are already included from src/amd/llvm. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4163>	2020-03-13 10:22:13 +00:00
Samuel Pitoiset	5ea32a6201	radv: remove radv_shader_variant::aco_used Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4163>	2020-03-13 10:22:13 +00:00
Samuel Pitoiset	3fea948177	radv: cleanup occurences of use_aco everywhere Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4163>	2020-03-13 10:22:13 +00:00
Timur Kristóf	967eb23261	radv: Enable lowering dynamic quad broadcasts. This will lower dynamic quad broadcasts into something that both LLVM and ACO can understand. On hardware which supports shuffles, they are lowered to shuffle, on older hardware (GFX6-7) they will get lowered to constant quad broadcasts. Fixes dEQP-VK.subgroups.quad..subgroupquadbroadcast_nonconst_ Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4147> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4147>	2020-03-12 13:16:07 +00:00
Daniel Schürmann	bdd7587414	radv: use nir_lower_discard_to_demote to work around game bugs Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4047>	2020-03-09 12:29:32 +00:00
Samuel Pitoiset	9432eb3e9c	ac: rename lds_size_per_cu to lds_size_per_workgroup It's more accurate. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3975>	2020-03-03 08:16:56 +01:00
Samuel Pitoiset	9204ad70f2	radv/gfx10: adjust the number of VGPRs used to compute waves Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3899>	2020-02-26 07:58:47 +00:00
Samuel Pitoiset	568f150409	radv/gfx10: adjust the LDS size used to compute waves It's 128KB per CU in WGP. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3899>	2020-02-26 07:58:47 +00:00
Samuel Pitoiset	b2531370c9	radv: remove RADV_DEBUG=nosisched and RADV_PERFTEST=sisched They are no longer useful. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3789>	2020-02-13 08:09:13 +00:00
Arcady Goldmints-Orlov	e9f83185a2	Rename nir_lower_constant_initializers to nir_lower_variable_initalizers This is naming is more clear as nir_variables can be initializes not just with a nir_constant but with a pointer to another nir_variable. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3047> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3047>	2020-02-12 15:41:49 +00:00
Samuel Pitoiset	401bfe0283	radv: implement VK_AMD_shader_explicit_vertex_parameter Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2402 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3578> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3578>	2020-01-29 09:49:50 +00:00
Rhys Perry	72e9a23443	radv/aco: use ACO for GS copy shaders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2421>	2020-01-24 13:35:07 +00:00
Rhys Perry	8bad100f83	aco: implement GS on GFX7-8 GS is the same on GFX6, but GFX6 isn't fully supported yet. v4: fix regclass v7: rebase after shader args MR Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2421>	2020-01-24 13:35:07 +00:00
Rhys Perry	40bb81c9dd	radv/aco,aco: implement GS on GFX9+ v2: implement GFX10 v3: rebase v7: rebase after shader args MR v8: fix gs_vtx_offset usage on GFX9/GFX10 v8: use unreachable() instead of printing intrinsic v8: rename output_state to ge_output_state v8: fix formatting around nir_foreach_variable() v8: rename some helpers in the scheduler v8: rename p_memory_barrier_all to p_memory_barrier_common v8: fix assertion comparing ctx.stage against vertex_geometry_gs Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2421>	2020-01-24 13:35:07 +00:00
Samuel Pitoiset	12fe19ba3b	radv: advertise VK_AMD_shader_fragment_mask Only for GFX8+ because it's untested on older generations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3304> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3304>	2020-01-23 10:48:02 +00:00
Samuel Pitoiset	e298e78a01	radv: advertise VK_AMD_shader_image_load_store_lod This extension allows to use LOD with image read/write operations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-09 07:58:34 +01:00
Samuel Pitoiset	eda1b77cc2	radv: enable SpvCapabilityImageMSArray The Vulkan spec says that StorageImageMultisample and ImageMSArray SPIRV-V capabilities must be enabled if the shaderStorageImageMultisample feature is supported. This fixes a warning with RenderDoc. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2212 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-12 18:52:08 +01:00
Samuel Pitoiset	3b51259f06	radv: remove dead shader input/output variables No pipeline-db changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-04 08:04:05 +01:00
Samuel Pitoiset	c105e6169c	radv,ac/nir: lower deref operations for shared memory This shouldn't introduce any functional changes for RadeonSI when NIR is enabled because these operations are already lowered. pipeline-db (NAVI10/LLVM): SGPRS: 9043 -> 9051 (0.09 %) VGPRS: 7272 -> 7292 (0.28 %) Code Size: 638892 -> 621628 (-2.70 %) bytes LDS: 1333 -> 1331 (-0.15 %) blocks Max Waves: 1614 -> 1608 (-0.37 %) Found this while glancing at some F12019 shaders. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-29 21:58:18 +01:00
Connor Abbott	e7f4cadd02	radv: Replace supports_spill with explict_scratch_args The former was always true and hence dead code. We will want to explicitly declare the ring offset register with ACO, but we also want to declare the scratch offset too, and we can't try to disable it since ACO also supports spilling and the determination of whether spilling has to happen occurs well after setting up registers. So replace supports_spill with something that will actually be used for ACO. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-25 14:17:51 +01:00
Connor Abbott	b45c54ff8d	aco: Use radv_shader_args in aco_compile_shader() Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-11-25 14:17:51 +01:00
Connor Abbott	66c703b3e8	radv: Move argument declaration out of nir_to_llvm Now it's executed for ACO too. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-25 14:17:51 +01:00
Rhys Perry	d7b0d9a8d8	radv: enable FP16/FP64 denormals earlier and only for LLVM ACO sets this itself and will have to set it differently in the future to support shaderDenormFlushToZeroFloat64. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-15 17:36:21 +00:00
Samuel Pitoiset	519d9b30de	radv: remove useless RADV_DEBUG=unsafemath debug option This option is useless and shouldn't be used at all. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-15 09:07:34 +01:00
Rhys Perry	76544f632d	radv: adjust loop unrolling heuristics for int64 In particular, increase the cost of 64-bit integer division. Fixes huge shaders with dEQP-VK.spirv_assembly.type.scalar.i64.mod_geom , with ACO used for GS this creates shaders requiring a branch with >32767 dword offset. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-07 23:29:12 +00:00
Samuel Pitoiset	d3f9957de4	radv: determine shaders wavesize at pipeline level Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-06 09:20:34 +01:00
Samuel Pitoiset	d4e0bef1bb	radv: fix dumping SPIR-V into hang reports Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-30 13:02:08 +00:00
Timothy Arceri	07692f703f	radv: for secure compile exit early from radv_shader_variant_create() We don't have permission to be creating shared memory etc. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-26 13:04:12 +11:00
Rhys Perry	7453c1adff	radv: round vgprs/sgprs before calculating max_waves Note that ACO doesn't correctly round SGPR counts on GFX8/GFX9. pipeline-db (ACO/Vega): SGPRS: 11000 -> 11000 (0.00 %) VGPRS: 3120 -> 3120 (0.00 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 164328 -> 164328 (0.00 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 1125 -> 1000 (-11.11 %) v2: consider wave32 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-10-23 19:11:20 +01:00
Samuel Pitoiset	7c50214aab	radv: implement VK_KHR_shader_float_controls This exposes what's required for DX and this is what we already configure. The driver flushes denorms for FP32 and preserves them for FP16/FP64. Note that we can't allow both preserving and flushing denorms because this won't work for merged shaders. This will require LLVM to update the float mode register to make it work. Only enabled on GFX8+ with the LLVM path because it's untested on previous chips and ACO doesn't support it. This extension is required for SPIRV 1.4. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-18 16:55:58 +02:00
Samuel Pitoiset	cbd6f0a0c2	radv: implement VK_KHR_shader_clock NIR->LLVM and ACO already support nir_intrinsic_shader_clock. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-10-09 08:43:14 +02:00
Rhys Perry	a87b0f5141	radv/aco,aco: set lower_fmod This simplifies ACO and allows the lowered code to be optimized (in particular, constant folded). Totals from affected shaders: SGPRS: 1776 -> 1776 (0.00 %) VGPRS: 1436 -> 1436 (0.00 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 203452 -> 203564 (0.06 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 103 -> 103 (0.00 %) At least some of the code size increase seems to be from literals being applied to instructions as a result of constant folding. v2: remove fmod/frem handling in init_context() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-10-04 14:00:46 +00:00
Samuel Pitoiset	5ebe1a17e9	radv: enable lower_fmod for the LLVM path This lowers fmod and frem at NIR level like RadeonSI. fmod is already lowered directly in NIR->LLVM, and frem will be lowered by LLVM anyways. This fixes a LLVM crash with: dEQP-VK.glsl.builtin.precision_fp16_storage32b.frem.compute.scalar. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-03 18:15:14 +02:00
Samuel Pitoiset	a2a68d551c	radv/gfx10: fix the ESGS ring size symbol Random hangs no longer happen, I'm actually not sure if they were related to this. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-02 21:50:40 +02:00
Daniel Schürmann	0fb27f1e5a	radv/aco: Don't lower subtractions 40228 shaders in 20236 tests Totals: SGPRS: 2045512 -> 2046496 (0.05 %) VGPRS: 1430856 -> 1430464 (-0.03 %) Spilled SGPRs: 1077 -> 1077 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 10348 -> 10348 (0.00 %) dwords per thread Code Size: 77202840 -> 77151832 (-0.07 %) bytes LDS: 863 -> 863 (0.00 %) blocks Max Waves: 260729 -> 260754 (0.01 %) Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-09-30 09:44:10 +00:00
Timur Kristóf	30f0c0ea7d	radv: Add debug option to dump meta shaders. This new option can help debug shader compiler problems when there are issues with the meta shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-26 13:36:49 +00:00
Timur Kristóf	a4fd8ba7e3	amd/common: Introduce ac_get_fs_input_vgpr_cnt. Add a function called ac_get_fs_input_vgpr_cnt which will return the number of input VGPRs used by an AMD shader. Previously, radv and radeonsi had the same code duplicated, but this commit also allows them to share this code. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-26 13:36:49 +00:00
Timur Kristóf	83eebdb507	radv: Set shared VGPR count in radv_postprocess_config. This commit allows RADV to set the shared VGPR count according to the shader config. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-26 13:36:49 +00:00
Rhys Perry	3c966fd688	aco,radv: rename record_llvm_ir/llvm_ir_string to record_ir/ir_string Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-26 11:08:47 +01:00
Rhys Perry	ec8ced9123	radv/aco: return a correct name and description for the backend IR Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-26 11:08:43 +01:00
Rhys Perry	6613b81327	aco,radv/aco: get dissassembly for release builds if requested Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-26 11:08:09 +01:00
Daniel Schürmann	8b78cce433	radv: remove dead shared variables LLVM does this anyway, but for ACO we need to do it in NIR. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-19 12:10:00 +02:00
Daniel Schürmann	281262281b	radv/aco: enable VK_EXT_shader_demote_to_helper_invocation For now, this extension will only be enabled for ACO. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-19 12:10:00 +02:00
Daniel Schürmann	a70a998718	radv/aco: Setup alternate path in RADV to support the experimental ACO compiler LLVM remains default and ACO can be enabled with RADV_PERFTEST=aco. Co-authored-by: Daniel Schürmann <daniel@schuermann.dev> Co-authored-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-19 12:10:00 +02:00
Marek Olšák	0692ae34e9	ac: move ac_get_num_physical_sgprs into radeon_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 14:39:06 -04:00
Marek Olšák	ca43006fd2	ac: move ac_get_max_wave64_per_simd into radeon_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 14:39:06 -04:00
Samuel Pitoiset	5ebc76471c	radv/gfx10: adjust the GS NGG scratch size for streamout It needs more space for multiple streams. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-16 12:08:22 +02:00
Samuel Pitoiset	a15b3bcf1a	radv/gfx10: add an option to switch from legacy to NGG streamout This internal option is turned off by default because NGG streamout still hangs. It seems like it's related to GDS as RadeonSI. That option will be turned on once all issues are resolved. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-16 12:08:22 +02:00
Samuel Pitoiset	538766792d	radv/gfx10: declare a LDS symbol for the NGG emit space This fixes some interactions when NGG GS is enabled. It fixes: - dEQP-VK.clipping.user_defined.clip_cull_distance_dynamic_index.geom - dEQP-VK.tessellation.geometry_interaction.passthrough.* For some reasons, using the computed ESGS ring size randomly hangs with CTS. For now, just use the maximum LDS size for ESGS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-10 09:27:01 +02:00
Samuel Pitoiset	a9af11f1fa	radv: fill shader info for all stages in the pipeline This shouldn't be in NIR->LLVM because ACO also needs the shader info. This will also help for computing some NGG values that are necessary for declaring LDS symbols. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-10 09:26:45 +02:00
Marek Olšák	d95afd8b9e	radeonsi/gfx10: fix wave occupancy computations Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-09 23:43:03 -04:00
Samuel Pitoiset	83499ac765	radv: merge radv_shader_variant_info into radv_shader_info Having two different structs is useless. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-06 15:52:03 +02:00
Vasily Khoruzhick	9367d2ca37	nir: allow specifying filter callback in lower_alu_to_scalar Set of opcodes doesn't have enough flexibility in certain cases. E.g. Utgard PP has vector conditional select operation, but condition is always scalar. Lowering all the vector selects to scalar increases instruction number, so we need a way to filter only those ops that can't be handled in hardware. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-06 01:51:28 +00:00
Connor Abbott	3f5b541fc8	radv: Call nir_propagate_invariant() Without this, invariant qualifiers don't do anything. Together with a fix to the game, this fixes flickering in No Man's Sky. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-05 14:05:46 +02:00
Connor Abbott	71a6794200	ac/nir: Enable nir_opt_large_constants vkpipeline-db numbers: Totals: SGPRS: 1740306 -> 1741322 (0.06 %) VGPRS: 1331124 -> 1331712 (0.04 %) Spilled SGPRs: 21201 -> 21316 (0.54 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 256 -> 256 (0.00 %) dwords per thread Code Size: 79022628 -> 78694788 (-0.41 %) bytes LDS: 6500 -> 6500 (0.00 %) blocks Max Waves: 301413 -> 301302 (-0.04 %) Wait states: 0 -> 0 (0.00 %) Totals from affected shaders: SGPRS: 53633 -> 54649 (1.89 %) VGPRS: 53000 -> 53588 (1.11 %) Spilled SGPRs: 3454 -> 3569 (3.33 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 5284232 -> 4956392 (-6.20 %) bytes LDS: 2 -> 2 (0.00 %) blocks Max Waves: 4239 -> 4128 (-2.62 %) Wait states: 0 -> 0 (0.00 %) (The biggest VGPR and max wave regression is due to unrolling a loop, which made the scheduler more aggressive, but in this case it's able to effectively hide latency so it's actually probably a win.) shader-db numbers with radeonsi NIR: Totals: SGPRS: 3526496 -> 3526512 (0.00 %) VGPRS: 2198576 -> 2198576 (0.00 %) Spilled SGPRs: 10463 -> 10463 (0.00 %) Spilled VGPRs: 86 -> 86 (0.00 %) Private memory VGPRs: 3182 -> 2528 (-20.55 %) Scratch size: 3308 -> 2640 (-20.19 %) dwords per thread Code Size: 74117280 -> 74106140 (-0.02 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 775846 -> 775844 (-0.00 %) Wait states: 0 -> 0 (0.00 %) Totals from affected shaders: SGPRS: 856 -> 872 (1.87 %) VGPRS: 680 -> 680 (0.00 %) Spilled SGPRs: 0 -> 0 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 654 -> 0 (-100.00 %) Scratch size: 668 -> 0 (-100.00 %) dwords per thread Code Size: 49652 -> 38512 (-22.44 %) bytes LDS: 0 -> 0 (0.00 %) blocks Max Waves: 182 -> 180 (-1.10 %) Wait states: 0 -> 0 (0.00 %) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-05 12:21:46 +02:00
Connor Abbott	5dadbabb47	radv/radeonsi: Don't count read-only data when reporting code size We usually use these counts as a simple way to figure out if a change reduces the number of instructions or shrinks an instruction. However, since .rodata sections aren't executed, we shouldn't be counting their size for this analysis. Make the linker return the total executable size, and use it to report the more useful size in both drivers. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-05 12:21:35 +02:00
Samuel Pitoiset	cc3d36b5dd	radv: remove radv_init_llvm_target() helper RADV no longer uses specific LLVM options compared to the common code. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-30 09:33:21 +02:00
Samuel Pitoiset	8d44f83844	radv: move lowering PS inputs/outputs at the right place At shaders creation, just after NIR linking. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-30 09:29:31 +02:00
Samuel Pitoiset	151d6990ec	radv: gather info about PS inputs in the shader info pass It's the right place to do that. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-30 09:29:29 +02:00
Samuel Pitoiset	49f5ddd3ae	radv: make use of has_ls_vgpr_init_bug Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-27 08:04:51 +02:00
Samuel Pitoiset	2b9c371575	ac: add cpdma_prefetch_writes_memory to ac_gpu_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-27 08:04:29 +02:00
Samuel Pitoiset	1fd60db4a1	ac,radv,radeonsi: remove LLVM 7 support Now that LLVM 9 will be released soon, we will only support LLVM 8, 9 and master (10). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 08:12:34 +02:00
Samuel Pitoiset	e73d863a66	radv: allow to enable VK_AMD_shader_ballot only on GFX8+ Scans aren't implemented on SI/CIK. Cc: 19.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-21 15:14:29 +02:00
Bas Nieuwenhuizen	2e763f7c87	radv: Use correct vgpr_comp_cnt for VS if both prim_id and instance_id are needed. Should take the max of the 2. Fixes: `ea337c8b7e` "radv/gfx10: fix VS input VGPRs with the legacy path" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-08-21 09:38:46 +00:00
Rhys Perry	7740149852	nir: merge and extend nir_opt_move_comparisons and nir_opt_move_load_ubo v2: add to series v3: update Makefile.sources v4: don't remove a comment and break statement v4: use nir_can_move_instr Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-12 22:01:30 +00:00
Bas Nieuwenhuizen	8874af8ef4	radv: Keep shader info when needed. This allows enabling the shader info keeping on a per shader basis. Also disables the cache on a per shader basis. Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-08-12 23:00:24 +02:00

... 3 4 5 6 7 ...

654 Commits