KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rhys Perry	8ff122f8b8	nir: add load_shared2_amd and store_shared2_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Emma Anholt	16c064dfaf	nir: Add a helper for setting up a nir_ssa_scalar struct. Trivial, but will help users avoid some struct constructions that can be awkward in C. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14865>	2022-03-02 22:28:58 +00:00
Emma Anholt	cac6f633b2	nir/opt_offsets: Use nir_ssa_scalar to chase offset additions. For nir_to_tgsi, I want to be able to fold into the base from a vector load_const, which the ad-hoc scalar chasing couldn't handle. r300: total instructions in shared programs: 1278731 -> 1256502 (-1.74%) instructions in affected programs: 457909 -> 435680 (-4.85%) total flowcontrol in shared programs: 8316 -> 8313 (-0.04%) flowcontrol in affected programs: 5 -> 2 (-60.00%) total temps in shared programs: 213687 -> 213774 (0.04%) temps in affected programs: 13140 -> 13227 (0.66%) total consts in shared programs: 952850 -> 949929 (-0.31%) consts in affected programs: 386352 -> 383431 (-0.76%) Fixes: #5781 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	645ca56425	nir/opt_offsets: Also apply the max offset to top-level constant folding. nir_to_tgsi wants this for disabling folding into shared var accesses at all. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	ec4b9909f0	nir/opt_offsets: Disable unsigned wrap checks on non-native-integers HW. Since we don't have 32-bit ints, these checks for 32-bit unsigned wrapping don't help and just reduce optimization opportunities (particularly for DX9 addressing math). Doesn't affect any current consumers. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	700d2fbd0a	nir: Add a .base field to nir_load_ubo_vec4. This lets nir-to-tgsi fold the constant offset of addressing calculations into the CONST[] reference, which is important for D3D9-era compatibility: HW of that age has limited uniform space, and if we do the addressing math as math in the shader for dynamic indexing, the nir_load_consts end up taking up uniforms we don't have available. r300: total instructions in shared programs: 1279699 -> 1279167 (-0.04%) instructions in affected programs: 134796 -> 134264 (-0.39%) total instructions in shared programs: 1279699 -> 1279167 (-0.04%) instructions in affected programs: 134796 -> 134264 (-0.39%) total temps in shared programs: 213912 -> 213736 (-0.08%) temps in affected programs: 2166 -> 1990 (-8.13%) total consts in shared programs: 953237 -> 952973 (-0.03%) consts in affected programs: 45980 -> 45716 (-0.57%) Acked-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	f6ffefba3e	nir: Apply nir_opt_offsets to nir_intrinsic_load_uniform as well. Doing this for ir3 required adding a struct for limits of how much base to fold in (which NTT wants as well for its case of shared vars), otherwise the later work to lower to the 1<<9 word limit would emit more instructions. The shader-db results are that sometimes the reduction in NIR instruction count results in the fewer sampler prefetches due to the shader being estimated to be shorter (dota2, nexuiz): total instructions in shared programs: 8996651 -> 8996776 (<.01%) total cat5 in shared programs: 86561 -> 86577 (0.02%) Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Emma Anholt	b024102d7c	freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars. Saves some work in carchase and manhattan31: instructions in affected programs: 2842 -> 2818 (-0.84%) nops in affected programs: 1131 -> 1105 (-2.30%) non-nops in affected programs: 1236 -> 1238 (0.16%) mov in affected programs: 57 -> 61 (7.02%) dwords in affected programs: 2144 -> 2150 (0.28%) cat0 in affected programs: 1195 -> 1169 (-2.18%) cat1 in affected programs: 151 -> 155 (2.65%) cat2 in affected programs: 142 -> 140 (-1.41%) sstall in affected programs: 190 -> 178 (-6.32%) (ss) in affected programs: 63 -> 63 (0.00%) systall in affected programs: 532 -> 511 (-3.95%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Rhys Perry	2368c36427	nir/opt_offsets: remove need to loop try_extract_const_addition fossil-db (Sienna Cichlid): Totals from 1 (0.00% of 134572) affected shaders: no stat changes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14009>	2021-12-03 11:51:49 +00:00
Rhys Perry	5c0fe11072	nir/opt_offsets: fix try_extract_const_addition recursion This initially looks like a miscompilation bug, but I don't think it's actually possible for it to create incorrect code. fossil-db (Sienna Cichlid): Totals from 32 (0.02% of 134572) affected shaders: VGPRs: 1336 -> 1320 (-1.20%) CodeSize: 90552 -> 89468 (-1.20%) Instrs: 17007 -> 16852 (-0.91%); split: -0.92%, +0.01% Latency: 429040 -> 428136 (-0.21%); split: -0.21%, +0.00% InvThroughput: 84966 -> 84572 (-0.46%); split: -0.47%, +0.00% Copies: 1458 -> 1468 (+0.69%); split: -0.07%, +0.75% Branches: 382 -> 384 (+0.52%) PreSGPRs: 970 -> 968 (-0.21%) PreVGPRs: 1029 -> 1011 (-1.75%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14009>	2021-12-03 11:51:49 +00:00
Rhys Perry	49add985ff	nir/unsigned_upper_bound: don't require dominance metadata Instead, determine if it's a merge or loop exit phi. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9808>	2021-06-04 14:14:00 +00:00
Timur Kristóf	744dc74078	nir: Add nir_opt_offsets to fold const adds into load/store offsets. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9201>	2021-03-17 12:42:23 +00:00

12 Commits