KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Georg Lehmann	df4b5914cd	nir/fold_16bit_tex_image: Default to only_fold_all. No driver doesn't use this option. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17757>	2022-07-27 18:57:12 +00:00
Chia-I Wu	ba461f897b	ir3: fix tess param allocation primitive_param takes up 2 vec4's. Remove an align that I don't understand. The align upset Test case 'dEQP-VK.subgroups.ballot_broadcast.graphics.subgroupbroadcast_vec4'.. deqp-vk: ../src/freedreno/ir3/ir3_nir.c:1039: void ir3_setup_const_state(nir_shader , struct ir3_shader_variant , struct ir3_const_state ): Assertion `constoff <= ir3_max_const(v)' failed. with an older version (android11-tests-dev branch) of deqp-vk. This is because ir3_nir_opt_preamble uses the function for the worst case but the function fails to replace the align by the worst case. No regression with dEQP-VK.tess*. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17570>	2022-07-26 01:04:56 +00:00
Chia-I Wu	e3ba8a2f07	ir3: increment constoff right after it is assigned Minor improvement to readability. No real change. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17570>	2022-07-26 01:04:56 +00:00
Georg Lehmann	775578b885	ir3: Stop using nir_legalize_16bit_sampler_srcs. nir_fold_16bit_tex_image's only_fold_all option ensures that there is never a mix of bit sizes. Closes https://gitlab.freedesktop.org/mesa/mesa/-/issues/6899 Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:04 +00:00
Georg Lehmann	87e3277b82	nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. Allow folding constants/undef sources by sharing more code with the image_store 16bit folding pass. Allow more than one set of sources because RADV wants two, one for G16 (ddx/ddy) and one for A16 (all other sources). Allow folding cube sampling destination conversions on radeonsi/radv because I think the limitation only applies to sources. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Georg Lehmann	06b33770b6	ir3: Lower alu to scalar if nir_legalize_16bit_sampler_srcs made progress. Fixes: `003327dd95` ("freedreno/ir3: Pass 16-bit sampler coordinates when possible.") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Georg Lehmann	9fe382ba96	ir3: Only run 16bit tex NIR passes on a5xx+. 16bit types aren't yet supported on older hardware. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16978>	2022-07-21 19:15:03 +00:00
Marek Olšák	c9ca8abe4f	Change all debug_assert calls to assert Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17403>	2022-07-10 00:50:35 +00:00
Rob Clark	8f77187e3e	freedreno/ir3: Fix GS clip-plane lowering And also handle tess. In all cases, we want to use the VS lowering pass on the last geometry stage. We don't make a special exception for GS like other drivers, because GS gets lowered into a quasi-VS. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17341>	2022-07-08 20:32:35 +00:00
Connor Abbott	7d706af76b	ir3: Fix vectorizer condition for SSBOs SSBO access works very differently from UBO access. Straddling loads/stores isn't an issue, loads/stores instead must be aligned to the element size and can have up to 4 components. We support 16-bit access with SSBOs on a650+, and sometimes the vectorizer tries to create a misaligned 32-bit access when combining 32-bit and 16-bit accesses. The UBO-focused logic didn't reject this, which is now fixed. This fixes a number of VK-CTS regressions on a650+. Fixes: `bf49d4a084` ("freedreno/ir3: Enable load/store vectorization for SSBO access, too.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17040>	2022-06-23 10:46:31 +00:00
Emma Anholt	6cf2b24eaf	freedreno/ir3: Disable image/ssbo 16-bit conversion folding pre-a6xx. I don't see it in blob dumps, and the reordered args tripped up validation. Fixes: `49dc60efa1` ("freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17004>	2022-06-22 20:07:36 +00:00
Emma Anholt	49dc60efa1	freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts. Shaves 5 instructions off of one manhattan31 shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Matt Turner	003327dd95	freedreno/ir3: Pass 16-bit sampler coordinates when possible. shader-db highlights from Rob's android shaders: total instructions in shared programs: 769641 -> 767536 (-0.27%) instructions in affected programs: 151139 -> 149034 (-1.39%) total last-baryf in shared programs: 55908 -> 55607 (-0.54%) last-baryf in affected programs: 35219 -> 34918 (-0.85%) total sstall in shared programs: 67074 -> 65767 (-1.95%) total full in shared programs: 36115 -> 36080 (-0.10%) full in affected programs: 203 -> 168 (-17.24%) sstall in affected programs: 9510 -> 8203 (-13.74%) total (ss) in shared programs: 14380 -> 14239 (-0.98%) (ss) in affected programs: 2965 -> 2824 (-4.76%) total systall in shared programs: 92425 -> 91522 (-0.98%) systall in affected programs: 13146 -> 12243 (-6.87%) total (sy) in shared programs: 4330 -> 4314 (-0.37%) (sy) in affected programs: 167 -> 151 (-9.58%) total waves in shared programs: 71580 -> 71584 (<.01%) waves in affected programs: 12 -> 16 (33.33%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Matt Turner	edb0904775	freedreno/ir3: Move the texture array coord fixup to nir We're going to optimize sampler coordinates to FP16, so we'll need to add the appropriately typed 0.5. Move this to NIR where that information is readily available. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Emma Anholt	bf49d4a084	freedreno/ir3: Enable load/store vectorization for SSBO access, too. Saves a few ldib/stib instructions in gfxbench vk-5-normal compute shaders by grouping vec4 accesses together. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Hyunjun Ko	16ea41c901	ir3: handle intrinsic_load_draw_id when scanning driver constants Fixes: #6567 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16769>	2022-05-31 01:33:55 +00:00
Emma Anholt	7938ce4af3	freedreno/ir3: Lower texture instructions used only for f2f16 to 16-bit. 2.5% improvement in gfxbench vk-5-normal. No obvious change on gl-5-normal. shader-db on Rob's android shaders: total instructions in shared programs: 770644 -> 770595 (<.01%) instructions in affected programs: 14880 -> 14831 (-0.33%) total nops in shared programs: 167784 -> 167860 (0.05%) nops in affected programs: 3351 -> 3427 (2.27%) total non-nops in shared programs: 602860 -> 602735 (-0.02%) non-nops in affected programs: 10523 -> 10398 (-1.19%) total mov in shared programs: 19313 -> 19286 (-0.14%) mov in affected programs: 365 -> 338 (-7.40%) total cov in shared programs: 18075 -> 17978 (-0.54%) cov in affected programs: 566 -> 469 (-17.14%) total dwords in shared programs: 1612848 -> 1612596 (-0.02%) dwords in affected programs: 13882 -> 13630 (-1.82%) total last-baryf in shared programs: 56144 -> 55975 (-0.30%) last-baryf in affected programs: 482 -> 313 (-35.06%) total full in shared programs: 36094 -> 36092 (<.01%) full in affected programs: 10 -> 8 (-20.00%) total sstall in shared programs: 66986 -> 66923 (-0.09%) sstall in affected programs: 1392 -> 1329 (-4.53%) total systall in shared programs: 91244 -> 91072 (-0.19%) systall in affected programs: 1194 -> 1022 (-14.41%) total (sy) in shared programs: 4316 -> 4321 (0.12%) (sy) in affected programs: 19 -> 24 (26.32%) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Connor Abbott	69f5be8bad	ir3: Add ir3_shader_variant::compiler And replace uses of ->shader->compiler. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	21e3dd57d3	ir3: Use ir3_shader_variant::type more often Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	f45c86dfb7	ir3, fd, tu: Copy misc. info from ir3_shader to ir3_shader_variant The shader won't be available for deserialized variants, so we need to include all the info we need for compiling variants to be in the variant. Most of the things we dug out of the shader were various bits from nir_shader_info which we move into ir3_shader_variant. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	3e30608ceb	ir3, freedreno, tu: Make ir3_shader_variant store stream_output This reduces the number of uses of ir3_shader which will be gone when we deserialize the variant directly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Emma Anholt	536c8ee96d	nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. This controls the whole lowering of "make tex ops with implicit derivatives on non-implicit-derivative stages be tex ops with an explicit lod of 0 instead", but it's really hard to describe that in a git commit summary. All existing callers get it added except: - nir_to_tgsi which didn't want it. - nouveau, which didn't want it (fixes regressions in shadowcube and shadow2darray with NIR, since the shading languages don't expose txl of those sampler types and thus it's not supported in HW) - optional lowering passes in mesa/st (lower_rect, YUV lowering, etc) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16156>	2022-04-28 21:26:08 +00:00
Emma Anholt	1bcd848816	freedreno/ir3: Call nir_opt_find_array_copies(). gfxbench vk-5-normal has a shader that sampels into a texels[] array at the top, then in a loop calls a GLSL function passing texels[] in by value. This resulted in a copy to a temp inside the loop, which got lowered to scratch stores since it was pretty big. By doing find_array_copies(), we notice that it's equivalent to copy_deref, then get to copy-propagate from the array at the top. Then we only have to set up the scratch array outside of the loop and load_scratch from it in the called function inside the loop. This also causes there to be less spilling, stps 1144 -> 354 and ldps 826->36. However, it doesn't seem to change performance on the test. So, while this seems to be an improvement for the shader, and we could maybe even do better by rematerializing the txl samples inside the loop instead of storing the texture fetches to scratch in the first place, it doesn't currently seem worth pursuing more optimization of this shader. No change on freedreno shader-db. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Connor Abbott	fccc35c2de	ir3: Add preamble optimization pass Now that everything is plumbed through, we can tie it together. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	7ad57d9af1	ir3: Don't count reserved user consts in ubo_state::size Previously we included the reserved user consts (for Vulkan push constants) as part of the pushed UBO contents, but that led to a problem because when calculating the worst-case space for UBOs we didn't factor in the reserved user consts. We'll have the same problem when doing the same thing in the preamble optimization pass. Stop including the reserved size in ubo_state::size, and have ir3_setup_consts() add it in instead, so we won't forget to add it anywhere. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Ilia Mirkin	aac7028b58	freedreno/ir3: support a4xx compute differences Mainly the workgroup id comes injected via consts by the hardware (or CP), and we must make room for it, otherwise the driver won't know where to put it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Ilia Mirkin	6fb5e64ead	freedreno/ir3: support a4xx in load/store buffer/image emission Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Ilia Mirkin	37306ba3f1	freedreno/ir3: remove bogus tg4 -> tex lowering pass It can't be done. This just provides bad results. The blob had a comparable approach where they fixed up coordinates, but that also can't work with a separate texture definition with nearest filtering. By then, might as well provide a unswizzled variant instead, and using native functionality. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Danylo Piliaiev	0b2da9d795	ir3: Limit the maximum imm offset in nir_opt_offset for shared vars STL/LDL have 13 bits to store imm offset. Fixes crash in CS compilation in Monster Hunter World. Fixes: `b024102d7c` ("freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars.") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14968>	2022-02-14 20:56:46 +00:00
Danylo Piliaiev	f917c73528	ir3: opt_deref in opt loop to remove unnecessary tex casts Otherwise we may be left with such casts: vec1 32 ssa_72 = deref_var &shadow_map (uniform sampler2D) vec1 32 ssa_73 = deref_cast (texture2D )ssa_72 (uniform texture2D) vec1 32 ssa_74 = deref_cast (sampler )ssa_72 (uniform sampler) vec1 32 ssa_76 = (float32)tex ssa_73 (texture_deref), ssa_74 (sampler_deref), ssa_75 (coord), ssa_64 (comparator) And crash in ycbcr lowering since we aren't able to follow deref chain. Fixes crash in GFXBench Aztec Ruins Vulkan tests. See issue: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5945 Cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14819>	2022-02-01 17:02:36 +00:00
Connor Abbott	0248644c89	ir3,tu: Enable subgroup shuffles and relative shuffles We still don't use the fast path for relative shuffles, that's left for future work. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:46 +00:00
Emma Anholt	f6ffefba3e	nir: Apply nir_opt_offsets to nir_intrinsic_load_uniform as well. Doing this for ir3 required adding a struct for limits of how much base to fold in (which NTT wants as well for its case of shared vars), otherwise the later work to lower to the 1<<9 word limit would emit more instructions. The shader-db results are that sometimes the reduction in NIR instruction count results in the fewer sampler prefetches due to the shader being estimated to be shorter (dota2, nexuiz): total instructions in shared programs: 8996651 -> 8996776 (<.01%) total cat5 in shared programs: 86561 -> 86577 (0.02%) Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Emma Anholt	b024102d7c	freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars. Saves some work in carchase and manhattan31: instructions in affected programs: 2842 -> 2818 (-0.84%) nops in affected programs: 1131 -> 1105 (-2.30%) non-nops in affected programs: 1236 -> 1238 (0.16%) mov in affected programs: 57 -> 61 (7.02%) dwords in affected programs: 2144 -> 2150 (0.28%) cat0 in affected programs: 1195 -> 1169 (-2.18%) cat1 in affected programs: 151 -> 155 (2.65%) cat2 in affected programs: 142 -> 140 (-1.41%) sstall in affected programs: 190 -> 178 (-6.32%) (ss) in affected programs: 63 -> 63 (0.00%) systall in affected programs: 532 -> 511 (-3.95%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Danylo Piliaiev	e1f89a1da2	ir3: Make nir compiler options a part of ir3_compiler This would allow for sub-gens to have different options. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13986>	2022-01-10 13:20:39 +02:00
Connor Abbott	1a1e25dcce	tu, ir3: Support runtime gl_SubgroupSize in FS We already supported it in the CS for computing the subgroup ID, but soon we'll need it in the FS too. Vertex stages will always have it lowered. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13960>	2022-01-10 10:58:28 +00:00
Connor Abbott	e6e34883a9	ir3: Add wavesize control This allows the wavesize to be controlled per-shader. This will be used by VK_EXT_subgroup_size_control, and freedreno will also need it if legacy ARB_shader_ballot is to be supported (since it forces a wavesize of 64 or less). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13960>	2022-01-10 10:58:28 +00:00
Connor Abbott	30237b3d9c	ir3: Pass shader to ir3_nir_post_finalize() We'll need to add shader-specific lowering for gl_SubgroupSize. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13960>	2022-01-10 10:58:28 +00:00
Danylo Piliaiev	3dfd4230bb	ir3,turnip: Enable subgroup ops support in all stages on gen4 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Ilia Mirkin	13fb587b8a	freedreno/ir3: indicate that clipdist arrays are in use We expose the compact array cap, which means that we get compact clipdist arrays. Indicate this to the lowering pass so that it works for gl_ClipDistance from fs, among others. Fixes, among others, on a420, tests/spec/glsl-1.30/execution/clipping/fs-clip-distance-interpolated.shader_test Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13891>	2021-11-28 02:55:58 -05:00
Rob Clark	f5ce806ed7	freedreno/ir3: Add wide load/store lowering Lower load/store for vectors wider than 4. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	0a35ba5c43	freedreno/ir3: Move lower_idiv_options Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	e544a9db16	freedreno/ir3: Add support for load_kernel_input Used for function arguments to compute kernels (ie. OpenCL). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	1e9f27f37f	freedreno/ir3: Handle MESA_SHADER_KERNEL Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	064c806d23	freedreno/ir3: Add load/store_global lowering Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	f45b7c58c4	freedreno/ir3: Lower 64b phis Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Danylo Piliaiev	bee9212efb	ir3/freedreno: add 64b undef lowering Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	2d65e6f56d	freedreno/ir3: 64b intrinsic lowering Both for OpenCL and VK_KHR_buffer_device_address Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	96b37b9546	freedreno/ir3: Remove used unused Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13160>	2021-10-04 15:10:07 +00:00
Emma Anholt	1cc8523c5c	freedreno/ir3: Use LDIB for coherent image loads on a5xx. If the coherent flag is present, then we need to not have an incoherent cache between us and previous stores to the image that were also decorated as coherent. isam apparently (unsurprisingly) goes through a texture cache. Use ldib instead, so that we don't get the wrong result. We would need a similar fix for a4xx, but that uses ldgb and I don't have hardware to test on. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12704>	2021-09-03 18:17:07 +00:00
Danylo Piliaiev	6373dd814a	ir3/a6xx,freedreno: account for resinfo return size dependency on IBO_0_FMT On a6xx resinfo returns size in bytes divided by IBO_0_FMT format size (not just size in dwords), we have to shift it back to NIR meaning which is size in bytes. Make freedreno use 16b buffers when they are supported in order to be able to depend on hardware capabilities when lowering ssbo size. Fixes: `ce1a381e57` "turnip: enable VK_KHR_16bit_storage on A650" Fixes cts tests: dEQP-VK.ssbo.unsized_array_length.float_offset_explicit_size dEQP-VK.ssbo.unsized_array_length.float_no_offset_whole_size dEQP-VK.compute.basic.write_multiple_unsized_arr_single_invocation and many more Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12485>	2021-09-01 16:09:20 +03:00

1 2 3 4

193 Commits