KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Danylo Piliaiev	4f0a7dbdea	tu: implement dynamic primitive restart enable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10434>	2021-08-12 08:01:30 +00:00
Danylo Piliaiev	0b8ed9b55c	tu: implement dynamic depth bias enable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10434>	2021-08-12 08:01:30 +00:00
Danylo Piliaiev	804f1b5664	tu: declare VK_EXT_extended_dynamic_state2 but leave it disabled Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10434>	2021-08-12 08:01:30 +00:00
Roland Scheidegger	513fb5438b	aux/cso: try harder to keep cso state in sync on cso context unbind Before `a73cb106a6`, cso contexts were never reused, but now that they are we need to be extra careful that the state in the cso context and in the pipe context matches even after an unbind, since when the cso context is reused the state might otherwise get out of sync (as there is no concept of "initial state", basically cso always relied on the default values being the same both in cso and the drivers). This fixes some errors we've seen internally with lavapipe. Fixes: `a73cb106a6` ("aux/cso: split cso_destroy_context into unbind and a destroy functions") Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12261>	2021-08-12 01:57:34 +00:00
Hyunjun Ko	4b8f4bae01	tu: allow dynamic primitive topology with tessellation This allows to set VK_PRIMITIVE_TOPOLOGY_PATCH_LIST dynamically when tessellation used. If other values are set via vkCmdSetPrimitiveTopologyEXT for the case, the validation layer can detect the issue. Fixes dEQP-VK.pipeline.extended_dynamic_state..topology_patch Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12299>	2021-08-12 01:37:01 +00:00
Jason Ekstrand	0bbc3afb9a	ci: Build ANV on Android Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5211 Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12308>	2021-08-11 23:57:52 +00:00
Jason Ekstrand	f3aabe068c	anv/android: Drop unused device variables Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12308>	2021-08-11 23:57:52 +00:00
Jason Ekstrand	16a9cf6915	anv/android: Pass the correct pointer type to vk_errorf Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12308>	2021-08-11 23:57:52 +00:00
Jason Ekstrand	279fe1ae6d	intel/perf: Use a char array for OA perf query data drm_i915_query_perf_config::data is an unsized array and declaring a struct containing an unsized array that isn't at the end is a GNU extension which trips up Android builds. Instead, stuff both into a char array of the appropriate size. This emulates what you'd normally do to allocate one of these with malloc only on the stack. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12308>	2021-08-11 23:57:52 +00:00
Jason Ekstrand	c858d30833	intel/vec4: Don't override emit_urb_write_opcode for SNB GS The gfx6_gs_visitor overrides emit_urb_write_opcode but with a different function signature. This causes warnings with -Woverloaded-virtual. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12308>	2021-08-11 23:57:52 +00:00
Jason Ekstrand	a4d1ae7017	meson/glsl: Only run GLSL tests if can_run_host_binaries() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12308>	2021-08-11 23:57:52 +00:00
Jason Ekstrand	dd88fb33fe	meson: Intel drivers don't require expat on Android Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12308>	2021-08-11 23:57:52 +00:00
Jason Ekstrand	ce1a66fe3e	meson/intel: Don't build genxml tests on Android They require expat which we don't have on Android. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12308>	2021-08-11 23:57:52 +00:00
Ilia Mirkin	fdc5138256	st/mesa: fix pbo download store image type There's generally not too big of a difference between 1D (default) and buffer, but can't hurt to be accurate. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12319>	2021-08-11 23:32:39 +00:00
Paulo Zanoni	03d17461f3	iris: use add_bo_to_batch() when adding batch->bo Again, we don't need all the dependency checking, seqno incrementing and duplicate tracking for batch->bo. Just use the unchecked version. This commit is not particularly significant since it really just saves us a check in the iris_use_pinned_bo() hot path, but since we already have the helper function, why not? v2: - (turns out the answer to "why not?" is because the patch had a bug) - Call ensure_exec_obj_space() since batch batch chaining can happen and doesn't guarantee pre-reserved space (Ken). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12194>	2021-08-11 15:04:52 -07:00
Paulo Zanoni	155a7a9b0f	iris: add the workaround_bo directly to the batch Don't use iris_use_pinned_bo(), go directly with add_bo_to_batch(), skipping every check. This allows us to early return from iris_use_pinned_bo when the workaround bo is used, saving us the call to find_validation_entry() which ends up doing nothing except iterating over every bo in the batch. Also don't bother with ensure_exec_obj_space() since we just reset the batch and this is the second BO we're adding to it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12194>	2021-08-11 15:04:52 -07:00
Paulo Zanoni	d07fa0ef60	iris: extract the code that adds BOs to the batch lists We want to add a new caller, so extract this first. v2: kflags can never contain EXEC_OBJECT_WRITE (Ken). v3: Rebase after s/gtt_offset/address/. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12194>	2021-08-11 15:04:52 -07:00
Paulo Zanoni	805c5dcd59	iris: assign bo->index to the aux map BOs too I don't see these BOs being searched for in the benchmarks I tested so I don't think this should improve anything. On the other hand, it shouldn't hurt either since it's just an extra assignment. I want to unify both places where we have this code into a single function and the lack of the bo->index assignment was the only difference between the two places. So first we make both functions the same and in the next commit we'll unify things. This should make bisecting easier in case I'm wrong. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12194>	2021-08-11 15:04:52 -07:00
Paulo Zanoni	f7102bed69	iris: don't bump the seqno for the workaround_bo The last_seqnos list is used by iris_emit_buffer_barrier_for() and as far as I can understand we don't emit barriers for the workaround bo, so don't even bother doing the atomic operations required to bump the workaround_bo seqno list. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12194>	2021-08-11 15:04:52 -07:00
Eric Engestrom	89c22ef230	docs: update calendar and link releases notes for 21.1.7 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12331>	2021-08-11 21:16:24 +00:00
Eric Engestrom	d6cfc97156	docs: add release notes for 21.1.7 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12331>	2021-08-11 21:16:24 +00:00
Dave Airlie	8a81d14271	intel/vec4: sel.cond writes the flags on Gfx4 and Gfx5 This is the equivalent of idr's intel/fs: sel.cond writes the flags on Gfx4 and Gfx5 except for the vec4 backend. This fixes buggy rendering seen with crocus on a qt trace. v2 (idr): Trivial whitespace change. Add unit tests. v3: Fix type in comment in unit tests. Noticed by Jason and Priit. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Iron Lake total instructions in shared programs: 8183077 -> 8184543 (0.02%) instructions in affected programs: 198990 -> 200456 (0.74%) helped: 0 HURT: 1355 HURT stats (abs) min: 1 max: 8 x̄: 1.08 x̃: 1 HURT stats (rel) min: 0.29% max: 6.00% x̄: 0.99% x̃: 0.70% 95% mean confidence interval for instructions value: 1.04 1.12 95% mean confidence interval for instructions %-change: 0.96% 1.03% Instructions are HURT. total cycles in shared programs: 238967672 -> 238962784 (<.01%) cycles in affected programs: 4666014 -> 4661126 (-0.10%) helped: 406 HURT: 314 helped stats (abs) min: 4 max: 54 x̄: 22.46 x̃: 18 helped stats (rel) min: <.01% max: 12.80% x̄: 1.82% x̃: 0.65% HURT stats (abs) min: 2 max: 112 x̄: 13.48 x̃: 12 HURT stats (rel) min: <.01% max: 7.82% x̄: 0.81% x̃: 0.16% 95% mean confidence interval for cycles value: -8.60 -4.98 95% mean confidence interval for cycles %-change: -0.87% -0.49% Cycles are helped. GM45 total instructions in shared programs: 4986888 -> 4988354 (0.03%) instructions in affected programs: 198990 -> 200456 (0.74%) helped: 0 HURT: 1355 HURT stats (abs) min: 1 max: 8 x̄: 1.08 x̃: 1 HURT stats (rel) min: 0.29% max: 6.00% x̄: 0.99% x̃: 0.70% 95% mean confidence interval for instructions value: 1.04 1.12 95% mean confidence interval for instructions %-change: 0.96% 1.03% Instructions are HURT. total cycles in shared programs: 153577826 -> 153572938 (<.01%) cycles in affected programs: 4666014 -> 4661126 (-0.10%) helped: 406 HURT: 314 helped stats (abs) min: 4 max: 54 x̄: 22.46 x̃: 18 helped stats (rel) min: <.01% max: 12.80% x̄: 1.82% x̃: 0.65% HURT stats (abs) min: 2 max: 112 x̄: 13.48 x̃: 12 HURT stats (rel) min: <.01% max: 7.82% x̄: 0.81% x̃: 0.16% 95% mean confidence interval for cycles value: -8.60 -4.98 95% mean confidence interval for cycles %-change: -0.87% -0.49% Cycles are helped. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12191>	2021-08-11 13:09:32 -07:00
Ian Romanick	38807ceeae	intel/fs: sel.cond writes the flags on Gfx4 and Gfx5 On Gfx4 and Gfx5, sel.l (for min) and sel.ge (for max) are implemented using a separte cmpn and sel instruction. This lowering occurs in fs_vistor::lower_minmax which is called very, very late... a long, long time after the first calls to opt_cmod_propagation. As a result, conditional modifiers can be incorrectly propagated across sel.cond on those platforms. No tests were affected by this change, and I find that quite shocking. After just changing flags_written(), all of the atan tests started failing on ILK. That required the change in cmod_propagatin (and the addition of the prop_across_into_sel_gfx5 unit test). Shader-db results for ILK and GM45 are below. I looked at a couple before and after shaders... and every case that I looked at had experienced incorrect cmod propagation. This affected a LOT of apps! Euro Truck Simulator 2, The Talos Principle, Serious Sam 3, Sanctum 2, Gang Beasts, and on and on... :( I discovered this bug while working on a couple new optimization passes. One of the passes attempts to remove condition modifiers that are never used. The pass made no progress except on ILK and GM45. After investigating a couple of the affected shaders, I noticed that the code in those shaders looked wrong... investigation led to this cause. v2: Trivial changes in the unit tests. v3: Fix type in comment in unit tests. Noticed by Jason and Priit. v4: Tweak handling of BRW_OPCODE_SEL special case. Suggested by Jason. Fixes: `df1aec763e` ("i965/fs: Define methods to calculate the flag subset read or written by an fs_inst.") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Dave Airlie <airlied@redhat.com> Iron Lake total instructions in shared programs: 8180493 -> 8181781 (0.02%) instructions in affected programs: 541796 -> 543084 (0.24%) helped: 28 HURT: 1158 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.35% max: 0.86% x̄: 0.53% x̃: 0.50% HURT stats (abs) min: 1 max: 3 x̄: 1.14 x̃: 1 HURT stats (rel) min: 0.12% max: 4.00% x̄: 0.37% x̃: 0.23% 95% mean confidence interval for instructions value: 1.06 1.11 95% mean confidence interval for instructions %-change: 0.31% 0.38% Instructions are HURT. total cycles in shared programs: 239420470 -> 239421690 (<.01%) cycles in affected programs: 2925992 -> 2927212 (0.04%) helped: 49 HURT: 157 helped stats (abs) min: 2 max: 284 x̄: 62.69 x̃: 70 helped stats (rel) min: 0.04% max: 6.20% x̄: 1.68% x̃: 1.96% HURT stats (abs) min: 2 max: 48 x̄: 27.34 x̃: 24 HURT stats (rel) min: 0.02% max: 2.91% x̄: 0.31% x̃: 0.20% 95% mean confidence interval for cycles value: -0.80 12.64 95% mean confidence interval for cycles %-change: -0.31% <.01% Inconclusive result (value mean confidence interval includes 0). GM45 total instructions in shared programs: 4985517 -> 4986207 (0.01%) instructions in affected programs: 306935 -> 307625 (0.22%) helped: 14 HURT: 625 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.35% max: 0.82% x̄: 0.52% x̃: 0.49% HURT stats (abs) min: 1 max: 3 x̄: 1.13 x̃: 1 HURT stats (rel) min: 0.12% max: 3.90% x̄: 0.34% x̃: 0.22% 95% mean confidence interval for instructions value: 1.04 1.12 95% mean confidence interval for instructions %-change: 0.29% 0.36% Instructions are HURT. total cycles in shared programs: 153827268 -> 153828052 (<.01%) cycles in affected programs: 1669290 -> 1670074 (0.05%) helped: 24 HURT: 84 helped stats (abs) min: 2 max: 232 x̄: 64.33 x̃: 67 helped stats (rel) min: 0.04% max: 4.62% x̄: 1.60% x̃: 1.94% HURT stats (abs) min: 2 max: 48 x̄: 27.71 x̃: 24 HURT stats (rel) min: 0.02% max: 2.66% x̄: 0.34% x̃: 0.14% 95% mean confidence interval for cycles value: -1.94 16.46 95% mean confidence interval for cycles %-change: -0.29% 0.11% Inconclusive result (value mean confidence interval includes 0). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12191>	2021-08-11 13:09:20 -07:00
Dave Airlie	593ad9294b	crocus: align staging resource pitch on gen4/5 to allow BLT usage. Aligning the pitch to 4 bytes allows the BLT engine to be used for transfers to/from these surfaces. Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12329>	2021-08-11 19:21:01 +00:00
Dave Airlie	58e95f99bd	crocus/blt: add pitch/offset checks to fix blt corruption I lost these in my conversion from i965 but they are necessary. This should fix corruption in qt fonts at seen in the minecraft launcher. Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12329>	2021-08-11 19:21:01 +00:00
Alyssa Rosenzweig	cb2e712b9e	pan/bi: Unit test DISCARD+FCMP fusing Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:26 -04:00
Alyssa Rosenzweig	d74ab1e4d9	pan/bi: Fuse DISCARD with conditions Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:26 -04:00
Alyssa Rosenzweig	4f347e96b8	pan/bi: Add fclamp unit tests The negative cases here did not pass before this series, showing the bug in the clamp optimization. By introducing the FCLAMP pseudo op, the bug is fixed. Let's ensure we don't regress. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:26 -04:00
Alyssa Rosenzweig	ac636f5adb	pan/bi: Use FCLAMP pseudo op for clamp prop Map nir_op_fsat/etc to FCLAMP pseudo ops, instead of FADD. There are significantly fewer knobs on FCLAMP, meaning significantly fewer things to get wrong. This fixes two(!) classes of bugs: * Swizzles (failing to lower/compose swizzles on clamps) * Numerical bugs (incorrectly treating +0.0 as an additive identity) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:26 -04:00
Alyssa Rosenzweig	a98790fa96	pan/bi: Add optimizer unit tests Writing these tests brought to light the cluster of bugs fixed in the previous commits. Now that things work, let's ensure they stay working. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:26 -04:00
Alyssa Rosenzweig	89e452883a	pan/bi: Use FABSNEG pseudo ops for modifier prop Simplifies pattern matching. This commit by itself fixes multiple numerical issues -- the previous fabsneg check failed to check the round mode or the sign of the zero. That will break Vulkan/OpenCL. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:26 -04:00
Alyssa Rosenzweig	b2b0dca4ce	pan/bi: Add shader equality helper for unit tests Optimizer tests really are global. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:26 -04:00
Alyssa Rosenzweig	f3d8ec1484	pan/bi: Fuse abs/neg more on Valhall Some of these Bifrost restrictions may be skipped on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:25 -04:00
Alyssa Rosenzweig	c27af4ad36	pan/bi: Simplify bi_compose_clamp Realized this trick when reversing Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:25 -04:00
Alyssa Rosenzweig	cfd73b4542	pan/bi: Unit test new constant folding patterns Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:25 -04:00
Alyssa Rosenzweig	ec76119dfb	pan/bi: Constant fold texturing lowerings This ensures we can constant fold the ALU ops used to lower: * explicit LOD calculations * array textures * texture offsets * multisample indices Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:59:24 -04:00
Alyssa Rosenzweig	6fa1479479	pan/va: Document IEEE 754 conformance of clamps These rules are not obvious. But they turn out to be exactly what's required by the spec. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12205>	2021-08-11 14:58:58 -04:00
Alyssa Rosenzweig	cdfb5d3788	panfrost: Test srcdst + dstsrc blending Validates the prior commit. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12152>	2021-08-11 18:15:52 +00:00
Alyssa Rosenzweig	0228ccf857	panfrost: Leverage Bifrost's 2src blend factor Bifrost adds a value for the C factor equaling 2src. This does not correspond directly to API blend modes so it is not too useful in general. However, it's required for srcdest + destsrc blending to be done in hardware instead of a blend shader. GFXbench uses that blend mode, so it must be important ;-) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12152>	2021-08-11 18:15:52 +00:00
Alyssa Rosenzweig	95e306dc84	panfrost: Add basic fixed-function blending tests Add unit tests for the fixed-function blending helpers in pan_blend.c. Each test consists of a Porter-Duff blend mode and the associated hardware state. In this commit, we add tests for the most common modes. For motivation, this code has NOT been properly tested in CI. True, functional correctness of the blend module as a whole is tested by dEQP-GLES3.functional.fragment_ops.blend.* among other integration tests. However, this testing is insufficient to check for regressions. Crucially, the following broken patch would clear CI: bool pan_can_fixed_function(...) { return false; } In that case, blend shaders are used 100% of the time, which will regress performance horribly but still pass dEQP. The only clue something went wrong would be some traces changing checksum due to the fixed-function blender producing slightly different output than equivalent blend shaders. By unit testing the fixed blend path, we ensure we always use the fixed-function path when we expect it to. Similarly, using incorrect values for the blend metadata may not affect functional correctness but will increase power consumption. Let's check all the data we export to drivers. Note: due to additive commutativity, there are many pairs of equivalent Mali blend modes. Unfortunately, the vendor is... inconsistent about how to resolve ambiguous modes. Our algorithm for computing modes is correct; the "preferred" values are left in comments since otherwise our tests fail despite correct code. I want to blame Bifrost for this, but Midgard was patient zero. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12152>	2021-08-11 18:15:52 +00:00
Alyssa Rosenzweig	95187c03f1	panfrost: Simplify blend_factor_constant_mask Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12152>	2021-08-11 18:15:52 +00:00
Alyssa Rosenzweig	01a1b253b4	panfrost: Fix is_opaque when blend_enable=false Needed to pass the "replace" unit test. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12152>	2021-08-11 18:15:52 +00:00
Alyssa Rosenzweig	b7af56f2ad	panfrost: Add blend helper packing the equation This is more convenient for the Gallium driver and easier to test. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12152>	2021-08-11 18:15:52 +00:00
Alyssa Rosenzweig	7eb2559198	panfrost: Use _PU for non-dithered formats This is required to disable dithering on a per-draw basis when OPAQUE output is used (bypassing the blender which normally uses the round_to_framebuffer_precision flag to do the same). This functionally reverts: `ebc07f4b2f` ("panfrost: Remove padded unorm blendable formats") `fae90a7940` ("panfrost: Always pick dithered tb formats") while adding the functionality to make them useful. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12152>	2021-08-11 18:15:52 +00:00
Alyssa Rosenzweig	07607c8443	panfrost: Remove unused #defines Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12328>	2021-08-11 18:00:45 +00:00
Alyssa Rosenzweig	3ec5e2b6a7	panfrost: Add LINEAR debug option Useful to cross off CPU texture tiling as the source of bugs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12328>	2021-08-11 18:00:45 +00:00
Alyssa Rosenzweig	3958f00215	pan/bi: Add a noopt debug option To rule out buggy optimization passes when debugging. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12328>	2021-08-11 18:00:45 +00:00
Alyssa Rosenzweig	ff03f096bf	pan/bi: Make bi_opt_push_ubo optional It's an optimization pass -- omitting it should not cause MMU faults (!). Make sure the UBO push mask is set regardless of whether the pass is called, and just call the pass when required. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12328>	2021-08-11 18:00:45 +00:00
Lionel Landwerlin	01b0935d31	nir/lower_shader_calls: remove empty phis This is confusing opt_cse. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8dfb240b1f` ("nir: Add raytracing shader call lowering pass.") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11953>	2021-08-11 15:10:07 +03:00
Marcin Ślusarz	a1b7a5fad6	zink: use nir_shader_instructions_pass in nir_lower_dynamic_bo_access Changes: - nir_metadata_preserve(..., nir_metadata_dominance) is called only when pass makes progress - nir_metadata_preserve(..., nir_metadata_all) is called when pass doesn't make progress Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12324>	2021-08-11 11:23:30 +00:00

1 2 3 4 5 ...

143588 Commits All Branches Search

143588 Commits

All Branches