KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Alyssa Rosenzweig	5ee4852239	panfrost: Add texture features enum to v9.xml Required to query texture features on Valhall. It's technically the same as previous Malis (except for narrow ASTC), but conceptually it's different as plane descriptors have superseded indexed pixel formats for block compressed textures. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16035>	2022-04-26 14:56:22 +00:00
Erik Faye-Lund	2d597c9f9d	panvk: quiet non-conformant warning on ci This helper has built-in support to be quieted, which seems like a good idea to do on ci. We're already setting the env var in the CI environment, so no need to do that here. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16033>	2022-04-26 13:47:34 +00:00
Alyssa Rosenzweig	2ca8b014d1	pan/bi: Implement pack_uvec[24]_to_uint This maps nicely to Mali's weirdo MKVEC, so implement it rather than scalarizing. The scalarization wants an extract implemented which we don't have. Fixes dEQP-VK.glsl.builtin.function.pack_unpack.* Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16120>	2022-04-26 00:18:19 +00:00
Vinson Lee	7898721cce	panfrost: Fix memory leaks on unit test failures. Change ASSERT_EQ to EXPECT_EQ to avoid aborting before freeing memory. Fix defects reported by Coverity Scan. Resource leak (RESOURCE_LEAK) leaked_storage: Variable tiled going out of scope leaks the storage it points to. leaked_storage: Variable linear going out of scope leaks the storage it points to. leaked_storage: Variable ref going out of scope leaks the storage it points to. Fixes: `bb6c14a697` ("panfrost: Unit test u-interleaved tiling routines") Suggested-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16127>	2022-04-25 20:18:42 +00:00
Alyssa Rosenzweig	4e111c259c	panvk: Lower shared memory Copy the code. Fixes workgroup tests, now compute kernels should work properly on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16123>	2022-04-25 16:29:31 +00:00
Alyssa Rosenzweig	179d9e3511	panvk: Handle empty shaders gracefully Fixes dEQP-VK.spirv_assembly.instruction.compute.shader_default_output.int.uninitialized Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16123>	2022-04-25 16:29:31 +00:00
Alyssa Rosenzweig	c9b33fe7dc	pan/bi: Implement fquantize2f16 Implement as f2f32(f2f16(x)) with the conversions in flush-to-zero mode. Accessing flush-to-zero mode on Bifrost is nontrivial: it is specified per-clause, rather than per-instruction. I've opted to pipe support for ftz clauses through the scheduler. This solution has two nice properties: * It uses the native hardware for flushing subnormals, avoiding extra lowering. * It's "smart" about scheduling around FTZ requirements, meaning we get good code generated even for a shader that e.g. quantizes a vector. With an unrelated scheduler fix, the V2F32_TO_V2F16/+F16_TO_F32 operation fits in a single tuple, minimizing the overhead of the special FTZ clause. We'll have to do something a bit different for Valhall (FLUSH.f32), but we'll worry about when we actually have PanVK brought up on Valhall. Fixes dEQP-VK.spirv_assembly.instruction.compute.opquantize. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16123>	2022-04-25 16:29:31 +00:00
Alyssa Rosenzweig	dfcb2f0699	panvk: Consider primitive restart in index buffer walks Fixes: dEQP-VK.pipeline.input_assembly.primitive_restart.index_type_uint32.line_strip Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16140>	2022-04-25 16:10:41 +00:00
Alyssa Rosenzweig	68c05c660b	panvk: Respect line width Fixes: dEQP-VK.pipeline.input_assembly.primitive_topology.index_type_uint16.line_list Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16140>	2022-04-25 16:10:41 +00:00
Alyssa Rosenzweig	6bef59f56b	pan/bi: Simplify bi_propagate_pass_flags Worklist-based dataflow analysis is overkill, we can just do DFS. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16045>	2022-04-23 18:16:08 +00:00
illiliti	67af7e2b40	Use proper types for meson objects Fix invalid usage of meson objects which violates official meson specification and thus breaks muon, an implementation of meson written in C. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15715>	2022-04-18 13:03:08 +03:00
Icecream95	f267000240	panfrost: Stop overallocating compressed textures The line stride uses the number of bytes in the entire block, so both the width and height need to be reduced for compressed textures so that the surface stride is calculated correctly. Fixes: `051d62cf04` ("panfrost: Add a pan_image_layout_init() helper") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6286 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15989>	2022-04-16 16:14:11 +00:00
Alyssa Rosenzweig	4d460c9fdd	panfrost: Rework PAN_GPU_ID mechanism Originally, PAN_GPU_ID was checked in the driver itself. I added the mechanism to run Bifrost shader-db on my Midgard laptop. There was no drm-shim support at this point, and this was a reasonable stop gap at the time. Nowadays, we have a competent drm-shim implementation, which wholly replaces this use case. So PAN_GPU_ID is only useful for drm-shim. Let's pull the code into drm-shim and get it out of the driver. This allows NDEBUG drm-shim builds to work properly. While we're at it, the default emulated GPU is changed from Mali-T860 to Mali-G52. This reflects our shifting development priorities. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15930>	2022-04-16 11:50:51 -04:00
Alyssa Rosenzweig	fc1397d1d7	docs/panfrost: Document u-interleaved tiling The optimized routine documented the tiling format together with the software algorithm. The reference implementation wants the tiling format alone documented. Let's break out the high level documentation into somewhere centrally accessible, and refocus the comments in the optimized file on the optimization. This documentation is linked bidirectionally with both implementations, so it should be easy to find. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15803>	2022-04-16 13:43:05 +00:00
Alyssa Rosenzweig	bb6c14a697	panfrost: Unit test u-interleaved tiling routines These are complex and not used in all dEQP paths. They're also easy to unit test, so add some tests to prevent regressions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15803>	2022-04-16 13:43:05 +00:00
Alyssa Rosenzweig	944049dedc	panfrost: Document arguments to tiling routines The exact semantics of these routines are subtle, although they match what Gallium wants. We're about to add unit tests. Add some comments that make it obvious what it is we expect these routines to do. (In particular, it's not a general region-of-interest copy, it's a region-of-interest of the tiled image and the entire linear staging image.) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15803>	2022-04-16 13:43:05 +00:00
Alyssa Rosenzweig	1d8ea77645	panfrost: Assert alignment in tiling routine We depend on this invariant implicitly. Make that dependence explicit so we don't get confused and add broken unit tests. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15803>	2022-04-16 13:43:05 +00:00
Alyssa Rosenzweig	4c31795f7a	panfrost: Add 48-bit and 96-bit tiling routines Thanks to our macros and some type trickery, our generic tiling routines are type-generic. So we just need to add 48-bit and 96-bit texel types to tile. Note we only support power-of-two bit sizes in the specialized tile routines for the sake of replacing a multiplication with a shift. With this change, all pixel formats supported in Panfrost are tileable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15803>	2022-04-16 13:43:05 +00:00
Icecream95	5da8c280b7	panfrost: Remove BO mapping from import BOs will be mapped when needed, so there is no need to mmap BOs when importing them. Fixes crashes when exporting a non-AFBC resource and importing it back in the same context. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15818>	2022-04-10 01:51:04 +00:00
Alyssa Rosenzweig	1fb4427a7a	pan/bi: Imply round mode most of the time Much less noisy, and provides a path to further improvements. There is a slight behaviour change: int-to-float conversions now use RTE instead of RTZ. For 32-bit opcodes, this affects conversions of integers with magnitude greater than 2^23 by at most 1 ulp. As this behaviour is unspecified in GLSL, this change is believed to be acceptable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	a747708b9d	pan/bi: Use should_skip in bi_builder generation To avoid further code duplication. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	de37f75554	pan/bi: Mark some opcodes as default round-to-zero Conversions to integer have different rounding rules. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	24d072fd6a	pan/bi: Don't use funny round modes in tests To prepare for defeaturing round modes, replace uses of round-to-positive with round-to-even in our unit tests. This doesn't meaningfully impact test coverage; there is no way to generate that round mode. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	9a521433ae	panfrost: Don't set a default for blend count Unnecessary. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:37 +00:00
Alyssa Rosenzweig	e39f9aa883	panfrost: Hide AFBC on Valhall The relevant data structures have been shuffled a bit. We need to wire up AFBC for Valhall; however, that's out of scope for the initial bring up. Just hide it so we can build. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	76e0a7c49e	panfrost: Adapt pan_shader.h for Valhall Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	4d04437a3b	panfrost: Add shader_stage helper For Valhall, which specifies these in the shader program descriptor. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	783d27645c	panfrost: Add panfrost_make_resource_table helper For Valhall drivers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	3baceb0ca4	panfrost: Hide parts of pan_encoder.h for Valhall These pertain to data structures that no longer exist. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	d11351c616	panfrost: Control tiler memory usage Ensure we don't hit OOM when rendering at 8192x8192 on Valhall by disabling the smallest bin size of the hierarchy mask. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	586c3b9e35	panfrost: Handle stencil texturing on Valhall Use a Bifrost compatible path. It's not clear this is optimal but it passes the tests and is no worse than what we do on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	db20152c8a	panfrost: Handle Valhall texturing Surface descriptors have been replaced by plane descriptors, which facilitate the intermediate layout of textures. This allows for more sophisticated handling of texture compressions, of particular to interest to copy_image. However, it requires a considerable amount of new logic to handle. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	813d355e9e	pan/va: Add LD_TILE.v3.f16 packing test This tests the staging register behaviour. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	2b57303eaf	pan/bi: Consider flow control in DCE We don't want to remove instructions like `NOP.wait` on Valhall; this would be tantamount to deleting barriers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	6e69c3369c	pan/bi: Don't lower vertex_id for malloc IDVS Based on hardware behaviour, it appears vertex_id is zero-based with the legacy geometry flow but not with the new malloc IDVS flow. Since the geometry flow is per-shader (not per-machine), there's not a good way to communicate this to NIR. Rather than trying to shoehorn this obscure detail into NIR, just do the lowering ourselves instead of in NIR. It's not much more code anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ccdec68aee	pan/bi: Report whether workgroups can be merged This flag gates a Valhall hardware optimization for compute shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	170d5a012e	pan/bi: Avoid masked writes for now Our swizzle lowering optimizations depend on replication of scalar fp16. This holds on Bifrost (at least for now), but not on Valhall which has proper support for write masks. For now, enforce Bifrost-compatible behaviour as we do not make use of the write masks on Valhall yet. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ba5b63f642	pan/bi: Generate LD_BUFFER on Valhall Replace LOAD.ubo with LD_BUFFER since the .ubo segment doesn't exist on Valhall. We could do this with a lowering pass instead but this is probably fine. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	f487c09045	pan/bi: Make psiz variants Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	9497a6a3c9	pan/bi: Lower gl_PointSize to FP16 on Valhall It is unclear if FP32 point sizes are supported on Valhall -- I can't get the DDK to use them at any rate. Always lower them to FP16 and store them as FP16 for hardware use. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	8e6f97b5fc	pan/bi: Force psiz to mediump To match driver behaviour. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	90d3f55aff	pan/bi: Set table for Valhall LD_ATTR Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	f79e33f82f	pan/bi: Emit Valhall-style varying stores Varying stores was changed in Valhall. Rather than using attribute descriptors like on Bifrost and Midgard, on Valhall we store to memory directly with hardware-allocated buffers. This requires a new implementation of store_output, with special provisions for writing gl_PointSize from a position shader. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	14e7796d4c	pan/bi: Emit Valhall-style varying loads Memory-allocated IDVS requires special varying load instructions that take an offset into the hardware-allocated varying buffer, as opposed to a varying slot. Emit these instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	a1d5bf0a7a	pan/bi: Track whether the malloc IDVS flow is used This affects what instructions the fragment shader uses. Will be used for the legacy geometry flow in blit shaders. Whether that is a good idea remains to be seen, admittedly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	9758555481	pan/bi: Handle Valhall texturing in helper analysis Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ae79f6765a	pan/bi: Emit Valhall texture instructions Valhall uses an updated version fo the TEXC path. To avoid disrupting the existing Bifrost code, add a new Valhall-specific texture path that generates the new-style texture instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	9091b6261b	pan/bi: Specialize BLEND emit for Valhall Fewer arguments compared to Bifrost; the corresponding information is encoded in a Valhall-specific blend shader prologue instead. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	a8afe6f7fb	pan/bi: Waits before tilebuffer access on Valhall On Bifrost, this is handled in the scheduler. Until we grow a Valhall scheduler, add a NOP with the appropriate flow control. This is correct but carries a small performance cost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	fe9cf1d0a4	pan/bi: Fix spilling on Valhall We need a slightly different idiom on Valhall, since the segment modifiers no longer exist but we now have an immediate offset. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	a2916aa934	pan/bi: Mark LD_TILE as w=format This tracks register usage more precisely for LD_TILE, which is an encoding difference on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:44 +00:00
Alyssa Rosenzweig	b371e509da	panfrost: Add a table for images For the default Valhall ABI. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:44 +00:00
Alyssa Rosenzweig	0864b15047	pan/va: Allow small constants in register pairs They are zero extended 32->64-bit. Allow this. Noticed debugging spilling on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:28:43 -04:00
Alyssa Rosenzweig	862a19aa4b	pan/va: Add flow control lowering pass Something an instruction has two logic flow controls, namely wait + reconverge. These are orthogonal -- we need to insert a NOP to handle this. Add a lowering pass that works out flow control to replace the ad hoc previous va_pack_flow. Fixes dEQP-GLES31.functional.ssbo.layout.single_basic_type.shared.lowp_vec3. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	4f5e0e1874	pan/va: Don't truncate slots Causes BARRIER not to work. Fixes: `f45654af59` ("pan/va: Add packing routines") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	9b727944a0	pan/va: Model image load instructions These use the attribute pipe, the new versions of LD_ATTR_TEX, but reading texture descriptors instead of attribute descriptors unlike their Bifrost predecessors. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	12da32c31f	pan/va: Pack LEA_TEX_IMM Mostly automatic. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	1f4cb6d99f	pan/va: Add indirect LEA_{ATTR, TEX} For parity with Bifrost. We might need these for images. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	c6fdafe5ea	pan/bi: Model Valhall image loads Like LD_ATTR_TEX. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Jason Ekstrand	cc78a3a820	panvk: Enable VK_EXT_debug_report and VK_EXT_debug_utils They're both implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Roman Stratiienko	61f94fff0d	panfrost: Don't crash on panfrost_bo_create() with size==0 invocation 1. Clamp bucket_index from both ends to avoid returning negative index. 2. Return NULL in case BO allocation/fetching failure to prevent invalid bo mapping. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6247 Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15748>	2022-04-05 13:08:51 +03:00
Tomeu Vizoso	51ab4ef4be	Revert "ci/panfrost: Disable some jobs due to a lab failure" Machines are back. This reverts commit `b5fd1fddd9`. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15730>	2022-04-04 08:59:38 +02:00
Tomeu Vizoso	b5fd1fddd9	ci/panfrost: Disable some jobs due to a lab failure A dispatcher has had a hard disk failure and these devices are offline now. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15711>	2022-04-01 14:37:03 +00:00
Alyssa Rosenzweig	0c1fde956b	panfrost: Add Valhall compressed formats We need to map to the interchange format, since there is no longer a pixel format for the memory layout. Use this new format table on v9. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	42b9295fa6	panfrost: Restrict Z/S formats for Valhall Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	ac51142bab	panfrost: Handle Valhall IDVS in job_uses_tiling Valhall-style IDVS uses a distinct job type which has to be handled separately. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	5b056971a3	pan/bi: Preload r60/r61 for MSAA + blend shader This is the sort of leakiness I hate about blend shaders. MSAA + blend shader is somewhat obscure but gets hit in the CTS. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	913a7ed41a	pan/bi: Use ID accessors for LEA_ATTR This is more portable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	3e08672369	pan/bi: Split out load/store to thread storage We need a slightly different idiom on Valhall, so let's first split the helpers for encapsulation. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	5e76467d5d	pan/bi: Use nir_tex_instr_has_implicit_derivative Rather tracking it ourselves. Slightly shorter. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	fc81415f47	pan/bi: Call Valhall backend passes on v9 These are required to lower the IR into something suitable for Valhall packing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	ac5eb4934b	pan/bi: Fix write_mask size We really need to stop tying the IR to Bifrost... Fixes: `3c817ed511` ("pan/bi: Model Valhall texture instructions") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	12edaae64a	pan/bi: Add .shadow modifier to TEX_GATHER Although TEX_GATHER looks like TEX_FETCH, it does support shadow comparators like TEX_SINGLE. Model this in the IR so we can use it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	f31208f778	pan/va: Lower BLEND to call blend shaders Do this as late as possible. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	cb76cc1f1d	pan/va: Add packing unit tests Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	18bf478f1e	pan/va: Add shader-db support Reports the common subset from Bifrost, as well as Mali offline compiler style normalized cycle counts. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	8bc268f2d5	pan/va: Implement the cycle model Will feed into shader-db reporting, and maybe other things eventually. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	8a258a685c	pan/va: Test instruction selection lowerings Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	1745c89312	pan/va: Lower branch offsets Logic is lifted from bi_layout.c, adapted to work on instructions (not clauses) and for Valhall's off-by-one semantic which is annoyingly different than Bifrost. (But the same as Midgard -- Bifrost was annoyingly different than Midgard!) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	9a9b20e652	pan/va: Add instruction selection lowering pass Valhall removes certain instructions from Bifrost, requiring a canonical lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	b796d32564	pan/va: Add constant lowering pass Valhall has a lookup table for common constants. Add a pass to take advantage of it, lowering away immediate indices. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	b8f912e547	pan/va: Validate FAU before packing These are pre-conditions required for packing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	fd1906afea	pan/va: Add FAU validation Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	676d9c9441	pan/va: Add unit tests for ADD_IMM optimizations Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	13d7ca1300	pan/va: Optimize add with imm to ADD_IMM Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	f45654af59	pan/va: Add packing routines Mostly manual since Valhall is regular. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	edf284215d	pan/va: Add helpers for swapping bitwise sources Annoyingly different from Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	619566dea1	pan/va: Generate header containing enums We already collect enums in the ISA description XML. Export them for use in the compiler backend, particularly the packing code. Usually we'd use Mako for templating. In this case, the script is so trivial a template engine didn't seem worth it. (The obvious version with Mako was about 10 lines longer than just prints and f-strings used here.) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	7ad98ae96e	pan/va: Build opcode info structures Filled out the new structures from XML. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	40ed485e32	pan/va: Permit encoding more flags Missed the first time around. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	76487c7eb4	pan/va: Unify flow control Group together dependency waits and flow control into a single enum. This simplifies the code, clarifies some detail, and ensures consistency moving forward. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	cf6d1a81f6	pan/va: Add Bifrost-style LD_VAR instructions For use in the legacy non-MALLOC_IDVS flow. Especially useful in blit shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	295b802f64	pan/va: Add LD_VAR_BUF instructions Like LD_VAR_BUF_IMM but indirect. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	e8590e0d04	pan/va: Add ST_TILE instruction Encoded like LD_TILE, required for some MSAA blend shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	fa841273d4	pan/bi: Rename I->action to I->flow For consistency with the Valhall ISA. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	f5585700be	pan/bi: Model LD_VAR_BUF instructions These are indirect versions of LD_VAR_BUF_IMM, taking their index in bytes. Used for indirect varying loads (the NIR lowering is inefficient). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	97a13d6424	pan/bi: Augment ST_TILE with register format To model its Valhall incarnation. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	c7f6b973b2	pan/bi: Check return addresses in blend shaders Required on Valhall, where jumping to 0x0 doesn't automatically terminate the program. Luckily the check is free there too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	1b7d7ebbab	pan/bi: Allow branch_offset on BLEND Required to model BLEND accurately on Valhall, where it encodes a special relative branch... Midgard style! Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	cfde0275e4	pan/bi: Model Valhall-style A(CMP)XCHG Handled consistently with computational atomics. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	90867e8204	pan/bi: Add ATOM_RETURN pseudo-instruction Allows modeling Valhall's atomics better. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	7983a0d0dc	pan/bi: Rename PATOM_C to ATOM This is basically what's native on Valhall. Use the Valhall naming for the pseudo-instruction on Bifrost for consistency. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	b70a7c97bb	pan/bi: Gate late DCE/CSE on "optimize" Otherwise we can end up with unlowered ATOM.i32 on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	3485b8dc78	pan/bi: Use consistent modifier lists in packing If there are modifiers only used by pseudo instructions, not the real instructions, bi_packer can get out-of-sync with bi_opcodes, causing hard-to-debug issues. Do the stupid-simple thing to ensure this doesn't happen. This may be a temporary issue, depending whether ISA.xml and the IR get split out for better Valhall support. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Boris Brezillon	face6f6ddc	panvk: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:03 +00:00
Alyssa Rosenzweig	d8b5d45dc1	pan/va: Add atomic instructions Equivalent to their Bifrost counterparts, with a much more sensible encoding. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15515>	2022-03-23 15:50:41 +00:00
Alyssa Rosenzweig	0ac9841809	pan/va: Allow omitting staging registers It's not usually valid, but sr_count == 0 is encodable and used for the non-RETURN variant of ATOM1. Allow dis/assembling this syntax. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15515>	2022-03-23 15:50:41 +00:00
Alyssa Rosenzweig	e6ca668d45	pan/va: Allow forcing staging flags to read-write Required for the correct encoding of atomics. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15515>	2022-03-23 15:50:41 +00:00
Alyssa Rosenzweig	d2fb6879a2	panfrost: Process scissor state earlier Otherwise, if batch->scissor_culls_everything is set for a single draw, every draw after it in the batch will be skipped because the new scissor/viewport state will never be processed. Process scissor state early in draw_vbo to fix this interaction. We do need to be careful: setting something on the batch can only happen when we've decided on a batch. If we have to select a fresh batch due to too many draws, that must happen first. This is pretty clear in the code but worth noting for the diff. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Reviewed-by: Icecream95 <ixn@disroot.org> Fixes: `79356b2e` ("panfrost: Skip rasterizer discard draws without side effects") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5839 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6136 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15365>	2022-03-22 19:44:40 +00:00
Alyssa Rosenzweig	0cbe4dd4c4	pan/bi: Use bi_dontcare for ZS_EMIT This is more portable and avoids special casing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	1b934d5962	pan/bi: Emit arch-specific code for bi_dontcare We use bi_dontcare() to specify any encoding where we don't care about the value, with a preference for power-efficient encodings. On Bifrost, a (possibly nonexistant) FAU read is the best encoding. On Valhall, that encoding doesn't exist so just use a zero. That should be good enough in practice. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	222d17fc67	pan/bi: Model Valhall action on bi_instr Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	38625af010	pan/bi: Add Valhall-specific zero builder When emitting code during or after register allocation, we need to be able to emit constants without running the constant->{LUT, move, uniform} pass running after. In particular, we need to access the constant 0 to implement spill code. Add a Valhall-specific zero for this purpose. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	666b714a37	pan/bi: Don't analyze helper reqs in !frag shaders Waste of time, and possibly invalid too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	a16163a9fd	pan/bi: Print Valhall-specific FAU indices We'll emit these shortly, prepare the printer. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	32ca920023	pan/bi: Use vertex/instance ID helpers Enables portability to Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	1e37113ede	pan/bi: Add helpers to get vertex/instance ID These are preloaded in different places across Bifrost and Valhall. Abstract that away so code using the builder isn't littered with "is Valhall?" checks. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	76a09b8cd3	pan/va: Fix ST_CVT definitions They are basicallly just STORE with an extra source and the memory access modifier in a different place. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	9b7a45e3dc	pan/va: Align error messages in disassembler tests Makes it easier to spot the difference, less eye scanning. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	3a4b864197	pan/va: Add missing .auto32 register format Clipped to .auto for consistency with Bifrost (and the existing IR). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	263c5ef194	pan/va: Add LEA_ATTR_IMM instruction Encoded like LEA_TEX_IMM. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	459c6ac23f	pan/va: Model LEA_TEX_IMM more accurately The unknown field is a descriptor type, which we model as an opcode2 since it's a fixed constant. This allows us to disambiguate LEA_TEX_IMM from LEA_ATTR_IMM. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	17caccd15d	pan/va: Correct definition of ZS_EMIT It's a message instruction, not an ALU one... duh. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	508335c927	panfrost: Add Tiler Job to v9 XML Legacy tiling job, semantics are the same as on Midgard. Useful for blits and transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	e635dc9ea5	panfrost: Refactor XML to permit non-IDVS jobs Tiler jobs look similar, but don't have the Allocations fields. Refactor to make this possible to express. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	53f1fa9219	panfrost: Fix definition of DCD on v9 The position and varying shader environment descriptors are additional sections of the job, rather than part of the (fragment only) DCD. This distinction matters for non-IDVS jobs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	02f519601a	panfrost: Correct ASTC decode mode XML The narrow/wide bit was backwards. Fixes: `bfba7533c7` ("panfrost: Add Valhall Plane Descriptor XML") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	fd87135852	pan/decode: Unify tiler job handling Instead of adding a third Valhall path, let's use GenXML to unify our paths. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	a9ca751a8f	pan/decode: Handle blend arrays on Valhall Required for MRT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Jason Ekstrand	012bfde7f3	panvk: Hook up emulated secondary command buffers Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Boris Brezillon	18fced0226	panvk: Refcount the descriptor set and pipeline layouts Lifetime of descriptor sets and pipeline layouts are odd. Let's refcount them so we don't end up with use-after-free patterns. That means we can't use custom allocators for those objects. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Jason Ekstrand	91cb714dc1	panvk: Drop GetPhysicalDeviceQueueFamilyProperties Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:15 -05:00
Jason Ekstrand	a929bafc77	panvk: Only implement Get*MemoryRequirements2 The runtime code will provide the 1.0 entrypoints for us. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	bc8b30ba55	panvk: Drop QueueBindSparse Now that we've switched to the common sync/submit framework, this is implemented in runtime/vk_queue.c. We don't need to provide the stub. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	2fc2ec17db	panvk: Drop BindImage/BufferMemory We already provide the 2 versions and the Vulkan runtime will map the 1.0 entrypoints to the 2 versions for us. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	f9b773a417	panvk: Implement VK_KHR_copy_commands2 This is just 2 versions of all the copy/blit entrypoings. The common Vulkan runtime code will implement the 1.0 versions in terms of the 2 versions. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	b573b22628	panvk: Implement VK_KHR_synchronization2 It's easier to switch to sync2 before CmdPipelineBarrier gets any more complicated. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	39c395d1d2	panvk: Move core properties into their respective core structs Currently, we only support a few features from 1.1. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	ff30dd11a7	panvk: Re-arrange GetPhysicalDeviceProperties2 Put the 1.0 properties and limits first. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	34139d9f51	panvk: Add a 1.3 features struct The only thing that gets pulled into this is private data. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	dd03dba7fd	panvk: Re-arrange GetPhysicalDeviceFeatures2 Put the 1.0 features at top followed by 1.1 and then 1.2. For filling out the actual 1.1 and 1.2 structs, use the helpers. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Alyssa Rosenzweig	b0faf422b7	pan/va: Use XML for special FAU page 0 Now all special FAU handling is unified, which makes both assembler and disassembler considerably nicer. This adds some more special FAU indices from page 0 that were previously missing, allowing them to be assembled and disasembled. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	31a171d92d	pan/va: Use boring names for FAU special pages 1/3 There's no magic underlying interpretation, be.. uniform. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	76159ee379	pan/va: Remove immediate modes from XML/asm Now replaced by inference in the assembler, as they should be. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	81498f1538	pan/va: Use 64-bit special FAU for pages 1 and 3 This aligns with how the hardware actually sees special FAU. Also fix the names while we're at it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	139867cb43	pan/va: Rename imm_mode -> fau_page In accordance with new information on the hardware. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	3bd1401075	pan/va: Handle uniforms from page 1 Like Bifrost, Valhall can access 2x as many fast acess uniforms as previously thought. However, on Valhall this requires using the pagination mechanism. Support this in the dis/assembler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	cf43a1cc58	pan/va: Rewrite FAU handling in dis/assembler FAU pages do not need to be specified explicitly in the assembly. Rather, they should be inferred by the assembler by the instructions used. Rewrite the code handling this in alignment with new information about the hardware. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	95b7908d2d	pan/va: Fix BLEND instruction There's only one staging register, the other register is just offset due to the Msg64 source. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	c7e8e8b319	pan/va: Handle 64-bit sources in message instrs These take up two slots, reading an aligned register pair, even though they are in a 32-bit instruction. Required to correctly model BLEND. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	9878469833	pan/va: Add start property to source The bit position of sources is more complicated than (8 * index). Make it a part of the Valhall reflection information. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	5a759140b0	pan/va: Fix typo in BLEND text Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Jason Ekstrand	0f048c5782	panvk: Convert to the common sync/submit framework Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15296>	2022-03-17 16:22:10 +00:00
Jason Ekstrand	acbb0d86f7	panvk: Implement VK_EXT_vertex_attribute_divisor Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15295>	2022-03-16 09:58:55 -05:00
Boris Brezillon	58587c32cb	panvk: Implement indexed rendering Since we can do 8-bit index buffers, also advertise VK_EXT_type_uint8. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15295>	2022-03-16 09:58:46 -05:00
Boris Brezillon	a08b695386	panvk: Fix per-instance attribute handling We were assuming per-vertex attributes so far. Let's extend the logic to support per-instance attributes with or without custom instance divisors. Now that we've got it all hooked up, we can enable VK_EXT_vertex_attribute_divisor. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15295>	2022-03-16 09:57:51 -05:00
Boris Brezillon	417cf3d35e	panvk: No-op zero-vertex draws Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15295>	2022-03-16 09:57:51 -05:00
Jason Ekstrand	864f3c0ee0	panvk: Fix SSBO buffer offsets Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15398>	2022-03-16 01:27:28 +00:00
Jason Ekstrand	6214cce382	panvk: Require 16B alignment for UBOs This is required by MALI_UNIFORM_BUFFER. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15398>	2022-03-16 01:27:28 +00:00
Daniel Stone	2221e3d487	ci: Add new Panfrost G52 skip This started failing for some reason, has been seen in https://gitlab.freedesktop.org/mesa/mesa/-/jobs/19776551 and others. Signed-off-by: Daniel Stone <daniels@collabora.com> Suggested-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15396>	2022-03-16 00:41:46 +00:00
Jason Ekstrand	5a0e081e00	panvk: Use vk_shader_module_to_nir() Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Jason Ekstrand	0c871d89ae	panvk: Use vk_shader_module Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Jason Ekstrand	2170c3ac63	panvk: Use the correct integer border colors Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Jason Ekstrand	8dd917b9f0	panvk: Rework texture, sampler, and image binding index calculation This adds a new get_resource_deref_binding helper which decodes a resource deref into set, binding, and index. To make texture instructions nicer, the index can optionally be split into immediate and SSA parts. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Jason Ekstrand	17e79b044e	panvk: Skip ZS setup if there is no depth/stencil attachment Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Jason Ekstrand	4f843db0a1	panvk: Make panvk_image_view derive from vk_image_view Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Jason Ekstrand	1865b7a93e	panvk: Make panvk_image derive from vk_image Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Boris Brezillon	ff8aa15fa0	panvk: Add support for texel buffers Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15334>	2022-03-14 18:06:01 +00:00
Boris Brezillon	9dc8382de8	panvk: Add a dummy sampler for NIR tex operations that don't take one In the NIR domain, some texture operations don't require a sampler, but Bifrost/Midgard always want one. Let's add a dummy sampler to handle that case. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15334>	2022-03-14 18:06:01 +00:00
Jason Ekstrand	a35e721162	panvk: Stop advertising Vulkan 1.1 We're nowhere close to even having Vulkan 1.0 working yet, there's no reason to get too excited about 1.1. It just means piles more test crashes for features we're claiming to support but don't. If we want to enable more tests, we can turn on the extensions for those features once we actually have them working. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15334>	2022-03-14 18:06:01 +00:00
Alyssa Rosenzweig	8c6cb15200	panfrost: Handle txs of cube arrays We need to divide the array length by 6 to match what OpenGL expects. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15254>	2022-03-12 17:34:01 +00:00
Alyssa Rosenzweig	53f1e57ee7	pan/bi: Handle non-2D arrays Handle arrays generically by using the last component of the coordinate source as the array index. That works for both 2D arrays and cube arrays, fixing cube arrays. Cube arrays were already handled correctly in core Panfrost code. This code path is not tested in dEQP-GLES31 without exposing OES_cube_map_array, which depends on OES_geometry_shader, which we don't have. Yet we do expose PIPE_CAP_CUBE_ARRAY, so ARB_cube_map_array is exposed. Disabling PIPE_CAP_CUBE_ARRAY would be an easy band-aid fix, but it's easy enough to handle correctly. dEQP-GLES31 passes with a hack enabling OES_cube_map_array [without geometry shaders]. Also fixes 1D arrays on Bifrost for the same reasons. Fixes: `70d6c5675d` ("pan/bi: Emit TEXC with builder") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15254>	2022-03-12 17:34:01 +00:00
Alyssa Rosenzweig	1f97819fbe	panfrost: Emulate GL_CLAMP on Bifrost Hardware support was removed with Midgard. Use mesa/st to emulate GL_CLAMP with nir_lower_tex automatically (the Zink lowering), and disable GL_MIRROR_CLAMP which isn't lowered correctly. Fixes texwrap Piglit tests on G52. Fixes: `f9ceab7b23` ("panfrost: Fix CLAMP wrap mode") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15253>	2022-03-12 17:16:00 +00:00
Jason Ekstrand	65db6b0e7c	bifrost: Constant fold after lower_explicit_io nir_lower_explicit_io generates mul+add chains even for constants. One round of constant folding should get rid of these. This fixes all of the dEQP-VK.glsl.conversions.* tests on panvk. GoGoGoGo'd-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15349>	2022-03-12 03:51:54 +00:00
Jason Ekstrand	1aa120b10f	bifrost: Handle nir_op_frexp* and nir_op_ldexp Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jason Ekstrand	d2a09f3dd3	bifrost: Implement fine and coarse derivatives We leave the undecorated ops as fine so we don't disturb panfrost. For coarse derivatives, we use a lane ID of 0 for the first lane and 1 or 2 for the second depending on axis. This ensures that coarse derivatives are quad-uniform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jason Ekstrand	83010c57a6	bifrost: Simplify derivatives a bit Instead of two magic ternary operations, define a new `axis` temporary which is 1 for X and 2 for Y. Then define everything else in terms of this variable. In particular, the mask operation we do on LANE_ID is a mask so it makes more sense to use ~axis than 1/2 but in the other order. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jason Ekstrand	c043e93ca5	bifrost: Lower usub_borrow Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Tapani Pälli	adea096029	ci: update various ci result files Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12936>	2022-03-11 09:58:28 +00:00
Boris Brezillon	02906baba7	panvk: Implement vkCmdDispatch() Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	1056b3e39e	panvk: Add support for storage image Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	eca0a0e29e	panvk: Move dummy attribute buffer emission out of emit_{attribute,varying}_bufs So we can easily add entries after the standard varyings/attributes (like image descriptors in the attribute array). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	0e7e1b64fc	panvk: Add support for storage/uniform buffers with dynamic offsets The idea of storing offsets in a separate UBO and lowering accesses to UBOs/SSBOs with a dynamic offset was not great. Let's apply the offset at UBO/SSBO emission time instead. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	13378e4129	panvk: Support creation of compute pipelines Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	b05ffc9fec	panvk: Add support for storage buffers Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	9a20467cf2	panvk: Add support for push constants Push constants are stored in a separate UBO. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Timur Kristóf	64acec0ef9	nir: Fix lowering terminology of compute system values: "from"->"to". This is to match other NIR terminology. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Jason Ekstrand	541f08cd4c	panvk: Non-destructively stub GetRenderAreaGranularity Don't crash. Just print a warning and return 1x1. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15285>	2022-03-08 17:03:47 +00:00
Jason Ekstrand	afe2ef9afc	panvk: Advertise zero sparse format properties This is the correct implementation when you don't support sparse and fixes piles of CTS crashes. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15285>	2022-03-08 17:03:47 +00:00
Jason Ekstrand	8feed1f114	panvk: Advertise VK_KHR_get_physical_device_properties2 All the entrypooints are already implemented and a bunch of Vulkan CTS tests assume this extension exists. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15285>	2022-03-08 17:03:47 +00:00
Icecream95	ba18799ca1	pan/bi: Don't assign slots for the blend second source Another instruction might write to the second source, and then an INSTR_INVALID_ENC fault will be raised because the tuple will write to and read from the register at the same time. Fixes: `795638767d` ("pan/bi: Use fused dual source blending") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	66a604efb5	pan/bi: Skip psuedo sources in ISA.xml The second staging register source for the +BLEND instruction should not be packed nor disassembled, so skip it when include_pseudo is not set. Fixes: `795638767d` ("pan/bi: Use fused dual source blending") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	9d4441c71a	panfrost: Fix ubo_mask calculation BITSET_MASK returns ~0 when given an input of zero, when we need it to return 0 instead. Fixes shaders with only sysvals but no UBOs when push constants are disabled. This breaks when 31 or 32 UBOs are used, but PAN_MAX_CONST_BUFFERS is currently set to 16. Fixes: `c246af0dd8` ("panfrost: Only upload UBOs when needed") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	24101d944b	pan/bi: Add documentation for bifrost_nir_lower_store_component Taken from the commit that introduced the function, `95458c4033` ("pan/bi: Lower stores with component != 0"). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	42caddcf6b	pan/bi: Make disassembler build reproducibly Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	d6c431c2e3	panfrost: Re-emit descriptors after resource shadowing This could be made slightly more efficient by only setting the dirty state that is needed, but eventually you reach a point where it's cheaper to re-emit everything than work out what can or can't be kept. Fixes rendering issues in Duckstation. Fixes: `cd2c1ef9da` ("panfrost: Dirty track textures/samplers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:54:58 -05:00
Icecream95	cb8c47b15e	pan/bi: Check dependencies of both destinations of instructions TEXC can have two destinations; the value for neither of them can be used in the same bundle, so extend the code to check for this to iterate over both destinations. Fixes artefacts in the game "LIMBO". Fixes: `a303076c1a` ("pan/bi: Add bi_instr_schedulable predicate") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Icecream95	9e714f7455	pan/bi: Add interference between destinations Trying to write to overlapping register ranges from a single instruction is undefined behaviour, so add interference between the nodes to avoid this. Hit in a dual-texture instruction in LIMBO. Fixes: `9146bafbb4` ("pan/bi: Add dual texture fusing pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Icecream95	198cb4a77a	panfrost: Disable point size upper limit clamping The hardware already clamps this, there is no need to do it in the shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Icecream95	d54efebf04	panfrost: Set PIPE_CAP_QUADS_FOLLOW_PROVOKING_VERTEX_CONVENTION Fixes arb-provoking-vertex-render Piglit test. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:47:24 -05:00
Icecream95	948300da27	pan/mdg: Use util_logbase2 instead of C99 log2 log2 operates on double, we only need the integer util/ function. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:27:44 -05:00
Alyssa Rosenzweig	7bda838c56	panfrost: Push twice as many uniforms The limit for Bifrost is twice as high as previously thought -- the limit is 64 slots of FAU, not 64 words. Each slot is 2 words. We can push twice as much, saving a considerable number of cycles in some cases. total instructions in shared programs: 2454260 -> 2431502 (-0.93%) instructions in affected programs: 845176 -> 822418 (-2.69%) helped: 3376 HURT: 304 helped stats (abs) min: 1.0 max: 60.0 x̄: 7.92 x̃: 6 helped stats (rel) min: 0.13% max: 45.45% x̄: 4.60% x̃: 4.11% HURT stats (abs) min: 1.0 max: 60.0 x̄: 13.06 x̃: 8 HURT stats (rel) min: 0.16% max: 35.09% x̄: 7.58% x̃: 6.52% 95% mean confidence interval for instructions value: -6.50 -5.87 95% mean confidence interval for instructions %-change: -3.75% -3.43% Instructions are helped. total tuples in shared programs: 1963383 -> 1951560 (-0.60%) tuples in affected programs: 638622 -> 626799 (-1.85%) helped: 2959 HURT: 573 helped stats (abs) min: 1.0 max: 54.0 x̄: 5.61 x̃: 4 helped stats (rel) min: 0.15% max: 28.57% x̄: 3.61% x̃: 3.12% HURT stats (abs) min: 1.0 max: 50.0 x̄: 8.35 x̃: 6 HURT stats (rel) min: 0.25% max: 27.34% x̄: 6.24% x̃: 4.92% 95% mean confidence interval for tuples value: -3.61 -3.08 95% mean confidence interval for tuples %-change: -2.18% -1.85% Tuples are helped. total clauses in shared programs: 387817 -> 365111 (-5.85%) clauses in affected programs: 135527 -> 112821 (-16.75%) helped: 3489 HURT: 25 helped stats (abs) min: 1.0 max: 43.0 x̄: 6.52 x̃: 5 helped stats (rel) min: 0.82% max: 58.33% x̄: 17.48% x̃: 15.87% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.56 x̃: 1 HURT stats (rel) min: 2.94% max: 11.11% x̄: 6.87% x̃: 6.67% 95% mean confidence interval for clauses value: -6.67 -6.26 95% mean confidence interval for clauses %-change: -17.65% -16.96% Clauses are helped. total cycles in shared programs: 201842.21 -> 168754.04 (-16.39%) cycles in affected programs: 84035.50 -> 50947.33 (-39.37%) helped: 3547 HURT: 136 helped stats (abs) min: 0.041665999999999315 max: 54.0 x̄: 9.33 x̃: 8 helped stats (rel) min: 0.17% max: 80.77% x̄: 36.10% x̃: 36.84% HURT stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.12 x̃: 0 HURT stats (rel) min: 0.18% max: 12.24% x̄: 1.18% x̃: 0.61% 95% mean confidence interval for cycles value: -9.26 -8.71 95% mean confidence interval for cycles %-change: -35.34% -34.11% Cycles are helped. total arith in shared programs: 74918.46 -> 75022.62 (0.14%) arith in affected programs: 22471.04 -> 22575.21 (0.46%) helped: 1571 HURT: 1492 helped stats (abs) min: 0.041665999999999315 max: 1.125 x̄: 0.17 x̃: 0 helped stats (rel) min: 0.17% max: 40.00% x̄: 2.50% x̃: 1.96% HURT stats (abs) min: 0.041665999999999315 max: 2.375 x̄: 0.25 x̃: 0 HURT stats (rel) min: 0.16% max: 100.00% x̄: 5.35% x̃: 2.37% 95% mean confidence interval for arith value: 0.02 0.05 95% mean confidence interval for arith %-change: 1.08% 1.56% Arith are HURT. total ldst in shared programs: 174812 -> 137889 (-21.12%) ldst in affected programs: 81319 -> 44396 (-45.41%) helped: 3722 HURT: 0 helped stats (abs) min: 1.0 max: 62.0 x̄: 9.92 x̃: 8 helped stats (rel) min: 1.82% max: 100.00% x̄: 47.18% x̃: 43.75% 95% mean confidence interval for ldst value: -10.20 -9.64 95% mean confidence interval for ldst %-change: -47.97% -46.39% Ldst are helped. total quadwords in shared programs: 1757124 -> 1714130 (-2.45%) quadwords in affected programs: 584065 -> 541071 (-7.36%) helped: 3474 HURT: 173 helped stats (abs) min: 1.0 max: 90.0 x̄: 12.66 x̃: 9 helped stats (rel) min: 0.26% max: 34.18% x̄: 8.78% x̃: 8.33% HURT stats (abs) min: 1.0 max: 26.0 x̄: 5.76 x̃: 4 HURT stats (rel) min: 0.45% max: 20.66% x̄: 4.48% x̃: 2.63% 95% mean confidence interval for quadwords value: -12.21 -11.37 95% mean confidence interval for quadwords %-change: -8.36% -7.95% Quadwords are helped. total threads in shared programs: 52898 -> 53142 (0.46%) threads in affected programs: 262 -> 506 (93.13%) helped: 250 HURT: 6 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.92 0.99 95% mean confidence interval for threads %-change: 93.69% 99.28% Threads are helped. total spills in shared programs: 161 -> 107 (-33.54%) spills in affected programs: 54 -> 0 helped: 27 HURT: 0 total fills in shared programs: 1386 -> 796 (-42.57%) fills in affected programs: 590 -> 0 helped: 27 HURT: 0 Fixes: `d4dccea0ba` ("panfrost: Add UBO push data structure") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15239>	2022-03-04 15:22:04 +00:00
Alyssa Rosenzweig	e7cfe18099	pan/bi: Run CSE after lowering FAU Lowering FAU can add moves from uniforms. If a uniform is moved out to a register mulitple times in a basic block, these moves can be CSE'd, saving instructions at the cost of register pressure. 854 shaders in my shader-db are helped on cycle count (average 2.94% reduction in cycles). Only 9 shaders have hurt thread count, and there is no change in spills or fills. Overall, this seems to be a win. Prevents instruction count regressions from the next commit. total instructions in shared programs: 2454423 -> 2444690 (-0.40%) instructions in affected programs: 386274 -> 376541 (-2.52%) helped: 2105 HURT: 0 helped stats (abs) min: 1.0 max: 116.0 x̄: 4.62 x̃: 2 helped stats (rel) min: 0.04% max: 27.27% x̄: 3.64% x̃: 1.92% 95% mean confidence interval for instructions value: -4.91 -4.33 95% mean confidence interval for instructions %-change: -3.83% -3.45% Instructions are helped. total tuples in shared programs: 1963534 -> 1957106 (-0.33%) tuples in affected programs: 233562 -> 227134 (-2.75%) helped: 1491 HURT: 117 helped stats (abs) min: 1.0 max: 63.0 x̄: 4.44 x̃: 2 helped stats (rel) min: 0.04% max: 24.53% x̄: 4.39% x̃: 2.59% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.61 x̃: 1 HURT stats (rel) min: 0.18% max: 8.33% x̄: 1.44% x̃: 1.05% 95% mean confidence interval for tuples value: -4.28 -3.71 95% mean confidence interval for tuples %-change: -4.20% -3.73% Tuples are helped. total clauses in shared programs: 387848 -> 387079 (-0.20%) clauses in affected programs: 13718 -> 12949 (-5.61%) helped: 583 HURT: 60 helped stats (abs) min: 1.0 max: 16.0 x̄: 1.42 x̃: 1 helped stats (rel) min: 1.11% max: 25.00% x̄: 8.28% x̃: 6.67% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.86% max: 20.00% x̄: 4.58% x̃: 4.00% 95% mean confidence interval for clauses value: -1.29 -1.10 95% mean confidence interval for clauses %-change: -7.57% -6.58% Clauses are helped. total cycles in shared programs: 201866.21 -> 201682.92 (-0.09%) cycles in affected programs: 6241.79 -> 6058.50 (-2.94%) helped: 952 HURT: 98 helped stats (abs) min: 0.04166399999999726 max: 2.625 x̄: 0.20 x̃: 0 helped stats (rel) min: 0.12% max: 26.00% x̄: 4.05% x̃: 2.38% HURT stats (abs) min: 0.041665999999999315 max: 0.16666700000000034 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.18% max: 8.70% x̄: 1.60% x̃: 1.43% 95% mean confidence interval for cycles value: -0.19 -0.16 95% mean confidence interval for cycles %-change: -3.80% -3.24% Cycles are helped. total arith in shared programs: 74924.00 -> 74660.12 (-0.35%) arith in affected programs: 9303.67 -> 9039.79 (-2.84%) helped: 1513 HURT: 118 helped stats (abs) min: 0.04166399999999726 max: 2.625 x̄: 0.18 x̃: 0 helped stats (rel) min: 0.07% max: 33.33% x̄: 4.68% x̃: 2.67% HURT stats (abs) min: 0.041665999999999315 max: 0.16666800000000137 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.18% max: 8.70% x̄: 1.55% x̃: 1.37% 95% mean confidence interval for arith value: -0.17 -0.15 95% mean confidence interval for arith %-change: -4.48% -3.98% Arith are helped. total quadwords in shared programs: 1757254 -> 1751978 (-0.30%) quadwords in affected programs: 197399 -> 192123 (-2.67%) helped: 1464 HURT: 110 helped stats (abs) min: 1.0 max: 51.0 x̄: 3.73 x̃: 2 helped stats (rel) min: 0.04% max: 21.95% x̄: 4.16% x̃: 2.52% HURT stats (abs) min: 1.0 max: 7.0 x̄: 1.71 x̃: 1 HURT stats (rel) min: 0.21% max: 13.04% x̄: 1.65% x̃: 0.93% 95% mean confidence interval for quadwords value: -3.58 -3.13 95% mean confidence interval for quadwords %-change: -3.97% -3.53% Quadwords are helped. total threads in shared programs: 52899 -> 52890 (-0.02%) threads in affected programs: 18 -> 9 (-50.00%) helped: 0 HURT: 9 HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: -1.00 -1.00 95% mean confidence interval for threads %-change: -50.00% -50.00% Threads are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15239>	2022-03-04 15:22:04 +00:00
Alyssa Rosenzweig	b48236ea3e	pan/bi: Add arithmetic flag to RSHIFT ops Models ops like ARSHIFT_OR.i32 on Valhall without adding piles of new instructions to the IR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	0b0e74ae82	pan/bi: Extend LD_TILE with a register format Required for Valhall. NIR has the information anyway, pass it along. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	74107abfc6	pan/bi: Add BRANCHZI instruction Technically this is just JUMP on Valhall, but the semantic is an indirect branch based on comparing with zero. It can also be used as a conservative branch (like BRANCHC), but this isn't modeled. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	3dc2095b07	pan/bi: Model LD_BUFFER instructions We'll use these to read from UBOs on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	5796777889	pan/bi: Model offset for LOAD/STORE Needed to model the immediate offset on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	039bb4e68c	pan/bi: Model pos/vary segments in STORE instructions For Bifrost, we model load/store segments, for example for thread local storage. We need something similar on Valhall -- access modifiers. There are four access modifiers on Valhall, controlling memory subsystem optimizations for the access: none: Nothing may be assumed. Corresponds to "global". istream: Internally streaming within the GPU. Corresponds to "pos", as it's used for position stores. estream: Externally streaming outside the GPU. Corresponds to "vary", as it's used for varying stores. force: Force access in discarded threads. Corresponds to "tl", as it's required for correct behaviour of helper invocations that use the stack. If these access modifiers end up being useful outside these fixed purposes, we may need to rework this part of the IR. For now, this should suffice. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	aaa39f0e60	pan/bi: Model LEA_BUF_IMM in the IR Required for varying stores in malloced IDVS jobs on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	eba9ef4c25	pan/bi: Add LD_VAR_BUF_IMM.f16/f32 instructions For use on Valhall with memory-allocated IDVS jobs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	48a398bf5b	pan/bi: Generalize I->table for Valhall Can be reused for resource tables in a natural way. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	20891e75c2	pan/bi: Extend BLEND to take a register format Needed on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	3c817ed511	pan/bi: Model Valhall texture instructions These act like a TEXC+immediate. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	234d3efb9b	pan/va: Add memory access modifier to LOADs Might be required for correct spilling in some circumstances. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	79aa4af078	pan/va: Remap "store segment" to "memory access" For now, the difference does not matter. However it's better to model the actual hardware behaviour, rather than isomorphic driver behaviour, when we can do so. So fix the names. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	254a641290	pan/va: Fix LEA_BUF_IMM definition Technically the table is folded, too; the 0xD refers to table 61. But this instruction is more general than previously thought. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	7c798fbb9f	pan/va: Fix definitions of LD_VAR_BUF_IMM So close! However, LD_VAR_IMM is something else -- Bifrost-style varying interpolation, without a hardware buffer. For ES3, we'll need to support both. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	c62836661e	pan/va: Add TEX_GATHER instruction Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	65cb3af38a	pan/va: Add TEX_DUAL instruction Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	47b70ca584	pan/va: Add modifiers required for gathers Mostly isomorphic to Bifrost-style gathers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	431e7e54a6	pan/va: Handle force_enum differing from name Needed for secondary register width, for dual texturing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	c3eee6327c	pan/va: Add missing copyright notice Minor. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	eda00fd39d	pan/bi: Extract INSTRUCTION_CASE macro Useful across multiple optimization tests. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	ffde1f359b	pan/bi: Adapt bi_lower_branch for Valhall Disable the Bifrost optimization; it's not portable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	f3937d9874	pan/bi: Trade off registers/threads on Valhall It's only v6 that's missing this feature. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	7637502c8d	pan/bi: Add BI_SUBGROUP_SUBGROUP16 option Valhall uses 16-wide warps. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	ec9c1f8fa6	pan/bi: Wire Valhall disassembler into compiler Useful when we grow Valhall support (soon!) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	31e991d801	pan/bi: Support standalone Valhall disassembly $ bifrost_compiler disasm --gpu=G78 foo.bin Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	600f689a98	pan/bi: Allow CSE of preloaded registers Needed to CSE `LEA_VARY` in varying shaders on Valhall. No shader-db changes on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	3154df232b	pan/bi: Use a progress loop for constant folding Needed to fold the dependent patterns produced by texture instructions during NIR->Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	e5582710f3	pan/bi: Mark NOP as having no destinations More accurate and more convenient. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	2604c65174	panfrost: Unify barrier+helper handling These are unified in the hardware, so let's unify them in pan_shader_info. Hoisting this logic to pan_shader.c avoids the need to duplicate this logic for Midgard/Bifrost (RSD packing) and Valhall (SPD packing). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	30d0c2e390	panfrost: Set texel_interleave on Valhall Instead of specifying the tiling on the texture descriptor, Valhall specifies it on the plane descriptor. There is a new flag on the texture descriptor specifying only whether the planes are interleaved or not. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	407bda4d8c	panfrost: Adapt estimate_texture_payload_size to Valhall The plane descriptor is larger than earlier surface descriptors, so we need to be somewhat careful here. This removes a memory micro-optimization in the interest of simplifying the code. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	469a36d071	panfrost: Don't emit compression tags on Valhall Unnecessary. To avoid even more #if/#endif soup, merge the v4, v5-v8, and v9 paths together -- by returning 0 as the compression tag on v4 or v9. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	36a2b8d039	panfrost: Add PAN_MESA_DEBUG=dump option To dump all graphics memory via the new pandecode_dump_mappings function(), since for Valhall I have to do this often enough to warrant a dynamic flag. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	631c01fc42	panfrost: Add an enum for Valhall resource tables Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	f3c971e0fe	panfrost: Make Divisor E an integer on v9 For consistency with previous architecture's XML files. Logically this is an 1-bit unsigned integer, not a boolean. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	b19afaf307	panfrost: Clarify contains descriptor? bit Influences cache prefetching. I don't see a good reason to put anything other than descriptors inside shader resources, meaning always setting this bit is appropriate (at least for GLES). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	1df6b0d7e2	panfrost: Remove Invalidate Cache from Valhall job header Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	217e038289	panfrost: Add Tile Render Order enum to fragment jobs Not sure what this is needed for yet. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	52ccd21e6b	panfrost: Extend SPD size There is software-defined state at the end we don't need. Model in the XML for correct behaviour. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Daniel Stone	d07df90bf4	Revert "CI: Disable Panfrost T720 jobs" This reverts commit 35209b94a6c7d88fb67b6446fda8f8daf556c911. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15191>	2022-02-28 14:08:34 +00:00
Daniel Stone	bd55458304	Revert "CI: Disable panfrost-t760" This reverts commit b9b444e0b8bc318cea2a93ec04b0a383c444180e. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15191>	2022-02-28 14:08:34 +00:00
Adrián Larumbe	6d0824abcc	panfrost: fix segfault in pandecode The structure wrapped around the rb tree node was being freed, but not the node itself, which caused a segmentation fault when accessing its parent node. Add rb tree node remove call to fix it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15188>	2022-02-28 12:40:32 +00:00
Daniel Stone	af0f9a31b3	CI: Disable Panfrost T720 jobs Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15189>	2022-02-28 07:08:38 +00:00
Daniel Stone	114e48e923	CI: Disable panfrost-t760 The DUTs are extremely tempremental for some reason. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15186>	2022-02-27 20:26:17 +00:00
Alyssa Rosenzweig	216da26b3f	pan/va: Add TEX_FETCH assembler case Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:03 +00:00
Alyssa Rosenzweig	794836daf0	pan/va: Handle sr_write_count in the disassembler Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	eee6dad0c9	pan/va: Fix definitions of TEX_SINGLE and TEX_FETCH Fix the definitions of the basic texturing instructions. In particular, a register format and a write mask were previously missing, as well as incorrect handling of staging registers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	a58807fa95	pan/va: Don't use staging index as a sideband It would cause us to get incorrect disassembly when the syntax is flipped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	49a4cc6af8	pan/va: Handle extended staging counts in assembler Needed for texturing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	142ba9fea6	pan/va: Allow forcing enums for 1-bit modifiers Ocassionally the 0 value has a meaningful value that's not meaningfully default, so we want an enum to encode both possible states. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	20fce28dfd	pan/va: Add MUX.v2i16 and MUX.v4i8 opcodes Basically identical to MUX.i32, slight differences in opcode and swizzling only. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	97f8fad37b	pan/va: Remove incorrect TEX test cases Not close enough to salvage; TEX is going to be redefined. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	988d5aae74	panfrost: Flush resources when shadowing When we shadow a resource, the backing BO is changed; as such, existing references to the resource become invalid. So batches accessing the resource need to be flushed (or otherwise have their references invalidated). The wrong behaviour change (not flushing) was introduced when we started tracking resources instead of BOs. The issue manifested as a severe performance regression in glmark2's -bbuffer test, particular the subdata subtest. The issue is magnified on slow CPUs; without the fix, the test becomes completely CPU bound Relevant glmark2 -bbuffer test from 43fps to 84fps. Apparently, this causes functional issues too -- this performance-minded change also fixes a few piglits. Fixes: `cecb889481` ("panfrost: Do tracking of resources, not BOs") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Chris Healy <cphealy@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13502>	2022-02-24 23:11:20 +00:00
Alyssa Rosenzweig	5536852d60	panfrost: Handle NULL samplers Fixes a NULL dereference in Piglit fp-fragment-position, getting the test to pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13203>	2022-02-24 22:48:30 +00:00
Alyssa Rosenzweig	4b2769493e	panfrost/ci: Update xfails list These tests seem to be passing now. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13203>	2022-02-24 22:48:30 +00:00
Alyssa Rosenzweig	cd2a4cc47c	pan/bi: Unit test message preloading optimization To make sure it is applied in the cases we expect it to be, to avoid code generation regressions. Functional regressions are expected to be caught by integration-testing, so that is not focused on here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 14:13:21 -05:00
Alyssa Rosenzweig	eb1479bda2	pan/bi: Support message preloading Preload LD_VAR_IMM or VAR_TEX instructions in the first block of fragment shaders on v7. Preloaded messages write to fixed registers; when replacing instructions we insert moves from the registers at the start of the program and hope coalescing goes to town. (Admittedly we don't do any coalescing yet...) The extra moves hurts instruction count in some cases; the win for cycle count should cancel this out. When we get smarter copy prop or RA, those moves should go away anyway. This optimization may hurt register pressure by extending the lifetime of up to eight registers written in the first block. This is expected to be acceptable: on a large shader-db, there are no additional spills/fills, and only two shaders are hurt on thread count. This optimization only applies to v7, as the hardware was not introduced on v6 and was removed for Valhall. total instructions in shared programs: 2451624 -> 2454286 (0.11%) instructions in affected programs: 909046 -> 911708 (0.29%) helped: 4719 HURT: 3341 helped stats (abs) min: 1.0 max: 10.0 x̄: 1.49 x̃: 1 helped stats (rel) min: 0.08% max: 33.33% x̄: 6.79% x̃: 3.92% HURT stats (abs) min: 1.0 max: 50.0 x̄: 2.90 x̃: 2 HURT stats (rel) min: 0.12% max: 66.67% x̄: 6.39% x̃: 3.45% 95% mean confidence interval for instructions value: 0.27 0.39 95% mean confidence interval for instructions %-change: -1.55% -1.11% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total tuples in shared programs: 1969529 -> 1963429 (-0.31%) tuples in affected programs: 601327 -> 595227 (-1.01%) helped: 5907 HURT: 1297 helped stats (abs) min: 1.0 max: 8.0 x̄: 1.41 x̃: 1 helped stats (rel) min: 0.07% max: 33.33% x̄: 7.25% x̃: 5.26% HURT stats (abs) min: 1.0 max: 40.0 x̄: 1.73 x̃: 1 HURT stats (rel) min: 0.16% max: 31.75% x̄: 3.38% x̃: 2.02% 95% mean confidence interval for tuples value: -0.88 -0.81 95% mean confidence interval for tuples %-change: -5.52% -5.15% Tuples are helped. total clauses in shared programs: 401689 -> 387830 (-3.45%) clauses in affected programs: 136944 -> 123085 (-10.12%) helped: 8427 HURT: 4 helped stats (abs) min: 1.0 max: 4.0 x̄: 1.65 x̃: 2 helped stats (rel) min: 0.49% max: 50.00% x̄: 19.88% x̃: 18.18% HURT stats (abs) min: 1.0 max: 4.0 x̄: 2.50 x̃: 2 HURT stats (rel) min: 1.96% max: 19.05% x̄: 14.18% x̃: 17.86% 95% mean confidence interval for clauses value: -1.66 -1.63 95% mean confidence interval for clauses %-change: -20.15% -19.58% Clauses are helped. total cycles in shared programs: 202735.83 -> 201862.21 (-0.43%) cycles in affected programs: 16295.46 -> 15421.83 (-5.36%) helped: 3349 HURT: 1962 helped stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.32 x̃: 0 helped stats (rel) min: 0.24% max: 100.00% x̄: 40.77% x̃: 33.33% HURT stats (abs) min: 0.041665999999999315 max: 1.5833329999999997 x̄: 0.10 x̃: 0 HURT stats (rel) min: 0.09% max: 31.40% x̄: 2.95% x̃: 1.94% 95% mean confidence interval for cycles value: -0.17 -0.16 95% mean confidence interval for cycles %-change: -25.48% -23.76% Cycles are helped. total arith in shared programs: 74665.50 -> 74920.00 (0.34%) arith in affected programs: 16059.92 -> 16314.42 (1.58%) helped: 860 HURT: 3409 helped stats (abs) min: 0.041665999999999315 max: 0.25 x̄: 0.06 x̃: 0 helped stats (rel) min: 0.24% max: 37.50% x̄: 4.73% x̃: 2.56% HURT stats (abs) min: 0.041665999999999315 max: 1.5833329999999997 x̄: 0.09 x̃: 0 HURT stats (rel) min: 0.09% max: 100.00% x̄: 8.99% x̃: 4.21% 95% mean confidence interval for arith value: 0.06 0.06 95% mean confidence interval for arith %-change: 5.83% 6.62% Arith are HURT. total texture in shared programs: 13083.50 -> 11877 (-9.22%) texture in affected programs: 1663 -> 456.50 (-72.55%) helped: 2377 HURT: 3 helped stats (abs) min: 0.5 max: 1.0 x̄: 0.51 x̃: 0 helped stats (rel) min: 6.25% max: 100.00% x̄: 87.12% x̃: 100.00% HURT stats (abs) min: 0.5 max: 0.5 x̄: 0.50 x̃: 0 HURT stats (rel) min: 0.00% max: 25.00% x̄: 16.67% x̃: 25.00% 95% mean confidence interval for texture value: -0.51 -0.50 95% mean confidence interval for texture %-change: -87.98% -86.00% Texture are helped. total vary in shared programs: 10220.62 -> 4183.88 (-59.06%) vary in affected programs: 10126.50 -> 4089.75 (-59.61%) helped: 8538 HURT: 0 helped stats (abs) min: 0.125 max: 1.0 x̄: 0.71 x̃: 0 helped stats (rel) min: 7.14% max: 100.00% x̄: 74.74% x̃: 87.50% 95% mean confidence interval for vary value: -0.71 -0.70 95% mean confidence interval for vary %-change: -75.32% -74.16% Vary are helped. total quadwords in shared programs: 1766717 -> 1757161 (-0.54%) quadwords in affected programs: 553801 -> 544245 (-1.73%) helped: 6760 HURT: 711 helped stats (abs) min: 1.0 max: 11.0 x̄: 1.58 x̃: 1 helped stats (rel) min: 0.09% max: 29.41% x̄: 5.31% x̃: 4.84% HURT stats (abs) min: 1.0 max: 33.0 x̄: 1.54 x̃: 1 HURT stats (rel) min: 0.10% max: 31.13% x̄: 2.53% x̃: 1.61% 95% mean confidence interval for quadwords value: -1.31 -1.25 95% mean confidence interval for quadwords %-change: -4.67% -4.46% Quadwords are helped. total threads in shared programs: 52899 -> 52897 (<.01%) threads in affected programs: 4 -> 2 (-50.00%) helped: 0 HURT: 2 total preloads in shared programs: 0 -> 116492 preloads in affected programs: 0 -> 116492 helped: 0 HURT: 8604 HURT stats (abs) min: 2.0 max: 24.0 x̄: 13.54 x̃: 14 HURT stats (rel) min: 0.00% max: 0.00% x̄: 0.00% x̃: 0.00% 95% mean confidence interval for preloads value: 13.45 13.63 95% mean confidence interval for preloads %-change: 0.00% 0.00% Preloads are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 14:09:14 -05:00
Alyssa Rosenzweig	c8437cd415	pan/bi: Account for message preloading in shaderdb If a message-passing instruction like LD_VAR is preloaded, it will no longer be counted in the shader cycle counts. Add a special message preload counter that approximates the cost of preloading, so this information doesn't get a lost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	19541dc8c8	pan/bi: Add bi_before_nonempty_block helper To be used in the message preloading pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	6618697e0e	panfrost: Pack message preloads from compiler Include full message preload descriptors in the RSD on v7, and do the obvious packing for fragment shader message preloads. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	bd06a26662	panfrost: Add an unpacked message preload struct The compiler will soon produce preloaded messages, but it should not pack them itself, as this would require depending on GenXML or handcoding bitfields / bit packs in the compiler. Instead, add a struct encoding the unpacked form of the message, used as ABI between the compiler and the common driver. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	2d0c4973dc	panfrost: Remove Message Preload Descriptor from v6.xml It is an anachronism, as this descriptor was added in v7 and, seemingly, removed immediately after. Good work. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:50:58 -05:00
Alyssa Rosenzweig	43bbe367ea	panfrost/ci: Move T860 flake to skip Actually an xfail but occassionally passes and gives us no new information, only noise. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-and-acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15154>	2022-02-24 14:51:31 +00:00
Alyssa Rosenzweig	5c07f7c427	panfrost/ci: Move T720 flakes to skips Doesn't seem like these will be resolved anytime soon.. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-and-acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15154>	2022-02-24 14:51:31 +00:00
Tomeu Vizoso	c0695bb473	ci: Allow disabling the whole of the Collabora farm Add a global-level variable that allows disabling all jobs that would have gone to the Collabora lab, to be used in case of outages. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15150>	2022-02-24 07:33:45 +01:00
Alyssa Rosenzweig	6b2eda6b72	pan/bi: Reorder pushed uniforms to avoid moves On Bifrost and Valhall, push uniforms are loaded into Fast Access Uniform Random Access Memory (FAU-RAM). FAU-RAM is organized as an array of 64-bit slots. A given tuple (Bifrost) or instruction (Valhall) may access at most a single 64-bit slot. If an instruction requires uniforms from multiple 64-bit slots, a uniform-to-register move must be inserted to avoid the hazard. However, if an instruction requires a pair of 32-bit uniforms from the same 64-bit slot, no move is required. To reduce the number of moves we emit, this commit adds an optimization pass that reorders pushed uniforms, trying to group uniforms used by the same instruction. The pass works by creating a graph of pushed uniforms, where edges denote the "both 32-bit uniforms required by the same instruction" relationship. We perform depth-first search on this graph to find the connected components, where each connected component is a cluster of uniforms that are used together. We then select pairs of uniforms from each connected component. The remaining unpaired uniforms (from components of odd sizes) are paired together arbitrarily. In principle, we should weight the graph by number of occurences and choose pairs that maximize the total selected edge weight. This is left for future work, as it is nontrivial -- selecting these edges optimally appears to be NP-hard at first blush. Implementation note: As position and varying shaders share FAU on Bifrost, extra care is taken with a `push_offset` shader stage info parameter that ensures varying shaders do not reorder uniforms selected by the previous position shader. total instructions in shared programs: 2503343 -> 2451758 (-2.06%) instructions in affected programs: 1553309 -> 1501724 (-3.32%) helped: 14256 HURT: 8 helped stats (abs) min: 1.0 max: 80.0 x̄: 3.62 x̃: 3 helped stats (rel) min: 0.06% max: 36.36% x̄: 7.31% x̃: 6.67% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.38 x̃: 1 HURT stats (rel) min: 1.30% max: 12.50% x̄: 4.99% x̃: 3.85% 95% mean confidence interval for instructions value: -3.66 -3.58 95% mean confidence interval for instructions %-change: -7.41% -7.20% Instructions are helped. total tuples in shared programs: 2008399 -> 1969627 (-1.93%) tuples in affected programs: 1146344 -> 1107572 (-3.38%) helped: 12867 HURT: 147 helped stats (abs) min: 1.0 max: 61.0 x̄: 3.03 x̃: 2 helped stats (rel) min: 0.17% max: 42.86% x̄: 6.79% x̃: 4.65% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.20 x̃: 1 HURT stats (rel) min: 0.29% max: 20.00% x̄: 2.12% x̃: 1.19% 95% mean confidence interval for tuples value: -3.03 -2.93 95% mean confidence interval for tuples %-change: -6.82% -6.57% Tuples are helped. total clauses in shared programs: 408005 -> 401708 (-1.54%) clauses in affected programs: 90760 -> 84463 (-6.94%) helped: 6006 HURT: 164 helped stats (abs) min: 1.0 max: 9.0 x̄: 1.08 x̃: 1 helped stats (rel) min: 0.45% max: 33.33% x̄: 12.44% x̃: 14.29% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 1.64% max: 25.00% x̄: 9.81% x̃: 5.26% 95% mean confidence interval for clauses value: -1.03 -1.01 95% mean confidence interval for clauses %-change: -12.03% -11.66% Clauses are helped. total cycles in shared programs: 203308.37 -> 202737.83 (-0.28%) cycles in affected programs: 19264.71 -> 18694.17 (-2.96%) helped: 3024 HURT: 41 helped stats (abs) min: 0.041665999999999315 max: 2.5416680000000014 x̄: 0.19 x̃: 0 helped stats (rel) min: 0.17% max: 33.33% x̄: 3.83% x̃: 2.83% HURT stats (abs) min: 0.041665999999999315 max: 0.125 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.30% max: 5.88% x̄: 1.41% x̃: 0.93% 95% mean confidence interval for cycles value: -0.19 -0.18 95% mean confidence interval for cycles %-change: -3.89% -3.64% Cycles are helped. total arith in shared programs: 76265.67 -> 74669.25 (-2.09%) arith in affected programs: 45001.50 -> 43405.08 (-3.55%) helped: 12945 HURT: 97 helped stats (abs) min: 0.041665999999999315 max: 2.5416680000000014 x̄: 0.12 x̃: 0 helped stats (rel) min: 0.17% max: 50.00% x̄: 8.06% x̃: 4.88% HURT stats (abs) min: 0.041665999999999315 max: 0.125 x̄: 0.05 x̃: 0 HURT stats (rel) min: 0.21% max: 33.33% x̄: 2.16% x̃: 0.96% 95% mean confidence interval for arith value: -0.12 -0.12 95% mean confidence interval for arith %-change: -8.16% -7.81% Arith are helped. total quadwords in shared programs: 1796563 -> 1766803 (-1.66%) quadwords in affected programs: 948830 -> 919070 (-3.14%) helped: 12078 HURT: 219 helped stats (abs) min: 1.0 max: 42.0 x̄: 2.49 x̃: 2 helped stats (rel) min: 0.10% max: 33.33% x̄: 5.57% x̃: 5.26% HURT stats (abs) min: 1.0 max: 4.0 x̄: 1.21 x̃: 1 HURT stats (rel) min: 0.33% max: 6.67% x̄: 2.00% x̃: 1.14% 95% mean confidence interval for quadwords value: -2.46 -2.38 95% mean confidence interval for quadwords %-change: -5.52% -5.36% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14163>	2022-02-24 01:35:33 +00:00
Alyssa Rosenzweig	31b7ebcbc7	pan/mdg: Fix overflow in intra-bundle interference There are up to 4 instructions in the latter stage (if a branch is included), not 3. Bump the limit to fix memory corruption. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15147>	2022-02-23 20:42:33 +00:00
Alyssa Rosenzweig	abb7f04674	panfrost: Inline pan_emit_sfbd_tiler Easier to read, the common code was already common. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	910d4f8245	panfrost: Remove pan_emit_fbd thunking Use a common interface. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	099d61c95d	panfrost: Use txl instead of tex in the blitter We always blit from a particular level, so it's a waste to compute the LOD. This corresponds to a simple texture instruction with implement 0 LOD, which is the optimal texturing path on Bifrost -- it maps to TEXS_2D but does not require helper invocations. Functional change on Bifrost: Blit shaders no longer set .computed_lod or shader_contains_barrier. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	5b1a00c565	panfrost: Inline pan_blit_emit_dcd Easier to follow the logic without having a million arguments passed around. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	c9784c9512	panfrost: Decouple tiler job and DCD emit We can share the "emit quad" logic, even though the DCDs differ. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	1eb3dbafdb	panfrost: Set defaults for deprecated DCD fields There are always set to true. Don't pollute the driver code with them, make their existence a local detail to pre-Valhall XML and that's it. Functional change: "four components per vertex" is now set on vertex job DCDs. This should be a no-op. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	bd3d7e33b6	panfrost: Use pan_shader_prepare_rsd in blitter This reduces code duplication and will ease Valhall porting. Functional changes on v7: * Shader contains barrier is now set (perf loss, fixed later in series) * Shader register allocation is now set (perf win) * Point sprite inverted, no-op for blit shaders Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	6fc81f163e	pan/mdg: Fix partial execution mode names cont -> skip, last -> kill, and fix the special case handling. It's just an enum. Makes the disassembly easier to read and closer to Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	2e86767370	pan/bi: Add BIFROST_MESA_DEBUG=nosb option To disable the new scoreboarding optimizations when debugging. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	c81c022e66	pan/bi: Implement basic scoreboarding pass Extend our existing bi_scoreboard infrastructure with a simple data flow analysis pass that calculates which dependency slots need waiting. We still lack a heuristic for selecting dependency slots. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	8f25d88d90	pan/bi: Print scoreboarding state Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	6ad9a7f650	pan/bi: Add scoreboard state to IR To a limited degree, scoreboarding must be global, so add the data structures for tracking this to the IR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	91c02893d8	pan/bi: Clean up nits in liveness analysis Fix minor silly things. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	734a8bdc5d	pan/bi: Use bi_exit_block The "generic" one is a vestige of Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	75406a561f	pan/bi: Add bi_{start, exit}_block helpers Useful for data flow analysis. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	e5423bb129	pan/bi: Do not cull post-RA staging writes Bifrost post-RA dead code elimination can cull the destinations of regular ALU instructions, by weakening from a register write to a temporary write. However, there is no way to suppress staging writes, so culling the destinations will result in invalid code generation. Fixes a regression in dEQP-GLES3.functional.shaders.switch.switch_in_for_loop_static_vertex with scoreboarding. The root cause there is the backend dead code elimination not being sufficiently aggressive in the presence of control flow. Usually this does not matter, since the backend optimizations are intended to be local with global optimizations happening in NIR. Unfortunately, our implementation of IDVS hits this hard. That will need to be optimized (probably by specializing IDVS shaders in NIR instead of the backend). In the mean time, let's fix the actual bug affecting scoreboarding. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	87d46f40c8	pan/bi: Cull DTSEL_IMM dests in post-RA DCE They are useless (given the semantics of DTSEL_IMM) and complicate scoreboarding. Just remove them in the pass that removes all the other silly register destinations. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	956b969616	pan/bi: Clarify requirement for barriers Barriers need to wait on all outstanding messages. This is more of an API requirement than a hardware requirement, but it's still an invariant the scoreboarding pass must respect. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	606ac8d61e	pan/bi: Enable nir_opt_shrink_vectors total instructions in shared programs: 1939513 -> 1935815 (-0.19%) instructions in affected programs: 809066 -> 805368 (-0.46%) helped: 3195 HURT: 865 helped stats (abs) min: 1.0 max: 15.0 x̄: 1.99 x̃: 1 helped stats (rel) min: 0.10% max: 25.00% x̄: 2.26% x̃: 1.28% HURT stats (abs) min: 1.0 max: 22.0 x̄: 3.09 x̃: 2 HURT stats (rel) min: 0.10% max: 83.33% x̄: 2.67% x̃: 1.39% 95% mean confidence interval for instructions value: -1.00 -0.82 95% mean confidence interval for instructions %-change: -1.34% -1.08% Instructions are helped. total tuples in shared programs: 1523194 -> 1521789 (-0.09%) tuples in affected programs: 745526 -> 744121 (-0.19%) helped: 2947 HURT: 1844 helped stats (abs) min: 1.0 max: 18.0 x̄: 2.06 x̃: 1 helped stats (rel) min: 0.15% max: 25.00% x̄: 2.65% x̃: 1.59% HURT stats (abs) min: 1.0 max: 29.0 x̄: 2.54 x̃: 1 HURT stats (rel) min: 0.09% max: 40.00% x̄: 2.32% x̃: 1.52% 95% mean confidence interval for tuples value: -0.39 -0.20 95% mean confidence interval for tuples %-change: -0.85% -0.62% Tuples are helped. total clauses in shared programs: 329158 -> 325350 (-1.16%) clauses in affected programs: 111654 -> 107846 (-3.41%) helped: 2787 HURT: 498 helped stats (abs) min: 1.0 max: 17.0 x̄: 1.57 x̃: 1 helped stats (rel) min: 0.76% max: 40.00% x̄: 6.92% x̃: 5.26% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.14 x̃: 1 HURT stats (rel) min: 0.87% max: 50.00% x̄: 4.73% x̃: 3.77% 95% mean confidence interval for clauses value: -1.21 -1.10 95% mean confidence interval for clauses %-change: -5.39% -4.93% Clauses are helped. total cycles in shared programs: 172084.50 -> 166827.62 (-3.05%) cycles in affected programs: 74698.83 -> 69441.96 (-7.04%) helped: 3706 HURT: 568 helped stats (abs) min: 0.041665999999999315 max: 19.0 x̄: 1.44 x̃: 1 helped stats (rel) min: 0.24% max: 75.00% x̄: 9.48% x̃: 6.90% HURT stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.15 x̃: 0 HURT stats (rel) min: 0.25% max: 50.00% x̄: 2.21% x̃: 1.42% 95% mean confidence interval for cycles value: -1.28 -1.18 95% mean confidence interval for cycles %-change: -8.18% -7.67% Cycles are helped. total arith in shared programs: 57145.04 -> 57211.37 (0.12%) arith in affected programs: 27595.12 -> 27661.46 (0.24%) helped: 1933 HURT: 2259 helped stats (abs) min: 0.041665999999999315 max: 0.75 x̄: 0.09 x̃: 0 helped stats (rel) min: 0.16% max: 33.33% x̄: 2.74% x̃: 1.52% HURT stats (abs) min: 0.04166399999999726 max: 1.3333329999999997 x̄: 0.11 x̃: 0 HURT stats (rel) min: 0.10% max: 100.00% x̄: 2.79% x̃: 1.62% 95% mean confidence interval for arith value: 0.01 0.02 95% mean confidence interval for arith %-change: 0.07% 0.40% Arith are HURT. total texture in shared programs: 12857 -> 12857 (0.00%) texture in affected programs: 0 -> 0 helped: 0 HURT: 0 total vary in shared programs: 11157.75 -> 10222 (-8.39%) vary in affected programs: 5643 -> 4707.25 (-16.58%) helped: 3196 HURT: 0 helped stats (abs) min: 0.125 max: 1.875 x̄: 0.29 x̃: 0 helped stats (rel) min: 2.78% max: 75.00% x̄: 18.49% x̃: 15.00% 95% mean confidence interval for vary value: -0.30 -0.29 95% mean confidence interval for vary %-change: -18.88% -18.11% Vary are helped. total ldst in shared programs: 146420 -> 140270 (-4.20%) ldst in affected programs: 66027 -> 59877 (-9.31%) helped: 2942 HURT: 10 helped stats (abs) min: 1.0 max: 19.0 x̄: 2.09 x̃: 2 helped stats (rel) min: 0.90% max: 100.00% x̄: 16.81% x̃: 8.33% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 2.22% max: 50.00% x̄: 13.03% x̃: 3.33% 95% mean confidence interval for ldst value: -2.15 -2.02 95% mean confidence interval for ldst %-change: -17.53% -15.89% Ldst are helped. total quadwords in shared programs: 1398329 -> 1392117 (-0.44%) quadwords in affected programs: 704641 -> 698429 (-0.88%) helped: 3677 HURT: 1299 helped stats (abs) min: 1.0 max: 26.0 x̄: 2.51 x̃: 1 helped stats (rel) min: 0.10% max: 26.92% x̄: 2.64% x̃: 1.89% HURT stats (abs) min: 1.0 max: 20.0 x̄: 2.31 x̃: 1 HURT stats (rel) min: 0.11% max: 44.44% x̄: 2.34% x̃: 1.55% 95% mean confidence interval for quadwords value: -1.34 -1.16 95% mean confidence interval for quadwords %-change: -1.44% -1.25% Quadwords are helped. total threads in shared programs: 35234 -> 35311 (0.22%) threads in affected programs: 119 -> 196 (64.71%) helped: 91 HURT: 14 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.60 0.87 95% mean confidence interval for threads %-change: 70.08% 89.92% Threads are helped. total loops in shared programs: 125 -> 125 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 149 -> 144 (-3.36%) spills in affected programs: 22 -> 17 (-22.73%) helped: 1 HURT: 0 total fills in shared programs: 966 -> 956 (-1.04%) fills in affected programs: 44 -> 34 (-22.73%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Alyssa Rosenzweig	e0e63c2a8e	pan/bi: Specialize IDVS in NIR It's a bit more code, but it's needed to chew through control flow since we don't have a backend version of dead_cf. Results are really good, meaning I really screwed this up the first time around (hence the cc mesa-stable). total instructions in shared programs: 1963576 -> 1939513 (-1.23%) instructions in affected programs: 671053 -> 646990 (-3.59%) helped: 4436 HURT: 729 helped stats (abs) min: 1.0 max: 43.0 x̄: 5.75 x̃: 6 helped stats (rel) min: 0.21% max: 100.00% x̄: 6.47% x̃: 5.17% HURT stats (abs) min: 1.0 max: 22.0 x̄: 2.01 x̃: 1 HURT stats (rel) min: 0.50% max: 50.00% x̄: 10.45% x̃: 9.09% 95% mean confidence interval for instructions value: -4.77 -4.55 95% mean confidence interval for instructions %-change: -4.36% -3.80% Instructions are helped. total tuples in shared programs: 1533335 -> 1523194 (-0.66%) tuples in affected programs: 483167 -> 473026 (-2.10%) helped: 3414 HURT: 1288 helped stats (abs) min: 1.0 max: 20.0 x̄: 3.73 x̃: 2 helped stats (rel) min: 0.27% max: 100.00% x̄: 4.87% x̃: 3.03% HURT stats (abs) min: 1.0 max: 19.0 x̄: 2.02 x̃: 1 HURT stats (rel) min: 0.24% max: 38.10% x̄: 8.10% x̃: 5.88% 95% mean confidence interval for tuples value: -2.28 -2.03 95% mean confidence interval for tuples %-change: -1.62% -1.02% Tuples are helped. total clauses in shared programs: 351432 -> 329158 (-6.34%) clauses in affected programs: 142237 -> 119963 (-15.66%) helped: 5328 HURT: 3 helped stats (abs) min: 1.0 max: 43.0 x̄: 4.18 x̃: 4 helped stats (rel) min: 0.74% max: 100.00% x̄: 19.44% x̃: 17.24% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 9.09% max: 12.50% x̄: 10.90% x̃: 11.11% 95% mean confidence interval for clauses value: -4.25 -4.11 95% mean confidence interval for clauses %-change: -19.72% -19.12% Clauses are helped. total cycles in shared programs: 202830.92 -> 172084.50 (-15.16%) cycles in affected programs: 117078.42 -> 86332 (-26.26%) helped: 5450 HURT: 1 helped stats (abs) min: 0.083333 max: 49.0 x̄: 5.64 x̃: 5 helped stats (rel) min: 1.42% max: 100.00% x̄: 27.94% x̃: 25.64% HURT stats (abs) min: 0.25 max: 0.25 x̄: 0.25 x̃: 0 HURT stats (rel) min: 2.46% max: 2.46% x̄: 2.46% x̃: 2.46% 95% mean confidence interval for cycles value: -5.74 -5.54 95% mean confidence interval for cycles %-change: -28.30% -27.58% Cycles are helped. total arith in shared programs: 57274.29 -> 57145.04 (-0.23%) arith in affected programs: 16418.33 -> 16289.08 (-0.79%) helped: 2442 HURT: 1784 helped stats (abs) min: 0.041665999999999315 max: 0.75 x̄: 0.14 x̃: 0 helped stats (rel) min: 0.23% max: 100.00% x̄: 5.51% x̃: 2.87% HURT stats (abs) min: 0.041665999999999315 max: 0.9166670000000003 x̄: 0.12 x̃: 0 HURT stats (rel) min: 0.00% max: 100.00% x̄: 25.13% x̃: 9.09% 95% mean confidence interval for arith value: -0.04 -0.03 95% mean confidence interval for arith %-change: 6.61% 8.24% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total texture in shared programs: 12857 -> 12857 (0.00%) texture in affected programs: 0 -> 0 helped: 0 HURT: 0 total vary in shared programs: 11157.75 -> 11157.75 (0.00%) vary in affected programs: 0 -> 0 helped: 0 HURT: 0 total ldst in shared programs: 177208 -> 146420 (-17.37%) ldst in affected programs: 117098 -> 86310 (-26.29%) helped: 5447 HURT: 0 helped stats (abs) min: 1.0 max: 49.0 x̄: 5.65 x̃: 5 helped stats (rel) min: 1.92% max: 100.00% x̄: 27.91% x̃: 25.64% 95% mean confidence interval for ldst value: -5.75 -5.55 95% mean confidence interval for ldst %-change: -28.27% -27.56% Ldst are helped. total quadwords in shared programs: 1436507 -> 1398329 (-2.66%) quadwords in affected programs: 515101 -> 476923 (-7.41%) helped: 5150 HURT: 111 helped stats (abs) min: 1.0 max: 39.0 x̄: 7.46 x̃: 6 helped stats (rel) min: 0.17% max: 100.00% x̄: 10.02% x̃: 8.24% HURT stats (abs) min: 1.0 max: 9.0 x̄: 2.01 x̃: 1 HURT stats (rel) min: 0.43% max: 21.62% x̄: 3.57% x̃: 1.94% 95% mean confidence interval for quadwords value: -7.41 -7.11 95% mean confidence interval for quadwords %-change: -9.98% -9.49% Quadwords are helped. total threads in shared programs: 35025 -> 35228 (0.58%) threads in affected programs: 218 -> 421 (93.12%) helped: 208 HURT: 5 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.91 0.99 95% mean confidence interval for threads %-change: 93.40% 99.55% Threads are helped. total loops in shared programs: 128 -> 125 (-2.34%) loops in affected programs: 3 -> 0 helped: 3 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% total spills in shared programs: 158 -> 149 (-5.70%) spills in affected programs: 15 -> 6 (-60.00%) helped: 9 HURT: 0 total fills in shared programs: 1133 -> 966 (-14.74%) fills in affected programs: 197 -> 30 (-84.77%) helped: 9 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Alyssa Rosenzweig	3c1021cd1e	panvk: Use more reliable assert for UBO pushing The important thing isn't the number of words pushed, it's that there are no UBOs required for us to upload. Check that instead. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Alyssa Rosenzweig	e392dd8237	pan/bi: Promote MUX to CSEL in the scheduler Helps scheduling, and makes scheduling more predictable when deciding between MUX and CSEL. total tuples in shared programs: 1523328 -> 1516256 (-0.46%) tuples in affected programs: 509800 -> 502728 (-1.39%) helped: 1977 HURT: 181 helped stats (abs) min: 1.0 max: 48.0 x̄: 3.71 x̃: 2 helped stats (rel) min: 0.04% max: 14.29% x̄: 1.98% x̃: 1.28% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.43 x̃: 1 HURT stats (rel) min: 0.14% max: 7.69% x̄: 1.40% x̃: 0.70% 95% mean confidence interval for tuples value: -3.47 -3.08 95% mean confidence interval for tuples %-change: -1.79% -1.60% Tuples are helped. total clauses in shared programs: 350552 -> 349906 (-0.18%) clauses in affected programs: 34839 -> 34193 (-1.85%) helped: 570 HURT: 49 helped stats (abs) min: 1.0 max: 16.0 x̄: 1.22 x̃: 1 helped stats (rel) min: 0.67% max: 20.00% x̄: 3.26% x̃: 2.22% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.92% max: 16.67% x̄: 4.31% x̃: 4.17% 95% mean confidence interval for clauses value: -1.13 -0.96 95% mean confidence interval for clauses %-change: -2.95% -2.38% Clauses are helped. total cycles in shared programs: 202589.37 -> 202512.25 (-0.04%) cycles in affected programs: 7644.46 -> 7567.33 (-1.01%) helped: 771 HURT: 147 helped stats (abs) min: 0.041665999999999315 max: 1.8333360000000027 x̄: 0.11 x̃: 0 helped stats (rel) min: 0.16% max: 14.29% x̄: 2.10% x̃: 1.35% HURT stats (abs) min: 0.041665999999999315 max: 0.3333340000000007 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.24% max: 7.41% x̄: 1.49% x̃: 1.11% 95% mean confidence interval for cycles value: -0.09 -0.07 95% mean confidence interval for cycles %-change: -1.69% -1.36% Cycles are helped. total arith in shared programs: 56755.96 -> 56585.50 (-0.30%) arith in affected programs: 18746.29 -> 18575.83 (-0.91%) helped: 1605 HURT: 352 helped stats (abs) min: 0.04166399999999726 max: 1.8333360000000027 x̄: 0.12 x̃: 0 helped stats (rel) min: 0.07% max: 20.00% x̄: 1.92% x̃: 1.12% HURT stats (abs) min: 0.041665999999999315 max: 0.3333340000000007 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.17% max: 33.33% x̄: 2.09% x̃: 1.08% 95% mean confidence interval for arith value: -0.09 -0.08 95% mean confidence interval for arith %-change: -1.34% -1.07% Arith are helped. total quadwords in shared programs: 1429737 -> 1424670 (-0.35%) quadwords in affected programs: 418175 -> 413108 (-1.21%) helped: 1682 HURT: 198 helped stats (abs) min: 1.0 max: 35.0 x̄: 3.17 x̃: 2 helped stats (rel) min: 0.04% max: 13.33% x̄: 1.72% x̃: 1.29% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.38 x̃: 1 HURT stats (rel) min: 0.15% max: 7.41% x̄: 1.30% x̃: 0.92% 95% mean confidence interval for quadwords value: -2.86 -2.53 95% mean confidence interval for quadwords %-change: -1.48% -1.32% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	a8418abd74	pan/bi: Revert "Fix load_const of 1-bit booleans" This reverts commit `29d319c767`. Now that we use nir_lower_bool_to_bitsize, we don't see 1-bit booleans anymore, so the issue this fixed doesn't apply. Actually, that issue was (in part) why I started looking into boolean handling in the first place. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	21bdee7bcc	pan/bi: Switch to lower_bool_to_bitsize Instead of ingesting 1-bit booleans and trying to force everything to be 16-bit, except when it isn't, and creating a mess in the backend... just use the NIR pass designed to select bitsize for booleans. Yes, this means we need to handle more NIR instructions, but the handling is easier and the conversion is more obvious (except for some edge cases like 16-bit vectorized b32csel). This generates noticeably better code, and the generated code will be easier to optimize. total instructions in shared programs: 90257 -> 88941 (-1.46%) instructions in affected programs: 49145 -> 47829 (-2.68%) helped: 201 HURT: 2 helped stats (abs) min: 1.0 max: 40.0 x̄: 6.57 x̃: 3 helped stats (rel) min: 0.29% max: 13.89% x̄: 2.57% x̃: 1.90% HURT stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 HURT stats (rel) min: 2.15% max: 2.74% x̄: 2.45% x̃: 2.45% 95% mean confidence interval for instructions value: -7.71 -5.26 95% mean confidence interval for instructions %-change: -2.84% -2.20% Instructions are helped. total tuples in shared programs: 73740 -> 72922 (-1.11%) tuples in affected programs: 36564 -> 35746 (-2.24%) helped: 184 HURT: 7 helped stats (abs) min: 1.0 max: 74.0 x̄: 4.49 x̃: 2 helped stats (rel) min: 0.30% max: 16.67% x̄: 2.86% x̃: 1.89% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.29 x̃: 1 HURT stats (rel) min: 0.12% max: 12.50% x̄: 4.26% x̃: 3.33% 95% mean confidence interval for tuples value: -5.29 -3.28 95% mean confidence interval for tuples %-change: -3.06% -2.13% Tuples are helped. total clauses in shared programs: 15993 -> 15928 (-0.41%) clauses in affected programs: 2464 -> 2399 (-2.64%) helped: 35 HURT: 16 helped stats (abs) min: 1.0 max: 27.0 x̄: 2.31 x̃: 1 helped stats (rel) min: 0.49% max: 18.88% x̄: 7.63% x̃: 5.88% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.79% max: 6.25% x̄: 1.91% x̃: 1.01% 95% mean confidence interval for clauses value: -2.46 -0.09 95% mean confidence interval for clauses %-change: -6.38% -2.90% Clauses are helped. total cycles in shared programs: 7622.13 -> 7594.75 (-0.36%) cycles in affected programs: 1078.67 -> 1051.29 (-2.54%) helped: 103 HURT: 4 helped stats (abs) min: 0.041665999999999315 max: 3.0833319999999986 x̄: 0.27 x̃: 0 helped stats (rel) min: 0.32% max: 21.05% x̄: 3.62% x̃: 2.44% HURT stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.05 x̃: 0 HURT stats (rel) min: 0.13% max: 7.14% x̄: 2.94% x̃: 2.25% 95% mean confidence interval for cycles value: -0.33 -0.19 95% mean confidence interval for cycles %-change: -4.14% -2.61% Cycles are helped. total arith in shared programs: 2762.46 -> 2728.08 (-1.24%) arith in affected programs: 1550.12 -> 1515.75 (-2.22%) helped: 197 HURT: 6 helped stats (abs) min: 0.041665999999999315 max: 3.0833319999999986 x̄: 0.18 x̃: 0 helped stats (rel) min: 0.32% max: 21.05% x̄: 2.93% x̃: 1.61% HURT stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.13% max: 20.00% x̄: 5.78% x̃: 3.37% 95% mean confidence interval for arith value: -0.21 -0.13 95% mean confidence interval for arith %-change: -3.20% -2.15% Arith are helped. total quadwords in shared programs: 68155 -> 67555 (-0.88%) quadwords in affected programs: 27944 -> 27344 (-2.15%) helped: 151 HURT: 9 helped stats (abs) min: 1.0 max: 52.0 x̄: 4.09 x̃: 3 helped stats (rel) min: 0.23% max: 12.35% x̄: 2.87% x̃: 2.17% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.89 x̃: 1 HURT stats (rel) min: 0.20% max: 6.76% x̄: 1.91% x̃: 1.13% 95% mean confidence interval for quadwords value: -4.67 -2.83 95% mean confidence interval for quadwords %-change: -2.99% -2.21% Quadwords are helped. total threads in shared programs: 2232 -> 2233 (0.04%) threads in affected programs: 1 -> 2 (100.00%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	a64534754d	pan/bi: Handle vectorized u2f16/i2f16 Will be useful when we enable int16, I guess... No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	6a05852f5b	pan/bi: Handle trivial i2i32 lower_bool_to_bitsize can generate i2i32 from a 32-bit source, which is trivial but needs to be handled explicitly to avoid going down the 8-bit conversion path. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	f7d44a46cd	pan/bi: Optimize replication Bifrost's 16-bit support comes in the form of vectorized instructions, so when we manipulate scalars, we usually replicate to both bottom and top halves of 32-bit registers. Add an analysis pass that detects replication. Then, use that replication pass to optimize out useless swizzle instructions (by changing them to plain moves, which can be copypropped). This optimization is a slight shader-db win on its own, and allows us to transition to lower_bool_to_bitsize without regressing shader-db. total instructions in shared programs: 90323 -> 90257 (-0.07%) instructions in affected programs: 2513 -> 2447 (-2.63%) helped: 20 HURT: 0 helped stats (abs) min: 1.0 max: 16.0 x̄: 3.30 x̃: 2 helped stats (rel) min: 1.25% max: 11.11% x̄: 4.80% x̃: 4.29% 95% mean confidence interval for instructions value: -5.05 -1.55 95% mean confidence interval for instructions %-change: -6.06% -3.54% Instructions are helped. total tuples in shared programs: 73769 -> 73740 (-0.04%) tuples in affected programs: 1611 -> 1582 (-1.80%) helped: 17 HURT: 0 helped stats (abs) min: 1.0 max: 9.0 x̄: 1.71 x̃: 1 helped stats (rel) min: 0.58% max: 16.67% x̄: 4.80% x̃: 3.33% 95% mean confidence interval for tuples value: -2.70 -0.71 95% mean confidence interval for tuples %-change: -7.06% -2.54% Tuples are helped. total clauses in shared programs: 15997 -> 15993 (-0.03%) clauses in affected programs: 27 -> 23 (-14.81%) helped: 4 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 7.69% max: 25.00% x̄: 18.17% x̃: 20.00% 95% mean confidence interval for clauses value: -1.00 -1.00 95% mean confidence interval for clauses %-change: -29.91% -6.44% Clauses are helped. total cycles in shared programs: 7623.13 -> 7622.13 (-0.01%) cycles in affected programs: 64.83 -> 63.83 (-1.54%) helped: 13 HURT: 0 helped stats (abs) min: 0.0416660000000002 max: 0.375 x̄: 0.08 x̃: 0 helped stats (rel) min: 1.02% max: 5.56% x̄: 2.82% x̃: 2.50% 95% mean confidence interval for cycles value: -0.13 -0.02 95% mean confidence interval for cycles %-change: -3.79% -1.85% Cycles are helped. total arith in shared programs: 2763.75 -> 2762.46 (-0.05%) arith in affected programs: 67.17 -> 65.88 (-1.92%) helped: 18 HURT: 0 helped stats (abs) min: 0.0416660000000002 max: 0.375 x̄: 0.07 x̃: 0 helped stats (rel) min: 1.02% max: 22.22% x̄: 5.68% x̃: 3.16% 95% mean confidence interval for arith value: -0.11 -0.03 95% mean confidence interval for arith %-change: -8.56% -2.80% Arith are helped. total quadwords in shared programs: 68173 -> 68155 (-0.03%) quadwords in affected programs: 1258 -> 1240 (-1.43%) helped: 14 HURT: 0 helped stats (abs) min: 1.0 max: 3.0 x̄: 1.29 x̃: 1 helped stats (rel) min: 0.42% max: 8.70% x̄: 3.88% x̃: 3.67% 95% mean confidence interval for quadwords value: -1.64 -0.93 95% mean confidence interval for quadwords %-change: -5.27% -2.49% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	35ff537814	pan/bi: Constant fold swizzles on constants This lets us avoid generating SWZ instructions. Those instructions could be constant folded but that complicates the replication analysis introduced in the next commit. Almost no shader-db changes. quadwords HURT: shaders/glmark/1-22.shader_test MESA_SHADER_FRAGMENT: 718 -> 722 (0.56%) total quadwords in shared programs: 68169 -> 68173 (<.01%) quadwords in affected programs: 718 -> 722 (0.56%) helped: 0 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	62533a6e64	pan/bi: Lower swizzles on MUX.v2i16 We'll generate this in a moment. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	8bd4976d98	pan/bi: Lower swizzles on CSEL.i32/MUX.i32 This is counter-intuitive, but required for correct operation when CSEL.i32 takes a 1-bit (stored 16-bit) boolean argument. The impedance mismatch ultimately is between CSEL.b32 (nir's bcsel, nonexistant in the hardware) and the lowering CSEL.i32. However, a similar problem exists even with MUX.i32 which lacks a good way of zero/sign-extending booleans. Cherry-picked from my Valhall branch though the issue also affects Bifrost. Fixes piglit shaders@glsl-vs-if-bool on Bifrost. Unfortunately, shader-db is quite unhappy :-( The proper fix is to use lower_bool_to_bitsize, but that can't be backported to mesa-stable. total instructions in shared programs: 157539 -> 158953 (0.90%) instructions in affected programs: 55621 -> 57035 (2.54%) helped: 2 HURT: 259 helped stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 helped stats (rel) min: 2.11% max: 2.67% x̄: 2.39% x̃: 2.39% HURT stats (abs) min: 1.0 max: 40.0 x̄: 5.47 x̃: 2 HURT stats (rel) min: 0.36% max: 16.13% x̄: 2.55% x̃: 1.59% 95% mean confidence interval for instructions value: 4.44 6.40 95% mean confidence interval for instructions %-change: 2.21% 2.82% Instructions are HURT. total tuples in shared programs: 132322 -> 132907 (0.44%) tuples in affected programs: 31806 -> 32391 (1.84%) helped: 5 HURT: 152 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.40 x̃: 1 helped stats (rel) min: 0.39% max: 3.03% x̄: 1.70% x̃: 1.61% HURT stats (abs) min: 1.0 max: 42.0 x̄: 3.89 x̃: 2 HURT stats (rel) min: 0.29% max: 18.18% x̄: 2.50% x̃: 1.79% 95% mean confidence interval for tuples value: 2.88 4.58 95% mean confidence interval for tuples %-change: 1.87% 2.85% Tuples are HURT. total clauses in shared programs: 28672 -> 28698 (0.09%) clauses in affected programs: 869 -> 895 (2.99%) helped: 1 HURT: 24 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 5.88% max: 5.88% x̄: 5.88% x̃: 5.88% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.12 x̃: 1 HURT stats (rel) min: 0.49% max: 33.33% x̄: 8.46% x̃: 3.59% 95% mean confidence interval for clauses value: 0.82 1.26 95% mean confidence interval for clauses %-change: 3.84% 11.93% Clauses are HURT. total cycles in shared programs: 15119.04 -> 15137.88 (0.12%) cycles in affected programs: 922.87 -> 941.71 (2.04%) helped: 4 HURT: 79 helped stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.05 x̃: 0 helped stats (rel) min: 0.40% max: 3.17% x̄: 1.57% x̃: 1.35% HURT stats (abs) min: 0.041665999999999315 max: 1.75 x̄: 0.24 x̃: 0 HURT stats (rel) min: 0.30% max: 20.00% x̄: 2.83% x̃: 2.12% 95% mean confidence interval for cycles value: 0.17 0.29 95% mean confidence interval for cycles %-change: 1.86% 3.37% Cycles are HURT. total arith in shared programs: 4922.71 -> 4947.71 (0.51%) arith in affected programs: 1423.79 -> 1448.79 (1.76%) helped: 5 HURT: 177 helped stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.06 x̃: 0 helped stats (rel) min: 0.40% max: 3.17% x̄: 1.82% x̃: 1.67% HURT stats (abs) min: 0.041665999999999315 max: 1.75 x̄: 0.14 x̃: 0 HURT stats (rel) min: 0.30% max: 22.22% x̄: 2.50% x̃: 1.52% 95% mean confidence interval for arith value: 0.11 0.17 95% mean confidence interval for arith %-change: 1.86% 2.90% Arith are HURT. total quadwords in shared programs: 120605 -> 120956 (0.29%) quadwords in affected programs: 26535 -> 26886 (1.32%) helped: 6 HURT: 143 helped stats (abs) min: 1.0 max: 7.0 x̄: 2.83 x̃: 1 helped stats (rel) min: 0.93% max: 6.33% x̄: 2.29% x̃: 1.71% HURT stats (abs) min: 1.0 max: 21.0 x̄: 2.57 x̃: 2 HURT stats (rel) min: 0.34% max: 13.79% x̄: 2.02% x̃: 1.22% 95% mean confidence interval for quadwords value: 1.86 2.86 95% mean confidence interval for quadwords %-change: 1.45% 2.24% Quadwords are HURT. total threads in shared programs: 4670 -> 4669 (-0.02%) threads in affected programs: 2 -> 1 (-50.00%) helped: 0 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	9168dcbbc1	pan/bi: Disambiguate IDVS variants in shader-db Label IDVS variants as being MESA_SHADER_{POSITION, VARYING} stages; reserve the MESA_SHADER_VERTEX label for non-IDVS shaders. This reduces confusion where a single shader compiles to two MESA_SHADER_VERTEX shaders with different stats. While we're at it, de-vendor the blend shader stage name; these stats are internal anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15086>	2022-02-19 00:01:07 +00:00
Alyssa Rosenzweig	8f4b3c749e	pan/bi: Test avoiding FADD.v2f16 hazards in scheduler There are many of them, and integration testing of the scheduler won't hit every case. Add targeted unit tests for the various scheduling hazards of this funny instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	9d95561c93	pan/bi: Test avoiding *FADD.v2f16 hazard in optimizer This hazard exists but is obscure enough to be missed on our existing test coverage (e.g the conformance tests). Add piles of unit tests for it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	24d2bdb1e0	pan/bi: Avoid *FADD.v2f16 hazard in scheduler Obscure encoding restriction. Fixes crash (assertion fail when instruction packing) in asphalt9/2659.shader_test on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	8e0eb592d5	pan/bi: Avoid *FADD.v2f16 hazard in optimizer This is a very obscure encoding restriction in the Bifrost ISA. Unknown if any real apps or tests hit this, but we still need to get it right sadly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	c2178d09d0	pan/va: Identify LEA_TEX_IMM table Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	839f15259a	pan/va: Fix conservative branch handling Mixed up lanes and conservative branch combine. Fix that. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	81a9c857c8	pan/va: Make subgroup 4-bits Future proofing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	9e851e75de	pan/va: Fix some units Remove the todos. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	47733ad1e1	pan/va: Parse units from the XML We need this information for cycle counting in Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	239d59ecdd	panvk: Don't use UBOs for meta_clear It must always be pushed, so constructing a uniform remap table is useless. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14913>	2022-02-18 15:29:48 +00:00
Alyssa Rosenzweig	030dadb5f4	pan/mdg: Remove todo we'll probably never get to Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:48 +00:00
Alyssa Rosenzweig	0e726d918f	pan/mdg: Assert that we don't see unknown jumps I still don't understand why we don't see continues. But in case we do, scream loudly so it can't be fixed. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:48 +00:00
Alyssa Rosenzweig	8b70e7491a	pan/mdg: Delete dedicated fdot2 lowering It's just lower_alu_to_scalar total instructions in shared programs: 72542 -> 72528 (-0.02%) instructions in affected programs: 673 -> 659 (-2.08%) helped: 4 HURT: 1 helped stats (abs) min: 1.0 max: 11.0 x̄: 3.75 x̃: 1 helped stats (rel) min: 0.28% max: 6.79% x̄: 3.07% x̃: 2.60% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 3.03% max: 3.03% x̄: 3.03% x̃: 3.03% 95% mean confidence interval for instructions value: -8.65 3.05 95% mean confidence interval for instructions %-change: -6.32% 2.62% Inconclusive result (value mean confidence interval includes 0). total bundles in shared programs: 32051 -> 32036 (-0.05%) bundles in affected programs: 207 -> 192 (-7.25%) helped: 3 HURT: 0 helped stats (abs) min: 1.0 max: 10.0 x̄: 5.00 x̃: 4 helped stats (rel) min: 3.28% max: 13.89% x̄: 8.29% x̃: 7.69% total quadwords in shared programs: 56496 -> 56487 (-0.02%) quadwords in affected programs: 422 -> 413 (-2.13%) helped: 2 HURT: 0 total registers in shared programs: 5106 -> 5104 (-0.04%) registers in affected programs: 8 -> 6 (-25.00%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	28cd2c9cca	pan/mdg: Delete stray comment Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	eb0ef85cb6	pan/mdg: Clarify some ISA unknowns Nothing usefully new here, just trying to improve signal:noise ratio on the disassembly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	3a53e46fcd	pan/mdg: Handle 8/16-bit UBO loads These will be seen by the compiler when we enable fp16 constant buffers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	8d949ecd3a	pan/mdg: Model zero/sign extension for 8/16-bit loads The destinations are packed as if 32-bit even for 8/16-bit loads, so the mask needs to be constructed accordingly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	ff970767a3	pan/mdg: Print optimized and scheduled shader To help identify problems across the compiler, print more forms of the shader with MIDGARD_MESA_DEBUG=shaders. Roughly matches the Bifrost compiler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	b707dabbac	pan/mdg: Pull out skip_internal boolean Aligns with Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	294a357b33	panfrost,asahi,radv: Don't set internal=true manually nir_builder_init_simple_shader does this automatically now. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14936>	2022-02-17 23:30:46 +00:00
Ian Romanick	a01b262990	nir: Add missing dependency on nir_opcodes.py Commit `38800b38` changed nir_opcodes.py, but that doesn't seem to have triggered nir_opt_algebraic.py. The change in `75ef5991` depends on opt_algebraic lowering 16-bit versions of slt, but if opt_algebraic is not rebuilt, this may not happen. This resulted in some people seeing assertion failures in, for example, dEQP-VK.spirv_assembly.instruction.compute.float16.arithmetic_3.step, due to the backend seeing nir_op_slt that it didn't know how to handle. v2: Add nir_opcodes.py to nir_algebraic_py so that all the per-driver algebraic passes pick up the dependency too. Rename it to nir_algebraic_depends. Suggested by Emma. Closes: #6047 Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15050>	2022-02-17 22:57:33 +00:00
Alyssa Rosenzweig	3697907231	panfrost: Fix Malloc Vertex definition A few missing things and a few wrong things, nothing major. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	1ca2358d6b	panfrost: Flesh out compute jobs Valhall has a new twist on Mali's task splitting voodoo, plus compute offset support. On Bifrost + Vulkan, compute offsets needed lowering on Bifrost (gl_GlobalID). Valhall saves a few instructions here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	6d5ddf69e2	panfrost: Update Shader Environment descriptor Disambiguate the name, add a missing field, shorten a field, remove a dated comment. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	cf95a1c308	panfrost: Add Valhall fields to tiler descriptor Mostly to support layered rendering. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	c011ea6c26	panfrost: Shuffle render target AFBC for Valhall I'm not sure why this is different, although it adds support for new AFBC modifiers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	1ee09eaca8	panfrost: Add Valhall additions to the framebuffer There are a few minor changes. Nothing fundamanetal. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	9dc30f99ae	panfrost: Flesh out the Shader Program Descriptor Only breaking change since Bifrost is that the shader contains barrier? flag is now fragment-only, meaning it is just a spawn helper threads flag. This affects compute shaders slightly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:55 -05:00
Alyssa Rosenzweig	60b37424d9	panfrost: Simplify Valhall preload descriptor Honestly, we could stand to do the same to Bifrost... Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:55 -05:00
Alyssa Rosenzweig	1e9a35648a	panfrost: Clarify unknowns in z/stencil descriptor Depth culling and clamping. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:55 -05:00
Alyssa Rosenzweig	733d5f061d	panfrost: Add more fields to Attribute Descriptor More XML Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:35 -05:00
Alyssa Rosenzweig	b31f6a821d	panfrost: Update primitive descriptor for Valhall Contains stuff needed for layered rendering. Unfortunately, there's no more provoking vertex per draw -- ugh! That's fine for Vulkan (just don't set provokingVertexModePerPipeline), but requires inserting extra flushes on desktop OpenGL. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:18 -05:00
Jason Ekstrand	bda4c4f6b6	vulkan: Take a vk_command_pool in vk_command_buffer_init() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	2a5ae138b4	panvk: Use vk_command_pool Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Louis-Francis Ratté-Boulianne	5e263cc324	vulkan/runtime: Add a level field to vk_command_buffer Looks like 3 implementations already have that field in their private command_buffer struct, and having it at the vk_command_buffer opens the door for generic (but suboptimal) secondary command buffer support. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Emma Anholt	ef112db311	ci: Bump VK-GL-CTS to 1.3.1.0. The main thing is VK 1.3 testing, but also includes test bugfixes. The 1.3 CTS required an uprev of deqp-runner to handle a new style of test output, and that deqp-runner brings in some neat new features, too (piglit in your deqp-runner suite, and extension list checking). A bunch of VK tests got renamed, so I replaced panvk's custom test list with simple include filters on the main test list. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (panvk) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14920>	2022-02-08 22:16:36 +00:00
Emma Anholt	648dd03e32	ci/panfrost: Add a flake a few of us have run into in the last couple days. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14920>	2022-02-08 22:16:35 +00:00
Alyssa Rosenzweig	12446491c1	panfrost: Fix Depth Source enum As I suspected... sigh. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	32e58c2dd4	panfrost: Remove unused layout enums Folded into Valhall-specific plane descriptor enums. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	d27d46a266	panfrost: Remove some indexed formats on Valhall Block compressed formats like ETC2 are now indicated in the plane descriptor, rather than the pixel format descriptor. Various other minor formats were removed in Valhall; remove them from the XML so we don't accidentally try to use them. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	8c51b54bd1	panfrost: Update supported job types Remove a few that no longer exist, and rename IDVS helper to Malloc Vertex. The distinction between Malloc Vertex jobs and regular Indexed Vertex jobs is that the hardware allocates varying buffers dynamically for Malloc Vertex jobs. Regular IDVS and even legacy tiler jobs are also supported where desired. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	d70a48a706	panfrost: Flesh out tiler heap descriptor Merged with the Buffer descriptor, hence why it shares a type nibble. However, Bifrost uses a dedicated tiler heap descriptor, and I see no benefit to merging. So pretending it's a dedicated descriptor on Valhall too allows us to reuse the Bifrost code with no modifications. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	62173fa532	panfrost: Strip % in GenXML names A new Valhall enum will represent percentages, so allow that. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	e514f4c0b1	panfrost: Flesh out Buffer descriptor Add fields required for structured buffers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Boris Brezillon	a8fbfcfbd3	pan/midg: Support 8/16 bit load/store Needed for panvk copy shaders to support 8 or 16bit formats. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	59ea6e2e27	pan/midg: Add a pass to lower non-logbase2 global/shared loads Compute shaders might do vec3(Xbits) loads which are translated to LD.<next_pow2(3 * X)> by the midgard compiler. This might cause out-of-bound accesses potentially leading to pagefaults if the access is at the end of a BO. One solution to avoid that (maybe not the best) is to split non-log2 loads to make sure we only read what's requested. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	3f9bce08e1	pan/midg: Fix swizzle packing on 64bit instructions with src-expansion + dst-shrinking In that case, the mask is specified on 32bit lanes, so we need to shift it if it's > 0x3. The expand modifier will take care of selecting the right side of the 32bit vector. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	da474d5d14	pan/midg: Fix the upper/lower limit on 8bit vectors If I'm correct, the lower/upper split on 8bit vectors is 8, not 4. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00

... 5 6 7 8 9 ...

4179 Commits