KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Ian Romanick	377246318a	intel/fs: Eliminate "masked" and "per slot offset" URB messages All of this information can be inferred from the sources. v2: Fix "error: unused variable 'opcode'" detected by marge-bot. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17605>	2022-07-26 17:25:19 +00:00
Ian Romanick	1b17f8fc5a	intel/fs: Make logical URB read instructions more like other logical instructions No shader-db changes on any Intel platform Fossil-db results: Tiger Lake Instructions in all programs: 156926440 -> 156926470 (+0.0%) Instructions hurt: 15 Cycles in all programs: 7513099349 -> 7513099402 (+0.0%) Cycles hurt: 15 Ice Lake and Skylake had similar results. (Ice Lake shown) Cycles in all programs: 9099036492 -> 9099036489 (-0.0%) Cycles helped: 1 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17605>	2022-07-26 17:25:19 +00:00
Ian Romanick	349a040f68	intel/fs: Make logical URB write instructions more like other logical instructions The changes to fs_visitor::validate() helped track down a place where I initially forgot to convert a message to the new sources layout. This had caused a different validation failure in dEQP-GLES31.functional.tessellation.tesscoord.triangles_equal_spacing, but this were not detected until after SENDs were lowered. Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) total instructions in shared programs: 19951145 -> 19951133 (<.01%) instructions in affected programs: 2429 -> 2417 (-0.49%) helped: 8 / HURT: 0 total cycles in shared programs: 858904152 -> 858862331 (<.01%) cycles in affected programs: 5702652 -> 5660831 (-0.73%) helped: 2138 / HURT: 1255 Broadwell total cycles in shared programs: 904869459 -> 904835501 (<.01%) cycles in affected programs: 7686744 -> 7652786 (-0.44%) helped: 2861 / HURT: 2050 Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) Instructions in all programs: 141442369 -> 141442032 (-0.0%) Instructions helped: 337 Cycles in all programs: 9099270231 -> 9099036492 (-0.0%) Cycles helped: 40661 Cycles hurt: 28606 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17605>	2022-07-26 17:25:18 +00:00
Jason Ekstrand	530de844ef	intel,anv,iris,crocus: Drop subgroup size from the shader key Use nir->info.subgroup_size instead. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Ian Romanick	a477587b4a	intel/fs: Add _LOGICAL versions of URB messages The lowering is currently fake. It just changes the opcode from the _LOGICAL version to the non-_LOGICAL version. v2: Remove some rebase cruft. 's/gfx8_//;s/simd8_/' in brw_instruction_name. Both suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Marcin Ślusarz	42b551fe7f	intel/compiler: adjust task payload offsets as late as possible Otherwise passes which expect offsets to be in bytes (like brw_nir_lower_mem_access_bit_sizes, called from brw_postprocess_nir) may produce incorrect results. Fixes 64-bit load/stores in task/mesh shaders. Fixes: `c36ae42e4c` ("intel/compiler: Use nir_var_mem_task_payload") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16196>	2022-06-27 14:14:41 +00:00
Erik Faye-Lund	2a134347cb	intel/compiler: use macro for power-of-two check This will allow the use of static_assert here instead of our compiler-specific implementation. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16670>	2022-06-03 07:14:43 +00:00
Marcin Ślusarz	9acb30c8c4	intel/compiler: implement primitive shading rate for mesh Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16030>	2022-05-13 13:05:51 +00:00
Marcin Ślusarz	040062df41	intel/compiler: handle VARYING_SLOT_CULL_PRIMITIVE in mesh It's needed for gl_MeshPerPrimitiveNV[].gl_ViewportMask Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16493>	2022-05-13 09:43:02 +00:00
Marcin Ślusarz	5dace41c10	intel/compiler: invalidate metadata in brw_nir_initialize_mue New "if" blocks may have been inserted. Fixes: `bc4f8c073a` ("intel/compiler: inject MUE initialization") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15924>	2022-04-19 11:43:55 +00:00
Caio Oliveira	c32d386ce2	intel/compiler: Inline TUE map computation into TUE Input lowering Refactor since the TUE compute function is simpler now and the comments make sense being near the lowering. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15022>	2022-03-25 23:29:19 +00:00
Caio Oliveira	c36ae42e4c	intel/compiler: Use nir_var_mem_task_payload Instead of reusing the in/out slot mechanism, use a separated NIR variable mode. This will make easier later to implement staging the output in shared memory (and storing all at the end to the URB). Note to get 64-bit type support we currently rely on the brw_nir_lower_mem_access_bit_sizes() pass. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15022>	2022-03-25 23:29:19 +00:00
Ernst Sjöstrand	e5f3689cff	intel/compiler: Fix non-trivial designated initializer Not supported by GCC 7. src/compiler/nir/nir_builder_opcodes.h:14156:118: sorry, unimplemented: non-trivial designated initializers not supported src/intel/compiler/brw_mesh.cpp:515:7: note: in expansion of macro ‘nir_store_per_primitive_output’ Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Fixes: `bc4f8c073a` ("intel/compiler: inject MUE initialization") Signed-off-by: Ernst Sjöstrand <ernstp@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15360>	2022-03-14 09:56:04 +00:00
Marcin Ślusarz	8c16ce53a9	intel/compiler: handle ViewportIndex, PrimitiveID and Layer in MUE setup Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	bc4f8c073a	intel/compiler: inject MUE initialization Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	333a490e32	intel/compiler: shift mesh urb read/write window when offset is too large Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Lionel Landwerlin	96c8880900	intel/fs: fix total_scratch computation We only have a single prog_data::total_scratch for all shader variants (SIMD 8, 16, 32). Therefore we should always max the total_scratch on top of existing variant. We probably haven't run into that issue before because we compile by increasing SIMD size and higher SIMD size is more likely to spill. But for bindless shaders with return shaders, if the last return part doesn't spill, we completely ignore the previous parts' scratch computation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15193>	2022-03-02 13:13:03 +00:00
Caio Oliveira	dc77542ed2	intel/compiler: Use pass helper in brw_nir_adjust_offset_for_arrayed_indices Also change the code to preserve certain metadata: control flow is not changed so both block indices and dominance information is preserved. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15206>	2022-03-02 10:46:23 +00:00
Caio Oliveira	7460199a2f	intel/compiler: Lower Task/Mesh I/O before SIMD specific lowering These are the same for all variants, so just lower it before cloning the nir_shader for each of them. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15019>	2022-03-01 07:35:13 +00:00
Marcin Ślusarz	b6557b80a5	intel/compiler: fix array & struct IO lowering in mesh shaders We really need offsets to be in dwords, not in vec4s. The bug manifests as random failure of func.mesh.clipdistance.5 crucible test, where stores to gl_MeshVerticesNV[x].gl_ClipDistance[4+n] actually write to gl_MeshVerticesNV[x].gl_ClipDistance[1+n]. Fixes: `1f438eb033` ("intel/compiler: Implement Mesh Output") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14997>	2022-02-14 19:18:23 +00:00
Timur Kristóf	0445802ab2	compiler: Extract num_mesh_vertices_per_primitive function. Prevent code duplication. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15005>	2022-02-14 11:13:42 +01:00
Marcin Ślusarz	24fef8f33d	intel/compiler: Use Task/Mesh InlineData for the first few push constants Replace load_mesh_global_arg_addr_intel with a more general intrinsic load_mesh_inline_data_intel, since inline data now hold both a pointer descriptor information and the first few push constants. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Marcin Ślusarz	baa17865de	intel/compiler: handle gl_[Clip\|Cull]Distance in mesh shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Dave Airlie	1352e0ba0c	mesa/*: add a shader primitive type to get away from GL types. This creates an internal shader_prim enum, I've fixed up most users to use it instead of GL types. don't store the enum in shader_info as it changes size, and confuses other things. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Jason Ekstrand	4fa58d27a5	intel/fs,vec4: Drop support for shader time Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14056>	2021-12-10 21:20:47 +00:00
Marcin Ślusarz	bd2c11dfa8	intel/compiler: Load draw_id from XP0 in Task/Mesh shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Marcin Ślusarz	b717872e08	intel/compiler: Get mesh_global_addr from the Inline Parameter for Task/Mesh Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	1f438eb033	intel/compiler: Implement Mesh Output Use the same URB access helpers that were added for Task Output. The Arrayed I/O (per-primitive and per-vertex) is handled by applying the pitch from the MUE layout into the NIR intrinsics and including the non-arrayed offset on top of it. After that, the index src can be used directly for lowering. Because we keep around the non-arrayed offset AND the pitch is aligned, we can identify cases where the access is indirect but guaranteed to be aligned, and dispatch a single message. Added a TODO to explore that later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	70ace2bbcd	intel/compiler: Implement Task Output and Mesh Input Implement the output written by the task workgroup and available to all the mesh workgroups dispatched from that task. We currently ignore any layout annotations (since they are not really testable) and produce a (packed) layout ourselves. The URB messages are only SIMD8, so for larger SIMDs, the functions will produce multiple messages. Making this lowering here instead of the generic lower_simd_width() since it is not just a matter of zip/unzip, e.g. the offset must be adjusted. Indirect writes/reads are implemented by handling one component at a time and using the PER_SLOT variant of the messages. Note that VK_NV_mesh_shader allows reading outputs, so add support for that as well. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	171bdd2ec6	intel/compiler: Lower Task/Mesh local_invocation_{id,index} The Invocation index is provided by the payload, so we can skip the usual math done to get to it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	db23c41537	intel/compiler: Add backend compiler basics for Task/Mesh Task/Mesh stages are CS-like stages, and include many builtins (e.g. workgroup ID/index) and intrinsics (e.g. workgroup memory primitives) originally present only in CS. This commit add two new stages (task and mesh) that 'inherit' from CS by embedding a brw_cs_prog_data in their own prog_data structure, so that CS functionality can be easily reused. They also currently use the same helpers to select the SIMD variant to use -- that was recently added for CS. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00

31 Commits