KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Mark Janes	34a130fedf	anv: fix performance bug in INTEL_MEASURE Re-allocating the buffer object for snapshots carries a heavy penalty at run-time. When resetting a command buffer, the buffer object that is allocated for snapshots may be re-used directly on subsequent renders. Stale snapshot data will persist in the buffer object. To verify that rendering is complete, zero the final timestamp value and check that it has been written before gathering data. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16571>	2022-06-16 02:58:08 +00:00
Mark Janes	c4c096e66e	intel: relax assertion in INTEL_MEASURE It is possible that a secondary command buffer was submitted with no renders in it. For that case, no timestamp will be collected. Only verify that timestamps if the index is nonzero. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16571>	2022-06-16 02:58:08 +00:00
Mark Janes	3c53c6b247	intel: parse intel_measure environment without side effects If an application links agaist both iris and anv, they will clash when parsing the INTEL_MEASURE environment variable. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16571>	2022-06-16 02:58:08 +00:00
Emma Anholt	979f213110	ci/iris: Disable blender-demo-cube_diorama on APL. It has timed out on 3 jobs today. Fixes: `96f0944a69` ("ci/panfrost: add Blender, Warzone2100, Freedoom and Unvanquished traces") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17041>	2022-06-15 23:28:23 +00:00
Yonggang Luo	0f3064ee44	intel: using C++11 keyword thread_local Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15087>	2022-06-15 17:37:16 +00:00
Jordan Justen	81d6ae31d6	anv, iris: Enable compute engine with INTEL_COMPUTE_CLASS=1 If this environment variable is set, then a detected compute engine will be used as described in docs/envvars.rst. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14395>	2022-06-15 08:58:20 +00:00
Jordan Justen	0c90c695f5	anv, iris: Add support for I915_ENGINE_CLASS_COMPUTE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14395>	2022-06-15 08:58:20 +00:00
Jordan Justen	b27720f2a1	anv: Move STATE_BASE_ADDRESS programming into init_common_queue_state() This is now needed following Ken's `8831cb38aa`. Ref: `8831cb38aa` ("anv: Stop updating STATE_BASE_ADDRESS on XeHP") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14395>	2022-06-15 08:58:20 +00:00
Jordan Justen	09d12e6727	anv: Add support for I915_ENGINE_CLASS_COMPUTE in init_device_state() Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14395>	2022-06-15 08:58:20 +00:00
Jordan Justen	60e29fc7c5	intel/gem: Add support for I915_ENGINE_CLASS_COMPUTE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14395>	2022-06-15 08:58:20 +00:00
Lionel Landwerlin	b0cd7bc8c1	anv: don't expose EXT_border_color_swizzle on gfx7 This requires EXT_custom_border_color which isn't supported on gfx7. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `fbcf65bfea` ("anv: VK_EXT_border_color_swizzle") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17025>	2022-06-15 07:30:52 +00:00
David Heidelberg	f58168850f	ci/iris: add Blender, Warzone2100, Freedoom and Unvanquished traces Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16909>	2022-06-14 11:52:45 +00:00
Mike Blumenkrantz	fbcf65bfea	anv: VK_EXT_border_color_swizzle Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16992>	2022-06-14 01:52:50 +00:00
Jason Ekstrand	ce60195ecd	anv: Use NIR_PASS(_, ...) I don't know when this was added but it's really neat and we should use it instead of NIR_PASS_V since NIR_DEBUG=print and a few validation things will work better. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17014>	2022-06-13 22:31:25 +00:00
Jason Ekstrand	844a70f439	intel/compiler: Use NIR_PASS(_, ...) I don't know when this was added but it's really neat and we should use it instead of NIR_PASS_V since NIR_DEBUG=print and a few validation things will work better. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17014>	2022-06-13 22:31:25 +00:00
Francisco Jerez	96e7e92f0d	intel/fs/xehp+: Emit scheduling fence for all NIR barriers on platforms with LSC. Tested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15743>	2022-06-12 12:56:47 +03:00
Tapani Pälli	47773a5d7c	intel/fs: setup SEND message descriptor from nir scope This fixes many tests in following groups on DG2: dEQP-VK.memory_model.* dEQP-VK.fragment_shader_interlock.* v2: use memory scope and setup descriptor also for barriers without defined scope (Curro), use local scope and flush type none with NIR_SCOPE_NONE scope, cleanups (Lionel) v3: use LSC_FENCE_THREADGROUP for NIR_SCOPE_WORKGROUP, remove default case (Curro), use eviction if scope was not defined, use LSC_FENCE_GPU scope for vertex stage v4: use LSC_FENCE_TILE independent of stage for device scope (Curro) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15743>	2022-06-12 12:29:47 +03:00
Georg Lehmann	9ccc683973	anv: Implement VK_EXT_non_seamless_cube_map. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12730>	2022-06-10 18:31:57 +00:00
Kenneth Graunke	a8e718c7e5	intel/compiler: Fix A64 header construction with a uniform address fs_visitor::assign_curb_setup() maps UNIFORM registers to HW regs, and contains the following assert: assert(inst->src[i].stride == 0); emit_a64_oword_block_header's striding tricks run afoul of this restriction, by producing stride 1 values on a 64-bit UNIFORM source. Work around this by copying the UNIFORM value to a VGRF first. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16938>	2022-06-10 02:14:57 +00:00
Jason Ekstrand	a820dc4a8e	anv/wsi: Stop resetting semaphores This will happen automatically when they're waited on by the dummy submit in wsi_common_queue_present(). Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4037>	2022-06-10 01:33:12 +00:00
Kenneth Graunke	18b3ad5a09	intel: Set a more useful fake devinfo->gtt_size in no-hw mode With the old value, anv didn't think that the hardware supported 48-bit addresses, and hit this assert: assert(device->supports_48bit_addresses == !device->use_relocations); The new value of 1ull << 48 is the one reported on my Icelake machine. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16933>	2022-06-10 00:56:36 +00:00
Ian Romanick	65d6708bc3	anv: Remove FS executables when applying the null FS optimization If the executables are still hanging out, anv_GetPipelineExecutableStatisticsKHR will try to dereference NULL pointers in pipeline->shaders[MESA_SHADER_FRAGMENT]. At least in terms of fossil-db output, this matches the behavior from before `73b3efcd59`. Fixes: `73b3efcd59` ("anv: Handle the null FS optimization after compiling shaders") Closes: #6590 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16898>	2022-06-10 00:22:05 +00:00
Emma Anholt	e8d4eaf172	ci/iris: Disable skqp until it can be stabilized. It keeps blocking marge with flakes across many different tests. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16951>	2022-06-09 18:35:24 +00:00
Jordan Justen	ffb0c97caf	intel: Build mi_builder_test whenever build-tests is set Previously `install-intel-gpu-tests` controlled this, but now `install-intel-gpu-tests` will only be used to decide if it should be installed. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16902>	2022-06-07 18:26:02 +00:00
Jason Ekstrand	81603e7dc2	anv: Use the common image<->buffer copy helper Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16873>	2022-06-07 17:57:42 +00:00
Jason Ekstrand	2c2b3e68e1	vulkan,anv: Move the image offset/extent sanitize helpers to common code Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16873>	2022-06-07 17:57:41 +00:00
Jordan Justen	8381f64251	intel: Fix build of mi_builder_tests by including c99_compat.h We need this so C++ will understand "restrict" which is used in the genxml output. Fixes: `9f717b5f23` ("util: remove needless c99_compat.h includes") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16899>	2022-06-07 08:27:19 +00:00
Emma Anholt	464b32c030	glsl: Drop the div-to-mul-rcp lowering for floats. NIR has fdiv, and all the NIR backends have to have lower_fdiv set appropriately already since various passes (format conversions, tgsi_to_nir, nir_fast_normalize(), etc.) might generate one. This causes softpipe and llvmpipe to now do actual divides, since lower_fdiv is not set there. Note that llvmpipe's rcp implementation is a divide of 1.0 by x, so now we're going to be just doing div(x, y) instead of mul(x, div(1.0, y)). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Tapani Pälli	d07ec3f038	anv: use anv_cmd_dirty_mask_t type for dynamic state We were using both uint32_t and anv_cmd_dirty_mask_t, this is a cleanup making type usage consistent. Commit also changes type of the mask to be enum anv_cmd_dirty_bits. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16849>	2022-06-03 14:11:04 +03:00
Erik Faye-Lund	2a134347cb	intel/compiler: use macro for power-of-two check This will allow the use of static_assert here instead of our compiler-specific implementation. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16670>	2022-06-03 07:14:43 +00:00
Paulo Zanoni	72a7d7d7a8	intel/compiler: call ordered_unit() only once at update_inst_scoreboard() Call it once instead of calling the very same function for each source and destination. This should make those ternary operators a little easier to read, IMHO. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15835>	2022-06-02 23:04:39 +00:00
Paulo Zanoni	2256314b08	intel/compiler: split handling of 64 bit floats and ints In opt_algebraic(), handle TYPE_DF in a different check than TYPE_Q. We have a separate flag for each type, use separate checks so platforms where one is true and the other is not can work properly. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15835>	2022-06-02 23:04:39 +00:00
Paulo Zanoni	8f02e6cb19	intel/compiler: compute int64_options based on devinfo->has_64bit_int Don't compute it based on devinfo->has_64bit_float. Othwerwise we may end up emitting 64bit-int (Q) instructions on platforms with 64bit floats but not 64bit integers. Right now, the only platforms where has_64bit_int is different from has_64bit_float are the platforms that use GFX7_FEATURES. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15835>	2022-06-02 23:04:39 +00:00
Kenneth Graunke	26bb81f3f6	intel/compiler: Fix uncompaction of signed word immediates on Tigerlake This expression accidentally performs a 32-bit sign-extension when processing the second half of the expression (the low 16 bits). Consider -7W, which is represented as 0xfff9fff9 in our encoding (the 16-bit word is replicated to both halves of the 32-bit dword). Tigerlake's compaction stores the low 11-bits of an immediate as-is, and replicates the 12th bit. So here, compacted_imm will be 0xff9. ( (int)(0xff9 << 20) >> 4) \| ((short)(0xff9 << 4) >> 4)) 0xfff90000 \| (0xff90 >> 4) 0xfff90000 \| 0xfffffff9 ...oops... 0xfffffff9 By casting the second line of the expression to unsigned short, we prevent the sign-extension when it combines both parts, so we get: 0xfff90000 \| 0x0000fff9 0xfff9fff9 Fixes: `12d3b11908` ("intel/compiler: Add instruction compaction support on Gen12") Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16833>	2022-06-02 13:59:38 -07:00
Erik Faye-Lund	df4fe7c4a2	intel/isl: remove needless c99_compat.h includes Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812>	2022-06-02 13:09:16 +00:00
Erik Faye-Lund	a8605db504	intel: remove stale makefile When this landed, the Autotools build system was already removed. Why was this file added in the first place? Probably a rebase-mistake... Fixes: `134e750e16` ("i965: extract performance query metrics") Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16790>	2022-06-02 09:13:23 +00:00
Marcin Ślusarz	fd132f25ba	anv: mask out not applicable state flags when setting up mesh pipeline Fixes tests matching: dEQP-VK.pipeline.extended_dynamic_state.cmd_buffer_start.*unused_ms These tests bind mesh pipeline, immediately after that bind non-mesh pipeline and expect that binding mesh pipeline was a no-op. v2: do it in one place & add comment (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16811>	2022-06-01 14:13:54 +00:00
Sagar Ghuge	7e098db1ae	anv: Disable storage image compression for possible atomic ops It looks like atomics are slow on compressed surfaces so when enabling compression for storage images that can be possibly used for atomic operation hinders performance. Lets just disable compression in this scenario. v2: Reword comment (Ken) Allow mutable with 16/32/64 bits (Ken) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14712>	2022-06-01 10:05:19 +00:00
Kenneth Graunke	f052e00a58	isl: Add an isl_format_supports_typed_atomics() helper. v2: Add a fields in isl_format with per gen support (Lionel) v3: Fixup R32_FLOAT from 80 to 90 Fixup R32_[SU]INT from 80 to 70 (Ken) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14712>	2022-06-01 10:05:19 +00:00
Timothy Arceri	abe4536c51	ci: uprev piglit 2022-05-31 Also document additional piglit failures and passes. Multiple changes, mostly notable: - few new tests - fixed test for upcoming mesa MR Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16785>	2022-06-01 03:14:29 +00:00
Jason Ekstrand	e5ff2c2242	anv: Use nir_shader_gather_xfb_info Now that the resulting xfb_info is stashed on the shader, we can put this with all the other NIR stuff and only fetch it out at the last minute when we upload the kernel. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	3e04432b3a	nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Juan A. Suarez Romero	836ce97f5e	ci: bump VK-GL-CTS to 1.3.2.0 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16689>	2022-05-31 15:02:08 +00:00
Jason Ekstrand	faa51a10ed	isl: Add some asserts about multisampled surfaces This isn't really necessary because the API doesn't allow MSAA and mipmapping at the same time but people forget that pretty often so it's good to have it as documentation if nothing else. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14129>	2022-05-31 13:42:28 +00:00
Jason Ekstrand	8d8fb6429c	anv: Implement VK_EXT_image_view_min_lod Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14129>	2022-05-31 13:42:28 +00:00
Jason Ekstrand	a19ed1f46a	intel/isl: Add isl_view::min_lod_clamp for IVB+ Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14129>	2022-05-31 13:42:28 +00:00
David Heidelberg	2cf7f08b04	ci: traces: temporarily disable nheko trace Disable nheko trace until apitrace gets fixed. apitrace currently fails with this trace, when more than 1 run is requested. Upstream issue: https://github.com/apitrace/apitrace/issues/800 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16774>	2022-05-31 00:00:25 +00:00
Marcin Ślusarz	0f46a8fbfe	anv: remove invalid copy/pasted comment Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16773>	2022-05-30 11:46:13 +00:00
Marcin Ślusarz	34b5a717c0	anv: remove redundant code calculating dynamic states mask pipeline->dynamic_states is already set by anv_graphics_pipeline_init since `231651fd89`. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16773>	2022-05-30 11:46:13 +00:00
David Heidelberg	092d03a90e	ci/iris: skqp: remove flaking atlastext for TGL (gl version) gles version of atlastext was already removed due to same behavior Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16772>	2022-05-30 10:50:12 +00:00
Lionel Landwerlin	09caa8902c	anv: move internal RT shaders to the internal cache Those shaders are just like the blorp ones. v2: Use a single internal cache for blorp/RT (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7f1e82306c` ("anv: Switch to the new common pipeline cache") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16741>	2022-05-28 10:14:03 +00:00
Jason Ekstrand	5d0b09be5b	anv: Use the base vk_buffer struct This mostly gets us the vk_buffer_range() helper but may be useful in the future. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16607>	2022-05-27 18:38:57 -05:00
Jason Ekstrand	dfedeccc13	intel: Only set VectorMaskEnable when needed For cases with lots of very small primitives, this may improve performance because we're not executing those dead channels all the time. Shader-db reports no instruction or cycle-count changes. However, by hacking up the driver to report when this optimization triggers, it appears to affect about 10% of shader-db. v2 (Kenneth Graunke): Always enable VMask prior to XeHP for now, because using VMask on those platforms allows us to perform the eliminate_find_live_channel() optimization. However, XeHP doesn't seem to have packed fragment shader dispatch, so we lose that optimization regardless, and there's no reason not to avoid vmask. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1054>	2022-05-27 21:52:48 +00:00
Jason Ekstrand	0d28de212a	anv: Don't disable the fragment shader if XFB is enabled It turns out that we need a fragment shader for streamout. Whh? From Lionel's reading of simulator sources, it seems the streamout unit is looking at enabled next stages. It'll generate output to the clipper in the following cases : - 3DSTATE_STREAMOUT::ForceRendering = ON - PS enabled - Stencil test enabled - depth test enabled - depth write enabled - some other depth/hiz clear condition Forcing rendering without a PS seems like a recipe for hangs so it's probably better to just enable the PS in this case. Fixes: `36ee2fd61c` ("anv: Implement the basic form of VK_EXT_transform_feedback") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16506>	2022-05-27 14:33:53 +00:00
Jason Ekstrand	73b3efcd59	anv: Handle the null FS optimization after compiling shaders Actually compile and cache the no-op fragment shader but remove it from the pipeline if we determine it's a no-op. This way we always have it even if it's not strictly needed. Fixes: `36ee2fd61c` ("anv: Implement the basic form of VK_EXT_transform_feedback") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16506>	2022-05-27 14:33:53 +00:00
Jason Ekstrand	9fe6caf4e7	anv: Drop alpha_to_coverage from the NULL FS optimization Starting with Ivy Bridge, we implement alpha-to-coverage by writting gl_SampleMask with a pattern based on alpha. This will show up in wm_prog_data::uses_omask so we don't need to look at the key. Fixes: `36ee2fd61c` ("anv: Implement the basic form of VK_EXT_transform_feedback") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16506>	2022-05-27 14:33:53 +00:00
Jason Ekstrand	1b9248e761	intel/fs: Copy color_outputs_valid into wm_prog_data Fixes: `36ee2fd61c` ("anv: Implement the basic form of VK_EXT_transform_feedback") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16506>	2022-05-27 14:33:53 +00:00
Jason Ekstrand	8379993223	intel/fs: Drop fs_visitor::emit_alpha_to_coverage_workaround() It no longer exists. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16506>	2022-05-27 14:33:53 +00:00
David Heidelberg	b19c858f3d	ci/intel: add RoR and Nheko traces and reenable most of Valve traces Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16633>	2022-05-27 06:51:38 +00:00
Lionel Landwerlin	e666089082	intel/disasm: add missing handling of <1;1,0> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7cd9adeb41` ("intel/compiler: In XeHP prefer <1;1,0> regions before compacting") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16704>	2022-05-26 06:42:16 +00:00
Kenneth Graunke	9886615958	intel/compiler: Move spill/fill tracking to the register allocator Originally, we had virtual opcodes for scratch access, and let the generator count spills/fills separately from other sends. Later, we started using the generic SHADER_OPCODE_SEND for spills/fills on some generations of hardware, and simply detected stateless messages there. But then we started using stateless messages for other things: - anv uses stateless messages for the buffer device address feature. - nir_opt_large_constants generates stateless messages. - XeHP curbe setup can generate stateless messages. So counting stateless messages is not accurate. Instead, we move the spill/fill accounting to the register allocator, as it generates such things, as well as the load/store_scratch intrinsic handling, as those are basically spill/fills, just at a higher level. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16691>	2022-05-25 06:56:01 +00:00
Michael Skorokhodov	10b6d9230c	anv: Update line range This commit increases the maximum line width to 8.0 for SLK+ and to 7.9921875 for BDW and earlier. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6234 Fixes: `fce0027d` ("anv: Unbreak wide lines on HSW/BDW") Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15879>	2022-05-24 23:09:26 +00:00
Kenneth Graunke	59bfc9c6cb	intel: Fix analysis invalidation in eliminate_find_live_channel If we saw a HALT instruction, we would forget to invalidate our analysis pass information before returning progress. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16677>	2022-05-24 22:36:39 +00:00
Marcin Ślusarz	21d3630cbc	intel/tools: fix 32-bit build Fixes: `0aac3b1009` ("intel/tools/aubinator: add support for 2 "new" subopcodes") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6553 Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16695>	2022-05-24 18:27:32 +00:00
Viktoriia Palianytsia	e39a5f2b9f	anv: Add workaround for sample mask with multisampling The game Batman: Arkham Knight expects OpenGL behavior with sample mask and multisampling which is different from the Vulkan one. This workaround fix changes key->ignore_sample_mask_out value that is used for prog_data->uses_omask definition in brv_fs.cpp(9740) In that way prog_data->uses_omask also changes it value and the cloak stops flickering. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6078 Signed-off-by: Viktoriia Palianytsia <v.palianytsia@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16551>	2022-05-24 14:43:57 +00:00
Marcin Ślusarz	8187716b55	intel/tools: add macros for gfx12+ variant of VCSUNIT0 Not used for now. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16664>	2022-05-24 08:03:45 +00:00
Marcin Ślusarz	ba80c36708	intel/tools/aubinator: list all platforms in help message Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16664>	2022-05-24 08:03:45 +00:00
Marcin Ślusarz	0aac3b1009	intel/tools/aubinator: add support for 2 "new" subopcodes ... and add macros for subopcodes we haven't seen yet Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16664>	2022-05-24 08:03:44 +00:00
Marcin Ślusarz	43ad5fd9b7	intel/tools: drop wrappers around mmio regs macros Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16664>	2022-05-24 08:03:44 +00:00
Marcin Ślusarz	b916b30f58	intel/tools: clean up mmio regs definitions Each unit has the same regs at the same offsets. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16664>	2022-05-24 08:03:44 +00:00
Marcin Ślusarz	3910736f29	intel/tools: add support for GEM_CREATE_EXT in intel_dump_gpu Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16664>	2022-05-24 08:03:44 +00:00
Jason Ekstrand	c24aa449d0	vulkan,anv,turnip: Add a common CmdBindVertexBuffers wrapper Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16611>	2022-05-20 02:12:37 +00:00
Kenneth Graunke	27314718a3	intel: Drop Wa_1409226450 (stall before instruction cache invalidation) Production Tigerlake and DG1 hardware shouldn't need this workaround. It was only needed on the very first steppings which never went public. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16575>	2022-05-19 21:31:45 +00:00
Lionel Landwerlin	1c077ca9c0	u_trace/anv/iris: drop cs argument for recording traces Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16605>	2022-05-19 19:04:28 +00:00
Lionel Landwerlin	5398c9183e	intel/ds: fix compilation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6518 Fixes: `efc2782f97` ("intel/perf: store a copy of devinfo") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16601>	2022-05-19 16:42:41 +00:00
Lionel Landwerlin	9d0db8d4c4	intel/perf: deal with OA reports timestamp values on DG2 OA reports on XeHP have their timestamp shifted to the left by 1. To get that back in the same time domain as the REG_READ you need to shift it back to the right and you're loosing the top bit. v2: use ull for 64bit constant (Ian) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	773f41e3e4	intel/perf: disable sseu setting on Gfx12.5+ This is rejected by i915. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	d2834dd626	intel/perf: add new layout for Gfx12.5 products Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	66045acdf9	intel/perf: add max vfuncs New counters will use those from inside their read function to generate percentage numbers. v2: Forgot to update Iris (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	c740ca6000	intel/perf: add support new variable counting the number of EUs in slice0-3 v2: MIN2(4, max_slices) (Marcin) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	6f63bc38e7	intel/perf: add OA A counter type On Gfx12.5 products, we'll need to capture a couple of A counters that are not captured in MI_RPC reports. Those are actually global, previously all A counters were per context. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	376e420abb	intel/perf: stop overriding oa_format This already set in the intel_perf_setup.h file at metric set creation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	aa04b47c6e	intel/perf: add support for GtSlice/GtSliceXDualsubsliceY variables For those, we'll fish the information out of the devinfo. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	d134a62345	intel/perf: add support for dualsubslice count variable This is the same as the subslice count. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	efc2782f97	intel/perf: store a copy of devinfo In the future we'll pull more information off devinfo. v2: Constify pointers (Ian) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Lionel Landwerlin	0df4b96062	intel/perf: add support for new opcodes in code generation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16144>	2022-05-17 19:55:10 +00:00
Jason Ekstrand	fc8d2543fc	vulkan,v3dv: Add a driver_internal flag to vk_image_view_init/create We already had a little workaround for v3dv where, for some if its meta ops, it had to bind a depth/stenicil image as color. Instead of special-casing binding depth/stencil as color, let's flip on the drier_internal flag and get rid of most of the checks in that case. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16376>	2022-05-17 18:14:55 +00:00
Kenneth Graunke	b637f6c3db	intel/decoder: Fix binding table pointer decoding with large offsets XeHP supports a 20:5 pointer format, so the offset can legitimately be more than UINT16_MAX. Likewise, with 256B binding table mode on Icelake/Tigerlake, we might have 18:8 pointers that exceed UINT16_MAX. Thanks to Felix DeGrood for catching this! Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16538>	2022-05-17 08:52:00 +00:00
David Heidelberg	d22eeb5ae0	ci/iris: skqp: remove flaking atlastext for TGL Example: - https://gitlab.freedesktop.org/mesa/mesa/-/jobs/22380389#L4349 - https://mesa.pages.freedesktop.org/-/mesa/-/jobs/22380389/artifacts///results/gles/report.html Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6460 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16407>	2022-05-17 06:57:19 +00:00
David Heidelberg	317496ba8a	ci/iris: skqp: add default GLES rendertests for TGL Import the intact whole rendertest file from skqp (branch android-cts-12.1_r1) to be able remove the offending test line in the following commit. Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16407>	2022-05-17 06:57:19 +00:00
Timothy Arceri	d7a071a28f	gallium/drivers: set force_indirect_unrolling_sampler for all required drivers This is set to true for all drivers that have a GLSL level of support lower than 4.00. This matches the rule for setting the GLSL IR option EmitNoIndirectSampler. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16543>	2022-05-17 02:12:21 +00:00
Lionel Landwerlin	17fc7b20b1	anv: fix primitives generated queries values Numbers in some situations are incorrect because we don't stall properly before capturing the register value. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6412 Fixes: `a468f26ca5` ("anv: implement VK_EXT_primitives_generated_query") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16505>	2022-05-14 10:47:29 +00:00
Marcin Ślusarz	1542ab70eb	anv: handle primitive shading rate for mesh Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16030>	2022-05-13 13:05:51 +00:00
Marcin Ślusarz	9acb30c8c4	intel/compiler: implement primitive shading rate for mesh Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16030>	2022-05-13 13:05:51 +00:00
Marcin Ślusarz	aa1c128b54	anv: disable streamout before emitting mesh shading state Fixes tests which use secondary command buffers. Fixes: `ef04caea9b` ("anv: Implement Mesh Shading pipeline") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16493>	2022-05-13 09:43:02 +00:00
Marcin Ślusarz	29a778fa6b	intel/compiler: print name of the unhandled intrinsic Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16493>	2022-05-13 09:43:02 +00:00
Marcin Ślusarz	f083df8710	anv: update task/mesh distribution with the recommended values Fixes: `ef04caea9b` ("anv: Implement Mesh Shading pipeline") Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16493>	2022-05-13 09:43:02 +00:00
Marcin Ślusarz	65ff6932dc	intel/compiler: handle gl_Viewport and gl_Layer in FS URB setup Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16493>	2022-05-13 09:43:02 +00:00
Marcin Ślusarz	040062df41	intel/compiler: handle VARYING_SLOT_CULL_PRIMITIVE in mesh It's needed for gl_MeshPerPrimitiveNV[].gl_ViewportMask Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16493>	2022-05-13 09:43:02 +00:00
Vadym Shovkoplias	55c71217ec	driconf: Add a limit_trig_input_range option With this option enabled range of input values for fsin and fcos is limited to [-2pi : 2pi] by calculating the reminder after 2*pi modulo division. This helps to improve calculation precision for large input arguments on Intel. -v2: Add limit_trig_input_range option to prog_key to update shader cache (Lionel) Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16388>	2022-05-13 06:47:53 +00:00
Kenneth Graunke	ad537edc7c	anv: Fix INTEL_DEBUG=bat on XeHP We no longer emit STATE_BASE_ADDRESS in every batch on XeHP, so the decoder might not know what the various base addresses are if it's only looking at a single batch. Fortunately, they also never change, so we can just emit them once here. On earlier platforms, initializing them here should be harmless. We'll emit STATE_BASE_ADDRESS if we change them, which will update these. Thanks to Iván Briano for catching this. Fixes: `8831cb38aa` ("anv: Stop updating STATE_BASE_ADDRESS on XeHP") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16287>	2022-05-12 11:10:25 -07:00
Jordan Justen	ad565f6b70	intel/dev: Enable first set of DG2 PCI IDs Mostly Matt Roper's kernel patch commit message: The IDs added here are the subset reserved for 'motherboard down' designs of DG2. We have all the necessary support upstream to enable these now. The remaining DG2 IDs for add-in cards will be enabled in a future patch once some additional required functionality has fully landed. Ref: https://patchwork.freedesktop.org/patch/msgid/20220425211251.77154-3-matthew.d.roper@intel.com Cc: 22.1 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16449>	2022-05-12 03:03:57 -07:00
Jordan Justen	4456209ce5	intel/dev: Add INTEL_PLATFORM_DG2_G12 Cc: 22.1 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16449>	2022-05-12 03:03:57 -07:00
Jason Ekstrand	352e32e5ba	nir/builder: Add a nir_trim_vector helper This pattern pops up a bunch and the semantics of nir_channels() aren't very convenient much of the time. Let's add a nir_trim_vector() which matches nir_pad_vector(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Iván Briano	2e46f38902	anv: re-alloc push constants after secondary command buffers If the secondary command buffer executed used push constants on a different set of stages than the primary is using, we may end up not reallocating them for the primary, getting misrender artifacts at best, or a nice GPU hang at worst. Fixes the tests from a CTS from the future: dEQP-VK.dynamic_rendering.random.* Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16439>	2022-05-10 21:56:49 +00:00
Karol Herbst	9c5fd100cc	nir: add a nir_remove_non_entrypoints helper This code just got duplicated a lot. There is still more, but the remaining instances do a bit more than just removing other functions. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16348>	2022-05-10 03:37:44 +00:00
Emma Anholt	af76f0bcfc	ci/iris: Cut the glk-deqp test coverage in half. It's taking 13-14 minutes of deqp-runner time, not counting booting, or the LAVA-side job getting being queued behind other jobs. Well past our 10-minute runtime target, and we saw load on these boards causing the queue to get quite long (https://gitlab.freedesktop.org/mesa/mesa/-/issues/6409#note_1368750) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16359>	2022-05-10 02:16:04 +00:00
Chia-I Wu	b2b810ebff	anv: advertise rectangularLines only for Gen10+ We use the non-strict algorithm (with parallelograms) prior to Gen10 for wide lines. We can not advertise rectangularLines. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Fixes: `f6e7de41d7` ("anv: Implement VK_EXT_line_rasterization") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15432>	2022-05-06 18:22:19 +00:00
Lionel Landwerlin	969512d696	intel: fix stall debug option Missing the parsing bit. Fixes: `317512e038` ("anv/intel: add a new debug flag for stalling after every draw/dispatch") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16338>	2022-05-06 08:27:47 +00:00
Emma Anholt	3a42e92a4f	glsl: Drop the dead MOD_TO_FLOOR path. It's now called lower_fmod in NIR. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Emma Anholt	72dba615be	ci/iris: Add a bunch of APL and KBL flakes recently. I got hit by one of them trying to merge !8044. Just update the list. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8044>	2022-05-05 22:25:03 +00:00
Emma Anholt	3c0e4be89b	ci/iris: Demote APL deqp to manual-only for now. it's been flaking with "2022-05-05 16:29:49.055151: [0m[31mERROR - Failure getting run results: parsing results: Reading from dEQP: timed out waiting for fd to be ready (See \"//results/c32.r1.log\")" and a pile of missings since the brief "whoops, HW CI failed to listen to the test exit code" regression. The only ways I know of to hit this case would be: 1) The deqp binary abruptly wedges on its own. This happens with NFS failures sometimes, but the rest of the run went fine and we never got the kernel complaining about NFS, so that seems unlikely. 2) The stderr pipe filled up before stdout was completed, and deqp got wedged trying to output stderr (happens sometimes when you do like NIR_DEBUG=print in your run). Both of these seem unlikely, given that we've got a big .qpa file that made it all the way to writing out test case durations at the end of the run before abruptly terminating. Why didn't we have at least some of the test results parsed? The next deqp-runner release we integrate will solve #2, and cleans up these error paths a bunch, so I'm hoping we get more information soon. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16350>	2022-05-05 18:20:12 +00:00
Lionel Landwerlin	797a8850b9	anv: remove static_state_mask This is now unnecessary. Either an instruction is never dynamic and it's emitted in genX_pipeline.c or it can be and it's emitted in genX_cmd_buffer.c/gfx8_cmd_buffer/gfx7_cmd_buffer.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:45 +00:00
Lionel Landwerlin	74a27a6ccb	anv: don't emit 3DSTATE_VF_TOPOLOGY in pipeline batch v2: drop primitive_topology = 0xffffffff (Tapani) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:45 +00:00
Lionel Landwerlin	48229d11ba	anv: don't emit 3DSTATE_DEPTH_BOUNDS in pipeline batch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:45 +00:00
Lionel Landwerlin	76e735d09c	anv: don't emit 3DSTATE_BLEND_STATE_POINTERS in pipeline batch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:45 +00:00
Lionel Landwerlin	e9d000a831	anv: don't emit 3DSTATE_WM in pipeline batch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:44 +00:00
Lionel Landwerlin	065242d623	anv: don't emit 3DSTATE_STREAMOUT in pipeline batch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:44 +00:00
Lionel Landwerlin	ce8bb29342	anv: never emit 3DSTATE_CPS in the pipeline batch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:44 +00:00
Lionel Landwerlin	168b13364f	anv: rework sample location On Gfx7 we can only give the sample location for a given multisample number. This means everytime the multisampling value changes, we have to re-emit the locations. It's fine because it's also where (3DSTATE_MULTISAMPLE) the number of samples is stored. On Gfx8+ though, 3DSTATE_MULTISAMPLE only holds the number of samples and all the sample locations for all number of samples are located in 3DSTATE_SAMPLE_PATTERN. So to be more effecient there, we need to track the locations for all sample numbers and compare new values with the relevant sample count when touching the dynamic state. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:44 +00:00
Lionel Landwerlin	810518fda7	Revert "anv: fix dynamic state emission" This reverts commit `f348103fce`. The change was causing performance regressions. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:44 +00:00
Lionel Landwerlin	69e6417e19	anv: add missing logic op set in pipeline dyn state v2: add ANV_CMD_DIRTY_DYNAMIC_LOGIC_OP check (Tapani) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `75ad0e4b08` ("anv: support blending logic op dynamic state") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:44 +00:00
Lionel Landwerlin	5048f15737	anv: reset all dynamic state after secondary execution We don't know in what state the secondary buffer will leave the HW when it ends. It's easier to consider everything needs to be reemitted for now. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16220>	2022-05-03 17:12:44 +00:00
Lionel Landwerlin	4efc997472	anv: fix invalid utrace memcpy l3 config on gfx < 11 device->l3_config is only valid on Gfx11+ This only fixes using GPU_TRACE=1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `02a4d622ed` ("anv: expose a couple of emit helper to build utrace buffer copies") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16291>	2022-05-03 13:18:48 +00:00
Rob Clark	c4b5ebe1fc	drm-shim: Better mmap offsets Using the bo pointer address as the offset doesn't go over well when someone is fuzzing you. But we already have the mem_addr, we can simply use that instead. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16250>	2022-05-02 19:50:33 +00:00
Caio Oliveira	7cd9adeb41	intel/compiler: In XeHP prefer <1;1,0> regions before compacting Ken performed some tests with shader-db to evaluate the effects ``` Across all 145,848 shaders generated, the results were: Total bytes compacted before: 3,326,224 Total bytes compacted after: 60,963,280 ``` Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15399>	2022-05-02 18:03:01 +00:00
Lionel Landwerlin	0be9cac742	anv: limit clflush usage Discrete platforms don't have LLC, but on those, we mmap our buffers with WC. So we shouldn't need to clflush there. Anv already had a boolean field on the physical device to know whether we need to use clflush(), based off the memory heaps available. So use that instead. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15780>	2022-05-02 12:07:01 +00:00
Lionel Landwerlin	44e93b4c6f	anv: fix clflush usage on utrace copy batch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `cc5843a573` ("anv: implement u_trace support") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15780>	2022-05-02 12:07:01 +00:00
Francisco Jerez	14cad38b19	intel/dev: Compute pixel pipe information based on geometry topology DRM query. This changes the intel_device_info calculation to call an additional DRM query requesting the geometry topology from the kernel, which may differ from the result of the current topology query on XeHP+ platforms with compute-only and 3D-only DSSes. This seems more reliable than the current guesswork done in intel_device_info.c trying to figure out which DSSes are available for the render CS. Cc: 22.1 <mesa-stable> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14143>	2022-04-30 00:00:58 +00:00
Jordan Justen	de99a11172	intel_dev_info: Add --hwconfig command line parameter Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14511>	2022-04-28 21:56:32 +00:00
Jordan Justen	d9ff9ea9c3	intel/dev: Read hwconfig from i915 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14511>	2022-04-28 21:56:32 +00:00
Emma Anholt	536c8ee96d	nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. This controls the whole lowering of "make tex ops with implicit derivatives on non-implicit-derivative stages be tex ops with an explicit lod of 0 instead", but it's really hard to describe that in a git commit summary. All existing callers get it added except: - nir_to_tgsi which didn't want it. - nouveau, which didn't want it (fixes regressions in shadowcube and shadow2darray with NIR, since the shading languages don't expose txl of those sampler types and thus it's not supported in HW) - optional lowering passes in mesa/st (lower_rect, YUV lowering, etc) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16156>	2022-04-28 21:26:08 +00:00
Nanley Chery	b023f18bad	isl,iris: Add DG2 CCS modifier support for XeHP Cc: 22.1 <mesa-stable> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14521>	2022-04-28 20:02:14 +00:00
Nanley Chery	a53abeb7fb	intel/isl: Add a score for I915_FORMAT_MOD_4_TILED Enables the modifier in anv. Cc: 22.1 <mesa-stable> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14521>	2022-04-28 20:02:14 +00:00
Anuj Phogat	ac441d0953	isl,iris: Add I915_FORMAT_MOD_4_TILED support for XeHP This patch adds Tile 4 modifier support to Mesa and allows Mesa to use Tile 4 on gen12-hp with GBM. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: 22.1 <mesa-stable> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14521>	2022-04-28 20:02:14 +00:00
Tapani Pälli	d3ef3657b2	isl: disable mcs (and mcs+ccs) for color msaa on DG2 Fixes lots of various test failures in: dEQP-VK.pipeline.multisample.min_sample_shading_disabled.* dEQP-GLES3.functionalmultisample. KHR-GLsample_variables. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13991>	2022-04-28 05:31:52 +00:00
Lionel Landwerlin	f4f350a06c	anv: reemit 3DSTATE_STREAMOUT after memcpy This doesn't fix anything because memcpy is only used before secondary buffer execution and we dirty everything after that. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16189>	2022-04-27 18:43:00 +00:00
David Heidelberg	657b0ff861	ci/iris: Enable SKQP on Tiger Lake boards - SKQP gets included now in all amd64 LAVA builds. - add test job for Tiger Lake (tgl) - add manual test job for Whiskey Lake (whl), because all runners are already used - document that we have 13 tgl machines Tests failed (on tgl): - gl_simpleaaclip_aaclip, 1 pixel off : https://okias.pages.freedesktop.org/-/mesa/-/jobs/21790629/artifacts///results/gl/report.html Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16048>	2022-04-27 12:35:13 +00:00
David Heidelberg	c1e59bea05	ci: intel: Merge anv and iris into src/intel/ci This commit make simple adding tests which use both GL(ES) and VK. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16048>	2022-04-27 12:35:13 +00:00
Erik Faye-Lund	3620e7e71c	vulkan: drop empty vulkan_wsi_args This is always empty, so let's just get rid of it. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16162>	2022-04-27 11:51:26 +00:00
Sviatoslav Peleshko	28ca5636f6	anv: workaround apps that assume full subgroups without specifying it Without this we might choose 8 or 16 width, while the app assumes 32. With subgroup operations it may cause wrong calculations and thus bugs. Examples of such games are Aperture Desk Job and DOOM Eternal. v2: Make it a driconf option instead of applying unconditionally, move from brw_required_dispatch_width to brw_compile_cs v3: Rename allow_assuming_full_subgroups -> assume_full_subgroups. Include assume_full_subgroups value in anv_pipeline_hash_compute(). v4: Move actual workaround code from brw_fs.c -> anv_pipeline.c. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6171 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15708>	2022-04-26 13:21:43 +00:00
Lionel Landwerlin	fe413962b4	anv: skip acceleration structure in binding table emission With mutable descriptor types, we can end up in a situation where a binding can be, for instance, both a UBO and an acceleration structure. While we can promote the UBO to a binding table entry and the shader can use it, this isn't true of acceleration structures that have no surface state. In that case just skip the entry. The shader is already compiled to use the descriptor entry. In the non mutable case, the entry will not be created by anv_nir_apply_pipeline_layout. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `63e91148b7` ("anv: Enable VK_VALVE_mutable_descriptor_type") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15969>	2022-04-25 13:19:28 +00:00
Lionel Landwerlin	b7828f56ba	anv: fix acceleration structure descriptor template writes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `d258b0bf0e` ("anv: Add support for binding acceleration structures") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16058>	2022-04-25 11:01:56 +00:00
Lionel Landwerlin	ace22edd30	anv: remove unused enum Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16058>	2022-04-25 11:01:56 +00:00
Lionel Landwerlin	107acf5a4a	intel: fixup number of threads per EU on XeHP Computations for indexing in-memory data structures for ray queries depend on this. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4f9141607f` ("intel: Add device info for DG2") Acked-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15925>	2022-04-25 10:06:02 +00:00
Lionel Landwerlin	5a52cfd88b	anv: fix INTEL_DEBUG=sync Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `3684012770` ("anv: implement DEBUG_SYNC") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16101>	2022-04-22 21:59:50 +00:00
Jason Ekstrand	2d3b3b757a	anv: Clean up pipeline cache helpers a bit Instead of having two different helpers, delete the pipeline_cache ones. Also, instead of manually handling the cache == NULL case in every vkCreateFooPipelines call, handle it inside the helpers. This means that BLORP can use them too by passing cache=NULL. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13184>	2022-04-22 19:38:52 +00:00
Jason Ekstrand	7f1e82306c	anv: Switch to the new common pipeline cache This patch is intended to be somewhat minimal. There's a lot of cleanup work that can be done but we'll leave that to later patches. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13184>	2022-04-22 19:38:52 +00:00
Jason Ekstrand	c551f6c4df	anv: Rename a fail label in CreateDevice The rest of them are labeled with the thing they need to destroy first, not the thing that failed. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13184>	2022-04-22 19:38:52 +00:00
Vadym Shovkoplias	785b6579ae	anv: Fix geometry flickering issue when compute and 3D passes are combined Call flush_pipeline_select_3d in CmdBeginRendering() to emit a dummy MEDIA_VFE_STATE before switching from GPGPU to 3D. Original commit with the fix: `bc612536` ("anv: Emit a dummy MEDIA_VFE_STATE before switching from GPGPU to 3D") Fixes: `3501a3f9ed` ("anv: Convert to 100% dynamic rendering") Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6201 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15954>	2022-04-21 11:00:07 +00:00
Jordan Justen	d257494ec4	intel/dev: Add device info for RPL-P Cc: mesa-stable Ref: https://patchwork.freedesktop.org/series/102701/ Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16054>	2022-04-21 01:32:53 -07:00
Lionel Landwerlin	a468f26ca5	anv: implement VK_EXT_primitives_generated_query Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15638>	2022-04-20 10:37:24 +03:00
Lionel Landwerlin	8ef8e72aac	intel/fs: tidy up lower of ray queries We already expect a single function. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15946>	2022-04-19 12:56:06 +00:00
Marcin Ślusarz	5dace41c10	intel/compiler: invalidate metadata in brw_nir_initialize_mue New "if" blocks may have been inserted. Fixes: `bc4f8c073a` ("intel/compiler: inject MUE initialization") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15924>	2022-04-19 11:43:55 +00:00
Marcin Ślusarz	4fddef33d5	intel/compiler: invalidate all metadata in brw_nir_lower_intersection_shader New "if" blocks were inserted. Fixes: `303378e1dd` ("intel/rt: Add lowering for combined intersection/any-hit shaders") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15924>	2022-04-19 11:43:55 +00:00
Marcin Ślusarz	5bd3ba5b67	anv: invalidate all metadata in anv_nir_lower_ubo_loads lower_ubo_load_instr may insert "if" blocks. Fixes: `61749b5a15` ("anv: Add a pass for lowering A64 UBO access") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15924>	2022-04-19 11:43:55 +00:00
Lionel Landwerlin	184084e21c	anv: allow getting the address of the beginning of the batch There is no reason not to be able to get it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `34a0ce58c7` ("anv: add a new execution mode for secondary command buffers") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15968>	2022-04-19 10:43:29 +00:00
Alexey Bozhenko	2d7d907ad1	intel/compiler: fix singleton pointer coverity warning fix brw_kernel::stats member that was declared as a variable but used as a pointer to array of 3 elements CID: 1503279 Signed-off-by: Bozhenko Alexey <oleksii.bozhenko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15975>	2022-04-19 12:36:10 +03:00
Lionel Landwerlin	3684012770	anv: implement DEBUG_SYNC Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15950>	2022-04-19 07:32:01 +00:00
Lionel Landwerlin	317512e038	anv/intel: add a new debug flag for stalling after every draw/dispatch Useful for hang debugging. Previously Anv incorrectly used DEBUG_SYNC for this. v2: Update documentations for sync/stall (Jordan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15950>	2022-04-19 07:32:01 +00:00
Lionel Landwerlin	a1969fa777	anv: improve INTEL_DEBUG for submit Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15950>	2022-04-19 07:32:01 +00:00
illiliti	67af7e2b40	Use proper types for meson objects Fix invalid usage of meson objects which violates official meson specification and thus breaks muon, an implementation of meson written in C. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15715>	2022-04-18 13:03:08 +03:00
Lionel Landwerlin	04bd007757	intel/fs: require memory fence commit bit on Gfx9 Fixes a hang on Gfx9 GT1 : dEQP-VK.compute.zero_initialize_workgroup_memory.max_workgroup_memory.128 Tested-by: Mark Janes <markjanes@swizzler.org> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15596>	2022-04-17 21:24:17 +00:00
Lionel Landwerlin	b07c215c35	intel: fix URB programming for GT1s We're missing a programming restriction. Hopefully fixing dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1.* on Gfx9atoms Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6216 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com>. Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15596>	2022-04-17 21:24:17 +00:00
Jason Ekstrand	ad0dc8e4ab	intel/compiler: Set lower_fisnormal Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15985>	2022-04-16 00:26:43 +00:00
Lionel Landwerlin	7be6632f7d	anv: use shadow surface for stencil input attachment on gfx7 This fixes a number of tests like : dEQP-VK.renderpass.suballocation.multisample.s8_uint. dEQP-VK.renderpass.suballocation.multisample.separate_stencil_usage.d24_unorm_s8_uint..test_stencil dEQP-VK.renderpass.suballocation.multisample.d24_unorm_s8_uint. dEQP-VK.renderpass.suballocation.multisample.d32_sfloat_s8_uint. Because the driver asserts when generating RENDER_SURFACE_STATE with a 8 Valign value for stencil buffer (only 2 & 4 are supported). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12670>	2022-04-15 09:46:40 +03:00
Lionel Landwerlin	08f3950d6b	anv: stop using old entrypoint/struct/enum names for 1.3 v2: More replacements Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15920>	2022-04-13 21:13:56 +00:00
Lionel Landwerlin	e11bedb9f5	intel/fs: add a note on possible optimization of root node address Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15910>	2022-04-13 11:24:49 +00:00
Lionel Landwerlin	9c0805ef91	intel/fs: fix metadata preserve on trace_ray intrinsic `c78be5da30` ("intel/fs: lower ray query intrinsics") introduced a helper function using nir_(push\|pop)_if which invalidated dominance & block_index for the replacement of nir_intrinsic_rt_trace_ray. We can still keep dominance/block_index metadata for the lowering of nir_intrinsic_rt_execute_callable though. This change uses 2 different lowering function with correct metadata preservation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c78be5da30` ("intel/fs: lower ray query intrinsics") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15910>	2022-04-13 11:24:49 +00:00
Jason Ekstrand	69b5424ea4	intel/nir: Lower 8 and 16-bit bitwise unops Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Jason Ekstrand	a482877c70	intel/fs: Implement 16-bit [ui]mul_high Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Mykhailo Skorokhodov	9c7e750ffe	intel/fs: Enable b2f(inot(a)) and b2i(inot(a)) optimization for Gfx12+ The commit enables the optimization for Intel Gfx12+ graphics. Tigerlake ``` total instructions in shared programs: 1289326 -> 1289015 (-0.02%) instructions in affected programs: 37841 -> 37530 (-0.82%) helped: 78 HURT: 9 helped stats (abs) min: 1 max: 26 x̄: 4.69 x̃: 3 helped stats (rel) min: 0.10% max: 12.50% x̄: 2.07% x̃: 1.21% HURT stats (abs) min: 1 max: 18 x̄: 6.11 x̃: 4 HURT stats (rel) min: 0.16% max: 1.95% x̄: 0.94% x̃: 0.61% 95% mean confidence interval for instructions value: -4.95 -2.20 95% mean confidence interval for instructions %-change: -2.34% -1.18% Instructions are helped. total cycles in shared programs: 105606388 -> 105606442 (<.01%) cycles in affected programs: 620119 -> 620173 (<.01%) helped: 49 HURT: 28 helped stats (abs) min: 2 max: 3618 x̄: 228.63 x̃: 12 helped stats (rel) min: 0.02% max: 23.31% x̄: 4.60% x̃: 1.11% HURT stats (abs) min: 1 max: 2142 x̄: 402.04 x̃: 29 HURT stats (rel) min: 0.01% max: 36.42% x̄: 5.01% x̃: 0.46% 95% mean confidence interval for cycles value: -151.80 153.20 95% mean confidence interval for cycles %-change: -3.00% 0.79% Inconclusive result (value mean confidence interval includes 0). ``` Related-to: `7725d60938` Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14017>	2022-04-12 10:55:05 +00:00
Marcin Ślusarz	65600a34c2	anv: initialize 3DMESH_1D.ExtendedParameter0 when ExtendedParameter0Present When IndirectParameterEnable==true it's not actually used by the hardware, but if it's not initialized and INTEL_DEBUG=bat is set, then Valgrind complains. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15850>	2022-04-12 09:10:31 +00:00
Marcin Ślusarz	f844ce66c8	anv: fix push constant lowering for task/mesh Fixes: `a6031cd9bd` ("anv: fix push constant lowering with bindless shaders") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15850>	2022-04-12 09:10:31 +00:00
Francisco Jerez	e858da39e5	intel/perf: Fix OA report accumulation on Gfx12+. The intel_perf_query path used for performance queries on GL was passing a bogus "end" pointer to intel_perf_query_result_accumulate(), causing it to accumulate garbage values. This was causing the values of many performance counters to be corrupted. The "end" pointer was incorrect because the current code was assuming that different OA reports were located TOTAL_QUERY_DATA_SIZE bytes apart, which is a hard-coded preprocessor define. However recent (Gfx12+) hardware generations use a variable query size determined by the query layout. Use the size derived from it instead, and remove the stale define. Fixes: `3c51325025` ("intel/perf: switch query code to use query layout") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15783>	2022-04-12 00:11:47 +00:00
Kenneth Graunke	b05ac36f01	intel/genxml: Add SAMPLER_MODE bits for enabling Small PL on Icelake This enables a lower power mode in the sampler hardware in certain common scenarios. On Tigerlake, SAMPLER_MODE is not programmable by userspace but the kernel already sets this bit for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15628>	2022-04-11 19:17:07 +00:00
Kenneth Graunke	e3defe7ae7	intel/genxml: Delete SAMPLER_MODE register definition on Gfx12+ While this register still exists, it's no longer a per-context register. Instead, on Gfx12+, SAMPLER_MODE exists per dual-subslice and is accessed as a "multicast" register, where you write control which version is accessed by the "steering control register". At any rate, userspace cannot write it any longer, and so there's not much point to it existing in our genxml (which was missing most of the fields anyway). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15628>	2022-04-11 19:17:07 +00:00
Kenneth Graunke	8092704705	intel/genxml: Add new "Low Quality Filter" field on Gfx12+. This allows the sampler to perform faster filtering of 8-bit UNORM textures by filtering them at a different precision. The filtering is intended to still be OpenGL and DirectX spec compliant. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15628>	2022-04-11 19:17:07 +00:00
Kenneth Graunke	9a70385e2b	intel/genxml: Add SAMPLER_STATE::Allow Low Quality LOD Calculation field This allows the hardware to perform a faster LOD calculation in many simple cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15628>	2022-04-11 19:17:07 +00:00
Vitalii.Lomaka	1407a4db69	intel/batch-decoder: Fix uninitialized scalar variables CID: 1498516 CID: 1498560 Signed-off-by: Vitalii Lomaka <vitalii.lomaka@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15685>	2022-04-08 18:35:34 +00:00
Benjamin Cheng	0666b7fecc	anv: drop from_wsi bit from anv_image It was originally introduced in `ca791f5c` but it was never actually set anywhere. It doesn't serve any purpose other than some sanity checking so let's clean it up for now. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15799>	2022-04-07 18:46:50 +00:00
Ian Romanick	b5fa43952a	intel/fs: Better handle constant sources of FS_OPCODE_PACK_HALF_2x16_SPLIT I noticed that a LOT of fragment shaders in Shadow of the Tomb Raider, for instance, end up with a sequence of NIR like: vec1 32 ssa_2 = load_const (0x00000000 = 0.000000) ... vec1 32 ssa_191 = pack_half_2x16_split ssa_188, ssa_2 vec1 32 ssa_192 = pack_half_2x16_split ssa_189, ssa_2 vec1 32 ssa_193 = pack_half_2x16_split ssa_190, ssa_2 This results in an assembly sequence like: mov(8) g28<1>UD 0x00000000UD mov(8) g21<2>HF g28<8,8,1>F shl(8) g21<1>UD g21<8,8,1>UD 0x00000010UD mov(8) g21<2>HF g25<8,8,1>F mov(8) g19<2>HF g28<8,8,1>F shl(8) g19<1>UD g19<8,8,1>UD 0x00000010UD mov(8) g19<2>HF g23<8,8,1>F mov(8) g20<2>HF g28<8,8,1>F shl(8) g20<1>UD g20<8,8,1>UD 0x00000010UD mov(8) g20<2>HF g24<8,8,1>F After this commit, the generated assembly is: mov(8) g21<1>UD 0x00000000UD mov(8) g21<2>HF g23<8,8,1>F mov(8) g19<1>UD 0x00000000UD mov(8) g19<2>HF g17<8,8,1>F mov(8) g20<1>UD 0x00000000UD mov(8) g20<2>HF g18<8,8,1>F Tiger Lake, Ice Lake, Skylake, and Haswell had similar results. (Ice Lake shown) total instructions in shared programs: 20119086 -> 20119034 (<.01%) instructions in affected programs: 9056 -> 9004 (-0.57%) helped: 8 HURT: 0 helped stats (abs) min: 2 max: 16 x̄: 6.50 x̃: 4 helped stats (rel) min: 0.29% max: 1.75% x̄: 1.00% x̃: 0.98% 95% mean confidence interval for instructions value: -11.01 -1.99 95% mean confidence interval for instructions %-change: -1.56% -0.44% Instructions are helped. total cycles in shared programs: 861019414 -> 861021044 (<.01%) cycles in affected programs: 279862 -> 281492 (0.58%) helped: 4 HURT: 2 helped stats (abs) min: 6 max: 936 x̄: 239.00 x̃: 7 helped stats (rel) min: 0.03% max: 8.13% x̄: 2.09% x̃: 0.09% HURT stats (abs) min: 18 max: 2568 x̄: 1293.00 x̃: 1293 HURT stats (rel) min: 0.36% max: 1.14% x̄: 0.75% x̃: 0.75% 95% mean confidence interval for cycles value: -972.56 1515.89 95% mean confidence interval for cycles %-change: -4.77% 2.49% Inconclusive result (value mean confidence interval includes 0). Broadwell total instructions in shared programs: 17812327 -> 17812263 (<.01%) instructions in affected programs: 9867 -> 9803 (-0.65%) helped: 8 HURT: 0 helped stats (abs) min: 2 max: 28 x̄: 8.00 x̃: 4 helped stats (rel) min: 0.32% max: 1.80% x̄: 1.00% x̃: 0.95% 95% mean confidence interval for instructions value: -15.46 -0.54 95% mean confidence interval for instructions %-change: -1.54% -0.47% Instructions are helped. total cycles in shared programs: 904768620 -> 904773291 (<.01%) cycles in affected programs: 454799 -> 459470 (1.03%) helped: 4 HURT: 4 helped stats (abs) min: 36 max: 586 x̄: 344.50 x̃: 378 helped stats (rel) min: 0.47% max: 4.04% x̄: 2.01% x̃: 1.77% HURT stats (abs) min: 1 max: 5572 x̄: 1512.25 x̃: 238 HURT stats (rel) min: <.01% max: 2.77% x̄: 1.46% x̃: 1.53% 95% mean confidence interval for cycles value: -1122.40 2290.15 95% mean confidence interval for cycles %-change: -2.26% 1.71% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 18581 -> 18579 (-0.01%) spills in affected programs: 323 -> 321 (-0.62%) helped: 1 HURT: 0 total fills in shared programs: 24985 -> 24981 (-0.02%) fills in affected programs: 1348 -> 1344 (-0.30%) helped: 1 HURT: 0 Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) Instructions in all programs: 143585431 -> 143513657 (-0.0%) Instructions helped: 14403 Cycles in all programs: 8439312778 -> 8439371578 (+0.0%) Cycles helped: 10570 Cycles hurt: 3290 Gained: 146 Lost: 74 All of the lost and gained fossil-db shaders are SIMD32 fragment shaders. 14,247 of the affected shaders are from Shadow of the Tomb Raider. 154 are from Batman Arkham Origins, and the remaining two are from Octopath Traveler. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15089>	2022-04-07 18:26:23 +00:00
Ian Romanick	c08302670b	intel/compiler: Fix sample_d messages on DG2 DG2 can only do sample_d and sample_d_c on 1D and 2D surfaces. The maximum number of gradient components and coordinate components should be 2. In spite of this limitation, the Bspec lists a mysterious R component before the min_lod, so the maximum coordinate components is 3. Fixes the following Vulkan CTS failures on DG2: dEQP-VK.glsl.texture_functions.texturegradclamp.isampler1d_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.isampler2d_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1d_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1d_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2d_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2d_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler1d_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler2d_fragment The Fixes: tag below is a bit misleading. This commit fixes some test cases similar to ones fixed by the Fixes: commit. I just want to make sure this commit gets applied everywhere that commit was also applied. Fixes: `635ed58e52` ("intel/compiler: Lower txd for 3D samplers on XeHP.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15781>	2022-04-07 17:09:28 +00:00
Jason Ekstrand	13fc698cef	anv/formats: Relax usage checks if EXTENDED_USAGE_BIT is set Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14153>	2022-04-07 15:56:33 +00:00
Lionel Landwerlin	b5031bd6f7	intel/nir: don't report progress on rayqueries if no queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c78be5da30` ("intel/fs: lower ray query intrinsics") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15769>	2022-04-07 08:24:19 +00:00
Lionel Landwerlin	56ef501e3a	blorp: disable depth bounds Otherwise the driver setting interacts with it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `939ddccb7a` ("anv: Add support for depth bounds testing.") Fixes: `1df871f8ff` ("iris: Add support for depth bounds testing.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15763>	2022-04-06 19:00:50 +00:00
Lionel Landwerlin	3069337144	anv: remove unused 3DSTATE_DEPTH_BOUNDS fields Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15763>	2022-04-06 19:00:50 +00:00
Lionel Landwerlin	88f77aa811	anv: disable preemption on 3DPRIMITIVE on gfx12 To workaround a push constant corruption issue. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5963 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5662 Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15753>	2022-04-06 12:51:15 +00:00
Vadym Shovkoplias	04a6693871	anv: fix EXT_depth_clip_control This fixes arb_clip_control-clip-control and depth_clamp piglit tests on zink. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6186 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15561>	2022-04-06 13:26:52 +03:00
Jason Ekstrand	29b8097408	anv: Enable VK_EXT_debug_utils It's implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Mike Blumenkrantz	6fd344ff98	anv: expose VK_EXT_image_2d_view_of_3d sampling only available on gen9+ Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15754>	2022-04-05 20:30:31 +00:00
Omar Akkila	4208895175	ci: bump VK-GL-CTS to 1.3.1.1 Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15668>	2022-04-04 23:04:33 +00:00
Jason Ekstrand	94ce812497	anv: Advertise two more formats These both require swizzling so border colors won't work. However, they're conveniently in the list of formats for which custom border colors require you to specify a format in the sampler. That list constists of: - VK_FORMAT_B4G4R4A4_UNORM_PACK16 - VK_FORMAT_B5G6R5_UNORM_PACK16 - VK_FORMAT_B5G5R5A1_UNORM_PACK16 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6226 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15624>	2022-04-04 21:42:23 +00:00
Jason Ekstrand	e32b9e5c3f	anv: Generalize border color swizzles Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15624>	2022-04-04 21:42:23 +00:00
Jason Ekstrand	54509d27d9	anv: Disallow blending on swizzled formats Fixes: `c20f78dc5d` ("anv: Support swizzled formats.") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15624>	2022-04-04 21:42:23 +00:00
Jason Ekstrand	257a20f40d	intel/isl: Add a helper for swizzling color values Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15624>	2022-04-04 21:42:23 +00:00
Ian Romanick	7fd1955412	nir: intel/compiler: Lower TXD on array surfaces on DG2+ DG2 can only do sample_d and sample_d_c on 1D and 2D surfaces. Cube maps and 3D surfaces were already handled, but 1D array and 2D array surfaces were not. Fixes the following Vulkan CTS failures on DG2: dEQP-VK.glsl.texture_functions.texturegradclamp.isampler1darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.isampler2darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1darray_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1darray_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2darray_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2darray_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler1darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler2darray_fragment The Fixes: tag below is a bit misleading. This commit adds another lowering, similar to the one in the Fixes: commit, that probably should have been added at the same time. I just want to make sure this commit gets applied everywhere that commit was also applied. Fixes: `635ed58e52` ("intel/compiler: Lower txd for 3D samplers on XeHP.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15681>	2022-03-31 12:59:18 -07:00
Rohan Garg	d876abeaa8	anv: Drop dead code in anv_UpdateDescriptorSets Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15666>	2022-03-30 15:19:47 +02:00
Lionel Landwerlin	684a4ea30c	intel/clc: fix missing pointer write Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `346a7f14fb` ("intel/compiler: Add code for compiling CL-style SPIR-V kernels") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15611>	2022-03-30 07:56:25 +00:00
Mike Blumenkrantz	65ec846f77	intel/isl: fix 2d view of 3d textures according to KHR_gl_texture_3D_image: If <target> is EGL_GL_TEXTURE_3D_KHR, <buffer> must be the name of a complete, nonzero, GL_TEXTURE_3D (or equivalent in GL extensions) target texture object, cast into the type EGLClientBuffer. <attr_list> should specify the mipmap level (EGL_GL_TEXTURE_LEVEL_KHR) and z-offset (EGL_GL_TEXTURE_ZOFFSET_KHR) which will be used as the EGLImage source; the specified mipmap level must be part of <buffer>, and the specified z-offset must be smaller than the depth of the specified mipmap level. thus a 2d view of a 3d surface is not only legal, it's part of the spec and must be supported when available cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15584>	2022-03-29 21:44:51 +00:00

... 2 3 4 5 6 ...

8177 Commits