KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Yiwei Zhang	31727f114a	venus: use linear modifier for legacy common wsi path Towards the renderer, venus better uses VK_EXT_image_drm_format_modifier to force linear with tiling modifier and mod_linear. Doing so won't make any difference on the mesa implementations we care about given we have required VK_EXT_image_drm_format_modifier for wsi support. A lucky side effect of this is to allow common wsi to work with host implementations not supporting dma_buf export. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15993>	2022-04-21 01:29:21 +00:00
Yiwei Zhang	09cee71e80	venus: override aspectMask for internal tiling modifier WSI images and Android AHBs can have tiling modifier overrides, thus we must override the aspectMask upon image subresource layout query. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15993>	2022-04-21 01:29:21 +00:00
Mike Blumenkrantz	d7256043b3	zink: handle device-local unsynchronized maps this is only possible when tc determines the buffer is not in use and decides to return a pointer immediately, so just give back a staging buffer cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15979>	2022-04-21 00:48:19 +00:00
Mike Blumenkrantz	e509598470	zink: remove xfb_barrier flag this was an attempt to minimize the number of xfb barriers being emitted, but really xfb barriers need to always be emitted in order for xfb to work cc: mesa-stable fixes (nv): KHR-GL46.texture_view.reference_counting KHR-GL46.transform_feedback_overflow_query_ARB.multiple-streams-multiple-buffers-per-stream KHR-GL46.transform_feedback_overflow_query_ARB.multiple-streams-one-buffer-per-stream Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16065>	2022-04-21 00:27:50 +00:00
Mike Blumenkrantz	fc5edf9b68	zink: fix xfb counter buffer barriers a read barrier is needed for resume, yes, but the counter buffer is always being written to, so write access must always be set cc: mesa-stable fixes (nv): KHR-GL46.transform_feedback.draw_xfb_test Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16065>	2022-04-21 00:27:50 +00:00
Mike Blumenkrantz	a056cbc691	zink: fix synchronization when drawing from streamout this was well-documented, but ultimately wrong: the synchronization being used was for binding streamout buffers (not counter buffers) as vertex buffers, which was already handled just fine in the normal vertex buffer binding drawing from streamout ONLY uses the counter buffer, which means the counter buffer needs to be synchronized for reading cc: mesa-stable fixes (nv): KHR-GL46.transform_feedback.draw_xfb_feedbackk_test KHR-GL46.transform_feedback.draw_xfb_instanced_test KHR-GL46.transform_feedback.draw_xfb_stream_instanced_test Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16065>	2022-04-21 00:27:50 +00:00
Mike Blumenkrantz	dd783d7144	zink: nv ci update cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16065>	2022-04-21 00:27:50 +00:00
Mike Blumenkrantz	7af76d1aae	zink: NV_linear_color_attachment this fixes staging blits on nvidia Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16016>	2022-04-21 00:14:48 +00:00
Mike Blumenkrantz	373c8001d6	zink: set VK_QUERY_RESULT_WAIT_BIT when copying to qbo according to spec, this ensures that drivers will accurately return results relative to when the query was ended cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16049>	2022-04-21 00:04:27 +00:00
Emma Anholt	02370e22f7	nir_to_tgsi: Make vec_to_movs avoid unsupported coalescing for 64-bit. I had some workarounds in ALU op emits trying to fix up when we were asked to store to unsupported channels when the ALU op had 64bit srcs (so only vec2 supported) but a 32-bit dest with a >vec2 writemask. Those workarounds had some bugs breaking 64-bit uniform initializer tests on virgl, and also set up too wide of a writemask such that they triggered assertion failures on nvc0. We can avoid the need for those workarounds at emit time by just having nir_lower_vec_to_movs not generate unsupported writemasks in the first place. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15934>	2022-04-20 23:21:06 +00:00
Boris Brezillon	b8fd1e8844	dzn: Report actual device limits Report actual device limits instead of pseudo-random numbers. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15911>	2022-04-20 22:47:29 +00:00
Boris Brezillon	6c877cb00f	dzn: Use core helpers to fill physical device features/properties The core provide generic helpers to turn Vulkan minor version features/properties into their KHR counterparts. Let's declare those core features/properties structs and use those helpers so we get ready to support newer spec versions without too much pain. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15911>	2022-04-20 22:47:29 +00:00
Marek Olšák	69e3f35435	gallium/ddebug: implement pipe_vertex_state callbacks Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15964>	2022-04-20 22:15:43 +00:00
Emma Anholt	f29706a25f	nouveau/nir: Set the input for vertex/instance ID like TGSI does. Doesn't seem to help tests, but clears a TODO about differences between them. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	801dca3c40	nouveau/nir: Fix edgeflag input detection. VERT_ATTRIB_EDGEFLAG is above GENERIC0, so it was being offset and thus not recognized by vert_attrib_to_tgsi_semantic(). Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	af718674ab	nouveau/nir: Fix the inverted sense of usesSampleMaskIn. Fixes: `9f3d5e99ea` ("compiler: Use util/bitset.h for system_values_read") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	d9b6b2acd7	nouveau/nv50: Set the primid sysval flag if it's in the sysval list, too. It's declared an input in TGSI, even though it's an SV in the backend. In NIR, it shows up as an SV, so it's in this list. Fixes NIR regressions in primitive-id-in and primitive-id-restart. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	814b0edae5	nouveau/nv50: Enable mesa/st alpha test lowering on nv50 with NIR. With TGSI, the driver allocates space for the alpha ref as a uniform and adds a conditional discard to the shader. We could either replicate that with NIR, or just set the flag saying we need the shader lowering and get the same thing. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	6040107dc1	nouveau/nir: Disable bitfield ops pre-nvc0. There's no hardware instructions for them until then. These chips don't expose the extension provinding the GLSL builtins for operations like bfrev, but NIR can recognize the construct and optimize it to bitfield_reverse, which pre-nvc0 would then fail to codegen. Prevents a regression when moving to nir-to-tgsi. Other lower_bitfield flags are set as well for when someone comes along and adds optimizations for them too. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	adb6d7fe9a	ci/nouveau: Add nv92 xfails. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	ea5873f787	ci/nouveau: Add expectations files for GM206. I'm using this in place of jetson for regression testing NTT on nvc0+. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
M Henning	c0c198ffc1	nouveau/nir: Split fewer 64-bit loads Also adjust the lowering pass to handle wide SSBO loads that we now emit for the nir case. This improves generated code quality since memoryopt can't merge SSBO loads that end up predicated on a bounds check. This also happens to fix a few test cases, only because the simpler generated IR is less likely to trigger other compiler bugs. Eg on kepler with NV50_PROG_USE_NIR=1, this fixes arb_gpu_shader_fp64-fs-non-uniform-control-flow-ubo Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	1b32d4b7d4	nouveau/nv50: Print the number of loops in shader-db output. This is important so you don't go comparing the number of instructions emitted when you unrolled loops differently. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16063>	2022-04-20 21:58:33 +00:00
Emma Anholt	a4840e15ab	r600: Use nir-to-tgsi instead of TGSI when the NIR debug opt is disabled. For !8044 I'm working on getting all drivers to accept NIR. The NIR compiler in the driver is apparently not quite ready, so use NIR-to-TGSI instead. This is a net win in testcases working on my RV770 and Turks cards (especially in some important piglit tests involving YUV dma-buf decode), though it's not regression-free. shader-db (R600): total dw in shared programs: 8553412 -> 8358918 (-2.27%) dw in affected programs: 7476702 -> 7282208 (-2.60%) total gprs in shared programs: 217286 -> 213217 (-1.87%) gprs in affected programs: 72747 -> 68678 (-5.59%) total loops in shared programs: 398 -> 330 (-17.09%) loops in affected programs: 68 -> 0 total cf in shared programs: 558835 -> 332768 (-40.45%) cf in affected programs: 420475 -> 194408 (-53.76%) shader-db (Turks): total dw in shared programs: 14104598 -> 13556782 (-3.88%) dw in affected programs: 12161972 -> 11614156 (-4.50%) total gprs in shared programs: 321068 -> 313690 (-2.30%) gprs in affected programs: 114899 -> 107521 (-6.42%) total loops in shared programs: 736 -> 651 (-11.55%) loops in affected programs: 111 -> 26 (-76.58%) total cf in shared programs: 925771 -> 581226 (-37.22%) cf in affected programs: 678600 -> 334055 (-50.77%) total stack in shared programs: 27853 -> 27855 (<.01%) stack in affected programs: 5 -> 7 (40.00%) glmark2 terrain: 0.137649% +/- 0.0511938% (n=6) glmark2 jellyfish: no change (n=8) unigine valley (extreme) 5.36 -> 5.45 (n=1 it takes so long to run) unigine heaven (basic) 16.13 -> 16.15 (n=1) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14319>	2022-04-20 21:46:09 +00:00
Emma Anholt	0879c15666	r600/sb: Avoid causing an exception when getting the reciprocal of 0u. I'm not sure what the hardware would return in this circumstance, so just don't fold it. Avoids regressions on transition to NIR. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14319>	2022-04-20 21:46:09 +00:00
Emma Anholt	25836895f3	r600: Fix reading back from a temp array immediately after writing on RV770. KHR-GL33.shaders.indexing.tmp_array.vertexid regressed with the switch to NIR-to-TGSI because the shader got optimized enough to emit a read just after writing to the array (the kind of situation where a non-rel write would have been followed by a PV/PS read). The R600 and EG docs say you always need to do this, but apparently some hardware gives you the right answer anyway so we don't flag it on all of them. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14319>	2022-04-20 21:46:09 +00:00
Emma Anholt	26189cdb1d	ci/r600: Manual run updates. Various fixes have happened, update status. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14319>	2022-04-20 21:46:09 +00:00
Emma Anholt	04a6d7b380	r600: Fix up some mis-indentation of blocks. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14319>	2022-04-20 21:46:09 +00:00
Rhys Perry	dab745f3b4	nir/copy_prop_vars: fix non-vector shader call payloads Fixes RADV+Q2RTX. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `ff05137c2d` ("nir: introduce and use nir_component_mask") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16029>	2022-04-20 21:03:03 +00:00
Jason Ekstrand	1b8a43a0ba	util: Remove util_cpu_detect util_cpu_detect is an anti-pattern: it relies on callers high up in the call chain initializing a local implementation detail. As a real example, I added: ...a Mali compiler unit test ...that called bi_imm_f16() to construct an FP16 immediate ...that calls _mesa_float_to_half internally ...that calls util_get_cpu_caps internally, but only on x86_64! ...that relies on util_cpu_detect having been called before. As a consequence, this unit test: ...crashes on x86_64 with USE_X86_64_ASM set ...passes on every other architecture ...works on my local arm64 workstation and on my test board ...failed CI which runs on x86_64 ...needed to have a random util_cpu_detect() call sprinkled in. This is a bad design decision. It pollutes the tree with magic, it causes mysterious CI failures especially for non-x86_64 developers, and it is not justified by a micro-optimization. Instead, let's call util_cpu_detect directly from util_get_cpu_caps, avoiding the footgun where it fails to be called. This cleans up Mesa's design, simplifies the tree, and avoids a class of a (possibly platform-specific) failures. To mitigate the added overhead, wrap it all in a (fast) atomic load check and declare the whole thing as ATTRIBUTE_CONST so the compiler will CSE calls to util_cpu_detect. Co-authored-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15580>	2022-04-20 18:44:35 +00:00
Daniel Schürmann	90a0675989	nir/lower_alu_to_scalar: don't set the nir_builder cursor This ensures recursive lowering in a single pass. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15977>	2022-04-20 17:53:48 +00:00
Emma Anholt	7f01299c40	nine: Disable optional use of TTN when MUL_ZERO_WINS is available. NIR doesn't have that knob currently, so we end up throwing errors about it being ignored. This should fix cases of "tgsi_to_nir: unhandled TGSI property 23 = 1", and presumably do better at DX9 muls on nv50 and r600. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14883>	2022-04-20 13:47:50 +00:00
Emma Anholt	09fd1e94fd	tgsi_to_nir: Emit load_ubo_vec4 instead of load_ubo on non-integer HW. Otherwise, we get an ishl that the HW can't support, and a ushr if the NIR ends up being lowered to ubo_vec4, which may not get constant-folded if the offset was non-constant. This matches what mesa/st uses for this arg to uniform lowering. Fixes: #5971 Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14883>	2022-04-20 13:47:50 +00:00
Gert Wollny	535f0b9391	ntt: Add option to not optimized register allocation On virglrenderer it is of interest to not re-use temporaries when we want to handle precise, invariant, and highp/mediump with better possibility for optimization. v2: Force optimized RA if the number of registers is too large (Emma: only 16 bit signed int are reserved for register indices) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16051>	2022-04-20 13:05:57 +00:00
Mike Blumenkrantz	b043d4c4c6	lavapipe: run nir_fold_16bit_sampler_conversions big cleanup for all shaders coming from zink Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15852>	2022-04-20 12:12:36 +00:00
Mike Blumenkrantz	27a43b531b	nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims AMD might not support cubes, but that doesn't mean cubes can't be used on other drivers Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15852>	2022-04-20 12:12:36 +00:00
Konstantin Seurer	324b2ae5f2	radv: Enable rt primitive culling for spirv2nir Fixes: `c8fe408fcc` ("radv: Advertise ray primitive culling") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16028>	2022-04-20 11:38:52 +00:00
Konstantin Seurer	b3896fa8c7	radv: Do not discard hits with t=tmax Fixes dEQP-VK.ray_tracing_pipeline.inside_aabbs.chit.ray_end_tmax_zero.* Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16034>	2022-04-20 10:46:29 +00:00
Lionel Landwerlin	a468f26ca5	anv: implement VK_EXT_primitives_generated_query Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15638>	2022-04-20 10:37:24 +03:00
Emma Anholt	30daa7d6d8	tgsi: Emit ureg HW_ATOMIC decls in range order. It turns out r600 has a dependency on it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16043>	2022-04-19 20:05:41 +00:00
Emma Anholt	73e1a54623	nir_to_tgsi: Allocate the primid sysval to num_inputs, not num_outputs. r600 would end up looking for it past the end of its array of inputs (which expected 1:1 ordering from declarations to driver locations). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16043>	2022-04-19 20:05:41 +00:00
Emma Anholt	fc96397256	nir_to_tgsi: Avoid swizzling from undefined channels in load_output. virglrenderer emits GLSL referencing all the swizzles, even if the write mask doesn't contain them. This is a problem when the output is TessLevelInner, which has only 2 elements. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16043>	2022-04-19 20:05:41 +00:00
Emma Anholt	bac7ec1a89	nir_to_tgsi: Don't forget to split 64-bit store_per_vertex_output. Same splitting method as store_output. Fixes regressions in virgl with nir-to-tgsi. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16043>	2022-04-19 20:05:41 +00:00
Emma Anholt	21282879f9	nir_to_tgsi: Fix assertion failures handling 64-bit vec3/vec4 ssa undefs. Found in virgl, where a glslparsertest accidentally gets its inputs lowered to undefs, and 64-bit undefs don't get split by the normal alu/intrinsic splitter (and would be hard to split because other passes would see reconstruction of the vec4 from undefs and turn it back into vec3/vec4 undef). Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16043>	2022-04-19 20:05:41 +00:00
Emma Anholt	4850dbb3f9	nir_to_tgsi: Add a workaround for virglrenderer TG4. I've tried to keep virglrenderer workarounds out of ntt, but this one would be bothersome to do with tgsi_translate and TG4 is pretty low-stakes for NTT consumers. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16043>	2022-04-19 20:05:41 +00:00
Yonggang Luo	a3a43e5fa8	win32: Do not use BUILD_GL32, we use def file to export win32 dll symbols. Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14041>	2022-04-19 19:38:47 +00:00
Yonggang Luo	4ead2f6579	win32: Fixes 32 bits visual studio module definition files by add script gen_vs_module_defs.py Getting opengl32.def consistence with Windows SDK. Getting osmesa.mingw.def's gl functions consistence with Windows SDK. stw_* functions are cdecl, not stdcall, so there is no need mangling the symbol. Fixes egl.def for x86 d3d10sw: Move the place of d3d10_sw.def to d3d10_sw.def.in Fixes vulkan_lvp.def for x86 Fixes #5552 Remove stdcall-fixup Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14041>	2022-04-19 19:38:47 +00:00
Emma Anholt	550975f229	turnip: Don't disable LRZ in subpasses after the first in the easy case. If it's the same depth/stencil attachment, then there's no need to turn off LRZ just because the subpass changed. Doesn't help gfxbench perf yet, but will with !16014. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:30 +00:00
Emma Anholt	7ba63f516a	turnip: Ignore TOP/BOTTOM_OF_PIPE bits in subpass src/dst dep flags. gfxbench sets these between the gbuffer subpass and the following ones. They should be no-ops as subpass dependencies. gfxbench vk-5-debug perf 12.8 -> 14.6 fps thanks to getting gmem on the gbuffer rendering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:30 +00:00
Emma Anholt	1bcd848816	freedreno/ir3: Call nir_opt_find_array_copies(). gfxbench vk-5-normal has a shader that sampels into a texels[] array at the top, then in a loop calls a GLSL function passing texels[] in by value. This resulted in a copy to a temp inside the loop, which got lowered to scratch stores since it was pretty big. By doing find_array_copies(), we notice that it's equivalent to copy_deref, then get to copy-propagate from the array at the top. Then we only have to set up the scratch array outside of the loop and load_scratch from it in the called function inside the loop. This also causes there to be less spilling, stps 1144 -> 354 and ldps 826->36. However, it doesn't seem to change performance on the test. So, while this seems to be an improvement for the shader, and we could maybe even do better by rematerializing the txl samples inside the loop instead of storing the texture fetches to scratch in the first place, it doesn't currently seem worth pursuing more optimization of this shader. No change on freedreno shader-db. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	7ba0c44607	turnip: Add nir_opt_conditional_discard. We can easily do discard_if in the backend without control flow, but it wasn't done in ir3 because the GL frontend already did it for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	d60282f5d2	freedreno/ir3: Make sched nodes before adding deps. The mark_kill_path() during dep setup follows SSA srcs, which when a phi is involved may include a def from later in the same block, that we hadn't created yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	ce15bf19fb	turnip: Add TU_DEBUG=layout for dumping image layouts. This was useful for comparing image allocations between gfxbench gl_5_normal and vk_5_normal to see if rendering was generally equivalent (formats, MSAA, UBWC choices, and notably gfxbench vk was choosing DXT5 instead of ASTC on non-android builds!) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Danylo Piliaiev	2c683519e2	turnip: Try harder to keep LRZ valid and fix a few edge cases Refactored tu6_calculate_lrz_state and added comments. 1) If there is no depth write we could keep LRZ valid with any compare op, we just have to temporary disable LRZ for incompatible ops in such case. 2) Found that VK_COMPARE_OP_EQUAL is not compatible with LRZ, and since it doesn't change LRZ buffer - LRZ could be just temporary disabled. This fixes rendering of grass/trees in PUBG mobile on angle. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6127 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16014>	2022-04-19 18:06:58 +00:00
M Henning	8313a9231c	nouveau: Skip cctl for atomic counters in tgsi The tgsi path already marked all aliasing loads of atomic counters with CACHE_CG, so we don't need to emit a cctl. This patch uses the cache flag on the atomic to model whether the L1 cache needs the stale values to be flushed or not. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14386>	2022-04-19 16:33:36 +00:00
M Henning	850197b3e0	nouveau: Emit cctl to flush L1 cache for atomics We were previously only emitting these for CAS, but all of the atomics seem to need it. Fixes spec@glsl-es-3.10@execution@fs-simple-atomic-counter-inc-dec-read on kepler with NV50_PROG_USE_NIR=1 Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14386>	2022-04-19 16:33:36 +00:00
Boris Brezillon	9eace7f2e4	dzn: refactor error-handling Here's a couple of cleanups to the error-handling code, now that we're no longer using ComPtr<T>. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	cfdaf1af9b	dzn: remove needless defines Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	2ca4e21df7	dzn: merge util sources There, no more C and C++ sources of the same base-name. We can do both in one source. This is our last C++ source file, so let's also clean away the C++20 mess in meson.build. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	0551f8ed62	dzn: port code to plain c This does quite a lot in one go, simply because C and C++ are too different to cleanly move from one language to another. But hopefully this won't create too many rebase-issues. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	b369e10d08	dzn: do not set unused default member initializer These objects aren't allocated using C++ constructors, so these default member initializers does nothing. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	c5e979f632	dzn: c-style casts Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	3d608de882	dzn: use c-style initialization Here's a few cases where we can use C-style initialization up-front, which reduces the diffs later on. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	05af6f0434	dzn: use c-style for-statement Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	502c36c07d	dzn: use define instead of constexpr Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	5a9571ee2c	dzn: no more reinterpret_cast Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	79119ac478	dzn: drop using references Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	bd8e8537cc	dzn: drop auto usage The auto keyword isn't available in C, so let's drop it and just use explicit types instead. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	d61c2e965b	dzn: add a bunch of missing struct-keywords If we're going to have any chance of porting this code to C, we're going to have to be better at spelling out structs. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	4903a7c051	dzn: port to d3d12 c-api Using the vulkan-helpers from C++ code has turned out to have a lot of friction, because no other driver uses C++ for this. So let's bite the bullet and call the D3D12 C-API instead. The C-API wasn't really around when we started out, but it's there now. This is still far from ideal; we should really create some wrapping macros to generate the extremely verbose COM calls. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	4753222e62	dzn: pass IDXGIAdapter1 to d3d12_create_device The D3D12 C API doesn't know about the relationship between IDXGIAdapter1 and IUnknown. And there's no good reason to care about it here either. So let's just pass the right type all the way. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	3ba021cdd0	dzn: use ID3D10Blob instead of ID3DBlob In the C interface, there's no such alias. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	8c6f50efdb	dzn: always use ID3D12GraphicsCommandList1 In the C-interface, ID3D12GraphicsCommandList1 and ID3D12GraphicsCommandList are unrelated types. So let's make sure we consistenly use the most up-to-date version. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	411dfc574c	dzn: always use ID3D12Device1 In the C-interface, ID3D12Device1 and ID3D12Device are unrelated types. So let's make sure we consistenly use the most up-to-date version. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	5f17d070a9	dzn: remove all usage of ComPtr<T> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Erik Faye-Lund	74228c32ee	dzn: fixup indent Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15816>	2022-04-19 15:39:48 +00:00
Georg Lehmann	d12b5e7633	aco: Reuse previous -1 result in find_msb to avoid using VOP3. Totals: CodeSize: 388934388 -> 388933712 (-0.00%) Totals from 208 (0.15% of 134913) affected shaders: CodeSize: 2008016 -> 2007340 (-0.03%) Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16011>	2022-04-19 15:18:58 +00:00
Yonggang Luo	ebb099a9b0	zink: Remove redundant framebuffer_mtx from zink_screen.h Fixes: `beb71504f4` ("zink: remove the worst part of basic framebuffer support") Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16025>	2022-04-19 15:02:33 +00:00
Lionel Landwerlin	2ab57e056d	ci/iris: mark another test as flaky Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16032>	2022-04-19 14:27:26 +00:00
Lionel Landwerlin	8ef8e72aac	intel/fs: tidy up lower of ray queries We already expect a single function. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15946>	2022-04-19 12:56:06 +00:00
Boris Brezillon	9fd02d49b8	dzn: Pass the right type to CreateCommandList() in the reset path The Command allocator and command list type must match, but we are forcing it to D3D12_COMMAND_LIST_TYPE_DIRECT in the reset path. Fixes: `a012b21964` ("microsoft: Initial vulkan-on-12 driver") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16023>	2022-04-19 12:07:38 +00:00
Marcin Ślusarz	5dace41c10	intel/compiler: invalidate metadata in brw_nir_initialize_mue New "if" blocks may have been inserted. Fixes: `bc4f8c073a` ("intel/compiler: inject MUE initialization") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15924>	2022-04-19 11:43:55 +00:00
Marcin Ślusarz	4fddef33d5	intel/compiler: invalidate all metadata in brw_nir_lower_intersection_shader New "if" blocks were inserted. Fixes: `303378e1dd` ("intel/rt: Add lowering for combined intersection/any-hit shaders") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15924>	2022-04-19 11:43:55 +00:00
Marcin Ślusarz	5bd3ba5b67	anv: invalidate all metadata in anv_nir_lower_ubo_loads lower_ubo_load_instr may insert "if" blocks. Fixes: `61749b5a15` ("anv: Add a pass for lowering A64 UBO access") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15924>	2022-04-19 11:43:55 +00:00
Lionel Landwerlin	184084e21c	anv: allow getting the address of the beginning of the batch There is no reason not to be able to get it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `34a0ce58c7` ("anv: add a new execution mode for secondary command buffers") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15968>	2022-04-19 10:43:29 +00:00
Alexey Bozhenko	2d7d907ad1	intel/compiler: fix singleton pointer coverity warning fix brw_kernel::stats member that was declared as a variable but used as a pointer to array of 3 elements CID: 1503279 Signed-off-by: Bozhenko Alexey <oleksii.bozhenko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15975>	2022-04-19 12:36:10 +03:00
Karmjit Mahil	4c6bec2c0c	pvr: Fix clang-format errors caused by vk outarrays. Signed-off-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15944>	2022-04-19 09:13:07 +00:00
Boris Brezillon	3e97d37c63	dzn: Add support for sampleRateShading Forward the sample-rate shading info to spirv_to_dxil() so we can claim to support sampleRateShading. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15916>	2022-04-19 08:49:50 +00:00
Boris Brezillon	80a5deee62	microsoft/spirv_to_dxil: Allow forcing per-sample shading Needed to support VkPipelineMultisampleStateCreateInfo::sampleShadingEnable. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15916>	2022-04-19 08:49:50 +00:00
Boris Brezillon	cacc3f03e6	microsoft/compiler: Add a dunmmy SV_SampleIndex when needed When per-sample shading is forced and all input variables have a flat interpolation, DXIL validation detects a mismatch between the SampleFrequency property and the fact that no variables are per-sample and SV_SampleIndex is never read. When that happens, add a dummy SV_SampleIndex. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15916>	2022-04-19 08:49:50 +00:00
Juan A. Suarez Romero	04fb31a420	v3d: enable GL_ARB_copy_image extension Enable the proper capability to get activate this extension. Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4588 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15693>	2022-04-19 08:03:42 +00:00
Juan A. Suarez Romero	e40cbd3438	v3d: define our own canonical supported formats Some of the canonical formats defined by Gallium are not TLB compatible, so we need to provide an alternative. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15693>	2022-04-19 08:03:42 +00:00
Juan A. Suarez Romero	606e42027e	gallium: add hook on getting canonical format On swizzled copies canonical formats are used to reduce the formats to a simpler subset. Nevertheless, it is possible that some of the canonical formats defined in Gallium are actually not supported by the drivers themselves. This provides a driver-defined hook that can be used to provide an alternative canonical format in case the canonical one defined by Gallium is not supported by the driver. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15693>	2022-04-19 08:03:42 +00:00
Juan A. Suarez Romero	21bfbc74ee	v3d: use surface format defined on pipe_blit When trying to perform a TLB-based blit, we need to create a surface out of the src and dst resources. But instead of using the same formats as the resources, we need to use the format that is passed through pipe_blit_info. This was making some cases to use the render-based blit instead of the TLB-based blit, which is more performant. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15693>	2022-04-19 08:03:42 +00:00
Juan A. Suarez Romero	e6bcb8ad15	v3d: do not tile 1D textures Hardware already support 1D untiled textures, so no need to convert them to tile for render-based blit. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15693>	2022-04-19 08:03:42 +00:00
Juan A. Suarez Romero	18f8e3e7bd	v3d: report the correct unsupported blit format We were reporting the resource format instead of the surface format for unsupported render blit formats. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15693>	2022-04-19 08:03:42 +00:00
Lionel Landwerlin	3684012770	anv: implement DEBUG_SYNC Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15950>	2022-04-19 07:32:01 +00:00
Lionel Landwerlin	317512e038	anv/intel: add a new debug flag for stalling after every draw/dispatch Useful for hang debugging. Previously Anv incorrectly used DEBUG_SYNC for this. v2: Update documentations for sync/stall (Jordan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15950>	2022-04-19 07:32:01 +00:00
Lionel Landwerlin	a1969fa777	anv: improve INTEL_DEBUG for submit Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15950>	2022-04-19 07:32:01 +00:00
Erik Faye-Lund	ff05137c2d	nir: introduce and use nir_component_mask The BITFIELD_MASK() macro is intended for using with actual bitfields, not with nir_component_mask_t. This means we do some extra work to handle values that are invalid for nir_component_mask_t in the first place. This eliminates some warnings on Clang, where the compiler complains about casting UINT32_MAX to UINT16_MAX. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15547>	2022-04-19 06:54:47 +00:00
Erik Faye-Lund	b27a2ba4fc	vulkan: explicitly cast object-type enum VkObjectType and VkDebugReportObjectTypeEXT has the same enum-values. Why the Vulkan WG thought this was a good idea, beats me. But it's what we have to live with now. Anyway, instead of having a statement that implicitly casts two different values from the former to the latter, let's fully relsove the type as the former, and cast the value when using it instead. Fixes: `41318a5819` ("vulkan: Use vk_object_base::type for debug_report") Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15547>	2022-04-19 06:54:47 +00:00
Samuel Pitoiset	90db834603	radv: do not support UNIFORM_TEXEL_BUFFER with SRGB Looks like it can't be supported. Also disabled by PRO/AMDVLK. Fixes new CTS dEQP-VK.texture.texel_buffer.uniform.srgb.*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16010>	2022-04-19 06:35:50 +00:00
Samuel Pitoiset	443034c1ec	radv: initialize the vertex input interface state in only one place Instead of copying states from these structures at many different places, do it only once. Will help VK_EXT_graphics_pipeline_library. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15967>	2022-04-19 06:15:52 +00:00
Samuel Pitoiset	ea6eaa4c19	radv: use the hardware primitive topology everywhere Instead of mixing the VK type vs HW type everywhere. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15967>	2022-04-19 06:15:52 +00:00
Samuel Pitoiset	984b6c037c	radv: mark all active stages earlier in the pipeline creation path Few pCreateInfo structs have to be ignored based on the active stages and this will be used to make a union of stages from graphics libraries. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15967>	2022-04-19 06:15:52 +00:00
Mike Blumenkrantz	1eada1b02d	zink: selectively disable dynamic vertex stride if the vertex state doesn't meet the requirements to use this feature, fall back to fully-baked pipelines instead of violating spec Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16018>	2022-04-19 03:44:59 +00:00
Mike Blumenkrantz	d46774f8e6	zink: store min required stride values on the vertex state this will be useful shortly Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16018>	2022-04-19 03:44:59 +00:00
Mike Blumenkrantz	75e4a861cb	zink: always bind gfx pipeline at the top of draw at one point I thought it'd be cool to try and async compile a pipeline between shader bind and draw emit, but this is an unrealistic pipe dream that just makes things more complicated Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16018>	2022-04-19 03:44:59 +00:00
Mike Blumenkrantz	3d97367a60	zink: rework zink_kopper_update() assert the dt might have been killed, so just assert that it's a display target fixes #6317 Fixes: `8ade5588e3` ("zink: add kopper api") Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16017>	2022-04-19 00:59:29 +00:00
Mike Blumenkrantz	9ecdc2e985	zink: make a kopper debug print into an error Fixes: `8ade5588e3` ("zink: add kopper api") Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16017>	2022-04-19 00:59:29 +00:00
Mike Blumenkrantz	452a2fb995	zink: remove ZINK_NO_TIMELINES Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	beb71504f4	zink: remove the worst part of basic framebuffer support this was one of the most complex interactions in zink, and now it's finally gone thanks to @jekstrand for licensing his patented Delete The Code methodology for this project Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	00f2517391	zink: rename imageless framebuffer functions Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	623de06056	zink: remove framebuffer indirection Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	fe8212791f	zink: delete all non-imageless framebuffer code hooray it's finally gone Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	0067641d3c	zink: require KHR_imageless_framebuffer this allows for deleting tons of code Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	8c539328fd	zink: require renderpass2 drivers should be able to support this, and it allows for deleting a lot of untested code Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	d461b1f722	zink: only use VK_DEPENDENCY_BY_REGION_BIT if sync2 is available this breaks texture barriers since non-sync2 barriers don't have this available Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	4f1ecbd7b7	zink: hook up VK_KHR_create_renderpass2 Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	b0cbe3d419	zink: remove driver-based max_fences throttling there are no more fence objects, so there's no need to do driver-specific clamping on them the mechanism remains intact to handle ETOOMANYSUBMITS Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	21fb0a3473	zink: rename zink_query::batch_id this conflicts with zink_fence::batch_id and is confusing in grep Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	b5d7f61e0c	zink: remove batch lock this is no longer needed and allows deleting some awful code Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	bc2e29accd	zink: require timeline semaphores this allows the removal of tons of awful code Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	07c86e99b1	zink: do not create fences at all if timeline semaphores are supported there's no point in doing this, as it's just extra objects that don't need to ever be used Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15904>	2022-04-18 23:45:30 +00:00
Mike Blumenkrantz	8806f444a5	zink: fix extended restart prim types without dynamic state2 these are all allowed with the ext cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15978>	2022-04-18 22:20:36 +00:00
Mike Blumenkrantz	cd9424d93f	zink: support restart with PIPE_PRIM_LINES_ADJACENCY if ext is available cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15978>	2022-04-18 22:20:36 +00:00
Mike Blumenkrantz	d8b66fcbf9	zink: unconditionally set line width on rasterizer state change the pipe cap is used for gating wideline support, so this will always be 1.0 when not supported furthermore, the previous code wasn't accurately checking line width for tess shaders, breaking tests cc: mesa-stable fixes (nv): KHR-GL46.tessellation_shader.tessellation_control_to_tessellation_evaluation.gl_PatchVerticesIn Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15960>	2022-04-18 22:10:07 +00:00
Mike Blumenkrantz	9409756ee3	zink: use mixed zs renderpass for depth read/write this is triggered by u_blitter when doing src==dst blits Fixes: `7781a75229` ("zink: add a renderpass flag for mixed zs layout") affects: GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15960>	2022-04-18 22:10:07 +00:00
Mike Blumenkrantz	37ac8647fc	zink: reject resource creation if format features don't match attachment if a rendertarget-specified image can't be a rendertarget or a blit dst then it can't be used for the designated functionality and must be rejected cc: mesa-stable fixes hangs on various nv driver versions: dEQP-GLES2.functional.texture.mipmap.2d.generate.rgba5551_fastest Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15960>	2022-04-18 22:10:07 +00:00
Mike Blumenkrantz	44ad45fa06	zink: add baseline for amdpro got some work to do here Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15980>	2022-04-18 21:56:46 +00:00
Mike Blumenkrantz	c7122814c5	zink: disable EXT_extended_dynamic_state2 on AMDPRO this is broken beyond space and time in 22.10-1395274 Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15980>	2022-04-18 21:56:46 +00:00
Mike Blumenkrantz	12cf9a1544	zink: remove tcs patch slot map this is illegal, and we'll just have to eat some piglit fails until indirects are handled Fixes: `f7ade1f188` ("zink: simplify shader i/o assignment") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15976>	2022-04-18 21:32:40 +00:00
Erik Faye-Lund	7ca1253932	gallium: rename ldexp shader-cap This is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15922>	2022-04-18 20:43:18 +00:00
Erik Faye-Lund	439c212a3c	gallium: rename dfracexp/dldexp shader-cap This is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15922>	2022-04-18 20:43:18 +00:00
Erik Faye-Lund	3efd6d4bfe	gallium: rename dround shader-cap This is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15922>	2022-04-18 20:43:18 +00:00
Erik Faye-Lund	9b545ea691	gallium: rename continue shader-cap This is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15922>	2022-04-18 20:43:18 +00:00
Mike Blumenkrantz	d275d6c32f	zink: clamp max shader images to 32 NO MATTER WHAT. Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16013>	2022-04-18 17:54:20 +00:00
Konstantin Seurer	b761b51451	radv: Fix ray queries with !15854 Fixes: `b62e90a` ("radv: use nir_op_imm helpers") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16004>	2022-04-18 16:37:54 +00:00
Georg Lehmann	a8b29094c2	aco: Remove some old comments in aco_opcodes.py. s_cmovk_i32 isn't GFX8_GFX9 only and s_version doesn't need a comment to say it's GFX10+ exclusive. The encoding list is enough to provide this information, as for other GFX10+ instructions. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16006>	2022-04-18 15:59:38 +00:00
Sviatoslav Peleshko	dd7278aa10	mesa: flush bitmap caches when changing scissors or window rects state If we change the sate without flushing the bitmap cache, the cache might be rendered with the new scissor, which excludes some parts that should've been rendered with the old state, and vice versa. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6233 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15881>	2022-04-18 12:39:03 +00:00
Juan A. Suarez Romero	f9e424f98d	ci/v3dv: remove fixed test `dEQP-VK.api.external.semaphore.opaque_fd.info_timeline` is already fixed. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16005>	2022-04-18 12:16:52 +00:00
Timothy Arceri	4b4bb46af4	nir: fix setting varying from uniform as flat Here we just make sure we match the interpolation type on both sides of the shader interface. Drivers like d3d12 are expecting this. Fixes: `9401990e6f` ("nir/linker: set varying from uniform as flat") Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16003>	2022-04-18 11:45:56 +00:00
illiliti	67af7e2b40	Use proper types for meson objects Fix invalid usage of meson objects which violates official meson specification and thus breaks muon, an implementation of meson written in C. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15715>	2022-04-18 13:03:08 +03:00
Samuel Pitoiset	ed7d831525	radv: fix initializing pipeline_key::topology for GFX9 and older This is used to determine the geometry shader info on GFX9, and it looks like it was broken for topologies that use adjacency. This is also used to remove PSIZ from shaders that don't need it. Found by inspection. fossils-db (Polaris10): Totals from 140 (0.10% of 135960) affected shaders: SGPRs: 10448 -> 9696 (-7.20%) VGPRs: 4376 -> 4264 (-2.56%) CodeSize: 164316 -> 161028 (-2.00%) Instrs: 26449 -> 25767 (-2.58%) Latency: 184448 -> 180468 (-2.16%) InvThroughput: 80772 -> 79092 (-2.08%) VClause: 337 -> 328 (-2.67%); split: -2.97%, +0.30% SClause: 859 -> 813 (-5.36%); split: -5.70%, +0.35% Copies: 1027 -> 790 (-23.08%) PreSGPRs: 2751 -> 2331 (-15.27%) PreVGPRs: 3887 -> 3836 (-1.31%) Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15948>	2022-04-18 06:42:39 +00:00
Timothy Arceri	3dae5442ef	glsl/st: vectorise interfaces of SSO shader programs For example the SSO program may consist of just tcs -> gs or even just a vs. In these cases we want to vectorise the externally facing shader interfaces just like we would in non SSO programs. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15941>	2022-04-18 02:34:24 +00:00
Lionel Landwerlin	04bd007757	intel/fs: require memory fence commit bit on Gfx9 Fixes a hang on Gfx9 GT1 : dEQP-VK.compute.zero_initialize_workgroup_memory.max_workgroup_memory.128 Tested-by: Mark Janes <markjanes@swizzler.org> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15596>	2022-04-17 21:24:17 +00:00
Lionel Landwerlin	b07c215c35	intel: fix URB programming for GT1s We're missing a programming restriction. Hopefully fixing dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1.* on Gfx9atoms Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6216 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com>. Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15596>	2022-04-17 21:24:17 +00:00
Josh Billingsley	ee9997e932	driconf: add SD Gundam G Generation Cross Rays Required to avoid blank white screen on game launch Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15998>	2022-04-17 18:36:14 +00:00
Gert Wollny	ef75752ef8	r600/sfn: Fix store_shared_r600 write masks The error was caught by the new nir_validation code. Fixes: `73ef225fc2` nir: validate write_mask for all intrinsics that have it Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15994>	2022-04-17 13:21:09 +00:00
Marek Olšák	11c462534b	gallium/winsys: move {amdgpu,radeon_drm}_public.h contents into radeon_winsys.h header file simplification Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15907>	2022-04-17 01:27:34 +00:00

1 2 3 4 5 ...

141258 Commits