KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
David Heidelberg	e8cf79f664	ci/freedreno: add Blender, Warzone2100, Freedoom and Unvanquished traces Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16909>	2022-06-14 11:52:45 +00:00
Connor Abbott	3cd39c2b32	tu: Enable VK_KHR_swapchain_mutable_format This is already implemented entirely in common code. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16976>	2022-06-13 14:58:32 +00:00
Danylo Piliaiev	19682028eb	tu/autotune: Prevent division by zero src/freedreno/vulkan/tu_autotune.c:509:48: runtime error: division by zero Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16967>	2022-06-10 14:09:59 +00:00
Danylo Piliaiev	4846d53b18	turnip: Fix out-of-bounds builder->shader_iova access src/freedreno/vulkan/tu_pipeline.c:1722:72: runtime error: index 5 out of bounds for type 'uint64_t [5]' Fixes: `05329d7f9a` ("tu: Implement pipeline caching with shared Vulkan cache") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16967>	2022-06-10 14:09:59 +00:00
Chia-I Wu	3933752c7f	turnip: force sample interpolations for sample shading Sample shading has similiar definitions in Vulkan and OpenGL, and they both require unique associated data. While the definition for Vulkan might change, we should stick to the current definition until the change takes place and until apps (i.e., ANGLE) are updated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16700>	2022-06-10 12:08:32 +00:00
Jason Ekstrand	0c764afac1	turnip: Set supported_sync_types before wsi_init The WSI code is about to start querying for available semaphore handle types via GetPhysicalDeviceExternalSemaphoreProperties in wsi_init(). For drivers that use vk_sync, supported_sync_types needs to be initialized before GetPhysicalDeviceExternalSemaphoreProperties is called. Really, wsi_init() should be the very last step of physical device setup. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4037>	2022-06-10 01:33:12 +00:00
Jason Ekstrand	4c95b574d1	turnip: Use the common QueuePresent implementation Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4037>	2022-06-10 01:33:12 +00:00
Jason Ekstrand	513d73b160	turnip: Use the common AcquireNextImage implementation The only reason for the wrapper was so that we could dummy signal the semaphore and fence. Now that the WSI code always dos this for us, we can drop our wrapper. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4037>	2022-06-10 01:33:12 +00:00
Emma Anholt	c426e21ff1	turnip: Reverse the order of walking pipes or tiles on odd rows. This improves the cache locality compared to raster order. Improves gfxbench vk-5-normal perf by 3.3009% +/- 0.105934% (n=3). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16917>	2022-06-09 20:36:42 +00:00
Chia-I Wu	790fc8455f	ci/turnip: add a link to VkPhysicalDeviceMemoryProperties failure Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16939>	2022-06-09 19:55:32 +00:00
Chia-I Wu	5c17a04282	turnip: consider render pass costs in autotune To be able to sum drawcall cost and render pass cost, the units of costs are changed to bytes. With that, tu_autotune_use_bypass can make decisions by comparing the costs of sysmem rendering and gmem rendering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16733>	2022-06-08 12:48:08 +00:00
Chia-I Wu	ce118a7002	turnip: estimate render pass costs They will be used by autotuner. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16733>	2022-06-08 12:48:08 +00:00
Chia-I Wu	fe9a2374e6	turnip: include stencil test for drawcall costs Stencil test reads from and writes to the stencil buffer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16733>	2022-06-08 12:48:08 +00:00
Chia-I Wu	2dc10165a1	turnip: base pipeline draw costs on colorWriteMask drawcall_base_cost estimates the memory bandwidth per sample. colorWriteMask should be more accurate than fs outputs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16733>	2022-06-08 12:48:08 +00:00
Chia-I Wu	6fe7b92114	turnip: if-checks autotune debug macros This avoids bitrot while the compiler can easily optimize away those checks. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16733>	2022-06-08 12:48:08 +00:00
Emma Anholt	7af5929b54	turnip: Move tile loads back into the draw CS. Now that we don't need to know if HW binning actually will get used or not, we can just emit the tile loads into the start of the draw CS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Danylo Piliaiev	ecabd3b5a9	turnip: Allow nested CP_COND_REG_EXEC This ends up being needed for moving tile loads into the draw cs. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	a92fad45e9	turnip: Allow load/store skipping in vkCmdClearAttachments(). We have to use a 3D draw to make it possible (so it goes through the binner's visibility calcs), but hopefully the increased overhead for apps with non-skippable rendering balances against skipping in others. The real motivation is to get draw-time state out of tile load setup. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	b8619ef343	turnip: Refactor a bit of subpass attachment processing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	83ae4a5ed4	turnip: Include 3d-based CmdClearAttachments() in binning visibility. It means the clear's draw can get skipped when it doesn't intersect with the tile. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	48403628a2	turnip: Refactor a bit of repeated code for subpass setup. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	5b119c0148	ci/turnip: Add a little forced touch-testing of XFB with no binning requested. This is just a couple of seconds of runtime. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	046438b7a4	turnip: Use fb->binning_possible to decide on conditional tile load/stores. When !fb->binning but fb->binning_possible, we can just set the VSC per-tile visibility reg to all visible in the "whoops, we'd rather not bin but we had to anyway for XFB" case. This gets that EndRenderPass state out of tile_load_cs/store_cs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	6c37b4ded1	turnip: Move binning decisions from FB usage time to FB creation time. This is mostly about helping me understand which choices are constant for the object as opposed to runtime decisions. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	ceeaac340a	turnip: Refactor a bit of tu6_emit_tile_select(). Reduce redundant code, make the used SET_VISIBILITY_OVERRIDE value clearer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Emma Anholt	2cad0dd03b	turnip: Don't bother creating tile_load/store_cs for sysmem rendering. They won't get called, so don't bother. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16826>	2022-06-07 00:00:28 +00:00
Timothy Arceri	57dee95b85	freedreno: switch to NIR loop unrolling Force unroll setting based on GLSL IR settings: case PIPE_SHADER_CAP_INDIRECT_INPUT_ADDR: case PIPE_SHADER_CAP_INDIRECT_OUTPUT_ADDR: case PIPE_SHADER_CAP_INDIRECT_TEMP_ADDR: case PIPE_SHADER_CAP_INDIRECT_CONST_ADDR: /* a2xx compiler doesn't handle indirect: */ return is_ir3(screen) ? 1 : 0; Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Timothy Arceri	34e868d882	freedreno/ir3: tidy up duplication of common nir options Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Erik Faye-Lund	200091aad0	freedreno: do not use variable in STATIC_ASSERT Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16670>	2022-06-03 07:14:43 +00:00
Erik Faye-Lund	08f8ccfc60	turnip: do not do STATIC_ASSERT on a variable Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16670>	2022-06-03 07:14:43 +00:00
Hyunjun Ko	8eb5178a6d	turnip: clamp to 1 for negative bottom-right of viewport. This fixes a crash of spec@!opengl 3.0@viewport-clamp in piglit(with zink). Also fixes a crash of negativeviewportheight in Sascha's examples. Closes: #6583 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16831>	2022-06-03 02:40:08 +00:00
Connor Abbott	bd68559d36	tu: Enable shaderTessellationAndGeometryPointSize This already works and AFAICT there was no reason to disable it other than that the blob does. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16830>	2022-06-02 16:32:57 +00:00
Danylo Piliaiev	eba8671e40	tu: Fix D32S8 resolves into D32 or S8 in sysmem mode Fixes CTS tests in sysmem mode: dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d24_unorm_s8_uint.compatibility_depth_zero_stencil_zero_testing_depth dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d24_unorm_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_depth dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d24_unorm_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_stencil dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint.compatibility_depth_zero_stencil_zero_testing_depth dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_depth dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_stencil Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16834>	2022-06-02 16:01:46 +00:00
Erik Faye-Lund	9bcd538643	freedreno: remove stale makefile We don't use this makefile, so there's no point in keeping it in the repo. It's also a generated one, so it contains a lot of... mess. Fixes: `536f43cb96` ("freedreno: slurp in afuc") Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16790>	2022-06-02 09:13:23 +00:00
Emma Anholt	49dc60efa1	freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts. Shaves 5 instructions off of one manhattan31 shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Emma Anholt	f005a2d786	freedreno/ir3: Fix validation of half-precision image store values. `ce1a381e57` ("turnip: enable VK_KHR_16bit_storage on A650") determined that the type of the instr decided the type of the value being stored in the ".b" case. But it would be surprising if image stores had the type determine the coordinates' precision instead of the value's, and once we turned on image instruction precision lowering we ran into asserts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Matt Turner	003327dd95	freedreno/ir3: Pass 16-bit sampler coordinates when possible. shader-db highlights from Rob's android shaders: total instructions in shared programs: 769641 -> 767536 (-0.27%) instructions in affected programs: 151139 -> 149034 (-1.39%) total last-baryf in shared programs: 55908 -> 55607 (-0.54%) last-baryf in affected programs: 35219 -> 34918 (-0.85%) total sstall in shared programs: 67074 -> 65767 (-1.95%) total full in shared programs: 36115 -> 36080 (-0.10%) full in affected programs: 203 -> 168 (-17.24%) sstall in affected programs: 9510 -> 8203 (-13.74%) total (ss) in shared programs: 14380 -> 14239 (-0.98%) (ss) in affected programs: 2965 -> 2824 (-4.76%) total systall in shared programs: 92425 -> 91522 (-0.98%) systall in affected programs: 13146 -> 12243 (-6.87%) total (sy) in shared programs: 4330 -> 4314 (-0.37%) (sy) in affected programs: 167 -> 151 (-9.58%) total waves in shared programs: 71580 -> 71584 (<.01%) waves in affected programs: 12 -> 16 (33.33%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Emma Anholt	ac24c49c37	freedreno/ir3: Make the types of tex coord padding match. Required for lowering of coords to be 16-bit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Matt Turner	edb0904775	freedreno/ir3: Move the texture array coord fixup to nir We're going to optimize sampler coordinates to FP16, so we'll need to add the appropriately typed 0.5. Move this to NIR where that information is readily available. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Emma Anholt	bf49d4a084	freedreno/ir3: Enable load/store vectorization for SSBO access, too. Saves a few ldib/stib instructions in gfxbench vk-5-normal compute shaders by grouping vec4 accesses together. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Emma Anholt	5f64bc7a3e	ci/turnip: Add missing xfails for a618 full VK run Fixes: `836ce97f5e` ("ci: bump VK-GL-CTS to 1.3.2.0") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Danylo Piliaiev	79e266fffc	ir3: Force late-z if FS has global store/atomic No known tests are fixed. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16816>	2022-06-01 16:30:36 +00:00
Connor Abbott	68b10b39f7	tu: Implement VK_EXT_image_2d_view_of_3d This is already allowed by the gallium driver, which uses the same code for image layout and image views, so everything Just Works and the tests pass. radv doesn't enable the sampler feature, but I don't see any reason it wouldn't work and the tests pass. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16806>	2022-06-01 15:04:31 +00:00
Connor Abbott	2e9c53549b	tu: Expose VK_KHR_synchronization2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16374>	2022-06-01 04:06:46 +00:00
Connor Abbott	59259a0167	tu: Convert to sync2 entrypoints Use the common Vulkan fallbacks to implement the now-deprecated original entrypoints. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16374>	2022-06-01 04:06:46 +00:00
Timothy Arceri	abe4536c51	ci: uprev piglit 2022-05-31 Also document additional piglit failures and passes. Multiple changes, mostly notable: - few new tests - fixed test for upcoming mesa MR Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16785>	2022-06-01 03:14:29 +00:00
Jason Ekstrand	c8e191e369	turnip: Use nir_gather_xfb_info Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	3e04432b3a	nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Juan A. Suarez Romero	836ce97f5e	ci: bump VK-GL-CTS to 1.3.2.0 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16689>	2022-05-31 15:02:08 +00:00
Erico Nunes	3d7b2f0186	ci: update linux kernel to v5.17 In addition to pushing it to the current latest stable, the v5.17 kernel for mesa CI pulls a patch to address a regression in drm that affects at least the lima jobs. The dtb for sc7180-trogdor-lazor-limozeen-nots is also updated since the old one no longer exists in v5.17. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16641>	2022-05-31 06:31:15 +00:00
Hyunjun Ko	16ea41c901	ir3: handle intrinsic_load_draw_id when scanning driver constants Fixes: #6567 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16769>	2022-05-31 01:33:55 +00:00
David Heidelberg	2cf7f08b04	ci: traces: temporarily disable nheko trace Disable nheko trace until apitrace gets fixed. apitrace currently fails with this trace, when more than 1 run is requested. Upstream issue: https://github.com/apitrace/apitrace/issues/800 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16774>	2022-05-31 00:00:25 +00:00
David Heidelberg	b8381aaa37	ci/freedreno: enable ROR and Nheko traces Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16633>	2022-05-27 06:51:38 +00:00
Danylo Piliaiev	713f504033	ir3: handle gl_Layer and gl_ViewportIndex when there is TES + GS Fixes CTS tests: KHR-GL46.shader_viewport_layer_array.ShaderViewportIndexTestCase KHR-GL46.shader_viewport_layer_array.ShaderLayerFramebufferLayeredTestCase KHR-GL46.shader_viewport_layer_array.ShaderLayerFramebufferNonLayeredTestCase Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6497 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16696>	2022-05-26 08:47:02 +00:00
Mike Blumenkrantz	aa32b96c51	turnip: fix assert for max xfb outputs this is a counter, not an index, so use <= cc: mesa-stable Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16620>	2022-05-21 16:48:54 +00:00
Chia-I Wu	2a8e6a4d1a	turnip: disable UBWC for SNORM formats In copy_format, we treat snorm as unorm to avoid clamping. But snorm and unorm are UBWC incompatible for special values such as all 0's or all 1's. Disable UBWC for snorm. For reference, I dumped the first byte of an UBWC blocks and it was color UNORM SNORM all black 0x01 0x31 all white 0x0d 0x11 @flto clarified that bit 4 is unset for fast clear encoded blocks. It looks like fast clear is not used for SNORM. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6480 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16534>	2022-05-21 15:27:42 +00:00
Chia-I Wu	e8eb6d13a5	turnip: fix tu6_pack_border_color for z24 The value should be at the bottom 24 bits, not at the top. dEQP-VK.pipeline.sampler.* still passes. This fixes most of dEQP-GLES31.functional.texture_border_clamp.formats.depth on angle. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16570>	2022-05-21 00:54:28 +00:00
Hyunjun Ko	f2635ca47b	turnip: add an assertion for max descriptor set count. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16624>	2022-05-20 09:49:00 +00:00
Jason Ekstrand	c24aa449d0	vulkan,anv,turnip: Add a common CmdBindVertexBuffers wrapper Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16611>	2022-05-20 02:12:37 +00:00
Emma Anholt	7938ce4af3	freedreno/ir3: Lower texture instructions used only for f2f16 to 16-bit. 2.5% improvement in gfxbench vk-5-normal. No obvious change on gl-5-normal. shader-db on Rob's android shaders: total instructions in shared programs: 770644 -> 770595 (<.01%) instructions in affected programs: 14880 -> 14831 (-0.33%) total nops in shared programs: 167784 -> 167860 (0.05%) nops in affected programs: 3351 -> 3427 (2.27%) total non-nops in shared programs: 602860 -> 602735 (-0.02%) non-nops in affected programs: 10523 -> 10398 (-1.19%) total mov in shared programs: 19313 -> 19286 (-0.14%) mov in affected programs: 365 -> 338 (-7.40%) total cov in shared programs: 18075 -> 17978 (-0.54%) cov in affected programs: 566 -> 469 (-17.14%) total dwords in shared programs: 1612848 -> 1612596 (-0.02%) dwords in affected programs: 13882 -> 13630 (-1.82%) total last-baryf in shared programs: 56144 -> 55975 (-0.30%) last-baryf in affected programs: 482 -> 313 (-35.06%) total full in shared programs: 36094 -> 36092 (<.01%) full in affected programs: 10 -> 8 (-20.00%) total sstall in shared programs: 66986 -> 66923 (-0.09%) sstall in affected programs: 1392 -> 1329 (-4.53%) total systall in shared programs: 91244 -> 91072 (-0.19%) systall in affected programs: 1194 -> 1022 (-14.41%) total (sy) in shared programs: 4316 -> 4321 (0.12%) (sy) in affected programs: 19 -> 24 (26.32%) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Emma Anholt	1cf0736f1c	freedreno/ir3: Add support for 16-bit nir_texop_lod. Same basic path, just do the rescaling in half float. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Emma Anholt	a28d2e87d3	turnip: Make RelaxedPrecision-decorated ALU ops 16-bit. Improves gfxbench vk-5-normal performance 5.5%. Fixes: #6346 Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Emma Anholt	633cf4eca1	freedreno/ir3: Fix 16-bit bit_count. No need to do the 16-bit lowering if it already is. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Connor Abbott	9f67fa368e	tu: Implement VK_EXT_pipeline_creation_cache_control Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16562>	2022-05-18 13:14:55 +00:00
Connor Abbott	49827da6fa	tu: Implement VK_EXT_pipeline_creation_feedback Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16562>	2022-05-18 13:14:55 +00:00
Connor Abbott	e348f2fb38	tu: Zero-initialize compute driver key Fixes: `05329d7` ("tu: Implement pipeline caching with shared Vulkan cache") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16562>	2022-05-18 13:14:55 +00:00
Danylo Piliaiev	5d377f435b	freedreno/a6xx: Add EARLYPREAMBLE flag to all a6xx_sp_xs_ctrl_reg0 Each shader stage has its own "early preamble" flag. Early preamble is likely an optimization to hide some of latency when loading UBOs into consts in the preamble. Early preamble has the following limitations: - Only shared, a1, and consts regs could be used (accessing other regs would result in GPU fault); - No cat5/cat6, only stc/ldc variants are working; - Values writen to shared regs are not accessible by the rest of the shader; - Instructions before shps are also considered to be a part of early preamble. Note, for all shaders from d3d11 games blob produced preambles compatible with early preamble mode. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15901>	2022-05-18 11:17:47 +00:00
Chia-I Wu	2410993ef6	turnip: fix off-by-one in border color bitset BITSET_FFS reserves 0 for no bit set. BITSET_CLEAR just below cleared the wrong bit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16567>	2022-05-17 23:29:15 +00:00
Timothy Arceri	d7a071a28f	gallium/drivers: set force_indirect_unrolling_sampler for all required drivers This is set to true for all drivers that have a GLSL level of support lower than 4.00. This matches the rule for setting the GLSL IR option EmitNoIndirectSampler. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16543>	2022-05-17 02:12:21 +00:00
Chia-I Wu	cb50fe7110	ir3: fix mem_ctx for ir3_disasm_info::nir Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6494 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16535>	2022-05-16 22:09:13 +00:00
Chia-I Wu	d3d34ad476	turnip: emit VPC_SO_DISABLE in xfb begin/end SO was always enabled before this change. That meant, after a call to tu_CmdBindTransformFeedbackBuffersEXT to emit VPC_SO_BUFFER_SIZE, any draw call (from the same render pass, in a different render pass, or in a different cmdbuf) could potentially cause writes to the SO buffers regardless of whether the draw is inside xfb begin/end or not. I choose to emit VPC_SO_DISABLE instead of using stateobjs like freedreno does only because it is simpler. It is not clear to me which is more efficient to HW. This also fixes double SO writes for gmem rendering. While tu6_tile_render_begin was careful to disable SO for the draw pass, tu6_emit_tile_select re-enabled it. dEQP-VK.transform_feedback.* still passes. It fixes dEQP-GLES3.functional.transform_feedback.* on angle. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16502>	2022-05-16 20:46:59 +00:00
Chia-I Wu	0b7751babf	turnip: fix sampledImageIntegerSampleCounts It seems fine to advertise msaa in sampledImageIntegerSampleCounts. dEQP-VK.rasterization.rasterization_order_attachment_access.format_integer.* goes from NotSupported to Pass for more test cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16487>	2022-05-16 19:26:46 +00:00
David Heidelberg	875643feeb	ci: uprev piglit 2022-05-10 Also document additional piglit failures and crashes with new tests. Multiple changes, mostly notable: - few new tests - traces downloader improvements Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16428>	2022-05-16 06:33:36 +00:00
Chia-I Wu	e9e8c649cd	freedreno/fdperf: support dumping counters This is useful for comparing two workloads. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16488>	2022-05-14 22:18:52 +00:00
Chia-I Wu	267786be60	freedreno/fdperf: make refresh rate configurable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16488>	2022-05-14 22:18:52 +00:00
Chia-I Wu	cd42f63c43	turnip: let modifier takes precedence over TU_DEBUG=noubwc TU_DEBUG=noubwc is not very usable on sway/xwayland where the wsi uses modifiers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16507>	2022-05-14 21:56:38 +00:00
Connor Abbott	05329d7f9a	tu: Implement pipeline caching with shared Vulkan cache Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	43981f0f58	tu: Include turnip debug flags in pipeline cache UUID Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	d023ae4686	tu: Rewrite cache UUID based on radv Switch to using sha1 so that we can add as many other flags as we need to easily. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	410d59943d	tu: Hash pipeline layout contents Mostly adapted from anv. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	3e3f8b1639	ir3: Add ir3_shader_create_variant() This is similar to ir3_shader_get_variant(), but always compiles the variant from scratch and returns a variant that's owned by the user rather than the shader. We'll need this because when variants are stored in the Vulkan pipeline cache they will outlive their shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	ea646ac9af	ir3: Support disabling the pipeline cache Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	c7a6293635	ir3: Add functions to serialize variants This will be used by turnip to create free-floating variant objects that integrate into the Vulkan cache system. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	ceae844794	ir3: Remove ir3_shader_variant::shader Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	91160dab97	tu: Keep original blit shaders separately We won't be able to access them once the ->shader link is gone. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	69f5be8bad	ir3: Add ir3_shader_variant::compiler And replace uses of ->shader->compiler. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	4509b49fb8	ir3: Allocate disasm_info under variant This shouldn't matter much because it gets stolen later, but the shader is going away. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	21e3dd57d3	ir3: Use ir3_shader_variant::type more often Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	f45c86dfb7	ir3, fd, tu: Copy misc. info from ir3_shader to ir3_shader_variant The shader won't be available for deserialized variants, so we need to include all the info we need for compiling variants to be in the variant. Most of the things we dug out of the shader were various bits from nir_shader_info which we move into ir3_shader_variant. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	3e30608ceb	ir3, freedreno, tu: Make ir3_shader_variant store stream_output This reduces the number of uses of ir3_shader which will be gone when we deserialize the variant directly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	3cad11d84a	tu: Delete unused tu_clear_blit GS handling This has been unused for a while since we switched to writing the array index in the VS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Rob Clark	7292b35da0	freedreno/devices: Add another SKU Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16477>	2022-05-12 22:12:24 +00:00
Rob Clark	a31c34e0d6	freedreno/drm/virtio: Don't try to mmap imported bo's Previously it would fail, and then we'd fall back to the transfer path for things like readpix. But it would spam logcat w/ bo_mmap fail messages. Since gralloc allocated buffers for GPU usage are allocate without _USE_MAPPABLE, let's just assume we can't map imported bo's. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16477>	2022-05-12 22:12:24 +00:00
Rob Clark	62f3e703c8	freedreno/drm: Use DEBUG_GET_ONCE_OPTION() In particular this uses os_get_option() so the android setprop fallback works. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16477>	2022-05-12 22:12:24 +00:00
Danylo Piliaiev	9a11ad7efd	tu: Fix indices of drm_msm_gem_submit_cmd when filling them For some reason CTS doesn't trigger the issue... When submit entry is not filled - kernel says: [drm:msm_ioctl_gem_submit] ERROR invalid type: 00000000 Fixes: `dbae9fa7d8` ("tu: implement sysmem vs gmem autotuner") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16474>	2022-05-12 16:44:09 +00:00
Emma Anholt	b282d504a4	turnip: Add a TU_DEBUG=perf debug option. For doing performance investigation, I often find it useful to have a "are we tripping over any of our performance TODOs?" flag, so add it and use it in a few of the TODOs. This also greatly cleans up the deqp-vk logs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16316>	2022-05-12 01:00:25 +00:00
Jason Ekstrand	352e32e5ba	nir/builder: Add a nir_trim_vector helper This pattern pops up a bunch and the semantics of nir_channels() aren't very convenient much of the time. Let's add a nir_trim_vector() which matches nir_pad_vector(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Danylo Piliaiev	187d3df52c	tu: Do not flush ccu in clear/blits during renderpass For clear/blits ccu flush not only worse for perf, but also messes up flush_bits when executed in a conditional set of commands. We already don't flush for 3d blits. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6419 Fixes: `487aa807bd` ("tu: Rewrite flushing to use barriers") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16352>	2022-05-11 08:07:50 +00:00
Danylo Piliaiev	db69218cbe	tu: Implement VK_EXT_image_view_min_lod Relevant tests: dEQP-VK.texture.mipmap..image_view_min_lod. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16292>	2022-05-09 07:53:41 +00:00
Rob Clark	409b76511c	freedreno/drm-shim: Better iova handling We actually want to use util_vma to handle this. But fortunately core drm-shim alredy does this for mem offset, we can just delete a bunch of code and re-use that. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16250>	2022-05-02 19:50:33 +00:00
Rob Clark	97f4e48717	freedreno/drm-shim: Robustify error handling We can't be so sloppy if we are using drm-shim for fuzzing. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16250>	2022-05-02 19:50:33 +00:00
Rob Clark	d06fc7bb4f	freedreno/drm-shim: Update to latest uapi version Needed for fuzzing virgl drm native context. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16250>	2022-05-02 19:50:33 +00:00
Rob Clark	69edfcaa20	freedreno/drm: Fix bos_on_stack calculation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16263>	2022-05-01 15:53:10 +00:00
Chia-I Wu	53d87865ca	turnip: fix drm modifier support with planar formats We need to advertise the results of tu6_plane_count and handle VK_IMAGE_ASPECT_MEMORY_PLANE_*_BIT. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6374 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16169>	2022-04-29 22:30:45 +00:00
Danylo Piliaiev	6e6ba85fd9	turnip: Fix tu_debug_flags values clashing Was not caught during rebase... Fixes: `725ae34458` ("turnip: Add debug option to print gmem load/store skip stats") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16246>	2022-04-29 15:09:36 +00:00
Danylo Piliaiev	725ae34458	turnip: Add debug option to print gmem load/store skip stats TU_DEBUG=log_skip_gmem_ops would print stats about skipped gmem/load every second. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15974>	2022-04-29 09:29:55 +00:00
Danylo Piliaiev	0c489f18cb	turnip: Skip load/stores for tiles with no geometry When HW binning is used tile loads/stores could be skipped if there is no geometry in the tile. Loads could be skipped when: - The attachment won't be resolved, otherwise if load is skipped there would be holes in the resolved attachment; - There is no vkCmdClearAttachments afterwards since it is likely a partial clear done via 2d blit (2d blit doesn't produce geometry). Stores could be skipped when: - The attachment was not cleared, which may happen by load_op or vkCmdClearAttachments; - When store is not a resolve. I chose to predicate each load/store separately to allow them to be skipped when only some attachments are cleared or resolved. Gmem loads are moved into separate cs because whether to emit CP_COND_REG_EXEC depends on HW binning being enabled and usage of vkCmdClearAttachments. CP_COND_REG_EXEC predicate could be changed during draw_cs only by perf query, in such case the predicate should be re-emitted. (At the moment it is always re-emitted before stores) Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15974>	2022-04-29 09:29:55 +00:00
Danylo Piliaiev	d5debf0d8a	freedreno/a6xx: Add UNK fields to CP_REG_TEST and CP_COND_REG_EXEC Their meaning is unknown, however they DO change the behavior. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15974>	2022-04-29 09:29:55 +00:00
Emma Anholt	536c8ee96d	nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. This controls the whole lowering of "make tex ops with implicit derivatives on non-implicit-derivative stages be tex ops with an explicit lod of 0 instead", but it's really hard to describe that in a git commit summary. All existing callers get it added except: - nir_to_tgsi which didn't want it. - nouveau, which didn't want it (fixes regressions in shadowcube and shadow2darray with NIR, since the shading languages don't expose txl of those sampler types and thus it's not supported in HW) - optional lowering passes in mesa/st (lower_rect, YUV lowering, etc) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16156>	2022-04-28 21:26:08 +00:00
Rob Clark	e42cea4db6	freedreno/drm/virtio: Split up large uploads Might be useful if host cached mmaps.. but OTOH we don't want to burn up address space. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	0aab310439	freedreno/drm/virtio: Async ccmd batching This could be a bit more clever an avoid extra memcpy.. but that seems to be in the noise at this point. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	528fa581c1	freedreno/drm/virtio: Pass guest handles to execbuf This is needed for the VIRTGPU_WAIT ioctl to work. TODO we could perhaps limit this, since it is not needed for residency, but only fencing. Ie. we could omit cmdstream, and probably anything that has FD_BO_NOMAP flag. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	cb5f25ea71	freedreno/drm/virtio: Protocol updates This syncs up with the protocol of what eventually landed in virglrender. 1) Move all static params to capset to avoid having to query host (reduce synchronous round trips at startup) 2) Use res_id instead of host_handle.. costs extra hashtable lookups in host during submit, but this lets us (with userspace allocated IOVA) make bo alloc and import completely async. 3) Require userspace allocated IOVA to simplify the protocol and not have to deal with GEM_NEW/GEM_INFO potentially being synchronous. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	fa23ddf258	freedreno/drm/virtio: Fix SHAREABLE+MAPPABLE A shareable bo should also be mappable if FD_BO_NOMAP is not set. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	441f01e778	freedreno/drm/virtio: Drop blocking in host These paths should be corner cases, but still it is a bad idea to block in the host (because it is single threaded), so instead just turn waits in the host into polling in the guest. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	e6b2785811	freedreno/drm/virtio: Use userspace IOVA allocation If supported by host virglrenderer and host kernel, use userspace allocated GPU virtual addresses. This lets us avoid stalling on waiting for response from host kernel until we need to know the host handle (which is usually not until submit time). Handling the async response from host to get host_handle is done thru the submit_queue, so that in the submit path (hot) we do not need any additional synchronization to know that the host_handle is valid. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	ae01c27ac0	freedreno/drm/virtio: Support ring_idx ring_idx zero is the CPU ring, others map to the priority level, as each priority level for a given drm_file on the host kernel side maps to a single fence timeline. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	122cedf98c	freedreno/drm: Move bo common init We'll need this to happen before virtio_bo_new() returns in the next patch. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	d52455a962	freedreno/drm: Close bo handle after bo->destroy() For userspace allocated iova, we want to give the backend a chance to release the iova before the handle is closed. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	4ed346c6fb	freedreno/drm: Drop FD_PP_PGTABLE Unused. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	0004cae638	freedreno/drm/virtio: Appease valgrind Valgrind isn't seeing that the kernel is initializing the caps (or returning an error). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	d79c71c705	freedreno: Misc indent fixes Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Emma Anholt	550975f229	turnip: Don't disable LRZ in subpasses after the first in the easy case. If it's the same depth/stencil attachment, then there's no need to turn off LRZ just because the subpass changed. Doesn't help gfxbench perf yet, but will with !16014. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:30 +00:00
Emma Anholt	7ba63f516a	turnip: Ignore TOP/BOTTOM_OF_PIPE bits in subpass src/dst dep flags. gfxbench sets these between the gbuffer subpass and the following ones. They should be no-ops as subpass dependencies. gfxbench vk-5-debug perf 12.8 -> 14.6 fps thanks to getting gmem on the gbuffer rendering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:30 +00:00
Emma Anholt	1bcd848816	freedreno/ir3: Call nir_opt_find_array_copies(). gfxbench vk-5-normal has a shader that sampels into a texels[] array at the top, then in a loop calls a GLSL function passing texels[] in by value. This resulted in a copy to a temp inside the loop, which got lowered to scratch stores since it was pretty big. By doing find_array_copies(), we notice that it's equivalent to copy_deref, then get to copy-propagate from the array at the top. Then we only have to set up the scratch array outside of the loop and load_scratch from it in the called function inside the loop. This also causes there to be less spilling, stps 1144 -> 354 and ldps 826->36. However, it doesn't seem to change performance on the test. So, while this seems to be an improvement for the shader, and we could maybe even do better by rematerializing the txl samples inside the loop instead of storing the texture fetches to scratch in the first place, it doesn't currently seem worth pursuing more optimization of this shader. No change on freedreno shader-db. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	7ba0c44607	turnip: Add nir_opt_conditional_discard. We can easily do discard_if in the backend without control flow, but it wasn't done in ir3 because the GL frontend already did it for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	d60282f5d2	freedreno/ir3: Make sched nodes before adding deps. The mark_kill_path() during dep setup follows SSA srcs, which when a phi is involved may include a def from later in the same block, that we hadn't created yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	ce15bf19fb	turnip: Add TU_DEBUG=layout for dumping image layouts. This was useful for comparing image allocations between gfxbench gl_5_normal and vk_5_normal to see if rendering was generally equivalent (formats, MSAA, UBWC choices, and notably gfxbench vk was choosing DXT5 instead of ASTC on non-android builds!) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Danylo Piliaiev	2c683519e2	turnip: Try harder to keep LRZ valid and fix a few edge cases Refactored tu6_calculate_lrz_state and added comments. 1) If there is no depth write we could keep LRZ valid with any compare op, we just have to temporary disable LRZ for incompatible ops in such case. 2) Found that VK_COMPARE_OP_EQUAL is not compatible with LRZ, and since it doesn't change LRZ buffer - LRZ could be just temporary disabled. This fixes rendering of grass/trees in PUBG mobile on angle. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6127 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16014>	2022-04-19 18:06:58 +00:00
illiliti	67af7e2b40	Use proper types for meson objects Fix invalid usage of meson objects which violates official meson specification and thus breaks muon, an implementation of meson written in C. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15715>	2022-04-18 13:03:08 +03:00
Emma Anholt	835704e669	turnip: Move autotune buffers to suballoc. Now the ANGLE trex_200 trace replay does a single BO allocation at startup for autotune results instead of one per frame (~350 for the whole replay). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	7c636acd53	turnip: Get autotune off of ralloc destructors. We've wanted to remove destructors from ralloc's API for a long time (it's an extra storage cost per ralloc for a rarely-used feature), and for the suballoc change we'd need to spend more storage on storing the tu_device pointer per result since destructors don't get anything else but the pointer passed into them. Fixes use-after-frees: ================================================================= ==2383==ERROR: AddressSanitizer: heap-use-after-free on address 0xffff88fe1940 at pc 0xffff934f427c bp 0xfffff5481e90 sp 0xfffff5481ea8 WRITE of size 8 at 0xffff88fe1940 thread T0 #0 0xffff934f4278 in list_del ../src/util/list.h:108 #1 0xffff934f4278 in result_destructor ../src/freedreno/vulkan/tu_autotune.c:237 #2 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #3 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #4 0xffff934f4368 in history_destructor ../src/freedreno/vulkan/tu_autotune.c:229 #5 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #6 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #7 0xffff934f5990 in tu_autotune_on_submit ../src/freedreno/vulkan/tu_autotune.c:442 [...] 0xffff88fe1940 is located 80 bytes inside of 112-byte region [0xffff88fe18f0,0xffff88fe1960) freed by thread T0 here: #0 0xffff9c1c90d8 in __interceptor_free ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:127 #1 0xffff934f4368 in history_destructor ../src/freedreno/vulkan/tu_autotune.c:229 #2 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #3 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #4 0xffff934f5990 in tu_autotune_on_submit ../src/freedreno/vulkan/tu_autotune.c:442 #5 0xffff935cf2ac in tu_queue_submit_locked ../src/freedreno/vulkan/tu_drm.c:997 [...] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	435d4f08b2	turnip: Reduce the pipeline's CS allocation a bit. We don't return unused space to the suballocator, so it's a little useful to limit how much we overallocate to reduce memory footprint. I took a look through the tu_cs_emit_array() calls and accounted for a couple of them in the variant-specific space calculation, then dropped the base allocation by factors of 2 until we started throwing asserts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	58f6331eec	turnip: Skip telling the kernel the BO list when we don't need any. In fencing, we sometimes do a dummy submit with no nr_cmds. If we don't have commands to execute, we don't need to pin or fence any BOs either. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	dc3203b087	turnip: Sub-allocate pipelines out of a device-global BO pool. Allocating a BO for each pipeline meant that for apps with many pipelines (such as Asphalt9 under ANGLE), we would end up spending too much time in the kernel tracking the BO references. Looking at CS:Source on zink, before we had 85 BOs for the pipelines for a total of 1036 kb, and now we have 7 BOs for a total of 896 kb. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	e0fbdd3eda	turnip: Stop allocating unused pvtmem space in the pipeline CS. The pvtmem was split off to a separate read/write BO. Fixes: `931ad19a18` ("turnip: make cmdstream bo's read-only to GPU") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	80c44a6626	turnip: Track refcounts on BOs in kgsl as well. I'm going to be using the BO refcount for the pipeline and autotune buffer suballocation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Tomeu Vizoso	9d5fa59322	Revert "ci/freedreno: Disable a618 jobs" This reverts commit `96e17287b4`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15791>	2022-04-08 14:07:31 +00:00
Connor Abbott	32af90d96f	freedreno/a6xx: Fix SP_DS_CTRL_REG0 definition Bit 20 isn't actually MERGEDREGS, the mode for the entire geometry pipeline is controlled by SP_VS_CTRL_REG0::MERGEDREGS and it appears to be something preamble-related instead since writing any register in the preamble hangs if it's set. This fixes those hangs on freedreno and turnip since we no longer set it. Fixes: `fccc35c2de` ("ir3: Add preamble optimization pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15801>	2022-04-08 04:40:17 +00:00
Emma Anholt	75a4e3f0e8	Revert "ci/freedreno: Reduce concurrency when replaying traces on a630" This reverts commit `d948f32365`. I think that fixing the timeout will have resolved this problem. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15805>	2022-04-08 00:24:20 +00:00
Emma Anholt	d51aea7f57	freedreno: Fix the cpu-prep wait to be "infinite". We don't need to restrict our timeout to 5 seconds, because the kernel's hangcheck will ensure that the wait completes in finite time if the GPU gets wedged. If the GPU is making progress, we don't want to time out early and have pipe_transfer_map() return an error, causing glReadPixels() to throw a confusing GL_OOM even though we're not out memory. The INFINITE arg to this function isn't actually infinite, it's limited to an hour. But an hour of GPU processing to wait on is probably plenty. This 5s timeout has caused problems with the CTS on freedreno at high parallelism, and I suspect is the cause of recent issues in the closed traces replay jobs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15805>	2022-04-08 00:24:20 +00:00
Danylo Piliaiev	dde1623ed2	turnip: Implement VK_EXT_primitives_generated_query Similar to pipeline statistics but done for a single counter. We use REG_A6XX_RBBM_PRIMCTR_7 to get generated primitives and not PRIMCTR_8 because PRIMCTR_7 counts pre-clipped prims while PRIMCTR_8 counts them after clipping. OpenGL spec for GL_PRIMITIVES_GENERATED says: "Subsequent rendering will increment the counter once for every vertex that is emitted from the geometry shader, or from the vertex shader if no geometry shader is present." Passes tests: dEQP-VK.transform_feedback.primitives_generated_query.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15746>	2022-04-07 08:01:59 +00:00
Emma Anholt	3cf28d16f6	ci: Uprev deqp-runner and piglit. deqp-runner uprevved to reduce memory usage on HW runners, let us experiment with shader cache on tmpfs, and hopefully provide a tool for virgl to be able to plausibly run piglit under crosvm instead of vtest. piglit uprevved to avoid a flake in softpipe in glx-multithread-texture, and improve performance of the test, too. This also brings in the fbo-blending-format-quirks fix to properly initialize the buffers, fixing some fails/flakes. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15419>	2022-04-06 20:43:53 +00:00
Emma Anholt	cd39523c53	ci/turnip: Drop xfails for create_list_modifiers. These were fixed in `5ce06f8474` ("turnip: Use correct type for OUTARRAY in FormatProperties2"), but they aren't included in the pre-merge CI run. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15419>	2022-04-06 20:43:52 +00:00
Danylo Piliaiev	25202b5861	ci/freedreno: Add fractional test of forced unaligned gmem store Unaligned gmem store is a mostly untested path since most of the times faster path is chosen. We have to force unaligned store to really test it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15773>	2022-04-06 19:53:27 +00:00
Danylo Piliaiev	a5a97f0b77	turnip: Fix subpassLoad from CUBE input attachments Cube descriptors require a different sampling instruction in shader, however we don't know whether image is a cube or not until the start of a renderpass. We have to patch the descriptor to make it compatible with how it is sampled in shader. For the reference subpassLoad is currently translated into isaml.a Blob v615 also doesn't handle this case correctly. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15734>	2022-04-06 19:42:30 +03:00
Danylo Piliaiev	6c18602164	turnip: Add "unaligned_store" debug option to better test gmem stores Unaligned store is incredibly rare in CTS, we have to force it to actually test it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Danylo Piliaiev	e255305e84	turnip: Ignore aspectMask for D32S8 framebuffer attachment Vulkan spec says: "When an image view of a depth/stencil image is used as a depth/stencil framebuffer attachment, the aspectMask is ignored and both depth and stencil image subresources are used." Since we use two planes for D32S8 format we have to add a special case for depth in addition to already existing case for stencil. Fixes hang in CTS: dEQP-VK.renderpass.depth_stencil_write_conditions.stencil_kill_write_d32sf_s8ui Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Danylo Piliaiev	72716993b2	turnip: Correctly store separate stencil in gmem store - When resolving d32s8 to s8 we stored stencil with a wrong format. - For unaligned multi-sample store we used wrong gmem offset for stencil. If unaligined store is forced this change fixes a hang in: dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_stencil Fixes: `b157a5d0d6` ("tu: Implement non-aligned multisample GMEM STORE_OP_STORE") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Jason Ekstrand	bdf52654ac	turnip: Enable VK_EXT_debug_utils It's implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Connor Abbott	b91b90c256	tu: Expose VK_KHR_maintenance4 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	5eb63d825f	tu: Remove tu_pipeline::layout This makes it more obvious that the layout is never used after creating the pipeline, which is required by VK_KHR_maintenance4. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	7455a7a44c	tu: Fill out maxBufferSize It seems this is really a workaround for silly issues in GetBufferMemoryRequirements when you ask for a really large buffer. Just expose the maximum possible size ATM. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	d1762b7df0	tu: Implement GetDevice*MemoryRequirements() Based mostly on anv, which is a bit more optimized than radv - we at allocate the image on the stack. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Omar Akkila	4208895175	ci: bump VK-GL-CTS to 1.3.1.1 Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15668>	2022-04-04 23:04:33 +00:00
Tomeu Vizoso	d948f32365	ci/freedreno: Reduce concurrency when replaying traces on a630 We are running out of memory when replaying traces sometimes, reduce the number of concurrent retrace processes. Mesa: User error: GL_OUT_OF_MEMORY in glReadPixels warning: GL_OUT_OF_MEMORY while getting snapshot 1074335: warning: failed to get snapshot https://gitlab.freedesktop.org/mesa/mesa/-/jobs/20519522 Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15682>	2022-04-04 12:48:40 +00:00
Emma Anholt	e1de9b0de5	turnip: Allow image access on swapped formats. This is apparently something that gamescope would like to have, and the CTS's test coverage is happy with it. Fixes: #6011 (we hope) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	4cd51efedb	turnip: Disable tiling on 1D images. If we know the height is 1, then it would be a waste to align each miplevel to tile height. For non-mipmapped textures, it doesn't save us memory (since you still align to 4 on the last miplevel), but it should be better cache locality by not loading those unused lines. Incidentally, this gets us some more coverage of swap != WZYX cases in CTS tests, which often use optimal tiling without also testing linear. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	71fcb751eb	freedreno/a6xx: Set the color_swap field for storage descriptors. This field does appear to work as expected: with 1D/1DArray turnip storage images switched to be always linear, it fixes the dEQP-VK.image.store tests using a color swapped format (once we allow color swap). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	51b04a7dfb	turnip: Add support for VK_KHR_format_feature_flags2. This reports all of our storage formats as supporting read/write without format, since we don't have any in-shader format conversions. Similarly, shadow comparisons were already supported on all the depth formats. This extension is required for VK 1.3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Danylo Piliaiev	5ce06f8474	turnip: Use correct type for OUTARRAY in FormatProperties2 Fixes: `799a9db24c` ("turnip: Stop using VK_OUTARRAY_MAKE()") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15694>	2022-04-02 09:51:45 +00:00
Vinod Koul	28ae397be1	freedreno/registers: update dsi registers to support dsc Display Stream compression (DSC) compresses the display stream in host which is later decoded by panel. This requires addition of 3 new DSI registers to support DSC over DSI. Signed-off-by: Vinod Koul <vkoul@kernel.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14967>	2022-04-01 21:56:40 +00:00
Rajnesh Kanwal	d5405c1608	vulkan: Move common format function to vulkan/util/vk_format.h Moving duplicate vk_format helper functions to common vulkan/util/vk_format.h and also renaming vk_format_get_component_size_in_bits to match how amd and freedreno name the same function. Not moving this function to common code as freedreno's implementation is a bit different. Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15696>	2022-03-31 17:18:22 +00:00
Danylo Piliaiev	10734fb748	turnip: enable has_ccu_flush_bug workaround for a660 It seems that a660 has the same bug. Without the workaround there are a lot of flakes with depth-stencil tests, e.g. in: dEQP-VK.pipeline.extended_dynamic_state.* dEQP-VK.renderpass.depth_stencil_write_conditions.* dEQP-VK.pipeline.stencil.format.d24_unorm_s8_uint.states.* Or guaranteed failures like of: dEQP-VK.pipeline.render_to_image.core.2d.huge.width.r8g8b8a8_unorm_d32_sfloat_s8_uint Enabling the workaround fixes all of them. cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15548>	2022-03-29 08:34:18 +00:00
Connor Abbott	0b0b9274b6	freedreno/ci: Fix skip comment This test was never supposed to be skipped, and the referenced commit just exposed a bug in turnip fixed by the previous commit. It was hanging due to a CTS bug making the submit take way too long, which will be fixed once the CTS change lands. Also, add it to the a630 skips. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15563>	2022-03-28 17:16:54 +00:00
Connor Abbott	9d081d7561	tu: Correctly handle VK_IMAGE_CREATE_EXTENDED_USAGE_BIT In this case we should relax checks based on the format, since the user will be responsible for them when creating an image view. This gets dEQP-VK.image.sample_texture._bit_compressed_format_ not skipping again after VK-GL-CTS 736eec57dc0c ("Fix checkSupport in compressed texture sampling tests"). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15563>	2022-03-28 17:16:54 +00:00
Danylo Piliaiev	37939e9c54	turnip: Fix the lack of WFM before indirect draws We have to add WFM to pending bits when we are flushing into CP for indirect draw to know when they should apply WFM workaround. Fixes CTS tests: dEQP-VK.draw.renderpass.indirect_draw._data_from_compute.indirect_draw_count Fixes: `abf0ae014a` ("tu: Properly handle waiting on an earlier pipeline stage") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15577>	2022-03-28 16:09:07 +00:00
Boris Brezillon	799a9db24c	turnip: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:02 +00:00
Rob Clark	c0f52f08a1	freedreno/ci: Update a306 expectations These have started to flakey UnexpectedPass somewhere along the way. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	802f4da5ee	freedreno/drm: Add virtio backend Add a new backend to enable using native driver in a VM guest, via a new virtgpu context type which (indirectly) makes host kernel interface available in guest and handles the details of mapping buffers to guest, etc. Note that fence-fd's are currently a bit awkward, in that they get signaled by the guest kernel driver (drm/virtio) once virglrenderer in the host has processed the execbuf, not when host kernel has signaled the submit fence. For passing buffers to the host (virtio-wl) the egl context in virglrenderer is used to create a fence on the host side. But use of out-fence-fd's in guest could have slightly unexpected results. For this reason we limit all submitqueues to default priority (so they cannot be preepmted by host egl context). AFAICT virgl and venus have a similar problem, which will eventually be solveable once we have RESOURCE_CREATE_SYNC. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2200d674e4	freedreno/drm: Reorder device destroy Call backend specific cleanup fxn earlier. This is needed if the backend has things like bo's to delete, otherwise the handle_table will already be destroyed causing problems in bo_del() Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	ea339137b0	freedreno/drm: Extract out "softpin" submit/ringbuffer base class We are going to want basically the identical thing, other than flush_submit_list, for virtio backend. Now that we've moved various other dependencies into the base classes, extract out an abstract base class for submit/ringbuffer. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	72a427244f	freedreno/drm: Move ring_pool slab parent to base Prep to move most of sp submit/ringbuffer to something that can be re-used by virtio backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	877f9049c3	freedreno/drm: Move bo idx to base The virtio backend will want this too, and it will make it easier to share most of the submit/ringbuffer implementation with the virtio backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2ac9b23f78	freedreno/drm: Move submit_queue to base The virtio backend will want this too. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	88a10c6216	freedreno/drm: Avoid CPU_PREP ioctl if bo is idle With userspace fences, if we know definitely that the buffer is idle (which implies that it is not shared with other processes, etc), then skip the ioctl. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	9bcc983256	freedreno/drm: Add fd_bo_upload() There are some buffers that we mmap just to write to them a single time. Add the possibility of the drm backend to provide an alternate upload path to avoid these mmap's. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	115518ec35	freedreno/drm: Add FD_BO_SHARED hint With the virtio backend we will need to pass an extra flag when allocating buffers that will be shared cross-device (such as with virtio-wl for passing between host and guest) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	f846181fe5	freedreno/drm: Add FD_BO_NOMAP hint Add a hint for buffers that we won't need to mmap. With the virtio backend, virglrenderer needs to create a dmabuf fd for mapping into the host, which we want to avoid when possible. Low hanging fruit is to use this hint for anything tiled/ubwc. There are probably more bo's that can be flagged as such. TODO add fd_bo_upload() for memcpy to bo.. this would be useful for uploads, for example, shaders which we just write once and never touch again.. for virtio this could be implemented with a TRANSFER_TO_HOST ioctl. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	598405c91f	freedreno/drm: Rework bo creation path Decoupling handle and fd_bo creation simplifies things for "normal" drm drivers, avoiding duplication for the create vs import paths. But this is awkward for the virtio backend when wants to do multiple things in the same guest<->host round trip. So instead, split the paths in the interface backend and move the code sharing for the two different paths into the msm backend itself. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	9ea36968d3	freedreno/drm: Add fd_device_open() helper Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2bc815878c	freedreno/drm: Split msm backend into subdir Let's keep things a bit better organized when we add a new backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Tomeu Vizoso	9f43dac0ca	ci/freedreno: Increase console timeout for perf jobs Piglit is very sparse in its status output and downloads of big traces can take a while. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15527>	2022-03-24 05:33:54 +00:00
Tomeu Vizoso	d0e99e566f	ci/freedreno: Update checksum for GolfWithYourFriends trace The MR below changed the rendering slightly and the checksum isn't valid any more: "ir3, turnip, freedreno: Shader preambles" https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148 Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15526>	2022-03-24 00:04:20 +00:00
Danylo Piliaiev	5d151ddfba	turnip: Disallow non-linear tiling when casting R8G8 to other fmts R8G8 have a different block width/height and height alignment from other formats that would normally be compatible (like R16), and so if we are trying to, for example, sample R16 as R8G8 we need to demote to linear. Follows the fix in Freedreno: `b97e3bb2e1` Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15465>	2022-03-22 13:47:21 +00:00
Danylo Piliaiev	a70b197741	turnip: Force linear mode for non-ubwc R8G8 formats Non-UBWC tiled R8G8 is probably buggy since media formats are always either linear or UBWC. There is no simple test to reproduce the bug. However it was observed in the wild leading to an unrecoverable hang on a650/a660. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5926 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15465>	2022-03-22 13:47:21 +00:00
Andrey Konovalov	ed2f496ce4	ir3: set local_size for shaders of MESA_SHADER_KERNEL type ir3_compile_shader_nir() should set local_size[] and local_size_variable fields not only for compute shaders, but for the OpenCL kernels too. v2: use gl_shader_stage_is_compute() instead of explicit comparison with MESA_SHADER_[COMPUTE,KERNEL]. Signed-off-by: Andrey Konovalov <andrey.konovalov@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14863>	2022-03-18 23:20:25 +00:00
Emma Anholt	f831ba238f	ci/turnip: Increase the hangcheck timer to 2 seconds. We get a lot of useful coverage from running graphicsfuzz with spilling enabled, but it's also pretty slow and can cause intermittent hangcheck failures. I thought I'd categorized them when merging !14839 (device loss on reset), but it looks like not all of them and we're now more likely to have flakes take out the whole test run when a single flake makes the rest of the caselist a flake. This is a little unfortunate in that it means our test environment is not the same as a stock system you would want to run deqp on to submit conformance, but I think it's an improvement in the test maintenance work vs needing to fix things up later. We have some other tests besides turnip that can trigger hangchecks which we might also like this increase for (some disabled traces, for example). However, freedreno GL has a 5-second timeout waiting for idle when mapping, and a couple of 2-second timeouts in a row can result in spurious failures in other tests! Fixes: #6163 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15435>	2022-03-18 19:07:24 +00:00
Connor Abbott	fc381fa1e3	tu: Actually expose VK_EXT_texel_buffer_alignment Oops... Fixes: `3d04c435` ("tu: Trivially implement VK_EXT_texel_buffer_alignment") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15451>	2022-03-18 18:30:20 +00:00
Jason Ekstrand	2a779f98dc	turnip: Drop tu_legacy.c The remaining three helpers all have helpers in the common code. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:08 -05:00
Connor Abbott	3d04c43576	tu: Trivially implement VK_EXT_texel_buffer_alignment The previous alignment of 64 bytes, which we got from the blob, indicates that single-texel alignment isn't supported. So just do a trivial no-op implementation that returns the same alignment as before. This matches what newer blobs that expose this extension do. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15427>	2022-03-17 20:45:19 +00:00
Tomeu Vizoso	96e17287b4	ci/freedreno: Disable a618 jobs Some of these machines are experiencing networking problems currently. Disable for now so people aren't blocked. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15430>	2022-03-17 17:43:06 +00:00
Connor Abbott	072fdcabcd	tu: Enable UniformBufferUpdateAfterBind UBOs are now read at run-time via the preamble so this can be enabled. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	9932ca8a3f	ir3, turnip: Use ldc.k to push UBOs This reuses the same UBO analysis to do the pushing in the shader preamble via the ldc.k instruction instead of in the driver via CP_LOAD_STATE6. The const_data UBO is exempted as it uses a different codepath that isn't as critical. Don't do this on gallium because there are some regressions. Aztec Ruins in particular regresses a bit, and nothing I've benchmarked benefits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	221a912b8c	ir3: Refactor ir3_compiler_create() to take an options struct This will let us add more options without creating too much churn. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	acba08b58f	ir3: Implement and document ldc.k Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	fccc35c2de	ir3: Add preamble optimization pass Now that everything is plumbed through, we can tie it together. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	986f7adfee	ir3: Don't include preamble instructions in stats Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	42e21c751b	ir3: Insert frag coord code after preamble To match the pre-preamble behavior, and so that we can better schedule it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	b6fe69d855	ir3: Support prefetching with preambles Since the NIR pass runs very late, it needs to be aware of preambles, and when creating the instruction we need to move it to the start block so that RA doesn't overwrite it in the preamble. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00

... 2 3 4 5 6 ...

3456 Commits