KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Hyunjun Ko	16ea41c901	ir3: handle intrinsic_load_draw_id when scanning driver constants Fixes: #6567 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16769>	2022-05-31 01:33:55 +00:00
David Heidelberg	2cf7f08b04	ci: traces: temporarily disable nheko trace Disable nheko trace until apitrace gets fixed. apitrace currently fails with this trace, when more than 1 run is requested. Upstream issue: https://github.com/apitrace/apitrace/issues/800 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16774>	2022-05-31 00:00:25 +00:00
David Heidelberg	b8381aaa37	ci/freedreno: enable ROR and Nheko traces Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16633>	2022-05-27 06:51:38 +00:00
Danylo Piliaiev	713f504033	ir3: handle gl_Layer and gl_ViewportIndex when there is TES + GS Fixes CTS tests: KHR-GL46.shader_viewport_layer_array.ShaderViewportIndexTestCase KHR-GL46.shader_viewport_layer_array.ShaderLayerFramebufferLayeredTestCase KHR-GL46.shader_viewport_layer_array.ShaderLayerFramebufferNonLayeredTestCase Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6497 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16696>	2022-05-26 08:47:02 +00:00
Mike Blumenkrantz	aa32b96c51	turnip: fix assert for max xfb outputs this is a counter, not an index, so use <= cc: mesa-stable Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16620>	2022-05-21 16:48:54 +00:00
Chia-I Wu	2a8e6a4d1a	turnip: disable UBWC for SNORM formats In copy_format, we treat snorm as unorm to avoid clamping. But snorm and unorm are UBWC incompatible for special values such as all 0's or all 1's. Disable UBWC for snorm. For reference, I dumped the first byte of an UBWC blocks and it was color UNORM SNORM all black 0x01 0x31 all white 0x0d 0x11 @flto clarified that bit 4 is unset for fast clear encoded blocks. It looks like fast clear is not used for SNORM. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6480 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16534>	2022-05-21 15:27:42 +00:00
Chia-I Wu	e8eb6d13a5	turnip: fix tu6_pack_border_color for z24 The value should be at the bottom 24 bits, not at the top. dEQP-VK.pipeline.sampler.* still passes. This fixes most of dEQP-GLES31.functional.texture_border_clamp.formats.depth on angle. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16570>	2022-05-21 00:54:28 +00:00
Hyunjun Ko	f2635ca47b	turnip: add an assertion for max descriptor set count. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16624>	2022-05-20 09:49:00 +00:00
Jason Ekstrand	c24aa449d0	vulkan,anv,turnip: Add a common CmdBindVertexBuffers wrapper Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16611>	2022-05-20 02:12:37 +00:00
Emma Anholt	7938ce4af3	freedreno/ir3: Lower texture instructions used only for f2f16 to 16-bit. 2.5% improvement in gfxbench vk-5-normal. No obvious change on gl-5-normal. shader-db on Rob's android shaders: total instructions in shared programs: 770644 -> 770595 (<.01%) instructions in affected programs: 14880 -> 14831 (-0.33%) total nops in shared programs: 167784 -> 167860 (0.05%) nops in affected programs: 3351 -> 3427 (2.27%) total non-nops in shared programs: 602860 -> 602735 (-0.02%) non-nops in affected programs: 10523 -> 10398 (-1.19%) total mov in shared programs: 19313 -> 19286 (-0.14%) mov in affected programs: 365 -> 338 (-7.40%) total cov in shared programs: 18075 -> 17978 (-0.54%) cov in affected programs: 566 -> 469 (-17.14%) total dwords in shared programs: 1612848 -> 1612596 (-0.02%) dwords in affected programs: 13882 -> 13630 (-1.82%) total last-baryf in shared programs: 56144 -> 55975 (-0.30%) last-baryf in affected programs: 482 -> 313 (-35.06%) total full in shared programs: 36094 -> 36092 (<.01%) full in affected programs: 10 -> 8 (-20.00%) total sstall in shared programs: 66986 -> 66923 (-0.09%) sstall in affected programs: 1392 -> 1329 (-4.53%) total systall in shared programs: 91244 -> 91072 (-0.19%) systall in affected programs: 1194 -> 1022 (-14.41%) total (sy) in shared programs: 4316 -> 4321 (0.12%) (sy) in affected programs: 19 -> 24 (26.32%) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Emma Anholt	1cf0736f1c	freedreno/ir3: Add support for 16-bit nir_texop_lod. Same basic path, just do the rescaling in half float. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Emma Anholt	a28d2e87d3	turnip: Make RelaxedPrecision-decorated ALU ops 16-bit. Improves gfxbench vk-5-normal performance 5.5%. Fixes: #6346 Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Emma Anholt	633cf4eca1	freedreno/ir3: Fix 16-bit bit_count. No need to do the 16-bit lowering if it already is. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16465>	2022-05-19 19:43:36 +00:00
Connor Abbott	9f67fa368e	tu: Implement VK_EXT_pipeline_creation_cache_control Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16562>	2022-05-18 13:14:55 +00:00
Connor Abbott	49827da6fa	tu: Implement VK_EXT_pipeline_creation_feedback Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16562>	2022-05-18 13:14:55 +00:00
Connor Abbott	e348f2fb38	tu: Zero-initialize compute driver key Fixes: `05329d7` ("tu: Implement pipeline caching with shared Vulkan cache") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16562>	2022-05-18 13:14:55 +00:00
Danylo Piliaiev	5d377f435b	freedreno/a6xx: Add EARLYPREAMBLE flag to all a6xx_sp_xs_ctrl_reg0 Each shader stage has its own "early preamble" flag. Early preamble is likely an optimization to hide some of latency when loading UBOs into consts in the preamble. Early preamble has the following limitations: - Only shared, a1, and consts regs could be used (accessing other regs would result in GPU fault); - No cat5/cat6, only stc/ldc variants are working; - Values writen to shared regs are not accessible by the rest of the shader; - Instructions before shps are also considered to be a part of early preamble. Note, for all shaders from d3d11 games blob produced preambles compatible with early preamble mode. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15901>	2022-05-18 11:17:47 +00:00
Chia-I Wu	2410993ef6	turnip: fix off-by-one in border color bitset BITSET_FFS reserves 0 for no bit set. BITSET_CLEAR just below cleared the wrong bit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16567>	2022-05-17 23:29:15 +00:00
Timothy Arceri	d7a071a28f	gallium/drivers: set force_indirect_unrolling_sampler for all required drivers This is set to true for all drivers that have a GLSL level of support lower than 4.00. This matches the rule for setting the GLSL IR option EmitNoIndirectSampler. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16543>	2022-05-17 02:12:21 +00:00
Chia-I Wu	cb50fe7110	ir3: fix mem_ctx for ir3_disasm_info::nir Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6494 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16535>	2022-05-16 22:09:13 +00:00
Chia-I Wu	d3d34ad476	turnip: emit VPC_SO_DISABLE in xfb begin/end SO was always enabled before this change. That meant, after a call to tu_CmdBindTransformFeedbackBuffersEXT to emit VPC_SO_BUFFER_SIZE, any draw call (from the same render pass, in a different render pass, or in a different cmdbuf) could potentially cause writes to the SO buffers regardless of whether the draw is inside xfb begin/end or not. I choose to emit VPC_SO_DISABLE instead of using stateobjs like freedreno does only because it is simpler. It is not clear to me which is more efficient to HW. This also fixes double SO writes for gmem rendering. While tu6_tile_render_begin was careful to disable SO for the draw pass, tu6_emit_tile_select re-enabled it. dEQP-VK.transform_feedback.* still passes. It fixes dEQP-GLES3.functional.transform_feedback.* on angle. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16502>	2022-05-16 20:46:59 +00:00
Chia-I Wu	0b7751babf	turnip: fix sampledImageIntegerSampleCounts It seems fine to advertise msaa in sampledImageIntegerSampleCounts. dEQP-VK.rasterization.rasterization_order_attachment_access.format_integer.* goes from NotSupported to Pass for more test cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16487>	2022-05-16 19:26:46 +00:00
David Heidelberg	875643feeb	ci: uprev piglit 2022-05-10 Also document additional piglit failures and crashes with new tests. Multiple changes, mostly notable: - few new tests - traces downloader improvements Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16428>	2022-05-16 06:33:36 +00:00
Chia-I Wu	e9e8c649cd	freedreno/fdperf: support dumping counters This is useful for comparing two workloads. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16488>	2022-05-14 22:18:52 +00:00
Chia-I Wu	267786be60	freedreno/fdperf: make refresh rate configurable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16488>	2022-05-14 22:18:52 +00:00
Chia-I Wu	cd42f63c43	turnip: let modifier takes precedence over TU_DEBUG=noubwc TU_DEBUG=noubwc is not very usable on sway/xwayland where the wsi uses modifiers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16507>	2022-05-14 21:56:38 +00:00
Connor Abbott	05329d7f9a	tu: Implement pipeline caching with shared Vulkan cache Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	43981f0f58	tu: Include turnip debug flags in pipeline cache UUID Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	d023ae4686	tu: Rewrite cache UUID based on radv Switch to using sha1 so that we can add as many other flags as we need to easily. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	410d59943d	tu: Hash pipeline layout contents Mostly adapted from anv. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	3e3f8b1639	ir3: Add ir3_shader_create_variant() This is similar to ir3_shader_get_variant(), but always compiles the variant from scratch and returns a variant that's owned by the user rather than the shader. We'll need this because when variants are stored in the Vulkan pipeline cache they will outlive their shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	ea646ac9af	ir3: Support disabling the pipeline cache Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	c7a6293635	ir3: Add functions to serialize variants This will be used by turnip to create free-floating variant objects that integrate into the Vulkan cache system. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	ceae844794	ir3: Remove ir3_shader_variant::shader Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	91160dab97	tu: Keep original blit shaders separately We won't be able to access them once the ->shader link is gone. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	69f5be8bad	ir3: Add ir3_shader_variant::compiler And replace uses of ->shader->compiler. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	4509b49fb8	ir3: Allocate disasm_info under variant This shouldn't matter much because it gets stolen later, but the shader is going away. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	21e3dd57d3	ir3: Use ir3_shader_variant::type more often Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	f45c86dfb7	ir3, fd, tu: Copy misc. info from ir3_shader to ir3_shader_variant The shader won't be available for deserialized variants, so we need to include all the info we need for compiling variants to be in the variant. Most of the things we dug out of the shader were various bits from nir_shader_info which we move into ir3_shader_variant. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	3e30608ceb	ir3, freedreno, tu: Make ir3_shader_variant store stream_output This reduces the number of uses of ir3_shader which will be gone when we deserialize the variant directly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Connor Abbott	3cad11d84a	tu: Delete unused tu_clear_blit GS handling This has been unused for a while since we switched to writing the array index in the VS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16147>	2022-05-13 17:07:05 +00:00
Rob Clark	7292b35da0	freedreno/devices: Add another SKU Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16477>	2022-05-12 22:12:24 +00:00
Rob Clark	a31c34e0d6	freedreno/drm/virtio: Don't try to mmap imported bo's Previously it would fail, and then we'd fall back to the transfer path for things like readpix. But it would spam logcat w/ bo_mmap fail messages. Since gralloc allocated buffers for GPU usage are allocate without _USE_MAPPABLE, let's just assume we can't map imported bo's. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16477>	2022-05-12 22:12:24 +00:00
Rob Clark	62f3e703c8	freedreno/drm: Use DEBUG_GET_ONCE_OPTION() In particular this uses os_get_option() so the android setprop fallback works. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16477>	2022-05-12 22:12:24 +00:00
Danylo Piliaiev	9a11ad7efd	tu: Fix indices of drm_msm_gem_submit_cmd when filling them For some reason CTS doesn't trigger the issue... When submit entry is not filled - kernel says: [drm:msm_ioctl_gem_submit] ERROR invalid type: 00000000 Fixes: `dbae9fa7d8` ("tu: implement sysmem vs gmem autotuner") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16474>	2022-05-12 16:44:09 +00:00
Emma Anholt	b282d504a4	turnip: Add a TU_DEBUG=perf debug option. For doing performance investigation, I often find it useful to have a "are we tripping over any of our performance TODOs?" flag, so add it and use it in a few of the TODOs. This also greatly cleans up the deqp-vk logs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16316>	2022-05-12 01:00:25 +00:00
Jason Ekstrand	352e32e5ba	nir/builder: Add a nir_trim_vector helper This pattern pops up a bunch and the semantics of nir_channels() aren't very convenient much of the time. Let's add a nir_trim_vector() which matches nir_pad_vector(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Danylo Piliaiev	187d3df52c	tu: Do not flush ccu in clear/blits during renderpass For clear/blits ccu flush not only worse for perf, but also messes up flush_bits when executed in a conditional set of commands. We already don't flush for 3d blits. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6419 Fixes: `487aa807bd` ("tu: Rewrite flushing to use barriers") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16352>	2022-05-11 08:07:50 +00:00
Danylo Piliaiev	db69218cbe	tu: Implement VK_EXT_image_view_min_lod Relevant tests: dEQP-VK.texture.mipmap..image_view_min_lod. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16292>	2022-05-09 07:53:41 +00:00
Rob Clark	409b76511c	freedreno/drm-shim: Better iova handling We actually want to use util_vma to handle this. But fortunately core drm-shim alredy does this for mem offset, we can just delete a bunch of code and re-use that. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16250>	2022-05-02 19:50:33 +00:00
Rob Clark	97f4e48717	freedreno/drm-shim: Robustify error handling We can't be so sloppy if we are using drm-shim for fuzzing. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16250>	2022-05-02 19:50:33 +00:00
Rob Clark	d06fc7bb4f	freedreno/drm-shim: Update to latest uapi version Needed for fuzzing virgl drm native context. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16250>	2022-05-02 19:50:33 +00:00
Rob Clark	69edfcaa20	freedreno/drm: Fix bos_on_stack calculation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16263>	2022-05-01 15:53:10 +00:00
Chia-I Wu	53d87865ca	turnip: fix drm modifier support with planar formats We need to advertise the results of tu6_plane_count and handle VK_IMAGE_ASPECT_MEMORY_PLANE_*_BIT. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6374 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16169>	2022-04-29 22:30:45 +00:00
Danylo Piliaiev	6e6ba85fd9	turnip: Fix tu_debug_flags values clashing Was not caught during rebase... Fixes: `725ae34458` ("turnip: Add debug option to print gmem load/store skip stats") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16246>	2022-04-29 15:09:36 +00:00
Danylo Piliaiev	725ae34458	turnip: Add debug option to print gmem load/store skip stats TU_DEBUG=log_skip_gmem_ops would print stats about skipped gmem/load every second. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15974>	2022-04-29 09:29:55 +00:00
Danylo Piliaiev	0c489f18cb	turnip: Skip load/stores for tiles with no geometry When HW binning is used tile loads/stores could be skipped if there is no geometry in the tile. Loads could be skipped when: - The attachment won't be resolved, otherwise if load is skipped there would be holes in the resolved attachment; - There is no vkCmdClearAttachments afterwards since it is likely a partial clear done via 2d blit (2d blit doesn't produce geometry). Stores could be skipped when: - The attachment was not cleared, which may happen by load_op or vkCmdClearAttachments; - When store is not a resolve. I chose to predicate each load/store separately to allow them to be skipped when only some attachments are cleared or resolved. Gmem loads are moved into separate cs because whether to emit CP_COND_REG_EXEC depends on HW binning being enabled and usage of vkCmdClearAttachments. CP_COND_REG_EXEC predicate could be changed during draw_cs only by perf query, in such case the predicate should be re-emitted. (At the moment it is always re-emitted before stores) Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15974>	2022-04-29 09:29:55 +00:00
Danylo Piliaiev	d5debf0d8a	freedreno/a6xx: Add UNK fields to CP_REG_TEST and CP_COND_REG_EXEC Their meaning is unknown, however they DO change the behavior. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15974>	2022-04-29 09:29:55 +00:00
Emma Anholt	536c8ee96d	nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. This controls the whole lowering of "make tex ops with implicit derivatives on non-implicit-derivative stages be tex ops with an explicit lod of 0 instead", but it's really hard to describe that in a git commit summary. All existing callers get it added except: - nir_to_tgsi which didn't want it. - nouveau, which didn't want it (fixes regressions in shadowcube and shadow2darray with NIR, since the shading languages don't expose txl of those sampler types and thus it's not supported in HW) - optional lowering passes in mesa/st (lower_rect, YUV lowering, etc) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16156>	2022-04-28 21:26:08 +00:00
Rob Clark	e42cea4db6	freedreno/drm/virtio: Split up large uploads Might be useful if host cached mmaps.. but OTOH we don't want to burn up address space. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	0aab310439	freedreno/drm/virtio: Async ccmd batching This could be a bit more clever an avoid extra memcpy.. but that seems to be in the noise at this point. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	528fa581c1	freedreno/drm/virtio: Pass guest handles to execbuf This is needed for the VIRTGPU_WAIT ioctl to work. TODO we could perhaps limit this, since it is not needed for residency, but only fencing. Ie. we could omit cmdstream, and probably anything that has FD_BO_NOMAP flag. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	cb5f25ea71	freedreno/drm/virtio: Protocol updates This syncs up with the protocol of what eventually landed in virglrender. 1) Move all static params to capset to avoid having to query host (reduce synchronous round trips at startup) 2) Use res_id instead of host_handle.. costs extra hashtable lookups in host during submit, but this lets us (with userspace allocated IOVA) make bo alloc and import completely async. 3) Require userspace allocated IOVA to simplify the protocol and not have to deal with GEM_NEW/GEM_INFO potentially being synchronous. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	fa23ddf258	freedreno/drm/virtio: Fix SHAREABLE+MAPPABLE A shareable bo should also be mappable if FD_BO_NOMAP is not set. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	441f01e778	freedreno/drm/virtio: Drop blocking in host These paths should be corner cases, but still it is a bad idea to block in the host (because it is single threaded), so instead just turn waits in the host into polling in the guest. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	e6b2785811	freedreno/drm/virtio: Use userspace IOVA allocation If supported by host virglrenderer and host kernel, use userspace allocated GPU virtual addresses. This lets us avoid stalling on waiting for response from host kernel until we need to know the host handle (which is usually not until submit time). Handling the async response from host to get host_handle is done thru the submit_queue, so that in the submit path (hot) we do not need any additional synchronization to know that the host_handle is valid. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	ae01c27ac0	freedreno/drm/virtio: Support ring_idx ring_idx zero is the CPU ring, others map to the priority level, as each priority level for a given drm_file on the host kernel side maps to a single fence timeline. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	122cedf98c	freedreno/drm: Move bo common init We'll need this to happen before virtio_bo_new() returns in the next patch. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	d52455a962	freedreno/drm: Close bo handle after bo->destroy() For userspace allocated iova, we want to give the backend a chance to release the iova before the handle is closed. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	4ed346c6fb	freedreno/drm: Drop FD_PP_PGTABLE Unused. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	0004cae638	freedreno/drm/virtio: Appease valgrind Valgrind isn't seeing that the kernel is initializing the caps (or returning an error). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Rob Clark	d79c71c705	freedreno: Misc indent fixes Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16086>	2022-04-27 23:10:00 +00:00
Emma Anholt	550975f229	turnip: Don't disable LRZ in subpasses after the first in the easy case. If it's the same depth/stencil attachment, then there's no need to turn off LRZ just because the subpass changed. Doesn't help gfxbench perf yet, but will with !16014. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:30 +00:00
Emma Anholt	7ba63f516a	turnip: Ignore TOP/BOTTOM_OF_PIPE bits in subpass src/dst dep flags. gfxbench sets these between the gbuffer subpass and the following ones. They should be no-ops as subpass dependencies. gfxbench vk-5-debug perf 12.8 -> 14.6 fps thanks to getting gmem on the gbuffer rendering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:30 +00:00
Emma Anholt	1bcd848816	freedreno/ir3: Call nir_opt_find_array_copies(). gfxbench vk-5-normal has a shader that sampels into a texels[] array at the top, then in a loop calls a GLSL function passing texels[] in by value. This resulted in a copy to a temp inside the loop, which got lowered to scratch stores since it was pretty big. By doing find_array_copies(), we notice that it's equivalent to copy_deref, then get to copy-propagate from the array at the top. Then we only have to set up the scratch array outside of the loop and load_scratch from it in the called function inside the loop. This also causes there to be less spilling, stps 1144 -> 354 and ldps 826->36. However, it doesn't seem to change performance on the test. So, while this seems to be an improvement for the shader, and we could maybe even do better by rematerializing the txl samples inside the loop instead of storing the texture fetches to scratch in the first place, it doesn't currently seem worth pursuing more optimization of this shader. No change on freedreno shader-db. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	7ba0c44607	turnip: Add nir_opt_conditional_discard. We can easily do discard_if in the backend without control flow, but it wasn't done in ir3 because the GL frontend already did it for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	d60282f5d2	freedreno/ir3: Make sched nodes before adding deps. The mark_kill_path() during dep setup follows SSA srcs, which when a phi is involved may include a def from later in the same block, that we hadn't created yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Emma Anholt	ce15bf19fb	turnip: Add TU_DEBUG=layout for dumping image layouts. This was useful for comparing image allocations between gfxbench gl_5_normal and vk_5_normal to see if rendering was generally equivalent (formats, MSAA, UBWC choices, and notably gfxbench vk was choosing DXT5 instead of ASTC on non-android builds!) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15982>	2022-04-19 18:45:29 +00:00
Danylo Piliaiev	2c683519e2	turnip: Try harder to keep LRZ valid and fix a few edge cases Refactored tu6_calculate_lrz_state and added comments. 1) If there is no depth write we could keep LRZ valid with any compare op, we just have to temporary disable LRZ for incompatible ops in such case. 2) Found that VK_COMPARE_OP_EQUAL is not compatible with LRZ, and since it doesn't change LRZ buffer - LRZ could be just temporary disabled. This fixes rendering of grass/trees in PUBG mobile on angle. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6127 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16014>	2022-04-19 18:06:58 +00:00
illiliti	67af7e2b40	Use proper types for meson objects Fix invalid usage of meson objects which violates official meson specification and thus breaks muon, an implementation of meson written in C. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15715>	2022-04-18 13:03:08 +03:00
Emma Anholt	835704e669	turnip: Move autotune buffers to suballoc. Now the ANGLE trex_200 trace replay does a single BO allocation at startup for autotune results instead of one per frame (~350 for the whole replay). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	7c636acd53	turnip: Get autotune off of ralloc destructors. We've wanted to remove destructors from ralloc's API for a long time (it's an extra storage cost per ralloc for a rarely-used feature), and for the suballoc change we'd need to spend more storage on storing the tu_device pointer per result since destructors don't get anything else but the pointer passed into them. Fixes use-after-frees: ================================================================= ==2383==ERROR: AddressSanitizer: heap-use-after-free on address 0xffff88fe1940 at pc 0xffff934f427c bp 0xfffff5481e90 sp 0xfffff5481ea8 WRITE of size 8 at 0xffff88fe1940 thread T0 #0 0xffff934f4278 in list_del ../src/util/list.h:108 #1 0xffff934f4278 in result_destructor ../src/freedreno/vulkan/tu_autotune.c:237 #2 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #3 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #4 0xffff934f4368 in history_destructor ../src/freedreno/vulkan/tu_autotune.c:229 #5 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #6 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #7 0xffff934f5990 in tu_autotune_on_submit ../src/freedreno/vulkan/tu_autotune.c:442 [...] 0xffff88fe1940 is located 80 bytes inside of 112-byte region [0xffff88fe18f0,0xffff88fe1960) freed by thread T0 here: #0 0xffff9c1c90d8 in __interceptor_free ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:127 #1 0xffff934f4368 in history_destructor ../src/freedreno/vulkan/tu_autotune.c:229 #2 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #3 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #4 0xffff934f5990 in tu_autotune_on_submit ../src/freedreno/vulkan/tu_autotune.c:442 #5 0xffff935cf2ac in tu_queue_submit_locked ../src/freedreno/vulkan/tu_drm.c:997 [...] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	435d4f08b2	turnip: Reduce the pipeline's CS allocation a bit. We don't return unused space to the suballocator, so it's a little useful to limit how much we overallocate to reduce memory footprint. I took a look through the tu_cs_emit_array() calls and accounted for a couple of them in the variant-specific space calculation, then dropped the base allocation by factors of 2 until we started throwing asserts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	58f6331eec	turnip: Skip telling the kernel the BO list when we don't need any. In fencing, we sometimes do a dummy submit with no nr_cmds. If we don't have commands to execute, we don't need to pin or fence any BOs either. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	dc3203b087	turnip: Sub-allocate pipelines out of a device-global BO pool. Allocating a BO for each pipeline meant that for apps with many pipelines (such as Asphalt9 under ANGLE), we would end up spending too much time in the kernel tracking the BO references. Looking at CS:Source on zink, before we had 85 BOs for the pipelines for a total of 1036 kb, and now we have 7 BOs for a total of 896 kb. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	e0fbdd3eda	turnip: Stop allocating unused pvtmem space in the pipeline CS. The pvtmem was split off to a separate read/write BO. Fixes: `931ad19a18` ("turnip: make cmdstream bo's read-only to GPU") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	80c44a6626	turnip: Track refcounts on BOs in kgsl as well. I'm going to be using the BO refcount for the pipeline and autotune buffer suballocation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Tomeu Vizoso	9d5fa59322	Revert "ci/freedreno: Disable a618 jobs" This reverts commit `96e17287b4`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15791>	2022-04-08 14:07:31 +00:00
Connor Abbott	32af90d96f	freedreno/a6xx: Fix SP_DS_CTRL_REG0 definition Bit 20 isn't actually MERGEDREGS, the mode for the entire geometry pipeline is controlled by SP_VS_CTRL_REG0::MERGEDREGS and it appears to be something preamble-related instead since writing any register in the preamble hangs if it's set. This fixes those hangs on freedreno and turnip since we no longer set it. Fixes: `fccc35c2de` ("ir3: Add preamble optimization pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15801>	2022-04-08 04:40:17 +00:00
Emma Anholt	75a4e3f0e8	Revert "ci/freedreno: Reduce concurrency when replaying traces on a630" This reverts commit `d948f32365`. I think that fixing the timeout will have resolved this problem. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15805>	2022-04-08 00:24:20 +00:00
Emma Anholt	d51aea7f57	freedreno: Fix the cpu-prep wait to be "infinite". We don't need to restrict our timeout to 5 seconds, because the kernel's hangcheck will ensure that the wait completes in finite time if the GPU gets wedged. If the GPU is making progress, we don't want to time out early and have pipe_transfer_map() return an error, causing glReadPixels() to throw a confusing GL_OOM even though we're not out memory. The INFINITE arg to this function isn't actually infinite, it's limited to an hour. But an hour of GPU processing to wait on is probably plenty. This 5s timeout has caused problems with the CTS on freedreno at high parallelism, and I suspect is the cause of recent issues in the closed traces replay jobs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15805>	2022-04-08 00:24:20 +00:00
Danylo Piliaiev	dde1623ed2	turnip: Implement VK_EXT_primitives_generated_query Similar to pipeline statistics but done for a single counter. We use REG_A6XX_RBBM_PRIMCTR_7 to get generated primitives and not PRIMCTR_8 because PRIMCTR_7 counts pre-clipped prims while PRIMCTR_8 counts them after clipping. OpenGL spec for GL_PRIMITIVES_GENERATED says: "Subsequent rendering will increment the counter once for every vertex that is emitted from the geometry shader, or from the vertex shader if no geometry shader is present." Passes tests: dEQP-VK.transform_feedback.primitives_generated_query.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15746>	2022-04-07 08:01:59 +00:00
Emma Anholt	3cf28d16f6	ci: Uprev deqp-runner and piglit. deqp-runner uprevved to reduce memory usage on HW runners, let us experiment with shader cache on tmpfs, and hopefully provide a tool for virgl to be able to plausibly run piglit under crosvm instead of vtest. piglit uprevved to avoid a flake in softpipe in glx-multithread-texture, and improve performance of the test, too. This also brings in the fbo-blending-format-quirks fix to properly initialize the buffers, fixing some fails/flakes. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15419>	2022-04-06 20:43:53 +00:00
Emma Anholt	cd39523c53	ci/turnip: Drop xfails for create_list_modifiers. These were fixed in `5ce06f8474` ("turnip: Use correct type for OUTARRAY in FormatProperties2"), but they aren't included in the pre-merge CI run. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15419>	2022-04-06 20:43:52 +00:00
Danylo Piliaiev	25202b5861	ci/freedreno: Add fractional test of forced unaligned gmem store Unaligned gmem store is a mostly untested path since most of the times faster path is chosen. We have to force unaligned store to really test it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15773>	2022-04-06 19:53:27 +00:00
Danylo Piliaiev	a5a97f0b77	turnip: Fix subpassLoad from CUBE input attachments Cube descriptors require a different sampling instruction in shader, however we don't know whether image is a cube or not until the start of a renderpass. We have to patch the descriptor to make it compatible with how it is sampled in shader. For the reference subpassLoad is currently translated into isaml.a Blob v615 also doesn't handle this case correctly. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15734>	2022-04-06 19:42:30 +03:00
Danylo Piliaiev	6c18602164	turnip: Add "unaligned_store" debug option to better test gmem stores Unaligned store is incredibly rare in CTS, we have to force it to actually test it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Danylo Piliaiev	e255305e84	turnip: Ignore aspectMask for D32S8 framebuffer attachment Vulkan spec says: "When an image view of a depth/stencil image is used as a depth/stencil framebuffer attachment, the aspectMask is ignored and both depth and stencil image subresources are used." Since we use two planes for D32S8 format we have to add a special case for depth in addition to already existing case for stencil. Fixes hang in CTS: dEQP-VK.renderpass.depth_stencil_write_conditions.stencil_kill_write_d32sf_s8ui Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Danylo Piliaiev	72716993b2	turnip: Correctly store separate stencil in gmem store - When resolving d32s8 to s8 we stored stencil with a wrong format. - For unaligned multi-sample store we used wrong gmem offset for stencil. If unaligined store is forced this change fixes a hang in: dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_stencil Fixes: `b157a5d0d6` ("tu: Implement non-aligned multisample GMEM STORE_OP_STORE") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Jason Ekstrand	bdf52654ac	turnip: Enable VK_EXT_debug_utils It's implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Connor Abbott	b91b90c256	tu: Expose VK_KHR_maintenance4 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	5eb63d825f	tu: Remove tu_pipeline::layout This makes it more obvious that the layout is never used after creating the pipeline, which is required by VK_KHR_maintenance4. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	7455a7a44c	tu: Fill out maxBufferSize It seems this is really a workaround for silly issues in GetBufferMemoryRequirements when you ask for a really large buffer. Just expose the maximum possible size ATM. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	d1762b7df0	tu: Implement GetDevice*MemoryRequirements() Based mostly on anv, which is a bit more optimized than radv - we at allocate the image on the stack. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Omar Akkila	4208895175	ci: bump VK-GL-CTS to 1.3.1.1 Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15668>	2022-04-04 23:04:33 +00:00
Tomeu Vizoso	d948f32365	ci/freedreno: Reduce concurrency when replaying traces on a630 We are running out of memory when replaying traces sometimes, reduce the number of concurrent retrace processes. Mesa: User error: GL_OUT_OF_MEMORY in glReadPixels warning: GL_OUT_OF_MEMORY while getting snapshot 1074335: warning: failed to get snapshot https://gitlab.freedesktop.org/mesa/mesa/-/jobs/20519522 Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15682>	2022-04-04 12:48:40 +00:00
Emma Anholt	e1de9b0de5	turnip: Allow image access on swapped formats. This is apparently something that gamescope would like to have, and the CTS's test coverage is happy with it. Fixes: #6011 (we hope) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	4cd51efedb	turnip: Disable tiling on 1D images. If we know the height is 1, then it would be a waste to align each miplevel to tile height. For non-mipmapped textures, it doesn't save us memory (since you still align to 4 on the last miplevel), but it should be better cache locality by not loading those unused lines. Incidentally, this gets us some more coverage of swap != WZYX cases in CTS tests, which often use optimal tiling without also testing linear. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	71fcb751eb	freedreno/a6xx: Set the color_swap field for storage descriptors. This field does appear to work as expected: with 1D/1DArray turnip storage images switched to be always linear, it fixes the dEQP-VK.image.store tests using a color swapped format (once we allow color swap). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	51b04a7dfb	turnip: Add support for VK_KHR_format_feature_flags2. This reports all of our storage formats as supporting read/write without format, since we don't have any in-shader format conversions. Similarly, shadow comparisons were already supported on all the depth formats. This extension is required for VK 1.3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Danylo Piliaiev	5ce06f8474	turnip: Use correct type for OUTARRAY in FormatProperties2 Fixes: `799a9db24c` ("turnip: Stop using VK_OUTARRAY_MAKE()") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15694>	2022-04-02 09:51:45 +00:00
Vinod Koul	28ae397be1	freedreno/registers: update dsi registers to support dsc Display Stream compression (DSC) compresses the display stream in host which is later decoded by panel. This requires addition of 3 new DSI registers to support DSC over DSI. Signed-off-by: Vinod Koul <vkoul@kernel.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14967>	2022-04-01 21:56:40 +00:00
Rajnesh Kanwal	d5405c1608	vulkan: Move common format function to vulkan/util/vk_format.h Moving duplicate vk_format helper functions to common vulkan/util/vk_format.h and also renaming vk_format_get_component_size_in_bits to match how amd and freedreno name the same function. Not moving this function to common code as freedreno's implementation is a bit different. Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15696>	2022-03-31 17:18:22 +00:00
Danylo Piliaiev	10734fb748	turnip: enable has_ccu_flush_bug workaround for a660 It seems that a660 has the same bug. Without the workaround there are a lot of flakes with depth-stencil tests, e.g. in: dEQP-VK.pipeline.extended_dynamic_state.* dEQP-VK.renderpass.depth_stencil_write_conditions.* dEQP-VK.pipeline.stencil.format.d24_unorm_s8_uint.states.* Or guaranteed failures like of: dEQP-VK.pipeline.render_to_image.core.2d.huge.width.r8g8b8a8_unorm_d32_sfloat_s8_uint Enabling the workaround fixes all of them. cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15548>	2022-03-29 08:34:18 +00:00
Connor Abbott	0b0b9274b6	freedreno/ci: Fix skip comment This test was never supposed to be skipped, and the referenced commit just exposed a bug in turnip fixed by the previous commit. It was hanging due to a CTS bug making the submit take way too long, which will be fixed once the CTS change lands. Also, add it to the a630 skips. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15563>	2022-03-28 17:16:54 +00:00
Connor Abbott	9d081d7561	tu: Correctly handle VK_IMAGE_CREATE_EXTENDED_USAGE_BIT In this case we should relax checks based on the format, since the user will be responsible for them when creating an image view. This gets dEQP-VK.image.sample_texture._bit_compressed_format_ not skipping again after VK-GL-CTS 736eec57dc0c ("Fix checkSupport in compressed texture sampling tests"). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15563>	2022-03-28 17:16:54 +00:00
Danylo Piliaiev	37939e9c54	turnip: Fix the lack of WFM before indirect draws We have to add WFM to pending bits when we are flushing into CP for indirect draw to know when they should apply WFM workaround. Fixes CTS tests: dEQP-VK.draw.renderpass.indirect_draw._data_from_compute.indirect_draw_count Fixes: `abf0ae014a` ("tu: Properly handle waiting on an earlier pipeline stage") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15577>	2022-03-28 16:09:07 +00:00
Boris Brezillon	799a9db24c	turnip: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:02 +00:00
Rob Clark	c0f52f08a1	freedreno/ci: Update a306 expectations These have started to flakey UnexpectedPass somewhere along the way. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	802f4da5ee	freedreno/drm: Add virtio backend Add a new backend to enable using native driver in a VM guest, via a new virtgpu context type which (indirectly) makes host kernel interface available in guest and handles the details of mapping buffers to guest, etc. Note that fence-fd's are currently a bit awkward, in that they get signaled by the guest kernel driver (drm/virtio) once virglrenderer in the host has processed the execbuf, not when host kernel has signaled the submit fence. For passing buffers to the host (virtio-wl) the egl context in virglrenderer is used to create a fence on the host side. But use of out-fence-fd's in guest could have slightly unexpected results. For this reason we limit all submitqueues to default priority (so they cannot be preepmted by host egl context). AFAICT virgl and venus have a similar problem, which will eventually be solveable once we have RESOURCE_CREATE_SYNC. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2200d674e4	freedreno/drm: Reorder device destroy Call backend specific cleanup fxn earlier. This is needed if the backend has things like bo's to delete, otherwise the handle_table will already be destroyed causing problems in bo_del() Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	ea339137b0	freedreno/drm: Extract out "softpin" submit/ringbuffer base class We are going to want basically the identical thing, other than flush_submit_list, for virtio backend. Now that we've moved various other dependencies into the base classes, extract out an abstract base class for submit/ringbuffer. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	72a427244f	freedreno/drm: Move ring_pool slab parent to base Prep to move most of sp submit/ringbuffer to something that can be re-used by virtio backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	877f9049c3	freedreno/drm: Move bo idx to base The virtio backend will want this too, and it will make it easier to share most of the submit/ringbuffer implementation with the virtio backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2ac9b23f78	freedreno/drm: Move submit_queue to base The virtio backend will want this too. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	88a10c6216	freedreno/drm: Avoid CPU_PREP ioctl if bo is idle With userspace fences, if we know definitely that the buffer is idle (which implies that it is not shared with other processes, etc), then skip the ioctl. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	9bcc983256	freedreno/drm: Add fd_bo_upload() There are some buffers that we mmap just to write to them a single time. Add the possibility of the drm backend to provide an alternate upload path to avoid these mmap's. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	115518ec35	freedreno/drm: Add FD_BO_SHARED hint With the virtio backend we will need to pass an extra flag when allocating buffers that will be shared cross-device (such as with virtio-wl for passing between host and guest) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	f846181fe5	freedreno/drm: Add FD_BO_NOMAP hint Add a hint for buffers that we won't need to mmap. With the virtio backend, virglrenderer needs to create a dmabuf fd for mapping into the host, which we want to avoid when possible. Low hanging fruit is to use this hint for anything tiled/ubwc. There are probably more bo's that can be flagged as such. TODO add fd_bo_upload() for memcpy to bo.. this would be useful for uploads, for example, shaders which we just write once and never touch again.. for virtio this could be implemented with a TRANSFER_TO_HOST ioctl. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	598405c91f	freedreno/drm: Rework bo creation path Decoupling handle and fd_bo creation simplifies things for "normal" drm drivers, avoiding duplication for the create vs import paths. But this is awkward for the virtio backend when wants to do multiple things in the same guest<->host round trip. So instead, split the paths in the interface backend and move the code sharing for the two different paths into the msm backend itself. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	9ea36968d3	freedreno/drm: Add fd_device_open() helper Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2bc815878c	freedreno/drm: Split msm backend into subdir Let's keep things a bit better organized when we add a new backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Tomeu Vizoso	9f43dac0ca	ci/freedreno: Increase console timeout for perf jobs Piglit is very sparse in its status output and downloads of big traces can take a while. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15527>	2022-03-24 05:33:54 +00:00
Tomeu Vizoso	d0e99e566f	ci/freedreno: Update checksum for GolfWithYourFriends trace The MR below changed the rendering slightly and the checksum isn't valid any more: "ir3, turnip, freedreno: Shader preambles" https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148 Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15526>	2022-03-24 00:04:20 +00:00
Danylo Piliaiev	5d151ddfba	turnip: Disallow non-linear tiling when casting R8G8 to other fmts R8G8 have a different block width/height and height alignment from other formats that would normally be compatible (like R16), and so if we are trying to, for example, sample R16 as R8G8 we need to demote to linear. Follows the fix in Freedreno: `b97e3bb2e1` Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15465>	2022-03-22 13:47:21 +00:00
Danylo Piliaiev	a70b197741	turnip: Force linear mode for non-ubwc R8G8 formats Non-UBWC tiled R8G8 is probably buggy since media formats are always either linear or UBWC. There is no simple test to reproduce the bug. However it was observed in the wild leading to an unrecoverable hang on a650/a660. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5926 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15465>	2022-03-22 13:47:21 +00:00
Andrey Konovalov	ed2f496ce4	ir3: set local_size for shaders of MESA_SHADER_KERNEL type ir3_compile_shader_nir() should set local_size[] and local_size_variable fields not only for compute shaders, but for the OpenCL kernels too. v2: use gl_shader_stage_is_compute() instead of explicit comparison with MESA_SHADER_[COMPUTE,KERNEL]. Signed-off-by: Andrey Konovalov <andrey.konovalov@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14863>	2022-03-18 23:20:25 +00:00
Emma Anholt	f831ba238f	ci/turnip: Increase the hangcheck timer to 2 seconds. We get a lot of useful coverage from running graphicsfuzz with spilling enabled, but it's also pretty slow and can cause intermittent hangcheck failures. I thought I'd categorized them when merging !14839 (device loss on reset), but it looks like not all of them and we're now more likely to have flakes take out the whole test run when a single flake makes the rest of the caselist a flake. This is a little unfortunate in that it means our test environment is not the same as a stock system you would want to run deqp on to submit conformance, but I think it's an improvement in the test maintenance work vs needing to fix things up later. We have some other tests besides turnip that can trigger hangchecks which we might also like this increase for (some disabled traces, for example). However, freedreno GL has a 5-second timeout waiting for idle when mapping, and a couple of 2-second timeouts in a row can result in spurious failures in other tests! Fixes: #6163 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15435>	2022-03-18 19:07:24 +00:00
Connor Abbott	fc381fa1e3	tu: Actually expose VK_EXT_texel_buffer_alignment Oops... Fixes: `3d04c435` ("tu: Trivially implement VK_EXT_texel_buffer_alignment") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15451>	2022-03-18 18:30:20 +00:00
Jason Ekstrand	2a779f98dc	turnip: Drop tu_legacy.c The remaining three helpers all have helpers in the common code. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:08 -05:00
Connor Abbott	3d04c43576	tu: Trivially implement VK_EXT_texel_buffer_alignment The previous alignment of 64 bytes, which we got from the blob, indicates that single-texel alignment isn't supported. So just do a trivial no-op implementation that returns the same alignment as before. This matches what newer blobs that expose this extension do. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15427>	2022-03-17 20:45:19 +00:00
Tomeu Vizoso	96e17287b4	ci/freedreno: Disable a618 jobs Some of these machines are experiencing networking problems currently. Disable for now so people aren't blocked. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15430>	2022-03-17 17:43:06 +00:00
Connor Abbott	072fdcabcd	tu: Enable UniformBufferUpdateAfterBind UBOs are now read at run-time via the preamble so this can be enabled. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	9932ca8a3f	ir3, turnip: Use ldc.k to push UBOs This reuses the same UBO analysis to do the pushing in the shader preamble via the ldc.k instruction instead of in the driver via CP_LOAD_STATE6. The const_data UBO is exempted as it uses a different codepath that isn't as critical. Don't do this on gallium because there are some regressions. Aztec Ruins in particular regresses a bit, and nothing I've benchmarked benefits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	221a912b8c	ir3: Refactor ir3_compiler_create() to take an options struct This will let us add more options without creating too much churn. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	acba08b58f	ir3: Implement and document ldc.k Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	fccc35c2de	ir3: Add preamble optimization pass Now that everything is plumbed through, we can tie it together. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	986f7adfee	ir3: Don't include preamble instructions in stats Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	42e21c751b	ir3: Insert frag coord code after preamble To match the pre-preamble behavior, and so that we can better schedule it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	b6fe69d855	ir3: Support prefetching with preambles Since the NIR pass runs very late, it needs to be aware of preambles, and when creating the instruction we need to move it to the start block so that RA doesn't overwrite it in the preamble. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	00d7ad334a	ir3/legalize: Handle inserting (ei) with preamble Make sure that shaders with a preamble are still considered early-release so that we don't regress them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	ccc64b7e00	ir3: Plumb through store_uniform_ir3 intrinsic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	944f4e6f8a	ir3: Better assemble/disassemble stc Add in the type, even though it turns out to not be that useful. Add in support for assembling it. Add some notes based on computerator experiments. And add support for the indirect a1.x mode that's needed for storing c64.x and later. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	3244e659e0	ir3: Implement basic shader preamble intrinsics These will be used to implement the ir3-specific shader preamble lowering in NIR. shps is conceptually similar to getone (although it technically can't be duplicated) and shpe is similar to other barriers, since it has to happen after any stores to the constant file in the preamble. Add NIR intrinsics and plumbs them through ir3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	7ad57d9af1	ir3: Don't count reserved user consts in ubo_state::size Previously we included the reserved user consts (for Vulkan push constants) as part of the pushed UBO contents, but that led to a problem because when calculating the worst-case space for UBOs we didn't factor in the reserved user consts. We'll have the same problem when doing the same thing in the preamble optimization pass. Stop including the reserved size in ubo_state::size, and have ir3_setup_consts() add it in instead, so we won't forget to add it anywhere. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	e274354204	ir3: Fix scan.macro valid flags Right now we don't support any. We could probably support const, but that's not worth it because we could optimize a reduce of a const better anyway. Fixes: `1a78604d20` ("ir3: Add support for subgroup arithmetic") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Emma Anholt	9e9a366cad	ci/turnip: Drop alpha_to-coverage flake note on a618. It's only ever been seen on a630. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	2f25d16653	turnip: Use the DRM or KGSL GPU reset status ioctls to report device loss. ANGLE-on-venus-on-turnip and zink-on-turnip want real data here for EGL's reset tests. This required moving the remaining GPU-reset-causing tests from flakes or xfails to skips. Otherwise, the rest of the caselist associated with them ends up being marked as fails as well. The alternative would be to put these tests in their own test groups with tests_per_group = 1, but that didn't seem worth the effort. Or, we could finally do something with https://gitlab.freedesktop.org/anholt/deqp-runner/-/issues/14. Fixes: #5955 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	add2121969	ci/freedreno: Remove some xfails for tests that now skip. The last CTS uprev correctly turned them into NotSupported. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	ffa9438183	ci/freedreno: Drop the skips of spirv_ids_abuse in pre-merge. The crash was fixed in `62a7acee93`, and runtime of the tests locally is 5-17s each with a hot shader cache, 11-25s without. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	5dd74533b2	ci: Drop skips of spv-stable-pillars-volatile-nontemporal-store The runtime was fixed in VK-GL-CTS 7cc65f6c02276767407233e74c7174d88dab7919. Now it's .6s on lvp, 20ms on turnip a618. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	3b90d3997a	turnip: use vk_shader_module_to_nir(). Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Connor Abbott	a83ea0253f	ir3: Use isam for bindless readonly ssbo loads Since this isn't hooked up in gallium, only do it for bindless for now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	625ebb977f	ir3: Actually use wrmask in emit_sam I noticed that isam emitted for SSBO loads was writing all 4 components, which this fixes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	5f020bcc8d	ir3/lower_spill: Fix corner case with oob offsets If the base register is killed, it may be reused as the destination of a ldp. In that case we should just skip resetting it afterwards. Fixes regressions in dEQP-VK.ssbo.layout.random.scalar.38 later. Fixes: `9912c61362` ("ir3/spill: Support larger spill slot offset") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	6304c7cb82	ir3/parser: Don't use right recursion This fixes memory exhaustion errors when doing shader replacement with very large shaders. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	f9d9c0172a	tu: Add an extra storage descriptor for isam Based on a workaround the blob does. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	1ec3d39407	tu: Handle UBO/SSBO descriptors with different sizes We reuse the otherwise-unused offset channel to represent the array stride, so that reindexing works properly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	5ba3ea1eb3	tu: Rewrite dynamic descriptor handling We need to prepare for storage buffers having different sizes from uniform buffers. This switches dynamic_offset_offset to have units of bytes, the same as offset, and as a nice bonus we can more easily combine the dynamic and non-dynamic paths in various different places. This also entails rewriting the code that patches dynamic descriptors, since we can no longer assume a linear mapping between indices in dynamicOffsets and descriptor locations which the previous approach heavily relied on. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Emma Anholt	eb9b092001	turnip: Enable VK_EXT_display_control using the common code. It's all implemented now, so we can turn it back on. Passes 15/16 tests when X11 isn't running, and 1/16 when it is, with no failures in either mode. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15351>	2022-03-15 20:08:58 +00:00
Rob Clark	05d6877235	freedreno/ir3: Don't try re-swapping cat3 srcs This can lead us to endless loops of "progress".. Note fixes commit commit really just exposed an existing problem. Fixes: `9c9e8c3349` ("nir: Reorder ffma and fsub combining") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6133 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15336>	2022-03-12 16:42:00 +00:00
Rob Clark	fa59556e1a	freedreno/ir3: Remove unused define Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15336>	2022-03-12 16:42:00 +00:00
Tapani Pälli	adea096029	ci: update various ci result files Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12936>	2022-03-11 09:58:28 +00:00
Connor Abbott	cdee38a57b	tu: Expose subgroup arithmetic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	1a78604d20	ir3: Add support for subgroup arithmetic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	a433db60c1	ir3: Track physical edges when inserting (ss) for shared regs Normally this wouldn't matter, but it will matter for the upcoming scan macro because the running tally is communicated through a shared register across a physical edge. It may also matter if a live-range split occurs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	72b32d83fb	ir3/spill: Mark reload destination as early-clobber Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	2ff5826f09	ir3/ra: Add IR3_REG_EARLY_CLOBBER We'll need this to model the subgroup reduction macros. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	34803d15ab	ir3/ra: Add proper support for multiple destinations We weren't considering the other destinations when allocating a destination, so we could allocate overlapping destinations. This wasn't done before because we never had a need for it, but the subgroup reduction macros will need it. The trickiest part of this is that we have to rewrite the compress_regs_left fallback, because we may have to move around the other already-allocated destinations. We now have a list of destinations to (re)allocate in addition to the popped live intervals. For the rest of the destination handling, we can just bail out if the proposed spot for something overlaps another destination, but for the fallback we have to handle all the cases gracefully. I also added support for odd combinations of multiple destinations where some of them are tied, which we'll use in the next commit to handle early-clobber destinations and which will actually be used because one of the destinations of the subgroup reduction macro will be early-clobber. The result is that the order of intervals to allocate is now a lot more complicated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	ab0ed4ff3f	ir3/ra: Sanitize parallel copy flags better For pcopies we only care about the register's type, i.e. whether its a half-register and whether it's an array (plus its size). Copying over other flags like IR3_REG_RELATIV just leads to sadness and validator assertions. Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	0135660dfc	ir3/ra: Fix ra_foreach_dst_n Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	077d07a983	ir3/ra: Fix tied destination handling with multiple destinations Before, we were careful to 1. Get the source physreg. 2. Allocate the destination. 3. Insert a copy with the source being the physreg from step 1. and this guaranteed that if the tied source were moved in step 2 we'd still insert a copy from the correct place. However this won't work with multiple destinations because an earlier destination could've already moved the tied source around. Instead flip steps 2 and 3 (we'll insert the copy before we allocate the interval, but that's ok) and run the first two steps in a separate loop before any destinations are allocated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	9cc42242d5	ir3/sched: Support multiple destinations Note: this is a behavior change for arrays, because it will count the entire array instead of just the components written in the register pressure calculation. However this is more accurate since this matches how RA works. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	a6be8fd0ea	ir3/dce: Support multiple destinations Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	89217f636c	ir3/cp_postsched: Support multiple destinations Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Danylo Piliaiev	c4703cd846	tu: Implement VK_EXT_depth_clip_control Since negativeOneToOne is a static property of the pipeline and viewport state could be dynamic, we have to defer viewport state emission until negativeOneToOne value is known. See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6070 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14363>	2022-03-10 11:08:50 +02:00
Danylo Piliaiev	2e878293f4	turnip: Make autotuner work with reusable command buffers To achieve it each command buffer now has its own GPU memory. However the BOs usage by autotuner is not optimal, the ideal pattern would be to use some memory pool to suballocate small GPU memory chunks, since most command buffers have only a few renderpasses. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5990 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14996>	2022-03-09 12:56:31 +00:00
Danylo Piliaiev	2cd30266f1	tu: Refactor VS DECODE/DEST to be emitted in two pkt4 Refactor to emit VFD_DECODE and VFD_DEST_CNTL in two packets regardless of attribute count. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14584>	2022-03-09 08:21:40 +00:00
Rob Clark	711f0d1df4	turnip: Don't call getenv() directly I noticed it was using getenv directly when I tried to use 'setprop mesa.tu.debug ..' on android. Use os_get_option() instead so we get sysprop fallback on android. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15289>	2022-03-09 00:22:36 +00:00
Timur Kristóf	64acec0ef9	nir: Fix lowering terminology of compute system values: "from"->"to". This is to match other NIR terminology. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Ilia Mirkin	4e45847d0a	freedreno: add a420 deqp-runner files This doesn't actually get run in CI, but this helps track outstanding issues / expectations. This is from a run on my IFC6540 with A420. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Ilia Mirkin	0277f491ae	freedreno/ir3: disable conversion folding on a4xx Experiments suggest that e.g. add.u r0.y, hr0.x, hr0.y will result in the summed value in both the high and low words of r0.y. This only happens with odd registers, not even ones (r0.x works fine). Seen in the bit_count lowering (which turns out to be unnecessary, but this is still a larger problem). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Ilia Mirkin	4684425150	freedreno/ir3: no need to count bits 16b at a time for a4xx This also works out nicely since a4xx has some sort of problem with the 16b-based lowering. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Ilia Mirkin	d11543ec52	freedreno/a4xx: extend astc and tg4 workarounds to compute shaders Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Danylo Piliaiev	e2fc99b188	turnip: Add "rast_order" debug option to force rast order access Enables rasterization order attachment access for all pipelines, see VK_ARM_rasterization_order_attachment_access for details. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15262>	2022-03-07 17:07:18 +00:00
Ilia Mirkin	e42a8a5b92	a4xx: add emission of compute state, and compute dispatch Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Ilia Mirkin	aac7028b58	freedreno/ir3: support a4xx compute differences Mainly the workgroup id comes injected via consts by the hardware (or CP), and we must make room for it, otherwise the driver won't know where to put it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Ilia Mirkin	6fb5e64ead	freedreno/ir3: support a4xx in load/store buffer/image emission Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Rob Clark	e9cd4fba6f	freedreno/perfetto+fdperf: Set SYSPROF param No need to check error return and deal with older kernels. Older kernels won't have this param but their default behavior allows for systemwide perfcntr collection. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15236>	2022-03-04 16:06:34 -08:00
Rob Clark	af4b7f74b2	freedreno/drm: Add SYSPROF param Add new param for putting kernel in system-profiling mode and add corresponding fd_pipe_set_param() mechanism. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15236>	2022-03-04 16:06:34 -08:00
Ilia Mirkin	539fae796a	freedreno/a4xx: fix integer tg4 Something is slightly off in the integer values returned. It passes many tests without the fixup, but the dEQP-GLES31 tests complain. The blob ends up doing 3x gathers, and selects between them based on getinfo results. Since we already have a per-sampler key with some spare bits, just stick the bit-size info in there. And we can derive signedness from the associated type info. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	96211adf77	freedreno/a4xx: add swizzles to shader keys for tg4 workaround Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	37306ba3f1	freedreno/ir3: remove bogus tg4 -> tex lowering pass It can't be done. This just provides bad results. The blob had a comparable approach where they fixed up coordinates, but that also can't work with a separate texture definition with nearest filtering. By then, might as well provide a unswizzled variant instead, and using native functionality. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Rob Clark	7e63fa2bb1	freedreno/registers: Add a couple regs we need for kernel Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15221>	2022-03-03 02:19:47 +00:00
Emma Anholt	221ce1b35a	ci/freedreno: Consolidate some information about an a630 flake. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15197>	2022-03-02 19:53:26 +00:00
Emma Anholt	a64408dcd5	ir3: Don't assert on not finding the VS output for an FS input. It should return undefined data, not terminate the program. Fixes some piglit tests poking at the UB handling. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15197>	2022-03-02 19:53:26 +00:00
Danylo Piliaiev	549e861dc1	turnip: Implement VK_EXT_physical_device_drm Copied from ANV and V3DV. v1. Fix a build error for clang "unannotated fall-through between switch labels" ( Hyunjun Ko <zzoon.ko@igalia.com> ) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6011 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14971>	2022-03-01 07:10:40 +00:00
Connor Abbott	ed9a0d48a9	ir3: Use isam for bindless images In the bindless case, we don't have to keep any shadow descriptors and can just reuse the IBO descriptor as a texture descriptor. Now that we're emitting the swizzle we can just flip this on. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	06485f7d3d	tu: Call nir_opt_access This adds some small optimizations, and enables lowering to isam in more cases where the app didn't specify readonly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	58d72f45e5	ir3/nir: Fix 1d array readonly images ncoords includes the array index, and the NIR source has the array index as its last component, so we have to insert the extra y coordinate in the middle in this case. Fixes: `0bb0cac` ("freedreno/ir3: handle image buffer") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	21ac044c3e	ir3: Don't always set bindless_tex with readonly images Fixes: `274f381` ("ir3: Plumb through bindless support") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	bb1e0eba08	freedreno/fdl: Set swizzle on storage descriptor It appears to be unused by ldib/stib, but it will let us use isam on IBO descriptors for bindless images. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	00be8c4619	freedreno: Replace A6XX_IBO with A6XX_TEX_CONST Since these were reverse-engineered, it's become clear that IBO descriptors are just a subset of texture descriptors, and bindless reads of readonly images actually use isam on the IBO descriptor, further confirming that the two are always compatible, even if not all of the texture fields exist for IBOs. It's pointless to have a separate type for IBOs, and just leads to things getting out-of-sync unnecessarily which has already happened. Just remove it and use TEX_CONST insted. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	e1c4c2ac60	ir3: Use CAN_REORDER instead of NON_WRITEABLE CAN_REORDER takes volatile into account, and is closer to what we actually require to use texture instructions, which is that we can arbitrarily reorder loads. Fixes: `aa93896` ("freedreno/ir3: adjust condition for when to use ldib") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Danylo Piliaiev	95fabff8de	turnip: Set drmFormatModifierTilingFeatures From Vulkan spec for VkDrmFormatModifierProperties2EXT: "drmFormatModifierTilingFeatures is a bitmask of VkFormatFeatureFlagBits that are supported by any image created with format and drmFormatModifier." "The returned drmFormatModifierTilingFeatures must contain at least one bit." "Therefore, if the returned drmFormatModifier is DRM_FORMAT_MOD_LINEAR, then drmFormatModifierPlaneCount must equal the format planecount, and drmFormatModifierTilingFeatures must be identical to the VkFormatProperties2::linearTilingFeatures returned in the same pNext chain." Relevant tests: dEQP-VK.drm_format_modifiers.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15032>	2022-02-28 22:53:40 +00:00
Guilherme Gallo	d1c6185b5a	ci: skqp: Add Vulkan support for a630_skqp job This commit adds support for Vulkan backend on a630_skqp job. = Needed changes - Needed to install libvulkan-dev package on system - Refactored the way the available skqp reports are printed tested in development builds with skia tools Piglit expectations had to be updated in various drivers due to !14750 not having bumped the tags when it tried to uprev. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14686>	2022-02-25 05:50:06 +00:00
Guilherme Gallo	8bfef8bf6b	ci: skqp: Build skqp from android-cts-10.0_r11 tag with Clang The Android CTS 10 version is relative old when compared with skia main branch, which was being used before. Some modifications in the skqp build/runner scripts were needed to make it run on CI. - skqp versions from android-cts have already all assets inside platform_tools folder. - along with the assets, are the render and unit files which are expected to pass in the Android CTS execution. - removed custom test files from the a630 folder, to make it comply with the CTS expectations. - include new patches to remove Python2 dependencies and avoid the installation of it in rootfs. - strip binariesthe built binaries `skqp` and `list_gpu_unit_tests`, as `is_debug = false` gn argument did not work, maybe it is not well tested in development builds with skia tools - use Clang instead of GCC. The GCC support is not so graceful as it is in the skia main branch, some NEON instructions needs to be turned off in the GCC compilation, causing different tests result. This change does not imply a bigger rootfs, since the built skqp binary uses GCC libc++ and other library runtimes. So clang is just a build dependency. = Changes in skqp results = Some errors were found for GL backend and unit tests. GLES and VK tests are green. All the failed tests were classified as expected to fail in the render and unit tests list. ``` gl_blur2rectsnonninepatch gl_bug339297_as_clip gl_bug6083 gl_dashtextcaps ``` ``` SRGBReadWritePixels (../../tests/SRGBReadWritePixelsTest.cpp:214 Could not create sRGB surface context. [OpenGL]) ``` Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14686>	2022-02-25 05:50:06 +00:00
Tomeu Vizoso	c0695bb473	ci: Allow disabling the whole of the Collabora farm Add a global-level variable that allows disabling all jobs that would have gone to the Collabora lab, to be used in case of outages. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15150>	2022-02-24 07:33:45 +01:00
Dmitry Baryshkov	b4bef890ee	freedreno/regs: remove 5nm DSI PHY regs 5nm PHY is a variation of 7nm PHY, they use the same register definitions. To remove duplication, drop 5nm defs. Cc: Robert Foss <robert.foss@linaro.org> Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15051>	2022-02-23 21:25:22 +00:00
Dmitry Baryshkov	22efeec399	freedreno/registers: add new register for 7nm DSI PHY v4.3 (sm8450) Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15052>	2022-02-23 17:28:17 +00:00
Danylo Piliaiev	7e703e4428	turnip: Always use GMEM for feedback loops in autotuner For ordinary feedback loops GMEM is a lot faster than sysmem since we don't set SINGLE_PRIM mode. For feedback loops with ordered rasterization GMEM should also be faster. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Danylo Piliaiev	ebc23ac963	turnip: Implement VK_ARM_rasterization_order_attachment_access Trivially implemented by using A6XX_GRAS_SC_CNTL_SINGLE_PRIM_MODE. This extension is useful for emulators e.g. AetherSX2 PS2 emulator and could drastically improve performance when blending is emulated. Relevant tests: dEQP-VK.rasterization.rasterization_order_attachment_access.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Danylo Piliaiev	d6c89e1e4a	turnip: Merge LRZ and DEPTH_PLANE draw states They were emitted at the same time. Frees 1 draw state for us to use. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Danylo Piliaiev	dab34bd5c8	turnip: Use LATE_Z when there might be depth/stencil feedback loop Otherwise a shader invocation would read the value which should have been set AFTER this shader invocation. Fixes tests: dEQP-VK.rasterization.rasterization_order_attachment_access.depth.samples_1.multi_draw_barriers dEQP-VK.rasterization.rasterization_order_attachment_access.stencil.samples_1.multi_draw_barriers Fixes: `71595a189a` ("tu: Fix feedback loops in sysmem mode") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Emma Anholt	59bc17d57a	turnip: Request no implicit sync when we have no implicit-sync WSI BOs. I chose to implement this as a global flag in the device, because otherwise we would end up with extra draw overhead trying to avoid it in the implicit-sync WSI case, and you're probably going to end up needing implicit sync anyway because you used one of the BOs in any of the submitted cmdbufs. To do better than this, we would probably want a skip-implicit-sync flag on the BOs in the BO list, rather than global on the submit. Reports about venus on turnip say that this flag reduces worst-case QueueSubmit time in a game workload from ~10ms to ~4ms. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14838>	2022-02-22 17:36:05 +00:00
Ilia Mirkin	65c4b6a4c6	freedreno/ir3: document GETINFO's x/y results The zw were already known, but throw them in here too. I'm not extremely happy with the description of "y", feels like there's a simpler explanation there, but I couldn't find it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14672>	2022-02-21 02:09:19 +00:00
Danylo Piliaiev	a814a4f9db	turnip: Add a refcount mechanism to BOs Until now we have lived without a refcount mechanism in the driver because in Vulkan the user is responsible for handling the life span of memory allocations for all Vulkan objects, however, imported BOs are tricky because the kernel doesn't refcount so user-space needs to make sure that: 1. When importing a BO into the same device used to create it (self-importing) it does not double free the same BO. 2. Frees imported BOs that were not allocated through the same device. Our initial implementation always freed BOs when requested, so we handled 2) correctly but not 1) on drm and we would double-free self-imported BOs because kernel doesn't return a unique gem_handle on each import. Beside this the submit ioctl checks for duplicates in the BO list and returns an error if there is one. This fixes the problem for good by adding refcounts to BOs so that self-imported BOs have a refcnt > 1 and are only freed when all references are freed. KGSL on the other hand does not have the same problems, at least not with ION buffers which are used for exportable BOs on pre 5.10 android kernels. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5936 Fixes CTS tests: dEQP-VK.drm_format_modifiers.export_import.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15031>	2022-02-19 15:16:55 +00:00
Emma Anholt	a2b7d9b9cd	ci/freedreno: Add a known spilling hangcheck flake. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15085>	2022-02-19 02:37:13 +00:00
Emma Anholt	b39d5e9705	ci/freedreno: Cut down pre-merge a630 VK coverage. We've got lots of VK coverage on 618, so take some of the load off (but leave a little bit of testing just to make sure we don't totally break 630). This should help with our Marge times since we've added some other coverage to 630 that's started overloading the runners. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15085>	2022-02-19 02:37:13 +00:00
Emma Anholt	04790ec8bb	ci/freedreno: Move a 60s timeout test to skips instead of flakes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15085>	2022-02-19 02:37:13 +00:00
Ian Romanick	a01b262990	nir: Add missing dependency on nir_opcodes.py Commit `38800b38` changed nir_opcodes.py, but that doesn't seem to have triggered nir_opt_algebraic.py. The change in `75ef5991` depends on opt_algebraic lowering 16-bit versions of slt, but if opt_algebraic is not rebuilt, this may not happen. This resulted in some people seeing assertion failures in, for example, dEQP-VK.spirv_assembly.instruction.compute.float16.arithmetic_3.step, due to the backend seeing nir_op_slt that it didn't know how to handle. v2: Add nir_opcodes.py to nir_algebraic_py so that all the per-driver algebraic passes pick up the dependency too. Rename it to nir_algebraic_depends. Suggested by Emma. Closes: #6047 Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15050>	2022-02-17 22:57:33 +00:00
Connor Abbott	3ef858a6f6	ir3/spill: Fix simplify_phi_nodes with multiple loop nesting Once we simplified a phi node, we never updated the definition it points to, which meant that it could become out of date if that definition were also simplified, and we didn't check that when rewriting sources. That could happen when there are multiple nested loops with phi nodes at the header. Fix it by updating the phi's pointer. Since we always update sources after visiting the definition it points to, when we go to rewrite a source, if that source points to a simplified phi, the phi's pointer can't be pointing to a simplified phi because we already visited the phi earlier in the pass and updated it, or else it's been simplified in the meantime and this isn't the last pass. This way we don't need to keep recursing when rewriting sources. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15035>	2022-02-16 09:55:39 +00:00
Yiwei Zhang	2a87a741ae	turnip: advertise VK_EXT_queue_family_foreign Both Venus and Android AHB requires this extension. Turnip ignores VK_SHARING_MODE_EXCLUSIVE so this is a no-op. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Rob Clark <robdclark@chromium.org> Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14836>	2022-02-14 21:27:35 +00:00
Danylo Piliaiev	0b2da9d795	ir3: Limit the maximum imm offset in nir_opt_offset for shared vars STL/LDL have 13 bits to store imm offset. Fixes crash in CS compilation in Monster Hunter World. Fixes: `b024102d7c` ("freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars.") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14968>	2022-02-14 20:56:46 +00:00
Ilia Mirkin	3d41414d26	freedreno/ir3: split up load/store/atomic by generation Some bits are slightly different on a4xx. Use the encodings that work. Perhaps these can be combined at some point if we get a proper understanding of what they mean. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:11 -05:00
Ilia Mirkin	b91b036322	isaspec: add gen-based leaf bitset separation This is necessary for some ops which have slightly different encoding on a4xx/a5xx, but are otherwise identical. This helps keeping the compiler from having to worry about these details and creating separate ops. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:07 -05:00
Jason Ekstrand	bda4c4f6b6	vulkan: Take a vk_command_pool in vk_command_buffer_init() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	d59caf5d11	turnip: Use vk_command_pool Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Louis-Francis Ratté-Boulianne	5e263cc324	vulkan/runtime: Add a level field to vk_command_buffer Looks like 3 implementations already have that field in their private command_buffer struct, and having it at the vk_command_buffer opens the door for generic (but suboptimal) secondary command buffer support. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Danylo Piliaiev	b84f059680	freedreno/pps: Expose same counters as blob Expose most of the counters exposed by blob. By faking the value of counters returned from kgsl I found the exact underlying counters and constant coefficients being used. Note, coefficients for counters that depend on time are NOT verified. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14323>	2022-02-10 15:15:33 +00:00
Danylo Piliaiev	97c90c514f	turnip: Depth/stencil formats should not expose any bufferFeatures From the Vulkan 1.3.205 spec, section 19.3 "43.3. Required Format Support": Mandatory format support: depth/stencil with VkImageType VK_IMAGE_TYPE_2D [...] bufferFeatures must not support any features for these formats See https://gitlab.khronos.org/vulkan/vulkan/-/merge_requests/4849 Fixes CTS tests: dEQP-VK.api.buffer.invalid_buffer_features.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14927>	2022-02-09 20:11:22 +00:00
Emma Anholt	ef112db311	ci: Bump VK-GL-CTS to 1.3.1.0. The main thing is VK 1.3 testing, but also includes test bugfixes. The 1.3 CTS required an uprev of deqp-runner to handle a new style of test output, and that deqp-runner brings in some neat new features, too (piglit in your deqp-runner suite, and extension list checking). A bunch of VK tests got renamed, so I replaced panvk's custom test list with simple include filters on the main test list. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (panvk) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14920>	2022-02-08 22:16:36 +00:00
Emma Anholt	6faaeca584	ci/freedreno: Add another unsizedArrayLength flake. Started appearing on Feb 1, but given that the rest of this test group flakes, I assume it's similar. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14934>	2022-02-08 18:04:37 +00:00
Danylo Piliaiev	44bdac9849	tu: Implement VK_AMD_buffer_marker to support Graphics Flight Recorder Graphics Flight Recorder is: "The Graphics Flight Recorder (GFR) is a Vulkan layer to help trackdown and identify the cause of GPU hangs and crashes. It works by instrumenting command buffers with completion tags." This is a nice little tool which could help quickly identify the call which hanged. Or if command buffer is executed for too long. The tiling nature of our GPU shouldn't be a big issue aside from lower performance. For non-segfault case, if: - Hang happens at the same place in cmdbuf and draw/dispatch is not finished at that point - it is likely that there is an infinite loop in some of the shaders in this draw. - Hang happens always in different place - likely there is nothing wrong and command buffer just takes too long to execute and you should try increasing hangcheck_period_ms. If it doesn't help it is likely a synchronization issue. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13553>	2022-02-07 12:53:34 +02:00
Danylo Piliaiev	183bc15bdb	turnip: Unconditionaly remove descriptor set from pool's list on free We didn't remove desc set from the pool's list if pool was host_memory_base. On the other hand in there is no point in removing desc set from the list in DestroyDescriptorPool/ResetDescriptorPool. Fixes: `da7a4751` ("turnip: Drop references to layout of all sets on pool reset/destruction") Fixes cts tests: dEQP-VK.api.buffer_marker.graphics.default_mem.bottom_of_pipe.memory_dep.draw dEQP-VK.api.buffer_marker.graphics.default_mem.bottom_of_pipe.memory_dep.dispatch Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14855>	2022-02-04 21:07:30 +00:00
Dylan Baker	2f916f2be6	meson: add support for `meson devenv` with vulkan Meson devenv is a feature added in meson 0.58 (thus the features is version guarded) that allows creating a shell environment with environment variables automatically setup for running the project inside the build dir. Some variables (such as LD_LIBRARY_PATH and PATH) are set automatically, others must be added by the project. For vulkan is is relativley simple, we create a new, uninstalled, icd file for each driver and set the VK_ICD_FILENAMES variable appropriately. This can be used with: ```sh meson devenv -C $builddir ``` then, vulkan applications will automatically use the uninstall vulkan driver, no need to install. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14826>	2022-02-04 09:08:47 -08:00
Danylo Piliaiev	fded7a95c5	turnip: Expose VK_KHR_shader_non_semantic_info This is entirely implemented in the SPIR-V frontend. Relevant CTS tests: dEQP-VK.spirv_assembly.instruction.compute.non_semantic_info.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00
Danylo Piliaiev	ff059605aa	turnip: Implement VK_KHR_zero_initialize_workgroup_memory Moved nir_lower_compute_system_values to lower load_local_invocation_index which could be emitted by nir_zero_initialize_shared_memory. Relevant CTS tests: dEQP-VK.compute.zero_initialize_workgroup_memory.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00
Danylo Piliaiev	c6d1cac6e5	turnip: Expose VK_EXT_image_robustness VK_EXT_image_robustness is a strict subset of VK_EXT_robustness2 so we could just expose it. Relevant CTS tests: dEQP-VK.robustness.image_robustness.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00
Danylo Piliaiev	03f9deecb8	turnip: Use the shared helpers to expose 1.3 core extensions/limits Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00

... 3 4 5 6 7 ...

3456 Commits