KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Emma Anholt	38dff02bfb	ci/deqp-runner: Rename the deqp-drivername-.txt files to drivername-.txt We have two testsuites with the same format for fails/flakes/skips files, and test names that are definitely unique. As I'm about to add a third testsuite (gtest for libva-utils), so let's have just one file each for fails/flakes/skips instead of one per type of testsuite. This starts the move with just the bulk rename of deqp. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13517>	2021-10-27 20:54:11 +00:00
Matt Turner	2822b1345c	tu: Expose required VK_FORMAT_FEATURE bits for planar YUV formats Specifically this enables these VK_FORMAT_FEATURE bits: VK_FORMAT_FEATURE_TRANSFER_SRC_BIT VK_FORMAT_FEATURE_TRANSFER_DST_BIT VK_FORMAT_FEATURE_SAMPLED_IMAGE_YCBCR_CONVERSION_LINEAR_FILTER_BIT VK_FORMAT_FEATURE_SAMPLED_IMAGE_FILTER_MINMAX_BIT VK_FORMAT_FEATURE_SAMPLED_IMAGE_FILTER_LINEAR_BIT VK_FORMAT_FEATURE_SAMPLED_IMAGE_BIT VK_FORMAT_FEATURE_MIDPOINT_CHROMA_SAMPLES_BIT VK_FORMAT_FEATURE_COSITED_CHROMA_SAMPLES_BIT Fixes the following tests: dEQP-VK.api.info.format_properties.g8_b8_r8_3plane_420_unorm dEQP-VK.api.info.format_properties.g8_b8r8_2plane_420_unorm dEQP-VK.api.info.image_format_properties.2d.optimal.g8_b8_r8_3plane_420_unorm dEQP-VK.api.info.image_format_properties.2d.optimal.g8_b8r8_2plane_420_unorm Additionally allows 339 tests in dEQP-VK.ycbcr.* to go from Skip to Pass. [ Connor: Fake support for 3-plane formats, fixup modifiers path ] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6792>	2021-10-22 11:25:31 +00:00
Emma Anholt	613b9c8c47	ci: Update deqp to vulkan-cts-1.2.7.1. Brings in some VK fixes for tilers. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13253>	2021-10-13 00:37:59 +00:00
Emma Anholt	c0142ddf3a	turnip: Disable VK_EXT_display_control. The common code fails dEQP-VK.wsi.display_control.register_device_event due to having a stub NOT_IMPLEMENTED return, and thus fails the CTS. This is one of our last failures, so disable the extension until it can get finished off, so we can unblock passing the CTS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13010>	2021-09-27 17:41:43 +00:00
Emma Anholt	0e997d42a2	turnip: Swizzle in 0, 1 for D24S8 STENCIL_ASPECT sampling. This seems to be what is expected by the CTS, but I haven't found a citation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13009>	2021-09-24 19:26:57 +00:00
Emma Anholt	c0c5c0d557	tu: Support VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_PROTECTED_MEMORY_PROPERTIES. We had missed this struct previously, causing a CTS fail. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12944>	2021-09-21 22:43:43 +00:00
Emma Anholt	23f7e06cd8	freedreno/ir3: Don't use isam for coherent image loads on a6xx. If the coherent flag is present, then we need to not have an incoherent cache between us and previous stores to the image that were also decorated as coherent. isam apparently (unsurprisingly) goes through a texture cache. Use ldib instead, so that we don't get the wrong result. We need a similar fix for pre-a6xx, but we don't have emit_intrinsic_load_image for those (yet). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12704>	2021-09-03 18:17:07 +00:00
Danylo Piliaiev	46def1814e	turnip: consider shader's immediates size for sub-stream allocation Otherwise we could exceed pre-allocated space. Fixes: dEQP-VK.spirv_assembly.instruction.compute.opphi.wide Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8841>	2021-09-03 14:34:53 +00:00
Connor Abbott	88142d8a5a	ir3/ra: Fix type mismatch when comparing intervals This was once a physreg, back in the very beginning of the new RA, but now the caller passes an unsigned int. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5163 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12487>	2021-08-31 20:33:49 +00:00
Connor Abbott	6d76112624	tu: Remove some stale bypass xfails These were fixed by `09e0b29bb6` which was missed during the suite conversion. For the remaining still-valid fail, there is a CTS patch in progress. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12488>	2021-08-20 22:28:56 +00:00
Matt Turner	e0bc11d9d2	tu: Free device->bo_idx and device->bo_list on init failure Two related changes: - in tu_device.c:tu_CreateDevice we need to free both pointers in the teardown path after tu_bo_finish(global_bo), which uses the pointers. They are allocated in the first call to tu_bo_init(), which happens when global_bo is allocated. - in tu_drm.c:tu_bo_init we need to free bo_list if the bo_idx allocation fails. Convert to the goto teardown pattern as well. Fixes the following dEQP-VK tests: dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail dEQP-VK.api.object_management.alloc_callback_fail.device dEQP-VK.api.object_management.alloc_callback_fail.device_group Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12481>	2021-08-20 20:30:14 +00:00
Connor Abbott	abf0ae014a	tu: Properly handle waiting on an earlier pipeline stage I never really implemented this properly, because I wasn't aware of the clusters when doing the original pipeline barrier implementation. It turns out that the Vulkan stages we get as part of the barriers are actually good for something, because it turns out that the pipeline state is split into stages, so earlier stages can run ahead of later stages and sometimes we need to wait when an earlier stage depends on the result of a later stage. This happens most often whenever a shader reads the result of a color/depth attachment write, because attachment writes happen in a logically later stage. However this could also happen for a FS -> VS dependency. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12475>	2021-08-20 18:03:26 +00:00
Connor Abbott	613eaac7b5	ir3: Initial support for spilling non-shared registers Support for spilling shared registers to normal registers is still TODO. There are also several improvements to be made, like rematerialization. Note, there is one behavior change to register pressure accounting: we now include half registers in the current full pressure directly in mergedregs mode, rather than adding the max half pressure to the max full pressure afterwards, which might result in lower calculated max pressure in some cases with half registers. This is needed for spilling, since we need to make sure the total pressure including half registers is below the maximum at each instruction. Because the entire pass is rewritten, including the register pressure calculating parts, it didn't seem worth it to separate out this change. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12033>	2021-08-20 10:37:36 +00:00
Emma Anholt	e6776148c1	freedreno/a6xx: Sync TFB BO access against prior TFB writes. CTS draw_indirect usage of TFB output was flaking due to the TFB writes possibly not having completed. Since GL TFB doesn't require any other barrier between TFB and use of the BO (as seen by the CTS not emitting any memory barrier), we have to do it ourselves. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12457>	2021-08-19 23:27:15 +00:00
Emma Anholt	83e9a7fbcf	freedreno/ir3: Align driver param upload size/offset for indirect uploads. For indirect draws, we have to upload some of the params as indirect references, which have a more strict size requirement. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12455>	2021-08-19 14:43:06 -07:00
Emma Anholt	5d192c2036	freedreno/ir3: Apply the a6xx samgq workaround to TES/TCS/GS as well. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12454>	2021-08-19 21:21:45 +00:00
Eric Anholt	547a2aa051	freedreno/ir3: Use the resinfo path for ssbo sizes on GL, too. Less state walking at draw time, in exchange for a SHL in the lookup. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12258>	2021-08-18 00:15:18 +00:00
Emma Anholt	a48fc88571	freedreno/a6xx: Apply the cube image size lowering to GL, too. Fixes KHR-GLES31.core.texture_cube_map_array.texture_size_compute_sh. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12256>	2021-08-17 20:00:49 +00:00
Emma Anholt	09942227f6	ci/freedreno: Move freedreno's deqp testing to suite support. Using suites makes load-balancing our jobs much easier, keeps the CPU busy handling the a630_gles_others.sh test sets (and improves the output and baseline handling for them), and makes it trivial to add in more short test sets. a306: still 5 jobs, and we add KHR-GLES2 (KHR-GLES3 is unstable) a530: still 5 jobs, added KHR-GLES* a630_gl: 5 jobs becomes 4, and we add KHR-GLES* a630_vk: still 3 jobs, now 1/3 of all VK instead of 1/4. a630_vk_full: still 2 jobs, now includes full bypass testing, partial no-force testing, and testing of pre-merge-skipped tests. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12256>	2021-08-17 20:00:49 +00:00
Connor Abbott	1564ff72a0	tu: Fix xfb when there is a hole at the end We were handling the case where we had an unassigned output in the middle of the outputs array, but v->outputs can be smaller than the shader's info.num_outputs when an output at the end isn't assigned. This lead to us reading garbage after the end, and assuming that it corresponded to r0.x and overwriting the xfb entry for some other random output with the unassigned output's entry. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12310>	2021-08-16 11:06:23 +00:00
Connor Abbott	380d4904ea	tu: Read some input attachments directly It can happen that the user reads an input attachment as the first use of that attachment. In that case there are no subpass dependencies required at all, because there could be a pipeline barrier before the renderpass instead, and in any case we assume that dependencies with the first subpass as a destination can be executed only once outside the renderpass. The result is that we only do a CACHE_INVALIDATE once before the entire renderpass, but it's actually required after each GMEM load, because input attachments read GMEM through UCHE and those writes to GMEM invalidate UCHE. While we could add the missing CACHE_INVALIDATE "by hand" somehow, it turns out it's actually just as easy to do an optimization the blob does, where it simply doesn't patch those input attachments and reads them directly instead. This means we can skip allocating memory in GMEM for them entirely in some circumstances. This fixes e.g. dEQP-VK.api.copy_and_blit.core.resolve_image.whole_array_image.4_bit with TU_DEBUG=forcebin. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12213>	2021-08-10 16:45:53 +02:00
Emma Anholt	fd29117551	ci/freedreno: Organize, fill out, and document our VK xfails. This is the full set of xfails for 1.2.7.0 currently. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12150>	2021-08-06 20:03:02 +00:00
Connor Abbott	8115cde3ba	tu, freedreno/a6xx, ir3: Rewrite tess PrimID handling The previous handling conflated RelPatchID and PrimID, which would result in incorrect gl_PrimitiveID when doing draw splitting and didn't work with PrimID passthrough which fills the VPC slot with the "correct" PrimID value from the tess factor BO which we left 0. Replace PrimID in the tess lowering pass with a new RelPatchID sysval, and relace PrimID with RelPatchID in the VS input code in turnip/freedreno at the same time so that there is no net change in the tess lowering code. However, now we have to add new mechanisms for getting the user-level PrimID: - In the TCS it comes from the VS, just like gl_PrimitiveIDIn in the GS. This means we have to add another register to our VS->TCS ABI. I decided to put PrimID in r0.z, after the TCS header and RelPatchID, because it might not be read in the TCS. - If any stage after the TCS uses PrimID, the TCS stores it in the first dword of the tess factor BO, and it is read by the fixed-function tessellator and accessed in the TES via the newly-uncovered DSPRIMID field. If we have tess and GS, the TES passes this value through to the GS in the same way as the VS does. PrimID passthrough for reading it in the FS when there's tess but no GS also "just works" once we start storing it in the TCS. In particular this fixes dEQP-VK.pipeline.misc.primitive_id_from_tess which tests exactly that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12166>	2021-08-05 16:35:41 +00:00
Emma Anholt	c2a6143755	turnip: Fix assertions on checking mutable combined samplers support. We would determine that it was unsupported, then ask for the size and triggered the assertion checking that we never ask for the size of a combined sampler. Fixes: `ee3495e465` ("turnip: Add support for VK_VALVE_mutable_descriptor_type") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12148>	2021-08-02 17:41:54 +00:00
Danylo Piliaiev	ea7a42775b	turnip: reduce maxComputeWorkGroupSize Blob advertises { 1024, 1024, 64 }, but from tests they all could be 1024. Fixes tests: dEQP-VK.compute.basic.max_local_size_x dEQP-VK.compute.basic.max_local_size_y dEQP-VK.compute.basic.max_local_size_z Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9409>	2021-08-02 20:19:18 +03:00
Connor Abbott	b157a5d0d6	tu: Implement non-aligned multisample GMEM STORE_OP_STORE We have to a bit careful here when disabling draw states. This also necessitates moving the actual recording of the stores to the end so that we set the dirty flag correctly. Closes: #4462 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12102>	2021-08-02 11:20:25 +00:00
Juan A. Suarez Romero	dfccbdff98	ci: update to VK-GL-CTS 1.2.7.0 v2: - Read mustpass files from vk-default.txt (Matt) - Remove freedreno atomic geom tests from fail list (Emma) - Move freedreno flake to separated line (Emma) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12069>	2021-07-30 20:02:13 +00:00
Connor Abbott	d9a4a0aebd	tu: Handle multisample vkCmdCopyColorImage() There was a bit of code already to select the 3d path, but we actually need another shader variant for it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12080>	2021-07-29 23:54:29 +00:00
Connor Abbott	7a14484bca	ir3: Preserve gl_ViewportIndex in the binning shader Fixes dEQP-VK.draw.shader_viewport_index.* with TU_DEBUG=forcebin. Fixes: `efff734220` ("turnip: multiViewport and VK_EXT_shader_viewport_index_layer") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12104>	2021-07-28 16:49:46 +00:00
Connor Abbott	d903f51631	tu: Triage some CTS failures These should be fixed by the linked VK-GL-CTS change. I couldn't reproduce the other renderpass failure on a650 though. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12065>	2021-07-27 13:55:46 +00:00
Connor Abbott	266d3d5814	tu: Update subgroup properties Everything should be in place for this to actually work. Support a size of 128, unlike the blob. I've also plumbed through ballot support, so enable that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Emma Anholt	ea5707c52f	turnip: Disable buffer texturing on 422 formats. Fixes: dEQP-VK.api.info.format_properties.g8b8g8r8_422_unorm dEQP-VK.api.info.format_properties.b8g8r8g8_422_unorm and part of: dEQP-VK.api.info.format_properties.g8_b8_r8_3plane_420_unorm dEQP-VK.api.info.format_properties.g8_b8r8_2plane_420_unorm Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11562>	2021-06-24 17:34:06 +00:00
Emma Anholt	6bc88c26b6	ci/turnip: Document create_instance_device_intentional_alloc_fail's fail. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11562>	2021-06-24 17:34:06 +00:00
Emma Anholt	31f8b70481	turnip: Link more MRs and issues related to our xfails. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11562>	2021-06-24 17:34:06 +00:00
Danylo Piliaiev	f38fd3c577	turnip: place a limit on the growth of BOs There is a limit on IB size, which on freedreno is set to 0x100000. Going beyond it results in hangs, however I found that the last 0x100000 packet just doesn't get executed. Thus the real limit is 0x0FFFFF. This could be tested by appending nops to the cmdstream and placing e.g. CP_INTERRUPT at the end, at any position other than being 0x100000 packet it results in a hang. Fixes: dEQP-VK.api.command_buffers.record_many_draws_secondary_2 dEQP-VK.api.command_buffers.record_many_draws_primary_2 However these tests could trigger hangcheck timeouts. Also this fixes hangs when opening captures of games in RenderDoc. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10786>	2021-05-31 17:38:26 +00:00
Emma Anholt	9d28bac9d0	turnip: Make sure that SNORM blits don't clamp ambiguous -1.0 values. The CTS expects that some paths transfer SNORM data exactly, but the HW will clamp 0x80 values to 0x81 in the process. We can treat snorm as unorm, though, and get working compression without the clamping happening. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10735>	2021-05-27 19:27:40 +00:00
Emma Anholt	307139c7f9	vulkan: Avoid stomping array padding in the MemoryProperties wrapper. The deqp test for it expects that the unused array elements are untouched, so make sure they don't get replaced with random stack data. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10737>	2021-05-20 21:41:06 +00:00
Connor Abbott	de9f2170cc	ir3: Use round-to-nearest-even for fquantize2f16 We're supposed to map a floating-point value too large to be represented as fp16 to infinity, however round-to-zero naturally rounds it down to the largest representable fp16 number instead. The blob emits a bunch of fixup code to work around this, but instead we can just do what all the other drivers seem to do and use round-to-nearest-even instead. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10897>	2021-05-20 18:45:59 +00:00
Connor Abbott	e894e83e47	ir3/cf: Rewrite pass The old pass had a few bugs: - It tried to avoid folding f2f32 into f2f16, but didn't consider conversions that were already folded in. - It didn't prevent folding an f2f16 or f2f32 into a non-floating-point op. In addition it wasn't written in a manner which made handling integer conversions practical. This rewrites the pass to instead calculate the "type" of the conversion source and then check whether folding the conversion is allowed. This allows us to cleanly separate the declarative part where we describe how the HW works from the policy part where we decide whether the transform is allowed, and makes it simple to add support for folding integer conversions. Closes: #3208 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10859>	2021-05-19 20:03:19 +00:00
Danylo Piliaiev	3a29e45a90	turnip: do not ignore early_fragment_tests Specifying "early_fragment_tests" in fragment shader takes precedence over our internal conditions. Fixes test: dEQP-VK.fragment_operations.early_fragment.early_fragment_tests_stencil Fixes: `b2a60c157e` "turnip: add LRZ early-z support" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10803>	2021-05-17 11:56:32 +03:00
Juan A. Suarez Romero	629e8347ad	ci: Update VK-GL-CTS to 1.2.6.1 Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10800>	2021-05-14 20:35:24 +00:00
Danylo Piliaiev	5a133ef1f2	ci/turnip: drop fail annotation for image.extend_operands_spirv1p4.* They were fixed in `ed20e69b` "vtn: Handle ZeroExtend/SignExtend image operands" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10783>	2021-05-13 17:18:48 +00:00
Danylo Piliaiev	9a477ccbea	ci/turnip: drop fail annotation for float_control tests These tests are NotSupported and therefore cannot fail. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10783>	2021-05-13 17:18:48 +00:00
Emma Anholt	9a5c9ff342	turnip: Drop fail annotation for driver_properties. These subtests weren't run in CI, and the whole set is skipped since dropping to 1.1. Fixes: `7bcda21441` ("turnip: Demote API version to 1.1.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10734>	2021-05-11 14:16:25 +00:00
Emma Anholt	63a3d18ae1	ci/turnip: Add some links to issues and MRs for some test failures. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10734>	2021-05-11 14:16:25 +00:00
Emma Anholt	90b08175b7	ci/turnip: Clean up some stale fail annotations. This test group was fixed in the deqp 1.2.6.0 uprev, but we do a fractional run that didn't include these tests. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10734>	2021-05-11 14:16:25 +00:00
Eric Anholt	7bcda21441	turnip: Demote API version to 1.1. We don't support major 1.2 required extensions like timeline semaphores. Fixes many complaints in the dEQP-VK.info.vulkan1p2.* group. We were originally bumped to 1.2 in `75755e0eba` ("turnip: Pretend to support Vulkan 1.2") but hopefully that build issue has been fixed in the entrypoint reworks since then. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10471>	2021-05-05 17:09:09 +00:00
Danylo Piliaiev	cb8a00791c	ir3: memory_barrier also controls shared memory access order nir_intrinsic_memory_barrier has the same semantic as memoryBarrier() in GLSL, which is: GLSL 4.60, 4.10. "Memory Qualifiers": "The built-in function memoryBarrier() can be used if needed to guarantee the completion and relative ordering of memory accesses performed by a single shader invocation." GLSL 4.60, 8.17. "Shader Memory Control Functions": "The built-in functions memoryBarrier() and groupMemoryBarrier() wait for the completion of accesses to all of the above variable types." Fixes tests: dEQP-VK.memory_model.message_passing.core11.u32.coherent.fence_fence.atomicwrite.device.payload_local.image.guard_nonlocal.workgroup.comp dEQP-VK.memory_model.message_passing.core11.u32.coherent.fence_fence.atomicwrite.device.payload_nonlocal.workgroup.guard_local.image.comp Fixes: `819a613a` ("freedreno/ir3: moar better scheduler") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9054>	2021-05-05 10:05:38 +00:00
Danylo Piliaiev	9402d5a6b5	ir3: make possible to specify branchstack up to 64 On a6xx/a5xx there is such dependency between branchstack bitfield and the amount of nested ifs, which could be seen with blob: IFs BRANCHSTACK 0 0 1 1 2 2 3 2 4 3 5 3 6 4 ... 59 30 60 31 61 31 62 32 63 32 64 32 Remove open-coded branchstack for a5xx compute along the way. Fixes tests: dEQP-VK.spirv_assembly.instruction.compute.float16.opvectorshuffle.344 dEQP-VK.spirv_assembly.instruction.graphics.float16.opvectorshuffle.344_vert dEQP-VK.spirv_assembly.instruction.graphics.float16.opvectorshuffle.444_geom dEQP-VK.spirv_assembly.instruction.graphics.float16.opvectorshuffle.244_tessc dEQP-VK.spirv_assembly.instruction.graphics.float16.opvectorshuffle.344_frag Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9859>	2021-04-21 11:57:07 +00:00
Samuel Iglesias Gonsálvez	54cf12774a	turnip/lrz: add support for VK_EXT_extended_dynamic_state When the depth or stencil state changes dynamically, that might affect LRZ state and we need to recalculate it and emit it again. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8615>	2021-04-15 09:51:20 +02:00

1 2

59 Commits