KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rhys Perry	816b7fb5cb	aco: fix unreachable() for uniform 8/16-bit nir_op_mov from VGPR Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `d20a752c0d` ("aco: use Builder::copy more") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8380>	2021-01-08 12:54:36 +00:00
James Park	3fb4755d48	util: Disable memstream for Apple builds Not all SDK versions support open_memstream. Maybe some other day. Fixes: `af8d488ea5` ("util,ac,aco,radv: Cross-platform memstream API") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8269>	2021-01-08 09:37:14 +00:00
Samuel Pitoiset	f40a7d3c93	radv: fix performance regression by restoring TC-compat HTILE in GENERAL This fixes a performance regression for games (eg. Youngblood) that declare all images as concurrent. This is likely buggy for compute queues but this just restores the previous behaviour for now. Fixes: `f4f096805b` ("radv: fix TC-compat HTILE images with DST_OPTIMAL on the compute queue") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8351>	2021-01-08 09:22:32 +00:00
Samuel Pitoiset	0ae1cf46a6	radv: fix enabling TC-compat HTILE in GENERAL for writes on GFX10+ It wasn't expected to also enable inside render loops. Fixes: `4bb92d9145` ("radv: enable TC-compat HTILE in GENERAL on GFX10+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8351>	2021-01-08 09:22:32 +00:00
Samuel Pitoiset	20683461e3	radv: configure the texture descriptor for TC-compat CMASK on GFX10+ This was missing, it can be enabled with RADV_PERFTEST=tccompatcmask. Note that this feature is still experimental. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8350>	2021-01-08 08:21:17 +01:00
Vinson Lee	e248119a82	r300: Fix typos. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8296>	2021-01-07 20:42:36 -08:00
Erik Faye-Lund	f1b51d472a	gallium/ntt: lower uniforms to ubo NTT doesn't handle uniforms, and requires them to have been lowered to UBOs. But for drivers that don't set nir_shader_compiler_options::lower_uniforms_to_ubo to true, this won't have happened yet. Neither Zink nor V3D sets this option, and in the case of Zink this isn't trivial to change. So let's lower uniforms to UBOs in this case in NTT instead. Fixes: `03c60762f5` ("gallium/ntt: Fix load_ubo_vec4 buffer index setup.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4047 Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8365>	2021-01-07 23:09:49 +00:00
Nanley Chery	28a141e325	iris: Blit stencil according to aspect_mask With this change, stencil picks up the fix for 3D texture blits introduced with commit `382451ff9d`. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8340>	2021-01-07 23:00:32 +00:00
Nanley Chery	1148da3436	iris: Use single-aspect formats more in iris_blit In order to handle blitting the stencil aspect of a depth-stencil resource, use aspect-specific pipe formats in the aspect_mask loop. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8340>	2021-01-07 23:00:32 +00:00
Nanley Chery	db2cdc4277	iris: Blit non-stencil according to aspect_mask When blitting just the stencil aspect, the source and destination resources are prepared/setup twice. Move the unconditional resource setup into the aspect_mask loop to avoid this. In addition, use the aspect provided by the loop instead of the mask provided by the info parameter. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8340>	2021-01-07 23:00:32 +00:00
Nanley Chery	b73e903f96	iris: Loop through an aspect mask in iris_blit Enables dropping the stencil-specific blit later on. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8340>	2021-01-07 23:00:31 +00:00
Nanley Chery	776074d66c	iris: Increase use of pipe_resources in iris_blit Allows the affected code to avoid being moved into a while loop later on. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8340>	2021-01-07 23:00:31 +00:00
Nanley Chery	51d26e2edf	iris: Use texture preparation helper in iris_blit Use iris_resource_prepare_texture in iris_blit to avoid partial resolves for sRGB <-> linear texture views. This affects a trace of L4D2. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8340>	2021-01-07 23:00:31 +00:00
Nanley Chery	04d73e2dc2	iris: Move depth-format assertion out of iris_blit Instead of having a depth-specific assertion in a generic portion of iris_blit, move it into the depth-specific cases of iris_resource_texture_aux_usage. Since iris_blit calls that function, the test still occurs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8340>	2021-01-07 23:00:31 +00:00
Nanley Chery	ce3a6dfa79	iris: Don't prepare depth for stencil-aspect blits Before this change, iris_blit would prepare the depth buffer in a depth-stencil resource even when only the stencil aspect was used for the blit. Use the aspect mask to prepare the correct resource. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8340>	2021-01-07 23:00:31 +00:00
Adam Jackson	a7762daa67	mesa: Don't make building tests conditional on building DRI drivers These tests should work, and be built, even if you're only building gallium drivers. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8353>	2021-01-07 17:39:26 -05:00
Adam Jackson	df4a7d67aa	mesa: Fix array-format-to-format table on big-endian The table constructor and the table lookup were doing different things for big-endian. This fixes MesaFormatsTest.FormatFromFormatAndType and MesaFormatsTest.FormatMatchesFormatAndType failing to round-trip for GL_RGBA / GL_SHORT, which we're not currently running in CI for s390x, but which a subsequent commit will enable. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8353>	2021-01-07 17:39:24 -05:00
Adam Jackson	ab0d17338f	tests: Fix memory leaks in DispatchSanity Needed to pass asan CI. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8353>	2021-01-07 17:39:14 -05:00
Samuel Pitoiset	d2f4934121	radv/llvm,aco: always split typed vertex buffer loads on GFX6 and GFX10+ To avoid any alignment issues that triggers memory violations and eventually a GPU. This can happen if the stride (static or dynamic) is unaligned and also if the VBO offset is aligned to scalar (eg. stride is 8 and VBO offset is 2 for R16G16B16A16_SNORM). The AMD Windows driver also always splits typed vertex fetches. fossils-db (Sienna Cichlid): Totals from 56508 (40.54% of 139391) affected shaders: SGPRs: 2643545 -> 2664516 (+0.79%); split: -0.19%, +0.98% VGPRs: 2007472 -> 1995408 (-0.60%); split: -0.74%, +0.13% CodeSize: 70596372 -> 73913312 (+4.70%); split: -0.00%, +4.70% MaxWaves: 772653 -> 774916 (+0.29%); split: +0.37%, -0.08% Instrs: 14074162 -> 14567072 (+3.50%); split: -0.00%, +3.51% Cycles: 69281276 -> 71253252 (+2.85%); split: -0.00%, +2.85% VMEM: 22047039 -> 25554196 (+15.91%); split: +17.20%, -1.29% SMEM: 4120370 -> 4360820 (+5.84%); split: +7.41%, -1.58% VClause: 416913 -> 438361 (+5.14%); split: -1.86%, +7.01% SClause: 536739 -> 542637 (+1.10%); split: -0.33%, +1.43% Copies: 977194 -> 970015 (-0.73%); split: -2.43%, +1.69% Branches: 241205 -> 241193 (-0.00%); split: -0.06%, +0.06% PreVGPRs: 1505645 -> 1505379 (-0.02%) This fixes GPU hangs with bin/draw-vertices from Piglit on GFX10+ with Zink. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8363>	2021-01-07 17:28:00 +00:00
Samuel Pitoiset	68c2537062	aco: fix creating the dest vector when 16-bit vertex fetches are splitted Compute the number of components of the destination vector from the bitsize when eg. a 16-bit vec2 vertex fetches is splitted. This is because the dst will be a v1, so the p_create_vector should be created from two v2b fro both sizes to match. This prevents a regression from the next change which will split typed vertex buffer loads on GFX6 and GFX10+. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8363>	2021-01-07 17:28:00 +00:00
Rhys Perry	f5adf27fb9	nir,radv: add and use nir_vectorize_tess_levels() fossil-db (Sienna): Totals from 1342 (0.97% of 138791) affected shaders: CodeSize: 3287996 -> 3269572 (-0.56%); split: -0.56%, +0.00% Instrs: 629896 -> 628191 (-0.27%); split: -0.31%, +0.04% Cycles: 2619244 -> 2612424 (-0.26%); split: -0.30%, +0.04% VMEM: 388807 -> 389273 (+0.12%); split: +0.14%, -0.02% SMEM: 90655 -> 90700 (+0.05%); split: +0.06%, -0.01% VClause: 21831 -> 21812 (-0.09%) PreVGPRs: 44155 -> 44058 (-0.22%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	bfc777f83e	radv: vectorize shader I/O Fixes code size regressions after enabling TCS/TES for ACO. fossil-db (Sienna): Totals from 2588 (1.86% of 138791) affected shaders: SGPRs: 109950 -> 108480 (-1.34%); split: -1.43%, +0.09% VGPRs: 107764 -> 112060 (+3.99%); split: -0.03%, +4.02% CodeSize: 5957760 -> 5321656 (-10.68%) MaxWaves: 31718 -> 30358 (-4.29%); split: +0.03%, -4.32% Instrs: 1116300 -> 1029000 (-7.82%) Cycles: 4600344 -> 4251072 (-7.59%) VMEM: 980024 -> 812978 (-17.05%); split: +1.14%, -18.18% SMEM: 275458 -> 258227 (-6.26%); split: +2.34%, -8.60% VClause: 42925 -> 30533 (-28.87%); split: -31.02%, +2.15% SClause: 31554 -> 31362 (-0.61%); split: -1.79%, +1.18% Branches: 15689 -> 15697 (+0.05%) PreVGPRs: 80399 -> 83953 (+4.42%); split: -0.00%, +4.42% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	f199b7188b	nir/load_store_vectorize: add data as callback args Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	00c8bec47b	nir: add nir_load_store_vectorize_options Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	f4eb833a12	nir/load_store_vectorize: don't ignore subgroup memory barriers Not sure why I thought this was correct, but we should consider them for optimization purposes. Fixes: `ce9205c03b` ('nir: add a load/store vectorization pass') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Juan A. Suarez Romero	4d0b8a9d32	v3d: reinterpret stencil data as uint texture in stencil blit path There is a path to blit stencil buffers reinterpreting the stencil data as an RGBA8888 or R8 float texture. This works fine except for the case when the stencil buffer is multisampled, and the blit operation needs to resolve it: an average of the samples is done, which is incorrect, as only one sample must be used. This can be observed n the piglit test `ext_framebuffer_multisample-unaligned-blit 2 stencil downsample -auto -fbo`, specifically in the triangles border. To avoid this averaging, let's reinterpret the stencil data as RGBA8888 or R8 uint texture. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8361>	2021-01-07 16:09:31 +00:00
Rhys Perry	cacce76db9	radv: workaround games which assume full subgroups if cswave32 is enabled This assumption becomes incorrect with RADV_PERFTEST=cswave32. Games include Detroit: Become Human and Doom Eternal. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7918>	2021-01-07 15:01:02 +00:00
Rhys Perry	c73c246e05	nir: gather whether a compute shader uses non-quad subgroup intrinsics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7918>	2021-01-07 15:01:02 +00:00
Rhys Perry	5bb94ab050	radv: implement CREATE_REQUIRE_FULL_SUBGROUPS_BIT with cswave32 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7918>	2021-01-07 15:01:02 +00:00
Michel Dänzer	e8f50bd600	wsi/x11: Treat IMMEDIATE present mode the same as MAILBOX for Xwayland Two main reasons: As described in the previous commit, sending buffers to the Wayland compositor as quickly as possible effectively results in mailbox behaviour. Also, doing the same as for MAILBOX present mode provides the following benefits: * We use more images in the swapchain, which avoids stalls on the client side if the Wayland compositor directly uses the client buffers for scanout. * We wait for fences to signal before submitting a new buffer, which avoids missing frames in the Wayland compositor due to fences not signalling in time for a flip. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3673 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8197>	2021-01-07 15:00:45 +01:00
Michel Dänzer	4292fb2139	wsi/x11: Use PresentOptionAsync for MAILBOX present mode with Xwayland This allows Xwayland to forward buffers to the Wayland compositor ASAP for fullscreen / undecorated windows, which in turn allows true mailbox behaviour in the Wayland compositor. Without this, Xwayland has to emulate the mailbox behaviour itself, which it cannot do as well as the Wayland compositor by design. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8197>	2021-01-07 15:00:07 +01:00
Michel Dänzer	b5268d532a	wsi/x11: Detect Xwayland The following commits will introduce different behaviour for Xwayland. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8197>	2021-01-07 14:58:24 +01:00
Michel Dänzer	1de2fd0cf2	wsi/x11: Always link against xcb-xrandr The next commit will make use of it even without VK_USE_PLATFORM_XLIB_XRANDR_EXT. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8197>	2021-01-07 14:57:45 +01:00
Michel Dänzer	1cce8e1101	wsi/x11: Set recognizable name for WSI swapchain queue thread This makes it easier to recognize the thread e.g. in a debugger. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8197>	2021-01-07 14:56:41 +01:00
Pierre-Eric Pelloux-Prayer	07c1504d1b	radeonsi: implement SQTT support Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:10:17 +01:00
Pierre-Eric Pelloux-Prayer	a46e830444	radeonsi: add radeon_set_uconfig_reg_seq_perfctr Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:10:17 +01:00
Pierre-Eric Pelloux-Prayer	df5233b977	ac/sqtt: move radv_get_expected_buffer_size to ac Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:10:16 +01:00
Pierre-Eric Pelloux-Prayer	ea6176e63e	ac/sqtt: move ac_is_thread_trace_complete to ac Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:10:14 +01:00
Pierre-Eric Pelloux-Prayer	ffdfe136e6	ac/sqtt: move rgp/sqtt def to ac Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:57 +01:00
Pierre-Eric Pelloux-Prayer	4ec5cf5318	ac/radv: move radv_rgp.c to ac Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:49 +01:00
Pierre-Eric Pelloux-Prayer	bbc245ab2e	ac/radv: move sqtt structs and helpers to amd/common Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:47 +01:00
Pierre-Eric Pelloux-Prayer	04f6ba113c	ac/sqtt: add ac_thread_trace_data Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:45 +01:00
Pierre-Eric Pelloux-Prayer	b94104c0c0	radeonsi: pass radeon_cmdbuf to si_cp_dma_wait_for_idle Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:25 +01:00
Pierre-Eric Pelloux-Prayer	aa9fe1e423	radeonsi: pass radeon_cmdbuf to emit_cache_flush Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8002>	2021-01-07 10:09:25 +01:00
Alistair Popple	7f9a084e7e	gv100/ir: Use system wide atomics Increase the scope of atomic operations from GPU to system. This is required for support of SVM to ensure atomic access is maintained for memory buffers that are not local to the current GPU. Signed-off-by: Alistair Popple <apopple@nvidia.com> Reviewed-by: Ben Skeggs <bskeggs@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7955>	2021-01-07 08:59:10 +00:00
Alistair Popple	b02e3053ea	gv100/ir: Make emitATOM consistent with emitRED GV100 code generation uses ATOM instructions for compare-and-swap and RED instructions for other atomic operations. Make the scope consistent for both types of operations. Signed-off-by: Alistair Popple <apopple@nvidia.com> Reviewed-by: Ben Skeggs <bskeggs@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7955>	2021-01-07 08:59:10 +00:00
Marek Olšák	62703b79a5	radeonsi: remove si_gs_prolog_bits::gfx9_prev_is_vs It didn't do anything useful. GS doesn't use the other user SGPRs. If we decrease the number of user SGPRs we declare for the GS prolog, we can remove gfx9_prev_is_vs. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8344>	2021-01-06 23:28:04 -05:00
Marek Olšák	b6b6d1ff3c	radeonsi: fix hang caused by for loop with exec=0 in LS and ES LLVM expects that exec != 0 when entering loops and generates this code that becomes an infinite loop if exec == 0: BB5_1: vcc_lo = (inverted terminating condition) s_and_b32 vcc_lo, exec_lo, vcc_lo s_cbranch_vccnz BB5_3 // jump if vcc != 0 (break statement) // ... loop body ... s_branch BB5_1 BB5_3: For non-monolithic VS before TCS, VS before GS, and TES before GS, we set exec = (thread enabledmask), which sets 0 for HS-only and GS-only waves, causing the infinite loop condition above. Fix it as follows: - set exec = ~0 at the beginning - wrap the whole shader (LS and ES) in a conditional block, so that HS-only and GS-only waves jump over it and never enter such a loop The TES before GS hang can be reproduced by gfxbench: testfw_app --gfx egl -w 1920 -h 1080 --gl_api gles -t gl_tess Fixes: `68d6d097f1` - radeonsi/gfx9: add GFX9 and VEGA10 enums Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8344>	2021-01-06 23:28:01 -05:00
Eric Anholt	e1b7c42cc6	freedreno: Add missing dep on u_tracepoints. Fixes: `a02dcb970f` ("freedreno: Add GPU tracepoints") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8355>	2021-01-06 23:55:30 +00:00
Eric Anholt	3efbc47c83	freedreno: Mark a615/a618 as also lacking Z24_UINT_S8_UINT support. Rob says it's also the case on 618, and presumably 615 as well then, so make it take the same path as a630. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8319>	2021-01-06 22:54:14 +00:00
Eric Anholt	1633c9735a	freedreno: Disable UBWC on z24s8 on a630. Stencil texture sampling (such as what we have to do for BlitFramebuffer) is broken with UBWC enabled. We can't just take the fd_resource_uncompress() path, because that's a blit just like BlitFramebuffer. Fixes failure in dEQP-GLES3.functional.fbo.msaa.2_samples.stencil_index8, but also the uncaught rendering fails of 4_samples.stencil_index8 and depth24_stencil8. Prior to "911ce374caf0 freedreno/a6xx: Fix MSAA clear" we would usually pass and sometimes flake fail on this test occasionally, thus it being listed as a flake (though the rendering was actually broken). Since that commit, though, we consistently fail on a pixel of the broken rendering, and thus this was brought to my attention by the #freedreno-ci channel spam. Rob took a look at the performance impact of this, and the worst was maybe up to .5% fps hit on trex. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8319>	2021-01-06 22:54:14 +00:00
Eric Anholt	6a8601fb24	freedreno/a6xx: Fix z24s8 non-ubwc blits on a630. Taken from the logic in turnip (though they have the format table have 8888_unorm and override to z24s8_as_rgba for blits in the ubwc case) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8319>	2021-01-06 22:54:14 +00:00
Eric Anholt	d6e307dcc5	freedreno/a6xx: Move the IBO pipe2tex down to where it's used. For the texture path, the pipe2tex happens in fd6_tex_const_0(). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8319>	2021-01-06 22:54:14 +00:00
Eric Anholt	1c4613f5d4	turnip: Move the limited_z24s8 flag to the shared device info. I want to do the same logic in freedreno, so use the same flag. On suggestion by robclark, rename it to what it specifically means. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8319>	2021-01-06 22:54:14 +00:00
Neil Armstrong	11f32b6a05	kmsro: sync Android.mk GALLIUM_TARGET_DRIVERS Sync Android.mk GALLIUM_TARGET_DRIVERS names from kmsro meson.build, notably adding the missing mediatek, meson and rockchip display drivers names. It also fixes the imx name into imx-drm as referenced in meson.build and src/gallium/targets/dri/target.c Signed-off-by: Neil Armstrong <narmstrong@baylibre.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7443>	2021-01-06 22:34:45 +00:00
Rhys Perry	f7a5b8ed35	vtn: support SpvCapabilitySparseResidency Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	7d1d4acbd5	nir/lower_tex: fix lower_tg4_offsets with sparse fetches Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	2d2decc905	nir: add sparse_residency_code_and Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	4cbdf9ec4d	nir,spirv: implement SpvOpImageSparseTexelsResident Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	1fd8b46667	nir,spirv: add sparse image loads Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	3a7972f72a	nir,spirv: add sparse texture fetches Like SPIR-V and GL_ARB_sparse_texture2, these return a residency code. It is placed in the destination after the rest of the result. If it's zero, then the texel is resident. Otherwise, it's not resident. Besides the larger destination and the residency code, sparse fetches work the same as normal fetches. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	95819663b7	nir: allow 5 component vectors These will be useful for sparse texture instructions and image load intrinsics. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	ba4a73a502	nir/tests: fix callback for load/store vectorizer tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Daniel Schürmann	22b89d9a52	nir/opt_vectorize: fix call to filter function Due to the typo, it could happen that instructions got further vectorized than intended. Fixes: `8eaf9c61d1` ('nir/opt_vectorize: don't hash filtered instructions') Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8352>	2021-01-06 19:03:07 +00:00
Vinson Lee	03999595e7	freedreno/afuc: Replace readfile with os_read_file. Tested afuc-disasm produced same output. $ ./builddir/src/freedreno/afuc/afuc-disasm -g 6 src/freedreno/.gitlab-ci/reference/afuc_test.fw > /tmp/afuc_test.asm $ diff ./src/freedreno/.gitlab-ci/reference/afuc_test.asm /tmp/afuc_test.asm $ echo $? 0 Suggested-by: Rob Clark <robdclark@chromium.org> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8290>	2021-01-06 18:12:34 +00:00
Erik Faye-Lund	96ceca33c1	gallium/util: do not perform n^2 stencil blits We already loop n times here, no point in doing n instances as well. Fixes: `e8a40715a8` ("gallium/util: add blitter-support for stencil-fallback") Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8301>	2021-01-06 17:09:16 +00:00
Rob Clark	32a6a13052	freedreno/ir3/parser: Fix pre-a6xx stib parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	859c92d7ee	freedreno/ir3/parser: a6xx ldib/stib parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	b7ea6ec178	freedreno/ir3: Fix pre-a6xx ldgb/stib parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	050a449dbb	freedreno/ir3: Explicitly flag disasm test vectors that don't parse Mark the test cases which aren't supported by ir3_parser.y explicitly, so we notice future regressions. And likewise, fail when we see an unexpected pass, so we don't forget to update the test vectors in the future as ir3_parser improves. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	b073dae5f0	freedreno/ir3: Fix ldg decoding/parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	a7e88787f6	freedreno/ir3/parser: Fixup stg parsing and add more tests The offset can also be a register, in which case we need to shuffle around the src order. Add a few more test vectors to cover each permutation (no offset, immed offset, gpr offset). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	d6fa130dda	freedreno/ir3/parser: Add stgb support Note that this conflicts with `stc` on a6xx+, so a good test that the (new) disasm can handle both cases properly. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	eddfafae6a	freedreno/ir3/parser: Add ldgb support Gives us at least better coverage of pre-a6xx-bindless-ibo instructions. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	1746c4d211	freedreno/ir3/parser: Fix pre-a6xx resinfo Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	32539c1afc	freedreno/ir3/parser: Fix atomic support 1) Handle a6xx bindless form 2) Fix shared vs global encoding Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	c5479d1d8d	freedreno/ir3/parser: Add ldc support Note that this shows up a slight encoding difference compared to test vector extracted from blob deqp runs. We think these should be dontcare bits. For now, add a note and replace the encoded value in the disasm test. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	d7f141bb35	freedreno/ir3: Add cat5/cat6 nonuniform flag Not yet used by the compiler, but needed so we don't loose information between ir3 parser and instruction encoding. Currently ignored for cat5, because the uniform vs non-uniform default is swapped there. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	101bf686ee	freedreno/ir3: Disambiguate a6xx+ "bindless" instructions Add a `.b`.. for the atomic instructions it should be `atomic.b.op` but for now put the `.b` at the end to simplify life for the existing disasm Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	c55737902c	freedreno/ir3: Don't leak disk_cache Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	71f902bab9	freedreno/ir3: Add parsing and assembler testing In theory we should be able to round-trip from disasm->asm and get a bitwise match. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	b91319d952	freedreno/ir3: Tweak ldib/resinfo encoding The blob is using '0' for the low bit in these (except for ldib where it seems to randomly use either '0' or '1'). The upcoming xml based ISA spec maps this bit to 'dontcare' in the ldib case. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	99908c8d6d	freedreno/ir3/parser: Add initial cat6 IBO instructions Well, really just resinfo.. dealing with the different ldib/stib syntax for a6xx+ vs earlier seems a bit too painful to deal with. But resinfo at least gives us some encoding test coverage of this group of instrs. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	f9c76fba9d	freedreno/ir3/parser: Relative gpr/const can have modifiers too Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	594b004e00	freedreno/ir3/parser: Add missing (sat) modifier Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	77552cbdda	freedreno/ir3: Don't set bit for dest conversion for p0.c This appears to be ignored when writing to predicate registers (which I guess makes sense, since they are boolean). So no real harm in setting it, other than it makes some of the ir3_parser test vectors not match the expected result for encoding. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	1cdff35361	freedreno/ir3/parser: Fixup cat5 s2en instructions Currently ir3 (incl emit_cat5()) expects the samp/tex src register to be first.. which requires some fixup for the parser to match. TODO we might want to revisit the src reg order when adding new instr packing/encoding. For now, lets just make the parser match the rest of ir3. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	d35c79614e	freedreno/ir3/parser: Fix dsxpp/dsypp encoding Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	e9b3234915	freedreno/ir3/parser: Fix cat6 store encoding Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	b90a1cf747	freedreno/ir3: Cleanup cat6 load instructions There was some src2 vs src3 confusion, but since the syntax is like: ldl.f32 rDst, l[rBase+off], ncomp it makes more sense to call the offset src2 and ncomp src3, than the way we had it. This is also easier to deal with for the ir3 assembly parser. Also, src_offset was only ever used by the assembly parser, and was handled incorrectly in emit_cat6(), resulting that cat6 load instrs would not work properly in (for ex) computerator. Since we are cleaning things up, drop src_offset and make the asm parser work in the same way as the nir->ir3 frontend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	4e272003b1	freedreno/ir3: Clean up instruction creation Convert everything remaining over to the version which takes # of register (src + dst) and drop the ir3_instr_create2() version. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	d968f46997	freedreno/ir3/parser: Handle half-immed Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	68be24dd6c	freedreno/ir3/parser: cat1 updates (mova1, movmsk) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	647d7fc36d	freedreno/ir3/parser: cat1 instructions can write relative GPR Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	0b36044d4f	freedreno/ir3/parser: Add new cat0 instructions Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	2dc6458563	freedreno/ir3: Various cat0 updates Update the IR and packer to handle the additional cat0 fields, in prep for adding support in the assembler (in prep for adding round trip parsing/packing test coverage). We don't actually use these yet from the ir3 compiler, but at least this is one less thing to worry about when we start trying to use them. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	eec183c159	freedreno/ir3/parser: Reset lexer when input changes Otherwise, in case of parse errors, the lexer state can still contain buffered input from the previous parse. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	7b2d2bafe4	freedreno/ir3: Move assembler error handling Move out of ir3_parse_asm() so we can re-use it in disasm test for round-tripping asm/disasm. We don't want failures to be fatal (yet) as there are still some things missing from the assembler. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	a928d0ab46	freedreno/ir3: Add some more disasm test vectors Various things that I noticed which were initially wrong with the xml based disasm. These were extracted from a collection of unique instructions extracted from deqp traces, which unfortunately looses the link back to the original test case. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	2933d54992	freedreno/ir3: Fix mova1 disasm Yet another mnemonic for mov Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00

1 2 3 4 5 ...

122835 Commits