KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Mike Blumenkrantz	b27e729c0d	zink: add vertex buffer barriers during bind now we have tracking for vbo binds and can automatically reapply the correct barrier just before draw if the resource is modified after bind Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 21:16:44 -04:00
Mike Blumenkrantz	be40886521	zink: emit descriptor barriers and references during bind there's not actually any reason to do these during draw other than wasting cpu cycles, and it has the side benefit of providing a bunch more info for rebinds image resources for gfx shaders need to have their barriers deferred until draw time, however, as it's unknown what layout they'll require until that point due to potentially being bound to multiple descriptor types Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 21:16:43 -04:00
Mike Blumenkrantz	6eb290341b	zink: improve samplerview update flagging we shouldn't actually need to use the hash value here now that the surfaces and bufferviews have been deduplicated Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 20:53:01 -04:00
Mike Blumenkrantz	425642bec1	zink: abstract descriptor functionality and make descriptor structs private the first step to adding modular descriptor managers is to isolate the existing one so that it can be easily swapped out Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 20:53:00 -04:00
Mike Blumenkrantz	d54c21d279	zink: replace has_descriptors program member with a util function Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 20:52:38 -04:00
Mike Blumenkrantz	9d128e6aed	zink: pass descriptor type to set layout create() this will be useful in the future Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 20:52:38 -04:00
Mike Blumenkrantz	6dcad430dd	zink: unify pipeline layout creation and move to descriptor_program_init the descriptor layouts are the last component needed to create the pipeline layout, so it makes sense to streamline setup by having the pipeline layout created from a single place for both types of pipelines Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 20:52:38 -04:00
Mike Blumenkrantz	a98e11ce21	zink: make a public util function for allocating descriptor sets Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 20:52:38 -04:00
Mike Blumenkrantz	02f8e860e5	zink: make descriptor_layout_get a public util function Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 20:52:38 -04:00
Mike Blumenkrantz	f20b3c5aa9	zink: rename ptr_add_usage -> batch_ptr_add_usage Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10849>	2021-05-25 20:52:37 -04:00
Yiwei Zhang	f61337b549	egl/android: check front rendering support for cros gralloc Enable EGL_KHR_mutable_render_buffer when below conditions are met: 1. Driver part implementation of this extension is available 2. Loader part implementation of this extension is available 3. ClientAPIs must be OpenGL ES bits (ES, ES2 or ES3) 4. Gralloc is cros gralloc and it supports front render usage query (4) is optional as long as another gralloc supports similar query. Upon window surface creation, if the surface type has mutable render buffer, then append the cached front rendering usage to the existing usage and properly set to the ANativeWindow. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10685>	2021-05-26 00:05:33 +00:00
Yiwei Zhang	abec42c9a3	gallium/dri: implement EGL_KHR_mutable_render_buffer Tested with low-lantency stylus apps with this extension enabled, no regression on the cts. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10685>	2021-05-26 00:05:33 +00:00
Yiwei Zhang	fca4b6877a	gallium/st: add a back buffer fallback for front rendering Unlike front buffer used by big GL API for front rendering, EGL_KHR_mutable_render_buffer together with ES redirects GL_BACK to the front buffer. This patch adds a fallback to use back buffer and ensures no behavior change for unrelated frontends. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10685>	2021-05-26 00:05:33 +00:00
Nanley Chery	19a8bd4c63	intel/isl: Fix HiZ+CCS comment about ambiguates Note that CCS isn't ambiguated during a HiZ ambiguate. Dumping the CCS surface after a HiZ ambiguate shows that the CCS is unchanged. Fixes: `98dc7f56b7` ("intel/isl: Add a separate ISL_AUX_USAGE_HIZ_CCS_WT") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9112>	2021-05-25 22:24:36 +00:00
Nanley Chery	34dbbfdd14	anv,iris: Port the D16 workaround stalls to BLORP Commit `cd40110420` added stalls before register writes that occur when drivers emit depth stencil packets. However, it only did so for non-BLORP draw calls. Since those packets are sometimes emitted during BLORP calls, add stalls there too. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4574 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10939>	2021-05-25 20:55:27 +00:00
Nanley Chery	afe3f373a4	intel: Limit the D16 workarounds to Gfx12.0 The workarounds introduced in `cd40110420` are no longer needed on Gfx12.5. Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10939>	2021-05-25 20:55:27 +00:00
Andres Gomez	07b86e64a5	ci: add VKD3D-Proton testsuite job for radv's Navy Flounder Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Martin Peres <martin.peres@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10870>	2021-05-25 17:03:25 +00:00
Andres Gomez	44b085375c	ci: add VKD3D-Proton testsuite runner Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Martin Peres <martin.peres@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10870>	2021-05-25 17:03:25 +00:00
Andres Gomez	fb8a4d89a2	ci: include VKD3D-Proton tests into the VK test container Instead of installing the distribution package, build and install locally, including the tests. Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Martin Peres <martin.peres@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10870>	2021-05-25 17:03:25 +00:00
Andres Gomez	537c9460fa	ci: add radv's trace job for Navy Flounder Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Martin Peres <martin.peres@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10870>	2021-05-25 17:03:25 +00:00
Andres Gomez	a71ffa4592	ci: uprev DXVK to 1.8.1 Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Martin Peres <martin.peres@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10870>	2021-05-25 17:03:25 +00:00
Andres Gomez	d0e6863bf1	ci: uprev apitrace to 10.0 Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Martin Peres <martin.peres@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10870>	2021-05-25 17:03:25 +00:00
Andres Gomez	fa8ca10e27	ci: remove radv's trace job for Polaris10 Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Martin Peres <martin.peres@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10870>	2021-05-25 17:03:25 +00:00
Andres Gomez	f0f812dbe7	ci: update radv's trace job tag for Raven Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Martin Peres <martin.peres@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10870>	2021-05-25 17:03:25 +00:00
Chia-I Wu	62014aeccd	util/u_thread: fix u_thread_setname for long names "WSI swapchain queue", set by vulkan/wsi/x11, is longer than 15 characters. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10935>	2021-05-25 16:39:02 +00:00
Marek Olšák	7844bdadac	radeonsi: remove DFSM after we discovered how bad it is Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	56a450e984	radeonsi: disable DFSM on gfx9 by default because it decreases performance a lot Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	2fad90dc4a	radeonsi: implement threaded context callbacks for resource busy checking Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	9ba17ec21a	radeonsi: generate buffer_id_unique for u_threaded_context Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	9dc7fff448	radeonsi: allow changing the NGG subgroup size to 256 but don't change it yet Currently, 128 seems to have the best performance. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	13acbaecd8	radeonsi: rewrite the prefix sum computation for shader culling Instead of storing the vertex mask per wave into LDS and then computing the prefix sum, store 8-bit bitcounts (vertex counts) of the vertex masks into LDS. This allows us to compute the sum using v_sad_u8, which computes a sum of 4 i8vec4 components in one instruction. Each i8vec4 of vertex counts is loaded in parallel threads (one dword per thread) instead of all being loaded in thread 0, and readlane copies them to SGPRs instead of readfirstlane. LDS is no longer initialized before culling. Instead, the counts for inactive waves are masked with AND later. Incorrect old comments are also fixed. This change removes 80 bytes from the code size, and it allows increasing the workgroup size from 128 to 256. (which is the main motivation for this) Now changing the workgroup size with wave64 has no effect on the code size. Switching to wave32 with 8 waves even generates slightly smaller code than wave64 with 4 waves. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	27c9e77c6a	radeonsi: add missing threaded_resource_deinit calls in fail paths Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	64b75cc12e	radeonsi: add a gfx10 hw bug workaround with the barrier before gs_alloc_req Fixes: `8845a23698` - amd: add NAVI10 PCI IDs Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	712f74f590	radeonsi: remove 8 bytes from si_resource, turn other 4 bytes into padding Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	5af124c92c	radeonsi: change si_resource::alignment to alignment_log2 for better packing Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	0c8d65a39a	radeonsi: improve generated culling code by adding optimization barriers This removes a lot of instructions and 16 bytes from the code size. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	980a70573f	radeonsi: re-enable fast launch with indexed tri strips because it doesn't hang I don't know which change fixed this, but I can't reproduce the hang anymore. Acked-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	a0fcd37731	radeonsi: remove a twice duplicated workaround for VERT_GRP_SIZE This enables better lane occupancy. Acked-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	c8e8979d6b	radeonsi: fix the fast launch vert/prim thread counts if they are trimmed This fixes the case when the counts were out of sync because one of them was decreased. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	f6e19fd831	radeonsi: don't use GS fast launch with small instances Acked-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	e33e525e16	radeonsi: don't disable L2 caching for staging textures Uncached access can be slow if the box is not aligned nicely. Also, caching in L2 might enable bigger PCIe bursts. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	36e07198a7	radeonsi: always use the L2 LRU cache policy for faster clears and copies Waves and CP DMA can finish sooner if L2 doesn't do any evictions, which is hard to predict. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	805c785314	radeonsi: fix a coherency issue when VS memory stores are not visible in PS If a shader has no param exports (no varyings), the pixel shader can start after the VS position is written before the vertex shader finishes. The fix is to wait for the memory stores before the position export. The code needs to be restructured. First prepare param exports to get nr_param_exports, then emit position exports with the wait, and then emit param exports. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	d8fefb1322	radeonsi: add a gfx10 bug workaround for NOT_EOP Fixes: `cc24ec8c07` - radeonsi: set NOT_EOP for back-to-back draws on gfx10+ Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	7effb291e6	radeonsi: handle PIPE_CAP_MAX_VERTEX_BUFFERS no change in behavior because the value is the same as the default Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	c7e731c737	radeonsi: remove unused SI_IMAGE_ACCESS_AS_BUFFER Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	66c8e40d69	winsys/amdgpu: don't hold a mutex while accessing is_shared It adds overhead to amdgpu_bo_wait and I'm not sure whether the mutex is even needed. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	4602147561	ac/llvm: don't draw the primitive for the dummy export workaround for Navi1x for conservative rasterization Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	f33500b187	ac/llvm: set range metadata on mbcnt and deduplicate get_thread_id Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00
Marek Olšák	57e182c75b	ac/llvm: allow ac_build_optimization_barrier with SGPRs, pointers, and metadata sgpr=true prevents moving the value to a VGPR. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10813>	2021-05-25 16:15:44 +00:00

1 2 3 4 5 ...

140061 Commits All Branches Search

140061 Commits

All Branches