KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Connor Abbott	89263fde20	tu: Use common vk_image struct This eliminates some boilerplate, and will be necessary to use the common render pass implementation for debugging purposes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17378>	2022-07-27 19:40:44 +00:00
Connor Abbott	3aa20a4409	tu: Split out some state into a separate struct These bits of state will have to be treated specially when suspending/resuming a render pass, because they will need to be tracked across command buffers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17378>	2022-07-27 19:40:44 +00:00
Danylo Piliaiev	19682028eb	tu/autotune: Prevent division by zero src/freedreno/vulkan/tu_autotune.c:509:48: runtime error: division by zero Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16967>	2022-06-10 14:09:59 +00:00
Chia-I Wu	5c17a04282	turnip: consider render pass costs in autotune To be able to sum drawcall cost and render pass cost, the units of costs are changed to bytes. With that, tu_autotune_use_bypass can make decisions by comparing the costs of sysmem rendering and gmem rendering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16733>	2022-06-08 12:48:08 +00:00
Chia-I Wu	6fe7b92114	turnip: if-checks autotune debug macros This avoids bitrot while the compiler can easily optimize away those checks. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16733>	2022-06-08 12:48:08 +00:00
Emma Anholt	835704e669	turnip: Move autotune buffers to suballoc. Now the ANGLE trex_200 trace replay does a single BO allocation at startup for autotune results instead of one per frame (~350 for the whole replay). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	7c636acd53	turnip: Get autotune off of ralloc destructors. We've wanted to remove destructors from ralloc's API for a long time (it's an extra storage cost per ralloc for a rarely-used feature), and for the suballoc change we'd need to spend more storage on storing the tu_device pointer per result since destructors don't get anything else but the pointer passed into them. Fixes use-after-frees: ================================================================= ==2383==ERROR: AddressSanitizer: heap-use-after-free on address 0xffff88fe1940 at pc 0xffff934f427c bp 0xfffff5481e90 sp 0xfffff5481ea8 WRITE of size 8 at 0xffff88fe1940 thread T0 #0 0xffff934f4278 in list_del ../src/util/list.h:108 #1 0xffff934f4278 in result_destructor ../src/freedreno/vulkan/tu_autotune.c:237 #2 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #3 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #4 0xffff934f4368 in history_destructor ../src/freedreno/vulkan/tu_autotune.c:229 #5 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #6 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #7 0xffff934f5990 in tu_autotune_on_submit ../src/freedreno/vulkan/tu_autotune.c:442 [...] 0xffff88fe1940 is located 80 bytes inside of 112-byte region [0xffff88fe18f0,0xffff88fe1960) freed by thread T0 here: #0 0xffff9c1c90d8 in __interceptor_free ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:127 #1 0xffff934f4368 in history_destructor ../src/freedreno/vulkan/tu_autotune.c:229 #2 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #3 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #4 0xffff934f5990 in tu_autotune_on_submit ../src/freedreno/vulkan/tu_autotune.c:442 #5 0xffff935cf2ac in tu_queue_submit_locked ../src/freedreno/vulkan/tu_drm.c:997 [...] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	dc3203b087	turnip: Sub-allocate pipelines out of a device-global BO pool. Allocating a BO for each pipeline meant that for apps with many pipelines (such as Asphalt9 under ANGLE), we would end up spending too much time in the kernel tracking the BO references. Looking at CS:Source on zink, before we had 85 BOs for the pipelines for a total of 1036 kb, and now we have 7 BOs for a total of 896 kb. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Danylo Piliaiev	2e878293f4	turnip: Make autotuner work with reusable command buffers To achieve it each command buffer now has its own GPU memory. However the BOs usage by autotuner is not optimal, the ideal pattern would be to use some memory pool to suballocate small GPU memory chunks, since most command buffers have only a few renderpasses. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5990 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14996>	2022-03-09 12:56:31 +00:00
Danylo Piliaiev	7e703e4428	turnip: Always use GMEM for feedback loops in autotuner For ordinary feedback loops GMEM is a lot faster than sysmem since we don't set SINGLE_PRIM mode. For feedback loops with ordered rasterization GMEM should also be faster. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Danylo Piliaiev	a814a4f9db	turnip: Add a refcount mechanism to BOs Until now we have lived without a refcount mechanism in the driver because in Vulkan the user is responsible for handling the life span of memory allocations for all Vulkan objects, however, imported BOs are tricky because the kernel doesn't refcount so user-space needs to make sure that: 1. When importing a BO into the same device used to create it (self-importing) it does not double free the same BO. 2. Frees imported BOs that were not allocated through the same device. Our initial implementation always freed BOs when requested, so we handled 2) correctly but not 1) on drm and we would double-free self-imported BOs because kernel doesn't return a unique gem_handle on each import. Beside this the submit ioctl checks for duplicates in the BO list and returns an error if there is one. This fixes the problem for good by adding refcounts to BOs so that self-imported BOs have a refcnt > 1 and are only freed when all references are freed. KGSL on the other hand does not have the same problems, at least not with ION buffers which are used for exportable BOs on pre 5.10 android kernels. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5936 Fixes CTS tests: dEQP-VK.drm_format_modifiers.export_import.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15031>	2022-02-19 15:16:55 +00:00
Danylo Piliaiev	dbae9fa7d8	tu: implement sysmem vs gmem autotuner The implementation is separate from Freedreno due to multithreading support. In Vulkan application may fill command buffer from many threads and expect no locking to occur. We do introduce the possibility of locking on renderpass end, however assuming that application doesn't have a huge amount of slightly different renderpasses, there would be minimal to none contention. Other assumptions are: - Application does submit command buffers soon after their creation. Breaking the above may lead to some decrease in performance or autotuner turning itself off. The heuristic is too simplistic at the moment, to find a proper one - we should run a bunch of traces with sysmem and gmem, and build better heuristic from gathered data. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12128>	2022-01-31 15:26:35 +00:00

12 Commits