KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Yonggang Luo	a9e2c699aa	util/c11: Update function u_thread_create to be c11 conformance Do not assume thrd_t to be a pointer or integer, as the C11 standard tells us: thrd_t: implementation-defined complete object type identifying a thread At https://en.cppreference.com/w/c/thread So we always return the thread creation return code instead of thrd_t value, and judge the return code properly. Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15087>	2022-06-15 17:37:17 +00:00
Yonggang Luo	672a93cd6d	util: Use timespec_get directly, it's always present Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15497>	2022-06-09 17:23:34 +00:00
Greg Depoire--Ferrer	969dfabc77	util/queue: handle thread cration failure in util_queue_adjust_num_threads If a thread cannot be created, make sure the num_threads field is updated to reflect the actual number of threads. Signed-off-by: Greg Depoire--Ferrer <greg.depoire@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15071>	2022-05-13 19:35:11 +00:00
Greg Depoire--Ferrer	9ed34cf9ff	util/queue: add missing space to comment in util_queue_destroy Signed-off-by: Greg Depoire--Ferrer <greg.depoire@gmail.com> Reviewed-By: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15071>	2022-05-13 19:35:10 +00:00
Pierre-Eric Pelloux-Prayer	7357ce19a2	util/u_queue: rework UTIL_QUEUE_INIT_SCALE_THREADS to scale faster The original code waiting for the queue to be full before adding more threads. This makes the thread count grow slowly, especially if the queue also uses UTIL_QUEUE_INIT_RESIZE_IF_FULL. This commit changes this behavior: now a new thread is spawned if we're adding a job to a non-empty queue because this means that the existing threads fail to process jobs faster than they're queued. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16273>	2022-05-13 14:40:56 +00:00
Jason Ekstrand	1b8a43a0ba	util: Remove util_cpu_detect util_cpu_detect is an anti-pattern: it relies on callers high up in the call chain initializing a local implementation detail. As a real example, I added: ...a Mali compiler unit test ...that called bi_imm_f16() to construct an FP16 immediate ...that calls _mesa_float_to_half internally ...that calls util_get_cpu_caps internally, but only on x86_64! ...that relies on util_cpu_detect having been called before. As a consequence, this unit test: ...crashes on x86_64 with USE_X86_64_ASM set ...passes on every other architecture ...works on my local arm64 workstation and on my test board ...failed CI which runs on x86_64 ...needed to have a random util_cpu_detect() call sprinkled in. This is a bad design decision. It pollutes the tree with magic, it causes mysterious CI failures especially for non-x86_64 developers, and it is not justified by a micro-optimization. Instead, let's call util_cpu_detect directly from util_get_cpu_caps, avoiding the footgun where it fails to be called. This cleans up Mesa's design, simplifies the tree, and avoids a class of a (possibly platform-specific) failures. To mitigate the added overhead, wrap it all in a (fast) atomic load check and declare the whole thing as ATTRIBUTE_CONST so the compiler will CSE calls to util_cpu_detect. Co-authored-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15580>	2022-04-20 18:44:35 +00:00
Samuel Pitoiset	04c90f292e	util/queue: fix a data race detected by TSAN when finishing the queue Thread sanitizer complains if it detects that the pthread_barrier is destroyed when a thread might still blocked on the barrier. Fix this by destroying the barrier only if pthread_barrier_wait returns PTHREAD_BARRIER_SERIAL_THREAD which is the value for success. In practice this shouldn't fix anything serious given that this code is only called when the disk cache is destroyed. Original patch from Timothy Arceri. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4342 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13861>	2021-11-19 09:02:23 +01:00
Marek Olšák	b4afe25ebf	util/queue: use simple_mtx_t for finish_lock Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13152>	2021-10-05 23:46:14 +00:00
Ian Romanick	d78e980523	util/queue: Don't crash in util_queue_destroy when init failed This simplifies the error exit paths for drivers that use these queues. v2: Move allocation of queue->jobs after initializing the mutxes and condition variables. Noticed by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11229>	2021-07-28 17:32:44 +00:00
Pierre-Eric Pelloux-Prayer	3713dc6b2a	util/u_queue: add UTIL_QUEUE_INIT_SCALE_THREADS flag This flag allow to create a single thread initially, but set max_thread to the request thread count. If the queue is full and num_threads is lower than max_threads, we spawn a new thread to help process the queue faster. This avoid creating N threads at queue creation time. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11296>	2021-06-17 09:11:59 +02:00
Pierre-Eric Pelloux-Prayer	0c88df1f6a	util/u_queue: move function definition up Will be used by the next commit. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11296>	2021-06-17 09:11:58 +02:00
Mike Blumenkrantz	a3a6611e96	util/queue: add a global data pointer for the queue object this better enables object-specific (e.g., context) queues where the owner of the queue will always be needed and various pointers will be passed in for tasks Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11312>	2021-06-16 15:10:09 -04:00
Mike Blumenkrantz	776ddfc858	util/queue: don't require a fence when adding a job sometimes we just want to fire jobs off into the void and don't care if or when they finish Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10719>	2021-05-14 06:49:31 +00:00
Jose Fonseca	3ba7784b1e	util: Always use timespec_get on Windows. include/c11/threads_win32.h provides a fallback implementation of timespec_get when necessary. Fixes https://gitlab.freedesktop.org/mesa/mesa/-/issues/4109 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9280>	2021-03-02 14:37:46 +00:00
Rob Clark	a9618e7c42	util: Add accessor for util_cpu_caps In release builds, there should be no change, but in debug builds the assert will help us catch undefined behavior resulting from using util_cpu_caps before it is initialized. With fix for u_half_test for MSVC from Jesse Natalie squashed in. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9266>	2021-02-26 18:31:19 +00:00
Rob Clark	9fb9019beb	util/u_queue: Ensure num_cpu_mask_bits is valid I noticed that we were hitting this before st_create_context() called util_cpu_detect() and so num_cpu_mask_bits was zero. But there is no harm in calling util_cpu_detect(), so lets just call it here to be safe. Fixes: `d877451b48` ("util/u_queue: add UTIL_QUEUE_INIT_SET_FULL_THREAD_AFFINITY") Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9266>	2021-02-26 18:31:19 +00:00
Witold Baryluk	65ef4a2e02	util: Use explicit relaxed reads for u_queue These are no-op, but make clang thread sanitizer happy. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8230>	2021-01-28 18:07:09 +00:00
Jeremy Huddleston Sequoia	68a785e63f	Fall back on clock_gettime when timespec_get() is unavailable Fixes: `e3a8013de8` "util/u_queue: add util_queue_fence_wait_timeout" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/1020 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4088 Signed-off-by: Jeremy Huddleston Sequoia <jeremyhu@apple.com> Signed-off-by: Yurii Kolesnykov <root@yurikoles.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8482>	2021-01-16 00:14:46 +00:00
Marek Olšák	a0467b7fa1	util: replace UTIL_MAX_CPUS by util_cpu_caps.num_cpu_mask_bits to reduce overhead when setting thread affinity. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8017>	2021-01-05 03:47:16 +00:00
Marek Olšák	9758b1d416	util: add util_set_thread_affinity helpers including Windows support Acked-by: Jose Fonseca <jfonseca@vmware.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7054>	2020-10-30 05:07:57 +00:00
Marek Olšák	96d9f7761d	util: consolidate thread_get_time functions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7054>	2020-10-30 05:07:57 +00:00
Con Kolivas	78d267e6da	Linux: Change minimum priority threads from SCHED_IDLE to nice 19 SCHED_BATCH. SCHED_IDLE on linux can lead to extraordinarily long periods of no scheduling leading to starvation of minimum priority threads for such an extended period that it can eventually lead to GUI stalls. Switch to renicing the threads to the lowest priority and use the SCHED_BATCH scheduling policy which is a hint to the scheduler that this is latency insensitive thread instead. This change has been confirmed to address unexpected GUI related stalls in mesa applications across a range of different linux kernels. Signed-off-by: Con Kolivas <kernel@kolivas.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4912>	2020-05-08 10:14:40 +00:00
Rhys Perry	1ef9658906	util/u_queue: fix race in total_jobs_size access Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> CC: <mesa-stable@lists.freedesktop.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4335> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4335>	2020-03-30 20:17:43 +00:00
Timothy Arceri	c578600489	util: remove LIST_DEL macro Just use the inlined function directly. The macro was replaced with the function in `ebe304fa54`. Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Timothy Arceri	40258fb8b8	util: remove LIST_ADD macro Just use the inlined function directly. The macro was replaced with the function in `ebe304fa54`. Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Timothy Arceri	7ae1be1028	util: remove LIST_INITHEAD macro Just use the inlined function directly. The macro was replaced with the function in `ebe304fa54`. Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Marek Olšák	c2efd2cbfb	util/u_queue: skip util_queue_finish if num_threads is 0 This fixes a deadlock in pthread_barrier_destroy. Cc: 19.1 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-10-23 21:11:17 -04:00
Timothy Arceri	896885025f	util/u_queue: track job size and limit the size of queue growth When both UTIL_QUEUE_INIT_RESIZE_IF_FULL and UTIL_QUEUE_INIT_USE_MINIMUM_PRIORITY are set, we can get into a situation where the queue never executes and grows to a huge size due to all other threads being busy. This is the case with the shader cache when attempting to compile a huge number of shaders up front. If all threads are busy compiling shaders the cache queues memory use can climb into the many GBs very fast. The use of these two flags with the shader cache is intended to allow shaders compiled at runtime to be compiled as fast as possible. To avoid huge memory use but still allow the queue to perform optimally in the run time compilation case, we now add the ability to track memory consumed by the jobs in the queue and limit it to a hardcoded 256MB which should be more than enough. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-19 15:03:27 +10:00
Lionel Landwerlin	9d3fc737af	util: fix compilation on macos timespec_get() is not available on macos, we need to pull in the include/c11/threads_posix.h helper. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103674 Fixes: `e2d761de03` ("util: drop final reference to p_compiler.h") Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-23 23:45:25 +03:00
Eric Engestrom	dffeaa55dd	util: use standard name for snprintf() Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-07-19 22:39:38 +01:00
Marek Olšák	050fae3983	util/queue: add util_queue_adjust_num_threads for ARB_parallel_shader_compile Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-04-01 12:37:52 -04:00
Marek Olšák	b7317b6ce0	util/queue: hold a lock when reading num_threads in util_queue_finish Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-04-01 12:37:52 -04:00
Marek Olšák	bb111559f2	util/queue: add ability to kill a subset of threads for ARB_parallel_shader_compile	2019-04-01 12:37:52 -04:00
Marek Olšák	d99cdc9d59	util/queue: move thread creation into a separate function Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-04-01 12:37:52 -04:00
Marek Olšák	d877451b48	util/u_queue: add UTIL_QUEUE_INIT_SET_FULL_THREAD_AFFINITY Initial version discussed with Rob Clark under a different patch name. This approach leaves his driver unaffected.	2018-10-06 22:05:58 -04:00
Andres Gomez	d7694136d3	kutil/queue: use util_snprintf() in util_queue_init Instead of plain snprintf(). To fix the MSVC 2013 build: Compiling src\util\u_queue.c ... u_queue.c src\util\u_queue.c(325) : warning C4013: 'snprintf' undefined; assuming extern returning int ... mesautil.lib(u_queue.obj) : error LNK2001: unresolved external symbol _snprintf scons: building terminated because of errors. Fixes: `b238e33bc9` ("kutil/queue: add a process name into a thread name") Cc: Marek Olšák <marek.olsak@amd.com> Cc: Brian Paul <brianp@vmware.com> Cc: Roland Scheidegger <sroland@vmware.com> Cc: Timothy Arceri <tarceri@itsqueeze.com> Cc: Eric Engestrom <eric.engestrom@intel.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-08-02 10:06:44 +03:00
Dylan Baker	17f49950da	util: move process.[ch] to u_process.[ch] On windows process.h is a system provided header, and it's required in include/c11/threads_win32.h. This header interferes with searching for that header, and results in windows build warnings with scons, but errors in meson which doesn't allow implicit function declarations. Just rename process to u_process, which follows the style of utils anyway. Fixes: `2e1e6511f7` ("util: extract get_process_name from xmlconfig.c") Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-01 12:47:16 -07:00
Lionel Landwerlin	78d5c1c82a	util: u_queue: fix android build error mesa/src/util/u_queue.c:242:15: error: address of array 'queue->name' will always evaluate to 'true' [-Werror,-Wpointer-bool-conversion] Fixes: `b238e33bc9` "kutil/queue: add a process name into a thread name" Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2018-07-05 15:42:26 +01:00
Marek Olšák	9b4c4fe334	util/queue: remove leftover debug code	2018-07-04 22:19:47 -04:00
Marek Olšák	b238e33bc9	kutil/queue: add a process name into a thread name v2: simplifications Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (v1) Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> (v1)	2018-07-04 21:54:39 -04:00
Marek Olšák	7083ac7290	util/u_queue: fix a deadlock in util_queue_finish Cc: 18.0 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 13:28:17 -04:00
Nicolai Hähnle	a6e8311723	util/u_queue: fix timeout handling in util_queue_fence_wait_timeout Fixes: `e3a8013de8` ("util/u_queue: add util_queue_fence_wait_timeout") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:15:49 +01:00
Nicolai Hähnle	e3a8013de8	util/u_queue: add util_queue_fence_wait_timeout v2: - style fixes - fix missing timeout handling in futex path Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-09 13:58:10 +01:00
Nicolai Hähnle	185061aef4	u_queue: add util_queue_finish for waiting for previously added jobs Schedule one job for every thread, and wait on a barrier inside the job execution function. v2: avoid alloca (fixes Windows build error) Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1)	2017-11-09 11:53:19 +01:00
Nicolai Hähnle	d1ff082637	u_queue: add a futex-based implementation of fences Fences are now 4 bytes instead of 96 bytes (on my 64-bit system). Signaling a fence is a single atomic operation in the fast case plus a syscall in the slow case. Testing if a fence is signaled is the same as before (a simple comparison), but waiting on a fence is now no more expensive than just testing it in the fast (already signaled) case. v2: - style fixes - use p_atomic_xxx macros with the right barriers Acked-by: Marek Olšák <marek.olsak@amd.com>	2017-11-09 11:37:39 +01:00
Nicolai Hähnle	574c59d4f9	u_queue: add util_queue_fence_reset Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-09 11:37:39 +01:00
Nicolai Hähnle	1b9d5ece55	u_queue: export util_queue_fence_signal Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-09 11:37:38 +01:00
Nicolai Hähnle	a208cd7ae4	util/queue: fix a race condition in the fence code A tempting alternative fix would be adding a lock/unlock pair in util_queue_fence_is_signalled. However, that wouldn't actually improve anything in the semantics of util_queue_fence_is_signalled, while making that test much more heavy-weight. So this lock/unlock pair in util_queue_fence_destroy for "flushing out" other threads that may still be in util_queue_fence_signal looks like the better fix. v2: rephrase the comment Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Gustaw Smolarczyk <wielkiegie@gmail.com>	2017-09-29 11:52:41 +02:00
Roland Scheidegger	c92fe8a8c5	util: only use SCHED_IDLE in pthread_setschedparam() when it's defined Fixes build error when it's not. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-09-01 01:10:32 +02:00
Marek Olšák	59ad769770	util/u_queue: add an option to resize the queue when it's full Consider the following situation: mtx_lock(mutex); do_something(); util_queue_add_job(...); mtx_unlock(mutex); If the queue is full, util_queue_add_job will wait for a free slot. If the job which is currently being executed tries to lock the mutex, it will be stuck forever, because util_queue_add_job is stuck. The deadlock can be trivially resolved by increasing the queue size (reallocating the queue) in util_queue_add_job if the queue is full. Then util_queue_add_job becomes wait-free. radeonsi will use it. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:57:20 -04:00

1 2

55 Commits