KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Yonggang Luo	9aa094d1b1	misc: Replace `#ifdef\t__cplusplus` with `#ifdef\s\s__cplusplus` Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15762>	2022-04-21 14:43:39 +00:00
Jason Ekstrand	1b8a43a0ba	util: Remove util_cpu_detect util_cpu_detect is an anti-pattern: it relies on callers high up in the call chain initializing a local implementation detail. As a real example, I added: ...a Mali compiler unit test ...that called bi_imm_f16() to construct an FP16 immediate ...that calls _mesa_float_to_half internally ...that calls util_get_cpu_caps internally, but only on x86_64! ...that relies on util_cpu_detect having been called before. As a consequence, this unit test: ...crashes on x86_64 with USE_X86_64_ASM set ...passes on every other architecture ...works on my local arm64 workstation and on my test board ...failed CI which runs on x86_64 ...needed to have a random util_cpu_detect() call sprinkled in. This is a bad design decision. It pollutes the tree with magic, it causes mysterious CI failures especially for non-x86_64 developers, and it is not justified by a micro-optimization. Instead, let's call util_cpu_detect directly from util_get_cpu_caps, avoiding the footgun where it fails to be called. This cleans up Mesa's design, simplifies the tree, and avoids a class of a (possibly platform-specific) failures. To mitigate the added overhead, wrap it all in a (fast) atomic load check and declare the whole thing as ATTRIBUTE_CONST so the compiler will CSE calls to util_cpu_detect. Co-authored-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15580>	2022-04-20 18:44:35 +00:00
Yonggang Luo	d9c3601e29	util: trim trailing space for files src/util/*/ Using the following bash script doing that ``` cd src/util find . -type f -print0 \| xargs -0 -n1 sed -i 's/[ \t]*$//' ``` Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15093>	2022-03-21 17:57:15 +00:00
Marius Hillenbrand	a46d155329	util/cpu_detect, gallium: use cpu_family CPU_S390X instead of separate flag to also get rid of the additional function that I introduced before. Fixes: `82b261417e` ("util/cpu_detect: Add flag for IBM Z (s390x)") Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13958>	2021-11-25 12:57:20 +00:00
Marius Hillenbrand	82b261417e	util/cpu_detect: Add flag for IBM Z (s390x) As preparation for changing the behavior of LLVMpipe on IBM Z, add a flag to detect that platform. As it is always known at compile-time, we do not add it to the struct for cpu flags to avoid inflating that struct's size. Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13927>	2021-11-23 17:49:02 +00:00
Marek Olšák	386e5371a7	util/cpu_detect: add/guess support for next Zen CPUs so that we don't have to update this anymore Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12335>	2021-08-31 22:29:21 +00:00
suijingfeng	88b234d7a7	gallivm: add basic mips64 support and set mcpu to mips64r5 on ls3a4000 ls3a4000 and ls2k1000 cpu is mips64r5 compatible with MSA SIMD instruction set implemented, while ls3a3000 is mips64r2 compatible only. Due to lacking llvm support for loongson CPU, llvm::sys::getHostCPUName(). return "generic" on all loongson mips CPU. So we override the MCPU to mips64r5 if MSA is implemented, feedback to mips64r2 for all other ordinaries. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: suijingfeng <suijingfeng@loongson.cn> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11955>	2021-07-21 13:14:05 +00:00
Ian Romanick	59ca535576	util: Use maximum number of CPUs for determining cache topology This prevents problems when some CPUs are offline. In a four CPU system, if CPUs 1 and 2 are offline, the cache topology code would only examine CPUs 0 and 1... giving incorrect information. The types are changed to int16_t so that the offset of num_L3_caches does not change. This triggered a STATIC_ASSERT failure: STATIC_ASSERT(offsetof(struct util_cpu_caps_t, num_L3_caches) == 5 * sizeof(uint32_t)); I'm assuming there's some assembly code or something that depends on this offset, and I don't feel like messing with it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11228>	2021-06-15 20:01:53 +00:00
Marek Olšák	48d2ac4e88	util: fix (re-enable) L3 cache pinning cores_per_L3 was uninitialized, so it was always disabled. Remove the variable and do it differently. Fixes: `11d2db17c5` - util: rework AMD cpu L3 cache affinity code. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10526>	2021-05-04 01:02:07 -04:00
Dave Airlie	f7acdb1d1d	st/glthread: allow for invalid L3 cache id. If we get 0xffffffff consider L3 cache info invalid and don't continue. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4496 Fixes: `d8ea509965` ("util: completely rewrite and do AMD Zen L3 cache pinning correctly") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9782>	2021-03-29 08:31:09 +00:00
Rob Clark	a9618e7c42	util: Add accessor for util_cpu_caps In release builds, there should be no change, but in debug builds the assert will help us catch undefined behavior resulting from using util_cpu_caps before it is initialized. With fix for u_half_test for MSVC from Jesse Natalie squashed in. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9266>	2021-02-26 18:31:19 +00:00
Marek Olšák	a0467b7fa1	util: replace UTIL_MAX_CPUS by util_cpu_caps.num_cpu_mask_bits to reduce overhead when setting thread affinity. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8017>	2021-01-05 03:47:16 +00:00
Marek Olšák	e4fa7c440d	util: add AMD CPU family enums and enable L3 cache pinning on Zen3 Based on: https://en.wikichip.org/wiki/amd/cpuid The only reason it's nominated as a fix is because Zen3 might underperform because the CPU detection ignored it. Fixes: `15fa2c5e35` - gallium/u_cpu_detect: get the number of cores per L3 cache for AMD Zen Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8225>	2021-01-05 02:43:55 +00:00
Marek Olšák	d8ea509965	util: completely rewrite and do AMD Zen L3 cache pinning correctly This queries the CPU cache topology correctly. Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7054>	2020-10-30 05:07:57 +00:00
Lionel Landwerlin	e2d761de03	util: drop final reference to p_compiler.h Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-09 22:59:43 +03:00
Dylan Baker	fb02bd3d1c	util: move u_cpu_detect to util CC: vlee@freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107870 Fixes: `80825abb5d` ("move u_math to src/util") Tested-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-10-30 14:32:52 -07:00

16 Commits