KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Emil Velikov	abe110df01	radv: use correct .specVersion for extensions Analogous to previous commit. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> (IRC)	2016-11-09 21:36:36 +00:00
Emil Velikov	190bae7685	amd/addrlib: limit fastcall/regparm to GCC i386 The use of regparm causes an error on arm/arm64 builds with clang. fastcall is allowed, but still throws a warning. As both options only have effect on 32-bit x86 builds, limit them to that case. v2: keep the __i386__ within GCC (Nicolai) Cc: 13.0 <mesa-stable@lists.freedesktop.org> Cc: Rob Herring <robh@kernel.org> Cc: Nicolai Hähnle <nhaehnle@gmail.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rob Herring <robh@kernel.org>	2016-11-09 21:36:35 +00:00
Dave Airlie	fb50245ac1	radv: fix GetFenceStatus for signaled fences if a fence is created pre-signaled we should return that in GetFenceStatus even if it hasn't been submitted. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Gustaw Smolarczyk <wielkiegie@gmail.com> Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-09 19:49:26 +00:00
Dave Airlie	3c9af7578f	radv: enable conditional discard optimisation on radv. This fixes a bunch of GPU hangs introduced in some CTS tests like dEQP-VK.memory.pipeline_barrier.host_write_uniform_buffer.65536 It works around an issue seen in the LLVM backend, but also makes the radv code work more like the radeonsi stack. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-10 05:46:49 +10:00
Dave Airlie	dd77faeca2	ac/nir: add support for discard_if intrinsic (v2) We are going to start lowering to this in NIR code, so prepare radv for it. v2: handle conversion to kilp properly (nha) Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-10 05:46:20 +10:00
Dave Airlie	bafc75b437	radv: emit correct last export when Z/stencil export is enabled I was getting a random GPU hang in the renderpass simple tests, it turns out sometimes radv emitted the wrong thing "last". This fixes the logic to emit Z/stencil last if they occur, and not mark a color output as last. Also this relies on the Z/STENCIL being the first two fragment outputs, which they are so yay. Fixes: dEQP-VK.renderpass.simple.color_depth (random hangs) Cc: "13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-09 06:05:03 +10:00
Mauro Rossi	0148313ea3	android: amd/common: add support for libmesa_amd_common Fixes the following building error introduced with commit `7115e56` and related amd/common dependencies: external/mesa/src/gallium/drivers/radeonsi/si_shader.c:6861: error: undefined reference to 'ac_is_sgpr_param' external/mesa/src/gallium/drivers/radeonsi/si_shader.c:6951: error: undefined reference to 'ac_is_sgpr_param' clang++: error: linker command failed with exit code 1 (use -v to see invocation) ninja: build stopped: subcommand failed. build/core/ninja.mk:148: recipe for target 'ninja_wrapper' failed make: *** [ninja_wrapper] Error 1 Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2016-11-05 18:42:29 +01:00
Nicolai Hähnle	908100cfae	amd/common: add ac_is_sgpr_param helper Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-11-03 10:06:27 +01:00
Nicolai Hähnle	2ff5df8f50	amd/common: build also for gallium drivers At least when LLVM is used, which is basically always (unless you're only building r600 without OpenCL). Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-11-03 10:06:24 +01:00
Nicolai Hähnle	8eabee9ec0	amd/common: move llvm helper prototype to ac_llvm_util.h Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-11-03 10:05:46 +01:00
Fredrik Höglund	e7b9c5eb74	radv: add support for anisotropic filtering on VI+ Ported from radeonsi. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-03 08:27:21 +10:00
Dave Airlie	73592b9284	radv: fix dual source blending Dolphin tried to use this, but we hadn't had any tests for it properly. All that is required is the shader output format needs to be set for 0 and 1 exports. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-03 08:26:51 +10:00
Dave Airlie	9f0726f3e5	radv: expose xlib platform extension I missed this when I added the xlib code, this allows dolphin emu to start and crash later. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-02 10:00:38 +10:00
Marek Olšák	d3244c47ce	amd: fix a typo in PIXEL_PIPE_STAT_RESET definition Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Dave Airlie	f88ea8c72a	radv: drop some unused cmask info members. These were assigned but never used. Inspired by similiar patch in radeonsi. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-01 15:11:35 +10:00
Fredrik Höglund	044ef54d65	radv: split the device local memory heap into two Advertise two device local memory heaps; one that is host visible and one that is not. This makes it possible for clients to tell how much host visible vs. non-host visible memory is available. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-28 12:27:49 +10:00
Fredrik Höglund	c9675b4e17	radv: add a write-combining host-local memory type Add the new memory type between the two device-local types. This makes the list of supported memory types look like this: 1) DEVICE_LOCAL \| \| \| 2) \| HOST_VISIBLE \| HOST_COHERENT \| 3) DEVICE_LOCAL \| HOST_VISIBLE \| HOST_COHERENT \| 4) \| HOST_VISIBLE \| HOST_COHERENT \| HOST_CACHED With this order a client that searches for a HOST_VISIBLE and HOST_COHERENT memory type using the algorithm described in section 10.2 of the Vulkan specification (revision 32) will find the host- local memory type first. A client that requires the memory type to be HOST_VISIBLE and HOST_COHERENT, but not DEVICE_LOCAL is most likely searching for a memory type suitable for staging buffers / images. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-28 12:27:46 +10:00
Dave Airlie	d548fa882b	radv/ac/llvm: trim texture return values The intrinsic engine asserts in llvm due to this, as we put a vec4 into a vec1, and the next instruction isn't expecting it. So trim the vector at the end before inserting it. Reported-by: Christoph Haag <haagch+mesadev@frickel.club> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-27 11:42:03 +10:00
Marek Olšák	edf56fb428	gallium/radeon: fix a ZPASS comment, EVENT_WRITE_EOP fixups Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Timothy Arceri	e1af20f18a	nir/i965/anv/radv/gallium: make shader info a pointer When restoring something from shader cache we won't have and don't want to create a nir_shader this change detaches the two. There are other advantages such as being able to reuse the shader info populated by GLSL IR. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-10-26 14:29:36 +11:00
Fredrik Höglund	0a153f4ee4	radv: mark the fence as submitted and signalled in vkAcquireNextImageKHR This stops the debug layers from complaining when fences are used to throttle image acquisition. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-26 12:25:35 +10:00
Matt Turner	14aac061e9	radv: Replace "abi_versions" with correct "api_version". git history shows "abi_versions" was used from the outset. Cc: <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98415 Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-10-25 12:55:39 -07:00
Dave Airlie	a969548f59	radv: allow cmask transitions without fast clear This fixes dEQP-VK.pipeline.multisample.sampled_image* These all render to multisampled image, and then sample from it, so we must transition it correctly, since we have a cmask and fmask this will cause the correct transition. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-24 11:03:09 +10:00
Dave Airlie	d842546ad1	radv: use emit_icmp for samples_identical On a debug llvm build we'd assert on the next compare when the return from samples_identical was i1 instead of i32. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-20 01:43:55 +01:00
Dave Airlie	86c4575a81	radv: decompress fmask before reading using texture unit Before we can read the fmask using the compute shader, we need to decompress the fmask in place. This fixes a bunch of remaining failure and hopefully multisampling in Talos.	2016-10-19 17:39:47 +10:00
Dave Airlie	67c91ef2a2	radv: fix samples_identical return value. This was returning an inversion, so not doing as it should have. We need to compare the fmask value with 0, and return the result from that.	2016-10-19 17:39:01 +10:00
Dave Airlie	93ba86c307	radv: fix wsi porting regression in swapchain destroy. The code in anv is right, there's a pending patch to fix this up different, but I'll sync the code for now.	2016-10-19 13:54:49 +10:00
Dave Airlie	63406b669e	radv: fix fmask ptr issue We were using the wrong descriptor in the fmask picking code.	2016-10-19 13:16:25 +10:00
Dave Airlie	db7ae14b60	radv: simplify fast clear shaders There is no need for anything but a noop shader here.	2016-10-19 13:16:14 +10:00
Dave Airlie	b0e11a153c	radv: start using defines for the user sgpr offsets This adds some comments and adds defines for the user sgprs, so that we can move them around easier later and not have to change/revalidate every one of these. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-19 10:17:48 +10:00
Dave Airlie	6c3bd1cdb3	radv: port to common wsi codebase This drops all the radv WSI code in favour of using the new shared code that was ported from anv This regresses Talos for now, Jason has pointed out the bug is in Talos and we should wait for them to fix it. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-10-19 10:15:43 +10:00
Dave Airlie	32d70c0d66	radv/anv/wsi: drop unneeded parameter Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-10-19 10:15:42 +10:00
Dave Airlie	e4df1830e4	radv: drop pointless struct decl. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-19 09:05:26 +10:00
Dave Airlie	4450f40519	radv: move to using shared vk_alloc inlines. This moves to the shared vk_alloc inlines for vulkan memory allocations. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-19 09:05:26 +10:00
Dave Airlie	c6f1077e0d	radv: drop local MIN/MAX macros. Use the ones in macros.h instead. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-19 09:05:25 +10:00
Dave Airlie	f5daaba0fd	radv: make use of shared vector helper. This removes the vector code from radv in favour of sharing code with anv. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-19 09:05:25 +10:00
Gustaw Smolarczyk	36cb5508e8	radv/winsys: Fail early on overgrown cs. When !use_ib_bos, we can't easily chain ibs one to another. If the required cs size grows over 1Mi - 8 dwords just fail the cs so that we won't assert-fail in radv_amdgpu_winsys_cs_submit later on. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-10-16 12:38:53 +02:00
Emil Velikov	3fd0cafc1c	radv: move AMDGPU_LIBS later in the link chain At the moment (albeit unlikely) one could get link-time issues, since libdrm_amdgpu.so is before it's users in the link chain. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-10-14 11:09:00 +01:00
Emil Velikov	a8a5f0a025	radv: correct variable name VISIBILITY_{, C}FLAGS The letter C was missing, thus in turn all the internal symbols were exported. As a result we hide ~150 symbols and cut ~36K from libvulkan_radeon.so. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-10-14 11:09:00 +01:00
Emil Velikov	753a9c989f	amd/addrlib: hide private symbols via VISIBILITY_CXXFLAGS Private/internal symbols should not be exported. Using the CXXFLAGS cuts ~300 exported symbols and ~23K from libvulkan_radeon.so. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-10-14 11:09:00 +01:00
Dave Airlie	47a7d86fe9	radv: fix the wayland wsi busy bit Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-14 05:10:02 +10:00
Tom Stellard	5c66d46d6a	radv: Use new image load/store intrinsic signatures v2 These were changed in LLVM r284024. v2: - Only use float types for vdata of llvm.amdgcn.image.store. LLVM doesn't support integer types for this intrinsic. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-14 04:48:11 +10:00
Tom Stellard	30e63fb0e4	radv: Fix incorrect comment Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-14 04:48:11 +10:00
Dave Airlie	060e6f468a	radv: fix identity swizzle handling The identity swizzle should operate exactly like an .r = R, .g = G, .b = B, .a = A swizzle. This fixes a bunch of the 16-bit BGRA blit tests dEQP-VK.api.copy_and_blit.blit_image.all_formats.b4g4r4a4* Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-14 04:45:57 +10:00
Dave Airlie	8bdac874e6	radv/wsi: fix app that acquire multiple images up front dota2 does multiple acquires followed by multiple queues, this bug manifested itself as a hang in the xshmfence code randomly when dota2 was doing it's menus. It also occured when running dota2 under phoronix-test-suite. The fix is once the image is acquired to mark it busy then so nobody else can acquire. We have to trust vulkan apps that they will eventually submit it. Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-14 04:45:11 +10:00
Nicolas Koch	35e2bfa6d9	radv: Return correct result in EnumeratePhysicalDevices If pPhysicalDevices is too small for all physical devices, the driver must return VK_INCOMPLETE. Since only a single physical device is supported, this is only the case when pPhysicalDeviceCount == 0 && pPhysicalDevices != NULL. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-13 09:11:13 +10:00
Emil Velikov	3c419a941a	radv: add all headers to the sources list Otherwise they'll be missing from the tarball and the build will fail. Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-10-12 18:55:20 +01:00
Edward O'Callaghan	cfbf956dfd	radv: trivial case stmt style fixups Relocate a 'default:' to the end of a case stmt and fix an indent issue. Signed-off-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2016-10-12 20:12:43 +11:00
Gustaw Smolarczyk	c3f3c6b0e8	radv/winsys: Fix radv_amdgpu_cs_grow min_size argument. (v2) It's supposed to be how much at least we want to grow the cs, not the minimum size of the cs after growth. v2: Unbreak use_ib_bos. Don't mask the ib_size when !use_ib_bos, since it's not needed. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 09:06:30 +10:00
Grigori Goronzy	a22b5f28fb	radv: fix strict aliasing violation Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 09:00:22 +10:00
Grigori Goronzy	0b539abcf4	radv: fix uninitialized variables This gets rid of "may be used uninitialized" compiler warnings. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 09:00:22 +10:00
Grigori Goronzy	7ca44f8a33	radv: add missing unreachable Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 09:00:22 +10:00
Dave Airlie	8cc9f89d26	radv: remove the validation layer and some related bits. As pointed out by Emil this isn't used in anv anymore, and it was totally unused in radv anyways. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 08:57:09 +10:00
Dave Airlie	014ec78fb2	radv: drop entrypoint split out. radv really doesn't need different dispatch per gen yet, there really isn't that many differences yet. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 08:56:41 +10:00
Dave Airlie	12301c5418	radv: drop the RADV_CALL macro. This is leftover from anv, and we really never needed it. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 08:56:41 +10:00
Dave Airlie	fc28f89157	radv: check driver name before calling amdgpu. This checks the kernel driver name is amdgpu before calling libdrm_amdgpu. This avoids the following error: amdgpu_device_initialize: DRM version is 1.6.0 but this driver is only compatible with 3.x.x when run on a machine with i915 graphics as well as amdgpu. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 08:56:41 +10:00
Dave Airlie	6215b47648	radv: fix memory leak from physical device if wsi fails Inspired by patch from Edward O'Callaghan <funfunctor@folklore1984.net> which didn't do it right. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 08:53:44 +10:00
Edward O'Callaghan	e0641c61ca	radv/winsys: Fix mem leak at failed do_winsys_init() call site Probably unlikely however ensure we don't leak a heap allocation on the fail path. V.2: also fix missing 'amdgpu_device_deinitialize()' calls (Emil Velikov). Signed-off-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 08:46:10 +10:00
Edward O'Callaghan	4a0db58f14	radv/winsys: Trivial style and readability fixups Drop/add a few newlines where appropriate and drop a couple of unnessary braces. Signed-off-by: Edward O'Callaghan <funfunctor@folklore1984.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-12 08:24:50 +10:00
Emil Velikov	0b54c022a8	radv: automake: move libamdgpu_addrlib.la to VULKAN_LIB_DEPS The static library is analogous to the intel ISL, which is required for both hardware and (to be added) testing library. Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-10-11 13:51:09 +01:00
Emil Velikov	4882476eca	radv: automake: remove unused variables Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-10-11 13:51:08 +01:00
Emil Velikov	e2cb253346	radv: automake: include the python scripts/formats table in the tarball Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-10-11 13:51:06 +01:00
Edward O'Callaghan	ba43768a1e	radv: Use proper header guards over 'pragma once' directives Signed-off-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2016-10-10 16:10:56 +11:00
Dave Airlie	85a47f647e	radv: drop all uint for unsigned. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-07 12:09:13 +10:00
Gustaw Smolarczyk	24815bd7b3	radv: Skip already signalled fences. If the user created a fence with VK_FENCE_CREATE_SIGNALED_BIT set, we shouldn't fail to wait for a fence if it was not submitted since that is not necessary. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-07 09:24:09 +10:00
Dave Airlie	f4e499ec79	radv: add initial non-conformant radv vulkan driver This squashes all the radv development up until now into one for merging. History can be found: https://github.com/airlied/mesa/tree/semi-interesting This requires llvm 3.9 and is in no way considered a conformant vulkan implementation. It can run a number of vulkan applications, and supports all GPUs using the amdgpu kernel driver. Thanks to Intel for providing anv and spirv->nir, and Emil Velikov for reviewing build integration. Parts of this are: Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net> Authors: Bas Nieuwenhuizen and Dave Airlie Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-07 09:16:09 +10:00
Emil Velikov	03350c9708	amd: add amd_kernel_code_t.h to the sources list Otherwise it won't be picked in the tarball and the build will fail. Fixes: `91ec6e5664` ("radeonsi/compute: Use the HSA abi for non-TGSI compute shaders v3") Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-10-06 16:17:51 +01:00
Tom Stellard	91ec6e5664	radeonsi/compute: Use the HSA abi for non-TGSI compute shaders v3 This patch switches non-TGSI compute shaders over to using the HSA ABI described here: https://github.com/RadeonOpenCompute/ROCm-Docs/blob/master/AMDGPU-ABI.md The HSA ABI provides a much cleaner interface for compute shaders and allows us to share more code in the compiler with the HSA stack. The main changes in this patch are: - We now pass the scratch buffer resource into the shader via user sgprs rather than using relocations. - Grid/Block sizes are now passed to the shader via the dispatch packet rather than at the beginning of the kernel arguments. Typically for HSA, the CP firmware will create the dispatch packet and set up the user sgprs automatically. However, in Mesa we let the driver do this work. The main reason for this is that I haven't researched how to get the CP to do all these things, and I'm not sure if it is supported for all GPUs. v2: - Add comments explaining why we are setting certain bits of the scratch resource descriptor. v3: - Use amdgcn-mesa-mesa3d triple instead of amdgcn--mesa3d. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-16 23:07:10 +00:00
Mauro Rossi	6b9d7e69ee	android: add support for libmesa_amdgpu_addrlib Android porting of the following commits: `f1f1ba3` "radeonsi: move sid.h/r600d_common.h to a common place." `69fca64` "amd/addrlib: move addrlib from amdgpu winsys to common code" This patch fixes android building errors Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-09-13 10:06:04 +10:00
Dave Airlie	69fca64259	amd/addrlib: move addrlib from amdgpu winsys to common code Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-06 10:06:33 +10:00
Dave Airlie	a86be7b6ad	radeon: move radeon_family/chip_class defintions to common This just moves these to a common header file. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-06 10:06:04 +10:00
Dave Airlie	f1f1ba3781	radeonsi: move sid.h/r600d_common.h to a common place. Step one to merging radv would be to move some files around. This only adds the include path to r600/radeonsi, because later we want to avoid having to add it to the generic target paths. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-06 10:05:13 +10:00

1 2 3

122 Commits