KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	d50bef9831	winsys/amdgpu: remove amdgpu_drm.h definitions trivial	2019-01-30 12:38:56 -05:00
Marek Olšák	e0a6399eb4	winsys/amdgpu: rename rfence, rsrc, rdst -> afence, asrc, adst Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-01-22 12:26:45 -05:00
Marek Olšák	c02f761bdf	winsys/amdgpu: use the new BO list API	2019-01-22 11:59:27 -05:00
Michel Dänzer	9d8395bf0e	winsys/amdgpu: Pull in LLVM CFLAGS Fixes build failure if the LLVM headers aren't in a standard include directory. Fixes: `ec22dd34c8` "radeonsi: move SI_FORCE_FAMILY functionality to winsys" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-12-19 17:54:18 +01:00
Nicolai Hähnle	ec22dd34c8	radeonsi: move SI_FORCE_FAMILY functionality to winsys This helps some debugging cases by initializing addrlib with slightly more appropriate settings. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-12-19 12:01:25 +01:00
Marek Olšák	39b20b7d4f	Revert "winsys/amdgpu: overallocate buffers for faster address translation on Gfx9" I didn't mean to push this. I don't think it makes any difference. This reverts commit `f737fe00a0`.	2018-11-29 14:46:06 -05:00
Nicolai Hähnle	776b911365	amd/addrlib: update Mesa's copy of addrlib Update to the internal master as of 2018-11-15. This has a lot of gratuitous whitespace change, but on the plus side it's built using the same tooling that's used for AMDVLK, which should help going forward.	2018-11-29 13:18:24 +01:00
Marek Olšák	c1d3c08699	winsys/amdgpu: add support for allocating GDS and OA resources Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	72b2b61d8c	winsys/amdgpu: use optimal VM alignment for CPU allocations Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	27f9935075	winsys/amdgpu: use optimal VM alignment for imported buffers Window system buffers didn't use the optimal alignment. Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	6b554d863f	winsys/amdgpu,radeon: pass vm_alignment to buffer_from_handle Acked-by: Christian König <christian.koenig@amd.com>	2018-11-28 20:20:27 -05:00
Marek Olšák	f737fe00a0	winsys/amdgpu: overallocate buffers for faster address translation on Gfx9 Sadly, the 3 games I tested (DeusEx:MD, DiRT Rally, DOTA 2) are unaffected by the overallocation, because I guess their buffers don't fall into the small range below a power-of-two size. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	8c00f778fc	winsys/amdgpu: increase the VM alignment to the MSB of the size for Gfx9 Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	a2a6b06d48	winsys/amdgpu: use >= instead of > for VM address alignment Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	98f2312b4f	winsys/amdgpu: clean up code around BO VM alignment Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-28 20:20:27 -05:00
Marek Olšák	5f9ccf827e	winsys/amdgpu: optimize slab allocation for 2 MB amdgpu page tables - the slab buffer size increased from 128 KB to 2 MB (PTE fragment size) - the max suballocated buffer size increased from 64 KB to 256 KB, this increases memory usage because it wastes memory - the number of suballocators increased from 1 to 3 and they are layered on top of each other to minimize unused space in slabs The final increase in memory usage is: DeusEx:MD: 1.8% DOTA 2: 1.75% DiRT Rally: 0.2% The kernel driver will also receive fewer buffers.	2018-11-28 20:20:27 -05:00
Marek Olšák	cf6835485c	radeonsi: generalize the slab allocator code to allow layered slab allocators There is no change in behavior. It just makes it easier to change the number of slab allocators.	2018-11-28 20:20:27 -05:00
Marek Olšák	9576266a37	winsys/amdgpu: always reclaim/release slabs if there is not enough memory	2018-11-28 20:20:27 -05:00
Nicolai Hähnle	eb94b6bd5c	winsys/amdgpu: explicitly declare whether buffer_map is permanent or not Introduce a new driver-private transfer flag RADEON_TRANSFER_TEMPORARY that specifies whether the caller will use buffer_unmap or not. The default behavior is set to permanent maps, because that's what drivers do for Gallium buffer maps. This should eliminate the need for hacks in libdrm. Assertions are added to catch when the buffer_unmap calls don't match the (temporary) buffer_map calls. I did my best to update r600 for consistency (r300 needs no changes because it never calls buffer_unmap), even though the radeon winsys ignores the new flag. As an added bonus, this should actually improve the performance of the normal fast path, because we no longer call into libdrm at all after the first map, and there's one less atomic in the winsys itself (there are now no atomics left in the UNSYNCHRONIZED fast path). Cc: Leo Liu <leo.liu@amd.com> v2: - remove comment about visible VRAM (Marek) - don't rely on amdgpu_bo_cpu_map doing an atomic write Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-28 18:24:14 +01:00
Nicolai Hähnle	35eb81987c	winsys/amdgpu: add amdgpu_winsys_bo::lock We'll use it in the upcoming mapping change. Sparse buffers have always had one. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-11-28 18:23:29 +01:00
Marek Olšák	d4e7d8b7f0	winsys/amdgpu: fix a device handle leak in amdgpu_winsys_create Cc: 18.2 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-23 17:08:44 -05:00
Marek Olšák	82aa07f81f	winsys/amdgpu: fix a buffer leak in amdgpu_bo_from_handle Cc: 18.2 18.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-11-23 17:08:42 -05:00
Marek Olšák	d2b2364313	radeonsi: stop command submission with PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET only Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-11-09 14:55:04 -05:00
Michel Dänzer	32b0eb51a3	winsys/amdgpu: Stop using amdgpu_bo_handle_type_kms_noimport It only behaves any different from amdgpu_bo_handle_type_kms with libdrm 2.4.93, and it breaks if an older version is picked up. Bugzilla: https://bugs.freedesktop.org/108096 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-11-07 17:37:47 +01:00
Boyuan Zhang	97c473bb29	winsys/amdgpu: add vcn jpeg cs support Add vcn jpeg cs support, align cs by no-op. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com>	2018-10-23 08:50:02 -04:00
Marek Olšák	25ffb84016	radeonsi: pin the winsys thread to the requested L3 cache (v2) v2: rebase Reviewed-by: Brian Paul <brianp@vmware.com>	2018-09-07 16:03:36 -04:00
Timothy Arceri	5566dd8a61	radeonsi: add radeonsi_zerovram driconfig option More and more games seem to require this so lets make it a config option. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-08-30 07:57:38 +10:00
Marek Olšák	461a864316	winsys/amdgpu: pass the BO list via the CS ioctl on DRM >= 3.27.0	2018-08-03 18:35:19 -04:00
Marek Olšák	20dd75a926	radeonsi: use storage_samples instead of color_samples in most places and use pipe_resource::nr_storage_samples instead of r600_texture::num_color_samples. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-07-31 18:28:41 -04:00
Marek Olšák	565dacc3d6	winsys/amdgpu: remove RADEON_SURF_FMASK leftover RADEON_SURF_FMASK is never set.	2018-07-19 00:58:51 -04:00
Marek Olšák	51d6b163da	winsys/amdgpu: fix VDPAU interop by having one amdgpu_winsys_bo per BO (v2) Dependencies between rings are inserted correctly if a buffer is represented by only one unique amdgpu_winsys_bo instance. Use a hash table keyed by amdgpu_bo_handle to have exactly one amdgpu_winsys_bo per amdgpu_bo_handle. v2: return offset and stride properly Tested-by: Leo Liu <leo.liu@amd.com> Acked-by: Leo Liu <leo.liu@amd.com>	2018-07-18 11:56:28 -04:00
Marek Olšák	e06b8ec106	winsys/amdgpu: use a better hash_pointer function Tested-by: Leo Liu <leo.liu@amd.com> Acked-by: Leo Liu <leo.liu@amd.com>	2018-07-18 11:56:28 -04:00
Marek Olšák	53684e9163	winsys/amdgpu: clean up error handling in amdgpu_bo_from_handle Tested-by: Leo Liu <leo.liu@amd.com> Acked-by: Leo Liu <leo.liu@amd.com>	2018-07-18 11:56:28 -04:00
Marek Olšák	a73e3d5e00	winsys/amdgpu: shorten bo->ws in amdgpu_bo_destroy Tested-by: Leo Liu <leo.liu@amd.com> Acked-by: Leo Liu <leo.liu@amd.com>	2018-07-18 11:56:28 -04:00
Marek Olšák	f8aa116c3c	winsys/amdgpu: clean up error handling in amdgpu_cs_submit_ib Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-16 13:32:33 -04:00
Marek Olšák	6b1e0e51e6	radeonsi: rework RADEON_PRIO flags to be <= 31 This decreases sizeof(struct amdgpu_cs_buffer) from 24 to 16 bytes. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-16 13:32:33 -04:00
Marek Olšák	342fff6cbc	winsys/amdgpu: use alloca when using global_bo_list Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-16 13:32:33 -04:00
Marek Olšák	6ec44b7055	winsys/amdgpu: remove label bo_list_error Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-16 13:32:33 -04:00
Marek Olšák	7346e5296e	winsys/amdgpu: always update gfx_bo_list_counter Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-16 13:32:33 -04:00
Marek Olšák	caf41fb96d	winsys/amdgpu: make amdgpu_cs_context::flags & handles local Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-16 13:32:33 -04:00
Marek Olšák	7fab8a4b37	Shorten u_queue names There is a 15-character limit for thread names shared by the queue name and process name. Shorten the thread name to make space for the process name. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2018-07-04 22:03:35 -04:00
Grazvydas Ignotas	f966929805	radeonsi: add a debug flag to zero vram allocations This allows to avoid having to see garbage in Dying Light loading screen at least, which probably expects Windows/NV behavior of all allocations being zeroed by default. Analogous to radv flag with the same name. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-06-21 12:18:50 +03:00
Marek Olšák	6703fec58c	amd,radeonsi: rename radeon_winsys_cs -> radeon_cmdbuf Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-19 13:08:50 -04:00
Dave Airlie	b7ac0779e0	gallium/winsys: rename DRM_API_HANDLE_* to WINSYS_HANDLE_* This just renames this as we want to add an shm handle which isn't really drm related. Originally by: Marc-André Lureau <marcandre.lureau@gmail.com> (airlied: I used this sed script instead) This was generated with: git grep -l 'DRM_API_' \| xargs sed -i 's/DRM_API_/WINSYS_/g' Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-05-30 09:11:53 +10:00
Marek Olšák	f9eb1ef870	amd: remove support for LLVM 4.0 It doesn't support GFX9. Acked-by: Dave Airlie <airlied@redhat.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-05-17 14:54:41 -04:00
Jan Vesely	58272c1ad7	winsys/amdgpu: Destroy dev_hash table when the last winsys is removed. Fixes memory leak on module unload. CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-05-10 23:23:50 -04:00
Marek Olšák	912b0163dc	ac/surface: add EQAA support Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-05-10 18:34:31 -04:00
Marek Olšák	60299e9abe	radeonsi: don't emit partial flushes for internal CS flushes only Tested-by: Benedikt Schemmer <ben@besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-16 16:58:10 -04:00
Marek Olšák	692f550740	winsys/amdgpu: always set AMDGPU_IB_FLAG_TC_WB_NOT_INVALIDATE There is a kernel patch that adds the new flag. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Benedikt Schemmer <ben@besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-16 16:58:10 -04:00
Marek Olšák	e29facff31	ac/surface: don't set the display flag for obviously unsupported cases (v2) This enables the tile swizzle for some cases of the displayable micro mode, and it also fixes an addrlib assertion failure on Vega. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2018-04-10 13:06:03 -04:00
Marek Olšák	7d2079908d	winsys/amdgpu: always allow GTT placements on APUs Reviewed-by: Christian König <christian.koenig@amd.com>	2018-03-26 19:23:30 -04:00
Marek Olšák	769603564e	radeonsi: don't reallocate on DMABUF export if local BOs are disabled	2018-03-26 19:22:12 -04:00
Marek Olšák	f7ffa504a0	ac/surface: compute tile swizzle for GFX9 Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-03-21 13:40:06 -04:00
Marek Olšák	a4a113b5bc	winsys/amdgpu: pad compute IBs v2: pad with PKT2 NOPs on SI Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2018-03-08 14:58:16 -05:00
Marek Olšák	75c5d25f0f	radeonsi: align command buffer starting address to fix some Raven hangs Cc: 17.3 18.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2018-03-08 14:58:16 -05:00
Christian König	33633690aa	winsys/amdgpu: request high addresses We now have hopefully fixed all bugs regarding high addresses on Vega10 and Raven. Start to use the high range to make room for SVM in the low range. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-02-28 13:30:32 +01:00
James Zhu	c6acae22c8	winsys/amdgpu:add uvd hevc enc support in amdgpu cs Support UVD HEVC encode in amdgpu cs Signed-off-by: James Zhu <James.Zhu@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com>	2018-02-21 13:53:38 -05:00
Marek Olšák	48ecacfefa	winsys/amdgpu: enable 32-bit VM allocations Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-02-17 04:52:17 +01:00
Michal Navratil	4081e08896	winsys/amdgpu: allow non page-aligned size bo creation from pointer Fix INVALID_OPERATION caused by BufferData with target EXTERNAL_VIRTUAL_MEMORY_BUFFER_AMD when the buffer size is not page aligned. Signed-off-by: Marek Olšák <marek.olsak@amd.com> Cc: 17.3 18.0 <mesa-stable@lists.freedesktop.org>	2018-02-06 18:51:12 +01:00
Andres Rodriguez	cc9762d74d	winsys/amdgpu: add support for syncobj signaling v3 Add the ability to signal a syncobj when a cs completes execution. v2: corresponding changes for gallium fence->semaphore rename v3: s/semaphore/fence for pipe objects Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-30 15:13:49 -05:00
Marek Olšák	0e40c6a7b7	gallium/radeon: set number of pb_cache buckets = number of heaps Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-27 02:09:09 +01:00
Marek Olšák	175549e0e9	pb_cache: let drivers choose the number of buckets Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-27 02:09:09 +01:00
Marek Olšák	17423c993d	winsys/amdgpu: fix assertion failure with UVD and VCE rings Cc: 18.0 <mesa-stable@lists.freedesktop.org>	2018-01-26 23:12:11 +01:00
Bas Nieuwenhuizen	5a3404d443	radeonsi: Export signalled sync file instead of -1. -1 is considered an error for EGL_ANDROID_native_fence_sync, so we need to actually create a sync file. Fixes: `f536f45250` "radeonsi: implement sync_file import/export" Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-26 01:26:53 +01:00
Dylan Baker	436ed65d38	autotools: include meson build files in tarball This adds the meson.build, meson_options.txt, and a few scripts that are used exclusively by the meson build. v2: - Remove accidentally included changes needed to test make dist with LLVM > 3.9 Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Acked-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 16:30:51 -08:00
Marek Olšák	bf0904e31f	winsys/amdgpu: disable local BOs again due to worse performance Cc: 17.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-11 19:11:14 +01:00
Marek Olšák	fef51ebcea	winsys/amdgpu: make IBs use read-only memory Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-06 15:19:02 +01:00
Nicolai Hähnle	20ccb51ffc	radeonsi: always place sparse buffers in VRAM Together with "radeonsi: fix the R600_RESOURCE_FLAG_UNMAPPABLE check", this ensures that sparse buffers are placed in VRAM. Noticed by an assertion that started triggering with commit `d4fac1e1d7` ("gallium/radeon: enable suballocations for VRAM with no CPU access") Fixes KHR-GL45.sparse_buffer_tests.BufferStorageTest in debug builds. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-12-06 11:19:00 +01:00
Marek Olšák	c7f84f6513	winsys/amdgpu: add RADEON_FLAG_READ_ONLY Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-05 13:30:34 +01:00
Marek Olšák	9ac5504df5	gallium/radeon: move setting VRAM\|GTT into winsyses The combined VRAM\|GTT heap will be removed. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-05 13:30:34 +01:00
Eric Engestrom	13a7a2d455	amd: remove always-true BRAHMA_BUILD define Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-12-01 13:49:42 +00:00
Marek Olšák	2c5f2936af	r300,r600,radeonsi: replace RADEON_FLUSH_* with PIPE_FLUSH_* and handle PIPE_FLUSH_HINT_FINISH in r300. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-11-29 18:21:30 +01:00
Boyuan Zhang	c445cdf649	winsys/amdgpu: add vcn enc cs support New cs support is needed for vcn encode Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2017-11-17 12:25:47 -05:00
Dylan Baker	2bfd34c518	meson: don't use build_by_default for specific gallium drivers Using build_by_default : false is convenient for dependencies that can be pulled in by various diverse components of the build system, the gallium hardware/software drivers and state trackers do not fit that description. Instead, these should be guarded using the variable that tracks whether that driver should be enabled. This leaves a few helper libraries: trace, rbug, etc, and the generic winsys bits as `build_by_default : false` because there are a large number of gallium components that pull them in. v2: - remove build_by_default from winsys convenience libs as well. v3: - Always put drivers before winsys for consistency Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com> Tested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v1) Reviewed-by: Eric Anholt <eric@anholt.net>	2017-11-13 13:43:12 -08:00
Nicolai Hähnle	e6dbc804a8	winsys/amdgpu: handle cs_add_fence_dependency for deferred/unsubmitted fences The idea is to fix the following interleaving of operations that can arise from deferred fences: Thread 1 / Context 1 Thread 2 / Context 2 -------------------- -------------------- f = deferred flush <------- application-side synchronization -------> fence_server_sync(f) ... flush() flush() We will now stall in fence_server_sync until the flush of context 1 has completed. This scenario was unlikely to occur previously, because applications seem to be doing Thread 1 / Context 1 Thread 2 / Context 2 -------------------- -------------------- f = glFenceSync() glFlush() <------- application-side synchronization -------> glWaitSync(f) ... and indeed they probably have to use this ordering to avoid deadlocks in the GLX model, where all GL operations conceptually go through a single connection to the X server. However, it's less clear whether applications have to do this with other WSI (i.e. EGL). Besides, even this sequence of GL commands can be translated into the Gallium-level sequence outlined above when Gallium threading and asynchronous flushes are used. So it makes sense to be more robust. As a side effect, we no longer busy-wait on submission_in_progress. We won't enable asynchronous flushes on radeon, but add a cs_add_fence_dependency stub anyway to document the potential issue. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-09 14:00:22 +01:00
Nicolai Hähnle	222a2fb998	util: move os_time.[ch] to src/util Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-09 11:57:21 +01:00
Timothy Arceri	87f02ddfd1	amdgpu: use simple mtx Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-11-09 12:07:48 +11:00
Marek Olšák	7f33e94e43	amd/addrlib: update to latest version This uses C++11 initializer lists. I just overwrote all Mesa files with internal addrlib and discarded hunks that we should probably keep, but I might have missed something. The code depending on ADDR_AM_BUILD is removed. We can add it back next time if needed. Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-11-08 00:55:13 +01:00
Andrey Grodzovsky	19fc3cdcfb	winsys/amdgpu: Add R600_DEBUG flag to reserve VMID per ctx. Fixes reverted patch `f03b7c9` by doing VMID reservation per process and not per context. Also updates required amdgpu libdrm version since the change involved interface updates in amdgpu libdrm. Signed-off-by: Andrey Grodzovsky <andrey.grodzovsky@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-11-03 18:06:17 +01:00
Marek Olšák	529cdce799	radeonsi: remove 'Authors:' comments It's inaccurate. Instead, see the copyright and use "git log" and "git blame" to know the authorship. Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-11-02 18:19:03 +01:00
Marek Olšák	1f2640bfa9	Revert "winsys/amdgpu: Add R600_DEBUG flag to reserve VMID per ctx." This reverts commit `f03b7c9ad9`. The libdrm interface is wrong.	2017-11-01 21:42:31 +01:00
Andrey Grodzovsky	f03b7c9ad9	winsys/amdgpu: Add R600_DEBUG flag to reserve VMID per ctx. Signed-off-by: Andrey Grodzovsky <andrey.grodzovsky@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-10-31 16:55:24 +01:00
Marek Olšák	0aafedbbb2	radeonsi: add GFX-IB-size query to the HUD It shows the sum of all IBs per frame. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-17 22:03:03 +02:00
Marek Olšák	4d944c72b1	winsys/amdgpu: disable CPU caching for GFX & SDMA IBs This should decrease IB fetch latency. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-17 22:03:03 +02:00
Marek Olšák	49f5ce39c1	winsys/amdgpu: don't do read-modify-write on command buffers i.e. don't use \|= Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-17 22:03:03 +02:00
Dylan Baker	66f97f6640	meson: build radeonsi This builds the radeonsi (and radeon) window system bits and gallium driver bits. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com> Reviewed-by: Eric Anholt <eric at anholt.net>	2017-10-16 16:32:43 -07:00
Marek Olšák	162502370c	winsys/amdgpu: implement sync_file import/export syncobj is used internally for interactions with command submission. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-12 21:07:41 +02:00
Marek Olšák	4ba20c9473	Revert "winsys/amdgpu: disable local BOs on Raven" This reverts commit `1cda9a2fee`. It works now.	2017-09-12 22:44:02 +02:00
Marek Olšák	a2a326e8f8	winsys/amdgpu: use the new raw CS API This also cleans things up. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-11 16:29:52 +02:00
Marek Olšák	3824ca7610	radeonsi: implement pipe_context::fence_server_sync This will be more useful once we have sync_file support. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-11 16:29:52 +02:00
Marek Olšák	8843bf6dfd	winsys/amdgpu: factor out some fence dependency code into separate functions Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-11 16:29:52 +02:00
Marek Olšák	a6eb164eb2	winsys/amdgpu: rename fence_dependency functions Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-11 16:29:52 +02:00
Marek Olšák	7213293fe2	winsys/amdgpu: don't allow interprocess resource sharing for IBs Now we should get IB submissions with bo_list == NULL when DRI buffers aren't referenced. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-11 16:29:52 +02:00
Marek Olšák	1cda9a2fee	winsys/amdgpu: disable local BOs on Raven It hangs with a high degree of reproducibility. Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-07 12:57:48 +02:00
Christian König	214b565bc2	winsys/amdgpu: set AMDGPU_GEM_CREATE_VM_ALWAYS_VALID if possible v2 When the kernel supports it set the local flag and stop adding those BOs to the BO list. Can probably be optimized much more. v2: rename new flag to AMDGPU_GEM_CREATE_VM_ALWAYS_VALID Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-08-31 14:55:38 +02:00
Marek Olšák	8b3a257851	radeonsi: set a per-buffer flag that disables inter-process sharing (v4) For lower overhead in the CS ioctl. Winsys allocators are not used with interprocess-sharable resources. v2: It shouldn't crash anymore, but the kernel will reject the new flag. v3 (christian): Rename the flag, avoid sending those buffers in the BO list. v4 (christian): Remove setting the kernel flag for now Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-08-31 14:55:21 +02:00
Samuel Pitoiset	0d9117b7bd	winsys/amdgpu: add BO to the global list only when RADEON_ALL_BOS is set Only useful when that debug option is enabled. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-08-30 09:33:59 +02:00
Marek Olšák	113278ee79	radeonsi: remove Constant Engine support We have come to the conclusion that it doesn't improve performance. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-22 13:29:47 +02:00
Marek Olšák	1694a8ba8d	gallium/radeon: print all members of radeon_info with R600_DEBUG=info also set max_alignment on amdgpu. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-07 21:12:24 +02:00
Marek Olšák	4a758a17da	winsys/amdgpu: enable computation of tile swizzle Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-04 02:10:04 +02:00
Marek Olšák	59144d4bf5	ac/surface: increment surf_index only when tile swizzle is allowed Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-04 02:10:04 +02:00
Nicolai Hähnle	bc7f41e11d	gallium: add pipe_screen_config to screen_create functions This allows a more generic mechanism for passing user configurations into drivers by accessing the dri options directly. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-08-02 09:50:57 +02:00
Marek Olšák	4cae274116	radeonsi: prevent a deadlock in util_queue_add_job with too many GL contexts If the queue is full, util_queue_add_job will wait while bo_fence_lock is held. It pb_slab wants to reuse a buffer, it will lock the pb_slab mutex and try to check BO fence busyness, but it has to wait for bo_fence_lock to get released. Both bo_fence_lock and pb_slab mutex are locked now. When the CS thread unreferences and releases a suballocated buffer, it will try to lock the pb_slab mutex and has to wait. The CS thread can't finish its job in order to free a queue slot and unblock util_queue_add_job ==> deadlock. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:57:25 -04:00
Marek Olšák	aaee0d1bbf	gallium: use "ull" number suffix to keep the QtCreator parser happy It can't parse "llu". Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-07-10 22:44:48 +02:00
Dave Airlie	edf2acbeb1	radv: add support for using addrlib max alignment. Rather than using 64k, use what addrlib returns as the base alignment for vulkan allocations. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-09 22:17:59 +01:00
Marek Olšák	0591df025b	winsys/amdgpu: use 128KB BOs for suballocations of up to 64KB BOs This decreases the number of BOs, but might also increase memory usage. It's better for small textures. The gameplay is on the far right: https://people.freedesktop.org/~mareko/suballoc.svg Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	91f72975ac	gallium/radeon: add radeon_winsys::buffer_is_suballocated Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	0f13451da3	gallium/radeon: clean up pb_cache bucket/usage determination Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	d4fac1e1d7	gallium/radeon: enable suballocations for VRAM with no CPU access Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	64e5577cac	gallium/radeon: clean up (domain, flags) <-> (slab heap) translations This is cleaner, and we are down to 4 slabs. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	b09a22ad21	gallium/radeon: remove RADEON_FLAG_CPU_ACCESS https://lists.freedesktop.org/archives/amd-gfx/2017-June/010591.html Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	03c5ef195d	gallium/radeon: disallow exports of sparse and suballocated BOs I think it's unsafe, because the slabs can reuse exported storage. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	7525c3e123	gallium/radeon: rename RADEON_FLAG_HANDLE -> RADEON_FLAG_NO_SUBALLOC Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	5b373629fc	radeonsi: add a HUD query for getting an average GFX BO list size Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	a98a04ec80	gallium/radeon: pass create_screen flags to r600_common_screen_init Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	58af1f6bb0	winsys/amdgpu: fix a deadlock when waiting for submission_in_progress First this happens: 1) amdgpu_cs_flush (lock bo_fence_lock) -> amdgpu_add_fence_dependency -> os_wait_until_zero (wait for submission_in_progress) - WAITING 2) amdgpu_bo_create -> pb_cache_reclaim_buffer (lock pb_cache::mutex) -> pb_cache_is_buffer_compat -> amdgpu_bo_wait (lock bo_fence_lock) - WAITING So both bo_fence_lock and pb_cache::mutex are held. amdgpu_bo_create can't continue. amdgpu_cs_flush is waiting for the CS ioctl to finish the job, but the CS ioctl is trying to release a buffer: 3) amdgpu_cs_submit_ib (CS thread - job entrypoint) -> amdgpu_cs_context_cleanup -> pb_reference -> pb_destroy -> amdgpu_bo_destroy_or_cache -> pb_cache_add_buffer (lock pb_cache::mutex) - DEADLOCK The simple solution is not to wait for submission_in_progress, which we need in order to create the list of dependencies for the CS ioctl. Instead of building the list of dependencies as a direct input to the CS ioctl, build the list of dependencies as a list of fences, and make the final list of dependencies in the CS thread itself. Therefore, amdgpu_cs_flush doesn't have to wait and can continue. Then, amdgpu_bo_create can continue and return. And then amdgpu_cs_submit_ib can continue. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101294 Cc: 17.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-20 12:53:46 +02:00
Samuel Li	c705caaff9	radeonsi: Use libdrm to get chipset name v2: Add a func pointer to radeon_winsys to support radeon later. Change-Id: I614ea71424f9e5c97e4ae68654315d28c89eaa5f Signed-off-by: Samuel Li <Samuel.Li@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-06-07 21:53:36 +02:00
Marek Olšák	89b6c93ae3	util/u_queue: add an option to set the minimum thread priority Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-07 18:43:42 +02:00
Leo Liu	7ecc244b14	winsys/amdgpu: add vcn dec cs support Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2017-05-25 11:40:20 -04:00
Christian König	5318870f54	winsys/amdgpu: align VA allocations to fragment size v2 BOs larger than the minimum fragment size should have their VA alignet to at least the fragment size for optimal performance. v2: drop unused leftover from initial implementation Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-24 10:32:19 +02:00
Marek Olšák	0781b58b3a	gallium/radeon: pipe AMDGPU_INFO_NUM_VRAM_CPU_PAGE_FAULTS into gallium HUD Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-05-23 23:29:16 +02:00
Nicolai Hähnle	98a2492290	ac_surface: use radeon_info from ac_gpu_info Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:52 +02:00
Nicolai Hähnle	988c866212	ac/radeonsi: move radeon_info initialization to amd/common v2: update Android.common.mk (Emil) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:52 +02:00
Nicolai Hähnle	4d6e75776d	ac/radeonsi: move some aspects of sanity checking to ac_surface Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:52 +02:00
Nicolai Hähnle	00f466bad9	ac/radeonsi: add ac_compute_surface to automatically switch gfx6 vs. gfx9 Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:52 +02:00
Nicolai Hähnle	8aabed64c3	ac/radeonsi: move the bulk of gfx9_surface_init to ac_surface We can now merge the two *_surface_init functions. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:51 +02:00
Nicolai Hähnle	db77cd879b	ac/radeonsi: move the bulk of gfx6_surface_init to ac_surface Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:51 +02:00
Nicolai Hähnle	f187a49322	ac/radeonsi: move amdgpu_addr_create to ac_surface v2: - update Android.common.mk (Emil) - rebase on top of Raven support Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1)	2017-05-18 11:48:51 +02:00
Marek Olšák	7622181cad	radeonsi/gfx9: add support for Raven Cc: 17.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-15 13:00:26 +02:00
Rob Herring	26aee6f4d5	Android: rework LLVM build support Currently, building with "mmma external/mesa3d" which builds all targets and dependencies is broken for targets that require LLVM. This is due to the build settings depending on MESA_ENABLE_LLVM. Instead of using a conditional in the global Android.common.mk, make all the components that need LLVM explicitly include the necessary build settings. GALLIVM_CPP_SOURCES doesn't exist anymore, so remove that as well. Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-11 13:52:21 +01:00
Rob Herring	3f097396a1	Android: push driver build details to driver makefiles src/gallium/targets/dri/Android.mk contains lots of conditional for individual drivers. Let's move these details into the individual driver makefiles. In the process, align the make driver conditionals with automake (i.e. HAVE_GALLIUM_*). Signed-off-by: Rob Herring <robh@kernel.org> [Emil Velikov: add the radeon winsys for radeonsi] Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-11 13:52:21 +01:00
Rob Herring	1082501979	Android: amd: use exported include dirs instead of explicit includes Add exported include paths rather than explicitly adding the includes in each user of the common AMD libs. Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Chih-Wei Huang <cwhuang@linux.org.tw> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-11 13:52:21 +01:00
Marek Olšák	69e6eab653	winsys/amdgpu: fix Polaris12 (RX 550) breakage reported by Greg White. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100892 Cc: 17.1 <mesa-stable@lists.freedesktop.org>	2017-05-05 01:21:32 +02:00
Samuel Pitoiset	84ed2e1192	winsys/amdgpu: init buffer_indices_hashlist with memset() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-17 11:59:17 +02:00
Samuel Pitoiset	af612816bc	winsys/amdgpu: simplify amdgpu_cs_add_buffer() a bit Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-17 11:59:17 +02:00
Samuel Pitoiset	5bcfe90501	gallium/radeon: add HUD queries for GPU temperature and clocks Only the Radeon kernel driver exposed the GPU temperature and the shader/memory clocks, this implements the same functionality for the AMDGPU kernel driver. These queries will return 0 if the DRM version is less than 3.10, I don't explicitely check the version here because the query codepath is already a bit messy. v2: - rebase on top of master Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-10 23:06:19 +02:00
Nicolai Hähnle	47e59a7e36	winsys/amdgpu: sparse buffer debugging helpers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:19 +02:00
Nicolai Hähnle	0baee15596	winsys/amdgpu: take fences when freeing a backing buffer We never add fences to backing buffers during submit. When we free a backing buffer, it must inherit the sparse buffer's fences, so that it doesn't get re-used prematurely via the cache. v2: - remove pipe_mutex_* Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	79dae12b41	winsys/amdgpu: add sparse buffers to CS ... and implement the corresponding fence handling. v2: - add missing bit in amdgpu_bo_is_referenced_by_cs_with_usage - remove pipe_mutex_* Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	667da4eaed	winsys/amdgpu: sparse buffer creation / destruction / commitment This is the bulk of the buffer allocation logic. It is fairly simple and stupid. We'll probably want to use e.g. interval trees at some point to keep track of commitments, but Mesa doesn't have an implementation of those yet. v2: - remove pipe_mutex_* - fix total_backing_pages accounting - simplify by using the new VA_OP_CLEAR/REPLACE kernel interface Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	e348248647	winsys/amdgpu: add sparse buffer data structures v2: - remove pipe_mutex_* - use a simple page commitment array Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	f3e514361c	winsys/amdgpu: extend amdgpu_add_fence to allow adding multiple fences Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	ae4f442304	winsys/amdgpu: build handles and flags list late on submit thread This probably has only minor performance effects, but it simplifies some subsequent code slightly. Ideally, it could also be used to simplify the handling of slab buffers in the same way, but unfortunately that's not possible as long as we need indices for relocations. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:17 +02:00
Nicolai Hähnle	0e476f6c03	winsys/amdgpu: share common code in amdgpu_add_fence_dependencies Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:17 +02:00
Nicolai Hähnle	1c125fdef0	winsys/amdgpu: extract amdgpu_do_add_real_buffer We will use it for delayed adding of sparse buffers' backing buffers. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:17 +02:00
Marek Olšák	6ab2042761	radeonsi/gfx9: fix and enable single-sample CMASK fast clear Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-31 21:41:57 +02:00
Marek Olšák	d4bb4583b0	radeonsi/gfx9: fix and enable MSAA compression Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-31 21:41:57 +02:00
Marek Olšák	35aaccaf81	radeonsi/gfx9: fix linear mipmap CPU access Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-31 21:41:57 +02:00
Samuel Pitoiset	7d99f48b5e	winsys/amdgpu: remove AMDGPU_INFO_NUM_EVICTIONS This is now exposed with libdrm_amdgpu 2.4.76. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 15:27:13 +02:00
Leo Liu	6c7870fee8	winsys/surface: add height pitch for gfx9 Signed-off-by: Leo Liu <leo.liu@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	f4ab7a5415	winsys/amdgpu: set/get BO tiling flags for GFX9 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	bd1da6b339	radeonsi/gfx9: add radeon_surf.gfx9.surf_offset Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	566defad13	radeonsi/gfx9: add a workaround for 1D depth textures The same workaround is used by Vulkan. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	b25d7c2cbf	gallium/radeon: move pre-GFX9 radeon_bo_metadata.* to u.legacy.* Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	9b365d497a	winsys/amdgpu: set num_tile_pipes, pipe_interleave_bytes for GFX9 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	493de7f935	winsys/amdgpu: wire up new addrlib for GFX9 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	e572835fea	winsys/amdgpu: update amdgpu_addr_create for GFX9 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	a71139470c	winsys/amdgpu: rename GFX6 surface functions Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	ba2e7c68ce	gallium/radeon: move pre-GFX9 radeon_surf.* members to radeon_surf.u.legacy.* Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	9338ab0afd	radeonsi/gfx9: set the LLVM processor, require LLVM 5.0 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	68d6d097f1	radeonsi/gfx9: add GFX9 and VEGA10 enums Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Xavi Zhang	3614999878	amdgpu/addrlib: Rewrite tile mode optmization code Note: remove reference to degrade4Space and use opt4Space instead.	2017-03-30 14:44:33 +02:00
Emil Velikov	858170e8a4	winsys/amdgpu: use drmGetDevice2 API Analogous to previous commit Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98502 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Tested-by: Mike Lothian <mike@fireburn.co.uk>	2017-03-15 11:37:58 +00:00
Timothy Arceri	628e84a58f	gallium/util: replace pipe_mutex_unlock() with mtx_unlock() pipe_mutex_unlock() was made unnecessary with `fd33a6bcd7`. Replaced using: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_unlock(\([^)]*\)):mtx_unlock(\&\1):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:53:05 +11:00
Timothy Arceri	ba72554f3e	gallium/util: replace pipe_mutex_lock() with mtx_lock() replace pipe_mutex_lock() was made unnecessary with `fd33a6bcd7`. Replaced using: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_lock(\([^)]*\)):mtx_lock(\&\1):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:52:38 +11:00
Timothy Arceri	be188289e1	gallium/util: replace pipe_mutex_destroy() with mtx_destroy() pipe_mutex_destroy() was made unnecessary with `fd33a6bcd7`. Replace was done with: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_destroy(\([^)]*\)):mtx_destroy(\&\1):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:52:16 +11:00
Timothy Arceri	75b47dda0c	gallium/util: replace pipe_mutex_init() with mtx_init() pipe_mutex_init() was made unnecessary with `fd33a6bcd7`. Replace was done using: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_init(\([^)]*\)):(void) mtx_init(\&\1, mtx_plain):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:52:07 +11:00
Timothy Arceri	acdcaf9be4	gallium/util: remove pipe_static_mutex() This was made unnecessary with `fd33a6bcd7`. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:48:16 +11:00
Timothy Arceri	2efddc63ee	gallium/util: replace pipe_mutex with mtx_t pipe_mutex was made unnecessary with `fd33a6bcd7`. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:48:11 +11:00
Marek Olšák	7e1faa79d3	radeonsi: drop support for LLVM 3.6 & 3.7 They are too old. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-06 14:13:04 +01:00
Marek Olšák	24847dd1b5	gallium/u_queue: isolate util_queue_fence implementation it's cleaner this way. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-22 20:26:39 +01:00
Nicolai Hähnle	550125e1e7	winsys/amdgpu: reduce max_alloc_size based on GTT limits Allocating huge buffers in VRAM is not a problem, but when those buffers start being migrated, the kernel runs into errors because it cannot split those buffer up for moving through GTT. This should fix intermittent failures of GL45-CTS.texture_buffer.texture_buffer_max_size Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-02-21 10:43:38 +01:00
Marek Olšák	6b73aafceb	radeonsi: use a clever alignment for constant buffer uploads This results in a very tiny decrease in lgkm wait cycles. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-18 01:22:08 +01:00
Marek Olšák	d1fae627fa	gallium/radeon: add a HUD query for monitoring the CS thread activity Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-15 14:35:52 +01:00
Samuel Pitoiset	af303abcdb	winsys/amdgpu: avoid potential segfault in amdgpu_bo_map() cs can be NULL when it comes from r600_buffer_map_sync_with_rings() to avoid doing the same checks. It was checked for write mappings but not for read mappings. Cc: "17.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-02-03 12:07:14 +01:00
Marek Olšák	2fc5fe0e85	winsys/amdgpu: add a fast exit path into amdgpu_cs_add_buffer The time spent in the function dropped by 37% for torcs. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-30 13:57:09 +01:00
Samuel Pitoiset	86eb52adad	winsys/amdgpu: do not iterate twice when adding fence dependencies The perf difference is very small, 3.25->2.84% in amdgpu_cs_flush() in the DXMD benchmark. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-30 13:44:25 +01:00
Samuel Pitoiset	5a6b1aadea	winsys/amdgpu: add one likely() call in amdgpu_cs_flush() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-30 13:44:19 +01:00
Marek Olšák	9327780da6	winsys/amdgpu: fix ADDR_REGISTER_VALUE::backendDisables This would be a fix if the value was used anywhere. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-30 13:27:14 +01:00
Samuel Pitoiset	eca96ea308	gallium/radeon: add VRAM-vis-usage HUD query This new query returns the current visible usage of VRAM accessed by the CPU. It will return 0 on radeon because it's unimplemented. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-26 19:40:52 +01:00
Samuel Pitoiset	9f087e1c7c	gallium/radeon: query the CPU accessible size of VRAM R600_DEBUG="info" can be used to display that size, as well as the total amount of VRAM/GTT. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-26 19:40:14 +01:00
Samuel Pitoiset	cff199ceb7	gallium/radeon: add a new HUD query for the number of mapped buffers Useful when debugging applications which map a ton of buffers and also because we used to run into Linux's limit on the number of simultaneous mmap() calls. v2: - update the commit message Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-25 15:19:21 +01:00
Marek Olšák	e248390e93	winsys/amdgpu: drop all IBs if at least one was rejected within the context The corruption is inevitable and hangs are possible too. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 23:43:38 +01:00
Marek Olšák	1840800860	winsys/amdgpu: report a rejected IB as a lost context Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 23:43:38 +01:00
Marek Olšák	b7699ce07c	winsys/amdgpu: fix a race condition between fence updates and IB submissions The CS thread is needed to ensure proper ordering of operations and can't be disabled (without complicating the code). Discovered by Nine CSMT, which ended up in a deadlock. Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Marek Olšák	2b621c47aa	gallium/radeon: add new HUD query num-SDMA-IBs Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Marek Olšák	6b8a371e00	gallium/radeon: rename the num-ctx-flushes query to num-GFX-IBs Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Junwei Zhang	018ead4266	radeonsi: add Polaris12 support (v3) v2: use gfxip names for llvm 4.0+ v3: use tonga for llvm <= 3.8, drop gfxip name, we can just change that we change the other asics. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Junwei Zhang <Jerry.Zhang@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2016-12-21 15:10:03 -05:00
Marek Olšák	79a8e674ae	winsys/amdgpu: set addrlib flag opt4Space Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-21 21:44:35 +01:00
Marek Olšák	49fa4a4e60	gallium/radeon: add RADEON_SURF_OPTIMIZE_FOR_SPACE FORCE_TILING should disable it. It has no effect now, but that may change soon. Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-21 21:44:35 +01:00
Marek Olšák	bf4d102ea3	gallium/radeon: add radeon_surf::is_linear Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Marek Olšák	e9c76eeeaa	gallium/radeon: remove radeon_surf_level::pitch_bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Marek Olšák	692f2640ab	gallium/radeon: replace radeon_surf_info::dcc_enabled with num_dcc_levels Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Marek Olšák	d18bf0b944	gallium/radeon: don't force the same tiling parameters for FMASK GCN can use a completely different tile mode for FMASK. FMASK allocation now skips one unrelated amdgpu_surface_init codepath as hinted by the assertion. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	ecf045b4f7	winsys/amdgpu: allocate FMASK properly I expect no change in behavior, because r600_texture.c forces the same tile mode as the base texture has. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	2a2e537577	gallium/radeon: rename bo_size -> surf_size, bo_alignment -> surf_alignment these names were misleading. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	7e73ff87c0	gallium/radeon: remove unnecessary fields from radeon_surf_level Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	e9590d9092	gallium/radeon: pass pipe_resource and other params to surface_init directly This removes input-only parameters from the radeon_surf structure. Some of the translation logic from pipe_resource to radeon_surf is moved to winsys/radeon. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	b0d8a717a7	winsys/amdgpu: remove unused definitions Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	81a95946da	gallium/radeon: fold radeon_winsys::surface_best into radeon/winsys Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	6ec3b2a4b1	winsys/amdgpu: fix radeon_surf::macro_tile_index for imported textures Maybe this is why SDMA has been broken for many amdgpu users? SDMA is the only block which is used with imported textures and relies on this variable. DB also uses it, but it doesn't get imported textures, so it's unaffected. I do get SDMA failures on Tonga before this patch if R600_DEBUG=testdma is changed to use imported textures. Cc: 11.2 12.0 13.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	d4d9ec55c5	radeonsi: implement TC-compatible HTILE so that decompress blits aren't needed and depth texturing needs less memory bandwidth. Z16 and Z24 are promoted to Z32_FLOAT by the driver, because TC-compatible HTILE only supports Z32_FLOAT. This doubles memory footprint for Z16. The format promotion is not visible to state trackers. This is part of TC-compatible renderbuffer compression, which has 3 parts: DCC, HTILE, FMASK. Only TC-compatible FMASK compression is missing now. I don't see a measurable increase in performance though. (I tested Talos Principle and DiRT: Showdown, the latter is improved by 0.5%, which is almost noise, and it originally used layered Z16, so at least we know that Z16 promoted to Z32F isn't slower now) Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-13 19:00:51 +02:00
Marek Olšák	d7e74b52bb	winsys/amdgpu: fix infinite loop w/ RADEON_NOOP=1 caused by unsubmitted fences Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-12 18:29:40 +02:00
Marek Olšák	844f8268e1	gallium/radeon/winsyses: set reasonable max_alloc_size which is returned for GL_MAX_TEXTURE_BUFFER_SIZE. It doesn't have any other use at the moment. Bigger allocations are not rejected. This fixes GL45-CTS.texture_buffer.texture_buffer_max_size on Bonaire. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-05 21:03:54 +02:00
Nicolai Hähnle	de84e99e45	gallium/radeon/winsyses: add radeon_winsys::min_alloc_size Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-29 11:24:52 +02:00
Nicolai Hähnle	4421c0fb0d	gallium/radeon/winsyses: reduce the number of pb_cache buckets Small buffers are now handled via the slabs code, so separate buckets in pb_cache have become redundant. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:41 +02:00
Nicolai Hähnle	ffa1c669dd	winsys/amdgpu: enable buffer allocation from slabs Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:23 +02:00
Nicolai Hähnle	a3832590c6	winsys/amdgpu: add fence and buffer list logic for slab allocated buffers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:20 +02:00
Nicolai Hähnle	a987e4377a	winsys/amdgpu: add slab entry structures to amdgpu_winsys_bo Already adjust amdgpu_bo_map/unmap accordingly. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:15 +02:00
Nicolai Hähnle	5af9eef719	winsys/amdgpu: do not synchronize unsynchronized buffers When a buffer is added to a CS without the SYNCHRONIZED usage flag, we now no longer add a dependency on the buffer's fence(s). However, we still need to add a fence to the buffer during flush, so that cache reclaim works correctly (and in the hypothetical case that the buffer is later added to a CS _with_ the SYNCHRONIZED flag). It is now possible that the submissions refererring to a buffer are no longer linearly ordered, and so we may have to keep multiple fences around. We keep the fences in a FIFO. It should usually stay quite short (# of contexts * 2, for gfx + dma rings). While we're at it, extract amdgpu_add_fence_dependency for a single buffer, which will make adding the distinction between real buffer and slab cases easier. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:11 +02:00
Nicolai Hähnle	6d89a40676	gallium/radeon: add RADEON_FLAG_HANDLE When passed to winsys->buffer_create, this flag will indicate that we require a buffer that maps 1:1 with a kernel buffer handle. This is currently set for all textures, since textures can potentially be exported to other processes. This is not a huge loss, since the main purpose of this patch series is to deal with applications that allocate many small buffers. A hypothetical application with tons of tiny textures might still benefit from not setting this flag, but that's not a use case I'm worried about just now. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:05 +02:00
Marek Olšák	35d284d08e	winsys/amdgpu: don't assume GTT if the VRAM flag isn't set Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-13 20:38:25 +02:00
Mauro Rossi	6b9d7e69ee	android: add support for libmesa_amdgpu_addrlib Android porting of the following commits: `f1f1ba3` "radeonsi: move sid.h/r600d_common.h to a common place." `69fca64` "amd/addrlib: move addrlib from amdgpu winsys to common code" This patch fixes android building errors Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-09-13 10:06:04 +10:00
Nicolai Hähnle	17fff0c2de	winsys/amdgpu: remove amdgpu_cs_lookup_buffer The radeonsi driver doesn't and shouldn't care about the buffer index. Only the virtual addresses matter. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:55:47 +02:00
Nicolai Hähnle	12657a7abf	winsys/amdgpu: remove unused field domains from amdgpu_cs_buffer Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:55:07 +02:00
Nicolai Hähnle	3cdeb2a177	winsys/amdgpu: remove initial buffer list allocation It's really not necessary. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:55:04 +02:00
Nicolai Hähnle	cc53dfda9f	winsys/amdgpu: extract adding a new buffer list entry into its own function While at it, try to be a little more robust in the face of memory allocation failure. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:55:01 +02:00
Nicolai Hähnle	11cbf4d7ae	winsys/amdgpu: use only one fence per BO The fence that is added to the BO during flush is guaranteed to be signaled after all the fences that were in the fences array of the BO before the flush, because those fences are added as dependencies for the submission (and all this happens atomically under the bo_fence_lock). Therefore, keeping only the last fence around is sufficient. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:54:59 +02:00
Nicolai Hähnle	480ac143df	winsys/amdgpu: add do_winsys_deinit function The idea is to have matching init/deinit functions so that deinit can be re-used for cleanup in the error path of amdgpu_winsys_create. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:54:56 +02:00
Nicolai Hähnle	9fb8d354ca	winsys/amdgpu: clean up error paths in amdgpu_winsys_create No need to call pb_cache_deinit, because the cache hasn't been initialized at that point. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:54:53 +02:00
Nicolai Hähnle	339867c077	gallium/radeon/winsyses: remove #includes of pb_bufmgr.h Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:54:36 +02:00
Marek Olšák	f9750932ea	winsys/amdgpu: replace OUT_CS with radeon_emit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-09 22:45:06 +02:00
Marek Olšák	53d74e055e	gallium/radeon/winsyses: fix counting mapped memory Not all buffers are unmapped explicitly. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-07 11:13:13 +02:00
Dave Airlie	69fca64259	amd/addrlib: move addrlib from amdgpu winsys to common code Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-06 10:06:33 +10:00
Dave Airlie	f1f1ba3781	radeonsi: move sid.h/r600d_common.h to a common place. Step one to merging radv would be to move some files around. This only adds the include path to r600/radeonsi, because later we want to avoid having to add it to the generic target paths. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-06 10:05:13 +10:00
Marek Olšák	281f1a5980	winsys/amdgpu: disable IB chaining on SI Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	a6869e7c06	winsys/amdgpu: finish up SI addrlib integration Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2016-08-26 15:50:10 +02:00
Ronie Salgado	97b55243fb	winsys/amdgpu: initial SI support Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2016-08-26 15:50:10 +02:00
Marek Olšák	971ef7518f	gallium/radeon: add a driver query for AMDGPU_INFO_NUM_EVICTIONS If the kernel driver doesn't support it, it returns 0. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	1e04483c22	winsys/amdgpu: track the amount of mapped memory Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-10 01:11:10 +02:00
Marek Olšák	8276776e64	winsys/amdgpu: don't try to unmap userptr buffers no app calls this AFAIK Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-10 01:11:10 +02:00
Nicolai Hähnle	e0736c438c	winsys/amdgpu: query ME/PFP/CE firmware versions The radeon kernel module doesn't have the firmware query interface, so the corresponding values will remain 0. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-08 12:52:41 +02:00
Marek Olšák	63b99590db	winsys/amdgpu: implement cs_get_next_fence Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 14:29:30 +02:00
Marek Olšák	c5ff0d3e65	gallium/radeon: move radeon_winsys::cs_memory_below_limit to drivers Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 13:56:14 +02:00
Marek Olšák	076db67217	gallium/radeon: inline radeon_winsys::query_memory_usage Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 13:56:14 +02:00
Marek Olšák	9646ae7799	gallium/radeon/winsyses: expose per-IB used_vram and used_gart to drivers The following patches will use this. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 13:56:14 +02:00
Marek Olšák	1c8f17599e	gallium/radeon/winsyses: print CS submission error number Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 13:56:14 +02:00
Marek Olšák	0ab47146c9	winsys/amdgpu: use pb_cache buckets for fewer pb_cache misses Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-19 23:45:06 +02:00
Marek Olšák	3cdc0e133f	gallium/pb_cache: divide the cache into buckets for reducing cache misses Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-19 23:45:06 +02:00
Rob Clark	44bbfedbd9	gallium/u_queue: add optional cleanup callback Adds a second optional cleanup callback, called after the fence is signaled. This is needed if, for example, the queue has the last reference to the object that embeds the util_queue_fence. In this case we cannot drop the ref in the main callback, since that would result in the fence being destroyed before it is signaled. Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-16 10:00:04 -04:00
Marek Olšák	85388652f9	winsys/amdgpu: return an error on IB submission failures Reviewed-by: Christian König <christian.koenig@amd.com>	2016-07-14 22:00:54 +02:00
Marek Olšák	a7d84f7731	gallium/radeon: add a return value to cs_flush Required by our UVD code. Reviewed-by: Christian König <christian.koenig@amd.com>	2016-07-14 22:00:54 +02:00
Marek Olšák	ed3912d0da	radeonsi: just save buffer sizes instead of buffers while recording IBs whole buffer objects are not needed Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-13 19:46:16 +02:00
Nicolai Hähnle	660cd3de4a	winsys/amdgpu: avoid flushed depth when possible If a depth/stencil texture has no mipmaps, we can always get a layout that is compatible with DB and TC. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:43:52 +02:00
Nicolai Hähnle	7000dfd5c3	gallium/radeon: add depth/stencil_adjusted output to surface computation This fixes a rare bug with stencil texturing -- seen on Polaris and Tonga, though it's basically a function of the memory configuration so could affect other parts as well. Fixes piglit "unaligned-blit * stencil downsample" and various "fbo-depth-array stencil" tests. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:43:52 +02:00
Nicolai Hähnle	19f8d2a843	gallium/radeon/winsyses: remove unused stencil_offset Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:42:49 +02:00
Marek Olšák	8a4ace4a47	gallium/radeon: add and use radeon_info::max_alloc_size (v2) v2: - squashed the patches - use INT_MAX - clamp max_const_buffer_size - check the DRM version in radeon Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Vedran Miletić <vedran@miletic.net>	2016-07-05 00:47:13 +02:00
sonjiang	28f85eab49	radeon uvd add uvd fw version for amdgpu Signed-off-by: sonjiang <sonny.jiang@amd.com> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-06-29 15:30:14 -04:00
Marek Olšák	fa7c927625	radeonsi: always calculate DCC info even if it's not used immediately for a later use Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-29 20:12:00 +02:00
Marek Olšák	1c5a10497a	gallium/radeon/winsyses: boolean -> bool, TRUE -> true, FALSE -> false Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Vedran Miletić <vedran@miletic.net> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-25 23:13:42 +02:00

... 3 4 5 6 7 ...

553 Commits