KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	59144d4bf5	ac/surface: increment surf_index only when tile swizzle is allowed Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-04 02:10:04 +02:00
Nicolai Hähnle	bc7f41e11d	gallium: add pipe_screen_config to screen_create functions This allows a more generic mechanism for passing user configurations into drivers by accessing the dri options directly. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-08-02 09:50:57 +02:00
Marek Olšák	4cae274116	radeonsi: prevent a deadlock in util_queue_add_job with too many GL contexts If the queue is full, util_queue_add_job will wait while bo_fence_lock is held. It pb_slab wants to reuse a buffer, it will lock the pb_slab mutex and try to check BO fence busyness, but it has to wait for bo_fence_lock to get released. Both bo_fence_lock and pb_slab mutex are locked now. When the CS thread unreferences and releases a suballocated buffer, it will try to lock the pb_slab mutex and has to wait. The CS thread can't finish its job in order to free a queue slot and unblock util_queue_add_job ==> deadlock. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:57:25 -04:00
Marek Olšák	aaee0d1bbf	gallium: use "ull" number suffix to keep the QtCreator parser happy It can't parse "llu". Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-07-10 22:44:48 +02:00
Dave Airlie	edf2acbeb1	radv: add support for using addrlib max alignment. Rather than using 64k, use what addrlib returns as the base alignment for vulkan allocations. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-09 22:17:59 +01:00
Marek Olšák	0591df025b	winsys/amdgpu: use 128KB BOs for suballocations of up to 64KB BOs This decreases the number of BOs, but might also increase memory usage. It's better for small textures. The gameplay is on the far right: https://people.freedesktop.org/~mareko/suballoc.svg Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	91f72975ac	gallium/radeon: add radeon_winsys::buffer_is_suballocated Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	0f13451da3	gallium/radeon: clean up pb_cache bucket/usage determination Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	d4fac1e1d7	gallium/radeon: enable suballocations for VRAM with no CPU access Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	64e5577cac	gallium/radeon: clean up (domain, flags) <-> (slab heap) translations This is cleaner, and we are down to 4 slabs. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	b09a22ad21	gallium/radeon: remove RADEON_FLAG_CPU_ACCESS https://lists.freedesktop.org/archives/amd-gfx/2017-June/010591.html Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	03c5ef195d	gallium/radeon: disallow exports of sparse and suballocated BOs I think it's unsafe, because the slabs can reuse exported storage. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	7525c3e123	gallium/radeon: rename RADEON_FLAG_HANDLE -> RADEON_FLAG_NO_SUBALLOC Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	5b373629fc	radeonsi: add a HUD query for getting an average GFX BO list size Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-04 15:40:37 +02:00
Marek Olšák	a98a04ec80	gallium/radeon: pass create_screen flags to r600_common_screen_init Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-23 19:50:20 +02:00
Marek Olšák	58af1f6bb0	winsys/amdgpu: fix a deadlock when waiting for submission_in_progress First this happens: 1) amdgpu_cs_flush (lock bo_fence_lock) -> amdgpu_add_fence_dependency -> os_wait_until_zero (wait for submission_in_progress) - WAITING 2) amdgpu_bo_create -> pb_cache_reclaim_buffer (lock pb_cache::mutex) -> pb_cache_is_buffer_compat -> amdgpu_bo_wait (lock bo_fence_lock) - WAITING So both bo_fence_lock and pb_cache::mutex are held. amdgpu_bo_create can't continue. amdgpu_cs_flush is waiting for the CS ioctl to finish the job, but the CS ioctl is trying to release a buffer: 3) amdgpu_cs_submit_ib (CS thread - job entrypoint) -> amdgpu_cs_context_cleanup -> pb_reference -> pb_destroy -> amdgpu_bo_destroy_or_cache -> pb_cache_add_buffer (lock pb_cache::mutex) - DEADLOCK The simple solution is not to wait for submission_in_progress, which we need in order to create the list of dependencies for the CS ioctl. Instead of building the list of dependencies as a direct input to the CS ioctl, build the list of dependencies as a list of fences, and make the final list of dependencies in the CS thread itself. Therefore, amdgpu_cs_flush doesn't have to wait and can continue. Then, amdgpu_bo_create can continue and return. And then amdgpu_cs_submit_ib can continue. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101294 Cc: 17.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-20 12:53:46 +02:00
Samuel Li	c705caaff9	radeonsi: Use libdrm to get chipset name v2: Add a func pointer to radeon_winsys to support radeon later. Change-Id: I614ea71424f9e5c97e4ae68654315d28c89eaa5f Signed-off-by: Samuel Li <Samuel.Li@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-06-07 21:53:36 +02:00
Marek Olšák	89b6c93ae3	util/u_queue: add an option to set the minimum thread priority Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-07 18:43:42 +02:00
Leo Liu	7ecc244b14	winsys/amdgpu: add vcn dec cs support Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2017-05-25 11:40:20 -04:00
Christian König	5318870f54	winsys/amdgpu: align VA allocations to fragment size v2 BOs larger than the minimum fragment size should have their VA alignet to at least the fragment size for optimal performance. v2: drop unused leftover from initial implementation Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-24 10:32:19 +02:00
Marek Olšák	0781b58b3a	gallium/radeon: pipe AMDGPU_INFO_NUM_VRAM_CPU_PAGE_FAULTS into gallium HUD Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-05-23 23:29:16 +02:00
Nicolai Hähnle	98a2492290	ac_surface: use radeon_info from ac_gpu_info Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:52 +02:00
Nicolai Hähnle	988c866212	ac/radeonsi: move radeon_info initialization to amd/common v2: update Android.common.mk (Emil) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:52 +02:00
Nicolai Hähnle	4d6e75776d	ac/radeonsi: move some aspects of sanity checking to ac_surface Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:52 +02:00
Nicolai Hähnle	00f466bad9	ac/radeonsi: add ac_compute_surface to automatically switch gfx6 vs. gfx9 Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:52 +02:00
Nicolai Hähnle	8aabed64c3	ac/radeonsi: move the bulk of gfx9_surface_init to ac_surface We can now merge the two *_surface_init functions. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:51 +02:00
Nicolai Hähnle	db77cd879b	ac/radeonsi: move the bulk of gfx6_surface_init to ac_surface Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-18 11:48:51 +02:00
Nicolai Hähnle	f187a49322	ac/radeonsi: move amdgpu_addr_create to ac_surface v2: - update Android.common.mk (Emil) - rebase on top of Raven support Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1)	2017-05-18 11:48:51 +02:00
Marek Olšák	7622181cad	radeonsi/gfx9: add support for Raven Cc: 17.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-15 13:00:26 +02:00
Rob Herring	26aee6f4d5	Android: rework LLVM build support Currently, building with "mmma external/mesa3d" which builds all targets and dependencies is broken for targets that require LLVM. This is due to the build settings depending on MESA_ENABLE_LLVM. Instead of using a conditional in the global Android.common.mk, make all the components that need LLVM explicitly include the necessary build settings. GALLIVM_CPP_SOURCES doesn't exist anymore, so remove that as well. Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-11 13:52:21 +01:00
Rob Herring	3f097396a1	Android: push driver build details to driver makefiles src/gallium/targets/dri/Android.mk contains lots of conditional for individual drivers. Let's move these details into the individual driver makefiles. In the process, align the make driver conditionals with automake (i.e. HAVE_GALLIUM_*). Signed-off-by: Rob Herring <robh@kernel.org> [Emil Velikov: add the radeon winsys for radeonsi] Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-11 13:52:21 +01:00
Rob Herring	1082501979	Android: amd: use exported include dirs instead of explicit includes Add exported include paths rather than explicitly adding the includes in each user of the common AMD libs. Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Chih-Wei Huang <cwhuang@linux.org.tw> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-11 13:52:21 +01:00
Marek Olšák	69e6eab653	winsys/amdgpu: fix Polaris12 (RX 550) breakage reported by Greg White. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100892 Cc: 17.1 <mesa-stable@lists.freedesktop.org>	2017-05-05 01:21:32 +02:00
Samuel Pitoiset	84ed2e1192	winsys/amdgpu: init buffer_indices_hashlist with memset() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-17 11:59:17 +02:00
Samuel Pitoiset	af612816bc	winsys/amdgpu: simplify amdgpu_cs_add_buffer() a bit Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-17 11:59:17 +02:00
Samuel Pitoiset	5bcfe90501	gallium/radeon: add HUD queries for GPU temperature and clocks Only the Radeon kernel driver exposed the GPU temperature and the shader/memory clocks, this implements the same functionality for the AMDGPU kernel driver. These queries will return 0 if the DRM version is less than 3.10, I don't explicitely check the version here because the query codepath is already a bit messy. v2: - rebase on top of master Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-10 23:06:19 +02:00
Nicolai Hähnle	47e59a7e36	winsys/amdgpu: sparse buffer debugging helpers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:19 +02:00
Nicolai Hähnle	0baee15596	winsys/amdgpu: take fences when freeing a backing buffer We never add fences to backing buffers during submit. When we free a backing buffer, it must inherit the sparse buffer's fences, so that it doesn't get re-used prematurely via the cache. v2: - remove pipe_mutex_* Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	79dae12b41	winsys/amdgpu: add sparse buffers to CS ... and implement the corresponding fence handling. v2: - add missing bit in amdgpu_bo_is_referenced_by_cs_with_usage - remove pipe_mutex_* Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	667da4eaed	winsys/amdgpu: sparse buffer creation / destruction / commitment This is the bulk of the buffer allocation logic. It is fairly simple and stupid. We'll probably want to use e.g. interval trees at some point to keep track of commitments, but Mesa doesn't have an implementation of those yet. v2: - remove pipe_mutex_* - fix total_backing_pages accounting - simplify by using the new VA_OP_CLEAR/REPLACE kernel interface Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	e348248647	winsys/amdgpu: add sparse buffer data structures v2: - remove pipe_mutex_* - use a simple page commitment array Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	f3e514361c	winsys/amdgpu: extend amdgpu_add_fence to allow adding multiple fences Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:18 +02:00
Nicolai Hähnle	ae4f442304	winsys/amdgpu: build handles and flags list late on submit thread This probably has only minor performance effects, but it simplifies some subsequent code slightly. Ideally, it could also be used to simplify the handling of slab buffers in the same way, but unfortunately that's not possible as long as we need indices for relocations. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:17 +02:00
Nicolai Hähnle	0e476f6c03	winsys/amdgpu: share common code in amdgpu_add_fence_dependencies Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:17 +02:00
Nicolai Hähnle	1c125fdef0	winsys/amdgpu: extract amdgpu_do_add_real_buffer We will use it for delayed adding of sparse buffers' backing buffers. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:17 +02:00
Marek Olšák	6ab2042761	radeonsi/gfx9: fix and enable single-sample CMASK fast clear Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-31 21:41:57 +02:00
Marek Olšák	d4bb4583b0	radeonsi/gfx9: fix and enable MSAA compression Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-31 21:41:57 +02:00
Marek Olšák	35aaccaf81	radeonsi/gfx9: fix linear mipmap CPU access Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-31 21:41:57 +02:00
Samuel Pitoiset	7d99f48b5e	winsys/amdgpu: remove AMDGPU_INFO_NUM_EVICTIONS This is now exposed with libdrm_amdgpu 2.4.76. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 15:27:13 +02:00
Leo Liu	6c7870fee8	winsys/surface: add height pitch for gfx9 Signed-off-by: Leo Liu <leo.liu@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	f4ab7a5415	winsys/amdgpu: set/get BO tiling flags for GFX9 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	bd1da6b339	radeonsi/gfx9: add radeon_surf.gfx9.surf_offset Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	566defad13	radeonsi/gfx9: add a workaround for 1D depth textures The same workaround is used by Vulkan. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	b25d7c2cbf	gallium/radeon: move pre-GFX9 radeon_bo_metadata.* to u.legacy.* Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	9b365d497a	winsys/amdgpu: set num_tile_pipes, pipe_interleave_bytes for GFX9 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	493de7f935	winsys/amdgpu: wire up new addrlib for GFX9 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	e572835fea	winsys/amdgpu: update amdgpu_addr_create for GFX9 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	a71139470c	winsys/amdgpu: rename GFX6 surface functions Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	ba2e7c68ce	gallium/radeon: move pre-GFX9 radeon_surf.* members to radeon_surf.u.legacy.* Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	9338ab0afd	radeonsi/gfx9: set the LLVM processor, require LLVM 5.0 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Marek Olšák	68d6d097f1	radeonsi/gfx9: add GFX9 and VEGA10 enums Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-03-30 14:44:33 +02:00
Xavi Zhang	3614999878	amdgpu/addrlib: Rewrite tile mode optmization code Note: remove reference to degrade4Space and use opt4Space instead.	2017-03-30 14:44:33 +02:00
Emil Velikov	858170e8a4	winsys/amdgpu: use drmGetDevice2 API Analogous to previous commit Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98502 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Tested-by: Mike Lothian <mike@fireburn.co.uk>	2017-03-15 11:37:58 +00:00
Timothy Arceri	628e84a58f	gallium/util: replace pipe_mutex_unlock() with mtx_unlock() pipe_mutex_unlock() was made unnecessary with `fd33a6bcd7`. Replaced using: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_unlock(\([^)]*\)):mtx_unlock(\&\1):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:53:05 +11:00
Timothy Arceri	ba72554f3e	gallium/util: replace pipe_mutex_lock() with mtx_lock() replace pipe_mutex_lock() was made unnecessary with `fd33a6bcd7`. Replaced using: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_lock(\([^)]*\)):mtx_lock(\&\1):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:52:38 +11:00
Timothy Arceri	be188289e1	gallium/util: replace pipe_mutex_destroy() with mtx_destroy() pipe_mutex_destroy() was made unnecessary with `fd33a6bcd7`. Replace was done with: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_destroy(\([^)]*\)):mtx_destroy(\&\1):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:52:16 +11:00
Timothy Arceri	75b47dda0c	gallium/util: replace pipe_mutex_init() with mtx_init() pipe_mutex_init() was made unnecessary with `fd33a6bcd7`. Replace was done using: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_init(\([^)]*\)):(void) mtx_init(\&\1, mtx_plain):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:52:07 +11:00
Timothy Arceri	acdcaf9be4	gallium/util: remove pipe_static_mutex() This was made unnecessary with `fd33a6bcd7`. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:48:16 +11:00
Timothy Arceri	2efddc63ee	gallium/util: replace pipe_mutex with mtx_t pipe_mutex was made unnecessary with `fd33a6bcd7`. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:48:11 +11:00
Marek Olšák	7e1faa79d3	radeonsi: drop support for LLVM 3.6 & 3.7 They are too old. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-03-06 14:13:04 +01:00
Marek Olšák	24847dd1b5	gallium/u_queue: isolate util_queue_fence implementation it's cleaner this way. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-22 20:26:39 +01:00
Nicolai Hähnle	550125e1e7	winsys/amdgpu: reduce max_alloc_size based on GTT limits Allocating huge buffers in VRAM is not a problem, but when those buffers start being migrated, the kernel runs into errors because it cannot split those buffer up for moving through GTT. This should fix intermittent failures of GL45-CTS.texture_buffer.texture_buffer_max_size Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-02-21 10:43:38 +01:00
Marek Olšák	6b73aafceb	radeonsi: use a clever alignment for constant buffer uploads This results in a very tiny decrease in lgkm wait cycles. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-18 01:22:08 +01:00
Marek Olšák	d1fae627fa	gallium/radeon: add a HUD query for monitoring the CS thread activity Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-15 14:35:52 +01:00
Samuel Pitoiset	af303abcdb	winsys/amdgpu: avoid potential segfault in amdgpu_bo_map() cs can be NULL when it comes from r600_buffer_map_sync_with_rings() to avoid doing the same checks. It was checked for write mappings but not for read mappings. Cc: "17.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-02-03 12:07:14 +01:00
Marek Olšák	2fc5fe0e85	winsys/amdgpu: add a fast exit path into amdgpu_cs_add_buffer The time spent in the function dropped by 37% for torcs. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-30 13:57:09 +01:00
Samuel Pitoiset	86eb52adad	winsys/amdgpu: do not iterate twice when adding fence dependencies The perf difference is very small, 3.25->2.84% in amdgpu_cs_flush() in the DXMD benchmark. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-30 13:44:25 +01:00
Samuel Pitoiset	5a6b1aadea	winsys/amdgpu: add one likely() call in amdgpu_cs_flush() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-30 13:44:19 +01:00
Marek Olšák	9327780da6	winsys/amdgpu: fix ADDR_REGISTER_VALUE::backendDisables This would be a fix if the value was used anywhere. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-30 13:27:14 +01:00
Samuel Pitoiset	eca96ea308	gallium/radeon: add VRAM-vis-usage HUD query This new query returns the current visible usage of VRAM accessed by the CPU. It will return 0 on radeon because it's unimplemented. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-26 19:40:52 +01:00
Samuel Pitoiset	9f087e1c7c	gallium/radeon: query the CPU accessible size of VRAM R600_DEBUG="info" can be used to display that size, as well as the total amount of VRAM/GTT. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-26 19:40:14 +01:00
Samuel Pitoiset	cff199ceb7	gallium/radeon: add a new HUD query for the number of mapped buffers Useful when debugging applications which map a ton of buffers and also because we used to run into Linux's limit on the number of simultaneous mmap() calls. v2: - update the commit message Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-25 15:19:21 +01:00
Marek Olšák	e248390e93	winsys/amdgpu: drop all IBs if at least one was rejected within the context The corruption is inevitable and hangs are possible too. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 23:43:38 +01:00
Marek Olšák	1840800860	winsys/amdgpu: report a rejected IB as a lost context Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 23:43:38 +01:00
Marek Olšák	b7699ce07c	winsys/amdgpu: fix a race condition between fence updates and IB submissions The CS thread is needed to ensure proper ordering of operations and can't be disabled (without complicating the code). Discovered by Nine CSMT, which ended up in a deadlock. Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Marek Olšák	2b621c47aa	gallium/radeon: add new HUD query num-SDMA-IBs Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Marek Olšák	6b8a371e00	gallium/radeon: rename the num-ctx-flushes query to num-GFX-IBs Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Junwei Zhang	018ead4266	radeonsi: add Polaris12 support (v3) v2: use gfxip names for llvm 4.0+ v3: use tonga for llvm <= 3.8, drop gfxip name, we can just change that we change the other asics. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Junwei Zhang <Jerry.Zhang@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2016-12-21 15:10:03 -05:00
Marek Olšák	79a8e674ae	winsys/amdgpu: set addrlib flag opt4Space Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-21 21:44:35 +01:00
Marek Olšák	49fa4a4e60	gallium/radeon: add RADEON_SURF_OPTIMIZE_FOR_SPACE FORCE_TILING should disable it. It has no effect now, but that may change soon. Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-21 21:44:35 +01:00
Marek Olšák	bf4d102ea3	gallium/radeon: add radeon_surf::is_linear Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Marek Olšák	e9c76eeeaa	gallium/radeon: remove radeon_surf_level::pitch_bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Marek Olšák	692f2640ab	gallium/radeon: replace radeon_surf_info::dcc_enabled with num_dcc_levels Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Marek Olšák	d18bf0b944	gallium/radeon: don't force the same tiling parameters for FMASK GCN can use a completely different tile mode for FMASK. FMASK allocation now skips one unrelated amdgpu_surface_init codepath as hinted by the assertion. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	ecf045b4f7	winsys/amdgpu: allocate FMASK properly I expect no change in behavior, because r600_texture.c forces the same tile mode as the base texture has. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	2a2e537577	gallium/radeon: rename bo_size -> surf_size, bo_alignment -> surf_alignment these names were misleading. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	7e73ff87c0	gallium/radeon: remove unnecessary fields from radeon_surf_level Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	e9590d9092	gallium/radeon: pass pipe_resource and other params to surface_init directly This removes input-only parameters from the radeon_surf structure. Some of the translation logic from pipe_resource to radeon_surf is moved to winsys/radeon. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	b0d8a717a7	winsys/amdgpu: remove unused definitions Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	81a95946da	gallium/radeon: fold radeon_winsys::surface_best into radeon/winsys Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	6ec3b2a4b1	winsys/amdgpu: fix radeon_surf::macro_tile_index for imported textures Maybe this is why SDMA has been broken for many amdgpu users? SDMA is the only block which is used with imported textures and relies on this variable. DB also uses it, but it doesn't get imported textures, so it's unaffected. I do get SDMA failures on Tonga before this patch if R600_DEBUG=testdma is changed to use imported textures. Cc: 11.2 12.0 13.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-26 13:02:58 +02:00
Marek Olšák	d4d9ec55c5	radeonsi: implement TC-compatible HTILE so that decompress blits aren't needed and depth texturing needs less memory bandwidth. Z16 and Z24 are promoted to Z32_FLOAT by the driver, because TC-compatible HTILE only supports Z32_FLOAT. This doubles memory footprint for Z16. The format promotion is not visible to state trackers. This is part of TC-compatible renderbuffer compression, which has 3 parts: DCC, HTILE, FMASK. Only TC-compatible FMASK compression is missing now. I don't see a measurable increase in performance though. (I tested Talos Principle and DiRT: Showdown, the latter is improved by 0.5%, which is almost noise, and it originally used layered Z16, so at least we know that Z16 promoted to Z32F isn't slower now) Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-13 19:00:51 +02:00
Marek Olšák	d7e74b52bb	winsys/amdgpu: fix infinite loop w/ RADEON_NOOP=1 caused by unsubmitted fences Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-12 18:29:40 +02:00
Marek Olšák	844f8268e1	gallium/radeon/winsyses: set reasonable max_alloc_size which is returned for GL_MAX_TEXTURE_BUFFER_SIZE. It doesn't have any other use at the moment. Bigger allocations are not rejected. This fixes GL45-CTS.texture_buffer.texture_buffer_max_size on Bonaire. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-05 21:03:54 +02:00
Nicolai Hähnle	de84e99e45	gallium/radeon/winsyses: add radeon_winsys::min_alloc_size Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-29 11:24:52 +02:00
Nicolai Hähnle	4421c0fb0d	gallium/radeon/winsyses: reduce the number of pb_cache buckets Small buffers are now handled via the slabs code, so separate buckets in pb_cache have become redundant. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:41 +02:00
Nicolai Hähnle	ffa1c669dd	winsys/amdgpu: enable buffer allocation from slabs Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:23 +02:00
Nicolai Hähnle	a3832590c6	winsys/amdgpu: add fence and buffer list logic for slab allocated buffers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:20 +02:00
Nicolai Hähnle	a987e4377a	winsys/amdgpu: add slab entry structures to amdgpu_winsys_bo Already adjust amdgpu_bo_map/unmap accordingly. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:15 +02:00
Nicolai Hähnle	5af9eef719	winsys/amdgpu: do not synchronize unsynchronized buffers When a buffer is added to a CS without the SYNCHRONIZED usage flag, we now no longer add a dependency on the buffer's fence(s). However, we still need to add a fence to the buffer during flush, so that cache reclaim works correctly (and in the hypothetical case that the buffer is later added to a CS _with_ the SYNCHRONIZED flag). It is now possible that the submissions refererring to a buffer are no longer linearly ordered, and so we may have to keep multiple fences around. We keep the fences in a FIFO. It should usually stay quite short (# of contexts * 2, for gfx + dma rings). While we're at it, extract amdgpu_add_fence_dependency for a single buffer, which will make adding the distinction between real buffer and slab cases easier. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:11 +02:00
Nicolai Hähnle	6d89a40676	gallium/radeon: add RADEON_FLAG_HANDLE When passed to winsys->buffer_create, this flag will indicate that we require a buffer that maps 1:1 with a kernel buffer handle. This is currently set for all textures, since textures can potentially be exported to other processes. This is not a huge loss, since the main purpose of this patch series is to deal with applications that allocate many small buffers. A hypothetical application with tons of tiny textures might still benefit from not setting this flag, but that's not a use case I'm worried about just now. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-27 16:45:05 +02:00
Marek Olšák	35d284d08e	winsys/amdgpu: don't assume GTT if the VRAM flag isn't set Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-13 20:38:25 +02:00
Mauro Rossi	6b9d7e69ee	android: add support for libmesa_amdgpu_addrlib Android porting of the following commits: `f1f1ba3` "radeonsi: move sid.h/r600d_common.h to a common place." `69fca64` "amd/addrlib: move addrlib from amdgpu winsys to common code" This patch fixes android building errors Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-09-13 10:06:04 +10:00
Nicolai Hähnle	17fff0c2de	winsys/amdgpu: remove amdgpu_cs_lookup_buffer The radeonsi driver doesn't and shouldn't care about the buffer index. Only the virtual addresses matter. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:55:47 +02:00
Nicolai Hähnle	12657a7abf	winsys/amdgpu: remove unused field domains from amdgpu_cs_buffer Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:55:07 +02:00
Nicolai Hähnle	3cdeb2a177	winsys/amdgpu: remove initial buffer list allocation It's really not necessary. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:55:04 +02:00
Nicolai Hähnle	cc53dfda9f	winsys/amdgpu: extract adding a new buffer list entry into its own function While at it, try to be a little more robust in the face of memory allocation failure. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:55:01 +02:00
Nicolai Hähnle	11cbf4d7ae	winsys/amdgpu: use only one fence per BO The fence that is added to the BO during flush is guaranteed to be signaled after all the fences that were in the fences array of the BO before the flush, because those fences are added as dependencies for the submission (and all this happens atomically under the bo_fence_lock). Therefore, keeping only the last fence around is sufficient. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:54:59 +02:00
Nicolai Hähnle	480ac143df	winsys/amdgpu: add do_winsys_deinit function The idea is to have matching init/deinit functions so that deinit can be re-used for cleanup in the error path of amdgpu_winsys_create. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:54:56 +02:00
Nicolai Hähnle	9fb8d354ca	winsys/amdgpu: clean up error paths in amdgpu_winsys_create No need to call pb_cache_deinit, because the cache hasn't been initialized at that point. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:54:53 +02:00
Nicolai Hähnle	339867c077	gallium/radeon/winsyses: remove #includes of pb_bufmgr.h Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-12 13:54:36 +02:00
Marek Olšák	f9750932ea	winsys/amdgpu: replace OUT_CS with radeon_emit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-09 22:45:06 +02:00
Marek Olšák	53d74e055e	gallium/radeon/winsyses: fix counting mapped memory Not all buffers are unmapped explicitly. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-07 11:13:13 +02:00
Dave Airlie	69fca64259	amd/addrlib: move addrlib from amdgpu winsys to common code Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-06 10:06:33 +10:00
Dave Airlie	f1f1ba3781	radeonsi: move sid.h/r600d_common.h to a common place. Step one to merging radv would be to move some files around. This only adds the include path to r600/radeonsi, because later we want to avoid having to add it to the generic target paths. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-06 10:05:13 +10:00
Marek Olšák	281f1a5980	winsys/amdgpu: disable IB chaining on SI Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	a6869e7c06	winsys/amdgpu: finish up SI addrlib integration Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2016-08-26 15:50:10 +02:00
Ronie Salgado	97b55243fb	winsys/amdgpu: initial SI support Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2016-08-26 15:50:10 +02:00
Marek Olšák	971ef7518f	gallium/radeon: add a driver query for AMDGPU_INFO_NUM_EVICTIONS If the kernel driver doesn't support it, it returns 0. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	1e04483c22	winsys/amdgpu: track the amount of mapped memory Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-10 01:11:10 +02:00
Marek Olšák	8276776e64	winsys/amdgpu: don't try to unmap userptr buffers no app calls this AFAIK Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-10 01:11:10 +02:00
Nicolai Hähnle	e0736c438c	winsys/amdgpu: query ME/PFP/CE firmware versions The radeon kernel module doesn't have the firmware query interface, so the corresponding values will remain 0. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-08 12:52:41 +02:00
Marek Olšák	63b99590db	winsys/amdgpu: implement cs_get_next_fence Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 14:29:30 +02:00
Marek Olšák	c5ff0d3e65	gallium/radeon: move radeon_winsys::cs_memory_below_limit to drivers Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 13:56:14 +02:00
Marek Olšák	076db67217	gallium/radeon: inline radeon_winsys::query_memory_usage Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 13:56:14 +02:00
Marek Olšák	9646ae7799	gallium/radeon/winsyses: expose per-IB used_vram and used_gart to drivers The following patches will use this. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 13:56:14 +02:00
Marek Olšák	1c8f17599e	gallium/radeon/winsyses: print CS submission error number Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-06 13:56:14 +02:00
Marek Olšák	0ab47146c9	winsys/amdgpu: use pb_cache buckets for fewer pb_cache misses Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-19 23:45:06 +02:00
Marek Olšák	3cdc0e133f	gallium/pb_cache: divide the cache into buckets for reducing cache misses Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-19 23:45:06 +02:00
Rob Clark	44bbfedbd9	gallium/u_queue: add optional cleanup callback Adds a second optional cleanup callback, called after the fence is signaled. This is needed if, for example, the queue has the last reference to the object that embeds the util_queue_fence. In this case we cannot drop the ref in the main callback, since that would result in the fence being destroyed before it is signaled. Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-16 10:00:04 -04:00
Marek Olšák	85388652f9	winsys/amdgpu: return an error on IB submission failures Reviewed-by: Christian König <christian.koenig@amd.com>	2016-07-14 22:00:54 +02:00
Marek Olšák	a7d84f7731	gallium/radeon: add a return value to cs_flush Required by our UVD code. Reviewed-by: Christian König <christian.koenig@amd.com>	2016-07-14 22:00:54 +02:00
Marek Olšák	ed3912d0da	radeonsi: just save buffer sizes instead of buffers while recording IBs whole buffer objects are not needed Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-13 19:46:16 +02:00
Nicolai Hähnle	660cd3de4a	winsys/amdgpu: avoid flushed depth when possible If a depth/stencil texture has no mipmaps, we can always get a layout that is compatible with DB and TC. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:43:52 +02:00
Nicolai Hähnle	7000dfd5c3	gallium/radeon: add depth/stencil_adjusted output to surface computation This fixes a rare bug with stencil texturing -- seen on Polaris and Tonga, though it's basically a function of the memory configuration so could affect other parts as well. Fixes piglit "unaligned-blit * stencil downsample" and various "fbo-depth-array stencil" tests. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:43:52 +02:00
Nicolai Hähnle	19f8d2a843	gallium/radeon/winsyses: remove unused stencil_offset Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:42:49 +02:00
Marek Olšák	8a4ace4a47	gallium/radeon: add and use radeon_info::max_alloc_size (v2) v2: - squashed the patches - use INT_MAX - clamp max_const_buffer_size - check the DRM version in radeon Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Vedran Miletić <vedran@miletic.net>	2016-07-05 00:47:13 +02:00
sonjiang	28f85eab49	radeon uvd add uvd fw version for amdgpu Signed-off-by: sonjiang <sonny.jiang@amd.com> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-06-29 15:30:14 -04:00
Marek Olšák	fa7c927625	radeonsi: always calculate DCC info even if it's not used immediately for a later use Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-29 20:12:00 +02:00
Marek Olšák	1c5a10497a	gallium/radeon/winsyses: boolean -> bool, TRUE -> true, FALSE -> false Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Vedran Miletić <vedran@miletic.net> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-25 23:13:42 +02:00
Nicolai Hähnle	49c0b4a0db	winsys/amdgpu: add guard pages when R600_DEBUG=check_vm is enabled This should help flush out GPU VM faults. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-24 12:36:03 +02:00
Marek Olšák	cbb5adb908	gallium/u_queue: allow the execute function to differ per job so that independent types of jobs can use the same queue. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-24 12:24:40 +02:00
Marek Olšák	2fba0aaa70	gallium/u_queue: add an option to name threads for debugging v2: correct the snprintf use Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-24 12:24:40 +02:00
Marek Olšák	404d0d50d8	gallium/u_queue: add an option to have multiple worker threads independent jobs don't have to be stuck on only one thread v2: use CALLOC & FREE Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-24 12:24:40 +02:00
Marek Olšák	d8367e91f2	gallium/u_queue: use a ring instead of a stack and allow specifying its size in util_queue_init. v2: use CALLOC & FREE Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-24 12:24:40 +02:00
Marek Olšák	562cb03d76	gallium/util: import the multithreaded job queue from amdgpu winsys (v2) v2: rename the event to util_queue_fence Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-15 21:07:34 +02:00
Marek Olšák	789618e3b4	gallium/radeon: add micro_tile_mode to radeon_surf for easier access Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-14 20:22:16 +02:00
Rob Herring	112e988329	Android: move libdrm settings to top-level Android.common.mk Fix warnings like these due to HAVE_LIBDRM being inconsistently defined: external/libdrm/include/drm/drm.h:839:30: warning: redefinition of typedef 'drm_clip_rect_t' is a C11 feature [-Wtypedef-redefinition] typedef struct drm_clip_rect drm_clip_rect_t; HAVE_LIBDRM needs to be set project wide to fix this. This change also harmlessly links libdrm with everything, but simplifies the makefiles a bit. Signed-off-by: Rob Herring <robh@kernel.org> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2016-06-13 15:31:29 +01:00
Marek Olšák	00389100b6	winsys/amdgpu: enable DCC for mipmapped textures Also add dcc_fast_clear_size for clearing only the necessary subset of DCC. For no AA, it's equal to the size of the whole DCC level. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	aa7fe70443	radeonsi: add per-level dcc_enabled flags Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	d4d733e39d	gallium/radeon: don't allocate DCC for non-renderable texture formats R9G9B9E5 is the only uncompressed one hopefully. This fixes incorrect rendering not discovered (due to a lack of tests) until DCC mipmapping was enabled. Cc: 11.1 11.2 12.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Nicolai Hähnle	c7877b9dab	winsys/amdgpu: decay max_ib_size over time So that memory use will eventually decrease again after a temporary peak. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:20 +02:00
Nicolai Hähnle	6aff6377b1	winsys/amdgpu: implement IB chaining on the gfx ring As a consequence, CE IB size never triggers a flush anymore. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:20 +02:00
Nicolai Hähnle	45be461f55	winsys/amdgpu: consolidate IB size management in amdgpu_ib_finalize Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:20 +02:00
Nicolai Hähnle	89ba076de4	radeon/winsys: introduce radeon_winsys_cs_chunk We will chain multiple chunks together and will keep pointers to the older chunks to support IB dumping. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:20 +02:00
Nicolai Hähnle	83a01cb498	winsys/amdgpu: start with smaller IBs, growing as necessary This avoids allocating giant IBs from the outset, especially for CE and DMA. Since we now limit max_dw only by the size that the buffer happens to be (which, due to the buffer cache, can be even larger than the rounded-up size we request), the new function amdgpu_ib_max_submit_dwords controls when we submit an IB. With this change, we effectively never flush prematurely due to the CE IB, after an initial warm-up phase. v2: - clean up buffer_size calculation Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	f80c6abb9e	winsys/amdgpu: add amdgpu_ib and amdgpu_cs_from_ib helper functions The latter function allows getting the containing amdgpu_cs from any IB (including non-main ones). Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	9e5ed559ba	winsys/amdgpu: extract IB big buffer allocation for re-use Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	9db851b5ee	winsys/amdgpu: add IB buffer in amdgpu_get_new_ib Adding the buffer when we start using it for the IB makes the logic for chaining a bit simpler. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	46ad3561be	radeon/winsys: add cs_check_space Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:18 +02:00
Nicolai Hähnle	92d5d97b10	winsys/amdgpu: simplify interface of amdgpu_get_new_ib We'll want to have an amdgpu_cs pointer for future changes. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:18 +02:00
Nicolai Hähnle	8396ab4241	winsys/amdgpu: add amdgpu_cs_has_user_fence v2: style change Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:18 +02:00
Marek Olšák	ea68215c54	winsys/amdgpu: set flags correctly when allocating depth-stencil buffers This mimics Vulkan. It also documents how to fix stencil texturing. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-06-01 17:35:30 +02:00
Marek Olšák	53f33619a4	winsys/amdgpu: add back multithreaded command submission Ported from the initial amdgpu winsys from the private AMD branch. The thread creates the buffer list, submits IBs, and cleans up the submission context, which can also destroy buffers. 3-5% reduction in CPU overhead is expected for apps submitting a lot of IBs per frame. This is most visible with DMA IBs. v2: use a semaphore instead of a busy loop in amdgpu_ws_queue_cs add another amdgpu_cs_sync_flush call into amdgpu_bo_map Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-05-26 16:43:45 +02:00
Marek Olšák	54737aabb9	gallium/radeon: don't set PB_USAGE in winsyses There is no point. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-05-19 12:35:50 +02:00
Nicolai Hähnle	d464bfd12a	winsys/amdgpu: cleanup error handling in amdgpu_ctx_create Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-05-17 15:28:40 -05:00
Nicolai Hähnle	fef08af99c	winsys/amdgpu: avoid ioctl call when fence_wait is called without timeout When user fences are used, we don't need the kernel for polling. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-05-17 15:28:39 -05:00
Jan Vesely	47b390fe45	Treewide: Remove Elements() macro Signed-off-by: Jan Vesely <jano.vesely@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2016-05-17 15:28:04 -04:00
Marek Olšák	60946c0d60	gallium/radeon: add a heuristic for better (S)DMA performance Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-05-10 17:20:09 +02:00
Marek Olšák	c2377b394b	gallium/radeon: align alignments for better buffer reuse It's for the buffer cache. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-05-10 17:20:09 +02:00
Marek Olšák	544967faf5	gallium/radeon: use gart_page_size instead of hardcoded 4096 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-05-10 17:20:09 +02:00
Marek Olšák	9d8c283f28	winsys/amdgpu: move gart_page_size to struct radeon_winsys Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-05-10 17:20:09 +02:00
Marek Olšák	20a77397fa	gallium/radeon: remove tile_mode_array_valid flags Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-05-02 22:49:25 +02:00
Marek Olšák	c8aac4fc0d	winsys/amdgpu: pass PIPE_CONFIG to addrlib on texture import This hasn't been needed, but I think we should set it. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-05-02 22:49:25 +02:00
Marek Olšák	dc970c4f4e	winsys/amdgpu: read NUM_BANKS from buffer metadata Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-05-02 22:49:25 +02:00
Marek Olšák	ef45825708	gallium/radeon: add radeon_surf::macro_tile_index for indexing cik_macrotile_mode_array Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-05-02 22:49:25 +02:00
Marek Olšák	3e3c43418e	winsys/amdgpu: count buffer size only once Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-05-02 22:49:25 +02:00
Marek Olšák	f98ba4123c	winsys/amdgpu: loosen up requirements for how much memory IBs can use ported from winsys/radeon. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-05-02 22:49:25 +02:00
Marek Olšák	92f6af2c4a	gallium/radeon: drop support for LINEAR_GENERAL layout Unused. All texture imports use LINEAR_ALIGNED regardless of what the DDX does. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2016-04-28 20:16:56 +02:00
Nicolai Hähnle	562c4a17b7	winsys/radeon: remove use_reusable_pool parameter from buffer_create All callers set this parameter to true. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-27 11:16:41 -05:00
Jakob Sinclair	ea327dc451	gallium: Remove every double semi-colon Signed-off-by: Jakob Sinclair <sinclair.jakob@openmailbox.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-26 14:36:29 -07:00
Bas Nieuwenhuizen	51d1551241	winsys/amdgpu: Silence possibly uninitialized variable warning. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-21 13:40:42 +02:00
Thomas Hindoe Paaboel Andersen	d4a21a0de0	winsys/amdgpu: fix preamble IB size The missing break caused the IB size to be overwritten with the size of IB_CONST. This was introduced in: `7201230582` Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2016-04-21 12:14:50 +02:00
Bas Nieuwenhuizen	7201230582	winsys/amdgpu: Enlarge const IB size. Necessary to prevent performance regressions due to extra flushing. Probably should enlarge it even further when also updating uniforms through the CE, but this seems large enough for now. v2: Add preamble IB. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-19 18:10:30 +02:00
Marek Olšák	7997b5f005	winsys/amdgpu: Add support for const IB. v2: Use the correct IB to update request (Bas Nieuwenhuizen) v3: Add preamble IB. (Bas Nieuwenhuizen) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-19 18:10:30 +02:00
Marek Olšák	e78170f388	winsys/amdgpu: split IB data into a new structure in preparation for CE Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-04-19 18:10:30 +02:00
Marek Olšák	f4b77c764a	gallium/radeon: move ring_type into winsyses Not used by drivers. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-04-19 18:10:30 +02:00
Marek Olšák	5a4b74d1ba	gallium/radeon: relax requirements on VRAM placements on APUs This makes Tonga with vramlimit=128 2x faster in Heaven. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-04-12 14:29:48 +02:00
Marek Olšák	a57309f807	winsys/amdgpu: remove hack for low VRAM configuration A better solution will be used. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2016-04-12 14:29:48 +02:00
Marek Olšák	0ba0933f48	winsys/amdgpu: add support for 64-bit buffer sizes v2: fail in radeon_winsys_bo_create if size > 32 bits Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-12 14:29:47 +02:00
Sonny Jiang	42e442d888	radeonsi: add support for Polaris (v2) v2: Polaris chips should be defined after Stoney Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> (v1) Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> (v1) Signed-off-by: Leo Liu <leo.liu@amd.com> (v2 diff) Reviewed-by: Alex Deucher <alexander.deucher@amd.com> (v2 diff)	2016-03-24 23:07:32 -04:00
Sonny Jiang	f5e24b19e8	winsys/amdgpu: addrlib - add Polaris support (v2) v2: fix indentation as noted by Michel Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-03-24 23:06:39 -04:00
Marek Olšák	8140154ae9	gallium/radeon: remove old CS tracing Cons: - it was only integrated in r600g - it doesn't work with GPUVM - it records buffer contents at the end of IBs instead of at the beginning, so the replay isn't exact - it lacks an IB parser and user-friendliness A better solution is apitrace in combination with gallium/ddebug, which has a complete IB parser and can pinpoint hanging CP packets. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-03-20 00:56:35 +01:00
Christian König	5aea0d6919	radeon/winsys: add layer support for BO export Add layer support to export individual array layers. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-17 14:17:06 +01:00
Christian König	04bc082f6a	radeon/winsys: add offset support for BO import/export Add offset support to handle NV12 offsets as well. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-17 14:17:03 +01:00
Christian König	29d26f1522	gallium/winsys/drm: add offset to struct winsys_handle We are going to need this for EGL_EXT_image_dma_buf_import. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-17 14:16:03 +01:00
Emil Velikov	3ffab9a89c	winsys/amdgpu/addrlib: do not wrap header inclusion in extern "C" Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-09 17:16:51 +00:00
Marek Olšák	737b6ed13e	winsys/amdgpu: get PCI info This will be queried by the OpenCL stack using an interop call. I have tested that the values match lspci. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-03-09 15:02:28 +01:00
Marek Olšák	f4aa3256ef	winsys/amdgpu: allow drivers to set/get opaque metadata Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-03-09 15:02:26 +01:00
Marek Olšák	bd1feb2827	gallium/radeon: rename winsys buffer_get/set_tiling to buffer_get/set_metadata Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-03-09 15:02:26 +01:00
Marek Olšák	6011d7cf25	gallium/radeon: remove rcs parameter from radeon_winsys::buffer_set_tiling This was needed for DRM < 2.12.0 where the kernel was rewriting tiling flags in IBs. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-03-09 15:02:25 +01:00
Marek Olšák	260ef9c9be	gallium/radeon: use a structure for passing tiling flags from/to winsys and call it radeon_bo_metadata Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-03-09 15:02:25 +01:00
Bas Nieuwenhuizen	6373845d98	winsys/amdgpu: enlarge buffer_indices_hashlist Enlarge the buffer hashlist to prevent large numbers of misses due to adding more buffers than can be cached in the hashlist. The game I tested had CS's with up to 1500 buffers and the overhead of amdgpu_lookup_buffer for various sizes was: 4096 1.97% (new value) 2048 4.37% 1024 6.92% 512 9.47% (old value) (percentage of CPU usage in render thread as determined by perf) The time spent in amdgpu_add_buffer self is ~4.2% in all cases and for 4096 the time needed to clear the hashlist is still < 0.10%, so I am not expecting significant regressions. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2016-03-09 00:52:07 +01:00
Marek Olšák	100796c15c	gallium/radeon: drop support for LLVM 3.5 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> v2: adjust the comment in the amdgpu winsys	2016-02-11 16:48:30 +01:00
Marek Olšák	c577f2843a	gallium/radeon: remove radeon_info::r600_tiling_config Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:29:19 +01:00
Marek Olšák	4f96846d9d	gallium/radeon: get pipe_interleave_bytes AKA group_bytes from the winsys Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:59 +01:00
Marek Olšák	0f3556d308	winsys/amdgpu: add an assertion to cik_get_num_tile_pipes (v2) v2: print an error to stderr Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:18 +01:00
Marek Olšák	a2291f7b57	winsys/amdgpu: remove an r600-only setting Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:12 +01:00
Marek Olšák	1e864d7379	gallium/radeon: rename & reorder members of radeon_info Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:00 +01:00
Michel Dänzer	30fcf241e1	winsys/amdgpu: Process RADEON_FLAG_* independently from RADEON_DOMAIN_* In particular, AMDGPU_GEM_CREATE_CPU_GTT_USWC can affect even BOs created in VRAM if they get evicted to GTT. In general there's no need to restrict any of the flags to any particular domains. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2016-01-29 16:06:06 +09:00
Michel Dänzer	62f837e2ea	winsys/amdgpu: Handle RADEON_FLAG_NO_CPU_ACCESS Failing to do this was resulting in the kernel driver unnecessarily leaving open the possibility of CPU access to tiled BOs. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93862 (This change shouldn't be backported to stable branches, because released versions of xf86-video-amdgpu unnecessarily try to map the front buffer) Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2016-01-29 16:06:06 +09:00
Marek Olšák	e707b9d8ba	winsys/amdgpu: optionally use buffer lists with all allocated buffers Set RADEON_ALL_BOS=1 to use it. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-01-23 17:01:54 +01:00
Marek Olšák	2b66bc87d4	winsys/amdgpu: compute num_good_compute_units correctly Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-01-22 15:02:40 +01:00
Marek Olšák	0d8e4f958f	gallium/radeon: rename max_compute_units -> num_good_compute_units radeon sets this correctly, but not amdgpu Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-01-22 15:02:40 +01:00
Nicolai Hähnle	e976860638	gallium/radeon: do not reallocate user memory buffers The whole point of AMD_pinned_memory is that applications don't have to map buffers via OpenGL - but they're still allowed to, so make sure we don't break the link between buffer object and user memory unless explicitly instructed to. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-01-14 09:41:24 -05:00
Marek Olšák	369afdb7b6	winsys/amdgpu: clear the buffer cache on mmap failure and try again Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-11 15:25:13 +01:00
Marek Olšák	eb1e1af676	winsys/amdgpu: clear the buffer cache on allocation failure and try again Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-11 15:25:13 +01:00
Marek Olšák	cf811faeff	gallium/radeon: remove radeon_winsys_cs_handle "radeon_winsys_cs_handle cs_buf" is now equivalent to "pb_buffer buf". Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-11 15:25:13 +01:00
Marek Olšák	1e05812fcd	winsys/amdgpu: don't use the "rws" abbreviation for amdgpu_winsys Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-11 15:25:12 +01:00
Marek Olšák	6f4e74d165	winsys/amdgpu: use pb_cache instead of pb_cache_manager This is a prerequisite for the removal of radeon_winsys_cs_handle. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-11 15:25:12 +01:00
Sonny Jiang	2618886600	winsys/amdgpu: addrlib - port a Fiji bug fix Fiji: Fixed tiled resource failures Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> v2: fix a compile failure (typo) - Marek	2015-12-07 21:58:42 +01:00
Sonny Jiang	338d7bf053	winsys/amdgpu: addrlib - port Checks mip 0 for czDispCompatible Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-07 21:58:42 +01:00
Sonny Jiang	676bc25140	winsys/amdgpu: addrlib - port fix error for workaround for 1D tiling Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-12-07 21:58:42 +01:00
Edward O'Callaghan	3edae10601	winsys/amdgpu: Make use of ARRAY_SIZE macro Signed-off-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-12-06 17:09:54 +01:00
Marek Olšák	93eb4f9287	winsys/amdgpu: remove the dcc_enable surface flag dcc_size is sufficient and doesn't need a further comment in my opinion. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-10-27 10:49:24 +01:00
Bas Nieuwenhuizen	afa357c3b0	radeonsi: Allocate buffers for DCC. As the alignment requirements can be 32 KiB or more, also adding an aligned buffer creation function. DCC is disabled for textures that can be shared as sharing the DCC buffers has not been implemented yet. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-10-24 00:42:01 +02:00
Samuel Li	bf0d0ce0d5	radeonsi: add support for Stoney asics (v3) v2 (agd): rebase on mesa master, split pci ids to separate commit v3 (agd): use carrizo for llvm processor name for llvm 3.7 and older Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Samuel Li <samuel.li@amd.com> Cc: mesa-stable@lists.freedesktop.org	2015-10-23 17:53:14 -04:00
Marek Olšák	6f48e2bee1	winsys/amdgpu: add winsys function cs_get_buffer_list For debugging. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-03 22:06:07 +02:00
Marek Olšák	93641f4341	gallium/radeon: stop using "reloc" in a few places Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-03 22:06:07 +02:00
Marek Olšák	2edb060639	gallium/radeon: tell the winsys the exact resource binding types Use the priority flags and expand them. This information will be used for debugging. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-03 22:06:07 +02:00
Marek Olšák	c6502e880b	winsys/amdgpu: calculate the maximum number of compute units Required for register spilling. Cc: 11.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-09-10 17:14:15 +02:00
Marek Olšák	54964c7751	winsys/amdgpu: remove exported buffers from the cache Cc: 11.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-09-03 18:41:42 +02:00
Marek Olšák	ba79ff7fa8	winsys/amdgpu: remove IB padding for SI SI is unsupported by amdgpu Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:15 +02:00
Marek Olšák	b1e5451211	winsys/amdgpu: use small IBs for better performance on VI Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:13 +02:00
Marek Olšák	6924ecac77	gallium/radeon: read_registers should return bool meaning success or failure Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:20 +02:00
Emil Velikov	4a21da709b	winsys/amdgpu: automake: remove missing headers The files are not referenced in any other place in whole of mesa. They are likely remnants of the early development stage. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-08-22 11:23:58 +01:00
Mauro Rossi	6aaa814995	android: fix cflags and includes for amdgpu winsys Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-08-22 11:22:24 +01:00
Marek Olšák	5fb0180592	winsys/amdgpu: fix the type of memory usage counters If the 32-bit types overflowed, the driver could submit an IB that uses much more memory than is available. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-08-19 12:03:01 +02:00
Leo Liu	0654a9ca17	radeon/vce: disable VCE dual instance for harvest part Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2015-08-14 15:02:31 +02:00
Alex Deucher	767ad50a10	radeonsi: add support for FIJI (v4) v2: incorporate comments from Marek v3: add missing fiji case in winsys init use tonga raster config (double check this) v4: rebase on harvest patch Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v3) Reviewed-by: Christian König <christian.koenig@amd.com> (v3) Reviewed-by: David Zhang <david1.zhang@amd.com> (v3) Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-14 15:02:29 +02:00
Alex Deucher	528a6ff599	winsys/amdgpu: add addrlib support for Fiji (v2) v2: fix tonga chip check Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: David Zhang <david1.zhang@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-14 15:02:29 +02:00
Marek Olšák	e7fc664b91	winsys/amdgpu: add addrlib - texture addressing and alignment calculator This is an internal project that Catalyst uses and now open source will do too. v2: squashed these commits in: - winsys/amdgpu: fix warnings in addrlib - winsys/amdgpu: set PIPE_CONFIG and NUM_BANKS in tiling_flags	2015-08-14 15:02:28 +02:00
Marek Olšák	2eb067db0f	winsys/amdgpu: add a new winsys for the new kernel driver v2: - lots of changes according to Emil Velikov's comments - implemented radeon_winsys::read_registers v3: - a lot of new work, many of them adapt to libdrm interface changes Squashed patches: winsys/amdgpu: implement radeon_winsys context support winsys/amdgpu: add reference counting for contexts winsys/amdgpu: add userptr support winsys/amdgpu: allocate IBs like normal buffers winsys/amdgpu: add IBs to the buffer list, adapt to interface changes winsys/amdgpu: don't use KMS handles as reloc hash keys winsys/amdgpu: sync buffer accesses to different rings winsys/amdgpu: use dependencies instead of waiting for last fence v2 gallium/radeon: unify buffer_wait and buffer_is_busy in the winsys interface (amdgpu part) winsys/amdgpu: track fences per ring and be thread-safe winsys/amdgpu: simplify waiting on a variable in amdgpu_fence_wait gallium/radeon: allow the winsys to choose the IB size (amdgpu part) winsys/amdgpu: switch to new amdgpu_cs_query_fence_status interface winsys/amdgpu: handle fence and dependencies merge winsys/amdgpu follow libdrm change to move user fence into UMD winsys/amdgpu: use amdgpu_bo_va_op for va map/unmap v2 winsys/amdgpu: use the new tiling flags winsys/amdgpu: switch to new GTT_USWC definition winsys/amdgpu: expose amdgpu_cs_query_reset_state to drivers winsys/amdgpu: fix valgrind warnings winsys/amdgpu: don't use VRAM with APUs that don't have much of it winsys/amdgpu: require LLVM 3.6.1 for VI because of bug fixes there winsys/amdgpu: remove amdgpu_winsys::num_cpus winsys/amdgpu: align BO size to page size winsys/amdgpu: reduce BO cache timeout winsys/amdgpu: remove useless flushing and waiting in amdgpu_bo_set_tiling winsys/amdgpu: use amdgpu_device_handle as a unique device ID instead of fd winsys/amdgpu: use safer access to amdgpu_fence_wait::signalled winsys/amdgpu: allow maximum IB size of 4 MB winsys/amdgpu: add ip_instance into amdgpu_fence gallium/radeon: add RING_COMPUTE instead of RADEON_FLUSH_COMPUTE winsys/amdgpu: set the ring type at CS initilization winsys/amdgpu: query the GART page size from the kernel winsys/amdgpu: correctly wait for shared buffers to become idle winsys/amdgpu: set the amdgpu_cs_fence structure only once at fence creation winsys/amdgpu: add a specific error message for cs_submit -> -ENOMEM winsys/amdgpu: check num_active_ioctls before calling amdgpu_bo_wait_for_idle winsys/amdgpu: clear user fence BO after allocating it winsys/amdgpu: fix user fences winsys/amdgpu: make amdgpu_winsys_create public winsys/amdgpu: remove thread offloading winsys/amdgpu: flatten the amdgpu_cs_context structure and simplify more v4: require libdrm 2.4.63	2015-08-14 15:02:28 +02:00

... 5 6 7 8 9 ...

553 Commits