KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	e2c15ea092	gallium/radeon: add new HUD queries for monitoring the CP There are even more counters in the CP_STAT register but I think these ones are enough for now. v2: only read (and expose) CP_STAT on VI+ Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-30 14:37:00 +01:00
Samuel Pitoiset	0e04a078c5	gallium/radeon: add new GPU-sdma-busy HUD query For simplicity, GPU-sdma-busy will return 0 on previous gens. v2: only read SRBM_STATUS2 on Evergreen+ Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-30 14:37:00 +01:00
Samuel Pitoiset	eca96ea308	gallium/radeon: add VRAM-vis-usage HUD query This new query returns the current visible usage of VRAM accessed by the CPU. It will return 0 on radeon because it's unimplemented. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-26 19:40:52 +01:00
Samuel Pitoiset	cff199ceb7	gallium/radeon: add a new HUD query for the number of mapped buffers Useful when debugging applications which map a ton of buffers and also because we used to run into Linux's limit on the number of simultaneous mmap() calls. v2: - update the commit message Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-25 15:19:21 +01:00
Samuel Pitoiset	aa2ace8e49	gallium/radeon: add HUD queries for monitoring some hw blocks It's also possible to monitor them via performance counters but the hardware can only use two counters simultaneously. It seems easier to re-use the existing code which reads from MMIO instead of writing a multi-pass approach. v2: - add new lines after ':' Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-23 21:19:49 +01:00
Marek Olšák	0d9a4efce9	gallium/radeon: add GPU-shaders-busy HUD query It should be close to the GPU load, but it can be much lower if something is stalling shader execution (e.g. CP DMA). Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-16 15:35:30 +01:00
Marek Olšák	2b621c47aa	gallium/radeon: add new HUD query num-SDMA-IBs Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Marek Olšák	6b8a371e00	gallium/radeon: rename the num-ctx-flushes query to num-GFX-IBs Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Marek Olšák	5871ebd7f1	radeonsi: add HUD queries for cache flush stats Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-06 21:05:48 +01:00
Marek Olšák	315eb0acb4	radeonsi: add a driver query for counting CP DMA calls CP DMA calls are synchronous with regard to shaders, but can be made asynchronous if needed. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Marek Olšák	d268b7f95e	radeonsi: add a driver query for shader cache hits This is an 8-month old patch. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-01 22:33:13 +01:00
Nicolai Hähnle	15e2661137	gallium/radeon: implement get_query_result_resource (v2) v2: fix a comment (Gustaw Smolarczyk) Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-29 11:14:54 +02:00
Nicolai Hähnle	2c9d546402	gallium/radeon: zero all query buffers To ensure that fences are properly initialized. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-29 11:14:51 +02:00
Nicolai Hähnle	51b57a9b5a	radeonsi: add save_qbo_state Save compute shader state that will be used for the ARB_query_buffer_object implementation. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-09-29 11:14:37 +02:00
Marek Olšák	addca75f4e	radeonsi: add HUD queries for counting VS/PS/CS partial flushes Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-05 18:01:15 +02:00
Marek Olšák	1d0593abd7	gallium/radeon: rename the num-cs-flushes query to num-ctx-flushes num-cs-flushes will mean compute shader flushes Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-09-05 18:01:15 +02:00
Marek Olšák	971ef7518f	gallium/radeon: add a driver query for AMDGPU_INFO_NUM_EVICTIONS If the kernel driver doesn't support it, it returns 0. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	33a9b4e8a1	gallium/radeon: add HUD queries for mapped VRAM/GTT mainly for monitoring visible VRAM congestion Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-10 01:11:10 +02:00
Marek Olšák	44906101c4	gallium/radeon: don't re-create queries for DCC stat gathering Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-29 20:12:00 +02:00
Marek Olšák	6da92df538	gallium/radeon: add a HUD query for PS draw ratio stats from separate DCC Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-29 20:12:00 +02:00
Marek Olšák	e607a6be2b	gallium/radeon: add flag R600_QUERY_HW_FLAG_BEGIN_RESUMES Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-29 20:12:00 +02:00
Marek Olšák	7db10093d3	gallium/radeon: boolean -> bool, TRUE -> true, FALSE -> false Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Vedran Miletić <vedran@miletic.net> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-25 23:13:42 +02:00
Marek Olšák	4140afd04b	gallium/radeon: add driver queries for compute/dma call stats and spills also print the average count per frame Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-14 20:22:16 +02:00
Nicolai Hähnle	fe3b1e1448	radeon: handle query buffer allocation and mapping failures Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94984 Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-21 22:33:12 -05:00
Nicolai Hähnle	b222580578	radeon: wire end_query return value to sw/hw_end Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-21 22:33:07 -05:00
Marek Olšák	0c52caf7b7	gallium/radeon: use enums in r600_query.h Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-18 19:51:25 +02:00
Marek Olšák	e241a63512	gallium/radeon: remove R600_QUERY_HW_FLAG_TIMER not used anymore Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-12 14:29:47 +02:00
Nicolai Hähnle	988f4b31f3	radeonsi: re-order the SQ_xx performance counter blocks This is yet another change motivated by appeasing AMD GPUPerfStudio's hardcoding of performance counter group numbers. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Acked-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:25:30 -05:00
Nicolai Hähnle	b0e32548c8	gallium/radeon: add GPIN driver query group This group was used by older versions of AMD GPUPerfStudio (via AMD_performance_monitor) to identify the GPU family, and GPUPerfStudio still complains when it isn't available. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Acked-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:24:59 -05:00
Nicolai Hähnle	80a16dece6	radeon: delay the generation of driver query names until first use This shaves a bit more time off the startup of programs that don't actually use performance counters. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-26 10:57:43 +01:00
Nicolai Hähnle	ad22006892	radeonsi: implement AMD_performance_monitor for CIK+ Expose most of the performance counter groups that are exposed by Catalyst. Ideally, the driver will work with GPUPerfStudio at some point, but we are not quite there yet. In any case, this is the reason for grouping multiple instances of hardware blocks in the way it is implemented. The counters can also be shown using the Gallium HUD. If one is interested to see how work is distributed across multiple shader engines, one can set the environment variable RADEON_PC_SEPARATE_SE=1 to obtain finer-grained performance counter groups. Part of the implementation is in radeon because an implementation for older hardware would largely follow along the same lines, but exposing a different set of blocks which are programmed slightly differently. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-25 15:52:09 +01:00
Nicolai Hähnle	5bda3d0958	radeon: re-prepare query buffers on begin_query for predicate queries The point of prepare_buffer is to ensure that the query buffer contains valid initial data for conditional rendering: as long as the buffer is initialized correctly, the GPU is able to tell whether query results have been written already (and wait or fall back to unconditional rendering if desired). This means prepare_buffer needs to be called again when a buffer is reused. Conversely, for queries that cannot be used for conditional rendering (notably pipeline statistics), we can re-use buffers immediately, and they do not need to be initialized. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Andy Furniss <adf.lists@gmail.com>	2015-11-20 22:46:11 +01:00
Nicolai Hähnle	27ce75ed12	radeon: count cs dwords separately for query begin and end This will be important for perfcounter queries. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-18 12:27:13 +01:00
Nicolai Hähnle	ffd01b7781	radeon: expose r600_query_hw functions for reuse Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:13 +01:00
Nicolai Hähnle	50f0f938e3	radeon: implement r600_query_hw_get_result via function pointers We will need the clear_result override for the batch query implementation. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-18 12:27:13 +01:00
Nicolai Hähnle	c207c55fc0	radeon: split hw query buffer handling from cs emit The idea here is that driver queries implemented outside of common code will use the same query buffer handling with different logic for starting and stopping the corresponding counters. Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:13 +01:00
Nicolai Hähnle	1d10b3d01e	radeon: convert hardware queries to the new style Move r600_query and r600_query_hw into the header because we will want to reuse the buffer handling and suspend/resume logic outside of the common radeon code. Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:12 +01:00
Nicolai Hähnle	829a9808a9	radeon: add query handler function pointers The goal here is to be able to move the implementation details of hardware- specific queries (in particular, performance counters) out of the common code. Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:12 +01:00
Nicolai Hähnle	50cab4788d	radeon: move R600_QUERY_* constants into a new query header file More query-related structures will have to be moved into their own header file to support hardware-specific performance counters. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-18 12:27:12 +01:00

39 Commits