KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Dave Airlie	978285f69a	draw: add disk caching for draw shaders This adds the cache search/insert and compile skipping for cached objects to the VS/GS/TES/TCS stages in draw. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:41 +10:00
Dave Airlie	db82faff71	llvmpipe: hook draw disk cache up Connect the draw callbacks into the llvmpipe code. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:41 +10:00
Dave Airlie	e07e5137b0	draw: add disk cache callbacks for draw shaders This provides a set of hooks from the driver that draw can use to access the disk cache for the draw shaders. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:41 +10:00
Dave Airlie	c2864081e1	llvmpipe/cs: add shader caching As for fragment shader, skip compilation step if we have the shaders Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	f0d91c9af3	llvmpipe/fs: add caching support Serialize and check if the object is in the cache, it there is a cached object skip compilation code once we've constructed the function interface. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	1b2e345110	gallivm: don't cache shaders that use fetch functions. This needs to be reworked, but it's a bit messy as we have to store all the fetch pointers to be added as globals later once gallivm has been initialised further. For now just refuse to cache shaders that hit these paths (mainly ETC1 and BPTC). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	6c0c61cb48	llvmpipe: add infrastructure for disk cache support This hooks up the gallium API and adds the APIs needed for shader stages to search and add things to the cache. It also adds cache stats debug printing. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	4962d3e107	gallivm: add cache interface to mcjit MCJIT uses an ObjectCache object to implement the cache, this creates and instances of it and adds it to the MCJIT instances, it stores the cached object for later use by the outer layers. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	b15ecb1717	gallivm: skip operations if we have a cached object. If the object is loaded from the cache, a bunch of gallivm/llvm interactions can be skipped. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	7b7c02d161	gallivm: add support for a cache object This plumbs the cache object into the gallivm API, nothing uses it yet. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	333ee94285	gallivm: rework debug printf hook to use global mapping. Cached shaders require relinking, so hardcoding the pointer can't work. This switches out the printf code to use new proper API. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	f511d2a553	gallivm: rework coroutine malloc/free callouts. When using cached shaders we have to relink the shader with external symbols when it's loaded. However the way gallivm does function calls now hardcodes the function pointer into the shader. LLVM had a mechanism for doing this properly using global mappings, this switches the coroutine alloc/free code to use a global mapping. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	d815d74f75	llvmpipe/draw: drop variant number from function names. When we use an object cache for the MCJIT we can have identical cache entries from the same shader variant in different shaders, but the JIT objcache uses the function name to relink things, so it has to be consistent. Just drop the variants from the function names. Note the modules still have the variant info. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:40 +10:00
Dave Airlie	e639e311a1	llvmpipe/cs: overhaul cs variant key state. This just realigns it with the fs state, and fixes some issues where shaders weren't getting cached correctly. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:28 +10:00
Dave Airlie	8735e96c53	util/disk_cache: add fallback for disk_cache_get_function_identifier Otherwise drivers need to have a ifdef on windows, easier to fix here hopefully. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5049>	2020-06-11 06:05:28 +10:00
Christian Gmeiner	456e8103ef	ci: fix possible spuriously run of jobs Need to list arm_test-base here as well, or jobs using this template may spuriously run if the arm_test-base job fails or is cancelled. Suggested-by: Michel Dänzer <mdaenzer@redhat.com> Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5405>	2020-06-10 16:13:50 +00:00
Marek Olšák	bd553f0546	ac/surface: cache DCC retile maps (v2) This reduces overhead when resizing windows or when allocating similar image sizes over and over again. v2: optimize the memory footprint of the cache Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5398>	2020-06-10 15:35:46 +00:00
Marek Olšák	4cf674c8f7	ac/surface: add a wrapper structure to hold ADDR_HANDLE and more things in the future. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5398>	2020-06-10 15:35:46 +00:00
Marek Olšák	e6996d6fbd	amd/addrlib: remove unused members of ADDR2_COMPUTE_DCC_ADDRFROMCOORD_INPUT Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5398>	2020-06-10 15:35:46 +00:00
Marek Olšák	a99f4d5382	amd/addrlib: don't recompute DCC info for every ComputeDccAddrFromCoord call This decreases the DCC retile map overhead from 23% to 18%. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5398>	2020-06-10 15:35:46 +00:00
Marek Olšák	a1b9eb62f6	ac/surface: don't recompute the DCC retile map for imported textures The retile map is not used in this case, and the retile map computation takes 39% of CPU time when resizing a window. This brings it down to 23%. The dcc_retile_use_uint16 setting has to be derived from DCC sizes. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5398>	2020-06-10 15:35:46 +00:00
Rhys Perry	1b2e1163b2	aco: fix moving sub-dword values out of a register for a fixed definition Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5040>	2020-06-10 15:05:11 +00:00
Rhys Perry	edf863d1d2	aco: use Info::definition_size instead of definition's regclass 16-bit abs/neg creates v_xor_b32/v_and_b32 with v2b definitions. These instructions never do partial writes without SDWA. No shader-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5040>	2020-06-10 15:05:11 +00:00
Rhys Perry	207c35cbe8	aco: add Info::{operand_size,definition_size} No shader-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5040>	2020-06-10 15:05:11 +00:00
Rhys Perry	62ea429a99	aco: prefer 4-byte aligned definitions shader-db (Navi, fp16 enabled): Totals from 42 (0.03% of 127638) affected shaders: CodeSize: 811984 -> 806224 (-0.71%) Instrs: 155733 -> 155939 (+0.13%); split: -0.04%, +0.18% Cycles: 1982568 -> 1984400 (+0.09%); split: -0.06%, +0.15% VMEM: 7187 -> 7121 (-0.92%); split: +0.86%, -1.78% SMEM: 1770 -> 1769 (-0.06%) VClause: 1475 -> 1476 (+0.07%) Copies: 12406 -> 12606 (+1.61%); split: -0.46%, +2.07% Branches: 5901 -> 5900 (-0.02%); split: -0.25%, +0.24% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5040>	2020-06-10 15:05:11 +00:00
Rhys Perry	56345b8c61	aco: allow reading/writing upper halves/bytes when possible Use SDWA, opsel or a different opcode to achieve this. shader-db (Navi, fp16 enabled): Totals from 42 (0.03% of 127638) affected shaders: VGPRs: 3424 -> 3416 (-0.23%) CodeSize: 811124 -> 811984 (+0.11%); split: -0.12%, +0.23% Instrs: 156638 -> 155733 (-0.58%) Cycles: 1994180 -> 1982568 (-0.58%); split: -0.59%, +0.00% VMEM: 7019 -> 7187 (+2.39%); split: +3.45%, -1.05% SMEM: 1771 -> 1770 (-0.06%); split: +0.06%, -0.11% VClause: 1477 -> 1475 (-0.14%) Copies: 13216 -> 12406 (-6.13%) Branches: 5942 -> 5901 (-0.69%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5040>	2020-06-10 15:05:11 +00:00
Rhys Perry	98060ba0f0	aco: p_extract_vector in 64-bit u2f16/i2f16 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5040>	2020-06-10 15:05:11 +00:00
Rhys Perry	d9cfb8ad48	aco: validate instructions reading/writing upper halves/bytes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5040>	2020-06-10 15:05:11 +00:00
Icecream95	3a1a40b443	panfrost: Add writes_stencil to the EARLY_Z disable list Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	deaef1df15	pan/mdg: Print writeout sources in mir_print_instruction Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	d37e901e35	pan/mdg: Add new depth store lowering This uses the new nir_intrinsic_store_combined_output_pan intrinsic, which can write depth, stencil and color in a single instruction. If there are no color writes, the "depth RT" is written to. Fixes the dEQP GLES3 depth write tests, as well as the piglit tests fragdepth_gles2, glsl-1.10-fragdepth and when modified to not rely on depth/stencil reload, glsl-fs-shader-stencil-export. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	a68063402b	pan/mdg: Add depth/stencil support to emit_fragment_store Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	7534a31a11	pan/mdg: Move search_var to earlier in midgard_compile.c It will be needed by the new zs lowering. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	2f3d60c84b	pan/mdg: Add new depth writeout code We schedule depth writeout to smul and stencil to vlut, so scheduling to smul has to be disabled in these cases. When only writing stencil, scheduling to smul is still disabled to prevent stencil writeout from being scheduled there. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	92d3f1fe59	pan/mdg: Replace writeout booleans with a single value A single value is easier to deal with than three separate booleans. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	bcc8f28b1a	nir: Replace the zs_output_pan intrinsic with combined_output_pan Depth and stencil writes are combined with color writes, so we need this intrinsic which has sources for color, RT, depth and stencil. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	2a5504fb92	pan/mdg: Remove writeout case from bytemask_of_read_components By setting the swizzle for the fragment color, and setting qmask to ~0 for branches, the special case for writeout branches can be removed from mir_bytemask_of_read_components_index. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	8f36904bae	pan/mdg: Remove old depth writeout code We need to be able to do color writeout at the same time as depth writeout. The old code can't do that, so needs to be removed. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	7da8667a7b	pan/mdg: Remove old zs store lowering It is broken for when there are also color writes, and will be replaced with a new lowering which takes that into account. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	ddc2ae32cf	pan/mdg: Move r1.w writeout to branch->dest There will need to be sources for depth and stencil writeout, so something has to be moved to the dest of the writeout branch. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Icecream95	5f5a973709	pan/mdg: Add a macro for printing instruction source information Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5065>	2020-06-10 13:54:03 +00:00
Alyssa Rosenzweig	dc8bffe999	nir: Remove nir_intrinsic_output_u8_as_fp16_pan Now unused in favour of nir_intrinsic_load_output, happily. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5287>	2020-06-10 09:30:31 +00:00
Pierre-Eric Pelloux-Prayer	8275dc1ed5	ac/surface: fix epitch when modifying surf_pitch This is needed otherwise it can cause bad rendering of UYVY files. The align(..., 256 / surf->bpe) constraint comes from addrlib. Fixes: `69aadc4933` ("radeonsi: fix surf_pitch for subsampled surface") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5314>	2020-06-10 09:11:23 +00:00
Pierre-Eric Pelloux-Prayer	e9826a1bb2	ac/surface: set SCANOUT if surf->is_displayable Fixes: `ba10fb3f7f` ("radeonsi: preserve the scanout flag for shared resources on gfx9 and gfx10") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5314>	2020-06-10 09:11:23 +00:00
Erik Faye-Lund	10f07495f6	zink: only report device-local memory as video-memory While the definition of "video memory" isn't super clear, I think it's pretty reasonable to assume host-memory isn't meant to be included. So let's only count dedicated memory here. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3107 Reviewed-by: Witold Baryluk <witold.baryluk@gmail.com> Tested-by: Witold Baryluk <witold.baryluk@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5409>	2020-06-10 08:58:09 +00:00
Samuel Pitoiset	9b58c4958b	ac/nir: fix integer comparisons with pointers If we get a comparison between a pointer and an integer, LLVM complains if the operands aren't of the same type. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3085 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5397>	2020-06-10 08:18:22 +00:00
Pierre-Eric Pelloux-Prayer	24ceb6a594	radeonsi/ngg: try GS multi-cycling mode if default mode failed If gsprim_lds_size is larger than target_lds_size then gfx10_ngg_calculate_subgroup_info will fail. This commit adds a logic to try the multi-cycling in this case because it's using less memory. This fix glsl-1.50-gs-max-output when using NGG. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5401>	2020-06-10 09:33:58 +02:00
Pierre-Eric Pelloux-Prayer	ce7692fc19	radeonsi: add return value to gfx10_ngg_calculate_subgroup_info gfx10_ngg_calculate_subgroup_info uses assert to detect invalid configuration, but if asserts are disabled it will continue its execution. This commits adds a boolean return value to let the caller know that something went wrong and that the results mustn't be used. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3103 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5401>	2020-06-10 09:33:48 +02:00
Andrii Simiklit	2c711beb5c	glsl: fix crash on glsl macro redefinition In case shader contains two equal macro defines, first one with trailing spaces and the second one without. `#define A 1 ` `#define A 1` The parser crashes Fixes: `0346ad3774` ("glsl: ignore trailing whitespace when define redefined") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5312>	2020-06-10 03:29:39 +00:00
Jason Ekstrand	0c37cbf807	anv/allocator: Compare to start_offset in state_pool_free_no_vg In `d11e4738a8`, we started using a start_offset to allow us to allocate pools where the base address isn't at the start of the pool. This is useful for binding table pools which want to be relative to surface state base address (more or less), among other things. However, we had a bug where, if you have a negative offset, everything returned to the pool would end up being returned to the "back" of the pool. This isn't what we want for binding tables in the softpin world. This was causing us to never actually re-use any binding table blocks. How this passed CTS, I have no idea. Closes: #3100 Fixes: `d11e4738a8` "anv/allocator: Add a start_offset to anv_state_pool" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5395>	2020-06-09 22:52:26 +00:00

... 2 3 4 5 6 ...

124956 Commits All Branches Search

124956 Commits

All Branches