KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Bas Nieuwenhuizen	61a9ef4ab1	radv: Compute ac keys from pipeline key. The beginning of the end for the shader keys. Not entirely sure what I'm going to replace them with for the compiler though, so this is the first step. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-26 00:28:40 +02:00
Bas Nieuwenhuizen	49d035122e	radv: Add single pipeline cache key. To decouple the key used for info gathering and the cache from whatever we pass to the compiler. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-26 00:28:40 +02:00
Bas Nieuwenhuizen	de38491a57	radv: Don't compute as_ls/as_es before hashing. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-26 00:28:40 +02:00
Matthew Nicholls	27a0b24bf2	ac/nir: generate correct instruction for atomic min/max on unsigned images v2: fix silly typo Cc: "17.2 17.3" <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-25 20:52:58 +02:00
Samuel Pitoiset	9711979df0	radv: print NIR before LLVM IR and disassembly It's still printed after linking, but it makes more sense to have SPIRV->NIR->LLVM IR->ASM. Fixes: `f0a2bbd1a4` (radv: move nir print after linking is done) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-25 11:46:53 +02:00
Bas Nieuwenhuizen	5bfbab2fdc	radv: Fix truncation issue hexifying the cache uuid for the disk cache. Going from binary to hex has a 2x blowup. Fixes: `1421625292` 'radv: create on-disk shader cache' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-25 09:50:05 +02:00
Timothy Arceri	767ca5bdf1	radv: enable lower to scalar nir pass This will allow dead components of varyings to be removed. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-25 17:02:40 +11:00
Timothy Arceri	8ebaf8192a	ac: add support for explicit component packing This is needed for RADV to support explicit component packing. This is also required to use the new NIR component splitting / packing passes. V2: - add commponent packing support for interpolate_at* intrinsics - improve store packing support when not all varyings are scalar as spotted by Bas the store source was incorrectly offset. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-25 17:02:40 +11:00
Dave Airlie	d8cefaa197	radv: use device name in cache creation like radeonsi. Not sure how useful this is, but it makes it more consistent. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.3" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-25 02:26:01 +01:00
Dave Airlie	3cd3035ace	radv: use a define for the transition point between cp and compute shader For certain buffer meta ops we can use the CP or a compute shader, we should use a define to rather than hardcoding 4096, allows for easier testing and more consistency. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-25 10:01:13 +10:00
Dave Airlie	a5499b639c	radv: only emit dfsm packets if dfsm is allowed. radeonsi only emits these when dfsm is enabled, so for now just hinge them on a flag we never set. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-24 23:00:57 +01:00
Marek Olšák	2a414c3961	radeonsi: postponed KILL isn't postponed anymore, but maintains WQM This restores performance for the drirc workaround, i.e. KILL_IF does: visible = src0 >= 0; kill_flag &= visible; // accumulate kills amdgcn_kill(wqm_vote(visible)); // kill fully dead quads only And all helper pixels are killed at the end of the shader: amdgcn_kill(kill_flag); Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-24 14:56:34 +02:00
Marek Olšák	478afbe525	ac: use llvm.amdgcn.kill with LLVM 6.0 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-24 14:56:34 +02:00
Marek Olšák	1ff9e27cbd	ac: replace ac_build_kill with ac_build_kill_if_false This will be a new LLVM intrinsic and will also work nicely with llvm.amdgcn.wqm.vote. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-24 14:56:34 +02:00
Timothy Arceri	f0a2bbd1a4	radv: move nir print after linking is done We now have linking optimisations so we want to delay dumping the nir until after these are complete. Fixes: `06f05040eb` (radv: Link shaders) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-24 10:41:38 +11:00
Timothy Arceri	013313cf89	radv: clone meta shaders before linking The IR is reused in different pipeline combinations so we need to clone it to avoid link time optimistaions messing up the original copy. Fixes: `06f05040eb` (radv: Link shaders) Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-24 09:27:40 +11:00
Alex Smith	fee9d05e21	radv: Update code pointer correctly if a variant is already created This was the actual cause of GPU hangs fixed by `0fdd531457` ("radv: Fix pipeline cache locking issues"), since multiple threads would end up trying to create the variants for a single entry. Now that we're locking around the whole of this function, this isn't really necessary (we either create all or none of the variants), but fix this anyway in case things change later. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> CC: 17.3 <mesa-stable@lists.freedesktop.org>	2017-10-23 22:36:54 +02:00
Eric Anholt	ba85525fce	ac: Silence a compiler warning about results[0]. We know that num_components will be > 0, but it doesn't. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 10:14:40 -07:00
Eric Anholt	34c04c734f	ac: Fix a compiler warning for possibly undefined "name" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 10:14:40 -07:00
Nicolai Hähnle	f9ccfda9bc	amd/common/gfx9: workaround DCC corruption more conservatively Fixes KHR-GL45.texture_swizzle.smoke and others on Vega. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102809 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-23 18:10:20 +02:00
Juan A. Suarez Romero	2665d012a8	radv: automake: include radv_extensions.py in the tarball Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-23 12:37:01 +02:00
Bas Nieuwenhuizen	a548b727a1	ac/nir: Only clamp shadow reference on radeonsi. Vulkan CTS does not expect the value to be clamped (at least for D32), and it makes a differences even though depth is in [0,1], due to strict inequalities. I couldn't find anything in the Vulkan spec about this, but the test seemed to be copied from GL tests and the GL spec only specifies clamping for fixed point formats. Hence I expect radeonsi to run into this at some point as well, but given that they still have a usecase with the Z16->Z32 promotion, I'll leave that for someone else to clean up. This at least fixes radv dEQP-VK.texture.shadow.* on VI. Fixes: `0f9e32519b` 'ac/nir: clamp shadow texture comparison value on VI' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-23 09:13:38 +02:00
Bas Nieuwenhuizen	c07d719e8b	radv: Disallow indirect outputs for GS on GFX9 as well. Since it also uses the output vector before writing to memory. Fixes: `e38685cc62` 'Revert "radv: disable support for VEGA for now."' Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-23 00:27:44 +02:00
Bas Nieuwenhuizen	2c5b43c87f	ac/nir: Fix nir_texop_lod on GFX for 1D arrays. Fixes: `1bcb953e16` 'radv: handle GFX9 1D textures' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-23 00:27:44 +02:00
Dave Airlie	da9c3cd3ee	radv/ac/nir: only emit tess factors to storage if tes reads them Otherwise we just need to write them to the tf ring. this seems to improve the tessellation demo on Bonarie ~2190->~2230 fps Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-23 07:10:29 +10:00
Bas Nieuwenhuizen	6ce550453f	radv: Don't use vgpr indexing for outputs on GFX9. Due to LLVM bugs. Fixes a bunch of dEQP-VK.glsl.indexing.* tests. Fixes: `e38685cc62` 'Revert "radv: disable support for VEGA for now."' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-22 02:36:37 +02:00
Bas Nieuwenhuizen	ad727b96b6	ac/nir: Account for compact array index in GS input load from LDS. Mirrors the vram path. Fixes: `d4ecc3c929` 'ac/nir: Add loading from LDS for merged GS.' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 22:29:40 +02:00
Bas Nieuwenhuizen	67648c0faa	radv: Don't compile shaders when they are cached already. When the gs_copy_shader is NULL (due to an incomplete cache), but the main shaders are found, we still do the nir, but we shouldn't compile the shaders again. For merged shaders we should also account for the missing shaders. Fixes: `ce03c119ce` 'radv: Add code to compile merged shaders.' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 22:29:34 +02:00
Bas Nieuwenhuizen	3bf954b28e	radv: Don't check for max GL GS invocations. We specify 127 instead of 32 as the limit in vulkan. Fixes: `6bc42855f9` 'radv: enable GS on GFX9' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 22:29:09 +02:00
Bas Nieuwenhuizen	050f7e2df2	radv: Don't explicitly reference vertex shader for draw_id. With merged shaders the vertex shader may not exist. This got in because the offending patch was written before merged shaders were upstream, but committed after. Fixes: `75dfab24a2` 'radv: refactor indirect draws with radv_draw_info' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 20:00:22 +02:00
Bas Nieuwenhuizen	20fb15bfe4	radv: Don't reset cmd_buffer->state.dirty. Otherwise for non-indexed draws we set and immediately unset RADV_CMD_DIRTY_INDEX_BUFFER. As all the set functions should clear their own bit, this is unnecessary. Fixes: `341529dbee` 'radv: use optimal packet order for draws' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 20:00:16 +02:00
Bas Nieuwenhuizen	fb55477990	radv: Correctly detect changed shaders for vertex descriptors. As they were emitted after the new pipeline, the changed pipeline detection was not working anymore. Fixes: `341529dbee` 'radv: use optimal packet order for draws' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 19:59:44 +02:00
Bas Nieuwenhuizen	24fe4e6143	ac/nir: Set larged wrokgroup size for GS on GFX9. They don't take a single wave anymore and we need the barriers. Fixes: `6bc42855f9` 'radv: enable GS on GFX9' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 12:46:44 +02:00
Bas Nieuwenhuizen	9e82f2b3ea	ac/nir: Take the max workgroup size of all provided shaders. Fixes: `ffaf4d608a` 'radv: Enable tessellation shaders for GFX9.' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 12:46:28 +02:00
Alex Smith	0fdd531457	radv: Fix pipeline cache locking issues Need to lock around the whole process of retrieving cached shaders, and around GetPipelineCacheData. This fixes GPU hangs observed when creating multiple pipelines in parallel, which appeared to be due to invalid shader code being pulled from the cache. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 03:52:43 +02:00
Andres Rodriguez	a2c6fbb3ee	radv: disable implicit sync for radv allocated bos v3 Implicit sync kicks in when a buffer is used by two different amdgpu contexts simultaneously. Jobs that use explicit synchronization mechanisms end up needlessly waiting to be scheduled for long periods of time in order to achieve serialized execution. This patch disables implicit synchronization for all radv allocations except for wsi bos. The only systems that require implicit synchronization are DRI2/3 and PRIME. v2: mark wsi bos as RADV_MEM_IMPLICIT_SYNC v3: Add drm version check (Bas) Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:15:54 +02:00
Andres Rodriguez	eff2bdbd82	radv: factor out radv_alloc_memory This allows us to pass extra parameters to the memory allocation operation that are not defined in the vulkan spec. This is useful for internal usage. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:15:49 +02:00
Andres Rodriguez	92724338ba	radv: Expose VK_EXT_global_priority Expose the extension string as supported Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:01:44 +02:00
Andres Rodriguez	9f7edf4d1f	radv: don't skip PS/VS partial flush This patch helps lower high priority compute latency. Found by bisecting a perf regression on computeparticles with high priority compute queues enabled. Reverting this micro-optimization doesn't seem to have any negative effect on performance on Dota2 or ssao. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:01:44 +02:00
Andres Rodriguez	fd04f3eb86	radv: Implement VK_EXT_global_priority This extension allows the caller to change a queue's system wide priority. This is useful for applications with specific latency constraints. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:01:44 +02:00
Andres Rodriguez	986c4b0bd4	radv: hardcode shader WAVE_LIMIT to the maximum value When WAVE_LIMIT is set, a submission will opt-in for SPI based resource scheduling. Because this mechanism is cooperative, we must ensure that all submissions have this field set, otherwise they will bypass resource arbitration. We always hardcode the field to its maximum value, instead of attempting to calculate an approximate usage. In testing, there were no benefits to using anything other than the maximum. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:01:44 +02:00
Jason Ekstrand	59fb59ad54	nir: Get rid of nir_shader::stage It's redundant with nir_shader::info::stage. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2017-10-20 12:49:17 -07:00
Samuel Pitoiset	341529dbee	radv: use optimal packet order for draws Ported from RadeonSI. The time where shaders are idle should be shorter now. This can give a little boost, like +6% with the dynamicubo Vulkan demo. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-20 20:07:53 +02:00
Samuel Pitoiset	af6985b309	radv: add radv_emit_shaders_prefetch() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-20 20:07:53 +02:00
Samuel Pitoiset	0d85f4a9e2	radv: add radv_emit_shader_prefetch() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-20 20:07:53 +02:00
Fredrik Höglund	e2053b8e3d	radv: don't flush the VS when srcStageMask == TOP_OF_PIPE_BIT The Vulkan specification says: "... an execution dependency with only VK_PIPELINE_STAGE_TOP_OF_- PIPE_BIT in the source stage mask will effectively not wait for any prior commands to complete." Signed-off-by: Fredrik Höglund <fredrik@kde.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-20 11:37:51 +02:00
Samuel Pitoiset	565c22158f	radv: mark total_count as MAYBE_UNUSED in CmdSet{Viewport,Scissor} Fixes two compilation warnings in release build. Trivial. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-20 11:22:19 +02:00
Samuel Pitoiset	c8f2b73656	radv: rename radv_cmd_buffer_flush_state() to radv_draw() Similar to the dispatch codepath. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-20 11:20:16 +02:00
Samuel Pitoiset	9e45e5c9fd	radv: emit primitive restart from radv_emit_draw_registers() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-20 11:20:14 +02:00
Samuel Pitoiset	93207a8e89	radv: add radv_emit_draw_registers() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-20 11:20:12 +02:00

1 2 3 4 5 ...

1483 Commits