KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Tapani Pälli	630cbb45ac	anv: set depth stall enabled when depth flush enabled on gen12 This implements HW workaround #1409600907 for anv driver. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3378> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3378>	2020-01-16 14:05:54 +02:00
Tapani Pälli	3cec148455	iris: set depth stall enabled when depth flush enabled on gen12 This implements HW workaround #1409600907 for iris driver. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3378>	2020-01-16 14:05:54 +02:00
Lionel Landwerlin	308efbf2f3	anv: implement another workaround for non pipelined states Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3408> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3408>	2020-01-16 11:51:30 +02:00
Lionel Landwerlin	9eca823cce	iris: implement another workaround for non pipelined states v2: add comment (Ken) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3408>	2020-01-16 11:51:22 +02:00
Lionel Landwerlin	e6e5cbac04	iris: handle new PIPE_CONTROL field Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3408>	2020-01-16 11:48:11 +02:00
Lionel Landwerlin	31f0af5568	genxml: add new Gen11+ PIPE_CONTROL field PIPE_CONTROL gained a new field in its first DWORD on Gen11. We had no use for it so far, but we start using it on Gen12. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3408>	2020-01-16 11:48:04 +02:00
Kenneth Graunke	e3405f177b	st/mesa: Allocate full miplevels if MaxLevel is explicitly set Some applications explicitly call glTex[ture]Parameteri[v] to set GL_TEXTURE_MAX_LEVEL and GL_TEXTURE_BASE_LEVEL before uploading any texture data. Core Mesa initializes MaxLevel to 1000, so if it isn't that, we know they've set it. (We check for < TEXTURE_MAX_LEVELS to avoid hardcoding that value, however.) If MaxLevel - BaseLevel > 0, then the app is trying to tell us that this texture is going to have multiple miplevels. In that case, go ahead and allocate the space for it. Avoids many resource_copy_region calls at texture finalization time in the Civilization VI benchmark. Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3401> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3401>	2020-01-16 00:06:54 -08:00
Samuel Pitoiset	68abc07317	aco: fix emitting SMEM instructions with no operands on GFX6-GFX7 Like s_memtime. Fixes dEQP-VK.glsl.shader_clock.* on GFX6. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3407> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3407>	2020-01-16 08:18:18 +01:00
Vasily Khoruzhick	e5226cff75	lima: fix handling of reverse depth range Looks like we need to handle cases when near > far and near == far. In first case we just need to swap near and far, and in second we need subtract epsilon from near if it's not zero. Fixes 10 tests in dEQP-GLES2.functional.depth_range.* Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3400> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3400>	2020-01-16 01:57:05 +00:00
Ilia Mirkin	784b84d308	nvc0: disable xfb's which don't have a stride No stride / no attributes means that nothing is being written to the buffer. However it might still prevent primitives from being written out to the other buffers. Disabling it entirely seems to fix it. Fixes GTF-GL45.gtf30.GL3Tests.transform_feedback.transform_feedback_overflow Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2020-01-15 19:53:18 -05:00
Erico Nunes	9bf210ba98	lima/ppir: implement full liveness analysis for regalloc The existing liveness analysis in ppir still ultimately relies on a single continuous live_in and live_out range per register and was observed to be the bottleneck for register allocation on complicated examples with several control flow blocks. The use of live_in and live_out ranges was fine before ppir got control flow, but now it ends up creating unnecessary interferences as live_in and live_out ranges may span across entire blocks after blocks get placed sequentially. This new liveness analysis implementation generates a set of live variables at each program point; before and after each instruction and beginning and end of each block. This is a global analysis and propagates the sets of live registers across blocks independently of their sequence. The resulting sets optimally represent all variables that cannot share a register at each program point, so can be directly translated as interferences to the register allocator. Special care has to be taken with non-ssa registers. In order to properly define their live range, their alive components also need to be tracked. Therefore ppir can't use simple bitsets to keep track of live registers. The algorithm uses an auxiliary set data structure to keep track of the live registers. The initial implementation used only trivial arrays, however regalloc execution time was then prohibitive (>1minute on Cortex-A53) on extreme benchmarks with hundreds of instructions, hundreds of registers and several spilling iterations, mostly due to the n^2 complexity to generate the interferences from the live sets. Since the live registers set are only a very sparse subset of all registers at each instruction, iterating only over this subset allows it to run very fast again (a couple of seconds for the same benchmark). Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3358> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3358>	2020-01-15 22:55:31 +00:00
Erico Nunes	7e2765fded	lima/ppir: remove orphan load node after cloning There are some cases in shades using control flow where the varying load is cloned to every block, and then the original node is left orphan. This is not harmful for program execution, but it complicates analysis for register allocation as there is now a case of writing to a register that is never read. While ppir doesn't have a dead code elimination pass for its own optimizations and it is not hard to detect when we cloned the last load, let's remove it early. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3358>	2020-01-15 22:55:31 +00:00
Kristian H. Kristensen	a3a73d116c	iris: Print warning and return *out = NULL when fd to syncobj fails Signed-off-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2020-01-15 14:47:46 -08:00
Kristian H. Kristensen	1ac138694b	iris: Advertise PIPE_CAP_NATIVE_FENCE_FD Enables EGL_ANDROID_native_fence_sync. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2020-01-15 14:47:46 -08:00
Kenneth Graunke	e9f9a944d3	iris: Fix export of fences that have already completed. After flushing batches, iris_fence_flush() asks the kernel whether each batch's last_syncpt has already signalled or not. (The idea is that either the compute or render batch may not have actually had any work queued up, so last_syncpt there might have been signalled a long time ago.) If it's already completed, we don't bother to record it. A strange corner is the case of repeated flushes. For example, we might flush for some reason, and hit a glFlush(), and hit SwapBuffers. It's possible for all the batches to have been flushed previously, -and- for them to have actually completed. In this case, we'll see that there are no syncobj's to wait on, and record fence->count == 0. This works fine internally - fence_finish can see count == 0 and realize that it doesn't need to wait, for example. But when working with native FDs, we may be asked to export a fence with count == 0. So we need an actual synchronization primitive we can hand off. Because all of the relevant batches had been signalled when creating the fence, we want the new dummy fence to be signalled as well. So we just make a signalled syncobj and export it. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2020-01-15 14:47:46 -08:00
Robert Foss	6b9fce5d9e	android: Fix whitespace issue Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2020-01-15 22:30:17 +00:00
Robert Foss	62adb6522b	panfrost: Prefix schedule_program to prevent collision Currently the schedule_program implementation being used is picked at compile time, which on the Android platform means that the bifrost compiler & scheduler is used for all targets, including midgard based hardware. This commit disambiguates between the two schedule_program functions. Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2020-01-15 22:30:17 +00:00
Marek Olšák	c4daf2b485	radeonsi: merge si_compile_llvm and si_llvm_compile functions Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	68586bdd21	radeonsi: remove useless #includes Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	30b14ba67e	radeonsi: move code for shader resources into si_shader_llvm_resources.c Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	da2c12af4b	radeonsi: move geometry shader code into si_shader_llvm_gs.c Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	57bd73e229	radeonsi: remove llvm_type_is_64bit Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	194449a405	radeonsi: move tessellation shader code into si_shader_llvm_tess.c Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	d7c86b106c	radeonsi: move si_insert_input_* functions Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3399>	2020-01-15 21:54:55 +00:00
Marek Olšák	8ff8e68e42	radeonsi: work around an LLVM crash when using llvm.amdgcn.icmp.i64.i1 Cc: 19.2 19.3 <mesa-stable@lists.freedesktop.org> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3338> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3338>	2020-01-15 20:17:23 +00:00
Marek Olšák	af3fbb410c	radeonsi: fix si_build_wrapper_function for compute-based primitive culling Fixes: `3b143369a5` "ac/nir, radv, radeonsi: Switch to using ac_shader_args" Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3338>	2020-01-15 20:17:23 +00:00
Marek Olšák	6d4993c942	radeonsi/gfx10: separate code for determining the number of vertices for NGG Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-15 15:06:34 -05:00
Marek Olšák	7a25521f92	radeonsi/gfx10: separate code for getting edgeflags from the gs_invocation_id VGPR Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-15 15:06:33 -05:00
Marek Olšák	cf65c6f0d2	radeonsi: move VS_STATE.LS_OUT_PATCH_SIZE a few bits higher to make space there Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-15 15:06:31 -05:00
Marek Olšák	34ef0c5083	radeonsi: make si_insert_input_* functions non-static Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-15 15:06:29 -05:00
Marek Olšák	eeb4a11c11	ac/cull: don't read Position.Z if it's not needed for culling It could be NULL. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-15 15:06:20 -05:00
Marek Olšák	8070402a30	radeonsi: separate code computing info for small primitive culling Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2020-01-15 14:59:11 -05:00
Kenneth Graunke	0a1c47074b	intel/compiler: Fix illegal mutation in get_nir_image_intrinsic_image get_nir_image_intrinsic_image() was incorrectly mutating the value held by the register which holds the intrinsic's first source (image index). If this happened to be the register for an SSA def which is also used elsewhere in the program, this meant that we would clobber that value in subsequent uses. Note that this only affects i965, because neither anv nor iris use the binding table start sections, so nothing is ever added here. Fixes KHR-GL46.compute_shader.resources-max on i965 with Eric Anholt's MR !3240 applied. That MR reorders SSBOs and ABOs, so that test uses image 0 and SSBO 0, causing this code to brilliantly add binding table index 45 to both the image (correct) and the SSBO (bzzt, wrong!). Fixes: `09f1de97a7` ("anv,i965: Lower away image derefs in the driver") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3404> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3404>	2020-01-15 19:25:35 +00:00
Rob Clark	b706a157c5	gitlab-ci: fix missing caselist.css/xsl My best guess is that this was broken by `d62dd8b0` Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3413> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3413>	2020-01-15 19:03:56 +00:00
Jason Ekstrand	af6c2f4193	relnotes: Add Vulkan 1.2	2020-01-15 09:25:51 -06:00
Samuel Pitoiset	7f5462e349	radv: enable Vulkan 1.2 This bumps the Vulkan version to 1.2.128. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	68d6bead78	radv: implement Vulkan 1.2 features and properties Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	b3033198a8	radv: implement Vulkan 1.1 features and properties Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	a09ab76828	radv: update VK_KHR_timeline_semaphore for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	fab0aa9182	radv: update VK_KHR_uniform_buffer_standard_layout for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	3ff8d12458	radv: update VK_KHR_shader_subgroup_extended_types for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	af25c8d57b	radv: update VK_KHR_shader_float_controls for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	5335bb6c39	radv: update VK_KHR_shader_float16_int8 for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	a73d01b1db	radv: update VK_KHR_shader_atomic_int64 for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	83d1773a57	radv: update VK_KHR_imageless_framebuffer for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	b3bdb4e6ff	radv: update VK_KHR_image_format_list for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	a80229941f	radv: update VK_KHR_driver_properties for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	af883bf3dc	radv: update VK_KHR_draw_indirect_count for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	b537be4368	radv: update VK_KHR_depth_stencil_resolve for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00
Samuel Pitoiset	5993f13b27	radv: update VK_KHR_create_renderpass2 for Vulkan 1.2 Promoted to Vulkan 1.2 with the KHR suffix omitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2020-01-15 08:42:25 -06:00

1 2 3 4 5 ...

119349 Commits All Branches Search

119349 Commits

All Branches