mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Kenneth Graunke	66b4a7a79e	i965: Call gen6_upload_push_constants() even when the stage is disabled. This properly sets stage_state->push_constant_dirty = true, so that we emit 3DSTATE_CONSTANT_XS to disable the constant buffer for the shader stage. It also sets stage_state->push_const_size = 0. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-10-24 16:14:04 -07:00
Kenneth Graunke	16096e9119	i965: Drop a bunch of downcasting and upcasting of gl_program pointers. We have a gl_program and we want a gl_program. There's no point in converting to brw_program and back again. This probably made more sense in the old days before Tim dropped a layer of subclassing. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-10-24 16:14:02 -07:00
Kenneth Graunke	90ed2a10bb	i965: Move _mesa_shader_write_subroutine_indices down a level. Now we call it in one place instead of making every caller do it. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-10-24 16:13:59 -07:00
Dave Airlie	a5499b639c	radv: only emit dfsm packets if dfsm is allowed. radeonsi only emits these when dfsm is enabled, so for now just hinge them on a flag we never set. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-24 23:00:57 +01:00
Rob Clark	4aa69cc425	meson: build freedreno Mostly copy/pasta from Dylan Baker's conversion of nouveau and i965. Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2017-10-24 15:33:40 -04:00
Rob Clark	2207af032b	meson: extract out variable for nir_algebraic.py Also needed in freedreno/ir3. Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2017-10-24 15:33:40 -04:00
Rob Clark	0ca8d53215	freedreno/ir3: use a flag instead of setting PYTHONPATH Similar to `848da66222`, pass an arg to ir3_nir_trig.py to add to python path, rather than using $PYTHONPATH, to prep for meson build support. Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2017-10-24 15:33:40 -04:00
Kenneth Graunke	583ce96c94	i965: Don't disable CCS for RT dependencies when dispatching compute. Compute shaders don't have access to the framebuffer, so there's no point in worrying whether a texture is bound as a render target. This saves a bunch of resolves in GFXBench4 Manhattan 3.1, but doesn't seem to impact performance at all, at least on Apollolake. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2017-10-24 11:31:33 -07:00
Eric Anholt	e91c3540fc	i965: Fix memmem compiler warnings. gcc is throwing this warning in my meson build: ../src/intel/compiler/brw_eu_validate.c:50:11: warning argument 1 null where non-null expected [-Wnonnull] return memmem(haystack.str, haystack.len, ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ needle.str, needle.len) != NULL; ~~~~~~~~~~~~~~~~~~~~~~~ The first check for CONTAINS has a NULL error_msg.str and 0 len. The glibc implementation will exit without looking at any haystack bytes if haystack.len < needle.len, so this was safe, but silence the warning anyway by guarding against implementation variablility. Fixes: `122ef3799d` ("i965: Only insert error message if not already present") Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-10-24 10:51:18 -07:00
Rob Clark	eed9685dd6	freedreno: per-context fd_pipe To enable per-context priorities, we need to have per-context pipe's. Unfortunately we still need to keep the global screen pipe, mostly just for screen->get_timestamp(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-10-24 12:56:51 -04:00
Rob Clark	9c32333a58	freedreno: rename pipe -> vsc_pipe To add context priority support we need to have an fd_pipe per context, rather than per-screen. Which conflicts with existing ctx->pipe (which is actually a visibility stream pipe (hw resource). So just rename it. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-10-24 12:56:51 -04:00
Rob Clark	7e7096307a	freedreno: pass context flags through to fd_context_init() Prep work for later patch. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-10-24 12:56:51 -04:00
Brian Paul	7a6c6e73a8	gallium/util: use util_snprintf() in u_socket_connect() Instead of plain snprintf(). To fix the MSVC build. snprintf() is used in various places in Mesa/gallium, but apparently, not in code built with MSVC. Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-10-24 08:17:15 -06:00
Benjamin Gordon	de3555f834	configure: Allow android as an EGL platform I'm working on radeonsi support in the Chrome OS Android container (ARC++). Mesa in ARC++ uses autotools instead of Android.mk, but all the necessary EGL bits are there, so the existing check is too strict. Signed-off-by: Benjamin Gordon <bmgordon@chromium.org> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-10-24 14:46:22 +01:00
Marek Olšák	2a414c3961	radeonsi: postponed KILL isn't postponed anymore, but maintains WQM This restores performance for the drirc workaround, i.e. KILL_IF does: visible = src0 >= 0; kill_flag &= visible; // accumulate kills amdgcn_kill(wqm_vote(visible)); // kill fully dead quads only And all helper pixels are killed at the end of the shader: amdgcn_kill(kill_flag); Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-24 14:56:34 +02:00
Marek Olšák	da0083f123	radeonsi: use postponed KILL only when derivatives are used Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-24 14:56:34 +02:00
Marek Olšák	478afbe525	ac: use llvm.amdgcn.kill with LLVM 6.0 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-24 14:56:34 +02:00
Marek Olšák	1ff9e27cbd	ac: replace ac_build_kill with ac_build_kill_if_false This will be a new LLVM intrinsic and will also work nicely with llvm.amdgcn.wqm.vote. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-24 14:56:34 +02:00
Timothy Arceri	f0a2bbd1a4	radv: move nir print after linking is done We now have linking optimisations so we want to delay dumping the nir until after these are complete. Fixes: `06f05040eb` (radv: Link shaders) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-24 10:41:38 +11:00
Dave Airlie	11d688d9f0	mesa/bufferobj: don't double negate the range This fixes a regression I introduced refactoring this code, I managed to invert range twice, I moved the inversion into the common code, but forgot to stop doing it in the callee. Fixes: GL45-CTS.multi_bind.dispatch_bind_buffers_base Fixes: `35ac13ed3` (mesa/bufferobj: consolidate some codepaths between ubo/ssbo/atomics.) Reported-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-24 08:40:23 +10:00
Timothy Arceri	013313cf89	radv: clone meta shaders before linking The IR is reused in different pipeline combinations so we need to clone it to avoid link time optimistaions messing up the original copy. Fixes: `06f05040eb` (radv: Link shaders) Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-24 09:27:40 +11:00
Brian Paul	069211f205	gallium/util: don't call close() on Windows in u_tests.c Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 15:10:44 -06:00
Brian Paul	5134c0dedf	mesa: use util_strdup() macro in u_debug_symbol.c Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 15:10:38 -06:00
Brian Paul	89372220b3	mesa: use util_strdup() macro in symbol_table.c Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 15:10:32 -06:00
Brian Paul	acd6ea0cc0	util: add util_strdup() wrapper macro To work around MSVC warning that strdup() is a deprecated POSIX function. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 15:10:24 -06:00
Brian Paul	6230773936	gallium/util: replace gethostbyname() with getaddrinfo() Compiling with MSVC options /we4995 /we4996 (a subset of /sdl) generates a warning that the gethostbyname() function is deprecated in favor of getaddrinfo() or GetAddrInfoW(). Replace the call with getaddrinfo(). Untested. There are no callers to u_socket_connect() in Gallium. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 15:10:01 -06:00
Alex Smith	fee9d05e21	radv: Update code pointer correctly if a variant is already created This was the actual cause of GPU hangs fixed by `0fdd531457` ("radv: Fix pipeline cache locking issues"), since multiple threads would end up trying to create the variants for a single entry. Now that we're locking around the whole of this function, this isn't really necessary (we either create all or none of the variants), but fix this anyway in case things change later. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> CC: 17.3 <mesa-stable@lists.freedesktop.org>	2017-10-23 22:36:54 +02:00
Kenneth Graunke	013d331220	i965: Revert absolute mode for constant buffer pointers. The kernel doesn't initialize the value of the INSTPM or CS_DEBUG_MODE2 registers at context initialization time. Instead, they're inherited from whatever happened to be running on the GPU prior to first run of a new context. So, when we started setting these, other contexts in the system started inheriting our values. Since this controls whether 3DSTATE_CONSTANT_* takes a pointer or an offset, getting the wrong setting is fatal for almost any process which isn't expecting this. Unfortunately, VA-API and Beignet don't initialize this (nor does older Mesa), so they will die horribly if we start doing this. UXA and SNA don't use any push constants, so they are unaffected. Until we have some kind of solution to this problem, I'm going to revert this patch and abandon using the feature for now. It will lead to fewer pushed UBO ranges on Broadwell+, which may lead to lower performance, though I don't have any data on the impact. Cc: "17.3 17.2" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102774	2017-10-23 12:03:22 -07:00
Dylan Baker	d4567efa5c	meson: build imx driver Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2017-10-23 11:45:55 -07:00
Dylan Baker	51558a1d6c	meson: build etnaviv driver + winsys Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2017-10-23 11:45:38 -07:00
Eric Anholt	ba85525fce	ac: Silence a compiler warning about results[0]. We know that num_components will be > 0, but it doesn't. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 10:14:40 -07:00
Eric Anholt	34c04c734f	ac: Fix a compiler warning for possibly undefined "name" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-23 10:14:40 -07:00
Dylan Baker	77f7ef0287	meson: fix egl build for meson version < 0.43 Meson 0.43 added the ability to pass nested lists to include_directories, so the code that we have works for 0.43, but not for 0.42. This patch changes the include_directories list to be flat so it works with 0.42 fixes: `108d257a16` ("meson: build libEGL") Tested-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Rhys Kidd <rhyskidd@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com>	2017-10-23 10:14:40 -07:00
Nicolai Hähnle	f9ccfda9bc	amd/common/gfx9: workaround DCC corruption more conservatively Fixes KHR-GL45.texture_swizzle.smoke and others on Vega. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102809 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-23 18:10:20 +02:00
Emil Velikov	a90b4329df	docs/release-calendar: update - 17.3.0-rc1 is out Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-23 14:31:15 +01:00
Ilia Mirkin	4d24a7cb97	glsl: fix derived cs variables There are two issues with the current implementation. First, it relies on the layout(local_size_*) happening in the same shader as the main function, and secondly it doesn't work for variable group sizes. In both cases, the simplest fix is to move the setup of these derived values to a later time, similar to how the gl_VertexID workarounds are done. There already exist system values defined for both of the derived values, so we use them unconditionally, and lower them after linking is performed. While we're at it, we move to using gl_LocalGroupSizeARB instead of gl_WorkGroupSize for variable group sizes. Also the dead code elimination avoidance can be removed, since there can be situations where gl_LocalGroupSizeARB is needed but has not been inserted for the shader with main function. As a result, the lowering code has to insert its own copies of the system values if needed. Reported-by: Stephane Chevigny <stephane.chevigny@polymtl.ca> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103393 Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-23 08:34:56 -04:00
Emil Velikov	4302df8c8e	docs: add 17.4.0-devel release notes template Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-23 13:07:06 +01:00
Emil Velikov	c85d64cf67	mesa: bump version to 17.4.0-devel Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-23 13:00:43 +01:00
Juan A. Suarez Romero	2665d012a8	radv: automake: include radv_extensions.py in the tarball Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-23 12:37:01 +02:00
Bas Nieuwenhuizen	a548b727a1	ac/nir: Only clamp shadow reference on radeonsi. Vulkan CTS does not expect the value to be clamped (at least for D32), and it makes a differences even though depth is in [0,1], due to strict inequalities. I couldn't find anything in the Vulkan spec about this, but the test seemed to be copied from GL tests and the GL spec only specifies clamping for fixed point formats. Hence I expect radeonsi to run into this at some point as well, but given that they still have a usecase with the Z16->Z32 promotion, I'll leave that for someone else to clean up. This at least fixes radv dEQP-VK.texture.shadow.* on VI. Fixes: `0f9e32519b` 'ac/nir: clamp shadow texture comparison value on VI' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-23 09:13:38 +02:00
Bas Nieuwenhuizen	c07d719e8b	radv: Disallow indirect outputs for GS on GFX9 as well. Since it also uses the output vector before writing to memory. Fixes: `e38685cc62` 'Revert "radv: disable support for VEGA for now."' Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-23 00:27:44 +02:00
Bas Nieuwenhuizen	2c5b43c87f	ac/nir: Fix nir_texop_lod on GFX for 1D arrays. Fixes: `1bcb953e16` 'radv: handle GFX9 1D textures' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-23 00:27:44 +02:00
Dave Airlie	da9c3cd3ee	radv/ac/nir: only emit tess factors to storage if tes reads them Otherwise we just need to write them to the tf ring. this seems to improve the tessellation demo on Bonarie ~2190->~2230 fps Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-23 07:10:29 +10:00
Bas Nieuwenhuizen	6ce550453f	radv: Don't use vgpr indexing for outputs on GFX9. Due to LLVM bugs. Fixes a bunch of dEQP-VK.glsl.indexing.* tests. Fixes: `e38685cc62` 'Revert "radv: disable support for VEGA for now."' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-22 02:36:37 +02:00
Bas Nieuwenhuizen	ad727b96b6	ac/nir: Account for compact array index in GS input load from LDS. Mirrors the vram path. Fixes: `d4ecc3c929` 'ac/nir: Add loading from LDS for merged GS.' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 22:29:40 +02:00
Bas Nieuwenhuizen	67648c0faa	radv: Don't compile shaders when they are cached already. When the gs_copy_shader is NULL (due to an incomplete cache), but the main shaders are found, we still do the nir, but we shouldn't compile the shaders again. For merged shaders we should also account for the missing shaders. Fixes: `ce03c119ce` 'radv: Add code to compile merged shaders.' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 22:29:34 +02:00
Bas Nieuwenhuizen	3bf954b28e	radv: Don't check for max GL GS invocations. We specify 127 instead of 32 as the limit in vulkan. Fixes: `6bc42855f9` 'radv: enable GS on GFX9' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 22:29:09 +02:00
Bas Nieuwenhuizen	050f7e2df2	radv: Don't explicitly reference vertex shader for draw_id. With merged shaders the vertex shader may not exist. This got in because the offending patch was written before merged shaders were upstream, but committed after. Fixes: `75dfab24a2` 'radv: refactor indirect draws with radv_draw_info' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 20:00:22 +02:00
Bas Nieuwenhuizen	20fb15bfe4	radv: Don't reset cmd_buffer->state.dirty. Otherwise for non-indexed draws we set and immediately unset RADV_CMD_DIRTY_INDEX_BUFFER. As all the set functions should clear their own bit, this is unnecessary. Fixes: `341529dbee` 'radv: use optimal packet order for draws' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 20:00:16 +02:00
Bas Nieuwenhuizen	fb55477990	radv: Correctly detect changed shaders for vertex descriptors. As they were emitted after the new pipeline, the changed pipeline detection was not working anymore. Fixes: `341529dbee` 'radv: use optimal packet order for draws' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 19:59:44 +02:00

1 2 3 4 5 ...

96980 Commits All Branches Search

96980 Commits

All Branches