KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	5ac6908263	r300,r600,radeonsi: read winsys_handle::stride,offset in drivers, not winsyses Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-09 23:43:03 -04:00
Marek Olšák	d95afd8b9e	radeonsi/gfx10: fix wave occupancy computations Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-09 23:43:03 -04:00
Marek Olšák	42ea0b7b52	radeonsi: only support at most 1024 threads per block LLVM 10 won't support 2048. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-09 23:43:03 -04:00
Marek Olšák	c1e08cb6d5	radeonsi: disable DCC when importing a texture from an incompatible driver and unify the code. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-09 23:43:03 -04:00
Marek Olšák	28adf0d00c	radeonsi/gfx10: don't call gfx10_destroy_query with compute-only contexts This fixes a crash. Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-09 23:43:03 -04:00
Marek Olšák	2f42d4cacc	radeonsi/gfx10: use fma for TGSI_OPCODE_FMA Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-09 23:43:03 -04:00
Marek Olšák	d64593e3c4	ac: use fma on gfx10 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-09 23:43:03 -04:00
Marek Olšák	d979e5bfab	ac: enable LLVM atomic optimizations	2019-09-09 23:43:03 -04:00
Lepton Wu	263136fb5d	virgl: Fix pipe_resource leaks under multi-sample. Fixes: `900a80f9e4` ("virgl: virgl_transfer should own its virgl_resource") Signed-off-by: Lepton Wu <lepton@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com>	2019-09-10 03:42:55 +00:00
Kenneth Graunke	410894c643	iris: Avoid flushing for cache history on transfer range flushes The VBO module maps a buffer with GL_MAP_FLUSH_EXPLICIT, and keeps appending data, and calling glFlushMappedBufferRange(). We were invalidating the VF cache each time it flushed a new range, which results in a ton of VF flushes. If the contents of the destination in the target range are undefined (never even possibly written), this patch makes us assume that it's likely not in the cache and so cache invalidations are required. If the destination range is defined, we continue cache flushing as we may need to expunge stale data. This eliminates 88% of the VF cache invalidates on Manhattan 3.0. Improves performance in Manhattan 3.0 on my Icelake 8x8 with the GPU frequency locked to 700Mhz by 0.376724% +/- 0.0989183% (n=10).	2019-09-09 15:08:22 -07:00
Kenneth Graunke	7d28e9ddd6	iris: Optimize out redundant sampler state binds This cuts roughly 85% of the 3DSTATE_SAMPLER_STATE_POINTERS_PS calls in the J2DBench images test. For some reason, the state tracker is calling bind_sampler_state with the same sampler state in a bunch of cases.	2019-09-09 11:55:27 -07:00
Kenneth Graunke	325e25d689	iris: Add support for the always_flush_cache=true debug option. This can be useful for debugging missing flushes.	2019-09-09 11:55:27 -07:00
Adam Jackson	366b2e5c19	mesa: Eliminate gl_config::rgbMode Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-09 14:12:57 -04:00
Adam Jackson	78e0fa6bb2	mesa: Eliminate gl_config::have{Accum,Depth,Stencil}Buffer Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-09 14:12:57 -04:00
Adam Jackson	c4990b7b19	mesa: Remove unused gl_config::indexBits Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-09 14:12:57 -04:00
Adam Jackson	04bef9a0a6	gallium/xlib: Fix an obvious thinko x == !GLX_DIRECT_COLOR is a fancy way of writing x == 0, which is clearly not what was meant.	2019-09-09 14:12:57 -04:00
Kenneth Graunke	9173459b95	iris: Ignore line stipple information if it's disabled The line stipple pattern and factor only matter if line stippling is actually enabled. Otherwise, we can safely ignore it. PBO upload may give us zero for line stipple information, while normal drawing tends to give us an actual stipple pattern such as 0xffff. This was causing us to flag IRIS_DIRTY_LINE_STIPPLE way too often, leading to useless 3DSTATE_LINE_STIPPLE commands, which are non-pipelined and thus very expensive. Improves performance in Manhattan 3.0 on Skylake GT4e by 0.149261% +/- 0.0380796% (n=210). On an Icelake 8x8 with the GPU frequency locked at 700Mhz, improves by 0.423756% +/- 0.222843% (n=3).	2019-09-09 10:55:20 -07:00
Vasily Khoruzhick	fbd5d9ebb5	lima/ppir: drop fge/flt/feq/fne options These are supposed to be lowered into sge/slt/seq/sne equivalents. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 10:25:30 -07:00
Vasily Khoruzhick	576341324d	lima: run opt_algebraic between int_to_float and boot_to_float for vs int_to_float emits ftrunc and ftrunc lowering generates bool ops. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 10:25:30 -07:00
Vasily Khoruzhick	996f1b6174	lima/gpir: fix warning in gpir disassembler Fixes following warning: ../src/gallium/drivers/lima/ir/gp/disasm.c: In function ‘print_src’: ../src/gallium/drivers/lima/ir/gp/disasm.c:241:20: warning: array subscript 28 is above array bounds of ‘char[5]’ [-Warray-bounds] 241 \| "xyzw"[src - gpir_codegen_src_attrib_x]); Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 10:25:30 -07:00
Vasily Khoruzhick	e6dbf6d948	lima/gpir: lower fceil GP doesn't support fceil so we need to lower it. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 10:25:30 -07:00
Connor Abbott	c64f30546d	lima/gpir: Disallow moves for schedule_first nodes The entire point of schedule_first is that the node has to be scheduled as soon as possible without any moves because it doesn't produce a proper floating-point value, or its value changes depending on where you read it. We were still introducing a move for preexp2 in some cases though, even if it got scheduled as soon as possible, which broke some exp() tests. Fix that. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 17:42:19 +07:00
Connor Abbott	8c7ad22adb	lima/gpir: Fix fake dep handling for schedule_first nodes The whole point of schedule_first nodes is that they need to be scheduled as soon as possible, so if a schedule_first node is the successor in a fake dependency that prevents it from being scheduled after its parent, that can cause problems. We need to add these fake dependencies to the parent as well, and we need to guarantee that the pre-RA scheduler puts schedule_first nodes right before their parents in order to prevent this from adding cycles to the dependency graph. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 17:42:00 +07:00
Connor Abbott	2955875381	lima/gpir: Fix schedule_first insertion logic The idea was to make sure schedule_first nodes were always first in the ready list. I made sure they were inserted first, but not that other nodes wouldn't later be scheduled ahead of them. Fixes spec@glsl-1.10@execution@built-in-functions@vs-exp-float and probably others. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 17:41:35 +07:00
Connor Abbott	63acdb5ce6	lima/gpir: Ignore unscheduled successors in can_use_complex() The point of the function is to avoid creating a complex move which is used by certain slots in the next instruction, but unscheduled successors will never be in the next instruction. Found while debugging a crash that the previous commit fixed. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 17:40:58 +07:00
Connor Abbott	ee8cc90e55	lima/gpir: Do all lowerings before rsched The scheduler assumes that load nodes are always duplicated so that they can always be scheduled eventually and therefore they never need to be spilled. But some lowerings were running after the pre-RA scheduler, whereas duplication has to happen before then since it's needed for the scheduler to do a better job reducing register pressure. This meant that lowerings were introducing multiple uses of a load instruction, which broke the scheduler's expectation and resulted in infinite loops in situations where the only nodes available to spill were load nodes. Spilling load nodes would be silly, so we want to fix the lowerings rather than the scheduler. Just do all lowerings before the pre-RA scheduler, which also helps with reducing pressure since the scheduler can more accurately compute the pressure. Fixes lima/mesa#104. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 17:39:20 +07:00
Mauro Rossi	ae5ac26dfa	android: anv: libmesa_vulkan_common: add libmesa_util static dependency Change needed to fix the following building error: In file included from external/mesa/src/intel/vulkan/anv_device.c:43: external/mesa/src/util/xmlpool.h:115:10: fatal error: 'xmlpool/options.h' file not found ^~~~~~~~~~~~~~~~~~~ 1 error generated. Fixes: `4dcb1ff` ("anv: add support for driconf") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-09-08 20:07:56 +02:00
Boris Brezillon	3ce03374b3	panfrost: Rename pan_bo_cache.c into pan_bo.c So we can move all the BO logic into this file instead of having it spread over pan_resource.c, pan_drm.c and pan_bo_cache.c. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-09-08 16:24:54 +02:00
Boris Brezillon	14bfb0cb67	panfrost: Get rid of the now unused SLAB allocator The last users have been converted to use plain BOs. Let's get rid of this abstraction. We can always consider adding it back if we need it at some point. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-09-08 16:24:19 +02:00
Boris Brezillon	2c90045cf2	panfrost: Get rid of unused panfrost_context fields Some fields in panfrost_context are unused (probably leftovers from previous refactor). Let's get rid of them. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-09-08 16:23:34 +02:00
Boris Brezillon	76274bcb5e	panfrost: Convert ctx->{scratchpad, tiler_heap, tiler_dummy} to plain BOs ctx->{scratchpad,tiler_heap,tiler_dummy} are allocated using panfrost_drm_allocate_slab() but they never any of the SLAB-based allocation logic. Let's convert those fields to plain BOs. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-09-08 16:22:59 +02:00
Boris Brezillon	a2bba567ae	panfrost: Make transient allocation rely on the BO cache Right now, the transient memory allocator implements its own BO caching mechanism, which is not really needed since we already have a generic BO cache. Let's simplify things a bit. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-09-08 16:22:26 +02:00
Boris Brezillon	12d8a17957	panfrost: Stop passing a ctx to functions being passed a batch The context can be retrieved from batch->ctx. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-09-08 16:21:44 +02:00
Boris Brezillon	beb18c6172	panfrost: Pass a batch to panfrost_drm_submit_vs_fs_batch() Given the function name it makes more sense to pass it a job batch directly. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-09-08 16:20:59 +02:00
Boris Brezillon	2c526993bc	panfrost: s/job/batch/ What we currently call a job is actually a batch containing several jobs all attached to a rendering operation targeting a specific FBO. Let's rename structs, functions, variables and fields to reflect this fact. Suggested-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2019-09-08 16:19:56 +02:00
Heinrich Fink	3aa4f3a442	egl: Add GL_MESA_EGL_sync support This commit follow OES_EGL_sync to universially enable use of EGL sync objects with desktop OpenGL contexts. Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-08 08:01:55 +00:00
Heinrich Fink	8c933c9d96	headers: Add GL_MESA_EGL_sync token to GL Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-08 08:01:55 +00:00
Heinrich Fink	17470c4aaa	registry: update gl.xml with GL_MESA_EGL_sync token As added by upstream GL registry changes Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-08 08:01:55 +00:00
Heinrich Fink	f4327ce06e	specs: Add GL_MESA_EGL_sync Adds GL_MESA_EGL_sync as defined in upstream OpenGL registry Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-08 08:01:55 +00:00
Tapani Pälli	f83f9d7daa	android: fix linking issues with liblog Fixes Android build errors observed in Intel CI. Fixes: `f9f7cbc1aa` "util: android logging support" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-09-07 13:16:29 +03:00
Kenneth Graunke	dfb86405cf	iris: Support the disable_throttling=true driconf option.	2019-09-06 18:35:24 -07:00
Jason Ekstrand	c832820ce9	nir/dead_cf: Repair SSA if the pass makes progress The dead_cf pass calls into the CF manipulation helpers which attempt to keep NIR's SSA form sane. However, when the only break is removed from a loop, dominance gets messed up anyway because the CF SSA clean-up code only looks at phis and doesn't consider the case of code becoming unreachable. One solution to this would be to put the loop into LCSSA form before we modify any of its contents. Another (and the approach taken by this pass) is to just run the repair_ssa pass afterwards because the CF manipulation helpers are smart enough to keep all the use/def stuff sane; they just don't always preserve dominance properties. While we're here, we clean up some bogus indentation. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111405 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111069 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-06 23:39:01 +00:00
Jason Ekstrand	1005272a2b	nir/repair_ssa: Insert deref casts when needed Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-06 23:39:01 +00:00
Jason Ekstrand	a3268599f3	nir/repair_ssa: Repair dominance for unreachable blocks NIR currently assumes that unreachable blocks are trivially dominated by everything. However, when considering well-formed SSA, there is no path from any block to an unreachable block. Therefore, we can break any use-def chains where the use is in an unreachable block. This removes any dependencies on code created by uses in unreachable blocks and lets DCE do a better job of cleaning it up. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-06 23:39:01 +00:00
Jason Ekstrand	f81a2623d8	nir: Add a block_is_unreachable helper Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-06 23:39:01 +00:00
Jason Ekstrand	517142252f	nir: Don't infinitely recurse in lower_ssa_defs_to_regs_block Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-06 23:39:01 +00:00
Jason Ekstrand	37cdb7fc44	nir: Handle complex derefs in nir_split_array_vars We already bail and don't split the vars but we were passing a NULL to _mesa_hash_table_search which is not allowed. Fixes: `f1cb3348f1` "nir/split_vars: Properly bail in the presence of ..." Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-06 23:39:01 +00:00
Jason Ekstrand	34541be7b0	intel/blorp: Use wide formats for nicely aligned stencil clears In the case where the stencil clear is nicely aligned, we can clear stencil much more efficiently by mapping it as a wide format (say RGBA32_UINT) and blasting out the stencil clear value with a repclear. On Unigine Heaven, this makes one stencil clear go from non-trivial to unnoticeable when looking at per-draw timings. In order for this change to work properly, ANV needs to do a bit more flushing around depth and stencil clears. i965 and iris already have the cache tracking logic to handle this so no changes are required there. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-09-06 23:35:09 +00:00
Jason Ekstrand	d62ca48c31	intel/blorp: Expose surf_fake_interleaved_msaa internally	2019-09-06 23:35:09 +00:00
Jason Ekstrand	caa786e029	intel/blorp: Expose surf_retile_w_to_y internally Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-09-06 23:35:09 +00:00

1 2 3 4 5 ...

115279 Commits All Branches Search

115279 Commits

All Branches