KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	6391f9ab4c	aco: fix nir_intrinsic_quad_* with 8-bit in GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5327>	2020-06-05 16:04:06 +02:00
Samuel Pitoiset	e1523b34c2	aco: fix sign-extend 8-bit subgroup operations on GFX6-GFX7 SDWA is GFX8+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5327>	2020-06-05 16:04:05 +02:00
Samuel Pitoiset	ee4bc13de2	aco: use v_bfe_u32 for unsigned reductions sign-extension on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5327>	2020-06-05 16:04:03 +02:00
Eric Engestrom	a874132cc4	intel/genxml: drop sort_xml.sh and move the loop directly in gen_sort_tags.py Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5353>	2020-06-05 13:44:18 +00:00
Bas Nieuwenhuizen	c67ef7695a	radv: Use ac_surface to allocate aux surfaces. For consistency and a bunch of codesharing. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	63db31fdfc	amd/common: Add total alignment calculation. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	f70b577683	radv: Allocate values/predicates at the end of the image. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	ec671e8718	radv: Disable HTILE in ac_surface. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	f84b4e2639	radv: Disable DCC in ac_surface. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	81dee6cf8f	radv: Use offsets in surface struct. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	ffae3589c9	radv: Rely on ac_surface for avoiding cmask for linear images. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	b5488a863c	radv: Enforce the contiguous memory for DCC layers in ac_surface. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	d3db633f6d	radv: Pass no_metadata_planes info in to ac_surface. Also do not allocate aux surfaces for multi-plane images. I may have messed up and used plane 1 offsets for the other planes as well. I cannot imagine that sharing aux surfaces between the planes will work well. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	599ea341dd	radv: Use ac_surface to determine fmask enable. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Rob Clark	ef5b8bbc5e	freedreno/computerator: fix missing dependency on generated header Fixes: ``` ../mesa-freedreno-20.2.0_pre/src/freedreno/computerator/ir3_asm.c:25:10: fatal error: 'ir3/ir3_parser.h' file not found #include "ir3/ir3_parser.h" ^~~~~~~~~~~~~~~~~~ 1 error generated. ``` Fixes: `da467817e3` ("freedreno/ir3: Move ir3 assembler to backend compiler") Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5340>	2020-06-05 09:48:47 +00:00
Eric Engestrom	7a68045b5d	glapi: remove deprecated .getchildren() that has been replace with an iterator Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3086 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5342>	2020-06-05 09:16:13 +00:00
Samuel Pitoiset	c9a9b363ce	radv/aco: enable 64-bit atomic features if RADV is linked with LLVM 8 Just in case someone links RADV with this old LLVM 8 and wants ACO. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5331>	2020-06-05 07:40:29 +00:00
Neha Bhende	ba37d408da	svga: Performance fixes This is a squash commit of in house performance fixes and misc bug fixes for GL4.1 support. Performance fixes: * started using system memory for constant buffer to gain 3X performance boost with metro redux Misc bug fixes: * fixed usage of vertexid in shader * added empty control point phase in hull shader for zero ouput control point * misc shader signature fixes * fixed clip_distance input declaration * clearing the dirty bit for the surface while using direct map if surface is already flushed and there is no pending primitive This patch also uses SVGA_RETRY macro for commands retries. Part of it is already used in previous patch. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Neha Bhende	ccb4ea5a43	svga: Add GL4.1(compatibility profile) support in svga driver This patch is a squash commit of a very long in-house patch series. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Neha Bhende	52ce25be87	svga/include: Headers for GL4.1 support This brings in the new types, enums and #defines for GL 4.1 features in the virtual device. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Neha Bhende	dc3505f87e	winsys/drm: Add GL4.1 support in drm winsys This is to check whether virtual hardware has SM5 support Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Neha Bhende	48a7456f4d	util: Add util functionality for GL4.1 support This patch adds the following tgsi utilities * tgsi_dynamic_indexing: This utility flattens out the dyanamic indexing of constant buffers * tgsi_vpos: This utility writes zeros to position at index 0 in vertex shader. This utility can be used if there is no shader output in vertex shader * util_make_tess_ctrl_passthrough_shader: This adds passthough tessellation control shader. Input of passthrough tess ctrl shader is output of vertex shader and output is input of tessellation eval shader. If program has tessellation eval shader but no tessellation control shader, this utility can be used to create passthrough tessellation control shader. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Rob Clark	f1f81abfd4	freedreno/a6xx: more early-z Technically we only have to do late-z in the alpha-test or discard case if depth-write is enabled. If depth write is disabled, the depth read / test / conditional-write interlock that we need to emulate is not a problem, so we can still use early-z test. There is a slightly weird case when there is no zsbuf attachment (see dEQP-GLES31.functional.fbo.no_attachments.*) where the hw wants us to use LATE_Z.. not entirely sure if this is an interaction with occlusion query or just a pecularity of how the hw works when there is no depth buffer. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5336>	2020-06-05 00:57:44 +00:00
Eric Anholt	ec98cff6a9	turnip: Simplify vertex buffer bindings. We were remapping the bindings so the HW binding points were consecutive, which there's no need for. Now that we don't shuffle, we can mostly drop the dependency on the pipeline for this SDS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Eric Anholt	5c9728d960	turnip: Don't bother clamping VB size. From the VK spec: "All elements of pOffsets must be less than the size of the corresponding element in pBuffers" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Eric Anholt	52942f18c6	turnip: Move vertex buffer bindings to SET_DRAW_STATE. This means that the HW can skip over the vertex buffer state when it's not used in a bin. The blob also has this behavior. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Dave Airlie	c8c7450fc7	llvmpipe: move coroutines out of noopt case the virgl CI code was using the noopt path and crashing with a wierd can't select llvm.coro.subfn.addr error, turns out we have to call the cleanup pass no matter what. This enable a lot more virgl gles31 passes, but we have to disable tessellation shaders as now they executed, they crash due to missing OES_gpu_shader5, I should try and reenable them when llvmpipe is further along Fixes: `d32690b43c` ("gallivm: add coroutine pass manager support") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Elie Tournier <elie.tournier@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5320>	2020-06-04 19:08:34 +00:00
Alyssa Rosenzweig	2d1688345a	pan/mdg: Ensure ld_vary_16 is aligned Otherwise packing may fail. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes: `5f8dd413bc` ("pan/mdg: Handle 16-bit ld_vary") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5339>	2020-06-04 17:46:45 +00:00
Kristian H. Kristensen	de8be1de13	freedreno/a6xx: Fix VFD_CONTROL emit The FETCH_CNT field isn't actually the FETCH count. We don't have a lot of data where it's different from DECODE_CNT, so there's not much to go by. It could be number of VFD_DEST_CNTL or maybe DECODE_CNT for binning. For now, setting both to number of DEST_CNTL gets Google Earth working again. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5324>	2020-06-04 15:50:41 +00:00
Clément Guérin	202252566b	radv: Always expose non-visible local memory type on dedicated GPUs DOOM Eternal expects this type, but RADV doesn't expose it when the VRAM is entirely host-visible, in my case on Fiji. Matches AMDVLK behavior. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/3054 Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5308>	2020-06-04 15:16:30 +00:00
Alyssa Rosenzweig	622e3a8510	pan/mdg: Legalize inverts with constants We need to force src_invert to be in the right place even if we flip when lowering an embedded->inline constant. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes: `449e5ded93` ("pan/mdg: Treat inot as a modifier") Reported-by: Icecream95 <ixn@keemail.me> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5299>	2020-06-04 13:17:11 +00:00
Erik Faye-Lund	e61a98877c	nir: reuse existing psiz-variable For shaders where there's already a psiz-variable, we should rather reuse it than create a second one. This can happen if a shader writes gl_PointSize, but disables GL_PROGRAM_POINT_SIZE. Fixes: `878c94288a` ("nir: add lowering-pass for point-size mov") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5328>	2020-06-04 09:12:54 +00:00
Lionel Landwerlin	57e4d0aa1c	i965: fix export of GEM handles We reuse DRM file descriptors internally. Therefore when we export a GEM handle we must do so in the file descriptor used externally. v2: Fix dmabuf leak Fix GEM handle leaks by tracking exported handles v3: Check os_same_file_description error (Michel) Don't create multiple exports for a given GEM table v4: Add WARN_ONCE (Ken) v5: Remove blank line (Ian) Remove unused field (Ian) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2882 Fixes: `4094558e86` ("i965: share buffer managers across screens") Tested-by: Eric Engestrom <eric@engestrom.ch> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4861>	2020-06-04 07:31:38 +00:00
Lionel Landwerlin	aba3aed96e	iris: fix export of GEM handles We reuse DRM file descriptors internally. Therefore when we export a GEM handle we must do so in the file descriptor used externally. This change also fixes a file descriptor leak of the FD given at screen creation. v2: Don't bother checking fd equals, they're always different Fix dmabuf leak Fix GEM handle leaks by tracking exported handles v3: Check os_same_file_description error (Michel) Don't create multiple exports for a given GEM table v4: Add WARN_ONCE (Ken) Rename external_fd to winsys_fd v5: Remove export lock in favor of bufmgr's Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2882 Fixes: `7557f16059` ("iris: share buffer managers accross screens") Tested-by: Eric Engestrom <eric@engestrom.ch> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4861>	2020-06-04 07:31:38 +00:00
Lionel Landwerlin	e41e820648	i965: don't forget to set screen on duped image We'll start using this field more for querying image properties. Without it we run into a crash. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4861>	2020-06-04 07:31:38 +00:00
Lionel Landwerlin	604a86e46f	iris: fix BO destruction in error path Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4861>	2020-06-04 07:31:38 +00:00
Vinson Lee	c3025bde19	mesa: Fix NetBSD compiler macro. Reported-by: Rafał Mikrut <mikrutrafal54@gmail.com> Fixes: `a63b90712a` ("mesa: also check for __NetBSD__") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3015 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5191>	2020-06-03 21:09:54 -07:00
Rob Clark	e9cda38031	freedreno/a6xx: also consider alpha-test for ztest-mode Looks like we don't have CI coverage for this (since deqp==GLES) but alpha test is conceptually the same as frag shaders with discard, and should be handled as such. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	1e3731e711	freedreno/a6xx: add early-lrz-late-z mode Now that we are doing a better job of managing LRZ, add support for the EARLY_LRZ_LATE_Z mode. Since we properly disable LRZ write in cases where we don't know a fragment's z value during the binning pass (or when blend is enabled in a later draw, meaning we will need the earlier fragment's color), we can enable a mode that keeps the early-lrz test when the frag shader has kill/discard. This will only discard geometry that is definitely not visible. This is a pretty big win for games/benchmarks that have a lot of frag shaders with kill/discard. More than 10% gain for gfxbench trex/mh and 40% gain for mh31. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	07887c9f34	freedreno/a6xx: re-work LRZ state tracking In particular, properly detect reversal of depth-test direction. With that we can remove a lot of cases where we were unnecessarily invalidating LRZ, which was simply papering over the direction- reversal issue in deqp. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	27e501bcfc	freedreno/a6xx: update depth-plane control regs And document the early-lrz-late-z mode. Initially I thought this would be two bits to control early-lrz vs early-z. But having early-z without early-lrz does not make sense, and the way the values line up makes an enum fit better. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	f6307426ed	freedreno/a6xx: sync registers from envytools Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	ebcf3545db	freedreno/ir3: split kill from no_earlyz Unlike other conditions which prevent early-discard of fragments, kill does not prevent early LRZ test. Split `has_kill` from `no_earlyz` so we can take advantage of this. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Kristian H. Kristensen	5fb7cad95c	freedreno/a6xx: Turn on robustness extensions With UBO access going through LDC, all memory access uses buffer based io primitives. We can then advertise PIPE_CAP_ROBUST_BUFFER_ACCESS_BEHAVIOR and PIPE_CAP_DEVICE_RESET_STATUS_QUERY, which turn on GL_EXT_robustness, GL_KHR_robust_buffer_access_behavior and GL_KHR_robustness. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5319>	2020-06-04 00:56:20 +00:00
Vinson Lee	8b353524b0	vdpau: Fix wrong calloc sizeof argument. Fix warning reported by Coverity Scan. Wrong sizeof argument (SIZEOF_MISMATCH) suspicious_sizeof: Passing argument 3544UL (sizeof (vlVdpPresentationQueue)) to function calloc that returns a pointer of type vlVdpPresentationQueueTarget * is suspicious because a multiple of sizeof (vlVdpPresentationQueueTarget) /16/ is expected. Fixes: `65fe0866ae` ("vl: implemented a few functions and made stubs to get mplayer running") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3026 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5182>	2020-06-03 17:01:47 -07:00
Francisco Jerez	8252bb0ec6	OPTIONAL: iris: Perform BLORP buffer barriers outside of iris_blorp_exec() hook. The iris_blorp_exec() hook needs to be executed under a single indivisible sync region, which means that in cases where we need to emit a PIPE_CONTROL for a buffer barrier we won't be able to track the subsequent commands separately from the previous commands, which will prevent us from optimizing out subsequent PIPE_CONTROLs if we encounter the same buffers again. In particular I've encountered this situation in some SynMark test-cases which perform lots of BLORP operations with the same buffer bound as both source and destination (in order to generate mipmaps): In such a scenario if the source requires flushing we'd also end up flushing for the destination redundantly, even though a single PIPE_CONTROL would have been sufficient. This avoids a 4.5% FPS regression in SynMark OglHdrBloom and a 3.5% FPS regression in SynMark OglMultithread. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	4b00338bde	iris: Remove iris_flush_depth_and_render_caches(). This helper is unused now. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	46adb83a29	iris: Emit single render target flush PIPE_CONTROL on format mismatch. The big-hammer iris_flush_depth_and_render_caches() is largely redundant whenever a format mismatch is detected from iris_cache_flush_for_render(). There is no need to kick the depth, sampler nor constant caches in that case. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	b928188493	iris: Open-code iris_cache_flush_for_read() and iris_cache_flush_for_depth(). These have become one-liners now so they can be easily inlined. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	74c774dce9	iris: Remove render cache hash table-based synchronization. The render cache hash table is now mostly redundant with the more general seqno matrix-based cache tracking mechanism. Most hash table operations are now gone except for the format mismatch checks done in iris_cache_flush_for_render(). Redundant code removed as a separate patch for bisectability. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	aa78d05a23	iris: Remove depth cache set tracking and synchronization. The depth cache set is now redundant with the more general seqno matrix-based cache tracking mechanism. Removed as a separate patch for bisectability. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	6b98072511	iris: Perform compute predraw flushes from compute batch. Whenever iris_predraw_resolve_inputs() ends up doing a flush or invalidate, we really want it to be on the same batch which is going to consume the result. Any resolves should still be performed from the render batch thanks to the previous patch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	8e8198f349	iris: Remove batch argument of iris_resource_prepare_access() and friends. The resolves performed by this function are only expected to work from the render batch, so make sure we use it independently of the batch the caller wants to use. This function provides no synchronization guarantees anyway, the caller is expected to insert any cache flushing and synchronization required for the resolved surface to be visible to the target batch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	878c770d13	iris: Insert buffer barrier in existing cache flush helpers. As a first step to phasing out the current hashtable-based depth and render cache tracking mechanisms. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	e226590898	iris: Implement buffer-local memory barrier based on cache coherency matrix. This takes advantage of the previously introduced cache tracking infrastructure in order to define a multi-purpose barrier operation that allows the caller to order memory operations with respect to previous operations performed on the same buffer from any other cache domain. v2: Assorted CPU overhead micro-optimizations (Francisco). v3: Use C99 designated initializers (Ken). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	8a6349eb86	iris: Update cache coherency matrix on PIPE_CONTROL. This introduces a batch synchronization boundary at every PIPE_CONTROL command, and updates the cache coherency status tracked during batch construction according to the specified control bits. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	fc221875cf	iris: Introduce cache coherency matrix for batch-local memory ordering. This introduces a representation of the cache coherency status of the GPU at any point in the batch. This is done by defining a matrix C of synchronization sequence numbers such that at any point of batch construction, a memory operation from domain i introduced into the batch is guaranteed to be ordered after any memory operation from domain j in a previous batch section with seqno n if the following condition holds: C_i_j >= n This allows us to efficiently determine whether additional flushing and/or invalidation is required in order to access a buffer object from some arbitrary domain. Except for batch buffer reset which requires clearing the whole matrix, all operations on the matrix are either O(n) or O(1) on the number of caching domains (which is basically constant). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	4b7fd91be6	iris: Report use of any in-flight buffers on first draw call after sync boundary. This is the main performance trade-off of this cache tracking mechanism: In order for the seqno vector of buffer objects to be accurate, they need to be marked as used again every time the batch is split into a new synchronization section if they remain bound to the pipeline. This can be achieved easily by re-using iris_restore_render_saved_bos() and iris_restore_compute_saved_bos(), which currently serve a similar purpose across batch buffer boundaries. The impact on Piglit drawoverhead results seems to be within a standard deviation of the current results. XXX - It might be possible to completely remove the current iris_batch::contains_draw flag at a small additional performance cost. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	ae88e79f69	iris: Drop redundant iris_address::write flag. The write flag is redundant since it can be inferred easily from the iris_address::access domain. This allows the iris_address struct to be laid out more efficiently in memory, leading to a measurable improvement in several Piglit Drawoverhead test-cases. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	eb5d1c2722	iris: Annotate all BO uses with domain and sequence number information. Probably the most annoying patch to review from the whole series -- Mark every buffer object use as accessed through some caching domain with the sequence number of the current synchronization section of the batch. The additional argument of iris_use_pinned_bo() makes sure I'd have gotten a compile error if I had missed any buffer added to the batch validation list. There are only a few exceptions where a buffer is left untracked while adding it to the validation list, justified below: - Batch buffers: These are strictly read-only for the moment. - BLORP buffer objects: Their seqnos are bumped manually at the end of iris_blorp_exec() instead, in order to avoid plumbing domain information through BLORP address combining. - Scratch buffers: The contents of these are strictly thread-local. - Shader images and SSBOs: Accesses of these buffers are explicitly synchronized at the API level. v2: Opt out of tracking more aggressively (Ken): In addition to the above, surface states, binding tables, instructions and most dynamic states are now left untracked, which means a lot more BO uses marked IRIS_DOMAIN_NONE which need to be reviewed extremely carefully, since the cache tracker won't be able to provide any coherency guarantees for them. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	e81c07de41	iris: Bracket batch operations which access memory within sync regions. This delimits all batch operations which access memory between iris_batch_sync_region_start() and iris_batch_sync_region_end() calls. This makes sure that any buffer objects accessed within the region are considered in use through the same caching domain until the end of the region. Adding any buffer to the batch validation list outside of a sync region will lead to an assertion failure in a future commit, unless the caller explicitly opted out of the cache tracking mechanism. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	8cbe953548	iris: Add infrastructure to partition batch into sync boundaries. This introduces some minimalistic infrastructure which will be used in order to partition the batch into a series of sections, each one with a unique, monotonically-increasing sequence number. Section boundaries will typically lie at points in the batch where the execution and memory coherency status of some previous commands are known, e.g. at batch buffer boundaries or PIPE_CONTROL commands. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	7878cbec59	iris: Add batch-local synchronization book-keeping to iris_bo. The purpose of this is to represent the cache coherency state of a buffer as a vector of integers (AKA seqnos), one for each incoherent caching domain of the GPU. A seqno will identify a single section of a batch buffer uniquely across the whole pipe_screen (which means that there will be no ambiguity about what context a given seqno belongs to even if there are multiple threads accessing the same buffer in parallel), and is guaranteed to be allocated in monotonically increasing order within any given context. The iris_bo_bump_seqno() helper is provided for marking the last update of a buffer from a given caching domain in a lockless manner. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Alyssa Rosenzweig	b73b339531	panfrost: Mark point sprites as todo on Bifrost Emulating them will be a rather annoying dance. Let's not worry about this until further down the line when we have a better sence of how to do handle them efficiently. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5290>	2020-06-03 22:58:46 +00:00
Alyssa Rosenzweig	0ef527928c	panfrost: Fix gl_PointSize out of GL_POINTS In this case, vs->writes_point_size is true as the VS writes gl_PointSize, but panfrost_writes_points_size() is false as we are not drawing points so the hardware doesn't process it. Thus the varying descriptor is emitted but elements is never written. When the VS runs, it will attempt to write to elements, a NULL pointer. The behaviour is architecture-independent. On Midgard, the write silently fails, hence why this bug was never noticed before. On Bifrost, this raises an MMU fault. The fix is to set the format to VARYING_DISCARD to ignore the write. Noticed on Neverball. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5290>	2020-06-03 22:58:46 +00:00
Alyssa Rosenzweig	3f8abd8676	panfrost: Prefer sysval for gl_PointCoord on Bifrost It's like gl_FragCoord. Still not implemented. This unfortunately makes point sprites a lot more complicated. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5290>	2020-06-03 22:58:46 +00:00
Alyssa Rosenzweig	bc7397f376	pan/bi: Disassemble gl_PointCoord reads. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5290>	2020-06-03 22:58:46 +00:00
Alyssa Rosenzweig	3e4a0c2bca	panfrost: Explicitly convert to 32-bit for logic-ops Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reported-by: Icecream95 <ixn@keemail.me> Fixes: `19b4e586f6` ("panfrost: Switch to pan_lower_framebuffer") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5289>	2020-06-03 22:48:10 +00:00
Alyssa Rosenzweig	6d00eaf733	panfrost: Readd MIDGARD_SHADERLESS quirk to t760 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reported-by: Icecream95 <ixn@keemail.me> Fixes: `e53d27de61` ("panfrost: Add quirks for blend shader types") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5289>	2020-06-03 22:48:10 +00:00
Francisco Jerez	46183a999b	iris: Extend iris_context dirty state flags to 128 bits. We're nearly out of dirty bits, and some patches pending review on GitLab no longer apply due to that. Make room for them by splitting off shader stage-specific bits into a separate stage_dirty mask. An alternative would be to split compute-related bits into a separate mask, but that would prevent the '<< stage' indexing done in various parts of the driver from working. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5279>	2020-06-03 22:22:19 +00:00
Francisco Jerez	45918e0d8c	iris: Simplify iris_batch_prepare_noop(). This makes iris_batch_prepare_noop() return a boolean instead of passing through the relevant set of dirty flags. It will make it easier to change the representation of dirty flags. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5279>	2020-06-03 22:22:19 +00:00
Rob Clark	26a3c7b363	nir/lower_tex: fixes for fp16 yuv lowering Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3079 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5318>	2020-06-03 21:24:13 +00:00
Rob Clark	0f3255ef0a	nir/builder: add bitsize conversion helpers Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5318>	2020-06-03 21:24:13 +00:00
Rob Clark	866618c5c8	nir: extract out convert_to_bitsize() helper Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5318>	2020-06-03 21:24:13 +00:00
Rob Clark	924bfb6560	nir: get_base_type() should return enum type Needed by the next patch, for c++ code which is more strict about conversions between integers and enums. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5318>	2020-06-03 21:24:12 +00:00
Alyssa Rosenzweig	dce7722ef8	panfrost: Handle writes_memory correctly We need to pass it thru to EARLY_Z and WRITES_GLOBAL instead of ignoring and assuming respectively. Nontrivial performance fix. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5300>	2020-06-03 20:48:24 +00:00
Alyssa Rosenzweig	2447b3b9d3	panfrost: Document MALI_WRITES_GLOBAL bit We've been setting this unconditionally -- oops! Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5300>	2020-06-03 20:48:24 +00:00
Alyssa Rosenzweig	ee59d1ad77	panfrost: Update MALI_EARLY_Z description Via the ES3.1 early-z testing force, I've confirmed this bit is e-z. I've also confirmed e-z must be disabled for global writes, as expected. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5300>	2020-06-03 20:48:24 +00:00
Marcin Ślusarz	7e26a02e5f	iris: remove unused iris_bo->swizzle_mode Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5313>	2020-06-03 18:38:00 +00:00
Samuel Pitoiset	77f08982af	aco: sign-extend input/identity for 16-bit subgroup ops on GFX6-GFX7 16-bit subgroup ops are implemented with 32-bit instructions on GFX6-GFX7. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:43 +02:00
Samuel Pitoiset	f31c9b4edf	aco: fix subdword copies on GFX6-GFX7 SDWA is only GFX8+. Use v_mov_b32 since the upper 16 bits don't matter. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:42 +02:00
Samuel Pitoiset	a521c67d22	aco: implement 16-bit nir_intrinsic_quad_* on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:40 +02:00
Samuel Pitoiset	6b08d269bf	aco: implement 16-bit reduce operations on GFX6-GFX7 No fp16 on GFX6-GFX7. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:37 +02:00
Alyssa Rosenzweig	0e73d879e3	pan/bi: Handle vectorized load_const In preparation for 16-bit vectors. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	1b09c6993d	pan/bi: Passthrough second argument of F32_TO_F16 At the NIR level this is a second vector source of the first (only) argument; at the BIR level this is a pair of scalars. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	8a4efe2d73	pan/bi: Pack second argument of F32_TO_F16 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	323eecaf13	pan/bi: Fix SEL.16 swizzle 2 scalar arguments, not 1 vector. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	9ed1ae4724	pan/bi: Handle SEL with vec3 16-bit Otherwise we end up with a missing argument. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	afc18c62d7	panfrost: Passthrough NATIVE loads/stores Now that we handle load_output directly, this works for e.g. RGB565 on Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	36af05bbde	pan/mdg: Handle regular nir_intrinsic_load_output Instead of the vendored version. Only for blend shaders at the moment, frag shaders fb_fetch has a lot more going on. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	293d37e19d	pan/mdg: Allow f2u8 and friends thru Now that we can handle destination sizes directly, this keeps us from needing to chew through so many conversions. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	0ae0141f5b	pan/mdg: Handle f2u8 This is similar to f2u16. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	f8b881f161	pan/mdg: Fold roundmode into applicable instructions Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	93513cd9ff	pan/mdg: Implement *_rtz conversions with roundmode Use rte as the canonical type. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	6290e83190	pan/mdg: Lower roundmodes So now we can use the IR field semantically. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	1bef784867	pan/mdg: Add opcode roundmode property When the output is rounded in a specified direction. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	2eb4c85e42	pan/mdg: Add roundmode enum Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	014d2e46a7	pan/mdg: Distinguish blend shaders in internal shader-db Since these shaders are purely internal, the optimization criteria are a bit different, so it's worth calling attention to this when dumping. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Icecream95	99446c9f7d	panfrost: Only use AFBC YTR with RGB and RGBA The "lossless colorspace transform" is lossy for R and RG formats. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5293>	2020-06-03 15:19:43 +00:00
Icecream95	9ac106defe	panfrost: Decode AFBC flag bits Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5293>	2020-06-03 15:19:43 +00:00
Timothy Arceri	a34cc97ca3	glsl: when NIR linker enable use it to resize uniform arrays Here we turn on uniform array resizing in the NIR linker and disable the GLSL IR resizing pass when the NIR linker is enabled. This will potentially make uniform arrays smaller due to NIR optimising away more uniform uses. Shader-db results (SKL): total instructions in shared programs: 14947192 -> 14944093 (-0.02%) instructions in affected programs: 138088 -> 134989 (-2.24%) helped: 822 HURT: 4 total cycles in shared programs: 324868402 -> 324794597 (-0.02%) cycles in affected programs: 3904170 -> 3830365 (-1.89%) helped: 2333 HURT: 1485 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4910>	2020-06-03 10:34:22 +00:00
Timothy Arceri	7d1eadb790	glsl: gather uniform dereference info before main linking loop We want to gather information for all stages here before the main linking loop. In the following patch we will use to information to reduce the size of uniform arrays where possible. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4910>	2020-06-03 10:34:22 +00:00
Timothy Arceri	a13d8d48ce	glsl: add update_array_sizes() helper to the NIR uniform linker This will be used to reduce the size of uniform arrays and replace the current glsl ir pass. Doing this in NIR allows us to better optimise the size of uniform arrays. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4910>	2020-06-03 10:34:22 +00:00
Timothy Arceri	6aea287b0a	glsl: add struct to gather more info about uniform array access This will be used in the following patches to allow the linker to resize uniform arrays based on array dereferences. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4910>	2020-06-03 10:34:22 +00:00
Timothy Arceri	d6d78f9b7f	util: add BITSET_LAST_BIT() helper This is the reverse of BITSET_FFS() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4910>	2020-06-03 10:34:22 +00:00
Timothy Arceri	f518508a81	i965: call brw_nir_lower_uniforms() after uniform linking is complete i965 currently uses the NIR uniform linker for spirv support. Until now the only reason there has been no issue with calling the lowering pass before the linker is because no garbage collection is done between the calls. An upcoming change to the linker will add an optimisation to resize unform arrays where possible. Because lowering causes the array defs to no longer be used the new optimisation ends up resizing the arrays to 0. To fix this we move the lowering call after the linking calls. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4910>	2020-06-03 10:34:22 +00:00
Simon Ser	907bacea13	gbm: document that gbm_bo_map exposes a linear view Drivers (Gallium, i965) expose a linear view of the buffer via gbm_bo_map. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Daniel Stone <daniel@fooishbar.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5238>	2020-06-03 10:09:52 +00:00
Danylo Piliaiev	9f3956fea0	glsl: Don't replace lrp pattern with lrp if arguments are not floats We don't have "lrp(int, int, int)" and validation of ir_triop_lrp fails down the road. Fixes: `8d37e991` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3059 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Tested-by: Witold Baryluk <witold.baryluk@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5257>	2020-06-03 09:06:25 +00:00
Boris Brezillon	3ed2123d77	spirv: Use scoped barriers for SpvOpControlBarrier If use_scoped_barrier is set to true, we don't have to split the control and memory barriers. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4900>	2020-06-03 07:39:52 +00:00
Boris Brezillon	689acc7398	intel/compiler: Extract control barriers from scoped barriers Add a lowering pass extracting all control barriers embedded in scoped barriers into proper control barriers so we can get rid of the logic inserting control barriers when an SpvOpControlBarrier with WorkGroup scope is parsed in spirv_to_nir(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4900>	2020-06-03 07:39:52 +00:00
Boris Brezillon	345b5847b4	nir: Replace the scoped_memory barrier by a scoped_barrier SPIRV OpControlBarrier can have both a memory and a control barrier which some hardware can handle with a single instruction. Let's turn the scoped_memory_barrier into a scoped barrier which can embed both barrier types. Note that control-only or memory-only barriers can be supported through this new intrinsic by passing NIR_SCOPE_NONE to the unused barrier type. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Suggested-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4900>	2020-06-03 07:39:52 +00:00
Boris Brezillon	94438a64bf	spirv: Split the vtn_emit_scoped_memory_barrier() logic We are about to add support for scoped control+memory barriers. Let's move the convert from SPIRV to NIR enums logic in helpers so we can easily re-use them. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4900>	2020-06-03 07:39:52 +00:00
Samuel Pitoiset	d3c937c0e4	radv: enable zero VRAM for all VKD3D (DX12->VK) games To fix rendering issues with Metro Exodus, RE2 and 3 and probably more titles. It seems the default behaviour of DX12 anyways. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3064 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5262>	2020-06-03 08:00:19 +02:00
Samuel Pitoiset	fd5ffd3a83	radv: enable zero VRAM for Doom Eternal That fixes some rendering issues. Probably some unitialized data from the game. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3064 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5262>	2020-06-03 07:59:57 +02:00
Timothy Arceri	7873276f68	glsl/spirv: remove dead uniforms in spirv nir linker Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4797>	2020-06-03 02:22:23 +00:00
Timothy Arceri	a494b62410	glsl: remove dead uniforms in the nir linker This is now possible as we do uniform linking via a nir based linker. Shader-db results for IRIS (SKL): total instructions in shared programs: 14947192 -> 14946397 (<.01%) instructions in affected programs: 39498 -> 38703 (-2.01%) helped: 230 HURT: 18 total cycles in shared programs: 324868402 -> 324847058 (<.01%) cycles in affected programs: 706701 -> 685357 (-3.02%) helped: 599 HURT: 449 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4797>	2020-06-03 02:22:23 +00:00
Timothy Arceri	60bee4c70c	glsl: add can_remove_uniform() helper to the NIR linker This helper reflects the rules we follow in the GLSL IR linker when deciding if we can remove a dead uniform. This check is required to avoid regressions when turning on NIR dead uniform clean up in the following patch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4797>	2020-06-03 02:22:23 +00:00
Timothy Arceri	04dbf709ed	nir: add callback to nir_remove_dead_variables() This allows us to do API specific checks before removing variable without filling nir_remove_dead_variables() with API specific code. In the following patches we will use this to support the removal of dead uniforms in GLSL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4797>	2020-06-03 02:22:23 +00:00
Timothy Arceri	bc79442f3f	nir: add glsl_get_ifc_packing() helper This will be used in the following patch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4797>	2020-06-03 02:22:23 +00:00
Alyssa Rosenzweig	7ac617c117	pan/mdg: Don't double-replicate blend on T720 We already do this unconditionally in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5305>	2020-06-03 00:32:24 +00:00
Bas Nieuwenhuizen	edd56bad94	radv: Use common gfx10_format_table.h Save some python code and build time, as well as some code duplication. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5291>	2020-06-03 00:17:00 +00:00
Bas Nieuwenhuizen	560f095dd5	radv: Include gfx10_format_table.h only from a single source file. The radeonsi variant has everything in the header, so lets not include it everywhere. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5291>	2020-06-03 00:17:00 +00:00
Bas Nieuwenhuizen	b351a50763	radeonsi: Define gfx10_format in the common header. So we don't have to have multiple definitions of the struct when sharing with radv. While at it put the table properly in a C file so we don't have to deal with multiple definitions, and the struct definition isn't in generated source. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5291>	2020-06-03 00:17:00 +00:00
Bas Nieuwenhuizen	c98e52f88a	amd/common,radeonsi: Move gfx10_format_table to common. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5291>	2020-06-03 00:17:00 +00:00
Bas Nieuwenhuizen	d936f69677	radeonsi: Explicitly map Z16_UNORM_S8_UINT to None for GFX10. We should always use separate planes for textures with this format. Fixes: `273ead81f1` "util/format: Add VK_FORMAT_D16_UNORM_S8_UINT." Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5291>	2020-06-03 00:17:00 +00:00
Erik Faye-Lund	a21966837a	zink: Use store_dest_raw instead of storing an uint I cleaned up the other similar call-sites, but somehow missed this one. There's nothing different with this, so let's also fix this. Fixes: `16339646f0` ("zink/spirv: rename functions a bit") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5250>	2020-06-02 21:45:30 +00:00
Oschowa	c310677a75	radv: Explicitly cast TIMESTAMP_NOT_READY value to uin32_t where needed. Fixes a clang warning. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Oschowa	663e8cb4e6	aco: Use correct reference type in for-range-loop. Fixes a clang warning. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Oschowa	7b1bc460fd	aco: Don't std::move temporary object. Fixes the following clang warning: mesa/src/amd/compiler/aco_optimizer.cpp:2928:15: warning: moving a temporary object prevents copy elision [-Wpessimizing-move] ctx.uses = std::move(dead_code_analysis(program)); Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Oschowa	536339b0dd	aco: Don't declare 'Block' as class, but define as struct. Fixes clang warnings. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Oschowa	c2a778ef0f	radv: Don't take absolute value of unsigned type. Fixes clang warnings. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Timur Kristóf	7d2fe60f1c	radv/aco: Always enable subgroup shuffle. It is now supported by both backends on all hw. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5223>	2020-06-02 21:12:13 +00:00
Timur Kristóf	045c9ffa7d	aco: Implement subgroup shuffle on GFX6-7. GFX6 and GFX7 don't have the ds_bpermute (or permute) instruction, but we would like to support subgroup shuffle on these old GPUs. So we introduce a new pseudio instruction which will be lowered to an "unrolled loop" that emulates bpermute on GFX6 and GFX7 using readlane instructions, while also respecting the exec mask thanks to v_cmpx. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5223>	2020-06-02 21:12:12 +00:00
Timur Kristóf	14a5021aff	aco/gfx10: Refactor of GFX10 wave64 bpermute. The emulated GFX10 wave64 bpermute no longer needs a linear_vgpr, so we don't consider it a reduction anymore. Additionally, the code is slightly reorganized in preparation for the GFX6 emulated bpermute. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5223>	2020-06-02 21:12:12 +00:00
Marek Olšák	fe3947632c	radeonsi: add a hack to disable TRUNC_COORD for shadow samplers This fixes dEQP-GLES3.functional.shaders.texture_functions.textureprojlodoffset.sampler2dshadow_vertex. This is probably a dEQP bug. Fixes: `d573d1d825` Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Marek Olšák	85a6bcca61	radeonsi: pass at most 3 images and/or shader buffers via user SGPRs for compute This should slightly decrease shader lifetime. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Marek Olšák	877c56bfdc	radeonsi: remove const_buffers_declared hacks This was a bug that was uncovered by `4553fc66a5`. Piglit: spec@arb_uniform_buffer_object@maxblocks Fixes: `4553fc66a5` Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Marek Olšák	ce4575b3b5	radeonsi: remove unused leftover code for INDIRECT_BUFFER inside IBs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Marek Olšák	cac24bee62	nir: gather which images are MSAA Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Marek Olšák	6503e4be13	nir: gather which images are buffers Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Marek Olšák	f8ef15c061	nir: don't count samplers and images in interface blocks Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5209>	2020-06-02 20:47:49 +00:00
Marek Olšák	c6c8a9bd55	ac/nir: support v2f16 derivatives Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	7c423dd721	ac/nir: set the second v_cvt_pkrtz argument to undef if it's unused Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	bfb95725aa	ac/nir: select v_cvt_pkrtz for all conversions from f32 to f16 for radeonsi Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	1d80015eaf	ac/nir: handle nir_op_[fiu]2[fiu]mp opcodes Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	70b6d54011	ac/nir: support 16-bit data in image opcodes Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	c3e0ba52a0	ac/nir: support 16-bit data in buffer_load_format opcodes Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	b819ba949b	ac/nir: remove type and num_channels args from ac_build_buffer_store_common They were only used for type overloading where we can just use the type of data. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	b98df7bf50	ac/nir: support vector types in the type suffix of overloaded intrinsics Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	e5ea87cde8	ac/nir: use more types from ac_llvm_context Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	116ec85012	ac: rename has_double_rate_fp16 -> has_packed_math_16bit Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	1af8fe4ed5	gallium: add shader caps INT16 and FP16_DERIVATIVES Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	733bee57eb	glsl: lower samplers with highp coordinates correctly Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	0c0803c32f	glsl: lower the precision of imageLoad Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	1192989533	glsl: lower mediump partial derivatives Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	6fe20ebaaa	glsl: lower mediump integer types to int16 and uint16 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	a052a9c277	glsl: handle int16 and uint16 types and add instructions for mediump v2: add more changes to ir_validate.cpp Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	9c14a87839	glsl: treat lowp as mediump when lowering builtins This seems to have been missed. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	116e006693	nir: add options::vectorize_vec2_16bit to limit vectorization to vec2 16 for hardware that is scalar but can do 2 16-bit operations on low and high 16 bits of registers at once. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	a6916d1ce8	nir: fix lower_wpos for 16-bit fddy Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	92333c6d1a	nir: lower int16 and uint16 in nir_lower_mediump_outputs Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	6f2e95f24d	nir: add int16 and uint16 type helpers Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Marek Olšák	f798513f91	nir: add i2imp and u2ump opcodes for conversions to mediump Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Alyssa Rosenzweig	f3310cb3e1	nir: Fold f2f16(b2f32(x)) to b2f16(x) By definition. This reduces register pressure on freedreno so that the noubo expected failure goes away. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Jonathan Marek	a2903dd767	turnip: fix RENDER_COMPONENTS value This fixes render_components being 0 when mrt_count=8, because shift by 32 is UB and in arm64 it ends up shifting by 0. This fixes tests with 8 MRTs. Fixes the 3d path sysmem CmdClearAttachments to set RENDER_COMPONENTS, as it was previously relying on tu6_emit_mrt setting it, but it is now part of the pipeline state. Also switch back to the previous behavior of not setting render components for VK_ATTACHMENT_UNUSED attachments: we don't update the MRT state for such attachments so we definitely don't want to be trying writing to those. Fixes: `078aa9df8d` ("tu: Move RENDER_COMPONENTS setting to pipeline state") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5292>	2020-06-02 18:42:09 +00:00
Dylan Baker	fb62e642ae	vulkan-overlay/meson: use install_data instead of configure_file We don't want to copy the file into the build directory, we want to install it. That's what install_data is for. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2924 Fixes: `56ccea58ae` ("vulkan/overlay: Add basic overlay control script.") Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Dylan Baker	a63e5cbe48	meson: use 2 space not 3 space indent Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Dylan Baker	a8e2d79e02	meson: use gnu_symbol_visibility argument This uses a meson builtin to handle -fvisibility=hidden. This is nice because we don't need to track which languages are used, if C++ is suddenly added meson just does the right thing. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Dylan Baker	fc7301865e	drm-shim/meson: Use portable override_options for setting C standard Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Dylan Baker	23df13c988	drm-shim/meson: The name of the target is a string not a list This happens to work, but it's not guaranteed to Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Dylan Baker	6ef314b4fa	meson: Use build_always_stale instead of build_always which was deprecated in 0.47. This doesn't change behavior, just shuts up a warning. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Dylan Baker	c1a290bdd5	meson: Bump required version to 0.52.0 This matches what other graphics space projects require now, and allows us to simplify a number of cases, as well as make use of new features in meson. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2737 Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Alyssa Rosenzweig	30a393f458	pan/mdg: Enable out-of-order execution after texture ops We don't make great use of it (due to the scheduler not being aware yet), but we can pack for it regardless and maybe pick up some win. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5286>	2020-06-01 18:38:49 +00:00
Alyssa Rosenzweig	7c0e82d4ab	pan/mdg: Add quirk for missing out-of-order support Added in T760, like the other good parts of Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5286>	2020-06-01 18:38:49 +00:00
Alyssa Rosenzweig	31de10c434	pan/mdg: Disassemble out-of-order bits Optimization for texture instructions, allowing ALU and LD/ST within a single thread while a texture read is still in flight. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5286>	2020-06-01 18:38:49 +00:00
Alyssa Rosenzweig	ca6759c3f9	panfrost: Remove unused nir_lower_framebuffer pass Superseded. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5284>	2020-06-01 18:10:59 +00:00
Alyssa Rosenzweig	7de4b98193	panfrost: Don't flush explicitly when mipmapping The reorder work already takes cares of this nicely. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5284>	2020-06-01 18:10:59 +00:00
Alyssa Rosenzweig	975238dc2a	panfrost: Use VTX tag for vertex texturing Fixes BARRIER faults. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5284>	2020-06-01 18:10:59 +00:00
Alyssa Rosenzweig	89a9cc7645	panfrost: Permit AFBC of RGB8 Ugly but hey. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5284>	2020-06-01 18:10:59 +00:00
Alyssa Rosenzweig	3a8e5eb1b1	panfrost: Fix PRESENT flag mix-up Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5284>	2020-06-01 18:10:59 +00:00
Alyssa Rosenzweig	7c793a4867	pan/mdg: Fuse f2f16 into load_interpolated_input To become a ld_vary intrinsic. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5283>	2020-06-01 12:37:03 -04:00
Alyssa Rosenzweig	5f8dd413bc	pan/mdg: Handle 16-bit ld_vary Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5283>	2020-06-01 12:36:46 -04:00
Alyssa Rosenzweig	e42950fe96	panfrost: Use internal_format throughout Fixes R32F_S8 texturing. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:24 +00:00
Alyssa Rosenzweig	e7765a8c7f	panfrost: Add separate_stencil BO to batch Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:24 +00:00
Alyssa Rosenzweig	6aa7f6792d	panfrost: Check for large tilebuffer requirements Fixes the rest of dEQP-GLES3.functional.fragment_out.array.uint.*, this situation occurs with MRT and large pixels. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:24 +00:00
Alyssa Rosenzweig	c46b11438d	panfrost: Let Gallium pack colours Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	8dc8b66403	panfrost: Account for differing types in blend lower Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	0c9fe82ee9	panfrost: Conditionally allow fp16 blending Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	19b4e586f6	panfrost: Switch to pan_lower_framebuffer It now supports what we need. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	4c286cc0a2	panfrost: Un/pack sRGB via NIR Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	5d14757c03	panfrost: Un/pack R11G11B10 NIR has a helper for it already; we can reuse. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	e24e248b84	panfrost: Un/pack RGB10_A2_UINT It's different. Because forget me. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	91cc678551	panfrost: Un/pack RGB10_A2_UNORM It's a funny one. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	7de0e5500b	panfrost: Un/pack RGB565 and RGB5A1 Basically the same as RGBA4 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	ff590702da	panfrost: Un/pack UNORM 4 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	eab8701e7c	panfrost: Flesh out dispatch Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	e937dd521b	panfrost: Un/pack 8-bit UNORM Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	f01aabb829	panfrost: Un/pack pure 8-bit Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	9a6483bb47	panfrost: Un/pack pure 16-bit Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	c31bcca48e	panfrost: Un/pack pure 32-bit Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	e5fcc193f7	panfrost: Stub out lowering boilerplate Structure ourselves as a NIR pass replacing loads/stores with unpacked/packed versions as necessary. Not actually functional yet. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	dbd72a8f94	panfrost: Determine classes for stores Fewer special cases here, thankfully. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	18a767df35	panfrost: Determine load classes for formats Via quirks. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	e53d27de61	panfrost: Add quirks for blend shader types Every hardware has its own set of what it can and can't do... let's document it all as quirks so the lowering code is GPU-agnostic. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	60d647f9de	panfrost: Determine unpacked type for formats Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	5c82f8a097	panfrost: Add theory for new framebuffer lowering We take a somewhat different strategy that should be more flexible. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	5a175e4a1b	pan/mdg: Implement raw colourbuf loads on T720 Uses a similar path to the fp16 cbuf loads on T760. It should make sense given the symmetry with T860. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	4f82aad7a2	pan/mdg: Drop the u8 from the colorbuf op names Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	49840a8a58	pan/mdg: Print 8-bit constants Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	0ff0291896	pan/mdg: Handle bitsize for packs Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	e9c780b1d0	pan/mdg: Treat packs "specially" We maybe would prefer synthetic ops? We'll find out in due time.. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	c495c6c295	pan/mdg: Add pack_unorm_4x8 via 8-bit More efficient than the 32-bit version in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Alyssa Rosenzweig	551d990a7c	pan/mdg: Handle un/pack opcodes as moves Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5265>	2020-06-01 15:46:23 +00:00
Chris Wilson	605b0e8acf	iris: Fixup copy'n'paste mistake in Makefile.sources In changing iris_seqno.[ch] to iris_fine_fence.[ch] and moving the lines earlier, the newline escape was forgotten. Fixes: `034329128b` ("iris: Rename iris_seqno to iris_fine_fence") Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5264>	2020-05-31 22:01:48 +00:00
Satyeshwar Singh	aaec065f03	intel/dev: Don't consider all TGL SKUs as GT1 only We should be passing _gt instead of 1 to GEN12_FEATURES or else all TGL SKUs will be considered as gt1 only. Fixes: `54996ad492` ("intel/dev: Split .num_subslices out of GEN12_FEATURES macro") Signed-off-by: Satyeshwar Singh <satyeshwar.singh@intel.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5261>	2020-05-30 17:24:58 -07:00
Vinson Lee	d2f8105b60	r300g: Remove extra printf format specifiers. Fix warning reported by Coverity Scan. Missing argument to printf format specifier (PRINTF_ARGS) missing_argument: No argument for format specifier %s. Fixes: `04c1536bf7` ("r300g: rasterizer debug logging") Fixes: `85efb2fff0` ("r300g: try to use color varyings for texcoords if max texcoord limit is exceeded") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5274>	2020-05-30 14:28:01 -07:00
Ilia Mirkin	6e1c47b98d	nouveau: allow invalidating coherent/persistent buffer backings This is needed to support the core's usage of coherent buffers for glVertex-style input. The reason why this was disallowed is that any mappings will be invalidated. Let the state tracker worry about that, and just reallocate when we're told. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5276>	2020-05-30 17:59:24 +00:00
Jason Ekstrand	c48f42e178	intel/fs: Emit HALT for discard on Gen4-5 Using HALT to immediately jump to the end of the shader is required to implement GL_EXT_gpu_shader4 and OpenGL 3.0. However, vanilla OpenGL 1.2 doesn't forbid it and it likely makes something somewhere faster. We should be consistent and implement the same discard behavior on all hardware if we can. The rules for HALT on Gen4-5 are a bit different from Gen6+. On the older hardware, there is no stack for HALT; instead it's up to software to save and restore mask registers. However, there's no real saving needed since we only use HALT to jump to the end of the program where we're about about to do our FB writes. All we need to do is reset AMask to DMask, the value it was initialized to at the start of the thread. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5244>	2020-05-30 06:21:15 +00:00
Jason Ekstrand	94aa7997e4	intel/fs: Fix unused texture coordinate zeroing on Gen4-5 We were inserting the right number of MOVs but, thanks to the way we advanced msg_end earlier in the function, were often writing the zeros past the end of where we actually read in the register file. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5243>	2020-05-30 01:08:50 -05:00
Jason Ekstrand	a7c8811fe4	intel/vec4: Stomp the return type of RESINFO to UINT32 We already do this in the FS back-end; we just weren't doing it in vec4 so RESINFO messages weren't returning the right data. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5243>	2020-05-30 01:08:50 -05:00
Timothy Arceri	e843303d6f	radv: fix regression with builtin cache If the ~/.cache dir already exists continue on without failing. Fixes: `cd61f5234d` ("radv: Handle failing to create .cache dir.") Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5249>	2020-05-30 04:01:28 +00:00
Bas Nieuwenhuizen	7e4c8949c6	gallium/dri: Remove lowered_yuv tracking for plane mapping. Just heard that etnaviv is also compatible with it even in the non-lowered cases, so let us enable it for everyone. Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5253>	2020-05-30 02:49:54 +00:00
Vinson Lee	13735c4f47	panfrost: Fix printf format specifier. bifrost_sampler_descriptor.zero1 is of type uint8_t. Fix warning reported by Coverity. Invalid type in argument to printf format specifier (PRINTF_ARGS) invalid_type: Argument s->zero1 to format specifier %lx was expected to have type unsigned long but has type unsigned char. Fixes: `6148d1be4b` ("panfrost: Fix size of bifrost sampler descriptor") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5248>	2020-05-30 02:10:12 +00:00
Marek Olšák	4925fb97f6	glthread: don't upload for glDraw inside a display list and always sync Let the vbo module handle it, not glthread. This handles functions set in vbo_initialize_save_dispatch. Fixes: `2840bc3065` ("glthread: upload non-VBO vertices and indices for non-Indirect non-IBM draws") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3001 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5246>	2020-05-30 01:42:56 +00:00
Bas Nieuwenhuizen	cf99267147	util/format: Add more multi-planar formats. These don't have a fourcc code as far as I can tell, but we want them for internal Vulkan use. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5195>	2020-05-30 01:22:51 +00:00
Bas Nieuwenhuizen	d491b0dfd9	util/format: Use correct pipe format for VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM. NV12 is UVUVUV (https://wiki.videolan.org/YUV#NV12) and in Vulkan is VK_FORMAT_G8_B8R8_2PLANE_420_UNORM. So U=B and V=R. So plane order in VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM is YUV, which is PIPE_FORMAT_IYUV. Further confirmation: https://fourcc.org/yuv.php U=Cb V=Cr. From the nir ycbcr conversion, B=Cb and R=Cr. Fixes: `75d7ee8029` "util/format: translate 422_UNORM and 420_UNORM vulkan formats" Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5195>	2020-05-30 01:22:51 +00:00
Bas Nieuwenhuizen	273ead81f1	util/format: Add VK_FORMAT_D16_UNORM_S8_UINT. Not participating in packing/unpacking/stencil-only/depth-only, because it doesn't mix well in a single plane. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5195>	2020-05-30 01:22:51 +00:00
Vinson Lee	f047d585ee	etnaviv: Fix memory leak on error path. Fix warning reported by Coverity Scan. Resource leak (RESOURCE_LEAK) leaked_storage: Variable pq going out of scope leaks the storage it points to. Suggested-by: Christian Gmeiner <christian.gmeiner@gmail.com> Fixes: `eed5a00989` ("etnaviv: convert perfmon queries to acc queries") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5220>	2020-05-30 01:04:30 +00:00
Alyssa Rosenzweig	bccb3deee2	panfrost: Probe G31/G52 if PAN_MESA_DEBUG=bifrost We're not quite ready to open the flood gates on Bifrost (a major blocker is CI, which is itself blocked on the lockdowns - expected to be resolved in the coming months..) Nevertheless, let's add a debug option to probe on compatible Bifrost devices to avoid keeping out-of-tree patches around. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5272>	2020-05-29 19:24:45 -04:00
Alyssa Rosenzweig	be8cbe0b41	panfrost: Add GPU IDs for G31/G52 Dvalin/Gondul respectively. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5272>	2020-05-29 19:24:05 -04:00
Alyssa Rosenzweig	229084f5de	panfrost: Disable QUAD_STRIP/POLYGON on Bifrost Support was dropped and now raises a DATA_INVALID_FAULT on G31. Unknown if retained on other devices. GL_QUADS is still ok. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:56 +00:00
Alyssa Rosenzweig	4be2cd604b	pan/bi: Passthrough deps of the branch target Now that we have the infrastructure, follow the branch. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:56 +00:00
Alyssa Rosenzweig	8230a04f51	pan/bi: Allow two successors in header packing We need to take the union of the dependencies. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:56 +00:00
Alyssa Rosenzweig	db2c10d032	pan/bi: Measure backwards branches as well Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:56 +00:00
Alyssa Rosenzweig	a42731536d	pan/bi: Add bi_foreach_block_from_rev helper Needed for next commit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	c697992ca1	pan/bi: Defer block naming until after emit This ensures names are meaningful. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	bd6ff4f7e1	pan/bi: Pack unconditional branch Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	e4791d2bf8	pan/bi: Set branch conditional bit Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	ffe7a61a46	pan/bi: Set back-to-back bit more accurately See Connor's ISA notes. Basically set unless it's a branch (explicit or fallthrough). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	3aacfaf87e	pan/bi: Set branch_conditional if b2b is set Match the blob. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	e945d4f79d	pan/bi: Pack proper clause offsets Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	682b63cdc2	pan/bi: Measure distance between blocks For branch offset calculation. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	64c49ab1fc	pan/bi: Add bi_foreach_clause_in_block_from{_rev} helpers Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	64bedbfa67	pan/bi: Link clauses back to their blocks Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	9c32956750	pan/bi: Preliminary branch packing Simple == 0 branch packing. Offset is still to-do. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	cd9a08d4f2	pan/bi: Assign constant port for branch offsets By convention. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	cdff3ebc9a	pan/bi: Set branch_constant if there is a branch Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	b9967ab6da	pan/bi: Pack branch offset constants This is not fully generic but for a single constant it will do. Extensions left for future work. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	627872ef7f	pan/bi: Add branch constant field to IR The offsets used for branches need some extra bits twiddled, so add a field to the clause to indicate this is happening. This is not ambiguous since a clause can only have a single branch. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	f1298ae336	pan/bi: Passthrough ZERO in branch packing There's a special mode for it. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	d619ff009b	pan/bi: Fix branch condition typesize Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	1cdd55a81e	pan/bi: Fix CONVERT component counting Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	d8c6a71878	pan/bi: Only rewrite COMBINE dest if not SSA If it's already a register, there's no point in rewriting and it will disturb the existing register, i.e. for if (..) { r0 = vecN .. } else { r0 = vecN .. } Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	e42a5dfd4f	pan/bi: Fix emit_if successor assignment Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes: `9a00cf3d1e` ("pan/bi: Add support for if-else blocks") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	b34eb94d9c	pan/bi: Allow printing branches without targets Useful for debugging codegen. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	a4fc16a1d4	pan/bi: Remove schedule_barrier Legacy from Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	b3ae088b96	pan/bi: Add helper to measure clause size Useful for branching. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	2a4e4477fc	pan/bi: Add bi_layout.c for clause layout helpers Figuring out what "shapes" of clauses are kosher happens during scheduling, not packing, but shouldn't distract the scheduler. So let's add a new file for these sorts of questions. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	c3de28bb49	pan/bi: Remove more artefacts of 2-pass scheduling A clause is, by definition, already scheduled. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	4096be05af	pan/bi: Add MUL.i32 to disasm Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	ec8665615f	pan/bi: Disassemble pos=0xe Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	a658a4f7a5	pan/bi: Document constant count invariant constants + instructions <= 13 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	ac64bf9b20	pan/bi: Move bi_flip_ports out of port assignment It's more of a packing fixup than anything scheduler-y, and port assignment will soon be the domain of the scheduler. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	95e3776d3e	pan/bi: Add FILE* argument to bi_print_registers In case we need it in general IR printing. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	dd96b451f6	pan/bi: Drop `struct` from bi_registers It's a full-fledged part of the IR now. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	b042ddef32	pan/bi: Move bi_registers to bi_bundle Make it a part of the IR itself. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	79f30d8a86	pan/bi: Move bi_registers to common IR structures Port assignments are critical to scheduling, this can't just live in bi_pack. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	59f8f20306	pan/bi: Remove comment about old scheduler design I've realized it really has to be 1-pass to be sane. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	635bf652ed	pan/bi: Remove FMA? parameter from get_src We can lower away zeroes a bit earlier. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5260>	2020-05-29 20:34:55 +00:00
Alyssa Rosenzweig	20f6c7a913	panfrost: Preload gl_FragCoord on Bifrost It's a precoloured register but we do need to specify in the cmdstream that we want the preloading to happen. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5267>	2020-05-29 20:19:46 +00:00
Alyssa Rosenzweig	1d194f8ac4	panfrost: Set reads_frag_coord as a sysval In addition to parsing out the varying. This is needed so it works on Bifrost as well. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5267>	2020-05-29 20:19:46 +00:00
Alyssa Rosenzweig	52875a34aa	panfrost: Don't generate gl_FragCoord varying on Bifrost It's treated as a sysval there, so that's silly. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5267>	2020-05-29 20:19:46 +00:00
Rob Clark	11470fcde2	freedreno/a6xx: fix vsc assert Fixes a debug build assert seeing with an android app. Not quite sure which path was passing us draw_info w/ instance_count==0. But we should just treat non-instanced draws as having a single instance. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5091>	2020-05-29 19:35:08 +00:00
Kristian H. Kristensen	f6f7bc2979	freedreno/a6xx: Program VFD_DEST_CNTL from program stateobj This only depends on the generated shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5140>	2020-05-29 18:59:56 +00:00
Kristian H. Kristensen	7aa809e31c	freedreno/a6xx: Create stateobj for VFD_DECODE This now only depends on vertex state and we can create it once up front in pctx->create_vertex_elements_state(). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5140>	2020-05-29 18:59:56 +00:00
Kristian H. Kristensen	8952dd6d99	freedreno/a6xx: Decouple VFD_FETCH and VFD_DECODE We used to output a VFD_FETCH entry for each VFD_DECODE, but we can instead output just one VFD_FETCH per VBO and point multiple VFD_DECODE entries at the same VFD_FETCH entry. There's typically fewer VBOs than vertex elements so this is a small win in itselfs, but more importantly, the VFD_DECODE state now only depends on program state. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5140>	2020-05-29 18:59:56 +00:00
Kristian H. Kristensen	c15db8928f	freedreno/a6xx: Move per element offset to VFD_DECODE Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5140>	2020-05-29 18:59:56 +00:00
Samuel Pitoiset	9d645a19eb	radv/aco: enable VK_KHR_subgroup_extended_types on GFX8+ Should be working now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	e22567089c	aco: sign-extend input/indentity for 32-bit reduce ops on GFX10 Because some 16-bit instructions are already VOP3 on GFX10, we use the 32-bit variants to remove the temporary VGPR and to use DDP with the arithmetic instructions. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	83dcd1690b	aco: allow gfx10_wave64_bpermute with 8-bit/16-bit input Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	8ece71507d	aco: allocate a temp VGPR for some 8-bit/16-bit reduction ops on GFX10 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	2e0ea9bcca	aco: implement 8-bit/16-bit reductions on GFX10 Some 16-bit instructions are VOP3 on GFX10 and we have to emit a 32-bit DPP mov followed by the ALU instruction. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	75a730ced5	aco: fix register allocation for subdword instructions on GFX10 Cc: 20.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Bas Nieuwenhuizen	ad609bf55a	frontend/dri: Implement mapping individual planes. It is kinda surprising that image2 = fromPlanar(image, 2, NULL) mapImage(..., image2, ...) does not map the third plane. This implements that behavior in the case where the DRI frontend lowers the multi-planar textures. In the case it doesn't this would need driver support. AFAIU at least etnaviv is impacted, and while it looks possible, I don't have the etnaviv knowledge to implement it. Instead of silently returning weird results (either always plane 0 or possibly something interleaved) this adds an error return on mapping multi-planar textures otherwise. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5200>	2020-05-29 09:12:33 +00:00
Vinson Lee	a2ee293422	zink: Check fopen result. Fix warning reported by Coverity. Dereference null return value (NULL_RETURNS) dereference: Dereferencing a pointer that might be NULL fp when calling fwrite. Fixes: `8d46e35d16` ("zink: introduce opengl over vulkan") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5235>	2020-05-29 08:59:19 +00:00
Samuel Pitoiset	7503863fe2	radv/aco: enable VK_EXT_subgroup_size_control ACO should already support Wave32 on GFX10 with all shader stages and CTS pass. RADV currently only allows Wave32 with the compute shader stage. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5056>	2020-05-29 10:12:26 +02:00
Rob Clark	6f39126200	freedreno/a6xx: document LRZ flag buffer Doesn't seem to be a big win, although I could still be missing something in my implementation. But might as well add the documentation. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5217>	2020-05-29 00:38:28 +00:00
Rob Clark	a3947f9d24	freedreno/a6xx: LRZ fix for alpha-test Similarly to stencil-test, if alpha-test is enabled, we don't know necessarily whether the fragment will pass. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3045 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5217>	2020-05-29 00:38:28 +00:00
Neha Bhende	838666a41d	util: Initialize pipe_shader_state for passthrough and transform shaders mesa/st is initializing pipe_shader_state for user define shaders. This patch intialized pipe_shader_state for all passthough and transform shaders. This fixes crashes for several opengl apps. Issue is found in vmware internal testing Fixes: `f01c0565bb` ("draw: free the NIR IR.") Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5240>	2020-05-28 23:27:53 +00:00
Chris Wilson	034329128b	iris: Rename iris_seqno to iris_fine_fence Rename iris_seqno to iris_fine_fence, borrowed from si_fine_fence, to avoid introducing any confusion with any other seqno used for tracking pipelines. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5233>	2020-05-28 12:47:19 -07:00
Gert Wollny	682e14d3ea	nir: lower_tex: Don't normalize coordinates for TXF with RECT v2: remove the option to actually request normalization and its application in Intel < Gen6 (Jason) v3: Also don't lower for query operations (Jason) Fixes: `1ce8060c25` nir/lower_tex: support for lowering RECT textures Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5105>	2020-05-28 18:39:29 +00:00
Samuel Pitoiset	10c4a7cf59	spirv,radv,anv: implement no-op VK_GOOGLE_user_type This extension only allows HLSL shader compilers to optionally embed unambiguous type information which can be safely ignored by the driver. This fixes a crash with the recent Vulkan backend of Path Of Exile (it uses the extension without checking if it's supported). Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5237>	2020-05-28 17:30:24 +02:00
Rhys Perry	01ce7887bf	aco: fix 64-bit shared_atomic_exchange Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4880>	2020-05-28 10:34:03 +00:00
Rhys Perry	1f2fd9c62e	aco: don't reorder barriers in the scheduler Unless we're reordering it around a barrier of the same type No shader-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4880>	2020-05-28 10:34:03 +00:00
Rhys Perry	e1900ee2c7	aco: preserve more fields when combining additions into SMEM Totals from 11 (0.01% of 127638) affected shaders: Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `93c8ebfa78` ('aco: Initial commit of independent AMD compiler') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4880>	2020-05-28 10:34:03 +00:00
Rhys Perry	95d5c1b8a1	aco: check instruction format before waiting for a previous SMEM store Totals from 7 (0.01% of 127638) affected shaders: CodeSize: 40336 -> 40320 (-0.04%) Instrs: 7807 -> 7803 (-0.05%) Cycles: 118588 -> 118344 (-0.21%); split: -0.23%, +0.02% SMEM: 331 -> 339 (+2.42%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `1749953ea3` ('aco/gfx10: Wait for pending SMEM stores before loads') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4880>	2020-05-28 10:34:03 +00:00
Rhys Perry	5ccc7c277c	aco: consider SDWA during value numbering Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `23ac24f5b1` ('aco: add missing conversion operations for small bitsizes') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5164>	2020-05-28 09:55:58 +00:00
Rhys Perry	8aa98cebc1	aco: fix interaction with 3f branch workaround and p_constaddr The offset was incorrect if we inserted a nop before the p_constaddr. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `93c8ebfa` ('aco: Initial commit of independent AMD compiler') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5164>	2020-05-28 09:55:58 +00:00
Caio Marcelo de Oliveira Filho	bccf2a25a8	intel: Add helper to calculate GPGPU_WALKER::RightExecutionMask Suggested by Jason. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5142>	2020-05-27 18:16:31 -07:00
Caio Marcelo de Oliveira Filho	78e400d4a5	iris, i965: Update limits for ARB_compute_variable_group_size The CS compiler now produces multiple SIMD variants, so the previous trade-off between "always using SIMD32" and "having a smaller max invocations" is now gone. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5142>	2020-05-27 18:16:31 -07:00

... 4 5 6 7 8 ...

115280 Commits