KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Erik Faye-Lund	10f07495f6	zink: only report device-local memory as video-memory While the definition of "video memory" isn't super clear, I think it's pretty reasonable to assume host-memory isn't meant to be included. So let's only count dedicated memory here. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3107 Reviewed-by: Witold Baryluk <witold.baryluk@gmail.com> Tested-by: Witold Baryluk <witold.baryluk@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5409>	2020-06-10 08:58:09 +00:00
Samuel Pitoiset	9b58c4958b	ac/nir: fix integer comparisons with pointers If we get a comparison between a pointer and an integer, LLVM complains if the operands aren't of the same type. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3085 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5397>	2020-06-10 08:18:22 +00:00
Pierre-Eric Pelloux-Prayer	24ceb6a594	radeonsi/ngg: try GS multi-cycling mode if default mode failed If gsprim_lds_size is larger than target_lds_size then gfx10_ngg_calculate_subgroup_info will fail. This commit adds a logic to try the multi-cycling in this case because it's using less memory. This fix glsl-1.50-gs-max-output when using NGG. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5401>	2020-06-10 09:33:58 +02:00
Pierre-Eric Pelloux-Prayer	ce7692fc19	radeonsi: add return value to gfx10_ngg_calculate_subgroup_info gfx10_ngg_calculate_subgroup_info uses assert to detect invalid configuration, but if asserts are disabled it will continue its execution. This commits adds a boolean return value to let the caller know that something went wrong and that the results mustn't be used. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3103 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5401>	2020-06-10 09:33:48 +02:00
Andrii Simiklit	2c711beb5c	glsl: fix crash on glsl macro redefinition In case shader contains two equal macro defines, first one with trailing spaces and the second one without. `#define A 1 ` `#define A 1` The parser crashes Fixes: `0346ad3774` ("glsl: ignore trailing whitespace when define redefined") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5312>	2020-06-10 03:29:39 +00:00
Jason Ekstrand	0c37cbf807	anv/allocator: Compare to start_offset in state_pool_free_no_vg In `d11e4738a8`, we started using a start_offset to allow us to allocate pools where the base address isn't at the start of the pool. This is useful for binding table pools which want to be relative to surface state base address (more or less), among other things. However, we had a bug where, if you have a negative offset, everything returned to the pool would end up being returned to the "back" of the pool. This isn't what we want for binding tables in the softpin world. This was causing us to never actually re-use any binding table blocks. How this passed CTS, I have no idea. Closes: #3100 Fixes: `d11e4738a8` "anv/allocator: Add a start_offset to anv_state_pool" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5395>	2020-06-09 22:52:26 +00:00
Alyssa Rosenzweig	5d547858da	panfrost: Ensure we have ro before using it Even through the resouce requested has a BIND_SCANOUT or related tag, this does not mean that we have a render-only driver. This can trivially happen as one requests such resource from GBM, while using the panfrost fd (and hence panfrost_dri.so) Forward port of !3000 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Robert Foss <robert.foss@collabora.com> Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Closes: #2664 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5410>	2020-06-09 22:09:07 +00:00
Samuel Pitoiset	64f2d45c3b	radv/aco: enable shaderInt8 and VK_KHR_shader_float16_int8 on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Samuel Pitoiset	be4dd6abd1	radv/aco: enable shaderInt16 on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Samuel Pitoiset	b3aee3aa23	radv/aco: enable 8-bit/16-bit storage on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	5cde4989d3	aco: remove unnecessary split- and create_vector instructions for subdword loads This helps GFX6/7 by removing unnecessary shuffle code. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Samuel Pitoiset	5446e3cf2e	aco: fix alignment of vectors with 4 elements I think this case was just missing. This fixes a bunch of 16-bit storage related CTS failures like dEQP-VK.ssbo.phys.layout.single_basic_type.std430.u16vec4. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Samuel Pitoiset	c7bd0f8cd5	aco: implement 8-bit/16-bit conversions on GFX6-GFX7 Use v_bfe to implement small bitsize conversions because the compiler probably optimizes this better. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	db957f9135	aco: optimize packing of 16bit subdword registers on GFX6/7 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	2a51840c52	aco: skip partial copies on first iteration when lowering to hw Helps some Detroit : Become Human shaders. Totals from affected shaders: (VEGA) Code Size: 47693912 -> 47670212 (-0.05 %) bytes Instructions: 9183788 -> 9177863 (-0.06 %) Copies: 910052 -> 904127 (-0.65 %) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	1d6f667193	aco: coalesce copies more aggressively when lowering to hw Helps some Detroit : Become Human shaders. Totals from affected shaders: (VEGA) Code Size: 9880420 -> 9879088 (-0.01 %) bytes Instructions: 1918553 -> 1918220 (-0.02 %) Copies: 177783 -> 177450 (-0.19 %) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	b21d2d9a9f	aco: add and use scratch SGPR to lower subdword p_create_vector on GFX6/7 This is needed to lower some corner cases correctly, in case the same operand occurs multiple times: e.g. v0 = p_create_vector(v0[0:8], v0[0:8], v0[0:8], v0[0:8]) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	9e8e12ea6d	aco: adjust GFX6 subdword lowering workarounds for 8bit Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	b083581010	aco: Workarounds subdword lowering on GFX6/7 As there are no SDWA instructions, we need to take care not to overwrite the upper bits of other copy_operation's operands. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	942e3c40c3	aco: use full-register instructions to implement subdword packing on GFX6/7 On GFX6/7, there are no SDWA instructions. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	3f03db848d	aco: simplify statistics collection for copies Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Daniel Schürmann	0560831593	aco: fix register assignment for p_create_vector on GFX6/7 In case, some operand was already placed in the definition space, it could happen that it wasn't considered for live-range splits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5226>	2020-06-09 21:25:38 +00:00
Mike Blumenkrantz	98d07bd5a0	zink: emit interpolation decorations for ntv outputs this matches up with nir internal states pre/post ntv Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5384>	2020-06-09 20:56:09 +00:00
Mike Blumenkrantz	ad8e61621b	zink: track program usages for each shader when shaders are created and destroyed in large numbers, the same pointers get reused for different shaders, which can lead to bad lookups in the program_cache hash table. now each shader tracks its program usage to automatically remove itself from that program in order to avoid hash collisions fixes mesa/mesa#3053 Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5315>	2020-06-09 20:30:25 +00:00
Erik Faye-Lund	48925f6927	zink: assert that image-view format isn't undefined Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5407>	2020-06-09 19:35:26 +00:00
Erik Faye-Lund	2d3c6605d6	zink: emulate B8G8R8X8_SRGB with B8G8R8A8_SRGB Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5407>	2020-06-09 19:35:26 +00:00
Eric Anholt	3e11f04d4e	turnip: Expose robustBufferAccess. It is a required device feature, and all enabled tests in dEQP-VK.robustness.* pass. Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5266>	2020-06-09 18:28:18 +00:00
Eric Anholt	58dd904c59	turnip: Fix crashes in compute with no descriptors to load. Found when trying to rebase cheza VK CI on top of this change. Fixes: `334204823e` ("tu: Fix context faults loading unused descriptor sets") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5266>	2020-06-09 18:28:17 +00:00
Thong Thai	b2324f4560	frontends/vdpau: Default destination rect to source rect mpv is passing in a NULL destination_video_rect, which results in a black screen when playing videos using VDPAU in some cases. Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5386>	2020-06-09 18:02:09 +00:00
Marek Olšák	0795241dde	radeonsi: require LLVM 11 for gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	9538b9a68e	radeonsi: add support for Sienna Cichlid Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	789cdab3b6	ac: align num_vgprs for gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	2cc4bfbe01	radeonsi: don't set any XNACK options on gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	430d384c31	radeonsi: set BIG_PAGE fields on gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	7edf15ad47	radeonsi: move L2_CACHE_CONTROL registers into si_emit_framebuffer_state the next commit will set more fields. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	788696c7b2	radeonsi: implement R9G9B9E5 render target and image store support on gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	a54bcb9429	radeonsi: enable larger SDMA clears and copies on gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	c4b5fd9ab0	radeonsi: honor a user-specified pitch on gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	abe89e1329	ac/surface: add displayable DCC code for gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	a23802bcb9	ac,radeonsi: start adding support for gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	a1602516d7	ac,radeonsi: replace == GFX10 with >= GFX10 where it's needed Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5383>	2020-06-09 16:17:36 +00:00
Marek Olšák	ceaf848c56	radeonsi: enable ARB_sparse_buffer This seems to be working now, but it wasn't working before. I don't know what fixed this. Tested on Raven and Navi14. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5402>	2020-06-09 16:00:38 +00:00
Connor Abbott	334204823e	tu: Fix context faults loading unused descriptor sets The app is allowed to never bind descriptor sets that are statically unused by the pipeline, which would've caused a context fault since CP_LOAD_STATE6 would try to load the descriptors that don't exist. Fix this by not preloading descriptors from unused descriptor sets. We could do more fine-grained accounting of which descriptors are used, but this is enough to fix the problem. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5400>	2020-06-09 15:35:29 +00:00
Danylo Piliaiev	a751051248	i965: Work around incorrect usage of glDrawRangeElements in UE4 Unreal Engine 4 has a bug in usage of glDrawRangeElements, causing it to be called with a number of vertices in place of "end" parameter (which specifies the maximum array index contained in indices). Since there is unknown amount of games affected and we could not identify that a game is built with UE4 - we are forced to make a blanket workaround, disregarding max_index in range calculations. Fortunately all such calls look like: glDrawRangeElements(GL_TRIANGLES, 0, 3, 3, ...); So we are able to narrow down this workaround. This was uncovered after `b684030c3a` broke a bunch of UE4 games. Cc: 20.1 <mesa-stable@lists.freedesktop.org> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2917 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5203>	2020-06-09 15:04:35 +00:00
Connor Abbott	487aa807bd	tu: Rewrite flushing to use barriers Replace the various ad-hoc flushes that we've inserted, copied from freedreno, etc. with a unified system that uses the user-supplied information via vkCmdPipelineBarrier() and subpass dependencies. There are a few notable differences in behavior: - We now move setting RB_CCU_CNTL up a little in the gmem case, but hopefully that won't matter too much. This matches what the Vulkan blob does. - We properly implement delayed setting of events, completing our implementaton of events. - Finally, of course, we should be a lot less flush-happy. We won't emit useless CCU/cache flushes with multiple copies, renderpasses, etc. that don't depend on each other, and also won't flush/invalidate the cache around renderpasses unless we actually need to. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4964>	2020-06-09 14:40:52 +00:00
Connor Abbott	29abf49886	tu: Remove useless event_write helpers tu6_emit_cache_flush() was wrongly named, and with the removal of the last parameter both are useless. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4964>	2020-06-09 14:40:52 +00:00
Connor Abbott	f4f6a9be9f	tu: Don't actually track seqno's for events We just dropped the last user which actually cared about the seqno. This never worked anyway, since the seqno was never reset between multiple executions of the same command buffer. Turn the part of the control buffer which used to track the seqno into a dummy dword, and figure out automatically whether we need to include it. We will implement seqnos again eventually, with timline semaphores, but that will likely be totally different. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4964>	2020-06-09 14:40:52 +00:00
Connor Abbott	dfb176a0ac	tu: Remove useless post-binning flushes The Vulkan blob doesn't do this, and based on my understanding of how the blob works this is unnecessary. CACHE_FLUSH is already serialized against all 3d commands so you don't need to wait for rendering commands to finish before issuing it, and the subsequent wfi + WAIT_FOR_ME will cause the CP to wait for the CACHE_FLUSH to finish, so there's also no need to wait for it to complete. The CACHE_INVALIDATE also seems unnecessary, and also isn't done by the blob. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4964>	2020-06-09 14:40:51 +00:00
Icecream95	18c067f9f0	panfrost: Mark PIPE_BUFFER BOs as not renderable Without this, memory usage explodes by 16x due to height alignment. Closes: #2715 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4451>	2020-06-09 13:52:52 +00:00
Pierre-Eric Pelloux-Prayer	db57624c0c	winsys/radeon: do not cast bo->va as void* Using a util_hash_table_create_ptr_keys to store bo->va address doesn't work on 32 bits. This commit makes radeon_drm_winsys::bo_vas a hash_table_u64 instead. Tested by Miklós Máté. CC: 20.1 <mesa-stable@lists.freedesktop.org> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3056 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5380>	2020-06-09 11:28:16 +02:00
Samuel Pitoiset	d7923c74d4	radv/llvm: expose VK_EXT_shader_demote_to_helper_invocation with LLVM 9+ It should already work with the LLVM backend. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5361>	2020-06-09 08:04:23 +02:00
Marek Olšák	d76e8131ac	glthread: sync in glFlush for multiple contexts See the code comment. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5382>	2020-06-09 05:07:01 +00:00
Marek Olšák	90c34aed1d	gallium/u_vbuf: add a faster path for uploading non-interleaved attribs +1% higher FPS in torcs. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5304>	2020-06-09 00:45:26 -04:00
Marek Olšák	88e8f1a38d	gallium/u_vbuf: get rid of some pointer dereferences Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5304>	2020-06-09 00:45:23 -04:00
Ben Skeggs	a6c747e8e0	nir: use bitfield_insert instead of bfi in nir_lower_double_ops NVIDIA hardware doesn't have an equivilant to bfi, but we do already have a lowering for bitfield_insert->bfi. Signed-off-by: Ben Skeggs <bskeggs@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5373>	2020-06-09 08:38:22 +10:00
Jonathan Marek	7b4f0eadc1	turnip: fix VFD_CONTROL for binning pass Fixes some cases with TU_DEBUG=forcebin, specifically the failures in: dEQP-VK.glsl.*_vertex Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:28:48 -04:00
Jonathan Marek	ab72c07aef	turnip: use common emit_xs_cntl to fill a6xx_sp_xs_ctrl_reg0 Note this changes the value of SP_GS_CTRL_REG0, by using FOUR_QUADS and setting MERGEDREGS. ir3 expects MERGEDREGS, and using FOUR_QUADS instead of TWO_QUADS doesn't seem to hurt. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:28:43 -04:00
Jonathan Marek	e16608e233	turnip: fix HW binning with geometry shader Fixes failures with TU_DEBUG=forcebin and geometry shaders, for example: dEQP-VK.binding_model.geometry dEQP-VK.transform_feedback.simple.query* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:23:05 -04:00
Jonathan Marek	6ac4d778fa	turnip: correctly emit non-binning vs in transform feedback case The offset given to tu6_emit_shader_object was wrong, binning_vs_offset should only be used when using the binning pass vs. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:23:05 -04:00
Jonathan Marek	6cc95abb27	freedreno/a6xx: use nonbinning VS when GS is used The current "ds = state->bs" seems broken, and the "vs = state->bs" is unnecessary (already set above). Since it was added as part of a GS-related patch, I think this is what was intended. Note: tesselation disables GMEM rendering so we shouldn't have to worry about hs/ds + binning interaction. Fixes: `0eebedb619` ("freedreno/a6xx: Emit program state for GS") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:23:05 -04:00
Jonathan Marek	88d5917cc1	turnip: clamp sampler minLod/maxLod Otherwise A6XX_TEX_SAMP_1_{MIN,MAX}_LOD silently overflows. This fixes these tests: dEQP-VK.texture.explicit_lod.2d.derivatives.* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5375>	2020-06-08 19:52:41 +00:00
Jonathan Marek	fecd83a0e8	turnip: update some properties based on blob driver subTexelPrecisionBits/mipmapPrecisionBits change fixes some failures in: dEQP-VK.texture.explicit_lod.* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5375>	2020-06-08 19:52:41 +00:00
Jonathan Marek	8c26c9eed8	turnip: move HLSQ_UPDATE_CNTL write to before xs config writes This matches the blob and gallium driver more closely, and fixes a rendering issue observed on a650. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5376>	2020-06-08 18:08:23 +00:00
Caio Marcelo de Oliveira Filho	d1f6d2f3e8	nir: Fix logic that ends combine barrier sequence The combination must stop when we see a scoped barrier that have execution scope, i.e. it has control barrier behavior. The code was mistakenly looking at the wrong scope. Fixes: `345b5847b4` ("nir: Replace the scoped_memory barrier by a scoped_barrier") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5365>	2020-06-08 15:49:24 +00:00
Caio Marcelo de Oliveira Filho	fe214d60bc	intel/fs: Add Fall-through comment Just to clarify the missing break is intentional. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5365>	2020-06-08 15:49:24 +00:00
Caio Marcelo de Oliveira Filho	e5bb4b1ee8	spirv: Memory semantics is optional for OpControlBarrier Fixes: `3ed2123d77` ("spirv: Use scoped barriers for SpvOpControlBarrier") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5365>	2020-06-08 15:49:24 +00:00
Caio Marcelo de Oliveira Filho	b7a3821a5c	nir: Fix printing execution scope of a scoped barrier Fixes: `345b5847b4` ("nir: Replace the scoped_memory barrier by a scoped_barrier") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5365>	2020-06-08 15:49:24 +00:00
Christian Gmeiner	7ec2582087	etnaviv: drop translate_blend(..) PIPE_BLEND_* matches 1:1 the hardware defines. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4187>	2020-06-08 15:35:13 +00:00
Danylo Piliaiev	9f1cf0e491	glsl: inline functions with unsupported return type before converting to nir glsl_to_nir doesn't expect non-vector/scalar return types in functions. Fixes: `7e60d5a501` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3058 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3060 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Tested-by: Witold Baryluk <witold.baryluk@gmail.com> Reviewed-by: Witold Baryluk <witold.baryluk@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5333>	2020-06-08 15:01:50 +00:00
Rhys Perry	43e69475ad	aco: use v_xor3_b32 fossil-db (Navi): Totals from 334 (0.26% of 128321) affected shaders: CodeSize: 3345532 -> 3345484 (-0.00%); split: -0.00%, +0.00% Instrs: 624662 -> 622778 (-0.30%); split: -0.30%, +0.00% Mostly affects some parallel-rdp shaders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5357>	2020-06-08 13:20:01 +00:00
Rhys Perry	1234faa7bf	ac/gpu_info, radv: set max_wave64_per_simd to 20 on GFX10 Fixes RADV max_waves reporting for GFX10 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5356>	2020-06-08 10:26:59 +00:00
Samuel Pitoiset	86f21e4eba	nir/lower_explicit_io: fix NON_UNIFORM access for UBO loads Make sure to propagate the NON_UNIFORM access for UBO loads, so that non-uniform loads are correctly lowered. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5311>	2020-06-08 07:35:43 +00:00
Arcady Goldmints-Orlov	b38d3cdcea	nir/spirv/glsl450: increase asin(x) precision asin(x) is now implemented using a piecewise approximation, which improves the precision for \|x\| < 0.5 Previously, we were using a polynomial approximation for both the asin() and acos() functions. Unfortunately, for asin(), this polynomial does not have enough precision to satisfy the Vulkan CTS requiremenents, which define the asin() precision based on the precision of atan2(x, sqrt(1.0 - x*x)). The piecewise approximation gives the needed precision in the problematic range. v2: Skip the piecewise approximation for acos Closes: #1843 Acked-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3809>	2020-06-08 07:10:17 +00:00
Samuel Pitoiset	008b0d1701	ac/nir: adjust an assertion for D16 on GFX6-GFX7 16-bit types can be used with MUBUF on GFX6-GFX7. Fixes: `c3e0ba52a0` ("ac/nir: support 16-bit data in buffer_load_format opcodes") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5325>	2020-06-08 08:45:32 +02:00
Peter Seiderer	b3beb6207f	v3d_bufmgr: fix time_t printf Fixes: error: format ‘%ld’ expects argument of type ‘long int’, but argument 3 has type ‘time_t’ {aka ‘long long int’} Signed-off-by: Peter Seiderer <ps.report@gmx.net> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4279>	2020-06-07 18:08:47 +02:00
Peter Seiderer	d512028d06	pan_bo.h: add time.h include for time_t Fixes: ../src/gallium/drivers/panfrost/pan_bo.h:93:9: error: unknown type name ‘time_t’ Signed-off-by: Peter Seiderer <ps.report@gmx.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4279>	2020-06-07 18:08:17 +02:00
Peter Seiderer	07ba5e47e6	vc4_bufmgr: fix time_t printf Fixes: error: format ‘%ld’ expects argument of type ‘long int’, but argument 3 has type ‘time_t’ {aka ‘long long int’} Signed-off-by: Peter Seiderer <ps.report@gmx.net> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4279>	2020-06-07 18:07:47 +02:00
Timothy Arceri	e43ab7bb05	glsl: fix potential slow compile times for GLSLOptimizeConservatively See code comment for full description of the change. Fixes: `0a5018c1a4` ("mesa: add gl_constants::GLSLOptimizeConservatively") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3034 Tested-by: Witold Baryluk <witold.baryluk@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5346>	2020-06-07 03:28:30 +00:00
Charmaine Lee	dd81f4853c	llvmpipe: do not enable tessellation shader without llvm coroutines support Tessellation shader in llvmpipe depends on llvm coroutines support. So do not advertise tessellation shader support in llvmpipe if GALLIVM_HAVE_CORO is FALSE. This fixes assertion in LLVMTokenTypeInContext() running tessellation shader tests with llvm version < 6. Fixes: `eb522717` "llvmpipe: add support for tessellation shaders" Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5366>	2020-06-06 22:45:21 +00:00
Marcin Ślusarz	990b3782bc	intel/compiler: fix Android build Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Fixes: `689acc7398` ("intel/compiler: Extract control barriers from scoped barriers") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3087 Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5354>	2020-06-06 18:42:03 +00:00
Vinson Lee	6a841dbf4e	intel/genxml: Migrate from deprecated xml.etree.ElementTree getchildren. xml.etree.ElementTree getchildren was deprecated since Python 2.7 and will be removed in Python 3.9. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5348>	2020-06-06 08:20:13 +00:00
Mauro Rossi	06650a771d	android: svga: fix build for GL4.1 support Fixes the following building errors: external/mesa/src/gallium/drivers/svga/svga_context.c:184: error: undefined reference to 'svga_init_ts_functions' external/mesa/src/gallium/drivers/svga/svga_context.c💯 error: undefined reference to 'svga_cleanup_tcs_state' out/target/product/x86_64/obj_x86/STATIC_LIBRARIES/libmesa_pipe_svga_intermediates/libmesa_pipe_svga.a(svga_state.o):svga_state.c:hw_draw_state_sm5: error: undefined reference to 'svga_hw_tes' out/target/product/x86_64/obj_x86/STATIC_LIBRARIES/libmesa_pipe_svga_intermediates/libmesa_pipe_svga.a(svga_state.o):svga_state.c:hw_draw_state_sm5: error: undefined reference to 'svga_hw_tcs' Fixes: `ccb4ea5a` "svga: Add GL4.1(compatibility profile) support in svga driver" Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5364>	2020-06-06 09:32:25 +02:00
Mauro Rossi	0570c7a7b5	android: util: fix build for GL4.1 support Fixes the following building errors: external/mesa/src/gallium/drivers/svga/svga_state_tgsi_transform.c:154: error: undefined reference to 'tgsi_write_vpos' external/mesa/src/gallium/drivers/svga/svga_state_tgsi_transform.c:201: error: undefined reference to 'tgsi_remove_dynamic_indexing' Fixes: `48a7456f` ("util: Add util functionality for GL4.1 support") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5364>	2020-06-06 09:32:19 +02:00
Vinson Lee	faa339e666	Switch from cElementTree to ElementTree. The xml.etree.cElementTree module will be removed in Python 3.9. Since Python 3.3 the xml.etree.cElementTree module has been deprecated, the xml.etree.ElementTree module uses a fast implementation whenever available. Builds using Python 2.7 can still work but with the slower implementation. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5349>	2020-06-05 23:42:54 -07:00
Icecream95	a61532e4db	Revert "panfrost: Keep cached BOs mmap'd" This reverts commit `794c239a99`. A kernel bug causes cached BOs to not be unmapped correctly, triggering "bad page cache" kernel messages and causing short hangs. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5355>	2020-06-05 23:51:32 +00:00
Icecream95	d97aaad155	pan/midgard: Use a signed value for checking inline constants Inline constants are sign extended, so we should use a int16_t instead of an unsigned type. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5351>	2020-06-05 22:51:45 +00:00
Eric Anholt	0bacb280a8	freedreno/ir3: Handle cases where we decide not to lower UBO 0 loads. We advertize 4096 vec4s of GL uniform storage, but the HW can only store 512 vec4s in the const buffer. Closes: #3049 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:43:30 -07:00
Eric Anholt	e349f50279	freedreno/ir3: Drop the max_const on a6xx to 512. The GLES blob on the p3a limits constlen to 512 between VS and FS across a6xx gpu ids (615, 630, 640, and 650). Experimentally, exceeding that limit in any one stage results in rendering corruption or GPU hangs (though my most detailed testing had a loop limit in a uniform, so that may the cause of the hang). Clamp the limit we use inside of a shader so we don't exceed it within a stage. This commit doesn't resovle limiting inter-stage. Experimentally, I've found that I can push up to a total of ~768 vec4s between VS and FS on a630, with or without uniform updates between each draw. We'll need to do some shader key-based limiting of constlen at draw time to respect that limit, but that's left for future work, and this commit is enough for the google earth case that initiated this work. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	486b894307	freedreno/ir3: Account for driver params in UBO max const upload. The const state setup needs to be able to push its driver params, so account for them in the analyze_ubo_ranges. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	a25347ab92	freedreno/ir3: Stop shifting UBO 1 down to be UBO 0. It turns out the GL uniforms file is larger than the hardware constant file, so we need to limit how many UBOs we lower to constbuf loads. To do actual UBO loads, we'll need to be able to upload UBO 0's pointer or descriptor. No difference on nohw 1 UBO update drawoverhead case (n=35). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	9e58ab09ff	freedreno/ir3: Drop unnecessary alignment of pushed UBO size. The analysis pass gives us vec4-aligned size, and all of our other constbuf allocations here are in vec4 units, so we can just divide by 16. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	07ec745014	freedreno/ir3: Stop pushing immediates once we've filled the constbuf. If we filled the constbuf up with UBOs, we may need to avoid generating more immediate push constants. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	ab29f2da42	freedreno/ir3: Refactor ir3_cp's lower_immed(). There was duplicated handling in the callers that we can just move inside. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	4065861807	freedreno: Upload gallium constbufs as needed when referenced as a UBO. For now we never ask to set up UBO 0 as a real UBO, so this doesn't trigger, but it gets us ready for handling the case where UBO 0 is too big to be push constants in the HW. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	d1f9d1e26a	freedreno/a6xx: Add support for ALPHA_TO_ONE. Fixes piglit ext_framebuffer_multisample-draw-buffers-alpha-to-one Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5343>	2020-06-05 18:11:22 +00:00
Eric Anholt	ac1ab9294a	turnip: Add support for alphaToOne. Comparing a blob trace using the feature to one not, the difference was pretty obvious and in the spot you'd expect compared to alphaToCoverage. The SP_ reg didn't have a corresponding bit set, though it also has an alphaToCoverage. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5343>	2020-06-05 18:11:22 +00:00
Eric Anholt	79f3003445	turnip: Use tu_cs_emit_regs() for BLEND_CONTROL. Just a cleanup since I was in the area. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5343>	2020-06-05 18:11:22 +00:00
Rhys Perry	5d13c7477e	radv: set keep_statistic_info with RADV_DEBUG=shaderstats Needed for RADV_DEBUG=shaderstats to dump ACO statistics. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5358>	2020-06-05 15:11:01 +00:00
Eric Engestrom	981d07c74a	intel: fix gen_sort_tags.py The script was failing for me (python 3.8), not sure if this is a recent python version break or not as I don't know how often people have been running this script: Processing ./gen9.xml... Traceback (most recent call last): File "./gen_sort_tags.py", line 177, in <module> main() File "./gen_sort_tags.py", line 170, in main genxml[:] = enums + sorted_structs.values() + instructions + registers TypeError: can only concatenate list (not "odict_values") to list Turning the odict into a list fixes it for me, and the resulting xml file are identical to before :) Fixes: `903e142f0d` ("genxml: add a sorting script") Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5352>	2020-06-05 14:31:13 +00:00
Samuel Pitoiset	bfff330f06	radv/aco: enable VK_KHR_shader_subgroup_extended_types on GFX6-GFX7 CTS pass on Pitcairn (GFX6). This extension isn't really useful without 8-bit/16-bit storage though but this is going to be exposed soon. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5327>	2020-06-05 16:04:08 +02:00
Samuel Pitoiset	6391f9ab4c	aco: fix nir_intrinsic_quad_* with 8-bit in GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5327>	2020-06-05 16:04:06 +02:00
Samuel Pitoiset	e1523b34c2	aco: fix sign-extend 8-bit subgroup operations on GFX6-GFX7 SDWA is GFX8+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5327>	2020-06-05 16:04:05 +02:00
Samuel Pitoiset	ee4bc13de2	aco: use v_bfe_u32 for unsigned reductions sign-extension on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5327>	2020-06-05 16:04:03 +02:00
Eric Engestrom	a874132cc4	intel/genxml: drop sort_xml.sh and move the loop directly in gen_sort_tags.py Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5353>	2020-06-05 13:44:18 +00:00
Bas Nieuwenhuizen	c67ef7695a	radv: Use ac_surface to allocate aux surfaces. For consistency and a bunch of codesharing. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	63db31fdfc	amd/common: Add total alignment calculation. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	f70b577683	radv: Allocate values/predicates at the end of the image. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	ec671e8718	radv: Disable HTILE in ac_surface. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	f84b4e2639	radv: Disable DCC in ac_surface. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	81dee6cf8f	radv: Use offsets in surface struct. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	ffae3589c9	radv: Rely on ac_surface for avoiding cmask for linear images. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	b5488a863c	radv: Enforce the contiguous memory for DCC layers in ac_surface. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	d3db633f6d	radv: Pass no_metadata_planes info in to ac_surface. Also do not allocate aux surfaces for multi-plane images. I may have messed up and used plane 1 offsets for the other planes as well. I cannot imagine that sharing aux surfaces between the planes will work well. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Bas Nieuwenhuizen	599ea341dd	radv: Use ac_surface to determine fmask enable. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5194>	2020-06-05 13:27:55 +00:00
Rob Clark	ef5b8bbc5e	freedreno/computerator: fix missing dependency on generated header Fixes: ``` ../mesa-freedreno-20.2.0_pre/src/freedreno/computerator/ir3_asm.c:25:10: fatal error: 'ir3/ir3_parser.h' file not found #include "ir3/ir3_parser.h" ^~~~~~~~~~~~~~~~~~ 1 error generated. ``` Fixes: `da467817e3` ("freedreno/ir3: Move ir3 assembler to backend compiler") Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5340>	2020-06-05 09:48:47 +00:00
Eric Engestrom	7a68045b5d	glapi: remove deprecated .getchildren() that has been replace with an iterator Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3086 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5342>	2020-06-05 09:16:13 +00:00
Samuel Pitoiset	c9a9b363ce	radv/aco: enable 64-bit atomic features if RADV is linked with LLVM 8 Just in case someone links RADV with this old LLVM 8 and wants ACO. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5331>	2020-06-05 07:40:29 +00:00
Neha Bhende	ba37d408da	svga: Performance fixes This is a squash commit of in house performance fixes and misc bug fixes for GL4.1 support. Performance fixes: * started using system memory for constant buffer to gain 3X performance boost with metro redux Misc bug fixes: * fixed usage of vertexid in shader * added empty control point phase in hull shader for zero ouput control point * misc shader signature fixes * fixed clip_distance input declaration * clearing the dirty bit for the surface while using direct map if surface is already flushed and there is no pending primitive This patch also uses SVGA_RETRY macro for commands retries. Part of it is already used in previous patch. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Neha Bhende	ccb4ea5a43	svga: Add GL4.1(compatibility profile) support in svga driver This patch is a squash commit of a very long in-house patch series. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Neha Bhende	52ce25be87	svga/include: Headers for GL4.1 support This brings in the new types, enums and #defines for GL 4.1 features in the virtual device. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Neha Bhende	dc3505f87e	winsys/drm: Add GL4.1 support in drm winsys This is to check whether virtual hardware has SM5 support Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Neha Bhende	48a7456f4d	util: Add util functionality for GL4.1 support This patch adds the following tgsi utilities * tgsi_dynamic_indexing: This utility flattens out the dyanamic indexing of constant buffers * tgsi_vpos: This utility writes zeros to position at index 0 in vertex shader. This utility can be used if there is no shader output in vertex shader * util_make_tess_ctrl_passthrough_shader: This adds passthough tessellation control shader. Input of passthrough tess ctrl shader is output of vertex shader and output is input of tessellation eval shader. If program has tessellation eval shader but no tessellation control shader, this utility can be used to create passthrough tessellation control shader. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com> Signed-off-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5317>	2020-06-05 06:36:54 +00:00
Rob Clark	f1f81abfd4	freedreno/a6xx: more early-z Technically we only have to do late-z in the alpha-test or discard case if depth-write is enabled. If depth write is disabled, the depth read / test / conditional-write interlock that we need to emulate is not a problem, so we can still use early-z test. There is a slightly weird case when there is no zsbuf attachment (see dEQP-GLES31.functional.fbo.no_attachments.*) where the hw wants us to use LATE_Z.. not entirely sure if this is an interaction with occlusion query or just a pecularity of how the hw works when there is no depth buffer. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5336>	2020-06-05 00:57:44 +00:00
Eric Anholt	ec98cff6a9	turnip: Simplify vertex buffer bindings. We were remapping the bindings so the HW binding points were consecutive, which there's no need for. Now that we don't shuffle, we can mostly drop the dependency on the pipeline for this SDS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Eric Anholt	5c9728d960	turnip: Don't bother clamping VB size. From the VK spec: "All elements of pOffsets must be less than the size of the corresponding element in pBuffers" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Eric Anholt	52942f18c6	turnip: Move vertex buffer bindings to SET_DRAW_STATE. This means that the HW can skip over the vertex buffer state when it's not used in a bin. The blob also has this behavior. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Dave Airlie	c8c7450fc7	llvmpipe: move coroutines out of noopt case the virgl CI code was using the noopt path and crashing with a wierd can't select llvm.coro.subfn.addr error, turns out we have to call the cleanup pass no matter what. This enable a lot more virgl gles31 passes, but we have to disable tessellation shaders as now they executed, they crash due to missing OES_gpu_shader5, I should try and reenable them when llvmpipe is further along Fixes: `d32690b43c` ("gallivm: add coroutine pass manager support") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Elie Tournier <elie.tournier@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5320>	2020-06-04 19:08:34 +00:00
Alyssa Rosenzweig	2d1688345a	pan/mdg: Ensure ld_vary_16 is aligned Otherwise packing may fail. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes: `5f8dd413bc` ("pan/mdg: Handle 16-bit ld_vary") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5339>	2020-06-04 17:46:45 +00:00
Kristian H. Kristensen	de8be1de13	freedreno/a6xx: Fix VFD_CONTROL emit The FETCH_CNT field isn't actually the FETCH count. We don't have a lot of data where it's different from DECODE_CNT, so there's not much to go by. It could be number of VFD_DEST_CNTL or maybe DECODE_CNT for binning. For now, setting both to number of DEST_CNTL gets Google Earth working again. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5324>	2020-06-04 15:50:41 +00:00
Clément Guérin	202252566b	radv: Always expose non-visible local memory type on dedicated GPUs DOOM Eternal expects this type, but RADV doesn't expose it when the VRAM is entirely host-visible, in my case on Fiji. Matches AMDVLK behavior. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/3054 Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5308>	2020-06-04 15:16:30 +00:00
Alyssa Rosenzweig	622e3a8510	pan/mdg: Legalize inverts with constants We need to force src_invert to be in the right place even if we flip when lowering an embedded->inline constant. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes: `449e5ded93` ("pan/mdg: Treat inot as a modifier") Reported-by: Icecream95 <ixn@keemail.me> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5299>	2020-06-04 13:17:11 +00:00
Erik Faye-Lund	e61a98877c	nir: reuse existing psiz-variable For shaders where there's already a psiz-variable, we should rather reuse it than create a second one. This can happen if a shader writes gl_PointSize, but disables GL_PROGRAM_POINT_SIZE. Fixes: `878c94288a` ("nir: add lowering-pass for point-size mov") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5328>	2020-06-04 09:12:54 +00:00
Lionel Landwerlin	57e4d0aa1c	i965: fix export of GEM handles We reuse DRM file descriptors internally. Therefore when we export a GEM handle we must do so in the file descriptor used externally. v2: Fix dmabuf leak Fix GEM handle leaks by tracking exported handles v3: Check os_same_file_description error (Michel) Don't create multiple exports for a given GEM table v4: Add WARN_ONCE (Ken) v5: Remove blank line (Ian) Remove unused field (Ian) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2882 Fixes: `4094558e86` ("i965: share buffer managers across screens") Tested-by: Eric Engestrom <eric@engestrom.ch> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4861>	2020-06-04 07:31:38 +00:00
Lionel Landwerlin	aba3aed96e	iris: fix export of GEM handles We reuse DRM file descriptors internally. Therefore when we export a GEM handle we must do so in the file descriptor used externally. This change also fixes a file descriptor leak of the FD given at screen creation. v2: Don't bother checking fd equals, they're always different Fix dmabuf leak Fix GEM handle leaks by tracking exported handles v3: Check os_same_file_description error (Michel) Don't create multiple exports for a given GEM table v4: Add WARN_ONCE (Ken) Rename external_fd to winsys_fd v5: Remove export lock in favor of bufmgr's Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2882 Fixes: `7557f16059` ("iris: share buffer managers accross screens") Tested-by: Eric Engestrom <eric@engestrom.ch> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4861>	2020-06-04 07:31:38 +00:00
Lionel Landwerlin	e41e820648	i965: don't forget to set screen on duped image We'll start using this field more for querying image properties. Without it we run into a crash. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4861>	2020-06-04 07:31:38 +00:00
Lionel Landwerlin	604a86e46f	iris: fix BO destruction in error path Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4861>	2020-06-04 07:31:38 +00:00
Vinson Lee	c3025bde19	mesa: Fix NetBSD compiler macro. Reported-by: Rafał Mikrut <mikrutrafal54@gmail.com> Fixes: `a63b90712a` ("mesa: also check for __NetBSD__") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3015 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5191>	2020-06-03 21:09:54 -07:00
Rob Clark	e9cda38031	freedreno/a6xx: also consider alpha-test for ztest-mode Looks like we don't have CI coverage for this (since deqp==GLES) but alpha test is conceptually the same as frag shaders with discard, and should be handled as such. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	1e3731e711	freedreno/a6xx: add early-lrz-late-z mode Now that we are doing a better job of managing LRZ, add support for the EARLY_LRZ_LATE_Z mode. Since we properly disable LRZ write in cases where we don't know a fragment's z value during the binning pass (or when blend is enabled in a later draw, meaning we will need the earlier fragment's color), we can enable a mode that keeps the early-lrz test when the frag shader has kill/discard. This will only discard geometry that is definitely not visible. This is a pretty big win for games/benchmarks that have a lot of frag shaders with kill/discard. More than 10% gain for gfxbench trex/mh and 40% gain for mh31. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	07887c9f34	freedreno/a6xx: re-work LRZ state tracking In particular, properly detect reversal of depth-test direction. With that we can remove a lot of cases where we were unnecessarily invalidating LRZ, which was simply papering over the direction- reversal issue in deqp. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	27e501bcfc	freedreno/a6xx: update depth-plane control regs And document the early-lrz-late-z mode. Initially I thought this would be two bits to control early-lrz vs early-z. But having early-z without early-lrz does not make sense, and the way the values line up makes an enum fit better. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	f6307426ed	freedreno/a6xx: sync registers from envytools Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	ebcf3545db	freedreno/ir3: split kill from no_earlyz Unlike other conditions which prevent early-discard of fragments, kill does not prevent early LRZ test. Split `has_kill` from `no_earlyz` so we can take advantage of this. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Kristian H. Kristensen	5fb7cad95c	freedreno/a6xx: Turn on robustness extensions With UBO access going through LDC, all memory access uses buffer based io primitives. We can then advertise PIPE_CAP_ROBUST_BUFFER_ACCESS_BEHAVIOR and PIPE_CAP_DEVICE_RESET_STATUS_QUERY, which turn on GL_EXT_robustness, GL_KHR_robust_buffer_access_behavior and GL_KHR_robustness. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5319>	2020-06-04 00:56:20 +00:00
Vinson Lee	8b353524b0	vdpau: Fix wrong calloc sizeof argument. Fix warning reported by Coverity Scan. Wrong sizeof argument (SIZEOF_MISMATCH) suspicious_sizeof: Passing argument 3544UL (sizeof (vlVdpPresentationQueue)) to function calloc that returns a pointer of type vlVdpPresentationQueueTarget * is suspicious because a multiple of sizeof (vlVdpPresentationQueueTarget) /16/ is expected. Fixes: `65fe0866ae` ("vl: implemented a few functions and made stubs to get mplayer running") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3026 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5182>	2020-06-03 17:01:47 -07:00
Francisco Jerez	8252bb0ec6	OPTIONAL: iris: Perform BLORP buffer barriers outside of iris_blorp_exec() hook. The iris_blorp_exec() hook needs to be executed under a single indivisible sync region, which means that in cases where we need to emit a PIPE_CONTROL for a buffer barrier we won't be able to track the subsequent commands separately from the previous commands, which will prevent us from optimizing out subsequent PIPE_CONTROLs if we encounter the same buffers again. In particular I've encountered this situation in some SynMark test-cases which perform lots of BLORP operations with the same buffer bound as both source and destination (in order to generate mipmaps): In such a scenario if the source requires flushing we'd also end up flushing for the destination redundantly, even though a single PIPE_CONTROL would have been sufficient. This avoids a 4.5% FPS regression in SynMark OglHdrBloom and a 3.5% FPS regression in SynMark OglMultithread. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	4b00338bde	iris: Remove iris_flush_depth_and_render_caches(). This helper is unused now. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	46adb83a29	iris: Emit single render target flush PIPE_CONTROL on format mismatch. The big-hammer iris_flush_depth_and_render_caches() is largely redundant whenever a format mismatch is detected from iris_cache_flush_for_render(). There is no need to kick the depth, sampler nor constant caches in that case. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	b928188493	iris: Open-code iris_cache_flush_for_read() and iris_cache_flush_for_depth(). These have become one-liners now so they can be easily inlined. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	74c774dce9	iris: Remove render cache hash table-based synchronization. The render cache hash table is now mostly redundant with the more general seqno matrix-based cache tracking mechanism. Most hash table operations are now gone except for the format mismatch checks done in iris_cache_flush_for_render(). Redundant code removed as a separate patch for bisectability. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	aa78d05a23	iris: Remove depth cache set tracking and synchronization. The depth cache set is now redundant with the more general seqno matrix-based cache tracking mechanism. Removed as a separate patch for bisectability. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	6b98072511	iris: Perform compute predraw flushes from compute batch. Whenever iris_predraw_resolve_inputs() ends up doing a flush or invalidate, we really want it to be on the same batch which is going to consume the result. Any resolves should still be performed from the render batch thanks to the previous patch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	8e8198f349	iris: Remove batch argument of iris_resource_prepare_access() and friends. The resolves performed by this function are only expected to work from the render batch, so make sure we use it independently of the batch the caller wants to use. This function provides no synchronization guarantees anyway, the caller is expected to insert any cache flushing and synchronization required for the resolved surface to be visible to the target batch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	878c770d13	iris: Insert buffer barrier in existing cache flush helpers. As a first step to phasing out the current hashtable-based depth and render cache tracking mechanisms. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	e226590898	iris: Implement buffer-local memory barrier based on cache coherency matrix. This takes advantage of the previously introduced cache tracking infrastructure in order to define a multi-purpose barrier operation that allows the caller to order memory operations with respect to previous operations performed on the same buffer from any other cache domain. v2: Assorted CPU overhead micro-optimizations (Francisco). v3: Use C99 designated initializers (Ken). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	8a6349eb86	iris: Update cache coherency matrix on PIPE_CONTROL. This introduces a batch synchronization boundary at every PIPE_CONTROL command, and updates the cache coherency status tracked during batch construction according to the specified control bits. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	fc221875cf	iris: Introduce cache coherency matrix for batch-local memory ordering. This introduces a representation of the cache coherency status of the GPU at any point in the batch. This is done by defining a matrix C of synchronization sequence numbers such that at any point of batch construction, a memory operation from domain i introduced into the batch is guaranteed to be ordered after any memory operation from domain j in a previous batch section with seqno n if the following condition holds: C_i_j >= n This allows us to efficiently determine whether additional flushing and/or invalidation is required in order to access a buffer object from some arbitrary domain. Except for batch buffer reset which requires clearing the whole matrix, all operations on the matrix are either O(n) or O(1) on the number of caching domains (which is basically constant). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	4b7fd91be6	iris: Report use of any in-flight buffers on first draw call after sync boundary. This is the main performance trade-off of this cache tracking mechanism: In order for the seqno vector of buffer objects to be accurate, they need to be marked as used again every time the batch is split into a new synchronization section if they remain bound to the pipeline. This can be achieved easily by re-using iris_restore_render_saved_bos() and iris_restore_compute_saved_bos(), which currently serve a similar purpose across batch buffer boundaries. The impact on Piglit drawoverhead results seems to be within a standard deviation of the current results. XXX - It might be possible to completely remove the current iris_batch::contains_draw flag at a small additional performance cost. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	ae88e79f69	iris: Drop redundant iris_address::write flag. The write flag is redundant since it can be inferred easily from the iris_address::access domain. This allows the iris_address struct to be laid out more efficiently in memory, leading to a measurable improvement in several Piglit Drawoverhead test-cases. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	eb5d1c2722	iris: Annotate all BO uses with domain and sequence number information. Probably the most annoying patch to review from the whole series -- Mark every buffer object use as accessed through some caching domain with the sequence number of the current synchronization section of the batch. The additional argument of iris_use_pinned_bo() makes sure I'd have gotten a compile error if I had missed any buffer added to the batch validation list. There are only a few exceptions where a buffer is left untracked while adding it to the validation list, justified below: - Batch buffers: These are strictly read-only for the moment. - BLORP buffer objects: Their seqnos are bumped manually at the end of iris_blorp_exec() instead, in order to avoid plumbing domain information through BLORP address combining. - Scratch buffers: The contents of these are strictly thread-local. - Shader images and SSBOs: Accesses of these buffers are explicitly synchronized at the API level. v2: Opt out of tracking more aggressively (Ken): In addition to the above, surface states, binding tables, instructions and most dynamic states are now left untracked, which means a lot more BO uses marked IRIS_DOMAIN_NONE which need to be reviewed extremely carefully, since the cache tracker won't be able to provide any coherency guarantees for them. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	e81c07de41	iris: Bracket batch operations which access memory within sync regions. This delimits all batch operations which access memory between iris_batch_sync_region_start() and iris_batch_sync_region_end() calls. This makes sure that any buffer objects accessed within the region are considered in use through the same caching domain until the end of the region. Adding any buffer to the batch validation list outside of a sync region will lead to an assertion failure in a future commit, unless the caller explicitly opted out of the cache tracking mechanism. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	8cbe953548	iris: Add infrastructure to partition batch into sync boundaries. This introduces some minimalistic infrastructure which will be used in order to partition the batch into a series of sections, each one with a unique, monotonically-increasing sequence number. Section boundaries will typically lie at points in the batch where the execution and memory coherency status of some previous commands are known, e.g. at batch buffer boundaries or PIPE_CONTROL commands. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Francisco Jerez	7878cbec59	iris: Add batch-local synchronization book-keeping to iris_bo. The purpose of this is to represent the cache coherency state of a buffer as a vector of integers (AKA seqnos), one for each incoherent caching domain of the GPU. A seqno will identify a single section of a batch buffer uniquely across the whole pipe_screen (which means that there will be no ambiguity about what context a given seqno belongs to even if there are multiple threads accessing the same buffer in parallel), and is guaranteed to be allocated in monotonically increasing order within any given context. The iris_bo_bump_seqno() helper is provided for marking the last update of a buffer from a given caching domain in a lockless manner. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3875>	2020-06-03 23:12:22 +00:00
Alyssa Rosenzweig	b73b339531	panfrost: Mark point sprites as todo on Bifrost Emulating them will be a rather annoying dance. Let's not worry about this until further down the line when we have a better sence of how to do handle them efficiently. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5290>	2020-06-03 22:58:46 +00:00
Alyssa Rosenzweig	0ef527928c	panfrost: Fix gl_PointSize out of GL_POINTS In this case, vs->writes_point_size is true as the VS writes gl_PointSize, but panfrost_writes_points_size() is false as we are not drawing points so the hardware doesn't process it. Thus the varying descriptor is emitted but elements is never written. When the VS runs, it will attempt to write to elements, a NULL pointer. The behaviour is architecture-independent. On Midgard, the write silently fails, hence why this bug was never noticed before. On Bifrost, this raises an MMU fault. The fix is to set the format to VARYING_DISCARD to ignore the write. Noticed on Neverball. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5290>	2020-06-03 22:58:46 +00:00
Alyssa Rosenzweig	3f8abd8676	panfrost: Prefer sysval for gl_PointCoord on Bifrost It's like gl_FragCoord. Still not implemented. This unfortunately makes point sprites a lot more complicated. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5290>	2020-06-03 22:58:46 +00:00
Alyssa Rosenzweig	bc7397f376	pan/bi: Disassemble gl_PointCoord reads. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5290>	2020-06-03 22:58:46 +00:00
Alyssa Rosenzweig	3e4a0c2bca	panfrost: Explicitly convert to 32-bit for logic-ops Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reported-by: Icecream95 <ixn@keemail.me> Fixes: `19b4e586f6` ("panfrost: Switch to pan_lower_framebuffer") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5289>	2020-06-03 22:48:10 +00:00
Alyssa Rosenzweig	6d00eaf733	panfrost: Readd MIDGARD_SHADERLESS quirk to t760 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reported-by: Icecream95 <ixn@keemail.me> Fixes: `e53d27de61` ("panfrost: Add quirks for blend shader types") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5289>	2020-06-03 22:48:10 +00:00
Francisco Jerez	46183a999b	iris: Extend iris_context dirty state flags to 128 bits. We're nearly out of dirty bits, and some patches pending review on GitLab no longer apply due to that. Make room for them by splitting off shader stage-specific bits into a separate stage_dirty mask. An alternative would be to split compute-related bits into a separate mask, but that would prevent the '<< stage' indexing done in various parts of the driver from working. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5279>	2020-06-03 22:22:19 +00:00
Francisco Jerez	45918e0d8c	iris: Simplify iris_batch_prepare_noop(). This makes iris_batch_prepare_noop() return a boolean instead of passing through the relevant set of dirty flags. It will make it easier to change the representation of dirty flags. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5279>	2020-06-03 22:22:19 +00:00
Rob Clark	26a3c7b363	nir/lower_tex: fixes for fp16 yuv lowering Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3079 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5318>	2020-06-03 21:24:13 +00:00
Rob Clark	0f3255ef0a	nir/builder: add bitsize conversion helpers Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5318>	2020-06-03 21:24:13 +00:00
Rob Clark	866618c5c8	nir: extract out convert_to_bitsize() helper Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5318>	2020-06-03 21:24:13 +00:00
Rob Clark	924bfb6560	nir: get_base_type() should return enum type Needed by the next patch, for c++ code which is more strict about conversions between integers and enums. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5318>	2020-06-03 21:24:12 +00:00
Alyssa Rosenzweig	dce7722ef8	panfrost: Handle writes_memory correctly We need to pass it thru to EARLY_Z and WRITES_GLOBAL instead of ignoring and assuming respectively. Nontrivial performance fix. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5300>	2020-06-03 20:48:24 +00:00
Alyssa Rosenzweig	2447b3b9d3	panfrost: Document MALI_WRITES_GLOBAL bit We've been setting this unconditionally -- oops! Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5300>	2020-06-03 20:48:24 +00:00
Alyssa Rosenzweig	ee59d1ad77	panfrost: Update MALI_EARLY_Z description Via the ES3.1 early-z testing force, I've confirmed this bit is e-z. I've also confirmed e-z must be disabled for global writes, as expected. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5300>	2020-06-03 20:48:24 +00:00
Marcin Ślusarz	7e26a02e5f	iris: remove unused iris_bo->swizzle_mode Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5313>	2020-06-03 18:38:00 +00:00
Samuel Pitoiset	77f08982af	aco: sign-extend input/identity for 16-bit subgroup ops on GFX6-GFX7 16-bit subgroup ops are implemented with 32-bit instructions on GFX6-GFX7. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:43 +02:00
Samuel Pitoiset	f31c9b4edf	aco: fix subdword copies on GFX6-GFX7 SDWA is only GFX8+. Use v_mov_b32 since the upper 16 bits don't matter. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:42 +02:00
Samuel Pitoiset	a521c67d22	aco: implement 16-bit nir_intrinsic_quad_* on GFX6-GFX7 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:40 +02:00
Samuel Pitoiset	6b08d269bf	aco: implement 16-bit reduce operations on GFX6-GFX7 No fp16 on GFX6-GFX7. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:37 +02:00
Alyssa Rosenzweig	0e73d879e3	pan/bi: Handle vectorized load_const In preparation for 16-bit vectors. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	1b09c6993d	pan/bi: Passthrough second argument of F32_TO_F16 At the NIR level this is a second vector source of the first (only) argument; at the BIR level this is a pair of scalars. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	8a4efe2d73	pan/bi: Pack second argument of F32_TO_F16 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	323eecaf13	pan/bi: Fix SEL.16 swizzle 2 scalar arguments, not 1 vector. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	9ed1ae4724	pan/bi: Handle SEL with vec3 16-bit Otherwise we end up with a missing argument. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5307>	2020-06-03 17:35:10 +00:00
Alyssa Rosenzweig	afc18c62d7	panfrost: Passthrough NATIVE loads/stores Now that we handle load_output directly, this works for e.g. RGB565 on Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	36af05bbde	pan/mdg: Handle regular nir_intrinsic_load_output Instead of the vendored version. Only for blend shaders at the moment, frag shaders fb_fetch has a lot more going on. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	293d37e19d	pan/mdg: Allow f2u8 and friends thru Now that we can handle destination sizes directly, this keeps us from needing to chew through so many conversions. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	0ae0141f5b	pan/mdg: Handle f2u8 This is similar to f2u16. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	f8b881f161	pan/mdg: Fold roundmode into applicable instructions Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	93513cd9ff	pan/mdg: Implement *_rtz conversions with roundmode Use rte as the canonical type. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	6290e83190	pan/mdg: Lower roundmodes So now we can use the IR field semantically. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	1bef784867	pan/mdg: Add opcode roundmode property When the output is rounded in a specified direction. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	2eb4c85e42	pan/mdg: Add roundmode enum Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Alyssa Rosenzweig	014d2e46a7	pan/mdg: Distinguish blend shaders in internal shader-db Since these shaders are purely internal, the optimization criteria are a bit different, so it's worth calling attention to this when dumping. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5285>	2020-06-03 15:36:57 +00:00
Icecream95	99446c9f7d	panfrost: Only use AFBC YTR with RGB and RGBA The "lossless colorspace transform" is lossy for R and RG formats. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5293>	2020-06-03 15:19:43 +00:00
Icecream95	9ac106defe	panfrost: Decode AFBC flag bits Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5293>	2020-06-03 15:19:43 +00:00

... 2 3 4 5 6 ...

115280 Commits