KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rob Clark	1bd38746d5	freedreno/gmem: rework gmem layout algo And try a bit harder to find an optimal layout. Improves on a sub- optimal layout we arrive at in the 4 MRT pass in manhattan, picking up a bit more than 3%. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4976>	2020-05-12 18:16:48 +00:00
Rob Clark	c46f46befe	freedreno/gmem: relax alignment on a6xx The blob only uses single page alignment, and empirically that appears to work just fine. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4976>	2020-05-12 18:16:48 +00:00
Rob Clark	ad6e06621b	freedreno: add gmemtool A simple standalone thing to run through a bunch of GMEM layouts for a given gpu. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4976>	2020-05-12 18:16:48 +00:00
Rob Clark	ef5f238fd0	freedreno/gmem: add helper to dump GMEM layout Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4976>	2020-05-12 18:16:48 +00:00
Rob Clark	6a49d9c396	freedreno/gmem: add div_align() helper Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4976>	2020-05-12 18:16:48 +00:00
Rob Clark	96b5a70f45	freedreno: initialize max_scissor Somehow the initialization of this got lost somewhere along the way, resulting in assuming minx/miny are always zero. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4976>	2020-05-12 18:16:48 +00:00
Rob Clark	1387e77801	freedreno/gmem: don't assume scissor opt when estimating # of bins We potentially don't know yet what the resulting scissor bounds are, so we can't assume this when estimating number of bins per pipe for VSC size calculations. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4976>	2020-05-12 18:16:48 +00:00
Eric Anholt	0f2e44d55b	freedreno: Drop the "write" arg to emit_const_bo now relocs don't care. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4967>	2020-05-12 16:30:57 +00:00
Eric Anholt	51d7a71bd4	freedreno: Replace OUT_RELOCW with OUT_RELOC. Final cleanup commit now that they're the same. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4967>	2020-05-12 16:30:57 +00:00
Eric Anholt	064f395a89	freedreno: Tell the kernel that all BOs are for writing. Using non-write flags is pretty dubious -- it means the kernel tracking an array of read-only consumers of the BO and having exclusive consumers wait on each reader's fence. It allows multiple readers through dma-bufs to do work in parallel, but at the cost of kernel CPU time and memory management of the shared array. Other drivers have dropped this distinction since dma-buf sharing is usually producer-consumer, not producer-two-consumers, and the userspace and kernel space tracking is expensive. For us, this lets us drop the flags passed in for relocs and tracked in the ringbuffer reloc lists. The end result of the flags reduction work is drawoverhead uniforms test throughput 2.37195% +/- 0.365579% (n=15) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4967>	2020-05-12 16:30:57 +00:00
Eric Anholt	554b959df0	freedreno: Replace OUT_RELOCD with permanently flagging shader BOs for it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4967>	2020-05-12 16:30:57 +00:00
Eric Anholt	9d8d936dfc	freedreno: Start moving relocs flags into the BOs. It's silly to have all the reloc emitters passing around FD_RELOC_READ when you have to have it set on all relocs (that don't include WRITE, which implies read) for the kernel to actually track the fences on the BO. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4967>	2020-05-12 16:30:57 +00:00
Lucas Stach	dc6c42dc77	etnaviv: generalize FE stall before loading shader and sampler states It seems that some of the new shader and sampler states added with Halti0 are not self-synchronizing anymore. Make sure to stall the FE before loading those new states to avoid corruption of the in-flight draw state. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3963>	2020-05-12 16:13:31 +02:00
Gert Wollny	50eabb7035	r600: Fix nir compiler options, i.e. don't lower IO to temps for TESS Also fix alignments and add umad24 and umul24 options. Fixes: `6747a984f5` r600: Enable tesselation for NIR Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4982>	2020-05-12 06:34:07 +00:00
Dave Airlie	5743fa6e70	zink: enable conditional rendering if available This doesn't seem to work perfect, but I'm not sure what is possible in GL vs Vulkan here Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2867 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4835>	2020-05-11 09:09:34 +00:00
Erik Faye-Lund	5c7dea394f	zink: add a GET_PROC_ADDR macro to simplify load_device_extensions This doesn't do much for now, but it will keep thing cleaner in the next commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4835>	2020-05-11 09:09:34 +00:00
Erik Faye-Lund	b8fd70eef2	zink: load vk_GetMemoryFdKHR while creating screen We're about to load some more extension-pointers as well, so let's create a separate place for doing this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4835>	2020-05-11 09:09:34 +00:00
Pierre-Eric Pelloux-Prayer	c668bdf05c	radeonsi: do not use cmask with encrypted texture Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4401>	2020-05-11 10:26:05 +02:00
Pierre-Eric Pelloux-Prayer	8873ea0e25	radeonsi: determine secure flag must be set for gfx IB Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4401>	2020-05-11 10:25:53 +02:00
Pierre-Eric Pelloux-Prayer	2c2ab36f53	radeonsi: add support for PIPE_RESOURCE_FLAG_ENCRYPTED Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4401>	2020-05-11 10:25:53 +02:00
Pierre-Eric Pelloux-Prayer	5c58cbe84d	radeonsi/sdma: implement tmz support Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4401>	2020-05-11 10:25:53 +02:00
Pierre-Eric Pelloux-Prayer	5d96c26b67	radeonsi: force using staging texture when uploading to secure texture Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4401>	2020-05-11 10:25:53 +02:00
Pierre-Eric Pelloux-Prayer	2853ed1a24	radeonsi: allocate framebuffer texture as secure when using tmz Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4401>	2020-05-11 10:25:53 +02:00
Pierre-Eric Pelloux-Prayer	5a67b52de4	radeon: add RADEON_CREATE_ENCRYPTED flag Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4401>	2020-05-11 10:25:53 +02:00
Pierre-Eric Pelloux-Prayer	977e19d5cf	amdgpu/radeon: add secure api Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4401>	2020-05-11 10:25:53 +02:00
Erico Nunes	e622e010fd	lima/ppir: rework select conditions This is yet another simple optimization that attemts to save the insertion of an unnecessary mov for a large number of cases. If the node outputting the condition for select satisfies a few requirements (which are common in the case of comparison conditions), it can just be changed to pipeline output and used directly. In case of difficult corner cases, just fall back to the mov as before. The sel_cond op is removed as the scheduler can be smart enough to place nodes that output to ^fmul in the ALU_SCL_MUL slot, and as there can be alu ops other than just mov. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4632>	2020-05-09 14:40:40 +02:00
Erico Nunes	a0c58867cd	lima/ppir: add fallback mov option for const scheduler It turns out that with more aggressive combining, there can be cases where the available const slots are not enough for one instruction. In particular, fcsel can take up to two consts, and a previous alu slot, such as a comparison condition, might require an additional const. So add a fallback for it like for uniforms. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4632>	2020-05-09 14:40:37 +02:00
Erico Nunes	8c47640731	lima/ppir: rework store output In many cases, it is possible to avoid creating a mov for the store output node. Additionally, nodes other than alu, such as load varying, can be valid store output nodes too. This is another small optimization, but helps a vast majority of programs by 1 instruction. Shaders with discard easily become complicated to handle properly. Some example issues: ppir has to rely on instruction ordering; or a node with ssa output could be required both before a discard_if (as a condition) and after it (as the instruction with the 'stop' bit set). So don't try to handle them here. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4632>	2020-05-09 14:40:34 +02:00
Erico Nunes	570f1420db	lima/ppir: rework emit nir to ppir The previous code assumed that a ppir node would be created for each nir instr and used that to add it to the list of nodes and verify success. This didn't make much sense anymore since some emit paths create multiple nodes anyway, and this didn't allow for an emit call to not create any new ppir node while still returning success. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4632>	2020-05-09 14:40:21 +02:00
Erico Nunes	6b21b771f7	lima/ppir: remove unused clone functions With the previous refactors moving these lowering steps to a nir pass, these are no longer needed. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Erico Nunes	8c4157138f	lima/ppir: duplicate consts in nir Move the duplicate consts step to a nir pass. This makes the nir representation closer to what ppir will have in the result. Additionally, it handles the case where a const is used multiple times by a single node (which can happen in instructions like fcsel). The new implementation will only emit a single load const for that case. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Erico Nunes	5e6c386118	lima/ppir: duplicate intrinsics in nir Move the duplicate uniform and varying steps to a nir pass, along with some changes in the duplicating strategy. Node duplication is now done per user of the varying/uniform. This is inspired by what the offline shader compiler seems to usually do, and as usual aims to reduce register pressure and better utilize the ld_uni and ld_var instruction slots. It is worth noting that due to a bug/feature, ppir was already duplicating uniforms per successor in ppir_node_add_src even if the comment indicated it was meant to be per-block. Additionally, ppir was duplicating load uniform nodes twice for nodes that use the same uniform in more than one source, resulting in one unnecessary (and unpipelineable) load. This new implementation in nir only creates one load in that case. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Erico Nunes	09003ba070	lima/ppir: combine varying loads in node_to_instr Varying loads with a single successor have a high potential to be combined with its successor node, like ppir does for uniforms, rather than being in a separate instruction. Even if ppir becomes capable of combining instructions in a separate step, combining varying loads during node_to_instr is trivial enough that it seems to be worth doing it in this stage, and this benefits pretty much every program that uses varyings. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Erico Nunes	c6a3987f32	lima/ppir: do not assume single src for pipeline outputs Even if a node has pipeline output and a single successor, it is still valid for that successor to have multiple references to that pipeline node. A trivial example is add(u.x,u.y) where u is a uniform. It is even possible for this to occur with consts as operands of fcsel. So remove uses of ppir_node_get_src_for_pred as that would assume a single src in the node that uses the pipeline. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Erico Nunes	741aa3439d	lima/ppir: fix lod bias register codegen The lod bias register is correctly run through the entire compilation process, but in the end its allocated register value was never being added to the instruction. It seems that most programs were lucky enough that lod bias was assigned register 0.x so that things worked anyway. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Erico Nunes	cef1c73634	lima/ppir: introduce liveness internal live set The current solution for handling registers that live and die within a single instruction does not handle all cases. In particular, these intra-instruction use register also conflict with registers that are part of the live_in set. Unfortunately, adding them to the live_in set is not an easy solution as that would cause them to be propagated upwards. So, add a separate set to handle these registers in the particular instructions, without propagating them. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Qiang Yu	727a0a53fd	radeonsi: remove emacs style config file As radeonsi has synced the code style with main mesa, remove the orginal radeonsi spec emacs config file and use the top level dir .dir-locals.el Acked-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4961>	2020-05-09 00:57:26 +00:00
Eric Anholt	c9e8df61dc	freedreno: Initialize the bo's iova at creation time. Avoids repeated conditionals at reloc time checking if we need to go ask the kernel. No statistically significant difference on the drawoverhead case I'm looking at (n=300). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4957>	2020-05-08 12:35:39 -07:00
Eric Anholt	6c688ae81f	freedreno: Deduplicate ringbuffer macros with computerator/fdperf They're sugar around freedreno_ringbuffer.h, so put them there and reuse them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4957>	2020-05-08 12:35:38 -07:00
Hyunjun Ko	094c7646a3	freedreno,tu: Don't request fragcoord components not being read. v1. Replace the existed bool type with new bitfield and edit register files to take a mask instead of duplicating codes to do masking. v2. Use fragcoord_compmask != 0 instead of fragcoord_compmask > 0 since it represents a bitfield. Tested with dEQP-VK.glsl.builtin_var.simple.fragcoord_xyz/w dEQP-GLES2.functional.shaders.builtin_variable.fragcoord_xyz/w Closes: #2680 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4723>	2020-05-08 17:45:03 +00:00
Blaž Tomažič	808eb20186	radeonsi: Fix omitted flush when moving suballocated texture Fixes: `5e805cc74b` "radeonsi: flush the context after resource_copy_region for buffer exports" Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4925>	2020-05-07 17:00:08 -04:00
Marek Olšák	441eaef6a9	amd: unify code for overriding offset and stride for imported buffers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4863>	2020-05-07 20:13:41 +00:00
Marek Olšák	c164ea86e1	ac/surface,radeonsi: move the set/get_umd_metadata code into ac_surface.c The indentation is on purpose. The whole file will be reindented to this code style some other time. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4863>	2020-05-07 20:13:41 +00:00
Marek Olšák	7691de0dce	ac/surface,radeonsi: move the set/get_bo_metadata code to ac_surface.c The indentation is on purpose. The whole file will be reindented to this code style some other time. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4863>	2020-05-07 20:13:41 +00:00
Marek Olšák	56e37374dd	amd: assume HTILE is always rb/pipe_aligned, remove ac_surface.u.gfx9.htile Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4863>	2020-05-07 20:13:41 +00:00
Marek Olšák	cf61f635ff	amd: assume CMASK is always rb/pipe_aligned, remove ac_surface.u.gfx9.cmask Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4863>	2020-05-07 20:13:41 +00:00
Dave Airlie	89d4b6b5c8	llvmpipe: make sample position a global array. I messed this up and LLVM asserts on it. Use the gallivm struct wrappers to make it clearer. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2913 Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Tested-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4933>	2020-05-07 18:38:51 +00:00
Jan Zielinski	58dfb38f78	gallium/swr: Fix crashes in sampling code Add missing functions used by the new sampling code in llvmpipe (num_samples and sample_stride) Reviewed-by: Krzysztof Raszkowski <krzysztof.raszkowski@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4947>	2020-05-07 17:31:21 +00:00
Tomeu Vizoso	9c3e82296c	panfrost: Don't trample on top of Bifrost-specific unions Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4944>	2020-05-07 17:16:53 +00:00
Tomeu Vizoso	a4d41a1510	panfrost: Add checksum BOs to batch So they don't get released before the last frame finishes rendering. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4944>	2020-05-07 17:16:52 +00:00
Jose Maria Casanova Crespo	905edc376d	v3d: Include supported DXT formats to enable s3tc/dxt extensions DXT1_RGBA and sRGB variants of DXT[135] formats are enabled as valid format on V3D. Once all S3TC formats supported by V3C are enabled the following extensions become exposed by gallium. * GL_ANGLE_texture_compression_dxt3 * GL_ANGLE_texture_compression_dxt5, * GL_EXT_texture_compression_dxt1 * GL_EXT_texture_compression_s3tc * GL_S3_s3tc * GL_EXT_texture_compression_s3tc_srgb This enables 206 passing piglit test related to gl_compressed.*s3tc_dxt Cc: 20.0 20.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4934>	2020-05-07 14:03:34 +02:00
Jose Maria Casanova Crespo	e3ecf48dda	v3d: Fix swizzle in DXT3 and DXT5 formats Swizzles were ignoring the W component of the format DXT3_RGBA and DXT5_RGBA. This fixes 15 piglit tests: spec/!opengl 1.1/copyteximage 2d spec/!opengl 1.2/copyteximage 3d spec/arb_texture_compression/fbo-generatemipmap-formats/gl_compressed_rgba spec/arb_texture_compression/fbo-generatemipmap-formats/gl_compressed_rgba npot spec/arb_texture_compression/texwrap formats bordercolor-swizzled/gl_compressed_rgba, swizzled, border color only spec/arb_texture_compression/texwrap formats bordercolor/gl_compressed_rgba, border color only spec/arb_texture_cube_map/copyteximage cube spec/arb_texture_cube_map/copyteximage cube samples=2 spec/arb_texture_cube_map/copyteximage cube samples=4 spec/arb_texture_rectangle/copyteximage rect spec/arb_texture_rectangle/copyteximage rect samples=2 spec/arb_texture_rectangle/copyteximage rect samples=4 spec/ext_texture_array/copyteximage 2d_array spec/ext_texture_array/copyteximage 2d_array samples=2 spec/ext_texture_array/copyteximage 2d_array samples=4 Fixes: `469bbd8387` "broadcom/vc5: Move the formats table to per-V3D-version compile." Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4934>	2020-05-07 14:03:34 +02:00
Elie Tournier	2e6bbab9ae	virgl: Enable CAP_CLEAR_TEXTURE if host supports it Signed-off-by: Elie Tournier <elie.tournier@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4345>	2020-05-07 10:21:50 +00:00
Elie Tournier	e705a2a9f4	virgl: implement ARB_clear_texture Signed-off-by: Elie Tournier <elie.tournier@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4345>	2020-05-07 10:21:50 +00:00
Gert Wollny	a6321c4b5a	r600: Fix warning regarding mixing enums and unsigned in ?: expression Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4939>	2020-05-07 11:01:02 +02:00
Gert Wollny	5469fcea75	r600: remove some unused variables to silence warnings Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4939>	2020-05-07 11:00:54 +02:00
Gert Wollny	79f20eb819	r600/sb: replace memset by using member initialization/assignment Closes #2860 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4939>	2020-05-07 11:00:51 +02:00
Gert Wollny	ee3f4ab2f4	r600: remove unused static functions Related #2860 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4939>	2020-05-07 11:00:47 +02:00
Gert Wollny	9a244778f7	r600: Annotate some case fallthroughs Also fix indentions where aproprate Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4939>	2020-05-07 11:00:26 +02:00
Rob Clark	e8cdf12511	freedreno/a6xx: enable tiled compressed textures I wasn't expecting this to be too useful, since compressed textures are already block based.. but gfxbench gl_fill says otherwise. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4868>	2020-05-06 17:11:34 -07:00
Rob Clark	193560c44b	freedreno/a6xx: compressed blit fixes width/height are not necessarily aligned to block boundaries, so we need to round up. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4868>	2020-05-06 17:11:34 -07:00
Kristian H. Kristensen	85f2cd84ac	freedreno/a6xx: Set tfetch correctly for compressed formats The fetchsize is just the blocksize for compressed formats, which gets rid of the ASTC special cases add handles ETC1/2 as well. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4868>	2020-05-06 17:11:34 -07:00
Marek Olšák	29da521280	radeonsi: fix compilation of monolithic PS This was totally broken. Monolithic PS is only used if FBFETCH or interpolateAtSample are used. When the PS prolog was built, it overwrote ctx->main_fn. Discovered by @eefano. Fixes: `8832a88434` "radeonsi: move PS LLVM code into si_shader_llvm_ps.c" Closes: #2814 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4918>	2020-05-06 17:02:23 +00:00
Erik Faye-Lund	7f6a491eec	zink: lower b2b to b2i Zink requires 1-bit booleans, but this requirement was missed before b2b1s started getting automatically inserted. Let's lower these away, to avoid piglit regressions. Fixes the following piglits: - shaders@glsl-vs-if-bool - spec@!opengl 2.0@vertex-program-two-side Fixes: `c217ee8d35` ("nir: Insert b2b1s around booleans in nir_lower_to") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2902 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4903>	2020-05-06 09:20:27 +00:00
Dave Airlie	dab8803af4	llvmpipe: enable ARB_sample_shading Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	8a83db4204	llvmpipe: add min samples support to the fragment shader. This isn't enabled yet until the state gets hooked up Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	d237e03a16	llvmpipe: enable GL_ARB_shader_texture_image_samples Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	3cc50cabf1	llvmpipe: enable 4x sample MSAA + texture multisample This enables proper support for 4xMSAA and for texture mulitsample extension. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	7898978377	llvmpipe: don't choose pixel centers for multisample Don't pick the pixel centers for multisample rendering, fix the setup program. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	8297513aa9	llvmpipe: choose correct position for multisample For multisample we don't want pixel centers at this stage, so don't add them in for that case. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	b72f504e99	llvmpipe: choose multisample rasterizer functions per triangle (v2) This just picks the correct cmds to add to the scene. v2: drop using 32-bit ms (Roland) Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	26cc01cefd	llvmpipe: generate multisample triangle rasterizer functions (v2) This uses the templating to generate multisample version of the tri plane raster functions This doesn't generate any optimised version for lower plane numbers, maybe this is worth doing in the future. v2: drop generating 32-bit msaa (Roland) Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	8611a6b34b	llvmpipe: fixup multisample coverage masks for covered tiles For fully covered tiles just pass in the filled out mask. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	2d13591ba4	llvmpipe: build 64-bit coverage mask in rasterizer This adds the logic to build the per-sample masks at the lowest level of the rasterizer block hierarchy Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	88851c4798	llvmpipe: add fixed point sample positions to scene. These will be used in the rasterizer to generate the coverage masks Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	78b7f22838	llvmpipe: add new rast api to pass full 64-bit mask. The 64-bit mask is a 16-bit mask per sample for up to 4 samples. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	c638a59fa8	llvmpipe: disable opaque variant for multisample Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	c5021ebb15	llvmpipe: fix multisample occlusion queries. This needs to check the per-sample mask inside the loop if multisample is enabled. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	335938cffd	llvmpipe: move color storing earlier in frag shader Move the color storage before the late Z test as for sample shading it needs to be inside a loop with the fragment shader. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	acba9a93ef	llvmpipe: pass mask store into interp for centroid interpolation This enables centroid interpolation to work, using the current coverage masks. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	367332b0fc	llvmpipe: don't allow branch to end for early Z with multisample Don't allow the branching optimisation with multisample enabled as we have to check all samples. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	d9276ae965	llvmpipe: handle gl_SampleMask writing. This is using a load/store to make it easier to add sample shading later. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:38 +00:00
Dave Airlie	69009949e0	llvmpipe: add multisample alpha to one support Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	66a92e5d92	llvmpipe: add multisample alpha to coverage support. Converts alpha into coverage mask. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	38e81938b6	llvmpipe: hook up sample position system value This creates a global static with the current sample positions, and passes it to the fragment shader which uses it for interpolation and sample position support. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	210d714f46	llvmpipe: handle multisample color stores. Extract the final per-sample masks and store to the multisample color buffers using them. This retypes the pointer to a uint8_t at entry to make the GEP simpler, then recasts to the blend type. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	102558912b	llvmpipe: interpolate Z at sample points for early depth test. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	a0195240c4	llvmpipe: handle multisample early depth test/late depth write A set of values have to be passed from the early depth test to the late depth write, when multisampling is enabled, a range of those values have to be stored between stages, so create storage for them and pass the values through the storage. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	9f8c7e232e	llvmpipe: multisample sample mask + early/late depth pass Start adding support for multisample masks and the depth passes The depth passes have to run per-sample, this isn't complete support it adds the loops, and handles the execution masks. One mask is stored per sample, they are combined post the early Z pass into a single shader execution mask, and then the resulting shader execution mask is anded back in for the late Z pass. Init the vars to NULL to avoid gcc warnings Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	f12dac5e10	llvmpipe: move some fs code around this just moves the num_fs loop around for follow on refactors Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	5e949b16c1	llvmpipe: add per-sample depth/stencil test The current depth stencil test code has some optimisations using the mask when there is only one depth value, multisample requires per-sample zstencil testing, and for that case just pass in the mask that needs updating. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	d297f2ecf1	llvmpipe: move getting mask value out of depth code. (v2) In order to add per-sample support to this code, the mask value is needed not the value from the exec mask. v2: update comment Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	18fd62a26e	llvmpipe: add per-sample interpolation. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	8154bdf25b	llvmpipe: add centroid interpolation support. This just adds the implementation and API to the interpolation builders. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	5697b9c00c	llvmpipe: pass interp location into interpolation code. This just tracks the attribute interpolation location into the interp code. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	455c8e3584	llvmpipe: add cbuf/zsbuf + coverage samples to the fragment shader key. These will cause different fragment shaders to be generated. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	d2f488684a	llvmpipe: change mask input to fragment shader to 64-bit. In order to handle a 4xMSAA mask (16-bits per sample) increase the fragment shader API to be 64-bit. v2: drop pointless if (Roland) Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	67ec1760ee	llvmpipe: add multisample bit to fragment shader key. The fragment shader needs to be regenerated when multisample changes. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	f5463576b9	llvmpipe: plumb multisample state bit into setup code. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	e47d39aee1	llvmpipe/rast: fix tile clearing for multisample color and depth tiles Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	01e9779c00	llvmpipe: record sample info for color/depth buffers in scene This adds the nr_samples + sample_stride to the scene records for cbufs and zsbuf. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	a30db60ede	llvmpipe: pass color and depth sample strides into fragment shader. This just adds the interface and passes the depth and sample strides into the fragment shader, nothing uses them yet. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	4c72bb4a96	llvmpipe: handle multisample render target clears Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	782271c0e1	llvmpipe: add clear texture support for multisample textures. This adds the clear paths for multisample textures. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	c8740cbf01	llvmpipe: add multisample resource copy region support. This allows direct copies of all samples between two resources. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	178df06821	llvmpipe: add internal multisample texture mapping path. For clearing and copying textures llvmpipe needs to internally access the per-sample data. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	cab13f9174	llvmpipe: pass incoming sample_mask into fragment shader context. This links up the api changing the sample mask to passing it into the fragment shader. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	c070af8511	llvmpipe/jit: pass fragment sample mask via jit context. The incoming sample mask for the fragment shader can be passed via the jit context Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	0a6150251a	llvmpipe: add get_sample_position support (v2) This just adds the sample values for 4xmsaa, and hooks them up to the get_sample_position API v2: move to vulkan standard sample positions Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	f6383673c9	llvmpipe: fix race between draw and setting fragment shader. There is a race with u_blitter shaders + pipeline shaders (aaline/aapoint) where the draw bind can cause a pipeline flush which can use bind_fs_state to be reenters and llvmpipe->fs gets the wrong value. Fix this by only setting the llvmpipe->fs value after the draw binding is complete. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	bcbe5b3d26	llvmpipe: add a max samples define set to 4. I doubt I'll care about much higher MSAA levels, so 4 it is. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	1b02eb1a4c	llvmpipe: add multisample support to texture allocator. This adds a sample stride field and allocates enough memory for each sample storage. Hook up the sample_stride field to draw and jit textures and images Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	1970390026	llvmpipe: add samples support to image jit Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	2e5cddacf7	llvmpipe: add num_samples/sample_stride support to jit textures This adds the support for num_samples/sample_stride retrieval to the jit texture infrastructure. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	bc3641d616	draw: add support for num_samples + sample_stride to the image paths Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Dave Airlie	026bf26599	draw: introduce sampler num samples + stride members This adds the num samples + sampler stride into the texture mapping paths, currently drivers just pass 0 for now. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4122>	2020-05-06 06:20:37 +00:00
Tomeu Vizoso	b6a20804ad	virgl: Properly check for encode_stride when encoding transfers Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alexandros Frantzis <alexandros.frantzis@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4763>	2020-05-06 08:04:58 +02:00
Dave Airlie	99fce3a6d7	llvmpipe: simple texture barrier implementation. Just flush. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4774>	2020-05-06 15:09:42 +10:00
Dave Airlie	870b6a6050	llvmpipo/nir: free compute shader NIR I forgot this in the last round. Fixes: `18f896e55d` (llvmpipe: add initial nir support) Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4899>	2020-05-06 05:11:19 +10:00
Marek Olšák	0d83e7f4b9	radeonsi: enable TC-compatible HTILE on demand for best Z/S performance I haven't measured this, but it can only help. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4866>	2020-05-05 16:27:29 +00:00
Marek Olšák	39571d384e	radeonsi: allow tc_compatible_htile to be mutable Move the relevant code from si_init_depth_surface to si_emit_framebuffer_state, so that it can be changed after a pipe_surface is initialized. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4866>	2020-05-05 16:27:29 +00:00
Marek Olšák	04085bedc2	radeonsi/gfx9: always use IMG_DATA_FORMAT_S8_32 for 8-bit stencil I wanna remove dependency on tc_compatible_htile from non-dynamic states. This should be the same as 8_UINT if HTILE is disabled. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4866>	2020-05-05 16:27:29 +00:00
Marek Olšák	266fec1307	radeonsi: don't wait for idle at the end of gfx IBs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4894>	2020-05-05 11:52:21 -04:00
Pierre-Eric Pelloux-Prayer	ae4379d81e	ac/nir: export some undef as zero NIR already optimizes undef usage. If undef reaches llvm, it's probably because of a broken shader. In this situation, rather than letting llvm use the undef values to do more optimization and probably produce incorrect results, we replace undef values by 0. "undef" values that are directly used in exports are kept as undef, because this allows llvm to optimize them away. This is only enabled for radeonsi. Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2689 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4607>	2020-05-05 12:26:26 +02:00
Pierre-Eric Pelloux-Prayer	0ee1a724bf	gallium: add a new cap PIPE_CAP_GLSL_ZERO_INIT Allows driver to select a zero init mode between the 3 possible values. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4607>	2020-05-05 12:26:02 +02:00
Pierre-Eric Pelloux-Prayer	547e81655a	radeonsi: don't print gs_copy_shader stats for shaderdb Fixes: `dbc86fa3de` ("radeonsi: dump shader stats when hitting the live cache") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4607>	2020-05-05 12:26:02 +02:00
Pierre-Eric Pelloux-Prayer	64662dd5ba	radeonsi: add workaround for issue 2647 For unknown reasons pixel shaders in KSP game get executed with infinite interpolation coefficients and this causes an infinite loop in the shader. This commit adds a hacky workaround that kills pixel shaders if invalid interp coeffs are detected and enables it for KSP. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2174 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2647 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4700>	2020-05-05 09:41:14 +00:00
Erik Faye-Lund	7983d97174	zink: use nir_lower_uniforms_to_ubo Instead of open-coding uniform -> UBO lowering, let's instead use the one that already exists. This should make things a bit simpler going forward. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4734>	2020-05-05 09:17:52 +00:00
Mauro Rossi	5779694698	android: iris: add iris_seqno.{c,h} to Makefile.sources Fixes the following undefined symbol building errors: ld.lld: error: undefined symbol: iris_seqno_init >>> referenced by iris_batch.c:187 (external/mesa/src/gallium/drivers/iris/iris_batch.c:187) >>> iris_batch.o:(iris_init_batch) in archive out/target/product/x86_64/obj/STATIC_LIBRARIES/libmesa_pipe_iris_intermediates/libmesa_pipe_iris.a Fixes: `e31b703c` ("iris: Place a seqno at the end of every batch") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Acked-by: Tapani Pälli <tapani.palli@intel.com>	2020-05-04 22:33:04 +02:00
Alyssa Rosenzweig	30f07e0d84	panfrost: Setup gl_FragCoord as sysval on Bifrost ..rather than a varying. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4883>	2020-05-04 11:08:14 -04:00
Christian Gmeiner	89a41dae77	etnaviv: do not use int filter when anisotropic filtering is used The blob does not use this combination. This change moves the decision if int filter gets used to state emit time. Fixes: `7aaa0e5908` ("etnaviv: add anisotropic filter support") Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4872>	2020-05-04 14:39:24 +00:00
Christian Gmeiner	b38e51bd96	etnaviv: fix SAMP_ANISOTROPY register value This caused some serious problems like shredded output, ~1fps and GPU hungs. Fixes: `7aaa0e5908` ("etnaviv: add anisotropic filter support") Reported-by: Lukas F. Hartmann <lukas@mntmn.com> Tested-by: Lukas F. Hartmann <lukas@mntmn.com> Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4872>	2020-05-04 14:39:24 +00:00
Marek Olšák	f1a40a26a9	Revert "ac/surface: remove RADEON_SURF_TC_COMPATIBLE_HTILE and assume it's always set" This reverts commit `f6d87ec8a9`. It breaks RADV. Fixes: `f6d87ec8a9` "ac/surface: remove RADEON_SURF_TC_COMPATIBLE_HTILE and assume it's always set" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4864>	2020-05-02 20:12:38 +00:00
Caio Marcelo de Oliveira Filho	33c61eb2f1	iris: Implement ARB_compute_variable_group_size Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4794>	2020-05-01 12:50:37 -07:00
Caio Marcelo de Oliveira Filho	e645bc6939	intel: Let drivers call brw_nir_lower_cs_intrinsics() The motivating factor is: this lowering may cause nir_intrinsic_load_local_group_size intrinsics to be added to the shader, and by moving this around we make possible for the drivers to lower that intrinsic by themselves. Iris will do just that in a later patch for implementing variable group size. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4794>	2020-05-01 12:50:37 -07:00
Kenneth Graunke	1800e4b58c	iris: Implement PIPE_FLUSH_DEFERRED support. (Co-authored with Chris Wilson.) Frequently, games create fences and later check them with a timeout of 0 to see if that work has completed yet. They do not want the work to be flushed immediately upon fence creation. This is what PIPE_FLUSH_DEFERRED does - it inhibits the flush at fence creation time, but still guarantees that a flush will occur later on once fence_finish() is called. Since syncpts can only occur at batch boundaries, when deferring a flush, we have to wait for the syncpt at the end of the batch being constructed. This is later than desired, but safe if blocking. To avoid extra delays, we additionally insert a PIPE_CONTROL to write an availability bit at the exact point of the fence. We can poll this on the CPU, allowing us to check whether the fence has gone by, even if the batch hasn't completed. It can also let us skip kernel calls. Improves performance in Bioshock Infinite by 10% on Icelake GT2 on -ForceCompatLevel=5 settings. Thanks to Felix Degrood and Mark Janes for helping notice the extraneous stalls and batches, Marek Olšák for adding deferred flush support to Gallium to solve this issue, and Chris Wilson for reworking a lot of the internals of this work. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Kenneth Graunke	df09efe8df	iris: Detect DRM_SYNCOBJ_WAIT_FLAGS_WAIT_FOR_SUBMIT kernel support We will use this for implementing deferred flushes in the next commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Kenneth Graunke	07fb925ad8	iris: Flush any current work in iris_fence_await before adding deps Receiving a fence_server_sync (iris_fence_await) means that any future work needs to wait for the fence. But previous work doesn't need to. So flush it now, to avoid delaying it arbitrarily. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Chris Wilson	3dbde89111	iris: Store a seqno for each batch in the fence In the next patch, we will introduce deferred fences where we will need to flush a fence later. To do this, we need to know which batch requires flushing, so keep a 1:1 mapping between seqno[] and the associated batch. It's also substantially less confusing to have a 1:1 mapping. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Chris Wilson	fd1907efb3	iris: Convert fences to using lightweight seqno By using the breadcrumbs we inject into the batch, we can build a lightweight fence - that can be evaluated in userspace without having to check in the kernel. In order to pass the fences between processes, and to wait efficiently, we continue to track the syncobj for each batch and use that as a terminator for the fence, and for passing coarse scheduling decisions to the kernel on execbuf. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Chris Wilson	e31b703c42	iris: Place a seqno at the end of every batch We can use seqno as a basic for fast userspace fences: where we can check a value directly to test for fence completion without having to query using the kernel. To do so we need to write a breadcrumb from the batch and track those writes as the basis for our lightweight fences. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Kenneth Graunke	fb95ac6855	iris: Destroy transfer slab after batches Batches are going to have an uploader in the next commit, so destroying batches will destroy uploaders, which will unmap transfers, which will return things to the slab allocator. So we need to reorder destroying the slab allocator to the end to avoid crashing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Kenneth Graunke	c94379c770	iris: Give up on not passing ice to iris_init_batch We're going to need it to create a uploader in the batch soon. We still avoid storing it, to maintain the charade of separation, and make people think twice about fetching random fields from there and intertwining things even worse. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Kenneth Graunke	4a1ed75b85	iris: Rename iris_syncpt to iris_syncobj for clarity. This is just a refcounted wrapper around a drm_syncobj. There is enough terminology going on in the area of synchronization (sync objects, sync files, ...) that I'd rather not invent our own. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
Kenneth Graunke	abf8aed680	iris: Include linux/sync_file.h instead of cut and pasting contents Lets us drop some cut and pasted kernel header contents. Linux 4.7 came out 4 years before we the first officially supported release of this driver; iris won't run on kernels older than 4.16, and 4.18.11+ is strongly recommended. So I suspect it's safe to assume that a kernel header from 4.7 will exist at build time. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3802>	2020-05-01 19:00:02 +00:00
D Scott Phillips	65b05ebdda	anv,iris: Fix input vertex max for tcs on gen12 gen12 does away with the single patch dispatch mode for tcs, and increases some limits so that 8_patch mode can always work. Make the necessary changes so we don't try to fall back to single patch mode. Fixes KHR-GL46.tessellation_shader.single.max_patch_vertices and others Fixes: `44754279ac` ("intel/fs/gen12: Use TCS 8_PATCH mode.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4843>	2020-05-01 16:49:11 +00:00
Eric Anholt	8f01fa1fb3	freedreno/ir3: Set the FS .msaa flag to true during precompiles. If you're going out of your way to do per-sample interpolation, you are almost surely going to be doing so to an MSAA framebuffer. Should reduce recompiles with MSAA enabled. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4562>	2020-05-01 16:26:32 +00:00
Eric Anholt	812c55b079	freedreno: Immediately compile a default variant of shaders. Now that we normalize our keys fairly well, build a variant at shader state creation time so that hopefully you don't have to call the compiler at draw time (as is now the case with glmark2 ES and most of the humus GL demos). Fixes: #2782 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4562>	2020-05-01 16:26:32 +00:00
Eric Anholt	8c1c218909	freedreno/ir3: Improve shader key normalization. We can remove a bunch of conditional code at key comparison time by computing a bitmask of used key bits at ir3_shader creation time. This also gives us a nice place to put additional key simplification to reduce how many variants we create (like skipping rastflat if we don't read colors in the FS, or skipping vclamp_color if we don't write colors). It does mean walking the whole key to AND it, but the key is just 28 bytes so far so that seems pretty fine. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4562>	2020-05-01 16:26:32 +00:00
Eric Anholt	6f1e3235f2	freedreno: Emit debug messages when doing draw-time recompiles of shaders. Right now that's "always" unless you have shaderdb set. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4562>	2020-05-01 16:26:32 +00:00
Eric Anholt	a361567c46	freedreno/ir3: Remove unused half precision shader key flag. The code using it was removed in `4af86bd0b9` ("freedreno/ir3: remove half-precision output") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4562>	2020-05-01 16:26:32 +00:00
Eric Anholt	05be0659fe	freedreno: Fix assertion failures on GS/tess shaders with shader-db enabled. We weren't filling in the tess mode of the key, or setting has_gs on GS shaders, resulting in assertion failures when NIR intrinsics didn't get lowered. We have to make a guess at prim mode for TCS, but it should be better to have some shader-db coverage than none, and it will avoid these failures happening when we start precompiling shaders. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4562>	2020-05-01 16:26:32 +00:00
Eric Anholt	fd8f3b62a4	freedreno: Stop doing binning shaders other than the VS in shader-db. ir3_cache.c only ever asks for binning variants for VS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4562>	2020-05-01 16:26:32 +00:00
Kristian H. Kristensen	a16ee14f37	freedreno/ir3: Pass stream output info to ir3_shader_from_nir We need shader->stream_output filled out when we layout the push constants in ir3_setup_const_state(). Otherwise const_state->offsets.tfbo ends up as ~0, which doesn't work. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4562>	2020-05-01 16:26:31 +00:00
Tomeu Vizoso	3a81abf3b2	panfrost: Add Bifrost texture trampoline BO to batch Fixes: `d3eb23adb5` ("panfrost: Emit sampler descriptor on bifrost") Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4832>	2020-05-01 16:52:40 +02:00
Tomeu Vizoso	3baf251487	panfrost: mali_attr_meta.unknown1 is zero on Bifrost For unknown1 reasons :) Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4832>	2020-05-01 16:52:32 +02:00
Tomeu Vizoso	c4400b05be	panfrost: GPUs newer than G-71 don't have swizzles... for attributes and varyings. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4832>	2020-05-01 16:52:28 +02:00
Alyssa Rosenzweig	d6588b87bf	panfrost: Update Bifrost fields in mali_shader_meta Not much is known currently about these fields and their values, but this gets things going in the scenarios we have been testing with so far. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4832>	2020-05-01 16:52:19 +02:00
Tomeu Vizoso	4d581a4bc6	panfrost: Create additional BO for the checksum of imported BOs (Bifrost) Similar to what the blob does. My reason for doing this was mainly so traces weren't as different, which makes it more work to spot relevant differences. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4832>	2020-05-01 16:52:03 +02:00
Tomeu Vizoso	28902ba87e	panfrost: Split bit out of format.unk3 On Bifrost traces, we can observe that this bit is always enabled. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4832>	2020-05-01 16:51:36 +02:00
Rob Clark	60912f1ebd	freedreno: we don't need aligned vbo's This gets rid of the last reason that mesa/st would use `u_vbuf` on a6xx. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4812>	2020-05-01 02:05:00 +00:00
Rob Clark	9a7c179473	freedreno/a6xx: add some more formats u_vbuf was translating these for us.. which isn't really necessary. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4812>	2020-05-01 02:05:00 +00:00
Alyssa Rosenzweig	bbecbedb4c	panfrost: Fix norm coords on bifrost sampler Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4844>	2020-05-01 00:27:23 +00:00
Rob Clark	f8424d3b99	freedreno/a6xx: fix LRZ hang In detecting the case where we actually do need to re-emit LRZ state (due to new batch), we were checking `ctx->last.dirty` to detect when we cannot trust previous state. But this is cleared before we check it. Move where it is cleared to the end of the draw_vbo() path. Fixes: `dfa702e94b` ("freedreno/a6xx: limit LRZ state emit") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4842>	2020-05-01 00:02:28 +00:00
Marek Olšák	bdd2f284d9	radeonsi: revert an accidental change in si_clear_buffer The change was in: `7b0b085c94` Fixes: `7b0b085c94` ("radeonsi: drop the negation from fmask_is_not_identity") Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4761>	2020-04-30 22:27:31 +00:00
Marek Olšák	5afec9bc9f	radeonsi: fix si_compute_clear_render_target with render condition enabled Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4761>	2020-04-30 22:27:31 +00:00
Marek Olšák	19db1a540c	radeonsi: add a workaround to fix KHR-GL45.texture_view.view_classes on gfx9 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4761>	2020-04-30 22:27:31 +00:00
Marek Olšák	d6acdbd935	radeonsi: implement and use compute-based DCC decompression on gfx9-10 DCC_DECOMPRESS doesn't work. Instead of trying to figure out why, use a compute blit where the load is compressed and the store is uncompressed. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4761>	2020-04-30 22:27:31 +00:00
Marek Olšák	d3da73954a	radeonsi: add SI_IMAGE_ACCESS_DCC_OFF to ignore DCC for shader images A shader-based DCC decompress pass will use this. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4761>	2020-04-30 22:27:31 +00:00
Marek Olšák	93d5c86081	radeonsi: bind shader images after DCC is disabled for image stores This prevents an infinite recursion with a compute-based DCC decompression when it restores shader images. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4761>	2020-04-30 22:27:31 +00:00
Marek Olšák	44d27fd6fb	radeonsi: clean up and deduplicate code around internal compute dispatches In addition to the cleanup, there are these changes in behavior: - clear_render_target waits for idle after the dispatch and then flushes L0-L1 caches (this was missing) - sL0 is no longer invalidated before the dispatch, because src resources don't use it - sL0 is no longer invalidated after the dispatch if dst is an image Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4761>	2020-04-30 22:27:31 +00:00
Marek Olšák	e58dcc47c3	radeonsi: unify and align down the max SSBO/TBO/UBO buffer binding size Rounding down the size fixes: KHR-GL45.enhanced_layouts.ssb_member_invalid_offset_alignment Fixes: `03e2adc990` Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4761>	2020-04-30 22:27:31 +00:00
Rob Clark	beb02a781c	freedreno/a6xx: don't set SP_FS_CTRL_REG0.VARYING for fragcoord Similar change to `5785bcc8a0`. It appears on a6xx and in fact this could cause varying corruption before the FS had a chance to consume the varyings from varying storage. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2838 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4838>	2020-04-30 21:38:52 +00:00
Lionel Landwerlin	612e35c8d9	iris: don't assert on unfinished aux import in copy paths After a resource is created the first command using it could be a copy command. In iris_state we finish the import on surface/view creation but we don't do that for copies. v2: Move finish call to gallium entrypoints (Ken) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2725 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (v1) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4657>	2020-04-30 21:18:42 +00:00
Rob Clark	d56b8c4554	freedreno: sync registers with envytools Pull in the `SP_xS_BRANCH_COND` regs to keep the mesa and envytools copies from getting out of sync. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	200765457e	freedreno/a6xx: more OUT_REG() Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	f62cad6b7f	freedreno: scissor vs disabled scissor micro-opt We don't need to deref and check rast state every time scissor changes, only when rast state changes. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	373e9ab27c	freedreno/a6xx: convert const emit to OUT_PKT() This is another hot packet. This splits out each of the four cases (geom vs frag, and indirect vs inline) intentionally, to avoid some parity bit calc. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	710537b19c	freedreno/ir3: inline const emit Drop vfunc callbacks for per-gen packet emit, and instead have a header that is #include'd once per gen. We'll end up with multiple copies of some of this, but since we never have multiple gen's of adreno on a single device, only one copy will be paged in (and hopefully in the I-cache for hot-paths) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	aff93f5419	freedreno/a6xx: split out const emit In order to inline the const emit and drop the per-gen vfuncs to emit the correct sort of packet, we should consolidate all of the entry- points to const emit in one object file, otherwise we'll end up with multiple copies per gen. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	58fd1d7ecd	freedreno/a6xx: convert draw packet to OUT_PKT() This is one of the hotter pkt7 packets, since it is guaranteed to happen on every draw. Switch to OUT_PKT() for less driver overhead in the draw path. Slight bit of cheating for using CP_DRAW_INDX_OFFSET_0 for the first dword in all cases. Possibly gen_header.py could be more clever and use typedef's in the cases of bitsets like vgt_draw_initiator. But this works out because it is always the first dword. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	ee293160d7	freedreno/a6xx: add OUT_PKT() Similar to OUT_REG(), this has the benefits of: 1. No more messing up pkt size 2. Detects errors of mixing up the order of dwords in the packet 3. Optimizes to more efficient code Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	a142bb8992	freedreno/a6xx: skip unnecessary MRT blend state To lower CP overhead. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	5d554987c2	freedreno/a6xx: combine sample mask into blend state This gets rid of one lone register we used to emit directly in IB2 whenever blend state changes, at the expense of needing blend state variants when sample-mask changes. I think typically sample-mask should not change frequently, so this seems like a fair trade-off. To further limit the # of variants, we ignore sample-mask bits that are not relavant for the current # of samples. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	880edb9dc5	freedreno/a6xx: move blend-color to stateobj To reduce CP overhead for draws skipped in a bin. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	dfa702e94b	freedreno/a6xx: limit LRZ state emit Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	3c268afd29	freedreno/a6xx: limit PROG_FB_RAST state emit The dependency on RASTERIZER state is only when rasterizer_discard changes. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	46e177389f	freedreno/a6xx: move scissor state to stateobj To reduce CP overhead for draws skipped in a given tile. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	8cfa765049	freedreno/a6xx: move const state to single stateobj In practice, we end up updating all the shader stages at the same time. So collapse this into a single group. Reduces CP overhead. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	89dbdb806f	freedreno/a6xx: avoid unnecessary clearing VS DP state If there is no (potentially unflushed) VS driver-param state, we don't need to emit a DISABLE on each frame. So avoid that to reduce CP overhead. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Rob Clark	f583dc68e5	freedreno/a6xx: small query cleanup Don't open-code `fd6_event_write()` Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4813>	2020-04-30 20:03:17 +00:00
Tomeu Vizoso	bc11deb86d	panfrost: Don't leak temporary descriptors array As found by Coverity: >>> CID 1462596: Resource leaks (RESOURCE_LEAK) >>> Variable "descriptors" going out of scope leaks the storage it points to. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4724>	2020-04-30 16:27:38 +02:00
Tomeu Vizoso	3c98c452f0	panfrost: Emit blend descriptors on Bifrost Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4724>	2020-04-30 16:27:34 +02:00
Rob Clark	f78af33721	gallium: extract out logicop helper Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4826>	2020-04-30 03:45:12 +00:00
Rob Clark	a0fe98b478	freedreno: fix buffer import `rsc->layout.cpp` is zero until we `fd_resource_layout_init()` Fixes: `5a8718f01b` ("freedreno: Make the slice pitch be bytes, not pixels.") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4818>	2020-04-29 22:34:25 +00:00
Rob Clark	27cafa9a51	freedreno: switch to simple_mtx Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4810>	2020-04-29 20:37:00 +00:00
Rob Clark	336a8cd82a	freedreno: add screen lock wrappers This will make it easier to swap out to simple_mtx_t Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4810>	2020-04-29 20:37:00 +00:00
Konrad Dybcio	fc66800032	freedreno/a4xx: enable A405 This patch brings support for Adreno A405 as found on MSM8939. That chip is a cut-down version of A4XX IP and requires no special handling. Tested on Asus Zenfone 2 Laser (Z00T) smartphone. Signed-off-by: Konrad Dybcio <konradybcio@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4753>	2020-04-29 19:15:58 +00:00
Mike Blumenkrantz	328cc00d39	iris: handle PIPE_CAP_CLEAR_SCISSORED this allows passing scissored clear calls through the driver where it can be handled by a repclear shader fix kwg/mesa#61 Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4310>	2020-04-29 18:05:06 +00:00
Mike Blumenkrantz	1c8bcad81a	gallium: add pipe cap for scissored clears and pass scissor state to clear() hook this adds a new pipe cap that drivers can support which enables passing buffer clears with scissor test enabled through to be handled by the driver instead of having mesa draw a quad also adjust all existing clear() hooks to have the new parameter Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4310>	2020-04-29 18:05:06 +00:00
Mike Blumenkrantz	91375f13ce	iris: move iris_vtable to iris_screen instead of inlining this into every context, now a struct is used in the screen struct to reduce memory usage and simplify a couple of the methods Closes: https://gitlab.freedesktop.org/kwg/mesa/-/issues/6 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4376>	2020-04-29 16:59:45 +00:00
Marek Olšák	5e31e4b697	ac/surface: add code for gfx10 displayable DCC Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4697>	2020-04-29 14:53:25 +00:00
Marek Olšák	a3dc7fffbb	ac/surface: don't compute DCC if it's unsupported by DCN on gfx9+ Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4697>	2020-04-29 14:53:25 +00:00
Marek Olšák	3dc2ccc14c	ac/surface: replace RADEON_SURF_OPTIMIZE_FOR_SPACE with !FORCE_SWIZZLE_MODE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4697>	2020-04-29 14:53:25 +00:00
Marek Olšák	f6d87ec8a9	ac/surface: remove RADEON_SURF_TC_COMPATIBLE_HTILE and assume it's always set So that drivers can enable it without worrying how the texture was allocated. v2: reworked the mechanism, hopefully fixes now added Bas Nieuwenhuizen's diff to fix radv Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4697>	2020-04-29 14:53:25 +00:00
Marek Olšák	25d3cc293e	ac/surface: rename micro tile mode enums like gfx10 uses them Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4697>	2020-04-29 14:53:25 +00:00
Danylo Piliaiev	8f0d387441	iris/bufmgr: Check if iris_bo_gem_mmap failed After refactoring of iris_bo_map_cpu and iris_bo_map_wc - immediate return of NULL on failure to mmap a buffer was lost. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2855 Fixes: `5bc3f52dd8` Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4786>	2020-04-29 08:51:33 +00:00
Kenneth Graunke	506414e837	iris: Fix downcast of bound_vertex_buffers from uint64_t to int This is the wrong data type, the original field - and the values we're adding in - are both 64-bit unsigned. Keep the original data type. Thanks to Dave Airlie for finding this while reading the code. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4802>	2020-04-29 06:50:54 +00:00
Rob Clark	6de01faac5	freedreno/a6xx: invalidate tex state cache entries on rebind When a resource's backing bo changes, its seqno will be incremented. Which would result in a new tex state cache key, and nothing to clean up the old tex state until the sampler view/state is destroyed. But in some games, that may never happen, or at least not happen before we run out of memory. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2830 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	ca05e6b04d	freedreno: rebind_resource() before bo changes This will matter in the next patch, where we need the original rsc->seqno. It means slight shuffling of where we call rebind_resource() in the `fd_try_shadow_resource()` path. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	d9e56d8a69	freedreno: rebind resource in all contexts If the resource is rebound, we need to invalidate in all contexts. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	f12188ff52	freedreno: optimize rebind_resource() Track how resources are used, ie. which state they may potentially dirty if the backing bo is changed/reallocated, to optimize rebind_resource(). This will be more important in a later patch when we hook up eviction of entries in a6xx tex state cache. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	1e18c58047	freedreno: mark more state dirty when rebinding resources Plus a bonus typo fix. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	bf97cc9221	freedreno: don't realloc idle bo's The `DISCARD_WHOLE_RESOURCE` is just a hint. And `rebind_resource()` is a bunch of faffing about (and going to get worse in a later patch), so let's not bother when the bo is already idle. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Rob Clark	938b6ed645	freedreno: small whitespace fix Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4744>	2020-04-29 00:08:57 +00:00
Jan Zielinski	a93b728bc6	gallium/swr: Fix crashes and failures in vertex fetch This commit fixes two problems: - In some cases SWR does not correctly report to Gallium which formats are supported. - Incorrect LLVM instructions are used in vertex fetch in some situations Reviewed-by: Krzysztof Raszkowski <krzysztof.raszkowski@intel.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4788>	2020-04-28 23:53:08 +00:00
Rob Clark	de0d3d1726	freedreno/log-parser: support to read gzip'd logs ~50MB gzip'd log files are nicer than ~300MB uncompressed Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4750>	2020-04-28 23:31:58 +00:00
Rob Clark	f561e516c8	freedreno/a6xx: pre-calculate expected vsc stream sizes We should only rely on overflow detection for indirect draws, where we have no other option. This doesn't use quite the worst-possible-case sizes, which in practice seem to be ~20x larger than what is required. But instead uses roughly half of that. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4750>	2020-04-28 23:31:58 +00:00
Rob Clark	99d802ccc7	freedreno: add helper to estimate # of bins per pipe For vsc size calculation, we need to know the # of bins per pipe. Or at least the worst-case # of bins, assuming we don't eliminate an unused depth/ stencil buffer. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4750>	2020-04-28 23:31:58 +00:00
Rob Clark	a9c255d70c	freedreno/a6xx+tu: rename VSC_DATA/VSC_DATA2 These are the draw-stream and primitive-stream, so lets give them more descriptive names. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4750>	2020-04-28 23:31:58 +00:00
Bas Nieuwenhuizen	8e03cf15f9	radeonsi: Count planes for imported textures. For the DRI2 lowered YUV import separate pipe_resources get created but in the end the first resource just gets asked for NPLANES. Since 1) (Almost) everything uses the first resource + a plane index in the Gallium interface. 2) This mirrors non-imported textures. lets fix this in the driver. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4779>	2020-04-28 11:16:03 +00:00
Gert Wollny	6747a984f5	r600: Enable tesselation for NIR Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	b6d4452661	r600/sfn: Add tesselation shaders Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	d77b81ce50	r600/sfn: Add lowering passes for Tesselation IO Lower the input and output intrinsics to r600 specific LDS intrinsics Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	1b3e103d0b	r600/sfn: Move removing of unused variables It doesn't make sense to do this in the optimization loop Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	74e0a0a723	r600/sfn: Handle LDS output in VS Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	f102301cc4	r600/sfn: derive the GS from the vertex stage for a common interface The GS can also provide the primid Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	f7df2c57a2	r600/sfn: extract class to handle the VS export to different stages This code can be shared with the TESS_EVAL shader Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	38038b369f	r600/sfn: Move some shader base methods to the public interface This will be needed for handling the VS stage export better. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	93f5f9e584	r600/sfn: Add methods to valuepool to get a vector of values Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	7cbca9cf64	r600/sfn: Move emission of barrier from compute shader to shader base Tess shaders also use these barriers. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	46a3033b43	r600/sfn: Emit some LDS instructions Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	a122303711	r600/sfn: Handle umul24 and umad24 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	7e064659cb	r600/sfn: Add IR instruction to fetch the TESS parameters Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	075ea32e48	r600/sfn: Add TF write instruction Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	230beac5f8	r600/sfn: Add LDS instruction to assembly conversion Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	b9d175bed2	r600/sfn: Add LDS IO instructions to r600 IR Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	172868167e	r600/sfn: Don't emit inline constants in the r600 IR This can be handled when lowering to assembly, and it makes testing for indirect buffer and sampler access easier. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	9bc6c135ac	r600/sfn: simplify UBO lowering pass Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Gert Wollny	096a026354	r600: Handle texcoord semantics in LDS index evaluation With NIR the texcoord semantic is enabled, and hence we have to handle index evaluation differently here. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4714>	2020-04-28 08:06:33 +00:00
Icecream95	b4cc116339	panfrost: Fix GL_EXT_vertex_array_bgra Previously, attributes would always use an RGBA swizzle, even if the format was BGRA. Fixes piglit tests bgra-sec-color-pointer and bgra-vert-attrib-pointer. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4752>	2020-04-28 00:20:53 +00:00
Jan Zielinski	4a523baa00	gallium/swr: Fix LLVM 11 compilation issues Changes needed to adapt to LLVM API changes in vector and pointer types. Reviewed-by: Krzysztof Raszkowski <krzysztof.raszkowski@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4769>	2020-04-27 22:29:52 +00:00
Eric Anholt	69c8dfd49f	freedreno: Fix calculation of the const buffer cmdstream size. The HW packet requires padding the number of pointers you emit, and we would assertion fail about running out of buffer space if the number of UBOs to be uploaded was odd. Fixes: `b4df115d3f` ("freedreno/a6xx: pre-calculate userconst stateobj size") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4621>	2020-04-27 22:10:10 +00:00
Mike Blumenkrantz	acc56300dc	zink: explicitly unref old fb object when setting new one this object has a ref from being created, and its lifetime is expected to be a single frame, so remove that initial ref when we expect to stop using it Closes: #2648 Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4768>	2020-04-27 21:55:51 +00:00
Mike Blumenkrantz	d3f0022a43	zink: remove framebuffer cache this can only match when re-rendering identical frames, which is not a typical case. the lack of cache eviction also leads to memory ballooning. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4768>	2020-04-27 21:55:51 +00:00
Bas Nieuwenhuizen	afd9274d48	st/dri: Set next in template instead of after creation. (v2) This should prevent horrors like Iris has with the delayed calls to iris_resource_finish_aux_import just because info is not available at allocation time. AFAICT all drivers just copy the template except radeonsi/r600 which reset the next pointer. AFAICT there is also no other place we get a state tracker setting next ptrs on a resource. v2: Updated Gallium docs. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3792>	2020-04-27 21:08:01 +00:00
Eric Anholt	b34ee185f4	freedreno: Fix derivatives without texturing on a3xx-a5xx. The shader variant tells us if we should set the PIXLODENABLE flag. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4685>	2020-04-27 19:06:57 +00:00
Samuel Pitoiset	42b1696ef6	ac,radeonsi: fix compilations issues with LLVM 11 Latest LLVM replaced LLVMVectorTypeKind. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2826 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4755>	2020-04-27 17:13:36 +00:00
Marek Olšák	19eb89b0f3	gallium: add PIPE_CAP_MAP_UNSYNCHRONIZED_THREAD_SAFE for glthread and add radeonsi support. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4758>	2020-04-27 11:56:06 +00:00
Marek Olšák	f2c2a28073	ac: update and document fast math flags used by radeonsi This should have no effect, because we never use FP division, but it's safer for the future. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4696>	2020-04-27 11:20:16 +00:00

... 3 4 5 6 7 ...

28657 Commits