KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Boris Brezillon	046c154585	panfrost: Rename panfrost_stage_attributes() panfrost_stage_attributes() is emitting mali_attr_meta descriptors, so let's rename it accordingly and move it to pan_cmdstream.c. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	dcc0b1ff01	panfrost: Move the mali_attr.src_offset adjustment to a sub-function Create a panfrost_vertex_state_upd_attr_offs() helper to adjust the attr_meta src_offsets. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	575f62ea02	panfrost: Emit attribute descriptors after patching the templates Patching attribute desc when they are in cacheable memory should be more efficient. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	4a2ee61a22	panfrost: Prepare attribute for builtins at state creation time The attribute meta slots reserved for gl_VertexID and gl_InstanceID can be pre-filled at state creation time. Only the index needs to be adjusted when attributes are generated. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	b692ab076a	panfrost: Ignore BO start addr when adjusting src_offset BOs are guaranteed to be aligned on 4K which inherently guarantees the 64 byte alignment. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	128820b886	panfrost: Drop initial mali_attr_meta.src_offset assignment The mali_attr_meta.src_offset is initialized to pipe_vertex_element.src_offset at vertex element creation time, but this field is then adjusted when the descrptors are emitted. Let's use the pipe_vertex_element data we saved earlier and drop this initial assignment. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	528384cb6d	panfrost: Add an helper to emit a pair of vertex/tiler jobs Add the panfrost_emit_vertex_tiler_jobs() helper and use it in panfrost_queue_draw(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	8e0a08bc8e	panfrost: Move sampler/tex descs emission helpers to pan_cmdstream.c Move panfrost_upload_texture_descriptors() and panfrost_upload_sampler_descriptors() to pan_cmdstream.c where other cmdstream related helpers live. While at it, change their prototype and name to make it consistent with the other helpers and prepare things for ctx->payloads[] removal. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	2b946a1d2b	panfrost: Add a panfrost_sampler_desc_init() helper It just makes sense to group all HW descriptor initilization logic in pan_cmdstream.c, so let's move this code out of panfrost_create_sampler_state(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	b02f97c875	panfrost: Prepare shader_meta descriptors at emission time This way we avoid potential state leaks and keep the shader_meta initialization in once place. The time spent preparing the shader descriptors should be negligible compared to the time spent pushing those descriptors to the transient buffer (remember we are writing to non-cacheable memory here). Note that we might get back to some sort of shader_meta descriptor caching at some point if that proves necessary, but now we have those panfrost_frag_meta_xxx_update() helpers now where xxx maps directly to a CSO bind, which should ease desc template updates. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	55e014336f	panfrost: Prepare things to get rid of panfrost_shader_state.tripipe panfrost_shader_state.tripipe is used as a template for shader_meta desc emission, but shader_meta desc preparation time should be negligible compared to desc emission time (remember we are writing to non-cacheable memory here). Let's prepare for generating the the shader_meta desc entirely at draw time by adding the necessary fields to panfrost_shader_state. Note that we might brink back some sort of shader_meta desc caching at some point, but let's simplify things a bit for now. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	e94076f8f5	panfrost: Add an helper to update the rasterizer part of a tiler job desc That's part of our attempt to make panfrost_emit_for_draw() a bit more dry and eventually get rid of it by inlining the code in panfrost_draw_vbo(). This is just one step in this direction. Note that we get rid of the panfrost_rasterizer.tiler_gl_enables field along the way, as setting/clearing those bits at draw time instead of doing when the state is created should make a huge difference. We might get back to pre-computed VT descs at some point, but let's keep things simple for now. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	56aeb921e9	panfrost: Add an helper to update the occclusion query part of a tiler job desc That's part of our attempt to make panfrost_emit_for_draw() a bit more dry and eventually get rid of it by inlining the code in panfrost_draw_vbo(). This is just one step in this direction. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	5f043cc776	panfrost: Simplify panfrost_emit_for_draw() and make it private Now that panfrost_launch_grid() no longer calls panfrost_emit_for_draw(), we can keep it private to pan_context.c and drop all compute-related stuff. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	8ac17139b1	panfrost: Stop using panfrost_emit_for_draw() for compute jobs We actually need a small subset of what's done in panfrost_emit_for_draw() when emitting compute jobs, so let's copy what we need directly in panfrost_launch_grid() instead of re-using this function whose initial purpose was to generate vertex/tiler jobs for draw operations. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	0d75eb002e	panfrost: Move panfrost_attach_vt_framebuffer() to pan_cmdstream.c Move panfrost_attach_vt_framebuffer() to pan_cmdstream.c and change its name to panfrost_vt_attach_framebuffer() so we can use a consistent prefix (panfrost_vt_) for all helpers initializing/updating midgard_payload_vertex_tiler fields. Note that the function only initializes one VT object now. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:34 +01:00
Boris Brezillon	5d33d42b4d	panfrost: Dissociate shader meta patching from the desc emission Right now we emit two shader descriptors for the fragment shader, one when panfrost_patch_shader_state() is called, and the final one including both the shader_meta and the blend RT descriptors. The first generated fragment shader descriptor is never used, since the second one overrides the postfix.shader pointer. Let's dissociate the state patching logic from the descriptor emission so we don't upload descriptors that are never used. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:33 +01:00
Boris Brezillon	36725be4d9	panfrost: Move shared mem desc emission out of panfrost_launch_grid() Let's move the shared memory descriptor emission to a dedicated function living with its pairs in pan_cmdstream.c. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:33 +01:00
Boris Brezillon	0b735a2d80	panfrost: Move the const buf emission logic out of panfrost_emit_for_draw() Let's move the constant buffer emission logic in a dedicated helper to make panfrost_emit_for_draw() a bit more dry. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:33 +01:00
Boris Brezillon	a72bab1c3e	panfrost: Move viewport desc emission out of panfrost_emit_for_draw() Let's move the viewport descriptor emission logic to a dedicated helper in order to shrink a bit the panfrost_emit_for_draw(). Note that this helper is placed in a new pan_cmdstream.c file where we will group all cmdstream related helpers (everything that's related to HW descriptor initialization emission). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:33 +01:00
Boris Brezillon	79f8850527	panfrost: Move the batch stack size adjustment out of panfrost_queue_draw() That's part of our attempt to sanitize panfrost_queue_draw(), panfrost_draw_vbo() and panfrost_emit_for_draw(). The new panfrost_batch_adjust_stack_size() helper is placed in pan_job.c, where all batch related functions live. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:33 +01:00
Boris Brezillon	b28f4bb67c	panfrost: Add an helper to retrieve the currently active shader state Doing that improves readability and helps avoiding code duplication. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:33 +01:00
Boris Brezillon	a0402f7960	panfrost: Assign primitive_size.pointer only if writes_point_size() returns true Checking vs->writes_point_size is not enough, as we might have a vertex shader writing point size, but a primitive that's not MALI_POINT. That currently works because emit_varying_descriptor() is called before the primitive_size.constant field is update, but let's make the logic more robust, just in case things are re-ordered at some point. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4083>	2020-03-10 12:47:33 +01:00
Samuel Pitoiset	24db276d11	radv/sqtt: describe pipeline and wait events barriers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 10:05:40 +01:00
Samuel Pitoiset	c04e9befc0	radv/rgp: bump the instrumentation spec version to 1 RGP expects the version to be 1, otherwise it doesn't display the barriers (including layout transitions) correctly. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 10:05:40 +01:00
Samuel Pitoiset	ac0d5b6b11	radv/sqtt: describe render pass color/depthstencil clears Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 10:05:40 +01:00
Samuel Pitoiset	b829fbb7f0	radv/sqtt: describe draw/dispatch and emit event markers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 10:05:40 +01:00
Samuel Pitoiset	dcfc08f5b8	radv/sqtt: describe begin/end command buffers with user markers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 09:58:02 +01:00
Samuel Pitoiset	31ecf0b17d	radv: initial implementation of the driver internal layer SQTT This layer is used to emit SQTT user markers to command buffers. It currently only emits API markers but it will consolidated soon with barrier markers and more. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 09:57:59 +01:00
Samuel Pitoiset	be700775dc	radv/sqtt: add a helper that emits thread trace userdata markers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 09:57:56 +01:00
Samuel Pitoiset	f4fbcfe818	radv: use device entrypoints from the SQTT layer if enabled This allows to override RADV device entrypoints if the prefix is 'sqtt' instead of 'radv'. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 09:57:53 +01:00
Samuel Pitoiset	9c88e4a272	radv/entrypoints: declare a driver internal layer for SQTT Some Vulkan commands will be overriden to emit user SQTT markers. These markers are then used by the Radeon GPU Profiler to display timings, barrier operations (cache flushes, pipeline stalls, layout transitions) and more. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4031>	2020-03-10 09:57:49 +01:00
Boris Brezillon	a64599a303	panfrost: Pass the sampler view format when creating a tex descriptor A sampler can use a different format than the native texture format. Let's pass the sampler format instead of the native texture format when creating a texture descriptor. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4101> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4101>	2020-03-10 08:43:08 +01:00
Boris Brezillon	ce845f44e9	Revert "panfrost: Z24 variants should be sampled as R32UI" Commit `0406ea4856` ("panfrost: Z24 variants should be sampled as R32UI") causes a regression when depth textures are sampled. It's still not clear how MALI_Z32 can work for for Z32 and Z24{S,X}8, but let's leave that question for later. Reported-by: Icecream95 <ixn@keemail.me> Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4101>	2020-03-10 08:42:05 +01:00
Tomeu Vizoso	8d0ec5b8a6	gallium: Add forgotten docs for new CAPs related to transform feedback These three caps were missing docs. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4115> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4115>	2020-03-10 07:26:13 +01:00
Vasily Khoruzhick	251c6991a3	lima: enable minmax cache for index buffers Re-use minmax cache for index buffers from panfrost. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4051> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4051>	2020-03-10 02:41:27 +00:00
Vasily Khoruzhick	53d6bb9fc6	panfrost: split index cache into shared part Split it into shared part since we're going to re-use it in lima. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4051>	2020-03-10 02:41:27 +00:00
Marek Olšák	040a7117c3	st/mesa: fix a possible crash with selection and feedback modes The index bounds are always valid without an index buffer, but they won't be. Reviewed-by: Dave Airlie <airlied@redhat.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3986> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3986>	2020-03-09 21:26:55 -04:00
Marek Olšák	7b0e043d48	st/mesa: flush the bitmap cache before st/dri and vbo flushes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3986>	2020-03-09 21:26:55 -04:00
Francisco Jerez	45d4665dc7	intel/fs: Fix workaround for VxH indirect addressing bug under control flow. The current workaround for this hardware bug involved marking the ADD instruction used to initialize the address register as NoMask on Gen12, which was based on the assumption that the problem was caused by a hardware bug affecting the application of the execution mask to the address register write. However that doesn't seem to be the case: The address register write was working correctly, the real problem leading to hangs on TGL is that the indirect addressing logic is unable to deal with garbage values in the address register (e.g. misaligned offsets), even for channels which are currently inactive due to non-uniform control flow. The current workaround isn't able to avoid that situation in general, since the result of the NoMask ADD instruction for a dead channel is calculated based on the corresponding (dead) component of the indirect_byte_offset source, which would still be undefined in the likely case that the source was initialized under control flow itself. This would lead to hangs whenever MOV_INDIRECT was used under non-uniform control flow in some scenarios like a tessellation shader from GFXBench5/gl_4 (AKA Car Chase) on TGL. In addition I've managed to reproduce the same issue on earlier platforms by initializing the whole address register with garbage before the ADD instruction, so this seems to be a long-standing issue we have avoided mostly by luck. This patch fixes the problem and applies the workaround to all platforms, since even when the hardware is able to deal with garbage address values without hanging there might be a significant performance cost from reading random GRF registers due to the useless extra EU cycles spent fetching registers for dead channels and due to the potential for unintended serialization with respect to other random instructions that could be executed in parallel, which may have had a cost of the order of hundreds of cycles in the worst case scenario. Fixes: `f93dfb509c` "intel/fs: Write the address register with NoMask for MOV_INDIRECT" Tested-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2020-03-10 00:42:50 +00:00
Ian Romanick	c144875f62	intel/fs: Allow NOT instructions in conditional discard optimization I don't know why I explicitly disallowed NOT in the first place. :( All Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 14549846 -> 14549770 (<.01%) instructions in affected programs: 12934 -> 12858 (-0.59%) helped: 76 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.13% max: 5.56% x̄: 1.04% x̃: 0.90% 95% mean confidence interval for instructions value: -1.00 -1.00 95% mean confidence interval for instructions %-change: -1.25% -0.84% Instructions are helped. total cycles in shared programs: 203793967 -> 203792696 (<.01%) cycles in affected programs: 77920 -> 76649 (-1.63%) helped: 67 HURT: 1 helped stats (abs) min: 2 max: 36 x̄: 19.00 x̃: 16 helped stats (rel) min: 0.04% max: 4.68% x̄: 2.35% x̃: 2.28% HURT stats (abs) min: 2 max: 2 x̄: 2.00 x̃: 2 HURT stats (rel) min: 0.03% max: 0.03% x̄: 0.03% x̃: 0.03% 95% mean confidence interval for cycles value: -20.75 -16.63 95% mean confidence interval for cycles %-change: -2.57% -2.05% Cycles are helped. Reviewed-by: Matt Turner <mattst88@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3965> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3965>	2020-03-09 16:46:28 -07:00
Ian Romanick	ba2fa1ceaf	intel/fs: Do cmod prop again after scheduling Pre-RA scheduling can create more opportunities for CMOD propagation. This takes advantage of that. It may be worth doing this again in post-RA scheduling, but there are additional problems there. I'm a little torn about the use of the OPT() macro. On the one hand, it would be confusing to see dumps from INTEL_DEBUG=optimizer that don't match the final output. On the other hand, since register allocation can fail, the same pass can be run multiple times. Each time one or both passes might or might not make progress. This would also lead to incongruous, confusing output. Ice Lake total instructions in shared programs: 14549808 -> 14548529 (<.01%) instructions in affected programs: 231985 -> 230706 (-0.55%) helped: 632 HURT: 0 helped stats (abs) min: 1 max: 32 x̄: 2.02 x̃: 1 helped stats (rel) min: 0.05% max: 2.56% x̄: 0.57% x̃: 0.41% 95% mean confidence interval for instructions value: -2.25 -1.79 95% mean confidence interval for instructions %-change: -0.61% -0.54% Instructions are helped. total cycles in shared programs: 203770850 -> 203776599 (<.01%) cycles in affected programs: 2495653 -> 2501402 (0.23%) helped: 282 HURT: 197 helped stats (abs) min: 1 max: 242 x̄: 20.37 x̃: 16 helped stats (rel) min: <.01% max: 11.65% x̄: 0.91% x̃: 0.64% HURT stats (abs) min: 2 max: 609 x̄: 58.35 x̃: 20 HURT stats (rel) min: <.01% max: 10.97% x̄: 1.35% x̃: 0.66% 95% mean confidence interval for cycles value: 5.27 18.73 95% mean confidence interval for cycles %-change: -0.16% 0.21% Inconclusive result (%-change mean confidence interval includes 0). LOST: 0 GAINED: 2 Skylake total instructions in shared programs: 13447708 -> 13446594 (<.01%) instructions in affected programs: 216813 -> 215699 (-0.51%) helped: 623 HURT: 0 helped stats (abs) min: 1 max: 32 x̄: 1.79 x̃: 1 helped stats (rel) min: 0.06% max: 2.86% x̄: 0.59% x̃: 0.42% 95% mean confidence interval for instructions value: -1.99 -1.59 95% mean confidence interval for instructions %-change: -0.63% -0.55% Instructions are helped. total cycles in shared programs: 193759224 -> 193762726 (<.01%) cycles in affected programs: 2540035 -> 2543537 (0.14%) helped: 249 HURT: 190 helped stats (abs) min: 2 max: 196 x̄: 16.67 x̃: 14 helped stats (rel) min: <.01% max: 4.71% x̄: 0.66% x̃: 0.62% HURT stats (abs) min: 2 max: 614 x̄: 40.27 x̃: 14 HURT stats (rel) min: 0.02% max: 5.78% x̄: 0.86% x̃: 0.37% 95% mean confidence interval for cycles value: 2.57 13.39 95% mean confidence interval for cycles %-change: -0.11% 0.11% Inconclusive result (%-change mean confidence interval includes 0). LOST: 0 GAINED: 1 Broadwell total instructions in shared programs: 13418631 -> 13417393 (<.01%) instructions in affected programs: 243192 -> 241954 (-0.51%) helped: 694 HURT: 0 helped stats (abs) min: 1 max: 31 x̄: 1.78 x̃: 1 helped stats (rel) min: 0.06% max: 2.86% x̄: 0.59% x̃: 0.44% 95% mean confidence interval for instructions value: -1.95 -1.62 95% mean confidence interval for instructions %-change: -0.62% -0.55% Instructions are helped. total cycles in shared programs: 200822940 -> 200829128 (<.01%) cycles in affected programs: 2128651 -> 2134839 (0.29%) helped: 251 HURT: 226 helped stats (abs) min: 1 max: 200 x̄: 14.32 x̃: 12 helped stats (rel) min: <.01% max: 3.56% x̄: 0.60% x̃: 0.50% HURT stats (abs) min: 2 max: 611 x̄: 43.28 x̃: 18 HURT stats (rel) min: 0.02% max: 7.03% x̄: 0.93% x̃: 0.54% 95% mean confidence interval for cycles value: 7.44 18.50 95% mean confidence interval for cycles %-change: 0.02% 0.23% Cycles are HURT. Haswell and Ivy Bridge had similar results. (Haswell shown) total instructions in shared programs: 11569710 -> 11568829 (<.01%) instructions in affected programs: 147862 -> 146981 (-0.60%) helped: 487 HURT: 0 helped stats (abs) min: 1 max: 34 x̄: 1.81 x̃: 1 helped stats (rel) min: 0.12% max: 4.75% x̄: 0.57% x̃: 0.45% 95% mean confidence interval for instructions value: -2.03 -1.59 95% mean confidence interval for instructions %-change: -0.61% -0.54% Instructions are helped. total cycles in shared programs: 187079425 -> 187079437 (<.01%) cycles in affected programs: 1088494 -> 1088506 (<.01%) helped: 234 HURT: 124 helped stats (abs) min: 2 max: 282 x̄: 22.66 x̃: 16 helped stats (rel) min: 0.03% max: 7.88% x̄: 0.93% x̃: 0.75% HURT stats (abs) min: 1 max: 276 x̄: 42.86 x̃: 20 HURT stats (rel) min: 0.03% max: 6.70% x̄: 0.99% x̃: 0.53% 95% mean confidence interval for cycles value: -5.54 5.61 95% mean confidence interval for cycles %-change: -0.41% -0.11% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 7746 -> 7740 (-0.08%) spills in affected programs: 6 -> 0 helped: 1 HURT: 0 total fills in shared programs: 6264 -> 6258 (-0.10%) fills in affected programs: 6 -> 0 helped: 1 HURT: 0 Sandy Bridge total instructions in shared programs: 10688576 -> 10688177 (<.01%) instructions in affected programs: 137875 -> 137476 (-0.29%) helped: 358 HURT: 0 helped stats (abs) min: 1 max: 9 x̄: 1.11 x̃: 1 helped stats (rel) min: 0.15% max: 1.43% x̄: 0.35% x̃: 0.28% 95% mean confidence interval for instructions value: -1.18 -1.05 95% mean confidence interval for instructions %-change: -0.37% -0.32% Instructions are helped. total cycles in shared programs: 153397144 -> 153393046 (<.01%) cycles in affected programs: 1220713 -> 1216615 (-0.34%) helped: 255 HURT: 31 helped stats (abs) min: 1 max: 304 x̄: 16.71 x̃: 16 helped stats (rel) min: <.01% max: 6.70% x̄: 0.41% x̃: 0.31% HURT stats (abs) min: 1 max: 41 x̄: 5.29 x̃: 3 HURT stats (rel) min: 0.02% max: 0.65% x̄: 0.16% x̃: 0.11% 95% mean confidence interval for cycles value: -17.44 -11.22 95% mean confidence interval for cycles %-change: -0.40% -0.29% Cycles are helped. Iron Lake total instructions in shared programs: 8106894 -> 8105529 (-0.02%) instructions in affected programs: 287197 -> 285832 (-0.48%) helped: 1099 HURT: 0 helped stats (abs) min: 1 max: 10 x̄: 1.24 x̃: 1 helped stats (rel) min: 0.16% max: 4.55% x̄: 0.67% x̃: 0.61% 95% mean confidence interval for instructions value: -1.29 -1.19 95% mean confidence interval for instructions %-change: -0.70% -0.64% Instructions are helped. total cycles in shared programs: 188347022 -> 188344266 (<.01%) cycles in affected programs: 3740632 -> 3737876 (-0.07%) helped: 758 HURT: 10 helped stats (abs) min: 2 max: 38 x̄: 3.68 x̃: 2 helped stats (rel) min: <.01% max: 1.00% x̄: 0.12% x̃: 0.08% HURT stats (abs) min: 2 max: 4 x̄: 3.20 x̃: 4 HURT stats (rel) min: 0.03% max: 0.07% x̄: 0.06% x̃: 0.07% 95% mean confidence interval for cycles value: -3.82 -3.35 95% mean confidence interval for cycles %-change: -0.13% -0.11% Cycles are helped. GM45 total instructions in shared programs: 4985449 -> 4984768 (-0.01%) instructions in affected programs: 145154 -> 144473 (-0.47%) helped: 547 HURT: 0 helped stats (abs) min: 1 max: 10 x̄: 1.24 x̃: 1 helped stats (rel) min: 0.16% max: 2.86% x̄: 0.66% x̃: 0.61% 95% mean confidence interval for instructions value: -1.31 -1.18 95% mean confidence interval for instructions %-change: -0.69% -0.62% Instructions are helped. total cycles in shared programs: 128835062 -> 128833144 (<.01%) cycles in affected programs: 2720650 -> 2718732 (-0.07%) helped: 517 HURT: 1 helped stats (abs) min: 2 max: 38 x̄: 3.71 x̃: 2 helped stats (rel) min: <.01% max: 0.89% x̄: 0.11% x̃: 0.07% HURT stats (abs) min: 2 max: 2 x̄: 2.00 x̃: 2 HURT stats (rel) min: 0.04% max: 0.04% x̄: 0.04% x̃: 0.04% 95% mean confidence interval for cycles value: -4.02 -3.39 95% mean confidence interval for cycles %-change: -0.12% -0.10% Cycles are helped. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3965>	2020-03-09 16:46:19 -07:00
Eric Engestrom	461ee85248	docs: update calendar, add news item, and link releases notes for 19.3.5 Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4121> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4121>	2020-03-09 23:04:36 +00:00
Eric Engestrom	b06471b77d	docs: add release notes for 19.3.5 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4121>	2020-03-09 23:04:36 +00:00
Vinson Lee	5ffa6eab88	st/nine: Fix incompatible-pointer-types-discards-qualifiers errors. ../src/gallium/state_trackers/nine/nine_ff.c:129:28: error: initializing 'struct nine_ff_vs_key ' with an expression of type 'const void ' discards qualifiers [-Werror,-Wincompatible-pointer-types-discards-qualifiers] struct nine_ff_vs_key vs = key; ^ ~~~ ../src/gallium/state_trackers/nine/nine_ff.c:145:28: error: initializing 'struct nine_ff_ps_key ' with an expression of type 'const void ' discards qualifiers [-Werror,-Wincompatible-pointer-types-discards-qualifiers] struct nine_ff_ps_key ps = key; ^ ~~~ Fixes: `fdd96578ef` ("nine: Add state tracker nine for Direct3D9 (v3)") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Andre Heider <a.heider@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4015> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4015>	2020-03-09 15:37:54 -07:00
Marek Olšák	c1b8e84961	radeonsi: determine uses_bindless_samplers correctly Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4079> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4079>	2020-03-09 16:08:14 -04:00
Marek Olšák	fc65df5651	ac: add a bug workaround for the 100% NGG culling case Fixes: `8db00a51f8` - radeonsi/gfx10: implement NGG culling for 4x wave32 subgroups Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4079>	2020-03-09 16:08:11 -04:00
Marek Olšák	7481c4be58	radeonsi: add a bug workaround for NGG - LATE_ALLOC_GS Cc: 19.3 20.0 <mesa-stable@lists.freedesktop.org> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4079>	2020-03-09 16:08:10 -04:00
Sonny Jiang	5ea2034f58	radeonsi: enable EXT_texture_shadow_lod Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4079>	2020-03-09 16:08:07 -04:00
Chia-I Wu	f3728816af	egl/android: require ANDROID_native_fence_sync for buffer age Querying buffer age requires a buffer to be dequeued. But dequeuing without ANDROID_native_fence_sync might imply eglClientWaitSync, which results in a deadlock as the display lock is already held by eglQuerySurface. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/221> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/221>	2020-03-09 18:27:11 +00:00

... 2 3 4 5 6 ...

121174 Commits All Branches Search

121174 Commits

All Branches