KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Guilherme Gallo	4783e55039	ci/lava: Add `slow` pytest marker Mark test_full_yaml_log with this new marker to be easily run by the developers. Make `debian-testing` skip this test with `not slow` marker hint. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	84abb3df13	ci/lava: Color red for fatal and yellow for warning Fatal errors now have red foreground color and retry messages yellow one. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	daff21ef55	ci/lava: Make hung job status yellow It will help to know what happened to a non-successful job. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	2c51b7a9c9	ci/lava: Detect R8152 issues preemptively and retry Implement a log-based retry hint for R8152 issue described in #6681, which is based on detecting these two consecutive lines: ``` r8152 <USB> eth0: Tx status -71 nfs: server <IP> not responding, still trying ``` Where <IP> and <USB> could be any IP and USB addresses, respectfully. This commit is a temporary fix since it requires a section-aware log follower, implemented in !16323. When the cited MR is merged, one will make a proper fix on top of that. Closes: #6681 Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	45a4b01427	ci/lava: Split lava_log into modules This script is getting too big, it been hard to extend it. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Mike Blumenkrantz	2f3a233b6f	zink: flush pending clears for fb texture barriers if a texture barrier occurs while clears are pending, these clears should show up if the fb attachments are read in shaders, so trigger a renderpass to flush out the clears cc: mesa-stable fixes #6766 fixes (radv): dEQP-GLES3.functional.draw_buffers_indexed.overwrite_common.common_advanced_blend_eq_buffer_advanced_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_common.common_blend_eq_buffer_advanced_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_common.common_separate_blend_eq_buffer_advanced_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_indexed.common_advanced_blend_eq_buffer_advanced_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_indexed.common_advanced_blend_eq_buffer_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_indexed.common_advanced_blend_eq_buffer_separate_blend_eq Reviewed-By: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17363>	2022-07-08 11:58:11 +00:00
Samuel Pitoiset	6517a2b926	radv: fix dumping VS prologs assembly This got removed by mistake and broke RADV_DEBUG=shaders,nocache,prologs. Fixes: `9fe2b6b748` ("aco/radv: provide a vs prolog callback from aco to radv.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17413>	2022-07-08 10:58:33 +00:00
Tatsuyuki Ishi	768cd5715d	radv: Fix vkCmdCopyQueryResults -> vkCmdResetPool hazard. The Vulkan specification states: > Query commands, for the same query and submitted to the same queue, > execute in their entirety in submission order, relative to each other. In > effect there is an implicit execution dependency from each such query > command to all query commands previously submitted to the same queue. Fixes dEQP-VK.query_pool.statistics_query.reset_after_copy.* Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17400>	2022-07-08 10:35:11 +00:00
Georg Lehmann	4f5e25ea8d	aco/assembler: Fix s_bitreplicate_b64_b32 on GFX9. This seems to be a relic from before aco added per generation opcodes. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17405>	2022-07-08 10:09:19 +00:00
Georg Lehmann	68db0a079b	aco: Fix swapping sources in SOPC -> SOPK optimization. Fixes: `2d6b0a4177` ("aco/optimizer: Optimize SOPC with literal to SOPK.") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17407>	2022-07-08 09:43:51 +00:00
Georg Lehmann	27526ffad1	r600/sfn: Add missing std::array include. Fixes: `79ca456b48` ("r600/sfn: rewrite NIR backend") Closes https://gitlab.freedesktop.org/mesa/mesa/-/issues/6824 Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17412>	2022-07-08 09:34:53 +00:00
Pierre-Eric Pelloux-Prayer	01314d0880	radeonsi: use LLVMBuildLoad2 for inter-stage outputs loads The PS case was covered by the previous commit, so we can use f32 everywhere. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:26 +00:00
Pierre-Eric Pelloux-Prayer	248781dea1	radeonsi: use LLVMBuildLoad2 in llvm PS PS is the only shader type where unpacked 16-bit outputs can be used, so use ac_shader_abi::is_16bit to pass the proper type to LLVMBuildLoad2. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:26 +00:00
Pierre-Eric Pelloux-Prayer	326c042491	ac/llvm: use LLVMBuildLoad2 in visit_load Only FS can have f16 outputs, so always use f32 here. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Pierre-Eric Pelloux-Prayer	dc8d82516b	ac/llvm: handle opaque pointers in visit_store_output Outputs are always f32 or f16. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Pierre-Eric Pelloux-Prayer	196c4ebe1a	ac: add per output is_16bit flag to ac_shader_abi Outputs are always f32 except for FS that may use unpacked f16. Store this information here to make it available to later processing. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Pierre-Eric Pelloux-Prayer	c275e69cee	radeonsi: use LLVMBuildLoad2 where possible This commit replaces LLVMBuildLoad usage by LLVMBuildLoad2 where possible (= where the pointee type is known). Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Pierre-Eric Pelloux-Prayer	940734630d	ac: use LLVMContextSetOpaquePointers if available Disabling opaque pointers in LLVM doesn't fix all the issues but it makes pointers non-opaque by default (eg LLVMPointerType() returns a typed pointer). Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Danylo Piliaiev	d9296dcbbf	zink: re-enable EXT_primitives_generated_query for Turnip https://gitlab.freedesktop.org/mesa/mesa/-/issues/6602 is resolved so the extension could be re-enabled. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17163>	2022-07-08 11:15:20 +03:00
Danylo Piliaiev	bf4c160909	tu: Fix prim gen query and pipeline stats query interaction Fixed: - VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT was able to stop prim counter when pipeline stats query is running. - This may have happened when prim gen query was in secondary cmdbuf. - VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT counting geometry in each tile. - VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT counting geometry in each tile when pipeline stats query is started inside prim gen query and inside a renderpass. The matter of VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT and pipeline stats interaction is solved by tracking whether pipeline stats query is running both on CPU (for non secondary cmdbuf case) and on GPU (for secondary cmdbuf). Note, prim gen query is not allowed with secondary command buffers, so only pipeline stats query is tracked on gpu. See https://gitlab.khronos.org/vulkan/vulkan/-/issues/3142 Counting geometry per each tile is solved by: - Conditionally executing START/STOP_PRIMITIVE_CTRS to not run in tiling pass. Solves the case when prim gen query is inside a renderpass. - Stop prim counters before executing `draw_cs` and restarting them afterwards. Solves prim gen query being outside a renderpass. Fixes GL CTS tests with Zink + `TU_DEBUG=gmem`: GTF-GL46.gtf30.GL3Tests.transform_feedback.transform_feedback_max_separate GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_basic GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_framebuffer GTF-GL46.gtf40.GL3Tests.transform_feedback3.transform_feedback3_streams_overflow GTF-GL46.gtf40.GL3Tests.transform_feedback3.transform_feedback3_streams_queried GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_states Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6602 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17163>	2022-07-08 11:14:18 +03:00
Danylo Piliaiev	465e7c303b	tu,freedreno: Refactored START/STOP events for pipeline stats For a5xx+ renamed: - RST_PIX_CNT -> START_FRAGMENT_CTRS - RST_VTX_CNT -> STOP_FRAGMENT_CTRS - TILE_FLUSH -> START_COMPUTE_CTRS - STAT_EVENT -> STOP_COMPUTE_CTRS I'm not sure about a5xx itself but I'll take a chance of it being similar to a6xx in this regard. Knowing this emit_begin_stat_query/emit_end_stat_query can now emit only events that are needed for the pool's flags. Also primitive generated query clearly doesn't need fragment and compute counters. Passes tests: dEQP-VK.query_pool.statistics_query.* dEQP-VK.transform_feedback.primitives_generated_query.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17163>	2022-07-08 11:14:18 +03:00
Samuel Pitoiset	cf46397aec	aco: fix load_barycentric_at_sample without MSAA It's legal to use this instruction in a fragment shader, even if the graphics pipeline doesn't use MSAA. Fixes dEQP-GLES31.functional.shaders.multisample_interpolation.interpolate_at_sample.non_multisample_buffer.sample_n_*. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17398>	2022-07-08 07:28:33 +00:00
Iago Toral Quiroga	2071804f33	nir/serialize: fix missing divergence info after deserialization Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17397>	2022-07-08 06:15:28 +00:00
Igor Torrente	48be8a9875	venus: Use maintenance4 to get max_size_buffer This should help speedup the device initalization process. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17399>	2022-07-08 04:27:51 +00:00
Igor Torrente	1893c81521	venus: Add support to VK_KHR_maintenance4 extension Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17399>	2022-07-08 04:27:51 +00:00
Jesse Natalie	b16f9f8ba4	dzn: Only support high/normal queue priorities D3D uses an int which seems like it'd support value between 0 and 100, but in reality it only accepts values of exactly 0, or 100. The space is left in case future values were to be added, so that comparisons would work (e.g. MEDIUM_HIGH < HIGH). Treat higher than 0.5 to be HIGH, and anything less to be NORMAL. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17406>	2022-07-08 03:09:11 +00:00
Alyssa Rosenzweig	638b22354e	panfrost: Clear with a quad to avoid flushing Flushing the batch midframe (splitting a renderpass) is expensive on a tiler, as it requires the GPU to flush the framebuffer contents to main memory and read them back. Clearing the framebuffer should not trigger a flush. Apps expect clears to be (almost) free, flushing for a clear is at the very least unexpected behaviour. The only reason we previously flushed is to ensure we could always use a "fast" clear. But a slow clear is a heck of a lot faster than a flush ;-) Instead of flushing, we should clear with a draw (via u_blitter) in case a fast clear isn't possible. This fixes pathological performance for applications that rely on partial clears within a frame. This issue was identified with Inochi2D, which repeatedly clears the stencil buffer midframe, in order to implement masking efficiently with the stencil buffer. In total, the all-important workload of rendering Asahi Lina is improved from 17fps to 29fps on a panfrost device. Fixes: `c138ca80d2` ("panfrost: Make sure a clear does not re-use a pre-existing batch") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17112>	2022-07-08 01:50:03 +00:00
Alyssa Rosenzweig	35a80418a1	panfrost/ci: Disable 0ad trace on T860 The last few frames of the trace are expensive (in terms of GPU time) and are close to hitting the timeout. With the next commit, they do hit the timeout due to using a larger batch. Nevertheless the next commit should be an overall perf improvement on average, so this remove to unblock CI. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17112>	2022-07-08 01:50:03 +00:00
Icecream95	91d9a34925	pan/decode: Change indent when decoding resources Make the separation between entries in the resource table more obvious. Increase the indent by two levels to keep descriptors distinct from the resource entry itself. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Icecream95	e05889c8c9	pan/decode: Use tag bits for resource entry count Fixes crashes when decoding the blob, which sometimes uses fewer than 9 entries. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Icecream95	f7da4eade4	pan/decode: fflush buffers after dumping and before aborts Otherwise trace files or other files being written (dEQP TestResults?) might be truncated. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Icecream95	bcd85a74e8	pan/va: Use the _safe iterator when adding blend shader calls Otherwise the list 'next' changing will cause the assertion in list_for_each_entry to be hit. This was not hit before because list_assert is defined for debug builds but not debugoptimized. Fixes: `5067a26f44` ("pan/bi: Use flow control lowering on Valhall") Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Icecream95	dc37ad2a73	panfrost: Enable QUAD_STRIP and POLYGON on v6 I wrote fiction about dreaming that these were supported but after waking finding that they were not. Somehow I came to consider that fiction as fact and I never thought to test if they did work. While reverse engineering the polygon list format, I found that these were supported after all. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Alyssa Rosenzweig	3d69b65512	panfrost: Fix PIPE_COMPUTE_CAP_SUBGROUP_SIZE Use the new helper to implement the CAP, correctly handling Midgard and Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17265>	2022-07-08 01:14:55 +00:00
Alyssa Rosenzweig	c517ec5939	panfrost: Fix PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS This maps to CL_DEVICE_MAX_COMPUTE_UNITS, which is defined as: The number of parallel compute cores on the OpenCL device. Since all supported Malis are unified shader cores, the number of parallel compute cores is simply the number of shader cores. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17265>	2022-07-08 01:14:55 +00:00
Alyssa Rosenzweig	fbe430fae9	panfrost: Move bifrost_lanes_per_warp to common Whereas the compiler needs to know the warp size for lowering divergent indirects, the driver needs to know it to report the subgroup size. Move the Bifrost-specific helper to common and add the trivial implementation for Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17265>	2022-07-08 01:14:55 +00:00
Alyssa Rosenzweig	6f3eea5ddb	panfrost: Separate core ID range from core count To query the core count, the hardware has a SHADERS_PRESENT register containing a mask of shader cores connected. The core count equals the number of 1-bits, regardless of placement. This value is useful for public consumption (like in clinfo). However, internally we are interested in the range of core IDs. We usually query core count to determine how many cores to allocate various per-core buffers for (performance counters, occlusion queries, and the stack). In each case, the hardware writes at the index of its core ID, so we have to allocate enough for entire range of core IDs. If the core mask is discontiguous, this necessarily overallocates. Rename the existing core_count to core_id_range, better reflecting its definition and purpose, and repurpose core_count for the actual core count. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17265>	2022-07-08 01:14:55 +00:00
Alyssa Rosenzweig	5aa740bc8e	pan/bi: Implement f2f16{_rtz, _rtne} Float conversions with explicit rounding modes are required for OpenCL, as well as for Vulkan with the VK_KHR_16bit_storage extension (mandatory in Vulkan 1.1). Since the hardware conversion instructions allow configuring the round mode, this is easy to support :-) Fixes test_half.vstore_half_rtz. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17262>	2022-07-08 00:57:18 +00:00
Alyssa Rosenzweig	5f599fdef6	pan/va: Add missing <roundmode/> to V2F32_TO_V2F16 So we can implement f2f16_rtz. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17262>	2022-07-08 00:57:18 +00:00
Lionel Landwerlin	9680e0e4a2	intel/fs: ray query fix for global address With stages dispatching with a mask, we can run into situations where we don't have the global address in all lanes. The existing code always assumed we had the addres in at least lane0. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `bb40e999d1` ("intel/nir: use a single intel intrinsic to deal with ray traversal") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17330>	2022-07-08 00:36:04 +00:00
Alyssa Rosenzweig	9bd7570e96	pan/bi: Fix unpack_32_2x16 definition This got messed up when scalarizing the IR. Fix the definition of the opcode to return (instead of break, asserting out) and to respect the swizzle (instead of failing validation). Noticed when bringing up OpenCL on Valhall. Fixes: `5febeae58e` ("pan/bi: Emit collect and split") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17222>	2022-07-07 23:16:39 +00:00
Jesse Natalie	6ee9002ff6	dzn: Fix winsys reporting For Windows we don't support using the DISPLAY winsys, and for WSL we should add the ones that support software Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17346>	2022-07-07 23:02:00 +00:00
Rhys Perry	48578713b7	radv,aco,ac/llvm: use nir_op_f{sin,cos}_amd This lets NIR optimize the multiplication, particularly sin/cos(a * #b). fossil-db (Sienna Cichlid): Totals from 12306 (7.58% of 162293) affected shaders: MaxWaves: 224814 -> 224834 (+0.01%) Instrs: 17365273 -> 17338758 (-0.15%); split: -0.16%, +0.00% CodeSize: 93478488 -> 93354912 (-0.13%); split: -0.14%, +0.01% VGPRs: 752080 -> 752072 (-0.00%); split: -0.00%, +0.00% SpillSGPRs: 8440 -> 8410 (-0.36%) Latency: 200402154 -> 200279405 (-0.06%); split: -0.06%, +0.00% InvThroughput: 37588077 -> 37545545 (-0.11%); split: -0.11%, +0.00% VClause: 293863 -> 293874 (+0.00%); split: -0.03%, +0.03% SClause: 619539 -> 619064 (-0.08%); split: -0.09%, +0.01% Copies: 1151591 -> 1151641 (+0.00%); split: -0.04%, +0.05% Branches: 506434 -> 506437 (+0.00%); split: -0.00%, +0.00% PreSGPRs: 877609 -> 877517 (-0.01%); split: -0.01%, +0.00% PreVGPRs: 711938 -> 711940 (+0.00%); split: -0.00%, +0.00% fossil-db (LLVM, Sienna Cichlid): Totals from 4377 (3.59% of 121873) affected shaders: SGPRs: 358960 -> 359176 (+0.06%); split: -0.18%, +0.25% VGPRs: 319832 -> 319720 (-0.04%); split: -0.18%, +0.15% SpillSGPRs: 46983 -> 47007 (+0.05%); split: -0.99%, +1.04% CodeSize: 30872812 -> 30764512 (-0.35%); split: -0.39%, +0.04% MaxWaves: 73814 -> 73904 (+0.12%); split: +0.25%, -0.13% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Rhys Perry	bb0415b697	nir: allow 16-bit fsin_amd/fcos_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Rhys Perry	bc1ea2fda9	nir/algebraic: optimize bcsel(c, fsin/cos_amd(a), fsin/cos_amd(b)) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Rhys Perry	69d21a3dee	nir: rename fsin_r600/fcos_r600 to fsin_amd/fcos_amd GCN has better range, but constant folding is the same. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10587>	2022-07-07 22:18:08 +00:00
Pierre-Eric Pelloux-Prayer	835364ea0c	vulkan/wsi: define pWaitDstStageMask in the blit submission Otherwise we get a crash in vk_common_QueueSubmit when doing: .stageMask = pSubmits[s].pWaitDstStageMask[i], cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6712 Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17310>	2022-07-07 21:49:39 +00:00
Mike Blumenkrantz	3f11ae93be	zink: support PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17189>	2022-07-07 20:39:31 +00:00
Mike Blumenkrantz	2f46495ac0	zink: don't warn for missing customBorderColorWithoutFormat on turnip Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17189>	2022-07-07 20:39:31 +00:00
Mike Blumenkrantz	6872adda26	zink: disable customBorderColorWithoutFormat on turnip this should "just work" now Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17189>	2022-07-07 20:39:30 +00:00

1 2 3 4 5 ...

156376 Commits All Branches Search

156376 Commits

All Branches