mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Alyssa Rosenzweig	720ff76de4	asahi: Implement invalidate_resource From Panfrost. This lets us avoid storing depth/stencil attachments at the end of the frame in GLES. On my 4K monitor, glmark2 -btexture at fullscreen goes from 705fps to 1150fps. I assume gains on real workloads will be smaller. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:46 -05:00
Alyssa Rosenzweig	28b652af80	asahi: Track batch masks on ZS/blend CSO Adapted from panfrost, with the work happening at CSO create time instead of draw time allowing us to do more sophisticated analysis. We'll use these for accurate masks in a moment. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Alyssa Rosenzweig	33b1876857	asahi: Dirty track blend state We'll want this to reduce variant lookups eventually. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Alyssa Rosenzweig	29e6c00e3c	asahi: Enable dirty tracking Whoops. drawoverhead test 1 score from 496 -> 2377. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Alyssa Rosenzweig	b28fe26d7c	ail: Save level_offsets_compressed_B So we can bind specific mip levels for rendering into compressed Z/S. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Aleksey Komarov	3895545b83	panfrost: implement clear_depth_stencil Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
David Heidelberg	b19a14a094	nine: enable on panfrost Also, enable required kmsro dependencies. Tested-by: Aleksey Komarov <q4arus@ya.ru> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
David Heidelberg	be841f0e78	panfrost: implement clear_render_target Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Co-authored-by: Aleksey Komarov <q4arus@ya.ru> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Aleksey Komarov <q4arus@ya.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
David Heidelberg	8560c7613d	panfrost: Handle resources without depth in batch_to_fb_info Prevent preloading data from resources which doesn't exist. Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Aleksey Komarov <q4arus@ya.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
David Heidelberg	d76d791565	panfrost: Implement GL_EXT_clip_control Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Co-authored-by: Aleksey Komarov <q4arus@ya.ru> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Aleksey Komarov <q4arus@ya.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
Paulo Zanoni	a099d6ae4d	intel: add devinfo->has_64bit_float_via_math_pipe Unusual hardware features that require special hanlding usually get a devinfo field, so do this for MTL's unordered DF types. This will guarantee that any platform based on MTL (thus inheriting from MTL_FEATURES) will automatically be handled in these special cases. v2: s/has_unordered_64bit_float/has_64bit_float_via_math_pipe/ (Curro). Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	eac00f4ec7	intel/compiler: fix intel_swsb_decode for newer platforms In the previous patch we adjusted the scoreboard pass to take into consideration a new case of unordered operations for TGL. Fix the decoding as well. v2: use intel_device_info_is_mtl() (Curro, Jordan) v3: the part where we export num_sources_from_inst() is now a separate patch (Curro). v4: Work around false positive maybe-unitialized warning since Marge uses -Werror=maybe-uninitialized (Marge). Reviewed-by: Francisco Jerez <currojerez@riseup.net> (v3) Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	295c5f59e0	intel/compiler: export brw_num_sources_from_inst We want to call this from brw_disasm.c, so move it out to brw_eu.c since it's about to become more of a shared utility function than something specific to the EU validator. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	df50add27e	intel/compiler: avoid 64bit SEL_EXEC on MTL On MTL, instructions with DF type are unordered, executed in the math pipe. This means that they require different SWSB dependency handling, and also that in some cases such as MOVs it's generally faster to simply use 2 smaller ordered moves than a single unordered MOV. One problem we have with the current code is that generate_code() is not setting the proper SWSB dependencies for the generated DF MOVs, causing some tests to fail. One solution would be to fix generate_code() by making it set the appropriate dependencies. This was the first patch I wrote. Another solution to this problem, pointed to us by Curro, is to change required_exec_type() so we use UD instructions instead of DF, just like we do with platforms that don't have 64 bit instructions, which means there won't be anything to fix in generate_code(). The second solution is what this patch implements. This fixes at least: - dEQP-VK.subgroups.arithmetic.framebuffer.subgroupmin_double_vertex Thanks to Francisco Jerez for all the major help provided with this problem. Credits-to: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	951855c349	intel/compiler: avoid (RegDist, SBID) on DF instructions on MTL When we use this form there's no way to specify which pipe RegDist refers to, so there are a few rules to figure this out, which is what inferred_sync_pipe() implements. But for MTL there's no long pipe and the documentation does not explicitly explain what should be the inferred type for its long (DF) instructions - which are out-of-order, by the way. One way to interpret this is that such case should be avoided. So add the extra check to entirely avoid this case. Notice that this is not actually fixing any bug, since returning TGL_PIPE_LONG (what we do today) will actually make these DF instructions incompatible with every in-order instruction, so we'll never opt to use the (RegDist, SBID) form anyway. But still, it's better to have this case explicitly documented instead of having it covered by a semi coincidence. v2: use intel_device_info_is_mtl() (Curro, Jordan) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	16b9f87104	intel/compiler: on MTL, DF instructions run in the math pipe Adjust the scoreboard code to take that into account. Fixes at least: - dEQP-VK.glsl.builtin.precision_double.refract.compute.vec3 - dEQP-VK.glsl.builtin.precision_double.matrixcompmult.compute.mat4 v2: use intel_device_info_is_mtl() (Curro, Jordan) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Francisco Jerez	051887fbf3	intel/fs: Make the result of is_unordered() dependent on devinfo. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Lionel Landwerlin	d608706875	Revert "anv: compile anv_acceleration_structure.c" This reverts commit `74d0be27ae`. Also remove anv_acceleration_structure.c, it was meant to be removed earlier. There was probably a rebase issue somewhere. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20248>	2022-12-10 01:16:16 +00:00
Chia-I Wu	d217883c5c	freedreno/a6xx: fix blend all_mrt_write_mask Fix all_mrt_write_mask when independent_blend_enable is false. Otherwise, lrz write is always diabled with MRT when independent_blend_enable is false. This fixes a 2% perf regression for multiple gfxbench benchmarks. Fixes: `0132c22de7` ("freedreno/a6xx: Don't disable LRZ for invalid channels") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20254>	2022-12-09 22:21:19 +00:00
Kenneth Graunke	bec68a85a2	iris: Improve direct CPU map heuristics We were promoting reads with a valid primary to direct CPU maps even if the mmap mode was IRIS_MMAP_WC, which would mean uncached reads from VRAM. In that case, GPU blits are in fact useful! We were also only checking for !DISCARD_RANGE rather than MAP_READ, which isn't a great idea for image maps, given the discussion in the previous commit about image map semantics. The original code was also just confusingly structured. Make a helper function with clearly defined cases where we want to bail on CPU maps. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19209>	2022-12-09 21:46:03 +00:00
Kenneth Graunke	eafaac2b1e	iris: Only copy existing data into staging images with PIPE_MAP_READ When performing transfer maps on images that require staging buffers (say, for presenting a linear view of tiled memory), we were reading the existing contents of the buffer into the staging resource on map unless PIPE_MAP_DISCARD_RANGE was set. The thinking was to support partial writes. If you map a subrectangle of an image, but then only write selective pixels - should it preserve the existing contents of the mapped region? I believed that it should, unless you pass PIPE_MAP_DISCARD_RANGE to explicitly say that that it's okay to invalidate the destination region. However, that does not appear to be the interpretation favored by other Mesa developers (in particular Michel Dänzer and Marek Olšák). The radeonsi driver does not do this readback from the destination region to the staging buffer unless you pass PIPE_MAP_READ. If you want to do a partial write and preserve contents, you need to pass both flags: (PIPE_MAP_READ \| PIPE_MAP_WRITE). Passing READ is expected to come with an associated cost. OpenGL defines GL_MAP_INVALIDATE_RANGE_BIT for mapping buffer objects, which is translated to PIPE_MAP_DISCARD_RANGE. However, unextended OpenGL doesn't define mapping textures. There are two main sources of image maps: our internal MapTextureImage() hook, and gbm_bo_map(). I've audited our internal MapTextureImage() calls, and while some do pass PIPE_MAP_DISCARD_RANGE, almost all of them wholly overwrite the mapped region, and those that care about combining with existing image contents all pass PIPE_MAP_READ. So this should work there. GBM defines three flags: GBM_BO_TRANSFER_READ, WRITE, and READ_WRITE. There is no defined "invalidate range" bit. In issue #6020, Matthias Treydte notes that this extra readback can cause performance problems, and with iris's current interpretation, there's no way to avoid it. During that discussion, Michel and Matthias both argued that GBM_BO_TRANSFER_WRITE should invalidate the destination contents and avoid the readback, while GBM_BO_TRANSFER_READ_WRITE would preserve it. This patch makes iris follow that model for image mappings, removing readback on staging maps for both detiling and stall avoidance, unless PIPE_MAP_READ is passed. I believe we can change this with impunity. For buffer objects, Ian Romanick and I both agree that partial writes should be supported, and GL_MAP_INVALIDATE_RANGE_BIT exists precisely to indicate that you should spend effort preserving existing contents. So we continue doing readback for buffers unless PIPE_MAP_DISCARD_RANGE is flagged, for now. While I think this is work, it also seems to be undertested in the CTS and Piglit. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6020 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19209>	2022-12-09 21:46:03 +00:00
Kenneth Graunke	50614d39fe	iris: Return idle status from iris_invalidate_buffer, skip busy checks If we successfully replace the backing storage for a buffer, we know that it's idle, and the transfer map code can mark it unsynchronized right away, letting us skip redundant resource_is_busy() checks. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19209>	2022-12-09 21:46:03 +00:00
Kenneth Graunke	f112add554	iris: Don't replace backing storage for exported buffers. We already gave out the old BO...or acquired it from somewhere which may be affecting it. We simply can't replace the backing store. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19209>	2022-12-09 21:46:03 +00:00
Kenneth Graunke	6954a8ddbe	iris: Promote DISCARD_RANGE to DISCARD_WHOLE_RESOURCE where possible This allows us to replace the backing storage for a buffer, which means we'd have an idle buffer and thus could do an unsynchronized mapping where we otherwise wouldn't. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19209>	2022-12-09 21:46:03 +00:00
Kenneth Graunke	465eb092ed	iris: Use persistent mappings for pinned memory (userptr) This is a port of Nicolai's `b52721e3b6` from radeonsi. Because GL_AMD_pinned_memory guarantees that mappings will refer to the same underlying page, we need to avoid using staging maps. Using a persistent map is a reasonable way to accomplish this. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19209>	2022-12-09 21:46:03 +00:00
Kenneth Graunke	b82d545442	iris: Delete map->dest_had_defined_contents Dead since commit `6cc09699cd`. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19209>	2022-12-09 21:46:03 +00:00
Rhys Perry	907fbf22dd	nir/gather_info: use nir_ssa_scalar_resolved This lets us skip copies. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rhys Perry	085828ea4d	vtn: add mesh output and task_payload to vtn_mode_is_cross_invocation This fixes a potential race condition, and removes output loads (which should not exist in the EXT_mesh_shader). Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7391 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rhys Perry	e1f5100311	nir: add task_payload and shader_out to nir_var_vec_indexable_modes Since these can be cross-invocation, we need this to write individual components without race conditions or loads. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7391 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rhys Perry	a89755d179	radv: fix task payload lowering when shared_memory_explicit_layout=true If shared_memory_explicit_layout=true, we would have skipped lowering task payload variables to explicit types. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rhys Perry	e4060752e2	radv: fix mesh shaders with null winsys Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rebecca Mckeever	f381187b8f	panvk: Delete panvk_CmdSetDeviceMask, panvk_GetDeviceGroupPeerMemoryFeatures Delete panvk_CmdSetDeviceMask and panvk_GetDeviceGroupPeerMemoryFeatures so that the vk_common_* version will be used instead. This will avoid repeated code. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:08:14 -06:00
Rebecca Mckeever	aa76b70751	hasvk: Delete VK_KHR_device_group provided entrypoints Delete anv_CmdDispatch, anv_CmdSetDeviceMask, and anv_GetDeviceGroupPeerMemoryFeatures so that the vk_common_* versions will be used instead. This will avoid repeated code. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:07:59 -06:00
Rebecca Mckeever	43f9c66224	anv: Delete VK_KHR_device_group provided entrypoints Delete anv_CmdDispatch, anv_CmdSetDeviceMask, and anv_GetDeviceGroupPeerMemoryFeatures so that the vk_common_* versions will be used instead. This will avoid repeated code. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:07:48 -06:00
Rebecca Mckeever	159cf9122e	tu: Delete VK_KHR_device_group provided entrypoints Delete tu_CmdDispatch, tu_CmdSetDeviceMask, and tu_GetDeviceGroupPeerMemoryFeatures so that the vk_common_* versions will be used instead. This will avoid repeated code. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:07:33 -06:00
Rebecca Mckeever	6b1e2e9eb6	v3dv: Delete VK_KHR_device_group provided entrypoints Delete v3dv_CmdDispatch, v3dv_CmdSetDeviceMask, and v3dv_GetDeviceGroupPeerMemoryFeatures so that the vk_common_* versions will be used instead. This will avoid repeated code. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:07:17 -06:00
Rebecca Mckeever	64d7385e61	radv: Delete VK_KHR_device_group provided entrypoints Delete radv_CmdDispatch, radv_CmdSetDeviceMask, and radv_GetDeviceGroupPeerMemoryFeatures so that the vk_common_* versions will be used instead. This will avoid repeated code. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:07:00 -06:00
Rebecca Mckeever	83400af043	vulkan/runtime: Add VK_KHR_device_group provided entrypoints Add entrypoints vk_common_CmdDispatch, vk_common_CmdSetDeviceMask, and vk_common_GetDeviceGroupPeerMemoryFeatures in Mesa Vulkan runtime so that they are available to all drivers. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20218>	2022-12-09 14:06:14 -06:00
Thong Thai	2d4a36ce64	gallium: add new variable for video frame statistics Video encoder previously reuses the associated_data variable to output encoding statistics, but it ended up breaking when transcoding. This commit adds a new variable just for statistics. Fixes: `2d1bd619df` ("frontends/va: add ability for encoder to output statistics") Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20190>	2022-12-09 13:37:00 -05:00
Rhys Perry	c872e339a1	radv: remove some unnecessary 64-bit IO handling nir_lower_io() lowers these to 32-bit. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20137>	2022-12-09 17:30:24 +00:00
Rhys Perry	6a5b615ab1	radv: fix streamout with different streams in the same varying slot Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20137>	2022-12-09 17:30:24 +00:00
Rhys Perry	20e670d060	aco/ra: don't swap create_vector operand with definition blocker for SGPRs There is no SGPR swap instruction, we always need 3 XORs. fossil-db (navi21): Totals from 76 (0.06% of 135636) affected shaders: Instrs: 58400 -> 58347 (-0.09%); split: -0.10%, +0.01% CodeSize: 312580 -> 312368 (-0.07%); split: -0.08%, +0.01% Latency: 843333 -> 843180 (-0.02%); split: -0.02%, +0.00% InvThroughput: 126431 -> 126412 (-0.02%) Copies: 4008 -> 3955 (-1.32%); split: -1.47%, +0.15% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20240>	2022-12-09 15:58:43 +00:00
Rhys Perry	a05dd58309	aco/ra: don't swap p_create_vector operand with definition blocker for scc SCC is 1-bit, and we can't copy a 32-bit value into it. Fixes dEQP-VK.spirv_assembly.type.scalar.i32.iequal_tesse with ACO_DEBUG=noopt. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `9476986e6f` ("aco/ra: special-case get_reg_for_create_vector_copy()") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20240>	2022-12-09 15:58:43 +00:00
Georg Lehmann	4dff3ff005	nir/opt_algebraic: Optimize open coded bfm. Foz-DB Navi21: Totals from 1553 (1.15% of 134913) affected shaders: SpillVGPRs: 2246 -> 2223 (-1.02%); split: -1.42%, +0.40% CodeSize: 10409156 -> 10410720 (+0.02%); split: -0.03%, +0.04% Instrs: 1899725 -> 1898773 (-0.05%); split: -0.07%, +0.02% Latency: 71225814 -> 71118314 (-0.15%); split: -0.21%, +0.06% InvThroughput: 13384926 -> 13330369 (-0.41%); split: -0.47%, +0.06% VClause: 38309 -> 38284 (-0.07%); split: -0.17%, +0.11% SClause: 70743 -> 70706 (-0.05%) Copies: 167296 -> 167230 (-0.04%); split: -0.28%, +0.24% Branches: 42446 -> 42444 (-0.00%); split: -0.01%, +0.00% PreVGPRs: 95191 -> 95188 (-0.00%) Some minor instructions count regressions in parallel-rdp because v_bfm_b32 can't use SDWA, but overall an improvement. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18887>	2022-12-09 14:59:16 +00:00
Ruijing Dong	a73e86e0a5	frontends/va: fix gst videotestsrc h264 enc fail issue. problem: when doing "gst-launch-1.0 -v videotestsrc num-buffer=10 ! vaapih264enc ! fakeink" The command will fail due to gst will fetch the first available supported format in the list, it becomes P010_LE due to the commit in [`0b02db3007`] frontends/va: fixed av1 decoding 10bit ffmpeg output YUV issue fix: move the P010_LE code block to the end of the function, the sequence of the supported formats restored to its original. cc: mesa-stable Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20242>	2022-12-09 09:31:11 -05:00
Yonggang Luo	ee10a5f7a6	frontend/osmesa: inherit pipe_frontend_drawable instead of allocating separately This is required by st/mesa now. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20027>	2022-12-09 13:14:03 +00:00
Yonggang Luo	5be128f67d	frontend/hgl: inherit pipe_frontend_drawable instead of allocating separately This is required by st/mesa now. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20027>	2022-12-09 13:14:03 +00:00
Marek Olšák	3ba24ad153	gallium: rename st_framebuffer_iface -> pipe_frontend_drawable, etc. Also rename: iface -> drawable stfb -> drawable (where it means dri_drawable and not st_framebuffer) stfbi -> drawable or pdrawable (if drawable exists) pipe_frontend_drawable* is really just dri_drawable* for DRI, and WGL/GLX have their own variants. This makes it easier to understand what kind of object is being used. I always wondered what st_framebuffer_iface, iface, stfbi, iface_stamp, and iface_ID actually mean. Now those terms are gone forever. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20027>	2022-12-09 13:14:03 +00:00
Marek Olšák	279dfeff1d	gallium: remove pipe_frontend_screen::destroy callback, call it directly This is the only one implemented by mesa/state_tracker. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20027>	2022-12-09 13:14:03 +00:00
Marek Olšák	ab7a86a0ee	gallium: clean up comments in api.h, cosmetic changes Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20027>	2022-12-09 13:14:03 +00:00

... 3 4 5 6 7 ...

164321 Commits All Branches Search

164321 Commits

All Branches