mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Jesse Natalie	33051f1eb4	dzn: Early-out on no-op barriers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22346>	2023-04-07 19:11:11 +00:00
Mike Blumenkrantz	472fcf74e2	zink: don't trigger shader variants on pcp change if driver supports dynamic pcp this otherwise pointlessly creates and binds shader variants that do nothing Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22365>	2023-04-07 18:32:34 +00:00
Mike Blumenkrantz	172054e305	zink: reuse copy_vars for generated tcs Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22364>	2023-04-07 17:44:29 +00:00
Mike Blumenkrantz	762a29279b	zink: reuse d3d12 variable copying to make passthrough gs more robust Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22364>	2023-04-07 17:44:29 +00:00
Felix DeGrood	4dc7256bf9	anv: reset query pools using blorp Previously we used PC to set query data to 0 during CmdResetQueryPool. This was slow when clearing large query pools. Switching to blorp to clear pools is faster for large query pools. Red Dead Redemption 2: +1.5% speedup Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Lionel Landwerlin	bb49610973	anv: replace query flush before gpu copy by semaphore wait All the flushes should already have happened, we just need CS to wait for the operations to complete. Just use a MI_SEMAPHORE_WAIT to check the availability bit is set. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Lionel Landwerlin	abc4111d19	anv: pass steam output as argument for anv_dump_pipe_bits Just if you need to change it at some point ;) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	2415d57a99	anv/blorp: add flush reasons to RT flushes Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	43f93f5043	anv/blorp: implement anv_cmd_buffer_fill_area Implemented function to fill an area at an address. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	0130a4f667	anv/blorp: support surf generation for addresses Already have support for anv_buff. Extended to support addresses. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Raun	9d38c9ca2f	dzn: Enable VK_KHR_get_memory_requirements2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22349>	2023-04-07 15:35:10 +00:00
Raun	a9a0dc3cca	dzn: Enable VK_KHR_bind_memory2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22349>	2023-04-07 15:35:10 +00:00
Samuel Pitoiset	bcd33d2937	radv: import retained NIR shaders later in the compilation process This allows us to remove the intermediate NIR shader pointer. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22356>	2023-04-07 14:38:46 +00:00
Samuel Pitoiset	e909764930	radv: do not retain noop FS for libs when a cache hit happened Determine if the graphics pipeline needs a noop FS later instead of retaining it. This was also suboptimal. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22356>	2023-04-07 14:38:46 +00:00
Samuel Pitoiset	34fa60e138	radv: simplify a check when retaining NIR shaders The RETAIN flag is only allowed with graphics libs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22356>	2023-04-07 14:38:46 +00:00
Samuel Pitoiset	3b5ea90f1d	radv: move the serialized NIR to radv_graphics_lib_pipeline Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22356>	2023-04-07 14:38:46 +00:00
Samuel Pitoiset	4672c6c43b	radv: add a helper for retaining NIR shaders Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22356>	2023-04-07 14:38:46 +00:00
Mike Blumenkrantz	dc18570c0a	zink: don't access non_fs part of zink_shader from fs Fixes: `a6de15eff5` ("zink: add flags to `zink_gfx_program` and `zink_context`") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22347>	2023-04-07 13:10:03 +00:00
Mike Blumenkrantz	215beee16d	zink: more explicitly track/check rp optimizing per-context if tc creation fails for whatever reason, rp optimizing must be marked as disabled for that context to avoid erroneous assumptions about rp operation fixes #8787 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22319>	2023-04-07 12:29:56 +00:00
Qiang Yu	2c78cbbfe1	ac/llvm: remove some unused code replaced by nir Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22304>	2023-04-07 03:42:25 +00:00
Qiang Yu	a2cecbbc44	ac/nir/ngg: fix store shared alignment For stream!=0, this align_mul=4 is not true. Not observe any problem yet, just for correctness. Fixes: `60ac5dda82` ("ac: Add NIR lowering for NGG GS.") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22304>	2023-04-07 03:42:25 +00:00
Qiang Yu	c082cdacae	ac/nir/ngg: fix gs culling vertex liveness check for odd vertices If vertex does not complete a primitive, it should not set the odd flag which miss lead liveness check when culling is enabled. For example, if odd flag is set regardless of complete flag, when culling is enabled, 3 vertices of a triangle's init prim flag: [0x00 0x04 0x01] then after culling, this triangle has been culled, their prim flag: [0x00 0x04 0x00] the second vertex is miss treat as live because its odd flag (code check prim_flag!=0 for liveness). Fixes: `1bdeb961bd` ("ac/nir/ngg: add gs culling") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8725 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22304>	2023-04-07 03:42:25 +00:00
Qiang Yu	fc3d8e1125	radeonsi: fix max scrach lds size calculation when ngg Fixes: `028d0590f8` ("radeonsi: replace llvm ngg vs/tes with nir lowering") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22304>	2023-04-07 03:42:25 +00:00
Asahi Lina	9fcadd0c8d	asahi: Allow explicit non-LINEAR modifiers for scanout The compositor is responsible for picking the right supported modifiers for scanout. If we get no modifiers, we have to assume linear, but if we do, just roll with it and don't attempt to force things. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	534a04d557	asahi: Flip kmsro around to allocate on the GPU Our display controller can handle arbitrary GPU imports, so there is no reason to use dumb KMS buffers. Allocate everything on the GPU instead. This also allows us to be lazy about mapping things to the KMS side, so only clients that really want a KMS handle actually do that, which stops us from ending up with a bunch of junk mapped to DCP (e.g. X11 clients always request SCANOUT even under XWayland). Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	9db36376a6	asahi: Fix compressed ZS support Depth/stencil formats are "not renderable" but do support compression. I swear I already fixed this at some point and the commit must've fallen through the cracks... Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	2296f69629	asahi: Print reasons why compression is disabled For resource debug. Found a regression in compressed depth this way. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	888d443f29	asahi: Add resource debugging I keep re-implementing this every time I look at resource-related issues. Let's just make it official so we can turn it on with a flag instead of having to add printfs every time ^^ Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	0a132b0640	asahi: Add a helper macro for debug/error messages This includes the program short name in the message, which is useful when running entire desktop sessions with a single log to figure out who is doing what. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	883ba4b161	asahi: Make BO import path failures more robust These operations can fail for complex reasons through no fault of mesa, so we should have proper runtime checks for them even in release builds. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	fcf594d00b	asahi: Implement valid buffer range tracking A common pattern is to allocate a vertex/etc buffer and write to it in subsets. Some games interleave this with draw calls using the buffer. This causes very expensive flushing for every draw call. Fix this by tracking which range of a buffer has been written to, and elide syncs when the range was previously uninitialized. Fixes Source engine game performance and probably helps a bunch of others. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	00064ba4e3	asahi: Fix style nits Found with a grep abomination which is probably too broken/silly to actually implement in CI... but hey, at least it found some. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	a88b9c5540	asahi: Locate low VA BOs correctly These need the shader_base added to them. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	030b2306a4	asahi: Enable glthread This helps a lot with FEX, since the GPU driver runs emulated (and only 64bit supports thunking). Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	4a5115c47b	asahi: Make agx_alloc_staging() take a screen instead of a context This makes it clear that it is thread-safe. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Asahi Lina	75e3212809	Revert "asahi: Advertise dual-source blending" This reverts commit `f4e2b22646`. This is broken until GL3 is enabled, possibly due to a core Mesa bug, but it's a corner case not worth fixing. Fixes Chromium. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Alyssa Rosenzweig	8a6d74d15b	agx: Make signal_pix instructions explicit Rather than implicitly packing them with the sample_mask. Again, this is just changing where they're emitted, no functional changes yet. Bug for bug compatibility with the old behaviour. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Alyssa Rosenzweig	bb530760a2	agx: Rename writeout to wait_pix This is the name applegpu is currently using, to capture the semantics of a pixel fence. I'm not sure what Apple calls this but wait_pix is closer than writeout for sure. This commit just does the rename. It doesn't fix the broken semantics we've had, this is to ease review and bisection. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Alyssa Rosenzweig	2028e7b88b	agx: Tease apart some sample_mask packing magic There's a second instruction here, and a second source in the first instruction. applegpu has known about the encodings for a while but I never updated the packing code. We will need to stop hardcoding this for multisampling support, as preparation tease apart the magic pieces. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Alyssa Rosenzweig	13b3da822b	asahi: Clamp texture buffer sizes Per the spec / freedreno. Fixes arb_texture_buffer_object-texture-buffer-size-clamp Fixes: `6b22a02f90` ("asahi,agx: Implement buffer textures with gnarly NIR") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Alyssa Rosenzweig	c4175c5fc8	asahi: Dirty track depth bias uploads Reduces how much we upload in SuperTuxKart. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:04 +00:00
Alyssa Rosenzweig	23880daa8d	asahi: Lower 1D to 2D Khronos APIs require that we support mipmapping even for 1D textures. However, it isn't clear if this is supported in the hardware, and how it would work even if it is. But 1D textures are pretty useless, so we just lower 1D textures to 2D textures instead of worrying about that. Fixes piles of Piglits relating to 1D textures. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00
Alyssa Rosenzweig	098295f1a0	asahi: Implement null textures Use the same silly workaround that Metal does, to fill in texture descriptors when there's nothing bound in the interest of robust behaviour. Fixes null pointer dereference in arb_shading_language_420pack-active-sampler-conflict. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00
Alyssa Rosenzweig	1fb4e34020	asahi: Honour sampler count It may not be equal to the texture count. Prevents a regression from the next commit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00
Alyssa Rosenzweig	203c9c12e2	agx: Don't overallocate registers We need to account for the full vector lengths. Especially important once we start restricting the reg file. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00
Alyssa Rosenzweig	42c5d6140b	agx: Coalesce more collects Try harder to coalesce collects, by trying to allocate collects only to regions of the register file where we actually have a full vector worth of registers free. If we already know that the vector will be blocked later, it's not a good base register to pick since we'd be force to shuffle later. So, this tweak to the collect coalescing heuristic lets us eliminate a pile of pointless copying. shader-db results are excellent. Note that, although we use more registers, none of the shaders tested had their thread count affected, likely because the max HURT isn't too high and most of the scary % here is from using a few more registers when the register pressure is already low. In the near future, that property will become guaranteed thanks to live range splitting, too. total instructions in shared programs: `1507337` -> 1500562 (-0.45%) instructions in affected programs: 428137 -> 421362 (-1.58%) helped: 2658 HURT: 167 helped stats (abs) min: 1.0 max: 34.0 x̄: 2.63 x̃: 2 helped stats (rel) min: 0.10% max: 25.00% x̄: 3.04% x̃: 2.14% HURT stats (abs) min: 1.0 max: 10.0 x̄: 1.24 x̃: 1 HURT stats (rel) min: 0.20% max: 23.81% x̄: 3.90% x̃: 3.57% 95% mean confidence interval for instructions value: -2.49 -2.31 95% mean confidence interval for instructions %-change: -2.76% -2.51% Instructions are helped. total bytes in shared programs: 10333670 -> 10293172 (-0.39%) bytes in affected programs: 2996682 -> 2956184 (-1.35%) helped: 2660 HURT: 175 helped stats (abs) min: 2.0 max: 204.0 x̄: 15.70 x̃: 12 helped stats (rel) min: 0.08% max: 23.08% x̄: 2.64% x̃: 1.83% HURT stats (abs) min: 2.0 max: 60.0 x̄: 7.26 x̃: 6 HURT stats (rel) min: 0.12% max: 22.39% x̄: 3.19% x̃: 2.78% 95% mean confidence interval for bytes value: -14.81 -13.76 95% mean confidence interval for bytes %-change: -2.39% -2.18% Bytes are helped. total halfregs in shared programs: 417284 -> 427363 (2.42%) halfregs in affected programs: 49814 -> 59893 (20.23%) helped: 95 HURT: 3018 helped stats (abs) min: 1.0 max: 8.0 x̄: 2.29 x̃: 2 helped stats (rel) min: 2.44% max: 28.57% x̄: 9.20% x̃: 6.06% HURT stats (abs) min: 1.0 max: 14.0 x̄: 3.41 x̃: 4 HURT stats (rel) min: 2.08% max: 150.00% x̄: 36.54% x̃: 27.27% 95% mean confidence interval for halfregs value: 3.17 3.31 95% mean confidence interval for halfregs %-change: 34.05% 36.23% Halfregs are HURT. total threads in shared programs: 16465280 -> 16465280 (0.00%) threads in affected programs: 0 -> 0 helped: 0 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00
Alyssa Rosenzweig	43b221cd59	asahi: Set PIPE_CAP_LOAD_CONSTBUF The CAP is a bit of a misnomer, what it really does is relax the alignment requirements for UBO packing. It should work fine and save us some memory. Noticed while debugging piglit fails. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00
Alyssa Rosenzweig	8e501b758a	asahi/decode: Print VDM barriers Instead of just decoding silently. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00
Alyssa Rosenzweig	0bbd8b502a	asahi/decode: Remove agxdecode_dump_bo Now that we have proper parsing this is more of a nuissance than not. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00
Alyssa Rosenzweig	e713983875	agx: Add helper for calculating occupancy Add information about the relationship between program register usage and program occupancy (the maximum number of threads that may execute concurrently on a single shader core). This table is derived from studying the maxTotalThreadsPerThreadgroup property in Metal while varying the register usage, something I blogged about a few years back. It's probably not 100% accurate and it hasn't been tested against hardware, but it matters "only" for performance (not correctness) so I'm not super stressed about the details. In the (near) future, RA will be able to make use of this information to know exactly when it can use more registers without hurting performance. In the present, it's just used for better shader-db statistics. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22353>	2023-04-07 03:23:03 +00:00

... 3 4 5 6 7 ...

169743 Commits All Branches Search

169743 Commits

All Branches