KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rob Clark	1fdddb1424	freedreno/ir3: Add copy_vars() helper Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17341>	2022-07-08 20:32:35 +00:00
Rob Clark	5434de7ab6	freedreno/ir3: Don't lower_gs multiple times At least with gallium, this can be called multiple times via pipe_screen::finalize_nir(). But it is not designed to be called multiple times, and can result in vertex_flags getting 'optimized' away. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6720 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17341>	2022-07-08 20:32:35 +00:00
Rob Clark	62c5d428bc	turnip: assert valid vertex_flag reg If this somehow gets optimized out, the GS will run forever. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17341>	2022-07-08 20:32:35 +00:00
Rob Clark	e16c46c6a8	freedreno/a6xx: assert valid vertex_flags reg If this somehow gets optimized out, the GS will run forever. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17341>	2022-07-08 20:32:35 +00:00
Ian Romanick	bbcb881f46	intel/fs: Remove non-_LOGICAL URB messages The _LOGICAL versions are lowered direct to SEND, so nothing can ever generate these messages. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	bdc7668008	intel/fs: Lower URB messages to SEND Before rebasing on top of Ken's split-SEND optimization (see !17018), this commit just caused some scheduling changes in various tessellation and geometry shaders. These changes were caused by the addition of real latency information for the URB messages. With the addition of the split-SEND optimization, the changes are... staggering. All of the shaders helped for spills and fills are vertex shaders from Batman Arkham Origins. What surprises me is that these shaders account for such a high percentage of the spills and fills in fossil-db. 85%?!? v2: Use FIXED_GRF instead of BRW_GENERAL_REGISTER_FILE in an assertion. Suggested by Ken. Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) total instructions in shared programs: 20013625 -> 19954020 (-0.30%) instructions in affected programs: 4007157 -> 3947552 (-1.49%) helped: 31161 HURT: 0 helped stats (abs) min: 1 max: 400 x̄: 1.91 x̃: 2 helped stats (rel) min: 0.08% max: 59.70% x̄: 2.20% x̃: 1.83% 95% mean confidence interval for instructions value: -1.97 -1.86 95% mean confidence interval for instructions %-change: -2.22% -2.18% Instructions are helped. total cycles in shared programs: 859337569 -> 858636788 (-0.08%) cycles in affected programs: 74168298 -> 73467517 (-0.94%) helped: 13812 HURT: 16846 helped stats (abs) min: 1 max: 291078 x̄: 82.83 x̃: 4 helped stats (rel) min: <.01% max: 37.09% x̄: 3.47% x̃: 2.02% HURT stats (abs) min: 1 max: 1543 x̄: 26.31 x̃: 14 HURT stats (rel) min: <.01% max: 77.97% x̄: 4.11% x̃: 2.58% 95% mean confidence interval for cycles value: -55.10 9.39 95% mean confidence interval for cycles %-change: 0.62% 0.77% Inconclusive result (value mean confidence interval includes 0). Broadwell total cycles in shared programs: 904844939 -> 904832320 (<.01%) cycles in affected programs: 525360 -> 512741 (-2.40%) helped: 215 HURT: 4 helped stats (abs) min: 4 max: 1018 x̄: 60.16 x̃: 39 helped stats (rel) min: 0.14% max: 15.85% x̄: 2.16% x̃: 2.04% HURT stats (abs) min: 79 max: 79 x̄: 79.00 x̃: 79 HURT stats (rel) min: 1.31% max: 1.57% x̄: 1.43% x̃: 1.43% 95% mean confidence interval for cycles value: -75.02 -40.22 95% mean confidence interval for cycles %-change: -2.37% -1.81% Cycles are helped. No shader-db changes on any older Intel platforms. Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) Instructions in all programs: 142622800 -> 141461114 (-0.8%) Instructions helped: 197186 Cycles in all programs: 9101223846 -> 9099440025 (-0.0%) Cycles helped: 37963 Cycles hurt: 151233 Spills in all programs: 98829 -> 13695 (-86.1%) Spills helped: 2159 Fills in all programs: 128142 -> 18400 (-85.6%) Fills helped: 2159 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	a477587b4a	intel/fs: Add _LOGICAL versions of URB messages The lowering is currently fake. It just changes the opcode from the _LOGICAL version to the non-_LOGICAL version. v2: Remove some rebase cruft. 's/gfx8_//;s/simd8_/' in brw_instruction_name. Both suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	07b9bfacc7	intel/compiler: Move logical-send lowering to a separate file brw_fs.cpp was 10kloc. Now it's only 7.5kloc. Ugh. v2: Rebase on `9680e0e4a2`. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	c751ca769f	intel/eu: Validate some aspects of URB messages If these checks had been in place previously, some bugs that... eh-hem... practically took down the Intel CI would have been caught earlier. blush v2: Update to account for split sends. v3: Add some more Gfx version checks. Remove the redundant "src0 is a GRF" check. Both suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	b909ac350f	intel/compiler: Rename vec4 state URB opcodes to have VEC4_ prefix An argument could be made that all stage-specific opcodes for vec4 stages should be prefixed with VEC4_ like the stage-agnostic opcodes. I'll leave those additional sed jobs for another day. egrep -lr '(VS\|GS\|TCS)_OPCODE_URB_WRITE' src \|\ while read f; do sed --in-place 's/$VS\\|GS\\|TCS$_OPCODE_URB_WRITE/VEC4_\1_OPCODE_URB_WRITE/g' $f done egrep -lr 'T.S_OPCODE[_A-Z]URB_OFFSETS' src \|\ while read f; do sed --in-place 's/$T.S_OPCODE[_A-Z]URB_OFFSETS$/VEC4_\1/g' $f done Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Jesse Natalie	f7c741c058	dzn: Add for condition to break nested loop Fixes: `d132ec92` ("dzn: Support native image copies when formats are compatible") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17377>	2022-07-08 19:17:53 +00:00
pal1000	36516b869e	dzn: Fix incompatible pointer type error affecting MSYS2 MINGW32 Suggested-by: Yonggang Luo <luoyonggang@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6807 Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17414>	2022-07-08 18:53:24 +00:00
David Heidelberg	81968e80cb	ci/traces: piglit, be more verbose Print more information about traces testing progress. Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17416>	2022-07-08 17:57:36 +00:00
Samuel Pitoiset	e527b41191	radv/ci: enable fossils testing for GFX1100 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16447>	2022-07-08 17:13:40 +02:00
Rhys Perry	98a65eafb7	aco: use scratch_* for VGPR spill/reload on GFX9+ fossil-db (navi21): Totals from 12 (0.01% of 162293) affected shaders: Instrs: 122808 -> 122782 (-0.02%); split: -0.11%, +0.09% CodeSize: 711248 -> 710788 (-0.06%); split: -0.16%, +0.10% SpillSGPRs: 928 -> 831 (-10.45%) SpillVGPRs: 1626 -> 1624 (-0.12%) Latency: 4960285 -> 4932547 (-0.56%) InvThroughput: 2574083 -> 2559953 (-0.55%) VClause: 3404 -> 3402 (-0.06%) Copies: 36992 -> 37181 (+0.51%); split: -0.05%, +0.56% Branches: 3582 -> 3585 (+0.08%) PreVGPRs: 3055 -> 3057 (+0.07%) fossil-db (vega10): Totals from 12 (0.01% of 161355) affected shaders: Instrs: 124817 -> 124383 (-0.35%); split: -0.46%, +0.12% CodeSize: 705116 -> 703664 (-0.21%); split: -0.44%, +0.23% SpillSGPRs: 1012 -> 898 (-11.26%) SpillVGPRs: 1632 -> 1624 (-0.49%) Scratch: 201728 -> 200704 (-0.51%) Latency: 6160115 -> 6266025 (+1.72%); split: -0.34%, +2.06% InvThroughput: 6440203 -> 6544595 (+1.62%); split: -0.35%, +1.97% VClause: 3409 -> 3423 (+0.41%) Copies: 37929 -> 37748 (-0.48%); split: -1.16%, +0.69% Branches: 3851 -> 3855 (+0.10%); split: -0.13%, +0.23% PreVGPRs: 3053 -> 3055 (+0.07%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	0e783d687a	aco: use scratch_* for scratch load/store on GFX9+ fossil-db (navi21): Totals from 52 (0.03% of 162293) affected shaders: Instrs: 83190 -> 82145 (-1.26%) CodeSize: 454892 -> 447260 (-1.68%); split: -1.68%, +0.00% VGPRs: 4768 -> 4672 (-2.01%) Latency: 1490887 -> 1487170 (-0.25%); split: -0.68%, +0.43% InvThroughput: 935500 -> 933060 (-0.26%); split: -0.72%, +0.46% VClause: 2715 -> 2632 (-3.06%); split: -4.53%, +1.47% SClause: 1902 -> 1883 (-1.00%) Copies: 8839 -> 8496 (-3.88%) PreSGPRs: 2012 -> 1807 (-10.19%) PreVGPRs: 3282 -> 3192 (-2.74%) fossil-db (vega10): Totals from 41 (0.03% of 161355) affected shaders: Instrs: 35772 -> 35699 (-0.20%) CodeSize: 187040 -> 186584 (-0.24%) VGPRs: 4044 -> 4072 (+0.69%) Latency: 243088 -> 242379 (-0.29%) InvThroughput: 180301 -> 179783 (-0.29%) VClause: 1204 -> 1216 (+1.00%) SClause: 653 -> 637 (-2.45%) Copies: 3736 -> 3704 (-0.86%); split: -0.88%, +0.03% PreSGPRs: 1331 -> 1207 (-9.32%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	d2d94b62f2	aco: initialize scratch base registers on GFX9-GFX10.3 fossil-db (navi21): Totals from 1142 (0.70% of 162293) affected shaders: Instrs: 271636 -> 271974 (+0.12%) CodeSize: 1532020 -> 1533792 (+0.12%) Latency: 7484066 -> 7485698 (+0.02%) InvThroughput: 4048824 -> 4049579 (+0.02%) SClause: 4171 -> 4212 (+0.98%) PreSGPRs: 11203 -> 12276 (+9.58%) fossil-db (vega10): Totals from 3327 (2.06% of 161355) affected shaders: Instrs: 257413 -> 257601 (+0.07%) CodeSize: 1424244 -> 1425372 (+0.08%) Latency: 8598402 -> 8600466 (+0.02%) InvThroughput: 7906335 -> 7908234 (+0.02%) SClause: 4932 -> 4973 (+0.83%) PreSGPRs: 22010 -> 25405 (+15.42%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	97e9e42e0d	aco: treat flat-like as vmem in some scheduling heuristics fossil-db (navi21): Totals from 12 (0.01% of 162293) affected shaders: Instrs: 48754 -> 48762 (+0.02%) CodeSize: 267092 -> 267124 (+0.01%) Latency: 1293798 -> 1292303 (-0.12%); split: -0.12%, +0.00% InvThroughput: 854599 -> 853578 (-0.12%) VClause: 1623 -> 1619 (-0.25%) SClause: 1187 -> 1188 (+0.08%); split: -0.08%, +0.17% fossil-db (vega10): Totals from 1 (0.00% of 161355) affected shaders: Latency: 18720 -> 18848 (+0.68%) InvThroughput: 5775 -> 5776 (+0.02%) SClause: 12 -> 11 (-8.33%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	29953d6048	aco: include scratch/global in VMEM WAW optimization fossil-db (navi21): Totals from 2 (0.00% of 162293) affected shaders: Instrs: 4788 -> 4785 (-0.06%) CodeSize: 25884 -> 25872 (-0.05%) Latency: 255008 -> 252950 (-0.81%) InvThroughput: 170005 -> 168633 (-0.81%) VClause: 206 -> 205 (-0.49%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	c66206cbed	aco: avoid WAW hazard with BVH MIMG and other VMEM According to LLVM, image_bvh64_intersect_ray does not write results in order with other VMEM instructions. fossil-db (navi21): Totals from 7 (0.00% of 162293) affected shaders: Instrs: 39978 -> 39985 (+0.02%) CodeSize: 219356 -> 219384 (+0.01%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	7d34044908	aco: refactor VGPR spill/reload lowering Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	6642f2fd74	aco: handle subtractions in parse_base_offset Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	52934f6cdb	aco: combine additions and constants into scratch load/store Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	931a456db1	aco: improve support for scratch_* instructions Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	cbeb25ce91	aco: make FLAT_instruction::offset signed Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	5898afba53	aco: include flat-like in vmem clause statistics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	08ed6ebc55	aco: make flat access latency match mtbuf/mubuf/mimg Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Corentin Noël	5b683ba19a	virgl: Only progagate the uniform numbers if the numbers are actually right When the field was first introduces, the numbers were reporting the number of vec4 instead of the number of float. Do not propagate them if they are wrong. Fixes: `d92c1ca01b` Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17415>	2022-07-08 12:58:20 +00:00
Guilherme Gallo	70f1291d8e	ci/lava: Add canceled job status We should be explicit that we are cancelling jobs once the script finds some log messages that are linked with known issues. That means the script preemptively retried the job without giving chances to recover. Adds magenta color to cancelled jobs. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	4783e55039	ci/lava: Add `slow` pytest marker Mark test_full_yaml_log with this new marker to be easily run by the developers. Make `debian-testing` skip this test with `not slow` marker hint. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	84abb3df13	ci/lava: Color red for fatal and yellow for warning Fatal errors now have red foreground color and retry messages yellow one. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	daff21ef55	ci/lava: Make hung job status yellow It will help to know what happened to a non-successful job. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	2c51b7a9c9	ci/lava: Detect R8152 issues preemptively and retry Implement a log-based retry hint for R8152 issue described in #6681, which is based on detecting these two consecutive lines: ``` r8152 <USB> eth0: Tx status -71 nfs: server <IP> not responding, still trying ``` Where <IP> and <USB> could be any IP and USB addresses, respectfully. This commit is a temporary fix since it requires a section-aware log follower, implemented in !16323. When the cited MR is merged, one will make a proper fix on top of that. Closes: #6681 Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Guilherme Gallo	45a4b01427	ci/lava: Split lava_log into modules This script is getting too big, it been hard to extend it. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17389>	2022-07-08 12:26:05 +00:00
Mike Blumenkrantz	2f3a233b6f	zink: flush pending clears for fb texture barriers if a texture barrier occurs while clears are pending, these clears should show up if the fb attachments are read in shaders, so trigger a renderpass to flush out the clears cc: mesa-stable fixes #6766 fixes (radv): dEQP-GLES3.functional.draw_buffers_indexed.overwrite_common.common_advanced_blend_eq_buffer_advanced_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_common.common_blend_eq_buffer_advanced_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_common.common_separate_blend_eq_buffer_advanced_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_indexed.common_advanced_blend_eq_buffer_advanced_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_indexed.common_advanced_blend_eq_buffer_blend_eq dEQP-GLES3.functional.draw_buffers_indexed.overwrite_indexed.common_advanced_blend_eq_buffer_separate_blend_eq Reviewed-By: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17363>	2022-07-08 11:58:11 +00:00
Samuel Pitoiset	6517a2b926	radv: fix dumping VS prologs assembly This got removed by mistake and broke RADV_DEBUG=shaders,nocache,prologs. Fixes: `9fe2b6b748` ("aco/radv: provide a vs prolog callback from aco to radv.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17413>	2022-07-08 10:58:33 +00:00
Tatsuyuki Ishi	768cd5715d	radv: Fix vkCmdCopyQueryResults -> vkCmdResetPool hazard. The Vulkan specification states: > Query commands, for the same query and submitted to the same queue, > execute in their entirety in submission order, relative to each other. In > effect there is an implicit execution dependency from each such query > command to all query commands previously submitted to the same queue. Fixes dEQP-VK.query_pool.statistics_query.reset_after_copy.* Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17400>	2022-07-08 10:35:11 +00:00
Georg Lehmann	4f5e25ea8d	aco/assembler: Fix s_bitreplicate_b64_b32 on GFX9. This seems to be a relic from before aco added per generation opcodes. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17405>	2022-07-08 10:09:19 +00:00
Georg Lehmann	68db0a079b	aco: Fix swapping sources in SOPC -> SOPK optimization. Fixes: `2d6b0a4177` ("aco/optimizer: Optimize SOPC with literal to SOPK.") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17407>	2022-07-08 09:43:51 +00:00
Georg Lehmann	27526ffad1	r600/sfn: Add missing std::array include. Fixes: `79ca456b48` ("r600/sfn: rewrite NIR backend") Closes https://gitlab.freedesktop.org/mesa/mesa/-/issues/6824 Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17412>	2022-07-08 09:34:53 +00:00
Pierre-Eric Pelloux-Prayer	01314d0880	radeonsi: use LLVMBuildLoad2 for inter-stage outputs loads The PS case was covered by the previous commit, so we can use f32 everywhere. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:26 +00:00
Pierre-Eric Pelloux-Prayer	248781dea1	radeonsi: use LLVMBuildLoad2 in llvm PS PS is the only shader type where unpacked 16-bit outputs can be used, so use ac_shader_abi::is_16bit to pass the proper type to LLVMBuildLoad2. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:26 +00:00
Pierre-Eric Pelloux-Prayer	326c042491	ac/llvm: use LLVMBuildLoad2 in visit_load Only FS can have f16 outputs, so always use f32 here. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Pierre-Eric Pelloux-Prayer	dc8d82516b	ac/llvm: handle opaque pointers in visit_store_output Outputs are always f32 or f16. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Pierre-Eric Pelloux-Prayer	196c4ebe1a	ac: add per output is_16bit flag to ac_shader_abi Outputs are always f32 except for FS that may use unpacked f16. Store this information here to make it available to later processing. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Pierre-Eric Pelloux-Prayer	c275e69cee	radeonsi: use LLVMBuildLoad2 where possible This commit replaces LLVMBuildLoad usage by LLVMBuildLoad2 where possible (= where the pointee type is known). Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Pierre-Eric Pelloux-Prayer	940734630d	ac: use LLVMContextSetOpaquePointers if available Disabling opaque pointers in LLVM doesn't fix all the issues but it makes pointers non-opaque by default (eg LLVMPointerType() returns a typed pointer). Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17361>	2022-07-08 08:41:25 +00:00
Danylo Piliaiev	d9296dcbbf	zink: re-enable EXT_primitives_generated_query for Turnip https://gitlab.freedesktop.org/mesa/mesa/-/issues/6602 is resolved so the extension could be re-enabled. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17163>	2022-07-08 11:15:20 +03:00
Danylo Piliaiev	bf4c160909	tu: Fix prim gen query and pipeline stats query interaction Fixed: - VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT was able to stop prim counter when pipeline stats query is running. - This may have happened when prim gen query was in secondary cmdbuf. - VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT counting geometry in each tile. - VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT counting geometry in each tile when pipeline stats query is started inside prim gen query and inside a renderpass. The matter of VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT and pipeline stats interaction is solved by tracking whether pipeline stats query is running both on CPU (for non secondary cmdbuf case) and on GPU (for secondary cmdbuf). Note, prim gen query is not allowed with secondary command buffers, so only pipeline stats query is tracked on gpu. See https://gitlab.khronos.org/vulkan/vulkan/-/issues/3142 Counting geometry per each tile is solved by: - Conditionally executing START/STOP_PRIMITIVE_CTRS to not run in tiling pass. Solves the case when prim gen query is inside a renderpass. - Stop prim counters before executing `draw_cs` and restarting them afterwards. Solves prim gen query being outside a renderpass. Fixes GL CTS tests with Zink + `TU_DEBUG=gmem`: GTF-GL46.gtf30.GL3Tests.transform_feedback.transform_feedback_max_separate GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_basic GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_framebuffer GTF-GL46.gtf40.GL3Tests.transform_feedback3.transform_feedback3_streams_overflow GTF-GL46.gtf40.GL3Tests.transform_feedback3.transform_feedback3_streams_queried GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_states Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6602 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17163>	2022-07-08 11:14:18 +03:00
Danylo Piliaiev	465e7c303b	tu,freedreno: Refactored START/STOP events for pipeline stats For a5xx+ renamed: - RST_PIX_CNT -> START_FRAGMENT_CTRS - RST_VTX_CNT -> STOP_FRAGMENT_CTRS - TILE_FLUSH -> START_COMPUTE_CTRS - STAT_EVENT -> STOP_COMPUTE_CTRS I'm not sure about a5xx itself but I'll take a chance of it being similar to a6xx in this regard. Knowing this emit_begin_stat_query/emit_end_stat_query can now emit only events that are needed for the pool's flags. Also primitive generated query clearly doesn't need fragment and compute counters. Passes tests: dEQP-VK.query_pool.statistics_query.* dEQP-VK.transform_feedback.primitives_generated_query.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17163>	2022-07-08 11:14:18 +03:00

1 2 3 4 5 ...

156405 Commits All Branches Search

156405 Commits

All Branches