mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Qiang Yu	b3ba33b6f1	mesa: add _mesa_bufferobj_get_subdata Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Qiang Yu	2224d6c35d	mesa: add hardware accelerated select constant Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Qiang Yu	ff8ae4e589	nir/builder: add load/store array variable helper functions Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Qiang Yu	1ef734cde6	mesa/vbo: remove unused vbo_context->binding Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Qiang Yu	feea8fed44	mesa/program: fix nir output reg overflow outputs_written is uint64_t, should count max reg number by util_last_bit64(). Otherwise the following access will overflow the allocated array with a smaller size. cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Alyssa Rosenzweig	28801cfba0	pan/va: Unit test constant lowering pass Like other optimizations, breaking this pass may not affect functional correctness. It's also dead simple to unit test the pass, so we have no excuse not to. Add unit tests for the functionality we currently support, since we just extended it and want to make sure everything still works. This includes tests for use of modifiers to get more small constants. There are lots of subtle gotchas there, so let's add lots of unit tests to make sure we got it right. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16862>	2022-06-06 18:10:24 +00:00
Alyssa Rosenzweig	9cfafbb09b	pan/va: Try widening small constants Many small integers are availabled as small constants, but the table of small constants is tightly packed. Zero and sign extensions are usually required to access small integers. When packing constants, try zero/sign extension for unsigned/signed integer instructions respectively. total instructions in shared programs: 2716912 -> 2707795 (-0.34%) instructions in affected programs: 1045609 -> 1036492 (-0.87%) helped: 4460 HURT: 125 helped stats (abs) min: 1.0 max: 58.0 x̄: 2.14 x̃: 1 helped stats (rel) min: 0.14% max: 23.85% x̄: 1.35% x̃: 0.88% HURT stats (abs) min: 1.0 max: 68.0 x̄: 3.41 x̃: 1 HURT stats (rel) min: 0.34% max: 3.88% x̄: 0.93% x̃: 0.70% 95% mean confidence interval for instructions value: -2.09 -1.89 95% mean confidence interval for instructions %-change: -1.33% -1.25% Instructions are helped. total cycles in shared programs: 141984.06 -> 141932.42 (-0.04%) cycles in affected programs: 552.08 -> 500.44 (-9.35%) helped: 18 HURT: 0 helped stats (abs) min: 0.015625 max: 11.0 x̄: 2.87 x̃: 0 helped stats (rel) min: 0.50% max: 19.64% x̄: 5.36% x̃: 1.53% 95% mean confidence interval for cycles value: -5.17 -0.56 95% mean confidence interval for cycles %-change: -9.28% -1.44% Cycles are helped. total cvt in shared programs: 13805.05 -> 13663.39 (-1.03%) cvt in affected programs: 6127.45 -> 5985.80 (-2.31%) helped: 4460 HURT: 125 helped stats (abs) min: 0.015625 max: 0.90625 x̄: 0.03 x̃: 0 helped stats (rel) min: 0.35% max: 50.00% x̄: 5.19% x̃: 4.00% HURT stats (abs) min: 0.015625 max: 1.0625 x̄: 0.05 x̃: 0 HURT stats (rel) min: 0.77% max: 9.30% x̄: 3.40% x̃: 2.78% 95% mean confidence interval for cvt value: -0.03 -0.03 95% mean confidence interval for cvt %-change: -5.10% -4.81% Cvt are helped. total ls in shared programs: 129545 -> 129494 (-0.04%) ls in affected programs: 495 -> 444 (-10.30%) helped: 6 HURT: 0 helped stats (abs) min: 2.0 max: 11.0 x̄: 8.50 x̃: 11 helped stats (rel) min: 1.49% max: 19.64% x̄: 13.95% x̃: 19.64% 95% mean confidence interval for ls value: -12.68 -4.32 95% mean confidence interval for ls %-change: -23.23% -4.67% Ls are helped. total quadwords in shared programs: 1476416 -> 1469824 (-0.45%) quadwords in affected programs: 121208 -> 114616 (-5.44%) helped: 820 HURT: 16 helped stats (abs) min: 8.0 max: 32.0 x̄: 8.28 x̃: 8 helped stats (rel) min: 1.39% max: 50.00% x̄: 11.00% x̃: 10.00% HURT stats (abs) min: 8.0 max: 32.0 x̄: 12.50 x̃: 8 HURT stats (rel) min: 1.38% max: 10.00% x̄: 6.19% x̃: 7.14% 95% mean confidence interval for quadwords value: -8.14 -7.63 95% mean confidence interval for quadwords %-change: -11.20% -10.15% Quadwords are helped. total threads in shared programs: 53633 -> 53663 (0.06%) threads in affected programs: 39 -> 69 (76.92%) helped: 33 HURT: 3 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.64 1.02 95% mean confidence interval for threads %-change: 73.27% 101.73% Threads are helped. total spills in shared programs: 154 -> 103 (-33.12%) spills in affected programs: 75 -> 24 (-68.00%) helped: 6 HURT: 0 total fills in shared programs: 656 -> 656 (0.00%) fills in affected programs: 148 -> 148 (0.00%) helped: 2 HURT: 4 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16862>	2022-06-06 18:10:23 +00:00
Alyssa Rosenzweig	72146051d5	pan/va: Try negating small constants when lowering If a constant is used with a floating point instruction with a floating-point negate modifier, we can use the modifier to negate constants in the table for free. Each floating point in the table is positive, so this is required for negative small constants. total instructions in shared programs: 2728438 -> 2716912 (-0.42%) instructions in affected programs: 1418220 -> 1406694 (-0.81%) helped: 6053 HURT: 94 helped stats (abs) min: 1.0 max: 43.0 x̄: 1.94 x̃: 1 helped stats (rel) min: 0.06% max: 18.18% x̄: 1.34% x̃: 0.84% HURT stats (abs) min: 1.0 max: 5.0 x̄: 2.34 x̃: 2 HURT stats (rel) min: 0.09% max: 21.43% x̄: 1.87% x̃: 0.91% 95% mean confidence interval for instructions value: -1.93 -1.82 95% mean confidence interval for instructions %-change: -1.34% -1.25% Instructions are helped. total cycles in shared programs: 142103 -> 141984.06 (-0.08%) cycles in affected programs: 766.70 -> 647.77 (-15.51%) helped: 97 HURT: 0 helped stats (abs) min: 0.015625 max: 40.0 x̄: 1.23 x̃: 0 helped stats (rel) min: 0.27% max: 41.24% x̄: 3.63% x̃: 2.08% 95% mean confidence interval for cycles value: -2.41 -0.04 95% mean confidence interval for cycles %-change: -4.68% -2.57% Cycles are helped. total cvt in shared programs: 13983.34 -> 13805.05 (-1.28%) cvt in affected programs: 7952.45 -> 7774.16 (-2.24%) helped: 6049 HURT: 98 helped stats (abs) min: 0.015625 max: 0.359375 x̄: 0.03 x̃: 0 helped stats (rel) min: 0.25% max: 100.00% x̄: 4.74% x̃: 2.52% HURT stats (abs) min: 0.015625 max: 0.078125 x̄: 0.04 x̃: 0 HURT stats (rel) min: 0.17% max: 100.00% x̄: 5.48% x̃: 2.54% 95% mean confidence interval for cvt value: -0.03 -0.03 95% mean confidence interval for cvt %-change: -4.83% -4.32% Cvt are helped. total ls in shared programs: 129660 -> 129545 (-0.09%) ls in affected programs: 601 -> 486 (-19.13%) helped: 7 HURT: 0 helped stats (abs) min: 3.0 max: 40.0 x̄: 16.43 x̃: 8 helped stats (rel) min: 2.88% max: 41.24% x̄: 17.41% x̃: 12.50% 95% mean confidence interval for ls value: -31.42 -1.44 95% mean confidence interval for ls %-change: -29.25% -5.58% Ls are helped. total quadwords in shared programs: 1482728 -> 1476416 (-0.43%) quadwords in affected programs: 131200 -> 124888 (-4.81%) helped: 798 HURT: 15 helped stats (abs) min: 8.0 max: 24.0 x̄: 8.06 x̃: 8 helped stats (rel) min: 0.34% max: 50.00% x̄: 10.15% x̃: 6.67% HURT stats (abs) min: 8.0 max: 8.0 x̄: 8.00 x̃: 8 HURT stats (rel) min: 1.49% max: 100.00% x̄: 11.25% x̃: 2.78% 95% mean confidence interval for quadwords value: -7.92 -7.60 95% mean confidence interval for quadwords %-change: -10.52% -8.99% Quadwords are helped. total threads in shared programs: 53585 -> 53633 (0.09%) threads in affected programs: 51 -> 99 (94.12%) helped: 49 HURT: 1 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.88 1.04 95% mean confidence interval for threads %-change: 90.97% 103.03% Threads are helped. total spills in shared programs: 125 -> 154 (23.20%) spills in affected programs: 75 -> 104 (38.67%) helped: 3 HURT: 4 total fills in shared programs: 800 -> 656 (-18.00%) fills in affected programs: 476 -> 332 (-30.25%) helped: 7 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16862>	2022-06-06 18:10:23 +00:00
Alyssa Rosenzweig	cecfa0c44a	pan/va: Record which instructions are signed We need to distinguish signed integer instructions from unsigned integer instructions, to distinguish sign-extension and zero-extension of sources. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16862>	2022-06-06 18:10:23 +00:00
Rhys Perry	f4c02d9116	aco: fix SMEM load_global with VGPR address and non-zero offset Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Fixes: `3e9517c757` ("aco: implement _amd global access intrinsics") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16775>	2022-06-06 17:47:59 +00:00
Rhys Perry	4d9f3fcf9c	aco: fix SMEM load_global_amd with non-zero offset Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Fixes: `3e9517c757` ("aco: implement _amd global access intrinsics") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16775>	2022-06-06 17:47:59 +00:00
Juan A. Suarez Romero	695f66cecd	v3d: save only required states in blitter Some blitter operations, like clear, doesn't require to save all the states. This is particular important because, besides saving time, the blitter operation restores the state required for the operation, and if we saved more states than those, these ones won't be restored and will be leak. So this also fixes some leaks when running CTS tests. CC: mesa-stable Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16837>	2022-06-06 16:25:53 +00:00
Juan A. Suarez Romero	92474951a3	v3d: use function to initialize refcount Call proper pipe reference function to initialize the reference counting. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16837>	2022-06-06 16:25:53 +00:00
Alyssa Rosenzweig	e57dfed419	pan/bi: Implement b2i with MUX The result_type modifier propagation looks for MUX instructions, so using this canonical b2i implementation allows the sequence b2i(cmp) to be fused. It's also faster on its own: on Valhall, MUX may be implemented as CSEL on the CVT unit, while AND may only be implemented on the SFU unit. So in case this doesn't get fused, we expect 4x better throughput for b2i with this implementation. Similarly, on Bifrost, MUX may be scheduled to either unit (as CSEL on FMA or MUX on ADD), whereas AND may only be scheduled to FMA. Results on Mali-G52: total instructions in shared programs: 2419171 -> 2414814 (-0.18%) instructions in affected programs: 272203 -> 267846 (-1.60%) helped: 767 HURT: 0 helped stats (abs) min: 1.0 max: 138.0 x̄: 5.68 x̃: 2 helped stats (rel) min: 0.12% max: 15.57% x̄: 2.09% x̃: 0.68% 95% mean confidence interval for instructions value: -6.68 -4.68 95% mean confidence interval for instructions %-change: -2.37% -1.82% Instructions are helped. total tuples in shared programs: 1932822 -> `1929234` (-0.19%) tuples in affected programs: 76485 -> 72897 (-4.69%) helped: 380 HURT: 3 helped stats (abs) min: 1.0 max: 138.0 x̄: 9.46 x̃: 1 helped stats (rel) min: 0.14% max: 15.96% x̄: 3.81% x̃: 0.92% HURT stats (abs) min: 1.0 max: 6.0 x̄: 2.67 x̃: 1 HURT stats (rel) min: 0.38% max: 8.57% x̄: 3.80% x̃: 2.44% 95% mean confidence interval for tuples value: -11.30 -7.44 95% mean confidence interval for tuples %-change: -4.27% -3.22% Tuples are helped. total clauses in shared programs: 356094 -> 355992 (-0.03%) clauses in affected programs: 3264 -> 3162 (-3.12%) helped: 80 HURT: 0 helped stats (abs) min: 1.0 max: 9.0 x̄: 1.27 x̃: 1 helped stats (rel) min: 0.81% max: 50.00% x̄: 4.83% x̃: 3.39% 95% mean confidence interval for clauses value: -1.49 -1.06 95% mean confidence interval for clauses %-change: -6.23% -3.43% Clauses are helped. total cycles in shared programs: 167337.10 -> 167329.19 (<.01%) cycles in affected programs: 510.08 -> 502.17 (-1.55%) helped: 80 HURT: 2 helped stats (abs) min: 0.041665999999999315 max: 0.7916659999999993 x̄: 0.10 x̃: 0 helped stats (rel) min: 0.51% max: 13.64% x̄: 2.12% x̃: 1.34% HURT stats (abs) min: 0.041665999999999315 max: 0.0416669999999999 x̄: 0.04 x̃: 0 HURT stats (rel) min: 0.39% max: 2.78% x̄: 1.58% x̃: 1.58% 95% mean confidence interval for cycles value: -0.12 -0.07 95% mean confidence interval for cycles %-change: -2.59% -1.48% Cycles are helped. total arith in shared programs: 73819.54 -> 73669.25 (-0.20%) arith in affected programs: 2840.54 -> 2690.25 (-5.29%) helped: 383 HURT: 3 helped stats (abs) min: 0.041665999999999315 max: 5.75 x̄: 0.39 x̃: 0 helped stats (rel) min: 0.33% max: 18.81% x̄: 4.39% x̃: 0.98% HURT stats (abs) min: 0.041665999999999315 max: 0.25 x̄: 0.11 x̃: 0 HURT stats (rel) min: 0.39% max: 8.96% x̄: 4.04% x̃: 2.78% 95% mean confidence interval for arith value: -0.47 -0.31 95% mean confidence interval for arith %-change: -4.93% -3.71% Arith are helped. total quadwords in shared programs: 1679798 -> 1676259 (-0.21%) quadwords in affected programs: 72826 -> 69287 (-4.86%) helped: 381 HURT: 15 helped stats (abs) min: 1.0 max: 142.0 x̄: 9.35 x̃: 1 helped stats (rel) min: 0.25% max: 18.87% x̄: 4.33% x̃: 1.13% HURT stats (abs) min: 1.0 max: 6.0 x̄: 1.47 x̃: 1 HURT stats (rel) min: 0.30% max: 6.25% x̄: 0.77% x̃: 0.35% 95% mean confidence interval for quadwords value: -10.76 -7.11 95% mean confidence interval for quadwords %-change: -4.71% -3.56% Quadwords are helped. Results on Mali-G57: total instructions in shared programs: 2704193 -> 2699317 (-0.18%) instructions in affected programs: 293366 -> 288490 (-1.66%) helped: 758 HURT: 5 helped stats (abs) min: 1.0 max: 151.0 x̄: 6.45 x̃: 2 helped stats (rel) min: 0.11% max: 22.22% x̄: 2.05% x̃: 0.64% HURT stats (abs) min: 1.0 max: 7.0 x̄: 2.20 x̃: 1 HURT stats (rel) min: 0.22% max: 1.69% x̄: 0.87% x̃: 1.08% 95% mean confidence interval for instructions value: -7.42 -5.36 95% mean confidence interval for instructions %-change: -2.27% -1.79% Instructions are helped. total cycles in shared programs: 141711.73 -> 141711.84 (<.01%) cycles in affected programs: 214.36 -> 214.47 (0.05%) helped: 4 HURT: 42 helped stats (abs) min: 0.015625 max: 0.359375 x̄: 0.20 x̃: 0 helped stats (rel) min: 1.85% max: 12.78% x̄: 9.12% x̃: 10.93% HURT stats (abs) min: 0.015625 max: 0.09375 x̄: 0.02 x̃: 0 HURT stats (rel) min: 0.17% max: 17.65% x̄: 0.84% x̃: 0.34% 95% mean confidence interval for cycles value: -0.02 0.03 95% mean confidence interval for cycles %-change: -1.23% 1.17% Inconclusive result (value mean confidence interval includes 0). total cvt in shared programs: 14479.14 -> 14474.19 (-0.03%) cvt in affected programs: 2877.05 -> 2872.09 (-0.17%) helped: 508 HURT: 209 helped stats (abs) min: 0.015625 max: 0.453125 x̄: 0.02 x̃: 0 helped stats (rel) min: 0.25% max: 16.67% x̄: 1.23% x̃: 0.37% HURT stats (abs) min: 0.015625 max: 0.296875 x̄: 0.03 x̃: 0 HURT stats (rel) min: 0.15% max: 18.18% x̄: 1.70% x̃: 0.34% 95% mean confidence interval for cvt value: -0.01 -0.00 95% mean confidence interval for cvt %-change: -0.57% -0.18% Cvt are helped. total sfu in shared programs: 7875.69 -> 7590.75 (-3.62%) sfu in affected programs: 1567.38 -> 1282.44 (-18.18%) helped: 906 HURT: 0 helped stats (abs) min: 0.0625 max: 8.625 x̄: 0.31 x̃: 0 helped stats (rel) min: 2.38% max: 100.00% x̄: 16.80% x̃: 5.63% 95% mean confidence interval for sfu value: -0.37 -0.26 95% mean confidence interval for sfu %-change: -18.43% -15.17% Sfu are helped. total quadwords in shared programs: 1468152 -> 1465800 (-0.16%) quadwords in affected programs: 37104 -> 34752 (-6.34%) helped: 161 HURT: 2 helped stats (abs) min: 8.0 max: 80.0 x̄: 14.71 x̃: 8 helped stats (rel) min: 1.67% max: 20.00% x̄: 8.05% x̃: 7.69% HURT stats (abs) min: 8.0 max: 8.0 x̄: 8.00 x̃: 8 HURT stats (rel) min: 3.57% max: 3.85% x̄: 3.71% x̃: 3.71% 95% mean confidence interval for quadwords value: -16.29 -12.57 95% mean confidence interval for quadwords %-change: -8.58% -7.22% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16857>	2022-06-06 16:08:25 +00:00
Alyssa Rosenzweig	8f3b62f87e	pan/va: Add MUX lowering tests Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16857>	2022-06-06 16:08:25 +00:00
Alyssa Rosenzweig	677a66b3eb	pan/va: Lower MUX to CSEL where possible CSEL executes on the conversion unit (CVT), while MUX executes on the special function unit (SFU). Throughput on CVT is 4x higher than SFU, so this is (almost) always an optimization. The "real" MUX is still used for unusual cases, like 8-bit and bitselect. Note that it's easier for us to use MUX everywhere for the IR. This is an easy fixup to get better codegen on Valhall without touching the core Bifrost code. shader-db is a bit of a toss up: register pressure and instruction count are hurt in some cases due to restrictions on FAU access. In particular, a shader that muxes between two uniforms needs an extra move due to extra constant (zero). However, in terms of throughput this is still a win: 2 CVT instructions (MOV + CSEL) have 2x throughput to 1 SFU instruction (MUX). The MOV has opportunities for CSE, but that can hurt pressure in turn. Overall, cycles are helped substantially. total instructions in shared programs: 2728438 -> 2731597 (0.12%) instructions in affected programs: 414391 -> 417550 (0.76%) helped: 87 HURT: 1063 helped stats (abs) min: 1.0 max: 6.0 x̄: 5.17 x̃: 6 helped stats (rel) min: 0.19% max: 15.79% x̄: 4.12% x̃: 4.11% HURT stats (abs) min: 1.0 max: 56.0 x̄: 3.40 x̃: 2 HURT stats (rel) min: 0.11% max: 23.43% x̄: 1.15% x̃: 0.63% 95% mean confidence interval for instructions value: 2.47 3.03 95% mean confidence interval for instructions %-change: 0.61% 0.90% Instructions are HURT. total cycles in shared programs: 142103 -> 142015.75 (-0.06%) cycles in affected programs: 1263.45 -> 1176.20 (-6.91%) helped: 281 HURT: 176 helped stats (abs) min: 0.015625 max: 2.234375 x̄: 0.50 x̃: 0 helped stats (rel) min: 0.71% max: 54.17% x̄: 16.93% x̃: 15.31% HURT stats (abs) min: 0.015625 max: 30.0 x̄: 0.30 x̃: 0 HURT stats (rel) min: 0.84% max: 120.00% x̄: 7.16% x̃: 5.00% 95% mean confidence interval for cycles value: -0.33 -0.05 95% mean confidence interval for cycles %-change: -9.08% -6.22% Cycles are helped. total cvt in shared programs: 13983.34 -> 14891.70 (6.50%) cvt in affected programs: 7498.36 -> 8406.72 (12.11%) helped: 71 HURT: 4711 helped stats (abs) min: 0.0625 max: 0.0625 x̄: 0.06 x̃: 0 helped stats (rel) min: 5.41% max: 40.00% x̄: 10.23% x̃: 9.30% HURT stats (abs) min: 0.015625 max: 2.640625 x̄: 0.19 x̃: 0 HURT stats (rel) min: 0.18% max: 141.18% x̄: 16.21% x̃: 9.52% 95% mean confidence interval for cvt value: 0.18 0.20 95% mean confidence interval for cvt %-change: 15.21% 16.42% Cvt are HURT. total sfu in shared programs: 11320.44 -> 7882.56 (-30.37%) sfu in affected programs: 7618.50 -> 4180.62 (-45.13%) helped: 4782 HURT: 0 helped stats (abs) min: 0.0625 max: 10.5625 x̄: 0.72 x̃: 0 helped stats (rel) min: 1.34% max: 100.00% x̄: 41.91% x̃: 37.50% 95% mean confidence interval for sfu value: -0.75 -0.68 95% mean confidence interval for sfu %-change: -42.68% -41.14% Sfu are helped. total ls in shared programs: 129660 -> 129690 (0.02%) ls in affected programs: 25 -> 55 (120.00%) helped: 0 HURT: 1 total quadwords in shared programs: 1482728 -> 1484128 (0.09%) quadwords in affected programs: 58624 -> 60024 (2.39%) helped: 24 HURT: 195 helped stats (abs) min: 8.0 max: 8.0 x̄: 8.00 x̃: 8 helped stats (rel) min: 3.70% max: 20.00% x̄: 10.34% x̃: 10.00% HURT stats (abs) min: 8.0 max: 24.0 x̄: 8.16 x̃: 8 HURT stats (rel) min: 1.41% max: 50.00% x̄: 4.84% x̃: 2.56% 95% mean confidence interval for quadwords value: 5.70 7.09 95% mean confidence interval for quadwords %-change: 2.22% 4.14% Quadwords are HURT. total spills in shared programs: 125 -> 127 (1.60%) spills in affected programs: 0 -> 2 helped: 0 HURT: 1 total fills in shared programs: 800 -> 828 (3.50%) fills in affected programs: 0 -> 28 helped: 0 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16857>	2022-06-06 16:08:25 +00:00
Alyssa Rosenzweig	3741606b25	pan/va: Implement more lanes Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16857>	2022-06-06 16:08:25 +00:00
Alyssa Rosenzweig	1768afa5b9	pan/bi: Extract MUX to CSEL optimization It's portable, and useful to both Bifrost and Valhall, in the clause scheduler and in an instruction selection respectively. Move it from the Bifrost clause scheduler to common code so we can share the benefits. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16857>	2022-06-06 16:08:25 +00:00
Frank Binns	fd0f02ec4e	pvr: shorten error to err in label names This is for consistency with the rest of the driver. Signed-off-by: Frank Binns <frank.binns@imgtec.com> Reviewed-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16882>	2022-06-06 15:58:33 +00:00
Juan A. Suarez Romero	8f3c60a93d	v3d/ci: Add traces Add a job to run and test traces from Tracies DB. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16809>	2022-06-06 15:18:50 +00:00
Alyssa Rosenzweig	c87629771d	panfrost: Don't calculate min/max indices on v9 On Valhall, we always* use memory-allocated IDVS, which does not require min/max indices. As such, we do not want to calculate min/max indices, as this is quite slow. Skip this step. * except for blit shaders, which don't use an index buffer anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16867>	2022-06-06 14:58:53 +00:00
Alyssa Rosenzweig	ca6d06fa91	panfrost: Extract panfrost_get_index_buffer helper Memory-allocated IDVS does not require min/max indices to be calculated, but it of course requires an index buffer. Extract a helper to upload the index buffer without calculating bounds. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16867>	2022-06-06 14:58:53 +00:00
Alyssa Rosenzweig	e1fb182d90	pan/va: Do not insert NOPs into empty shaders It's unnecessary and breaks the empty shader optimizations. Noticed while inspecting a trace from dEQP-GLES3.functional.color_clear.masked_scissored_rgb, which does not produce any varyings other than gl_Position in its vertex shader and hence should omit the varying shader. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16868>	2022-06-06 14:28:59 +00:00
Konstantin Seurer	e8da8fc5b7	radv: Require an alignment of 64 for accel structs Top level acceleration structures need the bottom 6 bits to store the root ids of instances. If we don't require that alignment, more "advanced" allocators like VMA may sub allocate a buffer which can lead to the 6 getting lost. Fixes the Khronos ray tracing Vulkan samples. Closes: #6598 Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16870>	2022-06-06 13:49:24 +00:00
David Heidelberg	9eb40f57a2	ci/virgl: traces: temporarily disable nheko trace Disable nheko trace until apitrace gets fixed. apitrace currently fails with this trace, when more than 1 run is requested. Upstream issue: https://github.com/apitrace/apitrace/issues/800 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16887>	2022-06-06 13:29:36 +00:00
Mike Blumenkrantz	de63ccfc1e	zink: remove buffer valid range tracking from blit I copy/pasted too hard. this code could never be reached Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16877>	2022-06-06 00:36:20 +00:00
Mike Blumenkrantz	79685199f4	zink: invalidate blit dsts if fully covered tiling perf++ since there's no need to load Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16877>	2022-06-06 00:36:20 +00:00
Mike Blumenkrantz	de1e67b39d	zink: hook up surface invalidation to LOAD_OP_DONT_CARE this should improve perf for tilers Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16877>	2022-06-06 00:36:20 +00:00
Mike Blumenkrantz	c7ad86b40f	zink: split out a dynamic render ternary this is going to get bigger Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16877>	2022-06-06 00:36:20 +00:00
Mike Blumenkrantz	e6ec9ca0ab	zink: rename renderpass attrib value this never really meant "swapchain", it just meant that load isn't needed Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16877>	2022-06-06 00:36:20 +00:00
Mike Blumenkrantz	5897ade22d	zink: flag renderpass for change if image resource changes valid state the next renderpass instance will need to use different load ops, so flag it here to ensure that gets picked up Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16877>	2022-06-06 00:36:20 +00:00
Mike Blumenkrantz	3e2c65281d	zink: track invalidation for image resources an image only has valid data if: * it's imported * it's written to * it's mapped for write Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16877>	2022-06-06 00:36:20 +00:00
Mike Blumenkrantz	8575080990	zink: disable EXT_primitives_generated_query on turnip this is broken Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16861>	2022-06-06 00:21:02 +00:00
Mike Blumenkrantz	9683de9bc4	zink: remove ANV depth clip control workaround this was fixed a while ago and I forgot Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16861>	2022-06-06 00:21:02 +00:00
Mike Blumenkrantz	06859ba69c	mesa: handle atomic counter lowering for drivers with big ssbo offset aligns according to the spec, atomic counters can be bound at any offset divisible by 4, which means that any driver that uses the ssbo lowering pass and doesn't have a min offset align of 4 is potentially broken to handle this, use a statevar to inject the misaligned remainder of the offset into the shader as a uniform. for well-aligned counter binds, the uniform offset will be 0 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16749>	2022-06-05 23:16:36 +00:00
Mike Blumenkrantz	5b5eb77a87	st/glsl_to_nir: call st_set_prog_affected_state_flags() as late as possible this function should be called late to allow for other passes potentially making changes which affect the states in use by shaders Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16749>	2022-06-05 23:16:36 +00:00
Mike Blumenkrantz	93d9f086a3	mesa: conditionally set constants dirty for atomic counter binds this is necessary for updating the offset uniforms Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16749>	2022-06-05 23:16:36 +00:00
Mike Blumenkrantz	b3fbd498e0	mesa: add statevar for atomic counter offsets some hardware can't do a ssbo offset=4, as required by the atomic->ssbo lowering pass, so for these cases an offset can be passed for the counter as a uniform, and the shaders can be adjusted accordingly Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16749>	2022-06-05 23:16:36 +00:00
Pavel Ondračka	6c2959c025	r300: merge simple movs with constant swizzles together This pass will merge instructions like these MOV output[0].x, temp[5].x___; MOV output[0].yzw, none._001; into MOV output[0].xyzw, temp[5].x001; It is currently very careful with control flow and dependency tracking, so there is still room for improvements. Shader-db stats with RV530: total instructions in shared programs: 132486 -> 132256 (-0.17%) instructions in affected programs: 6186 -> 5956 (-3.72%) helped: 65 HURT: 0 total temps in shared programs: 18035 -> 18014 (-0.12%) temps in affected programs: 295 -> 274 (-7.12%) helped: 22 HURT: 1 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16657>	2022-06-05 21:38:36 +00:00
Filip Gawin	0fcd423a6a	r300: don't check for unitialized reads when rewriting register This fixes the "Rewrite of inst X failed Can't allocate source for Inst X src_type=X new_index=X new_mask=X" errors. The compiler is quite strict when rewriting registers during the pair allocation and checks that all of the reads of it are initialized. However the spec doesn't enfore that, and specifically with control flow depending on user input we can't really know... In the following example temp[4].x is written only in one branch, that might or might not be taken, but this is enough to keep the compiler happy: IF aluresult.x___; MAD temp[4].x, src0.1__, src0.111, src0.000 ENDIF; src0.xyz = temp[4], src0.w = temp[4] MAD color[0].xyz, src0.xyz, src0.111, src0.000 MAD color[0].w, src0.w, src0.1, src0.0 After switch to ntt, more IFs are converted to CMP, and the color write looks like this. Please note that the CMP here is not TGSI opcode but rather our US_OP_RGB_CMP: src2 >= 0 ? src0 : src1 src0.xyz = temp[4], src0.w = temp[4], src1.xyz = temp[3], src1.w = temp[12], src2.xyz = temp[2] CMP color[0].xyz, src0.xyz, src1.xyz, -src2.xxx CMP color[0].w, src0.w, src1.w, -src2.x At this point temp[4].x is undefined. Now when compiler tries to allocate register for temp[4] at some previous instruction, it will find out that it is used as a source in the final CMP and bail out. Instead of increasing the complexitty even more trying to account for this, just get rid of the check completelly. Fixes: dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec2_dynamic_subscript_write_component_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec2_dynamic_subscript_write_direct_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec2_dynamic_subscript_write_dynamic_subscript_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec2_dynamic_subscript_write_static_loop_subscript_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec2_dynamic_subscript_write_static_subscript_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec3_dynamic_subscript_write_component_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec3_dynamic_subscript_write_direct_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec3_dynamic_subscript_write_dynamic_subscript_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec3_dynamic_subscript_write_static_loop_subscript_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec3_dynamic_subscript_write_static_subscript_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec4_dynamic_subscript_write_component_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec4_dynamic_subscript_write_direct_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec4_dynamic_subscript_write_dynamic_subscript_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec4_dynamic_subscript_write_static_loop_subscript_read_fragment,Fail dEQP-GLES2.functional.shaders.indexing.vector_subscript.vec4_dynamic_subscript_write_static_subscript_read_fragment,Fail Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16657>	2022-06-05 21:38:36 +00:00
Pavel Ondračka	a7f3584d1e	r300: Update list of RV515 dEQP failures and add some flakes The fixes are mostly from `23dfae4c81` dEQP-GLES2.functional.fragment_ops.depth_stencil tests show random flakes. The ones in failures are showing unexpected pass, however other random test failures from the same group keep showing so just mark it all as flakes. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16657>	2022-06-05 21:38:36 +00:00
Pavel Ondračka	bc9b2f3781	r300: don't try to use inline constants instead of constant swizzles It doesn't make sense and was not working anyway. This was spotted by Filip Gawin in https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13978 however the fix there was IMO just papering over the problem. I don't believe that this could manifest as a real issues, because when all of the swizzles were constant the file would be set to RC_FILE_NONE already. So in theory this could lead to an issue only in the close to impossible circumstance that the out of bounds memory read by constant->u.Immediate[swz] would end with the same exact value as another inlineable constant in different channel. However in some circumstances it would lead to following valgrind warnings: Conditional jump or move depends on uninitialised value(s) at 0x5D4E690: ieee_754_to_r300_float (radeon_inline_literals.c:61) by 0x5D4E690: rc_inline_literals (radeon_inline_literals.c:133) by 0x5D3877A: rc_run_compiler_passes (radeon_compiler.c:436) by 0x5D38821: rc_run_compiler (radeon_compiler.c:458) by 0x5D4AF63: r3xx_compile_fragment_program (r3xx_fragprog.c:139) by 0x5D48377: r300_translate_fragment_shader (r300_fs.c:499) by 0x5D491B0: r300_pick_fragment_shader (r300_fs.c:601) by 0x5D2BFEE: r300_create_fs_state (r300_state.c:1072) by 0x57DDC36: st_create_nir_shader (st_program.c:538) by 0x57DF10E: st_create_fp_variant (st_program.c:1056) by 0x57E057C: st_get_fp_variant (st_program.c:1102) by 0x57E0AB1: st_precompile_shader_variant (st_program.c:1287) by 0x57E0AB1: st_finalize_program (st_program.c:1333) by 0x57CB6F3: st_link_nir (st_glsl_to_nir.cpp:958) Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16657>	2022-06-05 21:38:36 +00:00
Pavel Ondračka	2bdffe7eb2	r300: be less agresive with copy propagate in loops When there are multiple MOVs with the same destination in loop in different branches and some readers after the loop, we would now errorneously copy propagate the last MOV, like in the following snippet: BGNLOOP; ... IF temp[3].x___; MOV temp[2], const[1].yxxy; BRK; ENDIF; IF temp[4].x___; MOV temp[2], const[1].xyxy; BRK; ENDIF; ... MOV temp[2], const[1].xyxy; ENDLOOP; ADD_SAT temp[0], temp[2], temp[1]; into: BGNLOOP; ... IF temp[3].x___; MOV temp[2], const[1].yxxy; BRK; ENDIF; IF temp[3].y___; MOV temp[2], const[1].xyxy; BRK; ENDIF; ... ENDLOOP; ADD_SAT temp[0], const[1].xyxy, temp[1]; We need the copy propagate just for simple cleanups after ttn, anything more complex should have been handled already in NIR. So just bail out if any of the readers is after the loop. No changes in shader-db. Fixes few piglit tests when loop unrolling is disabled: spec@glsl-1.10@execution@vs-loop-complex-unroll spec@glsl-1.10@execution@vs-loop-complex-unroll-nested-break spec@glsl-1.10@execution@vs-loop-complex-unroll-with-else-break Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6467 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16657>	2022-06-05 21:38:36 +00:00
Pavel Ondračka	5a3be2db24	r300: deduplicate common NIR options Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16657>	2022-06-05 21:38:36 +00:00
Mike Blumenkrantz	5c37320eb6	mesa/st: bump param reservation to 28 now d3d12 is hitting it, so here we go Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16872>	2022-06-05 13:20:25 +00:00
Mike Blumenkrantz	f160a3b2d6	virgl: add some ci flakes issue #6614 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16876>	2022-06-05 13:07:14 +00:00
Vinson Lee	3e679219a1	clc: Fix build with llvm-15. opencl_c_h is defined only for llvm < 15. Fixes: `bcc2df4890` ("clc: speed up compilation by not relying on opencl-c.h") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16808>	2022-06-04 22:27:55 -07:00
Mike Blumenkrantz	4b3afed35a	d3d12: skip time-elapsed piglit tests in ci flaky Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16866>	2022-06-04 17:12:58 +00:00
Timothy Arceri	5aec67a1e1	glsl: remove the now unused GLSL IR loop unrolling code This code was slow, buggy and hard to understand. All drivers have now switched to using the NIR unrolling code \o/ Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Timothy Arceri	26ff49038c	gallium: remove PIPE_SHADER_CAP_MAX_UNROLL_ITERATIONS_HINT CAP This is used for the old, buggy and slow GLSL IR loop unrolling code. All drivers have now switched to the NIR unrolling code so here we remove the CAP. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00

... 2 3 4 5 6 ...

155088 Commits All Branches Search

155088 Commits

All Branches