KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Alyssa Rosenzweig	6b6ace5199	pan/bi: Add option to test spilling BIFROST_MESA_DEBUG=spill now restricts the register file to 1/4 its usual size, useful for testing register spilling (e.g. running CTS) as well as debugging spilling on small shaders. Note blend shaders are exempt, as we don't allow blend shaders to spill. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16314>	2022-05-04 12:48:27 +00:00
Alyssa Rosenzweig	80f8e9da16	pan/bi: Use a dynarray for predecessors This is deterministic, unlike a set. Note we need the extra dereferencing to keep the macro safe, simple, and standards compliant: 1. Nesting two for-loops would cause break/continue to fail. 2. Declaring variables outside the loop would pollute the namespace. 3. Declaring an anonymous struct is not conformant and doesn't compile in clang. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16279>	2022-05-03 17:56:16 +00:00
Alyssa Rosenzweig	d496fe153a	pan/bi: Count blocks For u_worklist. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16279>	2022-05-03 17:56:16 +00:00
Alyssa Rosenzweig	eb0001bf2b	pan/bi: Rename bi_block->name to bi_block->index This is consistent with nir_block and (IMO) less confusing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16279>	2022-05-03 17:56:16 +00:00
Alyssa Rosenzweig	54412afadc	pan/bi: Handle texture offset + index Fixes dEQP-VK.glsl.opaque_type_indexing.sampler.uniform.vertex.sampler1d Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16283>	2022-05-02 20:28:48 +00:00
Emma Anholt	536c8ee96d	nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. This controls the whole lowering of "make tex ops with implicit derivatives on non-implicit-derivative stages be tex ops with an explicit lod of 0 instead", but it's really hard to describe that in a git commit summary. All existing callers get it added except: - nir_to_tgsi which didn't want it. - nouveau, which didn't want it (fixes regressions in shadowcube and shadow2darray with NIR, since the shading languages don't expose txl of those sampler types and thus it's not supported in HW) - optional lowering passes in mesa/st (lower_rect, YUV lowering, etc) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16156>	2022-04-28 21:26:08 +00:00
Icecream95	76cea8e27b	panfrost: Fix pack_32_2x16 implementation Fixes: `6f0eff548c` ("pan/bi: Implement packing ops between 32-bit vec1 and 16-bit vec2") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16181>	2022-04-27 15:30:09 +00:00
Alyssa Rosenzweig	2ca8b014d1	pan/bi: Implement pack_uvec[24]_to_uint This maps nicely to Mali's weirdo MKVEC, so implement it rather than scalarizing. The scalarization wants an extract implemented which we don't have. Fixes dEQP-VK.glsl.builtin.function.pack_unpack.* Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16120>	2022-04-26 00:18:19 +00:00
Alyssa Rosenzweig	c9b33fe7dc	pan/bi: Implement fquantize2f16 Implement as f2f32(f2f16(x)) with the conversions in flush-to-zero mode. Accessing flush-to-zero mode on Bifrost is nontrivial: it is specified per-clause, rather than per-instruction. I've opted to pipe support for ftz clauses through the scheduler. This solution has two nice properties: * It uses the native hardware for flushing subnormals, avoiding extra lowering. * It's "smart" about scheduling around FTZ requirements, meaning we get good code generated even for a shader that e.g. quantizes a vector. With an unrelated scheduler fix, the V2F32_TO_V2F16/+F16_TO_F32 operation fits in a single tuple, minimizing the overhead of the special FTZ clause. We'll have to do something a bit different for Valhall (FLUSH.f32), but we'll worry about when we actually have PanVK brought up on Valhall. Fixes dEQP-VK.spirv_assembly.instruction.compute.opquantize. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16123>	2022-04-25 16:29:31 +00:00
Alyssa Rosenzweig	1fb4427a7a	pan/bi: Imply round mode most of the time Much less noisy, and provides a path to further improvements. There is a slight behaviour change: int-to-float conversions now use RTE instead of RTZ. For 32-bit opcodes, this affects conversions of integers with magnitude greater than 2^23 by at most 1 ulp. As this behaviour is unspecified in GLSL, this change is believed to be acceptable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	6e69c3369c	pan/bi: Don't lower vertex_id for malloc IDVS Based on hardware behaviour, it appears vertex_id is zero-based with the legacy geometry flow but not with the new malloc IDVS flow. Since the geometry flow is per-shader (not per-machine), there's not a good way to communicate this to NIR. Rather than trying to shoehorn this obscure detail into NIR, just do the lowering ourselves instead of in NIR. It's not much more code anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ccdec68aee	pan/bi: Report whether workgroups can be merged This flag gates a Valhall hardware optimization for compute shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ba5b63f642	pan/bi: Generate LD_BUFFER on Valhall Replace LOAD.ubo with LD_BUFFER since the .ubo segment doesn't exist on Valhall. We could do this with a lowering pass instead but this is probably fine. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	f487c09045	pan/bi: Make psiz variants Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	9497a6a3c9	pan/bi: Lower gl_PointSize to FP16 on Valhall It is unclear if FP32 point sizes are supported on Valhall -- I can't get the DDK to use them at any rate. Always lower them to FP16 and store them as FP16 for hardware use. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	8e6f97b5fc	pan/bi: Force psiz to mediump To match driver behaviour. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	90d3f55aff	pan/bi: Set table for Valhall LD_ATTR Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	f79e33f82f	pan/bi: Emit Valhall-style varying stores Varying stores was changed in Valhall. Rather than using attribute descriptors like on Bifrost and Midgard, on Valhall we store to memory directly with hardware-allocated buffers. This requires a new implementation of store_output, with special provisions for writing gl_PointSize from a position shader. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	14e7796d4c	pan/bi: Emit Valhall-style varying loads Memory-allocated IDVS requires special varying load instructions that take an offset into the hardware-allocated varying buffer, as opposed to a varying slot. Emit these instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	a1d5bf0a7a	pan/bi: Track whether the malloc IDVS flow is used This affects what instructions the fragment shader uses. Will be used for the legacy geometry flow in blit shaders. Whether that is a good idea remains to be seen, admittedly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ae79f6765a	pan/bi: Emit Valhall texture instructions Valhall uses an updated version fo the TEXC path. To avoid disrupting the existing Bifrost code, add a new Valhall-specific texture path that generates the new-style texture instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	9091b6261b	pan/bi: Specialize BLEND emit for Valhall Fewer arguments compared to Bifrost; the corresponding information is encoded in a Valhall-specific blend shader prologue instead. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	a8afe6f7fb	pan/bi: Waits before tilebuffer access on Valhall On Bifrost, this is handled in the scheduler. Until we grow a Valhall scheduler, add a NOP with the appropriate flow control. This is correct but carries a small performance cost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	5b056971a3	pan/bi: Preload r60/r61 for MSAA + blend shader This is the sort of leakiness I hate about blend shaders. MSAA + blend shader is somewhat obscure but gets hit in the CTS. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	913a7ed41a	pan/bi: Use ID accessors for LEA_ATTR This is more portable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	5e76467d5d	pan/bi: Use nir_tex_instr_has_implicit_derivative Rather tracking it ourselves. Slightly shorter. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	fc81415f47	pan/bi: Call Valhall backend passes on v9 These are required to lower the IR into something suitable for Valhall packing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	18bf478f1e	pan/va: Add shader-db support Reports the common subset from Bifrost, as well as Mali offline compiler style normalized cycle counts. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	13d7ca1300	pan/va: Optimize add with imm to ADD_IMM Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	f45654af59	pan/va: Add packing routines Mostly manual since Valhall is regular. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	97a13d6424	pan/bi: Augment ST_TILE with register format To model its Valhall incarnation. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	c7f6b973b2	pan/bi: Check return addresses in blend shaders Required on Valhall, where jumping to 0x0 doesn't automatically terminate the program. Luckily the check is free there too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	7983a0d0dc	pan/bi: Rename PATOM_C to ATOM This is basically what's native on Valhall. Use the Valhall naming for the pseudo-instruction on Bifrost for consistency. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	b70a7c97bb	pan/bi: Gate late DCE/CSE on "optimize" Otherwise we can end up with unlowered ATOM.i32 on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	0cbe4dd4c4	pan/bi: Use bi_dontcare for ZS_EMIT This is more portable and avoids special casing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	1b934d5962	pan/bi: Emit arch-specific code for bi_dontcare We use bi_dontcare() to specify any encoding where we don't care about the value, with a preference for power-efficient encodings. On Bifrost, a (possibly nonexistant) FAU read is the best encoding. On Valhall, that encoding doesn't exist so just use a zero. That should be good enough in practice. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	666b714a37	pan/bi: Don't analyze helper reqs in !frag shaders Waste of time, and possibly invalid too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	32ca920023	pan/bi: Use vertex/instance ID helpers Enables portability to Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	53f1e57ee7	pan/bi: Handle non-2D arrays Handle arrays generically by using the last component of the coordinate source as the array index. That works for both 2D arrays and cube arrays, fixing cube arrays. Cube arrays were already handled correctly in core Panfrost code. This code path is not tested in dEQP-GLES31 without exposing OES_cube_map_array, which depends on OES_geometry_shader, which we don't have. Yet we do expose PIPE_CAP_CUBE_ARRAY, so ARB_cube_map_array is exposed. Disabling PIPE_CAP_CUBE_ARRAY would be an easy band-aid fix, but it's easy enough to handle correctly. dEQP-GLES31 passes with a hack enabling OES_cube_map_array [without geometry shaders]. Also fixes 1D arrays on Bifrost for the same reasons. Fixes: `70d6c5675d` ("pan/bi: Emit TEXC with builder") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15254>	2022-03-12 17:34:01 +00:00
Jason Ekstrand	65db6b0e7c	bifrost: Constant fold after lower_explicit_io nir_lower_explicit_io generates mul+add chains even for constants. One round of constant folding should get rid of these. This fixes all of the dEQP-VK.glsl.conversions.* tests on panvk. GoGoGoGo'd-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15349>	2022-03-12 03:51:54 +00:00
Jason Ekstrand	1aa120b10f	bifrost: Handle nir_op_frexp* and nir_op_ldexp Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jason Ekstrand	d2a09f3dd3	bifrost: Implement fine and coarse derivatives We leave the undecorated ops as fine so we don't disturb panfrost. For coarse derivatives, we use a lane ID of 0 for the first lane and 1 or 2 for the second depending on axis. This ensures that coarse derivatives are quad-uniform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jason Ekstrand	83010c57a6	bifrost: Simplify derivatives a bit Instead of two magic ternary operations, define a new `axis` temporary which is 1 for X and 2 for Y. Then define everything else in terms of this variable. In particular, the mask operation we do on LANE_ID is a mask so it makes more sense to use ~axis than 1/2 but in the other order. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Icecream95	9d4441c71a	panfrost: Fix ubo_mask calculation BITSET_MASK returns ~0 when given an input of zero, when we need it to return 0 instead. Fixes shaders with only sysvals but no UBOs when push constants are disabled. This breaks when 31 or 32 UBOs are used, but PAN_MAX_CONST_BUFFERS is currently set to 16. Fixes: `c246af0dd8` ("panfrost: Only upload UBOs when needed") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	24101d944b	pan/bi: Add documentation for bifrost_nir_lower_store_component Taken from the commit that introduced the function, `95458c4033` ("pan/bi: Lower stores with component != 0"). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	198cb4a77a	panfrost: Disable point size upper limit clamping The hardware already clamps this, there is no need to do it in the shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Alyssa Rosenzweig	e7cfe18099	pan/bi: Run CSE after lowering FAU Lowering FAU can add moves from uniforms. If a uniform is moved out to a register mulitple times in a basic block, these moves can be CSE'd, saving instructions at the cost of register pressure. 854 shaders in my shader-db are helped on cycle count (average 2.94% reduction in cycles). Only 9 shaders have hurt thread count, and there is no change in spills or fills. Overall, this seems to be a win. Prevents instruction count regressions from the next commit. total instructions in shared programs: 2454423 -> 2444690 (-0.40%) instructions in affected programs: 386274 -> 376541 (-2.52%) helped: 2105 HURT: 0 helped stats (abs) min: 1.0 max: 116.0 x̄: 4.62 x̃: 2 helped stats (rel) min: 0.04% max: 27.27% x̄: 3.64% x̃: 1.92% 95% mean confidence interval for instructions value: -4.91 -4.33 95% mean confidence interval for instructions %-change: -3.83% -3.45% Instructions are helped. total tuples in shared programs: 1963534 -> 1957106 (-0.33%) tuples in affected programs: 233562 -> 227134 (-2.75%) helped: 1491 HURT: 117 helped stats (abs) min: 1.0 max: 63.0 x̄: 4.44 x̃: 2 helped stats (rel) min: 0.04% max: 24.53% x̄: 4.39% x̃: 2.59% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.61 x̃: 1 HURT stats (rel) min: 0.18% max: 8.33% x̄: 1.44% x̃: 1.05% 95% mean confidence interval for tuples value: -4.28 -3.71 95% mean confidence interval for tuples %-change: -4.20% -3.73% Tuples are helped. total clauses in shared programs: 387848 -> 387079 (-0.20%) clauses in affected programs: 13718 -> 12949 (-5.61%) helped: 583 HURT: 60 helped stats (abs) min: 1.0 max: 16.0 x̄: 1.42 x̃: 1 helped stats (rel) min: 1.11% max: 25.00% x̄: 8.28% x̃: 6.67% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.86% max: 20.00% x̄: 4.58% x̃: 4.00% 95% mean confidence interval for clauses value: -1.29 -1.10 95% mean confidence interval for clauses %-change: -7.57% -6.58% Clauses are helped. total cycles in shared programs: 201866.21 -> 201682.92 (-0.09%) cycles in affected programs: 6241.79 -> 6058.50 (-2.94%) helped: 952 HURT: 98 helped stats (abs) min: 0.04166399999999726 max: 2.625 x̄: 0.20 x̃: 0 helped stats (rel) min: 0.12% max: 26.00% x̄: 4.05% x̃: 2.38% HURT stats (abs) min: 0.041665999999999315 max: 0.16666700000000034 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.18% max: 8.70% x̄: 1.60% x̃: 1.43% 95% mean confidence interval for cycles value: -0.19 -0.16 95% mean confidence interval for cycles %-change: -3.80% -3.24% Cycles are helped. total arith in shared programs: 74924.00 -> 74660.12 (-0.35%) arith in affected programs: 9303.67 -> 9039.79 (-2.84%) helped: 1513 HURT: 118 helped stats (abs) min: 0.04166399999999726 max: 2.625 x̄: 0.18 x̃: 0 helped stats (rel) min: 0.07% max: 33.33% x̄: 4.68% x̃: 2.67% HURT stats (abs) min: 0.041665999999999315 max: 0.16666800000000137 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.18% max: 8.70% x̄: 1.55% x̃: 1.37% 95% mean confidence interval for arith value: -0.17 -0.15 95% mean confidence interval for arith %-change: -4.48% -3.98% Arith are helped. total quadwords in shared programs: 1757254 -> 1751978 (-0.30%) quadwords in affected programs: 197399 -> 192123 (-2.67%) helped: 1464 HURT: 110 helped stats (abs) min: 1.0 max: 51.0 x̄: 3.73 x̃: 2 helped stats (rel) min: 0.04% max: 21.95% x̄: 4.16% x̃: 2.52% HURT stats (abs) min: 1.0 max: 7.0 x̄: 1.71 x̃: 1 HURT stats (rel) min: 0.21% max: 13.04% x̄: 1.65% x̃: 0.93% 95% mean confidence interval for quadwords value: -3.58 -3.13 95% mean confidence interval for quadwords %-change: -3.97% -3.53% Quadwords are helped. total threads in shared programs: 52899 -> 52890 (-0.02%) threads in affected programs: 18 -> 9 (-50.00%) helped: 0 HURT: 9 HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: -1.00 -1.00 95% mean confidence interval for threads %-change: -50.00% -50.00% Threads are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15239>	2022-03-04 15:22:04 +00:00
Alyssa Rosenzweig	b48236ea3e	pan/bi: Add arithmetic flag to RSHIFT ops Models ops like ARSHIFT_OR.i32 on Valhall without adding piles of new instructions to the IR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	0b0e74ae82	pan/bi: Extend LD_TILE with a register format Required for Valhall. NIR has the information anyway, pass it along. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	5796777889	pan/bi: Model offset for LOAD/STORE Needed to model the immediate offset on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00

1 2 3 4 5 ...

558 Commits