KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Eric Anholt	a703840320	freedreno/ir3: Fix the arg to ir3_get_num_components_for_image_format() GLuint worked fine for storing our enum, but it should be an enum pipe_format since the image-formats merge. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3728>	2020-02-24 18:25:02 +00:00
Eric Anholt	1b7de2d6b8	freedreno/ir3: Reuse glsl_get_sampler_dim_coordinate_components() in tex_info. Now that we have access to the interior switch statement not going through the txs special case for coord_components, we can just use it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3728>	2020-02-24 18:25:02 +00:00
Kristian H. Kristensen	daa4020948	freedreno/ir3: Lower output precision This lowers mediump FS outputs to fp16 in the ir3 backend. For now this is a modest improvement, which mostly helps us whittle down the full mediump work. Once the GLSL level support lands, then right hand side of the store output intrinsics will be fp16 expressions and we'll cancel out the fp16 -> fp32 -> fp 16 round trip here. We've had different attempts at implementing this: rewriting stores in the GLSL IR, lowering GLSL IR outputs to temporaries and inserting conversions when writing the temporaries to the outputs. In the end, GLSL ends up getting in the way a lot and doing it at the nir level is easier and still possible since we have the output var precisions. This part of the fp16 work is more of a step on the way towards full fp16 support and will add a few extra conversion instructions: total instructions in shared programs: 8151 -> 8163 (0.15%) instructions in affected programs: 1187 -> 1199 (1.01%) helped: 4 HURT: 10 total nops in shared programs: 3146 -> 3152 (0.19%) nops in affected programs: 563 -> 569 (1.07%) helped: 5 HURT: 10 total non-nops in shared programs: 5005 -> 5011 (0.12%) non-nops in affected programs: 92 -> 98 (6.52%) helped: 0 HURT: 3 total dwords in shared programs: 12832 -> 12800 (-0.25%) dwords in affected programs: 96 -> 64 (-33.33%) helped: 1 HURT: 0 total last-baryf in shared programs: 118 -> 115 (-2.54%) last-baryf in affected programs: 21 -> 18 (-14.29%) helped: 1 HURT: 0 total full in shared programs: 424 -> 417 (-1.65%) full in affected programs: 15 -> 8 (-46.67%) helped: 7 HURT: 0 Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3822> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3822>	2020-02-24 17:24:13 +00:00
Hyunjun Ko	c822460f85	freedreno/ir3: handle half registers for arrays during register allocation. So far we only handle full regs of arrays during pre-allocation. This patch is to handle half regs of arrays and also consider the size of half regs when finding out conflicts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3822>	2020-02-24 17:24:13 +00:00
Hyunjun Ko	6ee375f68d	freedreno/ir3: Add new ir3 pass to fold out fp16 conversions This pass tries to fold f2f16 conversion into alu instructions. This will be useful to help reduce the number of instructions once mesa starts supporting precision lowering. For example: add.f r0.w, r0.w, c0.x cov.f32f16 hr2.x, r0.w to add.f hr2.x, r0.w, c0.x Additionally this pass also tries to fold f2f16 conversion into load_input instruction: bary.f r0.x, 3, r0.w cov.f32f16 hr0.x, r0.x to bary.f hr1.x, 3, r0.x v2: Edit to not fold OPC_MAX_F and OPC_MIN_F, since that's not valid. v3: Add OPC_ABSNEG_F to the blacklist as well. v4: Don't remove dead cov instructions, DCE will do that later; don't iterate through sources when a cov only has one; remove special handling of IR3_REG_ARRAY and IR3_REG_RELATIV. v5: Handle folding into u32.u32 movs of floats correctly, don't bail out on IR3_REG_RELATIV or IR3_REG_ARRAY movs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3822>	2020-02-24 17:24:13 +00:00
Eduardo Lima Mitev	99f2b6144b	turnip/pipeline: Don't assume tu_shader is a valid object Fixes a crash in tu6_emit_fs_config() when 'shader' argument is assumed to be non-null, which is possible. Fixes dEQP test: dEQP-VK.api.descriptor_set.descriptor_set_layout_lifetime.graphics Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3756> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3756>	2020-02-24 12:20:20 +00:00
Jonathan Marek	d795eb207f	turnip: add option to force use of hw binning For running deqp tests which have small render sizes and don't otherwise get coverage of hw binning / multiple tiles. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3851> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3851>	2020-02-19 22:24:44 +00:00
Rob Clark	06dc280a57	freedreno/registers: cleanup CP_SET_MARKER 1) Name RM6_COMPUTE, and rename RM6_ENDVIS (from RM6_BLIT) to better reflect what it actually does 2) Cleanup open-coded mode enum values 3) Removed unused 0x10 Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833>	2020-02-18 20:52:42 +00:00
Rob Clark	7b4d6bb1ec	freedreno: quiet INFO_MSG Probably not useful unless LIBGL_DEBUG is set to something. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833>	2020-02-18 20:52:42 +00:00
Rob Clark	838ed2885d	freedreno/a6xx: few register updates Nothing used by mesa, but crashdec tool uses a few of these. And since the practice is these days to sync mesa->envytools, adding these on the mesa side first. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833>	2020-02-18 20:52:42 +00:00
Rob Clark	4fc31e7d33	freedreno/registers: teach gen_header.py about a3xx_regid This is a builtin type (treated as uint, but with special type-aware decoding) in envytools/cffdump. Lets teach gen_header.py about it and drop the enum hack in the xml so I don't have to keep deleting the enum when I sync the xml back to the freedreno envytools tree. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833>	2020-02-18 20:52:42 +00:00
Jonathan Marek	946eacbafb	freedreno/a6xx: document some unknown bits Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3814> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3814>	2020-02-14 08:22:33 -05:00
Jonathan Marek	75fbe089a6	freedreno: name sysmem color/depth flush events Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3814>	2020-02-14 08:22:33 -05:00
Connor Abbott	da64c35ff9	tu: Force sysmem with mipmapped non-aligned linear stores Fixes hangs with dEQP-VK.api.image_clearing.core.clear_color_image.1d.linear.single_layer.r8g8b8a8_unorm and many others on a640, and presumably silent corruption with a630. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:37:05 -05:00
Connor Abbott	f026982265	tu: Support input attachments with sysmem Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:37:05 -05:00
Connor Abbott	c1b3f9e832	tu: Support resolve ops with sysmem rendering Similar to vkCmdClearAttachments(), we use CP_COND_REG_EXEC to conditionally execute both the gmem and sysmem paths, except for after the last subpass where it's known whether we're using sysmem rendering or not. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:37:01 -05:00
Connor Abbott	8647a24a8d	tu: Handle vkCmdClearAttachments() with sysmem Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:36:41 -05:00
Connor Abbott	07e07daeae	tu: Add helper for CP_COND_REG_EXEC Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:36:41 -05:00
Connor Abbott	6a0c4008bf	tu: Sysmem rendering This has only lightly been tested. It passes dEQP-VK.api.smoke.triangle, so at least we're able to show a triangle. For now, it's just enabled under a debug flag. In the future we'll probably want some heuristics like what freedreno has and another debug flag to disable it except when it's forced. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:36:36 -05:00
Connor Abbott	041783d49d	tu: Disable linear depth attachments Also, disable importing depth/stencil textures. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:31:57 -05:00
Connor Abbott	ab3db20cb5	tu: Support multisample image clears We may need shader workarounds for some formats, but for now this seems to work at least as well as the gmem path for clearing multisample attachments. And soon we'll start calling this even on the gmem path, since we leave the final decision of whether to use sysmem or not up till the end, so we can't have it assert or otherwise working tests would assert. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:31:57 -05:00
Connor Abbott	a5fb515301	tu/blit: Support blits in secondary cmdstreams For sysmem rendering we'll have to emit a delayed clear IB to implement LOAD_OP_*, similar to the existing tile_load_ib. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:31:44 -05:00
Connor Abbott	a94be3da84	tu: Properly set UBWC flags in RB_RENDER_CNTL Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:23:50 -05:00
Connor Abbott	49817cb3ea	tu: Don't emit initial render target state in tile_load_ib Emitting it directly in CmdBeginRenderPass should be around the same, except that now we can easily share it with the sysmem path. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:23:50 -05:00
Chad Versace	787b56ac0e	turnip: Add a618 support I merely ported a freedreno patch to turnip which updates some magic regsiter values. commit `ff6e148a3d` Author: Rob Clark <robdclark@chromium.org> CommitDate: Tue Oct 29 09:19:34 2019 -0700 Subject: freedreno/a6xx: add a618 support That's all that Rob did for gallium for a618, so I assume that's we need for turnip also. Tested manually with: dEQP-VK.api.image_clearing.core.clear_color_image.2d.linear.single_layer.* pass 300/555 fail 0/555 skip 255/555 Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3743> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3743>	2020-02-12 23:27:43 +00:00
Chad Versace	ef5da26089	turnip: Add magic register values to tu_physical_device The value of some magic regsiters differ across chipsets. fd6_context manages the differences by initializing them at runtime. Let's do the same. Add to tu_physical_device a subset of those found in fd6_context: RB_UNKNOWN_8E04_blit RB_CCU_CNTL_gmem PC_UNKNOWN_9805 SP_UNKNOWN_A0F8 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3743>	2020-02-12 23:27:43 +00:00
Jonathan Marek	18786cc7d5	freedreno/a6xx: use single format enum Loses some information about which formats can be used in which cases, but we encode that information in the format table anyway. Important notes: * RB6_R10G10B10A2_UNORM becomes FMT6_R10G10B10A2_UNORM_DEST * TFMT6_8_8_8_UNORM becomes FMT6_8_8_8_X8_UNORM (not FMT6_8_8_8_UNORM) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3798> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3798>	2020-02-12 21:59:59 +00:00
Arcady Goldmints-Orlov	e9f83185a2	Rename nir_lower_constant_initializers to nir_lower_variable_initalizers This is naming is more clear as nir_variables can be initializes not just with a nir_constant but with a pointer to another nir_variable. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3047> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3047>	2020-02-12 15:41:49 +00:00
Hyunjun Ko	7bddaa6136	freedreno/ir3: Fold const only when the type is float Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:53:48 -08:00
Hyunjun Ko	260bd32b58	freedreno/ir3: put the conversion back for half const to the right place. The previous commit leads to match immed values unexpectedly. This makes constlen for each shader including bvert wrong. Also fixes atan2 for mediump deqp tests. Fixes: `cbd1f47433` ("freedreno/ir3: convert back to 32-bit values for half constant registers.") v2: Move conversion up above fabs/fneg modifier handling as well. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:53:42 -08:00
Hyunjun Ko	d70192e697	freedreno/ir3: Add cat4 mediump opcodes v2: Reworked to assign half-opcodes in ir3_ra.c (krh). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:51:25 -08:00
Rob Clark	3eca6d9ce1	freedreno/ir3: fold const conversion into consumer A sequence like: (nop3)cov.f32f16 hr0.x, c0.x mul.f hr4.y, hr1.z, hr0.x can be turned into: mul.f hr4.y, hr1.z, hc0.x Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:51:25 -08:00
Hyunjun Ko	5e2012d5c7	freedreno/ir3: fix printing half constant registers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:51:25 -08:00
Kristian H. Kristensen	d55dfef782	freedreno/ir3: Set IR3_REG_HALF flag on src as well in immediate MOV This lets is_same_type_reg() recognize that the dst and src of the immediate MOV are the same and unblocks fp16 constant propagation. Signed-off-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:51:25 -08:00
Martin Fuzzey	d8bae10bfe	freedreno: android: fix build of perfcounters. Some dependencies were missing on android causing a build failure. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3736> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3736>	2020-02-07 16:34:49 +00:00
Martin Fuzzey	fad9924315	freedreno: android: add a6xx-pack.xml.h generation to android build The generation of a6xx-pack.xml.h was missing in the android build scripts leading to a build failure. Signed-off-by: Martin Fuzzey <martin.fuzzey@flowbird.group> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3736>	2020-02-07 16:34:49 +00:00
Martin Fuzzey	cad400a59e	freedreno: android: fix build failure on android due to python version The freedreno gen_header.py script now only works under python3. It contains a "print()" call which prints a blank line under python3 but prints "()" under python2.7. However the Android build currently uses python2. This leads to incorrect code generation and a later build error. .../STATIC_LIBRARIES/libfreedreno_registers_intermediates/registers/adreno_common.xml.h:163:2: error: expected identifier or '(' () Fix this by adding MESA_PYTHON3 and using it for the freedreno scripts. Signed-off-by: Martin Fuzzey <martin.fuzzey@flowbird.group> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3736>	2020-02-07 16:34:49 +00:00
Eric Anholt	8d07d66180	glsl,nir: Switch the enum representing shader image formats to PIPE_FORMAT. This means you can directly use format utils on it without having to have your own GL enum to number-of-components switch statement (or whatever) in your vulkan backend. Thanks to imirkin for fixing up the nouveau driver (and a couple of core details). This fixes the computed qualifiers for EXT_shader_image_load_store's non-integer sizeNxM qualifiers, which we don't have tests for. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> (v3d) Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3355> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3355>	2020-02-05 10:31:14 -08:00
Connor Abbott	8455648cca	tu: Move vsc_data and vsc_data2 allocation into the device In addition to preparing us for dynamically resizing them, which has to be controlled by the device, this greatly reduces the memory usage when allocating large numbers of command buffers, making dEQP-VK.api.object_management.max_concurrent.command_buffer_primary go from crash -> pass. Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3621> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3621>	2020-02-05 15:27:28 +01:00
Connor Abbott	84bd4da468	freedreno: Fix CP_COND_EXEC Noticed while looking at a trace of the Vulkan blob. Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3600> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3600>	2020-02-05 13:14:22 +00:00
Connor Abbott	ed5d1c1c47	freedreno: Add CP_REG_WRITE documentation Document the first DWORD, which at least for the Vulkan blob on a640 isn't always 2. Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3600>	2020-02-05 13:14:22 +00:00
Connor Abbott	65197a3ac1	freedreno: Fix CP_COND_REG_EXEC bit positions Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3600>	2020-02-05 13:14:22 +00:00
Eric Anholt	22d2cbe685	freedreno: Allow UBWC on textures with multiple mipmap levels. This is a backport of Jonathan Marek's UBWC work on turnip to GL. Performance highlights from our trace set (320 frames sampled) traces/glmark2/texture-texture-filter=mipmap.rdc: +9.1% +/- 2.2% traces/android/trex.rdc: +8.7% +/- 0.4% traces/glmark2/desktop-effect=shadow:windows=4.rdc: +4.2% +/- 2.5% Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059>	2020-02-04 23:18:00 +00:00
Eric Anholt	17312b4a10	freedreno: Rename the UBWC layer size field and store it as bytes. This makes the field description match its usage in the code, matches tu's usage of the field, and avoids storing values in surprising units. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059>	2020-02-04 23:18:00 +00:00
Eric Anholt	b6b4118bb0	freedreno: Include the layer size in layout debug. It's been many of my bugs so far. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059>	2020-02-04 23:18:00 +00:00
Eric Anholt	20357dfde8	freedreno: Move the layout debug under FD_MESA_DEBUG=layout. I keep wanting to turn this on while debugging layout stuff, and I suspect krh and robclark could use it too. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059>	2020-02-04 23:18:00 +00:00
Eric Engestrom	cae6093266	freedreno/perfcntrs: fix fd leak CoverityID: 1110568, 1458071 Fixes: `5a13507164` ("freedreno/perfcntrs: add fdperf") Signed-off-by: Eric Engestrom <eric@engestrom.ch> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3671> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3671>	2020-02-04 19:26:40 +00:00
Kristian H. Kristensen	df6a2a7197	turnip: Be explicit about converting vk compare func to a6xx Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3686> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3686>	2020-02-04 06:03:52 +00:00
Kristian H. Kristensen	67dd51606c	freedreno/fdperf: Cast away some ignored return values This is developer tool, it can crash and burn if it fails to allocate. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3686>	2020-02-04 06:03:52 +00:00
Rob Clark	982d61e2cd	freedreno/ir3: fix a dirty lie Lies, damn lies, and leftover hacks! We no longer hard-code these two, so fix the disasm to print the correct values. Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00

1 2 3 4 5 ...

780 Commits