KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rob Clark	cc82521de4	freedreno/ir3: round-robin RA In the second (scalar pass) use the information about # of registers used in the first pass as the target max, and round-robin within that range. This generally gives the post-RA sched pass more opportunities to re-order instructions to remove nop's. Also, we can be a bit clever when assigning dest registers for SFU instructions, by picking the register used for it's src (if available and already assigned). This avoids some (ss) syncs caused by write after read hazards. (Ie. the SFU instruction will read it's own src before writing dest.) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	b2b349096f	freedreno/ir3: track register usage in first RA pass We'll use the feedback from the first pass to select a target register usage in the second pass. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	9ae93be8fb	freedreno/ir3: fix has_latency_to_hide Also count tex-prefetch instructions. And only let the no-latency rule kick in for frag shaders. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	b6eb11295a	freedreno/ir3: split out has_latency_to_hide() Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	752b9985be	freedreno/ir3: add simplified stall estimation Doesn't take into account stalls that result from a register written in a different block, etc. But this should be more useful than just using number of (ss)'s by trying to estimate how costly a given sync is. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	64ae2ef8bb	freedreno/ir3: remove extra nops inserted in scheduler They were inserting a nop between back to back SFU instrucions. But that doesn't actually appear to be required. And they get stripped out later anyways before legalize. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	ad2ff7a278	freedreno/computerator: add hrsq/hlog2/hexp2 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	4a8e4c18d2	freedreno/ir3: also lower lowp frag outputs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Kristian H. Kristensen	b75a166e68	freedreno/ir3: Don't fold conversions into sign Not supported. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3929>	2020-03-09 16:31:07 +00:00
Jonathan Marek	55dac91adc	turnip: fix tile->slot calculation Fixes HW binning cases when the horizontal number of tiles isn't divisible by the horizontal number of pipes (only happens with more than 32 tiles). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3142> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3142>	2020-03-05 12:53:29 -05:00
Jonathan Marek	036230341f	turnip: improve binning pipe layout config The old code looks the same as GL driver, but we get things like pipe_count = {32, 1}, which seems bad. This uses similar logic as for tiles which produces a balanced pipe_count width/height. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3142>	2020-03-05 12:48:12 -05:00
Hyunjun Ko	3199b8b9e7	turnip: support indirect draw Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3976> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3976>	2020-03-04 01:20:32 +00:00
Brian Ho	69628ababb	turnip: Execute main cs for secondary command buffers Previously, we only added the secondary command buffer's draw and draw epilogue command streams to the primary command buffer on vkCmdExecuteCommands. However, we also need to merge the primary cs for non-draw operations like vkCmdCopyBuffer and vkCmdBeginQuery. Fixes dEQP-VK.memory.pipeline_barrier.host_write_transfer_src.* and various other tests in dEQP-VK.api.command_buffers.*. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3988> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3988>	2020-03-03 02:25:25 +00:00
Brian Ho	5715a61fa9	turnip: Promote tu_cs_get_size/is_empty to header These will be used in tu_cmd_buffer.c. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3988>	2020-03-03 02:25:25 +00:00
Rob Clark	8cb9f79413	freedreno/ir3: add assert Catch problems earlier when inputs are not setup correctly. Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	ac705edd82	freedreno/ir3: fix assert with getinfo Fixes: dEQP-VK.glsl.texture_functions.query.texturesamples.sampler2dms_fixed_vertex Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	c1f4367461	freedreno/ir3: don't precolor unassigned inputs Fixes crash seen in: dEQP-VK.glsl.conversions.matrix_to_matrix.mat4_to_mat3x4_vertex Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	4b8e198fd2	freedreno/ir3: fix crash with samgq workaround Need to list_delinit() before we clone the instruction to split it into individual samgpN instructions, otherwise we get list corruption. Tested-by: Eduardo Lima Mitev <elima@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	56565b7bba	freedreno/ir3: update SFU delay 1) emperically, 10 seems like a more accurate # than 4 2) push "soft" delay handling into ir3_delayslots(), as we should also be using it to calculate the costs that the schedulers use Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	2cf4b5f29e	freedreno/ir3: track half-precision live values In schedule live value tracking, differentiate between half vs full precision. Half-precision live values are less costly than full precision. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	4353b3c1c5	freedreno/ir3: don't hide latency when there is none to hide Current scheduler thresholds try to ensure there are warps available to switch to when hiding texture fetch latency. But if there is none to hide, we should allow scheduler to use more registers to reduce nops. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	9d2aaa589c	freedreno/ir3: rewrite regmask to better support a6xx+ To avoid spurious sync flags, we want to, for a6xx+, operate in terms of half-regs, with a full precision register testing the corresponding two half-regs that it conflicts with. And while we are at it, stop open-coding BITSET Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	c02cd8afbd	freedreno/ir3: remove regmask_set_if_not() No longer used. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:41 +00:00
Rob Clark	141d0d1c25	freedreno/ir3: remove from_tgsi No longer used, other than in ir3 cmdline compiler, where it can be replaced with a local variable. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3989>	2020-02-28 16:53:40 +00:00
Jonathan Marek	c7ac1bcea0	turnip: increase array sizes in tu_descriptor_map Pending the descriptor rework, this allows running the follow test: dEQP-VK.renderpass.suballocation.attachment_sparse_filling.input_attachment_127 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	d195eef05d	turnip: fall back to sysmem when attachments don't fit into gmem Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	de3230e0a5	turnip: remove unnecessary fb size check Framebuffer with 0 width or height is not valid. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	cf302c9a22	turnip: don't hardcode gmem base for input attachment Newer a6xx no longer has programmable GMEM base, so we can't rely on the kernel driver setting it to 0x100000 (GMEM base is 0 on such GPUs). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	6420406f19	turnip: fix srgb MRT Register packing macros makes this only set the first bit. Set to whole dword to fix srgb for color attachments >0. Fixes: `59f29fc8` ("turnip: Convert the rest of tu_cmd_buffer.c over to the new pack macros.") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	8f9e1c6047	turnip: fix hw binning + render_area offset interaction Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	de33c23370	turnip: minify image_view extent Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	b18d6575fe	turnip: remove unecessary MRT_CONTROL fill Hardware won't use MRT_CONTROL after mrt_count Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	33b2db5fb9	turnip: move some constant state to tu6_init_hw Also remove duplicates. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	7d27a9ffb3	turnip: check the right alignment requirement on shader iova I had some trouble because I assumed this was right, tested that the alignment requirement is actually 16. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3979>	2020-02-28 14:04:20 +00:00
Jonathan Marek	0f0662a551	turnip: add r5g5b5a1_unorm/b5g5r5a1_unorm formats r5g5b5a1/b5g5r5a1 tiled/ubwc is the same as a1r5g5b5 (in memory), but linear is read as 1_5_5_5 and written with 5_5_5_1 with swap. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3806> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3806>	2020-02-28 12:48:11 +00:00
Jonathan Marek	80ceebcdd1	turnip: rework format table to support r5g5b5a1_unorm/b5g5r5a1_unorm These formats are an exception that can't be modeled in the current format table. Switch to a table with only a single a6xx_format per vk format, and deal with the exceptions separately (currently the only exception is 10_10_10_2_UNORM which has a different color format). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3806>	2020-02-28 12:48:11 +00:00
Jason Ekstrand	349898a967	nir: Drop nir_tex_instr::texture_array_size It's set by lots of things and we spend a lot of time maintaining it but no one actually uses the value for anything useful. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3940> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3940>	2020-02-26 18:29:49 +00:00
Eric Anholt	ec2f905ca8	freedreno/computerator: Fix defined-but-not-used warnings from lex/yacc. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3954> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3954>	2020-02-26 17:20:24 +00:00
Eric Anholt	bd53f4f56b	turnip: Fix compiler warning about casting a nondispatchable handle. Fixes: `1c5d84fcae` ("turnip: hook up cmdbuffer event set/wait") Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3916> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3916>	2020-02-26 16:58:50 +00:00
Jonathan Marek	87924646db	turnip: enable fullDrawIndexUint32/independentBlend/dualSrcBlend/logicOp These are already implemented but missing from VkPhysicalDeviceFeatures. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3923> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3923>	2020-02-26 00:21:45 +00:00
Jonathan Marek	708c3a5ffd	turnip: enable sampleRateShading feature There's still a TODO related to key->sample_shading, but it doesn't look like it changes anything in ir3, so it works without that. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3923>	2020-02-26 00:21:45 +00:00
Rob Clark	26d42645f9	freedreno/computerator: fix build dependency Ensure the generated register headers are built before computerator uses them. Reported-by: Clayton Craft <clayton.a.craft@intel.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3939> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3939>	2020-02-25 02:02:06 +00:00
Jonathan Marek	0ee76b90d5	turnip: move tile_load_ib/sysmem_clear_ib into draw_cs Avoids having to calculate reserved sizes for substream cs, also matches what the blob does. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3925> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3925>	2020-02-24 21:52:46 +00:00
Jonathan Marek	a410e64b68	turnip: make cond_exec helper easier to use Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3925>	2020-02-24 21:52:46 +00:00
Jonathan Marek	6ede9749d2	turnip: remove marker seqno Use robclark's new crashdec/devcoredump thing instead. Note: not sure this ever really worked because it didn't WFI. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3925>	2020-02-24 21:52:45 +00:00
Jonathan Marek	cf94124e1c	turnip: automatically reserve cmdstream space in emit_pkt4/emit_pkt7 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3925>	2020-02-24 21:52:45 +00:00
Jonathan Marek	4b2a7dcd93	turnip: add tu_device pointer to tu_cs Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3925>	2020-02-24 21:52:45 +00:00
Jonathan Marek	a9a990a60b	turnip: fix COND_EXEC reserved size in tu_query Conditionally executed dwords must be in the same bo. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3925>	2020-02-24 21:52:45 +00:00
Rob Clark	2275343ba3	freedreno/computerator: add computerator A standalone tool to compile and run compute shaders from ir3 assembly. Mostly to have an easy way to experiment with instructions. Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3926> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3926>	2020-02-24 21:31:53 +00:00
Rob Clark	568e948d1f	freedreno/ir3: allow block->predecessors to be null This way we can also use ir3_print from computerator, which mostly bypasses the ir3_block construct (since it doesn't need to do scheduling, etc) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3926>	2020-02-24 21:31:53 +00:00
Rob Clark	f87d412f08	freedreno/computerator: rename prefix asm->ir3 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3926>	2020-02-24 21:31:53 +00:00
Rob Clark	6ee68d796e	freedreno/computerator: polish out some of the rust Updates for differences between fdre-a3xx's early version of ir3, and what we have now in mesa. And updates for instruction name and syntax changes. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3926>	2020-02-24 21:31:53 +00:00
Rob Clark	3bb340cf4f	freedreno/computerator: import parser/lexer from fdre-a3xx Import the rusty old parser from freedreno.git Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3926>	2020-02-24 21:31:53 +00:00
Eric Anholt	e4baff9081	freedreno: Switch to using lowered image intrinsics. This cuts out a bunch of deref chain walking that the compiler can do for us. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3728> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3728>	2020-02-24 18:25:02 +00:00
Eric Anholt	a703840320	freedreno/ir3: Fix the arg to ir3_get_num_components_for_image_format() GLuint worked fine for storing our enum, but it should be an enum pipe_format since the image-formats merge. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3728>	2020-02-24 18:25:02 +00:00
Eric Anholt	1b7de2d6b8	freedreno/ir3: Reuse glsl_get_sampler_dim_coordinate_components() in tex_info. Now that we have access to the interior switch statement not going through the txs special case for coord_components, we can just use it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3728>	2020-02-24 18:25:02 +00:00
Kristian H. Kristensen	daa4020948	freedreno/ir3: Lower output precision This lowers mediump FS outputs to fp16 in the ir3 backend. For now this is a modest improvement, which mostly helps us whittle down the full mediump work. Once the GLSL level support lands, then right hand side of the store output intrinsics will be fp16 expressions and we'll cancel out the fp16 -> fp32 -> fp 16 round trip here. We've had different attempts at implementing this: rewriting stores in the GLSL IR, lowering GLSL IR outputs to temporaries and inserting conversions when writing the temporaries to the outputs. In the end, GLSL ends up getting in the way a lot and doing it at the nir level is easier and still possible since we have the output var precisions. This part of the fp16 work is more of a step on the way towards full fp16 support and will add a few extra conversion instructions: total instructions in shared programs: 8151 -> 8163 (0.15%) instructions in affected programs: 1187 -> 1199 (1.01%) helped: 4 HURT: 10 total nops in shared programs: 3146 -> 3152 (0.19%) nops in affected programs: 563 -> 569 (1.07%) helped: 5 HURT: 10 total non-nops in shared programs: 5005 -> 5011 (0.12%) non-nops in affected programs: 92 -> 98 (6.52%) helped: 0 HURT: 3 total dwords in shared programs: 12832 -> 12800 (-0.25%) dwords in affected programs: 96 -> 64 (-33.33%) helped: 1 HURT: 0 total last-baryf in shared programs: 118 -> 115 (-2.54%) last-baryf in affected programs: 21 -> 18 (-14.29%) helped: 1 HURT: 0 total full in shared programs: 424 -> 417 (-1.65%) full in affected programs: 15 -> 8 (-46.67%) helped: 7 HURT: 0 Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3822> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3822>	2020-02-24 17:24:13 +00:00
Hyunjun Ko	c822460f85	freedreno/ir3: handle half registers for arrays during register allocation. So far we only handle full regs of arrays during pre-allocation. This patch is to handle half regs of arrays and also consider the size of half regs when finding out conflicts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3822>	2020-02-24 17:24:13 +00:00
Hyunjun Ko	6ee375f68d	freedreno/ir3: Add new ir3 pass to fold out fp16 conversions This pass tries to fold f2f16 conversion into alu instructions. This will be useful to help reduce the number of instructions once mesa starts supporting precision lowering. For example: add.f r0.w, r0.w, c0.x cov.f32f16 hr2.x, r0.w to add.f hr2.x, r0.w, c0.x Additionally this pass also tries to fold f2f16 conversion into load_input instruction: bary.f r0.x, 3, r0.w cov.f32f16 hr0.x, r0.x to bary.f hr1.x, 3, r0.x v2: Edit to not fold OPC_MAX_F and OPC_MIN_F, since that's not valid. v3: Add OPC_ABSNEG_F to the blacklist as well. v4: Don't remove dead cov instructions, DCE will do that later; don't iterate through sources when a cov only has one; remove special handling of IR3_REG_ARRAY and IR3_REG_RELATIV. v5: Handle folding into u32.u32 movs of floats correctly, don't bail out on IR3_REG_RELATIV or IR3_REG_ARRAY movs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3822>	2020-02-24 17:24:13 +00:00
Eduardo Lima Mitev	99f2b6144b	turnip/pipeline: Don't assume tu_shader is a valid object Fixes a crash in tu6_emit_fs_config() when 'shader' argument is assumed to be non-null, which is possible. Fixes dEQP test: dEQP-VK.api.descriptor_set.descriptor_set_layout_lifetime.graphics Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3756> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3756>	2020-02-24 12:20:20 +00:00
Jonathan Marek	d795eb207f	turnip: add option to force use of hw binning For running deqp tests which have small render sizes and don't otherwise get coverage of hw binning / multiple tiles. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3851> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3851>	2020-02-19 22:24:44 +00:00
Rob Clark	06dc280a57	freedreno/registers: cleanup CP_SET_MARKER 1) Name RM6_COMPUTE, and rename RM6_ENDVIS (from RM6_BLIT) to better reflect what it actually does 2) Cleanup open-coded mode enum values 3) Removed unused 0x10 Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833>	2020-02-18 20:52:42 +00:00
Rob Clark	7b4d6bb1ec	freedreno: quiet INFO_MSG Probably not useful unless LIBGL_DEBUG is set to something. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833>	2020-02-18 20:52:42 +00:00
Rob Clark	838ed2885d	freedreno/a6xx: few register updates Nothing used by mesa, but crashdec tool uses a few of these. And since the practice is these days to sync mesa->envytools, adding these on the mesa side first. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833>	2020-02-18 20:52:42 +00:00
Rob Clark	4fc31e7d33	freedreno/registers: teach gen_header.py about a3xx_regid This is a builtin type (treated as uint, but with special type-aware decoding) in envytools/cffdump. Lets teach gen_header.py about it and drop the enum hack in the xml so I don't have to keep deleting the enum when I sync the xml back to the freedreno envytools tree. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3833>	2020-02-18 20:52:42 +00:00
Jonathan Marek	946eacbafb	freedreno/a6xx: document some unknown bits Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3814> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3814>	2020-02-14 08:22:33 -05:00
Jonathan Marek	75fbe089a6	freedreno: name sysmem color/depth flush events Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3814>	2020-02-14 08:22:33 -05:00
Connor Abbott	da64c35ff9	tu: Force sysmem with mipmapped non-aligned linear stores Fixes hangs with dEQP-VK.api.image_clearing.core.clear_color_image.1d.linear.single_layer.r8g8b8a8_unorm and many others on a640, and presumably silent corruption with a630. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:37:05 -05:00
Connor Abbott	f026982265	tu: Support input attachments with sysmem Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:37:05 -05:00
Connor Abbott	c1b3f9e832	tu: Support resolve ops with sysmem rendering Similar to vkCmdClearAttachments(), we use CP_COND_REG_EXEC to conditionally execute both the gmem and sysmem paths, except for after the last subpass where it's known whether we're using sysmem rendering or not. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:37:01 -05:00
Connor Abbott	8647a24a8d	tu: Handle vkCmdClearAttachments() with sysmem Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:36:41 -05:00
Connor Abbott	07e07daeae	tu: Add helper for CP_COND_REG_EXEC Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:36:41 -05:00
Connor Abbott	6a0c4008bf	tu: Sysmem rendering This has only lightly been tested. It passes dEQP-VK.api.smoke.triangle, so at least we're able to show a triangle. For now, it's just enabled under a debug flag. In the future we'll probably want some heuristics like what freedreno has and another debug flag to disable it except when it's forced. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:36:36 -05:00
Connor Abbott	041783d49d	tu: Disable linear depth attachments Also, disable importing depth/stencil textures. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:31:57 -05:00
Connor Abbott	ab3db20cb5	tu: Support multisample image clears We may need shader workarounds for some formats, but for now this seems to work at least as well as the gmem path for clearing multisample attachments. And soon we'll start calling this even on the gmem path, since we leave the final decision of whether to use sysmem or not up till the end, so we can't have it assert or otherwise working tests would assert. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:31:57 -05:00
Connor Abbott	a5fb515301	tu/blit: Support blits in secondary cmdstreams For sysmem rendering we'll have to emit a delayed clear IB to implement LOAD_OP_*, similar to the existing tile_load_ib. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:31:44 -05:00
Connor Abbott	a94be3da84	tu: Properly set UBWC flags in RB_RENDER_CNTL Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:23:50 -05:00
Connor Abbott	49817cb3ea	tu: Don't emit initial render target state in tile_load_ib Emitting it directly in CmdBeginRenderPass should be around the same, except that now we can easily share it with the sysmem path. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3713>	2020-02-12 21:23:50 -05:00
Chad Versace	787b56ac0e	turnip: Add a618 support I merely ported a freedreno patch to turnip which updates some magic regsiter values. commit `ff6e148a3d` Author: Rob Clark <robdclark@chromium.org> CommitDate: Tue Oct 29 09:19:34 2019 -0700 Subject: freedreno/a6xx: add a618 support That's all that Rob did for gallium for a618, so I assume that's we need for turnip also. Tested manually with: dEQP-VK.api.image_clearing.core.clear_color_image.2d.linear.single_layer.* pass 300/555 fail 0/555 skip 255/555 Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3743> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3743>	2020-02-12 23:27:43 +00:00
Chad Versace	ef5da26089	turnip: Add magic register values to tu_physical_device The value of some magic regsiters differ across chipsets. fd6_context manages the differences by initializing them at runtime. Let's do the same. Add to tu_physical_device a subset of those found in fd6_context: RB_UNKNOWN_8E04_blit RB_CCU_CNTL_gmem PC_UNKNOWN_9805 SP_UNKNOWN_A0F8 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3743>	2020-02-12 23:27:43 +00:00
Jonathan Marek	18786cc7d5	freedreno/a6xx: use single format enum Loses some information about which formats can be used in which cases, but we encode that information in the format table anyway. Important notes: * RB6_R10G10B10A2_UNORM becomes FMT6_R10G10B10A2_UNORM_DEST * TFMT6_8_8_8_UNORM becomes FMT6_8_8_8_X8_UNORM (not FMT6_8_8_8_UNORM) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3798> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3798>	2020-02-12 21:59:59 +00:00
Arcady Goldmints-Orlov	e9f83185a2	Rename nir_lower_constant_initializers to nir_lower_variable_initalizers This is naming is more clear as nir_variables can be initializes not just with a nir_constant but with a pointer to another nir_variable. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3047> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3047>	2020-02-12 15:41:49 +00:00
Hyunjun Ko	7bddaa6136	freedreno/ir3: Fold const only when the type is float Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:53:48 -08:00
Hyunjun Ko	260bd32b58	freedreno/ir3: put the conversion back for half const to the right place. The previous commit leads to match immed values unexpectedly. This makes constlen for each shader including bvert wrong. Also fixes atan2 for mediump deqp tests. Fixes: `cbd1f47433` ("freedreno/ir3: convert back to 32-bit values for half constant registers.") v2: Move conversion up above fabs/fneg modifier handling as well. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:53:42 -08:00
Hyunjun Ko	d70192e697	freedreno/ir3: Add cat4 mediump opcodes v2: Reworked to assign half-opcodes in ir3_ra.c (krh). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:51:25 -08:00
Rob Clark	3eca6d9ce1	freedreno/ir3: fold const conversion into consumer A sequence like: (nop3)cov.f32f16 hr0.x, c0.x mul.f hr4.y, hr1.z, hr0.x can be turned into: mul.f hr4.y, hr1.z, hc0.x Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:51:25 -08:00
Hyunjun Ko	5e2012d5c7	freedreno/ir3: fix printing half constant registers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:51:25 -08:00
Kristian H. Kristensen	d55dfef782	freedreno/ir3: Set IR3_REG_HALF flag on src as well in immediate MOV This lets is_same_type_reg() recognize that the dst and src of the immediate MOV are the same and unblocks fp16 constant propagation. Signed-off-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3737>	2020-02-07 09:51:25 -08:00
Martin Fuzzey	d8bae10bfe	freedreno: android: fix build of perfcounters. Some dependencies were missing on android causing a build failure. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3736> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3736>	2020-02-07 16:34:49 +00:00
Martin Fuzzey	fad9924315	freedreno: android: add a6xx-pack.xml.h generation to android build The generation of a6xx-pack.xml.h was missing in the android build scripts leading to a build failure. Signed-off-by: Martin Fuzzey <martin.fuzzey@flowbird.group> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3736>	2020-02-07 16:34:49 +00:00
Martin Fuzzey	cad400a59e	freedreno: android: fix build failure on android due to python version The freedreno gen_header.py script now only works under python3. It contains a "print()" call which prints a blank line under python3 but prints "()" under python2.7. However the Android build currently uses python2. This leads to incorrect code generation and a later build error. .../STATIC_LIBRARIES/libfreedreno_registers_intermediates/registers/adreno_common.xml.h:163:2: error: expected identifier or '(' () Fix this by adding MESA_PYTHON3 and using it for the freedreno scripts. Signed-off-by: Martin Fuzzey <martin.fuzzey@flowbird.group> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3736>	2020-02-07 16:34:49 +00:00
Eric Anholt	8d07d66180	glsl,nir: Switch the enum representing shader image formats to PIPE_FORMAT. This means you can directly use format utils on it without having to have your own GL enum to number-of-components switch statement (or whatever) in your vulkan backend. Thanks to imirkin for fixing up the nouveau driver (and a couple of core details). This fixes the computed qualifiers for EXT_shader_image_load_store's non-integer sizeNxM qualifiers, which we don't have tests for. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> (v3d) Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3355> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3355>	2020-02-05 10:31:14 -08:00
Connor Abbott	8455648cca	tu: Move vsc_data and vsc_data2 allocation into the device In addition to preparing us for dynamically resizing them, which has to be controlled by the device, this greatly reduces the memory usage when allocating large numbers of command buffers, making dEQP-VK.api.object_management.max_concurrent.command_buffer_primary go from crash -> pass. Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3621> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3621>	2020-02-05 15:27:28 +01:00
Connor Abbott	84bd4da468	freedreno: Fix CP_COND_EXEC Noticed while looking at a trace of the Vulkan blob. Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3600> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3600>	2020-02-05 13:14:22 +00:00
Connor Abbott	ed5d1c1c47	freedreno: Add CP_REG_WRITE documentation Document the first DWORD, which at least for the Vulkan blob on a640 isn't always 2. Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3600>	2020-02-05 13:14:22 +00:00
Connor Abbott	65197a3ac1	freedreno: Fix CP_COND_REG_EXEC bit positions Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3600>	2020-02-05 13:14:22 +00:00
Eric Anholt	22d2cbe685	freedreno: Allow UBWC on textures with multiple mipmap levels. This is a backport of Jonathan Marek's UBWC work on turnip to GL. Performance highlights from our trace set (320 frames sampled) traces/glmark2/texture-texture-filter=mipmap.rdc: +9.1% +/- 2.2% traces/android/trex.rdc: +8.7% +/- 0.4% traces/glmark2/desktop-effect=shadow:windows=4.rdc: +4.2% +/- 2.5% Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059>	2020-02-04 23:18:00 +00:00
Eric Anholt	17312b4a10	freedreno: Rename the UBWC layer size field and store it as bytes. This makes the field description match its usage in the code, matches tu's usage of the field, and avoids storing values in surprising units. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059>	2020-02-04 23:18:00 +00:00
Eric Anholt	b6b4118bb0	freedreno: Include the layer size in layout debug. It's been many of my bugs so far. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059>	2020-02-04 23:18:00 +00:00
Eric Anholt	20357dfde8	freedreno: Move the layout debug under FD_MESA_DEBUG=layout. I keep wanting to turn this on while debugging layout stuff, and I suspect krh and robclark could use it too. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3059>	2020-02-04 23:18:00 +00:00
Eric Engestrom	cae6093266	freedreno/perfcntrs: fix fd leak CoverityID: 1110568, 1458071 Fixes: `5a13507164` ("freedreno/perfcntrs: add fdperf") Signed-off-by: Eric Engestrom <eric@engestrom.ch> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3671> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3671>	2020-02-04 19:26:40 +00:00
Kristian H. Kristensen	df6a2a7197	turnip: Be explicit about converting vk compare func to a6xx Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3686> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3686>	2020-02-04 06:03:52 +00:00
Kristian H. Kristensen	67dd51606c	freedreno/fdperf: Cast away some ignored return values This is developer tool, it can crash and burn if it fails to allocate. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3686>	2020-02-04 06:03:52 +00:00
Rob Clark	982d61e2cd	freedreno/ir3: fix a dirty lie Lies, damn lies, and leftover hacks! We no longer hard-code these two, so fix the disasm to print the correct values. Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	752aeb7b3f	freedreno/ir3: simplify split from collect In some cases we need to split components out from what was already a collect. That was making it hard to DCE unused components of the collect. (Ie. unused components of fragcoord, etc) So just detect this case and skip the chained collect+split. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	8d0e7d9a4c	freedreno/ir3: create fragcoord instructions in input block This was somehow working to create the instructions in a random block, and use the value in other blocks, by dumb luck. But two-pass-RA's better choice of register assignment causes a couple dEQPs to start failing without this fix: dEQP-GLES3.functional.shaders.metamorphic.bubblesort_flag.variant_1 dEQP-GLES3.functional.shaders.metamorphic.bubblesort_flag.variant_2 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	fb09020ef2	freedreno/ir3: remove unused tex arg harder Just killing the SSA link isn't enough. It confuses RA, legalize, and postsched to see a bogus unused reg. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	2ffe44ec0a	freedreno/ir3: add RA sanity check Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	3e79c4f0ed	freedreno/ir3: two pass register allocation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	b0293af7a5	freedreno/ir3: don't precolor unused inputs This apparently can happen with gs/tess. And will cause problems with two-pass-ra, so lets just skip them. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	ad2587d3c8	freedreno/ir3: add is_tex_or_prefetch() Some of the aspects of tex prefetch are in common with normal tex instructions, such as having a wrmask to control which components are written. Add a helper for this. This should result in actually using the prefetch wrmask to avoid fetching unneeded components. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	4a7a6c9ef0	freedreno/ir3: number instructions from one ra_block_compute_live_ranges() treats zero as "not yet defined", so probably best to not let this be a valid instruction # Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	0f78c32492	freedreno/ir3: post-RA sched pass After RA, we can schedule to increase parallelism (reduce nop's) without worrying about increasing register pressure. This pass lets us cut down the instruction count ~10%, and prioritize bary.f, kill, etc, which would tend to increase register pressure if we tried to do that before RA. It should be more useful if RA round-robin'd register choices. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	3369406e44	freedreno/ir3: fix kill scheduling kill (and other cat0/flow instructions) do not have a dst register. Which was mostly harmless before, other than RA thinking it would need a free register to write. (But nothing consumed it, so the value would be immediately dead.) But this would cause more problems with postsched which would see a bogus dependency. Also, post-RA sched does need to see the dependency on the predicate register. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	9a9f78f1f9	freedreno/ir3/ra: make use()/def() functions instead of macros Originally these were nested functions, which worked nicely, giving us the function of a local macro that was actual 'c' syntax (ie. not token pasted macro). But these were converted to macros because clang doesn't let us have nice gcc extensions. Extract these back out into functions, before adding more things and making the macros even more cumbersome. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	a5f24f966a	freedreno/ir3: a bit more optmsgs debug Also dump where arrays are allocated. This was useful for debugging. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	300d1181c7	freedreno/ir3: move atomic fixup after RA A post-RA sched pass will move the extra mov's to the wrong place, so rework the fixup so it can run after RA (and therefore after postsched) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	304b50c9f8	freedreno/ir3: move block-scheduling into legalize We want to do this only once. If we have post-RA sched pass, then we don't want to do it pre-RA. Since legalize is where we resolve the branch/jumps, we might as well move this into legalize. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	093c94456b	freedreno/ir3: move nop padding to legalize This way we can deal with it in one place, after all the blocks have been scheduled. Which will simplify life for a post-RA sched pass. This has the benefit of already taking into account nop's that legalize has to insert for non-delay related reasons. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	c803c662f9	freedreno/ir3: split out delay helpers We're going to want these also for a post-RA sched pass. And also to split nop stuffing out into it's own pass. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	54c795f829	freedreno/ir3: fix crash when no non-input instructions This scenario can come up with block-sched and nop-sched moved to after RA. So lets fix it first to keep things bisectable. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	c1194e10b2	freedreno/ir3: cleanup after lower_locals_to_regs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	f0b792ea06	freedreno/ir3: shuffle a few ir3_register fields It makes life easier for postsched to always be able to rely on wrmask. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3569>	2020-02-01 02:40:22 +00:00
Rob Clark	d326d30efe	freedreno/drm: readonly cmdstream Noticed that we weren't consistently making cmdstream buffers gpu-readonly. Fix that and drop the need to pass flags to fd_bo_new_ring(). Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3663> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3663>	2020-01-31 13:01:52 -08:00
Brian Ho	58fd26c433	turnip: Fix vkCmdCopyQueryPoolResults with available flag Previously, calling vkCmdCopyQueryPoolResults with the VK_QUERY_RESULT_WITH_AVAILABILITY_BIT flag set the query result field in the buffer to 0 if unavailable and the query result if available. This was a misunderstanding of the Vulkan spec, and this commit corrects the behavior to emitting a separate available result in addition to the query result. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3560> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3560>	2020-01-30 20:30:46 +00:00
Brian Ho	1a3e2a7fa8	turnip: Fix vkGetQueryPoolResults with available flag Previously, calling vkGetQueryPoolResults with the VK_QUERY_RESULT_WITH_AVAILABILITY_BIT flag set the query result field in *pData to 0 if unavailable and the query result if available. This was a misunderstanding of the Vulkan spec, and this commit corrects the behavior to eriting a separate available result in addition to the query result. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3560>	2020-01-30 20:30:46 +00:00
Brian Ho	1c3319cf81	turnip: Free event->bo on vkDestroyEvent Fixes a leak from freeing event but not event->bo. Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3639> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3639>	2020-01-30 18:50:06 +00:00
Jonathan Marek	1c5d84fcae	turnip: hook up cmdbuffer event set/wait Gets some basic tests under "dEQP-VK.synchronization.event" passing Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3123> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3123>	2020-01-29 23:13:43 +00:00
Alejandro Piñeiro	d5c32db076	turnip: remove unused descriptor state dirty It was only used to be initialized to zero. Not even updated as descriptor sets are bind. As far as I understand, setting the bit TU_CMD_DIRTY_DESCRIPTOR_SET on tu_cmd_state.dirty is used instead. Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3624> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3624>	2020-01-29 20:52:52 +00:00
Eric Anholt	06b13dfed2	tu: Fix binning address setup after pack macros change. This fixes a regression in "vkcube -m headless" rendering, but upsettingly none of my CTS tests I've been using. Fixes: `59f29fc845` ("turnip: Convert the rest of tu_cmd_buffer.c over to the new pack macros.") Caught-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3609> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3609>	2020-01-29 19:30:09 +00:00
Brian Ho	3d5bdea2cf	turnip: Enable occlusionQueryPrecise This commit enables the occlusionQueryPrecise feature. No additonal work is required as occlusion queries are already implemented to track exact sample counts. Also enables a number of extra tests on the Vulkan CTS. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3605> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3605>	2020-01-29 19:05:23 +00:00
Samuel Pitoiset	15d53d8294	compiler: add PERSP to the existing barycentric system values We need the LINEAR versions for AMD_shader_explicit_vertex_parameter. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3578>	2020-01-29 09:49:50 +00:00
Eduardo Lima Mitev	e6b531af66	turnip: Fix issues in tu_compute_pipeline_create() that may lead to crash The shader object is destroyed even if its creation failed. It is also not destroyed if its compilation or upload fails, leading to leaks. Finally, tu_compute_pipeline_create() should set output var pPipeline to VK_NULL_HANDLE if it fails. Avoids crash on dEQP-VK.api.object_management.alloc_callback_fail_multiple.compute_pipeline Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3572> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3572>	2020-01-29 09:25:20 +00:00
Eduardo Lima Mitev	0e11e8ba89	turnip: Remove failed command buffer from pool When an error condition occurs during tu_create_cmd_buffer(), the cmd buffer has already been added to a pool, so the cleanup code should remove it. Fixes a crash (assert in tu_device::tu_bo_finish()) in dEQP tests: dEQP-VK.api.object_management.max_concurrent.command_buffer_primary dEQP-VK.api.object_management.max_concurrent.command_buffer_secondary due to pool attempting to destroy an invalid command buffer. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3572>	2020-01-29 09:25:20 +00:00
Rob Clark	63af27bc76	freedreno/drm: fix invalid-cmdstream-size with older kernels A cmdstream of size zero is invalid. But this can appear in various places where we emit a pointer to state. This doesn't show up with newer kernels (newer than v5.0) which use "softpin", but on earlier kernels can result in: [drm:msm_ioctl_gem_submit [msm]] ERROR invalid cmdstream size: 0 Since the pointer value doesn't matter in these cases, the easy solution is just to not emit a cmds table entry in this case. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2805> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2805>	2020-01-28 00:09:34 +00:00
Brian Ho	f55e215b8c	turnip: Implement vkCmdCopyQueryPoolResults for occlusion queries Use CP_COND_EXEC and CP_COND_WRITE to conditionally copy the results of a query to a buffer based off the query's availability. Fixes: #2238 Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279>	2020-01-24 18:14:01 +00:00
Brian Ho	9a3656b9fd	turnip: Implement vkCmdResetQueryPool Clears the available bit for each requested query on the GPU. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279>	2020-01-24 18:14:01 +00:00
Brian Ho	97fa4cb3dc	turnip: Implement vkGetQueryPoolResults for occlusion queries Implements fetching the results of a query pool with the VK_QUERY_RESULT_WAIT_BIT, VK_QUERY_RESULT_WITH_AVAILABILITY_BIT, and VK_QUERY_RESULT_PARTIAL_BIT flags. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279>	2020-01-24 18:14:01 +00:00
Brian Ho	24b95485dc	turnip: Update query availability on render pass end Unlike on an immidiate-mode renderer, Turnip only renders tiles on vkCmdEndRenderPass. As such, we need to track all queries that were active in a given render pass and defer setting the available bit on those queries until after all tiles have rendered. This commit adds a draw_epilogue_cs to tu_cmd_buffer that is executed as an IB at the end of tu_CmdEndRenderPass. We then emit packets to this command stream that update the availability bit of a given query in tu_CmdEndQuery. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279>	2020-01-24 18:14:01 +00:00
Brian Ho	f750dd2ab8	turnip: Implement vkCmdEndQuery for occlusion queries Mostly a translation of freedreno's implementation of glEndQuery for GL_SAMPLES_PASSED query objects with a slight modification to set the availability bit of the query bo (slot->available) if the query was not ended inside a render pass. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279>	2020-01-24 18:14:01 +00:00
Brian Ho	5824a59ee2	turnip: Implement vkCmdBeginQuery for occlusion queries Mostly a translation of freedreno's implementation of glBeginQuery for GL_SAMPLES_PASSED query objects with special logic for handling tiled render passes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279>	2020-01-24 18:14:01 +00:00
Brian Ho	78dea40b1c	turnip: Implement vkCreateQueryPool for occlusion queries General structure is inspired by anv's implementation in genX_query.c. We define a packed struct that tracks sample count at the beginning of the query and at the end; the result of the occlusion query is then slot->end - slot->begin. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279>	2020-01-24 18:14:01 +00:00
Brian Ho	a155ab93a3	turnip: Update tu_query_pool with turnip-specific fields tu_query_pool was forked from radv_query_pool, but we will need a different set of fields to implement queries in turnip. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3279>	2020-01-24 18:14:01 +00:00
Connor Abbott	b103157a0e	freedreno: Document CP_INDIRECT_BUFFER_CHAIN This will let us use batch chaining instead of growing batches on a5xx and a6xx. Reviewed-by: Rob Clark <robdclark@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3537> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3537>	2020-01-24 10:03:08 +00:00
Connor Abbott	f58242b56e	freedreno: Document CP_UNK_A6XX_55 Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3537>	2020-01-24 10:03:08 +00:00
Connor Abbott	3cf1d6b8db	freedreno: Document CP_COND_REG_EXEC more The vulkan blob uses the RENDER_MODE mode to condition a blit on the render mode in traces of a dEQP triangle test. Reviewed-by: Rob Clark <robdclark@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3182> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3182>	2020-01-24 09:23:27 +00:00
Eric Anholt	59f29fc845	turnip: Convert the rest of tu_cmd_buffer.c over to the new pack macros. There are only a couple of hard cases left using pkt4, where the register number to write is computed. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3455> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3455>	2020-01-23 22:46:09 +00:00
Eric Anholt	d67100519e	turnip: Convert renderpass setup to the new register packing macros. This gets a lot of the hard code converted over to the new macros, resulting in (I feel) much more readable code with LESS_SHOUTING_ABOUT_THE_REG(). I decided to consistently put the reg on its own line, so that all the register names line up. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3455>	2020-01-23 22:46:09 +00:00
Eric Anholt	08837ea3d2	turnip: Port krh's packing macros from freedreno to tu. This introduces some minor unpacking of the temporary fd_reg_pair structs to code that previously was packing a whole register field. In the pack wrapper in tu_cs.h, I added some explanatory docs, dropped the relocs handling since we don't need it, and removed the extra regs[] in the __ONE_REG() macro (which was causing gcc's optimizer to fall on its face in my release build). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3455>	2020-01-23 22:46:09 +00:00
Eric Anholt	d4bc3c93ea	freedreno: Fix OUT_REG() on address regs without a .bo supplied. Sometimes you want to zero out an address by supplying a NULL BO, but without this we would end up only emitting one dword. Increases size of fd6_gmem.o by .8%, though it's not clear to me why (no obvious terrible codegen happening) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3455>	2020-01-23 22:46:09 +00:00
Eric Anholt	c1327bc283	freedreno: Add some missing a6xx address declarations. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3455>	2020-01-23 22:46:09 +00:00
Eric Anholt	fbd9b4ce08	turnip: Fix execution of secondary cmd bufs with nothing in primary. We want to finish off cmd emission in the primary CS and add its entry to the IB, but regardless of whether there had been anything in the primary CS to emit, we still need a reserved CS entry for the loop below. Fixes crashes in dEQP-VK.binding_model.shader_access.secondary_cmd_buf.* and many more in dEQP-VK.renderpass* Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3524> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3524>	2020-01-23 20:27:26 +00:00
Jonathan Marek	8aa5d96864	turnip: simplify tu_physical_device_get_format_properties Fixes the "bad VkImageTiling" error when tiling is VK_IMAGE_TILING_DRM_FORMAT_MODIFIER_EXT. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3485> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3485>	2020-01-23 18:34:07 +00:00
Jonathan Marek	b7e22b7a35	vulkan/wsi: remove unused image_get_modifier Signed-off-by: Jonathan Marek <jonathan@marek.ca> Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3485>	2020-01-23 18:34:07 +00:00
Jonathan Marek	e8afd40758	turnip: set linear tiling for scanout images Fixes: `210e6887` "vulkan/wsi: Use the interface from the real modifiers extension" Signed-off-by: Jonathan Marek <jonathan@marek.ca> Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3485>	2020-01-23 18:34:07 +00:00
Jonathan Marek	11f6fba1c9	turnip: hook up GetImageDrmFormatModifierPropertiesEXT Fixes: `210e6887` "vulkan/wsi: Use the interface from the real modifiers extension" Signed-off-by: Jonathan Marek <jonathan@marek.ca> Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3485>	2020-01-23 18:34:07 +00:00
Guido Günther	c5334d2943	freedreno/drm: Don't miscalculate timeout The current code overflows (s * 1000000000) for s >= 5 but that is e.g. used in msm_bo_cpu_prep. Signed-off-by: Guido Günther <agx@sigxcpu.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3514> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3514>	2020-01-23 18:07:13 +00:00
Eric Anholt	b327501dbf	turnip: Add support for fine derivatives. This does appear to be the required instruction sequence (dsxpp_1 dst src; dsxpp_1.p dst src) as dropping either instruction fails the testsuite. Fixes dEQP-VK.glsl.derivate.* Reviewed-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3494> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3494>	2020-01-23 17:38:29 +00:00
Eric Anholt	876824908d	freedreno/ir3: Plumb the ir3_shader_variant into legalize. legalize is computing a lot of state that goes in the variant, let's just store it directly instead of passing pointers around. This leaves max_bary in place, which is doing some surprising work (overwriting the original total_in in some cases). Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3494>	2020-01-23 17:38:29 +00:00
Anthony Pesch	f77369086c	util/hash_table: update users to use new optimal integer hash functions Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3475> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3475>	2020-01-23 17:06:57 +00:00
Eric Anholt	65e432695d	turnip: Add support for uniform texel buffers. Pretty straightforward: Port texture descriptor code from freedreno, fill in alignment limits from closed vk, and tu_cmd_buffer.c was already uploading the texture descriptor. This doesn't implement storage texel buffers (required in the compute pipeline) yet, since those will need an IBO descriptor for the store path. Still, making the load path be connected to the texture descriptor won't hurt. Part of #2237 Fixes dEQP-VK.binding_model.shader_access.primary_cmd_buf.uniform_texel_buffer.* Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3522> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3522>	2020-01-23 02:40:09 +00:00
Eric Anholt	3abfde13be	turnip: Add support for non-zero (still constant) UBO buffer indices. This was actually all ready to go at this point, and just needed to increment by the value. Fixes dEQP-VK.binding_model.shader_access.primary_cmd_buf.uniform_buffer.* Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3504> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3504>	2020-01-22 02:13:38 +00:00
Jonathan Marek	5f791df0d0	turnip: fix array/matrix varyings Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3109> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3109>	2020-01-21 20:36:08 -05:00
Jonathan Marek	c171765223	turnip: remove tu_sort_variables_by_location nir_assign_io_var_locations already does sorting. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3109>	2020-01-21 20:36:08 -05:00
Jonathan Marek	1736447f27	freedreno/ir3: allow inputs with the same location turnip can have multiple inputs with the same location, and different location_frac. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3109>	2020-01-21 20:36:08 -05:00
Eric Anholt	d1166a3b3a	turnip: Disable UBWC on images used as storage images. The closed GL driver doesn't use UBWC on any storage images. It does tile mostly (skipping tiling on writeonly images, it seems), but for freedreno we've been enabling tiling in all cases and it's fine. We do need to disable UBWC, as tests fail otherwise and just plugging in the equivalent UBWC regs like we were setting up a texture isn't enough. Fixes dEQP-VK.image.atomic_operations.* Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3433> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3433>	2020-01-21 19:29:59 +00:00
Eric Anholt	e5ce365cde	turnip: Add limited support for storage images. So far this doesn't handle the texture state-based storage image access loads, and doesn't support descriptor arrays (same as SSBOs). The texture side is more tricky, since we have another remapping table to work around. This is enough to get some of dEQP-VK.image.atomic_operations.* working. Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3433>	2020-01-21 19:29:59 +00:00
Eric Anholt	85e424c591	turnip: Refactor the intrinsic lowering. Too many things in one function, split them out based on the intrinsic. Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3433>	2020-01-21 19:29:59 +00:00
Eric Anholt	3ac662e8df	turnip: Fix some whitespace around binary operators. Conforms to mesa style and the rest of turnip. Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3433>	2020-01-21 19:29:59 +00:00
Eric Anholt	fb6fca0037	freedreno: Stop scattered remapping of SSBOs/images to IBOs. Just make it be all SSBOs then all storage images. The remapping table was there to make it so that the big gap present from gallium's atomic lowering would get cleaned up, but that's no longer case. The table has made it very hard to support Vulkan storage images, so it's time for it to go. This does mean that an SSBO/IBO that is only loaded (or size-queried) will now occupy a slot in the table where it wouldn't before. This seems like a minor cost compared to being able to drop this much logic. With the remapping table gone, SSBO array handling for turnip just falls out. Fixes many array cases of dEQP-VK.binding_model.shader_access.primary_cmd_buf.storage_buffer.* Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Jonathan Marek <jonathan@marek.ca> (turnip) Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3240> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3240>	2020-01-21 10:06:23 -08:00
Eric Anholt	2dc2055157	turnip: Refactor linkage state setup. As I touch this for descriptor set reworks, I don't want to have to update it twice. Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3240>	2020-01-21 10:06:23 -08:00
Hyunjun Ko	26d93a7495	turnip: fix invalid VK_ERROR_OUT_OF_POOL_MEMORY When VK_DESCRIPTOR_TYPE_SAMPLER is provided, it doesn't need to be counted as a buffer count. Otherwise it leads to mismatch of allocated buffer size, hitting VK_ERROR_OUT_OF_POOL_MEMORY finally. Fixes: `c39afe68f0` Also fixes amber tests: ./tests/cases/address_modes_float.amber ./tests/cases/address_modes_int.amber ./tests/cases/magfilter_linear.amber ./tests/cases/magfilter_nearest.amber Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2020-01-21 10:29:16 +01:00
Jason Ekstrand	210e68874b	vulkan/wsi: Use the interface from the real modifiers extension The anv implementation still isn't quite complete, but we can at least start using the structs from the real extension. v2: Fix circular pNext list (Lionel) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3434>	2020-01-17 18:27:29 +00:00
Jason Ekstrand	75755e0eba	turnip: Pretend to support Vulkan 1.2 It doesn't really support any Vulkan properly yet so why not claim 1.2? This was an easier way of fixing the build than trying to roll it forward to a later version of ANV's entrypoint generator scripts.	2020-01-15 08:34:57 -06:00
Rob Clark	2629cb627c	freedreno/ir3: rename instructions Turns out this range of opcodes are more general purpose if/else/endif instructions. We should re-work tess to create a basic block and use normal flow control. And possibly (for a6xx+) optimize cases to use if/else/endif when appropriate. Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3398> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3398>	2020-01-15 00:56:24 +00:00
Jason Ekstrand	d3737002ee	nir/lower_atomics_to_ssbo: Also lower barriers This is more correct for a pass which is supposed to completely lower away atomic counters. It also lets us stop supporting atomic counter barriers in most of the drivers. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307>	2020-01-13 17:23:47 +00:00
Jason Ekstrand	e40b11bbcb	nir: Rename nir_intrinsic_barrier to control_barrier This is a more explicit name now that we don't want it to be doing any memory barrier stuff for us. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307>	2020-01-13 17:23:47 +00:00
Jason Ekstrand	60097cc840	nir: Add a new memory_barrier_tcs_patch intrinsic Right now, it's implemented as a no-op for everyone. For most drivers, it's a switch case in the NIR -> whatever which just breaks. For ir3, they already have code to delete tessellation barriers so we just add a case to also delete memory_barrier_tcs_patch. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3307>	2020-01-13 17:23:47 +00:00
Lasse Lopperi	3de2774dcb	freedreno/drm: Fix memory leak in softpin implementation Free the memory allocated for cmds/reloc_bos array when destoying the associated ringbuffer. For similar fix for the non-softpin implementation see: `d014af98b7` Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2324 Fixes: `f3cc0d2` ("freedreno: import libdrm_freedreno + redesign submit") Signed-off-by: Lasse Lopperi <lasse.lopperi@ge.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3342> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3342>	2020-01-10 16:21:35 +00:00
Kristian H. Kristensen	f9d35ea55b	ir3: Set up full/half register conflicts correctly Setting up transitive conflicts between a full register and its two half registers (eg r0.x and hr0.x and hr0.y) will make the half registers conflict. They don't actually conflict and this prevents us from using both at the same time. Add and use a new ra helper that sets up transitive conflicts between a register and its subregisters, except it carefully avoids the subregister conflict. Signed-off-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Rob Clark <robdclark@chromium.org>	2020-01-09 16:03:25 -08:00
Bas Nieuwenhuizen	b72182fcfa	turnip: Use VK_NULL_HANDLE instead of NULL. Only occurrence of implicitly converting pointer->int. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2570>	2020-01-02 11:47:02 +00:00
Rob Clark	0c32063794	freedreno/ir3: fix flat shading again These days `ctx->inputs` is the split scalar input components and `ir->inputs` is the full vecN. This got fixed in the load_input case, but the load_interpolated_input case was missed. Fixes: `bdf6b7018c` ("freedreno/ir3: re-work shader inputs/outputs") Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-12-24 17:16:31 +00:00
Jonathan Marek	13adce2845	turnip: disable B8G8R8 vertex formats Looks like swap doesn't work as expected on these, disable them. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3170> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3170>	2019-12-19 19:03:02 -05:00
Jonathan Marek	b9d4c10e4b	turnip: minor warning fixes Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3177> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3177>	2019-12-19 23:21:01 +00:00
Jonathan Marek	e9a32af3bf	turnip: implement secondary command buffers Uses a new "tu_cs_add_entries" function because tu_cs_emit_call doesn't work inside draw_cs (which is already called by tu_cs_emit_call). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3075> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3075>	2019-12-19 20:42:08 +00:00
Jonathan Marek	85fff42d08	turnip: compute gmem offsets at renderpass creation time This makes it easier to implement secondary command buffers, since we no longer need to know the render area to set the gmem offsets for input attachments and CmdClearAttachments. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3075>	2019-12-19 20:42:08 +00:00
Jonathan Marek	f81c41a812	turnip: emit_compute_driver_params fixes Offset was wrong, it is in vec4 not dwords. There's a hole between DP_NUM_WORK_GROUPS_Z and DP_LOCAL_GROUP_SIZE_X so use the IR3 enums. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3162> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3162>	2019-12-19 15:13:40 -05:00
Jonathan Marek	bb134c5316	turnip: emit base instance vs driver param Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3162>	2019-12-19 15:13:40 -05:00
Jonathan Marek	a3a70588c0	freedreno/ir3: support load_base_instance Not supported by hardware, uses same mechanism as base vertex. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3162>	2019-12-19 15:13:40 -05:00
Jonathan Marek	5c17d9b9ca	freedreno/registers: document vertex/instance id offset bits Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3162>	2019-12-19 15:13:40 -05:00
Kristian H. Kristensen	e4c2bb6a93	freedreno/a6xx: RB6_R8G8B8 is actually 32 bit RGBX Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2848>	2019-12-19 09:56:05 -08:00
Jonathan Marek	fe4a8df9a8	freedreno/ir3: fix vertex shader sysvals with pre_assign_inputs The first pre_assign_inputs loop doesn't pre-assign sysvals, so skip the second part for sysvals. The sysvals don't need to be pre-assigned since the state for those isn't shared between binning / nonbinning shaders. Fixes assert failures in cases where the sysvals didn't end up in the same registers for binning / nonbinning. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Rob Clark <robdclark@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3168> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3168>	2019-12-19 11:31:12 -05:00
Jonathan Marek	5785bcc8a0	turnip: don't set SP_FS_CTRL_REG0_VARYING if only fragcoord is used Fixes artifacts in the subpasses demo, which has a shader using fragcoord without any varyings. It looks like setting this bit when there are no varyings can cause weirdness in some cases (without this change, if the previous shader had <= 8 varyings it would work, but with 9 varyings it would have artifacts). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3143> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3143>	2019-12-18 19:03:37 -05:00
Jonathan Marek	4a59bc6df2	turnip: add cache invalidate to fix input attachment cases Fixes artifacts in the subpasses demo. Workaround texture cache with input attachments from GMEM by adding a cache invalidate between subpasses. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3143>	2019-12-18 19:03:37 -05:00
Connor Abbott	648cc22afb	freedreno: Fix CP_MEM_TO_REG flag definitions These actually mean something completely different, at least on A5xx and A6xx. The only other usage of the old flags on something older than A6xx was a typo, so I don't know if it was always this way, but at the same time it means that we don't have to worry too much about that. Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3116> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3116>	2019-12-18 23:09:05 +01:00
Connor Abbott	4c5ac156c3	freedreno: Use new macros for CP_WAIT_REG_MEM and CP_WAIT_MEM_GTE Similar to the existing usage for CP_COND_WRITE5, this makes it clear what each of the magic parameters are for. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3116>	2019-12-18 23:09:00 +01:00
Connor Abbott	cfa1fb895a	a6xx: Add more CP packets And add fields uncovered by looking at the firmware. I think this covers all the memory, register, and scratch manipulation opcodes that exist on A6xx, plus one additional nice find for Vulkan and describing a previously unknown opcode and documenting CP_WAIT_REG_MEM. Note that the bits for the CP_REG_TO_MEM count, as well as the formula for computing the actual count for both CP_REG_TO_MEM and CP_MEM_TO_REG, are changed because the A630 SQE firmware actually does something different. I haven't investigated older microcodes to see whether this extends back to A5xx and A4xx, but the only non-A6xx uses of this field result in the same bit-pattern when using the A6xx bit range and formula, so it should be safe to change the definition universally. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3116>	2019-12-18 23:08:55 +01:00
Jonathan Marek	072e95e07a	freedreno/ir3: update prefetch input_offset when packing inlocs If the input location changes then prefetch input_offset needs to change. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3141> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3141>	2019-12-17 16:41:13 -05:00
Kristian H. Kristensen	9aaa23fbad	freedreno/a6xx: Document the CP_SET_DRAW_STATE enable bits There are bits for binning, gmem and sysmem. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Signed-off-by: Kristian H. Kristensen <hoegsberg@google.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3131> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3131>	2019-12-17 11:45:20 -08:00
Eric Anholt	2da68c8649	turnip: Fix support for immutable samplers. We were setting up the hardware sampler state when updating a combined image sampler, but never looking at the immutable sampler for in the separate case. Fixes failures in dEQP-VK.binding_model.shader_access.primary_cmd_buf.sampler_immutable.fragment.* Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3127> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3127>	2019-12-16 19:51:27 -08:00

... 2 3 4 5 6 ...

984 Commits