KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Eric Anholt	314f0c57e4	vc4: Refactor qir_get_op_nsrc(enum qop) to qir_get_nsrc(struct qinst *). Every caller was dereffing the qinst, and this will let us make the number of sources vary depending on the destination of the qinst so that we can have general ALU ops that store to tex_[strb] and get an implicit uniform.	2016-11-29 08:38:59 -08:00
Eric Anholt	7c7bcbbc7d	vc4: Drop destination register when it's unused. This slightly reduces instructions on shader-db, but I think it's just perturbing register allocation -- the allocator should have always trivially colored these nodes, before. This commit is just to make QIR code failing more intelligible when register allocation fails.	2016-10-06 18:09:24 -07:00
Eric Anholt	ee69cfd11d	vc4: Convert vc4_opt_dead_code to work in the presence of control flow. With control flow, we can't be sure that we'll see the uses of a variable before its def as we walk backwards. Given that NIR is eliminating our long chains of dead code, a simple solution for now seems fine. This slightly changes the order of some optimizations, and so an opt_vpm happens before opt_dce, causing 3 dead MOVs to be turned into dead FMAXes in Minecraft: instructions in affected programs: 52 -> 54 (3.85%)	2016-07-13 23:54:15 -07:00
Eric Anholt	6c1f834a23	vc4: Create a basic block structure and move the instructions into it. The optimization passes and scheduling aren't actually ready for multiple blocks with control flow yet (as seen by the "cur_block" references in them instead of iterating over blocks), but this creates the structures necessary for converting them.	2016-07-12 15:47:26 -07:00
Eric Anholt	200b4e4bd5	vc4: Move SF removal to a separate peephole pass. The DCE pass is going to change significantly to handle control flow, while we don't really need to change it for the SF handling. We also need to add some more SF peephole optimization for SF updates generated by control flow support. No change on shader-db.	2016-07-04 16:33:22 -07:00
Eric Anholt	aa76ba6f2f	vc4: DCE instructions with a NULL destination. I'm going to add an optimization for redundant SF update removal, which will just remove the SF and leave us (in many cases) with an instruction with a NULL destination and no side effects. Rather than teaching that pass whether the whole instruction can be removed, leave that responsibility to this pass.	2016-07-04 16:33:22 -07:00
Eric Anholt	3bee7581e6	vc4: Drop the support for DCE of texture instructions. Now that we're using NIR for our optimization, there's no need for this tricky code.	2016-05-02 11:06:29 -07:00
Eric Anholt	a298fb15af	vc4: Reuse list_for_each_entry_safe_rev(). This didn't exist when I wrote the code.	2016-03-16 11:28:34 -07:00
Eric Anholt	cfa980f493	vc4: convert from tgsi semantic/index to varying-slot (originally part of previous patch, split out to separate patch by Rob) v2: squash in some fixes from Eric v3: Another fix from Eric for point coords. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 15:07:08 -04:00
Eric Anholt	78c773bb36	vc4: Convert from simple_list.h to list.h list.h is a nicer and more familiar set of list functions/macros.	2015-05-29 22:09:53 -07:00
Eric Anholt	85316d059c	vc4: Keep an array of pointers to instructions defining the temps around. The optimization passes are always regenerating it and throwing it away, but it's not hard to keep track of.	2015-02-19 23:35:17 -08:00
Eric Anholt	3f1e1287fd	vc4: Make SF be a flag on the QIR instructions. Right now the places that used to emit a mov.sf just put the SF on the previous instruction when it generated the source of the SF value. Even without optimization to push the sf up further (and kill thus potentially kill more MOVs), this gets us: total uniforms in shared programs: 13455 -> 13457 (0.01%) uniforms in affected programs: 3 -> 5 (66.67%) total instructions in shared programs: 40296 -> 40198 (-0.24%) instructions in affected programs: 12595 -> 12497 (-0.78%)	2015-02-12 16:33:16 -08:00
Eric Anholt	a8e14c293b	vc4: Allow dead code elimination of VPM reads. This gets a bunch of dead reads out of the CSes, which don't read most attributes generally. total instructions in shared programs: 39753 -> 39487 (-0.67%) instructions in affected programs: 4721 -> 4455 (-5.63%)	2015-01-10 20:55:37 +13:00
Eric Anholt	c772c92153	vc4: Split two notions of instructions having side effects. Some ops can't be DCEd, while some of the ops that are just important due to the args they have can be.	2015-01-10 15:24:46 +13:00
Eric Anholt	f87c700895	vc4: Add support for ARL and indirect register access on TGSI_FILE_CONSTANT. Fixes 14 ARB_vp tests (which had no lowering done), and should improve performance of indirect uniform array access in GLSL.	2014-10-28 17:16:05 -07:00
Eric Anholt	52824811b9	vc4: Allow dead code elimination of unused varyings. total instructions in shared programs: 39022 -> 37341 (-4.31%) instructions in affected programs: 26979 -> 25298 (-6.23%) total uniforms in shared programs: 11242 -> 10523 (-6.40%) uniforms in affected programs: 5836 -> 5117 (-12.32%)	2014-10-24 18:04:26 +01:00
Eric Anholt	40748cf8d9	vc4: Eliminate unused texture instructions.	2014-10-09 21:47:06 +02:00
Eric Anholt	b73cab6826	vc4: Dead code eliminate unused SF instructions.	2014-10-09 21:47:06 +02:00
Eric Anholt	64122b16ce	vc4: Dump constant uniform values in VC4_DEBUG=qir. Definitely helps when trying to understand and optimize a program.	2014-09-29 11:33:34 -07:00
Eric Anholt	2147dd9681	vc4: Fix memory leaks of struct qinst.	2014-09-15 13:12:27 -07:00
Eric Anholt	4bca922878	vc4: Merge qcompile and tgsi_to_qir The split between these two didn't make much sense. I'm going to want the chance to look at uniform contents in optimization passes, and the QPU emit I think is going to end up rewriting the uniforms stream.	2014-09-04 17:00:54 -07:00
Eric Anholt	b0a1e401a9	vc4: Include stdio/stdlib in headers so I don't have to include it per file. There are a few tools I want to have always available, and fprintf() and abort() are among them.	2014-08-22 10:16:57 -07:00
Eric Anholt	d9d1c14430	vc4: Add dead code elimination. This cleans up a bunch of noise in the compiled coordinate shaders (since we don't need the varying outputs), and also from writemasked instructions with negated src operands.	2014-08-08 18:59:47 -07:00

23 Commits