KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Eric Anholt	314f0c57e4	vc4: Refactor qir_get_op_nsrc(enum qop) to qir_get_nsrc(struct qinst *). Every caller was dereffing the qinst, and this will let us make the number of sources vary depending on the destination of the qinst so that we can have general ALU ops that store to tex_[strb] and get an implicit uniform.	2016-11-29 08:38:59 -08:00
Eric Anholt	8ce6526178	vc4: Add support for MUL output rotation. Extracted from a patch by jonasarrow on github.	2016-08-25 17:24:11 -07:00
Eric Anholt	4e797bd98f	vc4: Update copy propagation for control flow. Previously, we could assume that a MOV from a temp was always an available copy, because all temps were SSA in NIR, and their non-SSA state in QIR was just due to the fact that they were from a bcsel or pack_unorm_4x8, so we could use the current value of the temp after that series of QIR instructions to define it. However, this is no longer the case with control flow. Instead, we track a new array of MOVs defined within the block that haven't had their source or dest killed yet, and use that primarily. We fall back to looking through the QIR defs array to handle across-block MOVs, but now require that copies from the SSA defs have an SSA src as well.	2016-07-13 23:54:15 -07:00
Eric Anholt	d3cdbf6fd8	vc4: Add a "qir_for_each_inst_inorder" macro and use it in many places. We have the prior list_foreach() all over the code, but I need to move where instructions live as part of adding support for control flow. Start by just converting to a helper iterator macro. (The simpler "qir_for_each_inst()" will be used for the for-each-inst-in-a-block iterator macro later)	2016-07-12 15:47:25 -07:00
Eric Anholt	3359ad6cda	vc4: Add support for copy propagation with unpack flags present. total instructions in shared programs: 89251 -> 87862 (-1.56%) instructions in affected programs: 52971 -> 51582 (-2.62%)	2015-10-26 16:48:34 -07:00
Eric Anholt	6eb0760f48	vc4: Only copy propagate raw MOVs. No problems being fixed, but needed for the new unpack changes.	2015-10-26 16:48:34 -07:00
Eric Anholt	2002438c91	vc4: Use the SSA defs list for figuring out eligible MOVs for copy prop. I thought I'd converted this over previously. It was copy propagating MOVs badly with the new destination packing flags.	2015-08-21 13:29:26 -07:00
Eric Anholt	cc8fb29046	vc4: Make r4-writes implicitly move to a temp, and allocate temps to r4. Previously, SFU values always moved to a temporary, and TLB color reads and texture reads always lived in r4. Instead, we can have these results just be normal temporaries, and the register allocator can leave the values in r4 when they don't interfere with anything else using r4. shader-db results: total instructions in shared programs: 100809 -> 100040 (-0.76%) instructions in affected programs: 42383 -> 41614 (-1.81%)	2015-08-04 17:19:01 -07:00
Eric Anholt	78c773bb36	vc4: Convert from simple_list.h to list.h list.h is a nicer and more familiar set of list functions/macros.	2015-05-29 22:09:53 -07:00
Eric Anholt	85316d059c	vc4: Keep an array of pointers to instructions defining the temps around. The optimization passes are always regenerating it and throwing it away, but it's not hard to keep track of.	2015-02-19 23:35:17 -08:00
Eric Anholt	14dc281c13	vc4: Enforce one-uniform-per-instruction after optimization. This lets us more intelligently decide which uniform values should be put into temporaries, by choosing the most reused values to push to temps first. total uniforms in shared programs: 13457 -> 13433 (-0.18%) uniforms in affected programs: 1524 -> 1500 (-1.57%) total instructions in shared programs: 40198 -> 40019 (-0.45%) instructions in affected programs: 6027 -> 5848 (-2.97%) I noticed this opportunity because with the NIR work, some programs were happening to make different uniform copy propagation choices that significantly increased instruction counts.	2015-02-19 23:35:17 -08:00
Eric Anholt	a58ae83882	vc4: Redo VPM reads as a read file. This will let us do copy propagation of the VPM reads.	2015-01-10 14:35:24 +13:00
Eric Anholt	3093bfacf0	vc4: Fix the no-copy-propagating-from-TLB_COLOR_READ check. Our MOV's dst obviously won't be the TLB_COLOR_READ's def, because we're ssa.	2015-01-10 13:54:12 +13:00
Eric Anholt	93cac2637b	vc4: Prevent copy propagating out the MOVs from r4. Copy propagating these might result in reading the r4 after some other instruction has written r4. Just prevent all copy propagation of this for now. Fixes bad rendering with upcoming indirect register access support, where the copy propagation was consistently happening across another read.	2014-10-09 21:47:06 +02:00
Eric Anholt	64122b16ce	vc4: Dump constant uniform values in VC4_DEBUG=qir. Definitely helps when trying to understand and optimize a program.	2014-09-29 11:33:34 -07:00
Eric Anholt	71d4fc88d6	vc4: Allow copy propagation of uniforms. Fixes 12 piglit tests (and 8 more crash -> fail) from reducing register pressure.	2014-09-17 14:21:24 -07:00
Eric Anholt	4bca922878	vc4: Merge qcompile and tgsi_to_qir The split between these two didn't make much sense. I'm going to want the chance to look at uniform contents in optimization passes, and the QPU emit I think is going to end up rewriting the uniforms stream.	2014-09-04 17:00:54 -07:00
Eric Anholt	b0a1e401a9	vc4: Include stdio/stdlib in headers so I don't have to include it per file. There are a few tools I want to have always available, and fprintf() and abort() are among them.	2014-08-22 10:16:57 -07:00
Eric Anholt	ec9da314ba	vc4: Add copy propagation between temps. We put in a bunch of extra MOVs for program outputs, and this can clean those up. We should do uniforms, too, though. v2: Fix missing flagging of progress when we actually optimize. Caught by Aaron Watry.	2014-08-08 18:59:47 -07:00

19 Commits