KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jason Ekstrand	7e1d21edbf	nir: Move get_const_initializer_load from vars_to_ssa to NIR core Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-04-22 18:10:40 -07:00
Jason Ekstrand	02f03fc0f1	nir/tex: Use the correct return size for query_levels and lod Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-04-22 18:10:40 -07:00
Jason Ekstrand	94669cb534	nir: Refactor tex_instr_dest_size to use a switch statement Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-04-22 18:10:40 -07:00
Ian Romanick	67a8610caf	nir: Silence unused parameter warnings nir/nir.h: In function 'nir_validate_shader': nir/nir.h:1567:56: warning: unused parameter 'shader' [-Wunused-parameter] static inline void nir_validate_shader(nir_shader shader) { } ^ nir/nir_opt_cse.c: In function 'src_is_ssa': nir/nir_opt_cse.c:165:32: warning: unused parameter 'data' [-Wunused-parameter] src_is_ssa(nir_src src, void data) ^ nir/nir_opt_cse.c: In function 'dest_is_ssa': nir/nir_opt_cse.c:171:35: warning: unused parameter 'data' [-Wunused-parameter] dest_is_ssa(nir_dest dest, void *data) ^ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-04-14 16:41:04 -07:00
Kenneth Graunke	b3e286c457	nir: Store num_direct_uniforms in the nir_shader. Storing this here is pretty sketchy - I don't know if any driver other than i965 will want to use it. But this will make it a lot easier to generate NIR code at link time. We'll probably rework it anyway. (Ian suggested making nir_assign_var_locations_scalar_direct_first simply modify the nir_shader's fields, rather than passing pointers to them. If this stays long term, we should do that. But Jason and I suspect we'll be reworking this area again in the near future.) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-04-11 11:39:48 -07:00
Rob Clark	58add76791	nir: split out lower_sub from lower_negate Originally you had to have one or the other. But actually I don't want either. (Or rather I want whatever is the minimum # of instructions.) TODO: not sure where the best place to insert a check that driver hasn't set both lower_negate and lower_sub? Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-04-11 10:43:16 -04:00
Kenneth Graunke	500da98e0b	nir: Constify nir_lower_sampler's gl_shader_program pointer. Now that we're not generating linker errors, we don't actually modify this. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-04-10 02:16:33 -07:00
Kenneth Graunke	99264b7f37	nir: Make nir_lower_samplers take a gl_shader_stage, not a gl_program *. We don't actually need a gl_program struct. We only used it to translate prog->Target (i.e. GL_VERTEX_PROGRAM) to the gl_shader_stage (i.e. MESA_SHADER_VERTEX). We may as well just pass that. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-04-10 02:16:29 -07:00
Kenneth Graunke	4b27391cad	nir: Move gl_shader_stage enum from mtypes.h to shader_enums.h. I want to use this in some code that doesn't currently include mtypes.h. It seems like a better place for it anyway. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-04-10 02:16:27 -07:00
Jason Ekstrand	11694737fc	nir: Make nir__instr_create take a nir_shader instead of a void context Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-04-07 14:34:21 -07:00
Kenneth Graunke	a10d493715	nir: Implement a nir_sweep() pass. This pass performs a mark and sweep pass over a nir_shader's associated memory - anything still connected to the program will be kept, and any dead memory we dropped on the floor will be freed. The expectation is that this will be called when finished building and optimizing the shader. However, it's also fine to call it earlier, and many times, to free up memory earlier. v2: (feedback from Jason Ekstrand) - Skip sweeping impl->start_block, as it's already in the CF list. - Don't sweep SSA defs (they're owned by their defining instruction) - Don't steal phi sources (they're owned by nir_phi_instr). - Don't steal tex->src (it's owned by the tex_inst itself) - Don't sweep dereference chains (top-level dereferences are owned by the instruction; sub-dereferences are owned by the parent deref). - Don't sweep sources and destinations (SSA defs are handled as part of the defining instruction, and registers are handled as part of function implementations). - Just steal instructions; don't walk them (no longer required). v3: (feedback from Jason Ekstrand) - Steal indirect sources from nir_src/nir_dest. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-04-07 14:34:14 -07:00
Rob Clark	f2ecc95e44	nir: add lowering for idiv/udiv/umod Based on the algo from NV50LegalizeSSA::handleDIV() and handleMOD(). See also trans_idiv() in freedreno/ir3/ir3_compiler.c (which was an adaptation of the nv50 code from Ilia Mirkin). A python/numpy script which implements the same algorithm (and is possibly useful for debugging or analysis) can be found here: http://people.freedesktop.org/~robclark/div-lowering.py I've tested this on i965 hacked up to insert the idiv lowering pass, and on freedreno with NIR frontend. Signed-off-by: Rob Clark <robclark@freedesktop.org> Tested-by: Eric Anholt <eric@anholt.net> (vc4)	2015-04-05 09:20:35 -04:00
Rob Clark	6829d76e02	nir: add option to lower slt/sge/seq/sne In freedreno these get implemented as the matching f* instruction plus a u2f to convert the result to float 1.0/0.0. But less lines of code to just let nir_opt_algebraic handle this for us, plus opens up some small window for other opt passes to improve (ie. if some shader ended up with both a flt and slt with same src args, for example). v2: use b2f rather than u2f Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-04-05 08:56:24 -04:00
Jason Ekstrand	52e718097f	nir: Add a cubemap normalizing pass This commit adds a pass to L1-normalize cube-map coordinates. Some hardware such as i965 requires that largest cube-map coordinate is +-1. We had a pass to perform this normalization in GLSL IR but we need it in NIR for cube maps on ARB programs to work correctly. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> v2 (Suggested by Eric): - Do a vector fabs and split into components later - Move to core NIR Reviewed-by: Eric Anholt <eric@anholt.net>	2015-04-03 14:12:49 -07:00
Jason Ekstrand	7bdba4a245	nir: Add a src_get_parent_instr function Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2015-04-03 14:04:12 -07:00
Eric Anholt	ea811b7868	nir: Add a lowering pass for texture projectors. Not much hardware wants them these days, and it might give us a chance to do CSE or algebraic at the NIR level. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-04-03 11:50:24 -07:00
Jason Ekstrand	da294f9b2f	nir/algebraic: Add a seperate section for "late" optimizations i965/nir: Use the late optimizations Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-04-01 12:51:03 -07:00
Eric Anholt	afa9fc1561	nir: Add optional lowering of flrp. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-03-27 13:29:48 -07:00
Jason Ekstrand	8a33f95b7a	nir/lower_io: Add a assign_locations function that sorts by [in]direct use v2: Delete the set of indirectly accessed variables when we're done with it v3: Rename from _packed to _scalar Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-03-19 13:18:39 -07:00
Jason Ekstrand	25db44a845	nir/lower_io: Make variable location assignment a manual operation Previously, we just assigned variable locations in nir_lower_io. Now, we force the user to assign variable locations for us. This gives the backend a bit more control over where variables are placed. v2: Rename from _packed to _scalar Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-03-19 13:18:39 -07:00
Jason Ekstrand	639115123e	nir: Use a list instead of a hash_table for inputs, outputs, and uniforms We never did a single hash table lookup in the entire NIR code base that I found so there was no real benifit to doing it that way. I suppose that for linking, we'll probably want to be able to lookup by name but we can leave building that hash table to the linker. In the mean time this was causing problems with GLSL IR -> NIR because GLSL IR doesn't guarantee us unique names of uniforms, etc. This was causing massive rendering isues in the unreal4 Sun Temple demo. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-03-19 13:18:38 -07:00
Kenneth Graunke	b200cbb0a4	nir: Add native_integers to nir_shader_compiler_options. glsl_to_nir, tgsi_to_nir, and prog_to_nir all want to know whether the driver supports native integers. Presumably other passes may as well. Adding this to nir_shader_compiler_options is an easy way to provide that information, as it's accessible via nir_shader::options. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-03-08 20:03:57 -07:00
Kenneth Graunke	2561aea6b3	nir: Delete nir_shader::user_structures and num_user_structures. Nothing actually uses these, and the only caller of glsl_to_nir() (brw_fs_nir.cpp) always passes NULL for the _mesa_glsl_parse_state pointer, meaning they'll always be NULL and 0, respectively. Just delete them. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-03-08 20:03:44 -07:00
Jason Ekstrand	c750ecaa12	nir/register: Add a parent_instr field This adds a parent_instr field similar to the one for ssa_def. The difference here is that the parent_instr field on a nir_register can be NULL if the register does not have a unique definition or if that definition does not dominate all its uses. We set this field in the out-of-SSA pass so that backends can get SSA-like information even after they have gone out of SSA. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-02-24 14:08:04 -08:00
Eric Anholt	bef38f62e0	nir: Drop dependency on mtypes.h for core NIR. One less new directory necessary for gallium code that wants to interact with NIR. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-02-20 11:36:34 -08:00
Eric Anholt	b53d035825	util: Move Mesa's bitset.h to util/. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-02-20 11:36:34 -08:00
Jason Ekstrand	190073c737	nir: Add a global code motion (GCM) pass v2 Jason Ekstrand <jason.ekstrand@intel.com>: - Use nir_dominance_lca for computing least common anscestors - Use the block index for comparing dominance tree depths - Pin things that do partial derivatives Reviewed-by: Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-02-19 17:06:17 -08:00
Jason Ekstrand	a52a4b5223	nir/instr: Change "live" to a more generic "pass_flags" field Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-02-19 17:06:17 -08:00
Jason Ekstrand	3d25afc51c	nir: Make nir_[cf_node/instr]_[prev/next] return null if at the end Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-02-19 17:06:17 -08:00
Jason Ekstrand	f481a9425c	nir/dominance: Add a constant-time mechanism for comparing blocks This is mostly thanks to Connor. The idea is to do a depth-first search that computes pre and post indices for all the blocks. We can then figure out if one block dominates another in constant time by two simple comparison operations. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-02-19 17:06:17 -08:00
Jason Ekstrand	b4c5489c8a	nir/dominance: Expose the dominance intersection function Being able to find the least common anscestor in the dominance tree is a useful thing that we may want to do in other passes. In particular, we need it for GCM. v2: Handle NULL inputs by returning the other block Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-02-19 17:06:16 -08:00
Eric Anholt	1907a3a7ee	nir: Add a flag for lowering fsat. vc4 cse/algebraic-disabled stats: total instructions in shared programs: 44356 -> 44354 (-0.00%) instructions in affected programs: 55 -> 53 (-3.64%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	2015-02-18 14:47:51 -08:00
Eric Anholt	e5ecf8e427	nir: Add a flag for lowering ffma. vc4 cse/algebraic-disabled stats: total uniforms in shared programs: 13966 -> 13791 (-1.25%) uniforms in affected programs: 435 -> 260 (-40.23%) total instructions in shared programs: 44732 -> 44356 (-0.84%) instructions in affected programs: 9599 -> 9223 (-3.92%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	2015-02-18 14:47:51 -08:00
Eric Anholt	42a8ace66e	nir: Add a flag for lowering fneg/ineg. vc4 cse/algebraic-disabled stats: total instructions in shared programs: 44911 -> 44732 (-0.40%) instructions in affected programs: 11371 -> 11192 (-1.57%) v2: Fix broken iabs(isub(0, a)) transformation. v3: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	2015-02-18 14:47:51 -08:00
Eric Anholt	cb95a228e8	nir: Add a flag for lowering fsqrt(x) to frcp(frsqrt(x)). vc4 cse/algebraic-disabled stats: total uniforms in shared programs: 13972 -> 13966 (-0.04%) uniforms in affected programs: 408 -> 402 (-1.47%) total instructions in shared programs: 44973 -> 44911 (-0.14%) instructions in affected programs: 1551 -> 1489 (-4.00%) v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	2015-02-18 14:47:50 -08:00
Eric Anholt	8e9dbfff17	nir: Conditionalize the POW reconstruction on shader compiler options. Mesa has a shader compiler struct flagging whether GLSL IR's opt_algebraic and other passes should try and generate certain types of opcodes or patterns. Extend that to NIR by defining our own struct, which is automatically generated from the Mesa struct in glsl_to_nir and provided directly by the driver in TGSI-to-NIR. v2: Split out the previous two prep patches. v3: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v2)	2015-02-18 14:47:50 -08:00
Eric Anholt	f90bb54734	nir: Add a nir_shader_compiler_options struct pointed to by the shaders. This will be used to give the optimization passes a chance to customize behavior for the particular target device. v2: Rebase to master (no TGSI->NIR present) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	2015-02-18 14:47:50 -08:00
Kenneth Graunke	480ee1f0b4	nir: Mark nir_print_instr's instr pointer as const. Printing instructions doesn't modify them, so we can mark the parameter const. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-02-10 03:37:55 -08:00
Connor Abbott	a135f34080	nir: add an optimization to remove useless phi nodes This removes phi nodes whose sources all point to the same thing. Shader-db results: total NIR instructions in shared programs: 2045293 -> 2041209 (-0.20%) NIR instructions in affected programs: 126564 -> 122480 (-3.23%) helped: 615 HURT: 0 total FS instructions in shared programs: 4321840 -> 4320392 (-0.03%) FS instructions in affected programs: 24622 -> 23174 (-5.88%) helped: 138 HURT: 0 Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Tested-by: Jason Ekstrand <jason.ekstrand@intel.com> Signed-off-by: Connor Abbott <cwabbott0@gmail.com>	2015-02-03 16:00:13 -05:00
Jason Ekstrand	f2adcd36cb	nir: Add a pass to lower vector phi nodes to scalar phi nodes v2 Jason Ekstrand <jason.ekstrand@intel.com>: - Add better comments - Use nir_ssa_dest_init and nir_src_for_ssa more places - Fix some void * casts v3 Jason Ekstrand <jason.ekstrand@intel.com>: - Rework the way we determine whether or not to sccalarize a phi node to make the recursion non-bogus - Treat load_const instructions as scalarizable v4 Jason Ekstrand <jason.ekstrand@intel.com>: - Allow uniform and input loads to be scalarizable v5 Jason Ekstrand <jason.ekstrand@intel.com>: - Also consider loads of inputs (varying, uniform, or ubo) to be scalarizable. We were already doing this for load_var on uniforms and inputs. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-02-03 12:33:11 -08:00
Jason Ekstrand	5ab1489ae6	nir: Add an invalid type This allows us to indicate a concept of an invalid type. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-01-29 17:07:45 -08:00
Connor Abbott	816f0515a2	nir: add a helper function for getting the number of source components Unlike with non-SSA ALU instructions, where if they're per-component you have to look at the writemask to know which source channels are being used, SSA ALU instructions always have all the possible channels enabled so we can just look at the number of components in the SSA definition for per-component instructions to say how many source components are being used. v2: use new name nir_ssa_alu_instr_src_components() Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Signed-off-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-26 21:26:36 -05:00
Jason Ekstrand	27c6e3e4ca	nir: Use pointers for nir_src_copy and nir_dest_copy This avoids the overhead of copying structures and better matches the newly added nir_alu_src_copy and nir_alu_dest_copy. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-26 11:24:58 -08:00
Connor Abbott	fa4bc6c130	nir: use Python to autogenerate opcode information Before, we used a system where a file, nir_opcodes.h, defined some macros that were included to generate the enum values and the nir_op_infos structure. This worked pretty well, but for development the error messages were never very useful, Python tools couldn't understand the opcode list, and it was difficult to use nir_opcodes.h to do other things like autogenerate a builder API. Now, we store opcode information in nir_opcodes.py, and we have nir_opcodes_c.py to generate the old nir_opcodes.c and nir_opcodes_h.py to generate nir_opcodes.h, which contains all the enum names and gets included into nir.h like before. In addition to solving the above problems, using Python and Mako to generate everything means that it's much easier to add keep information centralized as we add new things like constant propagation that require per-opcode information. v2: - make Opcode derive from object (Dylan) - don't use assert like it's a function (Dylan) - style fixes for fnoise, use xrange (Dylan) - use iterkeys() in nir_opcodes_h.py (Dylan) - use pydoc-style comments (Jason) - don't make fmin/fmax commutative and associative yet (Jason) Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> v3 Jason Ekstrand <jason.ekstrand@intel.com> - Alphabetize source file lists - Generate nir_opcodes.h in the builddir instead of the source dir - Include $(builddir)/src/glsl/nir in the i965 build - Rework nir_opcodes.h generation so it generates a complete header file instead of one that has to be embedded inside an enum declaration	2015-01-24 21:33:56 -08:00
Eric Anholt	0680d170d1	nir: Expose nir_print_instr() for debug prints It's nice to have this present in your default cases so you can see what instruction is triggering an abort. v2: Just pass a NULL state, now that it won't crash when you do. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-01-23 17:30:11 -08:00
Eric Anholt	447ddfc137	nir: Add nir_lower_alu_to_scalar. This is the equivalent of brw_fs_channel_expressions.cpp, which I wanted for vc4. v2: Use the nir_src_for_ssa() helper, and another instance of nir_alu_src_copy(). v3: Drop the non-SSA support. All intended callers will have SSA-only ALU ops. v4: Use insert_before, drop stale bcsel/fcsel comment, drop now-unused unsupported() function, drop lower_context struct. v5: Completely rename the pass to nir_lower_alu_to_scalar(), add an assert about weird input_sizes[]. Reviewed-by: Jason Ekstrand <jason.ekstrand@iastate.edu>	2015-01-23 16:37:23 -08:00
Eric Anholt	b200127816	nir: Make some helpers for copying ALU src/dests. There aren't many users yet, but I wanted to do this from my scalarizing pass. v2: Constify the src arguments. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-01-23 16:37:16 -08:00
Eric Anholt	534a4ec82f	nir: Make an easier helper for setting up SSA defs. Almost all instructions we nir_ssa_def_init() for are nir_dests, and you have to keep from forgetting to set is_ssa when you do. Just provide the simpler helper, instead. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-01-22 13:52:19 -08:00
Matt Turner	28b7c6b285	nir: Replace assert(0) with unreachable(). Fixes a couple of warnings in the process. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-21 21:06:37 -08:00
Jason Ekstrand	7da60eca4f	nir: Add src and dest constructors Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-21 12:21:10 -08:00
Jason Ekstrand	194f6235b3	nir: Add a nir_foreach_phi_src helper macro Reviewed-by: Connor Abbott <cwabbott02gmail.com>	2015-01-20 16:53:29 -08:00
Jason Ekstrand	4c99e3ae78	util: Move main/set to util/hash_set Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-01-15 13:21:27 -08:00
Jason Ekstrand	d3636da902	nir: Add a pass for lowering copy instructions Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:24 -08:00
Jason Ekstrand	55b5058e69	nir: Rename lower_variables to lower_vars_to_ssa The original name wasn't particularly descriptive. This one indicates that it actually gives you SSA values as opposed to the old pass which lowered variables to registers. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:24 -08:00
Jason Ekstrand	4aa6162f6e	nir/tex_instr: Add a nir_tex_src struct and dynamically allocate the src array This solves a number of problems. First is the ability to change the number of sources that a texture instruction has. Second, it solves the delema that may occur if a texture instruction has more than 4 sources. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:24 -08:00
Jason Ekstrand	dcb1acdea0	nir/validate: Only build in debug mode Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:24 -08:00
Jason Ekstrand	e4115ca9d8	nir: Make intrinsic flags into an enum This should be much better for debugging as GDB will pick up on the fact that it's an enum and actually tell you what you're looking at instead of giving you some arbitrary hex value you have to go look up. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:23 -08:00
Jason Ekstrand	ed13f4e716	nir: Use static inlines instead of macros for list getters This should make debugging a lot easier as GDB handles static inlines much better than macros. Also, static inlines are typesafe. Reviewed-By: Glenn Kennard <glenn.kennard@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:23 -08:00
Jason Ekstrand	b95fae034f	nir/variable: Remove the constant_value field This was a left-over relic of GLSL IR that we aren't using for anything. If we ever want that value again, we can add it back, but NIR constant folding should be just as good as GLSL IR's if not better pretty soon, so I'm not worried about it. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:23 -08:00
Jason Ekstrand	8599b30c67	nir: Add some documentation Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:23 -08:00
Jason Ekstrand	40ca129ed5	nir: Rename parallel_copy_copy to parallel_copy_entry and add a foreach macro parallel_copy_copy was a silly name. Also, things were getting long and annoying, so I added a foreach macro. For historical reasons, several of the original iterations over parallel copy entries in from_ssa used the _safe variants of the loop. However, all of these no longer ever remove an entry so it's ok to make them all use the normal iterator. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:23 -08:00
Jason Ekstrand	1b720c6ed8	nir/from_ssa: Clean up parallel copy handling and document it better Previously, we were doing a lazy creation of the parallel copy instructions. This is confusing, hard to get right, and involves some extra state tracking of the copies. This commit adds an extra walk over the basic blocks to add the block-end parallel copies up front. This should be much less confusing and, consequently, easier to get right. This commit also adds more comments about parallel copies to help explain what all is going on. As a consequence of these changes, we can now remove the at_end parameter from nir_parallel_copy_instr. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:23 -08:00
Jason Ekstrand	de73d1e173	nir: Rename nir_block_following_if to nir_block_get_following_if The new name is a little longer but less confusing. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:23 -08:00
Jason Ekstrand	813316d150	nir/opcodes: Remove the per_component info field Originally, this field was intended for determining if the given instruction acted per-component or if it had mismatching source and destination sizes that would have to be interpreted specially. However, we can easily derive this from output_size == 0, so it's not really that useful. Also, the values we were setting in nir_opcodes.h for this field were completely bogus and it was never used. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:22 -08:00
Jason Ekstrand	46f3e1ab50	nir/opcodes: Add algebraic properties metadata This commit adds some algebraic properties to the metadata of each opcode in NIR. In particular, you now know, just from the metadata, if a given opcode is commutative or associative. This will be useful for algebraic transformation passes that want to be able to match a + b as well as b + a in one go. v2: Make algebraic properties all caps. This was more consistent with the intrinsics flags and seems better for flags in general. Also, the enums are now declared with (1 << n) rather then hex values. v3: fmin and fmax technically aren't commutative or associative. Things get funny when one of the arguments is a NaN. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:22 -08:00
Jason Ekstrand	2c7da78805	nir: Make load_const SSA-only As it was, we weren't ever using load_const in a non-SSA way. This allows us to substantially simplify the load_const instruction. If we ever need a non-SSA constant load, we can do a load_const and an imov. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:22 -08:00
Jason Ekstrand	675ffdef30	nir: Make nir_ssa_undef_instr_create initialize the destination Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:22 -08:00
Jason Ekstrand	193fea9eb6	nir: Add a foreach_ssa_def function There are some functions whose destinations are SSA-only and so aren't a nir_dest. This provides a function that is capable of iterating over the SSA definitions defined by those functions. If you want registers, you should use the old iterator. v2: Kenneth Graunke <kenneth@whitecape.org>: - Fix nir_foreach_ssa_def's return value. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:22 -08:00
Jason Ekstrand	dfb3abbaec	nir: Remove predication We stopped generating predicates in glsl_to_nir some time ago. Right now, it's all dead untested code that I'm not convinced always worked in the first place. If we decide we want them back, we can revert this patch. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:21 -08:00
Jason Ekstrand	b6c81b3ff4	nir/metadata: Rename metadata_dirty to metadata_preserve nir_metadata_dirty was a terrible name because the parameter it takes is the metadata to be preserved. This is really confusing because it looks like it's doing the opposite of what it is actually doing. Now it's named sensibly. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:21 -08:00
Jason Ekstrand	62ac0ee804	nir/tex_instr: Rename the indirect source type and add an array size In particular, we rename nir_tex_src_sampler_index to _sampler_offset and add a sampler_array_size field to nir_tex_instr. This way we can pass the size of sampler arrays through to backends even after removing the variable information and, with it, the type. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:21 -08:00
Jason Ekstrand	cd4b995254	nir: Make texture instruction names more consistent This commit renames nir_instr_as_texture to nir_instr_as_tex and renames nir_instr_type_texture to nir_instr_type_tex to be consistent with nir_tex_instr. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:21 -08:00
Jason Ekstrand	f77f4c00ce	nir: Add a basic constant folding pass Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:20 -08:00
Jason Ekstrand	d5410bd8f6	nir: Add an algebraic optimization pass This pass uses the previously built algebraic transformations framework and should act as an example for anyone else wanting to make an algebraic transformation pass for NIR. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:20 -08:00
Jason Ekstrand	8edcd1de14	nir: Make the type casting operations static inline functions Previously, the casting operations were macros. While this is usually fine, the casting macro used the input parameter twice leading to strange behavior when you passed the result of another function into it. Since we know the source and destination types explicitly, we don't loose anything by making it a function. Also, this gives us a nice little macro for creating cast function that will hopefully prevent mistyping. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:20 -08:00
Jason Ekstrand	919426631b	nir: Add a lowering pass for adding source modifiers where possible Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:20:20 -08:00
Jason Ekstrand	a3ad7fdf33	nir: Add a helper for getting a constant value from an SSA source Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:03 -08:00
Jason Ekstrand	27663dbe8e	nir: Vectorize intrinsics We used to have the number of components built into the intrinsic. This meant that all of our load/store intrinsics had vec1, vec2, vec3, and vec4 variants. This lead to piles of switch statements to generate the correct intrinsic names, and introspection to figure out the number of components. We can make things much nicer by allowing "vectorized" intrinsics. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:03 -08:00
Jason Ekstrand	d1d12efb36	nir: Remove the old variable lowering code Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:03 -08:00
Jason Ekstrand	6962c332e5	nir: Add a pass to lower global variables to local variables Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	619b2e2499	nir: Add a pass for lowering input/output loads/stores Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	aff431293b	nir: Add a pass to lower local variables to registers Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	d477beab07	nir: Add a pass to lower local variable accesses to SSA values This pass analizes all of the load/store operations and, when a variable is never aliased (potentially used by an indirect operation), it is lowered directly to an SSA value. This pass translates to SSA directly and does not require any fixup by the original to-SSA pass. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	615ba5ad04	nir: Add a copy splitting pass Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	6a52d2af2f	nir: Don't require a function in ssa_def_init Instead, we give SSA definitions a temporary index of 0xFFFFFFFF if the instruction does not have a block and a proper index when it actually gets added to the list. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	829aa98320	nir: Use an integer index for specifying structure fields Previously, we used a string name. It was nice for translating out of GLSL IR (which also does that) but cumbersome the rest of the time. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	4f8230e247	nir: Add a concept of a wildcard array dereference Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	b5143edaee	nir: Make array deref direct vs. indirect an enum Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:02 -08:00
Jason Ekstrand	cd01de0812	nir: Add a helper for rewriting an instruction source Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:01 -08:00
Jason Ekstrand	6bdce55c44	nir: Add a basic CSE pass This pass is still fairly basic. It only handles ALU operations, constant loads, and phi nodes. No texture ops or intrinsics yet. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:01 -08:00
Jason Ekstrand	20a5812606	nir: Add a fused multiply-add peephole	2015-01-15 07:19:01 -08:00
Jason Ekstrand	13ec15bdbf	nir: Add a peephole select optimization Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:01 -08:00
Jason Ekstrand	4f44120ff5	nir: Add a function for comparing two sources Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	366181d826	nir: Add a parallel copy instruction type Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	7de6b7fc3e	nir: Add a function for rewriting all the uses of a SSA def Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	fbc443ad56	nir: Add an initialization function for SSA definitions Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	f86902e75d	nir: Add an SSA-based liveness analysis pass. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	c9a21c725d	nir: set reg_alloc and ssa_alloc when indexing registers and SSA values Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	d7e482d32c	nir: Add a function to detect if a block is immediately followed by an if Since we don't actually have an "if" instruction, this is a very common pattern when iterating over instructions. This adds a helper function for it to make things a little less painful. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	dfdf0c4673	nir: Add a foreach_block_reverse function Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	49911cf4db	nir: Add a basic metadata management system Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	9d986d19d0	nir: Add a lower_vec_to_movs pass Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:19:00 -08:00
Jason Ekstrand	2943522d80	nir: Add a naieve from-SSA pass This pass is kind of stupidly implemented but it should be enough to get us up and going. We probably want something better that doesn't generate all of the redundant moves eventually. However, the i965 backend should be able to handle the movs, so I'm not too worried about it in the short term.	2015-01-15 07:18:59 -08:00
Jason Ekstrand	4b4f90dbff	nir: Add NIR_TRUE and NIR_FALSE constants and use them for boolean immediates Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-01-15 07:18:59 -08:00
Connor Abbott	7602385ac5	nir: add an SSA-based dead code elimination pass v2: Jason Ekstrand <jason.ekstrand@intel.com>: whitespace fixes	2015-01-15 07:18:58 -08:00
Connor Abbott	8b7cb7674c	nir: add an SSA-based copy propagation pass	2015-01-15 07:18:58 -08:00
Connor Abbott	4553887d4a	nir: add a pass to convert to SSA v2: Jason Ekstrand <jason.ekstrand@intel.com>: whitespace fixes	2015-01-15 07:18:58 -08:00
Connor Abbott	b559ee709b	nir: calculate dominance information	2015-01-15 07:18:58 -08:00
Connor Abbott	cff1deff72	nir: add an optimization to turn global registers into local registers After linking and inlining, this allows us to convert these registers into SSA values and optimise more code.	2015-01-15 07:18:58 -08:00
Connor Abbott	613bf6818a	nir: add a pass to lower atomics v2: Jason Ekstrand <jason.ekstrand@intel.com> whitespace fixes	2015-01-15 07:18:58 -08:00
Connor Abbott	8692c6a023	nir: add a pass to lower system value reads v2: Jason Ekstrand <jason.ekstrand@intel.com>: whitespace fixes	2015-01-15 07:18:58 -08:00
Connor Abbott	8cdcfce5ce	nir: add a pass to lower sampler instructions	2015-01-15 07:18:58 -08:00
Connor Abbott	370e875b32	nir: add a pass to remove unused variables After we lower variables, we want to delete them in order to free up some memory. v2: Jason Ekstrand <jason.ekstrand@intel.com>: whitespace fixes	2015-01-15 07:18:58 -08:00
Connor Abbott	494790b2a9	nir: keep track of the number of input, output, and uniform slots	2015-01-15 07:18:58 -08:00
Connor Abbott	c2f36cf125	nir: add a pass to lower variables for scalar backends	2015-01-15 07:18:58 -08:00
Connor Abbott	dbb76421da	nir: add a validation pass This is similar to ir_validate.cpp. v2: Jason Ekstrand <jason.ekstrand@intel.com>: whitespace fixes	2015-01-15 07:18:58 -08:00
Connor Abbott	98fa28bff7	nir: add a printer This is similar to ir_print_visitor.cpp. v2: Jason Ekstrand <jason.ekstrand@intel.com>: whitespace fixes	2015-01-15 07:18:58 -08:00
Connor Abbott	2812e5de93	nir: add core helper functions These include functions for adding and removing various bits of IR and helpers for iterating over all the sources and destinations of an instruction. This is similar to ir.cpp. v2: Jason Ekstrand <jason.ekstrand@intel.com>: whitespace and automake fixes	2015-01-15 07:18:58 -08:00
Jason Ekstrand	f521a3c543	SQUASH: Use the enum for the variable mode	2015-01-15 07:18:57 -08:00
Connor Abbott	30c4678f64	nir: add the core datastructures This includes all the instructions, ifs, loops, functions, etc. This is similar to the information in ir.h. v2: Jason Ekstrand <jason.ekstrand@intel.com>: Include ralloc and hash_table from the util directory whitespace fixes Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-By glenn.kennard <glenn.kennard@gmail.com>	2015-01-15 07:18:57 -08:00

1 2 3 4 5

220 Commits