KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Giuseppe Bilotta	60a27ad122	Remove wrongly repeated words in comments Clean up misrepetitions ('if if', 'the the' etc) found throughout the comments. This has been done manually, after grepping case-insensitively for duplicate if, is, the, then, do, for, an, plus a few other typos corrected in fly-by v2: * proper commit message and non-joke title; * replace two 'as is' followed by 'is' to 'as-is'. v3: * 'a integer' => 'an integer' and similar (originally spotted by Jason Ekstrand, I fixed a few other similar ones while at it) Signed-off-by: Giuseppe Bilotta <giuseppe.bilotta@gmail.com> Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-06-23 13:55:03 -07:00
Jason Ekstrand	bec07b7292	nir/alu_to_scalar: Respect the exact ALU operation qualifier Just setting builder->exact isn't sufficient because that only applies to instructions that are built with the builder but instructions created manually and only inserted using the builder are left alone. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-20 12:02:55 -07:00
Jason Ekstrand	202751fbb7	nir: Add a pass for propagating invariant decorations This pass is similar to propagate_invariance in the GLSL compiler. The real "output" of this pass is that any algebraic operations which are eventually consumed by an invariant variable get marked as "exact". Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-20 12:02:45 -07:00
Jason Ekstrand	68e308d853	nir/algebraic: Remove imprecise flog2 optimizations While mathematically correct, these two optimizations result in an expression with substantially lower precision than the original. For any positive finite floating-point value, log2(x) is well-defined and finite. More precisely, it is in the range [-150, 150] so any sum of logarithms log2(a) + log2(b) is also well-defined and finite as long as a and b are both positive and finite. However, if a and b are either very small or very large, their product may get flushed to infinity or zero causing log2(a * b) to be nowhere close to log2(a) + log2(b). This imprecision was causing incorrect rendering in Talos Principal because part of its HDR rendering process involves doing 8 texture operations, clamping the result to [0, 65000], taking a dot-product with a constant, and then taking the log2. This is done 6 or 8 times and summed to produce the final result which is written to a red texture. In cases where you have a region of the screen that is very dark, it can end up getting a result value of -inf which is not what is intended. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96425 Cc: "11.1 11.2 12.0" <mesa-stable@lists.freedesktop.org>	2016-06-20 11:56:57 -07:00
Jason Ekstrand	4d3b8318a7	nir/info: Get rid of uses_interp_var_at_offset We were using this briefly in the i965 driver to trigger recompiles but we haven't been using it since we switched to the NIR y-transform lowering pass. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-06-03 19:29:28 -07:00
Jason Ekstrand	45542f554c	nir/lower_indirect_derefs: Use the direct array deref for recursion This fixes about 100 of the new Vulkan CTS tests. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-03 19:29:28 -07:00
Rob Clark	dfbae7d64f	nir/algebraic: support for power-of-two optimizations Some optimizations, like converting integer multiply/divide into left/ right shifts, have additional constraints on the search expression. Like requiring that a variable is a constant power of two. Support these cases by allowing a fxn name to be appended to the search var expression (ie. "a#32(is_power_of_two)"). Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-03 16:05:03 -04:00
Jordan Justen	8f48d23e0f	i965: Add nir channel_num system value v2: * simd16/32 fixes (curro) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	6f316c9d86	nir: Make lowering gl_LocalInvocationIndex optional Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Ilia Mirkin	ca135a2612	nir: allow sat on all float destination types With the introduction of fp64 and fp16 to nir, there are now a bunch of float types running around. A F1 2015 shader ends up with an i2f.sat operation, which has a nir_type_float32 destination. Allow sat on all the float destination types. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 10:44:40 -04:00
Jason Ekstrand	0482efdc93	nir/inline: Also rewrite param derefs for texture instructions Without this, samplers get left hanging as derefs to variables that don't actually exist. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-05-27 10:28:27 -07:00
Jason Ekstrand	2522180845	nir/inline: Break the guts of rewrite_param-derefs into a helper Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-05-27 10:28:27 -07:00
Jason Ekstrand	d19c406395	nir/inline: Make the rewrite_param_derefs helper work on instructions Now that we have the better nir_foreach_block macro, there's no reason to use the archaic block version for everything. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-05-27 10:28:27 -07:00
Jason Ekstrand	2fcba404f8	nir/inline: Don't use foreach_instr_safe unless we need to Suggested-by: Connor Abbott <cwabbott0@gmail.com>	2016-05-27 10:28:27 -07:00
Jason Ekstrand	15e553daf0	nir: Make nir_const_value a union There's no good reason for it to be a struct of an anonymous union. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96221 Tested-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-26 16:03:44 -07:00
Jason Ekstrand	32210dea8e	compiler: Move glsl_to_nir to libglsl.la Right now libglsl.la depends on libnir.la so putting it in libnir.la adds a dependency on libglsl.la that goes the wrong direction. Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2016-05-26 14:13:38 -07:00
Matt Turner	4a5e92ac70	nir: Strengthen assertion that 'out' is nonnull. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2016-05-25 12:44:34 -07:00
Kristian Høgsberg Kristensen	595224f714	mesa: Add .gitignore entries for make check binaries Signed-off-by: Kristian Høgsberg Kristensen <krh@bitplanet.net> Acked-by: Matt Turner <mattst88@gmail.com>	2016-05-25 09:41:44 -07:00
Kristian Høgsberg Kristensen	a41b57679f	nir: Add a lowering pass for YUV textures This lowers sampling from YUV textures to 1) one or more texture instructions to sample each plane and 2) color space conversion to RGB. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-24 10:14:56 -07:00
Kristian Høgsberg Kristensen	50c24c3ff3	nir: Handle NULL in nir_copy_deref() Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-24 10:14:56 -07:00
Kristian Høgsberg Kristensen	29921ee987	nir: Add new 'plane' texture source type This will be used to select the plane to sample from for planar textures. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-24 10:14:56 -07:00
Jason Ekstrand	66e137ecf1	nir/lower_samplers: Protect against sampler index overflow Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-23 19:12:34 -07:00
Rob Clark	5245d845b6	nir/validate: fix null deref coverity warning CID 1265536 (#1 of 2): Explicit null dereferenced (FORWARD_NULL)6. var_deref_op: Dereferencing null pointer parent. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-23 10:14:50 -04:00
Iago Toral Quiroga	38b719d624	nir: handle double-precision in fsign, fsat, fnot and frcp I think these are not strictly necessary since the floats in them should be automatically promoted to doubles when operated with double sources, but it makes things more explicit at least. Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-23 08:54:37 +02:00
Iago Toral Quiroga	3f73039ade	nir: handle double-precision in fabs, frsq and fsqrt Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-23 08:54:28 +02:00
Kenneth Graunke	f7eb95a526	nir: Fix crash in nir_lower_wpos_center(). Otherwise we rewrote the fadd to use itself, causing crashes in validation. Instead, start after the last use like we should. A brown paper bag fix. Fixes crashes in several Vulkan tests. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-20 16:33:24 -07:00
Kenneth Graunke	6e5d86c07a	nir: Add a simple nir_lower_wpos_center() pass for Vulkan drivers. nir_lower_wpos_ytransform() is great for OpenGL, which allows applications to choose whether their coordinate system's origin is upper left/lower left, and whether the pixel center should be on integer/half-integer boundaries. Vulkan, however, has much simpler requirements: the pixel center is always half-integer, and the origin is always upper left. No coordinate transform is needed - we just need to add <0.5, 0.5>. This means that we can avoid using (and setting up) a uniform. I thought about adding more options to nir_lower_wpos_ytransform(), but making a new pass that never even touched uniforms seemed simpler. v2: Use normal iterator rather than _safe variant (noticed by Matt). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Rob Clark <robdclark@gmail.com>	2016-05-20 14:30:00 -07:00
Kenneth Graunke	12ab7fc6ac	nir: Don't use ffma in nir_lower_wpos_ytransform(). ffma is an explicitly fused multiply add with higher precision. The optimizer will take care of promoting mul/add to fma when it's beneficial to do so. This fixes failures on Gen4-5 when using this pass, as those platforms don't actually implement fma(). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-20 14:29:04 -07:00
Kenneth Graunke	b8b1b1c34c	nir: Handle fddy_fine and fddy_coarse in nir_lower_wpos_ytransform. These also need flipping! Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-05-20 14:29:04 -07:00
Kenneth Graunke	4b7577fad8	nir: Make lower_wpos_ytransform_block a void function. The return value was used for the old nir_foreach_block callback system, but at this point it no longer means anything. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-05-20 14:29:04 -07:00
Kenneth Graunke	88ea960aa7	nir: Make nir_lower_wpos_ytransform() match FragCoord by location. gl_FragCoord is a shader input with location == VARYING_SLOT_POS. ARB_fragment_programs have an equivalent input at VARYING_SLOT_POS, but it isn't called gl_FragCoord. We do want to transform it. Matching by location guarantees we catch both. Fixes several fp tests on a branch which uses this pass on i965. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-05-20 14:29:04 -07:00
Kenneth Graunke	c9192fcbd2	nir: Add interp_var_at_offset flipping. The Y-offset needs flipping as well, similar to ddy. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-05-20 14:29:04 -07:00
Kenneth Graunke	287f099db1	nir: Fix fddy swizzles in nir_lower_wpos_ytransform(). The original value might have been swizzled. That's taken care of in the fmul source - we don't want to reswizzle it again. Fixes validation failures in glsl-derivs-varyings on a branch of mine which uses this pass in i965. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-05-20 14:29:04 -07:00
Kenneth Graunke	7fe9a19302	nir: Fix wpos_ytransform lowering state_slot swizzle. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-05-20 14:28:30 -07:00
Rob Clark	df361fc58c	nir/validate: assume() that hashtable entry exists At this point, it would require a logic error in nir_validate to not have already populated this hashtable entry, but coverity doesn't realize that: CID 1265547 (#1 of 1): Dereference null return value (NULL_RETURNS)3. dereference: Dereferencing a null pointer entry. CID 1271039 (#1 of 1): Dereference null return value (NULL_RETURNS)3. dereference: Dereferencing a null pointer entry. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-20 11:13:50 -04:00
Rob Clark	fcd6b3f42b	nir: coverity unitialized pointer read Not sure how coverity arrives at the conclusion that we can read comp[j] unitialized (around line 204), other than not being aware that ncomp is greater than 1 so it won't underflow in the 'if (tex->is_array)' case. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-20 11:13:50 -04:00
Rob Clark	53c48feae0	nir: coverity sign-extension fix Not 100% sure, but I think being an unsigned literal will help: CID 1358505 (#1 of 1): Unintended sign extension (SIGN_EXTENSION)sign_extension: Suspicious implicit sign extension: load1->def.num_components with type unsigned char (8 bits, unsigned) is promoted in load1->def.num_components * (load1->def.bit_size / 8) to type int (32 bits, signed), then sign-extended to type unsigned long (64 bits, unsigned). If load1->def.num_components * (load1->def.bit_size / 8) is greater than 0x7FFFFFFF, the upper bits of the result will all be 1. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-20 11:13:50 -04:00
Rob Clark	bb993da795	nir/glsl_to_nir: quell some uninit_member coverity errors Signed-off-by: Rob Clark <robclark@freedesktop.org> Acked-by: Matt Turner <mattst88@gmail.com>	2016-05-20 11:13:50 -04:00
Rob Clark	e8beffb1b3	nir/validate: dump annotated shader with error msgs Log all the errors, and at the end dump the shader w/ error annotations to make it easier to see where the problems are. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-05-17 10:05:20 -04:00
Rob Clark	54ecfcc162	nir/validate: assert() -> validate_assert() Prep work for next patch. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-05-17 10:05:20 -04:00
Rob Clark	a0ef26c1c2	nir/print: add support for print annotations Caller can pass a hashtable mapping NIR object (currently instr or var, but I guess others could be added as needed) to annotation msg to print inline with the shader dump. As the annotation msg is printed, it is removed from the hashtable to give the caller a way to know about any unassociated msgs. This is used in the next patch, for nir_validate to try to associate error msgs to nir_print dump. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-05-17 10:05:20 -04:00
Juan A. Suarez Romero	80535873bb	nir: add double input bitmap This bitmap tracks which input attributes are double-precision. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-17 09:05:54 +02:00
Matt Turner	4191551262	nir: Mark nir_start_block()/nir_impl_last_block() with returns_nonnull. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-16 11:06:15 -07:00
Kenneth Graunke	6d65b0c6dc	nir: Add a nir->info.uses_interp_var_at_offset flag. I've added this to nir_gather_info(), but also to glsl_to_nir() as a temporary measure, since the i965 GL driver today doesn't use nir_gather_info() yet. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-15 23:50:28 -07:00
Rob Clark	f06343d6ea	nir: forward-declare 'struct gl_shader_program' Drop extra #include which is otherwise unneeded (and makes this header difficult to include from outside of src/mesa). Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-15 17:25:48 -04:00
Rob Clark	79d6409a14	nir: return progress from lower_idiv With algebraic-opt support for lowering div to shift, the driver would like to be able to run this pass after the main opt-loop, and then conditionally re-run the opt-loop if this pass actually lowered some- thing. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-15 17:25:48 -04:00
Rob Clark	8b24f7b440	nir: fix comment typo about f2d/d2f Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-15 17:25:47 -04:00
Jason Ekstrand	f47faa4316	nir: Add texture opcodes and source types for multisample compression Intel hardware does a form of multisample compression that involves an auxilary surface called the MCS. When an MCS is in use, you have to first sample from the MCS with a special opcode and then pass the result of that operation into the next sample instrucion. Normally, we just do this ourselves in the back-end, but we want to expose that functionality to NIR so that we can use MCS values directly in NIR-based blorp. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-14 13:34:44 -07:00
Jason Ekstrand	87a41e862b	nir/builder: Add a helper for grabbing multiple channels from an ssa def This is similar to nir_channel except that it lets you grab more than one channel by providing a mask. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-14 13:34:40 -07:00
Jason Ekstrand	fc58cb543f	nir/builder: Generate the alu helpers directly in python There's no reason for having a macro and a python generator. We can easily just do the whole thing in python. This has the advantage that we are no longer definining ALU# macros which conflict with the ones in brw_fs_builder.h. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-14 13:34:38 -07:00
Jason Ekstrand	a2f50d87b6	nir: Add an info bit for uses_sample_qualifier Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-14 13:33:52 -07:00
Ian Romanick	8f05a0a4c0	nir: Remove empty visit_call_src and visit_load_const_src functions The guts were removed in `dfb3abba`. It has been almost exactly a year, so I dont think we're going to "decide we want [predication] back." Silences several "unused parameter" warnings: nir/nir.c: In function ‘visit_call_src’: nir/nir.c:1052:32: warning: unused parameter ‘instr’ [-Wunused-parameter] visit_call_src(nir_call_instr instr, nir_foreach_src_cb cb, void state) ^ nir/nir.c:1052:58: warning: unused parameter ‘cb’ [-Wunused-parameter] visit_call_src(nir_call_instr instr, nir_foreach_src_cb cb, void state) ^ nir/nir.c:1052:68: warning: unused parameter ‘state’ [-Wunused-parameter] visit_call_src(nir_call_instr instr, nir_foreach_src_cb cb, void state) ^ nir/nir.c: In function ‘visit_load_const_src’: nir/nir.c:1058:44: warning: unused parameter ‘instr’ [-Wunused-parameter] visit_load_const_src(nir_load_const_instr instr, nir_foreach_src_cb cb, ^ nir/nir.c:1058:70: warning: unused parameter ‘cb’ [-Wunused-parameter] visit_load_const_src(nir_load_const_instr instr, nir_foreach_src_cb cb, ^ nir/nir.c:1059:28: warning: unused parameter ‘state’ [-Wunused-parameter] void *state) ^ v2: Add some comments in nir_foreach_src suggested by Jason. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Cc: Connor Abbott <cwabbott0@gmail.com>	2016-05-12 16:47:14 -07:00
Ian Romanick	098166e1bc	nir: Silence unused parameter warnings These cases had the parameter removed: nir/nir_lower_vec_to_movs.c: In function ‘try_coalesce’: nir/nir_lower_vec_to_movs.c:124:66: warning: unused parameter ‘shader’ [-Wunused-parameter] try_coalesce(nir_alu_instr vec, unsigned start_idx, nir_shader shader) ^ nir/nir_lower_io.c: In function ‘load_op’: nir/nir_lower_io.c:147:32: warning: unused parameter ‘state’ [-Wunused-parameter] load_op(struct lower_io_state state, ^ These cases had the parameter (void) silenced because the parameter was necessary for an interface: nir/glsl_to_nir.cpp:1900:32: warning: unused parameter 'ir' [-Wunused-parameter] nir_visitor::visit(ir_barrier ir) ^ nir/nir.c: In function ‘remove_use_cb’: nir/nir.c:802:35: warning: unused parameter ‘state’ [-Wunused-parameter] remove_use_cb(nir_src src, void state) ^ nir/nir.c: In function ‘remove_def_cb’: nir/nir.c:811:37: warning: unused parameter ‘state’ [-Wunused-parameter] remove_def_cb(nir_dest dest, void state) ^ Number of total warnings in my build reduced from 2543 to 2538 (reduction of 5). Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-12 16:46:41 -07:00
Rob Clark	9d3cc80b75	nir: glsl_get_bit_size() should take glsl_type It's what all the call-sites once, so gets rid of a bunch of inlined glsl_get_base_type() at the call-sites. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-12 13:39:40 -04:00
Jason Ekstrand	1b72c31e1f	nir/algebraic: Separate ffma lowering from fusing The i965 driver has its own pass for fusing mul+add combinations that's much smarter than what nir_opt_algebraic can do so we don't want to get the nir_opt_algebraic one just because we didn't set lower_ffma. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-11 11:44:35 -07:00
Rob Clark	dfbabc6bad	nir/lower-io: add support for lowering inputs Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-11 12:20:11 -04:00
Rob Clark	595f9d5476	nir/lower-io: split out some helper fxns Prep work to reduce the noise in the next patch. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-11 12:20:11 -04:00
Rob Clark	b085016f94	nir: rename lower_outputs_to_temporaries -> lower_io_to_temporaries Since it will gain support to lower inputs, give it a more generic name. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-11 12:20:11 -04:00
Rob Clark	47fcef9a20	nir: move callsite of lower_outputs_to_temporaries Going to convert this pass to parameterized lower_io_to_temporaries, and we want the user to be able to specify whether to lower outputs or inputs or both. The restriction of running this pass before validate to avoid output reads no longer applies. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-11 12:20:11 -04:00
Rob Clark	5261947260	nir: lower-io-types pass A pass to lower complex (struct/array/mat) inputs/outputs to primitive types. This allows, for example, linking that removes unused components of a larger type which is not indirectly accessed. In the near term, it is needed for gallium (mesa/st) support for NIR, since only used components of a type are assigned VBO slots, and we otherwise have no way to represent that to the driver backend. But it should be useful for doing shader linking in NIR. v2: use glsl_count_attribute_slots() rather than passing a type_size fxn pointer Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-05-11 12:20:11 -04:00
Rob Clark	b10cc24519	nir: passthrough-edgeflags support Handled by tgsi_emulate for glsl->tgsi case. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2016-05-11 12:20:11 -04:00
Rob Clark	3a939d034e	nir: add lowering pass for glBitmap Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2016-05-11 12:20:11 -04:00
Rob Clark	12c18ce476	nir: add lowering pass for glDrawPixels Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2016-05-11 12:20:11 -04:00
Rob Clark	b26645a00f	nir: add lowering pass for y-transform Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2016-05-11 12:20:11 -04:00
Jose Fonseca	94e8653a3b	Revert "nir: Try to warn when C99 extensions are used in nir headers." This reverts commit `99474dc29b`. -Wpedantic is too verbose, even when applied to just a few includes. We'll just have to deal with the issues as they come. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-05-10 03:29:24 -07:00
Eduardo Lima Mitev	60a5d02416	nir/print: Print memory qualifiers in a variable declaration Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-05-10 06:22:05 +02:00
Rob Clark	f096096b77	nir/search: fix typo Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-09 12:46:24 -04:00
Jose Fonseca	8ae78f7d28	nir: Remove spurious return from void function. Left over from `450c061362`. Trivial. Built locally with clang and gcc. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95296	2016-05-06 12:03:34 +01:00
Connor Abbott	4fab8dd5ea	nir: remove now-unused nir_foreach_block*_call() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-05-05 16:19:42 -07:00
Jason Ekstrand	31fc4a2528	nir/lower_double_ops: fixup for new nir_foreach_block() Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-05-05 16:19:41 -07:00
Jason Ekstrand	450c061362	nir/lower_double_pack: fixup for new nir_foreach_block() Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-05-05 16:19:41 -07:00
Jason Ekstrand	8c807cc2a6	nir/gather_info: fixup for new foreach_block() Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-05-05 16:19:41 -07:00
Connor Abbott	331b9f73a2	nir/lower_two_sided_color: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-05-05 16:19:41 -07:00
Connor Abbott	d40fbbc27e	nir/lower_tex: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-05-05 16:19:41 -07:00
Connor Abbott	8a7fe634d2	nir/lower_outputs_to_temporaries: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-05-05 16:19:41 -07:00
Kenneth Graunke	bc0062c54a	nir: Optimize out stores of undefs. There are a couple of cycle count changes in shader-db, but it's basically a wash. However, with the Broadwell scalar TCS backend enabled, many Shadow of Mordor shaders benefit from this patch. Because we don't batch up output writes for TCS, vec4 outputs might not have all components defined. Many output writes have a value of undef, which is useless. With scalar TCS, stats for tessellation shaders on Broadwell: total instructions in shared programs: 1283000 -> 1280444 (-0.20%) instructions in affected programs: 34302 -> 31746 (-7.45%) helped: 71 HURT: 0 total cycles in shared programs: 10798768 -> 10780682 (-0.17%) cycles in affected programs: 158004 -> 139918 (-11.45%) helped: 71 HURT: 0 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-05 14:24:00 -07:00
Kenneth Graunke	c7a8b32700	nir: Replace vecN(undef, undef, ...) with a single undef. shader-db statistics on Broadwell: total instructions in shared programs: 8963409 -> 8962455 (-0.01%) instructions in affected programs: 60858 -> 59904 (-1.57%) helped: 318 HURT: 0 total cycles in shared programs: 71408022 -> 71406276 (-0.00%) cycles in affected programs: 398416 -> 396670 (-0.44%) helped: 199 HURT: 51 GAINED: 1 The only shaders affected were in Dota 2 Reborn. It also sets up for the next optimization. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-05 14:24:00 -07:00
Kenneth Graunke	49ea7454a1	nir: Rename opt_undef_alu to opt_undef_csel; update comments. This better reflects what it does. I plan to add other ALU optimizations as well, so the old name would be confusing. In preparation for that, also move the file comments about csels above the opt_undef_csel function, and delete the ones about there not being other optimizations. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-05-05 14:24:00 -07:00
Thomas Hindoe Paaboel Andersen	8698194313	nir: fix assert for wildcard pairs The assert was null checking dest_arr_parent twice. The intention seems to be to check both dest_ and src_. Added in `d3636da9` Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-05-05 09:33:02 +02:00
Samuel Iglesias Gonsálvez	2ab2d2e588	nir: Separate 32 and 64-bit fmod lowering Split 32-bit and 64-bit fmod lowering as the drivers might need to lower them separately inside NIR depending on the HW support. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-05-04 08:07:49 +02:00
Samuel Iglesias Gonsálvez	b902377a56	nir/lower_double_ops: lower mod() There are rounding errors with the division in i965 that affect the mod(x,y) result when x = N * y. Instead of returning '0' it was returning 'y'. This lowering pass fixes those cases. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-05-04 08:07:49 +02:00
Rob Clark	dcf8c4425a	nir: make lower_clamp_color pass work after lower i/o Kinda important to work with tgsi_to_nir, which generates nir which already has i/o lowered. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-02 14:25:38 -04:00
Thomas Hindoe Paaboel Andersen	cbcd7b60f5	nir/lower_double_ops: fix indentation Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-30 12:16:32 -07:00
Thomas Hindoe Paaboel Andersen	21424e019d	nir/opt_dead_cf: fix indentation Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-30 12:16:29 -07:00
Thomas Hindoe Paaboel Andersen	6935726197	nir/opt_dead_cf: correction of side effect check Parenthesis are needed here as ! takes precedence over the &. The check had the opposite effect than intended. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-30 12:16:22 -07:00
Rob Clark	64abf6d404	nir: clamp-color-output support Handled by tgsi_emulate for glsl->tgsi case. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-04-30 14:56:19 -04:00
Jason Ekstrand	6d4a426745	nir/algebraic: Support lowering for both 64 and 32-bit ldexp Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-04-28 21:36:52 -07:00
Jason Ekstrand	f0af5b87ec	nir/opcodes: Make ldexp take an explicitly 32-bit int There is no sense in having the double version of ldexp take a 64-bit integer. Instead, let's just take a 32-bit int all the time. This also matches what GLSL does where both variants of ldexp take a regular integer for the exponent argument. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-04-28 21:36:52 -07:00
Jason Ekstrand	bee40dd730	nir/opcodes: Simplify the expressions for [un]pack_double The new expressions are more explicit in terms of where the bits go so it's a little easier to tell what's going on. This is the way GLSL specifies things so it's a bit easier to verify too. It also has the benifit that the new expressions easily vectorize so we can constant-fold vector forms of the _split versions correctly. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-04-28 21:36:52 -07:00
Jason Ekstrand	70f89dd75e	nir: Switch the arguments to nir_foreach_def This matches the "foreach x in container" pattern found in many other programming languages. Generated by the following regular expression: s/nir_foreach_def(\([^,]\),\s\([^,]*\))/nir_foreach_def(\2, \1)/ Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-04-28 15:54:48 -07:00
Jason Ekstrand	5015260a05	nir: Switch the arguments to nir_foreach_use and friends This matches the "foreach x in container" pattern found in many other programming languages. Generated by the following regular expression: s/nir_foreach_use(\([^,]\),\s\([^,]*\))/nir_foreach_use(\2, \1)/ and similar expressions for nir_foreach_use_safe, etc. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-04-28 15:54:48 -07:00
Jason Ekstrand	9464d8c498	nir: Switch the arguments to nir_foreach_function This matches the "foreach x in container" pattern found in many other programming languages. Generated by the following regular expression: s/nir_foreach_function(\([^,]\),\s\([^,]*\))/nir_foreach_function(\2, \1)/ Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-04-28 15:54:48 -07:00
Jason Ekstrand	e63766fb4b	nir: Switch the arguments to nir_foreach_parallel_copy_entry This matches the "foreach x in container" pattern found in many other programming languages. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-04-28 15:54:48 -07:00
Jason Ekstrand	8564916d01	nir: Switch the arguments to nir_foreach_phi_src This matches the "foreach x in container" pattern found in many other programming languages. Generated by the following regular expression: s/nir_foreach_phi_src(\([^,]\),\s\([^,]*\))/nir_foreach_phi_src(\2, \1)/ and a similar expression for nir_foreach_phi_src_safe. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-04-28 15:54:48 -07:00
Jason Ekstrand	707e72f13b	nir: Switch the arguments to nir_foreach_instr This matches the "foreach x in container" pattern found in many other programming languages. Generated by the following regular expression: s/nir_foreach_instr(\([^,]\),\s\([^,]*\))/nir_foreach_instr(\2, \1)/ and similar expressions for nir_foreach_instr_safe etc. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-04-28 15:54:48 -07:00
Connor Abbott	3a8688fb41	nir/algebraic: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	1f8c100614	nir/validate: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	a471c161b1	nir/nir_worklist: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	db35177772	nir/remove_dead_variables: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	b3aaae398e	nir/split_var_copies: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	9d41a1ffeb	nir/repair_ssa: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	480a182ccd	nir/opt_peephole_select: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	e5f37701ab	nir/phi_builder: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	1ba40d834b	nir/opt_cp: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	8dd7d78925	nir/opt_remove_phis: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	1a8c17a59e	nir/opt_undef: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	52affdd2e6	nir/opt_dead_cf: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	ddc6639f85	nir/opt_dce: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	3afb3be674	nir/opt_gcm: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	eecf96f530	nir/opt_constant_folding: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	26b4c9ee15	nir/lower_samplers: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	f4ebff89e4	nir/normalize_cubemap_coords: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	492b3554a7	nir/lower_var_copies: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	c1b37c08bf	nir/move_vec_src_uses_to_dest: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	ceed12557d	nir/lower_vars_to_ssa: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	1557344c81	nir/lower_vec_to_movs: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	b1eada04b2	nir/lower_idiv: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	2febb88e6d	nir/lower_to_source_mods: fixup for new foreeach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	c81ca60b41	nir/lower_io: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	7e909972e3	nir/lower_system_values: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	76c74de456	nir/lower_phis_to_scalar: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	b89f0bb58c	nir/lower_indirect_derefs: fixup for new foreach_block() v2 (Jason Ekstrand): Use nir_foreach_block_safe Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	e3c5bda16a	nir/nir_lower_global_vars: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	480d78f55b	nir/lower_atomics: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	06cf73a7ba	nir/lower_load_const: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	15264133d7	nir/lower_locals_to_regs: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	1c6307aab4	nir/lower_gs_intrinsics: fixup for new foreach_block() v2 (Jason Ekstrand): Use nir_foreach_block_safe Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	3bf3100794	nir/nir: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	686f247b21	nir/lower_clip: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	e36fbcfc3f	nir/lower_alu_to_scalar: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	4179a56f42	nir/liveness: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	34af78edb3	nir/inline_functions: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	b23e59e172	nir/from_ssa: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Connor Abbott	d6a6c729ca	nir/dominance: fixup for new foreach_block() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 15:52:17 -07:00
Jose Fonseca	99474dc29b	nir: Try to warn when C99 extensions are used in nir headers. Ideally we'd have nir.h being included with -Wpedantic too, but it fails with: src/compiler/nir/nir.h:754:20: warning: ISO C++ forbids zero-size array ‘src’ [-Wpedantic] nir_alu_src src[]; ^ In file included from src/compiler/nir/glsl_to_nir.cpp:42:0: src/compiler/nir/nir.h:919:16: warning: ISO C++ forbids zero-size array ‘src’ [-Wpedantic] nir_src src[]; Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-04-28 16:48:13 +01:00
Jose Fonseca	e7438009af	nir: Remove spurious ; after nir_builder functions. Makes -pedantic happy. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-04-28 16:48:12 +01:00
Jose Fonseca	caa5937ebb	nir: Remove spurious ; after namespace. Makes -pedantic happy. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-04-28 16:48:12 +01:00
Jose Fonseca	f7854d8227	nir: Avoid C99 field initializers. As they are not standard C++ and are not supported by MSVC C++ compiler. Just have nir_imm_double match nir_imm_float above. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com>	2016-04-28 16:48:12 +01:00
Samuel Iglesias Gonsálvez	db07b46f2c	nir: Add lrp lowering for doubles in opt_algebraic Some hardware (i965 on Broadwell generation, for example) does not support natively the execution of lrp instruction with double arguments. Add 'lower_flrp64' flag to lower this instruction in that case. v2: - Rename lower_flrp_double to lower_flrp64 (Jason) - Fix typo (Jason) - Adapt the code to define bit_size information in the opcodes. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 12:01:40 +02:00
Samuel Iglesias Gonsálvez	443600d51e	nir: rename lower_flrp to lower_flrp32 A later patch will add lower_flrp64 option to NIR. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 12:01:40 +02:00
Iago Toral Quiroga	072613b3f3	nir/lower_double_ops: lower round_even() At least i965 hardware does not have native support for round_even() on doubles. Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-28 12:01:40 +02:00
Iago Toral Quiroga	bf91df7f7f	nir/lower_double_ops: lower fract() At least i965 hardware does not have native support for fract() on doubles. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 12:01:40 +02:00
Iago Toral Quiroga	126a1ac03f	nir/lower_double_ops: lower ceil() At least i965 hardware does not have native support for ceil on doubles. v2 (Sam): - Improve the lowering pass to remove one bcsel (Jason). Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 12:01:36 +02:00
Iago Toral Quiroga	29541ec531	nir/lower_double_ops: lower floor() At least i965 hardware does not have native support for floor on doubles. v2 (Sam): - Improve the lowering pass to remove one bcsel (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 11:58:35 +02:00
Iago Toral Quiroga	5fab3d178b	nir/lower_double_ops: lower trunc() At least i965 hardware does not have native support for truncating doubles. v2: - Simplified the implementation significantly. - Fixed the else branch, that was not doing what we wanted. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 11:58:13 +02:00
Connor Abbott	2ea3649c63	nir: add a pass to lower some double operations v2: Move to compiler/nir (Iago) v3: Use nir_imm_int() to load the constants (Sam) v4 (Sam): - Undo line-wrap (Jason). - Fix comment (Jason). - Improve generated code for get_signed_inf() function (Connor). Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 11:58:13 +02:00
Connor Abbott	2cf3b28884	nir/builder: add nir_imm_double() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 11:58:13 +02:00
Samuel Iglesias Gonsálvez	3a150683ce	nir/builder: Add bit_size info to nir_build_imm() v2: - Group num_components and bit_size together (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-28 11:58:13 +02:00
Connor Abbott	122d27e998	nir: rewrite nir_foreach_block and friends Previously, these were functions which took a callback. This meant that the per-block code had to be in a separate function, and all the data that you wanted to pass in had to be a single void *. They walked the control flow tree recursively, doing a depth-first search, and called the callback in a preorder, matching the order of the original source code. But since each node in the control flow tree has a pointer to its parent, we can implement a "get-next" and "get-previous" method that does the same thing that the recursive function did with no state at all. This lets us rewrite nir_foreach_block() as a simple for loop, which lets us greatly simplify its users in some cases. This does require us to rewrite every user, although the transformation from the old nir_foreach_block() to the new nir_foreach_block() is mostly trivial. One subtlety, though, is that the new nir_foreach_block() won't handle the case where the current block is deleted, which the old one could. There's a new nir_foreach_block_safe() which implements the standard trick for solving this. Most users don't modify control flow, though, so they won't need it. Right now, only opt_select_peephole needs it. The old functions are reimplemented in terms of the new macros, although they'll go away after everything is converted. v2: keep an implementation of the old functions around v3 (Jason Ekstrand): A small cosmetic change and a bugfix in the loop handling of nir_cf_node_cf_tree_last(). v4 (Jason Ekstrand): Use the _safe macro in foreach_block_reverse_call Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-27 15:05:40 -07:00
Connor Abbott	958300137f	nir/opt_cp: use nir_block_get_following_if() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-27 15:05:34 -07:00
Jason Ekstrand	e0806930ad	nir/algebraic: Add a bit-size validator This commit adds a validator that ensures that all expressions passed through nir_algebraic are 100% non-ambiguous as far as bit-sizes are concerned. This way it's a compile-time error rather than a hard-to-trace C exception some time later. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-04-27 11:21:06 -07:00
Jason Ekstrand	8a3e344180	nir/opt_algebraic: Fix some expressions with ambiguous bit sizes Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-27 11:21:06 -07:00
Jason Ekstrand	7e0ee3a38b	nir/search: Respect the bit_size parameter on nir_search_value Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-27 11:21:06 -07:00
Jason Ekstrand	fcc1c8a437	nir/algebraic: Add a mechanism for specifying the bit size of a value Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-27 11:21:06 -07:00
Jason Ekstrand	cafb885e45	nir/algebraic: Use "uint" instead of "unsigned" for uint types This is consistent with the rename done for the rest of NIR. Currently, "bool" is the only type specifier used in nir_opt_algebraic.py so this is really a no-op. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-27 11:21:06 -07:00
Jason Ekstrand	736ee0bef7	nir/algebraic: Do better error reporting of bad expressions Previously, if an exception was encountered anywhere, nir_algebraic would just die in a fire with no indication whatsoever as to where the actual bug is. This commit makes it print out the particular search-and-replace expression that is causing problems along with the exception. Also, it will now report all of the errors it finds and then exit at the end like a standard C compiler would do. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-27 11:21:06 -07:00
Jason Ekstrand	d800b7daa5	nir: Add a helper for figuring out what channels of an SSA def are read Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-04-26 19:55:04 -07:00
Juha-Pekka Heikkila	940da2ce0e	nir: Add missing break into switch in construct_value() There seemed to be missing one break in nested switchcases. Signed-off-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com> Reviewed-by: Antia Puentes <apuentes@igalia.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-04-26 17:45:56 +02:00
Connor Abbott	b6dc940ec2	nir: rename nir_foreach_block() to nir_foreach_block_call() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-20 09:47:05 -07:00
Rob Clark	eddfc97709	nir/lower-tex: add srgb->linear lowering Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-19 17:13:50 -04:00
Rob Clark	eb00a0fc58	nir/builder: const'ify swiz param No need for it not to be const, and lets caller declare it const if desired. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2016-04-19 17:13:36 -04:00
Rob Clark	52ccc6349f	nir/lower-tex: make options a local var Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-04-19 16:12:49 -04:00
Dave Airlie	b3616f1326	nir: only dereference undef after NULL check. (v2) Pointed out by coverity. v2: nuke line, Jason pointed out the constructor does it. Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-04-18 07:37:48 +10:00
Jason Ekstrand	d8b85c96d1	Merge remote-tracking branch 'public/master' into vulkan	2016-04-15 13:35:16 -07:00
Eduardo Lima Mitev	7e4628da48	nir/print: Fix printing variable mode nir_variable_mode is currently a bitflag enum, while nir_print::print_var_decl() assumes is still a numbered list. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-15 16:41:41 +02:00
Jason Ekstrand	76fa7b16f4	Merge remote-tracking branch 'public/master' into vulkan	2016-04-14 18:30:52 -07:00
Jason Ekstrand	d7cddbd6d6	nir/lower_io: Add UBOs and SSBOs to get_io_offset_src Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-14 18:07:40 -07:00
Jason Ekstrand	c825e29a82	nir/intrinsics: Add a vulkan_resource_index intrinsic This is used to facilitate the Vulkan binding model where each resource is described by a (descriptor set, binding, array index) tuple. Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-04-14 17:20:05 -07:00
Jason Ekstrand	1e0012e3e4	nir: Add a descriptor_set field to nir_variable This is needed for supporting the Vulkan binding model Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-04-14 17:20:05 -07:00
Jason Ekstrand	5567ae0547	Merge remote-tracking branch 'public/master' into vulkan	2016-04-14 17:14:28 -07:00
Jason Ekstrand	63101177f3	nir: Add another index to load_uniform to specify the range read Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-14 15:59:33 -07:00
Jason Ekstrand	769b5614f8	nir/opt_algebraic: Remove the encoding line This is an unneeded diff between the vulkan and master branches	2016-04-14 10:35:40 -07:00
Jason Ekstrand	c34be07230	spirv: Move to compiler/ While it does rely on NIR, it's not really part of the NIR core. At the moment, it still builds as part of libnir but that can be changed later if desired.	2016-04-14 10:28:47 -07:00
Jason Ekstrand	bfa3a38280	nir: Remove some pointless delta between vulkan and master	2016-04-14 10:24:33 -07:00
Jose Fonseca	feb6732e80	nir: Use _snprintf on Windows. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-04-14 16:38:37 +01:00
Jose Fonseca	ba0c0e3940	nir: Avoid structure initalization expressions. Not supported by MSVC, and completely unnecessary -- inline functions work just as well. NIR_SRC_INIT/NIR_DEST_INIT could and probably should be replaced by the inline functions. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-04-14 16:38:37 +01:00
Jose Fonseca	8f96524f13	nir: Remove unistd.h include. It doesn't seem needed, and is not available on MSVC. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-04-14 16:38:31 +01:00
Jose Fonseca	f8e2f1fba5	nir: Avoid empty {} struct initializer. Not supported by MSVC and consistent through NIR. [Emil Velikov: rebase] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-04-14 16:33:52 +01:00
Jason Ekstrand	12f88ba32a	Merge remote-tracking branch 'public/master' into vulkan	2016-04-13 20:25:39 -07:00
Jason Ekstrand	b63a98b121	nir/dead_variables: Configurably work with any variable mode The old version of the pass only worked on globals and locals and always left inputs, outputs, uniforms, etc. alone. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-13 15:45:10 -07:00
Jason Ekstrand	4455bfa9a0	nir/algebraic: Add lowering for ldexp The algorithm used is different from both the naive suggestion from the GLSL spec and the one used in GLSL IR today. Unfortunately, the GLSL IR implementation that we have today doesn't handle denormals (for those that care) or the case where the float source is +-inf. Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-13 15:44:19 -07:00
Jason Ekstrand	745b3d295e	nir: Add more modulus opcodes These are all needed for SPIR-V Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-13 15:44:00 -07:00
Jason Ekstrand	dd616cab01	nir/lower_io: Allow for a full bitmask of modes Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-13 12:44:10 -07:00
Jason Ekstrand	2caaf0ac5e	nir/lower_indirect: nir_variable_mode is now a bitfield Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-13 12:44:07 -07:00
Jason Ekstrand	ffa0e12e15	nir: Convert nir_variable_mode to a bitfield There are several passes where we need to specify some set of variable modes that the pass needs top operate on. This lets us easily do that. Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-13 12:40:12 -07:00
Jason Ekstrand	8f3b516f2e	nir/clone: Copy bit size when cloning registers Reported-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-04-12 16:41:58 -07:00
Ian Romanick	193a5cee6a	nir: Fix typo in comment Trivial. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2016-04-11 19:24:19 -07:00
Markus Wick	18c8b927e2	nir: Merge redudant integer clamping. Dolphin uses them a lot. Range tracking would be better in the long term, but this two lines works fine for now. Signed-off-by: Markus Wick <markus@selfnet.de> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 18:48:50 -07:00
Kenneth Graunke	808d26c771	nir: Silence unused "options" warning in algebraic passes. Some passes may not refer to options->..., at which point the compiler will warn about an unused variable. Just cast to void unconditionally to shut it up. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-11 18:44:08 -07:00
Kenneth Graunke	5886cd79a0	nir: Do basic constant reassociation. Many shaders contain expression trees of the form: const_1 * (value * const_2) Reorganizing these to (const_1 * const_2) * value will allow constant folding to combine the constants. Sometimes, these constants are 2 and 0.5, so we can remove a multiply altogether. Other times, it can create more immediate constants, which can actually hurt. Finding a good balance here is tricky. While much more could be done, this simple patch seems to have a lot of positive benefit while having a low downside. shader-db results on Broadwell: total instructions in shared programs: 8963768 -> 8961369 (-0.03%) instructions in affected programs: 438318 -> 435919 (-0.55%) helped: 1502 HURT: 245 total cycles in shared programs: 71527354 -> 71421516 (-0.15%) cycles in affected programs: 11541788 -> 11435950 (-0.92%) helped: 3445 HURT: 1224 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-11 18:43:55 -07:00
Jason Ekstrand	a9e6213edd	nir/lower_system_values: Add support for several computed values Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-04-11 13:53:03 -07:00
Emil Velikov	3d67780b80	compiler: remove {glsl,nir}/Makefile.sources No longer used as of last commit. v2: Rebase. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> (v1)	2016-04-11 19:08:23 +01:00
Jason Ekstrand	3aa1a5ee88	nir/lower_system_values: Simplify the computation of LocalInvocationIndex	2016-04-10 23:43:38 -07:00
Connor Abbott	a89c474157	nir: add a pass for lowering (un)pack_double_2x32 v2: Undo unintended change to the signature of nir_normalize_cubemap_coords (Iago). v3: Move to compiler/nir (Iago) v4: Remove Authors from copyright header (Michael Schellenberger) v5 (Sam): - Use nir_channel() and nir_ssa_for_alu_src() helpers (Jason) - Inline lower_double_pack_instr() code into lower_double_pack_block() (Jason). - Initialize nir_builder at lower_double_pack_impl() (Jason). Signed-off-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:29:27 +02:00
Connor Abbott	663e6421df	nir: add split versions of (un)pack_double_2x32 v2 (Sam): - Use uint64 instead of float64 for sources and destinations. (Connor) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:29:27 +02:00
Connor Abbott	b093808d26	nir: don't try to scalarize unpack_double_2x32 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:29:27 +02:00
Connor Abbott	9e31e0a21b	nir: add support for (un)pack_double_2x32 v2 (Sam): - Use uint64 instead of float64 for sources and destinations. (Connor) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:29:27 +02:00
Iago Toral Quiroga	d5d6260329	nir: add i2d and u2d opcodes v2: - Assert supports_int and don't fallback to nir_fmov (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:29:27 +02:00
Iago Toral Quiroga	b16d06252e	nir: add d2i, d2u, d2b opcodes Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:29:27 +02:00
Connor Abbott	a4bce07dc6	nir: add support for d2f and f2d Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:29:27 +02:00
Iago Toral Quiroga	fab5d4cd95	nir/glsl_to_nir: set bit_size on ssbo_load result v2 (Sam): - Add missing bit_size assignment when ssbo_load destination is a boolean. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:29:27 +02:00
Samuel Iglesias Gonsálvez	a741378cb5	nir/glsl_to_nir: add bit-size info to add_instr() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:28:01 +02:00
Connor Abbott	4b37c64f3b	nir/split_var_copies: handle doubles Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:05 +02:00
Connor Abbott	106a1b5501	nir/instr_set: handle 64-bit bit-sizes v2: Revert spurious change in nir_opt_cse.c (Iago) Signed-off-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:05 +02:00
Connor Abbott	f2ccb63be1	nir: handle doubles in nir_deref_get_const_initializer_load() v2 (Sam): - Use proper bitsize value when calling to nir_load_const_instr_create() (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:05 +02:00
Connor Abbott	41c2541fc7	nir/print: add support for printing doubles and bitsize v2: - Squash the printing doubles related patches into one patch (Sam). v3: - Print using PRIx64 format: long is 32-bit on some 32-bit platforms but long long is basically always 64-bit (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:05 +02:00
Connor Abbott	f5551f8a8b	nir/glsl_to_nir: support doubles v2: - Don't set sized types to the destination of texture related opcodes. (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:05 +02:00
Iago Toral Quiroga	8e69782e3e	nir/lower_load_const_to_scalar: support doubles and multiple bit sizes v2 (Sam): - Add assert to detect bitsizes differents than 32 and 64 (Jason). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:05 +02:00
Iago Toral Quiroga	12f628adcb	nir/lower_to_source_mods: Handle different bit sizes v2 (Sam): - Use helper to get base type from nir_alu_type. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:04 +02:00
Samuel Iglesias Gonsálvez	3663a2397e	nir: add bit_size info to nir_load_const_instr_create() Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:04 +02:00
Connor Abbott	a5b17ae745	nir/lower_vec: adapt to different bit sizes Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:04 +02:00
Samuel Iglesias Gonsálvez	e3edaec739	nir: add bit_size info to nir_ssa_undef_instr_create() v2: - Make the users to give the right bit_sizes as arguments (Jason). Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:04 +02:00
Connor Abbott	41a39e3384	nir/locals_to_regs: adapt to different bit sizes Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:04 +02:00
Connor Abbott	40d1b671a9	nir/from_ssa: adapt to different bit sizes Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-04-11 08:27:04 +02:00
Jason Ekstrand	7d58cfa366	nir: Add a pass for gathering various bits of shader info Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-10 20:43:47 -07:00
Jason Ekstrand	b8f3909b73	nir/gather_info: Handle discard_if Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-04-08 15:58:36 -07:00
Jason Ekstrand	e26a978773	Merge remote-tracking branch 'public/master' into vulkan	2016-04-07 16:56:34 -07:00
Kenneth Graunke	3babb7b0a4	nir: Use PRIi64 and PRIu64 instead of %ld and %lu. %ld and %lu aren't the right format specifiers for int64_t and uint64_t on 32-bit (x86) systems. They're %zu on Linux and %Iu on Windows. Use the standard C99 macros in hopes that they work everywhere. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-04 14:38:48 -07:00
Jason Ekstrand	eb93d6dec8	nir/search: Don't match inexact expressions with exact subexpressions In the first pass of implementing exact handling, I made a mistake with search-and-replace. In particular, we only reallly handled exact/inexact on the root of the tree. Instead, we need to check every node in the tree for an exact/inexact match. As an example of this, consider the following GLSL code precise float a = b + c; if (a < 0) { do_stuff(); } In that case, only the add will be declared "exact" and an expression that looks for "b + c < 0" will still match and replace it with "b < -c" which may yield different results. The solution is to simply bail if any of the values are exact when matching an inexact expression. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-04-04 13:48:10 -07:00
Jason Ekstrand	fe247bbe92	nir: Stop double-printing function arguments	2016-04-04 12:10:20 -07:00
Jason Ekstrand	cc1320220f	nir/gather_info: Add an assert for supported stages	2016-04-01 15:44:43 -07:00
Jason Ekstrand	ebb0bcc11d	nir: Move variable_get_io_mask back into gather_info It used to be in nir_gather_info.c until I moved it out to nir.h so it could be re-used with some linking code that never got merged. We'll move it back out if and when we have real code to share it with.	2016-04-01 15:39:48 -07:00
Jason Ekstrand	95106f6bfb	Merge remote-tracking branch 'public/master' into vulkan	2016-04-01 15:16:21 -07:00
Jason Ekstrand	de60e250f5	nir: Add an opcode for stomping a 32-bit value to 16-bit precision This correlates directly to the SPIR-V opcode OpQuantizeToF16 Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-04-01 13:52:28 -07:00
Ian Romanick	08ff5f4d1f	nir: Simplify a bcsel to logical-or Oddly, this did not affect the shader where I first noticed the pattern. That particular shader doesn't get its if-statement converted to a bcsel because there are two assignments in the else-statement. This led to me submitting https://bugs.freedesktop.org/show_bug.cgi?id=94747. shader-db results: Sandy Bridge total instructions in shared programs: 8467384 -> 8467069 (-0.00%) instructions in affected programs: 36594 -> 36279 (-0.86%) helped: 46 HURT: 0 total cycles in shared programs: 117573448 -> 117568518 (-0.00%) cycles in affected programs: 339114 -> 334184 (-1.45%) helped: 46 HURT: 0 Ivy Bridge / Haswell / Broadwell / Skylake: total instructions in shared programs: 7774258 -> 7773999 (-0.00%) instructions in affected programs: 30874 -> 30615 (-0.84%) helped: 46 HURT: 0 total cycles in shared programs: 65739190 -> 65734530 (-0.01%) cycles in affected programs: 180380 -> 175720 (-2.58%) helped: 45 HURT: 1 No change on G45 or Ironlake. I also tried these expressions, but none of them affected any shaders in shader-db: (('bcsel', a, 'a@bool', 'b@bool'), ('ior', a, b)), (('bcsel', a, 'b@bool', False), ('iand', a, b)), (('bcsel', a, 'b@bool', 'a@bool'), ('iand', a, b)), Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-03-31 14:59:36 -07:00
Matt Turner	05ee6627d6	nir: Fix typo from commit `6702f1acde`.	2016-03-30 19:18:35 -07:00
Matt Turner	6702f1acde	nir: Propagate negates up multiplication chains. total instructions in shared programs: 7112159 -> 7088092 (-0.34%) instructions in affected programs: 1374915 -> 1350848 (-1.75%) helped: 7392 HURT: 621 GAINED: 2 LOST: 2	2016-03-30 13:12:34 -07:00
Jason Ekstrand	cf2257069c	nir/spirv: Set a default number of invocations for geometry shaders The SPIR-V spec says geometry shaders are supposed to have one invocation by default. The execution mode is only required if there are multiple invocations.	2016-03-29 20:30:27 -07:00
Jason Ekstrand	35e2e96b30	nir: Add a helper for getting the current block from a cursor Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	be98c47528	nir/lower_out_to_temp: Add an "entrypoint" parameter Previously, the pass assumed that the entrypoint would be whatever function happened to have the name "main". We really shouldn't trust in the function names. Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	31a5bec93f	nir/lower_out_to_temp: Steal the output's constant initializer Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	38de85f9a5	nir: Add a helper for getting the unique function in a shader Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	49be812be6	nir/sweep: Sweep function parameters They are no longer in the list of local variables so we need to explicitly sweep them. Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	1be4c61c95	nir/builder: Add a helper for creating undefs Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	6a2479d618	nir/builder: Add a helper for storing to variable derefs Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	77e2ac1da7	nir/builder: Add a helper for building fdot instructions Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	da422663a6	nir: Add a variable_foreach_safe helper Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	731870fbe3	nir/Makefile: Fix alphabetization Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-03-28 18:32:48 -07:00
Jason Ekstrand	433cf90650	nir/spirv: Remove the NoContraction hack NIR now just handles this for us by not fusing if the multiply is marked as exact.	2016-03-28 13:07:39 -07:00
Jason Ekstrand	035f66025b	nir/search: Don't match inexact expressions with exact subexpressions In the first pass of implementing exact handling, I made a mistake with search-and-replace. In particular, we only reallly handled exact/inexact on the root of the tree. Instead, we need to check every node in the tree for an exact/inexact match. As an example of this, consider the following GLSL code precise float a = b + c; if (a < 0) { do_stuff(); } In that case, only the add will be declared "exact" and an expression that looks for "b + c < 0" will still match and replace it with "b < -c" which may yield different results. The solution is to simply bail if any of the values are exact when matching an inexact expression.	2016-03-28 13:07:39 -07:00
Jason Ekstrand	fbb9e1f008	spirv/alu: Add support for the NoContraction decoration	2016-03-25 21:35:41 -07:00
Jason Ekstrand	00fa795cd3	spirv/glsl: Add a helper for converting glsl opcodes into nir opcodes This is similar to the way that regular ALU operations are handled.	2016-03-25 21:35:41 -07:00
Jason Ekstrand	98522c1853	nir/spirv: Get rid of the spirv2nir helper binary This was useful once upon a time but now that we have a real Vulkan driver to run our SPIR-V binaries through, there's really no point.	2016-03-25 21:35:41 -07:00
Jason Ekstrand	13bad493b4	nir/algebraic: Get rid of a redundant copy of fdiv lowering	2016-03-25 14:04:05 -07:00
Jason Ekstrand	08fe89864b	nir/algebraic: Add better lowering of ldexp	2016-03-25 14:04:05 -07:00
Jason Ekstrand	b75d770963	nir/builder: Simplify nir_ssa_undef a bit	2016-03-25 14:04:05 -07:00
Jason Ekstrand	ab31951bef	nir/spirv: Use the nir_ssa_undef helper from nir_builder	2016-03-25 14:04:05 -07:00
Jason Ekstrand	d2eee52a65	nir/builder: Add a bit size field to nir_ssa_undef	2016-03-25 14:04:05 -07:00
Jason Ekstrand	b50f7f0011	nir: Add a better comment for INTRINSIC_RANGE	2016-03-25 14:04:05 -07:00
Jason Ekstrand	add8c837b5	nir/glsl: Stop carying a pointer to the nir_shader in the visitor	2016-03-25 14:04:05 -07:00
Jason Ekstrand	2c3f95d6aa	Merge remote-tracking branch 'public/master' into vulkan	2016-03-24 17:30:14 -07:00
Jason Ekstrand	22b343a8ec	nir: Add a pass to inline functions This commit adds a new NIR pass that lowers all function calls away by inlining the functions. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	debf23ec68	nir/builder: Add helpers for easily inserting copy_var intrinsics Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	79dec93ead	nir: Add return lowering pass This commit adds a NIR pass for lowering away returns in functions. If the return is in a loop, it is lowered to a break. If it is not in a loop, it's lowered away by moving/deleting code as needed. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	8d61d72524	nir: Add a cursor helper for getting a cursor after any phi nodes Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	18b0166749	nir/builder: Add a helper for inserting jump instructions Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	97b663481c	nir/cf: Make extracting or re-inserting nothing a no-op Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	7022a673cd	nir: Add a function for comparing cursors Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	124f229ece	nir/cf: Handle relinking top-level blocks This can happen if a function ends in a return instruction and you remove the return. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	364212f1ed	nir: Add a pass to repair SSA form Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	ea98d415e4	nir/vars_to_ssa: Use the new nir_phi_builder helper The efficiency should be approximately the same. We do a little more work per phi node because we have to sort the predecessors. However, we no longer have to walk the blocks a second time to pop things off the stack. The bigger advantage, however, is that we can now re-use the phi placement and per-block SSA value tracking in other passes. As a side-benifit, the phi builder actually handles unreachable blocks correctly. The original vars_to_ssa code, because of the way it iterated the blocks and added phi sources, didn't add sources corresponding to predecessors of unreachable blocks. The new strategy employed by the phi builder creates a phi source for each predecessor and should correctly handle unreachable blocks by setting those sources to SSA undefs. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	42ddfc611f	nir/dominance: Handle unreachable blocks Previously, nir_dominance.c didn't properly handle unreachable blocks. This can happen if, for instance, you have something like this: loop { if (...) { break; } else { break; } } In this case, the block right after the if statement will be unreachable. This commit makes two changes to handle this. First, it removes an assert and allows block->imm_dom to be null if the block is unreachable. Second, it properly skips unreachable blocks in calc_dom_frontier_cb. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-03-24 15:20:44 -07:00
Jason Ekstrand	e4dc82cfcf	nir: Add a phi node placement helper Right now, we have phi placement code in two places and there are other places where it would be nice to be able to do this analysis. Instead of repeating it all over the place, this commit adds a helper for placing all of the needed phi nodes for a value. v2: Add better documentation Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-24 15:20:44 -07:00
Rob Clark	0bea0e7141	nir: fix dangling ssadef->name ptrs In many places, the convention is to pass an existing ssadef name ptr when construction/initializing a new nir_ssa_def. But that goes badly (as noticed by garbage in nir_print output) when the original string gets freed. Just use ralloc_strdup() instead, and add ralloc_free() in the two places that would care (not that the strings wouldn't eventually get freed anyways). Also fixup the nir_search code which was directly setting ssadef->name to use the parent instruction as memctx. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-03-24 08:30:04 -04:00
Jason Ekstrand	a984e44abd	nir/glsl: Propagate invariant into NIR alu ops Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:28:07 -07:00
Jason Ekstrand	91d6272c2b	nir/alu_to_scalar: Propagate the "exact" bit Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:28:06 -07:00
Jason Ekstrand	5f39e3e165	nir/cse: Properly handle nir_ssa_def.exact Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:28:06 -07:00
Jason Ekstrand	0dbda153aa	nir/algebraic: Flag inexact optimizations Many of our optimizations, while great for cutting shaders down to size, aren't really precision-safe. This commit tries to flag all of the inexact floating-point optimizations so they don't get run on values that are flagged "exact". It's a bit conservative and maybe flags some safe optimizations as unsafe but that's better than missing one. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:28:02 -07:00
Jason Ekstrand	ed3a029e80	nir/algebraic: Fix fmin detection to match the spec The previous transformation got the arguments to fmin backwards. When NaNs are involved, the GLSL min/max aren't commutative so it matters. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:28:00 -07:00
Jason Ekstrand	89545b1314	nir/algebraic: Get rid of an invlid fxor optimization The fxor opcode is required to return 1.0f or 0.0f but the input variable may not be 1.0f or 0.0f. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:27:58 -07:00
Jason Ekstrand	3a7cb6534c	nir/algebraic: Allow for flagging operations as being inexact Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:27:55 -07:00
Jason Ekstrand	a6f25fa7d7	nir/search: Propagate exactness into newly created expressions Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:27:52 -07:00
Jason Ekstrand	ded3133d47	nir/builder: Add a flag for setting exact Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:26:34 -07:00
Jason Ekstrand	4ff89377d9	nir: Add an "exact" bit to nir_alu_instr Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-03-23 16:26:34 -07:00
Jason Ekstrand	f849f53990	nir/clone: Export nir_variable_clone Reviewed-by: Rob Clark <robclark@gmail.com>	2016-03-23 15:26:11 -07:00
Jason Ekstrand	5fe8959912	nir/clone: Expose nir_constant_clone Reviewed-by: Rob Clark <robclark@gmail.com>	2016-03-23 15:26:08 -07:00
Jason Ekstrand	c4c373f156	nir: Fix whitespace Reviewed-by: Rob Clark <robclark@gmail.com>	2016-03-23 15:25:53 -07:00
Ian Romanick	d7a25a9def	nir: Don't abs slt and friends No shader-db changes, but this is symmetric with the previous commit. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-03-22 14:48:02 -07:00
Ian Romanick	2bb006af68	nir: Don't abs the result of b2f or b2i In the results below, 2 SIMD16 shaders in Trine are lost. G4X total instructions in shared programs: 4012279 -> 4011108 (-0.03%) instructions in affected programs: 116776 -> 115605 (-1.00%) helped: 339 HURT: 0 total cycles in shared programs: 84315862 -> 84313584 (-0.00%) cycles in affected programs: 1767232 -> 1764954 (-0.13%) helped: 274 HURT: 81 Ironlake total instructions in shared programs: 6399073 -> 6396998 (-0.03%) instructions in affected programs: 218050 -> 215975 (-0.95%) helped: 600 HURT: 0 total cycles in shared programs: 128892088 -> 128888810 (-0.00%) cycles in affected programs: 2867452 -> 2864174 (-0.11%) helped: 422 HURT: 137 Sandy Bridge total instructions in shared programs: 8462174 -> 8460759 (-0.02%) instructions in affected programs: 178529 -> 177114 (-0.79%) helped: 596 HURT: 0 total cycles in shared programs: 117542276 -> 117534098 (-0.01%) cycles in affected programs: 1239166 -> 1230988 (-0.66%) helped: 369 HURT: 150 Ivy Bridge total instructions in shared programs: 7775131 -> 7773410 (-0.02%) instructions in affected programs: 162903 -> 161182 (-1.06%) helped: 590 HURT: 0 total cycles in shared programs: 65759882 -> 65747268 (-0.02%) cycles in affected programs: 1004354 -> 991740 (-1.26%) helped: 467 HURT: 141 Haswell total instructions in shared programs: 7107786 -> 7106327 (-0.02%) instructions in affected programs: 140954 -> 139495 (-1.04%) helped: 590 HURT: 0 total cycles in shared programs: 64668028 -> 64655322 (-0.02%) cycles in affected programs: 967080 -> 954374 (-1.31%) helped: 452 HURT: 149 LOST: 2 GAINED: 0 Broadwell total instructions in shared programs: 8980029 -> 8978287 (-0.02%) instructions in affected programs: 197232 -> 195490 (-0.88%) helped: 715 HURT: 0 total cycles in shared programs: 70070448 -> 70055970 (-0.02%) cycles in affected programs: 975724 -> 961246 (-1.48%) helped: 471 HURT: 111 LOST: 2 GAINED: 0 Skylake total instructions in shared programs: 9115178 -> 9113436 (-0.02%) instructions in affected programs: 203012 -> 201270 (-0.86%) helped: 715 HURT: 0 total cycles in shared programs: 68848660 -> 68834004 (-0.02%) cycles in affected programs: 993888 -> 979232 (-1.47%) helped: 473 HURT: 116 LOST: 2 GAINED: 0 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-03-22 14:48:02 -07:00
Ian Romanick	348e5a71d8	nir: Simplify 0 < fabs(a) Sandy Bridge / Ivy Bridge / Haswell total instructions in shared programs: 8462180 -> 8462174 (-0.00%) instructions in affected programs: 564 -> 558 (-1.06%) helped: 6 HURT: 0 total cycles in shared programs: 117542462 -> 117542276 (-0.00%) cycles in affected programs: 9768 -> 9582 (-1.90%) helped: 12 HURT: 0 Broadwell / Skylake total instructions in shared programs: 8980833 -> 8980826 (-0.00%) instructions in affected programs: 626 -> 619 (-1.12%) helped: 7 HURT: 0 total cycles in shared programs: 70077900 -> 70077714 (-0.00%) cycles in affected programs: 9378 -> 9192 (-1.98%) helped: 12 HURT: 0 G45 and Ironlake showed no change. v2: Modify the comments to look more like a proof. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-03-22 14:47:56 -07:00
Ian Romanick	564a8b8a26	nir: Simplify 0 >= b2f(a) This also prevented some regressions with other patches in my local tree. Broadwell / Skylake total instructions in shared programs: 8980835 -> 8980833 (-0.00%) instructions in affected programs: 45 -> 43 (-4.44%) helped: 1 HURT: 0 total cycles in shared programs: 70077904 -> 70077900 (-0.00%) cycles in affected programs: 122 -> 118 (-3.28%) helped: 1 HURT: 0 No changes on earlier platforms. v2: Modify the comments to look more like a proof. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-03-22 14:44:57 -07:00
Ian Romanick	bf0d60aa11	nir: Simplify i2b with negated or abs operand This enables removing ssa_201 and ssa_202 in sequences like: vec1 ssa_200 = flt ssa_199, ssa_194 vec1 ssa_201 = b2i ssa_200 vec1 ssa_202 = i2b -ssa_201 shader-db results: Sandy Bridge total instructions in shared programs: 8462257 -> 8462180 (-0.00%) instructions in affected programs: 3846 -> 3769 (-2.00%) helped: 35 HURT: 0 total cycles in shared programs: 117542934 -> 117542462 (-0.00%) cycles in affected programs: 20072 -> 19600 (-2.35%) helped: 20 HURT: 1 Ivy Bridge total instructions in shared programs: 7775252 -> 7775137 (-0.00%) instructions in affected programs: 3645 -> 3530 (-3.16%) helped: 35 HURT: 0 total cycles in shared programs: 65760522 -> 65760068 (-0.00%) cycles in affected programs: 21082 -> 20628 (-2.15%) helped: 25 HURT: 2 Haswell total instructions in shared programs: 7108666 -> 7108589 (-0.00%) instructions in affected programs: 3253 -> 3176 (-2.37%) helped: 35 HURT: 0 total cycles in shared programs: 64675726 -> 64675272 (-0.00%) cycles in affected programs: 21034 -> 20580 (-2.16%) helped: 26 HURT: 1 Broadwell / Skylake total instructions in shared programs: 8980912 -> 8980835 (-0.00%) instructions in affected programs: 3223 -> 3146 (-2.39%) helped: 35 HURT: 0 total cycles in shared programs: 70077926 -> 70077904 (-0.00%) cycles in affected programs: 21886 -> 21864 (-0.10%) helped: 21 HURT: 6 G45 and Ironlake showed no change. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Suggested-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-03-22 14:43:28 -07:00
Ian Romanick	a4079f1cb2	nir: Lower flrp with Boolean interpolator to bcsel On Intel platforms that don't set lower_flrp, using bcsel instead of flrp seems to be a small amount worse. On those platforms, the use of flrp, bcsel, and multiply of b2f is still an active area of research. In review, Matt suggested this is because bcsel turns into CMP+SEL, and because of the flag register we can't schedule instructions well. shader-db results: G4X / Ironlake total instructions in shared programs: 4016538 -> 4012279 (-0.11%) instructions in affected programs: 161556 -> 157297 (-2.64%) helped: 1077 HURT: 1 total cycles in shared programs: 84328296 -> 84315862 (-0.01%) cycles in affected programs: 4174570 -> 4162136 (-0.30%) helped: 926 HURT: 53 Unsurprisingly, no changes on later platforms. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-03-22 14:42:42 -07:00
Connor Abbott	58fe7837b8	nir: propagate bitsize information in nir_search When we replace an expresion we have to compute bitsize information for the replacement. We do this in two passes to validate that bitsize information is consistent and correct: first we propagate bitsize from child nodes to parent, then we do it the other way around, starting from the original's instruction destination bitsize. v2 (Iago): - Always use nir_type_bool32 instead of nir_type_bool when generating algebraic optimizations. Before we used nir_type_bool32 with constants and nir_type_bool with variables. - Fix bool comparisons in nir_search.c to account for bitsized types. v3 (Sam): - Unpack the double constant value as unsigned long long (8 bytes) in nir_algrebraic.py. v4 (Sam): - Use helpers to get type size and base type from nir_alu_type. Signed-off-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-03-17 11:54:45 +01:00
Connor Abbott	3124ce699b	nir: add a bit_size parameter to nir_ssa_dest_init v2: Squash multiple commits addressing the new parameter in different files so we don't break the build (Iago) v3: Fix tgsi (Samuel) v4: Fix nir_clone.c (Samuel) v5: Fix vc4 and freedreno (Iago) v6 (Sam) - Fix build errors in nir_lower_indirect_derefs - Use helper to get type size from nir_alu_type. Signed-off-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Tested-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-03-17 11:54:45 +01:00
Iago Toral Quiroga	084b24f558	nir: rename nir_const_value fields to include bitsize information Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-03-17 11:16:33 +01:00
Connor Abbott	9076c4e289	nir: update opcode definitions for different bit sizes Some opcodes need explicit bitsizes, and sometimes we need to use the double version when constant folding. v2: fix output type for u2f (Iago) v3: do not change vecN opcodes to be float. The next commit will add infrastructure to enable 64-bit integer constant folding so this is isn't really necessary. Also, that created problems with source modifiers in some cases (Iago) v4 (Jason): - do not change bcsel to work in terms of floats - leave ldexp generic Squashed changes to handle different bit sizes when constant folding since otherwise we would break the build. v2: - Use the bit-size information from the opcode information if defined (Iago) - Use helpers to get type size and base type of nir_alu_type enum (Sam) - Do not fallback to sized types to guess bit-size information. (Jason) Squashed changes in i965 and gallium/nir drivers to support sized types. These functions should only see sized types, but we can't make that change until we make sure that nir uses the sized versions in all the relevant places. A later commit will address this. Signed-off-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-03-17 11:16:33 +01:00
Connor Abbott	6700d7e423	nir: add nir_{src,dest}_bit_size() helpers v2: use a ternary (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-03-17 11:16:33 +01:00
Jason Ekstrand	e172dbe5d2	nir: Add a bit_size to nir_register and nir_ssa_def This really hacky commit adds a bit size to registers and SSA values. It also adds rules in the validator to validate that they do the right things. It's still an open question as to whether or not we want a bit_size in nir_alu_instr or if we just want to let it inherit from the destination. I'm inclined to just let it inherit from the destination. A similar question needs to be asked about intrinsics. v2 (Connor): - Relax validation: comparisons have explicit destination sizes and implicit source sizes. v3 (Sam): - Use helpers to get size and base types of nir_alu_type enum. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-03-17 11:16:33 +01:00
Jason Ekstrand	78f1919429	nir: Add explicitly sized types v2: Fix size/type mask to properly handle 8-bit types. v3: Add helpers to get the bitsize and base type of a nir_alu_type enum. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-03-17 11:16:33 +01:00
Jordan Justen	3fd308a357	Merge remote-tracking branch 'origin/master' into vulkan	2016-03-17 01:44:07 -07:00
Jordan Justen	b1e7cdfdcf	nir: Lower shared var atomics during nir_lower_io Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-03-17 01:23:40 -07:00
Jordan Justen	e3cbb9d37c	nir: Add support for lowering load/stores of shared variables Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-03-17 01:23:40 -07:00
Jordan Justen	683c359c54	nir: Add atomic operations on variables This allows us to first generate atomic operations for shared variables using these opcodes, and then later we can lower those to the shared atomics intrinsics with nir_lower_io. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-03-17 01:23:40 -07:00
Jordan Justen	3c807607df	nir: Add compute shader shared variable storage class Previously we were receiving shared variable accesses via a lowered intrinsic function from glsl. This change allows us to send in variables instead. For example, when converting from SPIR-V. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-03-17 01:23:40 -07:00
Jordan Justen	26f8262698	nir/print: Add space after shader_storage var mode Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-03-17 01:23:40 -07:00
Jason Ekstrand	7f6a0cb29c	Merge remote-tracking branch 'public/master' into vulkan	2016-03-15 14:09:50 -07:00
Jason Ekstrand	98d58e7320	nir/clone: Add support for cloning a single function_impl Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-12 15:48:36 -08:00
Jason Ekstrand	036b209484	nir/validate: Better function validation Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-12 15:48:36 -08:00
Jason Ekstrand	f86f3c90aa	nir/print: Better function argument printing Since we aren't going to put the function parameters or the return variable in the list of locals, it won't get a proper declaration. This changes nir_print to print the type along with each parameter or return variable. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-12 15:48:36 -08:00

... 4 5 6 7 8 ...

608 Commits