mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Chris Forbes	5bbb028ef3	glsl: Validate aux storage qualifier combination with other qualifiers. We've been allowing `centroid` and `sample` in all kinds of weird places where they're not valid. Insist that `sample` is combined with `in` or `out`; and that `centroid` is combined with `in`, `out`, or the deprecated `varying`. V2: Validate this in a more sensible place. This does require an extra case for uniform blocks members and struct members, though, since they don't go through the normal path. V3: Improve error message wording; eliminate redundant error generation for inputs in VS or outputs in FS. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-06-10 10:09:31 +12:00
Chris Forbes	b18b4c7d74	glsl: Implement overload resolution for ARB_gpu_shader5 V3: Move spec citation into the code. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-04 20:10:27 +12:00
Chris Forbes	c1ceadfc32	glsl: Add support for comparing function parameter conversions The ARB_gpu_shader5 spec says: "To determine whether the conversion for a single argument in one match is better than that for another match, the following rules are applied, in order: 1. An exact match is better than a match involving any implicit conversion. 2. A match involving an implicit conversion from float to double is better than a match involving any other implicit conversion. 3. A match involving an implicit conversion from either int or uint to float is better than a match involving an implicit conversion from either int or uint to double. If none of the rules above apply to a particular pair of conversions, neither conversion is considered better than the other." V3: Add spec citation, including oddball difference between gs5 and GLSL 4.0; comment a bit better as per Jordan's suggestions. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-04 20:03:08 +12:00
Chris Forbes	59dd444cac	glsl: Build a list of inexact function matches This will facilitate GLSL 4.0 / ARB_gpu_shader5's enhanced overload resolution rules, and also possibly better error reporting for ambiguous function calls. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-04 19:49:34 +12:00
Chris Forbes	6ae787584d	glsl: Allow int -> uint implicit conversions on function parameters V2: Fix crashes during linking, where the parse state is NULL. In this case, all required checks have already been done, so we assume the extension is enabled. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-06-04 19:35:59 +12:00
Chris Forbes	f17428a276	glsl: Pass parse state to can_implicitly_convert_to() Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-06-04 19:35:57 +12:00
Chris Forbes	a78c663c22	glsl: Pass parse state to parameter_lists_match() The available implicit conversions depend on the GLSL version we're compiling. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-06-04 19:35:54 +12:00
Chris Forbes	240974e93f	glsl: Add support for int -> uint implicit conversions This is required for ARB_gpu_shader5. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-06-04 19:35:51 +12:00
Chris Forbes	1ace51f091	glsl: Clean up apply_implicit_conversion We're about to add new implicit conversions, first for ARB_gpu_shader5, and then later for ARB_gpu_shader_fp64. Pull out the opcode determination into its own function, and get rid of the bool -> float case that could never be hit anyway [since it fails the is_numeric() check]. V2: Retain the vector width mangling. It turns out this is necessary for the conversions done (and then thrown away) when determining the return type of arithmetic operators. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-06-04 19:35:47 +12:00
Chris Forbes	345034869e	glsl: Allow `precise` as a parameter qualifier Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-04 18:56:09 +12:00
Chris Forbes	d0495c6db8	glsl: Disallow `precise` redeclarations of vars from outer scopes Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-04 18:56:08 +12:00
Chris Forbes	5ecffe5a3a	glsl: Add support for `precise` redeclarations This works like glsl-1.20+'s invariant redeclarations, but with fewer restrictions, since `precise` is allowed on pretty much anything. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-04 18:56:05 +12:00
Chris Forbes	4b756b20c4	glsl: add support for `precise` in type_qualifier Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-04 18:56:03 +12:00
Chris Forbes	37ab3ddbf8	glsl: remove outdated comment, move sample to correct block Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-06-04 18:55:49 +12:00
Matt Turner	ac25cf55af	glsl: Make most ir_instruction::as_subclass() functions non-virtual. There are several common ways to check whether an object is a particular subclass: dynamic_cast<>, the as_subclass() pattern, or explicit enum tags. We originally used the virtual as_subclass methods, but later added enum tags as they are much nicer for debugging. Since we have the enum tags, we don't necessarily need to use virtual functions to implement the as_subclass() methods. We can just check the tag and return the pointer or NULL. This saves 18 entries in the vtable, and instead of two pointer dereferences per as_subclass() call most are only three inline instructions. Compile time of sam3/112.frag (the longest compile in a recent shader-db run) is reduced by 5% from 348 to 329 ms (n=500). perf stat of this workload shows: 24.14% reduction in iTLB-loads: 285,543 -> 216,606 42.55% reduction in iTLB-load-misses: 18,785 -> 10,792 Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-06-03 17:58:34 -07:00
Matt Turner	773544f0e9	glsl: Move ir_type_unset to end of enumeration. Now that the constructors set a type, ir_type_unset is not very useful. Move it to the end of the enum (specifically out of position 0) so that enums checks for dereferences and rvalues can save an instruction. Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-06-03 17:58:34 -07:00
Matt Turner	943cc7ff17	glsl: Reorder ir_type_* enum for easier comparisons. Makes checking whether an object is an ir_dereference, an ir_rvalue, or an ir_jump simpler. Since ir_dereference is a subclass or ir_rvalue, list its subtypes first so that they can both generate nice code. Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-06-03 17:58:34 -07:00
Matt Turner	3540b5eb55	glsl: Remove useless call to as_rvalue(). The type returned by hir() is already an ir_rvalue pointer. Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-06-03 17:58:34 -07:00
Ian Romanick	963bd99f03	glsl: Set ir_instruction::ir_type in the base class constructor This has the added perk that if you forget to set ir_type in the constructor of a new subclass (or a new constructor of an existing subclass) the compiler will tell you... instead of relying on ir_validate or similar run-time detection. Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2014-06-03 17:58:34 -07:00
Tapani Pälli	56bdffe8c1	scons: add common.c as part of glcpp build to have _mesa_error_no_memory function available Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=79440 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-05-30 10:11:44 +03:00
Tapani Pälli	c692581ae8	glcpp: link with tests/common.c So that prog_hash_table can use _mesa_error_no_memory function. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-05-30 09:22:24 +03:00
Juha-Pekka Heikkila	19f1d137f8	glsl: Add null check in loop_analysis.cpp Check return value from hash_table_find before using it as a pointer Signed-off-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-30 07:21:12 +03:00
Connor Abbott	fc7e7cfabc	glsl/tests: remove generated tests from the repo They were made unneccesary by the last commit. Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-05-28 15:07:07 -07:00
Connor Abbott	a1d8322fbb	glsl/tests: call create_test_cases.py in optimization-test This way, when someone modifies create_test_cases.py and forgets to commit their changes again, people will notice. v2: make sure we parse the right directories and check for existance the right way. v3 (Ken): Use $PYTHON2 instead of calling python directly. Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-05-28 15:06:45 -07:00
Connor Abbott	6e24111b9c	glsl/tests/lower_jumps: fix generated sexpr's for loops In `088494aa` (as well as other commits in the series) Paul Berry modified the tests for lower_jumps to account for the fact that the s-expression for the loop IR instruction changed from (loop () () () () (statements...)) to (loop (statements...)), but he forgot to update create_test_cases.py which he used to create the tests. Fix that, so that now create_test_cases.py is synced with the generated tests. Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-05-28 15:06:16 -07:00
Connor Abbott	bbaec0f76c	glsl: be more consistent about printing constants Make sure that we print the same number of digits when printing 0.0 as any other floating-point number. This will make generating expected output files for tests easier. To avoid breaking "make check," update the generated tests for lower_jumps before the next commit which will bring create_test_cases.py in line with them. Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-05-28 15:05:59 -07:00
Brian Paul	a7aca3919b	glsl: replace strncmp("gl_") calls with new is_gl_identifier() helper Makes things a little easier to read. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-28 15:06:07 -06:00
Brian Paul	f9cecca7a6	glsl: fix use-after free bug/crash in ast_declarator_list::hir() The call to get_variable_being_redeclared() may delete 'var' so we can't reference var->name afterward. We fix that by examining the var's name before making that call. Fixes valgrind warnings and possible crash when running the piglit tests/spec/glsl-1.30/execution/clipping/vs-clip-distance-in-param.shader_test test (and probably others). Cc: "10.1 10.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-28 15:06:07 -06:00
Matt Turner	9b0108ddc1	glsl: Add C-callable fprint_ir function. Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-15 15:45:39 -07:00
Jonathan Gray	0c0bbe77d0	glsl: simplify the M_PIf macros, fixes build on OpenBSD The M_PIf macros used a preprocessor paste to append 'f' to M_PI defines, which works if the values are only numbers but breaks on OpenBSD where M_PI definitions have casts and brackets to meet requirements of a future version of POSIX, http://austingroupbugs.net/view.php?id=801 http://austingroupbugs.net/view.php?id=828 Simplify the M_PI*f macros by using casts directly in the defines as suggested by Kenneth Graunke. Cc: "10.2" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=78665 Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Jonathan Gray <jsg@jsg.id.au>	2014-05-13 22:30:22 -07:00
José Fonseca	1646f4d0fb	ralloc: Omit detailed license information about talloc. That information misleads source code auditing tools to think that ralloc itself is released under LGPL v3. Instead, simply state talloc is not licensed under a permissive license. v2: Use wording suggested by Kenneth. Reviewed-by: Brian Paul <brianp@vmware.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-13 12:48:38 +01:00
Iago Toral Quiroga	a143fbb322	glsl: Do not call lhs->variable_referenced() multiple times Instead take the result from the first call and use it where needed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-13 10:01:02 +02:00
Timothy Arceri	9c9dd8ca93	glsl: the number of samplers is already calculated so use it Signed-off-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-13 07:40:08 +10:00
Chris Forbes	417f5ea00d	glsl: Rename linker's is_varying_var Both the ast->IR and linker have functions with this name, but different behavior. Rename the linker's version to var_counts_against_varying_limit to be closer to what it is actually used for. Suggested by Ian a while back. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-05-10 09:29:13 +12:00
Ian Romanick	f7bf37cb13	linker: Fix consumer_inputs_with_locations indexing In an earlier incarnation of populate_consumer_input_sets and get_matching_input, the consumer_inputs_with_locations array was indexed using the user-specified location. In that version, only user-defined varyings were included in the array. In the current incarnation, the Mesa location is used to index the array, and built-in varyings are included. This change fixes the unit test to exepect gl_ClipDistance in the array, and it resizes the arrays to actually be big enough. It's just dumb luck that the existing piglit tests use small enough locations to not stomp the stack. :( Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=78258 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.2" <mesa-stable@lists.freedesktop.org> Cc: Vinson Lee <vlee@freedesktop.org>	2014-05-07 09:50:14 -07:00
Tapani Pälli	e65917f94e	glsl: fix bogus layout qualifier warnings Print out GL_ARB_explicit_attrib_location warnings only when parsing attribute that uses "location" qualifier. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=77245 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: "10.1 10.2" <mesa-stable@lists.freedesktop.org>	2014-05-06 08:36:40 +03:00
Ian Romanick	59ad2e6696	mesa: Add _mesa_error_no_memory for logging out-of-memory messages This can be called from locations that don't have a context pointer handy. This patch also adds enough infrastructure so that the unit tests for the GLSL compiler and the stand-alone compiler will build and function. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>	2014-05-02 11:58:36 -07:00
Chia-I Wu	267e28bb62	glsl: make static constant variables "static const" This allows them to be moved to .rodata, and allow us to be sure that they will not be modified. Signed-off-by: Chia-I Wu <olv@lunarg.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au>	2014-05-02 10:50:14 -07:00
Ilia Mirkin	31b92aa2fc	glsl: add lowering passes for carry/borrow Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-05-02 12:01:35 -04:00
Ian Romanick	f64bfb2e39	mesa: Eliminate gl_shader_program::InternalSeparateShader This was a work-around to allow linking a program with only a fragment shader in a GLES context. Now that we have GL_EXT_separate_shader_objects in GLES contexts, we can just use that. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:20:11 -07:00
Ian Romanick	7d9adef340	mesa: Enable GL_EXT_separate_shader_objects for OpenGL ES Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:20:10 -07:00
Ian Romanick	507b875cf5	glsl: Sort the list of extensions ARB, OES, then everything else. If there's ever a KHR shading language extension, it should go between ARB and OES. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:20:10 -07:00
Ian Romanick	fb615feafb	mesa: Remove support for desktop OpenGL GL_EXT_separate_shader_objects I don't know of any applications that actually use it. Now that Mesa supports GL_ARB_separate_shader_objects in all drivers, this extension is just cruft. The entrypoints for the extension remain in the XML. This is done so that a new libGL will continue to provide dispatch support for old drivers that try to expose this extension. Future patches will add OpenGL ES GL_EXT_separate_shader_objects, but that's a different thing. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:20:10 -07:00
Ian Romanick	e608449d3e	mesa/sso: Enable GL_ARB_separate_shader_objects by default Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:20:08 -07:00
Ian Romanick	7ff937e579	linker: Modify cross_validate_outputs_to_inputs to match using explicit locations This will be used for GL_ARB_separate_shader_objects. That extension not only allows separable shaders to rendezvous by location, but it also allows traditionally linked shaders to rendezvous by location. The spec says: 36. How does the behavior of input/output interface matching differ between separable programs and non-separable programs? RESOLVED: The rules for matching individual variables or block members between stages are identical for separable and non-separable programs, with one exception -- matching variables of different type with the same location, as discussed in issue 34, applies only to separable programs. However, the ability to enforce matching requirements differs between program types. In non-separable programs, both sides of an interface are contained in the same linked program. In this case, if the linker detects a mismatch, it will generate a link error. v2: Make sure consumer_inputs_with_locations is initialized when consumer is NULL. Noticed by Chia-I. v3: Rebase on removal of ir_variable::user_location. v4: Replace a (stale) FINISHME with some good explanation comments from Eric. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:19:40 -07:00
Ian Romanick	d030a3404c	linker: Sort shader I/O variables into a canonical order v2: Rebase on removal of ir_variable::user_location. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:19:40 -07:00
Ian Romanick	c557eb7722	linker: Allow geometry shader without vertex shader for separable programs Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:19:40 -07:00
Ian Romanick	1ff5a2b1ba	linker: Assign varying locations for separable programs Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-02 07:19:40 -07:00
Ian Romanick	7d73c3e99e	linker: Allow consumer stage or producer stage to be NULL When linking a separable program that contains only a fragment shader, the producer will be NULL. Similar cases will exist with geometry shaders and, eventually, tessellation shaders. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:19:40 -07:00
Ian Romanick	fe37cb0ac6	linker: Refactor code that gets an input matching an output Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-02 07:19:40 -07:00
Ian Romanick	5699220cd5	glsl: Exit when the shader IR contains an interface block instance While writing the link_varyings::single_interface_input test, I discovered that populate_consumer_input_sets assumes that all shader interface blocks have been lowered to discrete variables. Since there is a pass that does this, it is a reasonable assumption. It was, however, non-obvious. Make the code fail when it encounters such a thing, and add a test to verify that behavior. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-02 07:19:40 -07:00
Ian Romanick	ba7195d126	glsl/tests: Add first simple tests of populate_consumer_input_sets Four initial tests: * Create an IR list with a single input variable and verify that variable is the only thing in the hash tables. * Same as the previous test, but use a built-in variable (gl_ClipDistance) with an explicit location set. * Create an IR list with a single input variable from an interface block and verify that variable is the only thing in the hash tables. * Create an IR list with a single input variable and a single input variable from an interface block. Verify that each is the only thing in the proper hash tables. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-02 07:19:39 -07:00
Ian Romanick	8f5852bd2b	linker: Refactor code that builds hash tables of varyings during linking I want to make some changes to this code, but first I want to make some unit tests for it... so that I can capture the pre- and post-invariants. Pulling the code out into its own function in a non-anonymous namespace enables that. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-02 07:19:39 -07:00
Ian Romanick	5998fd536a	linker: Make lower_packed_varyings work with explicit locations Don't do anything with variables that have explicitly assigned locations. This is also how built-in varyings are handled. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:16:54 -07:00
Ian Romanick	7016afe25d	glsl: Remove varying "base" parameters In February 2013 Paul unified the values used for shader stage outputs and shader stage inputs. See commits 8a076c5f0^..eed6baf76. Since that time, the location_base parameters are always VARYING_SLOT_VAR0. Instead of passing that around, just hard code it. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-02 07:16:54 -07:00
Ian Romanick	03488cd3b9	glsl: Constify parameter to a couple varying_matches methods Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-02 07:16:54 -07:00
Anuj Phogat	9bcb0a8532	glsl: Apply the link error conditions to GL_ARB_fragment_coord_conventions Link error conditions added in previous patch are equally applicable to GL_ARB_fragment_coord_conventions implementation. Extension's spec says: "If gl_FragCoord is redeclared in any fragment shader in a program, it must be redeclared in all the fragment shaders in that program that have a static use of gl_FragCoord. All redeclarations of gl_FragCoord in all fragment shaders in a single program must have the same set of qualifiers." Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-01 10:58:39 -07:00
Anuj Phogat	35f11e85cb	glsl: Link error if fs defines conflicting qualifiers for gl_FragCoord GLSL 1.50 spec says: "If gl_FragCoord is redeclared in any fragment shader in a program, it must be redeclared in all the fragment shaders in that program that have a static use gl_FragCoord. All redeclarations of gl_FragCoord in all fragment shaders in a single program must have the same set of qualifiers." This patch causes the shader link to fail if we have multiple fragment shaders with conflicting layout qualifiers for gl_FragCoord. V2: Restructure the code and add conditions to correctly handle the following case: fragment shader 1: layout(origin_upper_left) in vec4 gl_FragCoord; void main() { foo(); gl_FragColor = gl_FragData; } fragment shader 2: layout(pixel_center_integer) in vec4 gl_FragCoord; void foo() { } V3: Allow linking in the following case: fragment shader 1: void main() { foo(); gl_FragColor = gl_FragCoord; } fragment shader 2: in vec4 gl_FragCoord; void foo() { ... } Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-01 10:58:39 -07:00
Anuj Phogat	a751adf071	glsl: Compile error if fs uses gl_FragCoord before first redeclaration Section 4.3.8.1, page 39 of GLSL 1.50 spec says: "Within any shader, the first redeclarations of gl_FragCoord must appear before any use of gl_FragCoord." GLSL compiler should generate an error in following case: vec4 p = gl_FragCoord; layout(origin_upper_left) in vec4 gl_FragCoord; void main() { } Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-01 10:58:39 -07:00
Anuj Phogat	581e4acb0d	glsl: Compile error if fs defines conflicting qualifiers for gl_FragCoord GLSL 1.50 spec says: "If gl_FragCoord is redeclared in any fragment shader in a program, it must be redeclared in all the fragment shaders in that program that have a static use gl_FragCoord. All redeclarations of gl_FragCoord in all fragment shaders in a single program must have the same set of qualifiers." This patch makes the glsl compiler to generate an error if we have a fragment shader defined with conflicting layout qualifier declarations for gl_FragCoord. For example: layout(origin_upper_left, pixel_center_integer) in vec4 gl_FragCoord; layout(pixel_center_integer) in vec4 gl_FragCoord; void main() { } V2: Some code refactoring for better readability. Add compiler error conditions for redeclarations like: layout(origin_upper_left) in vec4 gl_FragCoord; layout(origin_upper_left, pixel_center_integer) in vec4 gl_FragCoord; and in vec4 gl_FragCoord; layout(origin_upper_left, pixel_center_integer) in vec4 gl_FragCoord; V3: Simplify function is_conflicting_fragcoord_redeclaration() V4: Check for null pointer before doing strcmp(var->name, "gl_FragCoord"). Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-01 10:58:39 -07:00
Anuj Phogat	8c61b6a99b	glsl: Allow overlapping locations for vertex input attributes Currently overlapping locations of input variables are not allowed for all the shader types in OpenGL and OpenGL ES. From OpenGL ES 3.0 spec, page 56: "Binding more than one attribute name to the same location is referred to as aliasing, and is not permitted in OpenGL ES Shading Language 3.00 vertex shaders. LinkProgram will fail when this condition exists. However, aliasing is possible in OpenGL ES Shading Language 1.00 vertex shaders." Taking in to account what different versions of OpenGL and OpenGL ES specs say about aliasing: - It is allowed only on vertex shader input attributes in OpenGL (2.0 and above) and OpenGL ES 2.0. - It is explictly disallowed in OpenGL ES 3.0. Fixes Khronos CTS failing test: explicit_attrib_location_vertex_input_aliased.test See more details about this at below mentioned khronos bug. V2: Fix the case where location exceeds the maximum allowed attribute location. V3: Simplify the condition added in V2. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: "9.2 10.0 10.1" <mesa-stable@lists.freedesktop.org> Bugzilla: Khronos #9609 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-01 10:58:39 -07:00
Chris Forbes	151a20dcd4	glsl: fix spelling of derived Signed-off-by: Chris Forbes <chrisf@ijw.co.nz>	2014-04-27 21:37:23 +12:00
Matt Turner	18993f7892	glsl: Use properly typed arguments for bitfieldInsert. bitfieldInsert takes scalar integers for its last two arguments. Since bitfieldInsert is lowered on i965 to two instructions that have more flexible arguments, I didn't notice when I wrote this. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-04-25 19:24:39 -07:00
Chris Forbes	0dfa6e7cf5	glsl: Only allow `invariant` on shader in/out between stages. Previously this was special-cased for VS and FS; it never got updated when geometry shaders came along. Generalize using is_varying_var() so this won't be broken again with tessellation. Note that there are two copies of the logic for `invariant`: It can be present as part of a new declaration, and also as a redeclaration of an existing variable or block member. Fixes the four new piglits: spec/glsl-1.50/compiler/invariant-qualifier-*.geom Note for stable: This won't quite pick cleanly due to whitespace and state->target -> state->stage renames. Should be straightforward adjustments though. Cc: "10.0 10.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-04-22 09:07:05 +12:00
Chris Forbes	9fec560e63	glsl: Fix typo Signed-off-by: Chris Forbes <chrisf@ijw.co.nz>	2014-04-21 16:02:02 +12:00
Chris Forbes	92840aabf7	glsl: Allow explicit binding on atomics again As of `943b2d52bf`, layout(binding) on an atomic would fail the assertion here. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-18 10:35:05 -07:00
Iago Toral Quiroga	cda5e0c25e	glsl: Small optimization for constant conditionals Once the relevant branch has been identified do not iterate over the instructions in the branch, do a linked list insertion instead to avoid the loop. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-16 23:39:57 -07:00
Iago Toral Quiroga	4472ab9e6d	glsl: Fix incorrect indentation. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-16 23:22:24 -07:00
Anuj Phogat	80b4a36fed	glsl: Fix copy-paste error in linker_warning() Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-16 18:37:06 -07:00
Iago Toral Quiroga	6d0e30c6a3	glsl: Properly handle blocks that define the same field name. Currently we can have name space collisions between blocks that define the same fields. For example: in block { vec4 Color; } In[]; out block { vec4 Color; } Out; These two blocks will assign the same interface name (block.Color) to the Color field in flatten_named_interface_blocks_declarations.cpp, leading to havoc. This was breaking badly the gl-320-primitive-shading test from ogl-samples. The patch uses the block instance name to avoid collisions, producing names like block.In.Color and block.Out.Color to avoid the name clash. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76394 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-15 22:18:43 -07:00
Matt Turner	d877c643be	glsl: Use M_PI_* macros. Notice our multiple values for M_PI_2, which rounded ...32 up to ...4 and ...5.	2014-04-15 09:24:09 -07:00
Chris Forbes	aeb03f8aea	glsl: Fix typo in interface block comment Signed-off-by: Chris Forbes <chrisf@ijw.co.nz>	2014-04-13 17:02:11 +12:00
Kenneth Graunke	857f3a68ea	glsl: Ignore loop-too-large heuristic if there's bad variable indexing. Many shaders use a pattern such as: for (int i = 0; i < NUM_LIGHTS; i++) { ...access a uniform array, or shader input/output array... } where NUM_LIGHTS is a small constant (such as 2, 4, or 8). The expectation is that the compiler will unroll those loops, turning the array access into constant indexing, which is more efficient, and which may enable array splitting and other optimizations. In many cases, our heuristic fails - either there's another tiny nested loop inside, or the estimated number of instructions is just barely beyond the threshold. So, we fail to unroll the loop, leaving the variable indexing in place. Drivers which don't support the particular flavor of variable indexing will call lower_variable_index_to_cond_assign(), which generates piles and piles of immensely inefficient code. We'd like to avoid generating that. This patch detects unsupported forms of variable-indexing in loops, where the array index is a loop induction variable. In that case, it bypasses the loop-too-large heuristic and forces unrolling. Improves performance in various microbenchmarks: Gl32PSBump8 by 47%, Gl32ShMapVsm by 80%, and Gl32ShMapPcf by 27%. No changes in shader-db. v2: Check ir->array for being an array or matrix, rather than the ir_dereference_array itself. v3: Fix and expand statistics in commit message. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-04-11 17:41:43 -07:00
Kenneth Graunke	2231db5598	glsl: Rename loop_unroll_count::fail to "nested_loop." The "fail" flag is set if loop_unroll_count encounters a nested loop; calling the flag "nested_loop" is a bit clearer. The original reasoning was that count is inaccurate (too small) if there are nested loops, as we don't do any sort of analysis on the inner loop. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-04-11 17:41:41 -07:00
Kenneth Graunke	8268a2f347	glsl: Pass gl_shader_compiler_optimizations to unroll_loops(). Loop unrolling will need to know a few more options in the future. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-04-11 17:41:39 -07:00
Kenneth Graunke	da22221aa3	glsl: Drop do_common_optimization's max_unroll_iterations parameter. Now that we pass in gl_shader_compiler_options, it makes sense to just use options->MaxUnrollIterations, rather than passing a separate parameter. Half of the invocations already passed options->MaxUnrollIterations, while the other half passed in a hardcoded value of 32. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-04-11 17:41:37 -07:00
Kenneth Graunke	ae2a03b573	glsl: Try vectorizing when seeing a repeated assignment to a channel. When considering assignment expressions like: v.x += u.x; v.x += u.x; the vectorizer would incorrectly keep going, attempting to find more instructions to vectorize. It would overwrite the saved assignment to point at the second one, and increment channels a second time, resulting in try_vectorize thinking the expression was a vec2 instead of a float. Instead, if we see a repeated assignment to a channel, just try to vectorize everything we've found so far. This clears the saved state so it will start over. Fixes Piglit's repeated-channel-assignments.vert. Cc: "10.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-04-11 12:39:37 -07:00
Ian Romanick	625cf8c874	glsl: Propagate explicit binding information from the AST all the way to the linker Information about the binding was not being properly communicated from the front-end compiler to the linker. As a result, the linker never knew that any UBOs had explicit bindings! Fixes the piglit test arb_shading_language_420pack-binding-layout. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76323 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: github@socker.lepus.uberspace.de [v0] Cc: "10.1" <mesa-stable@lists.freedesktop.org> Cc: github@socker.lepus.uberspace.de	2014-04-11 12:26:01 -07:00
Ian Romanick	25a6656875	linker: Set binding for all elements of UBO array Previously, a UBO like layout(binding=2) uniform U { ... } my_constants[4]; wouldn't get any bindings set. The code would try to set the binding of U, but that would fail. It should instead set the bindings for U[0], U[1], ... Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76323 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.1" <mesa-stable@lists.freedesktop.org> Cc: github@socker.lepus.uberspace.de	2014-04-11 12:26:01 -07:00
Ian Romanick	cc42717b50	linker: Set block bindings based on UniformBlocks rather than UniformStorage For blocks, gl_shader_program::UniformStorage isn't very useful. The names stored there are the names of the elements of the block, so finding blocks with an instance name is hard. There is also only one entry in ::UniformStorage for each element of a block array, and that is a deal breaker. Using ::UniformBlocks is what _mesa_GetUniformBlockIndex does. I contemplated sharing code between set_block_binding and _mesa_GetUniformBlockIndex, but building the stand-alone compiler and the unit tests make this hard. I plan to return to this effort shortly. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76323 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.1" <mesa-stable@lists.freedesktop.org> Cc: github@socker.lepus.uberspace.de	2014-04-11 12:26:01 -07:00
Ian Romanick	157391a41b	linker: Clean up "unused parameter" warnings ../../src/glsl/link_uniform_initializers.cpp:87:1: warning: unused parameter 'mem_ctx' [-Wunused-parameter] ../../src/glsl/link_uniform_initializers.cpp:87:1: warning: unused parameter 'type' [-Wunused-parameter] ../../src/glsl/link_uniform_initializers.cpp:127:1: warning: unused parameter 'mem_ctx' [-Wunused-parameter] ../../src/glsl/link_uniform_initializers.cpp:127:1: warning: unused parameter 'type' [-Wunused-parameter] Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76323 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.1" <mesa-stable@lists.freedesktop.org> Cc: github@socker.lepus.uberspace.de	2014-04-11 12:26:01 -07:00
Ian Romanick	943b2d52bf	linker: Fold set_uniform_binding into call site In the next patch, we'll see that using gl_shader_program::UniformStorage is not correct for uniform blocks. That means we can't use ::UniformStorage to select between the sampler path and the block path. Instead we want to just use the type of the variable. That's never passed to set_uniform_binding, and it's easier to just remove the function (especially for later patches in the series) than to add another parameter. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76323 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.1" <mesa-stable@lists.freedesktop.org> Cc: github@socker.lepus.uberspace.de	2014-04-11 12:26:01 -07:00
Ian Romanick	881c52f13f	linker: Various trivial clean-ups in set_sampler_binding - Remove the spurious block left from the previous commit and re-indent. - Constify elements. - Make the spec reference in the code look like other spec references in the compiler. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76323 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.1" <mesa-stable@lists.freedesktop.org> Cc: github@socker.lepus.uberspace.de	2014-04-11 12:26:01 -07:00
Ian Romanick	6e2f63b69e	linker: Split set_uniform_binding into separate functions for blocks and samplers The two code paths are quite different, and there are some problems in the handling of uniform blocks. Future changes will cause these paths to diverge further. Ultimately, selecting between the two functions will happen at the set_uniform_binding call site, and set_uniform_binding will be deleted. NOTE: This patch just moves code around. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76323 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.1" <mesa-stable@lists.freedesktop.org> Cc: github@socker.lepus.uberspace.de	2014-04-11 12:26:01 -07:00
Eric Anholt	8c2bfbc6b9	glsl: Move tree grafting's debug output to stderr. The rest of our compiler dumps are there, now. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	e9822f77a9	glsl: Skip making a temporary for assignments when we don't need one. While we wish our optimization passes could identify all the cases where we can coalesce our variables, we miss out on a lot of opportunities. total instructions in shared programs: 1673849 -> 1673166 (-0.04%) instructions in affected programs: 299521 -> 298838 (-0.23%) GAINED: 7 LOST: 0 Note that many programs are "hurt". The notable ones are where we produce unrolling in cases we didn't before (presumably just because of the lower instruction count). But there are also some cases where pushing things right into the variables prevents copy propagation and tree grafting, since we don't split our variable usage webs apart. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:47 -07:00
Kenneth Graunke	73f80c20f6	glsl: Pass ctx->Const.NativeIntegers to do_algebraic. The next patch will introduce an optimization that only works when integers are not represented as floating point values. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-04-08 00:02:06 -07:00
Kenneth Graunke	169c645f12	glsl: Pass ctx->Const.NativeIntegers to do_common_optimization(). The next few patches will introduce an optimization that only works when integers are not represented as floating point values. v2: Re-word-wrap a line, as requested by Ian Romanick. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-04-08 00:02:03 -07:00
Kenneth Graunke	40d9337406	glsl: Validate that base types match for a number of binops. The IR is not supposed to support implicit type conversions; we just failed to validate it. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-04-08 00:02:01 -07:00
Kenneth Graunke	e14b93371c	glsl: Fix lack of i2u in lower_ubo_reference. ir_binop_ubo_load takes unsigned integer operands. However, the array index used to compute these offsets may be a signed integer. (For example, see Piglit's spec/glsl-1.40/uniform_buffer/fs-bvec-array). For some reason, we were missing an ir_binop_i2u cast, and ir_validator was failing to catch that. Without this change, ir_builder's type inference code broke for me when writing a new optimization pass. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-04-08 00:01:58 -07:00
Kenneth Graunke	7540be22d1	glsl: Make is_16bit_constant from i965 an ir_constant method. The i965 MUL instruction doesn't natively support 32-bit by 32-bit integer multiplication; additional instructions (MACH/MOV) are required. However, we can avoid those if we know one of the operands can be represented in 16 bits or less. The vector backend's is_16bit_constant static helper function checks for this. We want to be able to use it in the scalar backend as well, which means moving the function to a more generally-usable location. Since it isn't i965 specific, I decided to make it an ir_constant method, in case it ends up being useful to other people as well. v2: Rename from is_16bit_integer_constant to is_uint16_constant, as suggested by Ilia Mirkin. Update comments to clarify that it does apply to both int and uint types, as long as the value is non-negative and fits in 16-bits. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-04-08 00:01:53 -07:00
Matt Turner	34ec1a24d6	glsl: Optimize (x + y cmp 0) into (x cmp -y). Cuts a small handful of instructions in Serious Sam 3: instructions in affected programs: 4692 -> 4666 (-0.55%) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-04-05 09:47:37 -07:00
Chia-I Wu	5d76e44643	glsl: remove UBO fields from _mesa_glsl_parse_state They are not needed since `514f8c7ec7`. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-04-01 13:41:20 +08:00
Ian Romanick	4047263cb1	glsl: Clean up "unused parameter" warnings ../../src/glsl/builtin_functions.cpp:72:1: warning: unused parameter 'state' [-Wunused-parameter] ../../src/glsl/ir_clone.cpp:31:1: warning: unused parameter 'ht' [-Wunused-parameter] ../../src/glsl/ir_equals.cpp:44:1: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/ir_equals.cpp:50:1: warning: unused parameter 'ignore' [-Wunused-parameter] ../../src/glsl/ir_equals.cpp:68:1: warning: unused parameter 'ignore' [-Wunused-parameter] ../../src/glsl/ir_print_visitor.cpp:149:6: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/ir_print_visitor.cpp:556:1: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/ir_print_visitor.cpp:562:1: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/link_uniforms.cpp:213:1: warning: unused parameter 'record_type' [-Wunused-parameter] ../../src/glsl/loop_analysis.cpp:225:1: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/loop_unroll.cpp:73:30: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/loop_unroll.cpp:79:30: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/loop_unroll.cpp:85:30: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/opt_copy_propagation_elements.cpp:189:1: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/opt_cse.cpp:402:1: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/opt_dead_code_local.cpp:117:30: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/opt_redundant_jumps.cpp:53:1: warning: unused parameter 'ir' [-Wunused-parameter] ../../src/glsl/opt_vectorize.cpp:301:1: warning: unused parameter 'ir' [-Wunused-parameter] Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-03-28 10:57:58 -07:00
Ian Romanick	22128e30f3	glsl: Move Doxygen block closing ot the correct place This is the closing for the "\defgroup IR Intermediate representation nodes" all the way at the top of the file. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-03-28 10:57:49 -07:00
Chia-I Wu	e7f7574598	glsl: remove {add,get}_type_ast from glsl_symbol_table They are not needed since `0da1a2cc36`. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-03-28 10:59:49 +08:00
Ian Romanick	c4cec40883	glsl: Clean up "unused parameter" warnings ../../src/glsl/ir_constant_expression.cpp:486:1: warning: unused parameter 'variable_context' [-Wunused-parameter] ../../src/glsl/ir_constant_expression.cpp:1633:1: warning: unused parameter 'variable_context' [-Wunused-parameter] ../../src/glsl/ir_constant_expression.cpp:1752:1: warning: unused parameter 'variable_context' [-Wunused-parameter] ../../src/glsl/ir_constant_expression.cpp:1761:1: warning: unused parameter 'variable_context' [-Wunused-parameter] ../../src/glsl/ir_constant_expression.cpp:1769:1: warning: unused parameter 'variable_context' [-Wunused-parameter] Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-25 12:09:36 -07:00
Ian Romanick	f3ab987b70	glsl: Minor clean ups in constant_referenced These could probably be squashed into one of the previous commits. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-25 12:09:36 -07:00
Ian Romanick	6429d6276d	glsl: Remove ir_dereference::constant_referenced All of the functionality is implemented in a private function in the one file where it is used. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-25 12:09:36 -07:00
Ian Romanick	bb0d6db974	glsl: Fold implementation of ir_dereference_array::constant_referenced into wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-25 12:09:36 -07:00
Ian Romanick	35bf94f901	glsl: Fold implementation of ir_dereference_record::constant_referenced into wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-25 12:09:36 -07:00
Ian Romanick	b66319b006	glsl: Fold implementation of ir_dereference_variable::constant_referenced into wrapper Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-25 12:09:36 -07:00
Ian Romanick	14f0faacb6	glsl: Add wrapper function that calls ir_dereference::constant_referenced Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-25 12:09:36 -07:00
Ian Romanick	c11c7e4f01	glsl: Group all of the constant_referenced functions together Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-25 12:09:36 -07:00
Matt Turner	c049dd4396	glsl: Allow dot() on scalars, and throw out dotlike(). In all uses of dotlike() we're writing generic code that operates on 1-4 component vectors. That our IR requires ir_binop_dot expressions' operands to be 2+ component vectors is an implementation detail that's not important when implementing built-in functions with dot(), which is defined for scalar floats in GLSL. Reviewed-by: Eric Anholt <eric@anholt.net>	2014-03-18 23:20:29 -07:00
Matt Turner	6cbc64c3cb	glsl: Optimize pow(x, 2) into x * x. Cuts two instructions out of SynMark's Gl32VSInstancing benchmark. Reviewed-by: Eric Anholt <eric@anholt.net>	2014-03-18 23:20:29 -07:00
Matt Turner	9a9eaaa79a	glsl: Match whitespace changes from previous patch. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-03-18 23:20:29 -07:00
Matt Turner	7988b4804f	glsl: Expose pack/unpack built-ins for ARB_gpu_shader5. ARB_gpu_shader5 and ES 3.0 expose different subsets of ARB_shading_language_packing. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-03-18 23:20:29 -07:00
Eric Anholt	2dbebbd37d	glsl: Improve debug output and variable names for opt_dead_code_local. I know this code has confused others, and it confused me 3 years later, too. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2014-03-14 13:02:05 -07:00
Ian Romanick	87c66a4ff7	glsl: Fix typo Remove extra "any" and re-word-wrap the comment. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-03-12 11:16:50 -07:00
Ian Romanick	6bdc1d96c3	glsl: Rewrite unrolled link_invalidate_variable_locations calls as a loop Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-03-12 11:16:50 -07:00
Jonathan Gray	0d6f573f6e	glsl: Link glsl_compiler with pthreads library. Fixes the following build error on OpenBSD: ./.libs/libglsl.a(builtin_functions.o)(.text+0x973): In function `mtx_lock': ../../include/c11/threads_posix.h:195: undefined reference to `pthread_mutex_lock' ./.libs/libglsl.a(builtin_functions.o)(.text+0x9a5): In function `mtx_unlock': ../../include/c11/threads_posix.h:248: undefined reference to `pthread_mutex_unlock' Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-03-11 08:47:12 -06:00
Emil Velikov	90a4ffdea5	automake: use only the folder name if it's a subfolder of the present one v2: Resolve rebase conflicts. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jon TURNEY <jon.turney@dronecode.org.uk>	2014-03-11 12:50:41 +00:00
Emil Velikov	b15b1fbb51	automake: silence folder creation There is little gain in printing whenever a folder is created. v2: - Use $(AM_V_at) over @ to have control in verbose builds. Suggested by Erik Faye-Lund. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jon TURNEY <jon.turney@dronecode.org.uk>	2014-03-11 12:50:41 +00:00
Tapani Pälli	56b1be4399	mesa/glsl: introduce a remap table for uniform locations Patch adds a remap table for uniforms that is used to provide a mapping from application specified uniform location to actual location in the UniformStorage. Existing UniformLocationBaseScale usage is removed as table can be used to set sequential values for array uniform elements. This mapping helps to implement GL_ARB_explicit_uniform_location so that uniforms locations can be reorganized and handled in a more easy manner. v2: small fixes + rename parameters for merge and split functions (Ian) improve documentation, remove old check for location bounds (Eric) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-03-10 09:46:24 +02:00
Sir Anthony	6e39a8f6ec	glcpp: Do not remove spaces to preserve locations. After preprocessing by glcpp all adjacent spaces were replaced by single one and glsl parser received column-shifted shader source. It negatively affected ast location set up and produced wrong error messages for heavily-spaced shaders. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-03-08 01:38:32 -08:00
Sir Anthony	da2275cd9b	glsl: Change locations from yylloc to appropriate tokens positions. Reviewed-by: Carl Worth <cworth@cworth.org>	2014-03-08 01:29:00 -08:00
Sir Anthony	5656775cf6	glsl: Add ast_node method to set location range. Reviewed-by: Carl Worth <cworth@cworth.org>	2014-03-08 01:29:00 -08:00
Sir Anthony	654ee41cd3	glsl: Make ast_node location comments more informative. Reviewed-by: Carl Worth <cworth@cworth.org>	2014-03-08 01:29:00 -08:00
Sir Anthony	433d562ac6	glsl: Extend ast location structure to hande end token position. Reviewed-by: Carl Worth <cworth@cworth.org>	2014-03-08 01:29:00 -08:00
Sir Anthony	6984aa4350	glsl: Update lexers in glsl and glcpp to hande end position of token. Reviewed-by: Carl Worth <cworth@cworth.org>	2014-03-08 01:29:00 -08:00
Brian Paul	ef8a19ed4f	glsl: fix compiler warnings in link_uniforms.cpp With a non-debug build, gcc has two complaints: 1. 'found' var not used. Silence with '(void) found;' 2. 'id' not initialized. It's assigned by the UniformHash->get() call, actually. But init it to zero to silence gcc. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-03-06 07:45:36 -07:00
Brian Paul	f19000550d	glsl: switch to c11 mutex functions Reviewed-by: José Fonseca <jfonseca@vmware.com>	2014-03-03 13:08:58 -07:00
Kenneth Graunke	3f37dd913f	glsl: Fix broken LRP algebraic optimization. opt_algebraic was translating lrp(x, 0, a) into add(x, -mul(x, a)). Unfortunately, this references "x" twice, which is invalid in the IR, leading to assertion failures in the validator. Normally, cloning IR solves this. However, "x" could actually be an arbitrary expression tree, so copying it could result in huge piles of wasted computation. This is why we avoid reusing subexpressions. Instead, transform it into mul(x, add(1.0, -a)), which is equivalent but doesn't need two references to "x". Fixes a regression since `d5fa8a9562`, which isn't in any stable branches. Fixes 18 shaders in shader-db (bastion and yofrankie). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-03-02 13:35:03 -08:00
Matt Turner	4bd7f1d044	glsl: Don't vectorize horizontal expressions. Cc: "10.1" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=75224	2014-02-28 10:37:52 -08:00
Matt Turner	5eff8576ba	glsl: Add is_horizontal() method to ir_expression. Cc: "10.1" <mesa-stable@lists.freedesktop.org>	2014-02-28 10:37:46 -08:00
Matt Turner	d5fa8a9562	glsl: Optimize lrp(x, 0, a) into x - (x * a). Helps one program in shader-db: instructions in affected programs: 96 -> 92 (-4.17%) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-02-28 10:36:12 -08:00
Matt Turner	ecc6c3d4ab	glsl: Optimize lrp(0, y, a) into y * a. Helps two programs in shader-db: instructions in affected programs: 254 -> 234 (-7.87%) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-02-28 10:36:06 -08:00
Brian Paul	863a1f7757	glsl: add switch case for MESA_SHADER_COMPUTE To fix warning about unhandled enum value. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-02-26 13:29:16 -07:00
Kenneth Graunke	ac0a8b9540	glsl: Delete LRP_TO_ARITH lowering pass flag. Tt's kind of a trap---calling do_common_optimization() after lower_instructions() may cause opt_algebraic() to reintroduce ir_triop_lrp expressions that were lowered, effectively defeating the point. Because of this, nobody uses it. v2: Delete more code (caught by Ian Romanick). Cc: "10.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Acked-by: Eric Anholt <eric@anholt.net>	2014-02-26 02:16:56 -08:00
Timothy Arceri	376a98d345	glsl: removed unused dimension_count varible This variable is no longer needed after the cleanup to the code prior to the first arrays of array series Signed-off-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-26 08:31:25 +11:00
Dave Airlie	122c3b9486	glsl/i965: move lower_offset_array up to GLSL compiler level. This lowering pass will be useful for gallium drivers as well, in order to support the GL TG4 oddity that is textureGatherOffsets. Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-02-25 13:28:57 +10:00
Eric Anholt	1e3bd9f9a5	glsl: Add a file argument to the IR printer. While we want to be able to print to stdout for glsl_compiler, for debugging drivers we want to be able to dump to stderr because that's where other driver debug (like LIBGL_DEBUG) tends to go, and because some apps actually close stdout to shut up their own messages (such as the X Server, or NWN). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-22 19:23:21 -08:00
Ian Romanick	4d14b190bb	glsl/sso: Add parser and AST-to-HIR support for separate shader object layouts GL_ARB_separate_shader_objects adds the ability to specify location layouts for interstage inputs and outputs. In addition, this extension makes 'in' and 'out' generally available for shader inputs and outputs. This mimics the behavior of GL_ARB_explicit_attrib_location. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-21 15:41:02 -08:00
Ian Romanick	f3b184590f	mesa/sso: Add extension tracking for ARB_separate_shader_objects This adds the necessary bits for both the API and the GLSL compiler. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-21 15:41:02 -08:00
Jordan Justen	d099019935	glsl: add gl_InvocationID variable for ARB_gpu_shader5 v2: * Make gl_InvocationID a system value Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-02-20 10:33:09 -08:00
Jordan Justen	313402048f	glsl/linker: produce gl_shader_program Geom.Invocations Grab the parsed invocation count, check for consistency during linking, and finally save the result in gl_shader_program Geom.Invocations. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-02-20 10:33:08 -08:00
Jordan Justen	02dc74fbd7	glsl: parse invocations layout qualifier for ARB_gpu_shader5 _mesa_glsl_parse_state in_qualifier->invocations will store the invocations count. v3: * Use in_qualifier to allow the primitive to be specied separately from the invocations count (merge_qualifiers) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-02-20 10:33:08 -08:00
Jordan Justen	738c9c3c54	glsl: Generate error for invalid input layout declarations Fixes various piglit tests: spec/glsl-1.50/compiler/incorrect-in-layout-qualifier-*.geom Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-02-20 10:33:08 -08:00
Jordan Justen	0c558f9ee6	glsl: convert GS input primitive to use ast_type_qualifier We introduce a new merge_in_qualifier ast_type_qualifier which allows specialized handling of merging input layout qualifiers. By merging layout qualifiers into state->in_qualifier, we allow multiple input qualifiers. For example, the primitive type can be specified specified separately from the invocations count (ARB_gpu_shader5). state->gs_input_prim_type is moved into state->in_qualifier->prim_type state->gs_input_prim_type_specified is still processed separately so we can determine when the input primitive is specified. This is important since certain scenerios are not supported until after the primitive type has been specified in the shader code. v4: * Merge with compute shader input layout qualifiers Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-02-20 10:33:08 -08:00
Ian Romanick	7700c73cf4	glsl: Silence "type qualifiers ignored on function return type" warning The const in const unsigned foo(void); is meaningless. Removing it silences this warning: src/glsl/ast_to_hir.cpp:1802:56: warning: type qualifiers ignored on function return type [-Wignored-qualifiers] Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2014-02-19 15:08:50 -08:00
Ian Romanick	2c85fd5a96	glsl: Only warn for macro names containing __ From page 14 (page 20 of the PDF) of the GLSL 1.10 spec: "In addition, all identifiers containing two consecutive underscores (__) are reserved as possible future keywords." The intention is that names containing __ are reserved for internal use by the implementation, and names prefixed with GL_ are reserved for use by Khronos. Names simply containing __ are dangerous to use, but should be allowed. Per the Khronos bug mentioned below, a future version of the GLSL specification will clarify this. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Cc: "9.2 10.0 10.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Tested-by: Darius Spitznagel <d.spitznagel@goodbytez.de> Cc: Tapani Pälli <lemody@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=71870 Bugzilla: Khronos #11702	2014-02-19 15:08:50 -08:00
Ian Romanick	0bd7892630	glcpp: Only warn for macro names containing __ Section 3.3 (Preprocessor) of the GLSL 1.30 spec (and later) and the GLSL ES spec (all versions) say: "All macro names containing two consecutive underscores ( __ ) are reserved for future use as predefined macro names. All macro names prefixed with "GL_" ("GL" followed by a single underscore) are also reserved." The intention is that names containing __ are reserved for internal use by the implementation, and names prefixed with GL_ are reserved for use by Khronos. Since every extension adds a name prefixed with GL_ (i.e., the name of the extension), that should be an error. Names simply containing __ are dangerous to use, but should be allowed. In similar cases, the C++ preprocessor specification says, "no diagnostic is required." Per the Khronos bug mentioned below, a future version of the GLSL specification will clarify this. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Cc: "9.2 10.0 10.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Tested-by: Darius Spitznagel <d.spitznagel@goodbytez.de> Cc: Tapani Pälli <lemody@gmail.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=71870 Bugzilla: Khronos #11702	2014-02-19 15:08:50 -08:00
Anuj Phogat	03597cf802	glsl: Fix condition to generate shader link error GL_ARB_ES2_compatibility doesn't say anything about shader linking when one of the shaders (vertex or fragment shader) is absent. So, the extension shouldn't change the behavior specified in GLSL specification. Tested the behavior on proprietary linux drivers of NVIDIA and AMD. Both of them allow linking a version 100 shader program in OpenGL context, when one of the shaders is absent. Makes following Khronos CTS tests to pass: successfulcompilevert_linkprogram.test successfulcompilefrag_linkprogram.test Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-02-18 11:07:09 -08:00
Brian Paul	248606a5f0	glsl: rename _restrict to restrict_flag To fix MSVC compile breakage. Evidently, _restrict is an MSVC keyword, though the docs only mention __restrict (with two underscores). Note: we may want to also rename _volatile to volatile_flag to be consistent. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=74900 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-02-12 13:37:09 -07:00
Francisco Jerez	212122543b	glsl/linker: Propagate image uniform access qualifiers to the driver. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:06 +01:00
Francisco Jerez	c318a677dd	glsl/linker: Assign image uniform indices. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:06 +01:00
Francisco Jerez	e51158f2e7	glsl/linker: Count and check image resources. v2: Add comment about the reason why image variables take up space from the default uniform block. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:06 +01:00
Francisco Jerez	e8dbe430aa	glsl: Add image built-in function generator. Because of the combinatorial explosion of different image built-ins with different image dimensionalities and base data types, enumerating all the 242 possibilities would be annoying and a waste of .text space. Instead use a special path in the built-in builder that loops over all the known image types. v2: Generate built-ins on GLSL version 4.20 too. Rename '_has_float_data_type' to '_supports_float_data_type'. Avoid duplicating enumeration of image built-ins in create_intrinsics() and create_builtins(). v3: Use a more orthodox approach for passing image built-in generator parameters. v4: Cosmetic changes. Acked-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:06 +01:00
Francisco Jerez	87acc7c650	glsl: Add built-in constants for ARB_shader_image_load_store. v2: Add them on GLSL version 4.20 too. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	6057300ec6	glcpp: Add built-in define for ARB_shader_image_load_store. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	60c89f8bff	glsl: Add built-in types defined by ARB_shader_image_load_store. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	7af167d2be	glsl/ast: Generalize some sampler variable restrictions to all opaque types. No opaque types may be statically initialized in the shader, all opaque variables must be declared uniform or be part of an "in" function parameter declaration, no opaque types may be used as the return type of a function. v2: Add explicit check for opaque types in interface blocks. Check for opaque types in ir_dereference::is_lvalue(). Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	2158749e52	glsl/ast: Forbid declaration of image variables in structures and uniform blocks. Aggregating images inside uniform blocks is explicitly disallowed by the standard, aggregating them inside structures is not (as of GL 4.4), but there is a similar problem as with atomic counters: image uniform declarations require either a "writeonly" memory qualifier or an explicit format qualifier, which are explicitly forbidden in structure member declarations. In the resolution of Khronos bug #10903 the same wording applied to atomic counters was decided to mean that they're not allowed inside structures -- Rejecting image member declarations within structures seems the most reasonable option for now. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	6b28528d1c	glsl/ast: Make sure that image argument qualifiers match the function prototype. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	81c167ef1c	glsl/ast: Verify that function calls don't discard image format qualifiers. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	94a95e03d9	glsl/ast: Validate and apply memory qualifiers to image variables. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	910311c4a6	glsl/parser: Handle image built-in types. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	f9cf61df3b	glsl/parser: Handle image memory qualifiers. v2: Make the "map" array static const. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	fcd869ed56	glsl/parser: Handle the early_fragment_tests input layout qualifier. v2: Only allow the early_fragment_tests qualifier in fragment shaders. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	b0b26faa25	glsl/lexer: Add new tokens for ARB_shader_image_load_store. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	299e869d25	glsl/ast: Keep track of type qualifiers defined by ARB_shader_image_load_store. v2: Add comment next to the read_only and write_only qualifier flags. Change temporary copies of the type qualifier mask to use uint64_t too. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	c116541b2c	glsl: Add gl_uniform_storage fields to keep track of image uniform indices. v2: Promote anonymous struct into named struct. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:05 +01:00
Francisco Jerez	bb13691d1c	glsl: Add image memory and layout qualifiers to ir_variable. v2: Add comment next to the read_only and write_only qualifier flags. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:44:04 +01:00
Francisco Jerez	107d03a6d5	glsl: Add helper methods to glsl_type for dealing with images. Add predicates to query if a GLSL type is or contains an image. Rename sampler_coordinate_components() to coordinate_components(). v2: Use assert instead of unreachable. v3: No need to use a separate code-path for images in coordinate_components() after merging image and sampler fields in the glsl_type structure. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:43:37 +01:00
Francisco Jerez	8a2508ee07	glsl: Add image type to the GLSL IR. v2: Reuse the glsl_sampler_dim enum for images. Reuse the glsl_type::sampler_* fields instead of creating new ones specific to image types. Reuse the same constructor as for samplers adding a new 'base_type' argument. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:39:48 +01:00
Francisco Jerez	9e611fc72d	glsl: Add ARB_shader_image_load_store extension enables. Reviewed-by: Paul Berry <stereotype441@gmail.com>	2014-02-12 18:39:48 +01:00
Matt Turner	025d99ce3c	glsl: Do not vectorize vector array dereferences. Array dereferences must have scalar indices, so we cannot vectorize them. Cc: "10.1" <mesa-stable@lists.freedesktop.org> Reported-by: Andrew Guertin <lists@dolphinling.net> Tested-by: Andrew Guertin <lists@dolphinling.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-02-11 16:05:55 -08:00
Daniel Kurtz	b47d231526	glsl: Add locking to builtin_builder singleton Consider a multithreaded program with two contexts A and B, and the following scenario: 1. Context A calls initialize(), which allocates mem_ctx and starts building built-ins. 2. Context B calls initialize(), which sees mem_ctx != NULL and assumes everything is already set up. It returns. 3. Context B calls find(), which fails to find the built-in since it hasn't been created yet. 4. Context A finally finishes initializing the built-ins. This will break at step 3. Adding a lock ensures that subsequent callers of initialize() will wait until initialization is actually complete. Similarly, if any thread calls release while another thread is still initializing, or calling find(), the mem_ctx/shader would get free'd while from under it, leading to corruption or use-after-free crashes. Fixes sporadic failures in Piglit's glx-multithread-shader-compile. Bugzilla: https://bugs.freedesktop.org/69200 Signed-off-by: Daniel Kurtz <djkurtz@chromium.org> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "10.1 10.0" <mesa-stable@lists.freedesktop.org>	2014-02-11 02:21:41 -08:00
Eric Anholt	1e12dafcac	glsl: Optimize triop_csel with all-true or all-false. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:48 -08:00
Eric Anholt	de796b0ef0	glsl: Optimize various cases of fma (aka MAD). Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:48 -08:00
Eric Anholt	44577c4857	glsl: Optimize lrp(x, x, coefficient) --> x. total instructions in shared programs: 1627754 -> 1624534 (-0.20%) instructions in affected programs: 45748 -> 42528 (-7.04%) GAINED: 3 LOST: 0 (serious sam, humus domino demo) Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:48 -08:00
Eric Anholt	d72956790f	glsl: Optimize pow(x, 1) -> x. total instructions in shared programs: 1627826 -> 1627754 (-0.00%) instructions in affected programs: 6640 -> 6568 (-1.08%) GAINED: 0 LOST: 0 (HoN and savage2) Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:48 -08:00
Eric Anholt	6d7c123d6c	glsl: Optimize log(exp(x)) and exp(log(x)) into x. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:47 -08:00
Eric Anholt	2c2aa35336	glsl: Optimize ~~x into x. v2: Fix pasteo of an extra abs being inserted (caught by many). Rewrite to drop the silly switch statement. Reviewed-by: Matt Turner <mattst88@gmail.com> (v1)	2014-02-07 12:46:47 -08:00
Kenneth Graunke	2062f40d81	glsl: Don't lose precision qualifiers when encountering "centroid". Mesa fails to retain the precision qualifier when parsing: #version 300 es centroid in mediump vec2 v; Consider how the parser's type_qualifier production is applied. First, the precision_qualifier rule creates a new ast_type_qualifier: <precision: mediump> Then the storage_qualifier rule creates a second one: <flags: in> and calls merge_qualifier() to fold in any previous qualifications, returning: <flags: in, precision: mediump> Finally, the auxiliary_storage_qualifier creates one for "centroid": <flags: centroid> it then does $$ = $1 and $$.flags \|= $2.flags, resulting in: <flags: centroid, in> Since precision isn't stored in the flags bitfield, it is lost. We need to instead call merge_qualifier to combine all the fields. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reported-by: Kevin Rogovin <kevin.rogovin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-02-07 12:36:38 -08:00
Juha-Pekka Heikkila	88cad8356e	glsl: Fix null access on file read error Signed-off-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-02-07 08:14:04 -07:00
Matt Turner	e2ef93cf94	glsl: Initialize ubo_binding_mask flags to zero. Missed in commit `e63bb298`. Caused sporadic test failures, like incorrect-in-layout-qualifier-repeated-prim.geom. Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-06 10:36:54 -08:00
Paul Berry	1fe274b3d7	glsl/cs: Prohibit mixing of compute and non-compute shaders. Fixes piglit test: spec/ARB_compute_shader/linker/mix_compute_and_non_compute Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-05 09:05:01 -08:00
Paul Berry	5a79bdab30	glsl/cs: Prohibit user-defined ins/outs in compute shaders. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-05 09:04:58 -08:00
Paul Berry	28ce604b7f	mesa/cs: Handle compute shader local size during linking. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-05 09:04:20 -08:00
Paul Berry	0fa74e848f	glsl/cs: Handle compute shader local_size_{x,y,z} declaration. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-05 09:03:44 -08:00
Paul Berry	0398b69954	mesa/cs: Implement MAX_COMPUTE_WORK_GROUP_COUNT constant. v2: Document that the 3-element array MaxComputeWorkGroupCount is indexed by dimension. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-05 09:03:08 -08:00
Paul Berry	c85c50997f	mesa/cs: Implement MAX_COMPUTE_WORK_GROUP_INVOCATIONS constant. Reviewed-by: Matt Turner <mattst88@gmail.com> v2: Use CONTEXT_INT rather than CONTEXT_ENUM. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-05 09:02:30 -08:00
Paul Berry	347dde82e6	mesa/cs: Implement MAX_COMPUTE_WORK_GROUP_SIZE constant. v2: Document that the 3-element array MaxComputeWorkGroupSize is indexed by dimension. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-02-05 09:01:54 -08:00
Paul Berry	c15064c169	glsl/cs: update main.cpp to use the ".comp" extension for compute shaders. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-05 09:01:13 -08:00
Paul Berry	d861c2963a	glsl/cs: Populate default values for ctx->Const.Program[MESA_SHADER_COMPUTE]. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-05 09:01:10 -08:00
Paul Berry	c61ec8d8e3	mesa/cs: Add a MESA_SHADER_COMPUTE stage and update switch statements. This patch adds MESA_SHADER_COMPUTE to the gl_shader_stage enum. Also, where it is trivial to do so, it adds a compute shader case to switch statements that switch based on the type of shader. This avoids "unhandled switch case" compiler warnings. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-05 09:00:34 -08:00
Paul Berry	28e526d558	glsl/cs: Change some linker loops to use MESA_SHADER_FRAGMENT as a bound. Linker loops that iterate through all the stages in the pipeline need to use MESA_SHADER_FRAGMENT as a bound, so that we can add an additional MESA_SHADER_COMPUTE stage, without it being erroneously included in the pipeline. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-05 09:00:31 -08:00
Paul Berry	b7d05a58ae	mesa/cs: Add extension enable flags for ARB_compute_shader. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-05 08:59:37 -08:00
Paul Berry	7f5740899f	glsl: Fix continue statements in do-while loops. From the GLSL 4.40 spec, section 6.4 (Jumps): The continue jump is used only in loops. It skips the remainder of the body of the inner most loop of which it is inside. For while and do-while loops, this jump is to the next evaluation of the loop condition-expression from which the loop continues as previously defined. Previously, we incorrectly treated a "continue" statement as jumping to the top of a do-while loop. This patch fixes the problem by replicating the loop condition when converting the "continue" statement to IR. (We already do a similar thing in "for" loops, to ensure that "continue" causes the loop expression to be executed). Fixes piglit tests: - glsl-fs-continue-inside-do-while.shader_test - glsl-vs-continue-inside-do-while.shader_test - glsl-fs-continue-in-switch-in-do-while.shader_test - glsl-vs-continue-in-switch-in-do-while.shader_test Cc: mesa-stable@lists.freedesktop.org Acked-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-04 09:06:09 -08:00
Paul Berry	56790856b3	glsl: Make condition_to_hir() callable from outside ast_iteration_statement. In addition to making it public, we also need to change its first argument from an ir_loop * to an exec_list *, so that it can be used to insert the condition anywhere in the IR (rather than just in the body of the loop). This will be necessary in order to make continue statements work properly in do-while loops. Cc: mesa-stable@lists.freedesktop.org Acked-by: Carl Worth <cworth@cworth.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-04 09:06:09 -08:00
Matt Turner	606544214e	glsl: Expand non-expr & non-swizzle scalar rvalues in vectorizing.	2014-01-31 10:21:50 -08:00
Matt Turner	3f49a8c9a5	glcpp: Reject #version after the version has been resolved. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=74166 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Carl Worth <cworth@cworth.org>	2014-01-31 10:21:50 -08:00
Carl Worth	9d4a6bd6bb	glcpp: Rename the variable used to enable debugging. The -p option we now use when calling bison means that this variable will be named glcpp_parser_debug not yydebug. This was not caught when the -p option was added because this variable isn't used in the code as committed. (I prefer the declaration to remain since it allows a developer to easily find this variable name to enable debugging.)	2014-01-31 10:02:58 -08:00
Carl Worth	2dc93bd5d1	glcpp: Add "make check" test for comment-parsing bug This is the innocent-looking but killer test case to verify the bug fixed in the preceding commit. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-01-31 10:02:54 -08:00
Carl Worth	71978cf66f	glcpp: Don't enter lexer's NEWLINE_CATCHUP start state for single-line comments In commit `6005e9cb28` a new start state of NEWLINE_CATCHUP was added to the lexer. This start state is used whenever the lexer is emitting a NEWLINE token to emit additional NEWLINE tokens for any newline characters that were skipped by an immediately preceding multi-line comment. However, that commit erroneously entered the NEWLINE_CATCHUP state for single-line comments. This is not desired since in the case of a single-line comment, the lexer is not emitting any NEWLINE token. The result is that the lexer will remain in the NEWLINE_CATCHUP state and proceed to fail to emit a NEWLINE token for the subsequent newline character, (since the case to match \n expects only the INITIAL start state). The fix is quite simple, remove the "BEGIN NEWLINE_CATCHUP" code from the single-line comment case, (preserving it only in exactly the cases where the lexer is actually emitting a NEWLINE token). Many thanks to Petri Latvala for reporting this bug and for providing the minimal test case to exercise it. The bug showed up only with a multi-line comment which was followed immediately by a single-line comment (without any intervening newline), such as: /* */ // Kablam! Since `6005e9cb28`, and before this commit, that very innocent-looking combination of comments would yield a parse failure in the compiler. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=72686 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2014-01-31 10:02:36 -08:00
Emil Velikov	4c35e32594	glsl: s/_NDEBUG/NDEBUG/ The former symbol is never defined within mesa. Based on the code it seems that the original intent was to use NDEBUG. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-01-29 19:52:35 +00:00
Matt Turner	37f1903e00	glsl: Avoid combining statements from different basic blocks. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=74113 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-01-27 21:15:35 -08:00
Matt Turner	8e2b8bd0e6	glsl: Set proper swizzle when a channel is missing in vectorizing. Previously, for example if the x channel was missing from a series of assignments we were attempting to vectorize, the wrong swizzle mask would be applied. a.y = b.y; a.z = b.z; a.w = b.w; would be incorrectly transformed into a.yzw = b.xyz; Fixes two transform feedback tests in the ES3 conformance suite. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=73978 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=73954 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-01-27 21:15:35 -08:00

... 2 3 4 5 6 ...

2876 Commits