mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Alexander von Gluck IV	255a225265	state_trackers/hgl: Fix missing include path	2015-09-28 13:58:24 -04:00
Francisco Jerez	b61292296b	i965/fs: Fix hang on IVB and VLV with image format mismatch. IVB and VLV hang sporadically when an untyped surface read or write message is used to access a surface of format other than RAW, as may happen when there is a mismatch between the format qualifier of the image uniform and the format of the actual image bound to the pipeline. According to the spec this condition gives undefined results but may not lead to program termination (which is one of the possible outcomes of the hang). Fix it by checking at runtime whether the surface is of the right type. Fixes the "arb_shader_image_load_store.invalid/format mismatch" piglit subtest. Reported-by: Mark Janes <mark.a.janes@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=91718 CC: mesa-stable@lists.freedesktop.org Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-09-28 18:10:39 +03:00
Serge Martin	2518645f63	clover: Implement clCreateImage?D w/ clCreateImage. Remplace clCreateImage2D and clCreateImage3D implementation with call to clCreateImage. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-09-28 18:10:39 +03:00
Serge Martin	f2c52e392b	clover: Implement CL1.2 clCreateImage(). Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-09-28 18:10:39 +03:00
Francisco Jerez	92666b90c0	clover: Move down canonicalization of memory object flags into validate_flags(). This will be used to share the same logic between buffer and image creation. v2: Make memory flag set constants local to validate_flags. (Serge Martin)	2015-09-28 18:10:39 +03:00
Iago Toral Quiroga	e7ae6d9e14	glsl: revert "glsl: atomic counters can be declared as buffer-qualified variables" This reverts commit `586142658e`. The specs are not explicit about any restrictions related to the types allowed on buffer variables, however, the description of opaque types (like atomic counters) is in conclict with the purpose of buffer variables: "The opaque types declare variables that are effectively opaque handles to other objects. These objects are accessed through built-in functions, not through direct reading or writing of the declared variable. (...) Opaque variables cannot be treated as l-values;(...)" Also, Mesa is already disallowing opaque types in interface blocks anyway, so that commit was not really achieving anything. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-09-28 14:23:26 +02:00
Ilia Mirkin	5bff12ecb4	gallium/util: avoid unreferencing random memory on buffer alloc failure Found by Coverity Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Albert Freeman <albertwdfreeman@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-09-28 02:38:58 -04:00
Ilia Mirkin	6dd059fefe	mesa: don't leak interface_name Found by Coverity Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au>	2015-09-28 02:38:58 -04:00
Timothy Arceri	e413d2fbc4	glsl: fix component size calculation for tessellation and geom shaders Broken in commit `abdab88b30` when adding arrays of arrays support Reviewed-by: Dave Airlie <airlied@redhat.com>	2015-09-28 11:31:50 +10:00
Kenneth Graunke	d6a41b5f70	i965/gs: Optimize away the EOT write on Gen8+ with static vertex count. With static vertex counts, the final EOT write doesn't actually write any data - it's just there to end the thread. Typically, the last thing before ending the thread will be an EmitVertex() call, resulting in a URB write. We can just set EOT on that. Note that this isn't always possible - there might be an intervening SSBO write/image store, or the URB write may have been in a loop. shader-db statistics for geometry shaders only: total instructions in shared programs: 3173 -> 3149 (-0.76%) instructions in affected programs: 176 -> 152 (-13.64%) helped: 8 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-09-26 12:02:34 -07:00
Kenneth Graunke	08fe5799e6	i965/gs: Allow src0 immediates in GS_OPCODE_SET_WRITE_OFFSET. GS_OPCODE_SET_WRITE_OFFSET is a MUL with a constant src[1] and special strides. We can easily make the generator handle constant src[0] arguments by instead generating a MOV with the product of both operands. This isn't necessarily a win in and of itself - instead of a MUL, we generate a MOV, which should be basically the same cost. However, we can probably avoid the earlier MOV to put src[0] into a register. shader-db statistics for geometry shaders only: total instructions in shared programs: 3207 -> 3173 (-1.06%) instructions in affected programs: 3207 -> 3173 (-1.06%) helped: 11 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-09-26 12:02:31 -07:00
Kenneth Graunke	f0a618ee7c	i965: Implement "Static Vertex Count" geometry shader optimization. Broadwell's 3DSTATE_GS contains new "Static Output" and "Static Vertex Count" fields, which control a new optimization. Normally, geometry shaders can output arbitrary numbers of vertices, which means that resource allocation has to be done on the fly. However, if the number of vertices is statically known, the hardware can pre-allocate resources up front, which is more efficient. Thanks to the new NIR GS intrinsics, this is easy. We just call the function introduced in the previous commit to get the vertex count. If it obtains a count, we stop emitting the extra 32-bit "Vertex Count" field in the VUE, and instead fill out the 3DSTATE_GS fields. Improves performance of Gl32GSCloth by 5.16347% +/- 0.12611% (n=91) on my Lenovo X250 laptop (Broadwell GT2) at 1024x768. shader-db statistics for geometry shaders only: total instructions in shared programs: 3227 -> 3207 (-0.62%) instructions in affected programs: 242 -> 222 (-8.26%) helped: 10 v2: Don't break non-NIR paths (just skip this optimization). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2015-09-26 12:01:58 -07:00
Kenneth Graunke	bcef2abad7	i965: Move GS_THREAD_END mlen calculations out of the generator. The visitor was setting a mlen that was wrong for Broadwell, but the generator was ignoring it and doing the right thing regardless. We may as well move the logic fully into the visitor. This will be useful in the next commit as well. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2015-09-26 12:01:57 -07:00
Kenneth Graunke	02530c5dc5	nir: Add a function to count the number of vertices a GS emits. Some hardware (such as Broadwell) can run geometry shaders more efficiently when the number of vertices emitted is statically known. This pass provides a way to obtain the constant vertex count, or -1 indicating that the vertex count is unknown/non-constant. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2015-09-26 12:01:53 -07:00
Kenneth Graunke	df221f65e2	i965: Simplify handling of VUE map changes. The old code was disasterously complex - spread across multiple atoms which may not even run, inspecting the dirty bits to try and decide whether it was necessary to do checks...storing VS information in brw_context...extra flagging... This code tripped me and Carl up very badly when working on the shader cache code. It's very fragile and hard to maintain. Now that geometry shaders only depend on their inputs and don't have to worry about the VS VUE map, we can dramatically simplify this: just compute the VUE map coming out of the geometry shader stage in brw_upload_programs. If it changes, flag it. Done. v2: Also check vue_map.separable. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2015-09-26 11:59:56 -07:00
Kenneth Graunke	6301af22bb	i965/gs: Remove the dependency on the VS VUE map. Because we only support geometry shaders in core profile, we can safely ignore any driver-extending of VS outputs. Those are: - Legacy userclipping (doesn't exist in core profile) - Edgeflag copying (Gen4-5 only, no GS support) - Point coord replacement (Gen4-5 only, no GS support) - front/back color hacks (Gen4-5 only, no GS support) v2: Rebase; leave a comment about why SSO works. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2015-09-26 11:59:56 -07:00
Kenneth Graunke	99df02ca26	i965: Don't re-layout varyings for separate shader programs. Previously, our VUE map code always assigned slots to varyings sequentially, in one contiguous block. This was a bad fit for separate shaders - the GS input layout depended or the VS output layout, so if we swapped out vertex shaders, we might have to recompile the GS on the fly - which rather defeats the point of using separate shader objects. (Tessellation would suffer from this as well - we could have to recompile the HS, DS, and GS.) Instead, this patch makes the VUE map for separate shaders use a fixed layout, based on the input/output variable's location field. (This is either specified by layout(location = ...) or assigned by the linker.) Corresponding inputs/outputs will match up by location; if there's a mismatch, we're allowed to have undefined behavior. This may be less efficient - depending what locations were chosen, we may have empty padding slots in the VUE. But applications presumably use small consecutive integers for locations, so it hopefully won't be much worse in practice. 3% of Dota 2 Reborn shaders are hurt, but only by 2 instructions. This seems like a small price to pay for avoiding recompiles. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2015-09-26 11:59:56 -07:00
Kenneth Graunke	1e5180316c	i965/vue: Make assign_vue_map() take an explicit slot. Our plan of assigning consecutive slots doesn't work properly for separate shader objects - at least, if we want to avoid recompiling them whenever the interface changes. As a first step, make assign_vue_map take an explicit slot parameter, rather than implicitly incrementing it. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2015-09-26 11:59:56 -07:00
Kenneth Graunke	268008f98c	i965: Initialize unused VUE map slots to BRW_VARYING_SLOT_PAD. Nothing actually relies on unused slots being initialized to BRW_VARYING_SLOT_COUNT. Soon, we're going to have VUE maps with holes in them, at which point pre-filling with BRW_VARYING_SLOT_PAD make a lot more sense. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2015-09-26 11:59:56 -07:00
Kenneth Graunke	39d4b553a8	i965: Fix BRW_VARYING_SLOT_PAD handling in the scalar VS backend. We can't just break for padding slots. Instead, treat them like unwritten output variables, so we handle flushing and incrementing urb_offset correctly. Paul introduced the concept of padding slots back in 2011, but we've never actually used them for anything. So it's unsurprising that the scalar VS backend didn't handle them quite right. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2015-09-26 11:59:56 -07:00
Samuel Iglesias Gonsalvez	511a86383b	main/tests: Enable glShaderStorageBlockBinding() check in dispatch_sanity test Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-09-26 16:54:02 +02:00
Timothy Arceri	abdab88b30	glsl: calculate component size for arrays of arrays when varying packing disabled Signed-off-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-09-26 22:48:49 +10:00
Timothy Arceri	1d401f9ce4	glsl: validate binding qualifier for AoA Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-09-26 22:28:05 +10:00
Timothy Arceri	9bad7afbc2	glsl: add helper for calculating size of AoA V2: return 0 if not array rather than -1 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-09-26 22:27:47 +10:00
Timothy Arceri	776a3845d6	glsl: clean-up link uniform code These changes are also needed to allow linking of struct and interface arrays of arrays. Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2015-09-26 22:27:24 +10:00
Marek Olšák	9932142192	radeonsi: add scratch buffer to the buffer list when it's re-allocated Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Cc: mesa-stable@lists.freedesktop.org	2015-09-26 01:51:05 +02:00
Leo Liu	1e97b41893	radeon/vce: fix vui time_scale zero error if app pass 0 as frame_rate_num, it should not be encoded to the VUI. Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>	2015-09-25 18:47:14 -04:00
Matt Turner	1dd943d7fb	mesa: Add locking to programs. Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-09-25 14:08:31 -07:00
Matt Turner	3c57a102eb	mesa: Add locking to sampler objects. Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-09-25 14:08:31 -07:00
Matt Turner	d4b0e0b717	mesa: Remove debugging code from _mesa_reference_*. Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-09-25 14:08:31 -07:00
Tapani Pälli	266d05a3a0	glsl: fix packed varyings interface type and add default case fixes Piglit test: arb_program_interface_query/linker/query-varyings.shader_test Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-09-25 12:19:36 +03:00
Antia Puentes	e92c35a872	glsl: Mark as active all elements of shared/std140 block arrays Commit `1ca25ab` (glsl: Do not eliminate 'shared' or 'std140' blocks or block members) considered as active 'shared' and 'std140' uniform blocks and uniform block arrays, but did not include the block array elements. Because of that, it was possible to have an active uniform block array without any elements marked as used, making the assertion ((b->num_array_elements > 0) == b->type->is_array()) in link_uniform_blocks() fail. Fixes the following 5 dEQP tests: * dEQP-GLES3.functional.ubo.random.nested_structs_instance_arrays.18 * dEQP-GLES3.functional.ubo.random.nested_structs_instance_arrays.24 * dEQP-GLES3.functional.ubo.random.nested_structs_arrays_instance_arrays.19 * dEQP-GLES3.functional.ubo.random.all_per_block_buffers.49 * dEQP-GLES3.functional.ubo.random.all_shared_buffer.36 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=83508 Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	614b5307fd	i965: Enable ARB_shader_storage_buffer_object extension for gen7+ Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	5b080e3ddf	mesa: enable ARB_shader_storage_buffer_object extension for GLES 3.1 Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	10b5c6491f	mesa: Add getters for the GL_ARB_shader_storage_buffer_object max constants v2: - Add tessellation shader constants support v3: - Add GLES 3.1 support. v4: - Move the getters to the proper place Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	91191af6d6	glapi: add ARB_shader_storage_block_buffer_object Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	26011fa22a	main/tests: add ARB_shader_storage_buffer_object tokens to enum_strings Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	9b477ad49d	main: Add SHADER_STORAGE_BLOCK and BUFFER_VARIABLE support for ARB_program_interface_query Including TOP_LEVEL_ARRAY_SIZE and TOP_LEVEL_ARRAY_STRIDE queries. v2: - Use std430_array_stride() to get top level array stride following std430's rules. Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	0f18945cb6	glsl: Do not allow reads from write-only buffer variables The error location won't be right, but fixing that would require to check for this as we process each type of AST node that can involve a variable read. v2: - Limit the check to buffer variables, image variables have different semantics involved. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	995a719499	glsl: Do not allow assignments to read-only buffer variables v2: - Merge the error check for the readonly qualifier with the already existing check for variables flagged as readonly (Timothy). - Limit the check to buffer variables, image variables have different semantics involved (Curro). Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	6ef82f039c	glsl: Allow memory qualifiers on shader storage buffer blocks v2: - Memory qualifiers on shader storage buffer objects do not come in the form of layout qualifiers, they are block-level qualifiers. Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	f1b647fdd1	glsl: Apply memory qualifiers to buffer variables v2: - Save memory qualifier info in the top level members of a shader storage block. - Add a checks to record_compare() which is used when comparing shader storage buffer declarations in different shaders. - Always report an error for incompatible readonly/writeonly definitions, whether they are present at block or field level. Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	f4c8c01a3d	glsl: Allow use of memory qualifiers with ARB_shader_storage_buffer_object. Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	3b2037f88c	glsl: fix UNIFORM_BUFFER_START or UNIFORM_BUFFER_SIZE query when no buffer object is bound According to ARB_uniform_buffer_object spec: "If the parameter (starting offset or size) was not specified when the buffer object was bound (e.g. if bound with BindBufferBase), or if no buffer object is bound to <index>, zero is returned." Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	2e16dd1350	mesa: Add queries for GL_SHADER_STORAGE_BUFFER These handle querying the buffer name attached to a giving binding point as well as the start offset and size of that buffer. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Samuel Iglesias Gonsalvez	4b7b1cf3c0	mesa: add glShaderStorageBlockBinding() Defined in ARB_shader_storage_buffer_object extension. Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	a07d0c2657	glsl: First argument to atomic functions must be a buffer variable v2: - Add ssbo_in the names of the static functions so it is clear that this is specific to SSBO atomics. v3: - Move the check after the loop (Kristian Høgsberg) Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	5ef169034c	i965/nir/vec4: Implement nir_intrinsic_ssbo_atomic_* Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	14af6f4698	i965/nir/fs: Implement nir_intrinsic_ssbo_atomic_* Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00
Iago Toral Quiroga	9d5c0be5d5	nir: Implement lowered SSBO atomic intrinsics The original GLSL IR intrinsics have been lowered to an internal version that accepts a block index and an offset instead of a SSBO reference. v2 (Connor): - Document the sources used by the atomic intrinsics. Reviewed-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-25 08:39:23 +02:00

1 2 3 4 5 ...

66407 Commits