KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jordan Justen	1a428a5256	docs: Mark ARB_compute_shader as work in progress Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2014-09-01 10:45:37 -07:00
Connor Abbott	d571f2b15d	i965/fs: don't use ir->shadow_comparitor in emit_texture_* Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:55:14 -07:00
Connor Abbott	cbfcb1b069	i965/fs: don't pass ir_variable * to emit_samplepos_setup() We were only using it to get at its type, which we already know because it's a builtin variable. Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:12:15 -07:00
Connor Abbott	ec3d06f591	i965/fs: don't pass ir_variable * to emit_frontfacing_interpolation() We were only using it to get at its type, which we already know because it's a builtin variable. v2 (Ken): Rebase on Matt's optimized gl_FrontFacing calculations. Signed-off-by: Connor Abbott <connor.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-09-01 00:11:16 -07:00
Kenneth Graunke	70691f0c28	i965: Fix GPU hangs when INTEL_DEBUG=no16 is set. The replicated data clear shader needs to be SIMD16, or else the GPU will hang. So, compile it even if INTEL_DEBUG=no16 is set. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-08-31 17:03:31 -07:00
Emil Velikov	88cbe3908f	mesa: fix make tarballs Current method of generating distribution tar-balls involves manually invoking make + target name in the appropriate places. This temporary solution is used until we get 'make dist' working. Currently it does not work, as in order to have the target (which is also a filename) available in the final Makefile we need to add a PHONY target + use the correct target name. Cc: "10.2 10.3" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-09-01 00:22:20 +01:00
Abdiel Janulgue	5598458e69	i965/vec4: Remove try_emit_saturate Now that saturate is implemented natively as an instruction, we can cut down on unneeded functionality. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	cbd225057a	i965/fs: Refactor try_emit_saturate v3: Since the fs backend can emit saturate as a separate instruction, there is no need to detect for min/max instructions and to rewrite the instruction tree accordingly. On the other hand, we don't need to emit a separate saturated mov either when the expression generating src can do saturate directly. v4: Add can_do_saturate() check before enabling saturate modifer (Ken) Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	b2c0c35907	ir_to_mesa, glsl_to_tgsi: Remove try_emit_saturate Now that saturate is implemented natively as instruction, we can cut down on unneeded functionality. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	7841a246b9	i965/vec4: Allow propagation of instructions with saturate flag to sel When sel conditon is bounded within 0 and 1.0. This allows code as: mov.sat a b sel.ge dst a 0.25F To be propagated as: sel.ge.sat dst b 0.25F v3: - Syntax clarifications in inst->saturate assignment - Remove extra parenthesis when assigning src_reg value from copy_entry (Matt Turner) v4: - Take channels into consideration when propagating saturated instructions. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	40aeb558ce	i965/fs: Allow propagation of instructions with saturate flag to sel When sel conditon is bounded within 0 and 1.0. This allows code as: mov.sat a b sel.ge dst a 0.25F To be propagated as: sel.ge.sat dst b 0.25F v3: Syntax clarifications in inst->saturate assignment (Matt Turner) Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:09 +03:00
Abdiel Janulgue	0e2ba3ee82	glsl: Optimize clamp(x, b, 1.0), where b > 0.0 as max(saturate(x),b) v2: - Output max(saturate(x),b) instead of saturate(max(x,b)) - Make sure we do component-wise comparison for vectors (Ian Romanick) v3: - Add missing condition where the outer constant value is > 0.0 and inner constant is 1.0. - Fix comments to show that the optimization is a commutative operation (Matt Turner) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	d92394c5d8	glsl: Optimize clamp(x, 0.0, b), where b < 1.0 as min(saturate(x),b) v2: - Output min(saturate(x),b) instead of saturate(min(x,b)) suggested by Ilia Mirkin - Make sure we do component-wise comparison for vectors (Ian Romanick) v3: - Add missing condition where the outer constant value is zero and inner constant is < 1 - Fix comments to reflect we are doing a commutative operation (Matt Turner) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	8f890b119e	glsl: Optimize clamp(x, 0, 1) as saturate(x) v2: - Check that the base type is float (Ian Romanick) v3: - Make sure comments reflect that we are doing a commutative operation - Add missing condition where the inner constant is 1.0 and outer constant is 0.0 - Make indexing of operands easier to read (Matt Turner) Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	cbd0d643a3	glsl: Implement saturate as ir_unop_saturate Now that we have the ir_unop_saturate implemented as a single instruction, generate the correct simplified expression. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	cb621166dc	yi965/vec4: Add support for ir_unop_saturate Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	4bfe8a1e61	i965/fs: Add support for ir_unop_saturate Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	909fa50f5b	ir_to_mesa, glsl_to_tgsi: Add support for ir_unop_saturate Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	cfa8c1cb39	ir_to_mesa, glsl_to_tgsi: lower ir_unop_saturate Needed when vertex programs doesn't allow saturate Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	8935c12937	glsl: Add a pass to lower ir_unop_saturate to clamp(x, 0, 1) Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	4c0ccfc5b3	glsl: Add constant evaluation of ir_unop_saturate v2: Use CLAMP macro (Ian Romanick) Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	a5f02b6696	glsl: Add ir_unop_saturate Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-08-31 21:04:08 +03:00
Abdiel Janulgue	f340145107	i965/vec4/fs: Count loops in shader debug Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:04:03 +03:00
Abdiel Janulgue	ddc1d297bc	i965/vec4: inline generate_vec4_instruction() within generate_code() Suggested by Matt. This patch combines and moves back the code-generation functions from generate_vec4_instruction() into generate_code(). Makes generate_code() a bit larger, but helps us to count loops in a straightforward manner. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>	2014-08-31 21:03:49 +03:00
Kenneth Graunke	e34a363a78	i965: Add 2x MSAA support to Broadwell fast clear code. According to the cited documentation section (but in the newer docs), x_scaledown is the same for 2x and 4x MSAA. +47 piglits. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=83081 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: "10.3" <mesa-stable@lists.freedesktop.org>	2014-08-31 01:48:10 -07:00
Matt Turner	8b5ac1df17	i965/vec4: Update register coalescing test. In commit `04895f5c` I added support for reswizzling writemasks. This test was checking that we didn't support this. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=82881	2014-08-30 21:00:28 -07:00
Matt Turner	0492275038	i965: Use unreachable() to silence warning. brw_meta_fast_clear.c:211:17: warning: 'x_scaledown' may be used uninitialized in this function [-Wmaybe-uninitialized] unsigned int x_scaledown, y_scaledown; Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-08-30 21:00:28 -07:00
Chia-I Wu	a14c23735e	ilo: set INTEL_RELOC_GGTT only on GEN6 We asked MI commands to use GGTT only on GEN6.	2014-08-31 10:34:39 +08:00
Chia-I Wu	255b274d75	ilo: fix bound check for 3DSTATE_URB_VS Fix max/min entries on GEN7.5 GT2/GT3.	2014-08-31 10:34:39 +08:00
Chia-I Wu	5f4b13f5fa	ilo: replace cmd by dw0 in GPE With `e3c251071b`, the magic values are gone. We no longer need "cmd" to hide them. Replace it by dw0.	2014-08-31 10:34:39 +08:00
Alexander von Gluck IV	7b6ea6ab8c	st/hgl: Move st_visual create/destroy into hgl state_tracker	2014-08-30 19:35:24 -04:00
Alexander von Gluck IV	15da8d0761	st/hgl: Move st_manager create/destroy into hgl state_tracker	2014-08-30 19:35:24 -04:00
Rob Clark	c06afcede2	freedreno/ir3: fix potential null ptr deref Fix potential segfault in debug code. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-08-30 18:02:51 -04:00
Rob Clark	c99f09f4be	freedreno/ir3: add TXB Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-08-30 18:02:51 -04:00
Rob Clark	b823abedf8	freedreno/ir3: detect scheduler fail There are some cases where the scheduler can get itself into impossible situations, by scheduling the wrong write to pred or addr register first. (Ie. it could end up being unable to schedule any instruction if some instruction which depends on the current addr/reg value also depends on another addr/reg value.) To solve this we'd need to be able to insert extra mov instructions (which would also help when register assignment gets into impossible situations). To do that, we'd need to move the nop padding from sched into legalize. But to start with, just detect when we get into an impossible situation and bail, rather than sitting forever in an infinite loop. This way it will at least fall back to the old compiler, which might even work if you are lucky. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-08-30 18:02:50 -04:00
Ian Romanick	932b0ef1ce	glsl: Use bit-flags image attributes and uint16_t for the image format All of the GL image enums fit in 16-bits. Also move the fields from the anonymous "image" structucture to the next higher structure. This will enable packing the bits with the other bitfield. Valgrind massif results for a trimmed apitrace of dota2: n time(i) total(B) useful-heap(B) extra-heap(B) stacks(B) Before (32-bit): 76 40,572,916,873 68,831,248 63,328,783 5,502,465 0 After (32-bit): 70 40,577,421,777 68,487,584 62,973,695 5,513,889 0 Before (64-bit): 60 36,822,640,058 96,526,824 88,735,296 7,791,528 0 After (64-bit): 74 37,124,603,758 95,891,808 88,466,712 7,425,096 0 A real savings of 346KiB on 32-bit and 262KiB on 64-bit. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-08-29 23:29:19 -07:00
Ian Romanick	8eeca7a56c	glsl: Use a single bit for the dual-source blend index The only values allowed are 0 and 1, and the value is checked before assigning. Valgrind massif results for a trimmed apitrace of dota2: n time(i) total(B) useful-heap(B) extra-heap(B) stacks(B) Before (32-bit): 74 40,580,119,657 69,186,544 63,506,327 5,680,217 0 After (32-bit): 76 40,572,916,873 68,831,248 63,328,783 5,502,465 0 Before (64-bit): 89 36,822,971,897 96,526,616 88,735,296 7,791,320 0 After (64-bit): 60 36,822,640,058 96,526,824 88,735,296 7,791,528 0 A real savings of 173KiB on 32-bit and no change on 64-bit. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-08-29 23:28:26 -07:00
Ian Romanick	c0cd5bedf6	glsl: Eliminate ir_variable::data.atomic.buffer_index Just use ir_variable::data.binding... because that's the where the binding is stored for everything else that can use layout(binding=). Valgrind massif results for a trimmed apitrace of dota2: n time(i) total(B) useful-heap(B) extra-heap(B) stacks(B) Before (32-bit): 50 40,564,927,443 69,185,408 63,683,871 5,501,537 0 After (32-bit): 74 40,580,119,657 69,186,544 63,506,327 5,680,217 0 Before (64-bit): 59 36,822,048,449 96,526,888 89,113,000 7,413,888 0 After (64-bit): 89 36,822,971,897 96,526,616 88,735,296 7,791,320 0 A real savings of 173KiB on 32-bit and 368KiB on 64-bit. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-08-29 23:27:59 -07:00
Kenneth Graunke	941269f89c	mesa: Delete ctx->GeometryProgram.Cache. The VertexProgram and FragmentProgram have a Cache member for dealing with fixed function programs. There are no fixed function geometry programs, so this should never have existed, and was just copy and pasted. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-08-29 22:13:37 -07:00
Roland Scheidegger	ca4f0baca2	gallivm: fix somewhat broken NaN behavior for exp2 I actually screwed that up in `754319490f`, mistakenly thinking the code actually wanted the non-nan result before. So, introduce that missing nan behavior case and use that instead. For sse, there's no actual change in the resulting code at all, the fallback code wouldn't have done the right thing though. Of course, the actual issue I saw with pow() was completely unrelated... Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-08-30 01:34:41 +02:00
Roland Scheidegger	3d29e75a5f	softpipe: handle vertex texture sampling when using llvm for draw Pretty trivial, just fill in the offsets and such. The implementation is near 100% copy and paste from llvmpipe. Should be useful for debugging. No piglit change when not using SOFTPIPE_USE_LLVM=1. Now that it can do the same tests with and without using llvm for vs/gs, with llvm more pass, the only things failing only with llvm seems to be edgeflags tests and vs/gs-pow-float-float (and for the latter I'm not convinced the zero tolerance it requires is somehow mandated by glsl). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-08-30 01:34:16 +02:00
Roland Scheidegger	62fd871984	llvmpipe: (trivial) enable cube map arrays The code is all in place now so enable it. Seems to pass all relevant piglit tests (just like cube maps, some of the cube map array tests need GALLIVM_DEBUG=no_quad_lod,no_rho_approx) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-08-30 01:33:40 +02:00
Roland Scheidegger	9da75f96bc	gallivm: handle cube map arrays for texture sampling Pretty easy, just make sure that all paths testing for PIPE_TEXTURE_CUBE also recognize PIPE_TEXTURE_CUBE_ARRAY, and add the layer * 6 calculation to the calculated face. Also handle it for texture size query, looks like OpenGL wants the number of cubes, not layers (so need division by 6). No piglit regressions. v2: fix up adding cube layer to face for seamless filtering (needs to happen after calculating per-sample face). Undetected by piglit unfortunately. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> (v1)	2014-08-30 01:33:02 +02:00
Roland Scheidegger	26a5156de7	draw: kill off bogus assertion in tgsi_fetch_gs_outputs Not sure why it was there but it is definitely not an error if gs outputs are infs/nans. Besides, the outputs can be ints, in which case any small negative number asserted. This fixes piglit's texelFetch gs isamplerXX crashes with softpipe (down from 14 to 2). Bug https://bugs.freedesktop.org/show_bug.cgi?id=80012 Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-08-30 01:17:47 +02:00
Roland Scheidegger	c9ae5038d5	softpipe: don't assert on illegal wrap mode for rect textures piglit tex-miplevel-selection nowadays doesn't use repeat wrap mode due to sampler objects any longer, however at the time of the clear the wrap mode is still illegal and at this point we get to verify the state, including samplers (even though they won't get used), and because mesa doesn't treat it as an incomplete texture as the spec says it should, we hit the assertion. Just warn about this for now instead. Gets crashes down from 44 to 14 in a piglit run (all were in various tests of tex-miplevel-selection with texture rectangles). Though just about all tex-miplevel-selection tests fail anyway for other reasons. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-08-30 01:17:47 +02:00
Roland Scheidegger	032fe4ed23	tgsi: (trivial) fix handling msaa resources on TXF Just handle as ordinary 2d / 2d array resources. Prevents an assertion failure with softpipe and piglit glsl-resource-not-bound 2DMS/2DMSArray tests. While here also fix TXD shadowCube similarly, which fixes the crash with piglit tex-miplevel-selection textureGrad CubeShadow (the test will still fail due to softpipe being broken). This fixes https://bugs.freedesktop.org/show_bug.cgi?id=80011 Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-08-30 01:17:47 +02:00
Roland Scheidegger	99105454b0	draw: remove fishy num_samplers/num_sampler_views check in llvm path This was meant for softpipe to not crash at some point if vertex texturing was used. It is, however, fishy because it uses values from draw_set_samplers/draw_set_sampler_views and not from the shader key. Albeit we should still in all cases actually generate a new shader if this changes (because the samplers and views themselves are in the key) I don't want to think again wondering if that's really correct in the future. Besides, at least today, it does not actually work for softpipe, as this was relying on softpipe not actually calling draw_set_samplers/sampler_views at all - I've verified it crashes regardless (if there were a tex instruction in the vs, which normally should not happen anyway). For drivers which do indeed not call these functions because they don't support vertex texturing at all (r300), this should still not crash because the static texture data is all zero, which causes the sampling functions to take an early out (same as is done if no texture is bound at the slot used for sampling - verified with hacked up softpipe). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-08-30 01:17:46 +02:00
Roland Scheidegger	85d4cc4790	mesa: fix fallback texture for cube map array mesa was creating a cube map array texture with just one layer, which is not legal. This caused an assertion failure when using that texture later in llvmpipe (when enabling cube map arrays) since it verifies the number of layers in the view is divisible by 6 (the sampling code might well crash randomly otherwise) with piglit glsl-resource-not-bound CubeArray -fbo -auto. v2: use appropriately sized texel array... Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (v1)	2014-08-30 01:17:46 +02:00
Aaron Watry	7c73ee677f	r600/compute: Don't leak compute pool item_list/unallocated_list v3: Fix multi-line comment format v2: Change to C-style comments and fix indentation Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Bruno Jiménez <brunojimen@gmail.com>	2014-08-29 17:38:24 -05:00
Michel Dänzer	6cd0dbc415	u_vbuf: Make sure all caps are initialized Pointed out by valgrind. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=83148 Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2014-08-29 12:15:10 +09:00

1 2 3 4 5 ...

65029 Commits All Branches Search

65029 Commits

All Branches