KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Nicolai Hähnle	4b7961da77	radeonsi: extract DB->CB copy logic into its own function Also clean up some of the looping. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:43:51 +02:00
Nicolai Hähnle	18cc825fb9	radeonsi: sample from flushed depth texture when required Note that this has no effect yet. A case where can_sample_z/s can be false in radeonsi will be added in a later patch. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:43:51 +02:00
Nicolai Hähnle	f2eb34f82f	gallium/radeon: replace is_flushing_texture with db_compatible This is a left-over of when I considered generalizing the separate stencil support. I do prefer the new name since it emphasizes what flushing vs. non-flushing means from a functional point-of-view, namely special handling of the texture format. v2: adjust r600_init_color_surface as well Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:43:48 +02:00
Nicolai Hähnle	dd65126153	gallium/radeon: add can_sample_z/s flags for textures v2: adjust r600_init_color_surface as well Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:43:43 +02:00
Nicolai Hähnle	065eeb79f7	radeonsi: correctly mark levels of 3D textures as fully decompressed Account for the fact that max_layer is minified for higher levels. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:42:49 +02:00
Nicolai Hähnle	19f8d2a843	gallium/radeon/winsyses: remove unused stencil_offset Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:42:49 +02:00
Nicolai Hähnle	3a1da559c5	gallium/radeon: remove redundant null-pointer check v2: keep using r600_texture_reference Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:42:48 +02:00
Nicolai Hähnle	5b87eef031	gallium/radeon: print StencilLayout only once It is the same for all levels. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:42:48 +02:00
Nicolai Hähnle	bae066c3f0	gallium/radeon: flush stdout after printing texture information Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-07-06 10:42:48 +02:00
Ilia Mirkin	a37e46323c	glsl: don't try to lower non-gl builtins as if they were gl_FragData If a shader has an output array, it will get treated as though it were gl_FragData and rewritten into gl_out_FragData instances. We only want this to happen on the actual gl_FragData and not everything else. This is a small part of the problem pointed out by the below bug. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96765 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "11.2 12.0" <mesa-stable@lists.freedesktop.org>	2016-07-05 21:22:01 -04:00
Ian Romanick	795d8dff89	glsl: Document and enforce restriction on type values Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-07-05 17:55:29 -07:00
Ian Romanick	3119871bd9	glsl: Pack integer and double varyings as flat even if interpolation mode is none v2: Also update varying_matches::compute_packing_class(). Suggested by Timothy Arceri. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96358 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Cc: Gregory Hainaut <gregory.hainaut@gmail.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu>	2016-07-05 16:58:27 -07:00
Ian Romanick	73a6a4ce49	mesa: Strip arrayness from interface block names in some IO validation Outputs from the vertex shader need to be able to match per-vertex-arrayed inputs of later stages. Acomplish this by stripping one level of arrayness from the names and types of outputs going to a per-vertex-arrayed stage. v2: Add missing checks for TESS_EVAL->GEOMETRY. Noticed by Timothy Arceri. v3: Use a slightly simpler stage check suggested by Ilia. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96358 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Cc: Gregory Hainaut <gregory.hainaut@gmail.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu>	2016-07-05 16:58:27 -07:00
Charmaine Lee	32651c67d1	svga: avoid emitting redundant DXSetRenderTargets command Tested with Lightsmark2008, MTT piglit, glretrace, conform. Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2016-07-05 16:58:29 -06:00
Leo Liu	aa7d42a5f9	radeon/vce: update encRefPic addr and array mode to tiled Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-07-05 09:15:50 -04:00
Leo Liu	e560a11b87	radeon/vce: increase cpb height alignment Height should be aligned with 2 macroblocks, thus making safer for tiled mode Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-07-05 09:15:47 -04:00
Iago Toral Quiroga	fa0654fc3c	i965: Remove trailing whitespace Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-07-05 14:06:37 +02:00
Iago Toral Quiroga	d92ac67126	i965: Make inline function static Without this the i965 driver fails to load. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-07-05 14:05:58 +02:00
Emil Velikov	cbc37f72e3	anv: install the intel_icd.json to ${datarootdir} by default As mentioned by the spec (and used by Archlinux and Debian) default to ${datarootdir} as opposed to ${sysconfdir} for the default location. Cc: Jason Ekstrand <jason@jlekstrand.net> Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-07-05 12:17:34 +01:00
Emil Velikov	744d0d8f3b	swr: automake: don't ship LLVM version specific generated sources Otherwise things will fail to build, if the builder is using another version of LLVM. v2: annotate all the dependencies of builder_gen.h v3: clean the generated files as needed v4: comment cleanups (Tim) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Tested-by: Tim Rowley <timothy.o.rowley@intel.com> Tested-by: Chuck Atkins <chuck.atkins@kitware.com> (v2) Reported-by: Chuck Atkins <chuck.atkins@kitware.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-07-05 12:17:05 +01:00
Emil Velikov	22e9357028	automake: don't mandate git_sha1.h/MESA_GIT_SHA1 It has proven subtle to get it right both from the build side POV (see commit list below) and builders due to their varying workflows. Furthermore it does not fully fulfil the reason why it was enforced - to detect uniqueness between different builds, in order to distinguish and invalidate Vulkan/GL caches. With that having a much better solution (previous commit) we can drop this solution. This effectively reverts the following commits: `359d9dfec3` ("mesa: automake: add directory prefix for git_sha1.h") `2c424e00c3` ("mesa: automake: ensure that git_sha1.h.tmp has the right attributes") `b7f7ec7843` ("mesa: automake: distclean git_sha1.h when building OOT") `8229fe68b5` ("automake: get in-tree `make distclean' working again.") Cc: Timo Aaltonen <tjaalton@debian.org> Cc: Haixia Shi <hshi@chromium.org> Cc: Jason Ekstrand <jason@jlekstrand.net> Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2016-07-05 12:16:20 +01:00
Emil Velikov	e5c1229a9a	anv: automake: indent with tabs and not spaces Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-07-05 12:16:06 +01:00
Emil Velikov	addb099ce8	anv: use cache uuid based on the build timestamp. Do not rely on the git sha1: - its current truncated form makes it less unique - it does not attribute for local (Vulkand or otherwise) changes Use a timestamp produced at the time of build. It's perfectly unique, unless someone explicitly thinkers with their system clock. Even then chances of producing the exact same one are very small, if not zero. v2: Remove .tmp rule. Its not needed since we want for the header to be regenerated on each time we call make (Eric). v3: - Honour SOURCE_DATE_EPOCH, to make the build reproducible (Michel) - Replace the generated header with a define, to prevent needless builds on consecutive `make' and/or `make install' calls. (Dave) v4: - Keep the timestamp generation at make time. (Jason) v5: - Ensure that file is regenerated on incremental builds. Cc: Michel Dänzer <michel@daenzer.net> Cc: Dave Airlie <airlied@gmail.com> Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-07-05 12:15:23 +01:00
Emil Velikov	f98530b739	clover: conditionally use MESA_GIT_SHA1 Considering how hard/annoying it was for many peoples' workflow to properly generate the macro, it will be demoted to conditionally available with follow-up commits. v2: Kill off gracious blank line (Vedran). Cc: mesa-stable@lists.freedesktop.org Cc: Vedran Miletić <vedran@miletic.net> Cc: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> (v1) Reviewed-by: Vedran Miletić <vedran@miletic.net>	2016-07-05 12:14:34 +01:00
Timothy Arceri	9c9e3e7ee1	mesa: stop copying SamplerUnits twice The call to _mesa_update_shader_textures_used() already takes care of copying for us. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-07-05 20:18:05 +10:00
Timothy Arceri	25a32c2cbf	mesa: make attribute binding message more useful Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-07-05 20:18:05 +10:00
Timothy Arceri	8f1ca0ee3f	i965: make more effective use of SamplersUsed Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-07-05 20:18:05 +10:00
Timothy Arceri	51f912786f	glsl: stop allocating memory for UBOs during linking This just stops counting and assigning a storage location for these uniforms, the count is only used to create the uniform storage. These uniform types don't use this storage. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-07-05 20:18:05 +10:00
Timothy Arceri	549b9b12fc	glsl: mark link_uniform_blocks_are_compatible() as static Missed this when doing `6d1a59d15b`. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2016-07-05 20:18:05 +10:00
Timothy Arceri	30812e90d1	mesa: fix build error Fix build error cased by `6a524c76f5`.	2016-07-05 18:42:06 +10:00
Gregory Hainaut	6a524c76f5	mesa: faster validation of sampler unit mapping for SSO Code was inspired from _mesa_update_shader_textures_used However unlike _mesa_update_shader_textures_used that only check for a single stage, it will check all stages. It avoids to loop on all uniforms, only active samplers are checked. For my use case: high FS frequency switches with few samplers. Perf event (relative to nouveau_dri.so) goes from 5.01% to 1.68% for the _mesa_sampler_uniforms_pipeline_are_valid function. Signed-off-by: Gregory Hainaut <gregory.hainaut@gmail.com> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2016-07-05 16:44:31 +10:00
Dave Airlie	cb728df967	Revert "st/glsl_to_tgsi: don't increase immediate index by 1." This reverts commit `27d456cc87`. DOH, what seems right and what is right with fp64 are always two different things. This regressed: spec@arb_gpu_shader_fp64@shader_storage@layout-std140-fp64-mixed-shader on radeonsi Reported-by: Michel Dänzer <michel@daenzer.net> Cc: "11.2 12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-07-05 10:25:29 +10:00
Samuel Pitoiset	c1fb3290a6	nvc0/ir: rename NVE4_SU_INFO_XXX to NVC0_SU_INFO_XXX While we are at it, fix a typo inside the comment which describes what those constants are for. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-07-05 01:44:15 +02:00
Samuel Pitoiset	f3b9fff3c3	nvc0/ir: reset the base offset for indirect images accesses In presence of an indirect image access, the base offset should be zeroed because the stride will be computed twice. This is a pretty rare situation but it can happen when tex.r > 0. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.2 12.0" <mesa-stable@lists.freedesktop.org>	2016-07-05 01:44:12 +02:00
Samuel Pitoiset	cb828b7b18	gm107/ir: fix sign bit emission for FADD32I When emitting OP_SUB, the sign bit for FADD and FADD32I is not at the same position. It's at position 45 for FADD but 51 for FADD32I. This fixes the following piglit test: tests/spec/arb_fragment_program/fdo30337b.shader_test Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: <mesa-stable@lists.freedesktop.org>	2016-07-05 01:44:08 +02:00
Eric Anholt	ac772b24a1	vc4: Regularize instruction emit macros ALU0 didn't have the _dest variant, and ALU2 didn't unset the def the way ALU1 did. This should make the ALU[012] macros much clearer, by moving most of their contents to vc4_qir.c	2016-07-04 16:33:22 -07:00
Eric Anholt	8a52f03f5d	vc4: Enable dead CF elimination. Now that we're about to start generating control flow in our NIR, we want this in place. It optimizes things frequently in the CS, when the GL VS has control flow that doesn't affect the vertex position.	2016-07-04 16:33:22 -07:00
Eric Anholt	8f2af4763a	vc4: Optimize out redundant SF updates. Tiny change on shader-db currently, but it will be important when we start emitting a lot of SFs from the same variable as part of control flow support. total instructions in shared programs: 89463 -> 89430 (-0.04%) instructions in affected programs: 1522 -> 1489 (-2.17%) total estimated cycles in shared programs: 250060 -> 250015 (-0.02%) estimated cycles in affected programs: 8568 -> 8523 (-0.53%)	2016-07-04 16:33:22 -07:00
Eric Anholt	200b4e4bd5	vc4: Move SF removal to a separate peephole pass. The DCE pass is going to change significantly to handle control flow, while we don't really need to change it for the SF handling. We also need to add some more SF peephole optimization for SF updates generated by control flow support. No change on shader-db.	2016-07-04 16:33:22 -07:00
Eric Anholt	aa76ba6f2f	vc4: DCE instructions with a NULL destination. I'm going to add an optimization for redundant SF update removal, which will just remove the SF and leave us (in many cases) with an instruction with a NULL destination and no side effects. Rather than teaching that pass whether the whole instruction can be removed, leave that responsibility to this pass.	2016-07-04 16:33:22 -07:00
Eric Anholt	2a8973fb78	vc4: Mark texturing setup instructions as having side effects. We need to not DCE them even though they don't have a destination in QIR. We also shouldn't relocate them in vc4_opt_vpm. Neither of these things happen, but I'm about to make DCE consider instructions with a NULL destination.	2016-07-04 16:33:22 -07:00
Eric Anholt	44df374a9c	vc4: Fix a pasteo in scheduling condition flag usage. Noticed by code inspection. This hasn't been too big of a deal, because our cond usages all start out as adder ops, either MOVs or the FTOI for Z writes. MOVs can get converted to mul ops during scheduling, but apparently we hadn't hit this.	2016-07-04 16:33:22 -07:00
Eric Anholt	eaa53f80d9	vc4: Drop the dead QIR_PACK() macro. This isn't used since we switched to using the dst.pack field instead of custom instructions.	2016-07-04 16:33:18 -07:00
Marek Olšák	5c92c21369	radeonsi: do compilation from si_create_shader_selector asynchronously Main shader parts and geometry shaders are compiled asynchronously by util_queue. si_create_shader_selector doesn't wait and returns. si_draw_vbo(si_shader_select) waits for completion. This has the best effect when shaders are compiled at app-loading time. It doesn't help much for shaders compiled on demand, even though VS+PS compilation should take as much as time as the bigger one of the two. If an app creates more shaders, at most 4 threads will be used to compile them. Debug output disables this for shader stats to be printed in the correct order. (We could go even further and build variants asynchronously too, then emit draw calls without waiting and emit incomplete shader states, then force IB chaining to give the compiler more time, then sync the compilation at the IB flush and patch the IB with correct shader states. This is great for compilation before draw calls, but there are some difficulties such as scratch and tess states requiring the compiler output, and an on-disk shader cache will likely be a much better and simpler solution.) Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-05 00:47:13 +02:00
Marek Olšák	84824935cf	radeonsi: don't lock shader cache mutex during compilation to allow multiple shaders to be compiled simultaneously. ALso, shader-db can again use all 4 cores. v2: Remove the pipe_mutex_unlock call in the error path. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v1)	2016-07-05 00:47:13 +02:00
Marek Olšák	850cd953b1	radeonsi: separate the compilation chunk of si_create_shader_selector The function interface is ready to be used by util_queue. Also, si_shader_select_with_key can no longer accept si_context. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-05 00:47:13 +02:00
Marek Olšák	6781a2a994	radeonsi: move LLVMTargetMachineRef creation to a separate function Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-05 00:47:13 +02:00
Marek Olšák	8a4ace4a47	gallium/radeon: add and use radeon_info::max_alloc_size (v2) v2: - squashed the patches - use INT_MAX - clamp max_const_buffer_size - check the DRM version in radeon Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Vedran Miletić <vedran@miletic.net>	2016-07-05 00:47:13 +02:00
Marek Olšák	027ad71b57	radeonsi: print LLVM IRs to ddebug logs Getting LLVM IRs of hanging shaders have never been easier. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-05 00:47:13 +02:00
Marek Olšák	28a03be06b	radeonsi: enable string markers and record apitrace call numbers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-07-05 00:47:13 +02:00

1 2 3 4 5 ...

82981 Commits All Branches Search

82981 Commits

All Branches