KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Samuel Iglesias Gonsálvez	fc1708948b	spirv/nir: add (un)packDouble2x32() translation Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	c332432bae	spirv/nir: implement DF conversions SPIR-V does not have special opcodes for DF conversions. We need to identify them by checking the bit size of the operand and the result. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	27cf6a369f	nir: add nir_type_conversion_op() This function returns the nir_op corresponding to the conversion between the given nir_alu_type arguments. This function lacks support for integer-based types with bit_size != 32 and for float16 conversion ops. v2: - Improve readiness of the code and delete cases that don't happen now (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	3a571fcc43	nir: add nir_get_nir_type_for_glsl_type() v2 (Jason): - Refactor nir_get_nir_type_for_glsl_type() to avoid using unneeded helpers (Jason) v3: - Use return directly (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	59944a77ae	spirv: add support for doubles on OpComposite{Insert,Extract} Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	e6bebb9982	spirv: Enable double floating points when copying variables in _vtn_variable_copy() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	9d71cfeff8	spirv: add double support to _vtn_block_load_store() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	0cd0c32c06	spirv: add double support to _vtn_variable_load_store Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	8076c8b59f	spirv: add double support to SpvOpCompositeExtract v2 (Jason): - Add asserts. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	a966387883	spirv: fix SpvOpSpecConstantOp with SpvOpVectorShuffle working with double-based vecs We need to pick two 32-bit values per component to perform the right shuffle operation. v2 (Jason): - Add assert to check matching bit sizes (Jason) - Simplify the code to pick components (Jason) v3: - Switch on bit_size once (Jason) - Add comment to explain the constant value for unused components (Erik) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	ec686ff62c	spirv: add DF support to SpvOp*ConstantComposite v2 (Jason): - Add assert. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	2bf4d0ba7a	spirv: add DF support to vtn_const_ssa_value() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	d77ffc3d87	spirv: add support for loading DF constants v2 (Jason): - Add assert. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	9602c7c02f	spirv: add definition of double based data types Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	d1bbe2c94e	spirv: fix typo in spec_constant_decoration_cb() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Dave Airlie	41969f0d06	radv: drop unused fields in physical device. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-01-09 16:48:14 +10:00
Tapani Pälli	8b43f42011	i965: call intel_prepare_render always when reading pixels Currently we do this only in the fallback code (when tiled memcpy version failed) but it needs to be done always so that we have correct read and write buffer in place. No regressions seen in CI. Fixes: dEQP-EGL.functional.buffer_age.* Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98330 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-01-09 07:44:53 +02:00
Timothy Arceri	953e4e4417	st/mesa: pass gl_program to st_bind_ubos() We no longer need anything from gl_linked_shader. Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-09 15:27:35 +11:00
Timothy Arceri	270e584a86	st/mesa: pass gl_program to st_bind_images() We no longer need anything from gl_linked_shader. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-09 15:27:35 +11:00
Timothy Arceri	59ac77b410	st/mesa: stop passing gl_linked_shader to set_affected_state_flags() We now get everything we need from the gl_program param. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-09 15:27:35 +11:00
Timothy Arceri	ae632afe4f	st/mesa/glsl: set num_images directly in shader_info This change also removes the now duplicate NumImages field. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-09 15:27:35 +11:00
Timothy Arceri	4b30011d34	st/mesa: pass gl_program to st_bind_ssbos() We no longer need to pass gl_shader_program. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-09 15:27:34 +11:00
Timothy Arceri	1130f82a88	nir: add another comparison simplification On BDW: total instructions in shared programs: 13061877 -> 13060965 (-0.01%) instructions in affected programs: 133569 -> 132657 (-0.68%) helped: 566 HURT: 0 total cycles in shared programs: 256611784 -> 256599536 (-0.00%) cycles in affected programs: 861016 -> 848768 (-1.42%) helped: 379 HURT: 73 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 12:32:16 +11:00
Kenneth Graunke	3371de38f2	nir: Turn bcsel of +/- 1.0 and 0.0 into b2f sequences. On BDW: total instructions in shared programs: 13074882 -> 13068703 (-0.05%) instructions in affected programs: 1823116 -> 1816937 (-0.34%) helped: 4187 HURT: 537 total cycles in shared programs: 256622718 -> 256425382 (-0.08%) cycles in affected programs: 123790120 -> 123592784 (-0.16%) helped: 3823 HURT: 2037 total spills in shared programs: 15276 -> 14929 (-2.27%) spills in affected programs: 9446 -> 9099 (-3.67%) helped: 352 HURT: 1 total fills in shared programs: 20496 -> 20144 (-1.72%) fills in affected programs: 13040 -> 12688 (-2.70%) helped: 352 HURT: 1 LOST: 2 GAINED: 21 v2: Rely on 'a' being a well-formed boolean (Connor, Eric). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 12:32:16 +11:00
Kenneth Graunke	1c50d31c26	nir: Convert ineg(b2i(a)) to a if it's a boolean. On BDW: total instructions in shared programs: 13071119 -> 13070371 (-0.01%) instructions in affected programs: 83424 -> 82676 (-0.90%) helped: 505 HURT: 45 (all TCS, all hurt by a single instruction) total cycles in shared programs: 256601322 -> 256588932 (-0.00%) cycles in affected programs: 819410 -> 807020 (-1.51%) helped: 450 HURT: 57 total loops in shared programs: 2950 -> 2942 (-0.27%) loops in affected programs: 8 -> 0 helped: 7 HURT: 0 v2: Drop unnecessary 'a@bool' annotation (Connor, Eric). Add a comment explaining the rule (Ian). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> [v1] Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-09 12:32:16 +11:00
Kenneth Graunke	86b9be777f	i965: Move TES input VUE map calculation out a layer. In Vulkan, we'll compile the TCS and TES at the same time, so I can just pass the TCS output VUE map to brw_compile_tes as the TES input VUE map. So, we only need to do this in GL. Move it to the GL-specific layer. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-07 22:24:10 -08:00
Kenneth Graunke	6e8ac0641f	i965: Pass NULL for gl_program when compiling TES. This isn't needed, and Vulkan doesn't have one. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-07 22:24:10 -08:00
Kenneth Graunke	08f8f1bcd5	i965: Move TES spacing/domain/topology setup to brw_compile_tes(). Moving this down a layer lets us share code between Vulkan and GL. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-07 22:24:10 -08:00
Kenneth Graunke	cc2df4bb81	i965: Access TES shader info via NIR. NIR exists in both GL and Vulkan, but gl_program is GL specific. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-07 22:24:10 -08:00
Kenneth Graunke	a4fd84ef5f	mesa: Introduce a compiler enum for tessellation spacing. It feels weird using GL_* enums in a Vulkan driver. v2: Fix the TESS_SPACING -> PIPE_TESS_SPACING conversion. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-07 22:22:28 -08:00
Kenneth Graunke	9bb89175e6	compiler: Change shader_info->tes.vertex_order into a ccw boolean. The vertex order is either clockwise or counterclockwise. We can just store a "ccw" boolean rather than GLenum values. I don't want to use GLenums in a Vulkan driver, and even in GL a simple boolean works fine. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-07 20:42:32 -08:00
Jason Ekstrand	faa1edeeb7	anv/pipeline: Call NIR passes using NIR_PASS_V This lets us get validation without having to do it manually. Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-07 15:45:09 -08:00
Jason Ekstrand	43e0b0d4b2	anv/pipeline: Only call remove_dead_variables once It can handle multiple modes at a time now so there's no reason to call it repeatedly. Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-07 15:45:09 -08:00
Kenneth Graunke	957ec00243	Revert recent GLSL slot counting fiasco. I apparently broke mark_whole_variable in ir_set_program_inouts. It was passing a type that wasn't var->type, so the wrapper didn't work out. It's all broken, revert it and start over. Fixes all kinds of things on other drivers. Revert "glsl: Make is_fixed_function_array actually check for varyings." This reverts commit `42699e1271`. Revert "glsl: Mark whole variable used for ClipDistance and TessLevel." This reverts commit `5c580e64cc`. Revert "glsl: Override the # of varying slots for ClipDistance and TessLevel." This reverts commit `8b5749f65a`. Revert "glsl: Create and use a new ir_variable::count_attribute_slots() wrapper." This reverts commit `6aa5cb34d0`.	2017-01-07 15:15:08 -08:00
Kenneth Graunke	42699e1271	glsl: Make is_fixed_function_array actually check for varyings. We can't check VARYING_SLOT_* locations until we've determined that the variable is actually a varying. Fixes assert failures in drivers which actually use this path, such as radeonsi and i915. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99314 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2017-01-07 13:05:37 -08:00
Kai Wasserbäch	5a165b4086	drirc: Allow extension midshader for Divinity: Original Sin (EE) See also <https://bugs.freedesktop.org/show_bug.cgi?id=93551#c27> where this was first observed as a requirement. Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-01-07 15:47:35 +01:00
Timothy Arceri	1edc53a66b	glsl: fix opt_minmax redundancy checks against baserange Marking operations as redundant if they are equal to the base range is fine when the tree structure is something like this: max / \ max b / \ 3 max / \ 3 a But the opt falls apart with a tree like this: max / \ max max / \ / \ 3 a b 3 The problem is that both branches are treated the same: descending in the left branch will prune the constant, and then descending the right branch will prune the constant there as well, because limits[0] wasn't updated to take the change on the left branch into account, and so we still get [3,\infty) as baserange. In order to fix the bug we just disable the marking of redundant expressions when they match the baserange. NIR algebraic opt will clean up the first tree for anyway, hopefully other backends are smart enough to do this also. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-07 21:46:36 +11:00
Jason Ekstrand	45912fb908	i965/compiler: Use the new nir_opt_copy_prop_vars pass We run this after nir_lower_vars_to_ssa so that as many load/store_var intrinsics as possible before copy_prop_vars executes. This is because the pass isn't particularly efficient (it does a lot of linear walks of a linked list) so we'd like as much of the work as possible to be done before copy_prop_vars runs. Shader DB results on Sky Lake: total instructions in shared programs: 12020290 -> 12013627 (-0.06%) instructions in affected programs: 26033 -> 19370 (-25.59%) helped: 16 HURT: 13 total cycles in shared programs: 137772848 -> 137549012 (-0.16%) cycles in affected programs: 6955660 -> 6731824 (-3.22%) helped: 217 HURT: 237 total loops in shared programs: 3208 -> 3208 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 4112 -> 4057 (-1.34%) spills in affected programs: 483 -> 428 (-11.39%) helped: 2 HURT: 0 total fills in shared programs: 5519 -> 5102 (-7.56%) fills in affected programs: 993 -> 576 (-41.99%) helped: 2 HURT: 0 LOST: 0 GAINED: 0 Broadwell had similar results. On older hardware, the impact isn't as large because they don't advertise GL 4.5. Of the hurt programs, all but one are hurt by a single instruction and the one is hurt by 3 instructions. All of the helped programs, on the other hand, are helped by at least 3 instructions and one kerbal space program shader is helped by 44.59%. The real star of the show, however, is the Gl43CSDof synmark2 benchmark which has two shaders which are cut by 28% and 40% and the over-all runtime performance of the benchmark on my Sky Lake laptop is improved by around 25-30% (it's a bit hard to be exact due to thermal throttling). Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:29 -08:00
Jason Ekstrand	62332d139c	nir: Add a local variable-based copy propagation pass Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	830dca74fe	nir/builder: Add a helper for getting the most recently added instruction Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	75a6707984	nir/builder: Add a load_deref_var helper Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	13a2f20740	nir/dead_variables: Remove shader-local variables that are only written Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	58fe5c57cd	nir/dead_variables: Removed shared variables when requested Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	2d7bed6158	anv/formats: Use the real format for B4G4R4A4_UNORM_PACK16 on gen8 Because border color is handled pre-swizzle, when we move the alpha channel around in the format, the OPAQUE_BLACK border colors don't work correctly on B4G4R4A4_UNORM_PACK16 with the hack. This fixes the following Vulkan CTS tests on Broadwell: dEQP-VK.pipeline.sampler.view_type.2d_array.format.b4g4r4a4_unorm_pack16.address_modes.all_mode_clamp_to_border_opaque_black dEQP-VK.pipeline.sampler.view_type.1d_array.format.b4g4r4a4_unorm_pack16.address_modes.all_mode_clamp_to_border_opaque_black dEQP-VK.pipeline.sampler.view_type.2d.format.b4g4r4a4_unorm_pack16.address_modes.all_mode_clamp_to_border_opaque_black dEQP-VK.pipeline.sampler.view_type.1d.format.b4g4r4a4_unorm_pack16.address_modes.all_mode_clamp_to_border_opaque_black dEQP-VK.pipeline.sampler.view_type.3d.format.b4g4r4a4_unorm_pack16.address_modes.all_mode_clamp_to_border_opaque_black Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0" <mesa-stable@lists.freedesktop.org>	2017-01-06 16:44:15 -08:00
Jason Ekstrand	4e7958fb13	isl: Mark A4B4G4R4_UNORM as supported on gen8 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0" <mesa-dev@lists.freedesktop.org>	2017-01-06 16:44:15 -08:00
Pierre-Loup A. Griffais	f6d3af2af6	radv: fix depth transitions with layerCount = VK_REMAINING_ARRAY_LAYERS Interpreting layerCount literally would try to create billions of image views in radv_process_depth_image_inplace(). Signed-off-by: Pierre-Loup A. Griffais <pgriffais@valvesoftware.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-01-07 01:26:08 +01:00
Kenneth Graunke	e6ae19944d	i965: Rework gl_TessLevel[] handling to use NIR compact arrays. Treating everything as scalar arrays allows us to drop a bunch of special case input/output munging all throughout the backend. Instead, we just need to remap the TessLevel components to the appropriate patch URB header locations in remap_patch_urb_offsets(). We also switch to treating the TES input versions of these as ordinary shader inputs rather than system values, as remap_patch_urb_offsets() just makes everything work out without special handling. This regresses one Piglit test: arb_tessellation_shader-large-uniforms/GL_TESS_CONTROL_SHADER-array-at-limit The compiler starts promoting the constant arrays assigned to gl_TessLevel to uniform arrays. Since the shader also has a uniform array that uses the maximum number of uniform components, this puts it over the uniform component limit enforced by the linker. This is arguably a bug in the constant array promotion code (it should avoid pushing us over limits), but is unlikely to penalize any real application. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-06 15:55:48 -08:00
Kenneth Graunke	31d9de58ab	i965: Inline store_output helper in quads workaround code. It's only used in one place, it ignores the offset parameter currently, and I want to add more parameters...at which point, passing in a bunch of integers seems less obvious than writing it out. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-06 15:55:47 -08:00
Kenneth Graunke	311b1f0a98	nir: Make glsl_to_nir compact scalar TessLevel arrays. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-06 15:55:46 -08:00
Kenneth Graunke	496693d466	i965: Make unify_interfaces not spread VARYING_BIT_TESS_LEVEL_*. This is harmless today because gl_TessLevelInner/Outer in the TES is currently treated as system values. However, when we move to treating them as inputs, this would cause a bug: with no TCS present, it would propagate TES reads of VARYING_SLOT_TESS_LEVEL into the VS output VUE map slots. This is totally bogus - those don't even exist in the VS. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-06 15:55:42 -08:00

1 2 3 4 5 ...

87951 Commits All Branches Search

87951 Commits

All Branches