mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Gert Wollny	57361d89fa	mesa/st: Tie depth_clamp code into other shaders (GS and TES) v2: Use file scope defined depth_range_state in common v3: - don't use the one_shader_variant property, as this is not correct (Marek) - also use tests on available shader stages to enable depth_clamp lowering v4: Don't use key.st, use st directly (Marek) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 05:58:53 +00:00
Gert Wollny	d81ba38b02	mesa/st: Tie depth_clamp lowering into the FS v1 implemented by Erik Faye-Lund <erik.faye-lund@collabora.com> v2: Use different call for FS v3: Use file scope defined depth_range_state Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 05:58:53 +00:00
Gert Wollny	fefb152067	mesa/st: Tie depth clamp lowering in to the VP code v1: implemented by Erik Faye-Lund <erik.faye-lund@collabora.com> v2: Add handling of the ARB_clip_control depth mode v3: Move depth_range_state to file scope and remove training zeros (Erik) v4: - don't use the one_shader_variant property, as this is not correct (Marek) - also use tests on available shader stages to enable depth_clamp lowering V5: Don't use key.st, use st directly (Marek) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 05:58:53 +00:00
Erik Faye-Lund	b048d8bf8f	mesa/st: add tgsi-lowering code for depth-clamp This is a TGSI pass that lowers depth-clamping into shader-operations, by replacing the depth-value with 0 (a z-coordinate of zero will always pass the OpenGL depth test conditions), and using a dedicated varying to interpolate the real depth-value instead. Finally we replace the depth-output in the fragment shader. v1 implemented by Erik Faye-Lund <erik.faye-lund@collabora.com> v2: Add support for handling depth clip mode, and refactor code v3: - Rename _vs functions to _last_vertex_stage (Erik) - Use 0.0 depth to avoid clipping (Erik) v4: Fix inversion of bool value for clip control property Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 05:58:53 +00:00
Gert Wollny	78ba12f40f	mesa/st: replace boolean declarations by bool Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 05:58:53 +00:00
Gert Wollny	7fb47195d8	Revert "softpipe: Don't draw when rasterizer_discard is set" This was too aggressive and breaks TF (Ilia) This reverts commit `4ee638cd78`. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-08-01 05:57:41 +00:00
Eric Engestrom	a563bb9e28	docs: reword meson instructions Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-08-01 00:42:02 +01:00
Eric Engestrom	8a1e803643	travis: drop unnecessary Meson option for MacOS Those are already their default values on MacOS. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-01 00:25:20 +01:00
Jason Ekstrand	b539157504	intel/vec4: Drop all of the 64-bit varying code Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 18:14:09 -05:00
Jason Ekstrand	d03ec807a4	intel/fs: Drop all of the 64-bit varying code Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 18:14:09 -05:00
Jason Ekstrand	942c759059	intel: Use NIR to lower 64-bit varying access Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 18:14:09 -05:00
Jason Ekstrand	078dcb7ccd	nir/lower_io: Add an option to lower 64-bit varyings Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 18:14:09 -05:00
Jorge Natz	a63e82deb5	docs: Update Platforms and Drivers page with more comprehensive information. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-07-31 22:50:43 +00:00
Dave Airlie	7ad6ec80d9	nir: use common deref has indirect code in scratch lowering. This doesn't seem to need it's own copy here. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-01 08:32:12 +10:00
Eric Engestrom	5d7bcac4e7	nir: remove explicit nir_intrinsic_index_flag values These were left after a rebase and happen to make NIR_INTRINSIC_SWIZZLE_MASK == NIR_INTRINSIC_SRC_ACCESS, which is how it was noticed. Fixes: `6f20643b47` ("nir: Allow qualifiers on copy_deref and image instructions") Cc: Connor Abbott <cwabbott0@gmail.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-07-31 23:28:20 +01:00
Yevhenii Kolesnikov	830a8e6c47	state_tracker: Free Labels for querry and tranform_feedback Memory leaks were observed on iris with GL_KHR_debug. Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-07-31 22:16:42 +00:00
Kenneth Graunke	b61f17d362	iris: Skip emitting 3DSTATE_INDEX_BUFFER if possible We were emitting 3DSTATE_INDEX_BUFFER on every indexed draw, even if back-to-back draws referred to the same index buffer. This improves drawoverhead scores in the DrawElements cases by about 10%, by giving us even more minimal batches.	2019-07-31 15:14:10 -07:00
Mike Blumenkrantz	8af1990ad7	st/dri: simplify dri_get_egl_image by reusing dri2_format_table this makes dri2_get_mapping_by_fourcc accessible from dri_helpers.h and does a direct lookup on the fourcc id to match the pipe format v2 (Ken): Allow map to be NULL, use img->texture->format. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-07-31 15:11:15 -07:00
Erico Nunes	82bf5a8aac	lima: enable lower_bitops in ppir The mali pp doesn't support integers and some nir_algebraic optimizations may result in ops that are not easily lowerable to floats, so disable optimizations resulting in bitops. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-07-31 23:06:26 +02:00
Erico Nunes	b3676a6548	nir/algebraic: rename lower_bitshift to lower_bitops Optimizations that insert bitshift or bitwise operations should not be applied on GPUs that don't support integer operations. The .lower_bitshift could be used to control the bitshift related ones, but there was also one bitwise optimization uncovered. Since only lima and freedreno use this option and the use case is that no bit operations are wanted, let's rename it to .lower_bitops and use it to control all bitops related optimizations. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-07-31 23:06:04 +02:00
Erico Nunes	99c956fb47	lima/ppir: lower fdot in nir_opt_algebraic Now that we have fsum in nir, we can move fdot lowering there. This helps reduce ppir complexity and enables the lowered ops to be part of other nir optimizations in the optimization loop. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-07-31 21:35:58 +02:00
Erico Nunes	4a407df682	nir/algebraic: add new fsum ops and fdot lowering The Mali400 pp doesn't implement fdot but has fsum3 and fsum4, which can be used to optimize fdot lowering. fsum2 is not implemented and can be further lowered to an add with the vector components. Currently lima ppir handles this lowering internally, however this happens in a very late stage and requires a big chunk of code compared to a nir_opt_algebraic lowering. By having fsum in nir, we can reduce ppir complexity and enable the lowered ops to be part of other nir optimizations in the optimization loop. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-07-31 21:35:58 +02:00
Erico Nunes	7f8ff686b7	lima/ppir: refactor texture code to simplify scheduler The 'varying fetch' pp instruction deals only with coordinates, and 'texture fetch' deals only with the sampler index. Previously it was not possible to clearly map ppir_op_load_coords and ppir_op_load_texture to pp instructions as the source coordinates were kept in the ppir_op_load_texture node, making this harder to maintain. The refactor is made with the attempt to clearly map ppir_op_load_coords to the 'varying fetch' and ppir_op_load_texture to the 'texture fetch'. The coordinates are still temporarily kept in the ppir_op_load_texture node as nir has both sampler and coordinates in a single instruction and it is only possible to output one ppir node during emit. But now after lowering, the sources are transferred to the (always) created ppir_op_load_coords node, and it should be possible to directly map them to their pp instructions from there onwards. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-07-31 21:22:41 +02:00
Erico Nunes	d2901de09e	lima/ppir: lower texture projection Lower texture projection in ppir using nir_lower_tex and nir_lower_tex. This will insert a mul with the coordinate division before the load varying. Even though the lima pp supports projection in the load varying instruction while loading the coordinates (from a register or a varying), it requires that both the coordinates and projector be components in a single register. nir currently handles them in separate ssa, and attempting to merge them manually may end up in worse code than just doing the coordinate division manually. So for now let's just lower the projection to add support for it in lima. In the future, an optimization pass may be implemented in lima to ensure that both coords and projector come in the same register, then this lowering may be disabled and in this case lima may use the built-in projection and save the mul instruction from lowering. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-07-31 21:22:41 +02:00
Vinson Lee	412e1b51fe	scons: Fix random_r check. Fixes: `597bddad47` ("scons: Test for random_r()") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 18:23:55 +00:00
Kenneth Graunke	3f9012839e	Revert "st/dri: simplify dri_get_egl_image by reusing dri2_format_table" This reverts commit `c47af8b95f`. It causes dEQP-EGL regressions. (I think there is an easy fix, but we'll have it go through review again.)	2019-07-31 11:06:32 -07:00
Alyssa Rosenzweig	91c4acedaf	pan/midgard: Don't special case inline_constant Another constant source of bugs. Ain't that special. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:59:19 -07:00
Alyssa Rosenzweig	29416a8599	pan/midgard: De-special-case branching It's not that special. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:59:18 -07:00
Alyssa Rosenzweig	3e47a1181b	panfrost: Add MALI_SAMP_NORM_COORDS flag Corresponds to the normalized coordinates? flag on images in OpenCL and evidently also shows up in GL, so let's wire it in. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:56:11 -07:00
Alyssa Rosenzweig	cf6cad3922	panfrost: Simplify filter_mode definition It's just a bit field containing some flags; there's no need for all the macro magic. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:56:11 -07:00
Alyssa Rosenzweig	160795429d	pan/midgard: Shrink "compute FBD" We still don't know what it is, but from a newer trace we now know it's half the size we thought it was. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:56:11 -07:00
Alyssa Rosenzweig	194b49ee28	panfrost: Flip texture/sampler fields We had them backwards in both the command stream and the Midgard stack. In OpenGL ES 2.0, they're always the same, but in Vulkan/later-GL/CL they diverge so we can fix this. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:56:11 -07:00
Alyssa Rosenzweig	a692126c93	panfrost: Add MALI_ATTR_IMAGE value Images are implemented (in part) as special attributes, so include support for decoding this. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:56:11 -07:00
Mike Blumenkrantz	c47af8b95f	st/dri: simplify dri_get_egl_image by reusing dri2_format_table this makes dri2_get_mapping_by_fourcc accessible from dri_helpers.h and does a direct lookup on the fourcc id to match the pipe format Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-31 09:50:06 -07:00
Mike Blumenkrantz	7404833c2e	gallium: add handling for YUV planar surfaces st/dri: this adds a table (similar to the one in i965) which provides mappings for turning various planar formats into multiple sampler views. whereas only NV12 and IYUV were supported, now many more formats are supported here: * P0XX * YUV4XX * YVU4XX * AYUV * XYUV * YUYV * UYVY the table is used directly to handle image creation, simplifying a lot of code and resolving related TODO/FIXME items where workarounds were previously in place to manage NV12 and IYUV formats exclusively st/mesa: the changes here relate to setting up samplers for the planar formats. this requires: * checking for driver support for all the sampler formats * creating the samplers with the corresponding formats and swizzling * running nir_lower_tex with the appropriate options to trigger the lowering for each plane->sampler fixes kwg/mesa#36 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-31 09:50:06 -07:00
Mike Blumenkrantz	338a29b08f	gallium: add AYUV and XYUV formats this only adds the PIPE_FORMAT members, not any direct handling for them Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-31 09:50:06 -07:00
Alyssa Rosenzweig	7f75b2b5af	pan/midgard: Simplify discard logic The "branch offset" is, in fact, ignored. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 09:39:16 -07:00
Alyssa Rosenzweig	27524d1462	pan/midgard: Add units for more instructions For everything but freduce, we have some sense of what units the instruction takes. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 09:39:16 -07:00
Alyssa Rosenzweig	64235b1ecc	pan/midgard: Fix ball/bany opcode table This were seriously messed up beyond all recognition. How we're passing shaders.random.* is a mystery. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 09:39:16 -07:00
Alyssa Rosenzweig	13ee87c8b9	pan/midgard: Document branch combination LUT This took way longer to figure out than it should have.. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 09:39:16 -07:00
Kenneth Graunke	2037478702	st/mesa: Skip scissor rect updates when scissor is entirely disabled. If any scissor rectangles are enabled, then we need to set proper scissor rectangles for all viewports. But if the scissor test is entirely disabled, then we can skip updating any scissor rectangles. Without this step, we were updating the scissor rectangles based on the current framebuffer size. So if an app rendered to a variety of render targets at different sizes, with scissor test disabled each time, we'd still be continually updating the scissor rectangles, even though it's not necessary. In Civilization VI, this drops us from 310-350 set_scissor_state calls per frame to 0, as it doesn't appear to use scissor testing. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-07-31 08:33:50 -07:00
Emil Velikov	72b97ad9b2	egl/drm: ensure the backing gbm is set before using it Currently, if we error out before gbm_dri is set (say due to a different name of the backing GBM implementation, or otherwise) the tear down will trigger a NULL ptr deref and crash out. Move the gbm_dri initialization as early as possible. v2: Drop check in dri2_teardowm_drm (Eric) Reported-by: Christian Gmeiner <christian.gmeiner@gmail.com> Cc: Christian Gmeiner <christian.gmeiner@gmail.com> Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-07-31 14:18:12 +01:00
Eric Engestrom	4bf7e7b170	docs: update required meson version Fixes: `f7b6a8d12f` ("meson: bump required version to 0.46") Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-07-31 11:50:39 +01:00
Samuel Pitoiset	c66021069e	radv/gfx10: implement a GE bug workaround Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-31 12:14:29 +02:00
Samuel Pitoiset	9a3fc7b6fa	radv/gfx10: remove an obsolete VGT_REUSE_OFF workaround Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-31 12:14:29 +02:00
Samuel Pitoiset	bb8f25233a	radv/gfx10: disable LATE_ALLOC_GS on Navi14 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-31 12:14:29 +02:00
Samuel Pitoiset	e041a74588	radv/gfx10: implement a bug workaround for GE_PC_ALLOC Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-31 12:14:29 +02:00
Samuel Pitoiset	0e1724af61	radv/gfx10: implement a bug workaround for NGG -> legacy transitions Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-31 12:14:29 +02:00
Samuel Pitoiset	29cca5f381	radv: skip draw calls with 0-sized index buffers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-31 12:14:29 +02:00
Eric Engestrom	fed6aa2fec	autotools: delete leftover script wrapper Randomly came across this file, which was likely only used by autotools to pass arguments to the test. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-07-31 10:16:30 +01:00

... 2 3 4 5 6 ...

113961 Commits All Branches Search

113961 Commits

All Branches