KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	d8db5986ce	radv: compute the number of subpass attachments correctly Only count color attachments twice if resolves are used, also account for the depth stencil attachment if present. Cc: 18.0 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-05-01 22:18:03 +02:00
Dave Airlie	e66f64c285	radv: set fmask_surf_index on fmask surfaces. This is needed for gfx9 and later for all fmask surface index. (Mentioned by Marek on irc) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-05-02 06:01:42 +10:00
Brian Paul	f298ed93d9	gallium/i915: fix PIPE_CAPF_MIN_CONSERVATIVE_RASTER_DILATE typo Fixes: `fffe5e2d14` ("gallium: add initial support for conservative rasterization") Trivial.	2018-05-01 09:52:22 -06:00
Rhys Perry	07dac3e040	nvc0: add conservative rasterization support Subpixel precision bias, dilation and the post-snap mode are supported on GM200 and newer. The pre-snap mode is supported for triangle primitives on GP100. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2018-04-30 21:13:53 -06:00
Rhys Perry	97f5f399ef	st/mesa: add support for nvidia conservative rasterization extensions Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-04-30 21:13:53 -06:00
Rhys Perry	fffe5e2d14	gallium: add initial support for conservative rasterization Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-04-30 21:13:53 -06:00
Rhys Perry	4580617509	mesa: add support for nvidia conservative rasterization extensions Although the specs are written against compatibility GL 4.3 and allows core profile and GLES2+, it is exposed for GL 1.0+ and GLES1 and GLES2+. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2018-04-30 21:13:53 -06:00
Brian Paul	31ab0427a7	glsl/tests: add GLSL_TYPE_UINT8, GLSL_TYPE_INT8 cases to switch statements To silence warnings about unhandled switch values. Untested otherwise. v2: move the INT/UINT8 cases after the INT/UINT16 cases, per Eric. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-04-30 21:13:53 -06:00
Brian Paul	efec712d51	tgsi: use enums instead of unsigned in ureg code Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-04-30 21:13:53 -06:00
Timothy Arceri	6487e7a30c	nir: move GL specific passes to src/compiler/glsl With this we should have no passes in src/compiler/nir with any dependencies on headers from core GL Mesa. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2018-05-01 12:39:33 +10:00
Andres Rodriguez	f56e22e496	radv/winsys: fix leaking resources from bo's imported by fd A bo's ref_count was not being initialized when imported from an fd. Therefore, we would fail to free the resource during VkFreeMemory(). This patch fixes applications like hifi VR in threaded mode, which perform frequent imports/releases of IPC shared memory. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> CC: 18.0 18.1 <mesa-stable@lists.freedesktop.org>	2018-04-30 18:20:30 -04:00
Scott D Phillips	2a08ae3c7c	i965/tiled_memcpy: ytiled_to_linear a cache line at a time Similar to the transformation applied to linear_to_ytiled, also align each readback from the ytiled source to a cacheline (i.e. transfer a whole cacheline from the source before moving on to the next column). This will allow us to utilize movntqda (_mm_stream_si128) in a subsequent patch to obtain near WB readback performance when accessing the uncached ytiled memory, an order of magnitude improvement. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-04-30 15:18:36 -07:00
Chris Wilson	682bdaa658	i965: Record mipmap resolver for unmapping When mapping a region of the mipmap_tree, record which complementary method to use to unmap it afterwards. By doing so we can avoid duplicating the decision tree used when mapping and thereby eliminate trivial errors that can be introduced if the two if-chains become out of sync. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Scott D Phillips <scott.d.phillips@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-04-30 14:06:23 -07:00
Chris Wilson	5367295e1a	i965: Move unmap_depthstencil before map_depthstencil Reorder code to avoid a forward declaration in the next patch. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2018-04-30 14:06:23 -07:00
Chris Wilson	ab2825c898	i965: Move unmap_etc before map_etc Reorder code to avoid a forward declaration in the next patch. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2018-04-30 14:06:23 -07:00
Chris Wilson	9e7e88049f	i965: Move unmap_s8 before map_s8 Reorder code to avoid a forward declaration in the next patch. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2018-04-30 14:06:23 -07:00
Chris Wilson	b3ad6f5ca6	i965: Move unmap_movntdqa before map_movntdqa Reorder code to avoid a forward declaration in the next patch. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2018-04-30 14:06:23 -07:00
Chris Wilson	f348d07a62	i965: Move unmap_blit before map_blit Reorder code to avoid a forward declaration in the next patch. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2018-04-30 14:06:23 -07:00
Chris Wilson	359624142d	i965: Move unmap_gtt before map_gtt Reorder code to avoid a forward declaration in the next patch. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2018-04-30 14:06:23 -07:00
Dave Airlie	8d3529872c	ac/nir: expand 64-bit vec3 loads to fix shuffling. If loading 64-bit vec3 values, a 4 component load would be followed by a 2 component load and the resulting shuffle would fail as it requires 2 4 components. This just expands the second results vector out to 4 components. This fixes 100 CTS tests: dEQP-VK.spirv_assembly.type.vec3.64 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-05-01 05:58:14 +10:00
Kenneth Graunke	bde12f75e1	i965: Don't stomp initial kflags for program cache. We want to flag EXEC_OBJECT_CAPTURE, but we ought to preserve any existing kflags. Today, there are none (as the program cache doesn't support 48-bit addressing), but once we start using softpin, we'll need to preserve EXEC_OBJECT_PINNED. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-04-30 11:34:19 -07:00
Kenneth Graunke	0cc98522f9	i965: Let batchbuffers be placed anywhere in the 48-bit address space. We were trying to mark batch buffers with EXEC_OBJECT_CAPTURE, and accidentally stomped EXEC_OBJECT_SUPPORTS_48B_ADDRESS in the process. There's no reason to restrict batch buffers to the lower 4GB. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-04-30 11:34:19 -07:00
Scott D Phillips	8ffc6ee251	intel: fix check for 48b ppgtt support The previous logic of the supports_48b_addresses wasn't actually checking if i915.ko was running with full_48bit_ppgtt. The ENOENT it was checking for was actually coming from the invalid context id provided in the test execbuffer. There is no path in the kernel driver where the presence of EXEC_OBJECT_SUPPORTS_48B_ADDRESS leads to an error. Instead, check the default context's GTT_SIZE param for a value greater than 4 GiB v2 (Ken): Fix in i965 as well. v3 Check GTT_SIZE instead of HAS_ALIASING_PPGTT (Chris Wilson) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-04-30 11:34:19 -07:00
Leo Liu	1c5f4f4e17	st/omx/enc: fix blit setup for YUV LoadImage The blit here involves scaling since it's copying from I8 format to R8G8 format. Half of source will be filtered out with PIPE_TEX_FILTER_NEAREST instruction, it looks that GPU always uses the second half as source. Currently we use "1" as the start point of x for R, then causing 1 source pixel of U component shift to right. So "-1" should be the start point for U component. Cc: 18.0 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-04-30 11:55:36 -04:00
Juan A. Suarez Romero	4d449c94e4	autotools, meson: bump up required VA version Due using a new VP9 config we use, required VA API 0.39 Fixes: `413c5ca372` ("travis: update libva required version") CC: 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-04-30 13:59:37 +02:00
Juan A. Suarez Romero	96ed3714fc	docs: update calendar, add news and link release notes to 18.0.2 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2018-04-28 17:01:48 +00:00
Juan A. Suarez Romero	8f1159bf9a	docs: add sha256 checksums for 18.0.2 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit b3eed3ad03fd1eb61474cd0a8a173ad40fb8a876)	2018-04-28 16:58:39 +00:00
Juan A. Suarez Romero	14f85260de	docs: add release notes for 18.0.2 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit d38da7bd2d4387635fac8bc7f45e64f50dc43c43)	2018-04-28 16:58:36 +00:00
Marek Olšák	8b7358fe43	radeonsi: increase the number of compiler threads depending on the CPU The compiler queue was limited to 3 threads, so shader-db running on a 16-thread CPU would have a bottleneck on the 3-thread queue. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	3f0eaaf6d9	radeonsi: avoid a crash in gallivm_dispose_target_library_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	e75fc8d033	radeonsi: move data_layout into si_compiler Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	797d673c9a	radeonsi: move passmgr into si_compiler Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	c1823ff661	radeonsi: move target_library_info into si_compiler Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	5a94f15aa7	radeonsi: use si_compiler::triple in si_llvm_optimize_module Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	43f0a10051	radeonsi: add triple into si_compiler Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	87eb597758	radeonsi: add struct si_compiler containing LLVMTargetMachineRef It will contain more variables. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Tested-by: Benedikt Schemmer <ben at besd.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	788d66553a	radeonsi: rename r600_texture::resource to buffer r600_resource could be renamed to si_buffer. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	6fadfc01c6	radeonsi: use r600_resource() typecast helper Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	3160ee876a	radeonsi: remove unused atom parameter from si_atom::emit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	de344209ad	radeonsi: inline 2 trivial state structures Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	e395475096	radeonsi: remove function si_init_atom Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	ccebcba893	radeonsi: remove si_atom::id Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	639b673fc3	radeonsi: don't use an indirect table for state atoms Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	9054799b39	radeonsi: rename r600_atom -> si_atom Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	a8abbbb172	radeonsi: remove r600_pipe_common.h Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	6d19120da8	radeonsi/gfx9: workaround for INTERP with indirect indexing and clean up the conditions. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Cc: 18.0 18.1 <mesa-stable@lists.freedesktop.org>	2018-04-27 17:56:04 -04:00
Marek Olšák	2d69b485f5	radeonsi: rewrite DCC format compatibility checking code It might be better to use a slow compressed clear when clearing to 1. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	c732d069b3	radeonsi: implement DCC fast clear swizzle constraints more accurately Reduce swizzle constraints to the ALPHA_IS_ON_MSB constraint and the clear value of 1. This significantly changes the DCC fast clear code, and fixes fast clear for RGB formats without alpha. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	9ef423f720	radeonsi: rename variables and document stuff around DCC fast clear Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00
Marek Olšák	1cc2e0cc6b	radeonsi: fully enable 2x DCC MSAA for array and non-array textures The clear code is exactly the same as for 1 sample buffers - just clear the whole thing. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2018-04-27 17:56:04 -04:00

1 2 3 4 5 ...

101972 Commits All Branches Search

101972 Commits

All Branches