KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Dave Airlie	271fa3a684	intel/vec4/gs: reset nr_pull_param if DUAL_INSTANCED compile failed. If dual object compile fails (as seems to happen with virgl a fair bit, and does piglit even have any tests for it?), we end up not restarting the pull params, so we call vec4_visitor::move_uniform_array_access_to_pull_constant a second time and it runs over the ends of the alloc. Fixes: tests/spec/glsl-1.50/execution/geometry/max-input-components.shader_test running inside virgl on ivybridge. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-03 16:54:08 +10:00
Thomas Hellstrom	d5ba75f888	st/dri2 Plumb the flush_swapbuffer functionality through to dri3 Implement the state tracker manager drawable interface flush_swapbuffer method by plumbing it through to dri3 if available. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com>	2017-08-03 08:01:31 +02:00
Thomas Hellstrom	91c93dec98	gallium/st: Add a method to flush outstanding swapbuffers Add a state tracker interface method to flush outstanding swapbuffers, and add a call to it from the mesa state tracker during glFinish(). This doesn't strictly mean the outstanding swapbuffers have actually finished executing but is sufficient for glFinish() to be able to be used as a replacement for glXWaitGL(). Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com>	2017-08-03 08:01:25 +02:00
Thomas Hellstrom	ad5136ac82	glx/dri3: Implement the flush_swapbuffers method Provide a dri3 implementation for the image loader extension method. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-08-03 08:00:25 +02:00
Thomas Hellstrom	ae93d534a8	dri: Add a flushSwapBuffers method to the image loader extension This method may be used by dri drivers to make sure all outstanding buffer swaps have been flushed to hardware. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-08-03 07:57:27 +02:00
Timothy Arceri	4e4042df6b	gallium: introduce PIPE_CAP_MEMOBJ This can be used to guard support for EXT_memory_object and related extensions. v2: update gallium docs v3 (Timothy Arceri): - add cap to nv50 Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-08-03 13:57:16 +10:00
Chris Wilson	fb63c43fd1	i965/blit: Remember to include miptree buffer offset in relocs Remember to add the offset to the start of the buffer in the relocation or else we write 0xff into random bytes elsewhere. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2017-08-02 18:06:35 -07:00
Matt Turner	858f554078	i965: Fix indentation	2017-08-02 16:49:32 -07:00
Bas Nieuwenhuizen	c9d4b571ad	radv: Add suballocation for shaders. This reduces the number of BOs that we need for the BO lists during a submission. Currently uses a fairly simple linear search for finding free space, that could eventually be improved to a binary tree, which with some per-node info could make a check for space O(1) and finding it O(log n), in the number of buffers in that slab. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-03 00:45:13 +02:00
Jordan Justen	fe3d2559d9	docs: Add Vulkan to features.txt To get the extension list: $ git grep -hE "extension name=\"VK_KHR" src/vulkan/registry/vk.xml \| \ grep -v disabled \| awk '{print $2}' \| sed -E 's/(name=)?"//g' \| sort To find anv(il) and radv supported extensions: $ git grep -hE "'VK_([A-Z]+)_[a-z]" src/intel/ $ git grep -hE "'VK_([A-Z]+)_[a-z]" src/amd/ v2: * Add radv to Vulkan 1.0 list (Bas) * 'started' => 'in progress' * Drop KHX and EXT extensions (Jason) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-08-02 14:49:47 -07:00
Kenneth Graunke	ebd2fd6ef3	i965: Set "Subslice Hashing Mode" to 16x16 on Apollolake. As of 4.11, the kernel isn't bothering to set the subslice hashing mode on Apollolake, leaving it at the default of 8x8. (It initializes it to 16x4 on most platforms.) Performance data for GPUTest Triangle on Apollolake at 1024x640: X-tiled RT: ----------- 8x8 -> 16x4: 2.4325% +/- 0.383683% (n=107) 8x8 -> 8x4: -3.75105% +/- 0.592491% (n=40) 8x8 -> 16x16: 6.17238% +/- 0.67157% (n=30) Y-tiled RT: ----------- 8x8 -> 16x4: 1.30307% +/- 0.297292% (n=205) 8x8 -> 8x4: -0.769282% +/- 0.729557% (n=35) 8x8 -> 16x16: 3.00254% +/- 0.715503% (n=40) 8x MSAA RT (INTEL_FORCE_MSAA=8): -------------------------------- 8x8 -> 16x4: 1.38889% +/- 0.93729% (n=7) 8x8 -> 8x4: -2.10643% +/- 1.15153% (n=3) 8x8 -> 16x16: 3.87183% +/- 1.08851% (n=5) Based on this, we choose 16x16 for Apollolake. Skylake GT2 with X-tiled buffers appears to be a toss-up between 16x4 and 16x16, and with Y-tiled buffers it doesn't seem to really matter. So we'll leave Skylake alone for now. The hashing mode doesn't seem to make a measurable impact on more complex benchmarks. Acked-by: Matt Turner <mattst88@gmail.com>	2017-08-02 13:31:56 -07:00
Dave Airlie	a60c584575	mesa/dri: drop unneeded mm.h include This isn't used in any of these drivers. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-03 06:19:45 +10:00
Dave Airlie	9e922bd78c	r300: drop u_mm.h include. This is not used in any of these files. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-03 06:19:42 +10:00
Emil Velikov	c9ec28b1c0	util: use cannonical form of ARRAY_SIZE Namely sizeof(foo)/sizeof((foo)[0]) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-08-02 20:43:33 +01:00
Emil Velikov	df83213702	i965: simplify intel_image_format_lookup() Drop the local variable and return directly. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-08-02 20:42:21 +01:00
Emil Velikov	69fa9e91cb	i965: annotate struct intel_image_format as const Already used as such througout the code. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-08-02 20:42:19 +01:00
Emil Velikov	31a6750988	st/dri: NULL check before deref DRI loader .getCapability One could have vX+1 which introduces another entrypoint without implementing older ones. v2: Rebase, while keeping loaderPrivate Fixes: `1bf703e4ea` ("dri_interface,egl,gallium: only expose RGBA visuals on Android") Cc: 17.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-08-02 20:42:19 +01:00
Eric Engestrom	dd9eb8db13	egl: check the correct function pointer `.swap_interval` != `.SwapInterval`... Fixes: `991ec1b81a` "egl: make platform's SwapInterval() optional" Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102015 Cc: Cedric Sodhi <manday@openmail.cc> Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Tested-by: Cedric Sodhi <manday@openmail.cc>	2017-08-02 18:03:47 +01:00
Kenneth Graunke	595a47b829	i965: Delete pitch alignment assertion in get_blit_intratile_offset_el. The cacheline alignment restriction is on the base address; the pitch can be anything. Fixes assertion failures when using primus (say, on glxgears, which creates a 300x300 linear BGRX surface with a pitch of 1200): intel_blit.c:190: get_blit_intratile_offset_el: Assertion `mt->surf.row_pitch % 64 == 0' failed. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2017-08-02 10:01:34 -07:00
Tim Rowley	7cd50b9e47	swr/rast: fix core / knights split of AVX512 intrinsics Move AVX512BW specific intrinics to be Core-only. Move some AVX512F intrinsics back to common implementation file. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	c8fe4c13b2	swr/rast: simplify knob default value setup Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	844be91e70	swr/rast: split gen_knobs templates into .h/.cpp Switch to a 1:1 mapping template:generated for future maintenance. Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	4c5b4f3f78	swr/rast: gen_knobs template code style Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	fb3e50a351	swr/rast: switch gen_knobs.cpp license Unintentionally added with an apache2 license; relicense to match the rest of the tree. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	e4a6ae06cf	swr/rast: fix scons gen_knobs.h dependency Copy/paste error was duplicating a gen_knobs.cpp rule. Fixes: `5079c277b5` ("swr: [scons] Fix windows build") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	08e3c36955	swr/rast: constify swr rasterizer Add "const" as appropriate in method/function signatures. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	a3f97ff28b	swr/rast: SIMD16 shaders - widen fetch and vertex shaders Work in progress, disabled by default. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	39ed8e297c	swr/rast: vmask() implementations for KNL Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	c18d91ca9a	swr/rast: rename frontend pVertexStore Rename to reflect global nature. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	eddbd781af	swr/rast: fix movemask_ps / movemask_pd on AVX512 Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	f253798205	swr/rast: stop using MSFT types in platform independent code Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	030cfa8eed	swr/rast: enable USE_SIMD16_FRONTEND by default Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	f8a572cdf0	swr/rast: disable AVX512 optimization of SSE / AVX code Disable an optimization which implemented sse/avx operations on avx512 using avx512 intrinsics (to avoid switching between lane widths). Compile with SIMD_OPT_128_AVX512 / SIMD_OPT_256_AVX512 defined to enable these optimizations. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	d08493f9ce	swr/rast: fix USE_SIMD16_FRONTEND issues Fix problems found when enabling USE_SIMD16_FRONTEND, mostly related to vMask / movemask_ps(pd). Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	07062daae9	swr/rast: simdlib better separation of core vs knights avx512 Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Tim Rowley	e1091b0861	swr/rast: threadID via portable std::this_thread::get_id() Replace use of Win32 GetCurrentThreadId() with portable std::this_thread::get_id(). Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-08-02 11:39:33 -05:00
Jason Ekstrand	95c6a97464	spirv: Fix SpvImageFormatR16ui Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "17.1 17.2" <mesa-stable@lists.freedesktop.org>	2017-08-02 09:15:01 -07:00
Jason Ekstrand	277644221d	anv: Advertise VK_KHR_relaxed_block_layout There is literally no work for us to do here. It already just works in our driver. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-02 09:13:13 -07:00
Jason Ekstrand	600605e3fc	anv: Bump the advertised version to 1.0.57 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-02 09:13:13 -07:00
Jason Ekstrand	077b200096	anv: Pull the API version from anv_extensions.py This way everything stays in sync and we only have the one version number. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-02 09:13:13 -07:00
Jason Ekstrand	0ab04ba979	anv: Use python to generate ICD json files This is more lines of code but the python is far easier to read than the sed expressions we were using before. Also, this allows us to pull the API version from anv_entrypoints.py so it never gets out-of-sync. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-02 09:13:13 -07:00
Jason Ekstrand	7382d8a416	anv: Add MAX_API_VERSION to anv_extensions.py The VkVersion class is probably overkill but it makes it really easy to compare versions in a way that's safe without the caller having to think about patch vs. no patch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-02 09:13:13 -07:00
Jason Ekstrand	a25267654b	anv: Make some bits of anv_extensions module-private This way we can use "from anv_extensions import *" in the entrypoint generator without worrying too much about pollution Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-08-02 09:13:13 -07:00
Eric Engestrom	aab0649487	git_sha1_gen: catch any error the same way Acked-by: Jose Fonseca <jfonseca@vmware.com> Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-08-02 14:57:54 +01:00
Tobias Klausmann	44828e99f9	build: Don't bail on OSError in git_sha1_gen.py When building sandboxed, we may encounter additional errors. Ignore the errors, as we are in a constrained environment. This can be observed when building latest git with OBS. Signed-off-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-08-02 14:28:58 +01:00
Nicolai Hähnle	e749995326	st/mesa: replace st_shader_stage_to_ptarget Use pipe_shader_type_from_mesa instead. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-08-02 14:18:52 +02:00
Samuel Pitoiset	56e3b8b9e6	mesa: add GLSL 4.60 to shading_language_version() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-02 13:36:43 +02:00
Samuel Pitoiset	c245502918	mesa: add always-false enable for GL 4.6 I believe this should be enough for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-02 13:36:41 +02:00
Samuel Pitoiset	1f4ceb8be1	glsl: recognize GLSL 4.60 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-08-02 13:36:39 +02:00
Thomas Hellstrom	185ef06fd2	dri3: Wait for all pending swapbuffers to be scheduled before touching the front This implements a wait for glXWaitGL, glXCopySubBuffer, dri flush_front and creation of fake front until all pending SwapBuffers have been committed to hardware. Among other things this fixes piglit glx-copy-sub-buffers on dri3. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Sinclair Yeh <syeh@vmware.com> Reviewed-by: Eric Anholt <eric@anholt.net> Cc: <mesa-stable@lists.freedesktop.org>	2017-08-02 13:29:20 +02:00

1 2 3 4 5 ...

94776 Commits All Branches Search

94776 Commits

All Branches