KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Tim Rowley	04ea03d99d	swr/rast: Fix indentation Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-10-19 13:10:55 -05:00
Tim Rowley	62e2d657c8	swr/rast: Miscellaneous viewport array code changes Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-10-19 13:10:55 -05:00
Tim Rowley	ed1db803fa	swr/rast: Minor changes for os-x Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-10-19 13:10:55 -05:00
Kenneth Graunke	82144b7392	i965: Don't disable aux buffers for non-overlapping miplevels. Meta's GenerateMipmap implementation binds the same image for both sampling and rendering - but it samples from one miplevel while rendering the next. This is a false self-dependency, and there's no need to disable auxiliary buffers in this case. In fact, we really want to leave it enabled so the new miplevels gain color compression. Thankfully, the texture object's _MaxLevel is always one shy of the miplevel being rendered. So we can simply check if irb->mt_level is overlaps with the texture's defined levels. If not, there's no self- dependency and we can leave the auxiliary buffers enabled. Fixes a performance regression in GFXBench4 Car Chase, which apparently calls glGenerateMipmap() on every frame. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103247 Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by; Jason Ekstrand <jason@jlekstrand.net>	2017-10-19 11:10:00 -07:00
Kenneth Graunke	fa6ca6991b	i965: Remove the intel_miptree_prepare_fb_fetch wrapper. Now that intel_miptree_prepare_texture takes levels and layers, there's not much use in this anymore. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by; Jason Ekstrand <jason@jlekstrand.net>	2017-10-19 11:10:00 -07:00
Kenneth Graunke	e208d7f874	i965: Only resolve texture levels/layers that are accessed. This should avoid unnecessary resolves when working with texture views. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by; Jason Ekstrand <jason@jlekstrand.net>	2017-10-19 11:10:00 -07:00
Kenneth Graunke	0954ce1000	i965: Make intel_miptree_prepare_texture() take level/layer arguments. This effectively exports intel_miptree_prepare_texture_slices() as intel_miptree_prepare_texture(). The hope is to avoid resolves for when using texture views that access a subset of the levels/layers. For now, we pass the same arguments to separate the mechanical change from the one that actually modifies our behavior. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by; Jason Ekstrand <jason@jlekstrand.net>	2017-10-19 11:10:00 -07:00
Tim Rowley	33bdbc1db4	gallium: add more exceptions to tgsi_util_get_inst_usage_mask A number of double/int64 operations don't have matching read and write usage masks, which the fallthrough case of tgsi_util_get_inst_usage_mask assumes for componentwise tagged instructions. No regressions in llvmpipe piglit; fixes a large number of swr regressions. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-19 12:49:32 -05:00
Kenneth Graunke	113a6a639f	isl: Fix width check in isl_gen7_choose_msaa_layout. The restriction is supposed to apply if the width field is >= 8192, meaning the actual width value is >= 8193. The code also incorrectly used == for some reason. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-10-19 10:21:45 -07:00
Kenneth Graunke	68f69ebdcc	i965: Use is_scheduling_barrier instead of schedule_node::is_barrier. Commit `a73116ecc6` tried to make add_barrier_deps() walk to the next barrier, and stop. To accomplish that, it added an is_barrier flag. Unfortunately, this only works half of the time. The issue is that add_barrier_deps() walks both backward (to the previous barrier), and forward (to the next barrier). It also sets is_barrier. Assuming that we're processing instructions in forward order, this means that is_barrier will be set for previous instructions, but not future ones. So we'll never see it, and walk further than we need to. dEQP-GLES31.functional.ssbo.layout.random.all_shared_buffer.23 now compiles its shaders in 3.6 seconds instead of 3.3 minutes. Reviewed-by: Matt Turner <mattst88@gmail.com> Tested-by: Pallavi G <pallavi.g@intel.com>	2017-10-19 10:19:20 -07:00
Kenneth Graunke	3d112a7cd4	i965: Move fs_inst::has_side_effects()'s eot check to the parent class. This eliminates a layer of wrapping, and makes a backend_instruction sufficient. The downside is that it exposes 'eot' to the vec4 backend, which it doesn't need, but can basically happily ignore. Reviewed-by: Matt Turner <mattst88@gmail.com> Tested-by: Pallavi G <pallavi.g@intel.com>	2017-10-19 10:19:20 -07:00
Roland Scheidegger	77b8392858	tgsi: fix tgsi_util_get_inst_usage_mask The logic for handling shadow coords was completely broken. Fixes `be3ab867bd`. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103265 Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-19 16:33:39 +02:00
Emil Velikov	a6c55243b9	docs: update calendar, add news item and link release notes for 17.2.3 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-19 13:31:59 +01:00
Emil Velikov	d5fdc37263	docs: add sha256 checksums for 17.2.3 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit facc85181883cb514b2b1a8106255be88fd54c6e)	2017-10-19 13:31:59 +01:00
Emil Velikov	b1605550a6	docs: add release notes for 17.2.3 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit 28dc4b64f2f75dc0a0a98e2b97f1dd3350f50e2d)	2017-10-19 13:31:59 +01:00
Iago Toral Quiroga	2d87caa279	glsl/linker: produce error when invalid explicit locations are used We only need to add a check to validate output locations here. For inputs with invalid locations we will fail to link when we can't find a matching output in the same (invalid) location. v2: compute location slots properly depending on shader stage and variable type / direction Fixes: KHR-GL45.enhanced_layouts.varying_location_limit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-19 11:27:12 +02:00
Iago Toral Quiroga	16631ca30e	i965/sbe: fix active components for SSO programs with over 16 inputs When we have up to 16 FS inputs, the SF unit will reorder our inputs to be consecutive, however, when we have more than 16 we need to to read our inputs from the URB exactly as they have been output from the previous stage. This means that for SSO we have to consider if we have URB padding due to unused input locations. Specifically, this affects gen9 active components programming, since for things to work in scenarios with over 16 inputs that have padded regions we need to ensure that we program active components for the padded regions too. If we don't do this the hardware won't read the URB properly for inputs located after padded regions. Found empirically. Fixes (these also require a patch in CTS): KHR-GL45.enhanced_layouts.varying_locations KHR-GL45.enhanced_layouts.varying_array_locations Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-10-19 08:31:42 +02:00
Chris Wilson	b7c655f700	i965: Do not log a perf warning when mapping an idle bo We only want to scare the user away from causing a GPU stall for mapping a busy bo. The time taken to instantiate the set of pages for a buffer and their mmapping is unavoidable and flagging idle bo as being busy is "crying wolf". Reported-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-10-19 07:12:39 +01:00
Matt Turner	e9796ebca7	i965: Use a union to bitcast a float ... which does not break C's aliasing rules.	2017-10-18 22:16:46 -07:00
Darren Salt	5767ce7d0d	drirc: Group a few games in the glthread whitelist together. Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-10-19 03:28:34 +02:00
Darren Salt	80c20b29d8	drirc: Enable glthread for more games (Saints Row 4 & Gat out of Hell). “Saints Row: Gat out of Hell” benefits from this on slower CPUs in that usage spikes on individual cores are avoided, which in turn makes it harder to hit a bug which causes broken audio and the game to hang on exit. “Saints Row IV” appears to be fine either way, but also exhibits the audio breakage bug: glthread is therefore being enabled on the grounds that it should make it a little harder to hit that bug. Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-10-19 03:28:34 +02:00
Samuel Pitoiset	535aa43df0	radv: reset dirty flags after flushing all states Move it to radv_cmd_buffer_flush_state() because if rasterizerDiscardEnable is true, the flags are not cleared. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-18 21:21:48 +02:00
Samuel Pitoiset	966d66f28f	radv: do not re-emit the index buffer for every draw call It can only be changed when CmdBindIndexBuffer() is called or when a secondary buffer is used. Though not always, but let's re-emit the packets in this situation for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-18 21:21:43 +02:00
Samuel Pitoiset	e5480be0d1	radv: remove useless mask operation in radv_cs_emit_draw_indexed_packet() This saves few CPU cycles when CmdDrawIndexed() is used a lot. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-18 21:21:30 +02:00
Bas Nieuwenhuizen	fa226e9933	radv: Do not read from the disk cache with RADV_DEBUG=nocache. Otherwise the flag is borderline useless. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-18 20:37:10 +02:00
Alex Smith	2cccc74f56	radv: Set active_stages after getting cached shaders Fixes: `7d45d22fdd` ("radv: switch to using radv_create_shaders()") Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-18 20:37:10 +02:00
Alex Smith	f557673237	radv: Don't free NIR shaders if tracing Fixes a crash while generating a hang report. Fixes: `7d45d22fdd` ("radv: switch to using radv_create_shaders()") Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-18 20:37:10 +02:00
Marek Olšák	84f3afc2e1	Revert "egl: move alloc & init out of _eglBuiltInDriver{DRI2,Haiku}" This reverts commit `8cb84c8477`. This fixes crashing shader-db/run.	2017-10-18 20:23:42 +02:00
Marek Olšák	2cb9ab53dd	Revert "egl: drop EGL driver `name`" This reverts commit `6414d6bd8d`. This is needed to apply the next revert.	2017-10-18 20:23:24 +02:00
Miklós Máté	f37af5ec8d	st/mesa: set dimension for constants in ATI_fragment_shader This fixes an assertion failure introduced by `30a2f0dfd4`. Fixes: `30a2f0dfd4` ("radeonsi: add an assertion that only Signed-off-by: Miklós Máté <mtmkls@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-10-18 19:36:53 +02:00
Michel Dänzer	8c9e7c9638	st/osmesa: include u_inlines.h for pipe_resource_reference Fixes build failure due to unresolved symbol. Fixes: `7561da367b` "st/mesa: Initialize textures array in st_framebuffer_validate" Trivial.	2017-10-18 18:44:58 +02:00
Michel Dänzer	7561da367b	st/mesa: Initialize textures array in st_framebuffer_validate And just reference pipe_resources to it in the validate callbacks. Avoids pipe_resource leaks when st_framebuffer_validate ends up calling the validate callback multiple times, e.g. when a window is resized. v2: * Use generic stable tag instead of Fixes: tag, since the problem could already happen before the commit referenced in v1 (Thomas Hellstrom) * Use memset to initialize the array on the stack instead of allocating the array with os_calloc. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2017-10-18 18:28:00 +02:00
Eric Engestrom	47273d7312	egl: set UseFallback if LIBGL_ALWAYS_SOFTWARE is set Suggested-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-10-18 17:25:41 +01:00
Eric Engestrom	6414d6bd8d	egl: drop EGL driver `name` The "DRI2" name was reported as confusing when printing EGL infos (one user reported thinking DRI3 was not working on his X server), and the only alternative is Haiku, which can only be used on a Haiku machine. The name therefore doesn't add any information that the user wouldn't know already, so let's just drop it. Cc: Kai Wasserbäch <kai@dev.carbon-project.org> Suggested-by: Emil Velikov <emil.l.velikov@gmail.com> Related-to: `b174a1ae72` ("egl: Simplify the "driver" interface") Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-10-18 17:25:41 +01:00
Eric Engestrom	d7e769abec	egl: drop always-false TestOnly option Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-10-18 17:25:41 +01:00
Nicholas Miell	3012885b3f	Fix the xf86vm meson dependency The pkg-config file is called xxf86vm. Signed-off-by: Nicholas Miell <nmiell@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-10-18 17:25:41 +01:00
Eric Engestrom	8cb84c8477	egl: move alloc & init out of _eglBuiltInDriver{DRI2,Haiku} Note: dropping the EGL_BAD_ALLOC in egl_haiku because it's overwritten by the EGL_NOT_INITIALIZED in eglInitialize(). Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-18 17:25:41 +01:00
Eric Engestrom	4893673b15	egl_dri2: drop dri2_egl_driver struct Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-18 17:25:41 +01:00
Eric Engestrom	7823cfe9fe	egl_dri2: move glFlush out of struct dri2_egl_driver There's no reason to store this there, it doesn't depend on the driver. Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-18 17:25:41 +01:00
Roland Scheidegger	3d0deed12a	llvmpipe: handle shader sample mask output This probably isn't all that useful for GL, but there are apis where sample_mask is a valid output even without msaa. Just discard the pixel if the sample_mask doesn't include the bit for sample 0. Reviewed-by: Brian Paul <brianp@vmware.com>	2017-10-18 18:16:44 +02:00
Vinson Lee	c5124fbc74	anv: Fix instance typos. Fix build error. CC vulkan/vulkan_libvulkan_common_la-anv_device.lo In file included from vulkan/anv_device.c:33:0: vulkan/anv_device.c: In function ‘anv_AllocateMemory’: vulkan/anv_device.c:1562:37: error: ‘struct anv_device’ has no member named ‘instace’; did you mean ‘instance’? result = vk_errorf(device->instace, device, ^ vulkan/anv_private.h:317:17: note: in definition of macro ‘vk_errorf’ __vk_errorf(instance, obj, REPORT_OBJECT_TYPE(obj), error,\ ^~~~~~~~ Fixes: `9775894f10` ("anv: Move size check from anv_bo_cache_import() to caller (v2)") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-10-18 09:08:08 -07:00
Brian Paul	e17aa6cd9d	mesa: fix trivial typo in _mesa_PixelMapusv() error string Signed-off-by: Brian Paul <brianp@vmware.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103323	2017-10-18 09:53:00 -06:00
Eric Engestrom	2515eb63f8	meson: move expat dependency where it's needed Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-10-18 14:27:20 +01:00
Hongxu Jia	05fc62d89f	automake: intel: move expat handling where it's used Linking libvulkan_intel.so can fail, due to unresolved references to libexpat.so. EXPAT_CFLAGS should be moved as well. Signed-off-by: Hongxu Jia <hongxu.jia@windriver.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-10-18 14:27:20 +01:00
Timothy Arceri	e5e9e21e9f	radv: don't create dummy fs when compiling compute stage Fixes: `d1c9f30d7f` "radv: add radv_create_shaders() helper" Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-18 22:47:53 +11:00
Samuel Pitoiset	e6b9abf294	radv: use the dispatch initiator for indirect dispatches Missed that when I allowed waves to be launched out-of-order. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-18 11:22:41 +02:00
Samuel Pitoiset	095e709717	radv: remove XtoY_temps structs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-18 11:22:39 +02:00
Tapani Pälli	6ef9bea734	anv: Install as Vulkan HAL module in Android.mk build Now that anvil fully implements the Vulkan HAL interface, we can install it as the vendor HAL module at /vendor/lib/hw/vulkan.${board}.so. To do so: - Rename LOCAL_MODULE to vulkan.$(TARGET_BOARD_PLATFORM). - Use LOCAL_PROPRIETARY_MODULE to install under vendor path. Tested by running different Sascha Williams demos on Android-IA. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> [chadv: Extract this hunk from Tapani's patch, and embed it as stand-alone patch in my arc-vulkan series]. Signed-off-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-10-18 00:23:38 -07:00
Chad Versace	053d4c328f	anv: Implement VK_ANDROID_native_buffer (v9) This implementation is correct (afaict), but takes two shortcuts regarding the import/export of Android sync fds. Shortcut 1. When Android calls vkAcquireImageANDROID to import a sync fd into a VkSemaphore or VkFence, the driver instead simply blocks on the sync fd, then puts the VkSemaphore or VkFence into the signalled state. Thanks to implicit sync, this produces correct behavior (with extra latency overhead, perhaps) despite its ugliness. Shortcut 2. When Android calls vkQueueSignalReleaseImageANDROID to export a collection of wait semaphores as a sync fd, the driver instead submits the semaphores to the queue, then returns sync fd -1, which informs the caller that no additional synchronization is needed. Again, thanks to implicit sync, this produces correct behavior (with extra batch submission overhead) despite its ugliness. I chose to take the shortcuts instead of properly importing/exporting the sync fds for two reasons: Reason 1. I've already tested this patch with dEQP and with demos apps. It works. I wanted to get the tested patches into the tree now, and polish the implementation afterwards. Reason 2. I want to run this on a 3.18 kernel (gasp!). In 3.18, i915 supports neither Android's sync_fence, nor upstream's sync_file, nor drm_syncobj. Again, I tested these patches on Android with a 3.18 kernel and they work. I plan to quickly follow-up with patches that remove the shortcuts and properly import/export the sync fds. Non-Testing =========== I did not test at all using the Android.mk buildsystem. I may have broke it. Please test and review that. Testing ======= I tested with 64-bit ARC++ on a Skylake Chromebook and a 3.18 kernel. The following pass (as of patchset v9): - a little spinning cube demo APK - several Sascha demos - dEQP-VK.info.* - dEQP-VK.api.wsi.android.* (except dEQP-VK.api.wsi.android.swapchain..image_usage, because dEQP wants to create swapchains with VK_IMAGE_USAGE_STORAGE_BIT) - dEQP-VK.api.smoke. - dEQP-VK.api.info.instance.* - dEQP-VK.api.info.device.* v2: - Reject VkNativeBufferANDROID if the dma-buf's size is too small for the VkImage. - Stop abusing VkNativeBufferANDROID by passing it to vkAllocateMemory during vkCreateImage. Instead, directly import its dma-buf during vkCreateImage with anv_bo_cache_import(). [for jekstrand] - Rebase onto Tapani's VK_EXT_debug_report changes. - Drop `CPPFLAGS += $(top_srcdir)/include/android`. The dir does not exist. v3: - Delete duplicate #include "anv_private.h". [per Tapani] - Try to fix the Android-IA build in Android.vulkan.mk by following Tapani's example. v4: - Unset EXEC_OBJECT_ASYNC and set EXEC_OBJECT_WRITE on the imported gralloc buffer, just as we do for all other winsys buffers in anv_wsi.c. [found by Tapani] v5: - Really fix the Android-IA build by ensuring that Android.vulkan.mk uses Mesa' vulkan.h and not Android's. Insert -I$(MESA_TOP)/include before -Iframeworks/native/vulkan/include. [for Tapani] - In vkAcquireImageANDROID, submit signal operations to the VkSemaphore and VkFence. [for zhou] v6: - Drop copy-paste duplication in vkGetSwapchainGrallocUsageANDROID(). [found by zhou] - Improve comments in vkGetSwapchainGrallocUsageANDROID(). v7: - Fix vkGetSwapchainGrallocUsageANDROID() to inspect its VkImageUsageFlags parameter. [for tfiga] - This fix regresses dEQP-VK.api.wsi.android.swapchain.*.image_usage because dEQP wants to create swapchains with VK_IMAGE_USAGE_STORAGE_BIT. v8: - Drop unneeded goto in vkAcquireImageANDROID. [for tfiga] v8.1: (minor changes) - Drop errant hunks added by rerere in anv_device.c. - Drop explicit mention of VK_ANDROID_native_buffer in anv_entrypoints_gen.py. [for jekstrand] v9: - Isolate as much Android code as possible, moving it from anv_image.c to anv_android.c. Connect the files with anv_image_from_gralloc(). Remove VkNativeBufferANDROID params from all anv_image.c funcs. [for krh] - Replace some intel_loge() with vk_errorf() in anv_android.c. - Use © in copyright line. [for krh] Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (v5) Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> (v9) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (v9) Cc: zhoucm1 <david1.zhou@amd.com> Cc: Tomasz Figa <tfiga@chromium.org>	2017-10-18 00:23:38 -07:00
Chad Versace	9775894f10	anv: Move size check from anv_bo_cache_import() to caller (v2) This change prepares for VK_ANDROID_native_buffer. When the user imports a gralloc hande into a VkImage using VK_ANDROID_native_buffer, the user provides no size. The driver must infer the size from the internals of the gralloc buffer. The patch is essentially a refactor patch, but it does change behavior in some edge cases, described below. In what follows, the "nominal size" of the bo refers to anv_bo::size, which may not match the bo's "actual size" according to the kernel. Post-patch, the nominal size of the bo returned from anv_bo_cache_import() is always the size of imported dma-buf according to lseek(). Pre-patch, the bo's nominal size was difficult to predict. If the imported dma-buf's gem handle was not resident in the cache, then the bo's nominal size was align(VkMemoryAllocateInfo::allocationSize, 4096). If it was resident, then the bo's nominal size was whatever the cache returned. As a consequence, the first cache insert decided the bo's nominal size, which could be significantly smaller compared to the dma-buf's actual size, as the nominal size was determined by VkMemoryAllocationInfo::allocationSize and not lseek(). I believe this patch cleans up that messy behavior. For an imported or exported VkDeviceMemory, anv_bo::size should now be the true size of the bo, if I correctly understand the problem (which I possibly don't). v2: - Preserve behavior of aligning size to 4096 before checking. [for jekstrand] - Check size with < instead of <=, to match behavior of commit `c0a4f56` "anv: bo_cache: allow importing a BO larger than needed". [for chadv]	2017-10-17 23:46:06 -07:00

... 2 3 4 5 6 ...

96965 Commits All Branches Search

96965 Commits

All Branches