KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	becc87b84a	glsl: make sampler/image types as 64-bit The ARB_bindless_texture spec says: "Samplers are represented using 64-bit integer handles." and, "Images are represented using 64-bit integer handles." It seems simpler to always consider sampler and image types as 64-bit unsigned integer. This introduces a temporary workaround in _mesa_get_uniform() because at this point no flag are used to distinguish between bound and bindless samplers. This is going to be removed in a separate series. This avoids breaking arb_shader_image_load_store-state. v3: - update the comment slightly Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-05-06 16:40:19 +02:00
Samuel Pitoiset	042eee2067	glsl: add ARB_bindless_texture enable This also adds the extension to the standalone GLSL compiler. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-06 16:40:19 +02:00
Samuel Pitoiset	b08a9bf791	mesa: add ARB_bindless_texture to the extensions list This is required for the following GLSL bits. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-05-06 16:40:19 +02:00
Fredrik Höglund	5ff4858111	radv/meta: fix restoring a push descriptor set radv_bind_descriptor_set cannot be used to bind a push descriptor set since a push descriptor set does not have a buffer list. However, there is no need to add the buffers again when restoring a set, so this fix is also an optimization. Cc: "17.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Fredrik Höglund <fredrik@kde.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-05-06 01:46:18 +02:00
Nicolas Boichat	f6ac3d0db6	configure.ac: Also match -androideabi tuple On ARM Android platforms, the host_os tuple should be linux-androideabi, so let's match both -android and -androideabi (or any other -android* tuple) to determine if we should do an Android build. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-05-05 15:39:38 -07:00
Jason Ekstrand	e05e3e07ab	anv/allocator: Only write to _vg_ptr if we have valgrind This fixes the build when not building against valgrind headers. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100945 Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-05-05 12:49:51 -07:00
Daniel Stone	d4342b1398	i915: Fix build break with empty unreachable() Actually put something in unreachable(), so as not to break the build on a Friday evening. Signed-off-by: Daniel Stone <daniels@collabora.com> Reported-by: Mark Janes <mark.a.janes@intel.com>	2017-05-05 18:24:44 +01:00
Marek Olšák	ee5908396e	radeonsi: apply the tess+GS hang workaround to Polaris12 as well Cc: 17.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 18:55:03 +02:00
Daniel Stone	8b8af19065	i965: Set modifier for imported and duplicated images When a buffer is being created from FD or GEM flink import, the current API makes no provision for passing modifier information along with this. Set the modifier for such images to DRM_FORMAT_MOD_INVALID. Also preserve the modifier when duplicating an image, as will be done by GBM when importing from a wl_buffer. This doubly tripped up Wayland, as the images would first have been created (as wl_buffers) with a 0 modifier, and then lost what modifier they would've had when being duplicated into gbm_bos. Fixes: `d78a36ea62` ("i965/dri: Handle the linear fb modifier") Signed-off-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-05 17:34:10 +01:00
Daniel Stone	467332a0ab	i965: Use helper function for modifier -> tiling Use a helper function and struct to convert between a modifier and tiling mode, so we can use it later for a tiling -> modifier lookup. Signed-off-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-05 17:34:10 +01:00
Samuel Pitoiset	485ece83ac	radeonsi: fix build with GCC 4.8 Fixes: `7088b655e8` ("radeonsi: constify a bunch of the perfcounter structs.") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100937 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-05-05 18:29:30 +02:00
Samuel Pitoiset	92ab06e782	st/glsl_to_tgsi: fix renumber_registers() in presence of dead code The TGSI DCE pass doesn't eliminate dead assignments like MOV TEMP[0], TEMP[1] in presence of loops because it assumes that the visitor doesn't emit dead code. This assumption is actually wrong and this situation happens. However, it appears that the merge_registers() pass accidentally takes care of this for some weird reasons. But since this pass has been disabled for RadeonSI and Nouveau, the renumber_registers() pass which is called after, can't do its job correctly. This is because it assumes that no dead code is present. But if there is still a dead assignment, it might re-use the TEMP register id incorrectly and emits wrong code. This patches fixes the issue by recording writes instead of reads, and this has the advantage to be faster. This should fix Unigine Heaven on RadeonSI and Nouveau. shader-db results with RadeonSI: 47109 shaders in 29632 tests Totals: SGPRS: 1923308 -> 1923316 (0.00 %) VGPRS: 1133843 -> 1133847 (0.00 %) Spilled SGPRs: 2516 -> 2518 (0.08 %) Spilled VGPRs: 65 -> 65 (0.00 %) Private memory VGPRs: 1184 -> 1184 (0.00 %) Scratch size: 1308 -> 1308 (0.00 %) dwords per thread Code Size: 60095968 -> 60096256 (0.00 %) bytes LDS: 1077 -> 1077 (0.00 %) blocks Max Waves: 431889 -> 431889 (0.00 %) Wait states: 0 -> 0 (0.00 %) It's still interesting to disable the merge_registers() pass. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 09:48:01 +02:00
Iago Toral Quiroga	7761cf6d01	anv/query: handle more cases of 'out of host memory' Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2017-05-05 08:53:33 +02:00
Nicolas Boichat	63b12b0c77	egl/android: Set EGLSurface.Lost to EGL_TRUE/EGL_FALSE Lost is an EGLBoolean, so we should assign it to EGL_TRUE/EGL_FALSE, not true/false. Fixes: `e5eace5868` ("egl/android: Mark surface as lost when dequeueBuffer fails") Fixes: `0212db3504` ("egl/android: Cancel any outstanding ANativeBuffer in surface destructor") Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-05-04 20:09:10 -07:00
Jason Ekstrand	98cd512089	anv/allocator: Improve block pool growing asserts Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	24827fdf50	anv: Drop the instruction pool block size Now that we can allocate states larger than the block size, we no longer need a block size of 1MB which can be rather wasteful. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	955127db93	anv/allocator: Add support for large stream allocations Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	f82d3d38b6	anv/allocator: Allow state pools to allocate large states Previously, the maximum size of a state that could be allocated from a state pool was a block. However, this has caused us various issues particularly with shaders which are potentially very large. We've also hit issues with render passes with a large number of attachments when we go to allocate the block of surface state. This effectively removes the restriction on the maximum size of a single state. (There's still a limit of 1MB imposed by a fixed-length bucket array.) For states larger than the block size, we just grab a large block off of the block pool rather than sub-allocating. When we go to allocate some chunk of state and the current bucket does not have state, we try to pull a chunk from some larger bucket and split it up. This should improve memory usage if a client occasionally allocates a large block of state. This commit is inspired by some similar work done by Juan A. Suarez Romero <jasuarez@igalia.com>. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	8c079b566e	anv/allocator: Support pushing multiple blocks onto a free list at once Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	8769fb48fb	anv/allocator: Add helpers for dealing with bucket sizes Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	12043ca696	anv/allocator: Add the capability to allocate blocks of different sizes Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	01170df262	anv/allocator: Rework a comment This commit just fixes up the English a bit and re-flows the comment. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	bcc5d0defb	anv/allocator: Tweak the block pool growing algorithm The old algorithm worked fine assuming a constant block size. We're about to break that assumption so we need an algorithm that's a bit more robust against suddenly growing by a huge amount compared to the currently allocated quantity of memory. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	d3ed72e2c2	anv/allocator: Embed the block_pool in the state_pool Now that the state stream is allocating off of the state pool, there's no reason why we need the block pool to be separate. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	bb2a3f0df8	anv/allocator: Get rid of the ability to free blocks Now that everything is going through the state pools, the block pool no longer needs to be able to handle re-use. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	08413a81b9	anv: Allocate binding table blocks through the state pool Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	55f49e6b7e	anv/allocator: Add support for "back" allocations to state_pool Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	49ecaf88d1	anv/allocator: Drop the block_size field from block_pool Since the state_stream is now pulling from a state_pool, the only thing pulling directly off the block pool is the state pool so we can just move the block_size there. The one exception is when we allocate binding tables but we can just reference the state pool there as well. The only functional change here is that we no longer grow the block pool immediately upon creation so no BO gets allocated until our first state allocation. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	30d63ffe26	anv/allocator: Pull the userptr part of block_pool_grow into a helper Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	c73ce41a48	anv/allocator: Roll fixed_size_state_pool into state_pool The helper functions aren't really gaining us as much as they claim and are actually about to be in the way. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	6d02ef011e	anv/allocator: Remove the state_size field from fixed_size_state_pool Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	367031a5c8	anv: Get rid of a bunch of uses of size_t We should only use size_t when referring to sizes of bits of CPU memory. Anything on the GPU or just a regular array length should be a type that has the same size on both 32 and 64-bit architectures. For state objects, we use a uint32_t because we'll never allocate a piece of driver-internal GPU state larger than 2GB (more like 16KB). Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	e86aeecb6a	anv/allocator: Convert the state stream to pull from a state pool Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	e049dea5b2	anv/allocator: Return a null state for zero-size allocations Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Jason Ekstrand	45e1829274	anv/allocator: Add no-valgrind versions of state_pool_alloc/free Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-05-04 19:07:54 -07:00
Dave Airlie	a096d8d3f7	radv: enable POLARIS12 support. This just adds the chip in the right places. We don't set the partial_vs_wave workaround, as radeonsi doesn't, but have to confirm it's not required. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-05-05 11:07:40 +10:00
Chad Versace	e5eace5868	egl/android: Mark surface as lost when dequeueBuffer fails This ensures that future calls to eglSwapBuffers and eglMakeCurrent emit an error. This patch is part of a series for fixing android.hardware.camera2.cts.RobustnessTest#testAbandonRepeatingRequestSurface on Chrome OS x86 devices. Cc: mesa-stable@lists.freedesktop.org Cc: Tomasz Figa <tfiga@chromium.org> Cc: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nicolas Boichat <drinkcat@chromium.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-04 17:46:34 -07:00
Chad Versace	0212db3504	egl/android: Cancel any outstanding ANativeBuffer in surface destructor That is, call ANativeWindow::cancelBuffer in droid_destroy_surface(). This should prevent application deadlock when the app destroys the EGLSurface after EGL has acquired a buffer from SurfaceFlinger (ANativeWindow::dequeueBuffer) but before EGL has released it (ANativeWindow::enqueueBuffer). This patch is part of a series for fixing android.hardware.camera2.cts.RobustnessTest#testAbandonRepeatingRequestSurface on Chrome OS x86 devices. Cc: mesa-stable@lists.freedesktop.org Cc: Tomasz Figa <tfiga@chromium.org> Cc: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nicolas Boichat <drinkcat@chromium.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-04 17:46:33 -07:00
Chad Versace	23c86c74cc	egl: Emit error when EGLSurface is lost Add a new bool, _EGLSurface::Lost, and check it in eglMakeCurrent and eglSwapBuffers. The EGL 1.5 spec says that those functions emit errors when the native surface is no longer valid. This patch just updates core EGL. No driver sets _EGLSurface::Lost yet. I discovered that Mesa failed to detect lost surfaces while debugging an Android CTS camera test, android.hardware.camera2.cts.RobustnessTest#testAbandonRepeatingRequestSurface. This patch doesn't fix the test though, though, because the test expects EGL_BAD_SURFACE when the surface becomes lost, and this patch actually complies with the EGL spec. If I interpreted the EGL spec correctly, EGL_BAD_NATIVE_WINDOW or EGL_BAD_CURRENT_SURFACE is the correct error. Cc: mesa-stable@lists.freedesktop.org Cc: Tomasz Figa <tfiga@chromium.org> Cc: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nicolas Boichat <drinkcat@chromium.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-05-04 17:46:33 -07:00
Marek Olšák	69e6eab653	winsys/amdgpu: fix Polaris12 (RX 550) breakage reported by Greg White. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100892 Cc: 17.1 <mesa-stable@lists.freedesktop.org>	2017-05-05 01:21:32 +02:00
Kenneth Graunke	9377801fbd	anv: Simplify Cherryview line handling. We can just use the new CHVLineWidth field rather than an entirely different generation's packing function. v2: Inline the function (requested by Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-05-04 16:17:34 -07:00
Kenneth Graunke	31f094e691	i965: Fix line width on Cherryview. We just add another field to gen8.xml for the Cherryview line width, rather than trying to replicate the gymnastics done in the Vulkan driver to use gen9 SF pack functions. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-05-04 16:17:34 -07:00
Marek Olšák	194d9b27cc	radeonsi/gfx9: allow the scratch buffer in HS and GS It works now. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 00:23:44 +02:00
Marek Olšák	8ac4923a67	radeonsi: prevent race conditions when doing scratch patching Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 00:23:44 +02:00
Marek Olšák	9dfc030b48	radeonsi: separate scratch state patching code into its own function Picked from a different branch. When we stop using the scratch patching, this function will not be called. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 00:23:44 +02:00
Marek Olšák	1b01014cbf	radeonsi/gfx9: also apply scratch relocations to the 1st shader of merged shaders Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 00:23:44 +02:00
Marek Olšák	e107c5a426	radeonsi/gfx9: set correct LLVM calling conventions for merged shaders for scratch support Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 00:23:44 +02:00
Marek Olšák	a47289f8fc	radeonsi: remove unused parameters from si_shader_apply_scratch_relocs Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 00:23:44 +02:00
Marek Olšák	2d662c0cba	radeonsi: inline si_llvm_shader_type into si_llvm_create_func Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 00:23:44 +02:00
Marek Olšák	4c0e68dfe5	radeonsi: don't use util_memcpy_cpu_to_le32 for shader uploads at least I think this is correct. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-05 00:23:44 +02:00

1 2 3 4 5 ...

91819 Commits All Branches Search

91819 Commits

All Branches