mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Tim Rowley	d9de8f3122	swr/rast: Code style change (NFC) Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-11-20 13:50:29 -06:00
Tim Rowley	08512c52de	swr/rast: Widen fetch shader to SIMD16 Widen fetch shader to SIMD16, enable SIMD16 types in the jitter, and provide utility EXTRACT/INSERT SIMD8 <-> SIMD16 utility functions. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-11-20 13:50:23 -06:00
Tim Rowley	e612231f20	swr/rast: Support flexible vertex layout for DS output Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-11-20 13:49:59 -06:00
Nicolai Hähnle	3f17d3c017	gallium/u_threaded: avoid syncing in threaded_context_flush We could always do the flush asynchronously, but if we're going to wait for a fence anyway and the driver thread is currently idle, the additional communication overhead isn't worth it. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:16:15 +01:00
Nicolai Hähnle	bc65dcab3b	radeonsi: avoid syncing the driver thread in si_fence_finish It is really only required when we need to flush for deferred fences. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:16:11 +01:00
Nicolai Hähnle	3db1ce01b1	radeonsi: recompute the relative timeout after waiting for ready fence Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:16:06 +01:00
Nicolai Hähnle	f5ea8d18ff	ddebug: fix the hang detection timeout calculation Fixes: `c9fefa062b` ("ddebug: rewrite to always use a threaded approach") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:16:03 +01:00
Nicolai Hähnle	16f8da2997	ddebug: fix use-after-free of streamout targets Fixes: `b47727a83a` ("ddebug: implement pipelined hang detection mode") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:16:00 +01:00
Nicolai Hähnle	aaebf49eba	gallium/u_threaded: properly initialize fence unflushed tokens This got lost in a rebase but never hurt anything because we happened to always sync in fence_finish anyway... Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:15:56 +01:00
Nicolai Hähnle	81aabb20f3	util/u_queue: really use futex-based fences The relevant define changed in the final revision of the simple mutex patch. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:15:53 +01:00
Nicolai Hähnle	a6e8311723	util/u_queue: fix timeout handling in util_queue_fence_wait_timeout Fixes: `e3a8013de8` ("util/u_queue: add util_queue_fence_wait_timeout") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:15:49 +01:00
Nicolai Hähnle	764bd6ef96	st/mesa: use asynchronous flushes in st_finish With threaded gallium, the driver may currently be running in another thread. In that case, we will execute all remaining commands in that thread instead of syncing, which should be better for cache locality. Reviewed-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:15:07 +01:00
Nicolai Hähnle	2d8b82baaa	st/mesa: implement st_server_wait_sync properly Asynchronous flushes require a proper implementation of st_server_wait_sync, because we could have the following with threaded Gallium: Context 1 app Context 1 driver Context 2 ------------- ---------------- --------- f = glFenceSync glFlush <-- app sync --> <-- app sync --> glWaitSync(f) .. draw calls .. pipe_context::flush for glFenceSync pipe_context::flush for glFlush Reviewed-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:15:07 +01:00
Nicolai Hähnle	ce470af0b1	u_threaded_gallium: remove synchronization in fence_server_sync The whole point of fence_server_sync is that it can be used to avoid waiting in the application thread. Reviewed-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 18:15:06 +01:00
Nicolai Hähnle	abeded1cac	amd: build addrlib with C++11 It is required for LLVM anyway. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103658 Fixes: `7f33e94e43` ("amd/addrlib: update to latest version") Tested-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 16:26:28 +01:00
Nicolai Hähnle	df5ebe0c26	radeonsi/gfx9: fix VM fault with fetched instance divisors We need to account for SGPR locations in merged shaders. This case is exercised by KHR-GL45.enhanced_layouts.vertex_attrib_locations Fixes: `79c2e7388c` ("radeonsi/gfx9: use SPI_SHADER_USER_DATA_COMMON") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-20 16:26:10 +01:00
Samuel Pitoiset	3a32858fc3	radv: use a 16 bytes array for the sampled/storage image descriptors This allows to update them with only one memcpy(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-11-20 11:18:22 +01:00
Samuel Pitoiset	bc92ed04ac	radv: do not add the query pool BO to the list in vkCmdEndQuery() As per the spec, the query identified by queryPool and query must currently be active. Applications have to call vkCmdBeginQuery() before, and thus the query pool BO will already be in the list. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-11-20 11:18:20 +01:00
Samuel Pitoiset	cf54ea155e	radv: only load needed depth clear regs for fast depth clears Similar to how the driver sets the depth clear regs after a fast depth clear. Most of the time, this will copy a 32-bit reg instead of a 64-bit reg. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-20 10:45:27 +01:00
Samuel Pitoiset	e55b7609fa	radv: do not add the image BO in radv_set_depth_clear_regs() For the fast path, radv_fill_buffer() ensures that the BO is already in the list. For the slow path, the depth surface is part of the framebuffer which means the BO is added to the list when the framebuffer is emitted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-20 10:45:23 +01:00
Samuel Pitoiset	3c6bba83f0	radv: remove useless assertion in emit_depthstencil_clear() Already checked in emit_clear(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-20 10:45:21 +01:00
Samuel Pitoiset	403a3d8061	radv: remove useless check in radv_set_depth_clear_regs() aspects can't be zero and there is an assertion that ensures it's not in emit_clear(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-20 10:45:19 +01:00
Dave Airlie	59ca0c4b44	docs/features: mark some r600 extensions supported These just looked to be missed when this file was updated. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-20 10:22:25 +10:00
George Barrett	f09c2cefdd	glsl: Catch subscripted calls to undeclared subroutines generate_array_index fails to check whether the target of a subroutine call exists in the AST, potentially passing around null ir_rvalue pointers eventuating in abort/segfault. Fixes: `fd01840c0b` ("glsl: add AoA support to subroutines") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100438	2017-11-20 11:04:04 +11:00
Eric Anholt	514db90448	broadcom/vc5: Fix up integer texture handling. The original spec I had didn't expose integer textures and suggested that you use unfiltered floats. Now there are proper formats for them. Fixes 16- and 32-bit texwrap integer tests in piglit, and dEQP-GLES3.functional.fbo.completeness.renderable.renderbuffer.color0.rgb10_a2ui.	2017-11-19 10:12:30 -08:00
Eric Anholt	65ae4527d9	broadcom/vc5: Fix simulator assertion failures about color RT clears. When we tried to clear color while storing depth, it assertion failed about basically not having enough information to decide which color RT to clear. It turns out the STORE_GENERAL picks the buffer according to the color buffer being stored, or all of them if NONE. If you're doing depth, it doesn't know which to pick.	2017-11-19 10:12:30 -08:00
Rob Clark	ae44845aff	freedreno/ir3: add texture gather support Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-18 13:39:39 -05:00
Lucas Stach	f5d477f447	etnaviv: enable full overwrite when no color buffer is present The OVERWRITE bit disables destination fetches, which is exactly what we want when there is no valid color buffer bound. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Wladimir J. van der Laan <laanwj@gmail.com>	2017-11-18 12:33:49 +01:00
Jason Ekstrand	1eab327ba7	i965: Stop including brw_cfg.h in brw_disasm_info.h The brw_disasm_info header is included by certain tools in order to get shader assembly from binaries so it's a semi-external header. Including brw_cfg.h also pulls in brw_shader.h so you end up getting quite a bit of our back-end compiler internals. Instead, make the couple of forward declarations we need and make the header more stand-alone. This fixes the meson build. Reviewed-by: Matt Turner <mattst88@gmail.com> Fixes: `4f82b17287`	2017-11-17 21:51:16 -08:00
Jason Ekstrand	0a6a137eb2	i965: Mark BOs as external when we export their handle Almost all of our BO export paths were already properly marked the BO as external and added it to the handle table. Most export use-cases go through a prime fd or flink where we have a brw_bo export helper that does the right thing. The one missing one happens when you call queryImage and ask for __DRI_IMAGE_ATTRIB_HANDLE. We just grabbed the gem handle out of the BO (because it's really easy to do that) and handed it off to the client; what could go wrong? As it turns out, this path is used by basically every compositor that wants to turn around and call drmModeAddFB2 on it so it can hand it off to display. The result, as of `4b1e70cc57`, is that we no longer set MOCS_PTE on those surfaces and the kernel's attempts to disable caching fail and we scanout gets corruption. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103759 Fixes: `4b1e70cc57` Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2017-11-17 17:16:44 -08:00
Jason Ekstrand	344252a27f	i965/bufmgr: Add a helper to mark a BO as external Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2017-11-17 17:16:44 -08:00
Andres Gomez	1866f7aee5	i965: Correct disasm_info usage in eu_validate test Fixes: `4f82b17287` ("i965: Rewrite disassembly annotation code") Cc: Matt Turner <mattst88@gmail.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-11-18 03:07:06 +02:00
Eric Anholt	8d5994098f	broadcom/vc5: Set up the padded height at surface creation time. This centralizes the calculation in the surface, instead of in each load/store.	2017-11-17 16:09:55 -08:00
Eric Anholt	87391e23cf	broadcom/vc5: Ensure that there is always a TLB write. This should fix some GPU hangs in our (currently always single-threaded) fragment shaders, and definitely fixes assertion failures in simulation.	2017-11-17 16:09:55 -08:00
Eric Anholt	c40ac132e4	broadcom/vc5: Fix clear color for swap_color_rb render targets. Fixes dEQP-GLES3.functional.depth_stencil_clear.depth.*	2017-11-17 16:09:55 -08:00
Eric Anholt	52f3e9e43c	broadcom/vc5: Fix pasteo in front stencil ref value setup. Fixes piglit masked-clear.	2017-11-17 16:09:55 -08:00
Eric Anholt	b63dd626b7	broadcom/vc5: Fix colormasking when we need to swap r/b colors. Fixes part of piglit masked-clear.	2017-11-17 16:09:55 -08:00
Eric Anholt	2daf941a58	broadcom/vc5: Enable the Z min/max clipping planes.	2017-11-17 16:09:55 -08:00
Eric Anholt	c259bf686c	broadcom/vc5: Fix driver for new PIPE_SHADER_CAP_MAX_HW_ATOMIC_*.	2017-11-17 16:09:54 -08:00
Brian Paul	2b5b6bebac	r300: add PIPE_SHADER_CAP_MAX_HW_ATOMIC_COUNTER* switch cases To silence compiler warnings. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-11-17 16:09:59 -07:00
Brian Paul	af322ed887	tgsi: s/uint/enum pipe_shader_type/ Roland Scheidegger <sroland@vmware.com>	2017-11-17 16:09:40 -07:00
Brian Paul	fdee3e1d82	tgsi: bump tgsi_opcode_info::output_mode size to 4 bits To avoid problems with MSVC. And verify size with ASSERT_BITFIELD_SIZE(). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2017-11-17 16:09:39 -07:00
Kenneth Graunke	a01ba366e0	i965: Revert Gen8 aspect of VF PIPE_CONTROL workaround. This apparently causes hangs on Broadwell, so let's back it out for now. I think there are other PIPE_CONTROL workarounds that we're missing. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103787	2017-11-17 14:28:22 -08:00
Adam Jackson	ddcd4b05a3	egl: Convert int to attrib in eglGetPlatformDisplay ... because converting attrib to int truncates, and that's bad. Signed-off-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-11-17 16:43:16 -05:00
Rob Clark	1831e3fb1d	docs: update features for freedreno Just comparing glxinfo and features.txt, and it seems features.txt is fairly out of date. The a5xx specific features (compute/images/atomics/ etc) are recent. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-11-17 15:19:38 -05:00
Matt Turner	821ec473a8	i965: Rename intel_asm_annotation -> brw_disasm_info It was the only file named intel_* in the compiler. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-11-17 12:14:38 -08:00
Matt Turner	4f82b17287	i965: Rewrite disassembly annotation code The old code used an array to store each "instruction group" (the new, better name than the old overloaded "annotation"), and required a memmove() to shift elements over in the array when we needed to split a group so that we could add an error message. This was confusing and difficult to get right, not the least of which was because the array has a tail sentinel not included in .ann_count. Instead use a linked list, a data structure made for efficient insertion. Acked-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-11-17 12:14:38 -08:00
Matt Turner	f80e97346b	i965: Simplify annotation_insert_error() Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-11-17 12:14:38 -08:00
Matt Turner	f4276ef7ef	i965: Move common code out of #ifdef I'm going to change the call in a later patch and with the difference in indentation level it wasn't immediately obvious that the calls were identical. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-11-17 12:14:38 -08:00
Anuj Phogat	822fd2341d	i965: Remove DWord length from MI_FLUSH_DW definition Fixes: `6165fda59b` ("i965: Program DWord Length in MI_FLUSH_DW") Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-11-17 11:51:28 -08:00

1 2 3 4 5 ...

97978 Commits All Branches Search

97978 Commits

All Branches