KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Kenneth Graunke	dd83ef0d1a	iris: Track per-stage bind history, reduce work accordingly We now track per-stage bind history for constant and shader buffers, shader images, and sampler views by adding an extra res->bind_stages field to go with res->bind_history. This lets us flag IRIS_DIRTY_CONSTANTS for only the specific stages involved, and also skip some CPU overhead in iris_rebind_buffer. Cuts 4% of 3DSTATE_CONSTANT_XS packets in a Shadow of Mordor trace on Icelake. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-18 15:44:22 -07:00
Kenneth Graunke	1e7daaa6c9	iris: Don't flag IRIS_DIRTY_BINDINGS for constant usage history The underlying buffer isn't changing - so we don't need to update any SURFACE_STATE descriptors - we just might have new constants, meaning we need to re-emit 3DSTATE_CONSTANT_XS. On Gen9, this means we need to update 3DSTATE_BINDING_TABLE_POINTERS_XS too, but that's now handled by the explicit check in the previous patch. On Gen9, this should cause us to re-emit the binding table /pointer/ on writing to a buffer with PIPE_BIND_CONSTANT_BUFFER, rather than emitting a whole new /table/. On Gen8 and Gen11, this avoids binding table churn altogether. Cuts 61% of 3DSTATE_BINDING_TABLE_POINTERS_XS packets in a Shadow of Mordor trace on Icelake. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-18 15:44:22 -07:00
Kenneth Graunke	e7db3577f8	iris: Explicitly emit 3DSTATE_BTP_XS on Gen9 with DIRTY_CONSTANTS_XS Right now, we usually flag both IRIS_DIRTY_{CONSTANTS,BINDINGS}_XS, because we have SURFACE_STATE for constant buffers in case the shaders access them via pull mode. But this flagging is overkill in many cases. Gen8 and Gen11 don't need it at all. Gen9 doesn't need that large of a hammer in all cases. Just handle it explicitly so the right thing happens. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-18 15:44:22 -07:00
Kenneth Graunke	caa0aebd01	iris: Flag IRIS_DIRTY_BINDINGS_XS on constant buffer rebinds We upload a new SURFACE_STATE for the UBO/SSBO in question, which means that we need new binding tables as well. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-18 15:44:22 -07:00
Bas Nieuwenhuizen	4b7e7956f0	radv: Add DFSM support. Apparently we already enabled it without having support ... Not sure if we also need to set disable_start_of_prim when the PS has memory writes, but this mirrors radeonsi. Doubles fillrate in my dual_quad_bench from ~16 pixels/cycles to ~32 pixels/cycle on a Raven. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 21:28:51 +00:00
Bas Nieuwenhuizen	0fa2740059	radv: Disable dfsm by default even on Raven. When actually implementing it, Talos on low is still 3% slower. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 21:28:51 +00:00
Bas Nieuwenhuizen	f2dffb395f	radv: Only break batch on framebuffer change with dfsm. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 21:28:51 +00:00
Connor Abbott	57e0bb8ccc	nir/opt_if: Fix undef handling in opt_split_alu_of_phi() The pass assumed that "Most ALU ops produce an undefined result if any source is undef" which is completely untrue. Due to how we lower if statements to selects and then optimize on those selects later, we simply cannot make that assumption. In particular this pass tried to replace an ior of undef and true, which had been generated by optimizing a select which itself came from flattening an if statement, to undef causing a miscompilation for a CTS test with radeonsi NIR. We fix this by always doing what the non-undef path did, i.e. duplicate the instruction twice. If there are cases where the instruction before the loop can be folded away due to having an undef source, we should add these to opt_undef instead. The comment above the pass says that if the phi source from before the loop is undef, and we can fold the instruction before the loop to undef, then we can ignore sources of the original instruction that don't dominate the block before the loop because we don't need them to create the instruction before the loop. This is incorrect, because the instruction at the bottom of the loop would get those sources from the wrong loop iteration. The code never actually did what the comment said, so we only have to update the comment to match what the pass actually does. We also update the example to more closely match what most actual loops look like after vtn and peephole_select. There are no shader-db changes with i965, radeonsi NIR, or radv. With anv and my vkpipeline-db there's only one change: total instructions in shared programs: 14125290 -> 14125300 (<.01%) instructions in affected programs: 2598 -> 2608 (0.38%) helped: 0 HURT: 1 total cycles in shared programs: 2051473437 -> 2051473397 (<.01%) cycles in affected programs: 36697 -> 36657 (-0.11%) helped: 1 HURT: 0 Fixes KHR-GL45.shader_subroutine.control_flow_and_returned_subroutine_values_used_as_subroutine_input with radeonsi NIR.	2019-09-18 17:18:34 -04:00
Eric Engestrom	a1de3011f3	gl: drop incorrect pkg-config file for glvnd Akin to `1a25980c46` ("egl: drop incorrect pkg-config file for glvnd") and `b01524fff0` ("meson: don't build libGLES*.so with GLVND") , removes a pkg-config file that shouldn't have been there in the first place, but was needed because of that GLVND bug. Now that the glvnd bug has been fixed, it was apparent that this gl.pc pkg-config file was forgotten to be removed, so let's do just that :) Suggested-by: Matt Turner <mattst88@gmail.com> Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-09-18 22:16:51 +01:00
Andres Gomez	66f2aa6ccd	docs: Add the maximum implemented Vulkan API version in 19.3 rel notes Currently, Vulkan 1.1. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-09-19 00:03:55 +03:00
Andres Gomez	41b0e0d7e0	docs: Add the maximum implemented Vulkan API version in 19.2 rel notes Currently, Vulkan 1.1. Cc: 19.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-09-19 00:03:50 +03:00
Andres Gomez	d2db43fcad	docs: Add the maximum implemented Vulkan API version in 19.1 rel notes Currently, Vulkan 1.1. Cc: 19.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-09-19 00:03:39 +03:00
Andres Gomez	d9760f8935	nir/opcodes: Clear variable names confusion Having Python and C variables sharing name in the same block of code makes its understanding a bit confusing. Make it explicit that the Python bit_size variable refers to the destination bit size. Suggested-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-09-18 23:59:07 +03:00
Rhys Perry	b3f71685d9	radv: never kill a NGG GS shader Seems to fix a hang with excessive vertex emissions when NGG is used for GS. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-18 19:26:58 +00:00
Samuel Pitoiset	99c186fbbe	radv/gfx10: fix VK_KHR_pipeline_executable_properties with NGG GS No GS copy shader if a pipeline enables NGG GS. This fixes dEQP-VK.pipeline.executable_properties.graphics.geometry_stage. Fixes: `86864eedd2` ("radv: Implement radv_GetPipelineExecutablePropertiesKHR.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-18 21:19:28 +02:00
Marek Olšák	fe7aa271a9	radeonsi: include drm_fourcc.h to fix the build	2019-09-18 14:52:25 -04:00
Marek Olšák	00e29816e7	radeonsi: implement pipe_screen::resource_get_param v2: return DRM_FORMAT_MOD_INVALID from the function Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1)	2019-09-18 14:43:01 -04:00
Marek Olšák	d307aa56f9	gallium: extend resource_get_param to be as capable as resource_get_handle Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-09-18 14:41:30 -04:00
Marek Olšák	aae35fbd3a	ac: move ac_get_num_physical_vgprs into radeon_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 14:39:06 -04:00
Marek Olšák	0692ae34e9	ac: move ac_get_num_physical_sgprs into radeon_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 14:39:06 -04:00
Marek Olšák	ca43006fd2	ac: move ac_get_max_wave64_per_simd into radeon_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 14:39:06 -04:00
Marek Olšák	deab3a23f6	ac: move num_sdp_interfaces into radeon_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 14:39:06 -04:00
Marek Olšák	2c62b461e9	ac: move PBB MAX_ALLOC_COUNT into radeon_info Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-18 14:39:06 -04:00
Jonathan Marek	05da025f35	etnaviv: fix two-sided stencil * Set missing STENCIL_CONFIG_EXT2 bits * Swap stencil sides when rendering CCW Fixes following deqp tests (which were 99% failing): dEQP-GLES2.functional.fragment_ops.depth_stencil.* Note: deqp tests require --deqp-gl-config-name=rgba8888d24s8ms0 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-09-18 12:49:10 -04:00
Samuel Pitoiset	68820007fd	radv: fix loading 64-bit GS inputs We have to load 2 32-bit integer and to cast correctly. This fixes crashes with gs-double-interpolator.vk_shader_test. Cc: 19.2 <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111734 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-18 17:16:36 +02:00
Bas Nieuwenhuizen	7999e10cab	tu: Set up glsl types. Addresses this assert: deqp-vk: ../mesa-freedreno-9999/src/compiler/glsl_types.cpp:1244: static const glsl_type glsl_type::get_interface_instance(const glsl_struct_field , unsigned int, enum glsl_interface_packing, bool, const char *): Assertion `glsl_type_users > 0' failed. running dEQP-VK.api.smoke.triangle . Fixes: `624789e370` "compiler/glsl: handle case where we have multiple users for types" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-09-18 16:51:18 +02:00
Andres Gomez	f833b4cada	docs: Update to OpenGL 4.6 in the release notes After `41549a18e6` ("i965: Enable OpenGL 4.6 for Gen8+"), Mesa implements the OpenGL 4.6 API. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com>	2019-09-18 12:28:05 +00:00
Erik Faye-Lund	ea74b1b9aa	.mailmap: add an alias for Eric Engestrom Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-09-18 14:05:05 +02:00
Erik Faye-Lund	ed91eacf71	.mailmap: add an alias for Michel Dänzer Reviewed-by: Michel Dänzer <mdaenzer@redhat.com>	2019-09-18 14:04:40 +02:00
Samuel Pitoiset	46b7512b0a	radv: fix writing depth/stencil clear values to image Use the fastest way only if both aspects are used. Oops. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111728 Fixes: `218ce34962` ("radv: add mipmap support for the clear depth/stencil values") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-18 13:27:46 +02:00
Michel Dänzer	88e5796daa	gitlab-ci: Merge scons-nollvm and scons-llvm jobs The new job tests scons without LLVM and with all LLVM versions >= 6.0. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-18 10:36:48 +00:00
Michel Dänzer	baa5024e24	gitlab-ci: Test scons with all LLVM versions Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-18 10:36:48 +00:00
Michel Dänzer	0374aacac0	gitlab-ci: Move scons build/test commands to a separate shell script Preparatory, no functional change intended. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-18 10:36:48 +00:00
Michel Dänzer	8a8388ca67	gitlab-ci: Use crossbuild-essential-* packages They are convenience packages which pull in everything needed for cross-building via dependencies. Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-18 10:36:48 +00:00
Michel Dänzer	a01230e73a	gitlab-ci: Use newer packages from backports by default This is needed in particular to get a recent enough version of meson in the stretch image, but should be generally beneficial. Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-18 10:36:48 +00:00
Michel Dänzer	8a19992869	gitlab-ci: Create separate docker images for Debian stretch & buster Pros: * Less fragile due to not mixing packages from stretch and buster * No longer need to use third-party LLVM packages * The buster image now uses GCC 8 for C++ as well (previously 6 for C++, 8 for C), allowing to drop some hacks Con: * The stretch image now only uses GCC 6 for C as well as C++ * Need separate jobs for testing old LLVM versions Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-18 10:36:48 +00:00
Michel Dänzer	26fcc8baba	gitlab-ci: Pass --no-remove to apt-get where possible If installing new packages would require removing previously installed ones, this flag causes apt-get to abort with an error instead, preventing later obscure failures due to the missing packages. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-18 10:36:48 +00:00
Michel Dänzer	2259b45174	gitlab-ci: Reference full ci-templates commit hash 8 digits might become ambiguous at some point. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-18 10:36:48 +00:00
Haihao Xiang	8a9b81ab9d	i965: support AYUV/XYUV for external import only Fixes: `89785e2d56` ("i965: add support for sampling from AYUV") Fixes: `7cab8d3661` ("i965: Add support for sampling from XYUV images") Cc: Vivek Kasireddy <vivek.kasireddy@intel.com> Cc: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Haihao Xiang <haihao.xiang@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-09-18 12:07:23 +03:00
Boris Brezillon	1e483a87bc	panfrost: Allocate tiler and scratchpad BOs per-batch If we want to execute several batches in parallel they need to have their own tiler and scratchpad BOs. Let move those objects to panfrost_batch and allocate them on a per-batch basis. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:40:17 +02:00
Boris Brezillon	0eec73a800	panfrost: Add FBO BOs to batch->bos earlier If we want the batch dependency tracking to work correctly we must make sure all BOs are added to the batch->bos set early enough. Adding FBO BOs when generating the fragment job is clearly to late. Add a panfrost_batch_add_fbo_bos helper and call it in the clear/draw path. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:37:56 +02:00
Boris Brezillon	5a4d095f9b	panfrost: Add the panfrost_batch_create_bo() helper This helper automates the panfrost_bo_create()+panfrost_batch_add_bo()+ panfrost_bo_unreference() sequence that's done for all per-batch BOs. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:37:31 +02:00
Boris Brezillon	9af4aeaaf7	panfrost: Don't return imported/exported BOs to the cache We don't know who else is using the BO in that case, and thus shouldn't re-use it for something else. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:35:52 +02:00
Boris Brezillon	90b8934547	panfrost: Add panfrost_bo_{alloc,free}() Thanks to that we avoid the recursive call into panfrost_bo_create() and we can get rid of panfrost_bo_release() by inlining the code in panfrost_bo_unreference(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:35:29 +02:00
Boris Brezillon	cb71ae5572	panfrost: Stop using panfrost_bo_release() outside of pan_bo.c panfrost_bo_unreference() should be used instead. The only difference caused by this change is that the scratchpad, tiler_heap and tiler_dummy BOs are now returned to the cache instead of being freed when a context is destroyed. This is only a problem if we care about context isolation, which apparently is not the case since transient BOs are already returned to the per-FD cache (and all contexts share the same address space anyway, so enforcing context isolation is almost impossible). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:35:06 +02:00
Boris Brezillon	e15ab939fd	panfrost: Stop passing screen around for BO operations Store a screen pointer in panfrost_bo so we don't have to pass a screen object to all functions manipulating the BO. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:34:27 +02:00
Boris Brezillon	10ce751726	panfrost: Don't check if BO is mmaped before calling panfrost_bo_mmap() panfrost_bo_mmap() already takes care of that. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:34:08 +02:00
Boris Brezillon	a06e08def9	panfrost: Stop exposing panfrost_bo_cache_{fetch,put}() They are not expected to be called directly, users should use panfrost_bo_{create,release}() instead. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:33:51 +02:00
Boris Brezillon	154cb725d4	panfrost: Move the BO API to its own header Right now, the BO API is spread over pan_{allocate,resource,screen}.h. Let's move all BO related definitions to a separate header file. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:29:13 +02:00
Boris Brezillon	34efaafc93	panfrost: s/PAN_ALLOCATE_/PAN_BO_/ Change the prefix for BO allocation flags to make it consistent with the rest of the BO API. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-18 10:28:55 +02:00

1 2 3 4 5 ...

115505 Commits All Branches Search

115505 Commits

All Branches