KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Connor Abbott	73274c9ec2	Revert "ac/nir: handle negate modifier" This reverts commit `bfea7e4d29`.	2019-08-02 11:14:50 +02:00
Connor Abbott	4a382d66ee	Revert "ac/nir: handle abs modifier" This reverts commit `d3c80733cd`. These were only appearing due to memory corruption.	2019-08-02 11:14:08 +02:00
Timothy Arceri	06ec14d692	iris: bump compat profile support to 4.6 All of the current piglit compat profile tests pass. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-02 18:56:53 +10:00
Timothy Arceri	74f96b06d6	egl: fix OpenGL 3.1 context creation >From the EGL_KHR_create_context spec: "* If OpenGL 3.1 is requested, the context returned may implement any of the following versions: * Version 3.1. The GL_ARB_compatibility extension may or may not be implemented, as determined by the implementation. * The core profile of version 3.2 or greater." Fixes CTS tests: dEQP-EGL.functional.create_context_ext.gl_31.rgb888_depth_stencil dEQP-EGL.functional.create_context_ext.robust_gl_31.rgb888_depth_stencil dEQP-EGL.functional.create_context_ext.gl_31.rgb888_depth_no_stencil dEQP-EGL.functional.create_context_ext.robust_gl_31.rgb888_depth_no_stencil dEQP-EGL.functional.create_context_ext.gl_31.rgba8888_depth_no_stencil dEQP-EGL.functional.create_context_ext.gl_31.rgb888_no_depth_no_stencil dEQP-EGL.functional.create_context_ext.robust_gl_31.rgba8888_depth_no_stencil dEQP-EGL.functional.create_context_ext.robust_gl_31.rgb888_no_depth_no_stencil dEQP-EGL.functional.create_context_ext.gl_31.rgba8888_no_depth_no_stencil dEQP-EGL.functional.create_context_ext.robust_gl_31.rgba8888_no_depth_no_stencil dEQP-EGL.functional.create_context_ext.gl_31.rgba8888_depth_stencil dEQP-EGL.functional.create_context_ext.robust_gl_31.rgba8888_depth_stencil Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-02 18:56:53 +10:00
Connor Abbott	f41516bdb5	nir/find_array_copies: Reject copies with mismatched type When we detect a scalar/vector copy through load_deref/store_deref, we have to be careful since those can bitcast an int to a float and vice-versa even though copy_deref can't. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111251 Fixes: `156306e5e6` ("nir/find_array_copies: Handle wildcards and overlapping copies") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-02 10:34:29 +02:00
Samuel Pitoiset	7368000868	radv: re-apply "Optimize rebinding the same descriptor set." This makes it cheaper to just change the dynamic offsets with the same descriptor sets. This optimization has been reverted a while back because of random GPU hangs on GFX9, no it looks fine, at least CTS no longer hangs on GFX9 and it doesn't hang on GFX10 as well. It fixes a performance problem with Wolfenstein Youngblood. Suggested-by: Philip Rebohle <philip.rebohle@tu-dortmund.de>	2019-08-02 09:56:55 +02:00
Samuel Pitoiset	96a5445559	radv/gfx10: use the correct target machine for Wave32 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-02 09:37:38 +02:00
Samuel Pitoiset	8a86908e9a	radv/gfx10: add Wave32 support for vertex, tessellation and geometry shaders It can be enabled with RADV_PERFTEST=gewave32. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-02 09:37:36 +02:00
Samuel Pitoiset	953bbacc23	radv/gfx10: add Wave32 support for fragment shaders It can be enabled with RADV_PERFTEST=pswave32. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-02 09:37:34 +02:00
Kenneth Graunke	18c2e09dc7	gallium: Implement GL_EXT_shader_samples_identical via a new capability This exposes the textureSamplesIdenticalEXT function in GLSL. We enable it for iris and radeonsi, because their compilers already have support for this. Tested on Intel Kabylake and AMD Vega 64. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 23:38:54 -07:00
Kenneth Graunke	adcc0a8fdc	intel/tools: Fix aubinator_viewer build. This functions was recently renamed and not all callers were updated. Fixes: `086c486a75` ("intel/device: rename gen_get_device_info")	2019-08-01 23:36:41 -07:00
Francisco Jerez	54fbc625ea	intel/ir: Fix CFG corruption in opt_predicated_break(). Specifically the optimization of a conditional BREAK + WHILE sequence into a conditional WHILE seems pretty broken. The list of successors of "earlier_block" (where the conditional BREAK was found) is emptied and then re-created with the same edges for no apparent reason. On top of that the list of predecessors of the block immediately after the WHILE loop is emptied, but only one of the original edges will be added back, which means that potentially several blocks that still have it on their list of successors won't be on its list of predecessors anymore, causing all sorts of hilarity due to the inconsistency in the control flow graph. The solution is to remove the code that's removing valid edges from the CFG. cfg_t::remove_block() will already clean up after itself. The assert in bblock_t::combine_with() also needs to be removed since we will be merging a block with multiple children into the first one of them. Found the issue on a hardware enabling branch originally, but apparently somebody reproduced the same problem independently on master in the meantime. Fixes: `d13bcdb3a9` ("i965/fs: Extend predicated break pass to predicate WHILE.") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111009 Cc: jiradet.jd@gmail.com Cc: Sergii Romantsov <sergii.romantsov@globallogic.com> Cc: Matt Turner <mattst88@gmail.com> Cc: mesa-stable@lists.freedesktop.org Tested-by: Paul Chelombitko <qamonstergl@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-08-01 16:56:48 -07:00
Mark Janes	ddb59cd20e	intel/device: make internal functions private The device info initializer makes several fuctions internal: - handling of device override - updating topology from kernel information The implementation file is slightly reordered due to the renamed functions being static. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:40:03 -07:00
Mark Janes	086c486a75	intel/device: rename gen_get_device_info Rename the original device info initialization routine so callers don't mistakenly call the wrong one: gen_get_device_info_from_fd: Queries kernel for full device info, including topology details. gen_get_device_info_from_pci_id: Partially initializes device info based on PCI ID lookup, when the kernel is not available. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:39:56 -07:00
Mark Janes	d594d2a052	intel/tools: use device info initializer Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:39:54 -07:00
Mark Janes	e4a0070db4	anv: use initialization routine for gen_device_info Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:39:51 -07:00
Mark Janes	49465f1330	iris/screen: use initialization routine for gen_device_info Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:39:48 -07:00
Mark Janes	96e1c945f2	i965: Move device info initialization to common code With perf queries, initializing the device info is much more complex than just getting a PCI ID and calling gen_get_device_info. This commit adds a new gen_get_device_info_from_fd helper in common code which does all of the requisite kernel queries to get device info including all of the topology information. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:39:44 -07:00
Mark Janes	1186f6ea69	i965/perf: verify kernel support before registering OA metrics When gen_device_info updates the topology in it's initializer, the kernel queries will fail silently. Iris and anv have minimum kernel requirements that support the queries. i965 must verify kernel support before reporting OA metrics. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:39:41 -07:00
Mark Janes	7852fe5415	intel/common: provide common ioctl routine i965 links against libdrm for drmIoctl, but anv and iris both re-implement this routine to avoid the dependency. intel/dev also needs an ioctl wrapper, so lets share the same implementation everywhere. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-01 16:38:40 -07:00
Alyssa Rosenzweig	b40ba2db6c	panfrost: Remove unused argument A relic from when we didn't have an online compiler, hah. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	ff345d4a01	panfrost: Handle MESA_SHADER_COMPUTE in compile callback Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	73c40d6bbb	pan/midgard: Use standard list traversal to find initial tag Fixes a hang (and abort) on empty shaders, which you shouldn't have anyway but better safe than sorry. DCE going on the fritz is no reason to freeze the system. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	4647999327	panfrost: Use gl_shader_stage directly for compiles No need to add a third set of enums to the mix. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	d9eb65c60c	panfrost: Emit "draw" info for compute jobs Important fields relating to shader state and UBOs are filled out from this (misnomer) function. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	22a8f6de61	panfrost: Feed compute shaders into the compiler The path for compute shader compiles resembles the graphic shader compile path, although it is substantially simpler as we don't need any shader keying. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	1b284628ef	panfrost: Expose compute shaders as panfrost_shader_variants Whether variants are packed by graphics or compute is irrelevant. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	8b53230d47	panfrost: Remove shader state *base It is now unused. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	c228046b4b	panfrost: Remove CSO dependency from shader_compile We want this routine to be generic across graphics and compute, so let the caller deal with the typing. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	428bed3bde	panfrost: Generalize UBO upload for other shader stages Now that everything is unified, this generalization is nice and easy. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	a34370e855	panfrost: Guard vertex upload by ctx->vertex != NULL This is irrelevant for graphics but matters for compute workloads. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	3bfdb878aa	panfrost: Generalize vertex shader upload This allows us to reuse the same code path for compute. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	3b7224190e	panfrost: Share gl_enables between VERTEX/COMPUTE Catch-all for magic bits. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	871c02b12e	panfrost: Invoke compute shader according to grid info We already have helpers for packing invocations (due to its role in instanced vertex shaders), so we can reuse this drop in for compute shaders. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	748ccbc808	panfrost: Explain and include compute FBD Squint at it hard enough and you realize it's the beginning of an SFBD... I guess... A compute shader with register spilling would be able to confirm this, but we would expect to see the first field \| 1 and an address splattered later, setting up TLS. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	3113be3127	panfrost: Unify-driven cleanup Again, now that stages are unified some logic goes away. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	ac6aa93f9e	panfrost: Unify ctx->vs and ctx->fs It's a little verbose, but this way we can support other shader stages without too much contortion. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	4b93152c29	panfrost: Flesh out launch_grid stub It's still incomplette, but we're able to hook into launch_grid to create a stub COMPUTE job. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:02 -07:00
Alyssa Rosenzweig	cd1be4605c	panfrost: Cleanup via payload unification Since these are now indexable, quite a bit of code cleans up. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:02 -07:00
Alyssa Rosenzweig	0da52015a1	panfrost: Unify payload_vertex/payload_tiler Rather than disparate variables, let's use an array of payloads indexed by the shader stage. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:02 -07:00
Alyssa Rosenzweig	902115f94f	panfrost: Only wallpaper if we drew something last_tiler.gpu may be NULL at flush time despite no clear and existing jobs -- if we executed a compute-only workload. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:02 -07:00
Alyssa Rosenzweig	2d86828243	panfrost: Adjust shader CAPs to expose dEQP compute Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	39fe9f5e2f	panfrost: Expose NIR as our PIPE_SHADER_CAP_SUPPORTED_IRS We could expose TGSI as well -- we pipe it through tgsi_to_nir for Gallium-internal shaders anyway -- but we'd rather not. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	1697760e05	panfrost: Copy freedreno's panfrost_get_compute_param Values reported here aren't remotely correct, but it's a start to just get the entrypoint stubbed out. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	c8bc664447	panfrost: Expose COMPUTE-related caps for GLES3.1 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	5a8b83ca0b	panfrost: Stub out launch_grid Just dumps some information about the invocation for later debug. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	a8fc40aaf5	panfrost: Stub out compute CSO Doesn't do anything, just gets the functions there. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	e913986868	panfrost: Implement gl_FrontFacing Interestingly, this requires no compiler changes. It's just exposed as a special varying. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:15:03 -07:00
Alyssa Rosenzweig	f3e15122d4	panfrost: Add support for decoding gl_FrontFacing Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:15:03 -07:00
Alyssa Rosenzweig	9e66ff3ea9	pan/decode: Use max varying index as varying buffer count This allows us to decode asymmetric varyings correctly, which occurs with e.g. gl_FrontFacing. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:15:03 -07:00

... 3 4 5 6 7 ...

114073 Commits All Branches Search

114073 Commits

All Branches