KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	1cf508b731	radv: save/restore all viewports/scissors for meta operations This is needed since we don't update the number of viewports/scissors when they are set dynamically (according to the spec). In the following scenario: * vkCmdSetViewport() * vkCmdClearColorImage() (or any other meta operations) The viewports/scissors weren't saved correctly because no pipeline was bound before, and thus the number of viewports/scissors were 0. This fixes a regression with: dEQP-VK.draw.negative_viewport_height.front_ccw_cull_back Fixes: `60878dd00c` ("radv: do not update the number of viewports in vkCmdSetViewport()") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-25 20:31:55 +02:00
Bas Nieuwenhuizen	bf0397b6f5	Revert "Revert "radv: fallback to an in-memory cache when no pipline cache is provided"" I tested this 10 times with ./deqp-vk --deqp-case=dEQP-VK.texture.filtering.3d.formats.r4g4b4a4* and one full run of CTS, seems the issue is gone. Also reduces CTS runtime by 30% or so. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-09-25 15:36:19 +02:00
Samuel Pitoiset	6f8c40734b	radv: make radv_pipeline_init() static Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-25 10:46:59 +02:00
Samuel Pitoiset	45ea90ef1f	radv: make use of ATI_VENDOR_ID everywhere Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-25 10:46:55 +02:00
Bas Nieuwenhuizen	d398db2acb	radv: Add code to check if two formats can share DCC metadata. Ported from radeonsi. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-09-16 11:55:42 +02:00
Samuel Pitoiset	49c72d84c2	radv: dump the list of enabled options when a hang occured Useful to know which debug/perftest options were enabled when a hang report is generated. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-14 10:37:57 +02:00
Samuel Pitoiset	ce218c31eb	radv: remove useless 'cmd_buffer' param from radv_buffer_view_init() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-13 09:47:41 +02:00
Dave Airlie	f2d0f587ca	radv: work out a base ia_multi_vgt_param. This just reduces the calculations a bit further. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-11 23:55:15 +01:00
Dave Airlie	ded1dbfd96	radv: calculate non-draw related ia_multi_vgt_param bits in pipeline This moves a bunch of non-draw dependent calcs into the pipeline code, to reduce CPU overheads in the draw path. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-11 23:55:15 +01:00
Dave Airlie	d2490eb2d1	radv: move calculating primgroup_size to pipeline. This moves this out of the draw paths. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-11 23:55:15 +01:00
Samuel Pitoiset	d4d777317b	radv: move shaders related code to radv_shader.c Reduce size of radv_pipeline.c and improve code isolation. More code can probably moved but it's a start. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-08 17:17:40 +02:00
Samuel Pitoiset	fefbcb090d	radv: add radv_vertex_elements_info data structure In my opinion, this improves code readability. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-08 16:04:51 +02:00
Samuel Pitoiset	86b99893eb	radv: do not use a bitfield when dirtying the vertex buffers Useless to track which one has been updated because we re-upload all the vertex buffers in one shot. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-07 10:01:21 +02:00
Dave Airlie	3cc620bf55	radv: reduce radv_image struct size. 1480->1472. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-07 11:00:08 +10:00
Dave Airlie	66031d8925	radv: reduce radv_shader_variant struct size. 544->536 Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-07 11:00:08 +10:00
Dave Airlie	a2c2a76c9e	radv: reduce radv_cmd_state struct size. 1632->1624. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-07 11:00:08 +10:00
Bas Nieuwenhuizen	1a72ca5667	radv: Put semaphore waits in preamble cs. The separate flush cs gets in the way of batchchain. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-09-04 00:06:40 +02:00
Samuel Pitoiset	80177306d9	radv: report VM faults if detected It's fairly simple for now, but this might be quite useful. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-01 09:46:36 +02:00
Samuel Pitoiset	ad42e2abb8	radv: move RADV_TRACE_FILE functions to radv_debug.c At the moment, debugging radv is not really easy because the driver doesn't report enough information when it hangs. This new file will be the main location for all debug tools. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-01 09:41:54 +02:00
Samuel Pitoiset	2bc3d65690	radv: rename record_fail to record_result and use VkResult This will allow to propagate VK_ERROR_OUT_OF_HOST_MEMORY to vkEndCommandBuffer() when necessary. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-08-28 11:25:44 +02:00
Bas Nieuwenhuizen	e3265c10c8	radv: Implement multiview draws. v2: - Use for_each_bit. - split emitting the draw packets out to separate functions. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 19:20:47 +02:00
Bas Nieuwenhuizen	2e86f6b259	radv: Add multiview clears. v2: Use for_each_bit. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 19:20:47 +02:00
Bas Nieuwenhuizen	3907d63259	radv: Store multiview info in renderpass. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 19:20:47 +02:00
Bas Nieuwenhuizen	eec5578158	ac/nir: Make shader key a struct. Some bits can be passed to almost every shader, and I don't like adding 5 variables. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-24 19:20:47 +02:00
Dave Airlie	5378b5d071	radv: cleanup some image view descriptor setup. Avoid passing the vulkan image creation into the image view descriptor setup. This cleans up the usage of range inside the init, instead using the properly inited values in the image view. This is just a cleanup but some future vega changes will depend on it. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-24 01:14:14 +01:00
Dave Airlie	9c080100d3	radv/gfx9: emit sx_mrt_blend registers GFX9 needs the SX MRT blend registers programmed, port over the code from radeonsi to workout the values from the blend state, and program the registers on rbplus systems. This fixes lots of: dEQP-VK.pipeline.blend.* Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-24 01:14:14 +01:00
Alex Smith	2e9a13bf22	radv: Fix decompression on multisampled depth buffers Need to take the sample count into account in the depth decompress and resummarize pipelines and render pass. Fixes: `f4e499ec79` ("radv: add initial non-conformant radv vulkan driver") Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org>	2017-08-07 23:47:49 +02:00
Dave Airlie	1e696b962b	radv: add separate fmask tile swizzle counter. This mirrors what Marek has done for radeonsi, and uses a separate counter to handle the fmask surface for MSAA MRTs. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-07 00:08:43 +01:00
Bas Nieuwenhuizen	15e5a7a683	radv: Only convert linear->srgb in compute resolves. It justs works with the fragment shader resolve, so no need to do a custom conversion. In fact with SRGB dest, it actually gives wrong results. Fixes: `69136f4e63` "radv/meta: add resolve pass using fragment/vertex shaders" Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-06 16:07:09 +02:00
Andres Rodriguez	14cad8786a	radv: generate the same driver UUID as radeonsi These need to match for interop compatibility queries. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-08-06 12:42:07 +10:00
Bas Nieuwenhuizen	c9d4b571ad	radv: Add suballocation for shaders. This reduces the number of BOs that we need for the BO lists during a submission. Currently uses a fairly simple linear search for finding free space, that could eventually be improved to a binary tree, which with some per-node info could make a check for space O(1) and finding it O(log n), in the number of buffers in that slab. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-08-03 00:45:13 +02:00
Dave Airlie	df61a05019	radv: handle 10-bit format clamping workaround. This fixes: dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.* for a2r10g10b10 formats as destination on SI/CIK hardware. This adds support to the meta program for emitting 10-bit outputs, and adds 10-bit support to the fragment shader key. It also only does the int8/10 on SI/CIK. Fixes: `f4e499ec7` (radv: add initial non-conformant radv vulkan driver) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-08-01 00:10:23 +01:00
Andres Rodriguez	a973b9a9f8	radv: rename physical_device->uuid[] to cache_uuid[] We have a few UUIDs, so lets be more specific. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-07-26 20:42:36 +10:00
Dave Airlie	eaa56eab6d	radv: initial support for shared semaphores (v2) This adds support for sharing semaphores using kernel syncobjects. Syncobj backed semaphores are used for any semaphore which is created with external flags, and when a semaphore is imported, otherwise we use the current non-kernel semaphores. Temporary imports from syncobj fd are also available, these just override the current user until the next wait, when the temp syncobj is dropped. v2: allocate more chunks upfront, fix off by one after previous refactor of syncobj setup, remove unnecessary null check. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-21 21:31:54 +01:00
Dave Airlie	9ee67467c9	radv: predicate cmask eliminate when using DCC. When using DCC some clear values don't require a cmask eliminate step. This patch adds support for black and black with alpha 1, there are other values, but I don't have access to a comprehensive list. This works by setting the cmask eliminate predicate when doing the fast clear, and later when doing the cmask elimination making sure the draws are predicated. This increases the fps on Sascha Willems deferred. Tonga: 580fps->670fps on a Tonga PRO card. Polaris 730->850fps Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-17 01:44:43 +01:00
Dave Airlie	f8d5b377c8	radv: set cb base tile swizzles for MRT speedups (v4) This patch uses addrlib to workout the tile swizzles according to the surface index. It seems to produce the same values as amdgpu-pro for the deferred test. v2: don't apply swizzle to CMASK. the eg docs don't mention it, and we clearly don't align cmask for that. v3: disable surf index for dedicated images, as these will most likely be shared, and I don't think the metadata has space for this info in it yet. v4: update for shareable images, rename combined_swizzle to tile_swizzle This gets the deferred demo from 730->950fps on my rx480. (dcc cmask elim predication patches get it further) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-17 01:43:41 +01:00
Jason Ekstrand	b70829708a	radv: Implement VK_KHR_external_memory This effectively reverts commit 43a171878bb4b5aedb36a. Technically, VK_KHR_get_memory_requirements2 and VK_KHR_dedicated_allocation are required for the KHR version but this at least restores the removed functionality. This patch builds but has received zero testing. Acked-by: Dave Airlie <airlied@redhat.com>	2017-07-15 08:59:38 -07:00
Alex Smith	0e1886efb9	radv: Fix descriptors for cube images with VK_IMAGE_USAGE_STORAGE_BIT If a cube image has VK_IMAGE_USAGE_STORAGE_BIT set, the type in an image view's descriptor was set to a 2D array (and a few other fields adjusted accordingly). This is correct when the image view is actually bound as a storage image, but not when bound as a sampled image. In that case the type should be set as a cube. Fix by generating 2 sets of descriptors at view creation time for both storage and non-storage usage, and then choose between them based on descriptor type when writing descriptor sets. v2: Generate storage descriptors for images with TRANSFER_DST, since those may be used as storage images internally. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-07-13 00:21:20 +02:00
Dave Airlie	a6c2001ace	radv: add support for cmd predication. This doesn't get used yet, it just adds support to various PKT3 emissions to enable it later. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-06 02:06:49 +01:00
Bas Nieuwenhuizen	78bef01da2	radv: Remove unused args of radv_image_view_init. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-26 01:24:50 +02:00
Dave Airlie	6a68170c83	radv: handle primitive id input into fragment shader with no geom shader Fixes: dEQP-VK.pipeline.framebuffer_attachment.no_attachments dEQP-VK.pipeline.framebuffer_attachment.no_attachments_ms Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:30 +10:00
Dave Airlie	c2464271a0	radv: introduce perf test env var and allow to enable chaining We have some features that seem to slow things down or cause other possible undesireable side effects, but it would be nice to test games etc with them easily. I forsee multisample DCC and maybe some shader opt changes using this. For now use it for batch chaining. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-09 02:15:25 +01:00
Dave Airlie	00fe30f376	radv: move lots of index related things into the bind. This just moves lots of stuff to the bind stage rather than dealing with it in the draw stage. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-07 10:24:37 +10:00
Dave Airlie	734ea16bdb	radv: move calculating the vertex sgpr to the pipeline. There is no need to calculate this at draw time. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-07 10:24:36 +10:00
Dave Airlie	3f48021b86	radv: rename and make global some functions. I want to use these in the pipeline setup stage. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-07 10:24:36 +10:00
Bas Nieuwenhuizen	4ec89727b2	radv: Remove vertex_descriptors_dirty. Redundant. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-06 23:23:43 +02:00
Alex Smith	621b3410f5	util/vulkan: Move Vulkan utilities to src/vulkan/util We have Vulkan utilities in both src/util and src/vulkan/util. The latter seems a more appropriate place for Vulkan-specific things, so move them there. v2: Android build system changes (from Tapani Pälli) Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2017-06-06 08:17:13 -07:00
Dave Airlie	67655cb24f	radv: add rb+ support for GFX9 This adds some rb+ support, as on GFX9 we have to disable it as per radeonsi. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:45 +10:00
Dave Airlie	c2fbeb7ca0	radv: add GFX9 cache flushing support. GFX9 needs to write event EOP to a fence buffer, allocate some space for this, and just write an ever increasing number to it, this isn't exactly what radeonsi does, but it seems to work. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:40 +10:00
Dave Airlie	41eba750ba	radv: add gfx9 depth/stencil surface support. This is ported from radeonsi. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-06 09:43:27 +10:00

1 2 3 4

171 Commits