KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Iago Toral Quiroga	f219795a26	v3dv: add a blit fallback path for vkCmdResolveImage This fallback is required when we have to do partial resolves. It works the same way as other blit fallbacks for copy operations: it will bind the source image as a source texture and blit the selected region to the destination image. The difference in this case is that the source image is multisampled and the blit shader needs to fetch and average individual samples for each texel. This gets us to pass all the remaining test cases in dEQP-VK.api.copy_and_blit.core.resolve_image.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	6c27dbdbc9	v3dv: setup texture shader state correctly for multisampled images Fixes multisampled cases in: dEQP-VK.pipeline.multisample.sampled_image.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	006178165c	v3dv: handle multisampled image copies in the TLB path vkCmdCopyImage can be used to copy multisampled images. We can easily support that on the TLB path, which copies full images. For partial copies we will need to amend our blit shader path to support multisampling resolve. Fixes: dEQP-VK.api.copy_and_blit.core.resolve_image.whole_copy_before_resolving.4_bit Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	8a1f6717e8	v3dv: implement vkCmdResolveImage for whole images For partial resolves we will need a shader blit & resolve fallback. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	b48e01e0cd	v3dv: implement subpass multisample rendering and resolve Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	d953eab5af	v3dv: process VkPipelineMultisampleStateCreateInfo properly Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	d87941cb3a	v3dv: consider MSAA when computing frame tiling Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	7c9b40effa	v3dv/pipeline: handle properly OUT_OF_HOST_MEMORY error when allocating p_stage So far we were just assuming that it would work (so we could try to access a NULL pointer), and not freeing it properly. Fixing that was somewhat messy due pipeline_compile_graphics using a temporary array and then update the internal pipeline stages. As we are here we just removed the array and simplified pipeline_compile_graphics code. Fixes following tests: dEQP-VK.api.object_management.alloc_callback_fail.graphics_pipeline dEQP-VK.api.object_management.alloc_callback_fail_multiple.graphics_pipeline Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	d64ff26563	v3dv/pipeline: pre-generate more that one shader variant In order to reduce the number of shader builds after pipeline creation (that ideally shouldn't happen) we pre-generate two shader variants at pipeline creation time. In addition to the default one, that set the return size for all texture to 16 bit, we build another variant setting the return size for all textures to 32-bit. cmd buffer selects the latter if any of the textures requires 32bit. So we are using an all 16-bit return size or an all 32-bit return size variants. This could be slightly improved by pre-generating return size combinations if the texture number is below a threshold. But that would require more space, and bigger pipeline creation time, so would need to be evaluated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	a87d2c7806	v3dv/pipeline: when looking for a variant, check first current variant So far, when checking for a variant fulfilling a specific v3d key, we were checking the caches, and if that failed, we compiled a new variant, and update the current variant. But we could check first if the current variant fullfils that. This was not really problematic so far, as checking on the caches was fast, but now that we could be without any kind of shader cache using V3DV_ENABLE_PIPELINE_CACHE, it is far better to check first current variant. Without this vkQuake3 at 720p drops to 1fps when disabling the cache. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	62c32d6ca0	v3dv/pipeline: remove custom variant cache Now that we have a default pipeline cache, we can rely on it. This allows to remove some code, and avoid the need to have a cache per each pipeline stage. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	35381a4696	v3dv/pipeline_cache: add default pipeline cache That it would be used as fallback. Three advantages: * Having a cache for user operations even if the user doesn't provide it. * Having a cache for internal operations. v3dv_meta_copy creates pipelines for some copy path, so it is interesting to have them cached. * Testing: so now the pipeline cache is tested by more CTS tests. As any other pipeline cache, it can be disabled with the V3DV_ENABLE_PIPELINE_CACHE. It was suggested that would make sense to have a specific envvar for the default pipeline cache, but for now just one envvar is enough. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	48a64f28c2	v3dv/pipeline: provide a shader_sha1 to private ShaderModules So far for private pipelines we were creating dummy shader modules where we directly provided the nir shader. But for the pipeline cache we were using the SPIR-V to generate part of the cache key sha1. The main use case for private pipelines are meta_copy/clear. Those nir shaders depend on parameters like the format etc, so we use directly the serialized form of the NIR shader to generate the sha1. The other case are the no-op fragment shader that we need to provide if no fragment shader is defined by the user. For that case we can just use the default shader name, as the no-op shader is always the same. This is required as we plan to add a default pipeline cache, that would include our private shaders too. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	cead2a6db6	v3dv/pipeline_cache: MergePipelineCaches implementation Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	8e461b8d71	v3dv/pipeline_cache: support to serialize/deserialize cached NIRs Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	a00fe4cb0c	v3dv/pipeline_cache: cache v3dv_shader_variants This also includes being able to serialize them as part of GetPipelineCacheData and to deserialize it as part of CreatePipelineCache. So now we can also upload the assembly of the variant as part of the PipelineCache creation. Note that from all this the tricky part was the prog_data serialization. v3d_compile allocates and fill a new prog_data, with rzalloc. Among other things because it also allocates internally the uniform list. So we needed to replicate that when deserializating the prog_data. Ideally we would like to avoid that, and allocate as much resources as possible using vk_alloc, but that would mean a somewhat deep change on the v3d_compiler, that we want to avoid as much possible for now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	63b6b633e9	v3dv/pipeline: add basic ref counting support for variants As soon as we start to cache variants on pipeline caches, the same variant could be used by different pipelines and pipeline caches. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	2326d5bc04	v3dv/pipeline_cache: cache nir shaders Heavily based on anv nir caching. One of the bigger difference is that we don't create the nir shader using a ralloc_context local to the main compile graphics method. On anv, after compiling the shader, they discard the nir shader. We need it as we could need it to build shader variants later. As anv, we introduce a environment variable to disable the cache: V3DV_ENABLE_PIPELINE_CACHE By default is enabled. The main purpose for this envvar is debugging, in order to provide a easy way to discard a bug on the cache. It is pending to serialize/deserialize the NIR shaders as part of GetPipelineCacheData and PipelineCacheCreate. We also plan is to cache too shader variants. We would do that on following patches. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	1d2ae8756f	v3dv/pipeline_cache: bare basic support for pipeline cache And this means providing a proper cache object, and being able to load/retrieve a cache data with a proper header. Not really caching anything yet. That would be tackle on following patches. Note that this no-op cache got all the specific pipeline_cache and pipeline.cache tests passing on the rpi4. The following tests are still crashing when using the simulator: dEQP-VK.synchronization.internally_synchronized_objects.pipeline_cache_compute dEQP-VK.synchronization.internally_synchronized_objects.pipeline_cache_graphics But those are an issue of synchronization tests on the simulator, and not related with the pipeline cache itself. In general synchronization tests should be tested on the rpi4. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	fd70e8e0b0	v3dv/device: proper pipeline cache uuid Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	8f46750f46	v3dv/device: get proper device ID under simulator Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	ffaab5593c	v3dv/device: add vendorID/deviceID get helpers As we would need them for the pipeline cache header. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	a2723154c5	v3dv: fix dynamic state after meta operation We should always save state on a push before starting a meta operation, even if we don't have a pipeline, since dynamic state can be set at any time directly on the command buffer. Similarly, we should always restore it if the pop after the meta operation signals that it has written any state, not only if we have a graphics pipeline to restore. Fixes a rendering artifact in VkQuake. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	33a6fdb53d	v3dv: enable fillModeNonSolid This, together with the shaderClipDistance changes, gets vkQuake3 to run without hacks. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	558987a465	v3dv: enable shaderClipDistance Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	47e02a2ef1	v3dv: add a fast path for vkCmdClearAttachments Since vkCmdClearAttachments executes inside a render pass, we would benefit from converting it to a draw within the current subpass job to improve batching and avoid expensive tile load/store operations. This can dramatically improve performance for applications using this command, however, we can only use this if we are clearing the base layers of framebuffer attachments, since otherwise we would need to use layered rendering, which we don't support yet. This improves vkQuake3 performance dramatically (almost 100% performance improvement at 1080p), which calls this twice per frame. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	c5d07802dc	v3dv: only clear depth/stencil attachments if any aspect needs clearing When we can't perform the clears using the TLB and fallback to vkCmdClearAttachments make sure we only emit the clear if any of the depth/stencil aspects actually needs it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	c4564f62aa	v3dv: emit new shader state if viewport is dirty If we have a dirty viewport then we need to re-emit our uniforms to update QUNIFORM_VIEWPORT_*. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	e7678f7c7f	v3dv: only care about barriers between GPU jobs We don't need to do anything special for host/gpu synchronization. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	08193cc56f	v3dv: avoid prime blit path when presenting WSI images This allows us to present WSI images directly rather than going through a blit path, improving performance. This improves vkQuake performance at 1080p by 10-20%. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	97891898c3	v3dv: only use per-buffer clear bit for cases were we are already storing This bit is helpful if we already need to store the buffer, but otherwise we should not emit the store only to get the clear, we can use the global clear packet for that and save us an expensive tile buffer store operation. Also, we have not been using the per-buffer clear bit for depth/stencil stores since "v3dv: fix depth/stencil clears on hardware", so we should never been considering clearing needs to flag stores. This improves vkQuake performance somewhere between 5%-15% by allowing us to skip the store of the depth attachment. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	add8778638	v3dv: ignore stencil load operation if attachment format doesn't have stencil This gets vkQuake to render correctly, which creates render passes with stencil load operations even when the depth/stencil attachment format doesn't have a stencil aspect. While this is a bit weird, it seems to be allowed by the spec: "If the format has depth and/or stencil components, loadOp and storeOp apply only to the depth data, while stencilLoadOp and stencilStoreOp define how the stencil data is handled." In our case we were not ignoring it and this was causing that we emitted a Z buffer load that seemed to clobber the Z clear, preventing all draw calls from passing the depth test. While we are at it, also change the depth/stencil store operation (which was already handling this scenario correctly) to use the format of the render pass attachment description rather than the underlying image format. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	1e1903fc1b	v3dv: use a binning sync for CL jobs waiting on a semaphore If we have a semaphore wait the job cannot be started before the semaphore has been signaled, so we need to wait before starting the binning stage. Fixes CTS failures in: dEQP-VK.synchronization.op.single_queue.binary_semaphore.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	0db95de577	v3dv: improve pipeline barrier handling So far we have been getting away with finishing the current job in the presence of a pipeline barrier and relying on the RCL serialization, but of course this is not always enough. This patch addresses synchronization across different GPU units (i.e. draw indirect after compute), as well as cases where we need to sync before binning. Fixes CTS failures in: dEQP-VK.synchronization.op.single_queue.barrier.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	adbce7723e	v3dv: don't support sRGB buffer formats Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	d08da4a54a	v3dv: handle VK_FORMAT_B8G8R8A8_UNORM vertex attributes This is the only mandatory vertex attribute format that requires a swizzle. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	5a2ef59963	v3d/compiler: support swapping R/B channels in vertex attributes. We will need this in Vulkan to support vertex format VK_FORMAT_B8G8R8A8_UNORM. The hardware doesn't allow to swizzle vertex attribute components, so we need to do it in the shader. v2: - Use nir_intrinsic_io_semantics() to retrieve the location instead of looping through the shader input variables (Eric). - Assert that we only have one component (Eric). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (v1) Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	a2538b2520	v3dv: make sure we emit vertex attributes in location order The order in which we emit the attributes is relevant, since GL_SHADER_STATE_ATTRIBUTE_RECORD packets don't include an explicit attribute index. This means that we need to emit them in driver location order, since the compiler uses that location to compute attribute offsets in the VPM. Fixes ~1300 CTS tests in: dEQP-VK.pipeline.vertex_input.multiple_attributes.out_of_order.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
n00b7	aebfdfa04c	v3dv/device: handle primary nodes for newer kernels For kernels rpi-5.X.y it changed from brcm,bcm2835-vc4 to brcm,bcm2711-vc5, so let's just check for both strings. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	3bd02a729a	v3dv: fix color border clamping with specific formats For some reason, CTS expects E5B9G9R9 and B10G11R11 with transparent black border clamping produce alpha 1 instead of 0. Since border color takes precedence over the texture state swizzle, the only way to fix this is to lower the texture swizzle in the shader to set alpha to 1. Fixes: dEQP-VK.pipeline.sampler.view_type.b10g11r11clamp_to_border_transparent_black dEQP-VK.pipeline.sampler.view_type.e5b9g9r9.clamp_to_border_transparent_black Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	f156c5dc4d	v3dv: fix regressions for cubemap array load/store It seems that we only want to set the texture state's depth to the number of 2D layers divided by 6 when sampling, not wen doing load/store. This means that we need to generate two different states and choose the one to use depending on the descriptor. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	9eb1b578ee	v3dv: use swizzle X001 with D/S formats This matches Vulkan CTS expectations. Fixes (for D/S formats): dEQP-VK.pipeline.sampler.view_type.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Alejandro Piñeiro	336d9e187c	v3dv/image: fix TEXTURE_SHADER_STATE depth for cube arrays For these we want to divide the number of layers by 6. Fixes: dEQP-VK.pipeline.image_view.view_type.cube_array.* dEQP-VK.pipeline.image.suballocation.sampling_type.combined.view_type.cube_array.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	cb1e492ee0	v3dv: handle unnormalized coordinates in samplers In OpenGL, unnormalized coordinates are implicit based on the sampler type (rectangle textures), so the compiler can set the flag when needed. In Vulkan, however, this is configured explicitly in the sampler object, so the compiler won't set it and we need to do it manually when we are writing the P1 uniform. Fixes: dEQP-VK.pipeline.sampler.exact_sampling.*.unnormalized_coords Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	0385da9e08	v3dv: fix blit_shader() to honor the region's aspect mask When translating combined depth/stencil blits to compatible color blits we should look at the requested region aspects to decide the color mask to apply. Fixes: dEQP-VK.api.copy_and_blit..buffer_to_depthstencil.buffer_offset_d24_unorm_s8_uint_D dEQP-VK.api.copy_and_blit..buffer_to_depthstencil.buffer_offset_d24_unorm_s8_uint_SD dEQP-VK.api.copy_and_blit.*.buffer_to_depthstencil.buffer_offset_d24_unorm_s8_uint_S_D Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	bf20a5e7b7	v3dv: fix srcSubresource description for image to buffer blits We are emitting a loop copying layer by layer here. Fixes: dEQP-VK.pipeline.render_to_image.core..huge. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	6053e85ee4	v3dv: fix textureSize() for cube arrays For these we want to divide the number of layers by 6. Fixes: dEQP-VK.glsl.texture_functions.query.texturesize.samplercubearray Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	b6c2fde4b9	v3dv: fix vkCmdCopyBuffer unaligned TLB access We need to make sure that our tile load/store operations are aligned to the selected item size for the copy. Fixes simulator assert in VkQuake. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	56fcc085fb	v3dv: handle draw after barrier Currently, we end the current job whenever the user emits a pipeline barrier, but we then expect to have a valid job when we emit a draw call. If by the time we have to emit a draw call we don't have a valid job, we need to create one by resuming execution of the current subpass. Fixes some tests in: dEQP-VK.renderpass.suballocation.attachment_allocation.input_output.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	8116d65fbc	v3dv: improve handling of too large image sizes Instead of asserting that users don't try to create images that would require 4GB+ of memory, error out with the corresponding OOM error when the user tries to actually allocate the memory for the image. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00

1 2 3 4 5 ...

129848 Commits All Branches Search

129848 Commits

All Branches