mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Iago Toral Quiroga	7aa04ad065	v3dv: handle D/S buffer to image copies with the texel buffer path We do this by converting them to a compatible color copy and using a destination color mask as well as a source component swizzle to handle D24 format semantics according to the V3D hardware requirements, similar to what we do with our blit shader interface. This path is faster than the terrible copy_buffer_to_image_blit, which requires to copy the source buffer to a tiled image first and should be avoided as much as possible, since it is slow and can also quickly increase device memory usage. This fixes occasional OOM errors when loading traces in renderdoc. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8875>	2021-02-05 13:31:25 +01:00
Jason Ekstrand	7fe36c1187	v3dv: Switch to the common VK_EXT_debug_report Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Alejandro Piñeiro	21f9a88673	v3dv: port to using common dispatch code. This moves v3dv over to using the new common dispatch layer code. v2 (Jason Ekstrand): - Remove some now dead function declarations Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Alejandro Piñeiro	3e2bbf5d50	v3dv: remove reference to v3dv_instance on v3dv_physical_device As we already have a reference to vk_instance at vk_physical_device, that we are setting when calling vk_physical_device_init. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	bde7e1c313	v3dv: Drop v3dv_instance::app_info There's an equivalent data structure in vk_instance. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Alejandro Piñeiro	9c0079e0ee	v3dv: move to subclassing instance/physical device This moves to using the common base structs for these two objects, but doesn't use any of the new features yet. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	8ee88948e3	vulkan: Move vk_device to its own file Things are going to start getting more complicated so let's avoid the single mega-file approach. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Alejandro Piñeiro	a346e1dea0	v3dv: properly handle two different binding points for cmd_buffers From vkCmdBindPipeline spec: "pipelineBindPoint is a VkPipelineBindPoint value specifying to which bind point the pipeline is bound. Binding one does not disturb the others." But internally we were only handling one pipeline per command buffer, so binding a pipeline of one type would override an alredy bound pipeline of other type. Note that for push constants, in the same way that we were keeping one client array and one bo for the values, for all stages, independently of the stageFlags specified by vkCmdPushConstants, we are keeping the same idea here, so such client array and bo is still tied to the command buffer, and used by the two pipeline bind points. That makes far easier tracking the push constants. We could revisit in the future if we want a more fine grained tracking. Fixes the following crashes: dEQP-VK.pipeline.push_constant.lifetime.pipeline_change_diff_range_bind_push_vert_and_comp dEQP-VK.pipeline.push_constant.lifetime.pipeline_change_same_range_bind_push_vert_and_comp v2 (from Iago review) * Move removal of v3dv_resource definition to a different commit. * Use the new v3dv_cmd_pipeline_state on the cmd buffer meta sub-struct, call it gfx for consistency Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8613>	2021-01-22 10:50:32 +00:00
Alejandro Piñeiro	dac20e100b	v3dv: drop v3dv_resource definition In the end it became a v3dv_bo+offset combination, and for that we already have v3dv_cl_reloc, so it became redundant. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8613>	2021-01-22 10:50:31 +00:00
Iago Toral Quiroga	cec2ed7c80	v3dv: fix disabling Early Z for the whole frame The documentation states that if we disable Early Z for the whole frame in the RCL Tile Rendering Mode packet, then we should not emit any draw calls with it enabled (which we can do by enabling it in the CFG_BITS packet). Since we emit our RCL after recording our draw calls in the BCL and we were not considering there if any condition for global disable would be met, it was possible that we end up with an incorrect configuration when we decide for a global disable in the RCL, which can cause rendering artifacts. This can be easily observed by simply forcing the RCL bit to disable early Z in applications that are known to enable it in CFG_BITS (such as the UE Shooter demo for example). With this change we keep track of this scenario when we record draw calls in the BCL and if decide that we need to disable EZ for the entire job, we make sure we never enable it for any draw calls in the frame. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8589>	2021-01-21 14:12:00 +01:00
Iago Toral Quiroga	9c97cc37b0	v3dv: enable early Z/S clears This is an optimization that should make Z/S clears faster. To enable this we can't have any Z/S loads or stores in the job. Also, it seems that enabling early Z/S clearing is independent of whether early Z/S testing is enabled. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8589>	2021-01-21 14:12:00 +01:00
Iago Toral Quiroga	15cf2ab642	v3dv: disable early Z writes if Z writes are disabled I saw this while inspecting CL dumps from the UE Shooter demo, where they disable Z writes for occlusion queries. The hardware is probably doing this internally, but it doesn't hurt to do this explicitly and make CL traces consistent with intended behavior. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8571>	2021-01-19 10:08:27 +00:00
Iago Toral Quiroga	a46547671b	v3dv: only update uniforms for dirty descriptors if stage has descriptors If we have dirty descriptor set state we have to update our uniform data to reference the new resources such as addresses for textures or UBOs. This is known to have a high CPU cost, so we want to limit this as much as we can. It is a common rendering pattern in applications to render many objects using the same pipeline, but modifying the descriptor sets bound to update textures, UBOs, etc. In this scenario, we would be incurring in unnecessary uniform stream updates for stages that don't access descriptor sets at all. This change makes it so we track which shader stages in a pipeline use descriptor set state and skips updating uniform streams for them when dirty descriptor set state is the only reason requiring us to generate new uniform streams for a draw call. v2: reuse shader stage information from the pipeline set layouts to track shader stages that use descriptor state. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8555>	2021-01-19 08:07:57 +00:00
Eric Anholt	547d11de54	etnaviv, v3d: Fix valgrind include paths. dep_valgrind gives you -I/usr/include/valgrind (or whatever) so if valgrind/ wasn't in the search path anyway, these includes would fail. Found in CI when adding valgrind to the build images. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7936>	2020-12-15 19:39:29 +00:00
Iago Toral Quiroga	4f7d4871a6	v3dv: don't log out of pool memory errors for internal driver pools Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7848>	2020-12-01 12:10:11 +00:00
Iago Toral Quiroga	eb75a67bd6	v3dv: add a helper to choose a compatible TFU format We are now doing this in 3 different places so it makes sense. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7845>	2020-12-01 11:24:17 +00:00
James Park	f86668f487	vulkan/util: Consolidate typed_memcpy Collapse typed_memcpy definitions into one header. Use do/while(0) pattern to fix MSVC compilation. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7830>	2020-12-01 07:48:08 +00:00
Iago Toral Quiroga	94c00be49f	v3dv: only write new uniforms when needed Writing uniform streams is performance sensitive so we should try our best to avoid writing new uniforms if they have not changed. Particularly, if only the vertex buffers have changed, we should not write new uniforms. This improves performance in vkQuake2 by about 11.15%. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7683>	2020-11-20 08:00:00 +01:00
Iago Toral Quiroga	5c305c8e36	v3dv: use VkSurface to retrieve an authenticated display fd We still need a fallback for the case where the application makes WSI allocations without a surface (Zink), but for the general case, this is the right way to do this, as it would ensure that we use the same display connection that was used to create the surface. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7631>	2020-11-18 08:11:37 +00:00
Alejandro Piñeiro	30b6fbc496	v3dv: use the common base object type and struct Used as reference Hyujun's commit `5d3fdbc52b`, that does the same for turnip. This commit also replaces in several cases alloc for zalloc, and adds checks on more Destroy methods if the object to be free is NULL or not. Most of them were needed to avoid crashes/weird behaviour due trying to use un-initialized data. Note that now that vk_object_free iterates over a array, making it more against un-initialized or just NULL data. Additionally, using zalloc we can also remove some memset to 0. In fact we needed to remove them, as if not, they would override the vk_object_base object to 0 (the alternative would me doing a memset computing a pointer offset, but that's is not needed as we can just use zalloc). v2: * Call memset(0) on reused descriptor sets when calling ResetDescriptorPool, not when reallocating them (Iago) * Add null check when calling DestroyImageView (detected by a full CTS run) v3: Fixed rebase conflicts after last meta copy/clear changes Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7627>	2020-11-17 23:55:14 +00:00
Iago Toral Quiroga	249aed1ff0	v3dv: rename playout and dslayout fields to use underscores. Following a suggestion from Alejandro, since playout is a word on its own and can be confusing. It also makes it more consistent with other variable names that use an underscore. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7651>	2020-11-17 12:12:45 +01:00
Iago Toral Quiroga	ba69c36ada	v3dv: add a buffer to image copy path using a texel buffer This is much faster than the blit fallback (which requires to upload the linear buffer to a tiled image) and the CPU path. A simple stress test involving 100 buffer to image copies of a single layer image with 10 mipmap levels provides the following results: Path \| Recording Time \| Execution Time \| -------------------------------------------------\| Texel Buffer \| 2.954s \| 0.137s \| -------------------------------------------------\| Blit \| 10.732s \| 0.148s \| -------------------------------------------------\| CPU \| 0.002s \| 1.453s \| -------------------------------------------------\| So generally speaking, this texel buffer copy path is the fastest of the paths that can do partial copies, however, the CPU path might provide better results in cases where command buffer recording is important to overall performance. This is probably the reason why the CPU path seems to provide slightly better results for vkQuake2. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7651>	2020-11-17 12:04:42 +01:00
Alejandro Piñeiro	035e21e780	v3dv/pipeline: take into account precision for the output_type By default we are using 32bit output type for texture operations, 16bit for shadow. With this commit we also use the precision info from the sampler (that is assigned if SPIR-V uses RelaxedPrecision decorator), in order to use 16bit. This is a first step as only take into account the precision of the deref_vars used on the texture operation. But the decoration can be also applied to other cases, like the result of the operation. That means that there are ways to infer that the texture operation can operate at relaxed precision. Those cases would be handled on following patches. v2: * Add directly the return_size on the descriptor_map, instead of shadow/relaxed_precision. * Check relaxed precision for images too (Iago) * Handle the return size for the default sampler v3: * Handle different output size for the case of not having a sampler. * Comment fixes (Iago) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7545>	2020-11-14 15:59:02 +00:00
Alejandro Piñeiro	7da854e186	v3dv: remove combined_idx support Now that the v3d compiler has support for separated texture and sampler indices, we can stop to combine them. Again, that's what Vulkan allows after all. As we are doing this we can't use anymore the texture format (coming from the texture) to chose the return size (that is a sampling parameter). We default for 32, and just go to 16 for shadow. We plan to use SPIR-V RelaxedPrecision to use in more cases 16 bit. We would do that on following patches. v2 (from Iago feedback): * Fix typos/bad grammar on comments. * Move tex/sampler number assert to before the loop that fills tex/sampler info. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7545>	2020-11-14 15:59:02 +00:00
Alejandro Piñeiro	1fe52af61a	v3dv: cleanup/remove support for pre-generated variants In preparation to the changes that would allow to not need them. It is worth to note that it is likely (we have some ideas in mind) that we would need to bring back pre-generate variants on the future. The approach is slightly different on v3dv_pipeline vs v3dv_cmd_buffer: * v3dv_pipeline: even after the clean-up, we had code for all the functions they have, even if they were doing less things (specifically, a second shader variant), so they still make sense on their own, and serve as template for adding support of multiple pre-generated shader variants in the future. * v3dv_cmd_buffer: as we really don't need to fill up the key with some after-pipeline data, we would end with some functions empty (specifically cmd_buffer_populate_v3d_key). Even as a placeholder, that would be odd. Additionally the current code has a lot of boilerplate code (functions to fill up vs, cs and fs keys are basically the same), and we already have in mind refactor them. So it would be better to remove all of them, instead of keeping around some code we would not be happy with. If in the future we pregenerate more that one variant, hopefully the new code to chose between them would be better. v2: clarify the commit message, and fix typos on the comments (Iago) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7545>	2020-11-14 15:59:02 +00:00
Iago Toral Quiroga	87fb1ec352	v3dv: move authenticated display fd acquisition to swapchain creation time So far, we have only been supporting X11, so we assumed that we were running inside X11 and would always try to get an authenticated fd from Xorg during device initialization. While this works for desktop Raspbian, it is not really correct and it is not what we want to do when we start considering other WSIs. Initially, one could think we can still do this by guarding the WSI code under the proper instance extension check. This, however, doesn't work reliably, as the Vulkan loader can call vkEnumerateDevices without enabling surface extensions on the instance, which then can lead to us not initializing any display_fd and failing with VK_ERROR_INITIALIZATION_FAILED, which is not correct, so while we can try to acquire the display_fd here, it might not always work, and we should definitely not fail initialization of the physical device for that. Instead, with this change we move acquisition of display_fd to swapchain creation time where required extensions need to be enabled in the instance. This was also suggested by Daniel Stone during review of a work-in-progress implementation for the Wayland WSI. There is a special case to consider though: applications like Zink that don't use Vulkan's swapchains at all but still allocate images that they intend to use for WSI. We need to handle these by checking that we have indeed acquired a display_fd before doing any memory allocation for WSI, and acquiring one at that time if that's not the case. This change also removes the render_fd and display_fd fields from the logical device (which we were copying from the physical device), because now there is no guarantee that we have acquired a display_fd at the time we create a logical device. Instead, we now put a reference to the physical device on the logical device from which we can access these. Finally, this also fixes a regression introduced with VK_KHR_display, where if that extension is enabled but we are running inside a compositor, we would acquire a display_fd that is not authenticated and try to use that instead of acquiring an authenticated display_fd from the display server. Fixes: `b1188c9451` (v3dv: VK_KHR_display extension support) Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7546>	2020-11-12 10:02:20 +01:00
Alejandro Piñeiro	ca1969ca88	v3dv/util: log debug ignored stype only on debug builds Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7497>	2020-11-09 12:54:56 +00:00
Alejandro Piñeiro	1fa4a37256	v3dv/util: remove several logging functions We already have vk_error to report errors, they add little specific v3dv wrapping over a simple fprintf, and they are not used really often. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7497>	2020-11-09 12:54:56 +00:00
Iago Toral Quiroga	7d6609e70d	v3dv: fix occlusion query inheritance in secondary command buffers If a secondary command buffer has occlusion query inheritance then draw calls recorded in it should update an active occlusion query counter started in the primary command buffer. If executing the secondary in a primary required to emit jobs and not just a branch instruction, then we might need to create a new job for the primary as well, and in that case we would lose the occlusion query state, so we need to re-emit it at that point so any additional draw calls recorded into the secondary that is being executed continue to update the counter. Fixes: dEQP-VK.query_pool.concurrent_queries.secondary_command_buffer Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7373>	2020-11-02 12:02:17 +01:00
Iago Toral Quiroga	12f87b6e7c	v3dv: add support for timestamp queries V3D doesn't provide any means to acquire timestamps from the GPU so we have to implement these in the CPU. v2: enable timestampComputeAndGraphics and set timestampPeriod (Piñeiro) Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7373>	2020-11-02 12:02:17 +01:00
Iago Toral Quiroga	0a9b8077ad	v3dv: add image view debug checks for VK_KHR_maintenance1 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7364>	2020-11-02 09:33:32 +01:00
Iago Toral Quiroga	666817ce84	v3dv: grow meta descriptor pool dynamically Our blit shader path allocates a descriptor pool to create combined image sampler descriptors for blit source images. So far, we had sized this pool statically and the driver would fail if we ever need to allocate more descriptors than that. With this change, we switch to using a dynamic allocation mechanism instead where we allocate as many pools as we need to meet descriptor set allocation requirements for the command buffer. Also, every time a new pool needs to be created, we double its size (up to a limit), so we can start small and avoid wasting memory for command buffers that only have a small number of blits, while trying to keep allocation overhead low for command buffers that record a lot of blits. v2: use existing framework for automatic destruction of private driver objects to free allocated pools. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7311>	2020-10-27 10:15:28 +00:00
Iago Toral Quiroga	0a4fc19605	v3dv: skip unnecessary tile loads when blitting If we are blitting to tile boundaries we don't need to emit tile loads. The exception to this is the case where we are blitting only a subset of the pixel components in the image (which we do for single aspect blits of D24S8), since in that case we need to preserve the components we are not writing. There is a corner case where some times we create framebuffers that alias subregions of a larger image. In that case the edge tiles are not padded and we can't skip the loads. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7247>	2020-10-22 08:00:02 +02:00
Iago Toral Quiroga	43063d8de8	v3dv: generate proper UUIDs for device and driver Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	1e81bb05ae	v3dv: implement workaround for GFXH-1461 If a subpass clears one aspect of Depth/Stencil but loads the other the clear might get lost. Fix this by emitting the clear as a draw call instead of relying on the TLB clear. Fixes: dEQP-VK.renderpass.suballocation.attachment.3.307 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	49549f43c7	v3dv: always program a reasonable internal depth type for copies/clears This doesn't seem to fix anything, but it is the right thing to do. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	1ed8252514	v3dv/pipeline_cache: extend pipeline cache envvar So far V3DV_ENABLE_DEFAULT_PIPELINE_CACHE allowed to configure pipeline cache to avoid any caching using a pipeline cache. With this change we can be more detailed. Then envvar is not anymore a boolean. Allowed values: * "off": no pipeline cache at all. PipelineCache objects behaves as no-op objects. * "no-default-cache": user PipelineCache caches nir/variants, but we don't provide a default cache in case the user doesn't provide a PipelineCache object, neither for internal pipelines. * "full" (default): we provide a default PipelineCache, used when the user doesn't provide one when creating a Pipeline, and for internal Pipelines. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	f7af9eb211	v3dv: free noop job if needed when finishing the queue Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	deb0dce1ee	v3dv: don't leak dumb BO handles allocated for swapchain images Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	4acf5985a4	v3dv: hook up robust buffer access Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	4823313587	v3dV: move meta init/finish to meta implementation files Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	db0bb36ace	v3dv: don't cache subpass color clear pipelines Subpass color clear pipelines are those used to emit partial attachment clears as draw calls inside the render pass currently bound by the application in the command buffer, leading to a huge performance improvement compared to the case where we emit them in their own render pass. Unfortunately, because the pipeline references the render pass object in which it is used and the render pass object is owned by the application (and can be destroyed at any point), we can't cache these pipelines (unless we implement a refcounting mechanism or other similar strategy). Performance impact looks negligible based on experiments with vkQuake3, probably because the underlying pipeline cache is preventing the redundant shader recompiles. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	fa7fbdb088	v3dv/pipeline: set 16bit return_size for shadows always So far we were pre-generating two variants, an all 16 bit return_size and an all 32-bit return_size, as at pipeline creation time we don't know the texture format that it would be used finally used. But it is possible to override or at least refine the 32bit case, as we know in advance that all shadow textures can (and in fact should) use return_size 16bit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	229cce4056	v3dv/pipeline: track if texture is shadow To be used to decide the texture return size. We add it on the descriptor map because it is the easier place to do so. As we are lowering the texture accesses we can check instr->is_shadow at that point. It is true that it is somewhat odd, as so far the descriptor map was general-descriptor info, but is_shadow is only for textures. But it doesn't make sense to make an effort now, as it is possible that we would get more descriptor-specific info on the map on the future. We can revisit that later. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	e8ceb8f56a	v3dv/meta: fix hash table insertion So far we were using directly the local variable key to do the insertion, when the hash table expects a permanent address. We add a key field on all the meta structures (that are already basically a wrapper over v3dv_pipeline). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	29ef4ddcf9	v3dv: handle multisample rasterization with empty framebuffers If the framebuffer has no attachments then multisample rasterization is enabled based on the rasterizationSamples multisample state of the pipelines. It should be noted that since we don't support the variableMultisampleRate feature, all pipelines in the same subpass must have matching number of samples. V3D requires that we specifically setup our frames to enable multisampling or not, and we do this when we create jobs inside a subpass. Since we create the first job for a subpass as soon as the subpas starts, this is problematic: if we don't have any attachments, we don't won't enable MSAA at this point, but later on we might bind an MSAA pipeline, since pipelines can be bound at any point in the lifespan of a command buffer. Here, we fix this by testing if the first draw call in a job uses an MSAA pipeline but the job the was setup to not use MSAA, and in that case we re-start the job with MSAA enabled. We also take care of a corner case that seems to be tested by CTS where a framebuffer with no attachments doesn't bind any pipelines with MSAA enabled (so according to the Vulkan spec, multisample rasterization must be disabled) but the fragment shader in use reads gl_SampleID (which enables per-sample shading). This would lead to enabling per-sample shading with single-sample rasterization, which doesn't make sense and makes the simulator complain, so we just disable per-sample shading in that case. Fixes: dEQP-VK.pipeline.multisample.mixed_count.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	5de282b50a	v3dv/descriptor: remove v3dv_descriptor_map_get_image_view Now that we added support for texel_buffers, on all the cases that we were checking for a image_view we end checking for a image_view or buffer_view, so we stopped to use it. Remove it as it become superfluous. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	4a63b1ae87	v3dv: handle multisample resolves for formats that don't support TLB resolves The TLB multisample resolve feature is only limited to specific format types. For everything else, including sfloat and integer formats, we need to fallback to a blit resolve. This needs to be handled both for in-pass resolves as well as for vkCmdResolveImage. Because these blits would happen after the tile store operations, we need to make sure we store the multisampled buffers so we can then read them for the blit resolve. Fixes the remaining test failures in: dEQP-VK.renderpass.suballocation.multisample_resolve.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	947fa7033d	v3dv: add v3dv_limits file There are several definitions for hw limits on v3dv_image that we want to share, but v3dv_private was already growing bigger and messier. So let's move them to a specific header. Note that there is already a broadcom/common/v3d_limits.h. We are not putting them there because right now they are only used by the Vulkan driver, but are candidates to be moved. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	81b713e341	v3dv/descriptor: support for UNIFORM/STORAGE_TEXEL_BUFFER This gets passing most uniform/storage_texel buffer tests. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	3310c1542c	v3dv: handle multisampled image copies with the blit path This should be able to handle partial copies of multisampled images. This change extends our blit shader interface to also handle multisampled destinations so that if the blit destination is a multisampled image, the blit will rely on sample rate shading to copy all samples from the source image (which must have a matching number of samples). I have not found any tests in CTS that do partial copies of multisampled images, so I tested this with a full multisampled image copy, using this test: dEQP-VK.api.copy_and_blit.core.resolve_image.whole_copy_before_resolving.4_bit Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	f219795a26	v3dv: add a blit fallback path for vkCmdResolveImage This fallback is required when we have to do partial resolves. It works the same way as other blit fallbacks for copy operations: it will bind the source image as a source texture and blit the selected region to the destination image. The difference in this case is that the source image is multisampled and the blit shader needs to fetch and average individual samples for each texel. This gets us to pass all the remaining test cases in dEQP-VK.api.copy_and_blit.core.resolve_image.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	d953eab5af	v3dv: process VkPipelineMultisampleStateCreateInfo properly Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	d87941cb3a	v3dv: consider MSAA when computing frame tiling Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	d64ff26563	v3dv/pipeline: pre-generate more that one shader variant In order to reduce the number of shader builds after pipeline creation (that ideally shouldn't happen) we pre-generate two shader variants at pipeline creation time. In addition to the default one, that set the return size for all texture to 16 bit, we build another variant setting the return size for all textures to 32-bit. cmd buffer selects the latter if any of the textures requires 32bit. So we are using an all 16-bit return size or an all 32-bit return size variants. This could be slightly improved by pre-generating return size combinations if the texture number is below a threshold. But that would require more space, and bigger pipeline creation time, so would need to be evaluated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	a87d2c7806	v3dv/pipeline: when looking for a variant, check first current variant So far, when checking for a variant fulfilling a specific v3d key, we were checking the caches, and if that failed, we compiled a new variant, and update the current variant. But we could check first if the current variant fullfils that. This was not really problematic so far, as checking on the caches was fast, but now that we could be without any kind of shader cache using V3DV_ENABLE_PIPELINE_CACHE, it is far better to check first current variant. Without this vkQuake3 at 720p drops to 1fps when disabling the cache. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	62c32d6ca0	v3dv/pipeline: remove custom variant cache Now that we have a default pipeline cache, we can rely on it. This allows to remove some code, and avoid the need to have a cache per each pipeline stage. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	35381a4696	v3dv/pipeline_cache: add default pipeline cache That it would be used as fallback. Three advantages: * Having a cache for user operations even if the user doesn't provide it. * Having a cache for internal operations. v3dv_meta_copy creates pipelines for some copy path, so it is interesting to have them cached. * Testing: so now the pipeline cache is tested by more CTS tests. As any other pipeline cache, it can be disabled with the V3DV_ENABLE_PIPELINE_CACHE. It was suggested that would make sense to have a specific envvar for the default pipeline cache, but for now just one envvar is enough. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	48a64f28c2	v3dv/pipeline: provide a shader_sha1 to private ShaderModules So far for private pipelines we were creating dummy shader modules where we directly provided the nir shader. But for the pipeline cache we were using the SPIR-V to generate part of the cache key sha1. The main use case for private pipelines are meta_copy/clear. Those nir shaders depend on parameters like the format etc, so we use directly the serialized form of the NIR shader to generate the sha1. The other case are the no-op fragment shader that we need to provide if no fragment shader is defined by the user. For that case we can just use the default shader name, as the no-op shader is always the same. This is required as we plan to add a default pipeline cache, that would include our private shaders too. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	a00fe4cb0c	v3dv/pipeline_cache: cache v3dv_shader_variants This also includes being able to serialize them as part of GetPipelineCacheData and to deserialize it as part of CreatePipelineCache. So now we can also upload the assembly of the variant as part of the PipelineCache creation. Note that from all this the tricky part was the prog_data serialization. v3d_compile allocates and fill a new prog_data, with rzalloc. Among other things because it also allocates internally the uniform list. So we needed to replicate that when deserializating the prog_data. Ideally we would like to avoid that, and allocate as much resources as possible using vk_alloc, but that would mean a somewhat deep change on the v3d_compiler, that we want to avoid as much possible for now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	63b6b633e9	v3dv/pipeline: add basic ref counting support for variants As soon as we start to cache variants on pipeline caches, the same variant could be used by different pipelines and pipeline caches. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	2326d5bc04	v3dv/pipeline_cache: cache nir shaders Heavily based on anv nir caching. One of the bigger difference is that we don't create the nir shader using a ralloc_context local to the main compile graphics method. On anv, after compiling the shader, they discard the nir shader. We need it as we could need it to build shader variants later. As anv, we introduce a environment variable to disable the cache: V3DV_ENABLE_PIPELINE_CACHE By default is enabled. The main purpose for this envvar is debugging, in order to provide a easy way to discard a bug on the cache. It is pending to serialize/deserialize the NIR shaders as part of GetPipelineCacheData and PipelineCacheCreate. We also plan is to cache too shader variants. We would do that on following patches. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	1d2ae8756f	v3dv/pipeline_cache: bare basic support for pipeline cache And this means providing a proper cache object, and being able to load/retrieve a cache data with a proper header. Not really caching anything yet. That would be tackle on following patches. Note that this no-op cache got all the specific pipeline_cache and pipeline.cache tests passing on the rpi4. The following tests are still crashing when using the simulator: dEQP-VK.synchronization.internally_synchronized_objects.pipeline_cache_compute dEQP-VK.synchronization.internally_synchronized_objects.pipeline_cache_graphics But those are an issue of synchronization tests on the simulator, and not related with the pipeline cache itself. In general synchronization tests should be tested on the rpi4. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Alejandro Piñeiro	ffaab5593c	v3dv/device: add vendorID/deviceID get helpers As we would need them for the pipeline cache header. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	47e02a2ef1	v3dv: add a fast path for vkCmdClearAttachments Since vkCmdClearAttachments executes inside a render pass, we would benefit from converting it to a draw within the current subpass job to improve batching and avoid expensive tile load/store operations. This can dramatically improve performance for applications using this command, however, we can only use this if we are clearing the base layers of framebuffer attachments, since otherwise we would need to use layered rendering, which we don't support yet. This improves vkQuake3 performance dramatically (almost 100% performance improvement at 1080p), which calls this twice per frame. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	add8778638	v3dv: ignore stencil load operation if attachment format doesn't have stencil This gets vkQuake to render correctly, which creates render passes with stencil load operations even when the depth/stencil attachment format doesn't have a stencil aspect. While this is a bit weird, it seems to be allowed by the spec: "If the format has depth and/or stencil components, loadOp and storeOp apply only to the depth data, while stencilLoadOp and stencilStoreOp define how the stencil data is handled." In our case we were not ignoring it and this was causing that we emitted a Z buffer load that seemed to clobber the Z clear, preventing all draw calls from passing the depth test. While we are at it, also change the depth/stencil store operation (which was already handling this scenario correctly) to use the format of the render pass attachment description rather than the underlying image format. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	0db95de577	v3dv: improve pipeline barrier handling So far we have been getting away with finishing the current job in the presence of a pipeline barrier and relying on the RCL serialization, but of course this is not always enough. This patch addresses synchronization across different GPU units (i.e. draw indirect after compute), as well as cases where we need to sync before binning. Fixes CTS failures in: dEQP-VK.synchronization.op.single_queue.barrier.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	a2538b2520	v3dv: make sure we emit vertex attributes in location order The order in which we emit the attributes is relevant, since GL_SHADER_STATE_ATTRIBUTE_RECORD packets don't include an explicit attribute index. This means that we need to emit them in driver location order, since the compiler uses that location to compute attribute offsets in the VPM. Fixes ~1300 CTS tests in: dEQP-VK.pipeline.vertex_input.multiple_attributes.out_of_order.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	3bd02a729a	v3dv: fix color border clamping with specific formats For some reason, CTS expects E5B9G9R9 and B10G11R11 with transparent black border clamping produce alpha 1 instead of 0. Since border color takes precedence over the texture state swizzle, the only way to fix this is to lower the texture swizzle in the shader to set alpha to 1. Fixes: dEQP-VK.pipeline.sampler.view_type.b10g11r11clamp_to_border_transparent_black dEQP-VK.pipeline.sampler.view_type.e5b9g9r9.clamp_to_border_transparent_black Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	f156c5dc4d	v3dv: fix regressions for cubemap array load/store It seems that we only want to set the texture state's depth to the number of 2D layers divided by 6 when sampling, not wen doing load/store. This means that we need to generate two different states and choose the one to use depending on the descriptor. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	cb1e492ee0	v3dv: handle unnormalized coordinates in samplers In OpenGL, unnormalized coordinates are implicit based on the sampler type (rectangle textures), so the compiler can set the flag when needed. In Vulkan, however, this is configured explicitly in the sampler object, so the compiler won't set it and we need to do it manually when we are writing the P1 uniform. Fixes: dEQP-VK.pipeline.sampler.exact_sampling.*.unnormalized_coords Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	6053e85ee4	v3dv: fix textureSize() for cube arrays For these we want to divide the number of layers by 6. Fixes: dEQP-VK.glsl.texture_functions.query.texturesize.samplercubearray Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	8116d65fbc	v3dv: improve handling of too large image sizes Instead of asserting that users don't try to create images that would require 4GB+ of memory, error out with the corresponding OOM error when the user tries to actually allocate the memory for the image. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	91907560d5	v3dv: implement support for shader spilling Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	e59e706928	v3dv: don't reset descriptor state after a meta operation If the meta operation did not change descriptor state then we should keep it, not reset it. Fixes: dEQP-VK.fragment_operations.early_fragment.early_fragment_tests_stencil dEQP-VK.fragment_operations.early_fragment.no_early_fragment_tests_stencil Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	d809d9f3f6	v3dv: don't free BOs from imported memory objects Only free the underlying BO when the exported memory object is freed to avoid multiple frees of the same memory. The only exception is winsys BOs where we import a BO created in the display device into the render device. In this case, we only have one memory object referencing the BO and we want to destroy it with that memory object. Fixes: dEQP-VK.api.external.memory.dma_buf.* dEQP-VK.api.external.memory.opaque_fd.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	b356d3de8c	v3dv: implement indirect compute dispatch The hardware can't do this, so we need to record a CPU job that will map the indirect buffer at queue submission time, read the dispatch parameters and then submit a regular dispatch. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	1d6edcc3e8	v3dv: always emit index buffer state for new jobs New jobs need to re-emit all state. Typically, this is achieved by resetting all dirty state flags when we start a new job, but for index buffers we were not using a dirty bit because we always emit them immediately. This patch adds the bit and only tries to skip index buffer state if the bit is not dirty, which will ensure that we will always emit it for new jobs. This fixes a regression in the shadowmapping demo from Sascha Willems introduced with "v3dv: try harder to skip emission of redundant state". Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	2f1c15116f	v3dv: handle unsized arrays in SSBOs CTS coverage for this was hiding behind compute shaders so we didn't notice this was not working properly until now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	7e990683fd	v3dv: implement compute dispatch for now this only implements regular dispatches, not indirect. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	d0b1bb3032	v3dv: handle separate binding points for compute and graphics Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	67d5b0c91f	v3dv: support compute pipelines Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Alejandro Piñeiro	f78c99f357	v3dv/bo: add a maximum size for the bo_cache and a envvar to configure it V3DV_MAX_BO_CACHE_SIZE can be used to configure it. So one way to disable the bo cache is setting V3DV_MAX_BO_CACHE_SIZE to zero. This would still run all the bo_cache size checks, but having another envvar just to ensure that anything related to the cache is used seemed like an overkill. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Alejandro Piñeiro	2adea940f1	v3dv/bo: adding a BO cache Heavily based on the already existing for the v3d OpenGL driver, but without references, and with some extra OOM checks (Vulkan CTS has several OOM tests). With this commit v3dv_bo_alloc and v3dv_bo_free became frontends to the bo_cache. The former tries to get a BO from the cache if possible, and the latter stores the BO on the cache if possible. The former also adds a new parameter to point if the BO to allocate is private. As v3d we are only caching private BOs, those created by the driver for internal use (like CLs, tile_alloc, etc). They are the ones with the highest change of being reused (for example, CL BOs are always 4KB, so they can always be reused). User-created BOs can have any size, including some very large ones for buffers and images, which makes them far less likely to be reused and would add a lot of memory pressure if we decided to cache them. In any case, in practice, we found that we could get a performance improvement by caching also user-created BOs, but that would need more care and an analysis to decide which ones makes sense. Would also require to change how the cached BOs are stored by size. Right now there are an array of list_head, that doesn't work well with big BOs. If done, that would be handled on a separate commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	1f8343b875	v3dv: add a CPU path for buffer to image copies The blit shader path for buffer to image copies is pretty bad, since it needs to produce a tiled image from the linear buffer prior to emitting the blit copy. This patch adds a new preferential path where we implement the copy using the CPU, similar to what the GL driver does for texture uploads. This makes vkQuake2 at least 4x faster when dynamic lights are enabled (which triggers dynamic texture updates). We also tested a GPU path where we use a shader that takes the linear buffer as a UBO and copies directly from it. This also shows a clear performance gain, but still worse than the CPU implementation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	e1c8041cde	v3dv: try harder to skip emission of redundant state Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Alejandro Piñeiro	9a894849d5	v3dv/descriptor: use descriptor pool bo for image/samplers This allows us to remove some individual bos for the image and sampler, used to store the SAMPLER_STATE and TEXTURE_SHADER_STATE. Now they are prepacked on static memory as part of the vulkan object struct. This commit introduces small descriptor structs, used to define what the bo subregion would contain. It is used mostly to compute offsets to that specific data, and define the size needed. Having said so, it would be possible to replace them with some kind of flag (like anv) or just compute the offset based on the context. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Alejandro Piñeiro	18553b50f9	v3dv/descriptor: add general bo on descriptor pool So far we were saving all the descriptor info on the host memory. With this commit we do the equivalent that other mesa vulkan drivers (Anvil and Turnip) and create a bo on the descriptor pool that would be suballocated for each descriptor. This would allow to clean up individual bos from some vulkan objects, reducing device memory fragmentation, and allowing to avoid to alloc bos for that info. After all, pre-allocating needed memory is one of the purposes of the descriptor pool. This commit introduces all the infrastructure, but doesn't use it for any descriptor yet, as if no descriptor needed data uploaded to a bo. The idea to decide which info goes to the descriptor pool bo is info that we would need to upload to a bo in any case, as it is referenced as an address by any packet. We could be more aggressive with that general rule, but that would be enough for now. If in the future we support VK_EXT_descriptor_indexing, we probably would need to store more info, as under that extension, descriptors can be updated after being bound. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	509c8a60c4	v3dv: don't leak attachment state We were assuming that if the command buffer state doesn't have any attachments (as per the attachment count) the attachment state array should not be allocated, however, during meta operations it is possible that the attachment state grows (since meta operations can emit render passes of their own). In that case, we would grow the state for the meta operation but then pop the previous attachment count and we would leak the state. An example of that is a secondary command buffer which has no attachment state by default since it doesn't execute a render pass begin, but that executes one in a meta operation (for vkCmdClearAttachments for example). Fix this by making the attachment count an allocation count instead and not popping it once we finish a meta operation. Also, always free the state so long as there is a valid pointer, and assert that the allocated count is not zero in that case. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	3092ddf4d5	v3dv: support vkCmdClearAttachments in secondary command buffers The main change we are introducing here is that now we allow secondary command buffers that execute in a render pass to have a job list with more than one job. The main issue with vkCmdClearAttachments is that we currently need this to spawn multiple jobs to clear multilayered framebuffers, as we need to setup a different 2D framebuffer for each layer to clear and therefore emit a different RCL for each. We could avoid this completely by used layered rendering with the "clear rect" path to redirect the clear rects to appropriate layers of the primary framebuffer, however, our hardware only supports layered rendering with geometry shaders, which we don't support at present. Because vkCmdClearAttachments relies on having framebuffer state available (something we would not need if we used the geometry shader implementation), if this is not available in the secondary we need to postpone emission of the command until the secondary is executed inside a primary. We do this by using a new CPU job V3DV_JOB_TYPE_CPU_CLEAR_ATTACHMENTS that is processed during vkCmdExecuteCommands by calling vkCmdClearAttachments directly in the primary. As a consequence of these changes, it is now possible that a secondary command buffer that runs inside a render pass have any kind of job in its job list, including partial CLs that need to be branched to and full CLs that need to be submitted to the GPU as is, so we introduced a new GPU job type V3DV_JOB_TYPE_GPU_CL_SECONDARY to identify partial CLs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	ca4b42a51d	v3dv: implement vkCmdWaitEvents for secondary command buffers Event waits can be safely moved before a render pass start, since event setting and resetting commands cannot happen inside one. We don't need to go that far, but we can use this to record the wait in its own separate job and then execute this job before the binning commands recorded in the secondary command buffer when we execute the secondary into a primary. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	6a34ef6565	v3dv: add basic support for secondary command buffers There are basically two types of scenarios to consider: - Secondary command buffers that run inside a render pass. - Secondary command buffers that run outside a render pass. For the former we want to record their commands into a binning command list that we can branch to when executed into a primary command buffer. This means this kind of command buffers don't spawn new jobs, just the default one where they record the binning commands which won't include the frame setup, which will be provided by the primary they will be executed in. For the latter we don't require anything special, we just record as many jobs as we need as usual and link that job list from the primary job list when executed. This handles most scenarios except: - vkCmdWaitForEvents - VkCmdClearAttachments Both of these can spawn new jobs inside a render pass, which is not what we want for secondary command buffers. We will address this is follow-up patches. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	e048eba81a	v3dv: drop the extra BO handling from the command buffer Now that we have a framework to register objects allocated internally by the driver we can just use that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Alejandro Piñeiro	03f5fae88f	v3dv/cmd_buffer: move variant checking to CmdDraw In order to properly check (and possibly compile) shader variants we need a pipeline and a compatible descriptor set. So far we were trying to do that check as early as possible, so we were trying to do it at CmdBindPipeline or CmdBindDescriptorSets, and a combination of dirty flags. This showed to not cover all the corners cases, and made the code complex, as needed to handle cases where the descriptors were not yet available, and return early. The latter also meant that we were running several checks that failed in the middle. This commit moves the variant check to CmdDraw, when we should have a pipeline and compatible descriptor sets, and simplifies and makes more strict the existing code. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	57a254c48d	v3dv: implement events This reverts a previous half-attempt at an implementation of events using a BO to hold the event state, and provides a full implementation. V3D doesn't have any built-in GPU functionality to wait on any kind of events, so we need to implement this in the driver an therefore we no longer need to use a BO for the event state. Instead, we implement GPU waits by using a CPU job for the wait operation that spawns a wait thread if the wait operation doesn't have all its events signaled by the time it is processed. To implement the semantics of the wait correctly, any jobs in the same command buffer that come after the wait will not be emitted until the wait thread completes. If a submit spawns any wait threads for a command buffer we can't signal any semaphores for it until all the wait threads complete and we know that all the jobs for those command buffers have been submitted. The same applies to the submit fence, if present. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	10dbb1e07d	v3dv: make the driver more robust against OOM This is generally very difficult to handle properly everywhere, but at least this is good enough to make the few CTS tests for this happy. Fixes (on Rpi4): dEQP-VK.wsi.xlib.swapchain.simulate_oom.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	ce681aac70	v3dv: fix the command buffer private object framework for 32-bit We were declaring the destroy callback function as taking a pointer for the vulkan object handle and relying on an implicit conversion to the Vulkan handle type, however that would be incorrect on 32-bit platforms, where non-dispatchable Vulkan objects (the kind that we may allocate privately during command buffer recording), are defined as uint64_t, so the signature of the destry callback type doesn't match the signature of the actual Vulkan function, leading to bogus results. Fix that by using uint64_t instead. This fixes compilation warnings and also crashes in some tests when compiling and executing natively in Rpi4. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	1957689249	v3dv: fix dynamic blend constants We were pre-packing the constants from the pipeline state and then always emitting that at draw time, ignoring dynamic state. This makes it so we don't prepack at pipeline creation time and we always emit the correct constants directly the command buffer dynamic state. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	98ade03113	v3dv: implement wide lines Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	9aaf07e5be	v3dv: implement depth bias This doesn't implement depth bias clamp, which requires to support the depth bias clamp feature, which we do not advertise as available at present. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	88a59437d2	v3dv: drop blit path for depth/stencil formats We can now implement all depth/stencil blits as compatible color blits, so let's just have the blit shader interface simply convert any blit with a depth/stencil format to a compatible color blit (like we were already doing for s8d24) and get rid of the depth blitting path. This also allows us to ignore the blit aspect in the blit pipeline cache key. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	b0af413edf	v3dv: implement partial image to buffer copies We implement this by blitting the requested region to a linear image setup to use the buffer memory store at the requested offset. Because we can't store linear depth/stencil images, we implement copies of depth/stencil aspects by using a compatible color blit. To do this, we also need to account for the fact that when we are copying depth from a d24 format we need to copy them from the MSB 24-bits of each word as provided by the hardware and store them in the LSB 24-bit of the buffer (as per Vulkan requirements). This is achieved by expanding our blit interface to also accept a swizzle to apply to the source texture. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	e270d2fb14	v3dv: implement stencil aspect blits for combined depth/stencil format To do this we just implement the stencil blit as a masked color bit with uint8 format. This allows us to support blitting on combined depth/stencil formats, and therefore, also partial image copies for these formats. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Alejandro Piñeiro	1b4a9c7d45	v3dv: properly return OOM error during pipeline creation So far we were just asserting or aborting if any of the internal method used during the pipeline creation failed. We needed to change the return value of several methods, in order to bubble up the proper memory allocation error. Note that as the pipeline creation is complex and splitted in several methods, if an error happens in the middle, it returns back, and rely on the higher level to call PipelineDestroy. This method needs to take into account that some of the resources could have not been allocated when freeing it. Also note that v3dv_get_shader_variant is used during the pipeline bind, as with the new resources bound, we need to check if we need to recompile a new variant. At that moment we are not creating a new vulkan object so we can really return a OOM error. For now we just assert on that case. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	f49d9d7c8b	v3dv: fix leaks during recording of meta blits This uses the framework to register private commmand buffer objects that get freed automatically when the command buffer is destroyed by the application. This change also moves the descriptor set pool that the meta blit path uses to allocate descriptors for the blit source textures, from the device to the command buffer, so we can have a descriptor pool per command buffer. This is necessary to ensure correct behavior when doing multi-threaded command buffer recording (alternatively, we would have to lock around the descriptor set allocation code, which would be undesirable). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	bfadd70d6c	v3dv: add framework for private driver objects This allows the driver to register private Vulkan objects it creates as part of command buffer recording (usually for meta operations) in the command buffer, so they can be destroyed together with it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	e18865b5dd	v3dv: support blits with 1D and 3D images Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:30 +00:00
Iago Toral Quiroga	831aa5d438	v3dv: implement shader draw fallback for vkCmdBlitImage For now this is limited to blits of 2D color images. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	5fc55e3a04	v3dv: save and restore push constant state during meta operations Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	441d3911b9	v3dv: save and restore descriptor state during meta operations if needed For now we have only been using meta operations for clears which don't need to bind descriptor sets, however meta blits will need to. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	30f1fc25ce	v3dv: implement TFU blits While very limited in scope, this might be the most efficient way to blit when applicable. In fact, we might also want to use this for the image copy commands when possible instead of the TLB. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	71ba6a1109	v3dv: simplify handling of no-op jobs Avoid creating (and destroying) no-op jobs more than once. Instead, cache the job and use it every time we need to submit one. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	53657b0cb1	v3dv: implement occlusion queries The design for queries in Vulkan requires that some commands execute in the GPU as part of a command buffer. Unfortunately, V3D doesn't really have supprt for this, which means that we need to execute them in the CPU but we still need to make it look as if they happened inside the comamnd buffer from the point of view of the user, which adds certain hassle. The above means that in some cases we need to do CPU waits for certain parts of the command buffer to execute so we can then run the CPU code. For exmaple, we need to wait before executing a query resets just in case the GPU is using them, and we have to do a CPU wait wait for previous GPU jobs to complete before copying query results if the user has asked us to do that. In the future, we may want to have submission thread instead so we don't block the main thread in these scenarios. Because we now need to execute some tasks in the CPU as part of a command buffer, this introduces the concept of job types, there is one type for all GPU jobs, and then we have one type for each kind of job that needs to execute in the CPU. CPU jobs are executed by the queue in order just like GPU jobs, only that they are exclusively CPU tasks. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	5c826568ab	v3dv/format: expose correctly if a texture format is filterable We were enabling VK_FORMAT_FEATURE_SAMPLED_IMAGE_FILTER_LINEAR_BIT for any format valid for texturing, but for example, right now we don't support linear filtering on any depth format. This is needed to get some hundreds of tests like this: dEQP-VK.pipeline.sampler.view_type.1d.format.r32g32_sfloat.mag_filter.linear properly skipped (those were all Crashes with the simulator, and almost all Fails with the real device). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	9b98d36522	v3dv/descriptor: handle not having a sampler when combining texture and sampler id There are some texture operations (like mipmap query levels) that doesn't require a sampler. In fact, you should ignore it. So we need to take it into account when combining the indexes. nir_tex_instr_src_index is returning a negative value to identify that case, but as we are using a uint32_t to pack both values (for convenience, easy to pack/unpack the hash table key), we just use a uint value big enough to be a wrong sampler id. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	ee917d2b78	v3dv/descriptor_set: combine texture and sampler indices OpenGL doesn't have the concept of individual texture and sampler, so texture and sampler indexes have the same value. v3d compiler uses this assumption, so for example, the texture info at the v3d key include values that you need to use the texture format and the sampler to fill (like the return_size). One option would be to adapt the v3d compiler to handle both, but then we would need to adapt to the lowerings it uses, like nir_lower_tex, that also take the same assumption. We deal with this on the Vulkan driver, by reassigning the texture and sampler index to a combined one. We add a hash table to map the combined texture idx and sampler idx to this combined idx, and a simple array to the opposite map. On the driver we work with the separate indices to fill up the data, while the v3d compiler works with the combined one. As mentioned, this is needed to properly fill up the texture return size, so as we are here, we fix that. This gets tests like the following working: dEQP-VK.glsl.texture_gather.basic.2d.depth32f.base_level.level_2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	9d8b1b01c3	v3dv/descriptor: move descriptor_map_get_sampler, add and use get_image_view First one as we plan to use get_sampler on more places, second one just to get cleaner code. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	93fd897c28	v3dv: compute tile granularity for each subpass We must update our check for whether the render area is tile-aligned for each subpass, since the hardware will update tile sizes for each RCL. Fixes: dEQP-VK.renderpass.suballocation.attachment_allocation.roll.8 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	ad64add4f5	v3dv: create a v3dv_cmd_buffer_subpass_resume helper This is the same as the subpass start version, only that it won't emit subpass clears. This is necessary when resuming a subpass from a partial clear to make sure we don't try to clear subpass attachments again. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	debdea6cb5	v3dv: push/pop more state during meta operations Since a meta partial clear starts a new render pass, we need to store all state that can be changed with vkCmdBeginRenderPass. Also, since the meta clear pipeline sets dynamic state, we also have to restore that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	dedff7446a	v3dv/descriptor_set: support for immutable samplers They are bound at the set layout, and cannot be changed. From VkDescriptorSetLayoutBinding spec: "pImmutableSamplers affects initialization of samplers. If descriptorType specifies a VK_DESCRIPTOR_TYPE_SAMPLER or VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER type descriptor, then pImmutableSamplers can be used to initialize a set of immutable samplers. Immutable samplers are permanently bound into the set layout and must not be changed; updating a VK_DESCRIPTOR_TYPE_SAMPLER descriptor with immutable samplers is not allowed and updates to a VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER descriptor with immutable samplers does not modify the samplers (the image views are updated, but the sampler updates are ignored)" We stored them as part of the set layout. It also means that when we need the sampler (like for texture operations) we can't just ask for a descriptor, as it would not have the sampler. A new method is created. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	7e96f9b219	v3dv: restrict render pass clears to the render area The problem with this is that TLB clears always clear and store full tiles, so if our render area is not perfectly aligned to tile boundaries we end up clearing all pixels in tiles that are only partially covered. In this scenario we have to avoid using TLB clears and instead fallback to clearing by rendering a scissored quad in the clear color, like we do for partial clears in vkCmdClearAttachments. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	e5642a36a0	v3dv: store the clip window in the command buffer state We will need this so we can match a render area for a new render pass against the current clip rect and decide if we need to make adjustments. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	d6d8bfbb4a	v3dv: implement proper caching for partial clear pipelines So far we have been caching the first pipeline we produced and always reusing that, which is obviously incorrect. This change implements a proper cache and also takes care of releasing the cached resources when the device is destroyed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	7a39e5e902	v3dv: implement partial color attachment clears This is achieved by rendering a quad in the clear color for each layer of each attachment being cleared. Right now we emit each clear in a separate job with a single attachment framebuffer, but in the future we may be able to extend the solution to using multiple render targets and clear multiple attachments with a single job. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	c6dca4c529	v3dv: allow to create shader modules from NIR This will come in handy when the driver needs to generate its own shaders, such as for partial clears. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	07addb4183	v3dv/cmd_buffer: update shader variants at CmdBindDescriptorSets/CmdBindPipeline Specially after CmdBindDescriptorSets, it is likely that we would need a new shader variant, like for example if sampler descriptor sets are bound. At that moment a new v3d key is populated, using as base the one used at pipeline creation, so only cmd_buffer depending values are changed. Then a new variant is requested. Note that internally it is handled with a cache, so no new compilation will be done if not needed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	f76dad8f89	v3dv/pipeline: add support for shader variants So far, we were doing the compilation to qpu when the pipeline was created (as part of vkCreateGraphicsPipeline). But this would not be correct when some specific descriptors are involved, like textures. For that case some nir lowerings depend on the texture format, and that info is not available until the specific descriptors are bound to the command buffer. In the same way, the same command buffer with a given pipeline could get their descriptor bound again. So it would be needed to support compilation variants of the same shader. So finally, the v3d_key would work as keys, as the variants would be tracked with a hash table. This commit introduces the new structures for that. What we were building as the final qpu shader would become the initial default variant for the pipeline. We are also saving the keys used at that point, to avoid needing to fully regenerate them when a new variant is created. Not just for performance, but also to avoid needing to track the graphics pipeline create info structure. The code to handle updating the current variant would be done on following commits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	272f0d9cfb	v3dv/descriptor_set: added support for samplers This include SAMPLER, COMBINED_IMAGE_SAMPLER and SAMPLED_IMAGE descriptors. In order to support them we do the pre-packing of TEXTURE_SHADER_STATE and SAMPLER_STATE when Images and Samplers (respectively) are created. Those packets doesn't need to be tweaked later, so we upload them to an bo. A possible improvement of this would be that the descriptor pool manages a bo for all descriptors, that suballocate for each descriptor allocated. This is what other drivers do (and as far as I understand, one of the reasons of having a descriptor pool). Immutable samplers are not supported, will be handled on a following patch. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	34910532fd	v3dv/pipeline: sampler lowering Add a pass to lower "uniform sampler2d" and others to their texture/sampler idx. This commit basically imports the same lowering from turnip. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Alejandro Piñeiro	f2d16529d1	v3dv/cmd_buffer: push constants not using descriptor anymore v3dv_descriptor is going to be expanded with more data, so it doesn't make sense anymore to handle a fake descriptor for the push constants. Introducing a new struct, that is just a pair bo/offset. Initially named v3dv_resource, as it could be the base to reuse bos for different resources (like assembly bo) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	1e54b62a61	v3dv: implement dynamic state for blend constants Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	4c15131b1d	v3dv: always flush draw calls if we are doing sRGB blending Not quite sure why this is required though. Conversion from/to sRGB happens on tile loads and stores, with the tile buffer being always linear, so there should be no difference. Fixes all test failures in: dEQP-VK.pipeline.blend.format.r8g8b8a8_srgb.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	761b0c02f8	v3dv: add an 'always flush' mode In this mode, which can be activated with V3D_DEBUG=always_flush like in the GL driver, we flush every draw call separately. For now this is useful for debugging, but we can also set the flag internally on specific jobs when we identify scenarios where we need the same behavior. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	d73bb59113	v3dv: implement color blending Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	4c8531a144	v3dv: fix depth/stencil clear color Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	8ed2e53e0d	v3dv: support submits without a command buffer It is valid to submit with an empty list ofcommand buffers, however, we still need to wait on the pWaitSemaphores provided and only signal the pSignalSemaphores and fence once we have finished waiting on them to honor the semantics of the submission. Because waiting and signaling happens in the kernel, the easiest way to do this is to submit a trivial no-op job to the GPU. To do this, we need to refactor some of our code so that code that might have been operating on a command buffer starts operating on a job instead, so we can resuse most of our infrastructure to create the no-op job. Additionally, because no-op jobs are created internally by the driver, we are responsible for destroying them too. For this, we bind a fence to each no-op job we submit and we test for completion of in-flight no-op jobs (and destory them if completed) every time vkQueueSubmit is called. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	5cfc3b5cc9	v3dv: handle the case where we fail to allocate a new job gracefully There are a handful of tests that simulate 'out of memory' situations during swapchain image creation, and these can lead to failed job allocations when the driver is running on the prime blit path, as that involves creating a command buffer. The tests expect us to handle this scenario gracefully and return an appropriate OOM error as a result. This make sure we don't try to dereference a job if we failed to allocate it so we don't crash and can return the OOM error gracefully in the process. Fixes: dEQP-VK.wsi.xlib.swapchain.simulate_oom.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	521f86272b	v3dv: fix incorrect sizing of the vertex attribute state array Fixes crashes in: dEQP-VK.glsl.conversions.matrix_combine.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Alejandro Piñeiro	cb7f148020	v3dv/descriptor: take into account pPushConstantRanges Push constant tests were still working without taking this into account because we can't really fine graine how much we allocate for them. For now we only use them to know if we should allocate/fill the ubo for push constants or not. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	313809487c	v3dv: more frame tiling refactors This puts all the information required to setup frame tiling into v3dv_frame_tiling so we no longer need a framebuffer to start a frame. This makes the code simpler, since frame tiling calculations happen automatically when we start a new frame and simplifies the implementation of copy and clear operations that used to requiere that we setup a fake framebuffer with no actual attachments, which was a bit of a kludge. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	2a3fa914d4	v3dv: rewrite frame tiling setup So far we have been getting away with computing frame tiling information for the framebuffer object, but that is not correct, since different subpasses may access different subsets of the framebuffer, with each requiring a different configuration because the number of render targets and the maximum bpp can change for each subpass. This adds a v3dv_frame_tiling struct to keep the frame tiling information and rewrites the code to compute this for every new job we start. Fixes a bunch of tests in dEQP-VK.pipeline.render_to_image.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	44cda4dbfe	v3dv: fix subpass tracking in the command buffer state When we create a new job for a new subpass, we might have to finish the current job for the previous subpass, so it is important that we we don't get ahead of ourselves and increment the current subpass index in the command buffer state until we are really done finishing the previous job. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	6a909387c2	v3dv: take memory format from appropriate miplevel for image load/store We were storing the format of the base miplevel in the image view and we were typically using that instead of the taking the format from the appropriate image slice. This was a problem when loading or storing a miplevel other than the base which happened to have a different format. This also removed the tiling field from the image view to avoid repeating the same mistake in the future. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Alejandro Piñeiro	5d1089b88d	v3dv/descriptor: support for dynamic ubo/ssbo Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Iago Toral Quiroga	d6afbcb800	v3dv: implement host-side event handling functions I am not quite certain that this is the way to go though. Here, we are expecting that the GPU can set/reset the event inside a command buffer as a 1x1 pixel clear for example, however, there is still the question of how we get to implement the command buffer wait on an event, since reading the docs I haven't found any such functionality to be available. We could think of implementing this by splitting the command buffer into multiple jobs at the wait command, and then using a separate thread for job submissions that would poll the event UBO before sending it to the kernel, but that looks like a bit of a kludge. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	aba2a66fb8	v3dv: implement vk{Create,Destroy}BufferView For now this is not particularly useful, since we can't bind this to a texel buffer descriptor yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	83554840a5	v3dv: add support for primitive restarts on indexed draw calls Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Alejandro Piñeiro	e9085e56ed	v3dv/cmd_buffer: support for push constants By default they are trivially lowered to load_uniform. We still need to allocate an UBO for push constants, used for those that are accessed using a non-const index. This is automatically handled by the compiler, as it cames back as asking a QUNIFORM_UBO_ADDR. This is what already does for gallium. Note that if needing the UBO, we are uploading the full push constant data. An improvement would be to try to upload only the data that needs to rely on the UBO (so non-const accesses to uniforms). Also, the code is not handling getting out of space from the UBO bo. This would be tackled at a different commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	b3f9a3beaf	v3dv: implement vkCmdUpdateBuffer Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	c7404f4e2d	v3dv: Add more supported formats to our format table Some of these may need additional work to work for real, but we should be able to support them. We also include some formats that are not supported for images, but that we want to support for buffers, such as R32G32B32 for a vertex buffer. In the future we might want to expand the format table to specify which formats are supported for buffers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	0dc4416827	v3dv: ignore image view aspects for depth/stencil attachments From the Vulkan spec: "When an image view of a depth/stencil image is used as a depth/stencil framebuffer attachment, the aspectMask is ignored and both depth and stencil image subresources are used." So in that scenario, we ignore the aspect mask on the view and go check the actual format of the underlying image to decide if we have depth or depth+stencil aspects. This gets VkRunner's depth-buffer.shader_test to pass. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	1c6938484e	v3dv: implement indexed draws Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	f48788896e	v3dv: implement vkCmdClearAttachments For now this only implements a fast path using the tile buffer, so it can only be used when clearing full images, but this is good enough for VkRunner. The implementation is a bit tricky because this command executes inside a render pass, and yet, since we are using the tile buffer to clear, this needs to go in its own job. This means that with this, we need to be able to split a subpass into multiple jobs which creates some issues. For example, certain operations, such as the subpass load operation (particularly if it is a clear) should only happen on the first job of the subpass and subsequent jobs in the same subpass should always load. Similarly, we should not discard the last store on an attachment unless we know it is the last job for the last subpass that uses the attachment. To handle these cases we add two new flags to the job, one to know if the job is not the first in a subpass (is_subpass_continue) and another one to know if a job is the last in a subpass (is_subpass_finish). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	255d4245fd	v3dv: add a helper to get the Z/S buffer from an aspect mask We will be using this when we need to implement other clearing commands. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	68692297ba	v3dv: add a helper to compute the hardware clear color We will be using this when we implement other clearing commands (currently this is only used for the atatchment load clear operation). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Alejandro Piñeiro	34cf475102	v3dv/descriptor_set: support for array of ubo/ssbo For that we include the array_index when asking for a ubo/ssbo index from the descriptor_map. Until now, array_index was not included, but the descriptor_map took into account the array_size. This had the advantage that you only need a entry on the descriptor map, and the index was properly return. But this make it complex to get back the set, binding and array_index back from the ubo/ssbo binding. So it was more easy to just add array_index. Somehow now the "key" on the descriptor map is the combination of (set, binding, array_index). Note that this also make sense as the vulkan api identifies each array index as a descriptor, so for example, from spec, VkDescriptorSetLayoutBinding:descriptorCount "descriptorCount is the number of descriptors contained in the binding, accessed in a shader as an array" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Alejandro Piñeiro	9afd24f89a	v3dv: initial descriptor set support Focused on getting the basic UBO and SSBO cases implemented. So no dynamic offset, push contanst, samplers, and so on. This include a initial implementation for CreatedescriptorPool, CreateDescriptorSetLayout, AllocateDescriptorSets, UpdateDescriptorSets, CreatePipelineLayout, and CmdBindDescriptorSets. Also introduces lowering vulkan intrinsics. For now just vulkan_resource_index. We also introduce a descriptor_map, in this case for the ubos and ssbos, used to assign a index for each set/binding combination, that would be used when filling back the details of the ubo or ssbo on other places (like QUNIFORM_UBO_ADDR or QUNIFORM_SSBO_OFFSET). Note that at this point we don't need a bo for the descriptor pool, so descriptor sets are not getting a piece of it. That would likely change as we start to support more descriptor set types. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	29a7d99c4a	v3dv: fix the mess with dynamic state handling The general idea is that we always emit from our dynamic state, and when a particular piece of state is not dynamic, we just set our dynamic state from the pipeline state, however, the implementation was quite confusing: the mask of dynamic states flagged states that were not dynamic and some places woud mix dirty flags and dynamic state flags. We also were not updating the dynamic state mask in the command buffer, etc. This patch, hopefully, simplifies all this and makes it less confusing, starting by making the dynamic state mask flag dynamic states, fixing the places where we would confuse dirty state flags with dynamic state flags, making sure that our command buffer state is setup correctly and that we only emit state when it is actually dirty. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	134afdfd32	v3dv: implement dynamic stencil states Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	590f29349f	v3dv: implement early Z optimization Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	72040f9182	v3dv: fix viewport state from pipeline We were not computing viewport transform for static viewports provided with the pipeline state. Also, we were not copying the transform into the command buffer state when we bound the pipeline. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	55acd9f1ea	v3dv: implement stencil testing This works on combined depth/stencil formats only, separate stencil is not supported yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Iago Toral Quiroga	df34b8fac6	v3dv: compute subpass ranges for attachments at render pass creation time Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	a7a402c8bd	v3dv: don't always skip tile buffer stores Otherwise we would lose updates relevant to subsequent subpasses in the same renderpass that read or partially write the attachment. The only scenario where we can safely do this is on the last subpass that uses the attachment, so long as we don't need to emit the store for the clear. This also fixes a bug in the computation of the first subpass that uses an attachment. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	4d0e497a3e	v3dv: implement support for depth testing This ignores stencil for now and focuses on depth testing without support for early depth testing. To implement this we need to start considering how many of our framebuffer attachments are color attachments, since some of the computations we use to determine tile sizes and binning configuration depend on this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	9a72a79d1c	v3dv: rename drm device fields so they are more explict Also, remove references to the master_fd. This is only useful for VK_KHR_display which we are not currently implementing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	1d92c314f8	v3dv: implement device detection on actual hardware When running on the real hardware we have two devices: the v3d render node and the vc4 display node. We need the latter to allocate winsys BOs for v3d to render into. Since exporting these BOs is a privileged operation, we need to obtain the fd for this device through the display server. For now we only support doing this through the XCB DRI3 platform. Also, do not duplicate or re-open the DRM devices when creating logical devices. The simulator checks that the file descriptor is exactly the same we used to initialize it when we created the physical device and aborts if it sees a different fd number, even if it points to the same device. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	6ab4c51d5c	v3dv: hook up WSI support Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	0fd72b6ac4	v3dv: implement VK_KHR_get_physical_device_properties2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	85f1f0690e	v3dv: provide default values for input attributes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	aad44cc614	v3dv: vertex input support This includes: * Implementation for CmdBindVertexBuffers * Gather vertex input info during CreateGraphicsPipelines (pipeline_init) and SHADER_STATE_ATTRIBUTE_RECORD prepacking * Final emission of such packet during CmdDraw (cmd_buffer_emit_graphics_pipeline) Default attributes values will be handled on a following patch. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	047ee913e8	v3dv: implement fences Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	56b4eac8d6	v3dv: implement semaphore waits and signals on queue submissions Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	d24dbd87ad	v3dv: implement vk{Create,Destroy}Semaphore Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	0b08f83817	v3dv: implement vkDeviceWaitIdle This is really a hack, but it is better than not having anything. In the future we should have a syncobject that we could wait on. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	4279145124	v3dv: prepack VCM_CACHE_SIZE Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	5226928557	v3dv: partial prepack of the gl_shader_state_record We can't prepack all the record, as addresses need the job, and uniforms depend on dynamic value. Also due cl_emit_with_prepacked and v3dv_pack asserting correct values, we need to define two values twice, that lead to move vpm_config to the pipeline. In any case, the latter will be useful when we start to prepack more stuff. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	0e49c6237f	v3dv/cmd_buffer: emit CFG_BITS Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	a904b6f082	v3dv: create a helper to start a new frame Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	22e431e626	v3dv: merge subpasses into the same job when possible Generally, we can do this when they render to the same collection of attachments and we only need to emit a single RCL for them. To implement this, we need to track the first subpass that is included in the job and rewrite our loads and stores in the RCL to refer to that subpass instead of the current subpass (which would be the last included in the RCL). When we merge jobs we also reuse the tile state/alloc BOs and we only emit the binning setup once. The environment variable V3DV_NO_MERGE_JOBS can be set to disable job merging and have each subpass be in a separate job. This can be useful for debugging issues spawning from incorrect subpass merges. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	0b5df86c98	v3dv: only clear attachments on the first subpass that uses them Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	2271489077	v3dv: rewrite attachment state tracking Keep track of attachment state for each attachment in the renderpass instead of tracking it only for the attachments in the current subpass. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	5259175fe8	v3dv: add the concept of a job As we make progress towards more complex submissions we will need to split our command buffers into smaller executable units (jobs) that we can submit indepdently to the kernel. This will be required to implement pipeline barriers, split subpasses that have depedencies on previous subpasses, split render passes that use more than 4 render targets, etc. For now we keep things simple and we only keep one job as current recording target in the command buffer, and we generate a new one with every subpass or with any commands we see outside of a render pass (only vkCopyImageToBuffer for now). In the future we probably want to optimize this by merging subpasses into the same job when possible, etc. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Iago Toral Quiroga	94b4e1869c	v3dv: rename and make compute_tile_size_for_framebuffer() public We will use this when we implement copying images to buffers using the TLB, where we'll need to setup a framebuffer and tiling configuration for the TLB store to the destination buffer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	2b02117e64	v3dv: add v3dv_write_uniforms Initial port of the equivalent v3d_write_uniforms, to be used by the cmd_buffer when emitting the drawing packets. Initially doesn't include all the quniform types, only those needed by the initial basic vulkan tests. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	a3b440cc2a	v3dv/cmd_buffer: cache viewport translate/scale Before that commit we were calling get_viewport_xform to get those values twice (to emit scissor and viewport), and we found that we would need that info even more times. So let's just compute that info when setting the viewport, and reuse the values. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	8431196eb0	v3dv/cmd_buffer: start to emit draw packets Starting using only the packet VERTEX_ARRAY_PRIMS Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	67aa8029a8	v3dv/cmd_buffer: emit shader_state packets Values still doesn't take into account having vertex elements data, but keeps some of that half-done code in comments. It would be better to do that when we get an example using it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	f752cdbab1	v3dv/pipeline: start to track dynamic state Starting with Viewport/Scissor data from VkGraphicsPipelineCreateInfo. Note that initially this can be somewhat counter-intuitive. What we are really doing it is filling up the structs with the dynamic stuff from the pipeline, when such is not defined as dynamic. This is what anv/radv does, and basically means that we treat both in the same way, so easier after that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	5285d83974	v3dv: CmdSetViewport and CmdSetScissor implementation This commit also introduces adding dynamic state definitions, dirty flags, and setting them on such methods, although this commit still doesn't use all that info yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	fca4dcee9f	v3dv: initial stub for CmdBindPipeline Right now it only sets the pipeline to the command buffer state. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	60145629a2	v3dv: initial CreateGraphicsPipeline/DestroyPipeline implementation The basic to get the spirv built to nir, including calling some common nir passes. Pending deep review if all those are needed or if we miss some, but for that it would be better to be able to run existing tests. Enough to get assembly generated for simple tests. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	6cb24a3b39	v3dv/format: add v3dv_get_format_swizzle Heavily based on the equivalent v3d v3d_get_formar_swizzle. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:25 +00:00
Iago Toral Quiroga	3f00638cff	v3dv: setup color clear values at subpass setup time Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:25 +00:00
Iago Toral Quiroga	bd3c28bf69	v3dv: keep track of the numbre of BOs in a command buffer Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:25 +00:00
Iago Toral Quiroga	6e4471c14c	v3dv: implement vkEndCommandBuffer Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:25 +00:00
Iago Toral Quiroga	61399b21c0	v3dv: emit scissor to render area and precompute hw color clear values Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:25 +00:00
Iago Toral Quiroga	98697e1fb5	v3dv: precompute more tiling info at framebuffer creation time Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:25 +00:00
Iago Toral Quiroga	e6a7d88b53	v3dv: make v3dv_layer_offset public Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:25 +00:00

... 2 3 4 5 6 ...

381 Commits