KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Gert Wollny	944f132578	r600/sfn: fix buffer offset for ssbo writes Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9584>	2021-03-14 13:28:44 +00:00
Gert Wollny	a47a12b37c	r600/sfn: remove unused code Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9584>	2021-03-14 13:28:44 +00:00
Gert Wollny	2e1cff0e85	r600/sfn: sort alu opcodes in switch statememt This makes it easier to figure out which ops are already handled. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9584>	2021-03-14 13:28:44 +00:00
Axel Davy	8283ed65cf	radeonsi: Limit the size of the in-memory shader cache The in-memory shader cache can get significantly huge in some rare cases. Limit its size to 64MB on 32 bits, and 1GB else. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9578>	2021-03-13 21:51:38 +00:00
Axel Davy	c23c3b7a77	st/nine: detect worker threads syncs for systemmem If we detect too many syncs, use the stream_uploader, which avoid syncs. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	205201c968	st/nine: Use stream_uploader for bad cases of systemmem Using stream_uploader in case we cannot use unsynchronized improves performance. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	05e6b0f200	st/nine: Always use DYNAMIC with SYSTEMMEM The disadvantages of the DYNAMIC path over the non-dynamic path are minor. The advantages are many. As we don't know if bad behaving apps use non-dynamic SYSTEMMEM in a dynamic fashion, let's be safe and always be dynamic. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	67c401b112	st/nine: Force DYNAMIC SYSTEMMEM for sw vertex processing SW vertex processing buffers are supposed to be sorted in RAM and to be immediately idle after use (thus you can write at the same location again immediately). DYNAMIC SYSTEMMEM is by far the best fit for now for these kind of buffers, though it can be improved further. Indeed the use pattern will cause a lot of syncs with csmt actived. Thus disable csmt when full sw vertex processing is requested. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	380c2bf887	st/nine: Optimize dynamic systemmem buffers Some apps use DYNAMIC SYSTEMMEM buffers and fill them in a dynamic fashion with discard and nooverwrite locking flags. To prevent uploading the whole buffer every draw call, track the region needed for the draw call, and upload only that region (or a bit more in order to ease valid region tracking). Try to aggressively upload with discard/unsynchronized. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	01c8071f93	st/nine: Track pending MANAGED buffer uploads The tracking enables to avoid flushing the csmt thread when locking repeatedly the same buffer, as long as the locks are non-overlapping. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	26a9e2bb15	gallium/util: Add new u_box helpers Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	a5819e1b49	st/nine: Use correct bind flag at buffer creation The buffer bind creation flag was incorrect for index buffers Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	a2c3db34d1	st/nine: Optimize DrawPrimitiveUp Try to keep the same vertex buffer state when having several consecutive DrawPrimitiveUp. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Nicer DrawPrimUp opt patch Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	9168a37692	st/nine: Refactor DrawPrimitiveUp . Use the same fonction as DrawPrimitive . Drop the user vbuf path . Avoid setting NULL vertexbuffer to the context Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	247d135f67	st/nine: Implement SYSTEMMEM buffers same as MANAGED Use the MANAGED path for SYSTEMMEM buffers. Tests point out the locking behaviour of SYSTEMEMM buffers is very close to MANAGED. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Axel Davy	634adfa253	st/nine: Optimize EndScene So far we did nothing on EndScene, but the API doc says it flushes the GPU command queue. The doc implies one can optimize CPU usage by calling EndScene long before Present() is called. Implementing the flush behaviour gives me +15-20% on the CPU limited Halo. On the other hand, do limit the flush to only once per frame. 3DMark03/3Mark05 get a 2% perf hit with the patch, but 5% if I allow more flushes. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451>	2021-03-13 21:23:24 +00:00
Mike Blumenkrantz	7b2e35128c	lavapipe: set more resource bind flags using image/buffer usage bits we have this info available, so we can be more precise about it stop the printf spam! Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9573>	2021-03-13 04:14:17 +00:00
Ilia Mirkin	a04a797b89	mesa: only report INCOMPLETE_FORMATS for GLES1 / desktop The enum is defined in EXT_framebuffer_object (available on desktop), and was copied to OES_framebuffer_object (available in GLES1). The ES2 spec has no mention of such an enum. If the underlying implementation does not support this, it will set a generic incomplete error (as is done in st_cb_fbo.c if mixed_formats == false). This should fix the following dEQP tests on ES2 drivers: dEQP-GLES2.functional.fbo.completeness.attachment_combinations.rbo_tex_none_none dEQP-GLES2.functional.fbo.completeness.attachment_combinations.tex_rbo_none_none Fixes: `fd017458bc` (mesa: fix fbo attachment size check for RBs, make it trigger in ES2) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4444 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9572>	2021-03-12 21:40:04 +00:00
Eric Anholt	87c85139ea	virgl: Update GLES expectations. These are only used in manual runs, and I noticed in a recent run that something on master had fixed them. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9509>	2021-03-12 21:14:27 +00:00
Eric Anholt	1cf8cf99c8	mesa/st: Fix precompile misses on compat GL VSes writing to color outputs. In compat GL, the gl_FrontColor/BackColor gets clamped unless you explicitly turn it off with ARB_color_buffer_float, and most apps doing floating point color rendering are going to be using non-compat varyings anyway instead of hoping that ARB_color_buffer_float exists. Thus, guess that the app will get clamping on the color outputs to reduce draw-time recompiles. Saves 60 VS recompiles on half-life-2.trace on freedreno (and similarly for many other compat GL apps). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9509>	2021-03-12 21:14:27 +00:00
Eric Anholt	35247ac017	mesa/st: Add perf debug for draw-time variant compiles. This ARB_debug_output is particularly useful in that default apitrace will log it, so we can find when we're doing draw-time recompiles we shouldn't be. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9509>	2021-03-12 21:14:27 +00:00
Eric Anholt	6864ff35e9	mesa/st: Unify st_get_vp_variant() and st_get_common_variant(). Fixes the following issues I can see: - Non-VS NIR shaders not gathering info after ucp lowering - Non-VS NIR shaders not doing GL_CLAMP lowering - Non-VS TGSI shaders not setting up stream output state. - Non-VS TGSI shaders leaking lower_depth_clamp lowering across variant compiles. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9509>	2021-03-12 21:14:27 +00:00
Mike Blumenkrantz	536533f556	lavapipe: fix push descriptor set indexing the push set index isn't always 0, so the offsets need to be updated in order to avoid clobbering other sets Fixes: `6be19765cf` ("lavapipe: add support for VK_KHR_push_descriptor") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9558>	2021-03-12 21:02:23 +00:00
Lionel Landwerlin	8e36f7da85	anv: move L3 initialization to device init on Gen11+ On Gen11+ we tend to only use a single configuration, so just get rid of the stalls in the command buffers by moving the config to device initialization. v2: drop NULL config check Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9534>	2021-03-12 20:49:01 +00:00
Lionel Landwerlin	581e68bc99	anv: move L3 config emission to genX_state.c We're about to reuse this at device initialization. v2: Handle NULL configs on Gen12+ v3: Handle NULL config in emission helper (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9534>	2021-03-12 20:49:01 +00:00
Danylo Piliaiev	914e7a7f73	turnip: set zmode to A6XX_EARLY_Z if FS forces early fragment test Specifying "early_fragment_tests" in fragment shader takes precedence over our internal conditions. Fixes test: dEQP-VK.fragment_operations.early_fragment.early_fragment_tests_stencil Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9569>	2021-03-12 20:11:28 +00:00
Lionel Landwerlin	1d658c8598	intel/dev: switch over to mesa log infrastructure v2: Deal with line wrapping (Jordan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9052>	2021-03-12 18:54:07 +00:00
Lionel Landwerlin	cef063826e	anv: stop using get_param for things queried by gen_device_info Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9052>	2021-03-12 18:54:07 +00:00
Lionel Landwerlin	5f057f6e91	i965: stop using get_param for things queried by gen_device_info All this code exists in gen_device_info. It will print out the same warning. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9052>	2021-03-12 18:54:07 +00:00
Lionel Landwerlin	f08f8eaa48	iris: use gen_device_info helper to get subslice total gen_device_info will try to use the most recent uAPI to get this number and will fallback to this get_param. So just use what was queries by gen_device_info_from_fd(). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9052>	2021-03-12 18:54:07 +00:00
Lionel Landwerlin	fa17bbe00c	intel/dev: add warning on missing kernel uAPI for Gen8+ We carry those warnings in i965/anv. Let's have them here. Next commit we remove some code from the drivers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9052>	2021-03-12 18:54:07 +00:00
Lionel Landwerlin	196d0aa110	intel/dev: add helpers to compute subslice/eu total Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9052>	2021-03-12 18:54:07 +00:00
Mike Blumenkrantz	45e6fcef4b	zink: remove debug printf accidentally left this in while bisecting Fixes: `929a748401` ("zink: avoid hashing states without descriptors") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9564>	2021-03-12 17:45:44 +00:00
Michel Zou	862671369d	vulkan: fix CreateRenderPass prototype Fixes compilation on win32 Fixes: `2523c477` " turnip: Move the CreateRenderPass wrapper..." Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9530>	2021-03-12 17:37:38 +00:00
Mike Blumenkrantz	bf0c5988b2	zink: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9540>	2021-03-12 17:21:40 +00:00
Mike Blumenkrantz	980a8b3fc0	zink: implement an extremely dumb resource memory cache this stores a number (currently 5) of backing allocations for resources for later reuse when creating matching resources because this is on the screen object it requires locking, but this is still far faster than allocating new memory each time Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9540>	2021-03-12 17:21:40 +00:00
Axel Davy	6205400612	st/nine: Fix compilation issue in nine_debug Some compilers complain of the implicit conversion. Make it explicit. Reported by: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6121 https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9250 Signed-off-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9563>	2021-03-12 17:12:58 +00:00
Mike Blumenkrantz	378bb07110	zink: optimize pipeline hashing we can reorder the pipeline state and avoid hashing the big arrays based on context states that we already have available to us also we can do incremental hashes for the shader modules to further reduce hashing Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9539>	2021-03-12 17:02:20 +00:00
Mike Blumenkrantz	da08d67364	zink: move vertex_buffers_enabled_mask to non-hashed part of pipeline state we can use this to optimize pipeline hashing Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9539>	2021-03-12 17:02:20 +00:00
Samuel Pitoiset	1aabfc77d3	radv: fix initialization of disable_compression when clearing color image Just make sure it's initialized properly. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9473>	2021-03-12 16:39:21 +00:00
Mike Blumenkrantz	0f7d32073d	zink: ci updates some flakes here that will be resolved in the very near future Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	a5984bdf78	zink: fix arrays of texel buffer descriptors these need to pass a constructed array of buffers, not a single buffer per descriptor Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	929a748401	zink: avoid hashing states without descriptors this is unnecessary hashing which decreases the accuracy of the states Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	72a06746bf	zink: remove image layouts from descriptor states these are always the same value, so there's no need to track them Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	5700cbfa1a	zink: always use VK_IMAGE_LAYOUT_SHADER_READ_ONLY_OPTIMAL for sampler bindings if a resource is used multiple times with different samplers then this can result in conflicting layouts, so it's better to just use the general case here Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	e4421c4b82	zink: don't create descriptor barrier hash tables for cached descriptor set this won't be used, so there's no point in allocating it Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	38e6e44262	zink: remove struct zink_descriptor_resource from descriptor updating this can be inferred from a resource's barrier usage, and then by merging the batch-tracking for resources into the barrier loop, this deduplicates calls for resources which are bound for multiple stages as well as removing another iteration during descriptor updating Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	4a3b344b23	zink: directly use resource count from pool instead of accumulating every time this is a surprisingly impactful speedup Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	201cd1a8c0	zink: track resource count on descriptor pool object we can do this once here instead of accumulating the count all over later also remove the debug value on the set which duplicated this Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	d9793a8a60	zink: reuse descriptor barriers across draws if we aren't invalidating the descriptor set then we can safely reuse its barriers to avoid doing any sort of hashing during descriptor updating Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	c55e2fb59c	zink: move descriptor binding out of the update codepath we won't always need to update sets that we bind Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	dc020db723	zink: pre-size descriptor transition hash table this avoids costly rehashing Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	80d9f3aa34	zink: simplify check for knowing whether descriptor updating is needed if a program has at least one pool object set then it will have descriptors Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	fe31566fbb	zink: remove intermediate func for descriptor set getting we can simplify this a bit by just getting the objects in the function Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	141ff0e091	zink: allow reuse of zink_descriptor_pools between programs now that we've split these into their own objects, we can reuse them across programs with matching binding layouts for a given type, giving even more value to the descriptor set cache by increasing reuse Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	ee40550b87	zink: move descriptor type to pool object from set this is a little less memory usage Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	58500979a8	zink: track number of sets currently allocated per descriptor pool if we're going to be sharing pools across programs, it's no longer sufficient to check the hash populations, so we need a separate counter to know when we have to flush Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	871022dc84	zink: change program pointer on struct zink_descriptor_set to pool pointer the program using the set isn't very relevant now that there's the intermediate pool struct which stores the sets Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	817de3c776	zink: break out all the descriptor pool/layout stuff into a new struct by pushing these into a separate struct, we can eventually reuse them between programs to make the descriptor set caches even more valuable Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	3d4e6c641b	zink: break out descriptor stuff into new files this is getting to be enough code that it's getting to be a hassle to keep with the program stuff also rename a couple of the moved functions Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	eee488abad	zink: deduplicate VkWriteDescriptorSet setup no functional changes Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	ce0524f009	zink: break out image descriptor updating all the descriptor updates are now broken out, so update_descriptors is now just a dispatch for calling the other update functions Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	eb375dc939	zink: break out sampler descriptor updating Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	e94c2d84b3	zink: break out ssbo descriptor updating by the power of grayskull, this is now readable Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	e4076d557f	zink: split out ubo descriptor updating now that we're doing incremental updates, we can refactor this to be much more readable by having separate functions for each type of descriptor update Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	2c83ea13d2	zink: add VkPipelineLayout to zink_program meta struct this lets us simplify a little more code Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	0248e6b969	zink: don't double iterate all the per-batch sets on reset we're already iterating once here, so we can definitely skip iterating again and just do the removals inlined Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	74a0badb91	zink: add flag for recycled descriptor sets this lets us skip a hash lookup in the last_set fastpath Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	3f561bbc83	zink: improve descriptor cache invalidation we can pass the offset of the 'invalid' flag directly to the resources to let them run through and invalidate their sets in time for us to detect it when we recycle the set during batch reset and throw it onto our allocation array additionally, by adding refs for the actual objects used in a descriptor set, we can ensure that our cache is as accurate as possible Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	1a9f3ba2d7	zink: only batch-reference the program in use once per descriptor update now that we're guaranteed to know what our batch is earlier, we can move this referencing around to reduce the number of hash lookups we'll perform here Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	9df3147264	zink: move descriptor set allocation near the top of update_descriptors no functional changes, but it moves the only point in the function where a flush can occur out of the way which lets us get the batch we'll be using immediately Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	514b5ced97	zink: add program pointer to desc set struct we generally want to avoid this, but it lets us skip a lot of hash lookups, which is a performance hit Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	b4105e8ac0	zink: actually flag all used resources as used during update_descriptors num descriptors != num bound resources, so this needs to be a separate count/index Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	40be7f6353	zink: store last-used descriptor set for each type of set for quick reuse we can avoid some hash lookups this way, and we can also avoid putting the null descriptor sets back into the array since we know they'll always be the last-used set this also helps our null set reuse be more explicit since we never have to put these sets back into an array or anything Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	8c908ee6a4	zink: pre-hash sampler views and states this greatly reduces the amount of on-demand hashing that we have to do, as now in worst case we'll be hashing 2x uint32_t per sampler descriptor vs 2x vulkan object pointer (sometimes a uint64_t) and a uint32_t Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	fa77cc2fbb	zink: add a null sampler view descriptor hash to the screen pre-hashing this saves us time later since we can just reuse it Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	67f9038b85	zink: introduce descriptor states this aims to track the states of descriptors so that we can do more incremental updating it also enables the descriptor cache to be more robust by providing the incremental data as the key Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	6d233e74ad	zink: use dynamic offsets for first ubo this lets us avoid invalidating the ubo descriptor state, which reduces our cache overhead Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	d4304ab0d2	zink: split descriptor sets based on usage this uses multiple descriptor sets so that we can perform more incremental updating and increase the value that we get from our cache by only invalidating one state at a time Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	c5ce20e97a	zink: move surface refs to the end of descriptor updating keeping descriptor stuff with descriptor stuff Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	d06c2ae21f	zink: reorder descriptor barrier applying during updating just putting this with the rest of the non-descriptor stuff Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	761a04a464	zink: move streamout to draw_vbo this isn't part of descriptor updating Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	b10a3be45e	zink: add second level cache for descriptor sets this lets us store sets that are valid but not currently used so that we can either prolongue a cache entry's lifetime or cannibalize a valid entry if necessary to avoid needing to allocate more sets Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:53 +00:00
Mike Blumenkrantz	6c4c995836	zink: add caching for descriptor sets this is a lot of churn that more or less amounts to hashing the descriptor state during draw and then performing lookups with this to determine whether we can reuse an existing descriptor set instead of allocating one Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	840ea21fa1	zink: add scaling factor for descriptor set bucket allocations now descriptor sets allocate in increasingly large batches based on how many sets a program has allocated, multiplying by 10 any time the sets hit a power of 10, e.g., if 100 sets are allocated, we now allocate in batches of 100 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	bd78710d21	zink: add bucket allocating for descriptor sets now instead of allocating a single descriptorset at a time, we allocate a defined count of descriptorsets (currently 10) at once and keep a separate array of allocated-and-unused sets that we can pop sets off of Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	4461a871b8	zink: remove flushes for batch descriptor use now that we do all our tracking and flushing per-program, we can throw out the batch-based flushing and let our descriptors free-range Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	d21e88719a	zink: even better handling for descriptor oom in addition to ensuring that all our batches stay under the max size by cycling them whenever we get too many active descriptors going, we now do per-program descriptor pools, so we can do some limiting there as well to ensure that we aren't letting any one program hog all the resources Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	4ec1d5e70c	zink: add helper function for cycling a batch this saves a bit of typing here and there Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	7ae8509673	zink: use more precise sizing for descriptor pools now that these are on the program structs, we can size the pools optimally based on the set layouts Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	9b3e2ba30c	zink: move descriptor set alloc function to zink_program.c try to keep things organized... Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	0b1a356393	zink: store and reuse descriptorsets after batch completion since we know that the layout is going to match, we can store descriptorsets in the program and then overwrite them instead of needing to free sets or reset the pool Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	0e6ef05878	zink: move descriptor sets/pools from batches to programs this lets us much more accurately create descriptor sets using the exact size required by a given program instead of creating gigantic monolithic sets it does (temporarily) incur a perf hit since sets are now freed after each use rather than being reset Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Mike Blumenkrantz	5ddfffcca1	zink: simplify some update_descriptor code by using the generic zink_program here we can shorten the code a little Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9348>	2021-03-12 16:32:52 +00:00
Samuel Pitoiset	15fc0c351a	radv: fix potential clears with non renderable images on GFX9+ Found by inspection. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9474>	2021-03-12 16:09:42 +00:00
Samuel Pitoiset	9863ed9bf3	radv: fix meta save/restore state with non renderable images For non renderable images, the driver performs some transfer operations with compute shaders on the gfx queue, but it was saving the gfx state instead of the compute state. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9474>	2021-03-12 16:09:42 +00:00
Samuel Pitoiset	b42204e4d0	radv: disable sampling with VK_FORMAT_R64_SFLOAT It's not supported. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4433 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9519>	2021-03-12 16:07:54 +01:00
Rohan Garg	44c38cae9a	virgl: Support the ETC1_RGB8 format as virglrenderer supports it Signed-off-by: Rohan Garg <rohan.garg@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9497>	2021-03-12 15:35:47 +01:00
Erico Nunes	c453921c91	lima: run nir dce after nir_lower_vec_to_movs Some of the 'vec*' nir instructions may hold references to dead code until the nir_lower_vec_to_movs pass runs. After nir_lower_vec_to_movs, that code can finally be cleaned by dce, so add an additional dce pass. This not only potentially further removes unneeded code from the nir representation but also prevents bugs with the compiler from special case unused code that is not expected (e.g. root undef type nodes). Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9507>	2021-03-12 11:46:54 +00:00
Juan A. Suarez Romero	3f1c375581	ci/broadcom: allow custom kernels So far, testing VC4 and V3D/V3DV requires the CI runners having access to a Raspberry Pi 3/4 kernel, and the correspondent modules and bootloader files. If a different kernel must be used, it means touching the runners to provide them. This commit adds the option to define an URL pointing to a (compressed) tarball containing such files, without requiring dealing with the runners. This link is provided through the `BM_BOOTFS` job variable. The tarball must contain two directories in the root: a `/boot` directory (containing the kernel, DTBs and bootloader files), and a `/lib/modules` (or `/usr/lib/modules`) with the kernel modules. Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9527>	2021-03-12 11:03:17 +00:00
Danylo Piliaiev	1a2f1e3f47	turnip: fill VkMemoryDedicatedRequirements We support VK_KHR_dedicated_allocation so we must fill VkMemoryDedicatedRequirements. Vulkan spec states: "[...] requiresDedicatedAllocation may be VK_TRUE under one of the following conditions: The pNext chain of VkImageCreateInfo for the call to vkCreateImage used to create the image being queried included a VkExternalMemoryImageCreateInfo structure, and any of the handle types specified in VkExternalMemoryImageCreateInfo::handleTypes requires dedicated allocation, as reported by vkGetPhysicalDeviceImageFormatProperties2 in VkExternalImageFormatProperties::externalMemoryProperties.externalMemoryFeatures, the requiresDedicatedAllocation field will be set to VK_TRUE." All handle types require dedicated allocation at the moment. Fixes: dEQP-VK.api.external.memory.opaque_fd.dedicated.image.info dEQP-VK.memory.requirements.dedicated_allocation.buffer.regular dEQP-VK.memory.requirements.dedicated_allocation.image.transient_tiling_optimal Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9086>	2021-03-12 11:56:47 +02:00
Tapani Pälli	d7b3454af3	anv: fix compilation due to missing vk_format_from_android Fixes: `4fb6c051c9` ("anv: Move vk_format helpers to common code") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4428 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9549>	2021-03-12 10:35:01 +02:00
Tapani Pälli	0759822f64	anv/android: fix compilation failure Fixes: `3e6d3bca1d` ("anv/android: Fix size check for imported gralloc bo") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9549>	2021-03-12 10:34:53 +02:00
Dave Airlie	49bb53ba43	lavapipe: add EXT_sampler_filter_minmax support Hook up the extension Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9423>	2021-03-12 16:02:30 +10:00
Dave Airlie	6adbf6c86c	llvmpipe: add reduction mode support Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9423>	2021-03-12 16:02:25 +10:00
Dave Airlie	1fb43ae9bf	lavapipe: enable KHR_multiview Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	cbd01045bc	lavapipe: add render pass support for multiview Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	3c08eee1bd	lavapipe: add input attachment support for multiview Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	8c6d4d470e	lavapipe: add draw support for multiview Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	4d72515e32	lavapipe: add clear support for multiview Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	e81cd37363	llvmpipe: add view index support to rasterizer Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	b76242b9c8	llvmpipe: add the view index callback from draw This just stores the view index into setup Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	b5f686c93b	draw: add tess/gs support for multiview index Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	a2bee6df5f	draw/vs: pass the view index to the vertex shader Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	a417843a3c	draw: pass the view index to the render driver Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	03cbb7b104	draw: add view_mask rendering support This loops the draws per-view above the instance rendering Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	b10b55f3d3	draw: refactor out the instances drawing code This can be reused nice for multiview if refactored out Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	267d216bcb	draw: add interface to notify renderer of the current view index Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	9f0fd85474	gallivm: add support for load_view_index intrinsic This just adds the system value Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	974f2e6c6a	gallivm: mark subpass input attachments as 2d arrays This matters when multiview is enabled. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Dave Airlie	e3b8f449e1	gallium: add a view mask to the draw command This allows the caller to specify the view mask for this draw in a multiview draw environment This has been packed into the upper nibble and 2 bits of the index size to retain the struct size as small as possible for tc. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9399>	2021-03-12 05:05:51 +00:00
Timothy Arceri	684f97de80	glsl: fix declarations of gl_MaxVaryingFloats gl_MaxVaryingFloats was not removed from core until 4.20 and is still available in compat shaders. Found while writing some new CTS to test the correct declarations of this constant. Fixes: 0ebf4257a385i ("glsl: define some GLES3 constants in GLSL 4.1") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9514>	2021-03-12 04:30:32 +00:00
Jason Ekstrand	6d16d929f3	iris: Add an iris_write_reg macro Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9537>	2021-03-12 04:17:39 +00:00
Jason Ekstrand	5b792d79a4	anv: Add an anv_batch_write_reg macro Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9537>	2021-03-12 04:17:39 +00:00
Jason Ekstrand	5f192b190f	anv,genxml: Handle L3SQCREG1_SQGHPCI in GenXML Technically, this is only one field on IVB but it's two on BYT and so it makes things easier if we split it for all Gen7. While we're here, make some of the other fields in L3SQCREG1 Booleans. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9537>	2021-03-12 04:17:39 +00:00
Ilia Mirkin	987fef5f0e	nvc0: enable minmax reductions on gm200+ Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9487>	2021-03-12 00:05:24 +00:00
Ilia Mirkin	41aad1c120	st/mesa: add EXT_texture_filter_minmax support This also trivially adds ARB_texture_filter_minmax, since the EXT variant is a strict superset. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9487>	2021-03-12 00:05:24 +00:00
Ilia Mirkin	6384dcaf7c	mesa: add tracking of reduction mode This is used to expose ARB/EXT_texture_filter_minmax. Note that only the EXT_* enable is provided since the ARB one would require proper handling of some formats not being supported. For now this is force-enabled for everything. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9487>	2021-03-12 00:05:24 +00:00
Dave Airlie	7c999249ef	gallium: add a sampler reduction cap + settings This is to allow for VK_EXT_sampler_filter_minmax GL_EXT_texture_filter_minmax support Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9487>	2021-03-12 00:05:24 +00:00
Michael Tang	8016a098fc	microsoft/spirv_to_dxil: Fix spirv2dxil I/O to use binary mode Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9513>	2021-03-11 23:43:47 +00:00
Michael Tang	d4a51160ad	util: Make os_read_file use O_BINARY on Windows Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9513>	2021-03-11 23:43:47 +00:00
Eric Anholt	5785fdac63	u_format: Mark the generated pack/unpack src/dst args as restrict. Calling code to pack/unpack with overlap would be already be undefined. Cuts 50k of text on x86_64 release builds from the compiler having more freedom in the src/dst loads knowing that they don't interfere with each other. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9500>	2021-03-11 23:26:34 +00:00
Anuj Phogat	d4b231bb80	intel/isl: Drop intel_ prefix in function names This change is in line with naming convention used in isl. We want to keep intel_ prefix reserved for common code. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9532>	2021-03-11 23:01:56 +00:00
Ian Romanick	da7389eced	nir/range_analysis: Simplify analysis of bcsel union_ranges was previously guarded by 'ifndef NDEBUG'. After removing that, I noticed that the two tables were identical. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9108>	2021-03-11 22:00:30 +00:00
Ian Romanick	7019cd84c0	nir/search: Use range analysis for is_finite There are only a couple patterns that use is_finite, so the changes aren't huge. Mostly shaders from Batman Arkham City and a few shaders from Shadow of the Tomb Raider were affected. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Tiger Lake Instructions in all programs: 160902591 -> 160902489 (-0.0%) SENDs in all programs: 6812270 -> 6812270 (+0.0%) Loops in all programs: 38225 -> 38225 (+0.0%) Cycles in all programs: 7429003266 -> 7428992369 (-0.0%) Spills in all programs: 192582 -> 192582 (+0.0%) Fills in all programs: 304539 -> 304539 (+0.0%) Ice Lake Instructions in all programs: 145301634 -> 145301460 (-0.0%) SENDs in all programs: 6863890 -> 6863890 (+0.0%) Loops in all programs: 38219 -> 38219 (+0.0%) Cycles in all programs: 8798589772 -> 8798575869 (-0.0%) Spills in all programs: 216880 -> 216880 (+0.0%) Fills in all programs: 334250 -> 334250 (+0.0%) Skylake Instructions in all programs: 135892010 -> 135891836 (-0.0%) SENDs in all programs: 6802916 -> 6802916 (+0.0%) Loops in all programs: 38216 -> 38216 (+0.0%) Cycles in all programs: 8442597324 -> 8442583202 (-0.0%) Spills in all programs: 194839 -> 194839 (+0.0%) Fills in all programs: 301116 -> 301116 (+0.0%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9108>	2021-03-11 22:00:30 +00:00
Ian Romanick	f4a7dbc58f	nir/range_analysis: Fix analysis of fmin, fmax, or fsat with NaN source Recall that when either value is NaN, fmax will pick the other value. This means the result range of the fmax will either be the "ideal" result range (calculated above) or the range of the non-NaN value. Previously, something like fmax({gt_zero}, {lt_zero, is_a_number}) would return a range of gt_zero. However, if the "gt_zero" parameter is NaN, the actual result will be the "lt_zero" parameter. This analysis depends on the is_a_number analysis also added in this MR. Assuming this doesn't cause any unforeseen problems, I believe we should wait a bit, then nominate a subset of the series for the stable branches. This fixes the piglit tests tests/spec/glsl-1.30/execution/range_analysis_fmax_of_nan.shader_test tests/spec/glsl-1.30/execution/range_analysis_fmin_of_nan.shader_test from https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/463. Even with the added fsat fixes, range_analysis_fsat_of_nan.shader_test still fails. There are some other issues there that will be addressed in later commits (in another MR). v2: Add fsat fixes. Suggested by Rhys. Fixes: `405de7ccb6` ("nir/range-analysis: Rudimentary value range analysis pass") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Shader-db results: All Intel platforms had similar results. (Tiger Lake shown) total instructions in shared programs: 21049290 -> 21049314 (<.01%) instructions in affected programs: 3175 -> 3199 (0.76%) helped: 0 HURT: 17 HURT stats (abs) min: 1 max: 3 x̄: 1.41 x̃: 1 HURT stats (rel) min: 0.20% max: 1.89% x̄: 0.97% x̃: 0.92% 95% mean confidence interval for instructions value: 1.09 1.73 95% mean confidence interval for instructions %-change: 0.75% 1.19% Instructions are HURT. total cycles in shared programs: 855136176 -> 855136406 (<.01%) cycles in affected programs: 37579 -> 37809 (0.61%) helped: 0 HURT: 17 HURT stats (abs) min: 12 max: 20 x̄: 13.53 x̃: 14 HURT stats (rel) min: 0.17% max: 1.13% x̄: 0.79% x̃: 0.91% 95% mean confidence interval for cycles value: 12.53 14.53 95% mean confidence interval for cycles %-change: 0.63% 0.94% Cycles are HURT. Fossil-db results: Tiger Lake Instructions in all programs: 160901033 -> 160902591 (+0.0%) SENDs in all programs: 6812270 -> 6812270 (+0.0%) Loops in all programs: 38225 -> 38225 (+0.0%) Cycles in all programs: 7430016795 -> 7429003266 (-0.0%) Spills in all programs: 192582 -> 192582 (+0.0%) Fills in all programs: 304539 -> 304539 (+0.0%) Ice Lake Instructions in all programs: 145299102 -> 145301634 (+0.0%) SENDs in all programs: 6863890 -> 6863890 (+0.0%) Loops in all programs: 38219 -> 38219 (+0.0%) Cycles in all programs: 8798390846 -> 8798589772 (+0.0%) Spills in all programs: 216880 -> 216880 (+0.0%) Fills in all programs: 334250 -> 334250 (+0.0%) Skylake Instructions in all programs: 135889478 -> 135892010 (+0.0%) SENDs in all programs: 6802916 -> 6802916 (+0.0%) Loops in all programs: 38216 -> 38216 (+0.0%) Cycles in all programs: 8442624166 -> 8442597324 (-0.0%) Spills in all programs: 194839 -> 194839 (+0.0%) Fills in all programs: 301116 -> 301116 (+0.0%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9108>	2021-03-11 22:00:30 +00:00
Ian Romanick	aa5d38decd	nir/range_analysis: Add "is a number" range analysis tracking This commit is necessary to support "nir/range_analysis: Fix analysis of fmin and fmax with NaN". No shader-db or fossil-db changes on any Intel platform. v2: Pack and unpack is_a_number. v3: Don't set is_a_number of integer constants. The bit pattern might be NaN. v4: Update handling of b2i32. intBitsToFloat(int(true)) is 1.401298464324817e-45. Return a value consistent with that. Fixes: `405de7ccb6` ("nir/range-analysis: Rudimentary value range analysis pass") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9108>	2021-03-11 22:00:30 +00:00
Ian Romanick	d4f21b53f2	nir/range_analysis: Add "is finite" range analysis tracking The obvious changes to nir_search_helpers.h are in a separate commit to limit the scope of this change. These additions are really only needed to support the next commit "nir/range_analysis: Add "is a number" range analysis tracking". This reduction in scope is intended to increase the suitability for stable branches. No shader-db or fossil-db changes on any Intel platform. v2: Pack and unpack is_finite. v3: Split nir_search_helpers.h changes into a separate commit. v4: Remove assertion intended for the next commit. Update is_finite comment for fsign. Both noticed by Rhys. Fix is_finite handling for load_const vectors. If any element is not finite, set the flag to false. This is the same way is_integral is already handled. v5: Update handling of b2i32. intBitsToFloat(int(true)) is 1.401298464324817e-45. Return a value consistent with that. Fixes: `405de7ccb6` ("nir/range-analysis: Rudimentary value range analysis pass") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9108>	2021-03-11 22:00:30 +00:00
Ian Romanick	86fb53b1be	nir/range_analysis: Refactor fsat handling This will greatly simplify a later commit. The assert(r.is_integral) in the eq_zero case is dropped because I don't think it's useful anymore. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9108>	2021-03-11 22:00:30 +00:00
Axel Davy	767270e809	st/nine: Check memfd_create support glibc introduced memfd_create only in its 2.27 release. Check memfd_create support by verifying HAVE_MEMFD_CREATE is defined. Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9377 Reported by Roman Elshin in https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9451 Signed-off-by: Axel Davy <davyaxel0@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9483>	2021-03-11 21:29:51 +00:00
Danylo Piliaiev	ae3b95daa7	turnip: lower device index to zero Vulkan 1.1 has VK_KHR_device_group and VK_KHR_device_group_creation promoted to core, thus we should handle DeviceIndex built-in. While we are here, also add these extensions to the extensions list, even though they are not doing anything useful. Fixes test: dEQP-VK.compute.device_group.device_index Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9516>	2021-03-11 21:12:52 +00:00
Connor Abbott	ee1f140fd9	freedreno/a6xx: Cleanup SP_XS_CTRL_REG0 definitions The registers were actually different per-stage even though we used the same type, which resulted in a bunch of incorrectly programmed fields and confusion. Move the stage-specific values to the registers themselves, which makes things much less confusing and makes it possible to set "mergedregs" correctly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9493>	2021-03-11 20:58:39 +00:00
Connor Abbott	9a5596d679	freedreno/registers: Handle typed registers with fields When a bitset is "inline" it should act as-if the its fields were inserted into the register itself. However when initializing the register's bitfield we weren't doing a deep copy of the inline bitfield, so if the register defined additional fields then they would get added to the original inline bitfield and any further registers with the same type would get them. Fix this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9493>	2021-03-11 20:58:39 +00:00
Connor Abbott	8d55a1e112	freedreno/a6xx: Fix compute threadsize type And use the variable for the other threadsize field. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9493>	2021-03-11 20:58:39 +00:00
Connor Abbott	1d8bf2d0bf	freedreno/computerator: Fix thrsz type And use it for the other thread size field, too Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9493>	2021-03-11 20:58:39 +00:00
Lionel Landwerlin	f3cf70dc8d	intel/tools: fix meson warning Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4434 Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9524>	2021-03-11 20:52:20 +00:00
Pierre Moreau	4a408ff7ea	spirv: Ignore WorkgroupSize in non-compute stages If a SPIR-V module contains for example both a geometry and a compute shader, when processing the geometry shader its vertices out, input primitive and output primitive attributes would get overwritten by the value of the WorkgroupSize. ``` ; SPIR-V ; Version: 1.5 ; Generator: Khronos; 17 ; Bound: 12 ; Schema: 0 OpCapability Geometry OpCapability Shader %1 = OpExtInstImport "GLSL.std.450" OpMemoryModel Logical GLSL450 OpEntryPoint Geometry %main "main" OpEntryPoint GLCompute %main_0 "main" OpExecutionMode %main InputPoints OpExecutionMode %main Invocations 1 OpExecutionMode %main OutputTriangleStrip OpExecutionMode %main OutputVertices 4 OpExecutionMode %main_0 LocalSize 1 1 1 OpSource GLSL 460 OpSource GLSL 460 OpName %main "main" OpName %main_0 "main" OpModuleProcessed "Linked by SPIR-V Tools Linker" OpDecorate %gl_WorkGroupSize BuiltIn WorkgroupSize %void = OpTypeVoid %6 = OpTypeFunction %void %uint = OpTypeInt 32 0 %v3uint = OpTypeVector %uint 3 %uint_1 = OpConstant %uint 1 %gl_WorkGroupSize = OpConstantComposite %v3uint %uint_1 %uint_1 %uint_1 %main = OpFunction %void None %6 %10 = OpLabel OpReturn OpFunctionEnd %main_0 = OpFunction %void None %6 %11 = OpLabel OpReturn OpFunctionEnd ``` Running spirv_to_nir on the SPIR-V sample above and for the geometry entry point would say that (among others): * vertices out: 1 * input primitive: LINES * output primitive: LINES By removing any reference to `%gl_WorkGroupSize`, the output would change to (among others): * vertices out: 4 * input primitive: POINTS * output primitive: TRIANGLE_STRIP Fixes: `7d862ef530` ("spirv: Rework handling of spec constant workgroup size built-ins") v2: * Move the check from inside `handle_workgroup_size_decoration_cb()` to its caller (Caio Marcelo de Oliveira Filho ) * Add an assert on the shader stage before using `workgroup_size_builtin` (Caio Marcelo de Oliveira Filho ) Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Signed-off-by: Pierre Moreau <dev@pmoreau.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9418>	2021-03-11 20:30:38 +00:00
Anuj Phogat	9d95e1bd79	i965: Rename files with "intel_" prefix to "brw_" v2: Rename intel_batchbuffer.c to intel_batch.c and intel_batchbuffer.h to intel_batch.h Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9510>	2021-03-11 10:14:33 -08:00
Anuj Phogat	3096788e5c	i965: Remove blank line at EOF Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9510>	2021-03-11 09:43:03 -08:00
Rhys Perry	38b2e13766	aco: remove vmem/smem score statistics Replaced by the Latency statistic. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 16:31:19 +00:00
Rhys Perry	a0243f5c47	aco: add ACO_DEBUG=perfinfo This prints the program with each instruction's contribution to it's latency and various factors for the calculation of the Inverse Throughput statistic. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 16:31:19 +00:00
Rhys Perry	5d6a1095bf	aco: add print option to print program without temporary IDs Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 16:31:19 +00:00
Rhys Perry	23ecceb160	aco: add latency and inverse throughput statistics Latency is estimanted duration of a single wave, ignoring others in the CU. It is similar to the old cycles statistic except it it's more accurate and considers memory operations. The InvThroughput statistic is a combination of MaxWaves, Latency and the portion of the wave's execution which does not use various resources. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 16:31:19 +00:00
Rhys Perry	83ce9407f2	aco: add instruction classes These should mostly match LLVM. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 16:31:19 +00:00
Rhys Perry	0af7ff49fd	aco: lower p_constaddr into separate instructions earlier This allows them to be scheduled properly and simplifies the assembler a little. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 16:31:19 +00:00
Rhys Perry	ab957bb899	aco: move wait_imm to aco_ir.h Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 15:35:34 +00:00
Rhys Perry	7d5643c0fe	aco: track divergent and uniform branch depth Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 15:35:30 +00:00
Rhys Perry	8f71be0a7b	aco: simplify loop_nest_depth tracking in isel Keep track of the current loop depth in Program and set the depth inside Program::insert_block() instead of repeating it every time we insert one. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 15:35:24 +00:00
Boris Brezillon	442fbcdb47	panfrost: Expose panfrost_modifier_to_layout() Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9517>	2021-03-11 15:10:58 +00:00
Boris Brezillon	825b1f9446	panfrost: Split the sampler and texture count The texture and sampler descriptors are well separated in Vulkan, let's add a new field to allow mixing sampler and texture descs. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9517>	2021-03-11 15:10:58 +00:00
Boris Brezillon	b0f968cf5c	panfrost: Don't count the special vertex/instance ID attributes on Bifrost On Bifrost the vertex/instance ID are preloaded in special registers, no need to add special attribute entries. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9517>	2021-03-11 15:10:58 +00:00
Boris Brezillon	7b9dfc502a	panfrost: Print the correct UBO size when dumping UBO information There's a minus(1) modifier on the entries field. Take it into account. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9517>	2021-03-11 15:10:58 +00:00
Boris Brezillon	3559efb9bf	panfrost: Allow passing an explicit UBO index for the sysval UBO UBO index assignment is a bit special in Vulkan, it's based on the descriptor set layout, which doesn't know about shaders' internal UBOs (our sysval UBOs). Extend the backend compilers so we can place sysval UBOs where we want: after all explicit UBOs. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9517>	2021-03-11 15:10:58 +00:00
Boris Brezillon	92d9f090d9	panfrost: Add a knob to disable the UBO -> push constants optimization I'm just too lazy to implement the logic to prepare push constant buffers in the Vulkan driver. Besides, Vulkan has explicit push constants, which AFAIK is not handled in the compiler backends yet, and that will probably conflict with the UBO -> push constant promotion. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9517>	2021-03-11 15:10:57 +00:00
Lucas Stach	2229328cf9	renderonly: close the gpu fd when destroying renderonly Currently the screen destruction closes the dup'ed fd, but not the original renderonly gpu fd, which is kept around for the lifetime of the renderonly. Squashed revert of "vc4: Don't leak the GPU fd for renderonly usage." (commit `99ef66c325`) as requested by Eric. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6983>	2021-03-11 14:41:48 +00:00
Lucas Stach	187218395d	renderonly: remove layering violations The renderonly object is something the winsys creates, so the pipe driver has no business in memcpying or freeing it. Move those bits to the winsys. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6983>	2021-03-11 14:41:48 +00:00
Alyssa Rosenzweig	5487847d8c	pan/bi: Implement u{add, sub}_sat Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Boris Brezillon	3c7634f7d2	pan/bi: Extend the bi_builder to support type variants correctly Some opcodes come with both type and size variants. Right now, only the size is taken into account. Extend the builder to provide wrappers that take a nir_type in addition to the bitsize. While at it, fix wrappers taking a compare operator to use the proper .{i,s,u} variant based on the comparison (equal and non-equal should use .i, other comparisons should use .{u,s}). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Boris Brezillon	0113a0a1ee	panfrost: Move pan_special_varying definition to pan_encoder.h Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Boris Brezillon	1f99bba06e	panfrost: Add a pan_section_offset() helper Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Boris Brezillon	1758da0a7e	panfrost: Allow passing an explicit global dependency when queuing a job We will have 2 compute jobs per indexed indirect draw, one doing the min-max index search and one patching the cmdstream. The second compute job needs to depend on the first one, as well as the previous indirect draw job to avoid corrupting the indirect draw context which is shared at the batch level (global dependency). Instead of handling that case in panfrost_add_job(), extend panfrost_add_job() to accept an explicit global dependency. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Boris Brezillon	0bb091fd7c	panfrost: Add a parameter to suppress next job prefetching This is needed for indirect draws so the compute job can patch the vertex/tiler jobs which are following in the chain. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Boris Brezillon	00b85a0aaf	panfrost: Split the direct and indirect draw logic Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Boris Brezillon	691c47dd6c	pan/bi: Move int64 lowering before idiv lowering Otherwise all 64 divisions will be skipped. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Boris Brezillon	f7bbfbaeb5	Revert "pan/bi: Optimize out redundant jumps to #0x0" A block that has all its successors empty is not necessarily a leaf block in the CFG, and removing the JUMP in that causes the shader to continue executing code from another block instead of exiting. This reverts commit `a496b41d50`. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9520>	2021-03-11 14:30:19 +00:00
Rhys Perry	35fe62dad1	radv/llvm: fix enabled_channels for compressed exports The old values seemed to work fine, but the ISA docs recommend 0x0,0x3,0xc and 0xf: COMPR==1: export half-dword enable. Valid values are: 0x0,3,c,f [0] enables VSRC0 : R,G from one VGPR (R in low bits, G high) [2] enables VSRC1 : B,A from one VGPR (B in low bits, A high) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9459>	2021-03-11 13:54:18 +00:00
Rhys Perry	341dd9d834	aco: set compr for fp16 exports Obviously this didn't affect correctness. Not sure about performance. It also changes enabled_channels to match radeonsi. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `f29c81f863` ("aco: use VOP2 for v_cvt_pkrtz_f16_f32 if possible") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9459>	2021-03-11 13:54:18 +00:00
Marek Olšák	e6a0f243ea	radeonsi: update pipe_screen::num_contexts This allows skipping mutex locking. Don't take the aux context into account. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9356>	2021-03-11 05:05:39 +00:00
Marek Olšák	981e55d530	gallium: add pipe_screen::num_contexts for skipping mutex locking in util_range Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9356>	2021-03-11 05:05:39 +00:00
Marek Olšák	728aa749ea	gallium/u_threaded: don't sync in create_stream_output_target Manhattan needs this. radeonsi can handle it since https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9028/diffs?commit_id=33ac9dec91d07ef353e110ac376842d84ec539b4. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9356>	2021-03-11 05:05:39 +00:00
Rob Clark	c4e5beef07	freedreno: threaded_context async flush support Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	9dbe2405a3	freedreno: threaded_context support Currently only initialized for a6xx, mostly because that is the easiest setup for me to test and debug at the moment. But the couple a6xx changes should not require counterparts in older gens. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	1a12d682a6	freedreno: Check cb0 in rebind_resource() Previously we were expecting cb0 to be user_buffer. (We did in some cases upload it to a gpu buffer, but this was an internally allocated buffer and not something subject to rebind.) But with TC it becomes a gpu buffer. (Technically, with pctx->const_uploader, we shouldn't hit the rebind path for cb0, but better to not try to be overly clever.. sooner or later that would bite us.) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	00eb60ee59	freedreno/a6xx: Move UBWC demotion to first sampler view bind With threaded_context, CSO creation happens in the frontend thread, which means it is no longer safe to do blits (if needed, for sampler views with format that cannot be UBWC). So move this to the first time that the sampler view is bound. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	acc2c015b3	freedreno: Add transfer_pool_unsync With threaded_context, in the TC_TRANSFER_MAP_UNSYNC case, we are getting called from the frontend thread, rather than driver thread. So we need a different slab_child_pool for that. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	0c163e0a45	freedreno: Add fd_replace_buffer_storage() This will be used by threaded_context to avoid stalls in the DISCARD_WHOLE_RESOURCE case (and DISCARD_RANGE cases that can be promoted to DISCARD_WHOLE_RESOURCE). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	63649e4101	freedreno: Extract out helper for transfer-map flag munging Split out the usage simplification from main part of transfer_map and handle the threaded-context specific TC_TRANSFER_x flags. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	4f07a24e41	freedreno: Extend threaded_transfer Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	1017dc9f6e	freedreno: Extend threaded_resource No functional change, just big churny Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	5fbaa8033b	freedreno: Restructure transfer_map() Separate the parts that, with threaded_context, can be called from either driver or frontend thread. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	39d6343a3e	freedreno: Split out batch/resource tracking For threaded_context, to properly handle replace_buffer_storage, we'll need to handle multiple "iterations" of a resource using the same tracking in order to implement transfer_map() correctly. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:16 +00:00
Rob Clark	f74ccde2c7	freedreno: Factor out common fd_resource init Before adding new things that would need initialization in both paths, refactor out a shared helper. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:15 +00:00
Rob Clark	bcf4562528	freedreno: Fix u_blitter constant-buffer leak We didn't see this before without threaded_context because we (normally) wouldn't upload cb0 (the slot u_blitter uses). But with cb0 getting uploaded we could hit a leak due to constant state only being restored in the fd_blitter_clear() path. Move cb0 save to the one path that uses it. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:15 +00:00
Rob Clark	9425b1343e	gallium/u_threaded: use mesa_log for debug msgs On android, this will show up in logcat, rather than being lost into the ether. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:15 +00:00
Rob Clark	f2f72ec3fe	gallium/u_threaded: Add helper to assert driver thread Useful for drivers to add some sanity checks to avoid/detect threading issues caused by things that might be called (indirectly) from frontend thread. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:15 +00:00
Rob Clark	d2a920ee6e	util: Extract thread-id helpers from u_current Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9323>	2021-03-11 04:42:15 +00:00
Timothy Arceri	1772569449	Revert "glsl: default to compat shaders in compat profile" This reverts commit `6c8cc9be12`. A spec bug was resolved confirming the original behaviour. Also it seems the game Foundation no longer depends on the incorrect behaviour. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9486>	2021-03-11 04:09:49 +00:00
Douglas Anderson	217d6594de	gallium/indices: Use "__restrict" to help the compiler In a perf trace translate_quads_uint2uint_last2last_prdisable() was showing up as a huge hot spot. Digging through the assembly on arm64 found that the compiler wasn't doing any read caching. Specifically, the generated code looked roughly like this: out[j+0] = in[i+0]; out[j+1] = in[i+1]; out[j+2] = in[i+3]; out[j+3] = in[i+1]; out[j+4] = in[i+2]; out[j+5] = in[i+3]; ...and the compiler was loading "i+1" and "i+3" from memory twice for no reason (instead of caching it). If we sprinkle generous amounts of the `__restrict` keyword then the compiler is able to be much smarter. Not only does it avoid double-loading but it also generates better instructions. It uses two LDRD instructions instead of 6 LDR instructions and uses some STRD too. In one example test this increased FPS from ~25.7 to ~34.5. Change-Id: I88bf8bd9ac421fe48a7d6961e224425c3ae7beee Reported-by: Rob Clark <robdclark@chromium.org> Signed-off-by: Douglas Anderson <dianders@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9485>	2021-03-11 03:14:31 +00:00
Jason Ekstrand	e7e297732e	vulkan/alloc: Use char * for pointer arithmetic MSVC doesn't like arithmetic on void *. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9511>	2021-03-10 20:59:59 -06:00
Jason Ekstrand	492b5577f0	vulkan/util: Add a type parameter to vk_multialloc_add We also switch from using __alignof__ to alignof() in util/macros.h which works on MSVC with the one unfortunate downside of requiring an actual type and not a value. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9511>	2021-03-10 20:59:56 -06:00
Jason Ekstrand	c120edd8e8	vulkan/alloc: Add VK_MULTIALLOC_DECL macros These both declare the variable and add it to the allocator in one go. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9511>	2021-03-10 20:59:55 -06:00
Jason Ekstrand	5afdbfe0c8	vk/alloc: Handle zero sizes better in vk_multialloc_add Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9511>	2021-03-10 20:59:53 -06:00
Jason Ekstrand	c22267262e	vulkan: Use ALWAYS_INLINE for multialloc This way it properly compiles on Visual Studio. Fixes: `145444d265` "anv: Move multialloc to common code" Acked-by: Daniel Stone <daniels@collabora.com> Acked-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9506>	2021-03-10 23:15:17 +00:00
Anuj Phogat	96e251bde7	intel: Rename "GEN_" prefix used in common code to "INTEL_" This patch renames all macros with "GEN_" prefix defined in common code. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9413>	2021-03-10 22:23:51 +00:00
Anuj Phogat	65d7f52098	intel: Fix broken alignment due to gen_ prefix renaming Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9413>	2021-03-10 22:23:51 +00:00
Anuj Phogat	692472a376	intel: Rename "gen_" prefix used in common code to "intel_" This patch renames functions, structures, enums etc. with "gen_" prefix defined in common code. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9413>	2021-03-10 22:23:51 +00:00
Anuj Phogat	733b0ee8cb	intel: Rename files with gen_ prefix in common code to intel_ Changes in this patch include: - Rename all files in src/intel/common path - Update the filenames used in source and build files Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9413>	2021-03-10 22:23:51 +00:00
Jason Ekstrand	b9e9f92f73	intel/fs: Handle payload node interference in destinations Starting with `d0d039a4d3`, we emit writes to the push constant chunk of the payload to stomp out-of-bounds data to zero for Vulkan. Then, in `369eab9420`, we started emitting shader preamble code for emulated push constants on Gen12.5 parts. In either of these cases, we can run into issues if we don't have a proper live range for some of the payload registers where they get used for something and then smashed by our push handling code. We've not seen many issues with this yet because it only happens when you have dead push constants. Fixes: `d0d039a4d3` "anv: Emit pushed UBO bounds checking code..." Fixes: `369eab9420` "intel/fs: Emit code for Gen12-HP indirect..." Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9501>	2021-03-10 22:17:41 +00:00
Jason Ekstrand	8b7c2f1800	intel/fs: Use INTEL_MASK for pushish constant address masking It's easier to compare with the HW docs than a pile of hex. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9501>	2021-03-10 22:17:41 +00:00
Yannik Marek	369f9d225d	turnip: fix alpha to coverage in no color and unused attachment cases In cases where the alpha coverage is enabled but the color attachment is either unused or absent there should be a dummy mrt to make the draw behave correctly. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Yannik Marek <yannik@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8952>	2021-03-10 22:02:43 +00:00
Adam Jackson	ea27f2bf09	zink: Fix a thinko in instance setup It really does help to size these arrays correctly. Fixes: `2b4fcf0a06` zink: generate instance creation code with a python script Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9499>	2021-03-10 20:19:00 +00:00
Matt Turner	6ceb6b509e	turnip: Remove unused TU_DEBUG_IR3 flag Replaced by IR3_SHADER_DEBUG=disasm,{vs,...,cs} and unused since the commit referenced below. Fixes: `808992fc50` ("tu: Use the ir3 shader API") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8249>	2021-03-10 18:59:22 +00:00
Eric Anholt	eba1b2a1ba	ci/freedreno: Mark another a5xx TF flake. Showed up with an iommu fault preceding it each time it failed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9488>	2021-03-10 18:44:16 +00:00
Marek Olšák	e39336a21e	radeonsi: enable RGP on gfx10.3 It seems to work on VanGogh. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9492>	2021-03-10 18:31:04 +00:00
Jason Ekstrand	5d8fa880d6	radv: Drop CreateRenderPass We can use the generic fall-back which calls CreateRenderPass2 instead. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	8304b4eef7	radv/meta: Use CreateRenderPass2 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	24414e7ec4	anv: Drop CreateRenderPass Fall back to the common implementation instead. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	b302159b1c	vulkan: Preserve preserve attachments in CreateRenderPass This is trivial so I really don't know why it wasn't handled in the initial turnip code. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	147187f754	vulkan: Add some asserts and checks for multiview in CreateRenderPass Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	5de355b0f9	vulkan: Use correct aspectMask in CreateRenderPass If a VkRenderPassInputAttachmentAspectCreateInfo is provided, we use the aspects specified there. Otherwise, we default to every aspect in the format. For attachments which are not input attachments, aspectMask is left zero. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	4fb6c051c9	anv: Move vk_format helpers to common code The Android ones we put in anv_android.c. Maybe one day we'll want a vk_android.h to put some common Android stuff but, for now, let's keep it contained to ANV's android code. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	c7345bd1fb	vulkan: Use VK_MULTIALLOC in CreateRenderPass The variable-length stack allocations are causing issues with ubsan when the array size is zero. Also, a heap allocation is probably safer. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	145444d265	anv: Move multialloc to common code Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	2523c47720	turnip: Move the CreateRenderPass wrapper to common code Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Marek Olšák	3b7b2df509	ac: remove switch cases for pc_lines for compute-only chips Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9389>	2021-03-10 18:02:28 +00:00
Marek Olšák	975e5e262b	ac,radeonsi: use correct VGPR granularity on Aldebaran Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9389>	2021-03-10 18:02:28 +00:00
Marek Olšák	a9da3fc0d1	ac: handle bigger instruction prefetch for Aldebaran Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9389>	2021-03-10 18:02:27 +00:00
Marek Olšák	9fdf69e611	ac/llvm: unpack thread IDs on Aldebaran Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9389>	2021-03-10 18:02:27 +00:00
Marek Olšák	6edf1978d3	ac: set the TCC line size for Aldebaran Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9389>	2021-03-10 18:02:27 +00:00
Marek Olšák	230a6dc55d	ac,radeonsi: add sampler changes for Aldebaran - no 3D and cube textures - no mipmapping - no border color - image_sample is the only supported opcode with a sampler (behaves like _lz) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9389>	2021-03-10 18:02:27 +00:00
James Zhu	381d3a5a38	amd: add Aldebaran chip enum Signed-off-by: James Zhu <James.Zhu@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9389>	2021-03-10 18:02:27 +00:00
Danylo Piliaiev	2764cf8d32	ir3: use OPC_GETBUF to get size of sampler buffers The maximum value which OPC_GETSIZE could return for one dimension is 0x007ff0, however sampler buffer could be much bigger. Blob uses OPC_GETBUF for them. Fixes tests: dEQP-VK.memory.pipeline_barrier.transfer_dst_uniform_texel_buffer.1048576 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9391>	2021-03-10 17:10:45 +00:00
Danylo Piliaiev	8e6ed9948e	freedreno/a5xx: port handling of PIPE_BUFFER textures from a6xx Otherwise, we won't be able to use OPC_GETBUF to get their size. After this change we also could get rid of the hack for OPC_GETSIZE which scaled the size for texture buffers. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9391>	2021-03-10 17:10:44 +00:00
Danylo Piliaiev	d968995c67	turnip: fix SP_HS_WAVE_INPUT_SIZE value It appears that storage for varyings in a wave has an upper limit of wavesize * max_a831 where max_a831 is 64. Exceeding the limit seam to force gpu to reduce primitives processed per wave, at least calculations make sense with such interpretation. With blob SP_HS_WAVE_INPUT_SIZE never exceeds 64 and setting it to 65 in freedreno leads to a hang. Copied from the commit to freedreno `e5499ca2` Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8187>	2021-03-10 16:50:11 +00:00
Connor Abbott	7b7532b806	freedreno/computerator: Add branching example Mainly to be able to test label resolution without having to replace a shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Connor Abbott	19c7b6f9d6	ir3/parser: Add ability to specify branchstack This lets you test branching with computerator. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Connor Abbott	a820eb537c	ir3/parser: Support labels This fixes the assembly for many scenarios where you want to use shader replacement. Note: unfortunately this leaks the identifier string created while lexing, but I couldn't find a way to avoid leaking it except for bringing in ralloc or something (which would be way more complicated). The only other place doing something similar in mesa is the glsl parser, which is using ralloc (actually a linear context). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Connor Abbott	534658f79b	freedreno/computerator: Fix example assembly Use the new bindless cat6 syntax for a6xx. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Connor Abbott	cd772d5687	ir3/parser: Fix parsing of "0.0" in @const line Trying to specify a floating-point value in a @const line would result in it getting interpreted as a FLUT value and failing parsing. Fix this by making the various FLUT tokens include the surrounding parentheses. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Marek Vasut	f7dc0520d9	etnaviv: Fix point sprite Z,W coordinate replacement Mesa fixed pipeline texture loading on programmable pipeline hardware emits a generic fragment shader program which contains gl_TexCoord.xyzw as a vec4 and then expects to configure the varying assignments to the shader in the pipeline command stream, to select what is wired to the XYZW fragment shader inputs. This gl_TexCoord.xyzw is turned into texture load with projection (TGSI TXP opcode, similar for NIR). Texture load with projection does not exist in the Vivante GPU as a dedicated opcode and is emulated. The shader program first divides texture coordinates XYZ by projector W and then applies regular TEX opcode to load the texture (i.e. TEX(gl_TexCoord.xyzw/gl_TexCoord.wwww)). For point sprites, XY are the point coordinates from VS, Z=0 and W=1, always. The Vivante GPU can only configure varying to be either of -- point coord X, point coord Y, used, unused -- which covers XYZ, but not W. Z is fine because unused means 0. W used to be 0 too before this patch and that led to division by 0 in shader. The only known way to solve this is to set Z=0, W=1 in the shader program itself if the point sprites are enabled. This means we have to generate a special shader variant which does extra SET to set the W=1 in case the point sprites are enabled. In case of TGSI, emitting the SET.TRUE opcode permits setting W=1 without allocating additional constants. With NIR, use nir_lower_texcoord_replace() to lower TEXn to PNTC, which sets Z=0, W=1, and let NIR optimize the shader. Note that nir_lower_texcoord_replace() must be called before input linking is set up, as it might add new FS input. Also note that it should be possible to simply drop PIPE_CAP_POINT_SPRITE in the long run, ST would then apply the same optimization pass, but that option is so far misbehaving. And for etnaviv TGSI this is not applicable yet. This fixes neverball point sprites (exit cylinder stars) and eglretrace of gl4es pointsprite test: https://github.com/ptitSeb/gl4es/blob/master/traces/pointsprite.tgz Signed-off-by: Marek Vasut <marex@denx.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8618>	2021-03-10 11:48:21 +00:00
Iago Toral Quiroga	8525cb1c53	v3dv: call util_cpu_detect() when initializing the instance Fixes this assert in debug builds: in __GI___assert_fail (assertion=0x7ffff731f66b "util_cpu_caps.nr_cpus >= 1", file=0x7ffff731f650 "../src/util/u_cpu_detect.h", line=116, function=0x7ffff7323280 <__PRETTY_FUNCTION__.11654> "util_get_cpu_caps") at assert.c:101 in util_get_cpu_caps () at ../src/util/u_cpu_detect.h:116 in _mesa_float_to_float16_rtz (val=0) at ../src/util/half_float.h:93 in util_format_r16g16b16a16_float_pack_rgba_float (dst_row=0x7fffffffbdc0 "", dst_stride=0, src_row=0x7fffffffbf90, src_stride=0, width=1, height=1) at src/util/format/u_format_table.c:13459 in util_format_pack_rgba (format=PIPE_FORMAT_R16G16B16A16_FLOAT, dst=0x7fffffffbdc0, src=0x7fffffffbf90, w=1) at ../src/util/format/u_format.h:1525 in util_pack_color (rgba=0x7fffffffbf90, format=PIPE_FORMAT_R16G16B16A16_FLOAT, uc=0x7fffffffbdc0) at ../src/gallium/auxiliary/util/u_pack_color.h:432 in v3dv_get_hw_clear_color (color=0x7fffffffbf90, internal_type=6, internal_size=8, hw_color=0x7fffffffbf10) at ../src/broadcom/vulkan/v3dv_cmd_buffer.c:1241 v2: move call from physical device to instance init. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9408>	2021-03-10 11:44:01 +01:00
Iago Toral Quiroga	c057a1211b	broadcom/compiler: disallow ldunif during ldvary sequences if possible This restores many of the hurt shaders from the previous patch at the expense of re-adding ldvary tracking in the scheduler. total instructions in shared programs: 13760415 -> 13755738 (-0.03%) instructions in affected programs: 1207560 -> 1202883 (-0.39%) helped: 5080 HURT: 1731 Instructions are helped. total max-temps in shared programs: 2322991 -> 2322828 (<.01%) max-temps in affected programs: 5063 -> 4900 (-3.22%) helped: 229 HURT: 108 Max-temps are helped. total sfu-stalls in shared programs: 31827 -> 31545 (-0.89%) sfu-stalls in affected programs: 478 -> 196 (-59.00%) helped: 304 HURT: 21 Sfu-stalls are helped. total inst-and-stalls in shared programs: 13792242 -> 13787283 (-0.04%) inst-and-stalls in affected programs: 1220856 -> 1215897 (-0.41%) helped: 5162 HURT: 1697 Inst-and-stalls are helped. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9471>	2021-03-10 07:52:22 +00:00
Iago Toral Quiroga	947e9e42cc	broadcom/compiler: simplify ldvary pipelining We get optimal ldvary pipelining by doing the following: 1) Carefully merge a paired ldvary into the previous instruction when possible. 2) When the above succeeds, flag the ldvary as scheduled immediately so we can merge one of its children into the current instruction. 3) When scheduling ldvary sequences, only pick up instructions that are part of the sequence to avoid picking up something that prevents successful pipelining. This patch skips 3) assuming some hurt shaders in exchange for better scheduling flexibility during ldvary sequences. Besides eliminating most of the code dedicated to special handling ldvary sequences, this also usually allows us to produce better code by merging instructions that are unrelated to ldvary sequences into the ldvary sequences, which is particularly effective to fill up the gaps produced when scheduling the first and last ldvary sequences as well as the gaps produced by flat and noperspective varyings sequences that don't have both mul and add instructions. Notice that there are some hurt shaders, because some times the extra scheduler flexibility can lead to picking up instructions that will break a sequence without compensating for that, typically an ldunif that prevents us from doing the fixup for a follow-up ldvary. We will try to correct some of these cases with the next patch. total instructions in shared programs: 13786037 -> 13760415 (-0.19%) instructions in affected programs: 3201387 -> 3175765 (-0.80%) helped: 16155 HURT: 4146 Instructions are helped. total max-temps in shared programs: 2324834 -> 2322991 (-0.08%) max-temps in affected programs: 22160 -> 20317 (-8.32%) helped: 1340 HURT: 103 Max-temps are helped. total sfu-stalls in shared programs: 30685 -> 31827 (3.72%) sfu-stalls in affected programs: 782 -> 1924 (146.04%) helped: 253 HURT: 1416 Inconclusive result. total inst-and-stalls in shared programs: 13816722 -> 13792242 (-0.18%) inst-and-stalls in affected programs: 3171642 -> 3147162 (-0.77%) helped: 15331 HURT: 4179 Inst-and-stalls are helped. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9471>	2021-03-10 07:52:22 +00:00
Iago Toral Quiroga	d37241bdc4	broadcom/compiler: move code block around These checks depend on prev_inst being set, so move them down below with all the other checks with the same requirement. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9471>	2021-03-10 07:52:22 +00:00
Iago Toral Quiroga	8bcda472a0	broadcom/compiler: add an additional sanity check assert to the ldvary fixup Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9471>	2021-03-10 07:52:22 +00:00
Samuel Pitoiset	077775f3ce	radv: check if dynamic line stipple state changed Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9458>	2021-03-10 07:21:46 +00:00
Samuel Pitoiset	892987e3a0	radv: check if dynamic VRS state changed Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9458>	2021-03-10 07:21:46 +00:00
Samuel Pitoiset	ed391a62f6	radv: do not declare push constants for DCC decompress on compute We don't use push constants at all. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9475>	2021-03-10 07:50:31 +01:00
Sagar Ghuge	0314c7503f	intel/blorp: Fix condition to figure out aux_address Fixes: `4dfabac4` ("blorp/gen12: Don't use aux address if implicit CCS") Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9491>	2021-03-09 22:39:43 -08:00
Sagar Ghuge	e3d221838a	Revert "Revert "blorp/gen12: Don't use aux address if implicit CCS"" This reverts commit `cbd5d82bae`. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9491>	2021-03-09 22:39:20 -08:00

... 3 4 5 6 7 ...

126030 Commits