KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jesse Natalie	ec415a274e	microsoft/compiler: Emit DS PSV validation and entrypoint metadata Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	9aca56b137	microsoft/compiler: Handle domain location intrinsic Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	8524d04783	microsoft/compiler: Handle load_output in the HS stage as reading a previously written patch constant Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	a39881b788	microsoft/compiler: Handle load_per_vertex_output as LoadOutputControlPoint Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	a550c059c7	microsoft/compiler: For load_input from DS, use loadPatchConstant Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	cc6104dd3f	microsoft/compiler: For store_output from HS, use storePatchConstant In HS, store_per_vertex_output maps to storeOutput in DXIL. The data that isn't per-vertex is patch constants. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	97b6ea71a0	microsoft/compiler: Add a pass for hull and domain shaders to shrink tess level vars DXIL validation will complain if the tess factor signature entries have the wrong number of components for the shader's domain. Make sure that both hull and domain shaders have the right number, and drop loads and stores from the removed components. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	bd2a4fb1b8	microsoft/compiler: Add patch constant signature into PSV and as container blob Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	26247d506e	microsoft/compiler: Gather patch const signature and handle tess factor in it Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	8e42891f69	microsoft/compiler: When sorting patch varyings, adjust location to be in normal varying range This way, patch varyings come before the patch sysvals (tess levels). Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	4bb4d0454d	microsoft/compiler: Overlap patch and non-patch varyings so both are separately 0-indexed Also add tess factors to the list of sysvals that can cause vars to be sorted last. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	b7da3f8647	microsoft/compiler: Fix I/O signatures for tess shaders - Skip patch variables, those go into a separate patch constant signature - Use nir_is_arrayed_io and only strip one level of array when it's true Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	b346f28453	microsoft/compiler: Emit HS PSV validation and entrypoint metadata Note that this requires the shader info "tess" data to be correct. For GLSL tess control shaders, only the output primitive count is automatically available. The rest will need to be either guessed or filled in from a matching tess eval (domain) shader. This is handled by the d3d12 driver in a later patch. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	6c58e1f448	microsoft/compiler: Delete misleading TODO comments about semantic table Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> We've been writing a valid semantic table for a while now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	f511354a64	microsoft/compiler: Split hull (tess ctrl) shaders into main and patch constant funcs Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	ad2233616c	microsoft/compiler: Handle store_per_vertex_output for HS outputs Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	41af962099	microsoft/compiler: Emit all NIR functions into the DXIL module Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	f6a333f010	microsoft/compiler: Emit functions with actual function names Once we start writing multiple functions, we can't keep calling all of them "main" Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	72812fe9b5	microsoft/compiler: Support emitting multiple functions into a DXIL module The instruction and block lists are moved into a new "function definition" struct, and the DXIL module tracks one at a time for adding instructions into. The NIR side still only emits the main function here though. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	e5f353f2f2	microsoft/compiler: Emit statically-indexed resource handles and scratch later The resource declarations are module-wide, but the resource handles are function-local. A future change will add multi-function support, but requires these handles to be potentially emitted multiple times. The alloca used for scratch is also function-local. This is the same pattern that the DXBC to DXIL converter uses. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	02f46b67cd	microsoft/compiler: Fix typo in enum entry Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	b5cb81f8c1	microsoft/compiler: Add mapping from MESA_SHADER_* to DXIL_*_SHADER for tessellation Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	8ad0393abe	microsoft/compiler: Getting a builtin function with an undeclared signature should be unreachable Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	4ad72b152c	microsoft/compiler: Multi-row output semantics need to write multiple never_writes_masks Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	5e3d64d067	microsoft/compiler: Semantic table should be de-duped for multi-row semantics too Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	218f8302d2	microsoft/compiler: Use driver_location instead of location for inter-stage varying index in GL In the case of two vars being packed into the same register / location, they'll still get unique driver_location, which is what we need. This does require some tweaks to stream output handling, which also needs to produce the varying index. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	14ed624ff3	microsoft/compiler: Force integer I/O vars to use flat/constant interpolation Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Dave Airlie	9b961b9d1d	mesa/st: refactor program translation into one file. This moves the notify callback into the file where it's all called from. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Dave Airlie	8dfe3c83b6	mesa/st: move program new/delete into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Dave Airlie	afce8654df	mesa/st: move st_vertex_program to gl_vertex_program in mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Dave Airlie	5730772e36	mesa/st: move new ati fragment shader to mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Dave Airlie	3faa21bda7	mesa/st: collapse st_program into gl_program object. Remove the subclass for this. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Jordan Justen	8db5937f94	intel/genxml: Extend length of 3DSTATE_DEPTH_BUFFER for gfx12.5 The two added dwords are MBZ. Ref: bspec 46935 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14702>	2022-01-26 00:22:54 +00:00
Jordan Justen	315d632977	intel/genxml: Extend length of 3DSTATE_WM_HZ_OP for gfx12.5 The added dword is MBZ. Ref: bspec 46981 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14702>	2022-01-26 00:22:54 +00:00
Mike Blumenkrantz	0ca6273713	zink: add anv (icl) fails mesa/mesa#5918 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14722>	2022-01-26 00:07:58 +00:00
Mike Blumenkrantz	5e748770b9	zink: never use SpvOpImageQuerySizeLod for texel buffers this is illegal cc: mesa-stable affects KHR-GL46.texture_buffer.texture_buffer_texture_buffer_range Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14696>	2022-01-25 23:56:36 +00:00
Mike Blumenkrantz	5dc28ccb79	zink: update radv fails list Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14717>	2022-01-25 23:43:34 +00:00
Mike Blumenkrantz	0d2f854795	zink: update nv fails more passes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14716>	2022-01-25 23:01:24 +00:00
Caio Oliveira	448a840b39	intel/fs/xehp: Add unit test for handling of RaR deps across multiple pipelines. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Paulo Zanoni	d107a0bff8	intel/fs: Assert the GPU supports 64bit ops if present at lower_scoreboard time. On platforms where we don't support 64 bit instructions we shouldn't pass such instructions for the code generator to lower into supported instructions, because this makes their execution pipeline unpredictable to the scoreboard lowering pass on XeHP+ platforms. We really should be reducing all these 64 bit instructions before code generation, so here we add an assert to help us catch and fix these cases more easily. Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> [ Francisco Jerez: Also allow has_integer_dword_mul. ] Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	79fb7f9de8	intel/fs: Perform 64-bit CLUSTER_BROADCAST lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	bdf8ac2466	intel/fs: Honor strided source regions specified by the IR for CLUSTER_BROADCAST. This fixes a bug in the CLUSTER_BROADCAST code generation that causes the original IR region to be ignored, this will be a problem when we start lowering 64-bit CLUSTER_BROADCAST instructions at the IR level, since it will lead to instructions with non-trivial regioning. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	6c8782c135	intel/fs: Perform 64-bit SEL_EXEC lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	9449b71bdd	intel/fs: Perform 64-bit SHUFFLE lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	d2d72fccf1	intel/fs: Fix destination suboffset calculations for non-trivial strides in SHUFFLE codegen. One of the two SHUFFLE implementations wasn't taking into account the destination stride at all, and the other (more commonly used) one was taking it into account incorrectly since brw_reg::hstride represents the stride logarithmically, so we need to use a left-shift operator instead of product. Found by inspection. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	d1038197f3	intel/fs: Take into account region strides during SIMD lowering decision of SHUFFLE. This fixes a bug in the handcrafted SIMD lowering done by the SHUFFLE code generation, which wasn't taking into account the source and destination region strides while deciding whether it needs to split an instruction. v2: Use new element_sz() helper instead of left shift. (Lionel) Fixes: `90c9f29518` ("i965/fs: Add support for nir_intrinsic_shuffle") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	44e48751d2	intel/fs: Teach the lower_regioning pass how to split instructions of unsuported exec type. This adds some generic infrastructure that allows splitting any instruction into a number of instructions of a smaller legal execution type. This is meant to replace several instances of handcrafted 64bit type lowering done manually in the code generator, which is rather error-prone, prevents scheduling of the lowered instructions, and makes them invisible to the SWSB pass on Gfx12+ platforms, which will become especially problematic on Gfx12.5+ since the EUs introduce multiple asynchronous execution pipelines which the SWSB pass needs to be able to synchronize to one another, so it's critical for the real execution type of the instruction to be visible to the SWSB pass. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	539c879a6b	intel/fs: Move legal exec type calculation into helper function in lower_regioning pass. Right now the execution type lowering functionality of this pass assumes that an integer type of the original bit size is always acceptable, however we'll want more complex behavior than that in order to leverage this pass to automate the lowering of unsupported 64-bit operations into multiple 32-bit operations. In order to do that calculate the closest legal execution type from a new helper function, and take advantage of that function from the has_invalid_exec_type() helper, along the lines of other lower_regioning() helpers structured as a pair of has_invalid_foo() + required_foo() functions. This shouldn't have any functional changes. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	3886e63033	intel/fs/xehp: Merge repeated in-order read dependencies instead of replacement. Previously the software scoreboard structure would drop previous dependencies for a given register and replace them with the most recent one for the same register when a new instruction (or set of instructions) is processed. This worked correctly on the Gfx12LP platforms this code was originally designed for, because a repeated dependency on the same register would either require the second instruction to synchronize against the first (so the first dependency could be disregarded from that point on) or require the dependency to be RaR and in-order, which allows the synchronization to be optimized out (the first dependency could still be disregarded as well, since the pipeline is in-order). However the latter assumption will break on upcoming Gfx12HP platforms, because they have multiple asynchronous FPU pipelines, so whenever we hit a RaR dependency we need to propagate forward both dependencies, since the order in which both reads will complete is not guaranteed by the hardware in cases where they occur from different asynchronous pipelines. Note that this dependency propagation change requires us to change the definition of dependency::done as well, since that constant is defined to discard any previous dependency information when used as argument for shadow(). This has been reported to fix the following conformance failures on DG2: KHR-GL46.shaders.uniform_block.random.all_per_block_buffers.19 dEQP-GLES3.functional.shaders.derivate.fwidth.* Reported-by: Tapani Pälli <tapani.palli@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5670 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Alejandro Piñeiro	4ab6631949	vc4/nir_lower_blend: update write mask when we update num components As explained at the header of the lowering: "Once this pass is done, the color write will either have one component (for single sample) with packed argb8888, or 4 components with the per-sample argb8888 result." So in several cases the lowering was updating the number of components, so we need to update the writemask too. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14708>	2022-01-25 21:24:03 +00:00

1 2 3 4 5 ...

149376 Commits All Branches Search

149376 Commits

All Branches