KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jason Ekstrand	96ee78778b	intel/isl: Add support for scratch buffers XeHP adds support for a new surface type for scratch. It's similar to SURFTYPE_STRBUF in that it's a 2D array-of-struct format but the one key difference is that the U coordinate is computed automatically based on the thread ID and only the V coordinate is provided in the dataport message. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	443627fcc0	intel/genxml: Add SURFTYPE_SCRATCH on GFX version 12.5 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	d31dd81292	anv: Claim to be a discrete GPU if has_lmem Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Jordan Justen	b6a1063c2e	intel/dev: Set has_local_mem for DG1 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Sagar Ghuge	e505c221fa	anv: Allocate scratch and workaround BO in local memory Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Sagar Ghuge	6352371ff6	anv: Allocate BO in appropriate region Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Sagar Ghuge	3f8eca7f82	anv: Wrapper around I915_GEM_CREATE_EXT_MEMORY_REGIONS v2 (Jordan Justin): - add anv_gem_stubs.c impl v3 (Jason Ekstrand): - Use the upstream uAPI - Rework the interface a bit Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Sagar Ghuge	65e8d72bc1	anv: Query memory region info Create additional memory type with DEVICE_LOCAL_BIT if we have local memory region aviable. v2 (Jason Ekstrand): - Don't leak mem_regions if the second ioctl fails Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Jordan Justen	cb6feae0b5	intel/devinfo: Add has_local_mem Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Jason Ekstrand	b8030ab1ea	isl,docs: Add a chapter on AUX state tracking We also update and improve the docs in isl.h which get pulled into this new chapter. Acked-by: Luis Strano <luis.strano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11479>	2021-06-24 13:57:40 +00:00
Jason Ekstrand	94a52bc85c	isl,iris: Move the extra_aux_surf logic into iris This gets rid of the awkward interface for isl_surf_get_ccs_surf where we passed it two aux surfaces and it was supposed to fill out the second one based on whether or not the first one already had stuff in it. Instead, we now pass it three well-labled surfaces: surf, hiz_or_mcs_surf, and ccs_surf which have obvious meanings. This does mean that iris has to carry a bit of logic and we have to flip parameters around in all the callers. But the resulting interface is much cleaner. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11479>	2021-06-24 13:57:40 +00:00
Jason Ekstrand	37f76aab1c	isl: Take a hiz_or_mcs_surf in isl_surf_supports_ccs Whether or not a surface supports CCS on Tigerlake and later is dependent not only on the main surface but also on the MCS or HiZ surface, if any. We were doing some of these checks in isl_get_ccs_surf based on the extra_aux parameter but not as many as we probably should. In particular, we were really only checking HiZ conditions and nothing for MCS. It also meant that, in spite of the symmetry in names, the checks in isl_surf_get_ccs_surf were more complete than in isl_surf_supports_ccs. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11479>	2021-06-24 13:57:40 +00:00
Jason Ekstrand	2d2590a879	isl: Assert some iris invariants in isl_surf_get_ccs_surf The only driver which calls isl_surf_get_ccs_surf with extra_aux != NULL is iris and it always calls it with two aux surfaces and never calls it for CCS twice. We can turn those checks into asserts. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11479>	2021-06-24 13:57:40 +00:00
Dave Airlie	0acd202858	intel/genxml: fix gfx6 GS SVB_INDEX encoding This seems to match what the docs + 965 traces say Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11567>	2021-06-24 06:50:49 +00:00
Tapani Pälli	55951ac28e	anv: fix emitting dynamic primitive topology Initial implementation missed various fields that derive from the primitive topology. This patch fixes 3DSTATE_RASTER/3DSTATE_SF, 3DSTATE_CLIP and 3DSTATE_WM (gen7.x) emission in the dynamic case. Fixes: `f6fa4a8000` ("anv: add support for dynamic primitive topology change") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4924 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11379>	2021-06-23 12:01:45 +00:00
Lionel Landwerlin	9b7cba7724	anv: bound checks buffer memory binding in debug builds Validation layers should warn you about this (VUID-VkBindBufferMemoryInfo-size-01037) but this would be useful for zink debugging. Requested by Zmike. v2: Also check memoryOffset (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11542>	2021-06-23 08:16:57 +00:00
Francisco Jerez	4dc4284342	intel/fs: Implement Wa_14013745556 on TGL+. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	c19cfa9dc2	intel/fs: Fix synchronization of accumulator-clearing W/A move on TGL+. Right now the accumulator-clearing move emitted by the generator for Wa_14010017096 inherits the SWSB field from the previous instruction. This can lead to redundant synchronization, or possibly more serious issues if the previous instruction had a TGL_SBID_SET SWSB synchronization mode. Take the SWSB synchronization information from the IR. Fixes: `a27542c5dd` ("intel/compiler: Clear accumulator register before EOT") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	63abc083ce	intel/fs: Teach IR about EOT instruction writing the accumulator implicitly on TGL+. This is unlikely to have had any negative side effect on the original TGL, but will lead to issues on XeHP+ if the software scoreboard pass isn't able to synchronize the accumulator writes. Fixes: `a27542c5dd` ("intel/compiler: Clear accumulator register before EOT") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	5e7f443de0	intel/fs: Add SWSB dependency annotations for cross-pipeline WaR data hazards on XeHP+. In cases where an in-order instruction is overwriting a register previously read by another in-order instruction, drop the dependency iff the previous read is guaranteed to have occurred from the same in-order pipeline. This should only have an effect on XeHP+ since previous Xe platforms only had one in-order FPU pipeline. The previous workaround we were using for this treated all ordered read dependencies as write dependencies to avoid noise from our simulation environment. Relative to our previous workaround this improves performance of GFXBench5 gl_tess by ~7% on a DG2 system among other single-digit percentual FPS improvements. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	d46bb14d14	intel/fs: Implement Wa_22012725308 for cross-pipe accumulator data hazard. The hardware fails to provide the expected data coherency guarantees for accumulator registers when accessed from multiple FPU pipelines. Fix this by tracking implicit accumulator accesses just like we do for regular GRF registers, but instead of adding synchronization annotations for any dependency we only do it for dependencies with a pipeline mismatch, since the hardware should be able to guarantee proper synchronization for matching pipelines. Note that this workaround handles RaW and WaW dependencies in addition to the WaR dependencies described in the hardware bug report even though cross-pipeline RaW accumulator dependencies should be extremely rare, since chances are the hardware will also hang if we ever hit such a condition. This only affects XeHP+, since all FPU instructions are executed as a single in-order pipeline on earlier Xe platforms. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	385da1fe36	intel/fs: Track single accumulator in scoreboard lowering pass. This change reduces the precision of the scoreboard data structure for accumulator registers, because the rules determining the aliasing of accumulator registers are non-trivial and poorly documented (e.g. acc0 overlaps the storage of acc1 when the former is accessed with an integer type). We could implement those rules but it wouldn't have any practical benefit since we currently only use acc0-1, and for the most part we can rely on the hardware's accumulator dependency tracking. Instead make our lives easier by representing it as a single register. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	231337a13a	intel/fs/xehp: Assert that the compiler is sending all 3 coords for cubemaps. As required by HSDES:14013363432. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Lionel Landwerlin	7ed0aaced7	nir: use a more fitting index for btd_stack_push_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	423c47de99	nir: drop the btd_resume_intel intrinsic This is now 100% equivalent to the new rt_resume intrinsic. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	4d9fcf2799	intel/rt: switch to common pass for shader calls lowering v2: rename for new indices Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	1dacea10f3	anv: implement caching for ray tracing pipelines v2: Turn a bunch of pointer checks into checks against NULL (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	fed7ac932f	anv: move trivial return shader to device Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	7c852f78c4	anv: store more RT shader data in pipeline_stage object Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	045f4600b1	anv: cache raytracing trampoline shader Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	ab77aeb488	blorp: add blorp string in shader keys Upon looking at caching the raytracing shader (in particular the trampoline one) I kind of got afraid that some of the keys used for blorp would end up matching other keys. This is because blorp keys are fairly simple. There is no SPIRV module hash included. This change includes a "blorp" string at the beginning of the queue to ensure we don't collide with other keys. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	7479fe6ae0	anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect v2: Fix anv_cmd_state::binding_tables array size (Lionel) v2: Fix anv_cmd_state::samplers array size (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	ac6d7a1758	anv: Make anv_address::offset 64-bit This allows us to convert a 64-bit address to an anv_address which is useful for working with device addresses. v2: switch to int64_t to keep state pool relative relocation working on non-softpin platforms v3: Update assert to reflect relative offsets (Jason) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	a67d7c9dee	anv: Allow _anv_combine_address with a NULL batch This is required in order to be able to use GenXML pack functions for structs with addresses when you're not packing into a batch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	f68d64dac0	anv: Add support for vkCmdSetRayTracingPipelineStackSizeKHR Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	02f7964a13	anv: Compute scratch sizes for ray-tracing pipelines and shader groups Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	c3ac9afca3	anv: Create and return ray-tracing pipeline SBT handles Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	b66d3e627a	intel/fs: Don't pull CS push constants if uses_inline_data Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	79dc25d867	anv: Compile trivial return and trampoline shaders These don't necessarily go in any group but are required for dispatch to work properly. The trampoline is a compute shader that is the initial start point for the trace. It's in charge of invoking the actual ray-gen shader. The trivial return shader is used whenever another shader is missing and it does no work except the minimum required to do a stack return. v2: Rebase on upstream changes (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	e104555851	anv: Compile ray-tracing shaders This doesn't look too different from other compile functions we have in anv_pipeline.c. The primary difference is that ray-tracing pipelines have this weird two-stage thing where you have "stages" which are individual shaders and "groups" which are sort of mini pipelines that are used to handle hits. For any given ray intersection, only the hit and intersection shaders from the same group get used together. You can't have an intersection shader from group A used with an any-hit from group B. This results in a weird two-step compile. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	379b9bb7b0	anv: Support fetching descriptor addresses from push constants Bindless shaders don't have binding tables so they have to get at the descriptor sets via a different mechanism. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	937ffb1af0	nir/apply_pipeline_layout: Handle bindless shaders They don't have binding tables so they have to use A64 descriptor set access and everything has to be bindless all the time. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	770d331285	anv: Disallow UBO pushing for bindless shaders They don't really have push constants. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	c92fd35848	intel/rt: Use reloc constants for the resume SBT It's going to be attached to the end of the shader binary, not an arbitrary table somewhere in memory. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	705395344d	intel/fs: Add support for compiling bindless shaders with resume shaders Instead of depending on the driver to compile each resume shader separately, we compile them all in one go in the back-end and build an SBT as part of the shader program. Shader relocs are used to make the entries in the SBT point point to the correct resume shader. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	d055ac9bdf	intel/compiler: Add a U32 reloc type Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	55508bbe66	intel/compiler: Generalize shader relocations a bit This commit adds a delta to be added to the relocated value as well as the possibility of multiple types of relocations. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	f7668d6fe5	anv,iris: Move the SHADER_RELOC enums to brw_compiler.h They're common between the two drivers and we want to add a couple more that get emitted from code in src/intel/compiler. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	d95d5b1465	anv: Add support for binding ray-tracing pipelines This includes not only vkCmdBindPipeline but also descriptor sets and push constants. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	8cc5080580	anv: Add a ray-tracing pipeline object This just adds the core data structure which we'll build on going forward. v2: Add VK_EXT_pipeline_creation_cache_control handling (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	dc05daf0e5	anv: Get ready for more pipeline stages This makes a bunch of loops use ARRAY_SIZE instead of MESA_SHADER_STAGES, extends a few arrays, and adds a bunch of array length asserts. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	f366f6a071	anv: Add minimal boilerplate for VK_KHR_ray_tracing_pipeline Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	d258b0bf0e	anv: Add support for binding acceleration structures Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	4664c92478	anv: Add stub support for acceleration structures This just adds a base struct and trivial implementations of all the create/destroy/bind functions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	e97002ebcf	anv: Add minimal boilerplate for VK_KHR_acceleration_structure Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jan Beich	aefa40886b	anv: adjust headers for non-GNU after `e9e1e0362b` src/intel/vulkan/anv_device.c:27:10: fatal error: 'sys/sysmacros.h' file not found #include <sys/sysmacros.h> ^~~~~~~~~~~~~~~~~ Fixes: `e9e1e036` ("anv: implement VK_EXT_physical_device_drm") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11516>	2021-06-22 13:38:26 +00:00
Tapani Pälli	100fc3020a	anv: provide dummy vkCmdSetPatchControlPointsEXT This makes dEQP-VK.api.version_check.entry_points pass and matches how other drivers are handling this case. We do not support the feature but still need to provide a dummy entrypoint. v2: throw error if/when called (Jason) Fixes: `0d031d1da3` ("anv: toggle on VK_EXT_extended_dynamic_state2") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11503>	2021-06-22 09:14:46 +03:00
Jordan Justen	eb98171252	intel/gen125.xml: Drop GPGPU_WALKER Ref: `f4902bb189` ("intel/genxml,anv,iris: Drop the legacy compute path from gen125.xml") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11489>	2021-06-21 19:40:14 +00:00
Lionel Landwerlin	e9e1e0362b	anv: implement VK_EXT_physical_device_drm v2: add docs update error messages (Sagar) v3: Use fstat() (Jason) v4: Do fstat() on demand (Jason) v5: clear major/minor values if not present (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8649>	2021-06-21 19:18:07 +00:00
Rhys Perry	ea68d4a676	nir/propagate_invariant: add invariant_prim option Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11035>	2021-06-21 15:13:05 +00:00
Emma Anholt	990c232603	nir: Add an interface for logging shaders with mesa_log*. For debug on Android, it's useful to be able to print shaders to the android log interface, since you don't usually have stdout/stderr. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9262>	2021-06-18 18:18:35 +00:00
Jason Ekstrand	6b15ce9fa2	docs/isl: Add detailed documentation about tiling on Intel GPUs Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11366>	2021-06-18 13:03:48 +00:00
Jason Ekstrand	0f6ebd2b73	docs/isl: Add detailed documentation about isl formats Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11366>	2021-06-18 13:03:48 +00:00
Jason Ekstrand	d07ec294bb	isl: Document more members of isl_surf Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11366>	2021-06-18 13:03:48 +00:00
Vinson Lee	5f771134ad	intel/vec4: Add missing break statement. Fix defect reported by Coverity Scan. Missing break in switch (MISSING_BREAK) unterminated_case: The case for value VEC4_OPCODE_ZERO_OOB_PUSH_REGS is not terminated by a break statement. Fixes: `89fd196f6b` ("intel/vec4: Add support for masking pushed data") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11347>	2021-06-18 05:02:23 +00:00
Yiwei Zhang	87f35032a6	anv: enable multi-planar support for drm format modifier This patch only enables the below VkFormat: - VK_FORMAT_G8_B8R8_2PLANE_420_UNORM This patch ensures the proper behavior of the below APIs: - vkGetPhysicalDeviceFormatProperties2 - vkGetPhysicalDeviceImageFormatProperties2 - vkCreateImage - vkGetImageSubresourceLayout - vkGetImageDrmFormatModifierPropertiesEXT - vkGetImageMemoryRequirements - vkGetImageMemoryRequirements2 Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chad Versace <chad@kiwitree.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11281>	2021-06-18 01:04:15 +00:00
Yiwei Zhang	ec70092d55	anv: support multi-planar format in add_all_surfaces_explicit_layout Add initial multi-planar format support on the images with modifiers: - With aux usage, - Format plane count must be 1. - Memory plane count must be 2. - Without aux usage, - Each format plane must map to a distinct memory plane. For the other cases, currently there is no way to properly map memory planes to format planes and aux planes due to the lack of defined ABI for external multi-planar images. This patch doesn't include some potentially supported cases like all format planes mapping to a single memory plane, additional refactoring is needed to workaround explicit base offset + ANV_OFFSET_IMPLICIT. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chad Versace <chad@kiwitree.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11281>	2021-06-18 01:04:15 +00:00
Yiwei Zhang	345a4e9871	anv: fix some log formats Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chad Versace <chad@kiwitree.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11281>	2021-06-18 01:04:15 +00:00
Kenneth Graunke	0510a947ba	anv: Fix dynamic primitive topology for tess on Gfx7.x too Commit `24342e499b` changed how primitive topology is handled on Gfx8+ but missed updating the Gfx7.x code. As a result, tests which previously used topologies like PATCHLIST_3 instead started using bogus ones like LINESTRIP_ADJ. This caused a GPU hangs in a bunch of Vulkan conformance tests involving tessellation. This fixes those hangs. Fixes: `24342e499b` ("anv: fix dynamic primitive topology for tess") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11434>	2021-06-17 02:00:32 +00:00
Yiwei Zhang	357e7ae30b	anv: fix build errors after commit `8b7ff78` Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11373>	2021-06-16 19:55:48 +00:00
Mike Blumenkrantz	24342e499b	anv: fix dynamic primitive topology for tess this needs to use the pre-converted topology using tess state patch control points Fixes: `f6fa4a8000` ("anv: add support for dynamic primitive topology change") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11332>	2021-06-16 13:45:15 +00:00
Jason Ekstrand	7bf0c762c3	anv: Agressively no-op Flush/InvalidateMappedMemoryRanges This has two steps. First, for each range we look at the memory object and see if it actually needs flushing before we start throwing CLFLUSH instructions. Second, we look at the whole list of types on device initialization and decide whether or not we need CLFLUSH at all. The first part should speed up atom chips a bit since we're currently CLFLUSHing everything even when we don't need to. The second isn't needed on most of today's parts because we base it on !has_llc but it is needed for discrete parts. It's also over-all cleaner. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11364>	2021-06-15 21:00:37 +00:00
Felix DeGrood	a49b145e8d	anv: Replace DC Flush with HDC Pipeline Flush HDC Pipeline Flush is the correct method for flushing HDC pipeline on Gfx12+ HW. Continue using DC Flush for earlier HW. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	ef70388a3a	anv: Add ANV_PIPE_HDC_PIPELINE_FLUSH_BIT Gfx12+ PIPE_CONTROL bit for flushing HDC cache and memory transactions to L3 cache. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	82952deb8b	anv: Only flush Tile Cache on VK_ACCESS_HOST_R/W Tile Cache flush flushes all Color/Depth values from L3 cache to memory in Unified Cache mode. This is only required when CPU access is required. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	6f26a51f47	anv: remove unnecessary Tile Cache flushes On Gfx12+, flushing tile cache ensures color/depth values are globally visible, but that's expensive. Most operations only need values to be GT-visible which can be achieved with depth or rt flush. Remove a bunch of unnecessary Tile Cache flushes. Fast clears and slow depth clears still require Tile Cache flush. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	a7bb74db7b	anv: Remove Tile Cache flush from SBA, Pipe Select Tile Cache flushing not required for State Base Address or Pipe Select instructions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	1da9ff047e	anv: Clear all pending stall after pipe flush Was only clearing CS stalls after emitting pending pipe controls. Need to clear all stalls. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	fc5cb54008	anv: Add debug messages for DEBUG_PIPE_CONTROL Enable with INTEL_DEBUG=pc. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	6c345ddbe4	anv: Cache VB/IB in L3$ for Gfx12 Gfx12 enables caching of Vertex and Index Buffers in L3. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Dave Airlie	f3630548f1	crocus: initial gallium driver for Intel gfx 4-7 This is a gallium driver for the Intel gfx 4-7 GPUs. It was initially cloned from the iris driver by Ilia Mirkin, then I ported over large reams of code from i965 until it worked. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11146>	2021-06-14 06:34:05 +10:00
Dave Airlie	8da92b5c0a	intel/compiler: add flag to indicate edge flags vertex input is last 965 and the mesa st disagree on how vertex elements are ordered when edgeflags are involved. 965 wants them in gl_vert_attrib order, but gallium supplies the edgeflag as the last vertex element regardless. This adds a flag which is enabled for gen4/5 to denote that the edgeflag is at the end. When we reap 965 later we can resolve this better. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11146>	2021-06-14 06:05:18 +10:00
Dave Airlie	5f03570eaa	intel: reorder base program key. This gets hashed a lot, this reduces the size of this, and the other keys by a small amount Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11146>	2021-06-14 06:05:17 +10:00
Dave Airlie	4de8ba15b4	intel/decode: handle gen4/5 WM state fragment shaders Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11146>	2021-06-14 06:05:17 +10:00
Yiwei Zhang	a0122385f0	anv: fix Android WSI VkFence Fix an obvious one-liner bug. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11344>	2021-06-13 06:12:31 +00:00
Jason Ekstrand	e23b55c3f0	i965: Use nir_lower_passthrough_edgeflags Now that there's a common NIR pass, there's no point in us doing this in the back-end anymore. In order to use this pass in i965, we do have to make one tiny change. Gallium runs the pass after assigning input and output locations and so needs the pass to respect those locations and num_inputs. i965, however, runs it before any location assignment or I/O lowering so we don't care. We do, however, need the pass to succeed with num_inputs == 0 because we set that later. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11313>	2021-06-11 21:19:06 +00:00
Jason Ekstrand	00dc4e0d83	intel/isl: Use a 4D physical total extent for size calculations With Yf and Ys tiling, everything is actually four dimensional because we can have multiple depth or multisampled array slices in the same tile. This commit just enhances the calculations so they can handle it. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	dc764916d9	intel/isl: Make tile logical extents four dimensional Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	a4dafe1fad	intel/isl: Make the offset helpers four dimensional We need to do this in order to handle Yf and Ys tiling because they use a four-dimensional tile instead of laying everything out in two dimensions. v2 (Jason Ekstrand): - Update functions added since v1: - isl_surf_get_image_range_B_tile - blorp_can_hiz_clear_depth - get_image_offset_el Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> (v1) Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	3f7f6b878c	intel/blorp: Use isl_surf_get_image_offset_B_tile_el in ccs_ambiguate Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	0eb3d0a8df	intel/isl: Add a isl_surf_get_image_offset_B_tile_el helper Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	ddf970af88	anv/blorp: Optimize addresses/relocations when ANV_ALWAYS_SOFTPIN Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	c7e1488037	anv: Optimize anv_address_physical when ANV_ALWAYS_SOFTPIN Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	921bd2d1c7	anv: Fast-path surface relocs when we have softpin Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	6afc3f97b6	anv: Make anv_batch_emit_reloc inline and optimize SKL+ This should drop the CPU overhead of processing buffers on SKL+ by dropping some of the logic contained in anv_reloc_list_add() whenever we have enough compile-time information to know we have softpin. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	9802a0d7ca	anv: Add a helper to add a BO to the batch list without a reloc The relocation list currently serves two purposes. One is for relocations on older non-softpin platforms. The second is to keep track of driver-managed BOs which are used by the given command buffer. We going to need a mechanism to add BOs to the command buffer without doing a relocation into the batch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	7e8c28383c	anv: Handle OOM in the pinned path in anv_reloc_list_add Fixes: `b3c0b1b218` "anv: Use a bitset for tracking residency" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	a63e97e09a	anv: Make use_softpin compile-time in genX code Whenever we have the GFX_VERx10 macro available, we can make use_softpin a compile-time thing for everything but Broadwell and Cherryview. This should save us some CPU cycles especially on SKL+. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	04ccfeae98	anv: Require softpin on Gen8+ Softpin was added to i915 in commit 506a8e87d8d2746b9e9d2433503fe237c54e4750 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Tue Dec 8 11:55:07 2015 +0000 drm/i915: Add soft-pinning API for execbuffer which was included in Linux 4.5. It's been over 5 years so it's probably reasonable to make it a hard requirement. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Caio Marcelo de Oliveira Filho	7c1c9e935e	anv: Support workgroup memory in other shaders Mesh and Task shaders can use workgroup memory, so generalize its handling in anv by moving it from anv_pipeline_compile_cs() to anv_pipeline_lower_nir(). Update Pipeline Statistics accordingly. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11230>	2021-06-08 11:30:39 -07:00
Caio Marcelo de Oliveira Filho	8af6766062	nir: Move workgroup_size and workgroup_variable_size into common shader_info Move it out the "cs" sub-struct, since these will be used for other shader stages in the future. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11225>	2021-06-08 09:23:55 -07:00
Caio Marcelo de Oliveira Filho	b5f6fc442c	nir: Move zero_initialize_shared_memory into common shader_info Move it out the "cs" sub-struct, since the bit will be used for other shader stages in the future. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11225>	2021-06-08 09:23:55 -07:00
Rhys Perry	1cbcfb8b38	nir, nir/algebraic: add byte/word insertion instructions Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:42 +00:00
Dave Airlie	c04f20e7e0	intel/isl: add blend enable flag to gen4/5 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10655>	2021-06-08 12:35:52 +10:00
Dave Airlie	c25a5a832a	intel/isl: add levels and minimum array element to null fill gen4/5 needs these to avoid gpu hangs around matching depth/null surfaces Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10655>	2021-06-08 12:35:43 +10:00
Dave Airlie	cb152e79f2	intel/isl: convert null surface fill to a struct. Suggested by Jason, pre-convert this to a struct so it can be expanded for gen4/5 crocus support Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10655>	2021-06-08 12:35:40 +10:00
Dave Airlie	6989afea58	intel/isl: decrease isl_format_layouts size by 36k This drops -0000000000011e90 R isl_format_layouts +0000000000008f48 R isl_format_layouts I think that's about 36k. Thanks to Jason for suggesting PACKED Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11232>	2021-06-08 11:52:25 +10:00
Caio Marcelo de Oliveira Filho	c8a7bd0dc8	nir: Rename WORK_GROUP (and similar) to WORKGROUP Be consistent with other usages in Vulkan and SPIR-V, and the recently added workgroup_size field. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Caio Marcelo de Oliveira Filho	a71a780598	nir: Rename nir_intrinsic_load_local_group_size to nir_intrinsic_load_workgroup_size Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Caio Marcelo de Oliveira Filho	430d2206da	compiler: Rename local_size to workgroup_size Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Jason Ekstrand	9e0fd49858	intel/fs/ra: Fix payload node setup for SIMD16 on Gen4-5 Since `40e1d798c6`, we are now using physical register numbers for everything which makes it all simpler. In particular, we no longer need the special case for setting up the payload for SIMD16 on Gen4-5. This fixes a pile of piglit tests on ILK and similar. Fixes: `40e1d798c6` "intel/fs: Use ra_alloc_contig_reg_class()..." Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11221>	2021-06-07 16:52:19 +00:00
Marcin Ślusarz	2ebf4e984b	intel/disasm: remove useless space after "(" Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11070>	2021-06-07 08:46:11 +00:00
Marcin Ślusarz	daba2894ff	intel/disasm: decode/describe more send messages Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11070>	2021-06-07 08:46:11 +00:00
Jason Ekstrand	b742f73913	intel/isl: Fix isl_format_is_valid The format enum space isn't necessarily contiguous so we can't assume that if it's in the table it's valid. We need to check something. Fixes: `ed6e586562` "intel: properly constify isl_format_layouts" Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11191>	2021-06-04 16:16:44 -05:00
Eric Anholt	cf33316ec0	intel/vec4: Use ra_alloc_contig_reg_class() to reduce RA overhead. We go from 1672 RA regs to the real 128 HW regs. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9437>	2021-06-04 19:08:57 +00:00
Eric Anholt	40e1d798c6	intel/fs: Use ra_alloc_contig_reg_class() to speed up RA. By using the new class type, we don't need to make 1928 different registers to represent each contigous reg size starting from the actual 128 HW register, or have a mapping between RA regs and HW base regs. With the number of regs reduced, and the fast q computation when using the new classes, we no longer need to compute our own q. This drops the FS RA initialization time on my CFL system from about 1ms to 50us. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9437>	2021-06-04 19:08:57 +00:00
Eric Anholt	95d41a3525	ra: Use struct ra_class in the public API. All these unsigned ints are awful to keep track of. Use pointers so we get some type checking. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9437>	2021-06-04 19:08:57 +00:00
Adam Jackson	ed6e586562	intel: properly constify isl_format_layouts Putting a const char * in the struct means it's a pointer that has to be resolved at rtld time, which means it can be in .data.rel.ro but not .rodata like you'd hope. Fix this with the usual string table trick. Cuts about 20k (-80k read-write +60k read-only) and ~280 relocations from the gallium driver. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11168>	2021-06-04 18:32:33 +00:00
Tapani Pälli	0d031d1da3	anv: toggle on VK_EXT_extended_dynamic_state2 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10366>	2021-06-04 12:58:34 +00:00
Tapani Pälli	75ad0e4b08	anv: support blending logic op dynamic state Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10366>	2021-06-04 12:58:34 +00:00
Tapani Pälli	e0c6055351	anv: centralize vk_to_intel_logic_op array This avoids multiple copies as we will need this in multiple places. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10366>	2021-06-04 12:58:34 +00:00
Tapani Pälli	1c718952c8	anv: support primitive restart enable dynamic state Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10366>	2021-06-04 12:58:34 +00:00
Tapani Pälli	496b508403	anv: support depth bias enable dynamic state Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10366>	2021-06-04 12:58:34 +00:00
Tapani Pälli	4d531c67df	anv: support rasterizer discard dynamic state Implemented by emitting 3DSTATE_STREAMOUT packet. v2: logic fixes + merge and emit properly all contents (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10366>	2021-06-04 12:58:34 +00:00
Tapani Pälli	284290a876	anv: introduce new dynamic states These will be used for VK_EXT_extended_dynamic_state2. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10366>	2021-06-04 12:58:34 +00:00
Anuj Phogat	8e10d54804	intel: Rename GFX 12.5 to XE_HP git grep -l "GFX 12\.5" \| xargs sed -ie "s/GFX 12\.5/XE_HP/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10877>	2021-06-03 15:51:40 -07:00
Chia-I Wu	567a18dade	anv: use vk_default_allocator Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11117>	2021-06-03 08:13:27 +00:00
Jason Ekstrand	06ae2723d1	intel/vec4: Also use MOV_FOR_SCRATCH for swizzle resolves In `2db8867943`, we introduced a new meta-op MOV_FOR_SCRATCH which is identical to MOV except it lets us identify MOVs emitted during spilling so we know not to re-spill those instructions. We emit them from shuffle_for_64bit_data whenever the new for_scratch parameter is true. Unfortunately, we missed the one used for resolving swizzles. Fixes: `2db8867943` "intel/vec4: Don't spill fp64 registers more..." Tested-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11155>	2021-06-03 06:14:17 +00:00
Jason Ekstrand	f63410eee6	intel/nir,i965: Move HW generation check for UBO pushing to i965 Iris only runs on BDW+ and ANV already handles this by not even trying on anything older than HSW. The only driver benefiting from this common check is i965. Moving it out makes the pass more generic and if some driver comes along which can push UBOs on IVB, it should work for that. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11145>	2021-06-03 05:12:33 +00:00
Dave Airlie	64fa67dd2f	intel/gfx6: move xfb_setup outside the gs compiler into the driver. This remove the use of a GL thing from the backend compiler Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11097>	2021-06-03 04:05:07 +00:00
Jason Ekstrand	f5e58838c2	intel/fs: Handle non-perspective-correct interpolation on gen4-5 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11125>	2021-06-03 02:36:17 +00:00
Mauro Rossi	076736222e	android: intel/compiler: add brw_compile_ff_gs.c to Makefile.sources Fixes the following building error: FAILED: out/target/product/x86_64/obj_x86/SHARED_LIBRARIES/i965_dri_intermediates/LINKED/i965_dri.so ... ld.lld: error: undefined symbol: brw_compile_ff_gs_prog >>> referenced by brw_ff_gs.c:56 (external/mesa/src/mesa/drivers/dri/i965/brw_ff_gs.c:56) Fixes: `52e426fd8b` ("intel/compiler: add support for compiling fixed function gs") Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10718>	2021-06-02 09:28:04 +00:00
Mike Blumenkrantz	5515d3aec5	anv: fix availability for copying timestamp query results idx here is the index of the value being written, so if it isn't used/incremented when the query result is written, the availability result will clobber it and be written to the same buffer offset Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11116>	2021-06-01 19:52:24 +00:00
Roman Stratiienko	e91445464f	anv_android: Add missing type Fixes: `2797c75426` ("anv: fix AHB leak upon exportable allocation") Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11068>	2021-05-28 23:20:24 +00:00
Anuj Phogat	6bb66b78a9	intel/gfx12+: Add Wa_14013840143 Cc: mesa-stable Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10984>	2021-05-26 17:42:21 +00:00
Nanley Chery	19a8bd4c63	intel/isl: Fix HiZ+CCS comment about ambiguates Note that CCS isn't ambiguated during a HiZ ambiguate. Dumping the CCS surface after a HiZ ambiguate shows that the CCS is unchanged. Fixes: `98dc7f56b7` ("intel/isl: Add a separate ISL_AUX_USAGE_HIZ_CCS_WT") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9112>	2021-05-25 22:24:36 +00:00
Nanley Chery	34dbbfdd14	anv,iris: Port the D16 workaround stalls to BLORP Commit `cd40110420` added stalls before register writes that occur when drivers emit depth stencil packets. However, it only did so for non-BLORP draw calls. Since those packets are sometimes emitted during BLORP calls, add stalls there too. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4574 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10939>	2021-05-25 20:55:27 +00:00
Nanley Chery	afe3f373a4	intel: Limit the D16 workarounds to Gfx12.0 The workarounds introduced in `cd40110420` are no longer needed on Gfx12.5. Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10939>	2021-05-25 20:55:27 +00:00
Lionel Landwerlin	40eb8b7830	intel/perf: rename metric descriptions There is an effort to drop the "Gen" prefix from much of our codebase. This just applies this to the metrics. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10930>	2021-05-25 10:09:44 +00:00
Lionel Landwerlin	16bff57543	intel/perf: update Gen9/11 programming for AsyncCompute Adding a register, similar to what was done for RenderBasic. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10930>	2021-05-25 10:09:44 +00:00
Lionel Landwerlin	290f5a0156	intel/perf: add EHL availability condition to HDCAndSF counters The availability of those counters depends on the topology. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10930>	2021-05-25 10:09:44 +00:00
Lionel Landwerlin	6ed558f8ca	intel/perf: update Gen11 RenderBasic programming Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10930>	2021-05-25 10:09:44 +00:00
Lionel Landwerlin	6f9dcb05ff	intel/perf: update Gen11 RenderBasic programming Simple order change. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10930>	2021-05-25 10:09:44 +00:00
Lionel Landwerlin	16e344bc79	intel/perf: update gen9/11 TestOa configs Programming an additional FLEX register. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10930>	2021-05-25 10:09:44 +00:00
Yiwei Zhang	2797c75426	anv: fix AHB leak upon exportable allocation A successful AHardwareBuffer_allocate itself will increase a refcount on the newly allocated AHB. For the import case, the implementation must acquire a reference on the AHB. So if we layer the exportable allocation on top of AHB allocation and AHB import, we must release an AHB reference to avoid leak. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10940>	2021-05-25 06:31:25 +00:00
Lionel Landwerlin	474eaa25ad	intel/fs: make sure shuffle is lowered to supported types On XeHP there are restrictions on types of source and destinations with float types. As shuffle is implemented using MOV we need to make sure we lower it to supported types. This fixes tests like : dEQP-VK.subgroups.arithmetic.framebuffer.subgroupexclusivemax_vec4_vertex dEQP-VK.subgroups.arithmetic.framebuffer.subgroupexclusivemul_f16vec3_vertex Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10902>	2021-05-22 21:55:33 +00:00
Felix DeGrood	380fa050f2	intel/compiler: balanced tileY/linear friendly LID order for CS Fixes perf regression introduced from tileY LID order for CS shaders that access both textures and buffers. Walks LIDs in X-major fashion, but with blocks of height 4. This maps LIDs per HW thread for SIMD8/16/32 as (2x4/4x4/8x4), which is always good for tileY resources and usually good for linear resources. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10733>	2021-05-22 00:15:25 +00:00
Felix DeGrood	c23e2a662a	intel/compiler: tileY friendly LID order for CS Computer shaders that access tileY resources (textures) benefit from Y-locality accesses. Easiest way to implement this is walk local ids in Y-major fashion, instead of X-major fashion. Y-major local ids will reduce partial writes and increase cache locality for tileY accesses since tileY resources cachelines progress in Y direction. Improves performance on TGL: Borderlands3.dxvk-g2 +1.5% Y-major can introduce a performance drop on CS that use mixture of buffers and images. This should be fixed in next commit. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10733>	2021-05-22 00:15:25 +00:00
Felix DeGrood	bbd6ce6e9d	intel/compile: refactor DERIVATIVE_GROUP logic Minor changes to logic to make following changes easier. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10733>	2021-05-22 00:15:25 +00:00
Felix DeGrood	ca59db9900	intel/compiler: Use switch for DERIVATIVE_GROUP logic Switch statement is more readable. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10733>	2021-05-22 00:15:25 +00:00
Chia-I Wu	1b02577209	pps: fix a missing include in Intel pps driver In file included from ../src/intel/ds/intel_pps_perf.cc:8: ../src/intel/ds/intel_pps_perf.h:34:78: error: ‘string’ in namespace ‘std’ does not name a type Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10934>	2021-05-21 22:03:16 +00:00
Ian Romanick	3fa16b3025	intel/isl: Add mappings for PIPE_FORMAT_R8G8_R8B8_UNORM and PIPE_FORMAT_G8R8_B8R8_UNORM Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9610>	2021-05-21 01:40:22 +00:00
Kenneth Graunke	a3a4517f41	isl: Work around NVIDIA and AMD display pitch requirements In the case where we're rendering on the Intel GPU, but displaying on an AMD and NVIDIA GPU, we need to follow their pitch requirements for our linear scanout buffers. Based on a patch by Lionel Landwerlin. Closes: #4706 Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10895>	2021-05-20 23:48:01 +00:00
Timur Kristóf	5f1aa5ac38	anv: Fix unused function warnings for memory range checks. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10806>	2021-05-20 17:11:22 +00:00
Nanley Chery	88903ba64d	isl: Add isl_aux_usage_has_compression Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10872>	2021-05-19 23:59:17 +00:00
Jason Ekstrand	ebba3cad81	intel/vec4: Add support for UBO pushing Shader-db results on Haswell (vec4 only): total instructions in shared programs: 2853928 -> 2726576 (-4.46%) instructions in affected programs: 855840 -> 728488 (-14.88%) helped: 9500 HURT: 18 helped stats (abs) min: 1 max: 359 x̄: 13.54 x̃: 11 helped stats (rel) min: 0.44% max: 53.33% x̄: 19.13% x̃: 17.44% HURT stats (abs) min: 4 max: 124 x̄: 71.00 x̃: 92 HURT stats (rel) min: 3.64% max: 77.86% x̄: 46.43% x̃: 52.12% 95% mean confidence interval for instructions value: -13.78 -12.98 95% mean confidence interval for instructions %-change: -19.21% -18.81% Instructions are helped. total cycles in shared programs: 101822616 -> 60245580 (-40.83%) cycles in affected programs: 93312382 -> 51735346 (-44.56%) helped: 13292 HURT: 4506 helped stats (abs) min: 2 max: 1229260 x̄: 3370.82 x̃: 776 helped stats (rel) min: 0.04% max: 96.70% x̄: 47.56% x̃: 43.76% HURT stats (abs) min: 2 max: 17644 x̄: 716.37 x̃: 82 HURT stats (rel) min: 0.02% max: 491.80% x̄: 41.00% x̃: 11.11% 95% mean confidence interval for cycles value: -3037.07 -1635.03 95% mean confidence interval for cycles %-change: -26.03% -24.25% Cycles are helped. total spills in shared programs: 1080 -> 1314 (21.67%) spills in affected programs: 74 -> 308 (316.22%) helped: 0 HURT: 47 total fills in shared programs: 310 -> 497 (60.32%) fills in affected programs: 71 -> 258 (263.38%) helped: 0 HURT: 47 total sends in shared programs: 239884 -> 151799 (-36.72%) sends in affected programs: 129302 -> 41217 (-68.12%) helped: 9547 HURT: 0 helped stats (abs) min: 1 max: 226 x̄: 9.23 x̃: 8 helped stats (rel) min: 3.12% max: 98.15% x̄: 72.38% x̃: 80.00% 95% mean confidence interval for sends value: -9.48 -8.98 95% mean confidence interval for sends %-change: -72.80% -71.97% Sends are helped. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Jason Ekstrand	89fd196f6b	intel/vec4: Add support for masking pushed data This is the vec4 equivalent of `d0d039a4d3`, required for proper UBO pushing in vertex stages for Vulkan on HSW. Sadly, the implementation requires us to do everything in ALIGN1 mode and the vec4 instruction scheduler doesn't understand HW_GRF <-> UNIFORM interference so it's easier to do the whole thing in the generator. We add an instruction to the top of the program which just means "emit the blob" and all the magic happens in codegen. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Jason Ekstrand	a881f2295f	intel/vec4: Set up push ranges before we emit any code In order to avoid switching pull constants to push constants and then having to back to pull, compute the push ranges up-front. This way we know by the time we emit code exactly what ranges are pushable. This is a bit inefficient in the case where the "normal" push constants get compacted. However, most apps don't use giant piles of dead uniforms combined with substantial UBO use so this should be ok. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Jason Ekstrand	c35501ffe8	intel/vec4: Update nr_params in pack_uniform_registers This is where we re-arrange and re-pack the params. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Jason Ekstrand	3d1ac996d0	intel/vec4: Add some asserts to move_push_to_pull Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Jason Ekstrand	2db8867943	intel/vec4: Don't spill fp64 registers more than once The way we handle spilling for fp64 in vec4 is to emit a series of MOVs which swizzles the data around and then a pair of 32-bit spills. This works great except that the next time we go to pick a spill reg, the compiler isn't smart enough to figure out that the register has already been spilled. Normally we do this by looking at the sources of spill instructions (or destinations of fills) but, because it's separated from the actual value by a MOV, we can't see it. This commit adds a new opcode VEC4_OPCODE_MOV_FOR_SCRATCH which is identical to MOV in semantics except that it lets RA know not to spill again. Fixes: `82c69426a5` "i965/vec4: support basic spilling of 64-bit registers" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Jason Ekstrand	7138249675	anv: Push at most 32 regs for vec4 shaders The vec4 back-end can't push UBOs just yet but it soon will be able. When it starts pushing UBOs, it will have a lower limit than scalar due to a crummy register allocator. Mirror that limit in ANV so we don't run into asserts due to ANV and the back-end making different choices. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Jason Ekstrand	c01354d5c4	anv: Support pushing shader constants Usually, nir_opt_constant_folding will get rid of any load_constant intrinsics which might possibly be pushed but there are rare cases where we can still end up with them. Better to handle them. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Jason Ekstrand	24b3e71fa9	anv: Plumb the shader into push constant helpers Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10571>	2021-05-19 14:38:13 +00:00
Lionel Landwerlin	790ff1ceac	anv: fixup physical device properties of fragment shading rate We've only enabled the extension on Gfx11+ so no need to care about prior values. Also fixup values of (min\|max)FragmentShadingRateAttachmentTexelSize. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `231651fd89` ("anv: implement VK_KHR_fragment_shading_rate") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10607>	2021-05-19 10:58:01 +03:00
Antonio Caggiano	92180a4376	pps: Intel pps driver Add the Intel pps driver using functionalities provided by libintel_perf. v2: Fix build with perfetto not enabled. v3: Open perf stream with no filtering. v4: Drop usage of inc/dec_n_users. v5: Isolate intel_perf in its own class. Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10216>	2021-05-18 14:28:48 +00:00
Lionel Landwerlin	5d95aa3964	intel/perf: allow metric sets to be loaded with on OA reports A bunch of performance counters rely on register snapshots on top of the OA reports. Those are already conditional to the query mode in the equations : availability="true $QueryMode &&" This change allows to disable counters that are only available with additional register snapshots. This will be useful if you only want to OA reports to extract performance counter values. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10216>	2021-05-18 14:28:48 +00:00
Lionel Landwerlin	fba189a349	intel/perf: allow opening perf stream with no context filtering Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10216>	2021-05-18 14:28:48 +00:00
Antonio Caggiano	b18a4bb8c6	intel/perf: Extern C Allow libintel_perf to be included as a dependency from a C++ project by wrapping some declaration within an extern "C" block, and then add a function to allow direct reading of the OA stream. v2: Don't expose internal helpers (Lionel) Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10216>	2021-05-18 14:28:48 +00:00
Tapani Pälli	72fd126070	anv: require rendering support for blit destination feature This fixes some new cts tests that exercise blitting between compressed and uncompressed formats. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10830>	2021-05-17 10:24:28 +00:00
Connor Abbott	a40714abf7	nir/lower_phis_to_scalar: Add "lower_all" option We don't want to have to deal with vector phis in freedreno, because vectors are always split/unsplit around vectorized instructions anyways, and the stated reason for not scalarising them (it hurting coalescing) won't apply to us because we won't be using nir_from_ssa. Add this option so that we don't have to do the equivalent thing while translating from NIR. Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10809>	2021-05-17 09:59:45 +00:00
Marcin Ślusarz	7e6a7dba1e	intel/isl: replace format_gen by verx10 Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10810>	2021-05-17 09:46:45 +00:00
Marcin Ślusarz	b1933d769f	intel: simplify is_haswell checks, part 2 Few cases that were not handled by sed. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10810>	2021-05-17 09:46:45 +00:00
Marcin Ślusarz	3340d5ee02	intel: simplify is_haswell checks, part 1 Generated with: files=`git grep is_haswell \| cut -d: -f1 \| sort \| uniq` for file in $files; do cat $file \| \ sed "s/devinfo->ver <= 7 && !devinfo->is_haswell/devinfo->verx10 <= 70/g" \| \ sed "s/devinfo->ver >= 8 \|\| devinfo->is_haswell/devinfo->verx10 >= 75/g" \| \ sed "s/devinfo->is_haswell \|\| devinfo->ver >= 8/devinfo->verx10 >= 75/g" \| \ sed "s/devinfo.is_haswell \|\| devinfo.ver >= 8/devinfo.verx10 >= 75/g" \| \ sed "s/devinfo->ver > 7 \|\| devinfo->is_haswell/devinfo->verx10 >= 75/g" \| \ sed "s/devinfo->ver == 7 && !devinfo->is_haswell/devinfo->verx10 == 70/g" \| \ sed "s/devinfo.ver == 7 && !devinfo.is_haswell/devinfo.verx10 == 70/g" \| \ sed "s/devinfo->ver < 8 && !devinfo->is_haswell/devinfo->verx10 <= 70/g" \| \ sed "s/device->info.ver == 7 && !device->info.is_haswell/device->info.verx10 == 70/g" \ > tmpXXX mv tmpXXX $file done Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Acked-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10810>	2021-05-17 09:46:45 +00:00
Nanley Chery	eef4c708b3	anv: Avoid sampling some MCS surfaces with clear Supposedly avoids GPU hangs in BF4. See HSD 1707282275 and 14013111325. v2. Fix bug in WA implementation. (Sagar) Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8755>	2021-05-14 18:05:32 +00:00
Nanley Chery	608c131638	anv: Add clear_supported to anv_layout_to_aux_state This will be used for an MCS workaround. Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8755>	2021-05-14 18:05:32 +00:00
Lionel Landwerlin	938e52a6e8	anv: handle spirv parsing failure v2: don't leak spec_entries v3: Also switch to VK_ERROR_UNKNOWN when parsing fails Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10790>	2021-05-14 06:32:03 +00:00
Jordan Justen	e435511b58	intel/dev: Add device info for ADL GT2 Cc: mesa-stable Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9465>	2021-05-14 06:10:47 +00:00
Jordan Justen	89f3312625	intel/isl: Add Wa_22011186057 to disable CCS on ADL GT2 A0 Cc: mesa-stable Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9465>	2021-05-14 06:10:47 +00:00
Caio Marcelo de Oliveira Filho	c0dc6affdc	intel/compiler: Clarify why VUE is recomputed by FS FS will get the last geometry VUE, but it still needs to recompute in case the number of position slots assigned by geometry is larger than one -- this happens when Primitive Replication is used. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10653>	2021-05-13 12:10:26 -07:00
Lionel Landwerlin	2cebb1b5b3	anv: fix perf query pass with command buffer batching We've only considered the perf query pool change previously. But we also need to pay attention to the pass index. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `0a7224f3ff` ("anv: group as many command buffers into a single execbuf") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10301>	2021-05-13 17:02:41 +00:00
Lionel Landwerlin	2c2de4d60e	intel/mi_builder: fix resolve call Giving NULL for anv_combine_address() triggers an assert in that function. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8525ebe6e3` ("intel/mi_builder: Return an address from __gen_get_batch_address") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10301>	2021-05-13 17:02:41 +00:00
Tapani Pälli	343d90b6ab	isl: require hiz for depth surface in isl_surf_get_ccs_surf Fixes: `752eefdb` ("intel/isl: Refactor isl_surf_get_ccs_surf") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10768>	2021-05-13 17:18:54 +03:00
Lionel Landwerlin	f46aa1b9d7	intel/fs: use the final destination type for regioning restrictions This is most likely a rebase mistake :( Fixes: `f3e5cd813a` ("intel/fs: Handle regioning restrictions of split FP/DP pipelines.") Ref: `aa53665fda` ("intel/fs/copy_prop: check stride constraints with actual final type") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10764>	2021-05-12 21:19:11 +00:00
Jason Ekstrand	b13d0eea12	anv: Allow storage on all formats that support typed writes In particular, this gives us B8G8R8A8_UNORM storage support which is useful for writing WSI images from compute shaders. These formats can only be accessed in a spec-compliant way by decorating the variable NonReadable in the SPIR-V (writeonly in GLSL). If the client doesn't so decorate the variable, it'll get the null surface state where reads return 0 and writes are ignored. Tested-by: Simon Ser <contact@emersion.fr> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10624>	2021-05-05 12:20:09 +00:00
Lionel Landwerlin	df0580312a	isl: document format fields Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10624>	2021-05-05 12:20:09 +00:00
Jason Ekstrand	9301b637cf	anv: Check offset instead of alloc_size for freeing surface states Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10624>	2021-05-05 12:20:09 +00:00
Dave Airlie	922f71b819	intel/decoder: add gen4/5 geometry state decode Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10642>	2021-05-05 07:28:34 +00:00
Dave Airlie	d91d3613ad	intel/decoder: fixup batch decoder for binding tables on gen4/5 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10642>	2021-05-05 07:28:34 +00:00
Dave Airlie	4d80ec8fcf	intel/genxml: fix raster op fields on gen4/5 These should be unsigned integers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10636>	2021-05-05 02:56:20 +00:00
Dave Airlie	e5169714ea	intel/gemxml: move blitter command to render on gen4/5 The blitter commands don't show up in INTEL_DEBUG=bat, but on gen4/5 they are emitted on the render engine ring so just change the XML to reflect that. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10636>	2021-05-05 02:56:20 +00:00
Dave Airlie	c6f34e9fda	intel/genxml: rewrite the prefilterop xml to be more consistent. This uses a prefix at Ken's suggestion and aligns it across gens Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10636>	2021-05-05 02:56:20 +00:00
Dave Airlie	04ed882046	intel/genxml: align gen4/5 xml for store data immediate Just align with the gen6 and later xml Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10636>	2021-05-05 02:56:20 +00:00
Ilia Mirkin	10abc09a04	intel: fix MI builder for pre-gen7 MI_LOAD_REGISTER_MEM is only available on gen7+, so avoid build errors on earlier generations. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10636>	2021-05-05 02:56:20 +00:00
Caio Marcelo de Oliveira Filho	dd48683cfd	nir: Move shared_memory_explicit_layout bit into common shader_info Move it out of the "cs" sub-struct, since the bit can be used for other shader stages in the future. This also removes a subtle issue in spirv_to_nir: info.cs.shared_memory_explicit_layout was used without checking for the CS shader stage. It ended up being "harmless" since the effects also depended on presence of shared variables. Fixes: `5de6c5973a` ("spirv: Implement SPV_KHR_workgroup_memory_explicit_layout") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10529>	2021-05-04 20:54:58 +00:00
Caio Marcelo de Oliveira Filho	caf9fb1a10	intel/compiler: Remove unused exported functions Now that all drivers are using brw_cs_get_dispatch_info() we can remove one function (which is now unused) and reduce the scope of the other. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10504>	2021-05-04 08:15:19 -07:00
Caio Marcelo de Oliveira Filho	279acf1031	anv: Use brw_cs_get_dispatch_info() And since right_mask is already provided as part of dispatch_info, just use that instead of storing it. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10504>	2021-05-04 08:15:19 -07:00
Caio Marcelo de Oliveira Filho	5cc758558d	intel/compiler: Add common function for CS dispatch info We have this small calculations repeated in each Intel driver, so move them to a single place to be reused. Also includes "right_mask" since is always used in the same context and depends on the dispatch info values. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10504>	2021-05-04 08:15:19 -07:00
Dave Airlie	52e426fd8b	intel/compiler: add support for compiling fixed function gs This is ported from i965, but the interface is cleaned up Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9721>	2021-05-04 03:39:45 +00:00
Dave Airlie	ac33e2b66b	intel: move brw_ff_gs_prog_key/data to compiler. Step one to moving the ff_gs emitter to compiler for sharing, move BRW_MAX_SOL_BINDINGS up so the keys are in same area Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9721>	2021-05-04 03:39:45 +00:00

... 2 3 4 5 6 ...

6947 Commits