KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jason Ekstrand	9946120d2b	intel/isl: Add more cases to isl_surf_get_uncompressed_surf We can actually create array surfaces instead of requiring single-slice in a few cases. This does require us to be very careful about our checks, though. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11647>	2021-07-06 19:29:42 +00:00
Jason Ekstrand	2001a26309	intel/blorp: Use isl_surf_get_uncompressed_surf Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11647>	2021-07-06 19:29:42 +00:00
Jason Ekstrand	20f075c270	intel/blorp: Adjust the compressed copy rectangle before convert_to_single_slice It doesn't matter for the actual copy rectangle and this makes the asserts a bit nicer as we don't need to bother with the intratile offsets because there aren't any yet. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11647>	2021-07-06 19:29:42 +00:00
Jason Ekstrand	8f281be0ba	intel/isl: Pull the uncompressed surface view code from anv This adds a helper isl_surf_get_uncompressed_surf for creating a surface which provides an uncompressed view into a compressed surface. The code is basically a direct port of the uncompressed surface code from the Vulkan driver which, in turn, was a port from BLORP. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11647>	2021-07-06 19:29:42 +00:00
Emma Anholt	d8d7d475aa	anv: Fix unused var warning on release builds from an assertion. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11628>	2021-07-06 11:24:37 -07:00
Thomas H.P. Andersen	85de630273	anv: remove dead code The unused bo_flags here is a leftover from the past. A similar setup of bo_flags is now performed within anv_device_alloc_bo via a call to anv_bo_alloc_flags_to_bo_flags. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11645>	2021-06-30 22:34:42 +00:00
Yevhenii Kolesnikov	974c58b317	intel: fix leaking memory on shader creation ralloc_adopt takes care of all the shader's children, but shader itsel ends up orphaned and never gets free'd. Fixes: `ef5bce9253` ("intel: Drop the last uses of a mem_ctx in nir_builder_init_simple_shader().") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4951 Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11651>	2021-06-30 19:34:56 +03:00
Jason Ekstrand	f5876dfdb9	intel/fs: Lower uniform pull constant load message to LSC dataport Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	6362059b6b	intel/fs: Lower varying pull constant load message to LSC dataport Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	4fca64ad4d	intel/fs: Lower A64 atomic messages to LSC dataport Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	07a4bdf1e8	intel/fs: Lower A64 byte scattered r/w messages to LSC dataport Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Mark Janes	22d20dbb02	intel/fs: Lower A64 untyped r/w messages to LSC when available We set the ex_desc to 0, since the address surface type is FLAT. v2 (Sagar Ghuge): - Fix message descriptor encoding v2 (Jason Ekstrand): - Drop support for block messages Co-authored-by: Sagar Ghuge <sagar.ghuge@intel.com> Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	621cf9b1df	intel/fs: Lower Byte scattered r/w messages to LSC when available v2 (Jason Ekstrand): - Squash in brw_scheduler changes - Update brw_ir_performance Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	8f82c8aa1a	intel/fs: Lower untyped float atomic messages to LSC when available Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Mark Janes	bd40a1e8c9	intel/fs: Lower untyped atomic messages to LSC when available Bspec programming note metions that "Atomic messages are always forced to "un-cacheable" in the L1 cache". We can make the L1 cache un-cacheable and L3 with write-back policy. v2: (Sagar Ghuge): - Fix caching policy for atomic messages - Fix simd exec size v3: (Sagar Ghuge): - Add atomic messages to brw_schedule_instructions v4: (Jason Ekstrand): - Rebase on lsc_msg_desc reworks Co-authored-by: Sagar Ghuge <sagar.ghuge@intel.com> Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Mark Janes	4f86a70599	intel/fs: Lower DW untyped r/w messages to LSC when available This puts the basic infrastructure in place for lowering logical dataport messages to LSC messages. We start with the two most obvious opcodes and add more in later patches. v2 (Sagar Ghuge): - Pass required params to message desc - Remove duplicate mlen calculation - Change commit message. v3 (Jason Ekstrand): - Drop TGM support Co-authored-by: Jason Ekstrand <mark.a.janes@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Mark Janes	32ec0662fd	intel/compiler: Add LSC messages to brw_schedule_instructions v2 (Jason Ekstrand): - Use lsc_msg_desc_opcode() - Drop all opcodes for now and add them in later patches. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Jason Ekstrand	8d3468ad5b	intel/compiler: Add LSC to messages brw_ir_performance This adds framework only. No opcodes. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	634925694d	intel/disasm: Disassemble LSC message extended descriptors v2 (Mark Janes): - changed to lsc convention v3 (Jason Ekstrand): - Use lsc_msg_desc_addr_type Co-authored-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	2605727a80	intel/disasm: Disassmeble LSC messages v2 (Jordan Justen): - Use PRIu64 v3 (Jason Ekstrand): - Drop ranged fence ops, Jason v4: (Mark Janes) - fixed missing parameter to brw_message_desc_cmask_or_vector - changed to use lsc methods to extract fields v5 (Jason Ekstrand): - Squash original disassembler patch and fixes togetherk - Use lsc_opcode_has_cmask - Prefix atomic ops with "atomic_" Co-authored-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Mark Janes	f5541cd4e9	intel/compiler: Add getter helpers for LSC message descriptor fields v2: (Sagar Ghuge): - rename addr_reg_size to src0_len to match with bspec v3 (Jason Ekstrand): - Re-arrange things in increasing bit order Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	4ff00194b7	intel/compiler: Add helpers for LSC message descriptors v2 (Jason Ekstrand): - Squash all the similar patches together - Re-arrange and rename some things to be more consistent - Add a lsc_opcode_has_cmask helper - Drop is_one_addr_reg v3 (Jason Ekstrand): - Add transpose - Re-order arguments to make more logical sense - Switch from `write` to `has_dest` Co-authored-by: Mark Janes <mark.a.janes@intel.com> Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	b67f1ff465	intel/compiler: Add support for LSC fence operations v2 (Jason Ekstrand): - Squash SLM and global fence ops together v3 (Jason Ekstrand): - Rework to use message descriptors instead of instruction fields v4 (Jason Ekstrand): - Don't pass BTI into back-end emit function. Always use FLAT. Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Sagar Ghuge	cf612e4dc1	intel/compiler: Define new LSC data port encodings Xe-HPG comes with a massively reworked dataport. The new thing, called Load/Store Cache or LSC, has a significantly improved interface. Instead of bespoke messages for every case, there's basically one or two messages with different bits to control things like address size, how much data is read/written, etc. It's way nicer but also means we get to rewrite all our dataport encoding/decoding code. This patch kicks off the party with all of the new enums. v2 (Jason Ekstrand, Mark Janes): - Rename to LSC v3 (Jason Ekstrand): - Add numbers to all enums Co-authored-by: Mark Janes <mark.a.janes@intel.com> Co-authored-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Jason Ekstrand	4a85a5e09f	intel/devinfo: Add a has_lsc bit Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11600>	2021-06-30 16:17:18 +00:00
Marcin Ślusarz	7048a15485	anv: drop unused argument of anv_descriptor_set_address Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11577>	2021-06-30 08:41:41 +00:00
Marcin Ślusarz	904bd8f358	anv: handle push descriptor sets when they are sent with push constants When vkCmdPushDescriptorSetKHR is used, the descriptor set is allocated internally without belonging to any pool. Such descriptor set will be visible on the GPU side because it's a part of the dynamic state stream, but we still have to store its address in the array of descriptor sets. Complements: `379b9bb7b0` ("anv: Support fetching descriptor addresses from push constants") Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11577>	2021-06-30 08:41:41 +00:00
Marcin Ślusarz	79a50c6f54	anv: keep descriptor set's address directly in anv_descriptor_set Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11577>	2021-06-30 08:41:41 +00:00
Emma Anholt	b18cf54f0d	intel: Early exit from inst_is_in_block(). Surely the compiler would sort that out, you would think. But no, my debugoptimized build improves dEQP-GLES31.functional.ubo.random.all_per_block_buffers.13 runtime by 25% on my SKL from this change. This was the slowest test in the GLES31 tests on APL in CI, at 22s. And yes, we were spending around half of our runtime in this function. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11631>	2021-06-29 16:48:40 +00:00
Lionel Landwerlin	71b7c7b0dc	intel/perf: use the right popcount for 64bits We're currently using the 32bit version which is dropping half the bits of the 64bits values. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11607>	2021-06-29 06:29:53 +00:00
Eleni Maria Stea	e094f75bd0	intel: PAGE_SIZE used in allocators shouldn't be defined on FreeBSD iris, i965, and anv define the PAGE_SIZE in anv_allocator and bufmgr files. As on FreeBSD the page size is defined in machine/param.h that is indirectly included by those files, we'd rather define it only when the system is not FreeBSD to avoid compile errors. v2: Changed the path in the comment to make clear that machine/params.h is a FreeBSD system file. Signed-off-by: Eleni Maria Stea <elene.mst@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11203>	2021-06-28 21:12:24 +03:00
Ella-0	86fe8db4aa	anv: expose primary node to VK_EXT_physical_device_drm even when VK_KHR_display is not enabled Fixes: `e9e1e0362b` ("anv: implement VK_EXT_physical_device_drm") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11616>	2021-06-28 14:03:22 +00:00
Marcin Ślusarz	2cf189cc88	intel/fs: use stack for temporary array "regs" is an array of 2 -> "m" must be <= 2 -> "components" array can be allocated on the stack Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11575>	2021-06-28 09:44:40 +00:00
Marcin Ślusarz	5c66fb7e6e	intel/blorp: initialize BLEND_STATE using braced initializer list Coverity complains that memset has no effect, because of size 0. Size of BLEND_STATE struct is 0 on gfx [6, 7.5], so memset has nothing to do there. This is of course harmless, but we can make code simpler by replacing memset with an empty initializer list and at the same time avoid a warning from Coverity. CID: 1486015 Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11574>	2021-06-28 09:31:41 +00:00
Marcin Ślusarz	5c5423e6ee	intel/tools: fix potential memory leaks CID: 1444041 CID: 1444104 Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11574>	2021-06-28 09:31:41 +00:00
Marcin Ślusarz	14227b3364	intel/tools: fix invalid type in argument to printf format specifier Also mark parse_error as printf-like to catch such errors with gcc. CID: 1473100 CID: 1473101 Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11574>	2021-06-28 09:31:41 +00:00
Marcin Ślusarz	f2a3c9de67	intel/tools: fix int-to-pointer/pointer-to-int cast warnings on 32-bit gcc complains: ../src/intel/tools/aub_write.c: In function ‘populate_ppgtt_table’: ../src/intel/tools/aub_write.c:254:16: warning: cast to pointer from integer of different size [-Wint-to-pointer-cast] 254 \| (void *)(aub->phys_addrs_allocator++ << 12); \| ^ ../src/intel/tools/aub_write.c:258:27: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast] 258 \| i, (uint64_t)table->subtables[i]); \| ^ ../src/intel/tools/aub_write.c:273:24: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast] 273 \| (level == 1 ? (uint64_t)table->subtables[i] : \| ^ ../src/intel/tools/aub_write.c: In function ‘ppgtt_lookup’: ../src/intel/tools/aub_write.c:346:11: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast] 346 \| return (uint64_t)L1_table(ppgtt_addr)->subtables[L1_index(ppgtt_addr)]; \| ^ ../src/intel/tools/intel_sanitize_gpu.c: In function ‘bo_size’: ../src/intel/tools/intel_sanitize_gpu.c:99:15: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast] 99 \| return e ? (uint64_t)e->data : UINT64_MAX; \| ^ Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11574>	2021-06-28 09:31:41 +00:00
Marcin Ślusarz	53c49ce41f	intel/tools: fix left shift overflow on 32-bit gcc complains: ../src/intel/tools/aub_write.c:394:53: warning: left shift count >= width of type [-Wshift-count-overflow] 394 \| assert((aub->phys_addrs_allocator + size) < (1UL << 32)); \| ^~ Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11574>	2021-06-28 09:31:41 +00:00
Marcin Ślusarz	d19cf7d572	anv: fix potential integer overflows In all cases both variables has a type of uint32_t, so multiplying them will also generate uint32_t. The results of those multiplications are used as uint64_t's, so Coverity thinks there might be integer overflows here. I don't think it's possible to hit them (query BOs should be relatively small), but let's avoid those overflows. CID: 1472820 CID: 1472821 CID: 1472822 CID: 1472824 CID: 1475934 CID: 1475927 Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11574>	2021-06-28 09:31:41 +00:00
Marcin Ślusarz	365c7cc87c	intel/decoder: add assert for register size Coverity complains about out-of-bounds access in intel_field_iterator_init, because it doesn't know that the GT_MODE register has a size of 4 bytes. Add an assertion to verify that. CID: 1474552 Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11574>	2021-06-28 09:31:41 +00:00
Lionel Landwerlin	40b67a2922	anv: allocate bigger batches as we grow command buffers This is the first time we see an application running out of mmap(). We essentially allocate too many batches (+65k) and end up not being able to mmap them, at which point we can't mmap anything anymore and things go sideways. This change allocates bigger batch BOs as we grow an existing command buffer. This drastically reduces the number of BOs we need to allocate (the benchmark that reported the issue now reaches a max of ~630 BOs, instead of reaching 65k and failing previously). v2: Track the total batch size of command buffers (Jason) Just give 0 for batch_len to i915 (Jason) v3: Fix indentation (Jason) v4: Drop uncessary reshuffling of error labels (Jason) v5: Remove empty lines (Marcin) v6: Limit BO growing to chunks of 16Mb (Jason) v7: Add assert on initial size (Jason) v8: Add define for max size (Jason) v9: Fixup v7 assert for non softpin platforms (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4956 Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11482>	2021-06-26 12:23:54 +03:00
Caio Marcelo de Oliveira Filho	5853f17c30	anv: Advertise VK_KHR_shader_subgroup_uniform_control_flow This is always valid for Intel, so we don't need to do anything different if the Execution Mode is being used. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11476>	2021-06-25 22:41:32 +00:00
Mike Blumenkrantz	6a79ee97f4	anv: VK_EXT_multi_draw implementation Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11531>	2021-06-25 17:16:14 +00:00
Mike Blumenkrantz	1e39f2c199	anv: unify some draw state vertex constant emission Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11531>	2021-06-25 17:16:14 +00:00
Samuel Iglesias Gonsálvez	e791b2045a	anv: do not dereference VkPipelineMultisampleStateCreateInfo always VkGraphicsPipelineCreateInfo.pMultisampleState is a pointer to a VkPipelineMultisampleStateCreateInfo structure, and is ignored if the pipeline has rasterization disabled. Fixes a crash in one CTS tests that checks this. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11601>	2021-06-25 14:42:42 +00:00
Jason Ekstrand	63e80d441a	intel/genxml: Remove old scratch fields on GFX version 12.5 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	eeeea5cb87	anv: Add support for scratch on XeHP Rework: * Jordan: Handle per_thread_scratch==0 in anv_scratch_pool_get_surf * Jordan: Update subslices in anv_scratch_pool_alloc * Jason: Clean up the patch a bit Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	8ca0513eca	intel/genxml: Add new ScratchSpaceBuffer fields on GFX version 12.5 Rework: * Jordan: Fix MEMZONE_BINDER_START detection * Jordan: Bump the IRIS_BINDLESS_SIZE to 8M Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	1e242785c3	intel/fs: Implement load/store_scratch on XeHP Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	c38812be1d	intel/fs: Implement spilling on XeHP Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	96ee78778b	intel/isl: Add support for scratch buffers XeHP adds support for a new surface type for scratch. It's similar to SURFTYPE_STRBUF in that it's a 2D array-of-struct format but the one key difference is that the U coordinate is computed automatically based on the thread ID and only the V coordinate is provided in the dataport message. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	443627fcc0	intel/genxml: Add SURFTYPE_SCRATCH on GFX version 12.5 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11582>	2021-06-25 00:18:29 +00:00
Jason Ekstrand	d31dd81292	anv: Claim to be a discrete GPU if has_lmem Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Jordan Justen	b6a1063c2e	intel/dev: Set has_local_mem for DG1 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Sagar Ghuge	e505c221fa	anv: Allocate scratch and workaround BO in local memory Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Sagar Ghuge	6352371ff6	anv: Allocate BO in appropriate region Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Sagar Ghuge	3f8eca7f82	anv: Wrapper around I915_GEM_CREATE_EXT_MEMORY_REGIONS v2 (Jordan Justin): - add anv_gem_stubs.c impl v3 (Jason Ekstrand): - Use the upstream uAPI - Rework the interface a bit Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Sagar Ghuge	65e8d72bc1	anv: Query memory region info Create additional memory type with DEVICE_LOCAL_BIT if we have local memory region aviable. v2 (Jason Ekstrand): - Don't leak mem_regions if the second ioctl fails Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Jordan Justen	cb6feae0b5	intel/devinfo: Add has_local_mem Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5599>	2021-06-24 16:14:38 +00:00
Jason Ekstrand	b8030ab1ea	isl,docs: Add a chapter on AUX state tracking We also update and improve the docs in isl.h which get pulled into this new chapter. Acked-by: Luis Strano <luis.strano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11479>	2021-06-24 13:57:40 +00:00
Jason Ekstrand	94a52bc85c	isl,iris: Move the extra_aux_surf logic into iris This gets rid of the awkward interface for isl_surf_get_ccs_surf where we passed it two aux surfaces and it was supposed to fill out the second one based on whether or not the first one already had stuff in it. Instead, we now pass it three well-labled surfaces: surf, hiz_or_mcs_surf, and ccs_surf which have obvious meanings. This does mean that iris has to carry a bit of logic and we have to flip parameters around in all the callers. But the resulting interface is much cleaner. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11479>	2021-06-24 13:57:40 +00:00
Jason Ekstrand	37f76aab1c	isl: Take a hiz_or_mcs_surf in isl_surf_supports_ccs Whether or not a surface supports CCS on Tigerlake and later is dependent not only on the main surface but also on the MCS or HiZ surface, if any. We were doing some of these checks in isl_get_ccs_surf based on the extra_aux parameter but not as many as we probably should. In particular, we were really only checking HiZ conditions and nothing for MCS. It also meant that, in spite of the symmetry in names, the checks in isl_surf_get_ccs_surf were more complete than in isl_surf_supports_ccs. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11479>	2021-06-24 13:57:40 +00:00
Jason Ekstrand	2d2590a879	isl: Assert some iris invariants in isl_surf_get_ccs_surf The only driver which calls isl_surf_get_ccs_surf with extra_aux != NULL is iris and it always calls it with two aux surfaces and never calls it for CCS twice. We can turn those checks into asserts. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11479>	2021-06-24 13:57:40 +00:00
Dave Airlie	0acd202858	intel/genxml: fix gfx6 GS SVB_INDEX encoding This seems to match what the docs + 965 traces say Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11567>	2021-06-24 06:50:49 +00:00
Tapani Pälli	55951ac28e	anv: fix emitting dynamic primitive topology Initial implementation missed various fields that derive from the primitive topology. This patch fixes 3DSTATE_RASTER/3DSTATE_SF, 3DSTATE_CLIP and 3DSTATE_WM (gen7.x) emission in the dynamic case. Fixes: `f6fa4a8000` ("anv: add support for dynamic primitive topology change") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4924 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11379>	2021-06-23 12:01:45 +00:00
Lionel Landwerlin	9b7cba7724	anv: bound checks buffer memory binding in debug builds Validation layers should warn you about this (VUID-VkBindBufferMemoryInfo-size-01037) but this would be useful for zink debugging. Requested by Zmike. v2: Also check memoryOffset (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11542>	2021-06-23 08:16:57 +00:00
Francisco Jerez	4dc4284342	intel/fs: Implement Wa_14013745556 on TGL+. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	c19cfa9dc2	intel/fs: Fix synchronization of accumulator-clearing W/A move on TGL+. Right now the accumulator-clearing move emitted by the generator for Wa_14010017096 inherits the SWSB field from the previous instruction. This can lead to redundant synchronization, or possibly more serious issues if the previous instruction had a TGL_SBID_SET SWSB synchronization mode. Take the SWSB synchronization information from the IR. Fixes: `a27542c5dd` ("intel/compiler: Clear accumulator register before EOT") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	63abc083ce	intel/fs: Teach IR about EOT instruction writing the accumulator implicitly on TGL+. This is unlikely to have had any negative side effect on the original TGL, but will lead to issues on XeHP+ if the software scoreboard pass isn't able to synchronize the accumulator writes. Fixes: `a27542c5dd` ("intel/compiler: Clear accumulator register before EOT") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	5e7f443de0	intel/fs: Add SWSB dependency annotations for cross-pipeline WaR data hazards on XeHP+. In cases where an in-order instruction is overwriting a register previously read by another in-order instruction, drop the dependency iff the previous read is guaranteed to have occurred from the same in-order pipeline. This should only have an effect on XeHP+ since previous Xe platforms only had one in-order FPU pipeline. The previous workaround we were using for this treated all ordered read dependencies as write dependencies to avoid noise from our simulation environment. Relative to our previous workaround this improves performance of GFXBench5 gl_tess by ~7% on a DG2 system among other single-digit percentual FPS improvements. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	d46bb14d14	intel/fs: Implement Wa_22012725308 for cross-pipe accumulator data hazard. The hardware fails to provide the expected data coherency guarantees for accumulator registers when accessed from multiple FPU pipelines. Fix this by tracking implicit accumulator accesses just like we do for regular GRF registers, but instead of adding synchronization annotations for any dependency we only do it for dependencies with a pipeline mismatch, since the hardware should be able to guarantee proper synchronization for matching pipelines. Note that this workaround handles RaW and WaW dependencies in addition to the WaR dependencies described in the hardware bug report even though cross-pipeline RaW accumulator dependencies should be extremely rare, since chances are the hardware will also hang if we ever hit such a condition. This only affects XeHP+, since all FPU instructions are executed as a single in-order pipeline on earlier Xe platforms. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	385da1fe36	intel/fs: Track single accumulator in scoreboard lowering pass. This change reduces the precision of the scoreboard data structure for accumulator registers, because the rules determining the aliasing of accumulator registers are non-trivial and poorly documented (e.g. acc0 overlaps the storage of acc1 when the former is accessed with an integer type). We could implement those rules but it wouldn't have any practical benefit since we currently only use acc0-1, and for the most part we can rely on the hardware's accumulator dependency tracking. Instead make our lives easier by representing it as a single register. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Francisco Jerez	231337a13a	intel/fs/xehp: Assert that the compiler is sending all 3 coords for cubemaps. As required by HSDES:14013363432. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11433>	2021-06-23 07:34:22 +00:00
Lionel Landwerlin	7ed0aaced7	nir: use a more fitting index for btd_stack_push_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	423c47de99	nir: drop the btd_resume_intel intrinsic This is now 100% equivalent to the new rt_resume intrinsic. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	4d9fcf2799	intel/rt: switch to common pass for shader calls lowering v2: rename for new indices Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	1dacea10f3	anv: implement caching for ray tracing pipelines v2: Turn a bunch of pointer checks into checks against NULL (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	fed7ac932f	anv: move trivial return shader to device Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	7c852f78c4	anv: store more RT shader data in pipeline_stage object Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	045f4600b1	anv: cache raytracing trampoline shader Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Lionel Landwerlin	ab77aeb488	blorp: add blorp string in shader keys Upon looking at caching the raytracing shader (in particular the trampoline one) I kind of got afraid that some of the keys used for blorp would end up matching other keys. This is because blorp keys are fairly simple. There is no SPIRV module hash included. This change includes a "blorp" string at the beginning of the queue to ensure we don't collide with other keys. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	7479fe6ae0	anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect v2: Fix anv_cmd_state::binding_tables array size (Lionel) v2: Fix anv_cmd_state::samplers array size (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	ac6d7a1758	anv: Make anv_address::offset 64-bit This allows us to convert a 64-bit address to an anv_address which is useful for working with device addresses. v2: switch to int64_t to keep state pool relative relocation working on non-softpin platforms v3: Update assert to reflect relative offsets (Jason) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	a67d7c9dee	anv: Allow _anv_combine_address with a NULL batch This is required in order to be able to use GenXML pack functions for structs with addresses when you're not packing into a batch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	f68d64dac0	anv: Add support for vkCmdSetRayTracingPipelineStackSizeKHR Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	02f7964a13	anv: Compute scratch sizes for ray-tracing pipelines and shader groups Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	c3ac9afca3	anv: Create and return ray-tracing pipeline SBT handles Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	b66d3e627a	intel/fs: Don't pull CS push constants if uses_inline_data Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	79dc25d867	anv: Compile trivial return and trampoline shaders These don't necessarily go in any group but are required for dispatch to work properly. The trampoline is a compute shader that is the initial start point for the trace. It's in charge of invoking the actual ray-gen shader. The trivial return shader is used whenever another shader is missing and it does no work except the minimum required to do a stack return. v2: Rebase on upstream changes (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	e104555851	anv: Compile ray-tracing shaders This doesn't look too different from other compile functions we have in anv_pipeline.c. The primary difference is that ray-tracing pipelines have this weird two-stage thing where you have "stages" which are individual shaders and "groups" which are sort of mini pipelines that are used to handle hits. For any given ray intersection, only the hit and intersection shaders from the same group get used together. You can't have an intersection shader from group A used with an any-hit from group B. This results in a weird two-step compile. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	379b9bb7b0	anv: Support fetching descriptor addresses from push constants Bindless shaders don't have binding tables so they have to get at the descriptor sets via a different mechanism. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	937ffb1af0	nir/apply_pipeline_layout: Handle bindless shaders They don't have binding tables so they have to use A64 descriptor set access and everything has to be bindless all the time. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	770d331285	anv: Disallow UBO pushing for bindless shaders They don't really have push constants. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	c92fd35848	intel/rt: Use reloc constants for the resume SBT It's going to be attached to the end of the shader binary, not an arbitrary table somewhere in memory. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	705395344d	intel/fs: Add support for compiling bindless shaders with resume shaders Instead of depending on the driver to compile each resume shader separately, we compile them all in one go in the back-end and build an SBT as part of the shader program. Shader relocs are used to make the entries in the SBT point point to the correct resume shader. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	d055ac9bdf	intel/compiler: Add a U32 reloc type Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	55508bbe66	intel/compiler: Generalize shader relocations a bit This commit adds a delta to be added to the relocated value as well as the possibility of multiple types of relocations. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	f7668d6fe5	anv,iris: Move the SHADER_RELOC enums to brw_compiler.h They're common between the two drivers and we want to add a couple more that get emitted from code in src/intel/compiler. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	d95d5b1465	anv: Add support for binding ray-tracing pipelines This includes not only vkCmdBindPipeline but also descriptor sets and push constants. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	8cc5080580	anv: Add a ray-tracing pipeline object This just adds the core data structure which we'll build on going forward. v2: Add VK_EXT_pipeline_creation_cache_control handling (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	dc05daf0e5	anv: Get ready for more pipeline stages This makes a bunch of loops use ARRAY_SIZE instead of MESA_SHADER_STAGES, extends a few arrays, and adds a bunch of array length asserts. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	f366f6a071	anv: Add minimal boilerplate for VK_KHR_ray_tracing_pipeline Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	d258b0bf0e	anv: Add support for binding acceleration structures Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	4664c92478	anv: Add stub support for acceleration structures This just adds a base struct and trivial implementations of all the create/destroy/bind functions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jason Ekstrand	e97002ebcf	anv: Add minimal boilerplate for VK_KHR_acceleration_structure Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8637>	2021-06-22 21:09:25 +00:00
Jan Beich	aefa40886b	anv: adjust headers for non-GNU after `e9e1e0362b` src/intel/vulkan/anv_device.c:27:10: fatal error: 'sys/sysmacros.h' file not found #include <sys/sysmacros.h> ^~~~~~~~~~~~~~~~~ Fixes: `e9e1e036` ("anv: implement VK_EXT_physical_device_drm") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11516>	2021-06-22 13:38:26 +00:00
Tapani Pälli	100fc3020a	anv: provide dummy vkCmdSetPatchControlPointsEXT This makes dEQP-VK.api.version_check.entry_points pass and matches how other drivers are handling this case. We do not support the feature but still need to provide a dummy entrypoint. v2: throw error if/when called (Jason) Fixes: `0d031d1da3` ("anv: toggle on VK_EXT_extended_dynamic_state2") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11503>	2021-06-22 09:14:46 +03:00
Jordan Justen	eb98171252	intel/gen125.xml: Drop GPGPU_WALKER Ref: `f4902bb189` ("intel/genxml,anv,iris: Drop the legacy compute path from gen125.xml") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11489>	2021-06-21 19:40:14 +00:00
Lionel Landwerlin	e9e1e0362b	anv: implement VK_EXT_physical_device_drm v2: add docs update error messages (Sagar) v3: Use fstat() (Jason) v4: Do fstat() on demand (Jason) v5: clear major/minor values if not present (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8649>	2021-06-21 19:18:07 +00:00
Rhys Perry	ea68d4a676	nir/propagate_invariant: add invariant_prim option Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11035>	2021-06-21 15:13:05 +00:00
Emma Anholt	990c232603	nir: Add an interface for logging shaders with mesa_log*. For debug on Android, it's useful to be able to print shaders to the android log interface, since you don't usually have stdout/stderr. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9262>	2021-06-18 18:18:35 +00:00
Jason Ekstrand	6b15ce9fa2	docs/isl: Add detailed documentation about tiling on Intel GPUs Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11366>	2021-06-18 13:03:48 +00:00
Jason Ekstrand	0f6ebd2b73	docs/isl: Add detailed documentation about isl formats Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11366>	2021-06-18 13:03:48 +00:00
Jason Ekstrand	d07ec294bb	isl: Document more members of isl_surf Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11366>	2021-06-18 13:03:48 +00:00
Vinson Lee	5f771134ad	intel/vec4: Add missing break statement. Fix defect reported by Coverity Scan. Missing break in switch (MISSING_BREAK) unterminated_case: The case for value VEC4_OPCODE_ZERO_OOB_PUSH_REGS is not terminated by a break statement. Fixes: `89fd196f6b` ("intel/vec4: Add support for masking pushed data") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11347>	2021-06-18 05:02:23 +00:00
Yiwei Zhang	87f35032a6	anv: enable multi-planar support for drm format modifier This patch only enables the below VkFormat: - VK_FORMAT_G8_B8R8_2PLANE_420_UNORM This patch ensures the proper behavior of the below APIs: - vkGetPhysicalDeviceFormatProperties2 - vkGetPhysicalDeviceImageFormatProperties2 - vkCreateImage - vkGetImageSubresourceLayout - vkGetImageDrmFormatModifierPropertiesEXT - vkGetImageMemoryRequirements - vkGetImageMemoryRequirements2 Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chad Versace <chad@kiwitree.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11281>	2021-06-18 01:04:15 +00:00
Yiwei Zhang	ec70092d55	anv: support multi-planar format in add_all_surfaces_explicit_layout Add initial multi-planar format support on the images with modifiers: - With aux usage, - Format plane count must be 1. - Memory plane count must be 2. - Without aux usage, - Each format plane must map to a distinct memory plane. For the other cases, currently there is no way to properly map memory planes to format planes and aux planes due to the lack of defined ABI for external multi-planar images. This patch doesn't include some potentially supported cases like all format planes mapping to a single memory plane, additional refactoring is needed to workaround explicit base offset + ANV_OFFSET_IMPLICIT. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chad Versace <chad@kiwitree.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11281>	2021-06-18 01:04:15 +00:00
Yiwei Zhang	345a4e9871	anv: fix some log formats Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chad Versace <chad@kiwitree.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11281>	2021-06-18 01:04:15 +00:00
Kenneth Graunke	0510a947ba	anv: Fix dynamic primitive topology for tess on Gfx7.x too Commit `24342e499b` changed how primitive topology is handled on Gfx8+ but missed updating the Gfx7.x code. As a result, tests which previously used topologies like PATCHLIST_3 instead started using bogus ones like LINESTRIP_ADJ. This caused a GPU hangs in a bunch of Vulkan conformance tests involving tessellation. This fixes those hangs. Fixes: `24342e499b` ("anv: fix dynamic primitive topology for tess") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11434>	2021-06-17 02:00:32 +00:00
Yiwei Zhang	357e7ae30b	anv: fix build errors after commit `8b7ff78` Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11373>	2021-06-16 19:55:48 +00:00
Mike Blumenkrantz	24342e499b	anv: fix dynamic primitive topology for tess this needs to use the pre-converted topology using tess state patch control points Fixes: `f6fa4a8000` ("anv: add support for dynamic primitive topology change") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11332>	2021-06-16 13:45:15 +00:00
Jason Ekstrand	7bf0c762c3	anv: Agressively no-op Flush/InvalidateMappedMemoryRanges This has two steps. First, for each range we look at the memory object and see if it actually needs flushing before we start throwing CLFLUSH instructions. Second, we look at the whole list of types on device initialization and decide whether or not we need CLFLUSH at all. The first part should speed up atom chips a bit since we're currently CLFLUSHing everything even when we don't need to. The second isn't needed on most of today's parts because we base it on !has_llc but it is needed for discrete parts. It's also over-all cleaner. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11364>	2021-06-15 21:00:37 +00:00
Felix DeGrood	a49b145e8d	anv: Replace DC Flush with HDC Pipeline Flush HDC Pipeline Flush is the correct method for flushing HDC pipeline on Gfx12+ HW. Continue using DC Flush for earlier HW. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	ef70388a3a	anv: Add ANV_PIPE_HDC_PIPELINE_FLUSH_BIT Gfx12+ PIPE_CONTROL bit for flushing HDC cache and memory transactions to L3 cache. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	82952deb8b	anv: Only flush Tile Cache on VK_ACCESS_HOST_R/W Tile Cache flush flushes all Color/Depth values from L3 cache to memory in Unified Cache mode. This is only required when CPU access is required. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	6f26a51f47	anv: remove unnecessary Tile Cache flushes On Gfx12+, flushing tile cache ensures color/depth values are globally visible, but that's expensive. Most operations only need values to be GT-visible which can be achieved with depth or rt flush. Remove a bunch of unnecessary Tile Cache flushes. Fast clears and slow depth clears still require Tile Cache flush. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	a7bb74db7b	anv: Remove Tile Cache flush from SBA, Pipe Select Tile Cache flushing not required for State Base Address or Pipe Select instructions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	1da9ff047e	anv: Clear all pending stall after pipe flush Was only clearing CS stalls after emitting pending pipe controls. Need to clear all stalls. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	fc5cb54008	anv: Add debug messages for DEBUG_PIPE_CONTROL Enable with INTEL_DEBUG=pc. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Felix DeGrood	6c345ddbe4	anv: Cache VB/IB in L3$ for Gfx12 Gfx12 enables caching of Vertex and Index Buffers in L3. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Dave Airlie	f3630548f1	crocus: initial gallium driver for Intel gfx 4-7 This is a gallium driver for the Intel gfx 4-7 GPUs. It was initially cloned from the iris driver by Ilia Mirkin, then I ported over large reams of code from i965 until it worked. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11146>	2021-06-14 06:34:05 +10:00
Dave Airlie	8da92b5c0a	intel/compiler: add flag to indicate edge flags vertex input is last 965 and the mesa st disagree on how vertex elements are ordered when edgeflags are involved. 965 wants them in gl_vert_attrib order, but gallium supplies the edgeflag as the last vertex element regardless. This adds a flag which is enabled for gen4/5 to denote that the edgeflag is at the end. When we reap 965 later we can resolve this better. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11146>	2021-06-14 06:05:18 +10:00
Dave Airlie	5f03570eaa	intel: reorder base program key. This gets hashed a lot, this reduces the size of this, and the other keys by a small amount Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11146>	2021-06-14 06:05:17 +10:00
Dave Airlie	4de8ba15b4	intel/decode: handle gen4/5 WM state fragment shaders Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11146>	2021-06-14 06:05:17 +10:00
Yiwei Zhang	a0122385f0	anv: fix Android WSI VkFence Fix an obvious one-liner bug. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11344>	2021-06-13 06:12:31 +00:00
Jason Ekstrand	e23b55c3f0	i965: Use nir_lower_passthrough_edgeflags Now that there's a common NIR pass, there's no point in us doing this in the back-end anymore. In order to use this pass in i965, we do have to make one tiny change. Gallium runs the pass after assigning input and output locations and so needs the pass to respect those locations and num_inputs. i965, however, runs it before any location assignment or I/O lowering so we don't care. We do, however, need the pass to succeed with num_inputs == 0 because we set that later. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11313>	2021-06-11 21:19:06 +00:00
Jason Ekstrand	00dc4e0d83	intel/isl: Use a 4D physical total extent for size calculations With Yf and Ys tiling, everything is actually four dimensional because we can have multiple depth or multisampled array slices in the same tile. This commit just enhances the calculations so they can handle it. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	dc764916d9	intel/isl: Make tile logical extents four dimensional Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	a4dafe1fad	intel/isl: Make the offset helpers four dimensional We need to do this in order to handle Yf and Ys tiling because they use a four-dimensional tile instead of laying everything out in two dimensions. v2 (Jason Ekstrand): - Update functions added since v1: - isl_surf_get_image_range_B_tile - blorp_can_hiz_clear_depth - get_image_offset_el Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> (v1) Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	3f7f6b878c	intel/blorp: Use isl_surf_get_image_offset_B_tile_el in ccs_ambiguate Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	0eb3d0a8df	intel/isl: Add a isl_surf_get_image_offset_B_tile_el helper Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11330>	2021-06-11 20:51:21 +00:00
Jason Ekstrand	ddf970af88	anv/blorp: Optimize addresses/relocations when ANV_ALWAYS_SOFTPIN Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	c7e1488037	anv: Optimize anv_address_physical when ANV_ALWAYS_SOFTPIN Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	921bd2d1c7	anv: Fast-path surface relocs when we have softpin Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	6afc3f97b6	anv: Make anv_batch_emit_reloc inline and optimize SKL+ This should drop the CPU overhead of processing buffers on SKL+ by dropping some of the logic contained in anv_reloc_list_add() whenever we have enough compile-time information to know we have softpin. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	9802a0d7ca	anv: Add a helper to add a BO to the batch list without a reloc The relocation list currently serves two purposes. One is for relocations on older non-softpin platforms. The second is to keep track of driver-managed BOs which are used by the given command buffer. We going to need a mechanism to add BOs to the command buffer without doing a relocation into the batch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	7e8c28383c	anv: Handle OOM in the pinned path in anv_reloc_list_add Fixes: `b3c0b1b218` "anv: Use a bitset for tracking residency" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	a63e97e09a	anv: Make use_softpin compile-time in genX code Whenever we have the GFX_VERx10 macro available, we can make use_softpin a compile-time thing for everything but Broadwell and Cherryview. This should save us some CPU cycles especially on SKL+. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Jason Ekstrand	04ccfeae98	anv: Require softpin on Gen8+ Softpin was added to i915 in commit 506a8e87d8d2746b9e9d2433503fe237c54e4750 Author: Chris Wilson <chris@chris-wilson.co.uk> Date: Tue Dec 8 11:55:07 2015 +0000 drm/i915: Add soft-pinning API for execbuffer which was included in Linux 4.5. It's been over 5 years so it's probably reasonable to make it a hard requirement. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11236>	2021-06-08 22:53:22 +00:00
Caio Marcelo de Oliveira Filho	7c1c9e935e	anv: Support workgroup memory in other shaders Mesh and Task shaders can use workgroup memory, so generalize its handling in anv by moving it from anv_pipeline_compile_cs() to anv_pipeline_lower_nir(). Update Pipeline Statistics accordingly. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11230>	2021-06-08 11:30:39 -07:00

1 2 3 4 5 ...

6947 Commits