KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jason Ekstrand	b1311a48e0	intel/isl: Allow multisampling with ISL_FORMAT_HiZ HiZ buffers can be multisampled and, on Broadwell and earlier, simply using interleaved multisampling with a compression block size of 8x4 samples yields the correct HiZ surface size calculations. Unfortunately, choose_msaa_layout was rejecting multisampled HiZ buffers because of format checks. Now that we have a simple helper for determining if a format supports multisampling, that's an easy enough issue to fix. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-10-03 14:53:01 -07:00
Jason Ekstrand	baade41a5c	intel/isl: Allow creation of 1-D compressed textures Compressed 1-D textures are not well-defined thing in either GL or Vulkan. However, auxiliary surfaces are treated as compressed textures in ISL and we can do HiZ and CCS with 1-D so we need to be able to create them. In order to prevent actually using them (the docs say no), we assert in the state setup code. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-10-03 14:53:01 -07:00
Jason Ekstrand	f82166578f	intel/isl: Fix up asserts in calc_phys_level0_extent_sa The assertion that a format is uncompressed in the multisample layouts isn't quite right. What we really want to assert is that the format supports multisampling which is a bit more complicated query. We also want to assert that it has a block size of 1x1 since we do nothing with the block size in the phys_level0_sa assignment. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-10-03 14:53:01 -07:00
Jason Ekstrand	5637f3f120	intel/isl: Add a format_supports_multisampling helper Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-10-03 14:53:01 -07:00
Ville Syrjälä	2fef0d108a	anv/formats: Fix build on gcc-4 and earlier gcc-4 and earlier don't allow compound literals where a constant is required in -std=c99/gnu99 mode, so we can't use ISL_SWIZZLE() when populating the anv_formats[] array. There are a few ways around it: First one would be -std=c89/gnu89, but the rest of the code depends on c99 so it's not really an option. The second option would be to upgrade to gcc-5+ where the compiler behaviour was relaxed a bit [1]. And the third option is just to avoid using compound literals. I chose the last option since it keeps gcc-4 and earlier working. [1] https://gcc.gnu.org/gcc-5/porting_to.html Cc: Jason Ekstrand <jason@jlekstrand.net> Cc: Topi Pohjolainen <topi.pohjolainen@intel.com> Fixes: `7ddb21708c` ("intel/isl: Add an isl_swizzle structure and use it for isl_view swizzles") Signed-off-by: Ville Syrjälä <ville.syrjala@linux.intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-10-03 15:45:28 +03:00
Timothy Arceri	eaf147cb46	i965: rename max_ds_* variable to max_tes_* Using consistent naming allows us to create macros more easily. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-10-03 15:29:58 +11:00
Timothy Arceri	b67633ce5e	i965: rename max_hs_* variables to max_tcs_* Using consistent naming allows us to create macros more easily. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-10-03 15:29:51 +11:00
Sirisha Gandikota	8e3e9d74b5	aubinator: Fix the decoding of values that span two Dwords Fixed the way the values that span two Dwords are decoded. Based on the start and end indices of the field, the Dwords are fetched and decoded accordingly. v2: rename dw to qw in gen_field_iterator_next and remove extra white space (Anuj) v3: change all instances of dw to qw (Anuj) Earlier, 64-bit fields (such as most pointers on Gen8+) weren't decoded correctly. gen_field_iterator_next seemed to walk one DWord at a time, sets v.dw, and then passes it to field(). So, even though field() takes a uint64_t, we're passing it a uint32_t (which gets promoted, so the top 32 bits will always be zero). This seems pretty bogus... (Ken) Signed-off-by: Sirisha Gandikota <Sirisha.Gandikota@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-26 11:18:52 -07:00
Nayan Deshmukh	b3827819aa	aubinator: fix resource leak CovID: 1373370 Signed-off-by: Nayan Deshmukh <nayan26deshmukh@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-25 12:32:48 -07:00
Nicolas Koch	f17948a30a	anv: Check for VK_WHOLE_SIZE in anv_CmdFillBuffer From the Vulkan spec: Size is the number of bytes to fill, and must be either a multiple of 4, or VK_WHOLE_SIZE to fill the range from offset to the end of the buffer. If VK_WHOLE_SIZE is used and the remaining size of the buffer is not a multiple of 4, then the nearest smaller multiple is used. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-23 00:20:16 -07:00
Lionel Landwerlin	6b21728c4a	anv: get rid of duplicated values from gen_device_info Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-23 10:12:06 +03:00
Lionel Landwerlin	bc24590f0c	intel/i965: make gen_device_info mutable Make gen_device_info a mutable structure so we can update the fields that can be refined by querying the kernel (like subslices and EU numbers). This patch does not make any functional change, it just makes gen_get_device_info() fill a structure rather than returning a const pointer. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-23 10:11:59 +03:00
Lionel Landwerlin	b8162d6b6e	anv: pipeline: use correct number of thread for compute Reproduces this commit : commit `0fb85ac08d` Author: Kenneth Graunke <kenneth@whitecape.org> Date: Mon Jun 6 21:37:34 2016 -0700 i965: Use the correct number of threads for compute shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-21 12:01:06 +03:00
Lionel Landwerlin	f2d43b44d7	anv: allocator: correct scratch space for haswell This reproduces this commit : commit `2213ffdb4b` Author: Kenneth Graunke <kenneth@whitecape.org> Date: Mon Jun 6 21:37:34 2016 -0700 i965: Allocate scratch space for the maximum number of compute threads. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-21 12:01:06 +03:00
Lionel Landwerlin	09394ee6cf	anv: device: calculate compute thread numbers using subslices numbers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-21 12:01:06 +03:00
Lionel Landwerlin	792d77165b	aubinator: add a custom handler for immediate register load Transforming this : 0x00c77084: 0x11000001: MI_LOAD_REGISTER_IMM 0x00c77088: 0x0000b020 : Dword 1 Register Offset: 0x0000b020 0x00c7708c: 0x00880038 : Dword 2 Data DWord: 8912952 Into this: 0x007880f0: 0x11000001: MI_LOAD_REGISTER_IMM 0x007880f4: 0x0000b020 : Dword 1 Register Offset: 0x0000b020 0x007880f8: 0x00080040 : Dword 2 Data DWord: 524352 register L3CNTLREG2 (0xb020) : 0x80040 SLM Enable: 0 URB Allocation: 32 URB Low Bandwidth: 0 RO Allocation: 32 RO Low Bandwidth: 0 DC Allocation: 0 DC Low Bandwidth: 0 v2: Drop unused arguments (Sirisha) Print out register name Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2016-09-20 10:47:21 +01:00
Kenneth Graunke	081f21f29b	isl: Finish tiling filtering for Gen6. Gen6 only has one additional restriction over Gen7+, so we just add it to the existing gen7 function (which actually covers later gens too). This should stop FINISHME spew when running GL on Sandybridge. v2: Fix bytes per block vs. bits per block confusion (Jason) and rename function to gen6_filter_tiling (Jason and Chad). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-15 21:21:50 -07:00
Jason Ekstrand	ed65e6ef49	nir: Add a flag to lower_io to force "sample" interpolation Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-15 13:31:43 -07:00
Jason Ekstrand	89a96c8f43	anv/cmd_buffer: Set the L3 atomic disable mask bit in CHICKEN3 on HSW Without this bit set, the value in "L3 Atomic Disable" won't get applied by the hardware so we won't properly get L3 atomic caching. Fixes dEQP-VK.spirv_assembly.instruction.compute.opatomic.compex and 198 of the dEQP-VK.image.atomic_operations.* tests on HSW Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-09-14 17:53:16 -07:00
Jason Ekstrand	a814e18c96	intel/blorp: Stop setting 3DSTATE_DRAWING_RECTANGLE The Vulkan driver sets 3DSTATE_DRAWING_RECTANGLE once to MAX_INT x MAX_INT at the GPU initialization time and never sets it again. The GL driver sets it every time the framebuffer changes. Originally, blorp set it to the size of the drawing area but meant we had to set it back in the Vulkan driver. Instead, we can easily just do that in the GL driver's blorp_exec implementation and not set it in blorp core. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-09-14 17:51:16 -07:00
Jason Ekstrand	b56f509ee0	intel/blorp: Emit 3DSTATE_MULTISAMPLE directly Previously, we relied on a driver hook for 3DSTATE_MULTISAMPLE. However, now that Vulkan and GL use the same sample positions, we can set up 3DSTATE_MULTISAMPLE directly in blorp and delete the driver hook. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-09-14 17:51:16 -07:00
Jason Ekstrand	c779ad3e66	intel: Move Vulkan sample positions to common code Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-09-14 17:51:16 -07:00
Sirisha Gandikota	aa7b410592	aubinator: Remove bogus "end" parameter in gen_disasm_disassemble() Earlier, the loop pretends to loop over instructions from "start" to "end", but the callers always pass 8192 for end, which is some huge bogus value. The real loop termination condition is send-with-EOT or 0. (Ken) Signed-off-by: Sirisha Gandikota <Sirisha.Gandikota@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-13 16:32:42 -07:00
Sirisha Gandikota	1ab92d80a8	aubinator: Make gen_disasm_disassemble handle split sends Skylake adds new SENDS and SENDSC opcodes, which should be handled in the send-with-EOT check. Make an is_send() helper that checks if the opcode is SEND/SENDC/SENDS/SENDSC (Ken) v2: Make is_send() much more crispier, Mix declaration and code to make the code compact (Ken) Signed-off-by: Sirisha Gandikota <Sirisha.Gandikota@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-13 16:32:39 -07:00
Sirisha Gandikota	5d2440532f	aubinator: Simplify print_dword_val() method Remove the float/dword union and use the iter->p[f->start / 32] directly as printf formatter %08x expects uint32_t (Ken) v2: Make the cleanup much more crispier (Ken) Signed-off-by: Sirisha Gandikota <Sirisha.Gandikota@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-13 16:32:24 -07:00
Jason Ekstrand	1eebb60917	anv/image: Set correct base_array_layer and array_len for storage images Since Vulkan doesn't allow single-slice 3D storage images, we need to just set the base_array_layer and array_len to the full size of the 3-D LOD. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-13 14:45:49 -07:00
Jason Ekstrand	106709db7b	Revert "intel/isl: Ignore base_array_layer and array_len for 3D storage..." This reverts commit `3943888c94`. It turns out that commit was pretty-much bogus since it breaks binding a 3-D texture as a 2-D storage image. The correct fix for the Vulkan CTS tests needs to be in the Vulkan driver itself rather than ISL. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-13 14:45:15 -07:00
Jason Ekstrand	330104464f	anv: Use blorp for doing MSAA resolves Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-09-13 12:40:13 -07:00
Jason Ekstrand	6bcb1f753e	anv: Use blorp for ClearColorImage Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-09-13 12:40:13 -07:00
Jason Ekstrand	57e87862eb	anv: Delete meta_blit2d Everything that we were once using the blit2d framework for is now done with blorp. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-09-13 12:40:13 -07:00
Jason Ekstrand	36286ccb96	anv/blorp: Add a gcd_pow2_u64 helper and use it for buffer alignments This is a lot cleaner and easier to read than the old piles of if statements. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-09-13 12:40:13 -07:00
Jason Ekstrand	af5d30de55	anv: Use blorp for CopyBuffer and UpdateBuffer Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-09-13 12:40:13 -07:00
Jason Ekstrand	0f1ca5407a	anv: Use blorp for CopyImage Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	58593f24cb	anv: Use blorp for CopyBufferToImage Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	f07f44a5bc	anv: Use blorp for CopyImageToBuffer Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	9f44745eca	anv: Use blorp to implement VkBlitImage Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	52fa3e8347	anv: Make image_get_surface_for_aspect_mask const Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	8f780af968	anv: Add initial blorp support Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	1fe8bf82b2	intel/anv: Use #defines for all __gen_ helpers This allows us to #undef them later if we don't want them to persist Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	4a6c9e20b8	anv: Generalize emit_urb_setup Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	8cb144bd93	anv/pipeline: Roll compute_urb_partition into emit_urb_setup Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	823ab83432	intel/blorp: Use #defines for all __gen_ helpers This allows us to #undef them later if we don't want them to persist Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	c0b9776cd6	intel/isl: Divide QPitch by 2 for 3-D stencil textures on SKL+ Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	00e79cec99	isl/state: Don't set QPitch for GEN4_3D surfaces Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-09-13 12:40:12 -07:00
Jason Ekstrand	cb780c9ccf	intel/blorp: Rework alloc_binding_table The original blorp_alloc_binding_table helper was supposed to return the binding table offset and map along with the surface state maps. This isn't quite what we want, however. What we really want is the binding table offsets, surface state offsets, and surface state maps. In the GL driver, the binding table map is an array of surface state offsets. However, in Vulkan, this isn't quite true as the entries in the binding table are surface state offsets combined with another binding table block offset. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-13 12:40:11 -07:00
Jason Ekstrand	6ac469a6c3	anv/allocator: Use VG_NOACCESS_WRITE in anv_bo_pool_free Previously, we were relying on the fact that VALGRIND_MEMPOOL_FREE came later on in the function to prevent "link->bo = bo" from causing an invalid write. However, in the case where the size requested by the user is very small (less than sizeof(struct anv_bo)), this isn't sufficient. Instead, we should call VALGRIND_MEMPOOL_FREE early and then use VG_NOACCESS_WRITE. We do, however, have to call VALGRIND_MEMPOOL_FREE after reading bo_in because it may be stored in the bo itself. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-13 10:44:03 -07:00
Jason Ekstrand	3943888c94	intel/isl: Ignore base_array_layer and array_len for 3D storage surfaces The time we want to restrict the Z range of a 3-D surface is when rendering to it. For storage surfaces, we always want he full range. However, we still need to set MinimumArrayElement and RenderTargetViewExtent to sensible values so we'll just set them to the reasonable defaults we used before we started respecting the base_array_layer and array_len. This fixes a bunch of Vulkan CTS regressions caused by `48f195d7c6`. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97790 Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-09-13 10:43:21 -07:00
Jason Ekstrand	d038adca0e	intel/isl: Add support for RGB formats in X and Y-tiled memory Normally, using a non-linear tiling format helps improve cache locality by ensuring that neighboring pixels are usually close-by in memory. For RGB formats, this still sort-of holds, but it can also lead to rather terrible memory access patterns where a single RGB pixel value crosses a tile boundary and gets split into two pieces in different 4K pages. It also makes for some rather awkward calculations because your tile size is no longer an even multiple of surface element size. For these reasons, we chose to simply never create tiled RGB images in the Vulkan driver. The GL driver, however, is not so kind so we need to support it somehow. I briefly toyed with a couple of different schemes but this is the best one I could come up with. The fundamental problem is that a tile no longer contains an integer number of surface elements. I briefly considered a couple other options but found them wanting: 1) Using floats for the logical tile size. This leads to potential rounding error problems. 2) When presented with a RGB format, just make the tile 3-times as wide. This isn't so nice because now our tiles are no longer power-of-two size. Also, it can force the row_pitch to be larger than needed which, while not strictly a problem for ISL, causes incompatibility problems with the way the GL driver chooses surface pitches. The chosen method requires that you pay attention and not just assume that your tile_info is in the units you think it is. However, it's nice because it provides a nice "these are the units" declaration in isl_tile_info itself. Previously, the tile_info wasn't usable as a stand-alone structure because you had to also know the format. It also forces figuring out how to deal with inconsistencies between tiling and format back to the caller which is good because the two different consumers of isl_tile_info really want to deal with it differently: Computation of the surface size wants the fewest number of horizontal tiles possible while get_intratile_offset is far more concerned with things aligning nicely. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Chad Versace <chadversary@chromium.org>	2016-09-12 19:44:05 -07:00
Jason Ekstrand	883086500b	intel/isl: Allow valign2 for texture-only Y-tiled surfaces on gen7 The restriction that Y-tiled surfaces must have valign == 4 only aplies to render targets but we were applying it universally. This causes problems if ISL_FORMAT_R32G32B32_FLOAT is used because it requires valign == 2; this should be okay because you can't render to that format. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-09-12 19:44:05 -07:00
Jason Ekstrand	54db5afd2c	intel/blorp: Work in terms of logical array layers When Ivy Bridge introduced array multisampling, someone made the decision to do lots of stuff throughout the driver in terms of physical array layers rather than logical array layers. In ISL, we use logical array layers most of the time and it really makes no sense to use physical array layers in the blorp API. Every time someone passes physical array layers into blorp for an array multisampled surface, they're always divisible by the number of samples and we divide right away. Eventually, I'd like to rework most of the GL driver internals to use logical array layers but that's going to be a big project and will probably happen as part of the ISL conversion. For now, we'll do the conversion in brw_blorp and let blorp just use the logical layers. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	fa4627149d	intel/blorp: Increase the presision of coordinate transform calculations The result of this calculation goes into an fma() in the shader and we would like it to be as precise as possible. The division in particular was a source of imprecision whenever dst1 - dst0 was not a power of two. This prevents regressions in some of the new Vulkan CTS tests for blitting using a filtering of NEAREST. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	c70be1ead5	intel/blorp: Add a swizzle parameter to blorp_clear While we're here, we also re-arrange the parameters to better match the parameter order of blorp_blit. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	ea1399aba0	intel/blorp: Make color_write_disable const and optional Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	9286f62f11	intel/blorp: Add support for clearing R9G9B9E5 surfaces Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	ab03e59867	intel/blorp: Add support for RGB destinations in copies Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	5ae8043fed	intel/blorp: Add an entrypoint for doing bit-for-bit copies Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	941b4d063a	intel/blorp: Pull the guts of blorp_blit into a helper Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	4e03edf189	intel/blorp: Stop using the X/YOffset field of RENDER_SURFACE_STATE While it can be useful, the field has substantial limtations. In particular, the bittom 2 or 3 bits is missing so your offset always has to be a multiple of 4 or 8. While surface alignments usually work out to make this ok, when you start trying to fake compressed surfaces as uncompressed (which we will want to do) this falls apart. The easiest solution is to simply align all offsets to a tile boundary and munge the regions we're copying to account for the intratile offset. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	c170606fc6	intel/blorp: Use fake_interleaved_msaa in retile_w_to_y Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	a613449f71	intel/blorp: Use isl_get_interleaved_msaa_px_size_sa Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	8ac99eabb6	intel/isl: Add a helper for getting the size of an interleaved pixel Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	3cc15ba5bb	intel/blorp: Handle 3D surfaces in convert_to_single_slice Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	43d25edf78	intel/isl: Fix an assert in get_intratile_offset_sa Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	6da968b651	intel/blorp: Fix the early return condition in convert_to_single_slice The convert_to_single_slice operation is mostly idempotent. The only non-repeatable thing it does is that, when it sets the intratile offset fields, it just overwrites them instead of doing a += operation. This is supposed to be ok because we have an early return at the top that should make it bail of the surface is already a single slice. Unfortunately, the if condition has been broken ever since it was first added in `96fa98c18`. This commit fixes the condition and adds an assert to ensure we don't stomp any non-zero intratile offsets. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	ec7e0d62c5	intel/blorp: Use the surface format for computing offsets If we use the view format, it may be an uncompressed view of a compressed image which throws things off. Since we're computing offsets of images, we want the actual surface offset anyway. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	7f2fecd114	intel/blorp: Don't assume R8_UINT in convert_to_single_slice We're going to use it for more than just stencil textures Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	2fc9c7e3d9	intel/blorp: Take a destination swizzle in blorp_blit Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	2dba5489ae	intel/blorp: Take an isl_swizzle instead of a SWIZZLE Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	7ddb21708c	intel/isl: Add an isl_swizzle structure and use it for isl_view swizzles This should be more compact than the enum isl_channel_select[4] that we were using before. It's also very convenient because we already had such a structure in the Vulkan driver we just needed to pull it over. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-09-12 19:42:57 -07:00
Jason Ekstrand	2519237c24	intel/blorp: Handle the 512 layers restriction on Sandy Bridge Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-12 16:48:56 -07:00
Jason Ekstrand	48f195d7c6	intel/isl: Treat 3-D textures as 2-D arrays for rendering In particular, this means that isl_view::base_array_layer and isl_view::array_len get applied to 3-D textures but only when rendering. We were already applying isl_view::base_array_layer for rendering into 3-D textures so this isn't a huge deviation. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-12 16:48:56 -07:00
Sirisha Gandikota	63fe9ab894	aubinator: Simplify gen_disasm_create()'s devinfo handling Copy the whole devinfo structure instead of just few fields (Ken) Earlier, copied only couple of fields which added more code. So, simplify code by copying the whole structure. Signed-off-by: Sirisha Gandikota <Sirisha.Gandikota@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-12 16:20:04 -07:00
Sirisha Gandikota	d2869c95fb	aubinator: Fix compiler warning Add 'const' qualifier to gen_field_iterator::p pointer (Ken) Signed-off-by: Sirisha Gandikota <Sirisha.Gandikota@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-12 16:19:56 -07:00
Jason Ekstrand	e2fb044115	intel/blorp: Add a TODO file This provides a nice little place to share notes on what still needs to be done and/or would be nice to have in BLORP. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-12 10:14:49 -07:00
Emil Velikov	ceaa2e1738	aubinator: rework print_help() Rather than using platform specific methods to retrieve the program name pass it explicitly. The function is called directly from main(). Similarly - basename comes in two versions POSIX (can modify string, always pass a copy) and GNU (never modifies the string). Just printout the complete program name, esp. since the program is not meant to be installed. Thus using $basename is unlikely to work, not to mention it is misleading. Reported-by: Timothy Arceri <timothy.arceri@collabora.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jonathan Gray <jsg@jsg.id.au>	2016-09-12 16:49:59 +01:00
Topi Pohjolainen	a1c7de09dc	intel/blorp: Add plumbing for setting color clear layer count Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-12 11:48:29 +03:00
Topi Pohjolainen	514afdce95	intel/blorp: Allow multiple layers Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-12 11:48:29 +03:00
Topi Pohjolainen	e597821ef2	i965/blorp: Instruct vertex fetcher to provide prim instance id This will indicate target layer (Render Target Array Index) needed for layered clears. v2: Use 3DSTATE_VF_SGVS for gen8+ Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-12 11:48:29 +03:00
Topi Pohjolainen	024a39511f	isl/gen8+: Allow 1D and 3D auxiliary surfaces Otherwise once mcs buffer gets allocated without delay for lossless compression (same as we do for msaa), assert starts to fire in piglit case: tex3d. The test uses depth of one which is in fact supported even now. v2 (Jason): Allow also 1D case as there is nothing in the specs constraining it either. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-12 11:48:29 +03:00
Jonathan Gray	d50c56f868	aubinator: only use program_invocation_short_name with glibc/cygwin program_invocation_short_name is a gnu extension. Limit use of it to glibc and cygwin and otherwise use getprogname() which is available on BSD and OS X. Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2016-09-08 18:37:02 +01:00
Jonathan Gray	2d3ebb474c	aubinator: include libgen.h for basename(3) Include libgen.h for basename as required by posix. The definition is not found on at least OpenBSD otherwise. Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2016-09-08 18:37:02 +01:00
Jonathan Gray	0ba9e281fc	aubinator: stop using non portable error() function error() is a gnu extension and is not present on OpenBSD and likely other systems. Convert use of error to fprintf/strerror/exit. Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2016-09-08 18:37:02 +01:00
Chad Versace	bad80c26e7	anv: Link to libX11-xcb only when unneeded The Makefile unconditionally linked libX11-xcb into libvulkan_intel.so. But it's needed only if HAVE_PLATFORM_X11. Fixes build of libvulkan_intel.so on Chromium OS, which has no X11 libraries. Fixes: `71258e9462` ("anv/x11: Add support for Xlib platform") Cc: Kevin Strasser <kevin.strasser@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2016-09-08 09:24:30 -07:00
Topi Pohjolainen	b863f4a39a	intel/blorp: Allow single slice converter to suppress number of layers Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-08 08:53:45 +03:00
Jason Ekstrand	20b2f1ecb9	anv/pipeline: Lower indirect outputs when EmitNoIndirectOutput is set Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reported-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-06 08:27:23 -07:00
Mauro Rossi	98f734e758	android: intel: fix include paths in new "common" library Fixes building error in libmesa_intel_common static library Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-03 20:03:16 -07:00
Jason Ekstrand	7e891f90c7	anv: Move cmd_buffer_config_l3 into anv_cmd_buffer.c This is the only remaining part of genX_l3.c and there's really no good reason for it to be in its own file. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-09-03 08:23:07 -07:00
Jason Ekstrand	17968e2dfd	anv/cmd_buffer: Move emit_lri and emit_lrm higher up Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-09-03 08:23:07 -07:00
Jason Ekstrand	42d03c204c	anv: Refactor pipeline l3 config setup Now that we're using gen_l3_config.c, we no longer have one set of l3 config functions per gen and we can simplify a bit. Also, we know that only compute uses SLM so we don't need to look for it in all of the stages. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-09-03 08:23:07 -07:00
Jason Ekstrand	6448c0e324	anv: Leverage the shared L3$ config code When Jordan first implement L3$ configuration for Vulkan, he copied+pasted from the GL driver because we had no good place to share it. Now that we have src/intel/common, we should be sharing these tables. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-09-03 08:23:07 -07:00
Jason Ekstrand	49981891f7	intel: Pull the guts of gen7_l3_state.c into a shared helper Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-09-03 08:23:07 -07:00
Jason Ekstrand	979d0aca62	intel: Rename brw_get_device_name/info to gen_get_device_name/info Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-09-03 08:23:07 -07:00
Jason Ekstrand	527f371999	intel: s/brw_device_info/gen_device_info/ Generated by: sed -i -e 's/brw_device_info/gen_device_info/g' src/intel/*/.c sed -i -e 's/brw_device_info/gen_device_info/g' src/intel/*/.h sed -i -e 's/brw_device_info/gen_device_info/g' */i965/.c sed -i -e 's/brw_device_info/gen_device_info/g' */i965/.cpp sed -i -e 's/brw_device_info/gen_device_info/g' */i965/.h Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-09-03 08:23:06 -07:00
Jason Ekstrand	55364ab5b7	intel: Add a new "common" library for more code sharing The first thing to go in this new library is brw_device_info. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-09-03 08:23:06 -07:00
Mauro Rossi	4218c32166	intel/blorp: fix typo in android makefile Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-03 08:22:53 -07:00
Lionel Landwerlin	2dc6930a5a	isl: round format alignment to nearest power of 2 A few inline asserts in anv assume alignments are power of 2, but with formats like R8G8B8 we have odd alignments. v2: round up to power of 2 (Ilia) v3: reuse util_next_power_of_two() from gallium/aux/util/u_math.h (Ilia) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-09-01 11:36:09 +01:00
Jason Ekstrand	b8bff0823b	mesa: Add some more .gitignore	2016-08-31 13:45:27 -07:00
Timothy Arceri	64a48efb9e	aubinator: fix if indentation and add brackets to multiline body Fixes misleading indentation warning in gcc. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-31 10:19:45 +10:00
Jason Ekstrand	10f9901bce	anv: Rework pipeline caching The original pipeline cache the Kristian wrote was based on a now-false premise that the shaders can be stored in the pipeline cache. The Vulkan 1.0 spec explicitly states that the pipeline cache object is transiant and you are allowed to delete it after using it to create a pipeline with no ill effects. As nice as Kristian's design was, it doesn't jive with the expectation provided by the Vulkan spec. The new pipeline cache uses reference-counted anv_shader_bin objects that are backed by a large state pool. The cache itself is just a hash table mapping keys hashes to anv_shader_bin objects. This has the added advantage of removing one more hand-rolled hash table from mesa. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97476 Acked-by: Kristian Høgsberg Kristensen <krh@bitplanet.net>	2016-08-30 15:08:23 -07:00
Jason Ekstrand	6899718470	anv: Add a struct for storing a compiled shader This new anv_shader_bin struct stores the compiled kernel (as an anv_state) as well as all of the metadata that is generated at shader compile time. The struct is very similar to the old cache_entry struct except that it is reference counted and stores the actual pipeline_bind_map. Similarly to cache_entry, much of the actual data is floating-size and stored after the main struct. Unlike cache_entry, which was storred in GPU-accessable memory, the storage for anv_shader_bin kernels comes from a state pool. The struct itself is reference-counted so that it can be used by multiple pipelines at a time without fear of allocation issues. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Acked-by: Kristian Høgsberg Kristensen <krh@bitplanet.net>	2016-08-30 15:08:23 -07:00
Jason Ekstrand	13c09fdd0c	anv: Add pipeline_has_stage guards a few places All of these worked before because they were depending on prog_data to be null. Soon, we won't be able to depend on a nice prog_data pointer and it's nice to be more explicit anyway. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-30 15:08:23 -07:00
Jason Ekstrand	b259d86ad6	anv: Remove unused fields from anv_pipeline_bind_map Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-30 15:08:23 -07:00
Jason Ekstrand	d5945bec12	anv/pipeline: Properly handle OOM during shader compilation Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-30 15:08:23 -07:00
Jason Ekstrand	a0f5c496e3	anv/allocator: Correctly set the number of buckets The range from ANV_MIN_STATE_SIZE_LOG2 to ANV_MAX_STATE_SIZE_LOG2 should be inclusive and we have asserts that ensure that you never try to allocate a state larger than (1 << ANV_MAX_STATE_SIZE_LOG2). However, without adding 1 to the difference, we allocate 1 too few bucckts and so, even though we have an assert, anything landing in the last bucket will fail to allocate properly.. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-30 15:08:23 -07:00
Jason Ekstrand	4200c2266e	anv/pipeline: Fix bind maps for fragment output arrays Found by inspection. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-30 15:08:23 -07:00
Jason Ekstrand	d316cec1c1	anv/descriptor_set: memset anv_descriptor_set_layout We hash this data structure so we can't afford to have uninitialized data even if it is just structure padding. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-30 15:08:23 -07:00
Eric Engestrom	aac91fffae	anv/wayland: fix assert typo Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-08-30 13:47:51 -07:00
Eric Engestrom	4e68bb620f	anv/meta: fix unreachable() typo Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-08-30 13:47:51 -07:00
Jason Ekstrand	9514c5a30f	intel/blorp: Inline get_vs_entry_size into emit_urb_config Topi asked to have the prefix removed because there's nothing gen7 about it. However, now that everything is in a single file, there is no good reason to have it split out into a helper function anyway. Let's just put the contents in emit_urb_config and call it a day. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-08-30 09:24:50 -07:00
Jason Ekstrand	f4314d06e8	isl/state: Add some asserts about format capabilities This keeps invalid surface states from leaking through and potentially hanging the GPU. We shouldn't actually be hitting this on a regular basis, but a helpful assert is better than a hang. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-08-29 12:17:34 -07:00
Jason Ekstrand	87214414fd	intel/blorp: Add a format parameter to blorp_fast_clear This allows us to use the actual render format as opposed to the texture format. I don't know that the hardware actually cares in the case of fast clears, but it certainly seems more correct. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-08-29 12:17:34 -07:00
Jason Ekstrand	348509269e	i965: Move blorp into src/intel/blorp At this point, blorp is completely driver agnostic and can be safely moved into its own folder. Soon, we hope to start using it for doing blits in the Vulkan driver. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-08-29 12:17:34 -07:00
Mauro Rossi	cd18bbeef3	android: intel: Flatten the makefile structure Android porting of commit `bebc1a1` "intel: Flatten the makefile structure" Automake approach was followed, by moving makefiles a level up, naming them Android.genxml.mk and Android.isl.mk, performing the necessary adjustments to the paths, adding src/intel/Android.mk and fixing mesa top level makefile. Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2016-08-29 12:17:34 -07:00
Jason Ekstrand	fb89551047	isl: Allow multisampled array textures This probably isn't the only thing that needs to be done to get multisampled array textures working in Vulkan but I think this is all that ISL really needs and it does fix 8 of the new CTS tests. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-08-26 19:00:02 -07:00
Kenneth Graunke	93bfa1d7a2	nir: Change nir_shader_get_entrypoint to return an impl. Jason suggested adding an assert(function->impl) here. All callers of this function actually want ->impl, so I decided just to change the API. We also change the nir_lower_io_to_temporaries API here. All but one caller passed nir_shader_get_entrypoint(), and with the previous commit, it now uses a nir_function_impl internally. Folding this change in avoids the need to change it and change it back. v2: Fix one call I missed in ir3_compiler (caught by Eric). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-08-25 19:18:24 -07:00
Jason Ekstrand	bebc1a1d99	intel: Flatten the makefile structure This pulls isl and genxml into a single make file so that they can properly build in parallel. This isn't terribly important now as genxml just generates sources which happens serially first anyway but it will be more important as we add more stuff to src/intel. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-08-25 15:29:48 -07:00
Jason Ekstrand	c19fc5e019	isl/tests: Use a longer path for isl.h The tests assumed that isl would be in the include path but that usually isn't the case. Instead, we usually have src/intel and you need to add an "isl/" prefix. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-08-25 15:29:47 -07:00
Jason Ekstrand	8bdf605214	intel/isl/gen9: Only use the magic 1D alignment for GEN9_1D surfaces If the surface has a layout of GEN4_2D then we need to compute a normal 2D alignment and not use the magic linewar 1D alignment. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-08-25 14:11:15 -07:00
Jason Ekstrand	cda1a5dc0e	intel/isl: Pass the dim_layout into choose_alignment_el Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-08-25 14:10:43 -07:00
Jason Ekstrand	f68cfb05fa	intel/isl: Use DIM_LAYOUT_GEN4_2D for tiled 1-D surfaces on SKL The Sky Lake 1D layout is only used if the surface is linear. For tiled surfaces such as depth and stencil the old gen4 2D layout is used. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2016-08-25 14:09:44 -07:00
Jason Ekstrand	2301705dee	anv: Include the pipeline layout in the shader hash The pipeline layout affects shader compilation because it is what determines binding table locations as well as whether or not a particular buffer has dynamic offsets. Since this affects the generated shader, it needs to be in the hash. This fixes a bunch of CTS tests now that the CTS is using a pipeline cache. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-24 20:42:05 -07:00
Jason Ekstrand	05f36435ef	anv: Add a --disable-vulkan-icd-full-driver-path option This option makes installed Vulkan ICD files contain only a driver library name and not a path. This is intended for distros to help them work around multi-arch issues. Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-08-25 10:32:31 +10:00
Lionel Landwerlin	91987c51e3	anv: meta_blit2d: adapt texel fetch pitch for fake w-tiled We need to compute detiling coordinates using the physical size of W tiling (128x32) rather than the logical size (64x64). v2: Correct comment (Jason) Fixes dEQP-VK.api.copy_and_blit.image_to_image_stencil Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97448 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-08-24 11:29:23 -07:00
Kenneth Graunke	e7530bfcd6	aubinator: Style fixes. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-23 21:19:58 -07:00
Sirisha Gandikota	56ba9656bb	aubinator: Fix the tool to correctly decode the DWords Several fixes have been added as part of this as listed below: 1) Fix the mask and add disassembler handling for STATE_DS, STATE_HS as the mask returned wrong values of the fields. 2) Fix the GEN_TYPE_ADDRESS/GEN_TYPE_OFFSET decoding - the address/ offset were handled the same way as the other fields and that gives the wrong values for the address/offset. 3) Decode nested/recurssive structures - Many packets contain nested structures, ex: 3DSATE_SO_BUFFER, STATE_BASE_ADDRESS, etc contain MOC structures. Previously, the aubinator printed 1 if there was a MOC structure. Now we decode the entire structure and print out its fields. 4) Print out the DWord address along with its hex value - For a better clarity of information, it is helpful to print both the address and hex value of the DWord along with the DWord count. Since the DWord0 contains the instruction code and the instruction length, it is unnecessary to print the decoded values for DWord0. This information is already available from the DWord hex value. 5) Decode the <group> and the corresponding fields in the group- The <group> tag can have fields of several types including structures. A group can contain one or more number of fields and this has be correctly decoded. Previously, aubinator did not decode the groups or the fields/structures inside them. Now we decode the <group> in the instructions and structures where the fields in it repeat for any number of times specified. v2: Fix the formatting (per Matt) Make the start and end pos calculation to extract fields from a DWord more appropriate by moving %32 away from mask() method Signed-off-by: Sirisha Gandikota <Sirisha.Gandikota@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Ben Widawsky <ben@bwidawsk.net>	2016-08-23 21:19:55 -07:00
Kristian Høgsberg Kristensen	3e218ad7f8	aubinator: Add a new tool called Aubinator to the src/intel/tools folder. The Aubinator tool is designed to help the driver developers in debugging the driver functionality by decoding the data in the .aub files. Primary Authors of this tool are Damien Lespiau <damien.lespiau at intel.com> and Kristian Høgsberg Kristensen <krh at bitplanet.net>. v2: Review comments are incorporated by Sirisha Gandikota as below: 1) Make Makefile.am more crisp, reuse intel_aub.h from libdrm (per Emil) 2) Aubinator will use platform name instead of GEN number (per Matt) 3) Disassmebler gets created based on pciid rather then GEN number (per Matt) 4) Other formatting comments (per Ken, Matt and Emil) Signed-off-by: Sirisha Gandikota <Sirisha.Gandikota@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Ben Widawsky <ben@bwidawsk.net>	2016-08-23 21:19:33 -07:00
Jason Ekstrand	70bc891c42	isl/formats: Integer formats are not filterable In `ca2a8e5628`, we updated the format table to add more formats (most of which are new on SKL) but accidentally marked some integer formats as filterable. You can't filter an integer format. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-08-23 16:51:34 -07:00
Jason Ekstrand	7bdccd104b	anv/clear: Clear E5B9G9R9 images as R32_UINT We can't actually clear these images normally because we can't render to them. Instead, we have to manually unpack the rgb9e5 color value on the CPU and clear it as R32_UINT. We still have a bit of work to do to clear non-power-of-two images, but this should get all of the power-of-two clears working on at least Haswell. This fixes three of the new Vulkan CTS tests in the dEQP-VK.api.image_clearing.clear_color_image.* group. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-23 11:45:25 -07:00
Jason Ekstrand	afa7ca0f77	anv/clear: Make cmd_clear_image take an actual VkClearValue Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-23 11:45:24 -07:00
Jason Ekstrand	cf3cf2ecfc	anv/blit2d: Add support for RGB destinations This fixes 104 of the new image_clearing and copy_and_blit Vulkan CTS tests. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-23 11:45:24 -07:00
Jason Ekstrand	16ddda8452	anv/blit2d: Add a format parameter to bind_dst and create_iview Signed-off-by: Jasosn Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-08-23 11:45:24 -07:00
Jason Ekstrand	954c0bfb20	anv/image: Don't create invalid render target surfaces Signed-off-by: Jason Ekstrand <jason@jlekstrand.net>	2016-08-23 11:45:24 -07:00
Jason Ekstrand	ca2a8e5628	isl/formats: Update the table with more samplable formats There were a lot of formats where support was added on Haswell or later but we never updated the format table. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-08-23 11:45:24 -07:00
Jason Ekstrand	aba9e25b70	isl/formats: Report ETC as being samplable on Bay Trail Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-08-23 11:45:24 -07:00
Jason Ekstrand	fb90291dd5	anv/formats: Don't use an RGBX format if it isn't renderable The whole point of using RGBX is so that we can render to it so if it isn't renderable, that kind-of defeats the purpose. Some formats (one example is R32G32B32X32_SFLOAT) exist in the format table but aren't actually renderable. Eventually, we'd like to get away from RGBX entirely, but this fixes hangs on BDW today. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net>	2016-08-23 11:45:24 -07:00
Jason Ekstrand	34ff4fbba6	anv: Throw INCOMPATIBLE_DRIVER for non-fatal initialization errors The only reason we should throw INITIALIZATION_FAILED is if we have found useable intel hardware but have failed to bring it up for some reason. Otherwise, we should just throw INCOMPATIBLE_DRIVER which will turn into successfully advertising 0 physical devices Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-22 18:49:49 -07:00
Lionel Landwerlin	475ce61d1a	anv: GetDeviceImageFormatProperties: fix TRANSFER formats We let the user believe we support some transfer formats which we don't. This can lead to crashes when actually trying to use those formats for example on dEQP-VK.api.copy_and_blit.image_to_image.* tests. Let all formats we can render to or sample from as meta implements transfers using attachments. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-08-22 10:41:30 -07:00
Jason Ekstrand	a2ae67aa47	anv: Give the installed intel_icd.json file an absolute path Not providing a path allows the ICD to work on multi-arch systems but breaks it if you install anywhere other than /usr/lib. Given that users may be installing locally in .local or similar, we probably do want to provide a filename. Distros can carry a revert of this commit if they want an intel_icd.json file without the path. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Chad Versace <chad@kiwitree.net>	2016-08-20 00:50:03 -07:00
Jason Ekstrand	04f3594cd5	genxml/gen9: Make 3DSTATE_SBE::AttributeActiveComponentFormat an array Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	bfdff28d68	genxml: Add a uint MOCS field to VERTEX_BUFFER_STATE Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	373613fa4b	genxml: Make a couple of VERTEX_BUFFER_STATE fields boolean Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	29f1f945a6	genxml: Make VERTEX_ELEMENT_STATE::Valid a bool Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	eb2589cba6	genxml/gen6: Make SAMPLER_STATE look a bit more like gen7 Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	2a84e40dae	genxml: Add a uint MOCS field to DEPTH_BUFFER packets This is easier than dealing with structs all the time Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	3f1022b029	genxml/gen6: Make "Depth Clear Value" a uint The actual data storred is in float, UNORM24, or UNORM16 depending on the actual depth format. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	be62e7645e	genxml/gen6: Add the 3D_Prim_Topo_Type enum Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	cca95a7bd6	genxml/gen6: Fix the length of 3DSTATE_WM Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	3ddb6f6e2a	genxml/gen6: Add a Surface Base Address field to HIER_DEPTH_BUFFER Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Jason Ekstrand	be52e16dbc	genxml/gen6: Add uint MOCS fields for most things Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 03:11:29 -07:00
Emil Velikov	d61d259518	isl: automake: use VISIBILITY_CFLAGS to restrict symbol visibility v2: Add VISIBILITY_CFLAGS to AM_CFLAGS (Ken) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (v1) Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-08-18 15:06:19 +01:00

1 2 3 4 5 ...

848 Commits