KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Caio Marcelo de Oliveira Filho	63f0259aeb	iris: Guard GEN9-only function in Iris state to avoid warning Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-23 13:25:27 -07:00
Caio Marcelo de Oliveira Filho	412ed1338f	intel/decoders: Avoid uninitialized variable warnings Initialize `next_batch_addr` and `second_level`. If the batch is well formed, those values will be overriden, if not, they are as good as uninitialized garbage. Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-23 13:25:27 -07:00
Caio Marcelo de Oliveira Filho	0661480029	compiler/glsl: Fix warning about unused function The helper check_node_type() is only used when DEBUG is set (in the function below), but ASSERTED macro uses NDEBUG. So just guard the helper with #ifdef. If we see more such cases we might consider a ASSERTED-like macro for the DEBUG case. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-23 13:25:27 -07:00
Caio Marcelo de Oliveira Filho	eac8a3b9af	anv: Drop unused local variable Leftover from `021fa28163` ("xintel/nir: Add a helper for getting BRW_AOP from an intrinsic"). Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-23 13:25:27 -07:00
Caio Marcelo de Oliveira Filho	f7d90c67c7	intel/compiler: Silence maybe-uninitialized warning in GCC 9.1.1 Compiler can't see that d is initialized. ../src/intel/compiler/brw_vec4_nir.cpp: In function ‘int brw::try_immediate_source(const nir_alu_instr, brw::src_reg, bool, const gen_device_info*)’: ../src/intel/compiler/brw_vec4_nir.cpp:984:12: warning: ‘d’ may be used uninitialized in this function [-Wmaybe-uninitialized] 984 \| d = MAX2(-d, d); Assert that we expect at least one component -- hence d going to be set. That by itself is not enough, so also zero initialize the variable. Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-23 13:25:27 -07:00
Andres Rodriguez	a410823b3e	radv: additional query fixes Make sure we read the updated data from the gpu in cases where WAIT_BIT is not set. Cc: 19.1 19.2 <mesa-stable@lists.freedesktop.org Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-17 05:53:51 -04:00
Kenneth Graunke	7ee7b0ecbc	iris: Fix large timeout handling in rel2abs() ...by copying the implementation of anv_get_absolute_timeout(). Appears to fix a CTS test with 32-bit builds: GTF-GL46.gtf32.GL3Tests.sync.sync_functionality_clientwaitsync_flush Fixes: `f459c56be6` ("iris: Add fence support using drm_syncobj") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-08-23 10:32:01 -07:00
Kenneth Graunke	9310ae6f68	iris: Set MOCS in all STATE_BASE_ADDRESS commands Rafael Antognolli tracked down a performance gap between i965 and iris in Synmark2's OglCSDof microbenchmark, noting that iris was performing substantially more memory reads and writes, with substantially fewer L3 hits. He suggested that something might be wrong with MOCS, or L3 configs, at which point I came up with a theory... It would appear that the STATE_BASE_ADDRESS command updates the MOCS settings for various base addresses even if you don't specify the "Modify Enable" bit for that address. Until now, we had been setting only the MOCS for bases we intended to change, leaving the others "blank" which is MOCS table entry 0, which is uncached. Most data access has a more specific MOCS (e.g. in SURFACE_STATE), but scratch access uses the Stateless Data Port Access MOCS from STATE_BASE_ADDRESS. So this meant all scratch access was uncached. Improves performance in Synmark2's OglCSDof by 2x, bringing iris on par with the existing i965 driver. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-23 10:21:48 -07:00
Vinson Lee	b05166e3d2	glx: Fix up glXQueryGLXPbufferSGIX on macOS. Fix this build error on macOS. ../src/glx/apple/glx_empty.c:158:4: error: void function 'glXQueryGLXPbufferSGIX' should not return a value [-Wreturn-type] return 0; ^ ~ Fixes: `3dd299c3d5` ("glx: Sync <GL/glxext.h> with Khronos") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-08-23 11:05:23 -04:00
Juan A. Suarez Romero	6f137ed901	docs: update calendar, add news item and link release notes for 19.1.5 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2019-08-23 12:40:40 +02:00
Juan A. Suarez Romero	23f1741996	docs: add sha256 checksums for 19.1.5 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit ae2a676cd1748c850f579863003c92f2b137f44a)	2019-08-23 12:38:28 +02:00
Juan A. Suarez Romero	152dd6ed19	docs: add release notes for 19.1.5 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> (cherry picked from commit a384fe0cebf1fcd6671c51c749fcc981e01b5505)	2019-08-23 12:38:27 +02:00
Connor Abbott	f59076f8a7	radeonsi/nir: Rewrite output scanning Similarly to before, this didn't properly handle varying structs with doubles in them. This doesn't fix any tests, but was noticed while looking at the code. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 11:05:31 +02:00
Connor Abbott	9395277972	radeonsi/nir: Rewrite store intrinsic gathering The old version wasn't as accurate as it could be, and didn't handle double variables inside structs correctly. Walk the path to compute the actual components affected. In combination with the previous commit fixes KHR-GL45.enhanced_layouts.varying_structure_locations. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 11:05:31 +02:00
Connor Abbott	87cca891c3	radeonsi/nir: Add const_index when loading GS inputs This fixes loading GS inputs in structures or arrays. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 11:05:31 +02:00
Connor Abbott	82589d3ffd	radeonsi/nir: Don't add const offset to indirect This is already done in get_deref_offset() in the common code. We were adding it twice accidentally. Fixes KHR-GL45.enhanced_layouts.varying_array_locations. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 11:05:31 +02:00
Connor Abbott	400db1852b	ac/nir: Assert GS input index is constant If it's not we silently ignore indir_index which is definitely a bug. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 11:05:31 +02:00
Connor Abbott	bb42c896fe	ac/nir: Handle const array offsets in get_deref_offset() Some users of this function (e.g. GS inputs) currently only work with constant offsets. We got lucky since all the tests used an array index of 0, so the non-constant part was always 0. But we still need to handle this. This doesn't fix any CTS test, but was noticed while debugging one. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 11:05:31 +02:00
Connor Abbott	97d592c855	radeonsi/nir: Don't recompute num_inputs and num_outputs Don't repeat what mesa/st already does. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 11:05:31 +02:00
Connor Abbott	3eb4aeed60	st/nir: Fix num_inputs for VS inputs Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 11:05:31 +02:00
Samuel Pitoiset	a4e6e59db8	radv/gfx10: do not use NGG with NAVI14 Cc: 19.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-23 09:54:08 +02:00
Samuel Pitoiset	0813c27d8d	radv/gfx10: don't initialize VGT_INSTANCE_STEP_RATE_0 Only gfx9 and older use it to get InstanceID in VGPR1. Ported from RadeonSI. Cc: 19.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-23 09:54:06 +02:00
Samuel Pitoiset	7d1c091143	gitlab-ci: bump LLVM to 8 for meson-vulkan and meson-clover To fix pipeline builds. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-23 08:13:31 +02:00
Samuel Pitoiset	1fd60db4a1	ac,radv,radeonsi: remove LLVM 7 support Now that LLVM 9 will be released soon, we will only support LLVM 8, 9 and master (10). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 08:12:34 +02:00
Tapani Pälli	3e03a3fc53	egl: reset blob cache set/get functions on terminate Fixes errors seen with eglSetBlobCacheFuncsANDROID on Android when running dEQP that terminates and reinitializes a display. Fixes: `6f5b57093b` "egl: add support for EGL_ANDROID_blob_cache" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-23 08:14:08 +03:00
Kenneth Graunke	2d79925034	iris: Avoid unnecessary resolves on transfer maps We were always resolving the buffer as if we were accessing it via CPU maps, which don't understand any auxiliary surfaces. But we often copy to a temporary using BLORP, which understands compression just fine. So we can avoid the resolve, and accelerate the copy as well. Fixes: `9d1334d2a0` ("iris: Use copy_region and staging resources to avoid transfer stalls") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:17 -07:00
Kenneth Graunke	136629a1e3	iris: Drop copy format hacks from copy region based transfer path. This doesn't work for compressed formats, as the source texture and temporary texture would have different block sizes. (Forcing the driver to always take the GPU path would expose the bug.) Instead, just use the source format for the temporary, and let blorp_copy deal with overrides. The one case where we can't do this is ASTC, because isl won't let us create a linear ASTC surface. Fall back to the CPU paths there for now. Fixes: `9d1334d2a0` ("iris: Use copy_region and staging resources to avoid transfer stalls") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:17 -07:00
Kenneth Graunke	1cd13ccee7	iris: Update fast clear colors on Gen9 with direct immediate writes. Gen11 stores the fast clear color in an "indirect clear buffer", as a packed pixel value. Gen9 hardware stores it as a float or integer value, which is interpreted via the format. We were trying to store that in a buffer, for similarity with Icelake, and MI_COPY_MEM_MEM it from there to the actual SURFACE_STATE bytes where it's stored. This unfortunately doesn't work for blorp_copy(), which does bit-for-bit copies, and overrides the format to a CCS-compatible UINT format. This causes the clear color to be interpreted in the overridden format. Normally, we provide the clear color on the CPU, and blorp_blit.c:2611 converts it to a packed pixel value in the original format, then unpacks it in the overridden format, so the clear color we use expands to the bits we originally desired. However, BLORP doesn't support this pack/unpack with an indirect clear buffer, as it would need to do the math on the GPU. On Gen11+, it isn't necessary, as the hardware does the right thing. This patch changes Gen9 to stop using an indirect clear buffer and simply do PIPE_CONTROLs with post-sync write immediate operations to store the new color over the surface states for regular drawing. BLORP continues streaming out surface states, and handles fast clear colors on the CPU. Fixes: `53c484ba8a` ("iris: blorp using resolve hooks") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:14 -07:00
Kenneth Graunke	117a0368b0	iris: Fix broken aux.possible/sampler_usages bitmask handling For renderable surfaces, we allocate SURFACE_STATEs for each bit in res->aux.possible_usages. Sampler views use res->aux.sampler_usages. When pinning buffers, we call surf_state_offset_for_aux() to calculate the offset to the desired surface state. surf_state_offset_for_aux() took an aux_modes parameter, which should be one of those two fields. However...it was not using that parameter. It always used the broader res->aux.possible_usages field directly. One of the callers, update_clear_value(), was passing incorrect masks for this parameter. It iterated through the bits in order, using u_bit_scan(), which destructively modifies the mask. So each time we called it, the count of bits before our selected mode was 0, which would cause us to always update the SURFACE_STATE for ISL_AUX_USAGE_NONE, rather than updating each in turn. This was hidden by the earlier bug where surf_state_offset_for_aux() ignored the parameter. Fixes: `7339660e80` ("iris: Add aux.sampler_usages.") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:14 -07:00
Kenneth Graunke	f6c44549ee	iris: Replace devinfo->gen with GEN_GEN This is genxml, we can compile out this code. Fixes: `2660667284` ("iris/gen8: Re-emit the SURFACE_STATE if the clear color changed.") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:14 -07:00
Alyssa Rosenzweig	272ce6f5a7	pan/midgard: Fix writeout combining shader-db regression in the scheduler. Fixes: `dff4986b1a` ("pan/midgard: Emit store_output branch just-in-time") total bundles in shared programs: 2055 -> 2019 (-1.75%) bundles in affected programs: 1055 -> 1019 (-3.41%) helped: 36 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.35% max: 20.00% x̄: 6.71% x̃: 5.16% 95% mean confidence interval for bundles value: -1.00 -1.00 95% mean confidence interval for bundles %-change: -8.45% -4.97% Bundles are helped. total quadwords in shared programs: 3444 -> 3408 (-1.05%) quadwords in affected programs: 1897 -> 1861 (-1.90%) helped: 36 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.19% max: 14.29% x̄: 3.97% x̃: 2.99% 95% mean confidence interval for quadwords value: -1.00 -1.00 95% mean confidence interval for quadwords %-change: -5.08% -2.86% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 14:03:23 -07:00
Alyssa Rosenzweig	2c5ba2ee6e	panfrost: Implement gl_FragCoord correctly Rather than passing through the transformed gl_Position, we can use the hardware-level varying for this, which will correctly handle gl_FragCoord.w Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	eeebf5c2df	panfrost: Remove vertex buffer offset from its size The offset is added to the base address, so we need to subtract it from the size to maintain the same end address and thus prevent a buffer overflow: end_address = start_address + size start_address' = start_address + offset size' = size - offset end_address' = start_address' + size' = (start_address + offset) + (size - offset) = (start_address + size) + (offset - offset) = start_address + size = end_address QED. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	f4678f3c62	pan/decode: Handle special varyings We need a special path for special varyings so we parse them correctly instead of throwing an error when they inevitably point to bad memory. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	caec0b3232	pan/decode: Remove size/stride divisibility check The hardware doesn't care, and a lot of Panfrost code relies on an oversized buffer. The important part is that (stride * padded_num_vertices) is no greater than size, which we'll need to check once we validate instancing. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	ed464e05c8	pan/decode: Decouple attribute/meta printing They are independent fields, so the parser should reflect that. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	ae84f16786	pan/decode: Print stub for uniforms We don't need to dump the contents necessary, but having the stub with the address is useful. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:06 -07:00
Alyssa Rosenzweig	26ed431ea9	pan/decode: Decode actual varying_meta address I don't know who thought this mask was a good idea but unfortunately it must have been me. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:56:49 -07:00
Alyssa Rosenzweig	f48136e9c5	pan/decode: Downgrade shader property mismatch to warning If we permit more $whatever through than the shader needs, that's a bit of a waste, but it isn't an error. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:56:35 -07:00
Alyssa Rosenzweig	f38ce6ea8c	pan/decode: Validate, but do not print, index buffer We don't actually care about the contents of the index buffer, but we would rather like to ensure it is present and of the correct size. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:56:04 -07:00
Alyssa Rosenzweig	cbbf75424a	pan/decode: Validate mali_shader_meta stats We can infer these stats in many cases from the disassembly, so we should try to sanity check where we can. We may need to be fuzzy about analysis, since analysis gives us a bound but we don't mind if it's not used fully by the shader. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:55:49 -07:00
Alyssa Rosenzweig	9b067d96f7	pan/decode: Disassemble before printing shader descriptor This allows the shader descriptor to access the disassembled stats. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:55:27 -07:00
Alyssa Rosenzweig	5f9a1c74ae	pan/decode: Promote <no shader> to an error There is no reason this should happen to an in-spec program, as far as I know. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:55:00 -07:00
Alyssa Rosenzweig	d7473e2e01	pan/decode: Fix uniform printing Lazypasting from UBOs. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:54:35 -07:00
Alyssa Rosenzweig	139708bbab	pan/decode: Validate blend shaders don't access I/O We could do better by forcing the checks to equal zero (right now, an indeterminate answer will pass the checks), but this is a start to guard against some egregious cases. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:54:16 -07:00
Alyssa Rosenzweig	ded9a68d8f	pan/decode: Validate and simplify FRAGMENT payloads There are a number of conditions we need to test for to statically check for TILE_RANGE_FAULTs, but once these checks are in order, we can print as-is. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:53:44 -07:00
Alyssa Rosenzweig	f06e8f7fe9	pan/decode: Validate MFBD tags These tags need to match up with what's actually described by the MFBD, so check this. Once this is checked, since the type and contents of the FBD are obvious from printing above, there's no need to explicitly mark off the framebuffer line. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:53:10 -07:00
Alyssa Rosenzweig	0c313419a0	pan/decode: Eliminate non-FBD dumped case We don't need more cases to deal with. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:52:52 -07:00
Alyssa Rosenzweig	6ec33b4f34	pan/decode: Removing uniform buffer framing We can do single line prints: ubuf_0[192] = memory_161f5000 + 896; Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:52:37 -07:00
Alyssa Rosenzweig	a68fe4baec	pan/decode: Remove mali_attr(_meta) framing It doesn't give any real added value. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:52:18 -07:00

1 2 3 4 5 ...

114763 Commits All Branches Search

114763 Commits

All Branches