mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	1fd60db4a1	ac,radv,radeonsi: remove LLVM 7 support Now that LLVM 9 will be released soon, we will only support LLVM 8, 9 and master (10). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-23 08:12:34 +02:00
Tapani Pälli	3e03a3fc53	egl: reset blob cache set/get functions on terminate Fixes errors seen with eglSetBlobCacheFuncsANDROID on Android when running dEQP that terminates and reinitializes a display. Fixes: `6f5b57093b` "egl: add support for EGL_ANDROID_blob_cache" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-23 08:14:08 +03:00
Kenneth Graunke	2d79925034	iris: Avoid unnecessary resolves on transfer maps We were always resolving the buffer as if we were accessing it via CPU maps, which don't understand any auxiliary surfaces. But we often copy to a temporary using BLORP, which understands compression just fine. So we can avoid the resolve, and accelerate the copy as well. Fixes: `9d1334d2a0` ("iris: Use copy_region and staging resources to avoid transfer stalls") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:17 -07:00
Kenneth Graunke	136629a1e3	iris: Drop copy format hacks from copy region based transfer path. This doesn't work for compressed formats, as the source texture and temporary texture would have different block sizes. (Forcing the driver to always take the GPU path would expose the bug.) Instead, just use the source format for the temporary, and let blorp_copy deal with overrides. The one case where we can't do this is ASTC, because isl won't let us create a linear ASTC surface. Fall back to the CPU paths there for now. Fixes: `9d1334d2a0` ("iris: Use copy_region and staging resources to avoid transfer stalls") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:17 -07:00
Kenneth Graunke	1cd13ccee7	iris: Update fast clear colors on Gen9 with direct immediate writes. Gen11 stores the fast clear color in an "indirect clear buffer", as a packed pixel value. Gen9 hardware stores it as a float or integer value, which is interpreted via the format. We were trying to store that in a buffer, for similarity with Icelake, and MI_COPY_MEM_MEM it from there to the actual SURFACE_STATE bytes where it's stored. This unfortunately doesn't work for blorp_copy(), which does bit-for-bit copies, and overrides the format to a CCS-compatible UINT format. This causes the clear color to be interpreted in the overridden format. Normally, we provide the clear color on the CPU, and blorp_blit.c:2611 converts it to a packed pixel value in the original format, then unpacks it in the overridden format, so the clear color we use expands to the bits we originally desired. However, BLORP doesn't support this pack/unpack with an indirect clear buffer, as it would need to do the math on the GPU. On Gen11+, it isn't necessary, as the hardware does the right thing. This patch changes Gen9 to stop using an indirect clear buffer and simply do PIPE_CONTROLs with post-sync write immediate operations to store the new color over the surface states for regular drawing. BLORP continues streaming out surface states, and handles fast clear colors on the CPU. Fixes: `53c484ba8a` ("iris: blorp using resolve hooks") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:14 -07:00
Kenneth Graunke	117a0368b0	iris: Fix broken aux.possible/sampler_usages bitmask handling For renderable surfaces, we allocate SURFACE_STATEs for each bit in res->aux.possible_usages. Sampler views use res->aux.sampler_usages. When pinning buffers, we call surf_state_offset_for_aux() to calculate the offset to the desired surface state. surf_state_offset_for_aux() took an aux_modes parameter, which should be one of those two fields. However...it was not using that parameter. It always used the broader res->aux.possible_usages field directly. One of the callers, update_clear_value(), was passing incorrect masks for this parameter. It iterated through the bits in order, using u_bit_scan(), which destructively modifies the mask. So each time we called it, the count of bits before our selected mode was 0, which would cause us to always update the SURFACE_STATE for ISL_AUX_USAGE_NONE, rather than updating each in turn. This was hidden by the earlier bug where surf_state_offset_for_aux() ignored the parameter. Fixes: `7339660e80` ("iris: Add aux.sampler_usages.") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:14 -07:00
Kenneth Graunke	f6c44549ee	iris: Replace devinfo->gen with GEN_GEN This is genxml, we can compile out this code. Fixes: `2660667284` ("iris/gen8: Re-emit the SURFACE_STATE if the clear color changed.") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-08-22 18:31:14 -07:00
Alyssa Rosenzweig	272ce6f5a7	pan/midgard: Fix writeout combining shader-db regression in the scheduler. Fixes: `dff4986b1a` ("pan/midgard: Emit store_output branch just-in-time") total bundles in shared programs: 2055 -> 2019 (-1.75%) bundles in affected programs: 1055 -> 1019 (-3.41%) helped: 36 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.35% max: 20.00% x̄: 6.71% x̃: 5.16% 95% mean confidence interval for bundles value: -1.00 -1.00 95% mean confidence interval for bundles %-change: -8.45% -4.97% Bundles are helped. total quadwords in shared programs: 3444 -> 3408 (-1.05%) quadwords in affected programs: 1897 -> 1861 (-1.90%) helped: 36 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.19% max: 14.29% x̄: 3.97% x̃: 2.99% 95% mean confidence interval for quadwords value: -1.00 -1.00 95% mean confidence interval for quadwords %-change: -5.08% -2.86% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 14:03:23 -07:00
Alyssa Rosenzweig	2c5ba2ee6e	panfrost: Implement gl_FragCoord correctly Rather than passing through the transformed gl_Position, we can use the hardware-level varying for this, which will correctly handle gl_FragCoord.w Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	eeebf5c2df	panfrost: Remove vertex buffer offset from its size The offset is added to the base address, so we need to subtract it from the size to maintain the same end address and thus prevent a buffer overflow: end_address = start_address + size start_address' = start_address + offset size' = size - offset end_address' = start_address' + size' = (start_address + offset) + (size - offset) = (start_address + size) + (offset - offset) = start_address + size = end_address QED. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	f4678f3c62	pan/decode: Handle special varyings We need a special path for special varyings so we parse them correctly instead of throwing an error when they inevitably point to bad memory. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	caec0b3232	pan/decode: Remove size/stride divisibility check The hardware doesn't care, and a lot of Panfrost code relies on an oversized buffer. The important part is that (stride * padded_num_vertices) is no greater than size, which we'll need to check once we validate instancing. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	ed464e05c8	pan/decode: Decouple attribute/meta printing They are independent fields, so the parser should reflect that. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:39 -07:00
Alyssa Rosenzweig	ae84f16786	pan/decode: Print stub for uniforms We don't need to dump the contents necessary, but having the stub with the address is useful. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 13:31:06 -07:00
Alyssa Rosenzweig	26ed431ea9	pan/decode: Decode actual varying_meta address I don't know who thought this mask was a good idea but unfortunately it must have been me. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:56:49 -07:00
Alyssa Rosenzweig	f48136e9c5	pan/decode: Downgrade shader property mismatch to warning If we permit more $whatever through than the shader needs, that's a bit of a waste, but it isn't an error. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:56:35 -07:00
Alyssa Rosenzweig	f38ce6ea8c	pan/decode: Validate, but do not print, index buffer We don't actually care about the contents of the index buffer, but we would rather like to ensure it is present and of the correct size. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:56:04 -07:00
Alyssa Rosenzweig	cbbf75424a	pan/decode: Validate mali_shader_meta stats We can infer these stats in many cases from the disassembly, so we should try to sanity check where we can. We may need to be fuzzy about analysis, since analysis gives us a bound but we don't mind if it's not used fully by the shader. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:55:49 -07:00
Alyssa Rosenzweig	9b067d96f7	pan/decode: Disassemble before printing shader descriptor This allows the shader descriptor to access the disassembled stats. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:55:27 -07:00
Alyssa Rosenzweig	5f9a1c74ae	pan/decode: Promote <no shader> to an error There is no reason this should happen to an in-spec program, as far as I know. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:55:00 -07:00
Alyssa Rosenzweig	d7473e2e01	pan/decode: Fix uniform printing Lazypasting from UBOs. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:54:35 -07:00
Alyssa Rosenzweig	139708bbab	pan/decode: Validate blend shaders don't access I/O We could do better by forcing the checks to equal zero (right now, an indeterminate answer will pass the checks), but this is a start to guard against some egregious cases. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:54:16 -07:00
Alyssa Rosenzweig	ded9a68d8f	pan/decode: Validate and simplify FRAGMENT payloads There are a number of conditions we need to test for to statically check for TILE_RANGE_FAULTs, but once these checks are in order, we can print as-is. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:53:44 -07:00
Alyssa Rosenzweig	f06e8f7fe9	pan/decode: Validate MFBD tags These tags need to match up with what's actually described by the MFBD, so check this. Once this is checked, since the type and contents of the FBD are obvious from printing above, there's no need to explicitly mark off the framebuffer line. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:53:10 -07:00
Alyssa Rosenzweig	0c313419a0	pan/decode: Eliminate non-FBD dumped case We don't need more cases to deal with. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:52:52 -07:00
Alyssa Rosenzweig	6ec33b4f34	pan/decode: Removing uniform buffer framing We can do single line prints: ubuf_0[192] = memory_161f5000 + 896; Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:52:37 -07:00
Alyssa Rosenzweig	a68fe4baec	pan/decode: Remove mali_attr(_meta) framing It doesn't give any real added value. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:52:18 -07:00
Alyssa Rosenzweig	f162adc32b	pan/midgard: Disassemble integer constants in hex It's usually easier to parse mentally. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:51:55 -07:00
Alyssa Rosenzweig	b89cb0dba6	pan/midgard: Explain ffma Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:51:39 -07:00
Alyssa Rosenzweig	19d58a299b	pan/midgard: Analyze simple loads/store For shaders using exclusively direct attribute/varyings, we can work this out statically. For shaders with indirect access, we just set an upper bound of 16 (the max attributes/varyings we support) and the actual count will be reported regardless. We proceed similarly for textures/samplers, as well as for UBOs. While UBOs can be indexed indirectly, the UBO itself -- which is what we count in the shader descriptor (rather than the UBO descriptors) -- is statically determinable. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:51:21 -07:00
Alyssa Rosenzweig	a89e368c7f	pan/midgard: Compute work_count via writes This is exact. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:50:57 -07:00
Alyssa Rosenzweig	b9fb63859e	pan/midgard: Sketch static analysis to uniform count This one is a little tricky, but the idea is that: r16-r23 are always uniforms r8-r15 are sometimes work, sometimes uniforms... ...but as work, they are always written before use ...and as uniforms, they are never written before use So we use that heuristic to determine the count to feed the machine. We'll record work register use in the next commit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:50:40 -07:00
Alyssa Rosenzweig	58fc260312	pan/decode: Hoist shader-db stats to shared decode We'll want all this information to validate the shader descriptor. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-22 12:50:14 -07:00
Alyssa Rosenzweig	a8f86fcb51	nir: Remove nir_const_load_to_arr There are no remaining users in-tree. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-08-22 12:24:13 -07:00
Alyssa Rosenzweig	3c01a6928a	pan/midgard,bifrost: Expand nir_const_load_to_arr Panfrost is the only user of the macro; we are better off expanding than having random stuff in nir.h. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com>	2019-08-22 12:24:13 -07:00
Adam Jackson	a7a9d958bc	glx: Make __glXGetDrawableAttribute return true sometimes Right now it always returns zero, but as of: commit `a48a6b8a40` Author: Adam Jackson <ajax@redhat.com> Date: Tue Nov 14 15:13:05 2017 -0500 glx: Prepare driFetchDrawable for no-config contexts We were hoping it would return true if the drawable could actually be looked up. It wasn't, so that didn't go very well. With the most recent update to <GL/glxext.h> glXQueryGLXPbufferSGIX (correctly) returns void, so there's no longer anything else besides driFetchDrawable that depends on the return value from __glXGetDrawableAttribute. Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-22 13:29:06 -04:00
Adam Jackson	3dd299c3d5	glx: Sync <GL/glxext.h> with Khronos Minor fixups required to keep the prototypes matching and to remove mention of retired enums. Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-22 13:29:04 -04:00
Adam Jackson	5ebd333c6c	glx: Whitespace cleanups Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-22 13:28:39 -04:00
Eric Engestrom	6db1dfe347	swr: use LLVM version string instead of re-computing it Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-22 16:08:09 +01:00
Eric Engestrom	7f5ef97a07	llvmpipe: use LLVM version string instead of re-computing it Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-22 16:08:09 +01:00
Eric Engestrom	3ea83f4c9b	scons: define MESA_LLVM_VERSION_STRING like the other build systems do Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-22 16:08:09 +01:00
Bas Nieuwenhuizen	c037fe5ad1	radv: Disable NGG for geometry shaders. A bunch of remaining issues including some that affect users. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111248 Fixes: `ee21bd7440` "radv/gfx10: implement NGG support (VS only)" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-08-22 12:47:32 +02:00
Lionel Landwerlin	5833f43305	util/timespec: use unsigned 64 bit integers for nsec values We added this utility for vulkan where all timeouts are given as uint64_t values. We can switch from signed to unsigned as this is the only user and if we ever deal with signed integers somewhere else we'll have to be careful to use the corresponding timespec_(add\|sub)_msec and always pass absolute values. v2: Forgot to drop the test calling add_nsec() with a negative number Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Juan A. Suarez Romero <jasuarez@igalia.com> Fixes: `d2d70c3bb5` ("util: add a timespec helper") Acked-by: Daniel Stone <daniels@collabora.com>	2019-08-22 09:35:57 +02:00
Tapani Pälli	728ebcdec2	iris/android: fix build and link with libmesa_intel_perf Fixes: `0fd4359733` "iris/perf: implement routines to return counter info" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-22 10:01:14 +03:00
Samuel Pitoiset	2d9f401a83	ac: fix exclusive scans on GFX8-GFX9 This fixes a regression introduced with scan&reduce operations on GFX10. Note that some subgroups CTS still fail on GFX10 but I assume it's a different issue. This fixes dEQP-VK.subgroups.arithmetic..subgroupexclusive. Fixes: `227c29a80d` "amd/common/gfx10: implement scan & reduce operations" Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-08-22 08:43:15 +02:00
Tapani Pälli	ce8fd042a5	util: fix os_create_anonymous_file on android Commit fixes current crashes with Vulkan applications on Android. Fixes: `c0376a1234` "util: add anon_file.h for all memfd/temp file usage" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-08-22 08:27:43 +03:00
Lionel Landwerlin	ac5bda374a	i965: honor scanout requirement from DRI Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-08-21 23:52:07 +00:00
Kenneth Graunke	bc844d92ce	gallium/noop: Implement resource_get_param v2: Pass through to oscreen rather than faking it (review from Marek). Fixes: `0346b70083` ("gallium/screen: Add pipe_screen::resource_get_param") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-21 22:18:22 +00:00
Kenneth Graunke	f02d1a0b75	gallium/rbug: Wrap resource_get_param if available Fixes: `0346b70083` ("gallium/screen: Add pipe_screen::resource_get_param") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-21 22:18:22 +00:00
Kenneth Graunke	c43a44791b	gallium/trace: Wrap resource_get_param if available Fixes: `0346b70083` ("gallium/screen: Add pipe_screen::resource_get_param") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-21 22:18:22 +00:00

1 2 3 4 5 ...

114740 Commits All Branches Search

114740 Commits

All Branches