KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Alyssa Rosenzweig	b660953733	panfrost: Use polygon list header size computation Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-17 07:59:14 -07:00
Alyssa Rosenzweig	edfba9bee2	panfrost: Calculate polygon list header size As per the notes at the beginning of pan_tiler.c, we implement a routine to calculate the size of the polygon list header given the framebuffer dimensions and the provided hierarchy mask. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-17 07:59:14 -07:00
Alyssa Rosenzweig	e88ff9ad85	panfrost: Add pan_tiler.h header Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-17 07:47:49 -07:00
Alyssa Rosenzweig	21eb411d2f	panfrost: Document tile size heuristic I'm not sure how the blob does it, but this seems to be a dead simple test and roughly corresponds to what I've noticed from the blob, so maybe it's good enough. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-17 07:47:49 -07:00
Alyssa Rosenzweig	7f26bb3553	panfrost: Rename tiler fields per tiler research Following the research into Midgard's hierarchical tiling infrastructure, we now understand (in broad stokes) the purpose of each tiler field in the MFBD. Additionally, we understand more of the tiling fields in the SFBD and in Bifrost's structures, although this knowledge is still incomplete. Update the names, decoder, and comments to reflect this new understanding. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-17 07:47:49 -07:00
Alyssa Rosenzweig	8d6fb66e3a	panfrost: Add notes about the tiler allocations This explains how the polygon list is allocated, updating the headers appropiately to sync the terminology. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-17 07:47:49 -07:00
Alyssa Rosenzweig	85e745f2b4	panfrost: Integrate kernel names for tiler FBD These names are from the replay workaround in kbase; they begin to shine some light on the meaning of these fields. In particular, we now understand why the "tiler_meta" field has the effect it does on performance in certain scenes (controlling tile granularity). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-17 07:47:49 -07:00
Bas Nieuwenhuizen	1a7caac9e9	radv: Add asserts that buffer descriptors are created with valid buffer formats. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-06-17 10:56:50 +00:00
Bas Nieuwenhuizen	4107590911	radv: Decompress DCC when the image format is not allowed for buffers. Otherwise the buffer loads/stores in the bufimage meta operations fail. If we decompress DCC then we can use the "canonical" format compatible with the not-supported format. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-06-17 10:56:50 +00:00
Samuel Pitoiset	e9875fc0b6	radv: make sure to init the DCC decompress compute path state This fixes a segfault when forcing DCC decompressions on compute because internal meta objects are not created since the on-demand stuff. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-17 11:30:49 +02:00
Samuel Pitoiset	4c7ef1b02e	ac: make ac_compute_cmask() a static function Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-17 11:30:47 +02:00
Samuel Pitoiset	cf77d3abf1	radv: rely on ac_compute_cmask() for CMASK info Instead of re-computing in the driver. The 3d and cube flags are correctly set, so the same values should returned by ac_compute_surface(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-17 11:30:44 +02:00
Samuel Pitoiset	6880b42cfc	radv: silent a compiler warning in radv_CmdPushDescriptorSetKHR() Trivial. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-06-17 09:53:26 +02:00
Tomeu Vizoso	e655d63644	panfrost: ci: Speed things up a bit by skipping a git clone Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-06-17 09:17:53 +02:00
Tomeu Vizoso	f1efb0f254	panfrost: ci: Exclude all blend tests from results As they randomly fail on T760. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-06-17 09:17:53 +02:00
Samuel Pitoiset	b5012a0518	ac: update llvm.amdgcn.icmp intrinsic name for LLVM 9+ LLVM r363339 changed llvm.amdgcn.icmp.i* to llvm.amdgcn.icmp.i64.i*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-By: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-17 08:58:33 +02:00
Erico Nunes	d72bbb2c89	lima: lower fmod in ppir and gpir Since commit `4f3c82c72c` fmod is no longer being lowered in nir, and ends up crashing lima programs with "unsupported nir_op: fmod" in both ppir and gpir. There seems to be no mod operation in hardware in utgard and there is an optimization in nir to lower fmod to instructions that lima already implements, so let's use that. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-06-16 10:11:59 +00:00
Rob Clark	a417c323ad	freedreno/a6xx: re-enable UBWC for depth/stencil Now that we can blit depth/stencil in a way that plays nicely with UBWC, re-enable it. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com>	2019-06-15 07:33:04 -07:00
Rob Clark	363a9ed614	freedreno/a6xx: handle z24s8/z24x8 blits with u_blitter Now that it can turn these blits into rendering to RB6_Z24_UNORM_S8_UINT it can properly handle cases where only one of depth+stencil is being blit. And this avoids lying about he format, which completely doesn't work when UBWC is used. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com>	2019-06-15 07:33:04 -07:00
Rob Clark	a96ae18de6	freedreno/a6xx: handle fallback for rewritten blits ourself For re-written z/s blits, we want to use the re-written `pipe_blit_info` even if we have to fallback to 3d pipe (`u_blitter`). So handle that fallback ourself. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com>	2019-06-15 07:33:04 -07:00
Rob Clark	94c36a8554	freedreno/a6xx: rename variable The name 'separate' doesn't make a while lot of sense, as only one of the cases is the blit actually split. But split out from previous patch in an attempt to reduce the noise. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com>	2019-06-15 07:33:04 -07:00
Rob Clark	5fe7b627eb	freedreno/a6xx: consolidate z/s blit handling This will get even simpler with the next patch Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com>	2019-06-15 07:33:04 -07:00
Rob Clark	4c75d62ce8	gallium: add z24s8_as_r8g8b8a8 format This maps to a special format that recent generations of adreno have, for blitting z24s8. Conceptually it is similar to doing Z and/or S blits by pretending it is r8g8b8a8 (with appropriate writemask). But it differs when bandwidth compression is used, as z24 is a different type from r8g8b8. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com>	2019-06-15 07:33:04 -07:00
Kenneth Graunke	1d75f52589	st/mesa: Respect GL_TEXTURE_SRGB_DECODE_EXT in GenerateMipmaps() Apparently, we're supposed to look at the texture object's built-in sampler object's sRGB decode setting in order to decide whether to decode/downsample/re-encode, or simply downsample as-is. Previously, we had just respected the pipe_resource's format. Fixes SKQP's Skia_Unit_Tests.SRGBMipMaps test. (This ports commit `337a808062` from i965 to st/mesa for Gallium drivers.) Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-14 20:13:46 +00:00
Erico Nunes	3ddea5e8c5	lima: fix dynarray usage in lima_submit_add_bo Commit `de8a919702` refactored dynarray usage and changed the size of the allocation in lima_submit_add_bo. That causes a segfault in programs running with lima. This commit restores the allocation size back to the previous size. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-06-14 20:47:35 +02:00
Alyssa Rosenzweig	9ab8d31f32	panfrost: Fix variant selection Fixes 1acffb ("panfrost: Unify...") Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-14 10:35:07 -07:00
Marek Olšák	abe9a51d27	ac: add radeon_info::is_amdgpu instead of checking drm_major == 3 and clean up Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-06-14 13:31:18 -04:00
Mauro Rossi	bbbbea243a	android: amd/common: fix missing include path Fixes the following building error in Android: In file included from external/mesa/src/amd/common/ac_llvm_helper.cpp:34: In file included from external/mesa/src/amd/common/ac_llvm_build.h:30: In file included from external/mesa/src/compiler/nir/nir.h:40: In file included from external/mesa/src/compiler/nir_types.h:36: external/mesa/src/compiler/glsl_types.h:37:10: fatal error: 'main/config.h' file not found ^~~~~~~~~~~~~~~ 1 error generated. Fixes: `bd4c661` ("ac,ac/nir: use a better sync scope for shared atomics") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-06-14 18:36:10 +02:00
Mauro Rossi	51e24af8fd	android: radv: fix necessary dependecies Fixes building errors due to libmesa_util and libexpat dependencies: In file included from external/mesa/src/amd/vulkan/radv_device.c:52: external/mesa/src/util/xmlpool.h:115:10: fatal error: 'xmlpool/options.h' file not found ^~~~~~~~~~~~~~~~~~~ 1 error generated. FAILED: out/target/product/x86_64/obj_x86/SHARED_LIBRARIES/vulkan.radv_intermediates/LINKED/vulkan.radv.so ... external/mesa/src/util/xmlconfig.c:670: error: undefined reference to 'XML_ParserCreate' ... clang.real: error: linker command failed with exit code 1 (use -v to see invocation) Fixes: `3c2e826` ("radv: Add support for driconf.") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-06-14 18:35:10 +02:00
Alejandro Piñeiro	d317944c24	docs: document three NIR_ envvars Initially I was only interested on documenting NIR_PRINT, as today I needed to check the code to find this envvar, that at the moment I vaguely remembered that existed. As we are here, though, let's just document all of them (assuming that makes sense). Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-14 16:18:43 +02:00
Alexandros Frantzis	83829abe03	virgl: Return immediately when finding a compatible resource in the cache When searching for resources in the cache, we previously released all expired resources even after having found a compatible resource. This commit changes this behavior to return immediately when finding a compatible resource, so that the operation finishes more quickly. This moves more of the burden of releasing expired resources to cache addition, which, since it happens at resource destruction time, it's less time critical. Signed-off-by: Alexandros Frantzis <alexandros.frantzis@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com>	2019-06-14 12:59:51 +03:00
Alexandros Frantzis	801753d4b3	virgl: Use virgl_resource_cache in the vtest winsys Replace the cache implementation in the vtest winsys with virgl_resource_cache. Signed-off-by: Alexandros Frantzis <alexandros.frantzis@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com>	2019-06-14 12:59:49 +03:00
Alexandros Frantzis	13f70d3668	virgl: Use virgl_resource_cache in the drm winsys Replace the cache implementation in the drm winsys with virgl_resource_cache. Signed-off-by: Alexandros Frantzis <alexandros.frantzis@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com>	2019-06-14 12:59:43 +03:00
Alexandros Frantzis	b18f09a509	virgl: Introduce virgl_resource_cache Introduce a resource cache implementation that can be used by any virgl winsys backend. Signed-off-by: Alexandros Frantzis <alexandros.frantzis@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com>	2019-06-14 12:58:51 +03:00
Haihao Xiang	8ead5bebdb	i965: support UYVY for external import only It is similar with YUYV Fixes: `165e704719` ("i965/i915: Add UYVY as the supported format") Signed-off-by: Haihao Xiang <haihao.xiang@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-06-14 15:45:56 +08:00
Neil Roberts	34d4b3e367	glsl: Set default precision on record members Record types have their own slot to store the precision for each member in glsl_struct_field. Previously if the member didn’t have an explicit precision qualifier this was being left as GLSL_PRECISION_NONE. This patch makes it take into account the type’s default precision qualifier like it does for regular variables in apply_type_qualifier_to_variable. This has the additional benefit of correctly reporting an error when a float type is used in a struct without declaring the default type. Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-14 09:29:53 +02:00
Neil Roberts	235425771c	glsl/linker: Make precision matching optional in intrastage_match This function is confusingly also used to match interstage interfaces as well as intrastage. In the interstage case it needs to avoid comparing the precisions. This patch adds a parameter to specify whether to take the precision into account or not so that it can be used for both cases. Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-14 09:29:53 +02:00
Neil Roberts	19b27a8569	glsl/linker: Don’t check precision for shader interface On GLES, the interface between vertex and fragment shaders doesn’t need to have matching precision. Section 4.3.10 of the GLSL ES 3.00 spec: “The type of vertex outputs and fragment inputs with the same name must match, otherwise the link command will fail. The precision does not need to match.” Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-14 09:29:53 +02:00
Neil Roberts	230d1e8d86	compiler/types: Making comparing record precision optional On GLES, the interface between vertex and fragment shaders doesn’t need to have matching precision. This adds an extra argument to glsl_types::record_compare to disable the precision comparison. This will later be used for the shader interface check. In order to make this work this patch also adds a helper function to recursively compare types while ignoring the precision. v2: Call record_compare from within compare_no_precision to avoid duplicating code (Eric Anholt). Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-14 09:29:53 +02:00
Lucas Stach	ab74699190	etnaviv: fix some pm query issues The offsets to read the query results were off-by-one, which causes the counters to report bogus increasing values. Also the counter result is u32, so we need to initialize the query type to reflect that. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-06-14 09:06:28 +02:00
Iago Toral Quiroga	360b832c58	v3d: do not setup execute flags for else block in uniform control flow Either all channels executed the 'then' block, in which case all channels will directly jump to the 'endif' block at the end of the 'then' block, or all channels execute the 'else' block (so no execution masking is necessary). Shader-db results: total instructions in shared programs: 9119238 -> 9117550 (-0.02%) instructions in affected programs: 401252 -> 399564 (-0.42%) helped: 855 HURT: 77 total uniforms in shared programs: 3022622 -> 3022605 (<.01%) uniforms in affected programs: 3566 -> 3549 (-0.48%) helped: 17 HURT: 0 total max-temps in shared programs: 1327762 -> 1327774 (<.01%) max-temps in affected programs: 619 -> 631 (1.94%) helped: 2 HURT: 15 Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-14 08:00:52 +02:00
Iago Toral Quiroga	2a2501247b	nir: detect more dynamically uniform expressions Shader-db results for v3d: total instructions in shared programs: 9132728 -> 9119238 (-0.15%) instructions in affected programs: 596886 -> 583396 (-2.26%) helped: 1118 HURT: 224 total threads in shared programs: 234298 -> 234308 (<.01%) threads in affected programs: 10 -> 20 (100.00%) helped: 5 HURT: 0 total uniforms in shared programs: 3022949 -> 3022622 (-0.01%) uniforms in affected programs: 29163 -> 28836 (-1.12%) helped: 108 HURT: 37 total max-temps in shared programs: 1328030 -> 1327762 (-0.02%) max-temps in affected programs: 10097 -> 9829 (-2.65%) helped: 263 HURT: 15 total spills in shared programs: 3793 -> 3777 (-0.42%) spills in affected programs: 432 -> 416 (-3.70%) helped: 16 HURT: 0 total fills in shared programs: 4380 -> 4266 (-2.60%) fills in affected programs: 828 -> 714 (-13.77%) helped: 16 HURT: 0 Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-14 08:00:52 +02:00
Tapani Pälli	287b58f827	ir3: initialize progress false before ir3_nir_lower_imul Removes a compiler warning about uninitialized variable. Fixes: `c02ffd2700` "ir3: Use the new NIR lowering pass for integer multiplication" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rob Clark <robclark@gmail.com> Reviewed-by: Eduardo Lima <elima@igalia.com>	2019-06-14 08:21:42 +03:00
Boris Brezillon	749c544b84	panfrost: Fix general purpose varying handling When both the fragment and vertex shaders point to the same varying location they expect to share the same varying slot. Make sure vertex and fragment varyings pointing to the same loc have ->src_offset set to the same value. [Alyssa: In addition a patch implement txs, this fixes GALLIUM_HUD on Panfrost] Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-13 10:54:18 -07:00
Marek Olšák	7566a9a58a	ac/registers: use better names for disambiguated definitions Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-06-13 13:52:06 -04:00
Marek Olšák	08ab9b70ce	ac/registers: remove deprecated/inapplicable definitions Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-06-13 13:52:06 -04:00
Caio Marcelo de Oliveira Filho	5bd48ff252	iris: Enable INTEL_shader_atomic_float_minmax Supported only for gen >= 9. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-06-13 09:03:58 -07:00
Caio Marcelo de Oliveira Filho	81835f87a4	gallium: Add PIPE_CAP_ATOMIC_FLOAT_MINMAX Used to enable INTEL_shader_atomic_float_minmax. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-06-13 09:03:58 -07:00
Rob Clark	9f10e40cde	freedreno/a6xx: fix MAX_INDICES Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-06-13 08:56:27 -07:00
Rob Clark	ce12ac8c2b	freedreno/blitter: remove dead code The src/dst format is overriden from the pipe_blit_info, so this just logic just serves to confuse the reader. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-06-13 08:56:27 -07:00

... 2 3 4 5 6 ...

111997 Commits All Branches Search

111997 Commits

All Branches