KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	77d2a98353	Revert "radeonsi: use uint32_t to declare si_shader_key.opt.kill_outputs" This reverts commit `7b2240ac9c`. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-27 18:45:07 +02:00
Marek Olšák	dbe45e1180	Revert "radeonsi: remove 8 bytes from si_shader_key with uint32_t ff_tcs_inputs_to_copy" This reverts commit `6b6fed3a3c`. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-27 18:45:07 +02:00
Marek Olšák	984f7feeb4	mesa: optimize GL_PRIMITIVE_RESTART_NV more And other client state changes don't have to call update_derived_primitive_restart_state. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-27 18:45:07 +02:00
Marek Olšák	bcf5d5ce40	mesa: fix clip plane enable breakage Broken by: commit `00173d91b7` Author: Marek Olšák <marek.olsak@amd.com> Date: Sat Jun 10 12:09:43 2017 +0200 mesa: don't flag _NEW_TRANSFORM for st/mesa if possible It also optimizes the case slightly for GL core. It doesn't try to fix that glEnable might be a bad place to do the clip plane transformation. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2017-06-27 18:45:07 +02:00
Leo Liu	fad0b47219	radeon/vcn: enable h264 decode entension support It's enabled through message buffer for UVD Signed-off-by: Leo Liu <leo.liu@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2017-06-27 10:59:44 -04:00
Charmaine Lee	b2e78e79d7	svga: clean up format_cap_table Per Jose's suggestion, this patch cleans up format_cap_table to remove the unnecessary default cap value for vgpu10 formats since those devcap values can be retrieved from the device. Tested with MTT conform, glretrace, piglit in HWv13 and HWv8. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-06-27 07:49:03 -06:00
Charmaine Lee	122ca27a48	svga: fix the default devcap for SVGA3D_Z_D24S8_INT The default devcap for format SVGA3D_Z_D24S8_INT in HWv8 when its devcap is not explicitly advertised should be set to zero to match the default value in the device. Tested with MTT piglit in HW version 8. Reviewed-by: Neha Bhende <bhenden@vmware.com>	2017-06-27 07:49:02 -06:00
Charmaine Lee	eea6223184	svga: create buffer surfaces for incompatible bind flags In cases where certain bind flags cannot be enabled together, such as CONSTANT_BUFFER cannot be combined with any other flags, a separate host surface will be created. For example, if a stream output buffer is reused as a constant buffer, two host surfaces will be created, one for stream output, and another one for constant buffer. Data will be copied from the stream output surface to the constant buffer surface. Fixes piglit test ext_transform_feedback-immediate-reuse-index-buffer, ext_transform_feedback-immediate-reuse-uniform-buffer Tested with MTT piglit, MTT glretrace, Nature, NobelClinician Viewer, Tropics. v2: Fix bind flags compatibility check as suggested by Brian. v3: Use the list utility to maintain the buffer surface list. v4: Use the SAFE rev of LIST_FOR_EACH_ENTRY Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-06-27 07:49:02 -06:00
Charmaine Lee	7abfb0b0d5	svga: do not unconditionally enable streamout bind flag Currently we unconditionally enable streamout bind flag at buffer resource creation time. This is not necessary if the buffer is never used as a streamout buffer. With this patch, we enable streamout bind flag as indicated by the state tracker. If the buffer is later bound to streamout and does not already has streamout bind flag enabled, we will recreate the buffer with the new set of bind flags. Buffer content will be copied from the old buffer to the new one. Tested with MTT piglit, Nature, Tropics, Lightsmark. v2: Fix bind flags check as suggested by Brian. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-06-27 07:49:02 -06:00
Charmaine Lee	b549f5e6b1	svga: pass tobind_flags to svga_buffer_handle This is to prepare for more bind_flags optimization in subsequent patches. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-06-27 07:49:02 -06:00
Charmaine Lee	4a79b508a4	svga: pass bind_flags to surface create functions This is to prepare for other bind_flags optimization in subsequent patches. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2017-06-27 07:49:02 -06:00
Brian Paul	ce608784d0	pipe_loader_sw: fix compilation warning Add the new 'flags' parameter to pipe_loader_sw_create_screen(). Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-06-27 07:49:02 -06:00
Eric Engestrom	b3eda74acf	mesa: add missing include src/mesa/drivers/x11/xm_dd.c:688:7: warning: implicit declaration of function ‘_mesa_update_draw_buffer_bounds’; did you mean ‘_mesa_has_ARB_draw_buffers_blend’? [-Wimplicit-function-declaration] _mesa_update_draw_buffer_bounds(ctx, ctx->DrawBuffer); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Cc: Marek Olšák <marek.olsak@amd.com> Fixes: `585c5cf8a5` ("mesa: don't update draw buffer bounds in _mesa_update_state") Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-06-27 14:33:49 +01:00
Lionel Landwerlin	3e0d54d270	i965: perf: add support for Geminilake Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:30 +03:00
Lionel Landwerlin	9a50fc7cfc	i965: perf: add support for Kabylake Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:30 +03:00
Lionel Landwerlin	8ff086fa68	i965: perf: use gen_device_info rather then brw_context Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:30 +03:00
Robert Bragg	e277ff41c0	i965: perf: ensure isolated timer reports while idle don't confuse filtering From experimentation in IGT, we found that the OA unit might label some report as "idle" (using an invalid context ID), right after a report for a given context. Deltas generated by those reports actually belong to the previous context, even though they're not labelled as such. This change makes ensure that while reading OA reports, we only consider the GPU actually idle after 2 reports with an invalid context ID. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:29 +03:00
Lionel Landwerlin	31b11f69f7	i965: perf: keep on reading reports until delimiting timestamp Due to an underlying hardware race condition, we have no guarantee that all the reports coming from the OA buffer related to the workload we're trying to measure have landed to memory by the time all the work submitted has completed. That means we need to keep on reading the OA stream until we read a report with a timestamp more recent than the timestamp recored by the MI_REPORT_PERF_COUNT at the end of the performance query. v2: fix uninitialized offset variable to 0 (Lionel) v3: rework the reading to avoid blocking the user of the API unless requested (Rob) v4: fix a bug that makes the i965 driver reading the perf stream when not necessary, leading to very long counter accumulation times (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:29 +03:00
Robert Bragg	1fc7b95127	i965: Add Gen8+ INTEL_performance_query support Enables access to OA unit metrics on Gen8+ via INTEL_performance_query. v2: make use of new parameters coming from gen_device_info (Lionel) Signed-off-by: Robert Bragg <robert@sixbynine.org> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:29 +03:00
Robert Bragg	243909d41e	i965: Add XML OA metric sets for Gen8+ Also updates Makefile.am to generate corresponding normalization code. Signed-off-by: Robert Bragg <robert@sixbynine.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:29 +03:00
Robert Bragg	e74972a3a6	i965: Add Gen8+ sys_vars for generated OA code In preparation for adding XML OA metric set descriptions for Gen 8 and 9 which will result in auto generated code that depends on a number of new system variables ($EuSubslicesTotalCount, $EuThreadsCount and $SliceMask) this adds corresponding members to brw->perf.sys_vars. Signed-off-by: Robert Bragg <robert@sixbynine.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:29 +03:00
Lionel Landwerlin	7dd20bc3ee	anv/i965: drop libdrm_intel dependency completely With Ken's work to drop the library dependency on libdrm_intel, we now only depend on libdrm for the kernel uapi headers it provides. It seems like we're better off just embeddeding those headers ourselves, making the lives of people developping news features tightly integrated with the kernel a tiny bit easier. This change also makes it a bit more obvious what cflags/libs are required by the i915 drivers vs i965, by renaming INTEL_CFLAGS/LIBS into I915_CFLAGS/LIBS. Headers were generated from drm-tip on the following commit : commit 6d61e70ccc21606ffb8a0a03bd3aba24f659502b Merge: 338ffbf7cb5e c0bc126f97fb Author: Dave Airlie <airlied@redhat.com> Date: Tue Jun 27 07:24:49 2017 +1000 Backmerge tag 'v4.12-rc7' into drm-next v2: Use installed files from the kernel (Daniel Vetter) v3: Use headers from drm-next rather than drm-tip (Dave/Daniel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:29 +03:00
Lionel Landwerlin	3c50ebce25	i915: use different CFLAGS/LIBS variables than i965/anv Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-06-27 14:10:29 +03:00
Lionel Landwerlin	230691b8e5	aubinator: import intel_aub.h from libdrm This enables us to compile aubinator without the libdrm dependency. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:28 +03:00
Lionel Landwerlin	adafe4b733	i965: perf: minimize the chances to spread queries across batchbuffers Counter related to timings will be sensitive to any delay introduced by the software. In particular if our begin & end of performance queries end up in different batches, time related counters will exhibit biffer values caused by the time it takes for the kernel driver to load new requests into the hardware. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-27 14:10:25 +03:00
Juan A. Suarez Romero	7ee409dd4e	nir: implement GLSL.std.450 NMax, NMIn and NClamp operations v2: NIR fmax/fmin already handles NaN (Connor). Reviewed by: Elie Tournier <elie.tournier@collabora.com>	2017-06-27 12:01:11 +02:00
Juan A. Suarez Romero	b5ae17fe59	nir: add support for 64-bit in SmoothStep function According to GLSL.std.450 spec, SmoothStep expects input to be a floating-point type, but it does not restrict the bitsize. Current implementation relies on inputs to be 32-bit. This commit extends the support to 64-bit size inputs. Reviewed by: Elie Tournier <elie.tournier@collabora.com>	2017-06-27 12:01:11 +02:00
Juan A. Suarez Romero	4195a9450b	nir: sge operation is defined for floating-point types According to GLSL.std.450 spec, the operand for step() function must be a floating-point. It does not restrict the value to 32-bit floats. Reviewed by: Elie Tournier <elie.tournier@collabora.com>	2017-06-27 12:01:11 +02:00
Topi Pohjolainen	b3bf453686	i965: Separate gen < 8 and gen >= 8 paths explicitly in wrap_mode() Makes coverity happier. Fix indentation in gen >= 8 block while at it. CID: 1413020 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-06-27 10:20:35 +03:00
Topi Pohjolainen	fbcc9555c5	intel/anv: Add missing break in anv_CreateDevice() CID: 1413018 Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-06-27 10:19:55 +03:00
Nicolai Hähnle	2ce126df3a	ac/nir: convert emit helpers to ac_llvm_context Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	58d496c8e2	ac/nir: remove unused nir_to_llvm_context::has_ddxy Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	6ecef25545	ac/nir: implement nir_op_f2b Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	dacf73e527	ac/nir: implement nir_op_{b2i,i2b} Booleans in NIR are ~0 for true, b2i returns 0/1. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	77d7764d5e	ac/nir: convert type helpers to ac_llvm_context Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	b7bd49158e	ac/llvm: fix type of second llvm.cttz.* parameter LLVM has required an i1 here for a long time. llvm.ctlz.* was fixed in commit `edd23e0606` ("ac/llvm: fix various findMSB bugs"). Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	e8ba03d32a	ac/shader_info: fix a comment Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	edfd3be77e	ac: add ac_llvm_context::v8i32 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	331a574732	ac: add ac_llvm_context::{i,f}32_{0,1} Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	7bf8c944dc	ac: add ac_llvm_context::{i16, i64, f16, f64} Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Ilia Mirkin	4a79f2be33	nv50/ir: fix combineLd/St to update existing records as necessary Previously the logic would decide that the record is kept, which translates into keep = false in the caller, which meant that these passes did not run. While it's right that keep = false which means that a new record does not need to be added, we do still have to perform the usual list maintenance. It's easiest to do this pre-merge rather than post. The lowering that clip/cull distance passes produce triggers this bug in TCS (since reading outputs is done differently in other stages), but it should be possible to achieve it with the right sequence of regular reads/writes. Fixes: KHR-GL45.cull_distance.functional Fixes: generated_tests/spec/arb_tessellation_shader/execution/tes-input/tes-input-gl_ClipDistance.shader_test Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable@lists.freedesktop.org	2017-06-26 20:24:19 -04:00
Ilia Mirkin	7d56ae5eb2	nv50/ir: adjust overlapping logic to take fileIndex-relative offsets If the fileIndex is different, that means they are in logically different spaces. However if there's also a relative offset, then they could end up pointing at the same spot again. Also add a note about potential for multiple buffers to overlap even if they're at different file indexes. However that's potentially lowered away by the point that this logic hits. Not known to fix any specific application or test. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-06-26 20:24:19 -04:00
Ilia Mirkin	55a8c11705	nv50/ir: VFETCH is also considered a load for MemoryOpt This has no effect since in practice this will only play for memory-backed files, for which VFETCH will never happen. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-06-26 20:24:19 -04:00
Ilia Mirkin	c12f8305a8	nv50,nvc0: remove IDX from bufctx immediately, to avoid conflicts with clear The idxbuf could linger, and when a clear happened, which also uses the 3d bufctx, we could get an error trying to access it. This fixes spurious crashes/errors in CTS tests. Fixes: `61d8f3387d` ("nv50,nvc0: clear index buffer bufctx bin unconditionally") Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable@lists.freedesktop.org	2017-06-26 20:23:04 -04:00
Ilia Mirkin	8c02ee4a8b	nv50/ir: fetch indirect sources BEFORE the op that uses them All the BuildUtil helpers just insert the operation into the current BB. So we have to take care that any fetchSrc() operations happen before the operation whose setIndirect() it goes into. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable@lists.freedesktop.org	2017-06-26 20:22:46 -04:00
Timothy Arceri	9545139ce5	mesa: skip FLUSH_VERTICES() if no samplers were changed Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-06-27 09:20:29 +10:00
Timothy Arceri	191ff86d53	mesa: don't set _NEW_PROGRAM_CONSTANTS for non-bindless opaque uniforms v2: rebase on new _mesa_flush_vertices_for_uniforms() helper Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-06-27 09:17:16 +10:00
Rob Herring	c4291a3283	Android: add renderonly files to libmesa_gallium vc4 now depends on renderonly functions, but these weren't added to the Android build resulting in the following errors: src/gallium/drivers/vc4/vc4_resource.c:380: error: undefined reference to 'renderonly_scanout_destroy' src/gallium/drivers/vc4/vc4_resource.c:681: error: undefined reference to 'renderonly_create_gpu_import_for_resource' src/gallium/drivers/vc4/vc4_screen.c:625: error: undefined reference to 'renderonly_dup' src/gallium/winsys/pl111/drm/pl111_drm_winsys.c:37: error: undefined reference to 'renderonly_create_gpu_import_for_resource' src/gallium/winsys/pl111/drm/pl111_drm_winsys.c:37: error: undefined reference to 'renderonly_create_gpu_import_for_resource' Fixes: `7029ec05e2` ("gallium: Add renderonly-based support for pl111+vc4.") Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2017-06-26 16:10:42 -07:00
Timothy Arceri	a00a277da9	mesa: add KHR_no_error support for glCopyTexImage*D() Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-06-27 08:27:11 +10:00
Timothy Arceri	8bf02efed3	mesa: add no error support to copyteximage() Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-06-27 08:27:11 +10:00

1 2 3 4 5 ...

93533 Commits All Branches Search

93533 Commits

All Branches