KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	59c5da40ed	radeonsi: preload PS inputs only if KILL is used so that most shaders can get lower VGPR usage thanks to lazy input loading. I think this is a more accurate constraint that prevents the black transitions in Witcher 2. Affected shaders (7758): Max Waves: 57437 -> 58231 (1.38 %) Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 23:43:38 +01:00
Marek Olšák	7b32ae4df5	gallium/radeon: adjust the rule for using the LINEAR_ALIGNED layout Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 23:43:38 +01:00
Marek Olšák	e248390e93	winsys/amdgpu: drop all IBs if at least one was rejected within the context The corruption is inevitable and hangs are possible too. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 23:43:38 +01:00
Marek Olšák	1840800860	winsys/amdgpu: report a rejected IB as a lost context Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 23:43:38 +01:00
Dave Airlie	dcfcb3047c	vulkan: import latest registry for 1.0.39 extensions. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-01-24 08:13:37 +10:00
Dave Airlie	e38bee34bf	vulkan: bump vulkan.h to 1.0.39 version This introduces a bunch of new extension defines. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-01-24 08:13:23 +10:00
Grazvydas Ignotas	f65b3641c3	radv: don't resubmit the same cs over and over while tracing Fixes: `97dfff54` ("radv: Dump command buffer on hang.") Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> CC: <mesa-stable@lists.freedesktop.org>	2017-01-23 22:27:05 +01:00
Samuel Pitoiset	aa2ace8e49	gallium/radeon: add HUD queries for monitoring some hw blocks It's also possible to monitor them via performance counters but the hardware can only use two counters simultaneously. It seems easier to re-use the existing code which reads from MMIO instead of writing a multi-pass approach. v2: - add new lines after ':' Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-23 21:19:49 +01:00
Samuel Pitoiset	a704f19247	gallium/radeon: refactor the GRBM counters path This will allow to expose more queries in order to know which blocks are busy/idle. v2: - add new lines after ':' Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-23 21:19:49 +01:00
George Kyriazis	00847e4f14	swr: Align query results allocation Some query results struct contents are declared as cache line aligned. Use aligned malloc, and align the whole struct, to be safe. Fixes crash when compiling with clang. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-01-23 14:15:54 -06:00
Bruce Cherniak	b829206b07	swr: Prune empty nodes in CalculateProcessorTopology. CalculateProcessorTopology tries to figure out system topology by parsing /proc/cpuinfo to determine the number of threads, cores, and NUMA nodes. There are some architectures where the "physical id" begins with 1 rather than 0, which was creating and empty "0" node and causing a crash in CreateThreadPool. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97102 Reviewed-By: George Kyriazis <george.kyriazis@intel.com> CC: <mesa-stable@lists.freedesktop.org>	2017-01-23 13:52:26 -06:00
Matt Turner	d349449a16	i965: Use UNUSED to silence unused variable (used in assert).	2017-01-23 10:50:20 -08:00
Rainer Hochecker	09b140abb5	dri: allow 16bit R/GR images to be exported via drm buffers This allows eglCreateImageKHR to access P010 surfaces created by vaapi Signed-off-by: Rainer Hochecker <fernetmenta@online.de> Acked-by: Ben Widawky <ben@bwidawsk.net>	2017-01-23 08:47:15 -08:00
Christian König	1338d912f5	st/va: make sure that we call begin_frame() only once v2 This fixes "st/va: delay calling begin_frame until we have all parameters". v2: call begin frame after decoder (re)creation as well. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Nayan Deshmukh <nayan26deshmukh@gmail.com> Tested-by: Andy Furniss <adf.lists@gmail.com>	2017-01-23 17:00:04 +01:00
Eric Engestrom	50141e131a	drirc: remove spurious tabs Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 16:34:58 +01:00
Nicolai Hähnle	cfabbbcfd7	st/glsl_to_tgsi: use DDIV instead of DRCP + DMUL Fixes GL45-CTS.gpu_shader_fp64.built_in_functions. v2: use DDIV unconditionally (Roland) Reviewed-by: Roland Scheidegger <sroland@vmware.com> (v1) Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1) Tested-by: Glenn Kennard <glenn.kennard@gmail.com> Tested-by: James Harvey <lothmordor@gmail.com> Cc: 17.0 <mesa-stable@lists.freedesktop.org>	2017-01-23 16:17:26 +01:00
Nicolai Hähnle	b71c415c3d	glsl: split DIV_TO_MUL_RCP into single- and double-precision flags Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Tested-by: Glenn Kennard <glenn.kennard@gmail.com> Tested-by: James Harvey <lothmordor@gmail.com> Cc: 17.0 <mesa-stable@lists.freedesktop.org>	2017-01-23 16:17:19 +01:00
Nicolai Hähnle	e4f8f9a638	r600: implement DDIV Tested-by: Glenn Kennard <glenn.kennard@gmail.com> Tested-by: James Harvey <lothmordor@gmail.com> Cc: 17.0 <mesa-stable@lists.freedesktop.org>	2017-01-23 16:17:15 +01:00
Nicolai Hähnle	488560cfe6	r600: factor out cayman_emit_unary_double_raw We will use it for DDIV. Tested-by: Glenn Kennard <glenn.kennard@gmail.com> Tested-by: James Harvey <lothmordor@gmail.com> Cc: 17.0 <mesa-stable@lists.freedesktop.org>	2017-01-23 16:17:12 +01:00
Nicolai Hähnle	76b02d2fe1	r600: double multiply can handle only one multiply at a time It seems clear that trying to multiply two pairs of doubles would result in the temporary register getting overwritten by the second pair. So make the code more explicit. Tested-by: Glenn Kennard <glenn.kennard@gmail.com> Tested-by: James Harvey <lothmordor@gmail.com> Cc: 17.0 <mesa-stable@lists.freedesktop.org>	2017-01-23 16:15:45 +01:00
Timothy Arceri	f3f9207786	glsl: fix tes linking regression Fixes regression caused by `cbeba6bd48`. I accidentally pushed the wrong version of the patch.	2017-01-23 19:07:22 +11:00
Timothy Arceri	38a67f020d	mesa: remove unused gl_shader_info field from gl_linked_shader Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 14:48:04 +11:00
Timothy Arceri	79f07e87c9	mesa/glsl: set and get cs layouts to and from shader_info Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 14:48:04 +11:00
Timothy Arceri	b96bddae67	mesa/glsl: set and get gs layouts directly to and from shader_info Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 14:48:04 +11:00
Timothy Arceri	cbeba6bd48	mesa/glsl/i965: set and get tes layouts directly to and from shader_info Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 14:48:04 +11:00
Timothy Arceri	64e201ab8f	glsl: use last_vert_prog to get last {clip,cull}_distance_array_size Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 14:48:04 +11:00
Timothy Arceri	fc707f570f	mesa/glsl: set {clip,cull}_distance_array_size directly in gl_program There are some line wrapping violations here but those lines will get deleted in the following patch. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 14:48:04 +11:00
Timothy Arceri	f86d15ed94	st/mesa/glsl: change xfb_program field to last_vert_prog Now that the i965 backend doesn't depend on this field we can make it more generic and short circuit a bunch of code paths. The new field will be used in a following patch for another clean-up. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-23 14:48:04 +11:00
Timothy Arceri	c505d6d852	mesa: use gl_program for CurrentProgram rather than gl_shader_program This makes much more sense and should be more performant in some critical paths such as SSO validation which is called at draw time. Previously the CurrentProgram array could have contained multiple pointers to the same struct which was confusing and we would often need to fish out the information we were really after from the gl_program anyway. Also it was error prone to depend on the _LinkedShader array for programs in current use because a failed linking attempt will lose the infomation about the current program in use which is still valid. V2: fix validate_io() to compare linked_stages rather than the consumer and producer to decide if we are looking at inward facing shader interfaces which don't need validation. Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net> To avoid build regressions the following 2 patches were squashed in to this commit: mesa/meta: rewrite _mesa_shader_program_use() and _mesa_program_use() These are rewritten to do what the function name suggests, that is _mesa_shader_program_use() sets the use of all stage and _mesa_program_use() sets the use of a single stage. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net> mesa: update active relinked program This likely fixes a subroutine bug were _mesa_shader_program_init_subroutine_defaults() would never have been called for the relinked program as we previously just set _NEW_PROGRAM as dirty and never called the _mesa_use* functions when linking. Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2017-01-23 14:48:04 +11:00
Rob Clark	31daeb5bf1	freedreno/a5xx: set frag shader threadsize Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:12:05 -05:00
Rob Clark	8d6af93e76	freedreno/a5xx: set fragcoordxy properly What a3xx docs call IJPERSPCENTERREGID.. the xy coord passed into bary.f. We were incorrectly setting both this and gl_FragCoord.xy to the same register resulting in all sorts of hilarity. Fixes stk, vdrift, 0ad, probably a bunch others. Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:11:43 -05:00
Rob Clark	278b97946f	freedreno/ir3: setup var locations in standalone compiler Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-22 14:11:26 -05:00
Rob Clark	6cc93bedc1	freedreno/a5xx: fix psize Note spritelist (POINTLIST_PSIZE) seems not to be a thing anymore on a5xx. Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:11:15 -05:00
Rob Clark	141a4f86d6	freedreno/a5xx: srgb fix Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:11:04 -05:00
Rob Clark	69fbb458cf	freedreno/a5xx: fix int vbos Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:10:54 -05:00
Rob Clark	16671e9704	freedreno/a5xx: fix clear for uint/sint formats Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:10:42 -05:00
Rob Clark	4d9aa4f67d	freedreno/a5xx: fix cull state Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:10:28 -05:00
Rob Clark	4c39458460	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:09:45 -05:00
Lionel Landwerlin	494b63f525	anv: descriptors: don't update immutables samplers with anything but their immutable value Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-21 19:22:27 +00:00
Jason Ekstrand	bb96b03461	nir/search: Use the correct bit size for integer comparisons The previous code always compared integers as 64-bit. Due to variations in sign-extension in the code generated by nir_opt_algebraic.py, this meant that nir_search doesn't always do what you want. Instead, 32-bit values should be matched as 32-bit and 64-bit values should be matched as 64-bit. While we're here we unify the unsigned and signed paths. Now that we're using the right bit size, they should be the same since the only difference we had before was sign extension. This gets the UE4 bitfield_extract optimization working again. It had stopped working due to the constant 0xff00ff00 getting sign-extended when it shouldn't have. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Cc: "17.0 13.0" <mesa-stable@lists.freedesktop.org>	2017-01-21 10:34:21 -08:00
Jason Ekstrand	817f9e3b17	intel/blorp/copy: Properly handle clear colors for CCS_E images In order to handle CCS_E, we stomp the image format to a UINT format and then do some bitcasting logic in the shader. This works fine since SKL render compression only considers the channel layout of the format and not the format itself. In order for this to work on images that have been fast-cleared, we need to also convert the clear color so that, when interpreted as UINT, it provides the same bit value as it would have in the original format. This fixes a bunch of OpenGL ES CTS tests for copy_image when we start using CCS more aggressively. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-21 10:34:09 -08:00
Kenneth Graunke	bb5db5564f	glsl: Rename [u]int64_t tokens. basetsd.h on Windows defines INT64 and UINT64 typedefs which conflict with these. Append "_TOK" to avoid conflicts. Should fix the Windows build. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-20 19:39:20 -08:00
Matt Turner	892781d6c7	Revert "i965: Really don't emit Q or UQ moves on Gen < 8" This reverts commit `c95380c404`. Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2017-01-20 19:12:31 -08:00
Matt Turner	d871f8e820	i965: Select DF type for 64-bit integers on Gen < 8. Gen8 adds Q/UQ types. We attempted to change the types back to DF in the generator (commit `c95380c40`), but an assertion added in the FP64 series (commit `e481dcc3`) triggers before that code has a chance to execute. In fact, using Q/UQ in the IR and then changing to DF in the generator would not work in the presence of source modifiers, etc. Fixes: `d6fcede6` ("i965: Return Q and UQ types for int64 and uint64") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-01-20 19:12:24 -08:00
Ian Romanick	db6d23cfd2	i965: Enable ARB_gpu_shader_int64 on Gen8+ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-20 15:41:23 -08:00
Ian Romanick	fc16bf125f	i965: Split SIMD16 CMP of Q and UQ instructions This is basically the same as happens for doubles. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-20 15:41:23 -08:00
Ian Romanick	51807c6493	i965: Enable 64-bit integer support for almost all unary and binary operations Integer comparison functions (e.g., nir_op_ilt) are handled in the next commit. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-20 15:41:23 -08:00
Ian Romanick	821d7cece8	i965: Enable uploading 64-bit integer uniforms Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-20 15:41:23 -08:00
Ian Romanick	e0579c5017	i965: Add 64-bit integer support for conversions and bitcasts v2 (idr): Make the "from" type in a cast unsized. This reduces the number of required cast operations at the expensive slightly more complex code. However, this will be a dramatic improvement when other sized integer types are added. Suggested by Connor. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-20 15:41:23 -08:00
Ian Romanick	f2fa510594	i965: Enable emitting Q and UQ instructions in the fs backend v2: Fixup assertion in brw_reg_type_to_hw_type to allow BRW_REGISTER_TYPE_{UQ,Q} on Gen8+. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-20 15:41:23 -08:00

1 2 3 4 5 ...

88421 Commits All Branches Search

88421 Commits

All Branches