KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Timothy Arceri	c86baf71fb	st/glsl_to_nir: use nir_lower_io_arrays_to_elements() to lower arrays This pass is more fully featured, it supports geom and tess shaders. It also supports interpolation intrinsics. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-12-04 12:52:18 +11:00
Timothy Arceri	d99c7e0ff1	nir: allow builin arrays to be lowered Galliums nir drivers expect this to be done. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-12-04 12:52:18 +11:00
Timothy Arceri	2bc49ac3e6	nir: add array lowering function that assumes there are no indirects The gallium glsl->nir pass currently lowers away all indirects on both inputs and outputs. This fuction allows us to lower vs inputs and fs outputs and also lower things one stage at a time as we don't need to worry about indirects on the other side of the shaders interface. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-12-04 12:52:18 +11:00
Timothy Arceri	f13790c92f	radv: enable nir varying array splitting Acked-by: Dave Airlie <airlied@redhat.com>	2017-12-04 12:52:18 +11:00
Timothy Arceri	6648bd68fd	st/glsl_to_nir: enable NIR link time opts Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	c16a0e11d3	radeonsi/nir: add support for packed inputs Because NIR can create non vec4 variables when implementing component packing we need to make sure not to reprocess the same slot again. Also we can drop the fs_attr_idx counter and just use driver_location. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	c3a5d74377	st/glsl_to_nir: move some calls out of st_glsl_to_nir_post_opts() NIR component packing will be inserted between these calls and the calling of st_glsl_to_nir_post_opts(). Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	90abaf8a21	st/glsl_to_nir: call some lowering passes earlier This is required so that we can enbale NIR linking optimisations. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	bd98b8c74e	st/glsl_to_nir: add basic NIR opt loop helper We need to be able to do these NIR opts in the state tracker rather than the driver in order for the NIR linking opts to be useful. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	a9ac01b96f	st/glsl_to_nir: make st_glsl_to_nir() static Here we also move the extern C functions to the bottom of the file. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	d586f39cb0	st/glsl_to_nir: split the st_glsl_to_nir() function in two We want to be able to generate NIR then apply NIR optimisations. Once the optimisations are done we can then apply the new post opt function which assigns uniforms etc based on the optimised IR. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	d38f99baec	st/glsl_to_nir: create set_st_program() helper Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	da953b641d	st/glsl: move nir linking loop to new function st_link_nir() This will allow us to refactor linking and include some nir link time optimisations. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	2a35021bc6	nir: fix support for scalar arrays in nir_lower_io_types() This was just recreating the same vector type we alreay had and hitting an assert for scalars. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	9530b786d2	st/glsl_to_nir: add st_nir_assign_var_locations() helper This avoids packed varyings being assigned different driver locations. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Timothy Arceri	aecb9bec87	radv: enable nir component packing SaschaWillems Vulkan demo tessellation: ~4000fps -> ~4600fps Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-12-04 09:10:30 +11:00
Timothy Arceri	1c9c42d16b	nir: add varying component packing helpers v2: update shader info input/output masks when pack components v3: make sure interpolation loc matches, this is required for the radeonsi NIR backend. v4: `33dca36f4f` fixed nir_gather_info to update outputs_read correct, make sure we also adjust this correctly when packing components. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (v1) Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v3)	2017-12-04 09:10:30 +11:00
Timothy Arceri	c797bc6aa7	nir: add varying array splitting pass V2: - fix matrix support, non-array matrices were being skipped in v1 v3: - handle lowering of tcs output loads correctly - correctly mark indirect locations for either in or out not both when processing a stage. - use nir_src_copy() when lowering stores. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-12-04 09:10:30 +11:00
Rob Clark	11efe42a73	freedreno/ir3: relax barriers Instructions with no barrier_class can move wrt. an EVERYTHING barrier. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	48eef0c182	freedreno/ir3: all mem instructions have WAR hazzard It isn't just load instructions that have write-after-read hazzard. Fixes stk gaussian blur compute shaders. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	e6c6495d3a	freedreno: add debug option to force emulated indirect Useful mostly for debugging indirect draw. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	f93f2f7b1e	freedreno: also mark draw-indirect buffer as read Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	4b1d0d2844	freedreno: small cleanups Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	91730fb0ff	freedreno: avoid unneccessary batch flush In some cases we can end up trying to add a write dependency on ourself, which shouldn't trigger a flush. Avoids an extra couple flushes per from in stk. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	4ab6ab8036	freedreno: avoid mem2gmem for invalidated buffers Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	2fcf6faa06	freedreno: deferred flush support Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	15ebf387fc	freedreno: rework fence tracking ctx->last_fence isn't such a terribly clever idea, if batches can be flushed out of order. Instead, each batch now holds a fence, which is created before the batch is flushed (useful for next patch), that later gets populated after the batch is actually flushed. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:40 -05:00
Rob Clark	deb57fb237	freedreno: proper locking for iterating dependent batches In transfer_map(), when we need to flush batches that read from a resource, we should be holding screen->lock to guard against race conditions. Somehow deferred flush seems to make this existing race more obvious. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:40 -05:00
Rob Clark	ef6313ffd3	freedreno/a5xx: correct max_indicies for indirect draws Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:40 -05:00
Jason Ekstrand	e19c623128	spirv: Convert the supported_extensions struct to spirv_options This is a bit more general and lets us pass additional options into the spirv_to_nir pass beyond what capabilities we support. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2017-12-02 08:09:11 -08:00
Jason Ekstrand	6bd876dcaa	spirv: Only emit functions which are actually used Instead of emitting absolutely everything, just emit the few functions that are actually referenced in some way by the entrypoint. This should save us quite a bit of time when handed large shader modules containing many entrypoints. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2017-12-02 08:07:35 -08:00
Jason Ekstrand	f5aad36d2e	spirv: Drop the impl field from vtn_builder We have a nir_builder and it has an impl field. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2017-12-02 08:07:35 -08:00
Jordan Justen	fc033742d2	i965: Serialize nir later in the linking process Fixes MESA_GLSL=cache_fb with piglit tests/spec/glsl-1.50/execution/geometry/clip-distance-vs-gs-out.shader_test Fixes: `0610a624a1` i965/link: Serialize program to nir after linking for shader cache Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103988 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-12-01 23:17:44 -08:00
Marc Dietrich	d93fabb013	configure: avoid testing for negative compiler options gcc seems to always accept unsupported negative compiler warning options: echo "int i;" \| gcc -c -xc -Wno-bob - # no error echo "int i;" \| gcc -c -xc -Walice - # unsupported compiler option Inverting the options fixes the tests. V2: fix options in meson build Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Marc Dietrich <marvin24@gmx.de>	2017-12-01 17:09:42 -08:00
Eric Anholt	0ed952c7e9	broadcom/vc4: Use a single-entry cached last_hindex value. Since almost all BOs will be in one CL at a time, this cache will almost always hit except for the first usage of the BO in each CL. This didn't show up as statistically significant on the minetest trace (n=340), but if I lop off the throttled lobe of the bimodal distribution, it very clearly does (0.74731% +/- 0.162093%, n=269).	2017-12-01 15:37:28 -08:00
Eric Anholt	230e646a40	broadcom/vc4: Decompose single QUADs to a TRIANGLE_FAN. No significant difference in the minetest replay, but it should reduce overhead by not requiring that we write quad indices to index buffers that we repeatedly re-upload (and making the draw packet smaller, as well). Over the course of the series the actual game seems to be up by 1-2 fps.	2017-12-01 15:37:28 -08:00
Eric Anholt	fefff74b0d	broadcom/vc4: Use the new enum functionality of the XML to decode better.	2017-12-01 15:37:28 -08:00
Eric Anholt	5167367050	broadcom/vc4: Skip emitting redundant VC4_PACKET_GEM_HANDLES. Now that there's only one user of it, it's pretty obvious how to avoid emitting redundant ones. This should save a bunch of kernel validation overhead. No statistically sigificant difference on the minetest trace I was looking at (n=169), but the maximum FPS is up by .3%	2017-12-01 15:37:28 -08:00
Eric Anholt	842b05d6ad	broadcom/vc4: Simplify the relocation handling for index buffers. Originally there was CL code for handling various relocations back when I had relocs for the TSDA/TA buffers. Now that the kernel handles those entirely on its own, I can inline that code into the one place using it.	2017-12-01 15:37:28 -08:00
Eric Anholt	84ab48c15c	broadcom/vc4: Fix handling of GFXH-515 workaround with a start vertex count. We failed to take the start into account for how many vertices to draw in this round, so we would end up decrementing count below 0, which as an unsigned number meant we would loop until the CLs soon ran out of space. When I wrote the code I was thinking about how to use the previously emitted shader state (no index bias baked into the elements) by emitting up to 65535 and then only re-emitting with bias for the second wround, but that doesn't work if the start is over 65535. Instead, just delay emitting shader state until we get into the drawarrays GFXH-515 loop and always bake the bias in when we're doing the workaround.	2017-12-01 15:37:28 -08:00
Eric Anholt	bcb6ebe91a	broadcom/vc4: Fix the scaling factor for the GFXH-515 workaround. For triangle strips, we step by max_verts - 2.	2017-12-01 15:37:28 -08:00
Dylan Baker	f56e964e01	meson: use dep_thread instead of dependency('threads') in freedreno They are the same thing, but this is more consistent with the rest of the project. Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-12-01 15:31:43 -08:00
Dylan Baker	5e71efef44	meson: Add lmsensors support v2: - Make -Dlmsensors=false work - Simplify auto and true cases Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-12-01 15:31:43 -08:00
Dylan Baker	7309207432	meson: Add support for gallium extra hud Signed-off-by: Dylan Baker <dylanx.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-12-01 15:31:43 -08:00
Adam Jackson	a48a6b8a40	glx: Prepare driFetchDrawable for no-config contexts When we look up the DRI drawable state we need to associate an fbconfig with the drawable. With GLX_EXT_no_config_context we can no longer infer that from the context and must instead query the server. Signed-off-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2017-12-01 15:53:52 -05:00
Adam Jackson	75d5d22fb7	glx: Use __glXSendError instead of open-coding it This also fixes a bug, the error path through MakeCurrent didn't translate the error code by the extension's error base. Signed-off-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2017-12-01 15:46:46 -05:00
Adam Jackson	bcb15bee52	glx: Simplify some dummy vtable interactions The dummy vtable has these slots as NULL already, no need to check for the dummy context explicitly. Signed-off-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-12-01 15:46:46 -05:00
Emil Velikov	8893418e99	docs/release-calendar: update and extend v2: Missing td tag, add Andres + Juan for 17.2.8 and 17.3.3 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v1) Reviewed-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com>	2017-12-01 19:30:23 +00:00
Emil Velikov	8d58e9b2cf	docs/specs: annotate MESA_set_3dfx_mode as obsolete Aimed to work with Glide, which hasn't been a thing in over 10 years. There are no drivers that implement it, so annotate it as obsolete v2: Move the extension to OLD/ Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Brian Paul <brianp@vmware.com> (v1) Reviewed-by: Adam Jackson <ajax@redhat.com> (v1) Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2017-12-01 19:30:23 +00:00
Emil Velikov	f8aea0ce47	xlib: remove dummy GLX_MESA_set_3dfx_mode implementation The implementation is a simple 'return EGL_FALSE'. Stop pretending and simply remove it. Note: the removal of XMesa API is fine, since there hasn't been any users for it in years. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2017-12-01 19:30:23 +00:00

1 2 3 4 5 ...

98285 Commits All Branches Search

98285 Commits

All Branches