KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Ilia Mirkin	37d0becfd9	freedreno/a3xx: use NUM_USER_CLIP_PLANES helper instead of magic number Use the helper from the newly-updated generated header file. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-09-16 15:42:55 -04:00
Ilia Mirkin	545a3cbb01	freedreno/a3xx: fix blending of L8 format Even though luminance formats don't have alpha, we still want the alpha output to go to the blender. This fixes the luminance blending tests. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.0" <mesa-stable@lists.freedesktop.org>	2015-09-16 15:42:55 -04:00
Ilia Mirkin	ee6b95c82c	freedreno/a3xx: add support for dual-source blending Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-09-16 15:42:54 -04:00
Eric Anholt	cfa980f493	vc4: convert from tgsi semantic/index to varying-slot (originally part of previous patch, split out to separate patch by Rob) v2: squash in some fixes from Eric v3: Another fix from Eric for point coords. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 15:07:08 -04:00
Eric Anholt	8fd3e53f3d	gallium/ttn: Convert to using VARYING_SLOT_* / FRAG_RESULT_*. This avoids exceeding the size of the .index bitfield since it got truncated, and should make our NIR look more like the NIR that the rest of the NIR developers are working on. v2: split out vc4 updates, first patch uses varying_slot_to_tgsi_semantic() helper, and second patch does the actual conversion. v3: add frag_result_to_tgsi_semantic() helper and don't try to map frag_results to semantic name/index as if they were varying_slot's v4: use VERT_ATTRIB_ for VS inputs v5: Fix vc4 build. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 15:03:53 -04:00
Ilia Mirkin	7a275fcda8	nv50, nvc0: fix max texture buffer size to 128M elements This is what the hardware supports, there never was any sort of 64K limit. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>	2015-09-16 12:51:58 -04:00
Ilia Mirkin	eb081681df	st/mesa: avoid integer overflows with buffers >= 512MB This fixes failures with the newly-submitted max-size texture buffer piglit test for GPUs exposing >= 128M max texels. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Glenn Kennard <glenn.kennard@gmail.com>	2015-09-16 12:51:58 -04:00
Brian Paul	1aff899a87	mesa: move GL_APPLE_object_purgeable functions to new file Move this code out of bufferobj.c since it's not strongly connected to buffer objects. Acked-by: Matt Turner <mattst88@gmail.com>	2015-09-16 09:02:40 -06:00
Brian Paul	8faed71830	mesa: remove trailing whitespace in bufferobj.c Trivial.	2015-09-16 08:53:21 -06:00
Brian Paul	edc01c6704	mesa: whitespace, line wrap fixes in varray.c Trivial.	2015-09-16 08:53:21 -06:00
Rob Clark	aecbc93f2d	nir/print: print symbolic names from shader-enum v2: split out moving of FILE *fp into state structure into it's own (more complete patch) to reduce the noise in this one Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-09-16 10:15:35 -04:00
Rob Clark	840df72f93	nir/print: bit of state refactoring Rename print_var_state to print_state, and stuff FILE ptr into the state object. This avoids passing around an extra parameter everywhere. v2: even more extensive conversion.. use state everywhere instead of FILE ptr, and convert nir_print_instr() to use state as well Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-09-16 10:15:17 -04:00
Rob Clark	f2533f2f8c	glsl: shader-enum to name debug fxns Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-09-16 10:04:13 -04:00
Rob Clark	5bb41d9094	freedreno: one screen to rule them all Similar to `fee0686c21`, but in this case to ensure that drm_gralloc and libGLES_mesa are sharing a single screen. Bumps libdrm_freedreno version dependency, as it requires the new fd_device_fd() API. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 09:14:39 -04:00
Rob Clark	b3958f9f83	freedreno/ir3: use NIR to lower ffract instead of tgsi_lowering Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 08:28:18 -04:00
Rob Clark	d9efe40dc9	nir: add lowering for ffract Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-09-16 08:27:36 -04:00
Jordan Justen	47e18a5957	i965/fs: The barrier send uses only 1 payload register When preparing the barrier payload, the instructions should operate in simd8 mode since we only use 1 payload register. fs_inst::regs_read is also updated to indicate that it only reads one register for SHADER_OPCODE_BARRIER. These issues were flagged by: commit `cadd7dd384` Author: Jason Ekstrand <jason.ekstrand@intel.com> Date: Thu Jul 2 15:41:02 2015 -0700 i965/fs: Add a very basic validation pass Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-15 15:41:07 -07:00
Jason Ekstrand	cb503c3227	nir/builder: Use a normal temporary array in nir_channel C++ gets cranky if we take references of temporaries. This isn't a problem yet in master because nir_builder is never used from C++. However, it will be in the future so we should fix it now. Reviewed-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 14:51:05 -07:00
Rob Clark	18385bc3ac	freedreno/a4xx: more texture formats Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:01 -04:00
Rob Clark	d85267c4bb	freedreno/a4xx: border-color support Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:01 -04:00
Rob Clark	f8222724f5	freedreno/a4xx: wire up texture clamp lowering Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:01 -04:00
Rob Clark	9124a49d54	freedreno: helper for a3xx/a4xx border-colors Both use the same layout for the buffer containing border-color values, so rather than duplicating the logic in a4xx, split it out into a helper. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:01 -04:00
Rob Clark	76977222af	freedreno: update generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:00 -04:00
Jason Ekstrand	29348631fe	nir/lower_vec_to_movs: Coalesce into destinations of fdot instructions Now that we have a replicating fdot instruction, we can actually coalesce into the destinations of vec4 instructions. We couldn't really do this before because, if the destination had to end up in .z, we couldn't reswizzle the instruction. With a replicated destination, the result ends up in all channels so we can just set the writemask and we're done. Shader-db results for vec4 programs on Haswell: total instructions in shared programs: 1747753 -> 1746280 (-0.08%) instructions in affected programs: 143274 -> 141801 (-1.03%) helped: 667 HURT: 0 It turns out that dot-products matter... Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 12:38:48 -07:00
Jason Ekstrand	a88ce0c1c4	i965/vec4: Use the replicated fdot instruction in NIR Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 12:38:48 -07:00
Jason Ekstrand	47739c7df4	nir: Add a fdot instruction that replicates the result to a vec4 Fortunately, nir_constant_expr already auto-splats if "dst" never shows up in the constant expression field so we don't need to do anything there. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 12:38:48 -07:00
Jason Ekstrand	2458ea95c5	nir/lower_vec_to_movs: Coalesce movs on-the-fly when possible The old pass blindly inserted a bunch of moves into the shader with no concern for whether or not it was really needed. This adds code to try and coalesce into the destination of the instruction providing the value. Shader-db results for vec4 shaders on Haswell: total instructions in shared programs: 1754420 -> 1747753 (-0.38%) instructions in affected programs: 231230 -> 224563 (-2.88%) helped: 1017 HURT: 2 This approach is heavily based on a different patch by Eduardo Lima Mitev <elima@igalia.com>. Eduardo's patch did this in a separate pass as opposed to integrating it into nir_lower_vec_to_movs. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 12:38:07 -07:00
Jason Ekstrand	2b2f1f16a0	nir/lower_vec_to_movs: Get rid of start_idx and swizzle compacting Previously, we did this thing with keeping track of a separate start_idx which was different from the iteration variable. I think this was a relic of the way that GLSL IR implements writemasks. In NIR, if a given bit in the writemask is unset then that channel is just "unused", not missing. In particular, a vec4 operation with a writemask of 0xd will use sources 0, 2, and 3 and leave source 1 alone. We can simplify things a good deal (and make them correct) by removing this "compacting" step. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-09-15 11:13:48 -07:00
Jason Ekstrand	c951bb8305	i965/vec4_nir: Use partial SSA form rather than full non-SSA We made this switch in the FS backend some time ago and it seems to make a number of things a bit easier. In particular, supporting SSA values takes very little work in the backend and allows us to take advantage of the majority of the SSA information even after we've gotten rid of Phi nodes. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 11:13:48 -07:00
Jason Ekstrand	c3f8cde964	nir/lower_vec_to_movs: Handle partially SSA shaders v2 (Jason Ekstrand): - Use nir_instr_rewrite_dest - Pass the impl directly into lower_vec_to_movs_block Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 11:13:45 -07:00
Jason Ekstrand	b7eeced3c7	nir/lower_vec_to_movs: Pass the shader around directly Previously, we were passing the shader around, we were just calling it "mem_ctx". However, the nir_shader is (and must be for the purposes of mark-and-sweep) the mem_ctx so we might as well pass it around explicitly. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 11:13:40 -07:00
Jason Ekstrand	cadd7dd384	i965/fs: Add a very basic validation pass Currently the validation pass only validates that regs_read and regs_written are consistent with the sizes of VGRF's. We can add more as we find it to be useful. Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-09-15 11:11:50 -07:00
Jason Ekstrand	0c6df7a1cb	i965/fs_surface_builder: Only apply predicate to components that exist In certain conditions, we have to do bounds-checking in the shader for image_load_store. The way this works for image loads is that we do a predicated load and then emit a series of selects, one per component, that gives us 0 or the loaded value depending on whether or not you're in bounds. However, we were hard-coding 4 components which may not be correct. Instead, we should be using size which is the number of components read. Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-09-15 11:09:48 -07:00
Jason Ekstrand	5182400054	i965/fs: Only read output_components many components when writing an output Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-15 11:08:12 -07:00
Jason Ekstrand	f55836f567	i965/fs: Set output_components for lowered clip distance outputs Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-15 11:07:54 -07:00
Nanley Chery	8200793649	mesa/teximage: restrict GL_ETC1_RGB8_OES support to GLES According to the extensions table and our glext headers, OES_compressed_ETC1_RGB8_texture is only supported in GLES1 and GLES2. Since we may give users a GLES3 context when a GLES2 context is requested, we also allow this extension for GLES3 as well. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-09-15 10:11:14 -07:00
Nanley Chery	48961fa3ba	mesa/extensions: restrict GL_OES_EGL_image to GLES Driver vendors do this as well. The extension specification lists GLES 1.1 or 2.0 as requirements. Reviewed-by: Chad Versace <chad.versace@intel.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-09-15 10:00:00 -07:00
Nanley Chery	fe796a1831	mesa/extensions: restrict luminance alpha formats to API_OPENGL_COMPAT According the GL 3.1 spec, luminance alpha formats are deprecated. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Nanley Chery <nanley.g.chery@intel.com>	2015-09-15 10:00:00 -07:00
Thomas Hellstrom	edfb7ed109	gallium/svga: Enable PIPE_FORMAT_L8_UNORM for vgpu10 It's extensively used by XA for a8- and planar yuv component surfaces. This fixes broken XA yuv blits using vgpu10 contexts. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-09-15 09:25:02 -07:00
Emil Velikov	a1ac742f70	egl/dri2: don't leak the fd on dri2_terminate Currently the check was incorrect as it did not consider the (unlikely) case of fd == 0. In order to fix this we should first correctly initialize it to -1, as the swrast implementations leave it set to zero (props to calloc()). Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Boyan Ding <boyan.j.ding@gmail.com>	2015-09-15 12:39:02 +01:00
Emil Velikov	bd5bcb5b8c	egl/dri2/drm: compact existing device mgmt Move the fcntl(dupfd_cloexec) to the else branch where it belongs. Otherwise it's not immediately obvious that the code is hit, only when an existing device is used. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Boyan Ding <boyan.j.ding@gmail.com>	2015-09-15 12:37:27 +01:00
Matt Turner	e4f0d26c8c	egl/dri2: Close file descriptor on error. v2: [Emil Velikov] Rework the error path to a common goto, close only if we own the fd. v3; [Emil Velikov] Always close the fd (we either opened the device or dup'd) (Boyan, Ian) Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Boyan Ding <boyan.j.ding@gmail.com>	2015-09-15 12:37:26 +01:00
Ray Strode	4bf151e662	gbm: convert gbm bo format to fourcc format on dma-buf import At the moment if a gbm buffer is imported and the gbm buffer has an old-style GBM_BO_FORMAT format, the import will crash, since it's passed directly to DRI functions that expect a fourcc format (as provided by the newer GBM_FORMAT definitions) This commit addresses the problem in two ways: 1) it prevents invalid formats from leading to a crash by returning EINVAL if the image couldn't be created 2) it translates GBM_BO_FORMAT formats into the comparable GBM_FORMAT formats. Reference: https://bugzilla.gnome.org/show_bug.cgi?id=753531 CC: "10.6 11.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-09-15 12:27:45 +01:00
Alejandro Piñeiro	a26e82b81d	docs: document INTEL_DEBUG 'optimizer' envvar Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-09-15 08:33:35 +02:00
Kristian Høgsberg Kristensen	a548c75e31	i965: Move perf_debug code to brw_codegen__prog() We're trying to avoid a libdrm dependency in the core compiler, so let's move the perf_debug code one level up from the brw__emit() helpers to the brw_codegen_*_prog() helpers. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: Kristian Høgsberg Kristensen <krh@bitplanet.net>	2015-09-14 16:56:59 -07:00
Kristian Høgsberg Kristensen	84f2ed2cfd	i965: Move brw_fs_precompile() to brw_wm.c All other precompile functions live in the brw_<stage>.c files, make fs follow the convention. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: Kristian Høgsberg Kristensen <krh@bitplanet.net>	2015-09-14 16:55:49 -07:00
Kristian Høgsberg Kristensen	dc70c86b9b	i965: Move compute shader code around This moves the compute shader code around in order to make the way the code is split up more consistent. There should be no functional changes. Typically we have a few files per stage: brw_vs.c, brw_wm.c brw_gs.c: code to drive code generation and implement precompiling and cache search. genX_<stage>_state.c gen specific implementation of the state emission for the shader stage. The brw_*_emit() functions are all in the same files as the visitor classes they use (with the exception of VS, which may use either vec4 or fs). To make compute follow this convention, we move the brw_cs_emit() function into brw_fs.cpp. We can then rename brw_cs.cpp to brw_cs.c and do this in C like the other similar files. Finally, move state setup and atoms to gen7_cs_state.c. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: Kristian Høgsberg Kristensen <krh@bitplanet.net>	2015-09-14 16:52:42 -07:00
Anuj Phogat	64e25167ed	meta: Abort meta pbo path if TexSubImage need signed unsigned conversion See similar fix for Readpixels in mesa commit `0d20790`. Jason suggested we need that for TexSubImage as well. Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-14 15:22:37 -07:00
Ilia Mirkin	5877a594d5	nvc0/ir: start offset at texBindBase for txq, like regular texturing Curiously this has no actual effect. I think it's because the first 8 textures are bound in multiple slots for some reason. However seems prudent to use these the same way as regular texturing, esp in the case where there are more than 8 textures bound. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-09-14 17:26:25 -04:00
Eric Anholt	64aee8fe9f	vc4: Fix build from recent NIR cleanups.	2015-09-14 11:21:07 -04:00

1 2 3 4 5 ...

72833 Commits All Branches Search

72833 Commits

All Branches