KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rob Clark	e523f69b1d	freedreno/ir3: switch to shader_enums.h interp constants A small step towards un-TGSI'ifying ir3. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-17 19:55:43 -04:00
Ilia Mirkin	e844e1007d	nv50,nvc0: flush texture cache in presence of coherent bufs This fixes the newly-added arb_texture_buffer_object-bufferstorage piglit test. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.0" <mesa-stable@lists.freedesktop.org>	2015-09-17 19:50:47 -04:00
Ilia Mirkin	323c912506	nv50,nvc0: detect underlying resource changes and update tic When updating texture buffers, we might end up replacing the whole buffer. Check that the tic address matches the resource address, and if not, update the tic and reupload it. This fixes: arb_direct_state_access-texture-buffer arb_texture_buffer_object-data-sync Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.0" <mesa-stable@lists.freedesktop.org>	2015-09-17 19:50:47 -04:00
Boyan Ding	8d3b92af21	vc4: Try to pair up instructions when only one of them has PM bit Instructions with difference in PM field can actually be paired up if the one without PM doesn't do packing/unpacking and non-NOP packing/unpacking operations from PM instruction aren't added to the other without PM. total instructions in shared programs: 48209 -> 47460 (-1.55%) instructions in affected programs: 11688 -> 10939 (-6.41%) Signed-off-by: Boyan Ding <boyan.j.ding@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-09-17 14:57:46 -04:00
Jason Ekstrand	fc11dbe13f	i965/vec4: Use nir_move_vec_src_uses_to_dest The idea here is not that it gives register coalescing a little bit of a helping hand. It doesn't actually fix the coalescing problems, but it seems to help a good bit. Shader-db results for vec4 programs on Haswell: total instructions in shared programs: 1746280 -> 1683959 (-3.57%) instructions in affected programs: 1259166 -> 1196845 (-4.95%) helped: 11363 HURT: 148 v2 (Jason Ekstrand): - Run nir_move_vec_src_uses_to_dest after going out of SSA - New shader-db numbers Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-17 08:21:31 -07:00
Jason Ekstrand	a6c467d6c5	nir: Add a pass to rewrite uses of vecN sources to the vecN destination v2 (Jason Ekstrand): - Handle non-SSA sources and destinations Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-17 08:19:48 -07:00
Jason Ekstrand	ddffe30f40	nir: Add comments to nir_index_instrs and nir_index_ssa_defs The provided indices have the very nice property that if A dominates B then A->index <= B->index. We should document that somewhere. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-09-17 08:16:01 -07:00
Jason Ekstrand	8ecaef967d	nir: Add a generic instruction index Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-09-17 08:16:01 -07:00
Ulrich Weigand	bd016a2601	mesa: Fix texture compression on big-endian systems Various pieces of code to create compressed textures will first generate an uncompressed RGBA texture into a temporary buffer, and then read from that buffer while creating the final compressed texture in the requested format. The code reading from the temporary buffer assumes the buffer is formatted as an array of bytes in RGBA order. However, the buffer is filled using a _mesa_texstore call with MESA_FORMAT_R8G8B8A8_UNORM format -- this is defined as an array of integers holding the RGBA values in packed format (least-significant to most-significant). This means incorrect bytes are accessed on big-endian systems. This patch fixes this by using the MESA_FORMAT_A8B8G8R8_UNORM format instead on big-endian systems when filling the buffer. This fixes about 100 piglit test case failures on s390x for me. Signed-off-by: Ulrich Weigand <ulrich.weigand@de.ibm.com> Tested-by: Oded Gabbay <oded.gabbay@gmail.com> Cc: "10.6" "11.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@gmail.com>	2015-09-17 21:23:45 +10:00
Thomas Hellstrom	7e28650649	st/xa: Use PIPE_FORMAT_R8_UNORM when available XA has been using L8_UNORM for a8 and yuv component surfaces. This commit instead makes XA prefer R8_UNORM since it's assumed to have a higher availability. Also neither of these formats are suitable as destination formats using destination alpha blending, so reject those operations. Signed-off-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-09-17 00:03:00 -07:00
Tapani Pälli	ba02f7a3b6	mesa: return initial value for VALIDATE_STATUS if pipe not bound From OpenGL 4.5 Core spec (7.13): "If pipeline is a name that has been generated (without subsequent deletion) by GenProgramPipelines, but refers to a program pipeline object that has not been previously bound, the GL first creates a new state vector in the same manner as when BindProgramPipeline creates a new program pipeline object." I interpret this as "If GetProgramPipelineiv gets called without a bound (but valid) pipeline object, the state should reflect initial state of a new pipeline object." This is also expected behaviour by ES31-CTS.sepshaderobjs.PipelineApi conformance test. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marta Lofstedt <marta.lofstedt@intel.com>	2015-09-17 08:26:33 +03:00
Tapani Pälli	d9689be5c6	mesa: return initial value for PROGRAM_SEPARABLE when not linked From OpenGL ES 3.1 spec (7.12): "Most properties set within program objects are specified not to take effect until the next call to LinkProgram or ProgramBinary. Some properties further require a successful call to either of these commands before taking effect. GetProgramiv returns the properties currently in effect for program, which may differ from the properties set within program since the most recent call to LinkProgram or ProgramBinary, which have not yet taken effect. If there has been no such call putting changes to pname into effect, initial values are returned." Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marta Lofstedt <marta.lofstedt@intel.com>	2015-09-17 08:26:33 +03:00
Tapani Pälli	8f1ae9abeb	mesa: enable query of PROGRAM_PIPELINE_BINDING for ES 3.1 Specified in OpenGL ES 3.1 spec, Table 23.32: Program Object State. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marta Lofstedt <marta.lofstedt@intel.com>	2015-09-17 08:26:33 +03:00
Timothy Arceri	ef8eebc6ad	nir: support indirect indexing samplers in struct arrays As a bonus we get indirect support for arrays of arrays for free. V5: couple of small clean-ups suggested by Jason. V4: fix struct member location caclulation, use nir_ssa_def rather than nir_src for the indirect as suggested by Jason V3: Use nir_instr_rewrite_src() with empty src rather then clearing the use_link list directly for the old indirects as suggested by Jason V2: Fixed validation error in debug build Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-17 11:28:34 +10:00
Timothy	0ad44ce373	glsl: add helper for calculating offsets for struct members V2: update comments Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-17 11:28:27 +10:00
Timothy Arceri	12af915e27	glsl: make variables private Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-17 11:28:21 +10:00
Timothy Arceri	dcd9cd0383	glsl: store uniform slot id in var location field This will allow us to access the uniform later on without resorting to building a name string and looking it up in UniformHash. V3: remove line wrap change from this patch V2: store slot number for all non-UBO uniforms to make code more consitent, renamed explicit_binding to explicit_location and added comment about what it does. Store the location at every shader stage. Updated data.location comments in ir/nir.h. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-17 11:28:14 +10:00
Timothy Arceri	9788700caf	glsl: assign hidden uniforms their slot id earlier This is required so that the next patch can safely assign the slot id to the var. The ids are now assigned in the order we want before allocating storage so there is no need to sort the storage array and move things around. V2: rename variable to make code easier to follow as suggested by Jason Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-17 11:26:45 +10:00
Timothy Arceri	874a0217fd	glsl: order indices for samplers inside a struct array This allows the correct offset to be easily calculated for indirect indexing when a struct array contains multiple samplers, or any crazy nesting. The indices for the folling struct will now look like this: Sampler index: 0 Name: s[0].tex Sampler index: 1 Name: s[1].tex Sampler index: 2 Name: s[0].si.tex Sampler index: 3 Name: s[1].si.tex Sampler index: 4 Name: s[0].si.tex2 Sampler index: 5 Name: s[1].si.tex2 Before this change it looked like this: Sampler index: 0 Name: s[0].tex Sampler index: 3 Name: s[1].tex Sampler index: 1 Name: s[0].si.tex Sampler index: 4 Name: s[1].si.tex Sampler index: 2 Name: s[0].si.tex2 Sampler index: 5 Name: s[1].si.tex2 struct S_inner { sampler2D tex; sampler2D tex2; }; struct S { sampler2D tex; S_inner si; }; uniform S s[2]; V3: Update comments with suggestions from Jason V2: rename struct array counter to have better name Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-17 11:26:39 +10:00
Dave Airlie	b5df52b112	Revert "mesa/extensions: restrict GL_OES_EGL_image to GLES" This reverts commit `48961fa3ba`. glamor/Xwayland use this, the spec saying something when it was written, and the fact that the comment says Mesa relies on it hasn't changed. I also don't have a copy of this patch in my mail archive, which seems wierd, did it get posted to mesa-dev? Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-09-17 06:58:51 +10:00
Eric Anholt	f5b26b4744	vc4: Only build in simulator mode if we find pkg-config for it. This will let other developers build it x86 for build-testing purposes.	2015-09-16 15:54:00 -04:00
Ilia Mirkin	37d0becfd9	freedreno/a3xx: use NUM_USER_CLIP_PLANES helper instead of magic number Use the helper from the newly-updated generated header file. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-09-16 15:42:55 -04:00
Ilia Mirkin	545a3cbb01	freedreno/a3xx: fix blending of L8 format Even though luminance formats don't have alpha, we still want the alpha output to go to the blender. This fixes the luminance blending tests. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.0" <mesa-stable@lists.freedesktop.org>	2015-09-16 15:42:55 -04:00
Ilia Mirkin	ee6b95c82c	freedreno/a3xx: add support for dual-source blending Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-09-16 15:42:54 -04:00
Eric Anholt	cfa980f493	vc4: convert from tgsi semantic/index to varying-slot (originally part of previous patch, split out to separate patch by Rob) v2: squash in some fixes from Eric v3: Another fix from Eric for point coords. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 15:07:08 -04:00
Eric Anholt	8fd3e53f3d	gallium/ttn: Convert to using VARYING_SLOT_* / FRAG_RESULT_*. This avoids exceeding the size of the .index bitfield since it got truncated, and should make our NIR look more like the NIR that the rest of the NIR developers are working on. v2: split out vc4 updates, first patch uses varying_slot_to_tgsi_semantic() helper, and second patch does the actual conversion. v3: add frag_result_to_tgsi_semantic() helper and don't try to map frag_results to semantic name/index as if they were varying_slot's v4: use VERT_ATTRIB_ for VS inputs v5: Fix vc4 build. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 15:03:53 -04:00
Ilia Mirkin	7a275fcda8	nv50, nvc0: fix max texture buffer size to 128M elements This is what the hardware supports, there never was any sort of 64K limit. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>	2015-09-16 12:51:58 -04:00
Ilia Mirkin	eb081681df	st/mesa: avoid integer overflows with buffers >= 512MB This fixes failures with the newly-submitted max-size texture buffer piglit test for GPUs exposing >= 128M max texels. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Glenn Kennard <glenn.kennard@gmail.com>	2015-09-16 12:51:58 -04:00
Brian Paul	1aff899a87	mesa: move GL_APPLE_object_purgeable functions to new file Move this code out of bufferobj.c since it's not strongly connected to buffer objects. Acked-by: Matt Turner <mattst88@gmail.com>	2015-09-16 09:02:40 -06:00
Brian Paul	8faed71830	mesa: remove trailing whitespace in bufferobj.c Trivial.	2015-09-16 08:53:21 -06:00
Brian Paul	edc01c6704	mesa: whitespace, line wrap fixes in varray.c Trivial.	2015-09-16 08:53:21 -06:00
Rob Clark	aecbc93f2d	nir/print: print symbolic names from shader-enum v2: split out moving of FILE *fp into state structure into it's own (more complete patch) to reduce the noise in this one Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-09-16 10:15:35 -04:00
Rob Clark	840df72f93	nir/print: bit of state refactoring Rename print_var_state to print_state, and stuff FILE ptr into the state object. This avoids passing around an extra parameter everywhere. v2: even more extensive conversion.. use state everywhere instead of FILE ptr, and convert nir_print_instr() to use state as well Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-09-16 10:15:17 -04:00
Rob Clark	f2533f2f8c	glsl: shader-enum to name debug fxns Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-09-16 10:04:13 -04:00
Rob Clark	5bb41d9094	freedreno: one screen to rule them all Similar to `fee0686c21`, but in this case to ensure that drm_gralloc and libGLES_mesa are sharing a single screen. Bumps libdrm_freedreno version dependency, as it requires the new fd_device_fd() API. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 09:14:39 -04:00
Rob Clark	b3958f9f83	freedreno/ir3: use NIR to lower ffract instead of tgsi_lowering Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-16 08:28:18 -04:00
Rob Clark	d9efe40dc9	nir: add lowering for ffract Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-09-16 08:27:36 -04:00
Jordan Justen	47e18a5957	i965/fs: The barrier send uses only 1 payload register When preparing the barrier payload, the instructions should operate in simd8 mode since we only use 1 payload register. fs_inst::regs_read is also updated to indicate that it only reads one register for SHADER_OPCODE_BARRIER. These issues were flagged by: commit `cadd7dd384` Author: Jason Ekstrand <jason.ekstrand@intel.com> Date: Thu Jul 2 15:41:02 2015 -0700 i965/fs: Add a very basic validation pass Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-09-15 15:41:07 -07:00
Jason Ekstrand	cb503c3227	nir/builder: Use a normal temporary array in nir_channel C++ gets cranky if we take references of temporaries. This isn't a problem yet in master because nir_builder is never used from C++. However, it will be in the future so we should fix it now. Reviewed-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 14:51:05 -07:00
Rob Clark	18385bc3ac	freedreno/a4xx: more texture formats Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:01 -04:00
Rob Clark	d85267c4bb	freedreno/a4xx: border-color support Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:01 -04:00
Rob Clark	f8222724f5	freedreno/a4xx: wire up texture clamp lowering Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:01 -04:00
Rob Clark	9124a49d54	freedreno: helper for a3xx/a4xx border-colors Both use the same layout for the buffer containing border-color values, so rather than duplicating the logic in a4xx, split it out into a helper. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:01 -04:00
Rob Clark	76977222af	freedreno: update generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-09-15 17:29:00 -04:00
Jason Ekstrand	29348631fe	nir/lower_vec_to_movs: Coalesce into destinations of fdot instructions Now that we have a replicating fdot instruction, we can actually coalesce into the destinations of vec4 instructions. We couldn't really do this before because, if the destination had to end up in .z, we couldn't reswizzle the instruction. With a replicated destination, the result ends up in all channels so we can just set the writemask and we're done. Shader-db results for vec4 programs on Haswell: total instructions in shared programs: 1747753 -> 1746280 (-0.08%) instructions in affected programs: 143274 -> 141801 (-1.03%) helped: 667 HURT: 0 It turns out that dot-products matter... Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 12:38:48 -07:00
Jason Ekstrand	a88ce0c1c4	i965/vec4: Use the replicated fdot instruction in NIR Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 12:38:48 -07:00
Jason Ekstrand	47739c7df4	nir: Add a fdot instruction that replicates the result to a vec4 Fortunately, nir_constant_expr already auto-splats if "dst" never shows up in the constant expression field so we don't need to do anything there. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 12:38:48 -07:00
Jason Ekstrand	2458ea95c5	nir/lower_vec_to_movs: Coalesce movs on-the-fly when possible The old pass blindly inserted a bunch of moves into the shader with no concern for whether or not it was really needed. This adds code to try and coalesce into the destination of the instruction providing the value. Shader-db results for vec4 shaders on Haswell: total instructions in shared programs: 1754420 -> 1747753 (-0.38%) instructions in affected programs: 231230 -> 224563 (-2.88%) helped: 1017 HURT: 2 This approach is heavily based on a different patch by Eduardo Lima Mitev <elima@igalia.com>. Eduardo's patch did this in a separate pass as opposed to integrating it into nir_lower_vec_to_movs. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 12:38:07 -07:00
Jason Ekstrand	2b2f1f16a0	nir/lower_vec_to_movs: Get rid of start_idx and swizzle compacting Previously, we did this thing with keeping track of a separate start_idx which was different from the iteration variable. I think this was a relic of the way that GLSL IR implements writemasks. In NIR, if a given bit in the writemask is unset then that channel is just "unused", not missing. In particular, a vec4 operation with a writemask of 0xd will use sources 0, 2, and 3 and leave source 1 alone. We can simplify things a good deal (and make them correct) by removing this "compacting" step. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-09-15 11:13:48 -07:00
Jason Ekstrand	c951bb8305	i965/vec4_nir: Use partial SSA form rather than full non-SSA We made this switch in the FS backend some time ago and it seems to make a number of things a bit easier. In particular, supporting SSA values takes very little work in the backend and allows us to take advantage of the majority of the SSA information even after we've gotten rid of Phi nodes. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2015-09-15 11:13:48 -07:00

1 2 3 4 5 ...

72854 Commits All Branches Search

72854 Commits

All Branches