KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	0cfc1304be	nv50: allow using inline vertex data submit when gl_VertexID is used The hardware can actually generates vertexid when vertices come from a client-side buffer like when glDrawElements is used. This doesn't fix (or break) any piglit tests but it improves the previous attempt of Ilia (`c830d19` "nv50: avoid using inline vertex data submit when gl_VertexID is used") The only disadvantage is that only works on G84+, but we don't really care of that weird and old NV50 chipset. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 21:11:38 +01:00
Samuel Pitoiset	9e40a621c1	nv50: add NV84_3D macro Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 21:11:27 +01:00
Matt Turner	a5b3115f0a	i965: Drop IMM fs_reg/src_reg -> brw_reg conversions. The previous two commits make this unnecessary. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-19 11:12:24 -08:00
Matt Turner	f9a9ba5eac	i965/vec4: Replace src_reg(imm) constructors with brw_imm_*(). Cuts 1.5k of .text. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-19 11:12:24 -08:00
Matt Turner	9b978046eb	i965/fs: Use brw_imm_uw(). W/UW immediates are 16-bits, but those 16-bits must be replicated in the high 16-bits of the 32-bit field. Remove the useless W/UW immediate saturating code, since we'll now be using the appropriate immediate (and W/UW immediates in the IR can now no longer be larger than 16-bits). Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-19 11:12:24 -08:00
Matt Turner	3ccc41ecfc	i965/fs: Replace fs_reg(imm) constructors with brw_imm_*(). Cuts 10k of .text, of which only 776 bytes are the fs_reg constructor implementations themselves. text data bss dec hex filename 5204535 214112 27784 5446431 531b1f i965_dri.so before 5193977 214112 27784 5435873 52f1e1 i965_dri.so after Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-19 11:12:24 -08:00
Matt Turner	c15a407eb4	i965: Make brw_imm_vf4() take 8-bit restricted floats. This partially reverts commit `bbf8239f92`. I didn't like that commit to begin with -- computing things at compile time is fine -- but for purposes of verifying that the resulting values are correct, looking up 0x00 and 0x30 in a table is a lot better than evaluating a recursive function. Anyway, by making brw_imm_vf4() take the actual 8-bit restricted floats directly (instead of only integral values that would be converted to restricted float), we can use this function as a replacement for the vector float src_reg/fs_reg constructors. brw_float_to_vf() is not currently an inline function, so it will not be evaluated at compile time. I'll address that in a follow-up patch. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-19 11:12:24 -08:00
Nanley Chery	e8c5ef3eca	mesa: Add test for sorted extension table Enable developers to know if the table's alphabetical sorting is maintained or lost. v2: Move "" next to pointer name (Matt) Include extensions_table.h instead of extensions.h (Ian) Remove extra " " in comment (Ian) Signed-off-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-11-19 11:12:45 -08:00
Nanley Chery	f030227f46	mesa/extensions: Sort the extension table alphabetically Make it easier to determine where to add new extensions. Performed with the vim sort command. v2: Insert newline after last #define (Matt) Signed-off-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-11-19 10:35:20 -08:00
Ilia Mirkin	bcda79676a	docs: GL3.1 for a3xx and a4xx Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 12:26:28 -05:00
Ryan Houdek	0ec218d167	mesa: enable EXT_blend_func_extended if the driver supports the ARB version Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 11:39:51 -05:00
Ryan Houdek	f7c23f225f	mesa: allow MAX_DUAL_SOURCE_DRAW_BUFFERS to be available to ES Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 11:39:51 -05:00
Ryan Houdek	4b549f0d8c	mesa: enable usage of blend_func_extended blend factors in GLES2 Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 11:39:51 -05:00
Ryan Houdek	33ddc8e865	glsl: add a parse check to check for the index layout qualifier This can only be used if EXT_blend_func_extended is enabled Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 11:39:51 -05:00
Ryan Houdek	ef9e6d1ec8	glsl: add GL_EXT_blend_func_extended preprocessor define Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 11:39:51 -05:00
Ryan Houdek	1d1d02f2ac	glsl: add support for EXT_blend_func_extended builtins gl_MaxDualSourceDrawBuffersEXT - Maximum dual-source draw buffers supported For ESSL 1.0, it provides two builtins since you can't have user-defined color output variables: gl_SecondaryFragColorEXT gl_SecondaryFragDataEXT[MaxDSDrawBuffers] Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 11:39:51 -05:00
Ryan Houdek	ceecb0876f	glsl: add EXT_blend_func_extended parser enables This adds a state for the maximum dual source draw variables available and the variable for determining if the extension has been enabled in the program shaders. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 11:39:51 -05:00
Ryan Houdek	625414f78c	glapi: add EXT_blend_func_extended XML definitions Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-19 11:39:51 -05:00
Brian Paul	15f8dc7b23	os: check for GALLIUM_PROCESS_NAME to override os_get_process_name() Useful for debugging and for glretrace. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2015-11-19 09:23:04 -07:00
Connor Abbott	f1ba0a5ea0	glsl: fix ir_constant::equals() for doubles Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-11-19 09:16:18 +01:00
Connor Abbott	84ed3819a4	glsl: fix isinf() for doubles Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-11-19 09:16:18 +01:00
Connor Abbott	7820b2c071	nir: fix constant folding of bfi Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-11-19 09:16:18 +01:00
Brian Paul	1cfffb95eb	hud: fix Windows build break Protect signal-related code with PIPE_OS_UNIX test. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-11-19 07:57:09 +00:00
Ian Romanick	2f55476153	glsl: Fix off-by-one error in array size check assertion Apparently, this has been a bug since 2010 (`c30f6e5d`). Also use ARRAY_SIZE instead of open coding it. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2015-11-18 18:35:56 -08:00
Ian Romanick	0aded03046	mesa: Don't expose GL_EXT_shader_integer_mix in GLES 1.x There are no shaders, so it doesn't even make sense to expose the extension. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Cc: Nanley Chery <nanley.g.chery@intel.com>	2015-11-18 18:35:56 -08:00
Ian Romanick	37c2cfa6bc	glsl: Silence unused parameter warnings builtin_functions.cpp:5289:52: warning: unused parameter 'num_arguments' [-Wunused-parameter] unsigned num_arguments, ^ builtin_functions.cpp:5290:52: warning: unused parameter 'flags' [-Wunused-parameter] unsigned flags) ^ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-18 18:35:56 -08:00
Ian Romanick	c82498c4da	glsl: Silence ignored qualifier warning I think the intention was to mark the "this" parameter as const, but const goes on the other end to do that. In file included from glsl_symbol_table.cpp:26:0: ast.h:339:35: warning: type qualifiers ignored on function return type [-Wignored-qualifiers] const bool is_single_dimension() ^ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2015-11-18 18:35:56 -08:00
Kenneth Graunke	fc19a0d2e4	i965: Allow indirect GS input indexing in the scalar backend. This allows arbitrary non-constant indices on GS input arrays, both for the vertex index, and any array offsets beyond that. All indirects are handled via the pull model. We could potentially handle indirect addressing of pushed data as well, but it would add additional code complexity, and we usually have to pull inputs anyway due to the sheer volume of input data. Plus, marking pushed inputs as live due to indirect addressing could exacerbate register pressure problems pretty badly. We'd need to be careful. v2: Use updated MOV_INDIRECT opcode. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-11-18 15:42:36 -08:00
Jimmy Berry	09d610796c	gallium/hud: document GALLIUM_HUD_PERIOD in envvars.html. Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-11-19 00:02:34 +01:00
Jimmy Berry	56a1c10bb8	gallium/hud: control visibility at startup and runtime. - env GALLIUM_HUD_VISIBLE: control default visibility - env GALLIUM_HUD_SIGNAL_TOGGLE: toggle visibility via signal Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-11-19 00:02:33 +01:00
Jason Ekstrand	fa8db0dfcc	anv: Put all of the descriptor set stuff together in one file The stuff to take descriptor sets and turn them into binding tables and sampler tables is still in anv_cmd_buffer.c. We may want to consider putting it in anv_descriptor_set.c eventually.	2015-11-18 14:58:43 -08:00
Jason Ekstrand	828b1a6eb6	anv/device: Update the right sampler in UpdateDescriptorSets	2015-11-18 14:48:28 -08:00
Jason Ekstrand	0bee3acc2a	i965/nir: Add hooks for testing nir_shader_clone This commit adds code for testing nir_shader_clone by running it after each and every optimization pass and throwing away the old shader. Testing nir_shader_clone is hidden behind a new INTEL_CLONE_NIR environment variable. Reviewed-by: Rob Clark <robclark@freedesktop.org> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-18 12:28:55 -08:00
Jason Ekstrand	9fbd390dd4	nir: Add support for cloning shaders This commit is heavily based on one by Rob Clark <robdclark@gmail.com> but reworked to re-use nir_create functions and do less hashing. Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 12:28:32 -08:00
Kenneth Graunke	9ff71b649b	i965/nir: Validate that NIR passes call nir_metadata_preserve(). Failing to call nir_metadata_preserve() can have nasty consequences: some pass breaks dominance information, but leaves it marked as valid, causing some subsequent pass to go haywire and probably crash. This pass adds a simple validation mechanism to ensure passes handle this properly. We add a new bogus metadata flag that isn't used for anything in particular, set it before each pass, and ensure it isn't still set after the pass. nir_metadata_preserve will reset the flag, so correct passes will work, and bad passes will assert fail. (I would have made these functions static inline, but nir.h is included in C++, so we can't bit-or enums without lots of casting...) Thanks to Dylan Baker for the idea. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-11-18 12:28:32 -08:00
Kenneth Graunke	7bc0978999	i965/nir: Add OPT() and OPT_V() macros for invoking NIR passes. OPT() is the normal macro for passes that return booleans, while OPT_V() is a variant that works for passes that don't properly report progress. (Such passes should be fixed to return a boolean, eventually.) These macros take care of calling nir_validate_shader() and setting progress appropriately. In the future, it would be easy to add shader dumping similar to INTEL_DEBUG=optimizer by extending the macro. v2 (Jason Ekstrand): - Fix an unused variable warning Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-11-18 12:28:32 -08:00
Rob Clark	d27ae2cf8c	nir: add array length field This will simplify things somewhat in clone. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-11-18 12:28:32 -08:00
Rob Clark	624ec66653	nir: remove nir_variable::max_ifc_array_access No users. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-11-18 12:28:32 -08:00
Jason Ekstrand	6f613abc2b	anv/cmd_buffer: Add a new genX_cmd_buffer file for shared code This file contains code that can be shared across gens modulo recompiling. In particular, we can share STATE_BASE_ADDRESS setup and handling of the vkPipelineBarrier call. Not sharing STATE_BASE_ADDRESS setup has already been a source of bugs and the gen7 and gen8 implementations of PipelineBarrier were line-for-line identical. Incidentally, this should fix MOCS settings for dynamic and surface state on Haswell.	2015-11-18 12:26:57 -08:00
Jason Ekstrand	fb8b2f5f9e	anv/gen7: A bunch of depth-stencil fixes There are various bits which move around between Haswell and Ivy Bridge that we weren't taking into account. This also makes us actually set the StencilWriteEnable in a sane way.	2015-11-18 11:43:52 -08:00
Rob Clark	4671c13852	freedreno/a4xx: add fake RGTC support (required for GL3) The a4xx bits corresponding to 'freedreno/a3xx: add fake RGTC support (required for GL3)' TODO some more r/e.. maybe we get lucky and hw supports some of this directly? For now this will help us enable gl3. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Rob Clark	2379cc9fe0	freedreno/a4xx: add compressed texture formats Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Rob Clark	fadd39442b	freedreno: update generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	4607b2b9b6	freedreno: expose GLSL 140 and fake MSAA for GL3.0/3.1 support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	9c409c8df3	freedreno/a3xx: fix texture buffers, enable offsets The main issue is that the current logic looked into cso->u.tex, which is the wrong side of the union to look into for texture buffers. While I was at it, it was easy enough to add the logic to handle offsets (first_element). - reduce texture buffer size limit (determined experimentally) - don't look at first/last levels, instead look at first/last element - include the first element offset - set offset alignment to 16 (determined experimentally) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	d69e557f2a	freedreno: add support for conditional rendering, required for GL3.0 A smarter implementation would make it possible to attach this to emit state for the BY_REGION versions to avoid breaking the tiling. But this is a start. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	059da344ec	freedreno/a3xx: add fake RGTC support (required for GL3) Also throw in LATC while we're at it (same exact format). This could be made more efficient by keeping a shadow compressed texture to use for returning at map time. However... it's not worth it for now... presumably compressed textures are not updated often. Lastly fix up Z32S8 transfers to non-0 layers. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	84d087aea2	freedreno/a3xx: add missing formats to enable ARB_vertex_type_2_10_10_10_rev The previously RE'd formats were from an ES driver implementing OES_vertex_type_10_10_10_2 and thus backwards. A future change could add the 2_10_10_10 support. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Rob Clark	8106fec74c	freedreno/a3xx+a4xx: fix for stk binning pass hang We'd end up in a state where shader uses no inputs, yet num_elements is greater than zero. Triggered by a TF vertex shader which did: gl_Position = vec4(0.0, 0.0, 0.0, 0.0); resulting in a binning pass variant with no inputs. Includes equiv fix in a4xx, even though we don't have binning-pass enabled yet on a4xx. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Rob Clark	b24c9a8aee	freedreno/a3xx+a4xx: fix GL_POINTS lockup w/ GLES point_size_per_vertex is always TRUE for GLES, causing us to configure the hw as if gl_PointSize was written, even if it was not. Which makes for grumpy hw. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00

... 4 5 6 7 8 ...

75879 Commits All Branches Search

75879 Commits

All Branches