KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Frank Henigman	2c73102dc3	gallivm: One code memory pool with deferred free. Provide a JITMemoryManager derivative which puts all generated code into one memory pool instead of creating a new one each time code is generated. This saves significant memory per shader as the pool size is 512K and a small shader occupies just several K. This memory manager also defers freeing generated code until you tell it to do so, making it possible to destroy the LLVM engine while keeping the code, thus enabling future memory savings. v2: Fix compilation errors with LLVM 3.4 (Jose) Signed-off-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-05-14 11:05:00 +01:00
José Fonseca	2ea923cf57	gallivm: Run passes per module, not per function. This is how it is meant to be done nowadays. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-05-14 11:05:00 +01:00
José Fonseca	920933e09e	gallivm: Use LLVM global context. I saw that LLVM internally uses its global context for some things, even when we use our own. Given ours is also global, might as well use LLVM's. However, sepearate contexts can still be enabled with a simple source code modification, for when the need/benefit arises. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-05-14 11:05:00 +01:00
José Fonseca	69f0835ff1	gallivm: Stop using module providers. Nowadays LLVMModuleProviderRef is just an alias for LLVMModuleRef, so its use just causes unnecessary confusion. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-05-14 11:05:00 +01:00
José Fonseca	9cf67e51b0	gallivm,draw,llvmpipe: Remove support for versions of LLVM prior to 3.1. Older versions haven't been tested probably don't work anyway. But more importantly, code supporting it is hindering further work. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-05-14 11:04:59 +01:00
José Fonseca	ecef2da0b2	configure: Require LLVM 3.1. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-05-14 11:04:59 +01:00
José Fonseca	c0ef9a67d3	scons: Require LLVM 3.1 Support for prior versions will be removed in the following change. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-05-14 11:04:59 +01:00
Matt Turner	2012599abb	i965: Reformat brw_set_src1 so it can be easily found with grep.	2014-05-13 22:40:01 -07:00
Samuel Iglesias Gonsalvez	e0dc018fd5	i965: fix size assert for gen7 in brw_init_compaction_tables() It should compare with it's own size. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com>	2014-05-13 22:35:42 -07:00
Iago Toral Quiroga	520dfa4b5c	i965: Relax accumulator dependency scheduling on Gen < 6 Many instructions implicitly update the accumulator on Gen < 6. The instruction scheduling code just calls add_barrier_deps() for each accumulator access on these platforms, but a large class of operations don't actually update the accumulator -- mostly move and logical instructions. Teaching the scheduling code about this would allow more flexibility to schedule instructions. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=77740 Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-05-13 22:33:59 -07:00
Jonathan Gray	0c0bbe77d0	glsl: simplify the M_PIf macros, fixes build on OpenBSD The M_PIf macros used a preprocessor paste to append 'f' to M_PI defines, which works if the values are only numbers but breaks on OpenBSD where M_PI definitions have casts and brackets to meet requirements of a future version of POSIX, http://austingroupbugs.net/view.php?id=801 http://austingroupbugs.net/view.php?id=828 Simplify the M_PI*f macros by using casts directly in the defines as suggested by Kenneth Graunke. Cc: "10.2" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=78665 Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Jonathan Gray <jsg@jsg.id.au>	2014-05-13 22:30:22 -07:00
Carl Worth	a5769ad373	docs: Really add the 10.1.3 release nots this time Commit `a96c3bccf6` intended to add these, but I forgot to add the file.	2014-05-13 17:30:17 -07:00
Rob Clark	f999c13176	freedreno/a3xx: occlusion query support Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-05-13 18:33:19 -04:00
Rob Clark	b8f78e1890	freedreno: add support for hw queries Real GPU queries need some infrastructure to track samples per tile and accumulate the results. But fortunately this can be shared across GPU generation. See: https://github.com/freedreno/freedreno/wiki/Queries#hardware-queries Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-05-13 18:33:19 -04:00
Rob Clark	13a0cf4480	freedreno/query: allow multiple query implementations Split out fd_query into an abstract base class, to allow multiple implementations. The current sw based queries are moved into fd_sw_query. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-05-13 18:33:19 -04:00
Kenneth Graunke	2265bda513	mesa: Dump ARB_vp/fp source and IR when MESA_GLSL=dump. As far as I can tell, Mesa hasn't had a convenient way to dump ARB_vp/fp source until now. Using MESA_GLSL=dump is convenient, since it means you can use a single environment variable to dump a program's shaders, no matter which language they're written in. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-13 15:32:16 -07:00
Kenneth Graunke	bd44ac8b5c	i965: Don't _swrast_BlitFramebuffer when doing CopyTexSubImage. The point of copytexsubimage_using_blit_framebuffer is to use a hardware accelerated BlitFramebuffer path. If that fails, we shouldn't do a swrast blit---we should try our CTSI fallback code. This is especially important for i965 and GLES, where we don't even create a swrast context. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=77705 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-05-13 15:32:16 -07:00
Jordan Justen	c51c192891	i965/gen8: Set depth extent field The depth extent field is used to limit the allowed slice range that can be rendered to. With the previous setting, only slice 0 could be rendered. This fixes piglit amd_vertex_shader_layer-layered-depth-texture-render. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-05-13 14:26:41 -07:00
Jordan Justen	294ada2fef	i965/gen8 depth: Set depth size based on LOD0 for 3D textures Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-05-13 14:25:58 -07:00
Jordan Justen	e6d6ed55ab	i965/gen7 depth: Set depth size based on LOD0 for 3D textures Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-05-13 14:25:58 -07:00
Jordan Justen	e47d08adef	i965/gen8 renderbuffer: Set depth size based on LOD0 for 3D textures Fixes piglit's 'gl-3.2-layered-rendering-clear-color-all-types 3d mipmapped' Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-05-13 14:25:58 -07:00
Jordan Justen	b875f39e29	i965/gen7 renderbuffer: Set depth size based on LOD0 for 3D textures If blorp is disabled for color clears, then piglit's 'gl-3.2-layered-rendering-clear-color-all-types 3d mipmapped' will fail. Currently, gen8 fails similarly on this test because gen8 does not use blorp. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2014-05-13 14:25:57 -07:00
Rob Clark	521ee86db7	freedreno/a3xx: add point-size Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-05-13 16:54:37 -04:00
Rob Clark	a13a798926	freedreno: update generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-05-13 16:54:20 -04:00
Bryan Cain	4e974a9cf3	glsl_to_tgsi: remove unnecessary dead code elimination pass With the more advanced dead code elimination pass already being run, eliminate_dead_code was making no difference in instruction count, and had an undesirable O(n^2) runtime. So remove it and rename eliminate_dead_code_advanced to eliminate_dead_code. Reviewed-by: Marek Olšák <marek.olsak at amd.com>	2014-05-13 14:57:55 -05:00
José Fonseca	1646f4d0fb	ralloc: Omit detailed license information about talloc. That information misleads source code auditing tools to think that ralloc itself is released under LGPL v3. Instead, simply state talloc is not licensed under a permissive license. v2: Use wording suggested by Kenneth. Reviewed-by: Brian Paul <brianp@vmware.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-13 12:48:38 +01:00
Iago Toral Quiroga	5421617325	i965: Avoid redundant call to brw_merge_inputs() in brw_try_draw_prims() We always call brw_merge_inputs() right before looping over the primitives but this can be called inside the loop for each primitive too. In the case we do it for the first primitive the call is redundant and can be skipped. Reviewed-by: Eric Anholt <eric@anholt.net>	2014-05-13 10:09:35 +02:00
Iago Toral Quiroga	a143fbb322	glsl: Do not call lhs->variable_referenced() multiple times Instead take the result from the first call and use it where needed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-13 10:01:02 +02:00
Topi Pohjolainen	2a549c43a8	meta: Refactor state save/restore for framebuffer texture blits Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-13 10:04:25 +03:00
Kristian Høgsberg	06842d436e	wayland: Move version 2 request to end of interface specification We're moving towards requiring interface additions to be appended to the end of the interface block. No functional change, opcodes are assigned as before, but version 2 additions are now grouped together, which prevents a scanner warning. Cc: "10.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Kristian Høgsberg <krh@bitplanet.net>	2014-05-12 15:55:21 -07:00
Timothy Arceri	9c9dd8ca93	glsl: the number of samplers is already calculated so use it Signed-off-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-05-13 07:40:08 +10:00
Eric Anholt	afe3d1556f	i965: Stop doing remapping of "special" regs. Now that we aren't using pixel_[xy] in live variables, nothing is looking at these regs after the visitor stage. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-12 09:50:32 -07:00
Eric Anholt	66f5c8df06	i965: Generalize the pixel_x/y workaround for all UW types. This is the only case where a fs_reg in brw_fs_visitor is used during optimization/code generation, and it meant that optimizations had to be careful to not move pixel_x/y's register number without updating it. Additionally, it turns out we had a couple of other UW values that weren't getting this treatment (like gl_SampleID), so this more general fix is probably a good idea (though I wasn't able to replicate problems with either pixel_[xy]'s values or gl_SampleID, even when telling the register allocator to reuse registers immediately) Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-12 09:49:27 -07:00
Eric Anholt	11bef60d09	i965: Move has_hiz from the slice to the level. The value depends only on the level, so no need to store the bool per slice. Shrinks intel_mipmap_slice from 24 bytes to 16, while slotting into an existing hole in intel_mipmap_level. Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2014-05-12 09:49:18 -07:00
Topi Pohjolainen	4dc9c314c8	meta: Refactor configuration of renderbuffer sampling Cc: "10.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-12 17:48:45 +03:00
Topi Pohjolainen	a2952315ac	meta: Refactor binding of renderbuffer as texture image Cc: "10.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-12 17:48:45 +03:00
Topi Pohjolainen	ac4db0aa55	meta: Merge compiling and linking of blit program Cc: "10.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-12 17:48:45 +03:00
Topi Pohjolainen	3a43cd0c3e	i965/blorp: Expose coordinate scissoring and mirroring Cc: "10.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-12 17:48:45 +03:00
Topi Pohjolainen	4a92ad5531	i965/gen8: Use helper variables for surface parameters Cc: "10.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-05-12 17:48:45 +03:00
Ilia Mirkin	8baed87212	nv50,nvc0: fix blit 3d path for 1d array textures Need to adjust coordinates since the shader receives the array index as depth in z, but the TEX instruction expects it to be the second coordinate for a 1D array texture. This fixes fbo-generatemipmap-array. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ben Skeggs <bskeggs@redhat.com> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-05-11 19:26:31 -04:00
Ilia Mirkin	4467c0c9fb	nv50,nvc0: leave queries on during blit, turn them on for 2d engine Fixes the new logic of the conditional rendering piglit test. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ben Skeggs <bskeggs@redhat.com> Cc: "10.2" <mesa-stable@lists.freedesktop.org>	2014-05-11 19:26:31 -04:00
Ilia Mirkin	64a7ddf40d	mesa/st: leave current query enabled during glBlitFramebuffer Also make sure that pipe_blit_info gets zero'd out so that query isn't accidentally left enabled. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-05-11 19:26:31 -04:00
Ilia Mirkin	752ce0affb	gallium: add bit to pipe_blit_info to leave current query enabled Previously the implication was that queries should be disabled during blits. However glBlitFramebuffer() is supposed to obey the current query, and this new bit will indicate that to the driver. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-05-11 19:26:31 -04:00
Ilia Mirkin	863573b9cb	nv50: fix setting of texture ms info to be per-stage Different textures may be bound to each slot for each stage. So we need to be able to upload ms parameters for each one without stages overwriting each other. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ben Skeggs <bskeggs@redhat.com> Cc: "10.1 10.2" <mesa-stable@lists.freedesktop.org>	2014-05-11 19:26:31 -04:00
Ilia Mirkin	68f47cad0d	nv50/ir: make sure to reverse cond codes on all the OP_SET variants Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ben Skeggs <bskeggs@redhat.com> Cc: "10.2 10.1" <mesa-stable@lists.freedesktop.org>	2014-05-11 19:26:31 -04:00
Rob Clark	83b4ec03e7	freedreno/a2xx: fix compiler warning Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-05-11 08:58:20 -04:00
Marek Olšák	d9e102b220	radeonsi: prepare depth export registers at compile time Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-05-10 13:58:46 +02:00
Marek Olšák	9baaa5dd4f	radeonsi: simplify depth/stencil export code Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-05-10 13:58:46 +02:00
Marek Olšák	bd2df40a84	radeon/llvm: add support for non-scalar system values The sample position is one of them. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-05-10 13:58:46 +02:00
Marek Olšák	250aa93e23	radeonsi: add and use a helper function for loading constants Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-05-10 13:58:46 +02:00

1 2 3 4 5 ...

62902 Commits All Branches Search

62902 Commits

All Branches