KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Chris Forbes	ca1d1b2fc1	i965: Don't try to use fast upload path for nontrivial views This will eventually be relaxed, but we'll get the fallback path working first. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:41 +12:00
Chris Forbes	c9c08867ed	i965: Adjust surface_state emission to account for view parameters V4: Comment style, remove magic shift. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:41 +12:00
Chris Forbes	771c2ae0af	i965: Add _Format to intel_texobj. This is the actual mesa_format to use. In non-view cases this is always the same as the mt's format. V4: Comment style Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:41 +12:00
Chris Forbes	b7f011fdc9	i965: Add driver hook for TextureView We need to wire the original texture's mt into the view. All the hard work of setting up an appropriate tree of gl_texture_image structures has already been done by core mesa. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:41 +12:00
Chris Forbes	93fa16bdd1	i965: Ensure that texture validation is skipped for immutable textures. If we were to relayout the miptree, we'd break any views that are sharing it. (Simplified based on suggestions from Eric) Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:41 +12:00
Chris Forbes	a98b675945	i965: refactor format selection for unsupported ETC* formats We will need to call this to munge view formats. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	14c116433d	i965: refactor format munging for separate stencil We will need this for munging the view's format. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	215c9432b9	i965: Include #slices in miptree debug Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	c1b017472b	mesa: Adjust _MaxLevel computation to account for views Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	61e264f4fc	mesa: Prefer non-swizzled formats for most sized internalformats These formats can be cast to others (with different component types or sizes) via ARB_texture_view or ARB_shader_image_load_store. We want them to be laid out consistently so that we can just reinterpret the memory with a different format. In V1, this was done conditionally on a 'prefer_no_swizzle' flag which was set in TexStorage/TextureView paths, but we need the same behavior for ARB_shader_image_load_store (which also works with images created via TexImage, so we don't want it to be conditional. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	58790043bb	i965: Render R8G8B8X8 as R8G8B8A8 The sampler can handle R8G8B8X8 (and substitute 1.0 for the fourth component) but we can't use it as a render target. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	50eed4eed5	i965: Pretend we don't support BRW_SURFACEFORMAT_R16G16B16_FLOAT for textures. None of the other 3-component 16bpc formats are directly supported, so they get promoted to XRGB equivalents. Not promoting RGB16F the same way makes texture views much more fiddly -- we don't want to have to do crazy copying behind the scenes. (with my other master + my experimental ARB_texture_view support) fixes the piglit test: `spec/ARB_texture_view/view compare 48bit formats` No regressions in gpu.tests on Haswell. V4: Don't alter the formats table -- just don't match it to a mesa_format. [Kenneth] Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	66b0554fa6	i965: Enable R10G10B10A2_UNORM format This is supported by all generations, and is required for memory layout consistency for texture_view. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	932a1eeac8	i965: Enable R8G8B8A8_UNORM_SRGB format Now this is the preferred format for GL_SRGB8_ALPHA8. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	6ef7205613	swrast: Add support for fetching from MESA_FORMAT_R10G10B10A2_UNORM V4: Fix rebase conflicts with Brian's renaming of the texfetch functions. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Brian Paul <brianp@vmware.com> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Chris Forbes	a421be1dcb	mesa: fix packing of float texels to GL_SHORT/GL_BYTE Previously, we would unpack the texels to floats using _TO_FLOAT_TEX, and then pack them into the desired format using FLOAT_TO_. Unfortunately, this isn't quite the inverse operation, and so some texel values would end up off-by-one. This fixes the GL_RGB8_SNORM and GL_RGB16_SNORM subcases in piglit's arb_texture_view-format-consistency-get test on i965. The similar 1-, 2- and 4-component cases already worked because they took the memcpy path rather than repacking. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Eric Anholt <eric@anholt.net>	2014-04-10 18:27:40 +12:00
Michel Dänzer	ee2bcf38a4	r600g: Don't leak bytecode on shader compile failure Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=74868 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-04-10 14:00:43 +09:00
Emil Velikov	55f9bbd46c	build: force .so extension for the gallium dri modules While linux uses .so as a default extension for shared libraries that is not the case for other platforms. The loader in libGL (and others) assumes that the dri module will always have a .so extension, thus it will fail to load on the affected platforms. Spotted-by: Jon TURNEY <jon.turney@dronecode.org.uk> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-04-09 22:12:36 +01:00
Jon TURNEY	92d0786f88	Partially revert `bba9c28` "configure: use LIB_EXT rather than hardcoded .so" Filenames passed to dlopen() don't need to use the platform's default extension for shared libraries. Using the '.so' extension when dlopen()ing DRI drivers is hardcoded into mesa and the X server, so it should be hardcoded here in the Makefile as well. A similar fix is probably also needed for gallium DRI drivers. (Consider that if we were starting from scratch, perhaps we would use a custom extension like .dri instead) Cc: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Jon TURNEY <jon.turney@dronecode.org.uk> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-04-09 22:12:36 +01:00
Emil Velikov	56f531657c	Partially revert "st/xa: Fix advertized version number and try to avoid future discrepancies" This reverts commit `61bedc3d6b`. As the header is the one defining the API/ABI and is distributed during installation, we should be using it rather than re-defining the XA version in configure.ac. Bump the version in the header to 2.2.0, to reflect what was the original intent of commit `42158926c6`. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2014-04-09 22:12:35 +01:00
Emil Velikov	f9832f960f	glx: drop obsolete _XUnlock_Mutex in __glXInitialize error path With commit 1f1928db001(glx: Drop _Xglobal_lock while we create and initialize glx display) we've split the big _Xglobal_lock handling in a more fine grained manner. Unfortunatelly we forgot to drop the unlock_mutex on the error paths, leading to undefined behaviour as the mutex is already unlocked. Cc: Kristian Høgsberg <krh@bitplanet.net> Cc: "9.2 10.0 10.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-04-09 22:12:35 +01:00
Rob Clark	6afd7be132	freedreno/a3xx: assert() -> debug_assert() We hit this assert with some piglit tests. Which appears to be a bug outside of freedreno. Previously we were relying on assert() being redefined to debug_assert() so that we didn't crash in release builds. Somehow that stopped working. So just use debug_assert() directly. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-04-09 16:37:04 -04:00
Brian Paul	e853ade544	svga: move LIST_INITHEAD(dirty_buffers) earlier in svga_context_create() Fixes a crash in svga_context_flush_buffers() if we use the 'draw' module for AA lines (when the device doesn't support that feature). We need to initialize this list before we setup the swtnl pieces. Found/fixed by Charmaine Lee. Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com>	2014-04-09 12:02:03 -06:00
Kenneth Graunke	26ae030fcc	i965: Stop advertising GL_MESA_ycbcr_texture. The "new" fragment shader backend has never supported the necessary color conversion code for this to work. We began using the new backend in Mesa 7.10 for GLSL (commit `a81d423d93`, October 2010), and for ARB_fragment_program in Mesa 9.1 (commit `97615b2d8c`, August 2012). I haven't heard any complaints, so I don't think anyone will miss this feature. I believe mplayer used it at one point, but these days defaults to other paths anyway. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <idr@freedesktop.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-04-09 08:28:25 -07:00
Rob Clark	4a92c12232	freedreno/a3xx/compiler: add CEIL fixes piglit glsl-fs-ceil Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-04-09 10:59:18 -04:00
Rob Clark	9604e31dc9	freedreno/a3xx/compiler: fix neg mov's create_mov() was fixed up to handle neg/abs properly for interal mov's, using absneg.f, but forgot to fix it for TGSI MOV's. The problem with using add.f to handle negated mov's is that we can only take a single const reg src. So: MOV TEMP[n], -CONST[m] would turn into: add.f Rdst, (neg)CONST[m], 0.0 which would not work. Anyways, just remove the extra code and always use create_mov() which DTRT. This fixes piglit vs-op-neg-int test. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-04-09 10:59:18 -04:00
Marek Olšák	4d641803e8	radeonsi: allow fast color clear and Hyper-Z with 1D-tiled surfaces on CIK This depends on my kernel fix. Hyper-Z is still disabled by default.	2014-04-09 01:45:16 +02:00
Marek Olšák	fb5cf3490e	r600g,radeonsi: add a bunch of useful queries for the HUD	2014-04-09 01:45:16 +02:00
Marek Olšák	4a5519f1e0	r600g,radeonsi: set correct initial domain for shared resources	2014-04-09 01:45:16 +02:00
Marek Olšák	5f7faff61b	gallium/radeon: fix warnings	2014-04-09 01:45:16 +02:00
Iago Toral Quiroga	1a92637c68	tnl: Merge _tnl_vbo_draw_prims() into _tnl_draw_prims(). This should help prevent situations where we render without proper index bounds. For example: https://bugs.freedesktop.org/show_bug.cgi?id=59455 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2014-04-08 15:10:10 -07:00
Topi Pohjolainen	2ffb50d77b	i965: Remove unused sampler key fields Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 13:34:59 -07:00
Brian Paul	6f059725fa	mesa: move declaration before code in etc2_unpack_rgb8() To fix MSVC build since `cb4ad13685`.	2014-04-08 14:17:40 -06:00
Kenneth Graunke	ec1baea95a	i965: Delete "fast color clear unsupported" performance warning. Applications frequently clear to colors other than 0.0 or 1.0, which prevents us from doing fast color clears. In that case, we issue this performance warning on basically every glClear call, resulting in so much spam that it's nearly impossible to see any other messages. Plus, I don't think it's useful. We aren't suggesting a better way to do what the application developers want---we're just telling them it would be faster to do something they don't want. Driver developers have no control over the clear color, so this message is totally useless to them. A better alternative to get this sort of information is to use INTEL_DEBUG=blorp, which tells you whether color clears were fast, simd16 repdata, or slow. v2: Rebase on has_color_component changes. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-04-08 13:09:46 -07:00
Rob Clark	ee839cc6ef	freedreno/a3xx: deal with optimized tex instructions Keep track of whether we actually have any sam instructions in the resulting shader, rather than using TGSI SAMP declarations. If the sam instruction is optimized out, because the result is not used, we don't want to emit texture state, etc. In fact emitting sampler state and/or setting PIXLODENABLE bit when there are no texture fetches seems to cause lockup. In theory this should never happen for a "normal" shader, unless the state tracker is wonky. But it is a very real possibility for binning pass shaders. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-04-08 16:06:49 -04:00
Courtney Goeltzenleuchter	cb4ad13685	mesa: add bounds checking to eliminate buffer overrun Decompressing ETC2 textures was causing intermitent segfault by copying resulting 4x4 texel block to the destination texture regardless of the size of the destination texture. Issue found via application crash in GLBenchmark 3.0's Manhattan test. v2: add more detail comment. Compute limit outside inner loops. v3: add bugzilla reference v4: Correct cc syntax in commit log v5: really grab the right patch Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=74988 Cc: "9.2 10.0 10.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> [v1, suggested v2-3]	2014-04-08 12:55:25 -07:00
Leo Liu	a22d944fdb	st/omx/enc: cleanup omx/vid_enc.c cleanup by moving each step into a separate function Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2014-04-08 17:51:19 +02:00
Christian König	5f374826f8	st/omx/enc: allocate input buffer private on demand v2: move allocation to a function as first step to clean vid_enc_EncodeFrame Signed-off-by: Christian König <christian.koenig@amd.com> Signed-off-by: Leo Liu <leo.liu@amd.com>	2014-04-08 17:51:15 +02:00
Brian Paul	9bb2ec6fd1	svga: replace sampler assertion with conditional For TEX instructions, the set of samplers and sampler views should be consistent. The XA state tracker sometimes passes an inconsistent set of samplers and sampler views. Rather than assert and die, issue a warning. v2: add debugging code to detect inconsistent state. v3: also check for null sampler in svga_state_tss.c Cc: "10.0" "10.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2014-04-08 08:45:18 -06:00
Chia-I Wu	4ddf51db6a	i965/vec4: fix record clearing in copy propagation Given mov vgrf7, vgrf9.xyxz add vgrf9.xyz, vgrf4.xyzw, vgrf5.xyzw add vgrf10.x, vgrf6.xyzw, vgrf7.wwww the last instruction would be wrongly changed to add vgrf10.x, vgrf6.xyzw, vgrf9.zzzz during copy propagation. The issue is that when deciding if a record should be cleared, the old code checked for inst->dst.writemask & (1 << ch) instead of inst->dst.writemask & (1 << BRW_GET_SWZ(src->swizzle, ch)) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=76749 Signed-off-by: Chia-I Wu <olv@lunarg.com> Cc: Jordan Justen <jljusten@gmail.com> Cc: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romainck <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Cc: "10.1" <mesa-stable@freedesktop.org>	2014-04-08 21:04:22 +08:00
Eric Anholt	57d6e7b7ee	i965/vec4: Add a test for copy propagation behavior. I thought I was seeing a bug in the code while reviewing, but it's not there. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-04-08 00:59:48 -07:00
Eric Anholt	6230b646a5	i965/fs: Track whether we're doing dual source in a more obvious way. I'm going to be turning dual_src_output into an array in a moment. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	14b85e3a47	i965/fs: Add a couple more global special regs to special[] Nothing bad came of this because they weren't used after visitor running, but leaving them in a bad state seems like a recipe for pain later. Suggested-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	4303d26f93	i965/fs: Handle arrays of special regs more cleanly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	72b845e640	i965/fs: Fix dump_instructions() on uniforms. All of a vec4 uniform was being printed as "u0" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	caa2605db5	i965/fs: Fix vgrf0 live interval when no interpolation was done. When you've got a simple solid-color shader that doesn't generate pixel_x/y interpolation, we were deciding that the first vgrf was both the undefined pixel_x and pixel_y, and extending its live interval to avoid the stride problem. That tricked other optimization that tries to see if a particular instruction is the last use of a variable. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	cf40ebacb1	i965: Drop pointless check for variable declarations in splitting. We're walking the whole instruction stream, so we know the declaration will be found. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	66b15ad9db	i965: Remove stale comment. We stopped doing variable index lowering for uniforms in `a64c1eb9b1`, 5 months after the comment was added. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	8c2bfbc6b9	glsl: Move tree grafting's debug output to stderr. The rest of our compiler dumps are there, now. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:48 -07:00
Eric Anholt	e9822f77a9	glsl: Skip making a temporary for assignments when we don't need one. While we wish our optimization passes could identify all the cases where we can coalesce our variables, we miss out on a lot of opportunities. total instructions in shared programs: 1673849 -> 1673166 (-0.04%) instructions in affected programs: 299521 -> 298838 (-0.23%) GAINED: 7 LOST: 0 Note that many programs are "hurt". The notable ones are where we produce unrolling in cases we didn't before (presumably just because of the lower instruction count). But there are also some cases where pushing things right into the variables prevents copy propagation and tree grafting, since we don't split our variable usage webs apart. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2014-04-08 00:59:47 -07:00

... 2 3 4 5 6 ...

62451 Commits All Branches Search

62451 Commits

All Branches