KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Kenneth Graunke	eee8a53906	meta: Make BlitFramebuffer() do sRGB encoding in ES 3.x. According to the ES 3.0 and GL 4.4 specifications, glBlitFramebuffer is supposed to perform sRGB decoding and encoding whenever sRGB formats are in use. The ES 3.0 specification is completely clear, and has always stated this. However, the GL specification has changed behavior in 4.1, 4.2, and 4.4. The original behavior stated that no sRGB encoding should occur. The 4.4 behavior matches ES 3.0's wording. However, implementing the new behavior appears to break applications such as Left 4 Dead 2. This patch changes Meta to apply the ES 3.x rules in ES 3.x, but leaves OpenGL alone for now, to avoid breaking applications. Meta implements several other functions in terms of BlitFramebuffer, and many of those explicitly do not perform sRGB encoding. So, this patch explicitly disables sRGB encoding in those other functions, preserving the existing (correct) behavior. If you're from the future and are reading this, hi! Welcome to the "fun" of debugging sRGB problems! Best of luck! Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-03-21 13:53:44 -07:00
Nicolai Hähnle	b74784638d	docs: mark GL_ARB_shader_image_load_store/_size as done for radeonsi Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:26 -05:00
Edward O'Callaghan	5219eb15e1	radeonsi: Set PIPE_SHADER_CAP_MAX_SHADER_IMAGES This enables ARB_shader_image_load_store and ARB_shader_image_size. Signed-off-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> [allow the same number of images for all shader stages and require LLVM 3.9] Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:26 -05:00
Nicolai Hähnle	6f942ac5ee	radeonsi: disable early Z if the fragment shader writes to memory Empirically, both the EXEC_ON_* flags and LATE_Z are necessary. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:25 -05:00
Nicolai Hähnle	79762e877c	tgsi/scan: add writes_memory to flag presence of stores or atomics Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:25 -05:00
Nicolai Hähnle	e9d935ed0e	radeonsi: force the DCC enable bit off in image descriptors for writing (v2) This avoids a lockup at least on Tonga. v2: only force DCC off on VI+ (Marek) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:25 -05:00
Nicolai Hähnle	43f5ce1d20	radeonsi: implement MemoryBarrier (v2) v2: invalidate both constant and VMEM/TC L1 for constant buffers (Marek) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:25 -05:00
Nicolai Hähnle	97352aa50a	radeonsi: implement volatile memory access Prevent loads from being re-ordered or coalesced. Atomics don't need special handling by definition, and stores don't need special handling because LLVM is unable to detect dead image or buffer stores. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:25 -05:00
Nicolai Hähnle	5a61b428f4	radeonsi: implement coherent memory access (v2) v2: set glc=1 for volatile also on buffers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:25 -05:00
Nicolai Hähnle	d6fa650454	radeonsi: Lower TGSI_OPCODE_MEMBAR down to LLVM op Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:25 -05:00
Nicolai Hähnle	f7a85a8a0a	radeonsi: Lower TGSI_OPCODE_ATOM* down to LLVM op Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:24 -05:00
Nicolai Hähnle	bfcefcb3c7	radeonsi: Lower TGSI_OPCODE_STORE down to LLVM op Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:24 -05:00
Nicolai Hähnle	1e82dedeca	radeonsi: Lower TGSI_OPCODE_LOAD down to LLVM op (v3) v2: new signature style for buffer intrinsics (offsets) v3: new signature style for llvm.amdgcn.buffer.load.format (overloaded return) Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v2)	2016-03-21 15:34:24 -05:00
Nicolai Hähnle	136686a51d	radeonsi: extract the LLVM type name construction into its own function Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:23 -05:00
Nicolai Hähnle	02bd0cd7b1	radeonsi: Lower TGSI_OPCODE_RESQ down to LLVM op Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:23 -05:00
Nicolai Hähnle	75539197c7	radeonsi: extract TXQ buffer size computation into its own function This will allow it to be reused for RESQ. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:23 -05:00
Nicolai Hähnle	515fb2c09c	radeonsi: decompress shader images Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:23 -05:00
Nicolai Hähnle	f61566b77a	radeonsi: update shader image descriptor for invalidated buffer Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:23 -05:00
Nicolai Hähnle	e85cf35a65	radeonsi: implement set_shader_images (v2) Whether DCC is disabled depends on the access flags with which the image is bound: image_load supports DCC, but store and atomic don't. v2: remove an unnecessary masking of images->desc.enabled_mask Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:23 -05:00
Nicolai Hähnle	b1b7268f01	gallium/radeon: make r600_texture_disable_dcc externally accessible We will need it in radeonsi for shader images. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:22 -05:00
Nicolai Hähnle	457f9c6b25	tgsi/scan: track which shader images are really buffers Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:22 -05:00
Nicolai Hähnle	fa096a14af	tgsi/scan: add images_writemask Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:22 -05:00
Nicolai Hähnle	1379544081	st/mesa: translate additional flags in MemoryBarrier Re-order flags in the order in which they appear in the OpenGL spec in the description of MemoryBarrier(). Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:22 -05:00
Nicolai Hähnle	96cd908fd3	gallium: add additional PIPE_BARRIER_* bits Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 15:34:22 -05:00
Brian Paul	86caa67aef	svga: add svga_winsys_context::pipe_debug_callback pointer The svga winsys modules can use this to send debug messages to the state tracker and Mesa. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2016-03-21 13:37:40 -06:00
Charmaine Lee	f8aaf0094d	svga: Fix the index buffer rebind regression The index buffer handle saved in the hw_state structure could be invalid after the index buffer is destroyed. Instead of rebinding the index buffer using the saved index buffer handle, we will reset the index buffer handle in the hw_state structure to force resending of the index buffer. Fixes bug 1593320 Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-03-21 13:37:40 -06:00
Charmaine Lee	47856e5945	svga: rebind stream output targets To ensure stream output target surfaces are available for the draw commands, we need to rebind the current stream output targets at the first draw in the command buffer. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-03-21 13:37:40 -06:00
Charmaine Lee	47cfc83440	svga: rebind index buffer Similar to other resources, current index buffer needs to be rebound at the first draw of the current command buffer to make sure the buffer is available for the draw command. Fixes bug 1587263. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-03-21 13:37:40 -06:00
Brian Paul	299f8ca0a7	svga: minor formatting fix, comment addition To sync with our internal tree. Signed-off-by: Brian Paul <brianp@vmware.com>	2016-03-21 13:37:25 -06:00
Charmaine Lee	b45b47c5c9	svga: optimize constant buffer uploads When a constant buffer slot is allocated in the upload buffer, the allocated slot size is always in multiple of 256. But the actual buffer size might not be in multiple of 256. This causes a gap between the ending offset of a slot and the starting offset of the next slot. The gap will prevent the two slots to be updated in a single update command. In order to maximize the chance of merging the contiguous dirty ranges, when a slot is to be allocated in the constant upload buffer, specify a buffer size in multiple of 256. There is about 10% performance improvement with Lightsmark2008 and 30% with Cinebench R11. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2016-03-21 12:58:25 -06:00
Charmaine Lee	0a1d91ef97	svga: add a few more resource updates HUD query This patch adds the following HUD queries: .num-resource-updates -- number of resource update. Commands include UPDATE_SUBRESOURCE, UPDATE_GB_IMAGE. .num-buffer-uploads -- number of buffer uploads. .num-const-buf-updates -- number of set constant buffer. .num-const-updates -- number of set shader constant. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2016-03-21 12:58:25 -06:00
Charmaine Lee	79e343b36a	svga: add new num-readbacks HUD query To find out how many image readback command is issued. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>	2016-03-21 12:58:25 -06:00
Brian Paul	dc9ecf58c0	svga: use shader sampler view declarations Previously, we looked at the bound textures (via the pipe_sampler_views) to determine texture dimensions (1D/2D/3D/etc) and datatype (float vs. int). But this could fail in out of memory conditions. If we failed to allocate a texture and didn't create a pipe_sampler_view, we'd default to using 0 (PIPE_BUFFER) as the texture type. This led to device errors because of inconsistent shader code. This change relies on all TGSI shaders having an SVIEW declaration for each SAMP declaration. The previous patch series does that. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	b56b853ab3	gallium/tests: declare sampler views in shaders Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	38e831ca3d	gallium/util: declare sampler view in util_make_fs_blit_msaa_depthstencil() Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	e7b5a844e3	postprocess: declare sampler views in shaders Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	5a9f2a2d89	hud: add sampler view declaration in text fragment shader Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	b3daaefadb	st/mesa: emit sampler view decls in drawpixels code v2: support both TGSI_TEXTURE_2D and _RECT Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	0f0a23d4d8	st/mesa: emit sampler view declaration in bitmap shader In June 2015, Rob Clark started updating the tgsi utility code to emit SVIEW declarations in various shaders (for polygon stipple, blitting, etc). These patches do the same for the Mesa state tracker. The VMware driver will use this. v2: support both TGSI_TEXTURE_2D and _RECT Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	72eb5a3cfe	st/mesa: emit sampler view declarations for ARB vert/frag programs Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	eda81fa357	st/mesa: use correct TGSI texture target in drawpix fragment shader Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	83b5b3d66e	st/mesa: use correct TGSI texture target in bitmap fragment shader Depending on the driver's support for NPOT textures, we might use a RECT texture instead of 2D texture. We should propogate that info to the fragment shader's TEX instruction. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Brian Paul	63e020d734	gallium/tgsi: pass TGSI tex target to tgsi_transform_tex_inst() Instead of hard-coded 2D tex target in tgsi_transform_tex_2d_inst() Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-03-21 11:59:25 -06:00
Nicolai Hähnle	a8b315b827	st/mesa: use the texture view's format for render-to-texture Aside from the bug below, it fixes a simplistic test I've written locally, and I see no regression in Piglit for radeonsi. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94595 Cc: "11.0 11.1 11.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-03-21 11:28:38 -05:00
Hans de Goede	dcf8a4d281	gallium: Remove unused TGSI_RESOURCE_ defines These magic file-index defines where only ever used in the nouveau code and that no longer uses them. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (v2) Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v2)	2016-03-21 12:20:58 +01:00
Hans de Goede	9b4c8f6629	nouveau: codegen: Do not silently fail in handeLOAD / handleSTORE / handleATOM handeLOAD / handleSTORE / handleATOM can only handle TGSI_FILE_BUFFER and TGSI_FILE_MEMORY. Make things fail explictly when another register-file is used in these functions. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (v2)	2016-03-21 12:20:48 +01:00
Hans de Goede	86e4440361	nouveau: codegen: Disable more old resource handling code Commit `c3083c7082` ("nv50/ir: add support for BUFFER accesses") disabled / commented out some of the old resource handling code, but not all of it. Effectively all of it is dead already, if we ever enter the old code paths in handeLOAD / handleSTORE / handleATOM we will get an exception due to trying to access the now always zero-sized resources vector. Disable all the dead code. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (v2)	2016-03-21 12:20:40 +01:00
Hans de Goede	71e315475c	nouveau: codegen: gk110: Make emitSTORE offset handling identical to emitLOAD Make the store offset handling in CodeEmitterGK110::emitSTORE identical to the one in CodeEmitterGK110::emitLOAD handling. This is just a cleanup, it does not cause any functional changes. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-03-21 12:20:38 +01:00
Hans de Goede	c783ad0e24	nouveau: codegen: Slightly refactor Source::scanInstruction() dst handling Use the dst temp variable which was used in the TGSI_FILE_OUTPUT case everywhere. This makes the code somewhat easier to reads and helps avoiding going over 80 chars with upcoming changes. This also brings the dst handling more in line with the src handling. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-03-21 12:20:32 +01:00
Hans de Goede	54cdde5eff	nouveau: codegen: Add support for clover / OpenCL kernel input parameters Add support for clover / OpenCL kernel input parameters. Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> (v1) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (v2)	2016-03-21 12:20:28 +01:00

1 2 3 4 5 ...

77155 Commits All Branches Search

77155 Commits

All Branches