KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Chia-I Wu	98bc4c62a6	ilo: add pipe-based copy method to ilo_blitter It enables accelerated resource_copy_region() when blt-based method fails.	2013-06-17 18:28:58 +08:00
Chia-I Wu	ebfd7a61c0	ilo: add BLT-based blitting methods to ilo_blitter Port BLT code in ilo_blit.c to BLT-based blitting methods of ilo_blitter. Add BLT-based clears. The latter is verifed with util_clear(), but it is not in use yet.	2013-06-17 16:36:53 +08:00
Chia-I Wu	b4b3a5c6dc	ilo: replace util_blitter by ilo_blitter ilo_blitter is just a wrapper for util_blitter for now. We will port BLT code to ilo_blitter shortly.	2013-06-17 14:37:10 +08:00
Kenneth Graunke	6d7abafdc8	i965: Assume flexible hardware primitive restart exists in the future. Primitive restart with an arbitrary cut index was first supported as of Haswell. It's very doubtful that they'd take that away in future hardware, so we may as well alter the check now.	2013-06-14 22:58:18 -07:00
Chris Forbes	def84d8014	i965: Shrink Gen5 VUE map layout to be the same as Gen4. The PRM suggests a larger layout, mostly to support having gl_ClipDistance[] somewhere predictable for the fixed-function clipper -- but it didn't actually arrive in Gen5. Just use the same layout for both Gen4 and Gen5. No Piglit regressions. Improves performance in CS:S Video Stress Test by ~3%. V2: - Remove now-useless function for determining the SF URB read offset - Remove now-unused BRW_VARYING_SLOT_POS_DUPLICATE Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-06-16 01:05:41 +12:00
Kenneth Graunke	1b77d2133c	i965: Implement 16-wide math on G45 and Ironlake. [chrisf:] Improves performance in CS:S video stress test by about 2%. No piglit regressions on Ironlake. Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2013-06-16 00:47:50 +12:00
Matt Turner	fcaa48d9cc	glsl: Disallow return with a void argument from void functions. NOTE: This is a candidate for the stable branches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-06-14 11:25:49 -07:00
Matt Turner	1a1b03e6bc	glsl: Allow implicit conversion of return values. Required by ARB_shading_language_420pack. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-06-14 11:25:49 -07:00
Matt Turner	876e16562b	glsl: Add gl_{Max,Min}ProgramTexelOffset built-in constants. Required by ARB_shading_language_420pack. Note that the 420pack spec incorrectly specifies their values as (Min, Max) = (-7, 8) when they should be (-8, 7) as listed in the GLSL 4.30 and ESSL 3.0 specs. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-06-14 11:25:49 -07:00
Matt Turner	ed455cdb0b	glsl: Allow swizzles on scalars. Required by ARB_shading_language_420pack. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-06-14 11:25:49 -07:00
Matt Turner	a8492e8fe7	glsl: Allow .length() method on vectors and matrices. Required by ARB_shading_language_420pack. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-06-14 11:25:49 -07:00
Todd Previte	cf7f424e18	mesa: Add infrastructure for ARB_shading_language_420pack. v2 [mattst88] - Split infrastructure into separate patch. - Add preprocessor #define. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-06-14 11:25:48 -07:00
Chia-I Wu	bfa8d21759	ilo: fix for half-float vertex arrays Commit `6fe0453c33` broke half-float vertex arrays. This reverts a part of that commit, and explains why.	2013-06-15 01:00:03 +08:00
Chia-I Wu	36ffd08706	ilo: add some assertions to help debugging Assert that we do not support user vertex/index/constant buffers. Issue a warning when a sampler view is created for a resource without PIPE_BIND_SAMPLER_VIEW.	2013-06-14 16:02:31 +08:00
Chia-I Wu	0d9afaad35	ilo: silence a compiler warning The path should never be hit.	2013-06-14 15:36:30 +08:00
Vinson Lee	93534873b0	glsl: Fix null check in read_dereference. Fixes "Logically dead code" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 22:13:34 -07:00
Chia-I Wu	399548b17f	st/mesa: fix temp texture bindings in st_CopyPixels() The temporary texture should have either PIPE_BIND_RENDER_TARGET or PIPE_BIND_DEPTH_STENCIL set in addition to PIPE_BIND_SAMPLER_VIEW. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Marek Olšák <maraeo@gmail.com>	2013-06-14 08:46:04 +08:00
Zack Rusin	5507c11f85	gallium/draw: add limits to the clip and cull distances There are strict limits on those registers. Define the maximums and use them instead of magic numbers. Also allows us to add some extra sanity checks. Suggested by Brian. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-06-13 12:13:11 -04:00
Zack Rusin	b63eeaf7b7	draw: cleanup the distance culling code a bit We don't need the clamped variable, because we can just return early. We should also do the regular culling after the distance culling passes. All spotted by Brian. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-06-13 12:13:01 -04:00
Chia-I Wu	c7e9b15010	ilo: mapping a resource may make some states dirty When a resource is busy and is mapped with PIPE_TRANSFER_DISCARD_WHOLE_RESOURCE, the underlying bo is replaced. We need to mark states affected by the resource dirty. With this change, we no longer have to emit vertex buffers and index buffer unconditionally.	2013-06-13 23:47:18 +08:00
Chia-I Wu	5f15050dc9	ilo: bump up PIPE_CAP_GLSL_FEATURE_LEVEL to 140 With UBO and TBO support, we are supposedly good to claim GLSL 1.40.	2013-06-13 23:47:18 +08:00
Chia-I Wu	4df85dbc06	ilo: initialize dirty flags in ilo_init_states() Now that we have a function to initialize states, initialize dirty flags there too.	2013-06-13 23:47:18 +08:00
Chia-I Wu	6057d7b7b5	ilo: re-emit states that involve resources Even with hardware contexts, since we do not pin resources, we have to re-emit the states so that the resources are referenced (by cp->bo) and their offsets are updated in case they are moved. This also allows us to elimiate cp flush in is_bo_busy().	2013-06-13 12:58:47 +08:00
Chia-I Wu	b65bdc61bd	ilo: fix for util_blitter_clear() changes It has been broken since `17350ea979`.	2013-06-13 12:58:47 +08:00
Manfred Ernst	bf2c074a2f	mesa: Fix bug in unclamped float to ubyte conversion. Problem: The IEEE float optimized version of UNCLAMPED_FLOAT_TO_UBYTE in macros.h computed incorrect results for inputs in the range 0x3f7f0000 (=0.99609375) to 0x3f7f7f80 (=0.99803924560546875) inclusive. 0x3f7f7f80 is the IEEE float value that results in 254.5 when multiplied by 255. With rounding mode "round to closest even integer", this is the largest float in the range 0.0-1.0 that is converted to 254 by the generic implementation of UNCLAMPED_FLOAT_TO_UBYTE. The IEEE float optimized version incorrectly defined the cut-off for mapping to 255 as 0x3f7f0000 (=255.0/256.0). The same bug was present in the function float_to_ubyte in u_math.h. Fix: The proposed fix replaces the incorrect cut-off value by 0x3f800000, which is the IEEE float representation of 1.0f. 0x3f7f7f81 (or any value in between) would also work, but 1.0f is probably cleaner. The patch does not regress piglit on llvmpipe and on i965 on sandy bridge. Tested-by Stéphane Marchesin <marcheu@chromium.org> Reviewed-by Stéphane Marchesin <marcheu@chromium.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-12 20:24:48 -07:00
Marek Olšák	3475b22133	st/dri: if flushing a drawable, don't set reason=SWAPBUFFERS 0 means SWAPBUFFERS. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:14 +02:00
Marek Olšák	a713d7b1b9	st/dri: resolve the back buffer only in SwapBuffers Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:14 +02:00
Marek Olšák	3b525036b9	st/dri: manually swap MSAA front and back buffers in SwapBuffers Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:14 +02:00
Marek Olšák	b77316ad75	st/dri: always copy new DRI front and back buffers to corresponding MSAA buffers This commit fixes these piglit tests with an MSAA visual forced on: - read-front - glx-copy-sub-buffer Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:14 +02:00
Marek Olšák	fdf9d234e2	st/dri: refactor dri_msaa_resolve The generic blit will be used by the following commit. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:14 +02:00
Marek Olšák	6c6cfc02c9	st/dri: reuse depth-stencil and MSAA resources after DRI2 invalidate event Page flipping generates an invalidate event every frame, causing reallocations of all private resources (MSAA and depth-stencil). Reusing the resources may improve performance (especially under memory pressure). Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:14 +02:00
Marek Olšák	683b065320	st/dri: fix MSAA resolving of buffers with height > width Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:14 +02:00
Marek Olšák	526ebfa278	st/mesa: make generic CopyPixels path work with MSAA visuals We have to use pipe->blit, not resource_copy_region, so that the read buffer is resolved if it's multisampled. I also removed the CPU-based copying, which just did format conversion (obsoleted by the blit). Also, the layer/slice/face of the read buffer is taken into account (this was ignored). Last but not least, the format choosing is improved to take float and integer read buffers into account. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:14 +02:00
Marek Olšák	9ef44e6eb7	st/mesa: don't use blit_copy_pixels if an occlusion query is active CopyPixels, just as DrawPixels, should count the samples that passed depth test. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:13 +02:00
Marek Olšák	79e421260a	st/mesa: rework blit_copy_pixels to use pipe->blit There were 2 issues with it: - resource_copy_region doesn't allow different sample counts of both src and dst, which can occur if we blit between a window and a FBO, and the window has an MSAA colorbuffer and the FBO doesn't. (this was the main motivation for using pipe->blit) - blitting from or to a non-zero layer/slice/face was broken, because rtt_face and rtt_slice were ignored. blit_copy_pixels is now used even if the formats and orientation of framebuffers don't match. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:13 +02:00
Marek Olšák	4d59258856	r600g: upsample and downsample MSAA resources for transfers We did downsample (=resolve) MSAA resources to make ReadPixels work with MSAA GLX visuals, which was enough for read-only color-only transfers. This commit makes write color transfers and depth-stencil transfers work in a similar manner. It does downsampling in transfer_map and upsampling in transfer_unmap. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:13 +02:00
Marek Olšák	72a086b8b2	gallium/u_format: add a new helper for initializing pipe_blit_info::mask Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:13 +02:00
Marek Olšák	d6d4a9a2e8	gallium/u_blitter: make clearing independent of the colorbuffer format There isn't any difference between 32_FLOAT and 32_*INT in vertex fetching. Both of them don't do any format conversion. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:13 +02:00
Marek Olšák	17350ea979	gallium/u_blitter: make clearing independent of the number of bound colorbuffers We can use the fragment shader TGSI property WRITES_ALL_CBUFS. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:13 +02:00
Marek Olšák	de1c38299c	gallium/util: make WRITES_ALL_CBUFS optional in the passthrough fragment shader Reviewed-by: Brian Paul <brianp@vmware.com>	2013-06-13 03:54:13 +02:00
Marek Olšák	45595d5066	mesa: fix OES_EGL_image_external being partially allowed in the core profile Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-06-13 03:54:13 +02:00
Ian Romanick	cfa3c5ad82	glsl: Generate smaller values for uniform locations Previously we would generate uniform locations as (slot << 16) + array_index. We do this to handle applications that assume the location of a[2] will be +1 from the location of a[1]. This resulted in every uniform location being at least 0x10000. The OpenGL 4.3 spec was amended to require this behavior, but previous versions did not require locations of array (or structure) members be sequential. We've now encountered two applications that assume uniform values will be "small." As far as we can tell, these applications store the GLint returned by glGetUniformLocation in a int16_t or possibly an int8_t. THIS BEHAVIOR IS NOT GUARANTEED OR IMPLIED BY ANY VERSION OF OpenGL. Other implementations happen to have both these behaviors (sequential array elements and small values) since OpenGL 2.0, so let's just match their behavior. Fixes "3D Bowling" on Android. NOTE: This is a candidate for stable release branches. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-and-tested-by: Chad Versace <chad.versace@linux.intel.com>	2013-06-12 16:30:29 -07:00
Ian Romanick	26d86d26f9	glsl: Add gl_shader_program::UniformLocationBaseScale This is used by _mesa_uniform_merge_location_offset and _mesa_uniform_split_location_offset to determine how the base and offset are packed. Previously, this value was hard coded as (1U<<16) in those functions via the shift and mask contained therein. The value is still (1U<<16), but it can be changed in the future. The next patch dynamically generates this value. NOTE: This is a candidate for stable release branches. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-and-tested-by: Chad Versace <chad.versace@linux.intel.com>	2013-06-12 16:30:18 -07:00
Ian Romanick	5097f35841	glsl: Add a gl_shader_program parameter to _mesa_uniform_{merge,split}_location_offset This will be used in the next commit. NOTE: This is a candidate for stable release branches. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-and-tested-by: Chad Versace <chad.versace@linux.intel.com>	2013-06-12 16:30:06 -07:00
Roland Scheidegger	4cce4efaa3	util: new util_fill_box helper Use new util_fill_box helper for util_clear_render_target. (Also fix off-by-one map error.) v2: handle non-zero z correctly in new helper Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-06-13 00:41:43 +02:00
Roland Scheidegger	957c040eb8	gallivm: (trivial) remove duplicated code block (including comment)	2013-06-13 00:41:43 +02:00
Paul Berry	b09a754078	i965/gen7: Enable support for fast color clears. This patch adds code to place mcs_state into INTEL_MCS_STATE_RESOLVED for miptrees that are capable of supporting fast color clears. This will have no effect on buffers that don't undergo a fast color clear; however, for buffers that do undergo a fast color clear, an MCS miptree will be allocated (at the time of the first fast clear), and will be used thereafter. Reviewed-by: Eric Anholt <eric@anholt.net>	2013-06-12 11:10:07 -07:00
Paul Berry	ef9142d4a3	i965/gen7+: Disable fast color clears on shared regions. In certain circumstances the memory region underlying a miptree is shared with other miptrees, or with other code outside Mesa's control. This happens, for instance, when an extension like GL_OES_EGL_image or GLX_EXT_texture_from_pixmap extension is used to associate a miptree with an image existing outside of Mesa. When this happens, we need to disable fast color clears on the miptree in question, since there's no good synchronization mechanism to ensure that deferred clear writes get performed by the time the buffer is examined from the other miptree, or from outside of Mesa. Fortunately, this should not be a performance hit for most applications, since most applications that use these extensions use them for importing textures into Mesa, rather than for exporting rendered images out of Mesa. So most of the time the miptrees involved will never experience a clear. v2: Rework based on the fact that we have decided not to use an accessor function to protect access to the region. Reviewed-by: Eric Anholt <eric@anholt.net>	2013-06-12 11:10:07 -07:00
Paul Berry	67cd0f9703	i965/gen7+: Resolve color buffers when necessary. Resolve color buffers that have been fast-color cleared: 1. before texturing from the buffer (brw_predraw_resolve_buffers()) 2. before using the buffer as the source in a blorp blit (brw_blorp_blit_miptrees()) 3. before mapping the buffer's miptree (intel_miptree_map_raw(), intel_texsubimage_tiled_memcpy()) 4. before accessing the buffer using the hardware blitter (intel_miptree_blit(), do_blit_bitmap()) v2: Rework based on the fact that we have decided not to use an accessor function to protect access to the region. Reviewed-by: Eric Anholt <eric@anholt.net>	2013-06-12 11:10:07 -07:00
Paul Berry	e9dfcb38e9	i965/gen7+: Ensure that front/back buffers are fast-clear resolved. We already had code in intel_downsample_for_dri2_flush() for downsampling front and back buffers when multisampling was in use. This patch extends that function to perform fast color clear resolves when necessary. To account for the additional functionality, the function is renamed to simply intel_resolve_for_dri2_flush(). Reviewed-by: Eric Anholt <eric@anholt.net>	2013-06-12 11:10:07 -07:00

1 2 3 4 5 ...

57088 Commits All Branches Search

57088 Commits

All Branches