KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Christian König	1c10018925	radeonsi: add preloading for all samplers Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-03-26 12:57:43 +01:00
Christian König	0f6cf2bc79	radeonsi: add preloading of all constants Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-03-26 12:57:40 +01:00
Christian König	44e3224554	radeonsi: mark most intrinsics as readnone/nounwind Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-03-26 12:57:36 +01:00
Christian König	206f059e1f	radeonsi: mark all loads as constant Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-03-26 12:57:33 +01:00
Christian König	86f6fc2f1d	radeonsi: remove wqm intrinsic Now the backend handles that itself. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-03-26 12:57:30 +01:00
Christian König	6249db73ea	radeon/llvm: remove uneeded inclusion The include isn't needed and the file has moved with LLVM master. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-03-26 12:57:23 +01:00
Christian König	0f001fbff1	glsl_to_tgsi: avoid creating arrays if driver doesn't support them Avoid creating arrays if we replace indirect addressing anyway. Signed-off-by: Christian König <christian.koenig@amd.com>	2013-03-26 10:22:27 +01:00
Christian König	462de2e65f	glsl_to_tgsi: make simplify_cmp work with arrays Even when we have arrays it is possible for simplify_cmp to work on temps, just not on arrays. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=62696 Signed-off-by: Christian König <christian.koenig@amd.com>	2013-03-26 10:22:27 +01:00
Marek Olšák	98a8e5b87e	gallium/docs: document get_driver_query_info	2013-03-26 01:37:40 +01:00
Marek Olšák	8ddae684af	r600g: add a driver query returning the amount of requested VRAM and GTT memory	2013-03-26 01:28:19 +01:00
Marek Olšák	2504380aaf	r600g: add a driver query returning the number of draw_vbo calls between begin_query and end_query	2013-03-26 01:28:19 +01:00
Marek Olšák	e40c634bd2	st/dri: integrate the HUD Reviewed-by: Brian Paul <brianp@vmware.com>	2013-03-26 01:28:19 +01:00
Marek Olšák	c91cf7d7d2	gallium: implement a heads-up display module Reviewed-by: Brian Paul <brianp@vmware.com> v2: lots of cosmetic changes	2013-03-26 01:28:19 +01:00
Marek Olšák	8ddcd715b7	gallium: add interface for driver queries like performance counters, etc. The pipe query interface is reused. The list of available queries can be obtained using pipe_screen::get_driver_query_info. Reviewed-by: Brian Paul <brianp@vmware.com>	2013-03-26 01:28:19 +01:00
Marek Olšák	9cec5edea7	gallium/tgsi: fix valgrind warning "Conditional jump or move depends on uninitialised value(s)" Reviewed-by: Brian Paul <brianp@vmware.com>	2013-03-26 01:28:19 +01:00
Marek Olšák	17003b44b7	st/mesa: fix crash with blit-based GetTexImage https://bugs.freedesktop.org/show_bug.cgi?id=62573 Tested-by: Andreas Boll <andreas.boll.dev@gmail.com>	2013-03-26 01:28:19 +01:00
Marek Olšák	d1b91e309b	cso: add constant buffer save/restore feature for postprocessing Postprocessing is an internal meta op and should restore the states it changes.	2013-03-26 01:28:18 +01:00
Marek Olšák	35c522dce4	radeonsi: fix crash while binding a NULL constant buffer Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-03-26 01:28:18 +01:00
Marek Olšák	a2378daf83	r600g: fix crash while binding a NULL constant buffer	2013-03-26 01:28:18 +01:00
Marek Olšák	53228fe2a8	r300g: fix crash while binding a NULL constant buffer	2013-03-26 01:28:18 +01:00
Martin Andersson	92855bcc95	r600g: Use virtual address for PIPE_QUERY_SO* in r600_emit_query_end Virtual address is used for PIPE_QUERY_SO* queries in r600_emit_query_begin, but not in r600_emit_query_end. This will trigger a GPU fault when one of those queries is made and virtual address is enabled. Note: this is a candidate for the 9.1 branch Signed-off-by: Alex Deucher <alexander.deucher@amd.com>	2013-03-25 18:18:23 -04:00
Rob Clark	634fb837ef	freedreno: use u_debug for debug env vars Signed-off-by: Rob Clark <robdclark@gmail.com>	2013-03-25 15:05:44 -04:00
Jordan Justen	e207c33020	glsl ir: add as_dereference_record Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-25 11:35:56 -07:00
Brian Paul	eb92f89587	gallium: undef PACKAGE_* macros to silence warnings Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-03-25 12:24:11 -06:00
Brian Paul	c0f16df938	gallivm: init vars to silence warnings Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-03-25 12:24:11 -06:00
Brian Paul	35aefe9226	swrast: init vars to silence warnings Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-03-25 12:24:11 -06:00
Rob Clark	980f1cf8a1	freedreno: prefer sw upload for textures Since we are UMA, in most cases the GPU blit doesn't make much sense for texture upload. Signed-off-by: Rob Clark <robdclark@gmail.com>	2013-03-25 13:05:44 -04:00
Rob Clark	732b0b5ebc	freedreno: track maximal scissor bounds Optimize out parts of the render target that are scissored out by taking into account maximal scissor bounds in fd_gmem_render_tiles(). This is a big win on things like gnome-shell which frequently do partial screen updates. Signed-off-by: Rob Clark <robdclark@gmail.com>	2013-03-25 13:05:44 -04:00
Adrian Marius Negreanu	8a4750fe5e	android: fix Android.mk bug in mesa/drivers/dri/common target-specific variables are undefined when used as pre-requisites. instead, use secondary-expansion. I noticed this when building the patch: i965: Add a driconf option to disable flush throttling Signed-off-by: Adrian Marius Negreanu <adrian.m.negreanu@intel.com> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-03-25 09:52:19 -07:00
Eric Anholt	712bac1f41	mesa: Disable validate_ir_tree() on release builds. Since half of ir_validate uses asserts() (the other using printf() then abort()), there's not much use to calling it in a release build. Cuts 6.3% of the startup time of TF2. NOTE: This is a candidate for the stable branches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-25 08:50:38 -07:00
Roland Scheidegger	92b8a37fdf	gallivm: move code for dealing with rgb9e5 and r11g11b10 formats to own file This is really not generic conversion stuff and the code very particular to these formats.	2013-03-24 22:54:45 +01:00
Vinson Lee	7d0c1f2437	llvmpipe: Fix assertions with assignment instead of comparison. Fixes assign instead of compare defects reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-03-24 14:49:22 -07:00
Paul Berry	a593a1b276	i965: Shrink brw_vue_map struct. This patch changes the arrays in brw_vue_map (which only ever contain values from -1 to 58) from ints to signed chars. This reduces the size of the struct from 488 bytes to 136 bytes. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> v2: fix STATIC_ASSERT to use 127 instead of 128. Reviewed-by: Eric Anholt <eric@anholt.net>	2013-03-24 10:55:28 -07:00
Paul Berry	0a0deb92d9	i965/fs: Rename vp_outputs_written to input_slots_valid. With the introduction of geometry shaders, fragment inputs will no longer come exclusively from the vertex shader; sometimes they come from the geometry shader. So the name "vp_outputs_written" will become a misnomer. This patch renames vp_outputs_written to input_slots_valid, to reflect the true meaning of the bitfield from the fragment shader's point of view: it indicates which of the possible input slots contain valid data that was written by the previous shader stage. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 10:55:28 -07:00
Paul Berry	bf9bfe838e	i965: Use brw.vue_map_geom_out instead of VS output VUE map where appropriate. This patch modifies post-GS pipeline stages (transform feedback, clip, sf, fs) to refer to the VUE map through brw->vue_map_geom_out rather than brw->vs.prog_data->vue_map. This ensures that when geometry shader support is added, these pipeline stages will consult the geometry shader output VUE map when appropriate, rather than the vertex shader output VUE map. v2: Fixed some stale "CACHE_NEW_VS_PROG" comments. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 10:55:27 -07:00
Paul Berry	463ef47b16	i965: Store the geometry output VUE map in brw_context. Currently, the GPU pipeline has one active VUE map in effect at any given time--the one representing the layout of vertex data coming from the vertex shader. However, when geometry shaders are added, they will have their own independent VUE map. Later pipeline stages (clip, sf, fs) will need to consult the geometry shader VUE map if a geometry shader is in use, and the vertex shader VUE map otherwise. This patch adds a new field to brw_context, vue_map_geom_out, which contains the VUE map that should be used by later pipeline stages. It also adds a new state flag, BRW_NEW_VUE_MAP_GEOM_OUT, which is signalled whenever the contents of the VUE map changes. Since we don't support geometry shaders yet, vue_map_geom_out is currently set only by the brw_vs_prog state atom. v2: Don't set vue_map_geom_out in do_vs_prog--that's redundant and possibly problematic for precompiles. Only set it in brw_upload_vs_prog. Also, make a copy instead of using a pointer--this makes it possible to detect when the VUE map hasn't changed, so we can avoid redundant state uploads. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 10:55:27 -07:00
Paul Berry	8fbc22e880	i965: Move brw_vs_prog_data::outputs_written into VUE map. Future patches will allow for there to be separate VUE maps when both a geometry shader and a vertex shader are in use. When this happens, we will want to have correspondingly separate outputs_written bitfields. Moving outputs_written into the VUE map will make this easy. For consistency with the terminology used in the VUE map, the bitfield is renamed to "slots_valid" in the process. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 10:55:27 -07:00
Paul Berry	76ba30800d	i965/gen7: Use WE_all mode when enabling channel masks for URB write. Gen7 adds mask bits to the message header for a URB write which allow the write to apply only to certain channels. We don't use this functionality, so to ensure that the entire write always occurs, we emit an OR instruction to set the mask bits. With the advent of geometry shaders, URB writes won't just happen at the end of a thread; they will happen in mid-thread too. Thus, we can no longer rely on channel 0 being enabled, so we need to emit the OR instruction in WE_all mode to ensure that it is executed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 10:55:27 -07:00
Paul Berry	8371c68a4b	i965: Rename BRW_VARYING_SLOT_MAX -> BRW_VARYING_SLOT_COUNT. The new name clarifies that it represents one more than the maximum possible brw_varying_slot value. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 10:55:27 -07:00
Paul Berry	ec9c3882d9	i965: Clarify nomenclature: vert_result -> varying This patch removes the terminology "vert_result" from the i965 driver, replacing it with "varying". The old terminology, "vert_result", was confusing because (a) it referred to the enum gl_vert_result, which no longer exists (it was replaced with gl_varying_slot), and (b) it implied a vertex output, but with the advent of geometry shaders, it could be either a vertex or a geometry output, depending what shaders are in use. The generic term "varying" is less confusing. No functional change. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> v2: Whitespace fixes.	2013-03-23 22:47:54 -07:00
Chris Forbes	f56fb9d248	i965: bump MAX_DEPTH_TEXTURE_SAMPLES to 4/8 Bump MAX_DEPTH_TEXTURE_SAMPLES to match what GetInternalformativ is claiming. Since that limit is what is actually enforced now, this doesn't actually change anything except the queried value. There's still no piglits verifying that multisample depth textures work, but this works in the Unigine demos. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 16:38:18 +13:00
Chris Forbes	2405da174e	mesa: use _mesa_check_sample_count() for multisample textures Extends _mesa_check_sample_count() to properly support the TEXTURE_2D_MULTISAMPLE and TEXTURE_2D_MULTISAMPLE_ARRAY targets, which have subtly different limits than renderbuffers. This resolves the remaining TODO in the implementation of TexImage*DMultisample. V2: - Don't introduce spurious block. - Do this in multisample.c instead. - Fix typo in error message. - Inline spec quotes Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 16:38:18 +13:00
Chris Forbes	90b5a2425a	mesa: helper for checking renderbuffer sample count Pulls the checking of the sample count into a helper function, and extends the existing logic to include the interactions with both ARB_texture_multisample and ARB_internalformat_query. _mesa_check_sample_count() checks a desired sample count against a a combination of target/internalformat, and returns the error enum to be produced, if any. Unfortunately the conditions are messy and the errors vary. V2: - Tidy up spurious block. - Move _mesa_check_sample_count() to multisample.c instead; It doesn't really belong in fbobject.c or teximage.c. - Inlined spec quotes Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 16:38:18 +13:00
Chris Forbes	86b8380600	mesa: allow internalformat_query with multisample texture targets Now that we support ARB_texture_multisample, there are multiple targets accepted for this query, and they may have target-dependent limits, so pass the target to the driverfunc. For example, the sampling hardware may not be able to do general texelFetch() for some format/sample count combination, but the driver may still be able to implement a reasonable resolve operation, so it can be supported for renderbuffers. V2: - Don't break Gallium compile. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-03-24 16:38:18 +13:00
Dmitry Cherkassov	3cc2629b3b	clover: add dynamic_cast results checking down in clSetKernelArgument() code path. Signed-off-by: Dmitry Cherkassov <dcherkassov@gmail.com> Signed-off-by: Francisco Jerez <currojerez@riseup.net>	2013-03-24 02:43:34 +01:00
Roland Scheidegger	b50e362dbb	gallivm: Add code for rgb9e5 shared exponent format to float conversion And use this (and the code for r11g11b10 packed float to float conversion) in the soa texturing code (the generated code looks quite good). Should be an order of magnitude faster probably than using the fallback (not measured). Tested with piglit texwrap GL_EXT_packed_float and GL_EXT_texture_shared_exponent respectively (didn't find much else using it). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-03-24 02:09:02 +01:00
Marek Olšák	3e10ab6b22	gallium,st/mesa: don't use blit-based transfers with software rasterizers The blit-based paths for TexImage, GetTexImage, and ReadPixels aren't very fast with software rasterizer. Now Gallium drivers have the ability to turn them off. Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com>	2013-03-23 13:19:16 +01:00
Marek Olšák	25e3094058	st/mesa: implement blit-based ReadPixels Initial version contributed by: Martin Andersson <g02maran@gmail.com> This is only used if the memcpy path cannot be used and if no transfer ops are needed. It's pretty similar to our TexImage and GetTexImage implementations. The motivation behind this is to be able to use ReadPixels every frame and still have at least 20 fps (or 60 fps with a powerful GPU and CPU) instead of 0.5 fps. Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com>	2013-03-23 13:17:05 +01:00
Marek Olšák	d702c67ba5	mesa: add common format-independent memcpy-based ReadPixels path I'll need the _mesa_readpixels_needs_slow_path function for the blit-based version, but it's also useful to have this memcpy-based path in one place and not scattered across several functions. v2: add "const" to function parameters Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com>	2013-03-23 13:17:05 +01:00
Marek Olšák	f8855a4214	mesa: add helper func for checking combined depthstencil buffers from st/mesa Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com>	2013-03-23 13:17:05 +01:00

... 3 4 5 6 7 ...

55924 Commits All Branches Search

55924 Commits

All Branches