KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Eric Anholt	0cadd32b6d	i965: Fix gen6 pixel_[xy] setup to avoid mixing int and float src operands. Pre-gen6, you could mix int and float just fine. Now, you get goofy results. Fixes: glsl-arb-fragment-coord-conventions glsl-fs-fragcoord glsl-fs-if-greater glsl-fs-if-greater-equal glsl-fs-if-less glsl-fs-if-less-equal	2010-10-11 15:26:59 -07:00
Eric Anholt	17306c60ad	i965: Don't compute-to-MRF in gen6 VS math. There was code to do this for pre-gen6 already, this just enables it for gen6 as well.	2010-10-11 15:26:59 -07:00
Eric Anholt	720ed3c906	i965: Expand uniform args to gen6 math to full registers to get hstride == 1. This is a hw requirement in math args. This also is inefficient, as we're calculating the same result 8 times, but then we've been doing that on pre-gen6 as well. If we're doing math on uniforms, though, we'd probably be better served by having some sort of mechanism for precalculating those results into another uniform value to use. Fixes 7 piglit math tests.	2010-10-11 15:26:58 -07:00
Eric Anholt	317dbf4613	i965: Don't compute-to-MRF in gen6 math instructions.	2010-10-11 15:26:58 -07:00
Eric Anholt	7b5bc38c44	i965: Add a couple of checks for gen6 math instruction limits.	2010-10-11 15:26:58 -07:00
Eric Anholt	25cf241540	i965: Don't consider gen6 math instructions to write to MRFs. This was leftover from the pre-gen6 cleanups. One tests regresses where compute-to-MRF now occurs.	2010-10-11 15:26:58 -07:00
Chad Versace	41c2079855	glsl: Changes in generated file glsl_lexer.cpp Signed-off-by: Ian Romanick <ian.d.romanick@intel.com>	2010-10-11 14:25:53 -07:00
Chad Versace	0c9fef6111	glsl: Add lexer rules for uint and uvecN (N=2..4) Commit for generated file glsl_lexer.cpp follows this commit. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2010-10-11 14:25:48 -07:00
Chad Versace	fc99a3beb9	glsl: Add glsl_type::uvecN_type for N=2,3 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2010-10-11 14:25:44 -07:00
Chad Versace	a34817917b	intel_extensions: Add ability to set GLSL version via environment Add ability to set the GLSL version used by the GLcontext by setting the environment variable INTEL_GLSL_VERSION. For example, env INTEL_GLSL_VERSION=130 prog args If the environment variable is missing, the GLSL versions defaults to 120. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2010-10-11 14:25:30 -07:00
Daniel Vetter	603741a86d	r200: revalidate after radeon_update_renderbuffers By calling radeon_draw_buffers (which sets the necessary flags in radeon->NewGLState) and revalidating if NewGLState is non-zero in r200TclPrimitive. This fixes an assert in libdrm (the color-/ depthbuffer was changed but not yet validated) and and stops the kernel cs checker from complaining about them (when they're too small). Thanks to Mario Kleiner for the hint to call radeon_draw_buffer (instead of my half-broken hack). v2: Also fix the swtcl r200 path. Cc: Mario Kleiner <mario.kleiner@tuebingen.mpg.de> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2010-10-11 15:16:44 -04:00
Eric Anholt	c6dbf253d2	i965: Compute to MRF in the new FS backend. This didn't produce a statistically significant performance difference in my demo (n=4) or nexuiz (n=3), but it still seems like a good idea and is recommended by the HW team.	2010-10-11 12:08:13 -07:00
Eric Anholt	06fd639c51	i965: Give the FB write and texture opcodes the info on base MRF, like math.	2010-10-11 12:07:33 -07:00
Eric Anholt	0cd6cea8a3	i965: Give the math opcodes information on base mrf/mrf len. This is progress towards enabling a compute-to-MRF pass.	2010-10-11 12:03:34 -07:00
Eric Anholt	37758fb1cb	i965: Move FS backend structures to a header. It's time to start splitting some of this up.	2010-10-11 11:52:02 -07:00
Eric Anholt	251fe27854	i965: Reduce register interference checks for changed FS_OPCODE_DISCARD. While I don't know of any performance changes from this (once extra reg available out of 128), it makes the generated asm a lot cleaner looking.	2010-10-11 11:52:01 -07:00
Eric Anholt	90c4022040	i965: Split FS_OPCODE_DISCARD into two steps. Having the single opcode write then read the reg meant that single instruction opcodes had to consider their source regs to interfere with their dest regs.	2010-10-11 11:52:01 -07:00
José Fonseca	986cb9d5cf	llvmpipe: Use lp_tgsi_info.	2010-10-11 13:06:25 +01:00
José Fonseca	7c1b5772a8	gallivm: More detailed analysis of tgsi shaders. To allow more optimizations, in particular for direct textures.	2010-10-11 13:05:32 +01:00
José Fonseca	11dad21718	tgsi: Export some names for some tgsi enums. Useful to give human legible names in other cases.	2010-10-11 13:05:31 +01:00
José Fonseca	6c1aa4fd49	gallium: Define C99 restrict keyword where absent.	2010-10-11 13:05:31 +01:00
José Fonseca	e1003336f0	gallivm: Eliminate unsigned integer arithmetic from texture coordinates. SSE support for 32bit and 16bit unsigned arithmetic is not complete, and can easily result in inefficient code. In most cases signed/unsigned doesn't make a difference, such as for integer texture coordinates. So remove uint_coord_type and uint_coord_bld to avoid inefficient operations to sneak in the future.	2010-10-11 08:14:09 +01:00
José Fonseca	b18fecbd0e	llvmpipe: Remove outdated comment about stencil testing.	2010-10-11 08:14:09 +01:00
Dave Airlie	3322416de4	r600g: don't run with scissors. This could probably be done much nicer, I've spent a day chasing a coherency problem in the kernel, that turned out to be incorrect scissor setup.	2010-10-11 16:23:23 +10:00
Dave Airlie	ef2702fb20	r600g: add TXL opcode support. fixes glsl1-2D Texture lookup with explicit lod (Vertex shader)	2010-10-11 12:18:05 +10:00
Dave Airlie	ea1d818b58	r600g: enable vertex samplers. We need to move the texture sampler resources out of the range of the vertex attribs. We could probably improve this using an allocator but this is the simple answer for now. makes mesa-demos/src/glsl/vert-tex work.	2010-10-11 11:59:53 +10:00
Dave Airlie	2c47f302af	r600g: evergreen has no request size bit in texture word4	2010-10-11 11:59:53 +10:00
Dave Airlie	bd89da79a1	r600g: fix input/output Z export mixup for evergreen.	2010-10-11 11:59:53 +10:00
José Fonseca	17dbd41cf2	gallivm: Pass texture coords derivates as scalars. We end up treating them as scalars in the end, and it saves some instructions.	2010-10-10 19:51:35 +01:00
José Fonseca	693667bf88	gallivm: Use variables instead of Phis in loops. With this commit all explicit Phi emission is now gone.	2010-10-10 19:05:05 +01:00
José Fonseca	48003f3567	gallivm: Allow to disable bri-linear filtering with GALLIVM_DEBUG=no_brilinear runtime option	2010-10-10 18:48:02 +01:00
José Fonseca	124adf253c	gallivm: Fix a long standing bug with nested if-then-else emission. We can't patch true-block at end-if time, as there is no guarantee that the block at the beginning of the true stanza is the same at the end of the true stanza -- other control flow elements may have been emitted half way the true stanza. Although this bug surfaced recently with the commit to skip mip filtering when lod is an integer the bug was always there, although probably it was avoided until now: e.g., cubemap selection nests if-then-else on the else stanza, which does not suffer from the same problem.	2010-10-10 18:48:02 +01:00
delphi	08f890d4c3	draw: some changes to allow for runtime changes to userclip planes	2010-10-10 08:40:11 +01:00
Francisco Jerez	e2acc7be26	dri/nv10: Fake fast Z clears for pre-nv17 cards.	2010-10-10 04:14:34 +02:00
Francisco Jerez	35a1893fd1	dri/nouveau: Minor cleanup.	2010-10-10 01:48:01 +02:00
José Fonseca	307df6a858	gallivm: Cleanup the rest of the flow module.	2010-10-09 21:39:14 +01:00
José Fonseca	d0ea464159	gallivm: Simplify if/then/else implementation. No need for for a flow stack anymore.	2010-10-09 21:14:05 +01:00
José Fonseca	1949f8c315	gallivm: Factor out the SI->FP texture size conversion for SoA path too	2010-10-09 20:26:11 +01:00
José Fonseca	d45c379027	gallivm: Remove support for Phi generation. Simply rely on mem2reg pass. It's easier and more reliable.	2010-10-09 20:14:03 +01:00
José Fonseca	ea7b49028b	gallivm: Use varilables instead of Phis for cubemap selection.	2010-10-09 19:53:21 +01:00
José Fonseca	cc40abad51	gallivm: Don't generate Phis for execution mask.	2010-10-09 12:55:31 +01:00
José Fonseca	679dd26623	gallivm: Special bri-linear computation path for unmodified rho.	2010-10-09 12:13:00 +01:00
José Fonseca	81a09c8a97	gallivm: Less code duplication in log computation.	2010-10-09 12:12:59 +01:00
José Fonseca	52427f0ba7	util: Defined M_SQRT2 when not available.	2010-10-09 12:12:59 +01:00
José Fonseca	53d7f5e107	gallivm: Handle code have ret correctly. Stop disassembling on unconditional backwards jumps.	2010-10-09 12:12:59 +01:00
José Fonseca	edba53024f	llvmpipe: Fix MSVC build. Enable the new SSE2 code on non SSE3 systems.	2010-10-09 12:12:58 +01:00
Keith Whitwell	2de720dc8f	llvmpipe: simplified SSE2 swz/unswz routines We've been using these in the linear path for a while now. Based on Chris's SSSE3 code, but using only sse2 opcodes. Speed seems to be identical, but code is simpler & removes dependency on SSE3. Should be easier to extend to other rgba8 formats.	2010-10-09 12:12:58 +01:00
Keith Whitwell	5b7eb868fd	llvmpipe: clean up shader pre/postamble, try to catch more early-z Specifically, can do early-depth-test even when alpahtest or kill-pixel are active, providing we defer the actual z write until the final mask is avaialable. Improves demos/fire.c especially in the case where you get close to the trees.	2010-10-09 11:44:45 +01:00
Keith Whitwell	aa4cb5e2d8	llvmpipe: try to be sensible about whether to branch after mask updates Don't branch more than once in quick succession. Don't branch at the end of the shader.	2010-10-09 11:44:45 +01:00
Keith Whitwell	2ef6f75ab4	gallivm: simpler uint8->float conversions LLVM seems to finds it easier to reason about these than our mantissa-manipulation code.	2010-10-09 11:44:45 +01:00

... 5 6 7 8 9 ...

40202 Commits All Branches Search

40202 Commits

All Branches