mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Ilia Mirkin	018aa27953	gallium: add new semantics for tessellation Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-05-16 14:48:54 +02:00
Marek Olšák	216543ea54	gallium: add FMA and DFMA opcodes (v3) Needed by ARB_gpu_shader5. v2: select DMAD for FMA with double precision v3: add and select DFMA Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-03-16 12:54:18 +01:00
Ilia Mirkin	12dedca523	gallium: add some more double opcodes to avoid unnecessary lowering Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Dave Airlie <airlied@redhat.com>	2015-02-19 19:32:35 -05:00
Dave Airlie	3cd1338534	gallium: add double opcodes and TGSI execution (v4.2) This patch adds support for a set of double opcodes to TGSI. It is an update of work done originally by Michal Krol on the gallium-double-opcodes branch. The opcodes have a hint where they came from in the header file. v2: add unsigned/int <-> double v2.1: update docs. v3: add DRSQ (Glenn), fix review comments (Glenn). v4: drop DDIV v4.1: cleanups, fix some docs bugs, (Ilia) rework store_dest and fetch_source fns. (Ilia) 4.2: fixup float comparisons (Ilia) This is based on code by Michael Krol <michal@vmware.com> Roland and Glenn also reviewed earlier versions. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-02-20 08:49:12 +10:00
Roland Scheidegger	f01e8d3ba5	gallium/docs: fix docs wrt ARL/ARR/FLR since the address reg holds integer values, ARL/ARR do an implicit float-to-int conversion, so clarify that. Thus it is also incorrect to say that FLR really does the same as ARL. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-01-29 22:08:12 +01:00
Roland Scheidegger	ade8b26bf5	gallium: add TGSI_SEMANTIC_VERTEXID_NOBASE and TGSI_SEMANTIC_BASEVERTEX Plus a new PIPE_CAP_VERTEXID_NOBASE query. The idea is that drivers not supporting vertex ids with base vertex offset applied (so, only support d3d10-style vertex ids) will get such a d3d10-style vertex id instead - with the caveat they'll also need to handle the basevertex system value too (this follows what core mesa already does). Additionally, this is also useful for other state trackers (for instance llvmpipe / draw right now implement the d3d10 behavior on purpose, but with different semantics it can just do both). Doesn't do anything yet. And fix up the docs wrt similar values. v2: incorporate feedback from Brian and others, better names, better docs. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-12-16 04:23:00 +01:00
Roland Scheidegger	fe7e6b248f	gallium/docs: clarify fragment shader position input w component. The previous language was a bit misleading, since it sounded like w was interpolated then the reciprocal calculated which isn't what should be happening. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-12-10 22:11:16 +01:00
Roland Scheidegger	ec30c66b46	gallium/docs: (trivial) remove STR opcode description. The opcode was removed alongside SFL by commit `ecfe9e2ad2`.	2014-12-06 17:56:46 +01:00
Eric Anholt	365a4a3f9a	gallium: Drop the unused CND opcode. Nothing in the tree generates it. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-11-24 14:56:22 -08:00
Eric Anholt	00f7002c5c	gallium: Drop unused BRA opcode. Never generated, and implemented in only nvfx vertprog. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-11-24 14:56:22 -08:00
Eric Anholt	ecfe9e2ad2	gallium: Drop the unused SFL/STR opcodes. Nothing generated them. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-11-24 14:56:22 -08:00
Eric Anholt	dc00b382b5	gallium: Drop the unused RFL opcode. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-11-24 14:56:22 -08:00
Eric Anholt	8c822b1e91	gallium: Drop unused X2D opcode. Nothing in the tree generates it. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-11-24 14:56:22 -08:00
Eric Anholt	ff886c4955	gallium: Drop the unused ARA opcode. Nothing in the tree generated it. v2: Only drop ARA, not ARR as well. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> (v2)	2014-11-24 14:56:22 -08:00
Eric Anholt	de2f8d75db	gallium: Drop the unused RCC opcode. Nothing in the tree generated it. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-11-24 14:56:22 -08:00
Eric Anholt	d4864cdf15	gallium: Drop the NRM and NRM4 opcodes. They weren't generated in tree, and as far as I know all hardware had to lower it to a DP, RSQ, MUL. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2014-11-24 14:56:22 -08:00
Brian Paul	52576dcb88	gallium/docs: fix NRM, NRM4 docs Need to do a sqrt(). FWIW, the html that Sphinx 1.1.3 generates for the math expressions looks completely broken. Reviewed-by: José Fonseca <jfonseca@vmware.com>	2014-11-01 09:00:07 -06:00
Ilia Mirkin	8ee74ce50f	gallium: add opcodes/cap for fine derivative support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1) Reviewed-by: Roland Scheidegger <sroland@vmware.com> (v1) v2: Reuse opcode gaps as suggested by Marek	2014-08-14 20:25:32 -04:00
Roland Scheidegger	9042e8863a	gallium/docs: Document TEX2/TXL2/TXB2 instructions and fix up other tex doc Add documentation for TEX2/TXL2/TXB2 tgsi opcodes. Also, the texture opcode documentation wasn't very accurate so fix this up a bit. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-08-05 04:13:17 +02:00
Brian Paul	da46b9de9f	gallium/docs: minor clarification for TXQ instruction Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2014-07-10 11:30:04 -06:00
Ilia Mirkin	7c9161521a	gallium: add INTERP_* opcodes to support interpolateAt* Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-07-09 22:32:13 -04:00
Ilia Mirkin	4c97ed4411	gallium: switch dedicated centroid field to interpolation location The new location field can be either center, centroid, or sample, which indicates the location that the shader should interpolate at. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-07-09 19:26:32 -04:00
Ilia Mirkin	0cbefc1bea	gallium: add vertex stream argument to EMIT/ENDPRIM Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-07-01 11:34:24 -04:00
Marek Olšák	1df7199fc9	gallium: implement ARB_texture_query_levels The extension is always supported if GLSL 1.30 is supported. Softpipe and llvmpipe support is also added (trivial). Radeon and nouveau support is already done. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-06-19 00:17:36 +02:00
Christoph Bumiller	4b586a26c8	gallium: create TGSI_PROPERTY to disable viewport and clipping Marek v2: add a cap Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2014-06-02 12:49:03 +02:00
Ilia Mirkin	a52eaba787	gallium: add new opcodes for ARB_gs5 bit manipulation support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-04-28 19:04:46 -04:00
Ilia Mirkin	4be146b108	gallium: add GS_INVOCATIONS property Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-04-26 11:57:09 -04:00
Ilia Mirkin	76db20fc67	gallium: add INVOCATIONID semantic Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-04-26 11:56:39 -04:00
Ilia Mirkin	88d8d88d8c	gallium: add basic support for ARB_sample_shading Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-04-26 11:52:01 -04:00
Dave Airlie	be5276ae7d	gallium: add support for LODQ opcodes. This opcode provide support for GL_ARB_texture_query_lod, Signed-off-by: Dave Airlie <airlied@redhat.com> [imirkin: rebase, docs update] Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-04-07 01:06:18 -04:00
Ilia Mirkin	c13ff5a763	gallium/docs: fix silent math failures due to ~ and & Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-01 10:17:13 -04:00
Ilia Mirkin	b4cf180695	gallium/docs: line up some of the equations Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-01 10:17:13 -04:00
Ilia Mirkin	05d0223da3	gallium/docs: fix incorrect/missing references Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-01 10:17:13 -04:00
Ilia Mirkin	45e383bfae	gallium/docs: fix use of _ in math sections Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-01 10:17:13 -04:00
Ilia Mirkin	4ca110a7b9	gallium/docs: fix a lot of bad formatting Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-04-01 10:17:13 -04:00
Brian Paul	63e7b51912	gallium/docs: update SLT, SGE, SFL, STR opcode docs To emphasize that the result is floating point 1.0 or 0.0, to match other opcodes like SLE and SEQ. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-03-18 08:03:27 -06:00
Dave Airlie	2fcbec48d7	gallium: add texture gather support to gallium (v3) This adds support to gallium for a TG4 instruction, and two CAPs. The first CAP is required for GL_ARB_texture_gather. The second CAP is required to expose GL_ARB_gpu_shader5. However so far we haven't found any hardware that natively exposes the textureGatherOffsets feature from GL, so just lower it for now. If hardware appears for this we can add another CAP to allow TG4 to take 4 offsets. v2: add component selection src and a cap to say hw can do it. (st can use to help control GL_ARB_gpu_shader5/GLSL 4.00). Add docs. v3: rename to SM5, add docs. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2014-02-25 13:29:17 +10:00
Erik Faye-Lund	5125165dde	gallium/tgsi: correct typo propagated from NV_vertex_program1_1 In the specification text of NV_vertex_program1_1, the upper limit of the RCC instruction is written as 1.884467e+19 in scientific notation, but as 0x5F800000 in binary. But the binary version translates to 1.84467e+19 rather than 1.884467e+19 in scientific notation. Since the lower-limit equals 2^-64 and the binary version equals 2^+64, let's assume the value in scientific notation is a typo and implement this using the value from the binary version instead. Signed-off-by: Erik Faye-Lund <kusmabite@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2014-02-07 08:22:23 -07:00
José Fonseca	2d5f21ba65	gallium: Make TGSI_SEMANTIC_FOG register four-component wide. D3D9 Shader Model 2 restricted the fog register to one component, http://msdn.microsoft.com/en-us/library/windows/desktop/bb172945.aspx , but that restriction no longer exists in Shader Model 3, and several WHCK tests enforce that. So this change: - lifts the single-component restriction TGSI_SEMANTIC_FOG from Gallium interface - updates the Mesa state tracker to enforce output fog has (f, 0, 0, 1) - draw module was updated to leave TGSI_SEMANTIC_FOG output registers alone Several gallium drivers that are going out of their way to clear TGSI_SEMANTIC_FOG components could be simplified in the future. Thanks to Si Chen and Michal Krol for identifying the problem. Testing done: piglit fogcoord-*.vpfp tests Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-11-21 14:00:05 +00:00
Zack Rusin	6905698fc2	gallium: Add support for 32x32 muls with 64 bit results The code introduces two new 32bit integer multiplication opcodes which can be used to produce correct 64 bit results. GLSL, OpenCL and D3D10+ require them. We use two seperate opcodes, because they match the behavior of GLSL and OpenCL, are a lot easier to add than a single opcode with multiple destinations and because there's not much (any) difference wrt code-generation. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-10-09 18:30:20 -04:00
Roland Scheidegger	e7a5bf7a34	gallium: add new float comparison instructions returning integer masks Newer graphic languages don't want messy float mask results but instead true "boolean" mask results for float comparisons. Otherwise just need to convert the floats back to integers. Need to keep the old opcodes however due to both legacy (gl and d3d9) needing them and because older hw can't really deal with integers. These new FSEQ/FSGE/FSLT/FSNE opcodes are part of integer API and hence must be supported if a driver claims to support glsl 1.30 (or PIPE_SHADER_CAP_INTEGERS). Reviewed-by: Zack Rusin <zackr@vmware.com>	2013-08-13 19:09:17 +02:00
Roland Scheidegger	c8572a9457	gallium: clarify SVIEWINFO opcode This opcode is quite problematic in tgsi, while it tries to mirror d3d10 resinfo it can't really do what's stated there due to missing the crazy return type modifiers. Hence specify this is ignored along with the swizzle. (Other options would be to have multiple opcodes or specify the ret type modifier maybe in dst_reg as there's padding bits left there but it is the only instruction allowing this.) Reviewed-by: Zack Rusin <zackr@vmware.com>	2013-08-08 18:55:57 +02:00
Roland Scheidegger	606132b4de	gallium: clarify shift behavior with shift count >= 32 Previously, nothing was said what happens with shift counts exceeding bit width of the values to shift. In theory 3 behaviors are possible: 1) undefined (classic c definition) 2) just shift out all bits (so result is zero, or -1 potentially for ashr) 3) mask the shift count to bit width - 1 API's either require 3) or are ok with 1). In particular, GLSL (as well as a couple uninteresting legacy GL extensions) is happy with undefined, whereas both OpenCL and d3d10 require 3). Consequently, most hw also implements 3). So, for simplicity we just specify that 3) is required rather than saying undefined and then needing state trackers to work around it. Also while here specify shift count as a vector, not scalar. As far as I can tell this was a doc bug, neither state trackers nor drivers used scalar shift count. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-08-02 03:49:57 +02:00
Roland Scheidegger	abcc40e7f0	gallivm: handle texel swizzles correctly for d3d10-style sample opcodes unlike OpenGL, the texel swizzle is embedded in the instruction, so honor that. (Technically we now execute both the sampler_view swizzle and the per-instruction swizzle but this should be quite ok.) v2: add documentation note as it's not obvious. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-27 16:41:29 +02:00
Zack Rusin	00cd455bd5	gallium: fixup definitions of the rsq and sqrt GLSL spec says that rsq is undefined for src<=0, but the D3D10 spec says it needs to be a NaN, so lets stop taking an absolute value of the source which completely breaks that behavior. For the gl program we can simply insert an extra abs instrunction which produces the desired behavior there. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2013-07-11 20:19:04 -04:00
Brian Paul	46205ab8cc	tgsi: rename the TGSI fragment kill opcodes TGSI_OPCODE_KIL and KILP had confusing names. The former was conditional kill (if any src component < 0). The later was unconditional kill. At one time KILP was supposed to work with NV-style condition codes/predicates but we never had that in TGSI. This patch renames both opcodes: TGSI_OPCODE_KIL -> KILL_IF (kill if src.xyzw < 0) TGSI_OPCODE_KILP -> KILL (unconditional kill) Note: I didn't just transpose the opcode names to help ensure that I didn't miss updating any code anywhere. I believe I've updated all the relevant code and comments but I'm not 100% sure that some drivers had this right in the first place. For example, the radeon driver might have llvm.AMDGPU.kill and llvm.AMDGPU.kilp mixed up. Driver authors should review their code. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-12 08:32:51 -06:00
Brian Paul	f501baabdb	tgsi: fix-up KILP comments KILP is really unconditional fragment kill. We've had KIL and KILP transposed forever. I'll fix that next. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-12 08:32:51 -06:00
Zack Rusin	5507c11f85	gallium/draw: add limits to the clip and cull distances There are strict limits on those registers. Define the maximums and use them instead of magic numbers. Also allows us to add some extra sanity checks. Suggested by Brian. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-06-13 12:13:11 -04:00
Zack Rusin	3d08eada34	gallium: add a cull distance semantic cull distance is analogous to clip distance. If a register is given this semantic, then the values in it are assumed to be a float32 distance to a plane. Primitives will be completely discarded if the plane distance for all of the vertices in the primitive are < 0. Signed-off-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2013-06-10 22:04:28 -04:00
Roland Scheidegger	6b53e2b038	gallium: add support for layered rendering Since pipe_surface already has all the necessary fields no interface changes are necessary except adding a new shader semantic value (TGSI_SEMANTIC_LAYER). (Note that what GL knows as "gl_Layer" variable d3d10 is naming "RENDER_TARGET_ARRAY_INDEX".) v2: drop cap bit (just tied to geometry shader), add docs.	2013-06-01 20:03:59 +02:00

1 2 3

128 Commits