KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Jonathan Gray	78fbb41fe3	configure.ac: add OpenBSD Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Vinson Lee <vlee@freedesktop.org> Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2013-07-17 21:06:46 -07:00
Vinson Lee	21f97446f4	glsl: Remove comma at end of enumerator list. Fixes this build error on OpenBSD 5.3. In file included from ../../src/mesa/main/ff_fragment_shader.cpp:53: ./../glsl/ir_optimization.h:64: error: comma at end of enumerator list Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2013-07-17 20:57:54 -07:00
Vinson Lee	77311dab3a	mesa: Remove commas at end of enumerator lists. Fixes these build errors on OpenBSD 5.3. In file included from ../../src/mesa/main/errors.h:47, from ../../src/mesa/main/imports.h:41, from ../../src/mesa/main/ff_fragment_shader.cpp:32: ../../src/mesa/main/mtypes.h:3286: error: comma at end of enumerator list ../../src/mesa/main/mtypes.h:3296: error: comma at end of enumerator list ../../src/mesa/main/mtypes.h:3303: error: comma at end of enumerator list ../../src/mesa/main/mtypes.h:3356: error: comma at end of enumerator list Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2013-07-17 20:57:53 -07:00
Carl Worth	ceaf1a74cb	docs: Import 9.1.5 release notes And add news item for the release.	2013-07-17 20:11:02 -07:00
Roland Scheidegger	7fd30a8621	gallivm: (trivial) simplify lp_build_cos/lp_build_sin a tiny bit Use "or" instead of "add" (this is a classic select sequence, which at least newer llvm versions can actually recognize (3.2+?), and the "add" might prevent that - and we really don't want an add instead of an or with avx if it isn't recognized (even without avx logic ops might be cheaper)). Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-17 18:16:34 +02:00
Roland Scheidegger	f0f9fb59c3	util/u_format_s3tc: handle srgb formats correctly. Instead of just ignoring the srgb/linear conversions, simply call the corresponding conversion functions, for all of pack/unpack/fetch, both for float and unorm8 versions (though some don't make a whole lot of sense, i.e. unorm8/unorm8 srgb/linear combinations). Refactored some functions a bit so don't have to duplicate all the code (there's a slight change for packing dxt1_rgb, as there will now be always 4 components initialized and sent to the external compression function so the same code can be used for all, the quite horrid and ad-hoc interface (by now) should always have worked with that). Fixes llvmpipe/softpipe piglit texwrap GL_EXT_texture_sRGB-s3tc. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-17 18:16:27 +02:00
Vadim Girlin	07baf9cfd1	r600g/sb: improve alu packing on cayman Scheduler/register allocator in r600-sb was developed and optimized on evergreen (VLIW-5) hardware, so currently it's not optimal for VLIW-4 chips. This patch should improve performance on cayman gpus due to better alu packing, but also it tends to increase register usage, so overall positive effect on performance has to be proven by real benchmarks yet. Some results with bfgminer kernel on cayman: source bytecode: 60 gprs, 3905 alu groups, sbcl before the patch: 45 gprs, 4088 alu groups, sbcl with this patch: 55 gprs, 3474 alu groups. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:29:56 +04:00
Vadim Girlin	ba7fa4c4c9	r600g/sb: fix handling of new multislot instructions on cayman Ex-scalar instructions that became multislot on cayman do replicate result to all channels - handle them similar to DOT4. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:31 +04:00
Vadim Girlin	033eec4145	r600g/sb: fix debug dump code in scheduler Update the stale debug code for other changes related to debug output. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:31 +04:00
Vadim Girlin	44ebe7291c	r600g/sb: fix initial register allocation Mark values that are members of the 'same register' constraint as preallocated in ra_init pass, this will prevent incorrect reallocation in scheduler in some cases. Should fix https://bugs.freedesktop.org/show_bug.cgi?id=66713 Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:30 +04:00
Vadim Girlin	f0d881106a	r600g/sb: move chip & class name functions to sb_context Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:30 +04:00
Vadim Girlin	96efa4cdf4	r600g/sb: fix handling of PS in source bytecode on cayman Actually PS doesn't make sense for cayman and isn't even mentioned in cayman docs, but llvm backend currently uses it in bytecode and, assuming that hw seems to be mostly ok with it, this will allow sb to parse such source bytecode correctly. Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com>	2013-07-17 18:27:30 +04:00
Vinson Lee	81d3881367	r600g/sb: Initialize ra_checker member variables. Fixes "Uninitialized scalar field" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2013-07-17 18:27:30 +04:00
Emil Velikov	b20e0fb520	gallium/util: use explicily sized types for {un, }pack_rgba_{s, u}int Every function but the above four uses explicitly sized types for their src and dst arguments. Even fetch_rgba_{s,u}int follows the convention. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Signed-off-by: Marek Olšák <maraeo@gmail.com>	2013-07-17 13:01:46 +02:00
Kyle McMartin	87c3440567	llvmpipe: use MCJIT on ARM and AArch64 MCJIT is the only supported LLVM JIT on AArch64 and ARM (the regular JIT has bit-rotted badly on ARM and doesn't exist on AArch64.) Signed-off-by: Kyle McMartin <kyle@redhat.com> Signed-off-by: Dave Airlie <airlied@gmail.com>	2013-07-17 17:29:01 +10:00
Kenneth Graunke	00d32cd5b4	glsl: Fix absurd whitespace conventions in the parser. Historically, we indented grammar production rules with a single 8-space tab, but code inside of blocks used Mesa's 3-space indents. This meant when editing code, you had to use an 8-space tab for the first level of indentation, and 3-spaces after that. Unless you specifically configure your editor to understand this, it will get the indentation wrong on every single line you touch, which quickly devolves into a colossal waste of time. It's also inconsistent with every other file in the entire project. This patch removes all tabs and moves to a consistent 3-space indent. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2013-07-16 11:31:58 -07:00
Kenneth Graunke	4ab7fc9ec3	glsl: Fail the build if the grammar contains shift/reduce errors. When working on a parser, it's very easy to accidentally introduce new shift/reduce conflicts. Failing the build guarantees they'll be noticed and fixed. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2013-07-16 11:31:58 -07:00
Kenneth Graunke	73620709c9	glsl: Silence the last shift/reduce conflict warning in the grammar. The single remaining shift/reduce conflict was the classic ELSE problem: 292 selection_rest_statement: statement . ELSE statement 293 \| statement . ELSE shift, and go to state 479 ELSE [reduce using rule 293 (selection_rest_statement)] $default reduce using rule 293 (selection_rest_statement) The correct behavior here is to shift, which is what happens by default. However, resolving it explicitly will make it possible to fail the build on new errors, making them much easier to detect. The classic way to solve this is to use right associativity: http://www.gnu.org/software/bison/manual/html_node/Non-Operators.html Since there is no THEN token in GLSL, we need to fake one. %right THEN creates a new terminal symbol; the %prec directive says to use the precedence of that terminal. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2013-07-16 11:31:58 -07:00
Vinson Lee	fa7829c36b	glsl: Initialize ast_jump_statement::opt_return_value. opt_return_value was not initialized if mode != ast_return. Fixes "Uninitialized pointer field" defect reported by Coverity. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-16 09:03:02 -07:00
Vinson Lee	f74acb9835	glapi: Do not use backtrace on OpenBSD. execinfo.h is not available on OpenBSD. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Chad Versace <chad.versace@linux.intel.com>	2013-07-16 09:00:38 -07:00
Maarten Lankhorst	b20b2b6dc8	osmesa: link against static libglapi library too to get the gl exports This should fix missing symbols in a osmesa built against shared glapi osmesa build. All opengl exports were missing that are defined in the static glapi, so link against both to fix this. This is a candidate for the stable series. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=47824 Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>	2013-07-16 10:18:40 +02:00
Chris Forbes	121ea0b38b	i965/Gen4: Zero extra coordinates for ir_tex We always emit U,V,R coordinates for this message, but the sampler gets very angry if we pass garbage in the R coordinate for at least some texture formats. Fill the remaining coordinates with zero instead. Fixes broken rendering on GM45 in Source games, and in VDrift. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=65236 NOTE: This is a candidate for stable branches. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-16 19:08:41 +12:00
Kenneth Graunke	e4fdf1b008	i965: Cite the Ivybridge PRM for 3DSTATE_CLEAR_PARAMS notes. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:53 -07:00
Kenneth Graunke	b72a298751	i965: Refer people to brw_tex_layout.c rather than the BSpec. brw_tex_layout.c sets up the align_w/h fields, and has all the appropriate spec references already. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:53 -07:00
Kenneth Graunke	4b704424e0	i965: Remove old BSpec reference from BLORP's 3DSTATE_WM/PS packets. The Sandybridge code had a citation for the range of the "Maximum Number of Threads" field, and the Ivybridge code just mentioned the "BSpec" in general. That's documented in the obvious place, so people can find it without a spec reference. The real value of the comment is to say "we tried zero, and it exploded, so program it to a valid number even if pixel shading is off." Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:52 -07:00
Kenneth Graunke	ada110716a	i965: Cite the Ivybridge PRM for 3DSTATE_URB_* programming. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:52 -07:00
Kenneth Graunke	90b5a03581	i965: Update workaround flush comments for Gen6 3DSTATE_VS. Unfortunately, the workaround text never made it into the Sandybridge PRM, so we still have to refer to the BSpec. It also wasn't obvious why we needed this workaround at all, since we don't currently do VS passthrough - but BLORP can turn off the VS. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:52 -07:00
Kenneth Graunke	3b3a440d2b	i965: Cite the Ivybridge PRM for VS PIPE_CONTROL workarounds. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:52 -07:00
Kenneth Graunke	9a86875c6b	i965: Cite the Sandybridge PRM for Gen7 stencil pitch requirements. Sadly, the Ivybridge PRM can't be cited, as it is missing the relevant text for some reason. However, the Sandybridge PRM has the text Chad originally quoted, and the modern BSpec has the same text. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:52 -07:00
Kenneth Graunke	2e928e2a3f	i965: Cite the Ivybridge PRM for multisample surface format notes. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:52 -07:00
Kenneth Graunke	43ea434225	i965: Delete "the data cache is the sampler cache" comments on Gen7+. I cut and pasted these comments from the Gen4 code during Ivybridge enabling, and didn't understand what they meant at the time. The data cache is NOT the same as the sampler cache on Ivybridge. The sampler cache has L1 and L2 caches in addition to the L3 cache, while data port messages to the "data cache" hit L3 directly. This means that the sampler domain is technically wrong, but we stopped caring about read/write domains quite a while ago. The kernel just flushes all the caches at the end of each batchbuffer, and our render to texture code flushes the sampler caches when necessary. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:52 -07:00
Kenneth Graunke	3f64cfabfc	i965: Cite the 965 PRM for "the data cache is the sampler cache". Presumably, this comment exists to justify the usage of I915_GEM_DOMAIN_SAMPLER for this relocation. At one point, this was necessary to ensure that the right flushing was done to keep caches coherent. These days, the kernel just flushes everything, so I don't think it matters. Still, the comment is interesting, so leave it in place. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:51 -07:00
Kenneth Graunke	f254c94204	i965: Cite the Ivybridge PRM for DP message descriptor fields. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:51 -07:00
Kenneth Graunke	a0c8e76202	i965: Cite the Ivybridge PRM for why the fake MRF range is what it is. The exact text is in the public docs, so we should cite those. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:51 -07:00
Kenneth Graunke	3090d39dde	i965: Cite the Ivybridge PRM for SFID enum values. The Ivybridge PRM adds new SFIDs and lists them in a different volume than Sandybridge, so it's worth adding a reference. I also removed the BSpec reference, as the section it referred to was moved somewhere, and I couldn't find it. This leaves one Haswell SFID without a citation, but we can add one once the PRMs are out. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 19:40:51 -07:00
Roland Scheidegger	dc1cc928ed	llvmpipe: support sRGB framebuffers Just use the new conversion functions to do the work. The way it's plugged in into the blend code is quite hacktastic but follows all the same hacks as used by packed float format already. Only support 4x8bit srgb formats (rgba/rgbx plus swizzle), 24bit formats never worked anyway in the blend code and are thus disabled, and I don't think anyone is interested in L8/L8A8. Would need even more hacks otherwise. Unless I'm missing something, this is the last feature except MSAA needed for OpenGL 3.0, and for OpenGL 3.1 as well I believe. v2: prettify a bit, use separate function for packing. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-16 01:54:51 +02:00
Marek Olšák	a882067d74	Revert "r300g: allow HiZ with a 16-bit zbuffer" This reverts commit `631c631cbf`. https://bugs.freedesktop.org/show_bug.cgi?id=66921 Cc: mesa-stable@lists.freedesktop.org	2013-07-15 23:46:01 +02:00
Marek Olšák	7969b567bd	r300g/swtcl: fix a lockup in MSAA resolve Cc: mesa-stable@lists.freedesktop.org	2013-07-15 23:45:22 +02:00
Marek Olšák	22427640b2	r300g/swtcl: fix geometry corruption by uploading indices to a buffer The splitting of a draw call into several draw commands was broken, because the split sometimes took place in the middle of a primitive. The splitting was supposed to be dealing with the case when there are more indices than the maximum size of a CS. This commit throws that code away and uses a real index buffer instead. https://bugs.freedesktop.org/show_bug.cgi?id=66558 Cc: mesa-stable@lists.freedesktop.org	2013-07-15 23:45:16 +02:00
Matt Turner	c889df3fbe	glsl: Reject C-style initializers with unknown types. _mesa_ast_set_aggregate_type walks through declarations initialized with C-style aggregate initializers and stops when it runs out of LHS declarations or RHS expressions. In the example vec4 v = {{{1, 2, 3, 4}}}; _mesa_ast_set_aggregate_type would not recurse into the subexpressions (since vec4s do not contain types that can be initialized with an aggregate initializer) to set their <constructor_type>s. Later in ::hir we would dereference the NULL pointer and segfault. If <constructor_type> is NULL in ::hir we know that the LHS and RHS were unbalanced and the code is illegal. Arrays, structs, and matrices were unaffected. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2013-07-15 13:02:36 -07:00
Paul Berry	7706e52b25	glsl: Rework builtin_variables.cpp to reduce code duplication. Previously, we had a separate function for setting up the built-in variables for each combination of shader stage and GLSL version (e.g. generate_110_vs_variables to generate the built-in variables for GLSL 1.10 vertex shaders). The functions called each other in ad-hoc ways, leading to unexpected inconsistencies (for example, generate_120_fs_variables was called for GLSL versions 1.20 and above, but generate_130_fs_variables was called only for GLSL version 1.30). In addition, it led to a lot of code duplication, since many varyings had to be duplicated in both the FS and VS code paths. With the advent of geometry shaders (and later, tessellation control and tessellation evaluation shaders), this code duplication was going to get a lot worse. So this patch reworks things so that instead of having a separate function for each shader type and GLSL version, we have a function for constants, one for uniforms, one for varyings, and one for the special variables that are specific to each shader type. In addition, we use a class, builtin_variable_generator, to keep track of the instruction exec_list, the GLSL parse state, commonly-used types, and a few other variables, so that we don't have to pass them around as function arguments. This makes the code a lot more compact. Where it was feasible to do so without introducing compilation errors, I've also gone ahead and introduced the variables needed for {ARB,EXT}_geometry_shader4 style geometry shaders. This patch takes care of everything except the GS variable gl_VerticesIn, the FS variable gl_PrimitiveID, and GLSL 1.50 style geometry shader inputs (using the gl_in interface block). Those remaining features will be added later. I've also made a slight nomenclature change: previously we used the word "deprecated" to refer to variables which are marked in GLSL 1.40 as requiring the ARB_compatibility extension, and are marked in GLSL 1.50 onward as requiring the compatibilty profile. This was misleading, since not all deprecated variables require the compatibility profile (for example gl_FragData and gl_FragColor, which have been deprecated since GLSL 1.30, but do not require the compatibility profile until GLSL 4.20). We now consistently use the word "compatibility" to refer to these variables. This patch doesn't introduce any functional changes (since geometry shaders haven't been enabled yet). Reviewed-by: Matt Turner <mattst88@gmail.com> v2: Rename "typ" -> "type". Add blank line between inline functions and declarations in builtin_variable_generator class. Use the standard comment "/* FALLTHROUGH */" for compatibility with static code analysis tools. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 09:35:28 -07:00
Paul Berry	428e030210	glsl: Fix lower_named_interface_blocks to account for dereferences of consts. In certain rare cases (such as those involving dereference of a literal constant array of structs), flatten_named_interface_blocks_declarations's rvalue visitor may be invoked on an ir_dereference_record whose variable_referenced() method returns NULL. Check for this case to avoid a segfault. Prevents crashes in piglit tests {vs,fs}-deref-literal-array-of-structs. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-07-15 07:59:52 -07:00
Paul Berry	b2265db8e7	glsl: Don't allow vertex shader input arrays until GLSL 1.50. Vertex shader inputs are not allowed to be arrays until GLSL 1.50. We were accidentally enabling them for GLSL 1.40 (although we haven't written any tests for them, so it's not clear whether they actually work). NOTE: although this is a simple bug fix, it probably isn't sensible to cherry-pick it to stable release branches, since its only effect is to cause incorrectly-written shaders to fail to compile. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-15 07:50:47 -07:00
Chris Forbes	b616d01661	i965: Gen4/5: use IEEE floating point mode for GLSL shaders. Fixes isinf(), isnan() from GLSL 1.30 Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-14 19:58:25 +12:00
Chris Forbes	1ec66f2fb2	i965/vs: Gen4/5: enable front colors if back colors are written Fixes undefined results if a back color is written, but the corresponding front color is not, and only backfacing primitives are drawn. Results are still undefined if a frontfacing primitive is drawn, but that's OK. The other reasonable way to fix this would have been to just pick the one color slot that was populated, but that dilutes the value of the tests. On Gen6+, the fixed function clipper and triangle setup already take care of this. Fixes 11 piglits: spec/glsl-1.10/execution/interpolation/interpolation-none-gl_BackColor- NOTE: This is a candidate for stable branches. Signed-off-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-07-14 19:58:11 +12:00
Roland Scheidegger	796b73d1fe	gallivm: (trivial) use constant instead of exp2f() function Some lame compilers can't do exp2f() and as far as I can tell they can't do exp2() (with doubles) neither so instead of providing some workaround for that (wouldn't actually be too bad just replace with pow) and since it is used with a constant only just use the precalculated constant.	2013-07-14 02:39:33 +02:00
Chia-I Wu	62c546bbf8	ilo: skip 3DSTATE_INDEX_BUFFER when possible When only the offset to the index buffer is changed, we can skip the 3DSTATE_INDEX_BUFFER if we always use 0 for the offset, and add (offset / index_size) to Start Vertex Location in 3DPRIMITIVE.	2013-07-14 05:59:52 +08:00
Roland Scheidegger	6bcbb0dc82	gallivm: handle srgb-to-linear and linear-to-srgb conversions srgb-to-linear is using 3rd degree polynomial for now which should be _just_ good enough. Reverse is using some rational polynomials and is quite accurate, though not hooked into llvmpipe's blend code yet and hence unused (untested). Using a table might also be an option (for srgb-to-linear especially). This does not enable any new features yet because EXT_texture_srgb was already supported via util_format fallbacks, but performance was lacking probably due to the external function call (the table used by the util_format_srgb code may not be all that much slower on its own). Some performance figures (taken from modified gloss, replaced both base and sphere texture to use GL_SRGB instead of GL_RGB, measured on 1Ghz Sandy Bridge, the numbers aren't terribly accurate): normal gloss, aos, 8-wide: 47 fps normal gloss, aos, 4-wide: 48 fps normal gloss, forced to soa, 8-wide: 48 fps normal gloss, forced to soa, 4-wide: 47 fps patched gloss, old code, soa, 8-wide: 21 fps patched gloss, old code, soa, 4-wide: 24 fps patched gloss, new code, soa, 8-wide: 41 fps patched gloss, new code, soa, 4-wide: 38 fps So there's a performance hit but it seems acceptable, certainly better than using the fallback. Note the new code only works for 4x8bit srgb formats, others (L8/L8A8) will continue to use the old util_format fallback, because I can't be bothered to write code for formats noone uses anyway (as decoding is done as part of lp_build_unpack_rgba_soa which can only handle block type width of 32). Compressed srgb formats should get their own path though eventually (it is going to be expensive in any case, first decompress, then convert). No piglit regressions. v2: use lp_build_polynomial instead of ad-hoc polynomial construction, also since keeping both linear to srgb functions for now make sure both are compiled (since they share quite some code just integrate into the same function). v3: formatting fixes and bugfix in the complicated (disabled) linear-to-srgb path. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-13 18:42:17 +02:00
Roland Scheidegger	9b8d97e5bf	gallivm: better support for fast rsqrt We had to disable fast rsqrt before because it wasn't precise enough etc. However in situations when we know we're not going to need more precision we can still use a fast rsqrt (which can be several times faster than the quite expensive sqrt). Hence introduce a new helper which does exactly that - it is probably not useful calling it in some situations if there's no fast rsqrt available so make it queryable if it's available too. v2: use fast_rsqrt consistently instead of rsqrt_fast, fix indentation, let rsqrt use fast_rsqrt. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2013-07-13 18:42:17 +02:00
Klemens Baum	45574ab2e9	configure.ac: better detection of LLVM version Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2013-07-12 21:20:59 -07:00

1 2 3 4 5 ...

57598 Commits All Branches Search

57598 Commits

All Branches