i965: Unroll SIMD16 DDY_FINE on Sandybridge.
This fixes 10 dEQP-GLES3 subtests: dEQP-GLES3.functional.shaders.derivate.dfdy.texture.float_nicest.*. Matt noticed that our Piglit tests for this use even numbered registers, while the failing dEQP tests use odd numbered registers. We believe that it works for even numbered registers, but not otherwise. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>
This commit is contained in:
parent
e915903c10
commit
21b4bcdd05
|
@ -1138,12 +1138,16 @@ fs_generator::generate_ddy(enum opcode opcode,
|
|||
*
|
||||
* Similar text exists in the g45 PRM.
|
||||
*
|
||||
* Empirically, compressed align16 instructions using odd register
|
||||
* numbers don't appear to work on Sandybridge either.
|
||||
*
|
||||
* On these platforms, if we're building a SIMD16 shader, we need to
|
||||
* manually unroll to a pair of SIMD8 instructions.
|
||||
*/
|
||||
bool unroll_to_simd8 =
|
||||
(dispatch_width == 16 &&
|
||||
(devinfo->gen == 4 || (devinfo->gen == 7 && !devinfo->is_haswell)));
|
||||
(devinfo->gen == 4 || devinfo->gen == 6 ||
|
||||
(devinfo->gen == 7 && !devinfo->is_haswell)));
|
||||
|
||||
/* produce accurate derivatives */
|
||||
struct brw_reg src0 = brw_reg(src.file, src.nr, 0,
|
||||
|
|
Loading…
Reference in New Issue