i965/fs: Use byte scattered read for 16-bit load_ssbo

Used to enable 16-bit reads at do_untyped_vector_read, that is used on
the following intrinsics:

   * nir_intrinsic_load_shared
   * nir_intrinsic_load_ssbo

v2: Removed use of stride = 2 on 16-bit sources (Jason Ekstrand)

v3: - Add bitsize to scattered read operation (Jason Ekstrand)
    - Remove implementation of 16-bit UBO read from this patch.
    - Avoid assertion at opt_algebraic caused by ADD of two IMM with
      offset with BRW_REGISTER_TYPE_UD type found on matrix tests.
      (Jose Maria Casanova)
v4: (Jason Ekstrand)
    - Put if case for 16-bits at the beginning of the if ladder.
    - Use type_sz(dest.type) * 8 as bit_size parameter for scattered read.

Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>
This commit is contained in:
Jose Maria Casanova Crespo 2017-07-01 08:20:07 +02:00
parent c57a3f200d
commit fa4a9d63bb
1 changed files with 13 additions and 1 deletions

View File

@ -2302,7 +2302,19 @@ do_untyped_vector_read(const fs_builder &bld,
const fs_reg offset_reg,
unsigned num_components)
{
if (type_sz(dest.type) == 4) {
if (type_sz(dest.type) <= 2) {
fs_reg read_offset = bld.vgrf(BRW_REGISTER_TYPE_UD);
bld.MOV(read_offset, offset_reg);
for (unsigned i = 0; i < num_components; i++) {
fs_reg read_reg =
emit_byte_scattered_read(bld, surf_index, read_offset,
1 /* dims */, 1,
type_sz(dest.type) * 8 /* bit_size */,
BRW_PREDICATE_NONE);
bld.MOV(offset(dest, bld, i), subscript(read_reg, dest.type, 0));
bld.ADD(read_offset, read_offset, brw_imm_ud(type_sz(dest.type)));
}
} else if (type_sz(dest.type) == 4) {
fs_reg read_result = emit_untyped_read(bld, surf_index, offset_reg,
1 /* dims */,
num_components,