mesa/src/compiler/nir
Emma Anholt b024102d7c freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars.
Saves some work in carchase and manhattan31:

instructions in affected programs: 2842 -> 2818 (-0.84%)
nops in affected programs: 1131 -> 1105 (-2.30%)
non-nops in affected programs: 1236 -> 1238 (0.16%)
mov in affected programs: 57 -> 61 (7.02%)
dwords in affected programs: 2144 -> 2150 (0.28%)
cat0 in affected programs: 1195 -> 1169 (-2.18%)
cat1 in affected programs: 151 -> 155 (2.65%)
cat2 in affected programs: 142 -> 140 (-1.41%)
sstall in affected programs: 190 -> 178 (-6.32%)
(ss) in affected programs: 63 -> 63 (0.00%)
systall in affected programs: 532 -> 511 (-3.95%)

Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>
2022-01-16 19:11:29 +00:00
..
tests nir/algebraic: Move relocations for expression conds to a table. 2021-12-07 07:09:00 +00:00
README
meson.build Revert "nir: disable a NIR test due to undebuggable & locally unreproducible CI failures" 2021-12-15 23:28:09 +00:00
nir.c nir: add load_mesh_view_count and load_mesh_view_indices intrinsics 2022-01-11 22:45:23 +00:00
nir.h nir/algebraic: Separate has_dot_4x8 into has_sdot_4x8 and has_udot_4x8 2022-01-10 13:20:39 +02:00
nir_algebraic.py nir/algebraic: Move all the individual transforms to a common table. 2021-12-07 07:09:00 +00:00
nir_builder.c nir: Make nir_build_alu() variants per 1-4 arg count. 2021-12-01 22:12:19 +00:00
nir_builder.h nir: Make nir_build_alu() variants per 1-4 arg count. 2021-12-01 22:12:19 +00:00
nir_builder_opcodes_h.py nir_to_tgsi: Enable fdot_replicates flag. 2022-01-07 09:58:24 +00:00
nir_builtin_builder.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_builtin_builder.h nir/builder: Move clamp helpers to nir_builder.h 2021-05-04 22:51:34 +00:00
nir_clone.c nir: Switch from ralloc to malloc for NIR instructions. 2021-09-14 17:53:06 +00:00
nir_constant_expressions.h
nir_constant_expressions.py python: drop python2 support 2021-08-14 21:44:32 +00:00
nir_control_flow.c nir/cf: fix insertion of loops/ifs after jumps 2021-11-29 22:22:24 +00:00
nir_control_flow.h nir/lower_returns: Append missing phis' sources after "break" insertion 2020-11-02 14:12:21 +00:00
nir_control_flow_private.h
nir_conversion_builder.h nir: Update saturated float->int/uint conversion algorithm 2021-01-05 19:46:25 +00:00
nir_convert_ycbcr.c nir,glsl_to_nir: use nir_fdot() 2021-08-16 17:19:45 +00:00
nir_deref.c nir/opt_deref: don't try to cast empty structures 2021-12-01 08:24:39 +00:00
nir_deref.h nir/deref: add helpers to lazily create paths 2020-11-20 13:57:34 +00:00
nir_divergence_analysis.c nir: Add a new sample_pos_or_center system value 2021-12-17 16:02:16 +00:00
nir_dominance.c nir/dominance: Use _mesa_set_clear instead ofhand-rolling it 2020-09-30 16:46:11 +00:00
nir_format_convert.h nir/format_convert: add ssa version of uint packing 2021-07-07 13:41:37 +00:00
nir_from_ssa.c nir: Fix read depth for predecessors 2021-11-30 00:12:48 +00:00
nir_gather_info.c nir: remove invalid assert affecting per-view variables 2022-01-11 22:45:23 +00:00
nir_gather_ssa_types.c nir: Add a nir_src_is_undef() helper, like nir_src_is_const(). 2021-03-03 00:51:44 +00:00
nir_gather_xfb_info.c
nir_group_loads.c nir: add new SSA instruction scheduler grouping loads into indirection groups 2021-11-08 21:20:11 +00:00
nir_gs_count_vertices.c nir: Add ability to count primitives per stream. 2020-10-09 15:26:14 +02:00
nir_inline_functions.c nir/inline_functions: Handle halting functions. 2021-08-13 21:18:13 +00:00
nir_inline_helpers.h nir: fix build at -O1 2021-02-26 21:54:53 +00:00
nir_inline_uniforms.c nir/inline_uniforms: support loop 2021-08-19 02:17:35 +00:00
nir_instr_set.c nir: use a single set during CSE 2021-06-15 17:57:07 +00:00
nir_instr_set.h nir: use a single set during CSE 2021-06-15 17:57:07 +00:00
nir_intrinsics.py nir: add load_mesh_view_count and load_mesh_view_indices intrinsics 2022-01-11 22:45:23 +00:00
nir_intrinsics_c.py python: drop explicit output_encoding='utf-8' in mako templates 2021-08-14 21:44:32 +00:00
nir_intrinsics_h.py python: drop explicit output_encoding='utf-8' in mako templates 2021-08-14 21:44:32 +00:00
nir_intrinsics_indices_h.py python: drop explicit output_encoding='utf-8' in mako templates 2021-08-14 21:44:32 +00:00
nir_linking_helpers.c nir: Fix sorting per-primitive outputs. 2021-12-03 17:06:47 +00:00
nir_liveness.c nir: Add a nir_src_is_undef() helper, like nir_src_is_const(). 2021-03-03 00:51:44 +00:00
nir_loop_analyze.c nir/loop_analyze: consider instruction cost of nir_op_flrp 2021-08-24 16:10:30 +00:00
nir_loop_analyze.h nir: return false for loops in contains_other_jump() 2021-08-19 13:51:17 +00:00
nir_lower_alpha_test.c nir: use intrinsic builders 2021-01-06 14:34:41 +00:00
nir_lower_alu.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_alu_to_scalar.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_amul.c nir/lower_amul: do not lower 64bit amul to imul24 2021-10-21 18:59:57 +00:00
nir_lower_array_deref_of_vec.c nir: Make nir_ssa_def_rewrite_uses_after take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_atomics_to_ssbo.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_bit_size.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_bitmap.c nir: Use sized types for nir_tex_instr::dest_type 2021-01-25 11:21:48 +01:00
nir_lower_blend.c nir/lower_blend: Use correct clamp for SNORM 2021-10-26 19:16:36 +00:00
nir_lower_blend.h Convert a few files to UTF-8 2021-07-12 23:45:34 +00:00
nir_lower_bool_to_bitsize.c nir/lower_bool: Rewrite dest_type for boolean destinations 2021-01-25 11:21:42 +01:00
nir_lower_bool_to_float.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_bool_to_int32.c nir: add 32-bit bool of fisfinite 2021-08-06 12:06:21 +10:00
nir_lower_clamp_color_outputs.c
nir_lower_clip.c nir/lower_clip: support clipdist array + no vars 2021-11-28 04:44:56 +00:00
nir_lower_clip_cull_distance_arrays.c nir: handle per-view clip/cull distances 2022-01-11 22:45:23 +00:00
nir_lower_clip_disable.c nir/lower_clip_disable: Fix store writemask 2021-04-26 17:07:02 +00:00
nir_lower_clip_halfz.c
nir_lower_convert_alu_types.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_discard_or_demote.c nir/lower_discard_or_demote: Fix metadata 2021-10-08 23:24:49 +00:00
nir_lower_double_ops.c
nir_lower_drawpixels.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_fb_read.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_flatshade.c
nir_lower_flrp.c util/vector: make util_vector_init harder to misuse 2021-10-08 00:15:11 +00:00
nir_lower_fp16_conv.c nir: lower 64-bit floats to 32-bit first. 2021-03-22 12:17:14 +10:00
nir_lower_fragcolor.c nir/lower_fragcolor: Avoid redundant load_output 2021-06-09 02:58:08 +00:00
nir_lower_fragcoord_wtrans.c compiler/nir: check whether var is an input in lower_fragcoord_wtrans 2021-05-14 13:26:13 +00:00
nir_lower_frexp.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_global_vars_to_local.c
nir_lower_goto_ifs.c Revert "nir/lower_goto_if: Add a route::outside set" 2020-09-30 16:46:11 +00:00
nir_lower_gs_intrinsics.c nir/lower_gs_intrinsics: Make nir_lower_gs_intrinsics be idempotent 2021-09-14 09:13:07 -07:00
nir_lower_idiv.c nir/lower_idiv: make lowered divisions exact 2021-04-12 16:19:46 +00:00
nir_lower_image.c nir: Switch from ralloc to malloc for NIR instructions. 2021-09-14 17:53:06 +00:00
nir_lower_indirect_derefs.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_input_attachments.c nir: add _amd suffix to fragment_mask_fetch and fragment_fetch texops 2021-10-07 15:36:39 +00:00
nir_lower_int64.c Convert most remaining free-form fall-through comments to FALLTHROUGH 2021-04-15 16:01:22 +00:00
nir_lower_int_to_float.c nir: preserve all metadata when nir_lower_int_to_float doesn't make progress 2021-10-05 10:02:54 +00:00
nir_lower_interpolation.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_io.c nir_lower_io: propagate the "invariant" flag to outputs 2022-01-07 16:35:43 +00:00
nir_lower_io_arrays_to_elements.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_io_to_scalar.c nir/lower_io_to_scalar: add support for bo and shared io 2021-10-27 16:46:01 +00:00
nir_lower_io_to_temporaries.c nir: Don't lower Task/Mesh I/O to temporaries 2021-08-28 03:56:43 +00:00
nir_lower_io_to_vector.c nir/lower_io_to_vector: Allow Task/Mesh to load from outputs 2021-09-24 14:35:15 +00:00
nir_lower_is_helper_invocation.c nir: add lowering pass for helperInvocationEXT() 2021-04-19 17:11:36 +00:00
nir_lower_load_const_to_scalar.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_locals_to_regs.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_mediump.c nir/fold_16bit_sampler_conversions: skip sparse residency tex instructions 2021-11-15 18:28:20 +00:00
nir_lower_memcpy.c nir: Make nir_deref_instr::mode a bitfield 2020-11-03 22:18:28 +00:00
nir_lower_memory_model.c nir: handle float atomics in nir_lower_memory_model 2021-05-12 11:09:07 +00:00
nir_lower_multiview.c nir: Add image atomic_fmin/fmax intrinsics 2021-03-18 00:13:40 +00:00
nir_lower_non_uniform_access.c nir/lower_non_uniform: allow lowering with vec2 handles 2021-04-27 15:56:07 +00:00
nir_lower_packing.c nir: intel/compiler: Add and use nir_op_pack_32_4x8_split 2021-08-18 22:03:37 +00:00
nir_lower_passthrough_edgeflags.c nir/edgeflags: Add a flag to indicate the edge flag input is needed 2021-09-17 16:36:08 -07:00
nir_lower_patch_vertices.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_phis_to_scalar.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_pntc_ytransform.c nir/lower_pntc_ytransform: Support PointCoordIsSysval 2021-11-12 12:34:14 +00:00
nir_lower_point_size.c
nir_lower_point_size_mov.c nir/lower_point_size_mov: zero nir_state_slot::swizzle in new variable 2021-07-20 16:34:51 +00:00
nir_lower_printf.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_readonly_images_to_tex.c spirv: Use texture types for sampled images 2021-10-16 05:49:34 +00:00
nir_lower_regs_to_ssa.c nir: Drop nir_ssa_def::name and nir_register::name 2021-07-08 17:34:41 +00:00
nir_lower_returns.c nir/lower_returns: Deal with single-arg phis after if. 2021-06-08 11:29:53 +00:00
nir_lower_samplers.c
nir_lower_scratch.c nir/lower_scratch: Ensure we don't lower vars with unsupported usage. 2021-08-13 20:56:30 +00:00
nir_lower_shader_calls.c nir/lower_shader_calls: fix store_scratch write_mask 2022-01-10 19:01:04 +00:00
nir_lower_ssbo.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_subgroups.c nir/lower_subgroups: fix left shift of -1 2021-11-24 16:45:05 +00:00
nir_lower_system_values.c nir: add load_mesh_view_count and load_mesh_view_indices intrinsics 2022-01-11 22:45:23 +00:00
nir_lower_sysvals_to_varyings.c nir: Add a nir_sysvals_to_varyings() helper 2021-10-07 19:45:35 +00:00
nir_lower_tex.c nir/lower_tex: Add filter for tex offset lowering 2021-12-13 16:56:23 -08:00
nir_lower_texcoord_replace.c compiler/nir: Increment shader input count and mark as used when adding new gl_PointCoord 2021-03-09 21:24:35 +00:00
nir_lower_to_source_mods.c nir: add nir_ssa_def_is_unused() 2021-03-01 17:38:10 +00:00
nir_lower_two_sided_color.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_ubo_vec4.c nir/lower_ubo_vec4: Fix align_mul=8 special case 2021-10-12 11:30:52 +00:00
nir_lower_undef_to_zero.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_uniforms_to_ubo.c nir/nir_lower_uniforms_to_ubo: Set the explicit stride of the UBO 0 uniform. 2021-08-31 20:12:16 +00:00
nir_lower_var_copies.c nir: Add a nir_instr_free() to replace ralloc_free(instr). 2021-09-14 17:53:05 +00:00
nir_lower_variable_initializers.c nir: Move workgroup_size and workgroup_variable_size into common shader_info 2021-06-08 09:23:55 -07:00
nir_lower_vars_to_ssa.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_vec3_to_vec4.c nir: Make nir_ssa_def_rewrite_uses_after take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_vec_to_movs.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_lower_viewport_transform.c nir/lower_viewport_transform: Allow geom/tess 2021-03-07 17:57:04 +00:00
nir_lower_wpos_center.c anv,nir: Use sample_pos_or_center in lower_wpos_center 2021-12-17 16:02:16 +00:00
nir_lower_wpos_ytransform.c nir: Make nir_ssa_def_rewrite_uses_after take an SSA value 2021-03-08 16:59:55 +00:00
nir_lower_wrmasks.c
nir_metadata.c nir: Introduce nir_metadata_instr_index for nir_index_instr() being current. 2020-10-20 08:53:36 -07:00
nir_move_vec_src_uses_to_dest.c
nir_normalize_cubemap_coords.c
nir_opcodes.py nir: fix constant expression of ibitfield_extract 2021-11-16 17:32:21 +00:00
nir_opcodes_c.py python: drop python2 support 2021-08-14 21:44:32 +00:00
nir_opcodes_h.py python: drop python2 support 2021-08-14 21:44:32 +00:00
nir_opt_access.c nir/opt_access: infer CAN_REORDER for global access 2021-12-17 18:51:24 +00:00
nir_opt_algebraic.py nir/algebraic: Separate has_dot_4x8 into has_sdot_4x8 and has_udot_4x8 2022-01-10 13:20:39 +02:00
nir_opt_barriers.c
nir_opt_combine_stores.c nir: Add lowered vendor independent raytracing intrinsics. 2021-06-21 21:23:51 +00:00
nir_opt_comparison_pre.c util/vector: make util_vector_init harder to misuse 2021-10-08 00:15:11 +00:00
nir_opt_conditional_discard.c nir: Add nir_intrinsic_terminate and nir_intrinsic_terminate_if 2020-10-15 21:40:09 +00:00
nir_opt_constant_folding.c nir/constant_folding: Optimize txb with bias of constant zero to tex 2021-12-06 19:50:42 +00:00
nir_opt_copy_prop_vars.c nir: Add lowered vendor independent raytracing intrinsics. 2021-06-21 21:23:51 +00:00
nir_opt_copy_propagate.c spirv: run nir_copy_prop before nir_rematerialize_derefs_in_use_blocks_impl 2021-11-24 15:43:51 +00:00
nir_opt_cse.c nir/cse: resize the instruction set 2021-06-15 17:57:07 +00:00
nir_opt_dce.c nir/dce: fix DCE of loops with a halt or return instruction in the pre-header 2021-11-29 22:22:24 +00:00
nir_opt_dead_cf.c nir_opt_dead_cf: Remove dead ifs 2022-01-07 05:15:48 +00:00
nir_opt_dead_write_vars.c nir: Add lowered vendor independent raytracing intrinsics. 2021-06-21 21:23:51 +00:00
nir_opt_find_array_copies.c nir/find_array_copies: Don't assume all children exist 2020-11-04 05:57:07 +00:00
nir_opt_fragdepth.c nir: add a pass to optimize "gl_FragDepth = gl_FragCoord.z" away 2021-08-11 11:00:11 +02:00
nir_opt_gcm.c nir/gcm: pin some instructions which require uniform sources 2021-08-24 16:52:31 +00:00
nir_opt_idiv_const.c nir/idiv_const: optimize imod/irem 2021-08-09 11:00:39 +00:00
nir_opt_if.c nir/opt_if: also merge break statements with ones after the branch 2022-01-13 02:30:32 +00:00
nir_opt_intrinsics.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_opt_large_constants.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_opt_load_store_vectorize.c nir/opt_load_store_vectorize: improve ssbo/global alias analysis 2021-12-17 18:51:24 +00:00
nir_opt_loop_unroll.c nir/loop_unroll: Always unroll loops that iterate at most once 2021-10-13 20:11:13 -07:00
nir_opt_memcpy.c nir: fix opt_memcpy src/dst mixup 2021-09-28 16:36:08 +00:00
nir_opt_move.c nir: refactor nir_opt_move 2022-01-12 13:41:54 +00:00
nir_opt_move_discards_to_top.c nir: Add a discard optimization pass 2021-05-19 18:04:44 +00:00
nir_opt_offsets.c freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars. 2022-01-16 19:11:29 +00:00
nir_opt_peephole_select.c nir: Add a new sample_pos_or_center system value 2021-12-17 16:02:16 +00:00
nir_opt_phi_precision.c nir: Move phi src setup to a helper. 2021-08-13 16:11:57 +00:00
nir_opt_ray_queries.c nir: add a ray query optimization pass 2021-12-04 20:46:35 +00:00
nir_opt_rematerialize_compares.c
nir_opt_remove_phis.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_opt_shrink_vectors.c nir/shrink_vectors: shrink vecN properly 2021-07-26 09:24:37 +00:00
nir_opt_sink.c nir/nir_opt_move,sink: Include load_ubo_vec4 as a load_ubo instr. 2021-12-11 02:12:27 +00:00
nir_opt_trivial_continues.c
nir_opt_undef.c nir: Drop the unused instr arg for src/dest copy functions. 2021-09-14 17:53:06 +00:00
nir_opt_uniform_atomics.c nir/uniform_atomics: fix is_atomic_already_optimized without workgroups 2022-01-10 19:57:38 +00:00
nir_opt_vectorize.c nir: preserve all metadata when nir_opt_vectorize doesn't make progress 2021-10-05 10:02:54 +00:00
nir_phi_builder.c nir: Move phi src setup to a helper. 2021-08-13 16:11:57 +00:00
nir_phi_builder.h
nir_print.c nir/print: print const value near each use of const ssa variable 2021-12-17 10:04:50 +00:00
nir_propagate_invariant.c nir: preserve all metadata when nir_propagate_invariant doesn't make progress 2021-10-05 10:02:54 +00:00
nir_range_analysis.c nir: Fix local_invocation_index upper bound for non-compute-like stages. 2021-08-30 14:05:33 +00:00
nir_range_analysis.h nir/range_analysis: Add "is a number" range analysis tracking 2021-03-11 22:00:30 +00:00
nir_remove_dead_variables.c nir: Two shared memory *blocks* may alias each other 2021-01-27 22:20:53 +00:00
nir_repair_ssa.c nir: Make nir_deref_instr::mode a bitfield 2020-11-03 22:18:28 +00:00
nir_schedule.c util/dag: Make edge data a uintptr_t 2021-11-17 13:41:47 +00:00
nir_schedule.h
nir_search.c nir/algebraic: Move all the individual transforms to a common table. 2021-12-07 07:09:00 +00:00
nir_search.h nir/algebraic: Move all the individual transforms to a common table. 2021-12-07 07:09:00 +00:00
nir_search_helpers.h nir/algebraic: optimize more 64-bit imul with constant source 2021-12-17 18:51:24 +00:00
nir_serialize.c nir: serialize divergent fields 2021-12-11 20:07:35 +00:00
nir_serialize.h
nir_split_per_member_structs.c nir: Make nir_ssa_def_rewrite_uses take an SSA value 2021-03-08 16:59:55 +00:00
nir_split_var_copies.c
nir_split_vars.c nir: track variables representing ray queries 2021-12-04 20:46:35 +00:00
nir_sweep.c nir: Stop sweeping indirects 2021-09-16 11:28:36 +00:00
nir_to_lcssa.c nir: Move phi src setup to a helper. 2021-08-13 16:11:57 +00:00
nir_validate.c nir: handle per-view clip/cull distances 2022-01-11 22:45:23 +00:00
nir_vla.h
nir_vulkan.h
nir_worklist.c nir: Add a nir_instr_remove that recursively removes dead code. 2021-07-06 11:24:43 -07:00
nir_worklist.h util/vector: make util_vector_init harder to misuse 2021-10-08 00:15:11 +00:00
nir_xfb_info.h

README

New IR, or NIR, is an IR for Mesa intended to sit below GLSL IR and Mesa IR.
Its design inherits from the various IRs that Mesa has used in the past, as
well as Direct3D assembly, and it includes a few new ideas as well. It is a
flat (in terms of using instructions instead of expressions), typeless IR,
similar to TGSI and Mesa IR.  It also supports SSA (although it doesn't require
it).

Variables
=========

NIR includes support for source-level GLSL variables through a structure mostly
copied from GLSL IR. These will be used for linking and conversion from GLSL IR
(and later, from an AST), but for the most part, they will be lowered to
registers (see below) and loads/stores.

Registers
=========

Registers are light-weight; they consist of a structure that only contains its
size, its index for liveness analysis, and an optional name for debugging. In
addition, registers can be local to a function or global to the entire shader;
the latter will be used in ARB_shader_subroutine for passing parameters and
getting return values from subroutines. Registers can also be an array, in which
case they can be accessed indirectly. Each ALU instruction (add, subtract, etc.)
works directly with registers or SSA values (see below).

SSA
========

Everywhere a register can be loaded/stored, an SSA value can be used instead.
The only exception is that arrays/indirect addressing are not supported with
SSA; although research has been done on extensions of SSA to arrays before, it's
usually for the purpose of parallelization (which we're not interested in), and
adds some overhead in the form of adding copies or extra arrays (which is much
more expensive than introducing copies between non-array registers). SSA uses
point directly to their corresponding definition, which in turn points to the
instruction it is part of. This creates an implicit use-def chain and avoids the
need for an external structure for each SSA register.

Functions
=========

Support for function calls is mostly similar to GLSL IR. Each shader contains a
list of functions, and each function has a list of overloads. Each overload
contains a list of parameters, and may contain an implementation which specifies
the variables that correspond to the parameters and return value. Inlining a
function, assuming it has a single return point, is as simple as copying its
instructions, registers, and local variables into the target function and then
inserting copies to and from the new parameters as appropriate. After functions
are inlined and any non-subroutine functions are deleted, parameters and return
variables will be converted to global variables and then global registers. We
don't do this lowering earlier (i.e. the fortranizer idea) for a few reasons:

- If we want to do optimizations before link time, we need to have the function
signature available during link-time.

- If we do any inlining before link time, then we might wind up with the
inlined function and the non-inlined function using the same global
variables/registers which would preclude optimization.

Intrinsics
=========

Any operation (other than function calls and textures) which touches a variable
or is not referentially transparent is represented by an intrinsic. Intrinsics
are similar to the idea of a "builtin function," i.e. a function declaration
whose implementation is provided by the backend, except they are more powerful
in the following ways:

- They can also load and store registers when appropriate, which limits the
number of variables needed in later stages of the IR while obviating the need
for a separate load/store variable instruction.

- Intrinsics can be marked as side-effect free, which permits them to be
treated like any other instruction when it comes to optimizations. This allows
load intrinsics to be represented as intrinsics while still being optimized
away by dead code elimination, common subexpression elimination, etc.

Intrinsics are used for:

- Atomic operations
- Memory barriers
- Subroutine calls
- Geometry shader emitVertex and endPrimitive
- Loading and storing variables (before lowering)
- Loading and storing uniforms, shader inputs and outputs, etc (after lowering)
- Copying variables (cases where in GLSL the destination is a structure or
array)
- The kitchen sink
- ...

Textures
=========

Unfortunately, there are far too many texture operations to represent each one
of them with an intrinsic, so there's a special texture instruction similar to
the GLSL IR one. The biggest difference is that, while the texture instruction
has a sampler dereference field used just like in GLSL IR, this gets lowered to
a texture unit index (with a possible indirect offset) while the type
information of the original sampler is kept around for backends. Also, all the
non-constant sources are stored in a single array to make it easier for
optimization passes to iterate over all the sources.

Control Flow
=========

Like in GLSL IR, control flow consists of a tree of "control flow nodes", which
include if statements and loops, and jump instructions (break, continue, and
return). Unlike GLSL IR, though, the leaves of the tree aren't statements but
basic blocks. Each basic block also keeps track of its successors and
predecessors, and function implementations keep track of the beginning basic
block (the first basic block of the function) and the ending basic block (a fake
basic block that every return statement points to). Together, these elements
make up the control flow graph, in this case a redundant piece of information on
top of the control flow tree that will be used by almost all the optimizations.
There are helper functions to add and remove control flow nodes that also update
the control flow graph, and so usually it doesn't need to be touched by passes
that modify control flow nodes.