KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Samuel Pitoiset	2dcd12f38b	radv: fix finding shaders by PC Shaders are allocated contiguously in memory for a pipeline and the freelist.next pointer is a pointer to the pipeline now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14950>	2022-02-14 08:31:14 +01:00
Samuel Pitoiset	a224b7a057	radv: fix allocating/uploading the trap handler shader Since shaders are allocated per pipeline, the trap handler shader was not uploaded at all. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14950>	2022-02-14 08:31:10 +01:00
Bas Nieuwenhuizen	7adb3c0f7f	radv: Use larger arena sizes. For some games that take like 400 MiB of shader binaries, the number of shader arenas ends up going >1500. Cut that down a bit by using larger arenas. 8 MiB should still be decent with small BAR and should still cut things down from ~1500 to ~50 buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14591>	2022-02-11 23:20:21 +00:00
Daniel Schürmann	af4b26c53a	radv: move nir_opt_shrink_stores from radv_optimize_nir() No need to call this pass in a loop. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:50:52 +01:00
Daniel Schürmann	2a92452a0e	nir/opt_shrink_vectors: Remove shrinking of store intrinsics data source This is done via nir_opt_shrink_stores. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:50:47 +01:00
Samuel Pitoiset	2451290bc4	radv: rewrite RADV_FORCE_VRS directly in NIR This introduces a small NIR pass that exports VARYING_SLOT_PRIMITIVE_SHADING_RATE if RADV_FORCE_VRS is used, instead of doing this in both backend compilers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14907>	2022-02-09 17:40:34 +01:00
Connor Abbott	913bec10c4	nir/lower_subgroups: Rename lower_shuffle to lower_relative_shuffle This option only applies to relative shuffles (up/down/xor), and in a moment we're going to add an option to lower normal shuffles, so rename it. While we're here, rename lower_shuffle() to lower_to_shuffle() for similar reasons. Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:45 +00:00
Rhys Perry	8951608f08	radv: avoid providing an align_offset to intrinsic builders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	552e59aee3	radv: avoid providing an align_mul to intrinsic builders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	e7f91b194a	radv,aco,ac/llvm: implement fmulz and ffmaz Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Dave Airlie	1352e0ba0c	mesa/*: add a shader primitive type to get away from GL types. This creates an internal shader_prim enum, I've fixed up most users to use it instead of GL types. don't store the enum in shader_info as it changes size, and confuses other things. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Dave Airlie	d54c07b4c4	mesa/*: use an internal enum for tessellation primitive types. To avoid dragging gl.h into places it has no business being, defined tessellation primitive mode to an enum. This has a lot of fallout all over the place. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Emma Anholt	f6ffefba3e	nir: Apply nir_opt_offsets to nir_intrinsic_load_uniform as well. Doing this for ir3 required adding a struct for limits of how much base to fold in (which NTT wants as well for its case of shared vars), otherwise the later work to lower to the 1<<9 word limit would emit more instructions. The shader-db results are that sometimes the reduction in NIR instruction count results in the fewer sampler prefetches due to the shader being estimated to be shorter (dota2, nexuiz): total instructions in shared programs: 8996651 -> 8996776 (<.01%) total cat5 in shared programs: 86561 -> 86577 (0.02%) Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Samuel Pitoiset	e6173ed1d2	radv: allow to disable anisotropic filtering for single level image with drirc Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14471>	2022-01-13 16:17:48 +00:00
Rhys Perry	cc802cab7c	radv: add RADV_DEBUG=splitfma This splits application-provided FMA in vertex/geometry/tesselation/mesh shaders. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14458>	2022-01-12 23:50:35 +00:00
Danylo Piliaiev	b8d486f298	nir/algebraic: Separate has_dot_4x8 into has_sdot_4x8 and has_udot_4x8 Adreno GPUs has native instruction for unsigned and mixed dot_4x8 but not signed dot product. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13986>	2022-01-10 13:20:39 +02:00
Marek Olšák	116a05c721	ac: move ac_exp_param.h to ac_nir.h Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	12b942bd16	radeonsi: pass sample_coverage VGPR index to the PS prolog instead of guessing The code was correct, but little confusing. This is cleaner. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Timur Kristóf	811c001049	radv: Lower primitive shading rate for mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14193>	2022-01-04 17:46:02 +00:00
Samuel Pitoiset	12ac44378d	radv: add UMR markers for the vertex prolog Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13376>	2022-01-04 07:50:07 +00:00
Samuel Pitoiset	2bf25e6f6e	radv,aco: keep track of the prolog disassembly if necessary Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13376>	2022-01-04 07:50:07 +00:00
Timur Kristóf	11501fa686	radv: Compile mesh shaders and apply the necessary NIR lowerings. Mesh shaders use NGG, but the API allows many compute shader features such as workgroups and shared memory. Use the appropriate NIR lowerings for these, then call ac_nir_lower_ngg_ms. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Rhys Perry	15a375b4c8	radv,aco: don't lower some ffma instructions GFX10.3 has no v_mad_f32 and we can't recombine exact ffma into a v_fma_f32 if they're split. GFX9+ only has v_fma_f16 and no generation has a 64-bit MAD. fossil-db (GFX10.3): Totals from 84040 (57.46% of 146267) affected shaders: VGPRs: 3717256 -> 3688064 (-0.79%); split: -0.87%, +0.08% SpillSGPRs: 10419 -> 10403 (-0.15%) CodeSize: 263064884 -> 262442820 (-0.24%); split: -0.31%, +0.07% MaxWaves: 2036908 -> 2038374 (+0.07%); split: +0.10%, -0.03% Instrs: 49849448 -> 49572182 (-0.56%); split: -0.60%, +0.04% Latency: 908130602 -> 907764246 (-0.04%); split: -0.18%, +0.14% InvThroughput: 207051300 -> 206762704 (-0.14%); split: -0.24%, +0.10% fossil-db (GFX10): Totals from 2 (0.00% of 146267) affected shaders: Latency: 8123 -> 8107 (-0.20%) fossil-db (GFX9): Totals from 2 (0.00% of 146401) affected shaders: (no statistics affected) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9805>	2021-12-13 11:22:33 +00:00
Samuel Pitoiset	43022ecc3a	radv: enable lower_lod_zero_width This fixes dEQP-VK.glsl.texture_functions.query.texturequerylod.*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14147>	2021-12-13 10:00:07 +00:00
Rhys Perry	85161fb8ac	radv: clone shader in radv_shader_compile_to_nir This way, radv_shader_compile_to_nir doesn't alter the NIR. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14087>	2021-12-08 11:07:40 +00:00
Rhys Perry	2020a1799b	radv: include RT shaders in RADV_DEBUG=shaders,shaderstats Instead of using module->nir or nir->info->name to determine if it's a meta shader, use nir->info->internal. This also has an effect of disabling printing of meta shaders with NIR_DEBUG=print. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14087>	2021-12-08 11:07:40 +00:00
Rhys Perry	d74498e617	radv: add radv_meta_init_shader Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14087>	2021-12-08 11:07:40 +00:00
Samuel Pitoiset	3fa2220838	radv: upload shader binaries of a pipeline contiguously in memory RGP expects shaders to be contiguous in memory, otherwise it explodes because we have to generate huge captures with lot of holes. This reduces capture sizes of Cyberpunk 2077 from ~3.5GiB to ~180MiB. This should also help for future pipeline libraries. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Samuel Pitoiset	13143b3c11	radv: upload shader binaries after they are all compiled Instead of mixing compilation and upload. This will allow us to upload all shader binaries contiguously in memory and also for future pipeline libraries work. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Samuel Pitoiset	ff61b36ba2	radv: add a helper function to upload a shader binary Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Samuel Pitoiset	dd66de6017	radv: remove never reached free() when compiling shaders binary_out is never NULL and binaries are freed from the pipeline after they are added to the cache. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Samuel Pitoiset	8eae431720	radv/llvm: constify radv_shader_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13542>	2021-11-29 10:10:07 +00:00
Samuel Pitoiset	096c02bcf5	radv: copy the user SGPRs locations outside of radv_declare_shader_args() The shader locations are now directly stored in radv_shader_args which makes sense because they are tied to the arguments. The locations are then copied to radv_shader_info but they will be moved into a new radv_shader_binary_info with upcoming changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13542>	2021-11-29 10:10:06 +00:00
Samuel Pitoiset	3bbc226d7a	radv: configure the number of SGPRs/VGPRs directly from the arguments Instead of copying the values to radv_shader_info. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13542>	2021-11-29 10:10:06 +00:00
Rhys Perry	26d2e22eea	radv: stop running copy-propagation before nir_opt_deref spirv_to_nir() now does this. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13924>	2021-11-24 15:43:51 +00:00
Rhys Perry	2d07bcad66	radv: lower load_local_invocation_index with 1D workgroups For 1D workgroups, we can just load from an input VGPR. fossil-db (Sienna Cichlid): Totals from 226 (0.18% of 128647) affected shaders: CodeSize: 1200476 -> 1195696 (-0.40%); split: -0.49%, +0.09% Instrs: 223817 -> 223328 (-0.22%); split: -0.29%, +0.07% Latency: 2552394 -> 2549606 (-0.11%); split: -0.15%, +0.04% InvThroughput: 533989 -> 532670 (-0.25%); split: -0.27%, +0.02% VClause: 5191 -> 5188 (-0.06%) SClause: 7637 -> 7636 (-0.01%) Copies: 18165 -> 18182 (+0.09%); split: -0.22%, +0.31% Branches: 10446 -> 10442 (-0.04%) PreSGPRs: 8049 -> 8041 (-0.10%); split: -0.17%, +0.07% PreVGPRs: 7785 -> 7767 (-0.23%); split: -0.32%, +0.09% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel-schuermann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13757>	2021-11-12 18:59:51 +00:00
James Park	0aaaee09a4	radv: Match function definitions to declarations Fixes compiler errors for 32-bit Windows. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13749>	2021-11-12 09:46:10 +00:00
Samuel Pitoiset	9b80f4d5f2	radv: rename radv_shader_variant to radv_shader Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13548>	2021-11-01 20:04:45 +00:00
Samuel Pitoiset	b6a69dbb40	radv: re-emit prolog inputs when the nontrivial divisors state changed If the application first uses nontrivial divisors, the driver emits the vertex shader VA to the upload BO rather than directly via the user SGPRs locations. But, if the vertex input dynamic state changes, the driver might select a different VS prolog that no longer needs nontrivial divisors. In this case, the driver needs to re-emit the prolog inputs because otherwise the VS prolog will jump to the PC that is emitted via the user SGPR locations, and the previous one was somewhere in the upload BO... This fixes a GPU hang with Bioshock and Zink. Fixes: `d9c7a17542` ("radv: enable VK_EXT_vertex_input_dynamic_state") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13377>	2021-10-22 09:47:50 +02:00
Samuel Pitoiset	8ec6824335	radv,aco: decouple shader_info/options from radv_shader_args Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13287>	2021-10-22 07:10:40 +00:00
Rhys Perry	80841196b2	radv: implement dynamic vertex input state using vertex shader prologs This doesn't actually use the functionality or implement prolog compilation yet. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11717>	2021-10-13 05:13:10 +00:00
Rhys Perry	a1069b8bd4	radv: add segregated fit shader memory allocator Way faster than the previous one, especially with a large number of shaders. This doesn't have much of an effect right now, but the previous allocator was expensive compared to the cost of compiling vertex shader prologs. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11717>	2021-10-13 05:13:10 +00:00
Rhys Perry	404752bfb2	radv: add radv_shader_variant_get_va and radv_find_shader_variant helpers Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11717>	2021-10-13 05:13:10 +00:00
Samuel Pitoiset	ced950e42f	radv: store the post-processed shader binary config to the cache This will allow us to reduce the size of radv_shader_info which is stored in the cache entry. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12992>	2021-10-11 12:25:41 +00:00
Samuel Pitoiset	1585629db2	radv: move ac_shader_config to radv_shader_binary instead of legacy This will allow us to avoid postprocessing binaries when they are loaded from the shaders cache. LLVM binaries already contain the shader config as part of the ELF, so it's duplicated and increase the cache entry by 48 bytes. Though, I don't think that should matter for LLVM. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12992>	2021-10-11 12:25:41 +00:00
Timur Kristóf	52413a93af	radv: Enable NGG culling by default on GFX10.3, add nonggc debug flag. This commit enables NGG culling on all GFX10.3 GPUs by default. A new debug flag environment variable RADV_DEBUG=nonggc is added to disable this feature on GPUs where it is enabled by default. The previous perf test flag RADV_PERFTEST=nggc will not be needed on GFX10.3 anymore but it can still be used to enable the feature on GPUs where it isn't on by default. Totals from 58239 (45.27% of 128647) affected shaders: VGPRs: 1989752 -> `2049408` (+3.00%); split: -3.21%, +6.21% SpillSGPRs: 675 -> 883 (+30.81%); split: -78.07%, +108.89% CodeSize: 72205968 -> 153572764 (+112.69%) LDS: 0 -> 227125248 (+inf%) MaxWaves: 1614598 -> 1646934 (+2.00%); split: +3.08%, -1.08% Instrs: 14202239 -> 29654042 (+108.80%) Latency: 87986508 -> 136960419 (+55.66%); split: -0.23%, +55.89% InvThroughput: 14444832 -> 21141875 (+46.36%); split: -0.01%, +46.37% VClause: 340794 -> 493067 (+44.68%); split: -1.33%, +46.01% SClause: 520983 -> 738636 (+41.78%); split: -0.25%, +42.03% Copies: 775639 -> 2787382 (+259.37%) Branches: 296911 -> 1225431 (+312.73%) PreSGPRs: 1316896 -> 2057270 (+56.22%); split: -0.14%, +56.36% PreVGPRs: 1473558 -> 1658432 (+12.55%); split: -1.44%, +13.99% Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13086>	2021-10-08 14:18:54 +00:00
Samuel Pitoiset	64030e79c5	radv,aco: remap PS inputs when declaring shader arguments LLVM seems to require all inputs for PS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13192>	2021-10-08 14:14:09 +02:00
Samuel Pitoiset	4e07fe5f78	radv,aco: compute and store the SPI PS input in radv_shader_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13192>	2021-10-08 14:14:08 +02:00
Samuel Pitoiset	a07bc0c37a	radv: do not set TRAP_PRESENT(1) for fragment shaders It shouldn't be always set. Found by inspection. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13244>	2021-10-08 09:00:56 +02:00
Jason Ekstrand	7a2516568d	radv: Switch to the new common vk_error helpers Also, change every vk_error to use the closest object instead of fetching all the way back to the instance. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13045>	2021-10-07 20:51:36 +00:00

1 2 3 4 5 ...

556 Commits