KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Georg Lehmann	c1cf0688c9	aco: Add G16 opcodes. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16979>	2022-07-26 16:54:08 +00:00
Samuel Pitoiset	8bdcc20815	aco: add new pseudo instruction p_jump_to_epilog The first operand of this new pseudo-instruction is a 64-bit SGPR for the continue PC, followed by a variable list of fixed VGPRS for the color exports which are the PS epilog inputs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Rhys Perry	d2d94b62f2	aco: initialize scratch base registers on GFX9-GFX10.3 fossil-db (navi21): Totals from 1142 (0.70% of 162293) affected shaders: Instrs: 271636 -> 271974 (+0.12%) CodeSize: 1532020 -> 1533792 (+0.12%) Latency: 7484066 -> 7485698 (+0.02%) InvThroughput: 4048824 -> 4049579 (+0.02%) SClause: 4171 -> 4212 (+0.98%) PreSGPRs: 11203 -> 12276 (+9.58%) fossil-db (vega10): Totals from 3327 (2.06% of 161355) affected shaders: Instrs: 257413 -> 257601 (+0.07%) CodeSize: 1424244 -> 1425372 (+0.08%) Latency: 8598402 -> 8600466 (+0.02%) InvThroughput: 7906335 -> 7908234 (+0.02%) SClause: 4932 -> 4973 (+0.83%) PreSGPRs: 22010 -> 25405 (+15.42%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	cbeb25ce91	aco: make FLAT_instruction::offset signed Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Georg Lehmann	dcdd31ae96	aco: Remove r128_a16 MIMG builder option. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16969>	2022-06-10 15:51:26 +00:00
Timur Kristóf	abcd0aa9e5	aco: Remove trailing whitespace. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Martin Roukala <martin.roukala@mupuf.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16702>	2022-05-25 12:29:30 +00:00
Georg Lehmann	a8b29094c2	aco: Remove some old comments in aco_opcodes.py. s_cmovk_i32 isn't GFX8_GFX9 only and s_version doesn't need a comment to say it's GFX10+ exclusive. The encoding list is enough to provide this information, as for other GFX10+ instructions. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16006>	2022-04-18 15:59:38 +00:00
Rhys Perry	2135c88d9c	aco: fix signedness of DS_instruction::offset0/1 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	f68797ead7	aco: create v_mac_legacy_f32/v_fmac_legacy_f32 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Tatsuyuki Ishi	da0412e55b	aco: support DPP8 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13971>	2021-12-31 20:56:39 +00:00
Rhys Perry	2a7fa132be	aco: implement udot_4x8/sdot_4x8/udot_2x16/sdot_2x16 opcodes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12617>	2021-09-03 13:21:28 +00:00
Daniel Schürmann	0988f7b9ba	aco: remove explicit dst_preserve flag Instead, we can rely on the fact that subdword definitions must preserve the unused bits while dword definitions either pad or sign-extend. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12640>	2021-09-02 20:39:17 +02:00
Daniel Schürmann	9e3ff06c38	aco: rewrite SDWA selector This commit introduces a new struct SubdwordSel in order to ease and clean up the usage of SDWA selections. This includes removing the distinction between register-allocated and fixed SDWA selections. Instead, SDWA selections can now also access the high bits of subdword variables. Alignment and sizes are validated accordingly. Size, offset and sign_extend can be evaluated via helper methods. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12640>	2021-09-02 20:39:17 +02:00
Daniel Schürmann	077776a866	aco/opcodes: remove definition_size[] Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12364>	2021-08-23 10:31:40 +00:00
Timur Kristóf	e66f54e5c8	aco: Allow elect to take advantage of knowing when all lanes are active. Implement elect using a pseudo-op which is lowered during the insert_exec_mask pass. This makes it possible to emit a more optimal sequence when the exec mask is constant. Fossil DB results on Sienna Cichlid: Totals from 211 (0.16% of 128647) affected shaders: CodeSize: 2254356 -> 2240468 (-0.62%); split: -0.62%, +0.00% Instrs: 438471 -> 434996 (-0.79%); split: -0.80%, +0.01% Latency: 2717082 -> 2709400 (-0.28%); split: -0.28%, +0.00% InvThroughput: 566987 -> 566342 (-0.11%); split: -0.11%, +0.00% Copies: 40058 -> 40162 (+0.26%) Branches: 31209 -> 31211 (+0.01%) PreSGPRs: 9927 -> 10125 (+1.99%) Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11458>	2021-07-16 14:31:54 +00:00
Daniel Schürmann	3f9e986d33	aco: add missing Licenses and remove Authors from files Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11271>	2021-07-12 12:09:31 +00:00
Timur Kristóf	fd6605367d	aco: Implement nir_op_sad_u8x4. Fix up the operand size for v_sad instructions, and implement the new NIR horizontal add. There is no viable way to do this in SALU, so let's always use a VGPR destination. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11072>	2021-06-09 16:48:51 +00:00
Rhys Perry	c129ede523	aco: use ds_read_{u8,u16}_d16 This allows partial writes and writes to the upper half of the destination. fossil-db (Sienna Cichlid): Totals from 135 (0.09% of 149839) affected shaders: Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11113>	2021-06-09 12:06:50 +00:00
Rhys Perry	4870d7d829	aco: use v1b/v2b for ds_read_u8/ds_read_u16 The p_extract_vector isn't necessary. For ds_read_u8 and ds_read_u16, we used a 32-bit regclass, but did't load 32 bits, and used dst_hint for vector loads when we shouldn't have. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4863 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11113>	2021-06-09 12:06:50 +00:00
Rhys Perry	c768d7d8f2	aco/tests: add SDWA tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:43 +00:00
Rhys Perry	2f94353735	aco: add p_extract/p_insert These will let us make the SDWA optimizer much simpler than if we were to recognize combinations of shift/and/bfe. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:42 +00:00
Bas Nieuwenhuizen	c7904b5b9b	aco: Implement bvh64_intersect_ray_amd intrinsic. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10818>	2021-05-18 23:02:25 +02:00
Rhys Perry	83ce9407f2	aco: add instruction classes These should mostly match LLVM. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 16:31:19 +00:00
Rhys Perry	0af7ff49fd	aco: lower p_constaddr into separate instructions earlier This allows them to be scheduled properly and simplifies the assembler a little. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8994>	2021-03-11 16:31:19 +00:00
Daniel Schürmann	29b866fef6	aco: remove special handling of load_helper_invocation These should now behave the same as is_helper_invocation. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9058>	2021-02-17 21:53:52 +00:00
Rhys Perry	441ead5fb3	aco: remove Format::{VOP3A,VOP3B} These are really the same as Format::VOP3. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8595>	2021-01-22 14:12:32 +00:00
Daniel Schürmann	5ad52ac906	aco: create helpers to emit vop3p instructions Also make get_alu_src() capable to return unswizzled multi-component SGPR sources. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6680>	2021-01-13 17:46:56 +00:00
Daniel Schürmann	2caba08c1a	aco: fix VOP3P assembly, VN and validation aco/opcodes: rename v_pk_fma_mix* -> v_fma_mix* and add modifier capabilities for VOP3P. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6680>	2021-01-13 17:46:55 +00:00
Daniel Schürmann	7b669ff789	aco: simplify and fix operand/definition sizes These are mainly needed for constant propagation and subdword register allocation. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8425>	2021-01-12 16:14:00 +00:00
Rhys Perry	867323379e	aco: don't use SMEM for SSBO stores fossil-db (Navi): Totals from 70 (0.05% of 138791) affected shaders: SGPRs: 2324 -> 2097 (-9.77%) VGPRs: 1344 -> 1480 (+10.12%) CodeSize: 157872 -> 154836 (-1.92%); split: -1.93%, +0.01% MaxWaves: 1288 -> 1260 (-2.17%) Instrs: 29730 -> 29108 (-2.09%); split: -2.13%, +0.04% Cycles: 394944 -> 391280 (-0.93%); split: -0.94%, +0.01% VMEM: 5288 -> 5695 (+7.70%); split: +11.97%, -4.27% SMEM: 2680 -> 2444 (-8.81%); split: +1.34%, -10.15% VClause: 291 -> 502 (+72.51%) SClause: 1176 -> 918 (-21.94%) Copies: 3549 -> 3517 (-0.90%); split: -1.80%, +0.90% Branches: 1230 -> 1228 (-0.16%) PreSGPRs: 1675 -> 1491 (-10.99%) PreVGPRs: 1101 -> 1223 (+11.08%) Totals from 70 (0.05% of 139517) affected shaders (RAVEN): SGPRs: 2368 -> 2121 (-10.43%) VGPRs: 1344 -> 1480 (+10.12%) CodeSize: 156664 -> 153252 (-2.18%) MaxWaves: 636 -> 622 (-2.20%) Instrs: 29968 -> 29226 (-2.48%) Cycles: 398284 -> 393492 (-1.20%) VMEM: 5544 -> 5930 (+6.96%); split: +11.72%, -4.76% SMEM: 2752 -> 2502 (-9.08%); split: +1.20%, -10.28% VClause: 292 -> 504 (+72.60%) SClause: 1236 -> 940 (-23.95%) Copies: 3907 -> 3852 (-1.41%); split: -2.20%, +0.79% Branches: 1230 -> 1228 (-0.16%) PreSGPRs: 1671 -> 1487 (-11.01%) PreVGPRs: 1102 -> 1225 (+11.16%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6143>	2020-11-16 15:52:22 +00:00
Samuel Pitoiset	2f5b3ac2f8	aco: remove v_{add,sub,subrev}_u32 on GFX8 These opcodes are never used and they always write the carry-out according to the GCN3 ISA documentation. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7569>	2020-11-16 10:56:13 +00:00
Rhys Perry	41839d38cf	aco: default to a definition size of 32 For non-arithmetic opcodes such as buffer_load_dword and buffer_load_short, default to a definition size of 32. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7276>	2020-10-28 10:56:27 +00:00
Daniel Schürmann	f29c81f863	aco: use VOP2 for v_cvt_pkrtz_f16_f32 if possible This patch also does a slight rework of export_fs_mrt_color() to avoid setting of enabled channels which are not used. Totals from 52404 (38.38% of 136546) affected shaders (NAVI): SGPRs: 3097443 -> 3097435 (-0.00%) CodeSize: 189151600 -> 188546200 (-0.32%) Instrs: 36445061 -> 36445104 (+0.00%); split: -0.00%, +0.00% Cycles: 1739388020 -> 1739388192 (+0.00%); split: -0.00%, +0.00% VMEM: 21071501 -> 21071665 (+0.00%); split: +0.00%, -0.00% SMEM: 3470983 -> 3470982 (-0.00%); split: +0.00%, -0.00% PreSGPRs: 2058965 -> 2058962 (-0.00%) PreVGPRs: 1860294 -> 1860295 (+0.00%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6777>	2020-10-14 15:31:38 +00:00
Daniel Schürmann	7240edec2a	aco: use VOP2 version of v_cvt_pkrtz_f16_f32 on GFX_6_7_10 Totals from 767 (0.56% of 136546) affected shaders (NAVI): CodeSize: 2862208 -> 2850036 (-0.43%) Instrs: 561572 -> 561574 (+0.00%) Cycles: 6455420 -> 6455428 (+0.00%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6777>	2020-10-14 15:31:38 +00:00
Rhys Perry	b811b1d083	aco: update aco_opcodes.py for GFX10.3 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5546>	2020-08-04 20:39:33 +01:00
Rhys Perry	e6366f9094	aco: add framework for unit testing And add some "tests" to test and document currently unused features of the framework. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3521>	2020-07-30 16:13:08 +00:00
Rhys Perry	d1f992f3c2	aco: rework barriers and replace can_reorder fossil-db (Navi): Totals from 273 (0.21% of 132058) affected shaders: CodeSize: 937472 -> 936556 (-0.10%) Instrs: 158874 -> 158648 (-0.14%) Cycles: 13563516 -> 13562612 (-0.01%) VMEM: 85246 -> 85244 (-0.00%) SMEM: 21407 -> 21310 (-0.45%); split: +0.05%, -0.50% VClause: 9321 -> 9317 (-0.04%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4905>	2020-07-28 16:56:34 +00:00
Rhys Perry	d169f09e37	aco: be more careful combining additions that could wrap into loads/stores SMEM does the addition with 64-bits, not 32. So if the original code relied on wrapping around (for example, for subtraction), it would break. Apparently swizzled MUBUF accesses also have issues with combining additions that could overflow. Normal MUBUF accesses seem fine. fossil-db (Navi): Totals from 27219 (20.02% of 135946) affected shaders: CodeSize: 128303256 -> 131062756 (+2.15%); split: -0.00%, +2.15% Instrs: 24818911 -> 25280558 (+1.86%); split: -0.01%, +1.87% VMEM: 162311926 -> 177226874 (+9.19%); split: +9.36%, -0.17% SMEM: 18182559 -> 20218734 (+11.20%); split: +11.53%, -0.34% VClause: 423635 -> 424398 (+0.18%); split: -0.02%, +0.20% SClause: 865384 -> 1104986 (+27.69%); split: -0.00%, +27.69% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2748 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2720>	2020-07-21 18:25:35 +00:00
Rhys Perry	b36950ad2c	aco: fix nir_op_f2f16_rtne with non-default rounding modes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5773>	2020-07-17 16:40:47 +00:00
Rhys Perry	09f48de582	aco: read 0 from inactive lanes when using dpp Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5695>	2020-07-13 14:11:50 +00:00
Rhys Perry	ec4d3def16	aco: use VOP2 version of v_mbcnt_hi_u32_b32 on GFX6/7 fossil-db (Pitcairn): Totals from 2172 (1.58% of 137414) affected shaders: CodeSize: 7109080 -> 7100100 (-0.13%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5623>	2020-07-07 18:48:15 +00:00
Rhys Perry	4784111abc	aco: use 32-bit inline constants for 16-bit integer instructions See https://reviews.llvm.org/D81841 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5245>	2020-06-15 18:24:22 +00:00
Rhys Perry	207c35cbe8	aco: add Info::{operand_size,definition_size} No shader-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5040>	2020-06-10 15:05:11 +00:00
Timur Kristóf	14a5021aff	aco/gfx10: Refactor of GFX10 wave64 bpermute. The emulated GFX10 wave64 bpermute no longer needs a linear_vgpr, so we don't consider it a reduction anymore. Additionally, the code is slightly reorganized in preparation for the GFX6 emulated bpermute. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5223>	2020-06-02 21:12:12 +00:00
Rhys Perry	8e02de4d7f	aco: remove use of f-strings f-strings require Python 3.6 but 3.5 is still maintained and used. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2839 Fixes: `2ab45f41` ("aco: implement sub-dword swaps") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4850>	2020-05-02 13:59:05 +00:00
Rhys Perry	4ed83e2f94	aco: add various GFX10 int16 opcodes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4772>	2020-04-28 23:16:55 +00:00
Rhys Perry	2ab45f41e0	aco: implement sub-dword swaps Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4469>	2020-04-22 13:25:17 +00:00
Rhys Perry	83fdb1ed3d	aco: add VOP3P_instruction The optimizer isn't yet updated to handle this, since lower_to_hw_instr will be the only user for now. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4469>	2020-04-22 13:25:17 +00:00
Timur Kristóf	e73f604b21	aco: Fix the meaning of is_atomic. Previously, is_atomic really meant "is not atomic", contrary to its name. This commit fixes it to mean what one would think it means. Fixes: `69bed1c918` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3618> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3618>	2020-01-29 20:32:31 +00:00
Rhys Perry	525b107347	aco: rework vertex fetching a bit This will make it easier to skip unused channels at the start and to split unaligned loads on GFX10. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3086>	2020-01-28 11:39:57 +00:00

1 2

65 Commits