KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Connor Abbott	acba08b58f	ir3: Implement and document ldc.k Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	944f4e6f8a	ir3: Better assemble/disassemble stc Add in the type, even though it turns out to not be that useful. Add in support for assembling it. Add some notes based on computerator experiments. And add support for the indirect a1.x mode that's needed for storing c64.x and later. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Ilia Mirkin	65c4b6a4c6	freedreno/ir3: document GETINFO's x/y results The zw were already known, but throw them in here too. I'm not extremely happy with the description of "y", feels like there's a simpler explanation there, but I couldn't find it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14672>	2022-02-21 02:09:19 +00:00
Ilia Mirkin	3d41414d26	freedreno/ir3: split up load/store/atomic by generation Some bits are slightly different on a4xx. Use the encodings that work. Perhaps these can be combined at some point if we get a proper understanding of what they mean. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:11 -05:00
Ilia Mirkin	b91b036322	isaspec: add gen-based leaf bitset separation This is necessary for some ops which have slightly different encoding on a4xx/a5xx, but are otherwise identical. This helps keeping the compiler from having to worry about these details and creating separate ops. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:07 -05:00
Emma Anholt	3d5ee08c15	freedreno/isaspec: Add missing dep of encode.py/decode.py calls on isa.py Fixes: #5921 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14725>	2022-02-02 11:21:56 -08:00
Danylo Piliaiev	c1d5c318bc	ir3: New cat3 instructions * shrm - (src2 >> src1) & src3 * shlm - (src2 << src1) & src3 * shrg - (src2 >> src1) \| src3 * shlg - (src2 << src1) \| src3 * andg - (src2 & src1) \| src3 * dp2acc - dot product of two {i,u}8vec2 packed into SRC1 and SRC2, added to 32b SRC3 * dp4acc - dot product of two {i,u}8vec4 packed into SRC1 and SRC2, added to 32b SRC3 * wmm - vec4(x_1, x_2, x_3, x_4) * (y_1 + y_2 + y_3 + y_4), which is duplicated (1 << (SRC3 / 32)) times starting from DST register * wmm.accu - same as wmm but result is added to DST registers, however the first reg in each vec4 result is overwritten instead of accumulating. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13986>	2022-01-10 13:20:39 +02:00
Vinson Lee	1d6f6f9102	ir3: Make shift operand 64-bit. Fix defect reported by Coverity Scan. Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) overflow_before_widen: Potentially overflowing expression 2 << W with type int (32 bits, signed) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type uint64_t (64 bits, unsigned). Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14258>	2021-12-22 01:19:46 +00:00
Danylo Piliaiev	d1c49901df	ir3: Add gen4 new subgroup instructions * getlast.w8 #4 - Perform jump for the first (CLUSTER_SIZE-1) fibers in a subgroup * brcst.active.w8 - necessary to implement arithmetic subgroup operations with prefix sum. * quad_shuffle.brcst - subgroupQuadBroadcast * quad_shuffle.horiz - subgroupQuadSwapHorizontal * quad_shuffle.vert - subgroupQuadSwapVertical * quad_shuffle.diag - subgroupQuadSwapDiagonal * getfiberid - gl_SubgroupID Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	5d5b1fc472	freedreno/ir3: add a6xx global atomics and separate atomic opcodes Separating atomic opcodes makes possible to express a6xx global atomics which take iova in SRC1. They would be needed by VK_KHR_buffer_device_address. The change also makes easier to distiguish atomics in conditions. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8717>	2021-11-23 18:26:37 +00:00
Danylo Piliaiev	ed16eedb2d	ir3: print half-dst/src for ldib.b/stib.b So it would print: ldib.b.untyped.1d.u16.1.imm.base0 hr0.z, r0.x, 0 instead of: ldib.b.untyped.1d.u16.1.imm.base0 r0.z, r0.x, 0 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13876>	2021-11-22 12:32:15 +00:00
Emma Anholt	9e04f97d8e	freedreno: Fix the uniform/nonuniform handling for cat5 bindful modes. We can see from the dynamically_uniform (compiler doesn't know if you're uniform or not) vs uniform (compiler can see it's uniform) case in the blob which is which. Now that we have the right names, also use the nonunif flag for encoding the actual non-uniform mode (previously, we were always setting it always in a way that meant uniform). I verified this behavior back to a418 with samplers. The a3xx blob I have only does GLES3, so we don't have the opaque_type_indexing tests to see. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13601>	2021-11-10 17:48:59 +00:00
Matt Turner	a150e31910	ir3: Add support for (dis)assembling flat.b flat.b is a variant of the bary.f instruction that does not perform interpolation of the varying input. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13558>	2021-11-04 02:59:28 +00:00
Rob Clark	22a203aa4c	freedreno/isa: Fix ldg/stg "halfness" Whether the load dst or store src is a half reg is determined by the type field, similar to cat5. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13426>	2021-10-19 16:04:42 +00:00
Rob Clark	0480595d03	freedreno/isa: Add immed reg accessors This way we can assert that a src that we expect to be an immediate actually is. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	8b0550f09f	freedreno/isa: Fixes for validation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	9516d8ce98	freedreno/ir3+isa: Cleanup bindless cat5 samp/tex encoding Don't let the way they are encoded at the isa level leak thru to the ir3 level. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Danylo Piliaiev	4218596671	ir3/freedreno: handle non-uniform a1en instructions Fixes vkd3d test "test_bindless_samplers_sm51" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13311>	2021-10-12 17:43:52 +00:00
Connor Abbott	470bf75ff8	ir3: Fix handling cat6 immediates We were treating them the same as regular cat2/cat3/cat4 immediates, but that's not right because cat6 sources are only 8 bits. Our bindless code was handling this before for bindless resources, and it was disabled for most other things, so this was mostly harmless, but fixing it will be necessary for handling ldc offsets. In addition enable tests for this that were just commented out, and add a custom test making sure that the immediate source is treated as unsigned. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13142>	2021-10-12 11:30:52 +00:00
Danylo Piliaiev	d590515112	ir3: support source modes for resinfo.b IBO/SSBO may have dynamic index, previously we just silently ignored this fact. However resinfo supports different modes. Fixes vkd3d test "test_null_uav" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13224>	2021-10-07 08:19:13 +00:00
Christian Gmeiner	bc2e806b0f	freedreno/isa: move isaspec to a new home This commit moves isaspec out of freedreno into a more generic new home - src/compiler/isaspec. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	6801c300a8	freedreno/isa: add shbang and make executable In a later change we will use mesons find_program(..) and this only works if python files are executable and have a shbang. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	dbb9d3d0e3	freedreno/isa: encode: switch bitmask_t to BITSET_WORD's This commit changes the underlying basetype of bitmask_t to a BITSET_WORD based one. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	9dc2ef7200	freedreno/isa: decode: switch bitmask_t to BITSET_WORD's This commit changes the underlying basetype of bitmask_t to a BITSET_WORD based one. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	430dc08755	freedreno/isa: add split_bits(..) methods Will be used to split a value into needed number of 32 bit words. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	1a8048954d	freedreno/isa: generate marcos used for printf(..) Generate correct BITSET_FORMAT and BITSET_VALUE macros based on the maximum needed ISA bits. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	a6b82d1c8c	freedreno/isa: add store_instruction(..) Makes it possible to store an encoded instruction in a generic ISA specific way. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	a5bbd08775	freedreno/isa: add BITMASK_WORDS define This new define will be used by a more generic deocde(..) and encode(..) functions. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	ea42a3bee5	freedreno/isa: add bitmask to/from uint64_t helper Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	35877cd7b8	freedreno/isa: add bitmask_t to encode.py Prep work for the next change. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	3c634e956b	freedreno/isa: generate isaspec-decode.h Generate a 'glue' header file to be able to support different ISAs. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	59e6258e4f	freedreno/isa: generate ir3-isa.h This header contains the bitmask_t struct typedef that will be used by the isaspec. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	3d6a3b3c7c	freedreno/isa: store max size for needed bitset We will use this information later to create a correctly sized BITSET. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	d29a6e2db5	freedreno/isa: add defines for fprintf(..) usage In the long run they will be replaces with some generated defines. If we do this early it keeps the diffs of the next changes small. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	6c3befdd70	freedreno/isa: add next_instruction(..) In during the next commits we will change to a generated version of next_instruction(..) based on the actual isa. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	c3b14ade55	freedreno/isa: simplify custom_target Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Christian Gmeiner	3b52d64474	freedreno/isa: add leading zero's Changes the output format slightly but its needed when we want to switch to more generic version of isaspec. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11321>	2021-09-21 20:25:31 +00:00
Emma Anholt	2b6729883a	freedreno/ir3: Add encode/decode support for a5xx's LDIB. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12704>	2021-09-03 18:17:07 +00:00
Emma Anholt	127e845d1d	freedreno/ir3: Refactor a3xx ibo/ssbo load/store instruction XML. There are fields common to both loads and stores, but not resinfo. Move them to a common bitset to reduce duplication. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12704>	2021-09-03 18:17:07 +00:00
Eric Engestrom	4d9acfa533	python: drop explicit output_encoding='utf-8' in mako templates Python 3 handles unicode strings by default, so we can drop all that. Suggested-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3674>	2021-08-14 21:44:32 +00:00
Rob Clark	cc72eeb077	freedreno/ir3: Reduce use of compiler->gpu_id For the same reason as previous patch. Mostly we only care about the generation, so convert things to use compiler->gen instead. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12159>	2021-08-06 18:51:50 +00:00
Danylo Piliaiev	1c6c200c0d	ir3: add newly found shlg.b16 instruction Example of blob's output: (nop3) shlg.b16 hr8.x, (r)8, (r)hr8.x, 12 It does: (src2 << src1) \| src2 src1 and src2 could be GPRs, relative GPRs, relative consts, or immidiates. However, they could not be plain const registers. Blob does use it in conjuncture with "samgq" instruction. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11760>	2021-07-09 13:00:29 +00:00
Connor Abbott	2ff3ab0aed	ir3: Make MOVMSK use repeat MOVMSK is a bit of a special case, because it takes multiple cycles (and therefore reduces the nops needed if it's between some other assigner and consumer) however weird things happen if you try to start reading the first component while it isn't finished yet. On balance making it use repeat seems to result in a fewer special cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6752>	2021-07-08 16:02:41 +00:00
Connor Abbott	92bb37cb59	ir3: Add min gen for multi-mov instructions swz works on a5xx/a6xx but not a3xx according to CI. I don't have any access to a4xx HW so I can't tell whether it works there. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11565>	2021-06-29 08:08:12 +00:00
Danylo Piliaiev	fdc0f489e0	ir3: add ldg.a,stg.a which allow complex in-place offset calculation The full form for ldg.a/stg.a offset is: g[reg_address + reg_offset << (imm_shift + 2) + imm_offset << 2] where imm_shift is in [0, 3] and imm_offset is in [0, 3] a6xx blob was found to produce a bit simplier offset calculations for TES/TCS shaders in GTA V: [c002000a_03c14215] ldg.a.f32 r2.z, g[r1.y+((r2.z+1)<<2)], 3; [c0020004_01c14609] ldg.a.f32 r1.x, g[r1.y+((r1.x+3)<<2)], 1; Our new syntax: stg.a.u32 g[r2.x+(r1.x+1)<<2], r5.x, 1 stg.a.u32 g[r2.x+r1.x<<4+3<<2], r5.x, 1 ldg.a.f32 r1.w, g[r1.y+(r1.w+1)<<2], 3 ldg.a.f32 r1.w, g[r1.y+r1.w<<5+2<<2], 3 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11431>	2021-06-25 15:39:51 +00:00
Danylo Piliaiev	4b06db0548	freedreno/isa: add uoffset type to print positive-only offsets Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11431>	2021-06-25 15:39:51 +00:00
Connor Abbott	bff83fc42b	freedreno/isa: Convert to srcs/dsts Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11469>	2021-06-23 17:20:29 +00:00
Italo Nicola	8074a040e7	util: add util_sign_extend This code is taken from src/freedreno/isa/decode.c. Since we need a similar function in panfrost, it's probably good to move it to utils. Signed-off-by: Italo Nicola <italonicola@collabora.com> Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9461>	2021-04-27 07:04:07 +00:00
Danylo Piliaiev	9dd9424a85	turnip: implement VK_EXT_shader_demote_to_helper_invocation The "demote" intrinsic has the semantics of D3D discard, which means it doesn't change the control flow, allowing derivatives to work. On A6xx there is no known way to check whether invocation was demoted, thus we use nir_lower_is_helper_invocation. Add "logical" OPC_DEMOTE which is later translated to "kill". Such separation is necessary to run "kill" specific optimizations which are invalid for "demote". Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9460>	2021-04-19 17:11:36 +00:00
Connor Abbott	08499369d0	ir3: Assemble and disassemble swz/gat/sct Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10291>	2021-04-19 16:10:44 +00:00

1 2

62 Commits