KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Danylo Piliaiev	1e33b6a32b	turnip: enable shaderInt16 We should have everything to enable it. 16b integer division is lowered by nir_lower_idiv. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10054>	2021-04-20 20:32:20 +00:00
Danylo Piliaiev	d918bbfa1c	ir3: treat 16b imul as mul.s24 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10054>	2021-04-20 20:32:20 +00:00
Rob Clark	5bf7475460	ir3: handle 16b op_i2b1 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10054>	2021-04-20 20:32:20 +00:00
Samuel Iglesias Gonsálvez	b2a60c157e	turnip: add LRZ early-z support Imported the logic from Freedreno driver. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7186>	2021-04-20 10:01:58 +00:00
Samuel Iglesias Gonsálvez	af049b6668	turnip: fix setting dynamic state mask for VK_DYNAMIC_STATE_STENCIL_OP_EXT case Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7186>	2021-04-20 10:01:58 +00:00
Samuel Iglesias Gonsálvez	88c7aa0b3e	turnip: group all geometry constant draw states in one Thus, we can free some draw state slots for future use. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7186>	2021-04-20 10:01:58 +00:00
Samuel Iglesias Gonsálvez	2c0c696f16	turnip: update LRZ state based on stencil test state Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7186>	2021-04-20 10:01:58 +00:00
Samuel Iglesias Gonsálvez	ff8e3547b3	turnip: implement LRZ direction There are some LRZ compare op switches that are not supported by the HW, like GREATER* <-> LESS* ones. This patch tracks the direction of the switch and disables LRZ if needed. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7186>	2021-04-20 10:01:58 +00:00
Eric Anholt	8a8e55d6a8	ci/freedreno: Test dEQP-EGL against Xorg. This should help us be able to refactor core EGL code with more confidence, and increase our confidence uprevving Mesa in ChromeOS. Part of #1884 Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10240>	2021-04-19 20:53:27 +00:00
Danylo Piliaiev	64367f2359	turnip: implement VK_KHR_shader_terminate_invocation OpTerminateInvocation provides the behavior required by the GLSL discard statement, which we already implement. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9460>	2021-04-19 17:11:36 +00:00
Danylo Piliaiev	9dd9424a85	turnip: implement VK_EXT_shader_demote_to_helper_invocation The "demote" intrinsic has the semantics of D3D discard, which means it doesn't change the control flow, allowing derivatives to work. On A6xx there is no known way to check whether invocation was demoted, thus we use nir_lower_is_helper_invocation. Add "logical" OPC_DEMOTE which is later translated to "kill". Such separation is necessary to run "kill" specific optimizations which are invalid for "demote". Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9460>	2021-04-19 17:11:36 +00:00
Connor Abbott	08499369d0	ir3: Assemble and disassemble swz/gat/sct Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10291>	2021-04-19 16:10:44 +00:00
Connor Abbott	d48d43039a	ir3: Improve cat1 modifier disassembly Remove bit that shouldn't be part of (rptN), and rewrite the handling of (even) and (pos_infinity) to uncover a missing (neg_infinity) modifier. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10291>	2021-04-19 16:10:44 +00:00
Connor Abbott	4c5b696cc3	ir3/parser: Fix oob write with immediates array immediates_count and immediates_size are supposed to have the same units, but it was only incrementing immediates_count by 1. While we're here, also fix the case where constants are specified out-of-order. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10291>	2021-04-19 16:10:44 +00:00
Rob Clark	c74d93cf01	freedreno/fdl: Re-indent Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Rob Clark	6050976232	freedreno/perfcntrs: Re-indent Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Rob Clark	d26a224ca9	freedreno/ir2: Re-indent clang-format -fallback-style=none --style=file -i src/freedreno/ir2/*.[ch] Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Rob Clark	2dbf09c2b4	freedreno/drm-shim: Re-indent clang-format -fallback-style=none --style=file -i src/freedreno/drm-shim/*.[ch] Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Rob Clark	45856c5fbc	freedreno/decode: Re-indent clang-format -fallback-style=none --style=file -i src/freedreno/decode/*.[ch] Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Rob Clark	3894bc9664	freedreno/computerator: Re-indent clang-format -fallback-style=none --style=file -i src/freedreno/computerator/*.[ch] Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Rob Clark	ccd68b672a	freedreno/common: Re-indent clang-format -fallback-style=none --style=file -i src/freedreno/common/*.[ch] Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Rob Clark	f5918f750f	freedreno/afuc: Re-indent clang-format -fallback-style=none --style=file -i src/freedreno/afuc/*.[ch] Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Rob Clark	b94db11708	freedreno/drm: Re-indent clang-format -fallback-style=none --style=file -i src/freedreno/drm/*.[ch] Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10293>	2021-04-17 15:38:56 +00:00
Eric Anholt	23159f1a7a	ci/freedreno: Skip some precision tests on a530. These have flaked as Timeouts in CI in the last month. .precision.* is generally very slow (some in the 15s-30s range), but it's unclear to me why they sometimes spike up to 60 seconds (thermal throttling?). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10274>	2021-04-16 04:34:14 +00:00
Eric Anholt	7d234da6ee	freedreno: Fix YUV sampler regression. We have to keep sampler uniforms around for later YUV lowering, and we only need to remove uniforms that take up storage space. Code comes from radeonsi. Closes: #4644. Fixes: `de17b4aab5` ("freedreno: Remove uniform variables after finalizing NIR.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10246>	2021-04-15 16:20:15 +00:00
Michel Dänzer	d200f45875	Use explicit break instead of fall-through to break-only case clang generates a warning if there's no explicit break or fall-through annotation. The latter would be kind of silly in this case, and not robust against any future changes turning the fall-through invalid. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10220>	2021-04-15 16:01:22 +00:00
Michel Dänzer	2928c21eb7	Convert most remaining free-form fall-through comments to FALLTHROUGH One exception is src/amd/addrlib/, for which -Wimplicit-fallthrough is explicitly disabled. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10220>	2021-04-15 16:01:22 +00:00
Connor Abbott	cf727e6ba4	tu: Expose VK_EXT_robustness2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7573>	2021-04-15 16:05:13 +02:00
Connor Abbott	0fb14420da	tu: Handle null descriptors Writing all 0's, including for the format, seems to work. Actually setting the format seems to break textureSize() (getsize returns 1 for some reason). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7573>	2021-04-15 16:05:13 +02:00
Connor Abbott	f58ece08da	tu: Handle robust UBO behavior for pushed UBO ranges If we push a UBO range but then find out at draw-time that part of the pushed range is out of range of the UBO descriptor, then we have to fill in the rest of the range with 0's to mimic the bounds-checking that ldc would've done. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7573>	2021-04-15 16:05:13 +02:00
Connor Abbott	cb02a48f83	tu: Correctly preserve old push descriptor contents We were never setting set->size, so we were always copying 0 bytes. But as we only copy the contents when the layout and therefore the size is the same, we don't have to take the old size into account anyway. This fixes some VK_EXT_robustness2 tests that use push descriptors. Fixes: `6d4f33e` ("turnip: initial implementation of VK_KHR_push_descriptor") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7573>	2021-04-15 16:05:13 +02:00
Connor Abbott	c68ea960a7	ir3, tu: Add compiler flag for robust UBO behavior This needs to be part of the compiler because it's the only piece that we always have access to in all the places ir3_optimize_loop() is called, and it's only enabled for the whole Vulkan device. Right now it's just used for constraining vectorization, but the next commit adds another use. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7573>	2021-04-15 16:05:11 +02:00
Connor Abbott	8f54028479	ir3: Reduce max const file indirect offset base to 9 bits This fixes dEQP-VK.robustness.robustness2.bind.notemplate.r32i.dontunroll.nonvolatile.uniform_buffer.no_fmt_qual.len_260.samples_1.1d.frag, which accesses the shader UBO with c<a0.x + 512> due to the constant data UBO coming before it in the const file. The len_256 variant has a smaller constant data UBO, so it uses c<a0.x + 256> instead, and that works, so 512 seems to be the real limit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7573>	2021-04-15 16:03:54 +02:00
Connor Abbott	8e11f0560e	ir3: Fix list corruption in legalize_block() We forgot to remove the instruction under consideration from instr_list before inserting it into the block's list, which caused instr_list to become corrupted. This happened to work but caused further corruption in some rare scenarios. Fixes: `adf1659` ("freedreno/ir3: use standard list implementation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7573>	2021-04-15 16:03:54 +02:00
Eric Anholt	6d510fd473	ci/freedreno: Merge a630 piglit to a single job. piglit_gl clocked in at 6:12 end-to-end runtime, and piglit_shader spent 2:53 in deqp-runner, so merging them together should be about 9 minutes. Removing a boot should save us a minute or two of runner time per pipeline. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10243>	2021-04-15 10:06:14 +00:00
Samuel Iglesias Gonsálvez	029bc53be6	turnip: fix typo in tu_CmdBeginRenderPass2() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8615>	2021-04-15 09:51:25 +02:00
Samuel Iglesias Gonsálvez	d52917f858	turnip/lrz: added support for depth bounds test enable Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8615>	2021-04-15 09:51:25 +02:00
Samuel Iglesias Gonsálvez	2161aebf8d	turnip: document GRAS_LRZ_CNTL's UNK5 bitfield It is used by the blob to enable depth bounds test for LRZ. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8615>	2021-04-15 09:51:25 +02:00
Samuel Iglesias Gonsálvez	54cf12774a	turnip/lrz: add support for VK_EXT_extended_dynamic_state When the depth or stencil state changes dynamically, that might affect LRZ state and we need to recalculate it and emit it again. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8615>	2021-04-15 09:51:20 +02:00
Samuel Iglesias Gonsálvez	6d6cbb7361	turnip: refactor how LRZ state is calculated Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8615>	2021-04-15 09:50:51 +02:00
Samuel Iglesias Gonsálvez	43ebba4e88	turnip: initialize pipeline->rb_{stencil,depth}_cntl always This change will simplify further changes on LRZ state management. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8615>	2021-04-15 09:50:51 +02:00
Samuel Iglesias Gonsálvez	1f9fb7677b	turnip: move pipeline gras_su and rb{stencil,depth}_cntl_mask initialization Move them up, so they are initialized even when the dynamic state is not used. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8615>	2021-04-15 09:50:51 +02:00
Rob Clark	31782330da	freedreno: Add missing foreach macros and update indentation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10247>	2021-04-14 16:53:26 -07:00
Rob Clark	2fb3984805	freedreno: Add .clang-format Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8883>	2021-04-14 19:52:21 +00:00
Connor Abbott	2deead184c	ir3/sched: Don't schedule too many tex/SFU instructions Consider a simple loop that does a series of texture instructions and then reduces the results: vec4 sum = vec4(0); for (int i = 0; i < N; i++) { sum += texture(...); } Assume that the loop is unrolled and we schedule the resulting basic block. Right now, after we schedule the first texture instruction, the only instructions available to schedule that don't incur a sync are the instructions to setup the second texture instruction. So we keep picking the texture instructions, no matter how large N is, resulting in a pathological schedule for register pressure when N is very large: sum1 = texture(...); sum2 = texture(...); sum3 = texture(...); ... sum = sum1 + sum2 + sum3 + ...; In particular this happens with some CTS tests for VK_EXT_robustness2, where a loop like that with many iterations is marked as [[unroll]], forcing NIR to unroll it. This solution is a balance between the current approach and always scheduling for register pressure (and ignoring sync's). We only allow a certain number of texture fetches to be in flight before considering textures to "sync", even though they don't really, both because they likely will sync in reality (overflowing the internal queue of waiting texture instructions) and because at some point we need the normal algorithm to kick in and start lowering register pressure. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7571>	2021-04-14 17:33:58 +00:00
Connor Abbott	7821e5a3f8	ir3/sched: Don't penalize uses of already-waited tex/SFU Once we insert a use of a given tex or SFU instruction, then we must wait for that tex/SFU instruction (as well as all earlier ones) to complete, so we shouldn't penalize further uses, even if a subsequent tex/SFU instruction gets scheduled after the first use. This especially matters after the next commit when we start forcibly breaking up long sequences of texture instructions, since if we schedule a group of 8 texture instructions then we want to schedule the uses of those instructions in parallel with the next 8 texture instructions to reduce register pressure. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7571>	2021-04-14 17:33:58 +00:00
Michel Dänzer	af0fde955c	ci: Move docker images from Debian buster to bullseye Among other things, this gets us GCC 10 (was 6). Requires some changes to third party components we use: * Install apitrace (& waffle) from Debian; was hitting issues with the local build, and it's the same version 9.0 anyway. * Update Fossilize to a newer commit which builds with GCC 10. * apt.llvm.org repositories are no longer needed. * Use an SPIRV-LLVM-Translator commit which builds with LLVM 11.0.1. * Install XCB packages from Debian, 1.13 fails to build with Python 3.9. * Install wayland-protocols from Debian, 1.12 is too old for libgtk-3-dev in bullseye. LLVM 7/8 packages are no longer available. Also adapt expected test results to Xvfb now exposing multi-samle GLXFBConfigs. v2: * Install clang instead of clang-11. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3124 Reviewed-by: Eric Anholt <eric@anholt.net> # v1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9833>	2021-04-14 13:05:08 +00:00
Connor Abbott	271c18f48e	tu: Expose VK_KHR_relaxed_block_layout This was absorbed into Vulkan 1.1, but we forgot to expose it separately. It's a subset of what's allowed by VK_EXT_scalar_block_layout. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8695>	2021-04-14 11:48:38 +00:00
Connor Abbott	765c3b85a5	tu: Expose VK_KHR_spirv_1_4 and VK_EXT_scalar_block_layout VK_KHR_spirv_1_4 is trivial because vtn already supports all the added SPIR-V features that aren't gated behind Vulkan extensions. I've observed some robustness2 CTS tests requiring this. However there are a few tests currently failing due to lacking spilling. VK_EXT_scalar_block_layout should also be trivial, since support for "straddling" UBO loads was added recently for other reasons. This is used by every robustness2 CTS test. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8695>	2021-04-14 11:48:38 +00:00
Juan A. Suarez Romero	9e5762c387	ci: Update VK-GL-CTS to 1.2.6.0 v2: - Bump up MESA_ROOTFS_TAG instead of arm_build (Michel) Acked-by: Michel Dänzer <mdaenzer@redhat.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10136>	2021-04-14 08:06:55 +00:00
Marek Olšák	fb29cef8dd	nir: add many passes that lower and optimize 16-bit input/outputs and samplers Added: * a pass that renumbers bases of IO intrinsics * a pass that converts mediump IO to 16 bits, optionally using the new packed varying slots * a pass that sets (forces) mediump in IO intrinsics (for testing) * a pass that remaps VARYING_SLOT_VAR[0..15]_16BIT to VARYING_SLOT_VAR[0..31] (if some shader stages don't want packed varyings) * a pass that folds type conversions around texture opcodes into those opcodes (e.g. tex(f2f32(coord), ..) is changed into tex accepting f16) * a pass that changes (legalizes) sampler src and dst types based on specified hw constraints (e.g. derivatives must be the same type as coordinates) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Rhys Perry	a2619b97f5	nir/lower_idiv: add options to use fp32 for 8-bit division lowering Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10081>	2021-04-12 16:19:46 +00:00
Danylo Piliaiev	16fd5bd996	turnip: support copying both aspects of D32_SFLOAT_S8_UINT We cannot copy both aspects at the same time, so copy them one by one. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10140>	2021-04-12 14:36:30 +00:00
Connor Abbott	ba796d5115	ir3/postsched: Make sure to schedule inputs before kill Before, we would prefer to schedule inputs before kills, which works assuming that the live range of the bary_ij system value don't get split and therefore all bary.f are ready at the start of the block. However live range splitting can mess up that assumption and cause a kill to get scheduled before a move that leads to a bary.f. This fixes even e.g. dEQP-GLES2.functional.shaders.discard.basic_always on a3xx before introducing CSE of collect instructions, but even after that it could be a problem theoretically as the register allocator doesn't guarantee that any live ranges aren't split. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10143>	2021-04-09 16:31:29 +00:00
Matt Turner	4251e9cddf	ir3: Don't count (nopX) towards the wrong category Prior to this commit (nop3) mad.f32 r0.y, c0.x, r1.w, c0.y was counted as 4 cat3 instructions (and still 3 cat0/nops) in shader-db results. With this change, it is counted as only 1 cat3 instruction. Probably never going to have better shader-db results than this in my career: total cat2 in shared programs: 1214667 -> 732058 (-39.73%) cat2 in affected programs: 1194729 -> 712120 (-40.39%) helped: 8551 HURT: 0 total cat3 in shared programs: 376448 -> 274745 (-27.02%) cat3 in affected programs: 344918 -> 243215 (-29.49%) helped: 7222 HURT: 0 Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10116>	2021-04-09 14:26:35 +00:00
Bas Nieuwenhuizen	4ca4de50f7	nir: Remove nir_shader->shared_size. The same info is in shader_info. Dedupe. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Chad Versace	5e6db19168	anv: Remove vkCreateDmaBufINTEL (v4) Superceded by VK_EXT_image_drm_format_modifier. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v1) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (v4) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1466>	2021-04-08 14:15:55 +00:00
Chad Versace	0845cabc72	vulkan: Track dependencies of Python imports The meson.build was unaware of transitive dependencies introduced by Python imports. Android still needs fixing. But I did not update the Android files lest I break the build. Ideally, we would fix this by using a Python runner that generates a depfile, similar to how meson creates depfiles for C files by passing flags -MD -MQ -MF to gcc. But this patch gets the job done, without stalling on the ideal general solution, by manually tracking the Python imports in new 'foo_depend_files' variables. CC: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1466>	2021-04-08 14:15:54 +00:00
Connor Abbott	5a70c4d4a0	ir3: Don't copy propagate arrays in ir3_cp We don't check whether there's an intervening write in this pass, which makes it incorrect. ir3_cp_postsched does check correctly, but we were accidentally doing it here anyway for some sources. While we're here, delete some code that was only used in the array case. Fixes: `f370e954` ("freedreno/ir3: handle const/immed/abs/neg in cp") Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10076>	2021-04-07 14:35:13 +00:00
Connor Abbott	1ad5ee5a04	ir3/cp_postsched: Set address of uses for relative mov's Fixes: `680ca5b` ("freedreno/ir3: add post-scheduler cp pass") Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10076>	2021-04-07 14:35:13 +00:00
Connor Abbott	dcc26a3945	ir3: Fix valid flags for STIB Disallow immediates for the source. This was hidden by the fact that we didn't copy-propagate trivial collect instructions. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10076>	2021-04-07 14:35:13 +00:00
Connor Abbott	94beaa1d92	ir3/legalize: Fix last input (ss) insertion If there was a mix of ldlv and bary.f and we inserted an (ss) after the last input which was a bary.f, then last_input_needs_ss would get unset, even though it shouldn't. For figuring out whether we need the (ss), we need to know whether there are any pending ldlv's when last_input gets executed, not at the end of the block, which means that the existing code's strategy of inserting it after the whole block has been processed won't work. Rework it to do the last_input processing in the main loop instead. Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10076>	2021-04-07 14:35:13 +00:00
Connor Abbott	35ffe4fec1	freedreno/a3xx: Fix SP_FS_CTRL_REG1_INITIALOUTSTANDING Unfortunately this didn't fix anything, but I thought I might as well include it. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10076>	2021-04-07 14:35:13 +00:00
Danylo Piliaiev	519eb735a3	turnip: implement variableMultisampleRate If subpass doesn't have depth/color attachments - samples count is devised from VkPipelineMultisampleStateCreateInfo::rasterizationSamples. Without variableMultisampleRate enabled all pipelines in such subpass should have the same samples count; variableMultisampleRate allows to have pipelines with different number of samples in one subpass, given that it doesn't have depth/color attachments. Blob doesn't have it enabled but there is no known reason for this. Passes: dEQP-VK.pipeline.multisample.variable_rate.* Fixes test: dEQP-VK.pipeline.framebuffer_attachment.no_attachments_ms Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9556>	2021-04-07 12:04:45 +00:00
Alejandro Piñeiro	1e0a69afa7	vulkan: track number of bindings instead of max binding for CreateDescriptorSetLayout As that handles better, and more clear, the case of bindingCount being zero. For the case of Anvil and Turnip, this avoids allocating a non-needed binding when bindingCount is zero. Inspired on radv, that was what it was doing so far. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4526 Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9905>	2021-04-05 20:17:53 +00:00
Danylo Piliaiev	0709a6b363	turnip: fix alignment of non-32b types in workgroup memory Fixes tests: dEQP-VK.spirv_assembly.instruction.compute.workgroup_memory.float16 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10039>	2021-04-05 17:31:11 +00:00
Alyssa Rosenzweig	06ebbde630	vulkan: Deduplicate mesa stage conversion Across every driver... v2: Add casts to appease -fpermissive used on CI. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9477>	2021-04-03 17:34:39 +00:00
Danylo Piliaiev	0ec495e3c9	turnip: handle format list for compressed formats Compressed formats may have compatible formats, however they could only be sampled, so we should not call tu6_format_color with them. tu6_format_texture should have the same behaviour for checking swap so use it for all cases. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10009>	2021-04-02 21:52:05 +00:00
Eric Anholt	f67b6f9c47	ci/freedreno: Fix up the a5xx border color flake annotation. Looks like I put it in the wrong file back when I first caught it. It's a one-or-twice-a-week back flake that seems to happen. The upcoming deqp-runner uprev would have caught this mistake. Fixes: `957132294f` ("ci/a5xx: Increase the gles3/31 coverage.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9806>	2021-04-02 18:42:04 +00:00
Eric Anholt	adf04d1af4	ci/freedreno: Switch to the trimmed glxgears trace. The old one had a ton of frames and took ~5 minutes on a306. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9957>	2021-04-01 21:04:11 +00:00
Eric Anholt	fe5349f70c	freedreno/a6xx: Fix alpha tests. Apparently I inverted the sense of this flag back when we didn't have piglit testing. Fixes terrible rendering in minetest, HL2, CS:Source, and CS. Fixes: `0369dd9077` ("freedreno/a6xx: Add ARB_depth_clamp and separate clamp support.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9957>	2021-04-01 21:04:11 +00:00
Eric Anholt	3043940183	freedreno/a5xx: Fix alpha test vs early Z bugs. Just like with discards, we have to disable early Z writes when alpha test is enabled. Fixes rendering on HL2, CS: Source, counter-strike, and minetest. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9957>	2021-04-01 21:04:11 +00:00
Eric Anholt	c9fd8c2570	ci/freedreno: Add trace testing on a3xx, a5xx. Having compared rendering between a6xx and these, I found several bugs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9957>	2021-04-01 21:04:11 +00:00
Eric Anholt	8e3a1d0dd2	ci/freedreno: Rename a306-test and a530-test to drop "arm64" from the name. We don't have an armhf variant, and probably won't. Now matches a630. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9957>	2021-04-01 21:04:11 +00:00
Eric Anholt	ec54546b2a	ci/freedreno: Add more new traces for a630 (minetest, TDM, pioneer, glyphy). These are all recent traces that have been added. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9957>	2021-04-01 21:04:11 +00:00
Danylo Piliaiev	ce1a381e57	turnip: enable VK_KHR_16bit_storage on A650 A650 can use the same SSBO descriptor for both 32-bit and 16-bit access, which makes it easy to enable this extension. Passes tests that run under: dEQP-VK.spirv_assembly.instruction..16bit_storage. Rebased and modified commit from Jonathan Marek. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9840>	2021-04-01 17:51:07 +00:00
Jonathan Marek	14acc64c3b	turnip: enable VK_KHR_shader_float16_int8 ir3 supports 16-bit floats, so we can enable this. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9840>	2021-04-01 17:51:07 +00:00
Danylo Piliaiev	64aaa4afc3	turnip: enable infinities for f16 math and document the register When float16 is enabled this will allow to pass a number of float16 tests. When A6XX_SP_FLOAT_CNTL_F16_NO_INF is set - all operations which generate +-infinity generate +-MAX_HALF_FLOAT. Fixes some tests from: dEQP-VK.spirv_assembly.instruction..float16. dEQP-VK.spirv_assembly.instruction..float_controls.fp16. E.g.: dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1.sinh_vert dEQP-VK.spirv_assembly.instruction.compute.float16.arithmetic_4.length dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.input_args.log_denorm_flush_to_zero_nostorage dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.input_args.log2_denorm_flush_to_zero_nostorage dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.input_args.inv_sqrt_denorm_flush_to_zero_nostorage Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9840>	2021-04-01 17:51:07 +00:00
Danylo Piliaiev	14460faa64	ir3: convert shift amount to 16b for 16b shifts NIR has shifts defined as: opcode("*shr", 0, tuint, [0, 0], [tuint, tuint32], False, ... However, in ir3 we have to ensure that both operators of shift instruction have the same bitness. Let's hope that in future the additional COV for constants would be optimized away. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9840>	2021-04-01 17:51:07 +00:00
Jonathan Marek	3777ecdf11	turnip: implement VK_KHR_shader_float_controls This matches the blob and doesn't require actually implementing controls since the supported modes are just what the HW does. Passes tests under: dEQP-VK.spirv_assembly..float_controls. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9840>	2021-04-01 17:51:07 +00:00
Danylo Piliaiev	de195671bd	ir3: nir_op_f2f16 should round to even cat1 instructions round to zero by default. When fp16 is enabled this will fix: dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp32_nostorage_frag dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp32_nostorage_vert dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.input_args.rounding_rte_conv_from_fp32_nostorage Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9840>	2021-04-01 17:51:07 +00:00
Michel Dänzer	6652c5018c	ci: Merge ARM testing docker images to a single arm_test one The merged image contains kernels & rootfs for both arm64 & armhf baremetal test jobs, and is smaller than either arm{64,hf}_test image before. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9955>	2021-04-01 16:35:26 +00:00
Michel Dänzer	4b20bd7425	ci: Build ARM baremetal rootfs in native container Doing so in an x86 container via qemu was slow, and started failing recently after updating to a newer qemu version. This also results in smaller arm_test docker images, since we need to install fewer Debian packages in them. As a bonus, this turns some piglit tests from fail to pass (Or maybe they'll turn out to be flakes? They've passed at least 3 times in a row). Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9955>	2021-04-01 16:35:26 +00:00
Eric Anholt	0be9a40225	ci/freedreno: Demote a630-asan to a manual test for now. It's flaky in producing Missing results. I've got an uprev that should avoid the issue (and possibly a followon actual fix), but it's blocked on being able to rebuild the arm containers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9932>	2021-03-31 17:15:27 +00:00
Danylo Piliaiev	00d6ccebf9	ir3/isa: account for randomly set by blob lowest bit of ibo atomics As far as I could see - blob randomly sets the lowest bit of atomic.b.* instructions. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9841>	2021-03-31 15:03:35 +00:00
Bas Nieuwenhuizen	83c92a48b7	vulkan: Fix descriptor set creation with zero bindings. MAX2(count * struct size, 1) results in 1 for count=0, not the size of a struct. Since this MAX only seems to exist so we can keep using NULL for error reporting, just refactor to return a VkResult. Fixes: `ad241b15a9` ("vk: consolidate dynamic descriptor binding sorting") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4522 Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9880>	2021-03-29 23:32:50 +00:00
Matt Turner	0b35987895	tu: Skip tu_tiling_config_update_tile_layout() if not using gmem Otherwise pass->tile_align_w will be 0, leading to a divide by zero and undefined behavior. In practice, I saw this lead to an infinite loop in tests like dEQP-VK.draw.instanced.draw_indexed_indirect_vk_primitive_topology_line_list_attrib_divisor_0_multiview Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9606>	2021-03-29 21:58:24 +00:00
Eric Anholt	99838513ae	freedreno/a5xx: Add support for clip distances and use them for userclip. A little low-stakes RE effort as I unwind from fighting CI all day. Comes from diffing dEQP-VK.clipping.user_defined.clip_distance.vert.* on the blob and comparing to a6xx behavior. (My blob doesn't do tess, so if there are equivalent tess fields for some of these, I didn't find them) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9870>	2021-03-29 21:24:16 +00:00
Vinson Lee	a5d5cbdf08	freedreno: Fix file descriptor leak. Fix defect reported by Coverity Scan. Resource leak (RESOURCE_LEAK) leaked_handle: Handle variable fd going out of scope leaks the handle. Fixes: `5a13507164` ("freedreno/perfcntrs: add fdperf") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9889>	2021-03-29 17:08:56 +00:00
Danylo Piliaiev	2087168a30	turnip,ir3: account for dispatch group offsets Fixes tests: dEQP-VK.compute.device_group.dispatch_base Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9339>	2021-03-29 14:31:44 +03:00
Eric Anholt	1e8792ea5f	freedreno/a6xx: Use the frontend userclip lowering. This ends up being way more piglit-conformant than our backend lowering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9815>	2021-03-26 20:51:18 +00:00
Danylo Piliaiev	243724031b	turnip: clamp to zero negative upper left corner of viewport We cannot send negative viewport coordinates to the hardware, so clamp them since negative min.x/y is valid per spec. The negative origin still counts in calculations of guardband. Fixes crash in 3DMark's "Sling Shot Extreme" test. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9629>	2021-03-25 17:57:59 +00:00
Danylo Piliaiev	56909868cd	turnip: implement VK_KHR_pipeline_executable_properties Loosely based on ANV implementation. For executable's internal representation we output: - Initial NIR after spirv_to_nir - Final optimized NIR - IR3 disassembly Note, that vkGetPipelineExecutablePropertiesKHR is required to return executable properties even if pipeline was not created with CAPTURE_STATISTICS or CAPTURE_INTERNAL_REPRESENTATIONS bits set. So the executables array is unconditionally populated, however NIR and IR3 disassemlies are filled only when CAPTURE_INTERNAL_REPRESENTATIONS is set. Passes dEQP-VK.pipeline.executable_properties.* Works with RenderDoc. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8877>	2021-03-25 13:53:33 +00:00
Tomeu Vizoso	f64ef064de	ci/fdo: Use trimmed traces for Valve games Now that we have trimmed versions of them, we can run them in a more reasonable amount of time. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9771>	2021-03-24 20:54:58 +00:00
Eric Anholt	6eee6769e9	turnip: Fix KGSL build since common dispatch rework. Fixes: `59d70c47c7` ("turnip: Use the common dispatch framework") Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9764>	2021-03-24 17:25:07 +00:00
Samuel Pitoiset	2c2ea54020	turnip: use common entrypoints for VK_KHR_create_renderpass2 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9601>	2021-03-24 11:21:53 +00:00
Rob Clark	e7202e889b	freedreno: Split out devicetree helpers The freedreno pps datasource is going to need the same, so split out helpers that can be re-used. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9758>	2021-03-22 20:46:17 +00:00
Rob Clark	9479ae9761	freedreno/fdperf: Use os_read_file() Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9758>	2021-03-22 20:46:17 +00:00
Rob Clark	5871f4177c	freedreno: Make headers C++ happy We'll need a few of these for the C++ based gfx-pps performance counter collector datasource. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9758>	2021-03-22 20:46:17 +00:00
Eric Anholt	431b0ef9ee	freedreno/a6xx: Rename the RB_BLIT_INFO.INTEGER field to SAMPLE_0. As @samuelig found, this is the field for disabling sample averaging and using sample 0 instead. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9661>	2021-03-22 19:07:08 +00:00
Danylo Piliaiev	a5b37c64d1	turnip: expose several already implemented extensions They were promoted to Vulkan 1.1 and we already support them. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9757>	2021-03-22 18:20:57 +00:00
Connor Abbott	d8a2abe348	freedreno/computerator: Add script for finding reg file size This helps with finding the various parameters introduced in the last commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9498>	2021-03-22 18:03:16 +00:00
Connor Abbott	d274649799	freedreno/computerator: Use threadsize calculated by ir3 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9498>	2021-03-22 18:03:16 +00:00
Connor Abbott	7ecc70b31c	turnip: Use threadsize calculated by ir3 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9498>	2021-03-22 18:03:16 +00:00
Connor Abbott	fd7960e191	ir3: Calcuate max_waves and threadsize max_waves is just for shader-db stats for now, but threadsize will replace the various mechanisms used to determine threadsize across the different gen's. Calculating these correctly entails adding a bunch of details about the sizes of various things to ir3. In the future we will use the guts of the max_waves calculation to inform RA decisions as well, which is why the max_waves calculation is broken up into register dependent/independent pieces. Something should be said about the units of reg_size_vec4. These units were chosen for two reasons: 1. As said in the comment, it makes some calculations easier. 2. For a4xx/a5xx, where we don't know as much because we haven't done the same sorts of experiments to probe for the HW configuration, it corresponds more directly to things that are known. The existing code switches to the smaller threadsize when r24.x or higher is used, which translates directly to a reg_size_vec4 of 48. If we chose different units (e.g. multiplying by wave_granularity and/or threadsize_base), then to match the same behavior we'd have to set reg_size_vec4 based on some other parameters that aren't 100% known. If someone comes along and updates them, they might inadvertantly break it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9498>	2021-03-22 18:03:16 +00:00
Connor Abbott	cbc68c79a5	freedreno: Add local_size to ir3_shader_variant We want to use the local_size when available to calculate the threadsize in ir3, and we need it to work with e.g. computerator where we don't have a nir shader. Add a local_size field and use that in computerator instead of of a separate structure that's inaccessable to core ir3. Also set a dummy local_size in the tests to avoid a divide-by-zero. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9498>	2021-03-22 18:03:16 +00:00
Mike Blumenkrantz	ad241b15a9	vk: consolidate dynamic descriptor binding sorting this code was duplicated across several drivers Reviewed-by: Adam Jackson <ajax@redhat.com> turnip changes Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9480>	2021-03-22 16:51:55 +00:00
Danylo Piliaiev	208250b376	ir3: update info about applicability of saturation modifier On a6xx saturation doesn't work on cat4 and on bary.f Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9751>	2021-03-22 15:02:14 +00:00
Rob Clark	9aef029635	freedreno/ir3: Precompute whether we need driver-params To save a bit of extra math in the draw-path. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9581>	2021-03-20 12:13:09 -07:00
Rob Clark	b5e1e99da1	freedreno/drm: Inline iova calculation The shift/or are frequently zero, so this lets the compiler optimize out some draw-overhead hotpath. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9581>	2021-03-20 12:13:09 -07:00
Rob Clark	93d5349fa5	freedreno/drm: Move emit_reloc_tail to head Get this out of the way first to avoid some register push/pop. Only reloc->bo is needed after writing the address into cmdstream, so this turns msm_submit_append_bo() into a tail call. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9581>	2021-03-20 12:13:09 -07:00
Rob Clark	684586b96e	freedreno/drm: Split 64b vs 32b paths No need to 'if (gpu_id >= 500)' on every reloc Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9581>	2021-03-20 12:13:09 -07:00
Rob Clark	9168d9cbfb	freedreno/drm: Split softpin "reloc" functions "OBJECT" rb's are long lived, and generating them is not a hotpath, but relocs to "STREAMING" rb's are a hot path. But we can decouple these. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9581>	2021-03-20 12:13:08 -07:00
Eric Anholt	4eb7c4d60c	ci/freedreno: Mark all of dEQP TF as flaky. I keep working on stabilizing it, but no luck yet. Stop blocking CI on our flakes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9715>	2021-03-19 22:07:57 +00:00
Danylo Piliaiev	9efec45b0c	ir3: disallow .sat on SEL instructions Saturation is unsupported on SEL instructions. Fixes main menu rendering in Genshin Impact. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9666>	2021-03-19 17:09:07 +00:00
Eric Anholt	5da520cf3d	freedreno/ir3: Demote centroid usage to pixel on non-msaa. Like with the sample qualifier on all GPUs, use pixel on older HW when MSAA rasterization is disabled to get reliable results. Since I ran many CI jobs on this, this updates the A530 TF flakes list, though I don't think that this MR necessarily made it flakier (we were already struggling on a5xx TF, which was what was motivating me to look at this!) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9641>	2021-03-18 10:46:09 -07:00
Danylo Piliaiev	b804abd61d	freedreno/isa: assert if field's range is out of bitset's range Also, update outdated comment along the way. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9628>	2021-03-17 12:07:54 +00:00
Danylo Piliaiev	42c81e1901	ir3: match mova1 mnemonic when writing to A1 For MOV to A1 blob uses "mova1" mnemonic, which is mov.u16u16; change s16 to u16 when creating MOV to A1 in order to match the blob. Before, couldn't be parsed back: mov.s16s16 ha0.y, 0 After, could be parsed back and matches blob behaviour: mova1 a1.x, 0 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9628>	2021-03-17 12:07:54 +00:00
Danylo Piliaiev	c0a62b203e	ir3/isa,parser: fix encoding and parsing of bindless s2en SAM Before, decoding showed that there is an error: sam.base0 (f32)(xyzw)r0.x, r0.z, a1.x ; no field 'HAS_SAMP', WARNING: unexpected bits[0:7] in #cat5-samp-s2en-bindless-a1: 0x1 vs 0x0 After: sam.base0 (f32)(xyzw)r0.x, r0.z, s#1, a1.x Fixes textures on the ground in TauCeti Vulkan Technology Benchmark Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9628>	2021-03-17 12:07:54 +00:00
Samuel Iglesias Gonsálvez	0acd7df67b	turnip: set depth plane control zmode to A6XX_LATE_Z when sample mask is written Otherwise, gl_SampleMask[] writes are ignored and the stencil test works like if all samples were enabled. Fixes: dEQP-VK.renderpass.suballocation.multisample.s8 Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9478>	2021-03-17 09:05:33 +00:00
Iago Toral Quiroga	1e4abf1fe3	vulkan/util: call glsl_type_singleton_init_or_ref from vk_instance_init v2: link libvulkan_util with libglsl so it can find the glsl singleton symbols. v3: link with libcompiler instead of libglsl (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> for the v3dv bits. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> for the turnip bits. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> for the radv bits. Acked-by: Dave Airlie <airlied@redhat.com> for the lvp bits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9457>	2021-03-17 08:15:36 +01:00
Hyunjun Ko	d9fcf5de55	turnip: Enable nonuniform descriptor indexing Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9125>	2021-03-17 01:09:30 +00:00
Hyunjun Ko	e9fd2a2a58	ir3: Add nonuniform encodings to ir3 encoder and parser By keeping track of nonuniform access from nir and storing it to ir3. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9125>	2021-03-17 01:09:30 +00:00
Hyunjun Ko	433cdd1cff	ir3: fix has_src() to return correctly in ir3_nir_lower_tex_prefetch This seems to be originally introduced from `2a0d45ae6c`, and `562aaea07c` misused the method. Fixes: `2a0d45ae6c` "freedreno/ir3: Add a NIR pass to select tex instructions eligible for pre-fetch" Fixes: `562aaea07c` "freedreno/ir3: respect tex prefetch limits" Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9125>	2021-03-17 01:09:30 +00:00
Hyunjun Ko	e0e55b181f	turnip: Return correct value of tu6_load_state_size The state of active_desc_sets in pipeline should be set before allocation of the pipeline so we get correct size of descriptor sets and reserve enough space upfront. Otherwise we might hit assert(pipeline->cs.bo_count == 1). Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9125>	2021-03-17 01:09:30 +00:00
Danylo Piliaiev	e767208069	ir3: fix oob access to regs array for getbuf,getinfo,rgetinfo Since they have zero source registers, src->regs[1] is out of bounds. It probably wasn't able to cause any harm, but it's always better be safe. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4209 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9609>	2021-03-16 22:36:12 +00:00
Eric Anholt	f3a7a8a4dc	ci/freedreno: Switch the piglit testing to the new piglit runner. Getting piglit to fit onto our test devices was proving difficult, and we need the ability to handle flakes, so switch to the rust piglit runner that @pepp wrote as part of the deqp-runner repo which gives us flake detection, sharding across boards, fractional runs, and almost half the runtime. It doesn't handle piglit subtests yet, but if you can't run piglit's python on your devices because it's too bloated and unstable, this is a way forward. Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9468>	2021-03-16 22:19:30 +00:00
Eric Anholt	739486de2f	freedreno/a5xx: Fix the max texture buffer size. The GLES minmax is 65536. The blob vulkan exposes 65536 on both a5xx and a6xx, but try just doing the same as we do for a6xx. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9617>	2021-03-16 16:15:48 +00:00
Eric Anholt	b93d21810a	freedreno/a5xx: Fix the texel buffer alignment requirement. Info comes from the a540 vulkan blob driver minTexelBufferOffsetAlignment. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9617>	2021-03-16 16:15:48 +00:00
Danylo Piliaiev	b8ca39a80d	turnip: implement intrinsic_vulkan_resource_reindex Descriptor arrays are continuous, so it's just an addition of offset. Fixes test: dEQP-VK.spirv_assembly.instruction.compute.variable_pointers.dynamic_offset.select_descriptor_array Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9495>	2021-03-15 23:56:26 +00:00
Eric Anholt	3dc8102420	ci/freedreno: Add three more a5xx flakes from the last day. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9575>	2021-03-15 22:45:13 +00:00
Mike Blumenkrantz	71b17149e8	tu: use common interfaces for shader modules Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9508>	2021-03-15 21:47:44 +00:00
Danylo Piliaiev	914e7a7f73	turnip: set zmode to A6XX_EARLY_Z if FS forces early fragment test Specifying "early_fragment_tests" in fragment shader takes precedence over our internal conditions. Fixes test: dEQP-VK.fragment_operations.early_fragment.early_fragment_tests_stencil Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9569>	2021-03-12 20:11:28 +00:00
Danylo Piliaiev	1a2f1e3f47	turnip: fill VkMemoryDedicatedRequirements We support VK_KHR_dedicated_allocation so we must fill VkMemoryDedicatedRequirements. Vulkan spec states: "[...] requiresDedicatedAllocation may be VK_TRUE under one of the following conditions: The pNext chain of VkImageCreateInfo for the call to vkCreateImage used to create the image being queried included a VkExternalMemoryImageCreateInfo structure, and any of the handle types specified in VkExternalMemoryImageCreateInfo::handleTypes requires dedicated allocation, as reported by vkGetPhysicalDeviceImageFormatProperties2 in VkExternalImageFormatProperties::externalMemoryProperties.externalMemoryFeatures, the requiresDedicatedAllocation field will be set to VK_TRUE." All handle types require dedicated allocation at the moment. Fixes: dEQP-VK.api.external.memory.opaque_fd.dedicated.image.info dEQP-VK.memory.requirements.dedicated_allocation.buffer.regular dEQP-VK.memory.requirements.dedicated_allocation.image.transient_tiling_optimal Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9086>	2021-03-12 11:56:47 +02:00
Danylo Piliaiev	ae3b95daa7	turnip: lower device index to zero Vulkan 1.1 has VK_KHR_device_group and VK_KHR_device_group_creation promoted to core, thus we should handle DeviceIndex built-in. While we are here, also add these extensions to the extensions list, even though they are not doing anything useful. Fixes test: dEQP-VK.compute.device_group.device_index Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9516>	2021-03-11 21:12:52 +00:00
Connor Abbott	ee1f140fd9	freedreno/a6xx: Cleanup SP_XS_CTRL_REG0 definitions The registers were actually different per-stage even though we used the same type, which resulted in a bunch of incorrectly programmed fields and confusion. Move the stage-specific values to the registers themselves, which makes things much less confusing and makes it possible to set "mergedregs" correctly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9493>	2021-03-11 20:58:39 +00:00
Connor Abbott	9a5596d679	freedreno/registers: Handle typed registers with fields When a bitset is "inline" it should act as-if the its fields were inserted into the register itself. However when initializing the register's bitfield we weren't doing a deep copy of the inline bitfield, so if the register defined additional fields then they would get added to the original inline bitfield and any further registers with the same type would get them. Fix this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9493>	2021-03-11 20:58:39 +00:00
Connor Abbott	1d8bf2d0bf	freedreno/computerator: Fix thrsz type And use it for the other thread size field, too Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9493>	2021-03-11 20:58:39 +00:00
Yannik Marek	369f9d225d	turnip: fix alpha to coverage in no color and unused attachment cases In cases where the alpha coverage is enabled but the color attachment is either unused or absent there should be a dummy mrt to make the draw behave correctly. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Yannik Marek <yannik@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8952>	2021-03-10 22:02:43 +00:00
Matt Turner	6ceb6b509e	turnip: Remove unused TU_DEBUG_IR3 flag Replaced by IR3_SHADER_DEBUG=disasm,{vs,...,cs} and unused since the commit referenced below. Fixes: `808992fc50` ("tu: Use the ir3 shader API") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8249>	2021-03-10 18:59:22 +00:00
Eric Anholt	eba1b2a1ba	ci/freedreno: Mark another a5xx TF flake. Showed up with an iommu fault preceding it each time it failed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9488>	2021-03-10 18:44:16 +00:00
Jason Ekstrand	4fb6c051c9	anv: Move vk_format helpers to common code The Android ones we put in anv_android.c. Maybe one day we'll want a vk_android.h to put some common Android stuff but, for now, let's keep it contained to ANV's android code. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Jason Ekstrand	2523c47720	turnip: Move the CreateRenderPass wrapper to common code Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8857>	2021-03-10 18:17:31 +00:00
Danylo Piliaiev	2764cf8d32	ir3: use OPC_GETBUF to get size of sampler buffers The maximum value which OPC_GETSIZE could return for one dimension is 0x007ff0, however sampler buffer could be much bigger. Blob uses OPC_GETBUF for them. Fixes tests: dEQP-VK.memory.pipeline_barrier.transfer_dst_uniform_texel_buffer.1048576 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9391>	2021-03-10 17:10:45 +00:00
Danylo Piliaiev	8e6ed9948e	freedreno/a5xx: port handling of PIPE_BUFFER textures from a6xx Otherwise, we won't be able to use OPC_GETBUF to get their size. After this change we also could get rid of the hack for OPC_GETSIZE which scaled the size for texture buffers. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9391>	2021-03-10 17:10:44 +00:00
Danylo Piliaiev	d968995c67	turnip: fix SP_HS_WAVE_INPUT_SIZE value It appears that storage for varyings in a wave has an upper limit of wavesize * max_a831 where max_a831 is 64. Exceeding the limit seam to force gpu to reduce primitives processed per wave, at least calculations make sense with such interpretation. With blob SP_HS_WAVE_INPUT_SIZE never exceeds 64 and setting it to 65 in freedreno leads to a hang. Copied from the commit to freedreno `e5499ca2` Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8187>	2021-03-10 16:50:11 +00:00
Connor Abbott	7b7532b806	freedreno/computerator: Add branching example Mainly to be able to test label resolution without having to replace a shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Connor Abbott	19c7b6f9d6	ir3/parser: Add ability to specify branchstack This lets you test branching with computerator. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Connor Abbott	a820eb537c	ir3/parser: Support labels This fixes the assembly for many scenarios where you want to use shader replacement. Note: unfortunately this leaks the identifier string created while lexing, but I couldn't find a way to avoid leaking it except for bringing in ralloc or something (which would be way more complicated). The only other place doing something similar in mesa is the glsl parser, which is using ralloc (actually a linear context). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Connor Abbott	534658f79b	freedreno/computerator: Fix example assembly Use the new bindless cat6 syntax for a6xx. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Connor Abbott	cd772d5687	ir3/parser: Fix parsing of "0.0" in @const line Trying to specify a floating-point value in a @const line would result in it getting interpreted as a FLUT value and failing parsing. Fix this by making the various FLUT tokens include the surrounding parentheses. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9463>	2021-03-10 16:23:04 +00:00
Dave Airlie	8027a7ba8a	shader_info: convert textures_used to a bitset. For now keep it a bitset of 1 32-bit dword. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9456>	2021-03-10 06:16:09 +10:00
Danylo Piliaiev	1d70863c12	freedreno/hw: fix populating branch targets in isa_decode pre-pass pre-pass ran with branch_labels being false which made it no-op. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9476>	2021-03-09 18:17:48 +00:00
Jason Ekstrand	e20e85f01e	nir: Make nir_ssa_def_rewrite_uses_after take an SSA value This replaces the new_src parameter of nir_ssa_def_rewrite_uses_after() with an SSA def, and rewrites all the users as needed. Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9383>	2021-03-08 16:59:55 +00:00
Jason Ekstrand	117668b811	nir: Make nir_ssa_def_rewrite_uses take an SSA value This commit replaces the new_src parameter of nir_ssa_def_rewrite_uses() with an SSA def, removes nir_ssa_def_rewrite_uses_ssa(), and rewrites all the users as needed. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9383>	2021-03-08 16:59:55 +00:00
Jason Ekstrand	13a0ee8a51	nir: Add and use a new nir_ssa_def_rewrite_uses_src helper This is currently an alias for nir_ssa_def_rewrite_uses but we move all the instances which used it to write a non-SSA source to the newly named helper. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9383>	2021-03-08 16:59:55 +00:00
Connor Abbott	ccd7986f59	freedreno/cffdec: Use rb trees for tracking buffers Gets rid of the arbitrary size limitation, and should make decoding faster with many buffers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8838>	2021-03-08 15:18:47 +00:00
Danylo Piliaiev	7e25e5b56f	ir3: disallow moving memory writes over discard Writes to global memory should not be moved over discard, otherwise we could have unintended side-effects or lack of side-effects where they should be observed. Fixes tests: dEQP-VK.rasterization.frag_side_effects.color_at_beginning.kill dEQP-VK.rasterization.frag_side_effects.color_at_end.kill Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9365>	2021-03-04 11:40:58 +00:00
Juan A. Suarez Romero	7b3b8524ef	ci: Bump deqp to vk-gl-cts 1.2.5.2 Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9369>	2021-03-04 11:09:35 +00:00
Danylo Piliaiev	72a9f315db	ir3: make mark_kill_path exit early if instr is already seen Would bring down its complexity in pathological cases. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9386>	2021-03-04 10:52:06 +00:00
Danylo Piliaiev	9dbb678f5a	ir3: prevent duplication of instruction's dependencies Otherwise mark_kill_path() is happy to take exponential time to finish. It was possible to have such chains: ... stib.base0 imm[0.000000,0,0x0], ssa_233, ssa_234, false-deps:ssa_231, ssa_231 stib.base0 imm[0.000000,0,0x0], ssa_237, ssa_238, false-deps:ssa_235, ssa_235 stib.base0 imm[0.000000,0,0x0], ssa_241, ssa_242, false-deps:ssa_239, ssa_239 stib.base0 imm[0.000000,0,0x0], ssa_245, ssa_246, false-deps:ssa_243, ssa_243 stib.base0 imm[0.000000,0,0x0], ssa_249, ssa_250, false-deps:ssa_247, ssa_247 stib.base0 imm[0.000000,0,0x0], ssa_105, ssa_253, false-deps:ssa_251, ssa_251 stib.base0 imm[0.000000,0,0x0], ssa_109, ssa_256, false-deps:ssa_254, ssa_254 stib.base0 imm[0.000000,0,0x0], ssa_113, ssa_259, false-deps:ssa_257, ssa_257 stib.base0 imm[0.000000,0,0x0], ssa_117, ssa_262, false-deps:ssa_260, ssa_260 stib.base0 imm[0.000000,0,0x0], ssa_265, ssa_266, false-deps:ssa_263, ssa_263 stib.base0 imm[0.000000,0,0x0], ssa_269, ssa_270, false-deps:ssa_267, ssa_267 stib.base0 imm[0.000000,0,0x0], ssa_273, ssa_274, false-deps:ssa_271, ssa_271 ... Fixes tests: dEQP-VK.geometry.layered.cube_array.36_36_12.secondary_cmd_buffer_inherit_framebuffer dEQP-VK.geometry.layered.3d.64_64_8.secondary_cmd_buffer_inherit_framebuffer dEQP-VK.geometry.layered.cube_array.64_64_12.secondary_cmd_buffer_inherit_framebuffer Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9386>	2021-03-04 10:52:06 +00:00
Eric Anholt	a8423eb732	ci/turnip: Mark a flaky WSI test. This one has flaked many times at this point, and I've even seen it flake locally. No luck debugging it yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9397>	2021-03-03 23:03:48 +00:00
Rob Clark	1611693977	freedreno/ir3: Add comments about shader key/gen I had forgotton on which gens these where used on (which is important if you need to know which shader stages use these).. expand the comments a bit. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9394>	2021-03-03 22:09:22 +00:00
Eric Anholt	957132294f	ci/a5xx: Increase the gles3/31 coverage. Now that there's more time available in our budget per board, we can run all of gles31, and half of gles3, instead of 10%. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	1087bf16af	ci/a3xx: Run all of GLES3 dEQP. We're not spending half our time booting any more, so run the other half. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	bb82efa792	ci/a5xx: Run all of gles2 in one job. Now that we're not spending so much time on boot overhead, no need to parallelize. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	bcdfee3bcd	ci/freedreno: Switch the fastboot boards to using nfsroot. This saves time in packing the rootfs, allows for larger rootfses, and avoids the need for webdav. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Danylo Piliaiev	4600dbc6cc	turnip: fix leak of tu_shader object during compute pipeline creation tu_shader should be freed after pipeline is successfully created. Fixes tests: dEQP-VK.api.object_management.alloc_callback_fail.compute_pipeline dEQP-VK.api.object_management.alloc_callback_fail_multiple.compute_pipeline Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9364>	2021-03-03 10:41:29 +00:00
Danylo Piliaiev	d06c1e4554	turnip/ir3: check for bindless IBOs in atomic dests fixup Otherwise destinations may remain unfixed because ir3_shader_nibo doesn't count bindless IBOs. Fixes tests: dEQP-VK.image.atomic_operations.*intermediate_values Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9309>	2021-02-26 21:13:04 +00:00
Rob Clark	a9618e7c42	util: Add accessor for util_cpu_caps In release builds, there should be no change, but in debug builds the assert will help us catch undefined behavior resulting from using util_cpu_caps before it is initialized. With fix for u_half_test for MSVC from Jesse Natalie squashed in. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9266>	2021-02-26 18:31:19 +00:00
Eric Anholt	f65a7a8aa3	freedreno/a5xx: Fix cube image load/stores. This is the same thing we do on a6xx for cubes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9270>	2021-02-25 19:11:19 +00:00
Eric Anholt	c93fd1046a	freedreno: Use the mesa/st frontend lowering of GL_CLAMP. 350 lines of code for this stupid feature, and we weren't even doing it right for CS/GS/tess. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9267>	2021-02-25 00:38:11 +00:00
Eric Anholt	5fa27e6670	freedreno: Drop custom driver lowering of GL's color clamping. The mesa/st frontend can do it for us now that we don't need to worry about breaking precompiles. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8997>	2021-02-24 21:48:54 +00:00
Eric Anholt	3b9f6af1a9	freedreno: Drop custom driver lowering of two-sided color. The GL frontend can do it for us now, so just use their code instead of our own shader variants. In the past we had to do hide the GL shader variants in the driver to get precompiles from st, but no longer as of !8601. I tested with drawoverhead -test 6 (shader program change, n=30) and -test 1 (no statechanges, n=43) and saw no change in driver overhead. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8997>	2021-02-24 21:48:54 +00:00
Eric Anholt	de17b4aab5	freedreno: Remove uniform variables after finalizing NIR. mesa/st optimizes the uniform storage if you have the finalize hook in place, causing the uniforms declared to potentially not have storage in the ParameterValues list any more. If you leave your uniforms around in the NIR, then a later finalization after variant creation will re-add the uniforms to parameters, defeating the optimization and likely reallocating the uniform storage (causing use-after-free). So, we have to do this before we can start using variants in mesa/st. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8997>	2021-02-24 21:48:54 +00:00
Rob Clark	e5a64e34d8	freedreno/ir3: Drop foreach_bit() macro Now that there is a global one in util/bitscan.h Note this version had an extra assert which is not really suitable to a generic foreach_bit().. just move the assert to the two usages of the iterator macro. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9191>	2021-02-24 17:11:44 +00:00
Mike Blumenkrantz	77cba4b9f2	freedreno/vulkan: for_each_bit -> foreach_bit Reviewed-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9191>	2021-02-24 17:11:44 +00:00
Juan A. Suarez Romero	e814e23f59	ci/piglit: allow parallel piglit jobs This allows to split a piglit job in several parallel jobs, to speed up the execution. Due piglit restrictions, this only works for single profiles. Otherwise an error will be shown in the runner. Also, a new gitlab job variable `PIGLIT_TESTS` is introduced that contains the excluded/included tests with `-x` or `-n`. The rest of the piglit options go to `PIGLIT_OPTIONS` (like `--timeout n`). v2 (Andres): - Replay profile is supported in parallel jobs. - Bail out inmediately if parallel jobs is tried with multiple profiles. - Use testlist only when doing parallel jobs. - Do not drop pass tests when filtering executed tests. - Get rid of PIGLIT_FRACTION. v4: - uncommit unrelated change (Andres). Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Andres Gomez <agomez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9022>	2021-02-24 09:41:33 +01:00
Eric Anholt	ad77170b85	ci: Move the dEQP and traces expectations to the per-driver CI dirs. This means less custom test-source-dep stuff for these drivers, though it means that touching the CI expects files will cause a bit more retesting: - broadcom drivers retest as a group (but Igalia requested that organization of CI files) - radv+radeonsi retest as a group - lvp+llvmpipe retest as a group Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9161>	2021-02-22 23:02:42 +00:00
Eric Anholt	419758abc8	ci/a5xx: Increase our dEQP GLES3 fraction by 4x. Now that we've got SMP, we can get a lot more of this test suite covered in our 10-minute job window. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9144>	2021-02-22 19:31:46 +00:00
Eric Anholt	fcc2ed6299	ci/bare-metal: Use an upstream kernel for db820c. On top of the last kernel tree I added a couple of DT changes for db820c from the qcom landing tree necessary for bringing up the GPU, and a fix to my OOB cleanups fro cheza. I also enabled the CPU clock driver for db820c so we can turn on SMP and not leave jobs stranded on a 19Mhz CPU or whatever. This causes us to need a bit of updating of our TF expectations since the order of jobs changes a bit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9144>	2021-02-22 19:31:46 +00:00
Eric Anholt	8c539275d9	ci/freedreno: Remove stray BM_DTB definition. It's unused -- cheza uses an image with kernel+dtb glued together, and this var does nothing (which is good, given that it was pointing to db820c. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9144>	2021-02-22 19:31:46 +00:00
Dave Airlie	7b1568b7a3	tu: reset object base on recycled command buffers The loader_set_dispatch overwrites the magic with the dispatch pointer, however when cmd buffers get recycled, and the loader is in debug mode, it asserts that the magic isn't set anymore. When recycling command buffers, reset the object base. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9145>	2021-02-22 09:32:49 +10:00
Rob Clark	a983a87a5f	freedreno/ir3/print: Improve branch printing Handle the instruction suffix better, and don't try to print src regs in a generic way, since that doesn't really work out. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9142>	2021-02-19 22:56:56 +00:00
Rob Clark	03762a956e	freedreno/ir3/print: More sane ssa src/dst display Give src/dst a "ssa_%u" name generated from the instruction's unique serialno. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9142>	2021-02-19 22:56:56 +00:00
Danylo Piliaiev	0fa7ec1473	turnip,freedreno/a6xx: tell hw the size of shared mem used by CS Before, we only used 2k of shared memory. It was found that 5 lower bits of SP_CS_UNKNOWN_A9B1 do control the available size of shared memory for compute shaders, with AVAILABLE_SIZE = (SP_CS_UNKNOWN_A9B1_SHARED_SIZE + 1) * 1k up to 32k. And SP_CS_UNKNOWN_A9B1_SHARED_SIZE being zero enables all 32k of shared memory. Fixes tests: dEQP-VK.rasterization.line_continuity.line-strip dEQP-VK.memory_model.message_passing.core11.u32.coherent.fence_fence.atomicwrite.workgroup.payload_local.buffer.guard_nonlocal.workgroup.comp dEQP-VK.memory_model.message_passing.core11.u32.coherent.fence_fence.atomicwrite.workgroup.payload_nonlocal.workgroup.guard_local.buffer.comp dEQP-VK.memory_model.write_after_read.core11.u32.coherent.fence_fence.atomicwrite.workgroup.payload_local.image.guard_nonlocal.workgroup.comp Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9157>	2021-02-19 20:28:44 +02:00
Eric Anholt	dab845d457	ci: Move specific driver testing to separate files in separate dirs. The top-level gitlab-ci.yml is big and unwieldy when one wants to work on CI for a single driver. Move the drivers to separate include files for ease of finding all your driver's tests, and also to pave the way for work on a single driver's CI to not retest all other drivers. Reviewed-by: Andres Gomez <agomez@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9139>	2021-02-19 17:30:36 +00:00
Danylo Piliaiev	14a0004232	turnip: consider tile_max_h when calculating tiling config Otherwise we may get a tile height exceeding the maximum. Fixes tests: dEQP-VK.pipeline.render_to_image.core.2d.huge.height.r8g8b8a8_unorm dEQP-VK.pipeline.render_to_image.core.2d.huge.height.r8g8b8a8_unorm_d16_unorm dEQP-VK.pipeline.render_to_image.core.2d.huge.height.r8g8b8a8_unorm_s8_uint Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9159>	2021-02-19 15:24:30 +00:00
Danylo Piliaiev	b6b3b38434	turnip: consider HW limit on number of views when apply multipos opt Blob doesn't apply multipos optimization starting from 11 views even on a650, however in practice, with the limit of 16 views, tests pass on a640/a650 and fail on a630. Fixes tests: dEQP-VK.multiview.draw_indexed.max_multi_view_view_count dEQP-VK.multiview.input_attachments.max_multi_view_view_count dEQP-VK.multiview.masks.max_multi_view_view_count dEQP-VK.multiview.multisample.max_multi_view_view_count dEQP-VK.multiview.queries.max_multi_view_view_count dEQP-VK.multiview.renderpass2.index.fragment_shader.max_multi_view_view_count dEQP-VK.multiview.secondary_cmd_buffer.max_multi_view_view_count Fixes: `8d275778` ("tu: Enable multi-position output") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9135>	2021-02-19 09:16:00 +00:00
Jonathan Marek	ec54166a2b	freedreno/a6xx: set SP_PERFCTR_ENABLE in computerator Set this register to have properly working SP perfcntrs in computerator. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8423>	2021-02-19 04:04:03 +00:00
Jonathan Marek	46f64aa3be	freedreno/a6xx: update some registers Some sorting, adding unknown fields, documenting some fields, etc. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8423>	2021-02-19 04:04:03 +00:00
Jonathan Marek	b94c652afe	freedreno/a6xx: always use reg64 for address registers (no LO/HI) Reduce noise in a6xx.xml by removing LO/HI versions of address registers. Also fix type="address" registers in register packing (use bit size instead of checking for "waddress" to use qword) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8423>	2021-02-19 04:04:02 +00:00
Jonathan Marek	b15d4484f8	freedreno/a6xx: update perfcntr registers (declare as arrays) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8423>	2021-02-19 04:04:02 +00:00
Jonathan Marek	72f00fe72e	freedreno/registers: use macro instead of inline function for array regs This is to allow use in places where an inline function isn't allowed, such as a static initializer. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8423>	2021-02-19 04:04:02 +00:00
Connor Abbott	79921b81bc	freedreno/a6xx: Document threadsize-related fields We'll need to use if we want to start playing around with thread sizes. At least now we know what the actual threadsize is. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8423>	2021-02-19 04:04:02 +00:00
Samuel Iglesias Gonsálvez	8dd54778fa	turnip: VK_EXT_memory_budget implementation Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8524>	2021-02-17 08:07:33 +01:00
Samuel Iglesias Gonsálvez	4342dec09a	turnip: keep track of memory heap usage, size and flags Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8524>	2021-02-17 08:07:19 +01:00
Caio Marcelo de Oliveira Filho	e4e962cbe0	freedreno/ir3: Use gl_varying_slot_name_for_stage() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8998>	2021-02-13 00:44:53 +00:00
Danylo Piliaiev	f0a76b2067	turnip: enable inheritedQueries Passes relevant CTS tests. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8824>	2021-02-10 12:38:44 +00:00
Jason Ekstrand	0260b4a7e7	vulkan: Add a common helper for enumerating instance extension properties Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8792>	2021-02-04 20:02:12 +00:00
Rob Clark	ff61e9b54d	freedreno/decode: Fix overflow CP_SET_DRAW_STATE state-groups count as a 4th level of IB. Fixes a crash seen on 32b/arm builds of crashdec. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8842>	2021-02-03 18:35:38 +00:00
Samuel Iglesias Gonsálvez	5723887676	turnip: fix resolve MSAA D32_SFLOAT_S8_UINT image to S8_UINT According to VK_KHR_depth_stencil_resolve spec (see VUID-VkSubpassDescriptionDepthStencilResolve-pDepthStencilResolveAttachment-03182): "If the VkFormat of pDepthStencilResolveAttachment has a stencil component, then the VkFormat of pDepthStencilAttachment must have a stencil component with the same number of bits and numerical type" The issue with D32_SFLOAT_S8_UINT format is that it is implemented as two planes, so we need to execute the separate_stencil path in tu_emit_blit() to resolve its stencil component into S8_UINT image. Fixes the following tests: dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint.compatibility_depth_zero_stencil_zero_testing_stencil dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_stencil Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8527>	2021-02-03 16:51:02 +00:00
Samuel Iglesias Gonsálvez	09e9be3d8f	turnip: fix resolve MSAA D24_UNORM_S8_UINT image to S8_UINT According to VK_KHR_depth_stencil_resolve spec (see VUID-VkSubpassDescriptionDepthStencilResolve-pDepthStencilResolveAttachment-03182) "If the VkFormat of pDepthStencilResolveAttachment has a stencil component, then the VkFormat of pDepthStencilAttachment must have a stencil component with the same number of bits and numerical type" That means that we can resolve MSAA depth/stencil to a stencil only image only if the stencil component matches with same number of bits and type. Although the driver only supports VK_RESOLVE_MODE_SAMPLE_ZERO_BIT resolve mode, it was doing a sample average when resolving a MSAA D24_UNORM_S8_UINT image to S8_UINT. Fixes the following tests: dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d24_unorm_s8_uint.compatibility_depth_zero_stencil_zero_testing_s tencil dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d24_unorm_s8_uint_separate_layouts.compatibility_depth_zero_stenc il_zero_testing_stencil Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8527>	2021-02-03 16:51:02 +00:00
Samuel Iglesias Gonsálvez	5fc5d18aac	turnip: fix UINT64_MAX size wrapping in tu_GetBufferMemoryRequirements() tu_GetBufferMemoryRequirements() ends up wrapping the UINT64_MAX size to 0 when aligning. Fixes: dEQP-VK.api.buffer.basic.size_max_uint64 Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4493>	2021-02-03 16:01:41 +01:00
Samuel Iglesias Gonsálvez	ea42632ba7	turnip: set sparseAddressSpaceSize to zero According to Vulkan spec, "Table 46. Required Limits", as sparse binding is unsupported, we need to return unsupported limit for sparseAddressSpaceSize, which is zero. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4493>	2021-02-03 16:01:21 +01:00
Jonathan Marek	dd388b14c8	turnip: add missing register write to disable dithering This was causing rendering issues with low precision formats because GL driver can enable it. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8707>	2021-02-03 13:45:19 +00:00
Jonathan Marek	bdaa4d1ee0	turnip: don't always use 3d ops for blit_image Revert this accidentally committed testing change. Fixes: `872c4bcd27` ("turnip: implement z-scaling and z-mirroring BlitImage") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8707>	2021-02-03 13:45:19 +00:00
Jonathan Marek	b37bd5f89b	turnip: IMAGE_FILTER_{LINEAR,CUBIC}_BIT only for non-integer formats Avoid CTS trying to use linear filtering for integer formats. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8707>	2021-02-03 13:45:19 +00:00
Jonathan Marek	b4653c1033	turnip: use vk_format_is_int to disable COLOR_ATTACHMENT_BLEND_BIT This is simpler and easier to understand. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8707>	2021-02-03 13:45:19 +00:00
Jonathan Marek	de44e700b1	turnip: delete unused vk_format_parse.py file Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8707>	2021-02-03 13:45:19 +00:00
Jonathan Marek	596e82510d	turnip: fix logicOp Don't ignore logic op for integer formats. Blend also doesn't need this path, because it isn't valid for blendEnable to be true for integer formats. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8707>	2021-02-03 13:45:19 +00:00
Jason Ekstrand	f2545f22f4	vulkan: Drop the type_prefix parameter from gen_extensions Now that all the drivers are converted, it's set to 'vk' by everyone so there's no point in having the parameter. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:25 +00:00
Jason Ekstrand	bafd0c680d	vulkan: Rework vk_device_init and friends Now that all drivers are converted over, we can make a few changes. First off, vk_device_init no longer takes two separate allocators because we can assume that the parent instance is non-null and it can pull the instance allocator from that. Second, dispatch tables and the instance extension table are no longer optional. We leave the device extension table optional for now because we don't do any verification at vk_init_physical_device time and some drivers find it more convenient to set the extensions later in their own physical_device_init for various reasons. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:25 +00:00
Jason Ekstrand	394708b3cb	turnip: Switch to the common VK_EXT_debug_report Acked-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	3a8060271c	turnip: Drop some legacy wrappers in favor of common code Acked-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	0870cf4c06	turnip: Use common entrypoints for VK_EXT_private_data Acked-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	59d70c47c7	turnip: Use the common dispatch framework Acked-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	d360a996f9	vulkan: Add common instance and physical device structs Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	5d6ac87d61	vulkan: Add a return code to vk_device_init Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	8ee88948e3	vulkan: Move vk_device to its own file Things are going to start getting more complicated so let's avoid the single mega-file approach. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	ce0e5cd35b	turnip: Properly clean up vk_device Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:24 +00:00
Jason Ekstrand	8d6cf9e1c2	vulkan/meson: Add missing dependencise on vk_extensions_gen.py Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8676>	2021-02-01 18:54:23 +00:00
Connor Abbott	ae7a9d0585	ir3: Assume that nir_tex_instr::dest_type is sized Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7989>	2021-01-25 11:22:07 +01:00
Connor Abbott	23beffadea	freedreno/ir3: Handle sized tex destination types Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7989>	2021-01-25 11:21:42 +01:00
Danylo Piliaiev	fa74389485	turnip: don't emit tess consts if they are not used If tess consts aren't used they don't get included in constlen, and we risk overrunning consts of the next stage. Fixes: dEQP-VK.tessellation.invariance.outer_edge_index_independence.quads_fractional_even_spacing_ccw dEQP-VK.tessellation.invariance.outer_triangle_set.quads_fractional_odd_spacing dEQP-VK.tessellation.invariance.primitive_set.isolines_fractional_odd_spacing_ccw dEQP-VK.tessellation.invariance.primitive_set.quads_fractional_odd_spacing_cw Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4117 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8578>	2021-01-20 13:10:10 +00:00
Samuel Iglesias Gonsálvez	b50b28cd33	turnip: disable UBWC on Z24_S8 MSAA images on A630 Fixes GPU hangs in dEQP-VK.renderpass2.depth_stencil_resolve.* tests on A630. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8381>	2021-01-18 17:32:21 +01:00
Mauro Rossi	b53d404aa7	android: freedreno/ir3: Switch over to new encoder/decoder Fixes the following building error: FAILED: out/target/product/x86_64/obj/SHARED_LIBRARIES/gallium_dri_intermediates/LINKED/gallium_dri.so ... ld.lld: error: undefined symbol: isa_assemble >>> referenced by ir3_shader.c:151 (external/mesa/src/freedreno/ir3/ir3_shader.c:151) ... ld.lld: error: undefined symbol: isa_decode >>> referenced by ir3_shader.c:668 (external/mesa/src/freedreno/ir3/ir3_shader.c:668) Fixes: `5cae4779c` ("freedreno/ir3: Switch over to new encoder/decoder") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8538>	2021-01-17 21:57:05 +01:00
Mauro Rossi	7c0298e2fe	android: freedreno/hw/isa: Add description of ir3 ISA Necessary to build libir3decode and libir3encode for Android Fixes: `6d94f575d` ("freedreno/hw/isa: Add description of ir3 ISA") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8538>	2021-01-17 21:57:05 +01:00
Joel Linn	5939a64b15	freedreno/a2xx: add RB perfcounter 1-3 Xenos driver reads four perf counters in total. v2: fix register names Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7666>	2021-01-16 19:10:22 +00:00
Joel Linn	040ffee71f	freedreno/a2xx: fix/add RBBM perfcounter Xenos driver read two perf counters and their order is also different. v2: fix typo in register address Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7666>	2021-01-16 19:10:22 +00:00
Rob Clark	bfe5ac89b2	freedreno/isa: Fix branch/jump offset encoding When cross compiling with clang, `1ul` would end up 32b instead of 64b, resulting in 32b fields (like branch/jump offsets) being encoded as zero. Which results in infinite loops. Fixes: `e7630ec278` ("freedreno/hw: Add isaspec mechanism for documenting/defining an ISA") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8528>	2021-01-15 17:36:30 +00:00
Danylo Piliaiev	5e2cee57c5	freedreno/ir3/parser: add cat7 support Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8420>	2021-01-15 10:08:38 +00:00
Danylo Piliaiev	39a2da738d	ir3: add debug option to override shader assembly IR3_SHADER_DEBUG=vs,tcs,tes... now also prints shader's sha1. When there is a file named %sha1%.asm in IR3_SHADER_OVERRIDE_PATH directory - ir3 assembly from file would be parsed, assembled, and will override the shader with corresponding sha1 hash. Parsing failure is considered unrecoverable error. Upon successful override shader's assembly is printed with: "Native code (overridden) for unnamed ..." This debug option allows easier testing of small changes in assembly without modifying the compiler or using computerator. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8388>	2021-01-14 21:51:16 +00:00
Danylo Piliaiev	cea4d85093	turnip: make GS use correct varyings size from previous stage Fixes: dEQP-VK.tessellation.invariance.primitive_set.triangles_fractional_even_spacing_ccw dEQP-VK.tessellation.invariance.outer_edge_division.triangles_fractional_even_spacing dEQP-VK.tessellation.invariance.outer_edge_symmetry.triangles_fractional_odd_spacing_cw dEQP-VK.tessellation.invariance.outer_edge_symmetry.quads_fractional_odd_spacing_ccw dEQP-VK.tessellation.invariance.outer_edge_symmetry.isolines_equal_spacing_cw dEQP-VK.tessellation.invariance.outer_edge_index_independence.triangles_equal_spacing_ccw dEQP-VK.tessellation.invariance.outer_edge_index_independence.triangles_fractional_even_spacing_cw dEQP-VK.tessellation.invariance.inner_triangle_set.triangles_equal_spacing Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8497>	2021-01-14 19:06:07 +00:00
Danylo Piliaiev	ad098553ee	turnip/ir3: handle image load/stores produced by AtomicLoad/Store SpvOpAtomicLoad and SpvOpAtomicStore are translated into nir_intrinsic_image_deref_store/load instead of some separate atomic intrinsics, however they don't have src or dest type specified. Turnip doesn't support shaderImageFloat32Atomics so type is just integer. Fixes: dEQP-VK.memory_model.message_passing.core11.u32.coherent.fence_fence.atomicwrite.device.payload_local.image.guard_local.image.frag dEQP-VK.memory_model.message_passing.core11.u32.coherent.fence_fence.atomicwrite.workgroup.payload_local.buffer.guard_local.image.comp dEQP-VK.memory_model.write_after_read.core11.u32.coherent.fence_fence.atomicwrite.device.payload_local.buffer.guard_local.image.comp dEQP-VK.memory_model.write_after_read.core11.u32.coherent.fence_fence.atomicwrite.workgroup.payload_local.image.guard_local.image.comp dEQP-VK.memory_model.write_after_read.core11.u32.coherent.fence_fence.atomicwrite.workgroup.payload_nonlocal.workgroup.guard_local.image.comp Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8476>	2021-01-14 05:43:56 +00:00
Rob Clark	74748f16c9	freedreno/ir3: Remove legacy packed-struct encoding Note that we can't actually remove the packed structs themselves yet, because tu still uses them in some hand-coded blit shaders. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:48 +00:00
Rob Clark	1a8113fdee	freedreno/ir3/decode: Switch over to new disasm Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:48 +00:00
Rob Clark	668943e9f7	freedreno/ir3: Realign disasm shader stats To better match up with what mesa shader-db stats look like, for easier comparision. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	3e15ba5ccc	freedreno/ir3: Better sstall estimation 1) Take into account repeat/nop cycles 2) Clear sfu_delay after an (ss) sync Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	11cba228fd	freedreno/ir3: Small resinfo disasm tweak Add the 'type' field. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	5cae4779c2	freedreno/ir3: Switch over to new encoder/decoder Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	01e8bd55de	freedreno/ir3/tests: Switch disasm test over to new decoder Also, uncomment the `stc` test vectors (since the new decoder decodes these properly) and comment out an instruction which looks suspiciously like -6.0 in hex. This also switches the parser back to `atomic.b.op` from `atomic.op.b` which was a short-term workaround to make it easier for the legacy disassembler. Also switch the binary encoding for ldib to clear b0, because the new disassembler warns about unexpected dontcare bits (which cases the disasm to not match). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	704e49bae0	freedreno/hw/isa: Add expression caching Drops decoding an ~850KB collection of instructions from ~4min to ~1sec. Granted for normal sized shaders, this probably doesn't matter.. but it at reduces my cycle time for fixing things to match existing disasm syntax using this massive collection of unique instructions. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	6d94f575d2	freedreno/hw/isa: Add description of ir3 ISA Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	e7630ec278	freedreno/hw: Add isaspec mechanism for documenting/defining an ISA Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	6309c9313b	freedreno/ir3: Add some new "logical" opcodes Once we switch over to the xml based ir3 ISA definition, the opcodes will be decoupled from instruction encoding. Which will let us better handle cases where a single "opcode" (from instruction encoding stand- point) means different things on different generations. And also cases like the different variations of `b`ranch instructions, which share a single hw "opcode" plus a separate "brtype" field. When we start using these in ir3, we'd like to treat them as separate instructions and not have to care about the details of how they are encoded. For now, these are only used internally within the new xml generated instruction encoding, but once the existing "packed struct" encoding/ decoding is replace, we'll update ir3 to start using the new opcode enums directly (except for the `mov` variants). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	cd31bface8	freedreno/ir3: Decouple ir3_info collection from assembler We'll want to re-use this when cutting over to the new XML based instruction encoding. So untangle it from instruction packing. Also, move handling of the appended constant data out of the assembler, since this isn't much related to instruction encoding. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Rob Clark	e1f8aaf9d2	freedreno/ir3: Fix ldg decoding/parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7997>	2021-01-13 18:32:47 +00:00
Danylo Piliaiev	5331b1d945	turnip: implement indirect dispatch Vulkan guarantees only 4 byte alignment of offset for vkCmdDrawIndirect, while CP_LOAD_STATE.EXT_SRC_ADDR requires 16 byte alignment which makes us copy indirect parameters to a correctly aligned buffer. Blob does essentially the same but emits indirect CP_LOAD_STATE with src = SS6_UBO and EXT_SRC_ADDR = 0xe0000, and only for a first dispatch. Fixes: dEQP-VK.compute.indirect_dispatch.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8444>	2021-01-13 09:55:47 +00:00
Danylo Piliaiev	a6ae7b2421	turnip: remove unused IR3_DP_LOCAL_GROUP_SIZE_* from cs params In Turnip local group size is lowered in NIR via nir_lower_compute_system_values. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8444>	2021-01-13 09:55:47 +00:00
Daniel Schürmann	bd8e84eb8d	nir: replace .lower_sub with .has_fsub and .has_isub This allows a more fine-grained control about whether a backend supports one of these instructions. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6597>	2021-01-11 19:13:51 +00:00
Rhys Perry	f199b7188b	nir/load_store_vectorize: add data as callback args Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	00c8bec47b	nir: add nir_load_store_vectorize_options Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Michel Dänzer	1de2fd0cf2	wsi/x11: Always link against xcb-xrandr The next commit will make use of it even without VK_USE_PLATFORM_XLIB_XRANDR_EXT. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8197>	2021-01-07 14:57:45 +01:00
Eric Anholt	3efbc47c83	freedreno: Mark a615/a618 as also lacking Z24_UINT_S8_UINT support. Rob says it's also the case on 618, and presumably 615 as well then, so make it take the same path as a630. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8319>	2021-01-06 22:54:14 +00:00
Eric Anholt	1c4613f5d4	turnip: Move the limited_z24s8 flag to the shared device info. I want to do the same logic in freedreno, so use the same flag. On suggestion by robclark, rename it to what it specifically means. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8319>	2021-01-06 22:54:14 +00:00
Vinson Lee	03999595e7	freedreno/afuc: Replace readfile with os_read_file. Tested afuc-disasm produced same output. $ ./builddir/src/freedreno/afuc/afuc-disasm -g 6 src/freedreno/.gitlab-ci/reference/afuc_test.fw > /tmp/afuc_test.asm $ diff ./src/freedreno/.gitlab-ci/reference/afuc_test.asm /tmp/afuc_test.asm $ echo $? 0 Suggested-by: Rob Clark <robdclark@chromium.org> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8290>	2021-01-06 18:12:34 +00:00
Rob Clark	32a6a13052	freedreno/ir3/parser: Fix pre-a6xx stib parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	859c92d7ee	freedreno/ir3/parser: a6xx ldib/stib parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	b7ea6ec178	freedreno/ir3: Fix pre-a6xx ldgb/stib parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	050a449dbb	freedreno/ir3: Explicitly flag disasm test vectors that don't parse Mark the test cases which aren't supported by ir3_parser.y explicitly, so we notice future regressions. And likewise, fail when we see an unexpected pass, so we don't forget to update the test vectors in the future as ir3_parser improves. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	b073dae5f0	freedreno/ir3: Fix ldg decoding/parsing Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:53 +00:00
Rob Clark	a7e88787f6	freedreno/ir3/parser: Fixup stg parsing and add more tests The offset can also be a register, in which case we need to shuffle around the src order. Add a few more test vectors to cover each permutation (no offset, immed offset, gpr offset). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	d6fa130dda	freedreno/ir3/parser: Add stgb support Note that this conflicts with `stc` on a6xx+, so a good test that the (new) disasm can handle both cases properly. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	eddfafae6a	freedreno/ir3/parser: Add ldgb support Gives us at least better coverage of pre-a6xx-bindless-ibo instructions. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	1746c4d211	freedreno/ir3/parser: Fix pre-a6xx resinfo Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	32539c1afc	freedreno/ir3/parser: Fix atomic support 1) Handle a6xx bindless form 2) Fix shared vs global encoding Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	c5479d1d8d	freedreno/ir3/parser: Add ldc support Note that this shows up a slight encoding difference compared to test vector extracted from blob deqp runs. We think these should be dontcare bits. For now, add a note and replace the encoded value in the disasm test. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	d7f141bb35	freedreno/ir3: Add cat5/cat6 nonuniform flag Not yet used by the compiler, but needed so we don't loose information between ir3 parser and instruction encoding. Currently ignored for cat5, because the uniform vs non-uniform default is swapped there. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	101bf686ee	freedreno/ir3: Disambiguate a6xx+ "bindless" instructions Add a `.b`.. for the atomic instructions it should be `atomic.b.op` but for now put the `.b` at the end to simplify life for the existing disasm Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	c55737902c	freedreno/ir3: Don't leak disk_cache Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	71f902bab9	freedreno/ir3: Add parsing and assembler testing In theory we should be able to round-trip from disasm->asm and get a bitwise match. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	b91319d952	freedreno/ir3: Tweak ldib/resinfo encoding The blob is using '0' for the low bit in these (except for ldib where it seems to randomly use either '0' or '1'). The upcoming xml based ISA spec maps this bit to 'dontcare' in the ldib case. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	99908c8d6d	freedreno/ir3/parser: Add initial cat6 IBO instructions Well, really just resinfo.. dealing with the different ldib/stib syntax for a6xx+ vs earlier seems a bit too painful to deal with. But resinfo at least gives us some encoding test coverage of this group of instrs. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	f9c76fba9d	freedreno/ir3/parser: Relative gpr/const can have modifiers too Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	594b004e00	freedreno/ir3/parser: Add missing (sat) modifier Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	77552cbdda	freedreno/ir3: Don't set bit for dest conversion for p0.c This appears to be ignored when writing to predicate registers (which I guess makes sense, since they are boolean). So no real harm in setting it, other than it makes some of the ir3_parser test vectors not match the expected result for encoding. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	1cdff35361	freedreno/ir3/parser: Fixup cat5 s2en instructions Currently ir3 (incl emit_cat5()) expects the samp/tex src register to be first.. which requires some fixup for the parser to match. TODO we might want to revisit the src reg order when adding new instr packing/encoding. For now, lets just make the parser match the rest of ir3. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	d35c79614e	freedreno/ir3/parser: Fix dsxpp/dsypp encoding Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	e9b3234915	freedreno/ir3/parser: Fix cat6 store encoding Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	b90a1cf747	freedreno/ir3: Cleanup cat6 load instructions There was some src2 vs src3 confusion, but since the syntax is like: ldl.f32 rDst, l[rBase+off], ncomp it makes more sense to call the offset src2 and ncomp src3, than the way we had it. This is also easier to deal with for the ir3 assembly parser. Also, src_offset was only ever used by the assembly parser, and was handled incorrectly in emit_cat6(), resulting that cat6 load instrs would not work properly in (for ex) computerator. Since we are cleaning things up, drop src_offset and make the asm parser work in the same way as the nir->ir3 frontend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	4e272003b1	freedreno/ir3: Clean up instruction creation Convert everything remaining over to the version which takes # of register (src + dst) and drop the ir3_instr_create2() version. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	d968f46997	freedreno/ir3/parser: Handle half-immed Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	68be24dd6c	freedreno/ir3/parser: cat1 updates (mova1, movmsk) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	647d7fc36d	freedreno/ir3/parser: cat1 instructions can write relative GPR Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	0b36044d4f	freedreno/ir3/parser: Add new cat0 instructions Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	2dc6458563	freedreno/ir3: Various cat0 updates Update the IR and packer to handle the additional cat0 fields, in prep for adding support in the assembler (in prep for adding round trip parsing/packing test coverage). We don't actually use these yet from the ir3 compiler, but at least this is one less thing to worry about when we start trying to use them. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	eec183c159	freedreno/ir3/parser: Reset lexer when input changes Otherwise, in case of parse errors, the lexer state can still contain buffered input from the previous parse. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	7b2d2bafe4	freedreno/ir3: Move assembler error handling Move out of ir3_parse_asm() so we can re-use it in disasm test for round-tripping asm/disasm. We don't want failures to be fatal (yet) as there are still some things missing from the assembler. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	a928d0ab46	freedreno/ir3: Add some more disasm test vectors Various things that I noticed which were initially wrong with the xml based disasm. These were extracted from a collection of unique instructions extracted from deqp traces, which unfortunately looses the link back to the original test case. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	2933d54992	freedreno/ir3: Fix mova1 disasm Yet another mnemonic for mov Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Rob Clark	e3bd9aaf6b	freedreno/ir3: Fix half-immed decoding issues For mov, half-float immeds are packed in 16b. In other cases, the syntax for a half-immed is a bit different (ie. `h(1)`) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Connor Abbott	6f35ebd8a5	ir3: Support MOVMSK Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Connor Abbott	5d36f36454	ir3: Better rules for shared src copy propagation It turns out that the actual rule for when a source/dest can be shared is that it has to be cat1, cat2, or cat3. Allow this and silence warnings. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Connor Abbott	f9804673fb	ir3: Rename high registers to shared registers This more accurately reflects what they are. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8175>	2021-01-06 16:46:52 +00:00
Christian Gmeiner	32bd47f6fa	tu: use intrinsic builders Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8295>	2021-01-06 14:34:41 +00:00
Christian Gmeiner	d46a761e9e	ir3: use intrinsic builders Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8295>	2021-01-06 14:34:41 +00:00
Eric Anholt	7e1e227694	freedreno/ir3: Deduplicate link_stream_out. All 3 copies were the same other than style tweaks. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8336>	2021-01-05 18:23:37 +00:00
Danylo Piliaiev	122da9bd2d	freedreno/ir3: remap FRAG_RESULT_COLOR to _DATA* for dual-src blending gl_SecondaryFragColorEXT is mapped to FRAG_RESULT_COLOR and just have a different io.dual_source_blend_index. We don't need to replicate the color to other render targets in case of dual source blending, so we could just remap it to FRAG_RESULT_DATA0 + index. Fixes piglit test: arb_blend_func_extended-fbo-extended-blend-pattern_gles2 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8245>	2020-12-28 17:33:17 +00:00
Vinson Lee	7d8d99ea12	turnip: Remove unsigned nonnegative check. index is of type uint32_t. Fix defect reported by Coverity Scan. Macro compares unsigned to 0 (NO_EFFECT) unsigned_compare: This greater-than-or-equal-to-zero comparison of an unsigned value is always true. index >= 0U. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8231>	2020-12-24 23:08:56 +00:00
Hyunjun Ko	ec1464077b	turnip: use ir3_compiler_destroy instead of ralloc_free Fixes: `c0f22c3d94` "freedreno/ir3: add ir3_compiler_destroy()" Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6808>	2020-12-22 04:57:22 +00:00
Hyunjun Ko	19a7a915ca	turnip/kgsl: support VK_KHR_performance_query Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6808>	2020-12-22 04:57:22 +00:00
Hyunjun Ko	3d90909837	turnip: enable VK_KHR_performance_query with new debug flag Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6808>	2020-12-22 04:57:22 +00:00
Hyunjun Ko	c921a6e98d	turnip: support multipass for performance query. To support multipass, querying perf counters happens in several steps below. 0) There's a scratch reg to set pass indices for perf counters query. Prepare cmd streams to set each pass index to the reg at device creation time. See tu_CreateDevice in tu_device.c 1) Emit command streams to read all requested perf counters at all passes in begin/end query with CP_REG_TEST/CP_COND_REG_EXEC, which reads the scratch reg where pass index is set. 2) Pick the right cs setting proper pass index to the reg and prepend it to the command buffer at each submit time. 3) If the pass index in the reg is true, then executes the command stream below CP_COND_REG_EXEC. Would need to implement for kgsl in the future. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6808>	2020-12-22 04:57:22 +00:00
Hyunjun Ko	937dd76426	turnip: Implement VK_KHR_performance_query There are still some commands unimplemented yet. - vkGetPhysicalDeviceQueueFamilyPerformanceQueryPassesKHR: The following patch supports this. - vkAcquireProfilingLockKHR / vkReleaseProfilingLock This patch supports only monitoring perf counters for each submit. To reserve/configure counters across submits we would need a kernel interface to be able to do that. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6808>	2020-12-22 04:57:22 +00:00
Danylo Piliaiev	e5499ca2bf	freedreno/a6xx: Fix SP_HS_UNKNOWN_A831 value and document it It appears that storage for varyings in a wave has an upper limit of wavesize * max_a831 where max_a831 is 64. Exceeding the limit seam to force gpu to reduce primitives processed per wave, at least calculations make sense with such interpretation. With blob SP_HS_UNKNOWN_A831 never exceeds 64 and setting it to 65 in freedreno leads to a hang. On A630 tests (patch_size=3 + gl_Position + array of vec4) have shown such relation: \| Num of vec4 \| A831 \| PC_HS_INPUT_SIZE \| \|-------------\|------\|------------------\| \| 1 \| 0x10 \| 0xc \| \| 2 \| 0x14 \| 0xf \| \| 3 \| 0x18 \| 0x12 \| \| 4 \| 0x1c \| 0x15 \| \| 5 \| 0x20 \| 0x18 \| \| 6 \| 0x24 \| 0x1b \| \| 7 \| 0x28 \| 0x1e \| \| 8 \| 0x2c \| 0x21 \| \| 9 \| 0x30 \| 0x24 \| \| 10 \| 0x34 \| 0x27 \| \| 11 \| 0x38 \| 0x2a \| \| 12 \| 0x3c \| 0x2d \| \| 13 \| 0x3f \| 0x30 \| \| 14 \| 0x40 \| 0x33 \| \| 15 \| 0x3d \| 0x36 \| \| 16 \| 0x3d \| 0x39 \| \| 17 \| 0x40 \| 0x3c \| \| 18 \| 0x3f \| 0x3f \| \| 19 \| 0x3e \| 0x42 \| \| 20 \| 0x3d \| 0x45 \| \| 21 \| 0x3f \| 0x48 \| \| 22 \| 0x3d \| 0x4b \| \| 23 \| 0x40 \| 0x4e \| \| 24 \| 0x3d \| 0x51 \| \| 25 \| 0x3f \| 0x54 \| \| 26 \| 0x3c \| 0x57 \| \| 27 \| 0x3e \| 0x5a \| \| 28 \| 0x40 \| 0x5d \| \| 29 \| 0x3c \| 0x60 \| \| 30 \| 0x3e \| 0x63 \| \| 31 \| 0x40 \| 0x66 \| \|-------------\|------\|------------------\| Brief tests with high patch sizes also confirm that formula matches blob behaviour. A831 is not a limit for storage available for one thread, so naming it as SP_HS_WAVE_INPUT_SIZE would make more sense. Fixes: `47e2c195` "freedreno/a6xx: Program state for tessellation stages" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7917>	2020-12-21 16:25:34 +02:00
Danylo Piliaiev	22180137e9	ir3: Allow tesselation to use all 32 varying slots POS, PSIZE, CLIP_DIST0, and CLIP_DIST1 have their own predefined indices, map's size should take this into account. Fixes: `9e063b01` "ir3: Switch tess lowering to use location" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7917>	2020-12-21 16:06:20 +02:00
Samuel Iglesias Gonsálvez	84136d78e6	turnip: fix cube map array image size calculation imageSize() expects the last component of the return value to be the number of layers in the texture array. In the case of cube map array, it will return a ivec3, with the third component being the number of layer-faces. Fixes: dEQP-VK.image.image_size.cube_array.* Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8087>	2020-12-18 06:43:07 +01:00
Danylo Piliaiev	b34bc3db67	tu: pCounterBuffers can be NULL in vkCmd*TransformFeedbackEXT() According to the spec: "pCounterBuffers is an optional array of buffer handles [...] If pCounterBuffers is NULL, then transform feedback will start capturing vertex data to byte offset zero in all bound transform feedback buffers." "If counterBufferCount is not 0, and pCounterBuffers is not NULL, pCounterBuffers must be a valid pointer to an array [...]" So counterBufferCount could be non-zero with pCounterBuffers being NULL. Fixes crash in RenderDoc when inspecting draw call with tesselation or geometry shader present. Fixes: `98b0d900` "turnip: rework streamout state and add missing counter buffer read/writes" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8140>	2020-12-17 16:33:33 +00:00
Danylo Piliaiev	6aec3c9a23	tu: Ignore pTessellationState if there is no tesselation shaders According to the spec: "pTessellationState [...] is ignored if the pipeline does not include a tessellation control shader stage and tessellation evaluation shader stage." Fixes crash in RenderDoc when inspecting draw call with geometry shader but without tesselation shaders. Fixes: `eefdca2e` "turnip: Parse tess state and support PATCH primtype" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8140>	2020-12-17 16:33:33 +00:00
Michael Forney	434da21a7c	meson: add missing dependency on generated git_sha1.h Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8121>	2020-12-17 00:03:22 +00:00
Eric Anholt	f6665eb053	freedreno/ir3: Free the compiler at the end of the unit tests. Needed for meson test with asan enabled. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7936>	2020-12-15 19:39:29 +00:00
Samuel Iglesias Gonsálvez	e8bf15d107	turnip: pCounterBufferOffsets can be NULL on vkCmd*TransformFeedbackEXT() According to the spec for both vkCmd{Begin,End}TransformFeedbackEXT(), if pCounterBufferOffsets is NULL, then it is assumed the offsets are zero. Fixes crash on dEQP-VK.transform_feedback.simple.backward_dependency_no_offset_array Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8057>	2020-12-11 16:30:51 +00:00
Jonathan Marek	fa16e66a3f	turnip: always set LRZ registers to zero for 3d clear/blit Apparently LRZ will be read/written regardless of depth being enabled or not, so we have to make sure these registers are zero. Fixes: `1d83f5ae84` ("turnip: disable LRZ on vkCmdClearattachments() 3D fallback path") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7899>	2020-12-08 13:26:16 -05:00
Jonathan Marek	f24358e002	turnip: move up LRZ invalidate in CmdClearAttachments There is an early return if cmd->state.predication_active is true, so do the LRZ invalidate before that. Fixes: `2f79e00664` ("turnip: disable LRZ on vkCmdClearAttachments()") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7899>	2020-12-08 13:26:16 -05:00
Jonathan Marek	aed7c5aa31	turnip: do not emit draw states in draw_cs outside of renderpass This avoids a possible issue with MSAA sysmem clears, which use a 3D clear path which assumes draw states are disabled, and are emitted in draw_cs in BeginRenderPass. (checking for TU_CMD_DIRTY_DRAW_STATE also allows not emitting the draw states if they will be re-emitted on the next draw anyway. the previous patch makes it so TU_CMD_DIRTY_DRAW_STATE is always set outside of renderpasses) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7899>	2020-12-08 13:26:11 -05:00
Jonathan Marek	3f58d80823	turnip: correctly disable draw states outside of renderpasses * do the disable in EndRenderPass2 to fix the missing disable for sysmem * we don't need a disable at the end of every tile, or between binning pass and gmem pass (the first draw in draw_cs emits all the draw states) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7899>	2020-12-08 13:16:11 -05:00
Jonathan Marek	af6e74bca8	turnip: always emit LRZ draw state in DIRTY_DRAW_STATE path The packet size is constant and assumes all states, except for the 2 input attachment states. (this means we get an invalid packet if DIRTY_LRZ isn't set when DIRTY_DRAW_STATE is set). Fixes: `3c07a14998` ("turnip: enable LRZ") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7899>	2020-12-08 13:16:11 -05:00
Jonathan Marek	2d886fb436	turnip: do not include compute stage in pipeline_builder This avoids emitting compute-related state in the graphics pipeline (tu6_emit_xs_config was being called for compute stage). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7899>	2020-12-08 13:16:11 -05:00
Jonathan Marek	d7ea266e6f	turnip: no linear_to_srgb for alpha channel for gmem clear value packing Alpha channel is always linear (oops). Fixes: `ddac5933f8` ("turnip: call packing functions directly for pack_gmem_clear_value") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7899>	2020-12-08 13:16:09 -05:00
Mauro Rossi	2c16c209b5	android: freedreno/ir3: use python3 in gen rules Completes freedreno gen rules migration to python3 as per meson.build With this change all freedreno gen rules use $(MESA_PYTHON3) Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7942>	2020-12-07 13:10:32 +00:00
Jonathan Marek	872c4bcd27	turnip: implement z-scaling and z-mirroring BlitImage Z scaling case without nearest filter needs a 3D texture, so add a 3D texture path and use it to cover all scaling/mirroring cases. The "rotation" argument for the clear/blit "setup" function is replaced with a more generic "blit_param", which has a different meaning for the 3D blit path. (to avoid having too many arguments) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7781>	2020-12-03 15:30:06 +00:00
Eric Anholt	06f2516696	freedreno/afuc: Fix up some sprintf format security warnings. Showed up when I tried enabling asan. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7695>	2020-12-02 20:43:33 +00:00
Daniel Stone	9eee405484	freedreno: Add missing dependency to build computerator depends on ir3_parser.h, which is a generated file, but this dependency is not expressed in the build. Fixes: `1e8808a4a0` ("freedreno/ir3: refactor out helper to compile shader from asm") Signed-off-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7870>	2020-12-02 16:26:29 +00:00
Danylo Piliaiev	a569ffeb83	freedreno/a6xx: Fix typo in height alignment calculation in a6xx layout Fixes KHR-GL31.texture_size_promotion.functional Fixes: `e49748521e` Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7792>	2020-11-26 17:37:37 +00:00
Erik Faye-Lund	5461e21245	Revert "freedreno/ir3: Use get_once() for one-time init" This reverts commit `b4ad27a986`. Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7760>	2020-11-25 09:44:11 +00:00
Rob Clark	b4ad27a986	freedreno/ir3: Use get_once() for one-time init Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7644>	2020-11-24 21:03:34 +00:00
Rob Clark	53f7d539cd	util: Add helgrind support for simple_mtx Annoyingly mtypes.h pulls in simple_mtx, which means we end up needing to sprinkle a lot of idep_mesautil around. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3773 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7644>	2020-11-24 21:03:34 +00:00
Rob Clark	9de6a601ce	freedreno/drm: Quiet timedout error msg This isn't terribly interesting, but got more chatty when we converted to mesa_loge() vs debug_printf() Fixes: `156d7e45f7` ("freedreno: Convert to mesa_log*()") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7717>	2020-11-23 16:04:52 +00:00
Connor Abbott	76ade57fa6	ir3/ra: Fix array reg liveness in scalar pass Assigning an array reg removes IR3_REG_ARRAY, which means that definitions and uses can't be tracked back to the array register's name and liveness for the components of the array aren't correctly calculated. To fix this we delay assigning array registers until the scalar pass. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7711>	2020-11-23 11:33:13 +00:00
Connor Abbott	bac6cc586f	ir3: Enable nir_lower_vars_to_scratch on a6xx Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:58 +01:00
Connor Abbott	4d44461dd5	tu: Support private memory Allocate enough space and then program the registers correctly. We currently allocate scratch memory as part of the pipeline, because the alternative of trying to share it across pipelines is a bit trickier due to the need for the configs to exactly match whenever we reuse the same buffer for different shaders. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	b525934f26	freedreno: Add per-device parameters for private memory We have to allocate backing storage big enough to hold all the private memory for all threads that can possibly be in flight, which means that we have to start filling in some more model-specific information as the sizes will be different for models with different core counts/ALU counts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	ae109ca83c	ir3: Properly validate cat6 half-ness Apparently this is all that's required to get loads & stores to work with half registers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	4970aa5577	ir3: Initial support for private memory Add information that the driver will need to setup registers, and implement support for load_scratch/store_scratch using private memory. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	32cb01a418	ir3/parser: Fix st{l,lw,g,p} and ld{l,lw,g,p} assembly It seems the src_offset and dst_offset are unused for these, and the offset is expected to be an immediate register. Also we forgot to add a dummy dst for the store instructions. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	504142ff75	ir3: Fix STP/LDP assembly Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	e7471ce776	ir3: Support assembling & disassembling getspid/getwid These aren't useful yet in the driver, but were useful for reverse-engineering how private memory works. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	2cee8642ca	ir3: Add more a6xx-specific cat6 opcodes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	c82d7be193	ir3: Expand cat6 a6xx opcode field Turns out the low bit of pad3 is actually the high bit of the opcode. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	92fe6fa0cc	freedreno/a6xx: Document private memory registers They seem to be broadly similar to the a3xx ones, albeit with some things shuffled around and with different units, and the extra layout mode bits. We also document the FIRST_EXEC_OFFSET registers, so that we can start properly setting them all to 0 in freedreno and turnip in later commits. I discovered the compute one when playing with function support in the blob CL driver, and added the other registers via analogy (the blob Vulkan driver sets FIRST_EXEC_OFFSET and the shader VA together in one packet for all stages, so it seems to really be in the same place for all stages). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Connor Abbott	3d5bed03e1	freedreno/ci: Strip location from asserts Let's not force everyone touching ir3.h to make random changes to the reference output. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7386>	2020-11-19 17:55:03 +01:00
Samuel Iglesias Gonsálvez	1200f6da0b	turnip: implement VK_KHR_depth_stencil_resolve support Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6884>	2020-11-19 09:43:11 +00:00
Eric Anholt	8ae38885d6	freedreno: Fix uninitialized var warning in afuc using unreachable(). Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7664>	2020-11-18 18:15:02 +00:00
Alejandro Piñeiro	c77409a87e	turnip: minor tu_queue fixes related to vk_base_object Include: * Missing call to tu_queue_finish * Use the proper free method for device->queues Fixes `5d3fdbc52b` Tested-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7663>	2020-11-18 00:58:29 +00:00
Eric Anholt	008872aa30	turnip: Assert about the storage buffer offset alignment. Giving us an unaligned pointer is invalid, and this helps switch a CTS bug from being a flake to a consistent crash. https://gitlab.khronos.org/Tracker/vk-gl-cts/-/issues/2661 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7662>	2020-11-18 00:38:02 +00:00
Vinson Lee	69cad1f96e	turnip: Close sync_fd only if it is a valid file descriptor. Fix defects reported by Coverity Scan. Argument cannot be negative (NEGATIVE_RETURNS) negative_returns: sync_fd is passed to a parameter that cannot be negative. Fixes: `cec0bc73e5` ("turnip: rework fences to use syncobjs") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7647>	2020-11-17 01:05:44 +00:00
Eric Anholt	1f44053301	freedreno+turnip: Upload large shader constants as a UBO. Right now if the shader indirects on some large constant array, we see NIR load_consts (usually from the const file) of its contents into general registers, then indirection on the GPRs. This often results in register allocation failures, as it's easy to go beyond the ~256 dwords of registers per invocation. By moving the large constants to a UBO, we can load an arbitrary number of them. They also can be theoretically moved to the constant reg file (~2k dwords), though you're unlikely to hit this path without an indirect load on your large constant, and we don't yet let UBO indirect loads get moved to constant regs. This possibly won't work out right if we have 16-bit load_constants, but without other MRs in flight we won't see 16-bit temps to be lowered to this. This allows 2 kerbal-space-program shaders to compile that previously would fail, and fixes the new dEQP-VK and -GLES2 tests I wrote that dynamically index a 40-element temporary array of float/vec2/vec3/vec4 with constant element initializers. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2789 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5810>	2020-11-16 13:55:41 -08:00
Eric Anholt	17db969f7a	freedreno/ir3: Fix incorrect optimization of usage of 16-bit constbuf vals. If you're loading a 32b word from the const file and doing a cov.u32u16 split to two 16bit values, we can't turn that into a reference of a 16-bit float value directly from the constbuf, because the CONSTANT_DEMOTION_ENABLE results in a f2f16 operation on the 32-bit value that we didn't want. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5810>	2020-11-16 13:54:22 -08:00
Eric Anholt	a9b37e5dad	freedreno/ir3: Include at least 4 NOPs so that cffdump doesn't disasm junk. cffdump looks at the following 4 instructions to decide if the shader has really ended, so if we pack data after that (such as turnip's next stage's shader), it might decode instructions that aren't really part of the shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5810>	2020-11-16 13:54:22 -08:00

... 5 6 7 8 9 ...

2409 Commits