KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rob Clark	1d54fb5b2b	freedreno/ir3: update obsolete comment Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Rob Clark	5baf430261	freedreno/computerator: MERGEDREGS update Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Rob Clark	0e0d4daa5b	turnip: set .MERGEDREGS based on variant Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Rob Clark	1cc4cf141a	freedreno/ir3: make mergedregs a property of the variant Rather than assuming a6xx+ means mergedregs. We can actually (mostly?) do splitregs on a6xx as well. And GS/DS/HS currently require it, which might be papering over a bug, or might be something to do with how chaining shaders works. At any rate, we should at least be consistent, and not have the compiler thinking we are doing mergedregs when we are actually doing splitregs. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Rob Clark	c052087038	freedreno/ir3: re-work assembler API Just pass thru the variant, since it has everything we need. And will be needed in the next patch. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Rob Clark	ffe62e1b6c	freedreno/ir3: pass variant to postsched Prep for the next patch. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Rob Clark	38df3f899d	freedreno/ir3: decouple regset from gpu gen Allow different regset's to coexist, so we can make mergedregs vs split reg file a variant property. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Rob Clark	47decc88c2	freedreno/ir3: move mergedreg state out of reg It is only needed one place, let's move it there. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Rob Clark	46cdcf590b	freedreno/ir3: convert regmask_t to struct Prep to make merged/split register file mode a property of the regmask, rather than the ir3_register. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5458>	2020-06-18 02:46:28 +00:00
Jonathan Marek	d53dc6c376	freedreno/fdl6: rework layout code a bit (reduce linear align to 64 bytes) Reduce linear alignment, and rework the layout code a bit. This rework has a side effect of also increasing the alignment on linear levels of tiled (non-ubwc) cpp=1 and cpp=2 layouts. Since we should be UBWC for those cases anyway, its not a big loss. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5013>	2020-06-18 02:26:43 +00:00
Jonathan Marek	3a9ab3b6e9	freedreno/a6xx: FETCHSIZE is PITCHALIGN "FETCHSIZE" is actually a "minimum pitch" or "pitchalign" value that's relevant for mipmaps. The 0 value means 64-bytes. Understanding this allows some simplifications and will make it possible to use less alignment on linear formats. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5013>	2020-06-18 02:26:43 +00:00
Eric Engestrom	9ca2a4e6fc	freedreno: replace all dup() with os_dupfd_cloexec() Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5369>	2020-06-18 02:09:56 +00:00
Eric Engestrom	4a8085d67c	replace all F_DUPFD_CLOEXEC with os_dupfd_cloexec() All squashed into a single commit because it shouldn't have any behaviour change, except that it might work now on platforms where it was broken because F_DUPFD_CLOEXEC is not supported but FD_CLOEXEC is. Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5369>	2020-06-18 02:09:56 +00:00
Jonathan Marek	9f24909b0b	turnip: use u_format for packing gmem clear values Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5485>	2020-06-17 19:42:32 +00:00
Jonathan Marek	f745ceecee	turnip: use draw states for input attachments Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5446>	2020-06-17 15:32:30 +00:00
Jonathan Marek	159a1300ce	turnip: input attachment descriptor set rework Implement GMEM input attachments by using non-bindless texture state which is emitted at the start of every subpass. This achieves two things: * More vulkan-like CmdBindDescriptorSets * Fixing secondary command buffer input attachments with GMEM Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5446>	2020-06-17 15:32:30 +00:00
Jonathan Marek	233610f8cf	turnip: refactor draw states and dynamic states This reworks dynamic states to use draw states, and reworks draw states. This moves towards doing as little as possible in bind_draw_states. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5446>	2020-06-17 15:32:30 +00:00
Jonathan Marek	62a4db4c0f	turnip: delete dead dynamic state code Remove unused code, split this out to reduce the diff in the next patch. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5446>	2020-06-17 15:32:30 +00:00
Jonathan Marek	aab3398b33	turnip: improve dirty bit handling a bit This moves some logic out of bind_draw_states, moving towards the eventual goal of doing very little in bind_draw_states. Split this out as a separate patch to make the DIRTY_INPUT_ATTACHMENTS more visible: it can be safely removed because pipelines are subpass specific, so there will always be a pipeline change to go with the CmdBeginRenderPass and CmdNextSubpass (the CmdBindPipeline may not be in the subpass, but the draw that flushes the pipeline update will be). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5446>	2020-06-17 15:32:30 +00:00
Jonathan Marek	edb8c581db	turnip: move descriptor set BO tracking to CmdBindDescriptorSets This avoids the duplicated code. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5446>	2020-06-17 15:32:30 +00:00
Jonathan Marek	5ef0f9f622	turnip: compute and graphics have completely separate state The comment about fragment shader state overwriting compute shader state is wrong, if either path is overwriting the other's state then it is a mistake. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5446>	2020-06-17 15:32:30 +00:00
Connor Abbott	a3464c567c	tu: Actually remove dead variables after io lowering I forgot that their derefs would still be lying around, so we need to eliminate them first. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5519>	2020-06-17 14:36:50 +00:00
Connor Abbott	168c42290f	ir3: Don't calculate num_samp ourselves In addition to duplicating what core NIR does better, this was wrong for Vulkan, where it should be 0 as there are no non-bindless samplers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5519>	2020-06-17 14:36:50 +00:00
Connor Abbott	568e06b3a6	tu: Set num_components to 0 when building bindless intrinsics Fixes: `167fa288` (" nir/validate: validate intr->num_components") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5520>	2020-06-17 15:49:40 +02:00
Connor Abbott	6fcbce3b99	tu: Remove tu_shader_compile_options The only two fields were always true, and I don't think we'd ever have use for them. If we want to disable optimizations then we'd need a different approach, and I don't even know what include_binning_pass was for. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5500>	2020-06-17 13:13:05 +00:00
Connor Abbott	808992fc50	tu: Use the ir3 shader API This will be necessary once we start compiling multiple variants due to different const size limits, and it will also be necessary for properly implementing the pipeline cache. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5500>	2020-06-17 13:13:05 +00:00
Connor Abbott	b1700698a5	tu: Remove num_samp hack Delete the variables so that ir3 thinks there are no samplers and images instead. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5500>	2020-06-17 13:13:05 +00:00
Connor Abbott	6f2981176d	ir3: Pass reserved_user_consts to ir3_shader_from_nir() ir3_shader_from_nir() calls ir3_optimize_nir(), which currently sets up the const state. However, we need to know the number of user consts reserved by the driver before setting up the const state, which means that this information needs to be passed into ir3_shader_from_nir() somehow rather than being set in the shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5500>	2020-06-17 13:13:05 +00:00
Jonathan Marek	d37deebde5	turnip: fix cubic filtering with CmdBlitImage This fixes the newly added cubic blit_image tests for A650, by falling back to the 3D path and setting the filter correctly. Note: there are still failures with the texture filtering tests. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5509>	2020-06-17 08:50:42 +00:00
Jonathan Marek	198b13974a	turnip: fix 3D path always being used for CmdBlitImage This change accidentally made it into `72d7df40a5`, and started causing blit_image flakes (because of the issue fixed in the previous patch) Fixes: `72d7df40a5` ("turnip: add layered 3D path clear for CmdClearAttachments") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5509>	2020-06-17 08:50:42 +00:00
Jonathan Marek	1622787ee4	turnip: set VFD_INDEX_OFFSET in 3D clear/blit path This was missing an causing flakes when used after a test that set it to a non-zero value. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5509>	2020-06-17 08:50:42 +00:00
Rob Clark	680ca5b393	freedreno/ir3: add post-scheduler cp pass A pass to eliminate extra mov's from an array. We need to do this after scheduling so we know that there are not any potentially conflicting array writes between the original `mov` and it's use(s). Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2124 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	a60d48a863	freedreno/ir3/cp: extract valid_flags We'll also need this in the postsched-cp pass. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	5f1f8f7b17	freedreno/ir3: delay test support for vectorish instructions Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	92d6eb4dd5	freedreno/ir3: add helpers to move instructions A bit cleaner than open coding the list manipulation. Plus I want to use it in the next patch, rather than adding more open coded list futzing. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	9eed0c6011	freedreno/ir3/delay: calculate delay properly for (rptN)'d instructions When a sequence of same instruction is encoded with repeat flag, destination registers are written on successive cycles. Teach the delay calculation about this. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	c3b30963dd	freedreno/ir3: add test for delay slot calculation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	a69d28769a	freedreno/ir3/print: print (r) flag Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	cd376a1434	freedreno/ir3/legalize: don't allow (nopN) if (rptN) These two encodings are mutually exclusive. If the instruction is a vector(ish) `(rptN)` instruction, then we can't fold a `(nopN)` post- delay into it. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	f35f711c71	freedreno/ir3/cp: properly handle already-folded RELATIV In the `try_swap_mad_two_srcs()` case, valid_flags() gets called both for the src that we want to try to fold, and for the other src that we are trying to swap to make that possible. It can happen in the 2nd case that a RELATIV src has already been folded. Since `ssa()` returns non- null in both the `IR3_REG_SSA` and `IR3_REG_ARRAY` cases (in the later case, it is the dependent array access that the current instruction cannot be moved ahead of), we need to explicitly check that the src reg we are looking at is still an SSA src. Reported-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	1bee79996b	freedreno/ir3/validate: also check instr->address Verify that instructions which have a relative src and/or dest, have `instr->address`. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	f598786775	freedreno/sched: reset delay counters at start of block Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5280>	2020-06-16 20:56:15 +00:00
Rob Clark	28a14787c0	freedreno/ir3: don't rely on intr->num_components It is better to use `nir_intrinsic_dest_components()` which also handles the case of intrinsics with a fixed number of dest components. Somehow this starts showing up with a nir_serialize round-trip with shader-cache. But we really shouldn't have been relying on `intr->num_components` directly. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5371>	2020-06-16 02:48:18 +00:00
Jonathan Marek	c1e1b13bfe	turnip: simplify stage2 helpers Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5455>	2020-06-15 15:35:13 -04:00
Jonathan Marek	067370fe87	turnip: remove duplicated stage2opcode and stage2shaderdb Reduce 3 copies of this same logic into a single one. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5455>	2020-06-15 15:34:52 -04:00
Rob Clark	b5c810d68b	turnip: drop linking libfreedreno_drm Now that it is no longer required. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5476>	2020-06-15 15:46:37 +00:00
Rob Clark	1a33faea8c	freedreno/ir3: move the libdrm dependency out of shared code The only reason for this dependency was the fd_bo used for the uploaded shader. But this isn't used by turnip. Now that we've unified the cleanup path from gallium, it isn't hard to pull the fd_bo upload/free parts into ir3_gallium. This cleanup has the added benefit that the shader disk-cache will not have to deal with it. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5476>	2020-06-15 15:46:37 +00:00
Jonathan Marek	1d9e6e456a	freedreno/ir3: fix ir3_nir_move_varying_inputs ir3_nir_move_varying_inputs is broken when there a load input outside of the first block which depends on the result of a previous load input. This simplification/rework avoids the problem, and should also be faster. Fixes this dEQP-VK test: dEQP-VK.pipeline.multisample_interpolation.offset_interpolate_at_pixel_center.128_128_1.samples_2 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5465>	2020-06-14 17:53:47 +00:00
Jonathan Marek	8c152a5e2a	turnip: remove some dead/redundant code A bit of cleanup to reduce noise in the codebase. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5447>	2020-06-13 00:11:47 +00:00
Jonathan Marek	c93753e618	turnip: add emit renderpass cache flushes for sysmem 3D CmdClearAttachments This clear path behaves like a draw, it needs the same flush as tu_draw. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5426>	2020-06-12 15:09:07 +00:00
Jonathan Marek	72d7df40a5	turnip: add layered 3D path clear for CmdClearAttachments This fixes cases where the 3D path is used with layered rendering. Fixes dEQP-VK.renderpass.suballocation.multisample_resolve.layers* failures Note the blob's 3D fallback path behaves differently, and uses the framebuffer information to clear each layer individually (changing the MRT state each time). But that's not possible in all cases, and the blob fails to clear properly in dEQP-VK.geometry.layered.*.secondary_cmd_buffer cases. So this clear path is not based on the blob's behavior. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5426>	2020-06-12 15:09:07 +00:00
Jonathan Marek	093c413722	turnip: share code between 3D blit/clear path and tu_pipeline Instead of filling out registers manually, fill out ir3 structs and re-use code from tu_pipeline. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5426>	2020-06-12 15:09:06 +00:00
Jonathan Marek	13525a9c70	turnip: pipeline program state refactor This refactor simplifies things a bit, and will make it easier to share some logic with tu_clear_blit (see next patches). This changes the order in which some things are emitted, and emits less for disabled shader stages. There's also as extra write to SP_GS_PRIM_SIZE that is removed. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5426>	2020-06-12 15:09:06 +00:00
Rob Clark	ee29c682fe	freedreno/ir3: limit pre-fetched tex dest Teach RA to setup additional interference to prevent textures fetched before the FS starts from ending up in a register that is too high to encode. Fixes mis-rendering in multiple playcanv.as webgl apps. Note that the regression was not actually 733bee57eb8's fault, but that was the commit that exposed the problem. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3108 Fixes: `733bee57eb` ("glsl: lower samplers with highp coordinates correctly") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5431>	2020-06-11 21:59:54 +00:00
Rob Clark	f80092dad2	freedreno/ir3: remove RA "q-values" optimization This is mainly the "piglit optimization" (ie, since piglit launches an separate process for for each test). It was never wired up for a6xx, and makes register class setup unnecessarily complicated. Remove it to simplify the next patch. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5431>	2020-06-11 21:59:54 +00:00
Rob Clark	562aaea07c	freedreno/ir3: respect tex prefetch limits Refactor a bit the limit checking in the bindless case, and add tex/samp limit checking for the non-bindless case, to ensure we do not try to prefetch textures which cannot be encoded in the # of bits available. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5431>	2020-06-11 21:59:54 +00:00
Rob Clark	4cabc25fa4	freedreno/ir3: add debug code to print conflicting half-regs I keep re-typing this from time to time when debugging various things. Which is dumb. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5431>	2020-06-11 21:59:54 +00:00
Eric Anholt	3e11f04d4e	turnip: Expose robustBufferAccess. It is a required device feature, and all enabled tests in dEQP-VK.robustness.* pass. Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5266>	2020-06-09 18:28:18 +00:00
Eric Anholt	58dd904c59	turnip: Fix crashes in compute with no descriptors to load. Found when trying to rebase cheza VK CI on top of this change. Fixes: `334204823e` ("tu: Fix context faults loading unused descriptor sets") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5266>	2020-06-09 18:28:17 +00:00
Connor Abbott	334204823e	tu: Fix context faults loading unused descriptor sets The app is allowed to never bind descriptor sets that are statically unused by the pipeline, which would've caused a context fault since CP_LOAD_STATE6 would try to load the descriptors that don't exist. Fix this by not preloading descriptors from unused descriptor sets. We could do more fine-grained accounting of which descriptors are used, but this is enough to fix the problem. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5400>	2020-06-09 15:35:29 +00:00
Connor Abbott	487aa807bd	tu: Rewrite flushing to use barriers Replace the various ad-hoc flushes that we've inserted, copied from freedreno, etc. with a unified system that uses the user-supplied information via vkCmdPipelineBarrier() and subpass dependencies. There are a few notable differences in behavior: - We now move setting RB_CCU_CNTL up a little in the gmem case, but hopefully that won't matter too much. This matches what the Vulkan blob does. - We properly implement delayed setting of events, completing our implementaton of events. - Finally, of course, we should be a lot less flush-happy. We won't emit useless CCU/cache flushes with multiple copies, renderpasses, etc. that don't depend on each other, and also won't flush/invalidate the cache around renderpasses unless we actually need to. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4964>	2020-06-09 14:40:52 +00:00
Connor Abbott	29abf49886	tu: Remove useless event_write helpers tu6_emit_cache_flush() was wrongly named, and with the removal of the last parameter both are useless. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4964>	2020-06-09 14:40:52 +00:00
Connor Abbott	f4f6a9be9f	tu: Don't actually track seqno's for events We just dropped the last user which actually cared about the seqno. This never worked anyway, since the seqno was never reset between multiple executions of the same command buffer. Turn the part of the control buffer which used to track the seqno into a dummy dword, and figure out automatically whether we need to include it. We will implement seqnos again eventually, with timline semaphores, but that will likely be totally different. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4964>	2020-06-09 14:40:52 +00:00
Connor Abbott	dfb176a0ac	tu: Remove useless post-binning flushes The Vulkan blob doesn't do this, and based on my understanding of how the blob works this is unnecessary. CACHE_FLUSH is already serialized against all 3d commands so you don't need to wait for rendering commands to finish before issuing it, and the subsequent wfi + WAIT_FOR_ME will cause the CP to wait for the CACHE_FLUSH to finish, so there's also no need to wait for it to complete. The CACHE_INVALIDATE also seems unnecessary, and also isn't done by the blob. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4964>	2020-06-09 14:40:51 +00:00
Jonathan Marek	7b4f0eadc1	turnip: fix VFD_CONTROL for binning pass Fixes some cases with TU_DEBUG=forcebin, specifically the failures in: dEQP-VK.glsl.*_vertex Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:28:48 -04:00
Jonathan Marek	ab72c07aef	turnip: use common emit_xs_cntl to fill a6xx_sp_xs_ctrl_reg0 Note this changes the value of SP_GS_CTRL_REG0, by using FOUR_QUADS and setting MERGEDREGS. ir3 expects MERGEDREGS, and using FOUR_QUADS instead of TWO_QUADS doesn't seem to hurt. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:28:43 -04:00
Jonathan Marek	e16608e233	turnip: fix HW binning with geometry shader Fixes failures with TU_DEBUG=forcebin and geometry shaders, for example: dEQP-VK.binding_model.geometry dEQP-VK.transform_feedback.simple.query* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:23:05 -04:00
Jonathan Marek	6ac4d778fa	turnip: correctly emit non-binning vs in transform feedback case The offset given to tu6_emit_shader_object was wrong, binning_vs_offset should only be used when using the binning pass vs. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5370>	2020-06-08 16:23:05 -04:00
Jonathan Marek	88d5917cc1	turnip: clamp sampler minLod/maxLod Otherwise A6XX_TEX_SAMP_1_{MIN,MAX}_LOD silently overflows. This fixes these tests: dEQP-VK.texture.explicit_lod.2d.derivatives.* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5375>	2020-06-08 19:52:41 +00:00
Jonathan Marek	fecd83a0e8	turnip: update some properties based on blob driver subTexelPrecisionBits/mipmapPrecisionBits change fixes some failures in: dEQP-VK.texture.explicit_lod.* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5375>	2020-06-08 19:52:41 +00:00
Jonathan Marek	8c26c9eed8	turnip: move HLSQ_UPDATE_CNTL write to before xs config writes This matches the blob and gallium driver more closely, and fixes a rendering issue observed on a650. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5376>	2020-06-08 18:08:23 +00:00
Vinson Lee	faa339e666	Switch from cElementTree to ElementTree. The xml.etree.cElementTree module will be removed in Python 3.9. Since Python 3.3 the xml.etree.cElementTree module has been deprecated, the xml.etree.ElementTree module uses a fast implementation whenever available. Builds using Python 2.7 can still work but with the slower implementation. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5349>	2020-06-05 23:42:54 -07:00
Eric Anholt	0bacb280a8	freedreno/ir3: Handle cases where we decide not to lower UBO 0 loads. We advertize 4096 vec4s of GL uniform storage, but the HW can only store 512 vec4s in the const buffer. Closes: #3049 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:43:30 -07:00
Eric Anholt	e349f50279	freedreno/ir3: Drop the max_const on a6xx to 512. The GLES blob on the p3a limits constlen to 512 between VS and FS across a6xx gpu ids (615, 630, 640, and 650). Experimentally, exceeding that limit in any one stage results in rendering corruption or GPU hangs (though my most detailed testing had a loop limit in a uniform, so that may the cause of the hang). Clamp the limit we use inside of a shader so we don't exceed it within a stage. This commit doesn't resovle limiting inter-stage. Experimentally, I've found that I can push up to a total of ~768 vec4s between VS and FS on a630, with or without uniform updates between each draw. We'll need to do some shader key-based limiting of constlen at draw time to respect that limit, but that's left for future work, and this commit is enough for the google earth case that initiated this work. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	486b894307	freedreno/ir3: Account for driver params in UBO max const upload. The const state setup needs to be able to push its driver params, so account for them in the analyze_ubo_ranges. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	a25347ab92	freedreno/ir3: Stop shifting UBO 1 down to be UBO 0. It turns out the GL uniforms file is larger than the hardware constant file, so we need to limit how many UBOs we lower to constbuf loads. To do actual UBO loads, we'll need to be able to upload UBO 0's pointer or descriptor. No difference on nohw 1 UBO update drawoverhead case (n=35). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	9e58ab09ff	freedreno/ir3: Drop unnecessary alignment of pushed UBO size. The analysis pass gives us vec4-aligned size, and all of our other constbuf allocations here are in vec4 units, so we can just divide by 16. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	07ec745014	freedreno/ir3: Stop pushing immediates once we've filled the constbuf. If we filled the constbuf up with UBOs, we may need to avoid generating more immediate push constants. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	ab29f2da42	freedreno/ir3: Refactor ir3_cp's lower_immed(). There was duplicated handling in the callers that we can just move inside. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5273>	2020-06-05 13:36:29 -07:00
Eric Anholt	ac1ab9294a	turnip: Add support for alphaToOne. Comparing a blob trace using the feature to one not, the difference was pretty obvious and in the spot you'd expect compared to alphaToCoverage. The SP_ reg didn't have a corresponding bit set, though it also has an alphaToCoverage. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5343>	2020-06-05 18:11:22 +00:00
Eric Anholt	79f3003445	turnip: Use tu_cs_emit_regs() for BLEND_CONTROL. Just a cleanup since I was in the area. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5343>	2020-06-05 18:11:22 +00:00
Rob Clark	ef5b8bbc5e	freedreno/computerator: fix missing dependency on generated header Fixes: ``` ../mesa-freedreno-20.2.0_pre/src/freedreno/computerator/ir3_asm.c:25:10: fatal error: 'ir3/ir3_parser.h' file not found #include "ir3/ir3_parser.h" ^~~~~~~~~~~~~~~~~~ 1 error generated. ``` Fixes: `da467817e3` ("freedreno/ir3: Move ir3 assembler to backend compiler") Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5340>	2020-06-05 09:48:47 +00:00
Eric Anholt	ec98cff6a9	turnip: Simplify vertex buffer bindings. We were remapping the bindings so the HW binding points were consecutive, which there's no need for. Now that we don't shuffle, we can mostly drop the dependency on the pipeline for this SDS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Eric Anholt	5c9728d960	turnip: Don't bother clamping VB size. From the VK spec: "All elements of pOffsets must be less than the size of the corresponding element in pBuffers" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Eric Anholt	52942f18c6	turnip: Move vertex buffer bindings to SET_DRAW_STATE. This means that the HW can skip over the vertex buffer state when it's not used in a bin. The blob also has this behavior. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5321>	2020-06-04 19:42:54 +00:00
Rob Clark	27e501bcfc	freedreno/a6xx: update depth-plane control regs And document the early-lrz-late-z mode. Initially I thought this would be two bits to control early-lrz vs early-z. But having early-z without early-lrz does not make sense, and the way the values line up makes an enum fit better. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	f6307426ed	freedreno/a6xx: sync registers from envytools Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Rob Clark	ebcf3545db	freedreno/ir3: split kill from no_earlyz Unlike other conditions which prevent early-discard of fragments, kill does not prevent early LRZ test. Split `has_kill` from `no_earlyz` so we can take advantage of this. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5298>	2020-06-04 02:34:54 +00:00
Timothy Arceri	04dbf709ed	nir: add callback to nir_remove_dead_variables() This allows us to do API specific checks before removing variable without filling nir_remove_dead_variables() with API specific code. In the following patches we will use this to support the removal of dead uniforms in GLSL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4797>	2020-06-03 02:22:23 +00:00
Jonathan Marek	a2903dd767	turnip: fix RENDER_COMPONENTS value This fixes render_components being 0 when mrt_count=8, because shift by 32 is UB and in arm64 it ends up shifting by 0. This fixes tests with 8 MRTs. Fixes the 3d path sysmem CmdClearAttachments to set RENDER_COMPONENTS, as it was previously relying on tu6_emit_mrt setting it, but it is now part of the pipeline state. Also switch back to the previous behavior of not setting render components for VK_ATTACHMENT_UNUSED attachments: we don't update the MRT state for such attachments so we definitely don't want to be trying writing to those. Fixes: `078aa9df8d` ("tu: Move RENDER_COMPONENTS setting to pipeline state") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5292>	2020-06-02 18:42:09 +00:00
Dylan Baker	a8e2d79e02	meson: use gnu_symbol_visibility argument This uses a meson builtin to handle -fvisibility=hidden. This is nice because we don't need to track which languages are used, if C++ is suddenly added meson just does the right thing. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Rob Clark	6f39126200	freedreno/a6xx: document LRZ flag buffer Doesn't seem to be a big win, although I could still be missing something in my implementation. But might as well add the documentation. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5217>	2020-05-29 00:38:28 +00:00
Jonathan Marek	ddfd2e626a	turnip: support VkImageDrmFormatModifierExplicitCreateInfoEXT This will be used to import images which have different layout from what turnip uses by default. For example non-UBWC (linear) images from the video decoder on some hardware have a 512 pitch alignment. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4596>	2020-05-27 04:02:58 +00:00
Jonathan Marek	da409fb7b8	freedreno/layout: add explicit offset/pitch argument to fdl6_layout fdl6_layout will return false when the explicit pitch is not valid. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4596>	2020-05-27 04:02:58 +00:00
Kristian H. Kristensen	f4e64e9f53	freedreno/ir3: Avoid {0} initializer for struct reginfo First element is not a scalar. Just initialize the struct like we do elsewhere. src/freedreno/ir3/disasm-a3xx.c:958:33: warning: suggest braces around initialization of subobject [-Wmissing-braces] Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5174>	2020-05-26 12:46:18 -07:00
Kristian H. Kristensen	06ab93d694	turnip: Use {} initializer to silence warning We're already using the {} syntax elsewhere in turnip. src/freedreno/vulkan/tu_formats.c:828:71: warning: suggest braces around initialization of subobject [-Wmissing-braces] Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5174>	2020-05-26 12:46:18 -07:00
Kristian H. Kristensen	697fe1c801	turnip: Use tu6_reduction_mode() to avoid warning This makes it a little more explicit that the values line up. src/freedreno/vulkan/tu_device.c:2209:75: warning: implicit conversion from enumeration type 'const VkSamplerReductionMode' (aka 'const enum VkSamplerReductionMode') to different enumeration type 'enum a6xx_reduction_mode' [-Wenum-conversion] Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5174>	2020-05-26 12:46:18 -07:00
Kristian H. Kristensen	fff17707ea	turnip: Use hw enum when emitting A6XX_RB_STENCIL_CONTROL We're hard-coding this value, so let's use the hw enum and avoid a warning. src/freedreno/vulkan/tu_clear_blit.c:2091:19: warning: implicit conversion from enumeration type 'enum VkStencilOp' to different enumeration type 'enum adreno_stencil_op' [-Wenum-conversion] Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5174>	2020-05-26 12:46:18 -07:00
Kristian H. Kristensen	e369b8931c	freedreno: Use explicit *_NONE enum for undefined formats This adds RB, VFMT and TFMT NONE values for a3xx-a5xx and FMT6_NONE for a6xx. Use those values instead of open coded (enum xxx) ~0 or sometimes even ~0, which triggers out-of-enum range warnings. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5173>	2020-05-26 18:35:03 +00:00
Eric Anholt	5ec3747fbe	freedreno/ir3: Use RESINFO for a6xx image size queries. The closed GL driver uses resinfo on images with the writeonly flag (using the texture-path's getsize only for readonly images). The closed vulkan driver seems to use resinfo regardless. Using resinfo doesn't need any fixups after the instruction. It also avoids one of the needs for the TEX_CONST state for the image, which is awkward to set up in the GL driver. The new handler goes into ir3_a6xx to be next to the other current image code, but the a4xx version is left in place because it wants a bunch of sampler helpers. Fixes assertion failure in dEQP-VK.image.image_size.buffer.readonly_32. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3501>	2020-05-26 18:17:46 +00:00

1 2 3 4 5 ...

1317 Commits