KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rob Clark	18bc5a81a7	freedreno: deduplicate a2xx disasm Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	f39afda1a7	freedreno: move a2xx disasm out of gallium So that it can be reused by the decode tools. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	f7bd3456d7	freedreno: deduplicate a3xx+ disasm Merge the extra tracking that is useful for generating stats from asm (as opposed to ir), and for guestimating things like inputs and outputs (mostly useful for r/e) into ir3's version and drop cffdec's version. There is a small change in disasm output for the decode tools, in that it no longer prints the used consts, but rather just the max accessed const. This is the more useful piece of information, and avoids making the shared regmask type big enough to deal with the const reg file. Additional error checking for invalid regids causes crashdec to bail out sooner when decoding memory that might hold valid instructions. Also, crashdec no longer prints stats, because stats aren't very useful when trying to decode random instruction memory (which might or might not be valid instructions). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	6b379a4cb4	freedreno: drop shader_t When this code was outside of the mesa tree, we needed our own enum. Now we can use a common one, to simplify deduplicating the disasm code. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	bb98b71893	freedreno/ir3: split out regmask To unify the ir3 disasm code, we need to add in the regmask based register tracking from cffdec's version of the disassembler. Split out regmask (or at least the part that doesn't depend on ir3) so it can be shared. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	ddcee248ad	freedreno: add CI for envytools tools This also tunes `.freedreno-rules` a bit so that it isn't triggered by various tools that don't effect the driver build. The .gitlab-ci directory is kept separate from the toplevel one so that updates to (for example) reference decode output do not trigger all the other-driver jobs to run. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	b62e4a8e9e	freedreno/afuc: warnings cleanup Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	5125b4bc69	freedreno/decode: warnings cleanup Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	cbbaafdf72	freedreno/rnn: warnings cleanup Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	536f43cb96	freedreno: slurp in afuc Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	1ea4ef0d3b	freedreno: slurp in decode tools cffdump, crashdec, etc At this point there is some duplication with other files in-tree (ie. a2xx and a3xx+ disassembly), which will be cleaned up in a later commit. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	7c0bd8429f	freedreno: slurp in rnn Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	b721d336da	freedreno: slurp in rnndb Pull in all of $envytools/rnndb (including display, etc) from envytools commit 6ccdda33ac4d88e19d2a70e1b4edaaab5ec4b026 This changes the directory structure to match the organization in the envytools tree. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Rob Clark	7de0842d42	freedreno: make gen_header.py check parent directory With the next commit, the xml files will be no longer be all in the same directory. But checking up a single directory level to resolve import will be sufficient. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6070>	2020-07-28 09:45:08 +00:00
Connor Abbott	8e8baecd6a	tu: Enable resource dynamic indexing This has actually worked since bindless support was merged. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6086>	2020-07-27 16:38:17 +00:00
Connor Abbott	8bc060ab81	ir3: Fix incorrect src flags for samp_tex Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6086>	2020-07-27 16:38:17 +00:00
Connor Abbott	e73a8a2b39	ir3: Remove redundant samp_tex validation It's already checked in ir3_validate. This way we don't have to fix it up for bindless. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6086>	2020-07-27 16:38:17 +00:00
Connor Abbott	3adc23f667	ir3: Validate bindless samp_tex correctly It's full instead of half precision, because the maximum number of textures/samplers is much larger. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6086>	2020-07-27 16:38:17 +00:00
Connor Abbott	d542bfc306	tu: Fix descriptor update templates with input attachments Found via dEQP-VK.binding_model.descriptorset_random.sets4.noarray.ubolimitlow.sbolimitlow.sampledimglow.outimgonly.noiub.nouab.frag.ialimitlow.0 Fixes: `159a1300ce` ("turnip: input attachment descriptor set rework") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6087>	2020-07-27 12:36:36 +00:00
Jonathan Marek	9ece61269d	turnip: fix SP_HS_UNKNOWN_A831 value for A650 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5765>	2020-07-27 12:17:38 +00:00
Jonathan Marek	e646e77e18	turnip: use patchControlPoints for HS_INPUT_SIZE value It should be calculated from patchControlPoints, not tcs_vertices_out. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5765>	2020-07-27 12:17:38 +00:00
Jonathan Marek	da49a45351	turnip: move WFI out of draw state to fix a650 hangs Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5765>	2020-07-27 12:17:38 +00:00
Jonathan Marek	e5f4527f20	freedreno/ir3: fix wrong local_primitive_id_start type When changing the patch to use an offset instead of a bool, the type was accidentally left as bool. Fixes: `f472c98443` ("freedreno/ir3: add support for a650 tess shared storage") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5765>	2020-07-27 12:17:38 +00:00
Connor Abbott	9e596cc2c2	tu: Enable vertex & fragment stores & atomics Note that there are some extra tess fails, but they're probably unrelated to the actual feature. There were also some xfails that were created as part of an earlier attempt to enable the feature which were fixed in the meantime, so remove them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5738>	2020-07-24 18:43:40 +00:00
Connor Abbott	f7f29a04b4	tu: Detect invalid-for-binning renderpass dependencies This is all that was missing for stores & atomics. Closes: #3196 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5738>	2020-07-24 18:43:40 +00:00
Connor Abbott	d6d75fcd91	tu: Fix hangs for DS with no output Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5738>	2020-07-24 18:43:40 +00:00
Connor Abbott	7ad962bf89	tu: Fix empty blit scissor case With vertexPipelineStoresAndAtomics enabled, fixes: dEQP-VK.tessellation.invariance.one_minus_tess_coord_component.quads_fractional_even_spacing_cw_point_mode dEQP-VK.tessellation.invariance.tess_coord_component_range.triangles_fractional_even_spacing_ccw_point_mode Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5738>	2020-07-24 18:43:40 +00:00
Connor Abbott	6cbdffd79c	tu: Implement VK_KHR_draw_indirect_count Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6007>	2020-07-24 19:21:07 +02:00
Connor Abbott	52ec35f5a6	tu: Add missing wfi to tu6_emit_hw() It needs to be there before changing CCU state. This was accidentally deleted in `f494799a7f` when it should've been moved. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6007>	2020-07-24 19:20:44 +02:00
Connor Abbott	a0ca688a67	tu: Integrate WFI/WAIT_FOR_ME/WAIT_MEM_WRITES with cache tracking Track them via pending_flush_bits. Previously WFI was only tracked in flush_bits and WAIT_FOR_ME was emitted directly. This means that we don't emit WAIT_FOR_ME or WAIT_FOR_IDLE if there wasn't a cache flush or other write by the GPU. Also split up host writes from sysmem writes, as only the former require WFI/WAIT_FOR_ME. Along the way, I also realized that we were missing proper handling of transform feedback counter writes which require WAIT_MEM_WRITES. Plumb that through as well. And CmdDrawIndirectByteCountEXT needs a WAIT_FOR_ME as it does not wait for WFI internally. As an example of what this does, a typical barrier for transform feedback with srcAccess = VK_TRANSFORM_FEEDBACK_WRITE_COUNTER_BIT_EXT and dstAccess = VK_ACCESS_INDIRECT_COMMAND_READ_BIT used to emit on A650: - WAIT_FOR_IDLE and now we emit: - WAIT_MEM_WRITES - WAIT_FOR_ME So we've eliminated a useless WFI and added some necessary waits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6007>	2020-07-24 19:20:44 +02:00
Connor Abbott	cd78a7a5ff	freedreno: Add INDIRECT_COUNT CP_DRAW_INDIRECT_MULTI variants These have an indirect count which is loaded from an iova, and the minimum is taken between the indirect and direct counts. Note, I also had to fix gen_header.py to deal with the extra-long names we get. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6007>	2020-07-24 19:20:44 +02:00
Connor Abbott	8da31ee15f	freedreno: Clean up CP_DRAW_MULTI_INDIRECT definition Depends on the envytools changes to make the "addvariant" magic work in order to decode this correctly, and to be able to print the register names directly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6007>	2020-07-24 19:20:44 +02:00
Jonathan Marek	1747f9fdd0	turnip: remove extra gmem alignment Now that we clear the PITCHALIGN" field when filling GMEM input attachment descriptors, we can get rid of the extra tile width alignment on a630/a640. With the "block_align_shift" value change, this brings down the default gmem_align from 16k to 4k on a630/a640 and down from 24k to 12k on a650, to match the gallium driver. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5528>	2020-07-24 13:44:42 +00:00
Jason Ekstrand	196db51fc2	anv,turnip,radv,clover,glspirv: Run nir_copy_prop before nir_opt_deref We're about to make the SPIR-V -> NIR path generate a bit more complex SSA chains for certain derefs. This will ensure we don't regress anyone when we start making vec2's of derefs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5278>	2020-07-23 22:43:21 -05:00
Rob Clark	d35b54c705	freedreno: sync registers from envytools Pull in a bunch of fixes and updates.. mostly using varset correctly, and fixes for implicit bools. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6052>	2020-07-23 17:11:16 -07:00
Connor Abbott	1610c69f34	tu: Enable VK_EXT_depth_clip_enable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6047>	2020-07-23 18:55:56 +00:00
Jonathan Marek	8fff8afb13	turnip: disable tiling for NV12/IYUV formats The last change to my previous MR to disable UBWC for the formats ended up breaking a few tests for A640 at least, because tiled-but-not-UBWC can be broken in some cases. Fixes: `1a83279da5` ("turnip: enable 420_UNORM formats") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5817>	2020-07-21 20:08:07 +00:00
Connor Abbott	b559d26c74	freedreno/ir3: Fix SSBO size for bindless SSBO's We theoretically could push these sizes to the const file opportunistically, which appears to be what the blob does, but the maximum number of SSBO's is way too big to do that unconditionally. Just use resinfo to get the size for now. Fixes on turnip: dEQP-VK.ssbo.unsized_array_length.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6012>	2020-07-21 19:53:32 +00:00
Connor Abbott	c9c848dede	tu: Use common guardband helper Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5950>	2020-07-21 14:26:18 +00:00
Connor Abbott	19895dde90	freedreno: Add a helper for computing guardband sizes This should be much better than the previous method that was more guesswork-based than anything else. It returns a value within 1 of the blob for every input value I've tested, and it seems like it returns slightly better (but still legal) answers when it differs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5950>	2020-07-21 14:26:18 +00:00
Eric Anholt	d973e50f69	freedreno/ir3: Add missing ld_args_build_id to the ir3_delay unit test. It triggers the disk cache for me, and asserts abount not getting the build id right. Fixes: `f97acb4bb4` ("freedreno/ir3: disk-cache support") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5989>	2020-07-20 22:11:51 +00:00
Eric Anholt	af92348b1c	freedreno/ir3: Fix disasm of register offsets in ldp/stp. I had a stp testcase that was getting its offset wrong, and by twiddling bits and feeding it to qc disasm, I found that the comment was sort of right: some the cat6a bits implicated in the old comment do get used, as the high bits of the cat6c offset. Reallocating those bits also fixes how we were getting r960.y for r0.y. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5815>	2020-07-20 19:42:45 +00:00
Eric Anholt	d6d8dc133e	freedreno/ir3: Refactor cat6 general dst printing. We didn't need the extra branch and temp, we can move it inside of the dst handling by just duplicating the print of the dst reg. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5815>	2020-07-20 19:42:45 +00:00
Eric Anholt	62dcf75432	freedreno/ir3: Add a bunch more tests for cat6 opcodes. This started with making note of some ldp/stp instructions from the blob and how we differ from them. In the process of fixing it, I accidentally modified behavior of other opcodes, and the other instructions listed will keep us from doing that. I also dropped an old stl test that looks like I took from after a shader 'end' instruction. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5815>	2020-07-20 19:42:45 +00:00
Eric Anholt	ed3338f581	freedreno/ir3: Add a note about the instructions in the disasm test. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5815>	2020-07-20 19:42:45 +00:00
Rob Clark	912ad09112	freedreno/ir3/ra: fix array conflicts for split/merged Properly handle the difference between split and merged register file when determining where arrays can fit without conflicting with other arrays or pre-colored instructions. 1) if not mergedregs, only consider other things with same precision as potentially conflicting 2) if mergedregs, calculate everything in therms of half-regs and convert back to fullregs in the end Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:21:09 -07:00
Rob Clark	b1465c382b	freedreno/ir3/ra: assign vreg names to all array elements We shouldn't divide-by-two for half-reg arrays. We set the proper node interference class, based on `arr->half`. Fixes a RA fail with 16b arrays: src/freedreno/ir3/ir3_ra.c:633: name_to_array: Assertion `!"invalid array name"' failed. Caused by use/def iterators returning `arr->length` vreg namess, but only assigning the array half that many names. Also, since we are assigning unique vreg names to each array element, there is no need to try and convert from half-reg to it's conflicting full reg when pre-coloring the array elements. Getting us closer to having half-arrays work sanely with split-register-file (a5xx and earlier). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:21:09 -07:00
Rob Clark	6317f7d574	freedreno/ir3/ra: debug msgs tweak Print out the assigned vreg names earlier. Also print the few special nodes. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:14:13 -07:00
Rob Clark	c2d94aa365	freedreno/ir3: fix half-reg array stores Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:14:13 -07:00
Rob Clark	5be171b888	freedreno/ir3: set array precision on creation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:14:13 -07:00
Rob Clark	0472ca2aa5	freedreno/ir3/parser: half-precision relative regs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5957>	2020-07-18 09:14:13 -07:00
Eric Anholt	5b38048347	freedreno/ir3: Add unit tests for derivatives disasm. Since I was going back to look at fine derivs again, add some tests of instruction encoding. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5699>	2020-07-18 00:43:44 +00:00
Eric Anholt	3d7d5d220b	freedreno/ir3: Fix duplicated fine derivatives instructions. legalize_block() can get run multiple times, which I didn't notice when adding fine derivs support. Other instruction clones change things such that the legalization won't trigger again, but that didn't apply to the DS.PP legalization. To keep someone else from tripping over this, split the one-shot legalization out of the iterative sync flag application. Fixes failures in dEQP-VK.glsl.derivate.dfdxfine.* Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3198 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5699>	2020-07-18 00:43:44 +00:00
Icecream95	314ba5e174	nir: Add a face_sysval argument to nir_lower_two_sided_color This is needed for handling drivers that use an input for loading the face, for example Panfrost with Midgard GPUs. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Tested-by: Urja Rannikko <urjaman@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5915>	2020-07-17 14:50:26 +00:00
Connor Abbott	b5a48a948a	tu: Enable VK_EXT_shader_stencil_export This passes the grand total of 3 CTS tests (2 actually enabled due to missing D32_SFLOAT_S8_UINT support) under dEQP-VK.pipeline.shader_stencil_export.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5936>	2020-07-16 20:49:20 +00:00
Connor Abbott	aeca92ed79	ir3: Handle gl_FragStencilRefARB Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5936>	2020-07-16 20:49:20 +00:00
Connor Abbott	981608ad04	freedreno/a6xx: Add stencilref register info Found by guessing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5936>	2020-07-16 20:49:20 +00:00
Roman Stratiienko	29849aca0f	Android: Fixes for Q and R Fix Android-Q build: - Use AOSP prebuilt bison by specifying $(BISON) variable - Use AOSP prebuilt flex by specifying $(LEX) variable Fix Android-R build: - Add M4 environmet variable for Android R and higher (See [1]) [1] - `2bfffb9f48`:Changes.md;dlc=997661002af1282d938e88c3c723037e42e5d283 Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Tested-by: Mauro Rossi <issor.oruam@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5894>	2020-07-15 20:49:24 +00:00
Rob Clark	7f9039f0a8	freedreno/ir3: DCE unused arrays Letting unused arrays stick around confuses RA, which assigns vreg names to the unused arrays, but then does not precolor them (because they are unused). This leads to an assert in ra_select_reg_merged(): skqp: ../src/freedreno/ir3/ir3_ra.c:589: name_to_instr: Assertion '!name_is_array(ctx, name)' failed. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3262 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5907>	2020-07-14 23:26:15 +00:00
Rob Clark	37e0e0791f	freedreno/ir3/ra: be better at failing It doesn't happen much. But it's annoying when we hit an impossible condition deep in RA 90% thru a long test run. Add some ra_assert()/ ra_unreachable() helper macros so we can bail cleanly and fail RA. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5907>	2020-07-14 23:26:15 +00:00
Rob Clark	b3ca55f5aa	freedreno/ir3: make compile fails more visible Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5907>	2020-07-14 23:26:15 +00:00
Jonathan Marek	d00487dd42	freedreno/regs: update a6xx PC regs Update some registers in the 0x9800-0xa000 range. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5870>	2020-07-14 18:00:06 +00:00
Jonathan Marek	2e32a20f7c	freedreno/regs: update a6xx VPC regs Update some registers in the 0x9000-0x95ff range. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5870>	2020-07-14 18:00:06 +00:00
Jonathan Marek	e883aa2585	freedreno/regs: update a6xx RB regs Update some registers in the 0x8c00-0x8dff range. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5870>	2020-07-14 18:00:06 +00:00
Jonathan Marek	a5c668518a	freedreno/regs: update a6xx GRAS registers Update some registers in the 0x8000-0x87ff range. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5870>	2020-07-14 18:00:06 +00:00
Jonathan Marek	f37f1a1a64	turnip: remove use of tu_cs_entry for draw states The tu_cs_entry struct doesn't match well what we want for SET_DRAW_STATE and CP_INDIRECT_BUFFER (requires extra steps to get iova and size), so start phasing it out. Additionally, use newly added tu_cs_draw_state where it doesn't require any effort (it requires a fixed size, but gets rid of the extra end_sub_stream) Note this also changes the behavior of CmdBindDescriptorSets for compute to emit directly in cmd->cs instead of doing through a CP_INDIRECT. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5558>	2020-07-14 17:00:08 +00:00
Jonathan Marek	7f24a69ace	turnip: fix inconsistencies with tu6_load_state_size The next patch assumes the correct size is returned in tu6_emit_load_state. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5558>	2020-07-14 17:00:07 +00:00
Jonathan Marek	bf997ca306	turnip: emit compute pipeline directly in CmdBindPipeline There's no need to defer it, and can get rid DIRTY_COMPUTE_PIPELINE. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5558>	2020-07-14 17:00:07 +00:00
Jonathan Marek	dce6cb1196	turnip: use DIRTY SDS bit to avoid making copies of pipeline load state ib Some testing showed that the DIRTY bit has the desired behavior, so use it to make things a bit simpler. Note in CmdBindPipeline, having the TU_CMD_DIRTY_DESCRIPTOR_SETS behind a if(pipeline->layout->dynamic_offset_count) was wrong. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5558>	2020-07-14 17:00:07 +00:00
Eric Engestrom	f95637f01a	meson: fix android vulkan build Android doesn't have `pthread_cancel()` and is unlikely to ever implement it [1], but `wsi_common_display.c` needs it (or an alternative). Let's just disable the platform on Android (as it used to be before `448eb19158`). [1] https://android-review.googlesource.com/c/platform/bionic/+/1215779/1/docs/status.md Fixes: `448eb19158` ("vulkan: automatically compile the `display` platform when available") Signed-off-by: Eric Engestrom <eric@engestrom.ch> Acked-by: Nataraj Deshpande <nataraj.deshpande@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5860>	2020-07-14 09:34:54 +00:00
Connor Abbott	bf1376aba0	tu: Don't invalidate irrelevant state when changing pipeline At least in the future this could let us avoid re-emitting gfx/cs constants when the other changes. This also matches what the blob does. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5877>	2020-07-14 10:23:58 +02:00
Connor Abbott	a16136796f	freedreno/a6xx: Add some documentation for shared consts I'm not convinced we'll actually want to use this, and there may be another enable bit in SP_UNKNOWN_AB00, but it's nice to at least write this down in case we want to try using it in the future. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5877>	2020-07-14 10:23:58 +02:00
Connor Abbott	e1fa740c4c	freedreno/a6xx: Rename and document HLSQ_UPDATE_CNTL It turns out that this clears CP_LOAD_STATE6 packets, including disabling any pending loads for SS6_INDIRECT/SS6_BINDLESS (these loads don't actually happen until the draw itself, and I'm not sure if they happen if the state is unused by the shader) and marking constants and UBO descriptors loaded with SS6_DIRECT as invalid. It's used very differently from HLSQ_UPDATE_CNTL on a4xx from whence the name came, and unlike on a4xx it's not readable, so this probably doesn't line up with HLSQ_UPDATE_CNTL on a4xx. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5877>	2020-07-14 10:23:58 +02:00
Kristian H. Kristensen	684cfca748	freedreno/registers: Rename SP_2D_SRC_FORMAT This register contains information about the destination format, so let's rename to SP_2D_DST_FORMAT. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5717>	2020-07-14 06:12:22 +00:00
Jonathan Marek	53e36cf062	turnip: drop GS clear path We didn't know how to write layer id without GS, since that's the only way to do it through VK/GL, and the blob didn't implement this clear case (and failed cases where it was absolutely necessary). However now we know how to set it after some educated guesses and looking at tess/geom traces, so the GS path can be dropped. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5790>	2020-07-14 04:05:24 +00:00
Jonathan Marek	a1a80c38ea	turnip: clean up primitive output state We only need to emit one set of primitive output registers. This may differ from the blob, because it seems to try to allow using the same pipeline with tess/geom enabled/disabled. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5790>	2020-07-14 04:05:24 +00:00
Jonathan Marek	7748afbb1e	freedreno/regs: update primitive output related registers Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5790>	2020-07-14 04:05:24 +00:00
Eric Anholt	5c1afd1ce4	freedreno/ir3: Fix uninit var warning. It's a decent bit of analysis to see that the initialization will always happen, and my compiler isn't doing so in at least one configuration. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5834>	2020-07-14 03:38:53 +00:00
Hyunjun Ko	d941c6b74f	turnip: implement VK_EXT_private_data Which is using base class's implementation. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5539>	2020-07-14 02:48:30 +00:00
Hyunjun Ko	5d3fdbc52b	turnip: Use the common base object type and struct. v2. Define new helper function to avoid duplicated a pair of function calls. v3. Move new helper functions to vk_object.h and call them. v4. Merge 2 commits to use commomn base object type and struct into one. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5539>	2020-07-14 02:48:30 +00:00
Hyunjun Ko	cd85315dcb	tu: Fix wrong copies of sampler descriptor. Found this with the following patch but it exists since adding ycbcr sampler to the struct. Fixes: `d070a7ba0c` Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5539>	2020-07-14 02:48:30 +00:00
Eric Engestrom	448eb19158	vulkan: automatically compile the `display` platform when available Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3161>	2020-07-10 13:48:24 +00:00
Jonathan Marek	ffb6eb6d5d	freedreno/ir3: run nir_opt_loop_unroll in optimization loop GL driver was relying on this being done by gallium, but there might be new loops to unroll during optimizations and turnip needs it. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5818>	2020-07-09 23:30:33 +00:00
Jonathan Marek	9c23afebbe	freedreno/ir3: fix setup_input for sparse vertex inputs With turnip we can have sparse input variables like: decl_var shader_in INTERP_MODE_NONE float @1 (VERT_ATTRIB_GENERIC1.x, 1, 0) decl_var shader_in INTERP_MODE_NONE float @2 (VERT_ATTRIB_GENERIC1.y, 1, 0) decl_var shader_in INTERP_MODE_NONE float @3 (VERT_ATTRIB_GENERIC1.w, 1, 0) Example of a test fixed: dEQP-VK.glsl.440.linkage.varying.component.vert_in.vec2.as_float_float_unused Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5818>	2020-07-09 23:30:33 +00:00
Jonathan Marek	26b75daef5	turnip: fix active_desc_sets not being set for compute pipeline This resulted in the load state being always empty. Its an optimization, so it didn't result in any failures. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5816>	2020-07-09 10:27:35 +00:00
Jonathan Marek	979e7e3680	freedreno/layout: layout simplifications and pitch from level 0 pitch This updates a3xx/a4xx/a5xx to fix the fetchsize to "PITCHALIGN" (called "MINLINEOFFSET" by the a3xx docs), and some simplifications to make things more like a6xx. Also similar simplifications for a2xx layout code. The pitch can always be determined using a simple calculation from the base level pitch, so don't pre-calculate a pitch for each mipmap level. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5796>	2020-07-08 20:46:08 +00:00
Jonathan Marek	fcac0b4fc9	freedreno/regs: document CS shared storage size bit Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5797>	2020-07-08 11:33:42 +00:00
Jonathan Marek	f472c98443	freedreno/ir3: add support for a650 tess shared storage A650 uses LDL/STL, and the "local_primitive_id" in tess ctrl shader comes from bits 16-21 in the header instead of 0-5. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5764>	2020-07-08 02:30:23 +00:00
Jonathan Marek	14c554a391	turnip: use global bo for clear blit shaders Fill the global bo will all possible shaders for 3D clear/blit. Note the global bo size is still <4k (so this doesn't cost any extra memory), this saves having to allocate shaders in sub_cs everytime the 3D path is used. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5776>	2020-07-07 16:40:45 +00:00
Connor Abbott	6ff66942d2	freedreno: Sync registers with envytools Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5557>	2020-07-07 09:51:40 +00:00
Connor Abbott	c1ba7612fb	freedreno: Include adreno_pm4.xml.h before adreno_a6xx.xml.h This matches the XML, and soon adreno_a6xx.xml will start including types from adreno_pm4.xml. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5557>	2020-07-07 09:51:40 +00:00
Connor Abbott	f69c3849b8	tu: Force gl_Layer to 0 when necessary In particular this will help us implement input attachments correctly with layered rendering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5732>	2020-07-07 08:10:47 +00:00
Connor Abbott	4f91345f49	ir3: Add layer_zero variant bit Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5732>	2020-07-07 08:10:47 +00:00
Ilia Mirkin	836d41d772	ir3: use empirical size for params as used by the shader For example only some UCPs may be used by the shader, triggering asserts that too many consts are being uploaded. While we're at it, also fix the const size when loading UCPs, since otherwise it doesn't correspond to what the shader is actually using. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5752>	2020-07-06 23:57:51 +00:00
Connor Abbott	7682c887b3	tu: Enable KHR_variable_pointers Passes dEQP-VK.spirv_assembly.instruction.graphics.variable_pointers.* and dEQP-VK.spirv_assembly.instruction.compute.variable_pointers.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5684>	2020-07-06 22:48:57 +00:00
Connor Abbott	9aec89ead3	tu: Rewrite variable lowering Don't lower to offsets, instead use nir_lower_explicit_io here and use actual pointers for UBO's and SSBO's. This makes KHR_variable_pointers trivial. This also fixes asserts with shared variables, which are now supposed to be lowered with nir_lower_explicit_io. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5684>	2020-07-06 22:48:57 +00:00
Jason Ekstrand	36a9046848	freedreno: Only call nir_lower_io on shader_in/out Gallium drivers should never see nir_var_uniform because gallium lowers regular uniforms to a UBO. No GL driver should ever see either nir_var_mem_shared because that's lowered in GLSL IR. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5418>	2020-07-06 19:54:30 +00:00
Ilia Mirkin	fc944428bf	ir3: mark ucp_enables as allowed values on all keys Both vertex and fragment shaders need to have the lowering. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5751>	2020-07-06 18:37:22 +00:00
Ilia Mirkin	00f9d4b1fd	a4xx: add noperspective interpolation support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5753>	2020-07-06 17:35:56 +00:00
Jonathan Marek	6d8e2cec81	freedreno/regs: document SS6_UBO state src Document this new a6xx_state_src value seen in A640/A650 tess traces. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5760>	2020-07-06 15:46:48 +00:00
Rob Clark	0a7b1f9167	freedreno/fdperf: prefer render node Avoid inadvertantly becoming master if fdperf happens to be the first thing to open the device. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5762>	2020-07-06 15:08:15 +00:00
Rob Clark	385d036f58	freedreno/fdperf: better compatible string matching Previously we would match the start of the compatible string, in a couple of cases, in order to match compatible strings like "qcom,adreno-630.2". But these cases would always list a more generic compatible (ie. "qcom,adreno") as a later choice. So if we parse all the compatible strings, we can do a more precise exact match. This avoids us accidentially matching on "qcom,adreno-smmu" and the hilarity that ensues. Fixes: `5a13507164` ("freedreno/perfcntrs: add fdperf") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5762>	2020-07-06 15:08:15 +00:00
Rob Clark	9c34a3322d	freedreno/fdperf: fix print of base address Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5762>	2020-07-06 15:08:15 +00:00
Jonathan Marek	b76c6dcbc5	freedreno/ir3: fix/rework tess levels The previous version assumes tess level outputs will only be written once in the shader, however its not possible to guarantee that. It also assumes all invocations will write all the levels, which is also not guaranteed. This is required to fix the "tesselation" and "terraintessellation" demos with turnip. The comment about nir_lower_io_to_temporaries in lower_tess_ctrl_block is removed because nir_lower_io_to_temporaries specifically skips TESS_CTRL shaders so the comment doesn't make sense. The split load for tess levels workaround is removed, the new version only has scalar access unless if ever gets vectorized. This sets NIR_COMPACT_ARRAYS cap to avoid the glsl tess vec lowering with gallium. It seems this will also disable "LowerCombinedClipCullDistance", which I'm not sure was needed or not. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5744>	2020-07-06 08:48:06 -04:00
Jonathan Marek	3c5512ce50	freedreno/layout: fix explicit layout offset not added to slice offset Accidentally broke this when rebasing the offending commit. My use case with non-zero explicit offset is UV plane of UBWC NV12, and only the UBWC slice offset is used for the UBWC sampler, so I didn't catch it immediately. Fixes: `d53dc6c376` ("freedreno/fdl6: rework layout code a bit (reduce linear align to 64 bytes)") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5761>	2020-07-06 11:24:59 +00:00
Jonathan Marek	1a83279da5	turnip: enable 420_UNORM formats Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4600>	2020-07-05 15:25:17 +00:00
Jonathan Marek	7af2a0b9bc	turnip: support multi-image layouts Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4600>	2020-07-05 15:25:17 +00:00
Jonathan Marek	37cd3c256a	turnip: clear_blit: pass aspect mask to setup function Avoids having to duplicate logic to figure out the write mask on D24S8 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4600>	2020-07-05 15:25:17 +00:00
Jonathan Marek	19f3c79c7e	turnip: fix tess param bo size calculation ir3 already calculates the stride in the tess param bo, so use that instead of a incorrect calculation. The calculation of per_vertex_output_size / per_patch_output_size is wrong because it counts dwords instead of bytes, and what it counts for per_vertex_output_size is a per-patch size because the glsl type is already an array of # vertex/patch elements. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5743>	2020-07-04 03:33:43 +00:00
Jonathan Marek	0e7b7c3087	turnip: vsc improvements * Remove scratch_bo from cmdbuffer, use a device-global bo instead, which also includes border color (and eventually shaders for 3D blit path) * Use CP_SET_BIN_DATA5_OFFSET to allow setting VSC buffer addresses only once at the start of the cmdstream * Use scratch bo mechanism for a resizable VSC buffer * Use feedback from "vsc_draw_overflow" and "vsc_prim_overflow" values to increase the size of VSC buffer when beginning to record a new cmdbuffer Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5570>	2020-07-03 14:49:10 +00:00
Jonathan Marek	4ac851ea25	turnip: rework render_tiles loop Loop through pipes and then loop over the tiles in that pipe instead of looping over all tiles then having to calculate the pipe # and slot #. Mainly this avoids the hard to follow "config_get_tile" logic, but should also be a gain due to better use of cache with the VSC data. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5570>	2020-07-03 14:49:10 +00:00
Jonathan Marek	8898ebce1a	turnip: make tiling config part of framebuffer state Compute the tiling config at framebuffer creation time. A framebuffer will b be re-used multiple times, so this will avoid having to re-calculate the tiling config every time a command buffer is recorded. The tiling config already couldn't use the render area's x1/y1 because of hw binning, this move makes it so the render area isn't used at all for the tiling config. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5570>	2020-07-03 14:49:10 +00:00
Hyunjun Ko	9190cc9b15	tu,radv: fix potentially wrong offset of flexible array. v2. Remove redundant memset and make the expression simpler. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5703>	2020-07-03 00:45:16 +00:00
Jonathan Marek	9bebbf5867	freedreno/ir3: add support for INTERP_MODE_NOPERSPECTIVE Check the interp mode and use SYSTEM_VALUE_BARYCENTRIC_LINEAR_* instead when it is INTERP_MODE_NOPERSPECTIVE. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5582>	2020-07-01 13:52:59 +00:00
Jonathan Marek	0f5c9f9713	turnip: set missing bary sysvals Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5582>	2020-07-01 13:52:59 +00:00
Jonathan Marek	33457fc705	freedreno/ir3: add generic get_barycentric() This will be useful to support the missing barycentric sysvals. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5582>	2020-07-01 13:52:59 +00:00
Jonathan Marek	2e9ded21d1	freedreno/registers: update varying-related registers Note: * a3xx change based on available register documentation * a4xx guesses (RB_RENDER_CONTROL2 bits especially) * a5xx change based on a6xx, these registers seem identical Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5582>	2020-07-01 13:52:59 +00:00
Jonathan Marek	622c548967	turnip: enable depthBiasClamp Passes at least dEQP-VK.dynamic_state.rs_state.depth_bias_clamp Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5678>	2020-06-29 13:08:51 +00:00
Jonathan Marek	0ed100ea49	turnip: enable largePoints Passes dEQP-VK.rasterization.primitive_size.points.point_size_* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5678>	2020-06-29 13:08:51 +00:00
Jonathan Marek	cb10edd544	freedreno/regs: add extra bits for UBWC array pitch This is not completely tested, but matches the max array pitch allowed by A6XX_TEX_CONST_9_FLAG_BUFFER_ARRAY_PITCH. Note this still doesn't allow all image sizes, but it allows 16384x16384 cpp=4 images to work. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5678>	2020-06-29 13:08:51 +00:00
Jonathan Marek	7d31bc9a34	freedreno/ir3: fix resinfo wrmask resinfo always writes 3 components, which was not being taken into account Fixes these tests: dEQP-VK.renderpass.suballocation.attachment_sparse_filling.input_attachment_3 dEQP-VK.renderpass.suballocation.attachment_sparse_filling.input_attachment_7 Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5674>	2020-06-28 16:32:08 +00:00
Kristian H. Kristensen	4fccbd0ea6	turnip: Put VK_KHR_external_fence_fd stubs back tu_ImportFenceFdKHR is used by tu_AcquireImageANDROID, which may or may not work, but let's at least keep things compiling until somebody has time to tie up the loose ends on the Android side. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5670>	2020-06-26 16:29:15 -07:00
Eric Anholt	34630fe081	turnip: Properly return VK_DEVICE_LOST on queuesubmit failures. The device lost support closely matches the anv code for the same. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2769>	2020-06-26 19:34:17 +00:00
Eric Anholt	487576e3cf	turnip: Fix error handling of DRM_MSM_GEM_INFO ioctls. drmCommandWriteRead gives us a -errno, and we only checked for -1 (-EPERM, incidentally). All the callers wanted 0 for errors, which they were getting by the fact that req.value was 0-initialized in our stack allocation (though this only works as long as the kernel doesn't return an error after setting req.value to something), and -EPERM not really being an answer we would expect from an ioctl at this stage in the driver. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2769>	2020-06-26 19:34:17 +00:00
Eric Anholt	e67c2e1c96	turnip: Do better TU_DEBUG=startup logging of drmGetDevices2() failure. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2769>	2020-06-26 19:34:17 +00:00
Bas Nieuwenhuizen	aba8c579a9	turnip: semaphore support. There is only one queue for now, so for non-shared semaphores, the implementation is basically a no-op. For shared semaphores, this always uses syncobjs. This depends on syncobj support in the msm kernel driver. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2769>	2020-06-26 19:34:17 +00:00
Rob Clark	189a0fecf5	freedreno/ir3: move nir finalization to after cache miss In cases where every variant is a shader-cache-hit, we never need the post-finalize round of nir opt/lowering passes. So defer this until the first shader-cache-miss to avoid doing pointless work. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5372>	2020-06-26 08:55:21 -07:00
Rob Clark	f97acb4bb4	freedreno/ir3: disk-cache support Adds a shader disk-cache for ir3 shader variants. Note that builds with `-Dshader-cache=false` have no-op stubs with `disk_cache_create()` that returns NULL. Binning pass variants are serialized together with their draw-pass counterparts, due to shared const-state. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5372>	2020-06-26 08:55:19 -07:00
Rob Clark	6aadb00e60	freedreno/ir3: build binning variant at same time as draw variant For shader-cache, we are going to want to serialize them together. Which is awkward if the two related variants are not compiled together. This also decouples allocation and compile, which will simplify adding shader-cache (which still needs to allocate, but can skip compile). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5372>	2020-06-26 08:53:02 -07:00
Rob Clark	83b97bf161	freedreno/a6xx+ir3: stop generating pointless binning shaders Currently we always do sysmem if there is tess. And for GS, the binning pass VS ends up identical to the draw pass VS, so no point in compiling it twice. (For GS what we should do someday is generate a binning pass GS, and possibly if we can do cross-stage linking opts, an optimized binning pass VS, but the required outputs would somehow have to end up in the shader variant key.) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5372>	2020-06-26 08:53:00 -07:00
Rob Clark	fdbe1ffaf7	freedreno/ir3: shuffle some variant fields Just to group together the parts that will get serialized when we have shader disk-cache. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5372>	2020-06-26 08:43:23 -07:00
Rob Clark	c0f22c3d94	freedreno/ir3: add ir3_compiler_destroy() Use ir3_compiler_destroy() rather than open-coding ralloc_free(). This will give us a place to add more compiler related cleanup code in the following patches. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5372>	2020-06-26 08:43:23 -07:00
Rob Clark	f1ab57359c	freedreno/ir3: move finalize_nir to pscreen hook Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5372>	2020-06-26 08:43:22 -07:00
Rob Clark	d3ae559378	freedreno/ir3: add ir3_finalize_nir() The next step is to hook this into pscreen->finalize_nir() so it can come before the state tracker's shader-caching. Unfortunately we still need to do lower_io after mesa/st, so that is split out into a post-finalize pass. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5372>	2020-06-26 08:43:22 -07:00
Jonathan Marek	2fbc12a0ac	turnip: fix huge scissor min/max case Now that tu_cs_emit_regs is used for the scissor, it hits an assert when the scissor is too large. Fixes this dEQP test: dEQP-VK.draw.scissor.static_scissor_max_int32 Fixes: `9c0ae5704d` ("turnip: fix empty scissor case") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5655>	2020-06-26 11:34:49 +00:00
Jonathan Marek	1854eeefde	turnip: fix VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_VULKAN_1_1_FEATURES My attempt to be clever here backfired, it overwrites the pNext and stops the loop (causing deqp to fail to query extension features after that). Fixes: `62de79ac44` ("turnip: implement VK_KHR_shader_draw_parameters") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5654>	2020-06-26 11:11:29 +00:00
Connor Abbott	ba5e1c5310	tu: Pass firstIndex directly to CP_DRAW_INDX_OFFSET Saves some minor overhead, cleans things up a bit, and removes one more unknown. We now program the internal registers in the same way between direct/indirect draws. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5644>	2020-06-26 10:05:24 +00:00
Connor Abbott	259d07a2ff	freedreno/registers: Label firstIndex field in CP_DRAW_INDX_OFFSET Based on comparing the implementations of CP_DRAW_INDX_OFFSET and CP_DRAW_INDIRECT, this is what this field is for. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5644>	2020-06-26 10:05:24 +00:00
Connor Abbott	a32fb2f9d0	freedreno: On a5xx+ INDX_SIZE is MAX_INDICES This was already done correctly for the indirect variants, and turnip was setting the correct value, but it seems freedreno missed the change in the non-indirect variant. Also, fix a misspelling of "indices" and add a type to INDX_SIZE. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5644>	2020-06-26 10:05:24 +00:00
Connor Abbott	8ad65609da	tu: Share constlen between different stages properly Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5607>	2020-06-26 09:34:33 +00:00
Connor Abbott	48b1602b50	ir3: Add ir3_trim_constlen() This provides the policy for how to handle reducing constlen for some stages. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5607>	2020-06-26 09:34:33 +00:00
Connor Abbott	9edff0cfd4	ir3: Support variants with different constlen's This provides the mechanism for compiling variants with a reduced constlen. The next patch provides the policy for choosing which to reduce. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5607>	2020-06-26 09:34:33 +00:00
Connor Abbott	4554b946c3	ir3: Include ir3_compiler from ir3_shader I wanted to access the ir3_compiler from a small helper inside ir3_shader.h, which currently isn't possible. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5607>	2020-06-26 09:34:33 +00:00
Connor Abbott	2841bb1fac	ir3, freedreno: Round up constlen earlier Prevents problems when calculating whether we overflow the shared limit. Note that on a6xx, the macros handle the assert for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5607>	2020-06-26 09:34:33 +00:00
Eric Anholt	72c0522db2	turnip: Add support for polygon fill modes. Passes the new tests in dEQP-VK.rasterization.culling.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5650>	2020-06-25 13:46:30 -07:00
Eric Anholt	daee177ca0	freedreno/a6xx: Define the register fields for polygon fill mode. Produced by comparing the traces of: dEQP-VK.rasterization.culling.front_triangles dEQP-VK.rasterization.culling.front_triangles_point dEQP-VK.rasterization.culling.front_triangles_line Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5650>	2020-06-25 13:46:28 -07:00
Jonathan Marek	62de79ac44	turnip: implement VK_KHR_shader_draw_parameters Note: going by the blob, VFD_INDEX_OFFSET/FD_INSTANCE_START_OFFSET seem completely unused by indirect draws, so this changes them to only be set for non-indirect draws (and moves them to the vs_params draw state). Passes dEQP-VK.draw.shader_draw_parameters.* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5635>	2020-06-25 15:57:45 +00:00
Jonathan Marek	16a9e233da	freedreno/ir3: add support for load_draw_id This is part of adding VK_KHR_shader_draw_parameters for turnip. IR3_DP_VTXID_BASE/IR3_DP_VTXCNT_MAX offsets are changed to match what CP_DRAW_INDIRECT_MULTI requires. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5635>	2020-06-25 15:57:45 +00:00
Jonathan Marek	01799b3448	freedreno/registers: add CP_DRAW_INDIRECT_MULTI Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5635>	2020-06-25 15:57:45 +00:00
Rob Clark	6da0647987	freedreno/ir3/ra: fix pre-color edge case Fixes a case where you have something like: aVecOutput.z = aScalarInput; In particular, skipping over things that are not the first component is wrong.. in the above case the input we need to precolor is the 3rd component. But we need to adjust the target register according to the offset. Fixes android.hardware.nativehardware.cts.AHardwareBufferNativeTests Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5601>	2020-06-25 04:40:40 +00:00

1 2 3 4 5 ...

1571 Commits