KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Connor Abbott	cb45120556	ir3: Use (ss) for instructions writing shared regs The blob uses both nops and (ss). It turns out that in some rare cases the hardware does take more than 6 cycles, at least for movmsk, but adding nops is unnecessary. I believe the extra nops are only there due to the immaturity of the blob's implementation of subgroup ops, so we don't have to copy them - just handle shared reg producers the same as SFU instructions. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	d45678cac4	ir3/postsched: Rename tex/sfu to sy/ss Analogous to the previous commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	e6b35d606d	ir3/sched: Rename tex/sfu to sy/ss This now covers e.g. cat6 instructions as well, and ss will cover instructions writing shared regs as well. This is split out from the previous change to avoid too much churn and shouldn't cause any functional changes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	0cc4aca345	ir3: Use new (sy)/(ss) stall helpers in the compiler This fixes a few bad assumptions in the pre-RA and post-RA scheduler, for example that (sy) is only for texture instructions and (ss) is only for SFU instructions and (sy) and (ss) producers will always take the same number of cycles. This means we now start doing latency hiding for cat6 instructions like ldib and ldc. It also should make us hide latency more aggressively, since the number used for (sy) stall cycles was way lower than the real numbers for everything except ldc. Finally it unifies the various places (ss) soft nops were calculated. selected shader-db results: total nops in shared programs: 345278 -> 358959 (3.96%) nops in affected programs: 215622 -> 229303 (6.34%) helped: 690 HURT: 2430 helped stats (abs) min: 1 max: 125 x̄: 11.40 x̃: 5 helped stats (rel) min: 0.53% max: 100.00% x̄: 24.19% x̃: 18.52% HURT stats (abs) min: 1 max: 501 x̄: 8.87 x̃: 5 HURT stats (rel) min: 0.00% max: 9900.00% x̄: 52.36% x̃: 14.29% 95% mean confidence interval for nops value: 3.78 4.99 95% mean confidence interval for nops %-change: 28.21% 42.66% Nops are HURT. total mov in shared programs: 75049 -> 74110 (-1.25%) mov in affected programs: 15754 -> 14815 (-5.96%) helped: 566 HURT: 455 helped stats (abs) min: 1 max: 36 x̄: 4.52 x̃: 3 helped stats (rel) min: 0.83% max: 100.00% x̄: 35.85% x̃: 30.00% HURT stats (abs) min: 1 max: 35 x̄: 3.55 x̃: 3 HURT stats (rel) min: 0.00% max: 1100.00% x̄: 63.60% x̃: 25.00% 95% mean confidence interval for mov value: -1.25 -0.58 95% mean confidence interval for mov %-change: 2.92% 14.02% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total last-baryf in shared programs: 80468 -> 67670 (-15.90%) last-baryf in affected programs: 63676 -> 50878 (-20.10%) helped: 309 HURT: 147 helped stats (abs) min: 1 max: 260 x̄: 49.20 x̃: 24 helped stats (rel) min: 0.60% max: 98.81% x̄: 37.92% x̃: 40.91% HURT stats (abs) min: 1 max: 115 x̄: 16.35 x̃: 12 HURT stats (rel) min: 0.96% max: 1933.33% x̄: 45.55% x̃: 7.89% 95% mean confidence interval for last-baryf value: -33.03 -23.10 95% mean confidence interval for last-baryf %-change: -21.52% -0.50% Last-baryf are helped. total sstall in shared programs: 133997 -> 126398 (-5.67%) sstall in affected programs: 86866 -> 79267 (-8.75%) helped: 1893 HURT: 598 helped stats (abs) min: 1 max: 77 x̄: 6.06 x̃: 4 helped stats (rel) min: 0.71% max: 100.00% x̄: 32.82% x̃: 16.67% HURT stats (abs) min: 1 max: 65 x̄: 6.47 x̃: 6 HURT stats (rel) min: 0.00% max: 900.00% x̄: 65.51% x̃: 25.00% 95% mean confidence interval for sstall value: -3.39 -2.71 95% mean confidence interval for sstall %-change: -12.19% -6.24% Sstall are helped. total systall in shared programs: 350304 -> 288234 (-17.72%) systall in affected programs: 234855 -> 172785 (-26.43%) helped: 1456 HURT: 260 helped stats (abs) min: 1 max: 574 x̄: 46.42 x̃: 27 helped stats (rel) min: 0.19% max: 100.00% x̄: 39.43% x̃: 36.06% HURT stats (abs) min: 1 max: 757 x̄: 21.20 x̃: 8 HURT stats (rel) min: 0.00% max: 180.95% x̄: 24.82% x̃: 12.50% 95% mean confidence interval for systall value: -39.31 -33.03 95% mean confidence interval for systall %-change: -31.49% -27.90% Systall are helped. total waves in shared programs: 236732 -> 235142 (-0.67%) waves in affected programs: 6142 -> 4552 (-25.89%) helped: 535 HURT: 17 helped stats (abs) min: 2 max: 8 x̄: 3.08 x̃: 2 helped stats (rel) min: 12.50% max: 75.00% x̄: 28.78% x̃: 25.00% HURT stats (abs) min: 2 max: 6 x̄: 3.53 x̃: 4 HURT stats (rel) min: 16.67% max: 75.00% x̄: 37.35% x̃: 33.33% 95% mean confidence interval for waves value: -3.04 -2.72 95% mean confidence interval for waves %-change: -28.10% -25.39% Waves are helped. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	7e60978d30	ir3: Introduce systall metric and new helper functions Add new centralized functions which will replace the various places we hardcode 10 for the number of (ss) nops, add numbers for soft (sy) nops based on similar computerator experiments with ldc, sam, and ldib (the most common (sy) producers), and add a "systall" metric which is analogous to sstall. This also fixes some cases where we'd erroniously count ldl* as (sy) producers instead of (ss) producers when calculating sstall. This only switches over the metric reporting to the new functions, so there is no behavior change. The following commit will switch over the rest of the compiler. While we're at it, remove max_sun as it's never set. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	603791bdeb	ir3: Bump type mismatch penalty to 3 After some experimentation with computerator, it seems on a618 that writing a full register and then reading half of it as a half register requires a delay of 6, the same as the delay for cat5/cat6 sources. The other direction only has a delay of 5, but just bump it unconditionally out of an abundance of caution. Fixes: `890de1a436` ("ir3/delay: Fix full->half and half->full delay") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	d371d807eb	ir3/ra: Fix logic bug in compress_regs_left If we're allocating a source then we force is_killed to false, not to true. Fixes a regression in dEQP-GLES31.functional.synchronization.in_invocation.image_atomic_write_read later. Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Guilherme Gallo	a6d05e6863	ci: Add a630_skqp jobs Start Xorg during skqp job, since it is needed to make rendered tests work. There are 1 new job, namely `a630_skqp` which runs GL and GLES backends and then the skqp GPU unittests. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5580 Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14146>	2022-01-05 20:15:04 +00:00
Guilherme Gallo	8992cf5ab8	ci: Build skqp on ARM64 images This commit makes `kernel+rootfs_arm64` job build and install skqp on ARM64 devices rootfs. Skia repository has a tool to prepare skqp models located at `tools/skqp/cut-release`, which get files from [Skia Gold](https://skia.org/docs/dev/testing/skiagold/), generate files.checksum, rendertests.txt and unittests.txt. One gives a range of commits to let `cut-release` find the right resources to prepare skqp for the user. However, it is failing, since it fails when trying to get image packages from a range of commits via HTTPS from the host https://public-gold.skia.org but it responds with error 404 every time. I tried a range a thousand of commits, yet it still does not give results. The workaround employed was to recover the most recent `files.checksum` and `rendertests.txt` files from the git history and generate `unittests.txt` from `list_gpu_unit_tests` binary. `skqp` runs two lists of tests, `rendertests.txt` and `unittests.txt`. Both must be located inside the `skqp` assets folder. The first list uses GL and GLES to test rendering scenarios. The second runs some unit tests that do not render an image per se. In order to make the first `a630_skqp` to be green, the crashing tests were removed from the test lists and the expectations of the failing ones were updated. It is worth noting that `rendertests.txt` can bring some detail about each test expectation, so each test can have a max pixel error count, to tell `skqp` that it is OK to have at most that number of errors for that test. See also: https://github.com/google/skia/blob/main/tools/skqp/README_ALGORITHM.md As each render backend has a different error count, two different `rendertests.txt` files were created, `src/freedreno/ci/freedreno-a630-skqp-gl_rendertests.txt`, `src/freedreno/ci/freedreno-a630-skqp-gles_rendertests.txt` and , which one refers to GL and GLES tests respectfully. The unit tests file for a630 is located at `src/freedreno/ci/freedreno-a630-skqp_unittests.txt` ``` aaclip domain formats highcontrastfilter rectangle_texture yuv_make_color_space ``` ``` ProcessorOptimizationValidationTest VkProtectedContext_CreateNonprotectedContext VkYCbcrSampler_DrawImageWithYcbcrSampler VkYCbcrSampler_NoYcbcrSurface ``` Each test was updated with the max_error count equal to the first run result. ``` analytic_antialias_inverse async_rescale_and_read_dog_down async_rescale_and_read_dog_up async_rescale_and_read_rose async_rescale_and_read_text_down async_rescale_and_read_text_up async_rescale_and_read_text_up_large async_rescale_and_read_yuv420_rose complexclip2_path_bw encode-platform imageblur_large lcdtextsize onebadarc onefailarc scale-pixels surfaceprops textfilter_color textfilter_image ``` Considering all the following tests results as wrong. ``` async_rescale_and_read_no_bleed backdrop_imagefilter_croprect_persp complexclip2 imageblurrepeatmode mixerCF overdrawcolorfilter patch_alpha patch_primitive rrect_clip_bw scaledemoji_rendering yuv_splitter ``` v2: a) add link to HTML report on job log b) remove extraneous spaces diff c) remove unnecessary conditions from build-skqp.sh d) use fixed skqp source commit SHA v3: a) Use only main skia repository to fetch models and build skqp b) Use list_gpu_unit_tests binary to create a base unittests.txt file c) Remove crashing tests d) Set failing tests expectations for the first skqp run v4: a) Remove clang dependency b) Separate each skqp backend result into its folder c) Regroup a630_skqp in one job v5: a) Separate tests files per driver Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5580 Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14146>	2022-01-05 20:15:04 +00:00
Thomas H.P. Andersen	ff7aee2ac9	tu/clear_blit: use \|\| when working with bools Fixes a warning with clang Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14315>	2021-12-28 03:13:38 +00:00
Vinson Lee	1d6f6f9102	ir3: Make shift operand 64-bit. Fix defect reported by Coverity Scan. Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) overflow_before_widen: Potentially overflowing expression 2 << W with type int (32 bits, signed) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type uint64_t (64 bits, unsigned). Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14258>	2021-12-22 01:19:46 +00:00
Rob Clark	8a21b2fda0	freedreno/ir3: Dump const state with shader disasm Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14231>	2021-12-20 19:47:35 +00:00
Rob Clark	9766a5721d	freedreno/computerator: Mark shader bo for dumping Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14231>	2021-12-20 19:47:35 +00:00
Rob Clark	d1edc6d9a1	freedreno/computerator: Fix @buf header Order is important in the grammar, the more specific match needs to go first. Fixes: `ba1c989348` ("freedreno/computerator: pass iova of buffer to const register") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14231>	2021-12-20 19:47:35 +00:00
Rob Clark	78c53f4888	freedreno/ir3: Handle instr->address when cloning Without this, a cloned instruction that takes full regs will trigger an ir3_validate assert. This can happen, for ex, if an instruction that writes p0.x and has a relative src gets cloned in ir3_sched. Fixes an assert in Genshin Impact with a debug build. Fixes: `9af795d9b9` ("ir3: Make ir3_instruction::address a normal register") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14231>	2021-12-20 19:47:35 +00:00
Emma Anholt	9c722a06ed	ci/freedreno: Add known flakes from the last month. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14239>	2021-12-16 22:37:53 +00:00
Pierre-Eric Pelloux-Prayer	1cb5c1775b	glx: fix querying GLX_FBCONFIG_ID for Window This commit fixes apps using the following sequence: 1. XCreateWindow(dpy) -> win 2. glXCreateContextAttribsARB(dpy, ...) -> ctx 3. glXMakeCurrent(dpy, win, ctx) 4. glXQueryDrawable(dpy, win, GLX_FBCONFIG_ID, ...) glXQueryDrawable returned 0 (while correctly returning a valid GLXFCONFIG_ID for other types of drawables). This commit adds the same dance as driInferDrawableConfig to get the GLX visual from the Window, and then the GLXFBCONFIG_ID of this visual. This fixes: * piglit: glx-query-drawable --attr=GLX_FBCONFIG_ID --type=WINDOW * Maya which uses the config ID from step 4 as an input to glXChooseFBConfig. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14174>	2021-12-16 01:21:36 +00:00
Danylo Piliaiev	c82d7e3617	turnip: Fix operator precedence in address calculation macros for queries Fixes crash in Oblivion, Skyrim, Crysis running through DXVK on 32b systems. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5723 Fixes: `937dd76426` "turnip: Implement VK_KHR_performance_query" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14148>	2021-12-10 17:45:02 +00:00
Ilia Mirkin	0db2e78788	freedreno/ci/a306: increase concurrency No harm from using more threads, but not enough benefit to reduce parallelism unfortunately. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14067>	2021-12-08 00:50:25 +00:00
Ilia Mirkin	3db30ea877	freedreno/ci/a306: add more skips These come up with increased concurrency. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14067>	2021-12-08 00:50:25 +00:00
Danylo Piliaiev	c749da6135	ir3,turnip: Add support for GL_KHR_shader_subgroup_quad Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	3dfd4230bb	ir3,turnip: Enable subgroup ops support in all stages on gen4 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	ded51fd39e	ir3: Use getfiberid for SubgroupInvocationID on gen4 Since it requires (ss) categorize it as is_sfu() and not is_mem(). Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	d1c49901df	ir3: Add gen4 new subgroup instructions * getlast.w8 #4 - Perform jump for the first (CLUSTER_SIZE-1) fibers in a subgroup * brcst.active.w8 - necessary to implement arithmetic subgroup operations with prefix sum. * quad_shuffle.brcst - subgroupQuadBroadcast * quad_shuffle.horiz - subgroupQuadSwapHorizontal * quad_shuffle.vert - subgroupQuadSwapVertical * quad_shuffle.diag - subgroupQuadSwapDiagonal * getfiberid - gl_SubgroupID Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	e63ffc2f04	freedreno,tu: Limit the amount of instructions preloaded into icache Inferring from blob's cmdstream the size of shader instruction cache for: - a630 is 64 - a650 is 128 - a660 is 128 On a650 and a660 gpu could hang if we exceed the limit. Though it is not reproducible with computerator or a single amber test. Also while blob limits the size to 128 - Turnip still hangs with it but does not hang with the limit of 127. On a630 there seem to be no hang when limit is exceeded. Fixes the hang of compute shader in Alien Isolation on a650/a660. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14044>	2021-12-07 13:48:35 +00:00
Ilia Mirkin	a7180bd4a6	freedreno/a5xx: enable OES_gpu_shader5 This extension is controlled by the ESSL feature level. Bump it up since all parts of OES_gpu_shader5 should be supported. This also avoids lowering all of the "advanced" functions (which should probably not be lowered in the first place since they're part of ES 3.1...) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14035>	2021-12-03 20:04:17 +00:00
Guilherme Gallo	dabc068e6c	ci: Use ci-fairy minio login via token file For every CI job, put JWT content into a file and unset CI_JOB_JWT environment var ======= * virgl jobs: - Share JWT token file to crosvm instance - Keep using `export -p` due to high complexity in the scripts of these jobs. At least, the CI_JOB_JWT will not be leaked, since it is being unset at the `before_script` phase of each Mesa CI job. * iris jobs: Update lava_job_submitter to take token file as argument - generate-env with CI_JOB_JWT_TOKEN_FILE - create token file during baremetal init stage * baremetal jobs: Copy token file to bare-metal NFS Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14004>	2021-12-02 18:01:29 +00:00
Guilherme Gallo	cdf8a14bff	ci: Uprev piglit Bring up the piglit replay jwt-file argument feature. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14004>	2021-12-02 18:01:29 +00:00
Ilia Mirkin	fc2cc39a0f	freedreno/ci/a306: split off snorm blending failures The hardware doesn't support this. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13990>	2021-12-02 03:39:28 +00:00
Ilia Mirkin	bbe5b745dc	freedreno/ci/a306: split off the f32 blend / texturing failures The hardware doesn't support this. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13990>	2021-12-02 03:39:28 +00:00
Ilia Mirkin	1f79c36dae	freedreno/ci/a306: separate msaa fails The driver does not implement MSAA. When that happens these can be split up further. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13990>	2021-12-02 03:39:28 +00:00
Ilia Mirkin	58aad3f403	freedreno/a3xx: add some legacy formats These can be used in "legacy" buffer textures. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13989>	2021-12-02 02:29:50 +00:00
Ilia Mirkin	41aa583edf	freedreno/ci/a306: add additional skip which hangchecks I was having trouble getting a run to complete without this. Was working earlier, not sure what changed. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13989>	2021-12-02 02:29:50 +00:00
Emma Anholt	d7226e9a9e	freedreno/a6xx: Allocate a fixed-size tess factor BO. Saves per-batch allocations, avoids reallocation for various vertex counts, and avoids needing the indirect tess addrs constobj so that we could emit the relocs to the tess BO after we'd emitted all the draws. Also apparently it fixes one of our CTS fails. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13851>	2021-12-02 01:47:38 +00:00
Rob Clark	145b0711fc	freedreno/crashdec: Basing GMU log decoding Looks like each entry is four dwords, with the second dword being a timestamp. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13937>	2021-12-01 17:53:21 +00:00
Rob Clark	8c654d02a3	freedreno/crashdec: Fallback to chip_id for GPU id Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13937>	2021-12-01 17:53:21 +00:00
Rob Clark	f33d5256dd	freedreno/crashdec: HFI queue decoding Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13937>	2021-12-01 17:53:21 +00:00
Rob Clark	2133d34b11	freedreno/crashdec: Split out mempool decoding Before we start adding GMU HFI decoding, lets split the other big section specific decoding (mempool) out into it's own file. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13937>	2021-12-01 17:53:21 +00:00
Emma Anholt	b234c538e8	turnip: Move CP_SET_SUBDRAW_SIZE to vkCmdBindPipeline() time. Now that the subdraw size is constant for a pipeline, this lets tess draws avoid the slow path in vkCmdDraw*(). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6089>	2021-12-01 16:57:30 +00:00
Jonathan Marek	fd11d99254	turnip: use SUBDRAW_SIZE and constant sized tess bos This fixes the problem of large indirect draws, and at the same time avoids allocating too large buffers for tessellation. Reworked by @anholt to use a separate tess factor BO so we can skip the WFIs to set the TESSFACTOR_ADDR. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6089>	2021-12-01 16:57:30 +00:00
Emma Anholt	3748b8afce	freedreno/ir3: Make a shared helper for the tess factor stride. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6089>	2021-12-01 16:57:30 +00:00
Lionel Landwerlin	698343edc5	util/u_trace/perfetto: add new env variable to enable perfetto When using the Vulkan API, command buffers can be recorded way before perfetto is enabled. This can be problematic if you want already recorded command buffers to produce traces. This new environment variable makes perfetto enabled internally so that command buffers are recorded with timestamps, even though no perfetto recording happens. v2: rename to GPU_TRACE_INSTRUMENT (Rob) v3: Move instrumentation check to generated headers (Danylo) Decouple instrumentation enabling from tracing (Danylo) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13911>	2021-12-01 15:14:05 +00:00
Lionel Landwerlin	65697d6141	util/u_trace: add end_of_pipe property to tracepoints In order to capture the timestamp when things actually end on Intel GPU HW, we need to know whether the timestamp should be capture at the top or end of pipeline. v2: use one line python if/else (Danylo) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13911>	2021-12-01 15:14:05 +00:00
Ilia Mirkin	c868bff36a	freedreno/ci: add piglit runs for a306 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13920>	2021-11-30 20:06:07 -05:00
Qiang Yu	fcc062235c	ci: remove egl-copy-buffers from fail list egl-copy-buffers test has been fixed for dri3. So remove it from broadcom and freedreno ci fail list to prevent the gitlab ci test fail: spec@egl 1.4@egl-copy-buffers,UnexpectedPass Also remove it from radeonsi ci fail list since I verified on radeonsi. Acked-by: Daniel Stone <daniels@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13868>	2021-11-30 01:58:42 +00:00
Ilia Mirkin	e31d08d307	ci: move windowoverlap exclusion to all-skips The test is just plain not built by our containers. Skip it everywhere. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13919>	2021-11-29 18:08:49 -05:00
Ilia Mirkin	f533d7a446	freedreno/ir3: get the post-lowering clip/cull mask The variant may include a lowered gl_Clip/CullDistance array. So we have to use the variant's info (which is not available). However we save off the clip/cull masks already, so just reuse those. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13891>	2021-11-28 02:55:58 -05:00
Ilia Mirkin	13fb587b8a	freedreno/ir3: indicate that clipdist arrays are in use We expose the compact array cap, which means that we get compact clipdist arrays. Indicate this to the lowering pass so that it works for gl_ClipDistance from fs, among others. Fixes, among others, on a420, tests/spec/glsl-1.30/execution/clipping/fs-clip-distance-interpolated.shader_test Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13891>	2021-11-28 02:55:58 -05:00
Danylo Piliaiev	a78c36ecc6	ir3/cp: Prevent setting an address on subgroup macros These macros expand to a mov in an if statement which breaks address assumption that instruction which produces address and consumes it are in the same block. Fixes test: dEQP-VK.subgroups.ballot_broadcast.framebuffer.subgroupbroadcast_intvertex Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13931>	2021-11-25 12:18:48 +00:00
Connor Abbott	969369e962	ir3/lower_subgroups: Fix potential infinite loop I was trying to be clever here, skipping ahead to the newly-created block and processing the remaining instructions after the split in the same loop. But if the last instruction in a block was lowered, the saved next instruction would be the head of the block before the split, not the new block, and we would compare it to the new block so we wouldn't stop like we were supposed to. Stop being so clever, and just restart processing with the new block after lowering an instruction. Because we're wrapping the actual transform in yet another loop, and the restarting logic is a bit tricky, refactor the actual lowering into a separate lower_instr function. Otherwise we'd be mixing the two and indenting the actual logic even more. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13928>	2021-11-25 10:16:48 +00:00

1 2 3 4 5 ...

2925 Commits