KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Tomeu Vizoso	20507f8b17	gitlab-ci: Download traces from MinIO Downloading the traces directly from git causes very high egress from GCE, which is expensive. So we can expand trace testing further, we are going to keep a cache in freedesktop.org's MinIO instance. This commit implements downloading from it. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-By: Rohan Garg <rohan.garg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5472>	2020-07-21 09:22:18 +00:00
Rohan Garg	087be7e322	gitlab-ci: Replay traces on lava devices Submit lava jobs to replay traces on Veyron (Mali T760) and Kevin (Mali T860) boards. Signed-off-by: Rohan Garg <rohan.garg@collabora.com> Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-By: Rohan Garg <rohan.garg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5472>	2020-07-21 09:22:18 +00:00
Kenneth Graunke	576c53dadf	iris: Fix CCS check in iris_texture_subdata(). The intention here was to check "Would the GPU be able to compress this if we used the PBO-based texture upload path?" Prior to Gen12, that meant checking for CCS_E. On Gen12, there are a lot more types of compression, and basic CCS_E was replaced by GEN12_CCS_E, making this check simply not work, so we'd take the CPU path instead. Instead, check if it has CCS, and isn't the basic "fast clear" CCS_D. Fixes: `39f06e2848` ("iris: Implement pipe->texture_subdata directly") Tested-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6005>	2020-07-21 09:10:37 +00:00
Rhys Perry	0868638aed	nir/lower_int64: lower 64-bit amul Fixes an issue with Renderdoc's shader debugging with ACO. If nir_opt_algebraic isn't called in-between nir_lower_explicit_io and nir_lower_int64, we can end up with 64-bit multiplications. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `6320e37d4b` ('nir: add amul instruction') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5709>	2020-07-21 06:47:10 +00:00
Jason Ekstrand	4d44848c47	anv: Advertise support for VK_EXT_shader_atomic_float We already have all of the shader code for load/store/exchange. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5992>	2020-07-21 05:01:34 +00:00
Jason Ekstrand	675d7b19a9	intel/fs: Use the correct logical op for global float atomics Fixes: `e644ed468f` "intel/fs: Implement nir_intrinsic_global_atomic_*" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5992>	2020-07-21 05:01:34 +00:00
Jason Ekstrand	84086b620e	spirv: Add support for SPV_EXT_shader_atomic_float Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5992>	2020-07-21 05:01:34 +00:00
Jason Ekstrand	2a568c595b	spirv: Update headers and grammar json This pulls in commit 63cb1fc131573fa from KhronosGroup/SPIRV-Headers Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5992>	2020-07-21 05:01:34 +00:00
Eric Engestrom	cc03448008	egl: inline _EGLAPI into _EGLDriver _EGLDriver was an empty wrapper around _EGLAPI, so let's only keep one of them. "driver" represents better what's being accessed, so that's the one we're keeping. Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5987>	2020-07-21 00:59:43 +00:00
Bas Nieuwenhuizen	7b7917a424	radeonsi: Inhibit clock-gating for perf counters. Otherwise most counters return 0. Should be much more user friendly than having to totally disable clock-gating on the kernel cmdline. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5972>	2020-07-20 23:56:26 +00:00
Bas Nieuwenhuizen	794ba3efd7	amd/registers: add RLC_PERFMON_CLK_CNTL for pre-GFX10 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5972>	2020-07-20 23:56:26 +00:00
Jason Ekstrand	36e6ac65c5	anv: Advertise VK_EXT_image_robustness We already support a superset of VK_EXT_image_robustness via VK_EXT_robustness2. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5985>	2020-07-20 22:30:18 +00:00
Eric Anholt	d973e50f69	freedreno/ir3: Add missing ld_args_build_id to the ir3_delay unit test. It triggers the disk cache for me, and asserts abount not getting the build id right. Fixes: `f97acb4bb4` ("freedreno/ir3: disk-cache support") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5989>	2020-07-20 22:11:51 +00:00
Samuel Pitoiset	3688da2192	radv: advertise VK_EXT_image_robustness All new dEQP-VK.robustness.image_robustness.* pass. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5979>	2020-07-20 21:18:27 +00:00
Christian Gmeiner	096adbe369	ci: bare-metal: use nginx to get results from DUT Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2655 Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5661>	2020-07-20 20:21:12 +00:00
Yevhenii Kolesnikov	101400d449	mesa: change error code of TextureSubImage for incorreect target According to the "Errors" list of the OpenGL 4.6 spec, section 8.6 "Alternate Texture Image Specification Commands": An INVALID_OPERATION error is generated by TextureSubImage if the effective target of texture does not match the command, as shown in table 8.15. Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5934>	2020-07-20 19:58:14 +00:00
Eric Anholt	af92348b1c	freedreno/ir3: Fix disasm of register offsets in ldp/stp. I had a stp testcase that was getting its offset wrong, and by twiddling bits and feeding it to qc disasm, I found that the comment was sort of right: some the cat6a bits implicated in the old comment do get used, as the high bits of the cat6c offset. Reallocating those bits also fixes how we were getting r960.y for r0.y. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5815>	2020-07-20 19:42:45 +00:00
Eric Anholt	d6d8dc133e	freedreno/ir3: Refactor cat6 general dst printing. We didn't need the extra branch and temp, we can move it inside of the dst handling by just duplicating the print of the dst reg. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5815>	2020-07-20 19:42:45 +00:00
Eric Anholt	62dcf75432	freedreno/ir3: Add a bunch more tests for cat6 opcodes. This started with making note of some ldp/stp instructions from the blob and how we differ from them. In the process of fixing it, I accidentally modified behavior of other opcodes, and the other instructions listed will keep us from doing that. I also dropped an old stl test that looks like I took from after a shader 'end' instruction. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5815>	2020-07-20 19:42:45 +00:00
Eric Anholt	ed3338f581	freedreno/ir3: Add a note about the instructions in the disasm test. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5815>	2020-07-20 19:42:45 +00:00
Jason Ekstrand	4ab3a219cc	vulkan: Update Vulkan XML and headers to 1.2.148 Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5983>	2020-07-20 18:28:10 +00:00
Eric Anholt	fd24a95995	ci: Use FDO_CI_CONCURRENT as our -j flags when present in the runner env. fd.o has retuned the x86 runners on packet for -j8. Rather than having to tweak our CI every time fd.o decides to rebalance job concurrency, respect what the runner admin has chosen for their builds (this will also be convenient for people with large local runners). Reviewed-by: Michel Dänzer <michel@daenzer.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5669>	2020-07-20 17:22:17 +00:00
Daniel Schürmann	5f79e4e69a	nir/algebraic: fold some nested bcsel Totals from 14266 (10.62% of 134368) affected shaders (Polaris): SGPRs: 761756 -> 762732 (+0.13%); split: -0.00%, +0.13% VGPRs: 430392 -> 430924 (+0.12%); split: -0.05%, +0.17% SpillSGPRs: 4652 -> 4628 (-0.52%); split: -0.60%, +0.09% CodeSize: 30133000 -> 29949780 (-0.61%); split: -0.66%, +0.05% MaxWaves: 102122 -> 102111 (-0.01%); split: +0.00%, -0.01% Instrs: 5845085 -> 5841668 (-0.06%); split: -0.08%, +0.03% Cycles: 69033140 -> 68889188 (-0.21%); split: -0.22%, +0.01% VMEM: 8479021 -> 8474978 (-0.05%); split: +0.03%, -0.08% SMEM: 831437 -> 830464 (-0.12%); split: +0.06%, -0.18% VClause: 105411 -> 105410 (-0.00%); split: -0.01%, +0.01% SClause: 327727 -> 327780 (+0.02%); split: -0.00%, +0.02% Copies: 372704 -> 373306 (+0.16%); split: -0.16%, +0.32% Branches: 112260 -> 112269 (+0.01%); split: -0.00%, +0.01% PreSGPRs: 433308 -> 433631 (+0.07%); split: -0.01%, +0.09% PreVGPRs: 397888 -> 397905 (+0.00%); split: -0.01%, +0.01% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:46 +00:00
Daniel Schürmann	27244662f2	nir/algebraic: propagate b2i out of ior/iand Totals from 761 (0.57% of 134368) affected shaders (Polaris): SGPRs: 29496 -> 29488 (-0.03%) SpillSGPRs: 41 -> 43 (+4.88%) CodeSize: 1922036 -> 1882408 (-2.06%); split: -2.08%, +0.02% Instrs: 366051 -> 360362 (-1.55%); split: -1.57%, +0.02% Cycles: 7692516 -> 7661216 (-0.41%); split: -0.41%, +0.01% VMEM: 365175 -> 365172 (-0.00%) VClause: 15324 -> 15322 (-0.01%) SClause: 9825 -> 9824 (-0.01%); split: -0.02%, +0.01% Copies: 41216 -> 41294 (+0.19%); split: -0.01%, +0.20% Branches: 7020 -> 7033 (+0.19%) PreSGPRs: 22103 -> 22106 (+0.01%) PreVGPRs: 26518 -> 26515 (-0.01%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:46 +00:00
Daniel Schürmann	baee5a9812	nir/algebraic: add distributive rules for ior/iand Totals from 581 (0.43% of 134368) affected shaders (Polaris): CodeSize: 1389560 -> 1386488 (-0.22%) Instrs: 264488 -> 263984 (-0.19%) Cycles: 1057952 -> 1055936 (-0.19%) VMEM: 296016 -> 291613 (-1.49%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:46 +00:00
Daniel Schürmann	70d3efeb88	nir/algebraic: optimize (a < 0.0) ? -a : a -> fabs(a) Totals from affected shaders: (VEGA) SGPRS: 13920 -> 13920 (0.00 %) VGPRS: 10252 -> 10252 (0.00 %) Spilled SGPRs: 62 -> 62 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 587648 -> 587224 (-0.07 %) bytes LDS: 5 -> 5 (0.00 %) blocks Max Waves: 1489 -> 1489 (0.00 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:46 +00:00
Daniel Schürmann	9d22c5ed71	nir/algebraic: optimize fmul(x, bcsel(c, -1.0, 1.0)) -> bcsel(c, -x, x) Totals from affected shaders: (VEGA) SGPRS: 545712 -> 545712 (0.00 %) VGPRS: 413092 -> 413116 (0.01 %) Spilled SGPRs: 10616 -> 10616 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 37031684 -> 36984248 (-0.13 %) bytes LDS: 427 -> 427 (0.00 %) blocks Max Waves: 54350 -> 54340 (-0.02 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:46 +00:00
Daniel Schürmann	56ec814b56	nir/algebraic: add some more unop + bcsel optimizations Totals from affected shaders: (VEGA) SGPRS: 284392 -> 284400 (0.00 %) VGPRS: 261080 -> 261076 (-0.00 %) Spilled SGPRs: 105 -> 105 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 24698596 -> 24277788 (-1.70 %) bytes LDS: 196 -> 196 (0.00 %) blocks Max Waves: 10101 -> 10105 (0.04 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	2fca183910	nir/algebraic: add optimizations for fsign/isign This just reverts fsign/isign lowering. Totals from affected shaders: SGPRS: 257496 -> 256672 (-0.32 %) VGPRS: 181800 -> 178864 (-1.61 %) Spilled SGPRs: 105 -> 105 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 11355852 -> 11141840 (-1.88 %) bytes LDS: 3789 -> 3789 (0.00 %) blocks Max Waves: 30453 -> 30951 (1.64 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	8e1b75b330	nir/algebraic: optimize iand/ior of (n)eq zero Found in some Detroit: Become Human shaders. Totals from affected shaders: SGPRS: 700256 -> 700256 (0.00 %) VGPRS: 507208 -> 507212 (0.00 %) Spilled SGPRs: 142531 -> 142531 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 76404616 -> 76301768 (-0.13 %) bytes LDS: 43 -> 43 (0.00 %) blocks Max Waves: 21438 -> 21438 (0.00 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	e4281dbecc	nir: also move b2i in case of nir_move_copies Booleans are often more efficient with register usage. This also allows to move comparisons further. Totals from affected shaders: (VEGA) SGPRS: 451608 -> 450320 (-0.29 %) VGPRS: 351448 -> 351256 (-0.05 %) Spilled SGPRs: 105 -> 105 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 1008 -> 1008 (0.00 %) dwords per thread Code Size: 26555596 -> 26551080 (-0.02 %) bytes LDS: 10323 -> 10323 (0.00 %) blocks Max Waves: 42850 -> 42934 (0.20 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	de0ebaf09d	nir/algebraic: optimize bcsel(a, 0, 1) to b2i This avoids combination with other bcsel operations, and as b2i is often a no-op (when used for iadd and such), the resulting pattern is preferable. Totals from affected shaders: (VEGA) SGPRS: 598448 -> 598448 (0.00 %) VGPRS: 457940 -> 457352 (-0.13 %) Spilled SGPRs: 127154 -> 127154 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 0 -> 0 (0.00 %) dwords per thread Code Size: 64836352 -> 64802728 (-0.05 %) bytes LDS: 781 -> 781 (0.00 %) blocks Max Waves: 22931 -> 22931 (0.00 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Icecream95	e764192f40	pan/mdg: Use the blend RT for blend shader framebuffer fetches Fixes piglit test fbo-drawbuffers-blend-add when fixed-function blending is disabled in panfrost_get_blend_for_context. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5892>	2020-07-20 14:15:49 +00:00
Icecream95	3ec252a3b2	panfrost: 8x MRT support Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5892>	2020-07-20 14:15:49 +00:00
Icecream95	978f963ea4	panfrost: Use more tilebuffer sizes This will be needed for 8x MRT with 128-bit framebuffer formats. Adds support for 256-bit, 1024-bit, and 2048-bit tilebuffer allocations, depending on the amount of data required. v2: Squash commits (Alyssa) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5892>	2020-07-20 14:15:49 +00:00
Icecream95	c1d3d39e97	panfrost: Fake RGTC support For most GPUs RGTC is disabled, so it needs to be emulated, using the fake_rgtc option of u_transfer_helper. Passes the rgtc-teximage tests in piglit. v2: Update docs/features.txt (Alyssa) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5975>	2020-07-20 09:54:49 -04:00
Rhys Perry	fac813dc61	spirv: don't split memory barriers If the SPIR-V had a shared+image memory barrier, we would emit two NIR barriers: a shared barrier and an image barrier. Unlike a single barrier, two barriers allows transformations such as: intrinsic image_deref_store (ssa_27, ssa_33, ssa_34, ssa_32, ssa_25) (1) intrinsic memory_barrier_shared () () intrinsic memory_barrier_image () () intrinsic store_shared (ssa_35, ssa_24) (0, 1, 4, 0) -> intrinsic memory_barrier_shared () () intrinsic store_shared (ssa_35, ssa_24) (0, 1, 4, 0) intrinsic image_deref_store (ssa_27, ssa_33, ssa_34, ssa_32, ssa_25) (1) intrinsic memory_barrier_image () () This commit fixes two dEQP-VK.memory_model.* CTS tests with ACO. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5951>	2020-07-20 12:05:16 +00:00
Samuel Pitoiset	28c227c7ca	radv/winsys: always allow GTT placements on APUs When the VRAM size is small and the preferred heap only VRAM, the kernel tries to always honor the requested heap and it does a ton of evictions which is a disaster for performance. On APUs, VRAM and GTT have similar performance, so allow the kernel to choose the best placement (GTT or VRAM) itself. This gives a huge performance boost with Doom Eternal on RAVEN. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5665>	2020-07-20 11:41:07 +00:00
Samuel Pitoiset	d1bba2eee7	radv: disable CPU caching for IBS to reduce fetch latency AMDGPU_GEM_CREATE_CPU_GTT_USWC should be faster when CPU reads are unexpected (because they aren't cached). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5959>	2020-07-20 11:23:19 +00:00
Pierre-Eric Pelloux-Prayer	d2a3ca289f	radeonsi: adjust epitch for PIPE_FORMAT_R8G8_R8B8_UNORM This fix si_compute_copy_image for yuyv image (so using PIPE_FORMAT_R8G8_R8B8_UNORM). With this change, the following gst pipeline produce the expected results for various image sizes (with or without AMD_DEBUG=nodma): gst-launch-1.0 filesrc location=input.jpg ! jpegparse ! vaapijpegdec ! filesink location=output.yuv Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5841>	2020-07-20 10:32:44 +00:00
Pierre-Eric Pelloux-Prayer	87ecfdfbf0	ac/surface: adapt surf_size when modifying surf_pitch Otherwise we might get VM_L2_PROTECTION_FAULT_STATUS errors. Fixes: `8275dc1ed5` ("ac/surface: fix epitch when modifying surf_pitch") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5841>	2020-07-20 10:32:44 +00:00
Gert Wollny	1fa36c1d3d	d600/sfn: write stream outputs to correct mem ring Fixes: arb_gpu_shader5-xfb-streams Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	21d296a481	r600/sfn: Make the pin_to_channel generic Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	3ea847e6d1	r600/sfn: Only use sample mask if the according shader key is set This fixes all the piglits from arb_sample_shading "samplemask * *" with the nir backend. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	c91979c634	r600: Add shader key item to identify when the sample mask should be used The sample mask must be applied when more then one sample is available or multisamplig is not enabled, so add a shader key to track this. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	05df4bfbca	r600/sfn: Fix default z swizzle for GDS instructions Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	2779aa360e	r600/sfn: Fix Ring output swizzle masks Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	c18b1c6df5	r600/sfn: Add a forced output swizzle for depth write This makes sure no components are written that shouldn't be written. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	d31ef0b7a4	r600/sfn: correct handling of loading vec4 with fetching constants Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00
Gert Wollny	aca99e6fc9	r600/sfn: Add option to get a temp value for a specific channel Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5963>	2020-07-20 09:32:51 +00:00

... 4 5 6 7 8 ...

126569 Commits All Branches Search

126569 Commits

All Branches