KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	345f04ed92	radeonsi: remove r600_emit_reloc Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-09 16:27:02 +02:00
Marek Olšák	da61946cb1	radeonsi: merge si_set_streamout_targets with si_common_set_streamout_targets Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-09 16:27:00 +02:00
Marek Olšák	a86c9328ce	radeonsi: add si_so_target_reference The src type is different on purpose. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-09 16:26:58 +02:00
Marek Olšák	65f2e33500	radeonsi: import r600_streamout from drivers/radeon Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-09 16:26:55 +02:00
Marek Olšák	ed7f27ded8	radeonsi: add performance thresholds for CP DMA, decrease it for clears The first one isn't used yet. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-09 16:24:21 +02:00
Marek Olšák	8e969cce38	radeonsi: disable primitive binning on Vega10 (v2) Our driver implementation is known to decrease performance for some tests, but we don't know if any apps and benchmarks (e.g. those tested by Phoronix) are affected. This disables the feature just to be safe. Set this to enable partial primitive binning: R600_DEBUG=dpbb Set this to enable full primitive binning: R600_DEBUG=dpbb,dfsm v2: add new debug options Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-09 16:20:18 +02:00
Marek Olšák	3784ce9782	radeonsi: enumerize DBG flags Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-09 16:20:16 +02:00
Marek Olšák	5a47abb63e	radeonsi: don't change viewport for blits, use window-space positions The viewport state was an identity anyway. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	76ef08f6ee	radeonsi: set correct PA_SC_VPORT_ZMIN/ZMAX when viewport is disabled Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	13b6c1c031	radeonsi: minor cleanup of si_update_vs_writes_viewport_index Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	5f566faa46	radeonsi: don't save and restore vertex buffers and elements for u_blitter Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	69ccb9dae7	radeonsi: use new VS blit shaders (VS inputs in SGPRs) Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	6a8401a94e	radeonsi: add VS blit shader creation no users yet Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	f3fe6afba8	radeonsi: split declare_default_desc_pointers Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	0a3b5a0232	gallium/u_blitter: let drivers decide which VS to use for draw_rectangle This approach allows drivers to set their own vertex shader and skip compilation of u_blitter vertex shaders. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	a46bcf0a77	gallium/u_blitter: let drivers set the vertex elements state radeonsi won't set it. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	f84a63bc00	radeonsi: don't use util_draw_arrays_instanced in si_draw_rectangle Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	387590accb	radeonsi: move si_draw_rectangle into si_state_draw.c Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	de810f8b84	radeonsi: remove wrappers si_decompress_xx_textures Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	efd72b31cb	gallium/radeon: remove r600_atom::num_dw Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-07 18:26:35 +02:00
Marek Olšák	c4d1a199f8	radeonsi: add a drirc workaround for HTILE corruption in ARK: Survival Evolved v2: use DB_META \| PS_PARTIAL_FLUSH Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102955 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (v1) Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v1)	2017-10-06 02:56:11 +02:00
Marek Olšák	15d918e46f	radeonsi: inline struct si_sampler_views Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	23cdde5138	radeonsi: rename si_textures_info -> si_samplers, si_images_info -> si_images Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	3dfb375446	radeonsi: fold needs_*_decompress_mask update into si_set_sampler_view Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	bd5509d0a8	radeonsi: simplify a loop in si_update_fb_dirtiness_after_rendering Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	cceb916456	radeonsi: use f32_0 and f32_1 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	1516059ab1	radeonsi: fold *gallivm Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	e1b83c67da	radeonsi: lp_type::length is always 1 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	906ee3a3ba	radeonsi: don't use bld.elem_type Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	723a23905f	radeonsi: don't use lp_build_const_* Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	b4600b4740	radeonsi: use ctx->ac.context and ctx->types Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	d0751f6c1f	radeonsi: use ctx->ac.builder Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	82dc72c8bd	radeonsi: use ctx->i/f32 types more Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	dcbd3d470c	radeonsi: use i32_0 and i32_1 more Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	bacdf5a928	radeonsi: use bitcast in a few places Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	ad7305aa96	radeonsi: use ac helpers for bitcasts Reviewed-by: Nicolai Hähnle <nicolai.haehnle at amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	dbe16d7537	radeonsi: implement PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	44993bd26f	radeonsi: use si_get_indirect_index for TEMP indexing Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	e986a16c16	radeonsi: use si_get_indirect_index for CONST indexing Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	41b85158ab	gallium: add PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Marek Olšák	be3ab867bd	tgsi: implement tgsi_util_get_inst_usage_mask properly All opcodes are handled. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-10-06 02:56:11 +02:00
Matt Turner	3a8a5e77e8	gallium: Remove util_format_s3tc_enabled Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-02 19:41:22 -07:00
Nicolai Hähnle	146c2b7c28	radeonsi: adjust clip discard based on line width / point size Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:45 +02:00
Nicolai Hähnle	63680471f9	radeonsi: remove si_context::{scissor_enabled,clip_halfz} They are just copies of the rasterizer state. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:45 +02:00
Nicolai Hähnle	12f3155e28	radeonsi: simplify the signature of si_update_vs_writes_viewport_index Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:45 +02:00
Nicolai Hähnle	7bbcb6ac6c	radeonsi: move current_rast_prim into si_context v2: rebase fixes Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:45 +02:00
Nicolai Hähnle	6b416ec3d6	radeonsi: move and rename scissor and viewport state and functions v2: change GET_MAX_SCISSOR to SI_MAX_SCISSOR Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:45 +02:00
Nicolai Hähnle	449ac258d1	radeonsi: remove si_apply_scissor_bug_workaround It only affects pre-SI chips. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:44 +02:00
Nicolai Hähnle	c955f45946	radeonsi: move r600_viewport.c to si_viewport.c This is purely a file-move + #include fixup + build system changes. Other cleanups will follow in subsequent commits. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:44 +02:00
Nicolai Hähnle	30e37289ea	radeonsi: fix maximum advertised point size / line width The hardware registers store the half-size/width in 12.4 fixed point format, so 8192 is the maximum. Fixes dEQP-GLES3.functional.rasterization.* Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:44 +02:00
Nicolai Hähnle	a3fa3b2e02	radeonsi: deduce rast_prim correctly for tessellation point mode Together with the previous patches, this fixes dEQP-GLES31.functional.primitive_bounding_box.wide_points.* Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:44 +02:00
Nicolai Hähnle	4d74432dd3	radeonsi: don't discard points and lines This is a bit conservative, but a more precise solution requires access to the rasterizer state. This is something to tackle after the fork between r600 and radeonsi. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:44 +02:00
Nicolai Hähnle	f86a112b07	radeonsi: move current_rast_prim to r600_common_context We'll use it in the scissors / clip / guardband state. v2: avoid a performance regression on r600 when applied to (pre-fork) stable branches Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 15:07:43 +02:00
Nicolai Hähnle	6d23f7c65d	radeonsi: fix a regression in integer cube map handling A recent commit fixed the case of 8888 integer cube maps, which need the workaround of replacing the data format with USCALED/SSCALED. However, this broke the case of non-8888 integer cube maps; those still need the fix of shifting the texture coordinates. Fixes KHR-GL45.texture_gather.plain-gather-int-cube-array and similar. Fixes: `6fb0c1013b` ("radeonsi: workaround for gather4 on integer cube maps") Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 12:17:15 +02:00
Nicolai Hähnle	052b974fed	amd/common: move ac_build_phi from radeonsi Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-10-02 12:17:15 +02:00
Marek Olšák	da3cf0e206	radeonsi: don't use the template keyword for C++ editors Reviewed-by: Brian Paul <brianp@vmware.com>	2017-09-30 19:03:07 +02:00
Benedikt Schemmer	3797a82e78	radeonsi/uvd: clean up si_video_buffer_create V2: remove code duplication and one unnessecary variable, minor whitespace fix Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-09-30 19:03:07 +02:00
Marek Olšák	e9cf64a67c	radeonsi/uvd: fix planar formats broken since `f70f6baaa3` Tested-by: Benedikt Schemmer <ben@besd.de> Reviewed-by: Christian König <christian.koenig@amd.com>	2017-09-30 19:03:07 +02:00
Nicolai Hähnle	d190bfc1ad	radeonsi: emit DLDEXP and DFRACEXP TGSI opcodes Note: this causes spurious regressions in some current piglit tests, because the tests incorrectly assume that there is no denorm support for doubles. I'm going to send out a fix for those tests as well. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 12:08:07 +02:00
Nicolai Hähnle	061303e4fd	radeonsi: emit LDEXP opcode The LLVM intrinsic has existed for a long time. The current name was established in LLVM 3.9. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 12:08:04 +02:00
Nicolai Hähnle	cad959d901	gallium: add LDEXP TGSI instruction and corresponding cap Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 12:08:01 +02:00
Nicolai Hähnle	2b0bfc51de	tgsi: infer that dst[1] of DFRACEXP is an integer Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 12:07:59 +02:00
Nicolai Hähnle	7af64b4d4a	gallivm: add dst register index to lp_build_tgsi_context::emit_store Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 12:07:55 +02:00
Nicolai Hähnle	797dd12c7b	radeonsi: fix border color translation for integer textures This fixes the extremely unlikely case that an application uses 0x80000000 or 0x3f800000 as border color for an integer texture and helps in the also, but perhaps slightly less, unlikely case that 1 is used as a border color. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 11:45:08 +02:00
Nicolai Hähnle	6eb9483912	radeonsi: clamp border colors for upgraded depth textures The hardware does this automatically for unorm formats, but we need to do it manually for unorm depth formats that have been upgraded to Z32_FLOAT. Fixes dEQP-GLES31.functional.texture.border_clamp.range_clamp.nearest_unorm_depth and others. Fixes: `d4d9ec55c5` ("radeonsi: implement TC-compatible HTILE") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 11:45:05 +02:00
Nicolai Hähnle	4c56e07029	radeonsi: clamp depth comparison value only for fixed point formats The hardware usually does this automatically. However, we upgrade depth to Z32_FLOAT to enable TC-compatible HTILE, which means the hardware no longer clamps the comparison value for us. The only way to tell in the shader whether a clamp is required seems to be to communicate an additional bit in the descriptor table. While VI has some unused bits in the resource descriptor, those bits have unfortunately all been used in gfx9. So we use an unused bit in the sampler state instead. Fixes dEQP-GLES3.functional.texture.shadow.2d.linear.equal_depth_component32f and many other tests in dEQP-GLES3.functional.texture.shadow.* Fixes: `d4d9ec55c5` ("radeonsi: implement TC-compatible HTILE") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 11:44:50 +02:00
Nicolai Hähnle	7dfa891f32	radeonsi/gfx9: fix geometry shaders without output vertices Not that those are super common or useful, but hey! Fun corner cases of the API... Fixes dEQP-GLES31.functional.geometry_shading.emit.* Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-29 11:43:09 +02:00
Nicolai Hähnle	4ed419328d	radeonsi: move descriptor logs to after corresponding draw/compute packet It has to happen after descriptor uploads since otherwise we'll print out the wrong GPU list / incorrectly claim descriptor corruption. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-29 11:37:06 +02:00
Nicolai Hähnle	9ddc6e16a9	amd/common: remove ac_shader_abi::chip_class Redundant with the recently added ac_llvm_context::chip_class. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-29 11:37:03 +02:00
Samuel Pitoiset	3ab0cff32c	radeonsi: remove useless check in si_blit_decompress_color() That's unnecessary to double-check that dcc_offset is not 0 because all callers already check that. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-27 09:31:24 +02:00
Marek Olšák	06bfb2d28f	r600: fork and import gallium/radeon This marks the end of code sharing between r600 and radeonsi. It's getting difficult to work on radeonsi without breaking r600. A lot of functions had to be renamed to prevent linker conflicts. There are also minor cleanups. Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-26 04:21:14 +02:00
Jan Vesely	9c87150618	gallium: Add PIPE_SHADER_CAP_INT64_ATOMICS Denotes availability of 64bit int atomic instructions Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-21 11:18:17 -04:00
Nicolai Hähnle	704ddbcdf6	radeonsi: set MIP_POINT_PRECLAMP to 0 This fixes a bug with nearest ("point") mip selection when the fractional part of max_lod is in (0.5,1). In this case, the spec mandates that we still select the mip level ceil(max_lod) in the clamping case. However, MIP_POINT_PRECLAMP will clamp before the mip selection, which is wrong. Supposedly this setting was originally copied from the closed Vulkan driver, but as far as I can tell, closed Vulkan was actually changed back recently :) Fixes dEQP-GLES3.functional.texture.mipmap.2d.max_lod.{nearest,linear}_nearest Fixes: `f7420ef5b4` ("radeonsi: enable some sampler fields to match the closed driver") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-09-20 15:43:13 +02:00
Nicolai Hähnle	87f7c7bd65	radeonsi: fix array textures layer coordinate Like for cube map (array) gather, we need to round to nearest on <= VI. Fixes tests in dEQP-GLES3.functional.shaders.texture_functions.texture.* Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-20 15:42:58 +02:00
Jan Vesely	7b2c5547c3	gallium: Add PIPE_SHADER_CAP_FP16 Denotes native half precision float operations capability v2: PIPE_CAP_HALFS -> PIPE_SHADER_CAP_FP16 fix indentation Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-18 10:45:02 -04:00
Nicolai Hähnle	7a62f8621a	radeonsi: allow out-of-order rasterization in commutative blending cases We do not enable this by default for additive blending, since it slightly breaks OpenGL invariance guarantees due to non-determinism. Still, there may be some applications can benefit from white-listing via the radeonsi_commutative_blend_add drirc setting without any real visible artifacts. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-18 11:25:20 +02:00
Nicolai Hähnle	8c56c45cd4	radeonsi: add drirc option "radeonsi_assume_no_z_fights" This option enables a performance optimization where typical non-blending draws with depth buffer may be rasterized out-of-order (on VI+, multi-SE chips). This optimization can lead to incorrect results when an applications renders multiple objects with the same Z value at the same pixel, so we will never enable it by default. But there may be applications that could benefit from white-listing. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-18 11:25:19 +02:00
Nicolai Hähnle	aab134cfa5	radeonsi: enable out-of-order rasterization when possible on VI and GFX9 dGPUs This does not take commutative blending into account yet. R600_DEBUG=nooutoforder disables it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-18 11:25:19 +02:00
Nicolai Hähnle	66d03d0e3e	gallium/radeon: pass old_(perfect_)enable to set_occlusion_query_state The callee can derive the current enable state itself. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-09-18 11:25:19 +02:00
Nicolai Hähnle	6772452e4c	amd/common: remove has_ds_bpermute argument from ac_build_ddxy Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-18 11:25:18 +02:00
Nicolai Hähnle	3db86d86ed	amd/common: add chip_class to ac_llvm_context Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-18 11:25:18 +02:00
Nicolai Hähnle	e0af3bed2c	amd/common: round cube array slice in ac_prepare_cube_coords The NIR-to-LLVM pass already does this; now the same fix covers radeonsi as well. Fixes various tests of dEQP-GLES31.functional.texture.filtering.cube_array.combinations.* Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-09-18 11:25:18 +02:00
Nicolai Hähnle	6fb0c1013b	radeonsi: workaround for gather4 on integer cube maps This is the same workaround that radv already applied in commit `3ece76f03d` ("radv/ac: gather4 cube workaround integer"). Fixes dEQP-GLES31.functional.texture.gather.basic.cube.rgba8i/ui.* Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-18 11:25:17 +02:00
Timothy Arceri	a70a401f52	radeonsi: enable STD430 packing of UBOs by default Before this change we were defaulting to STD140 which is slightly less efficient at packing arrays. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-15 11:42:55 +10:00
Timothy Arceri	c96e45ebf0	gallium: introduce PIPE_CAP_LOAD_CONSTBUF Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-15 11:42:55 +10:00
Timothy Arceri	b4401cc104	radeonsi: make use of LOAD for UBOs v2: always set can_speculate and allow_smem to true Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-15 11:42:55 +10:00
Samuel Pitoiset	f0d09d9012	radeonsi: move si_get_wave_info() to AMD common code This will allow us to use it from radv. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-09-14 10:37:57 +02:00
Denis Pauk	74d2456491	gallium/{r600, radeonsi}: Fix segfault with color format (v2) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102552 v2: Patch cleanup proposed by Nicolai Hähnle. * deleted changes in si_translate_texformat. Cc: Nicolai Hähnle <nhaehnle@gmail.com> Cc: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2017-09-14 00:59:24 +02:00
Nicolai Hähnle	e4af4433fc	radeonsi: hard-code pixel center for interpolateAtSample without multisample buffers The GLSL rules for interpolateAtSample are unfortunate: "Returns the value of the input interpolant variable at the location of sample number sample. If multisample buffers are not available, the input variable will be evaluated at the center of the pixel. If sample sample does not exist, the position used to interpolate the input variable is undefined." This fix will fallback to monolithic shader compilation when interpolateAtSample is used without multisampling. One alternative would be to always upload 16 sample positions, filling the buffer up with repetition when the actual number of samples is less, and then ANDing the sample ID with 0xf. However, that punishes all well-behaving users of interpolateAtSample, when in reality, only conformance tests should be affected by the issue. Fixes dEQP-GLES31.functional.shaders.multisample_interpolation.interpolate_at_sample.non_multisample_buffer.* Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-13 18:25:45 +02:00
Nicolai HÃÂ¤hnle	92c4277990	radeonsi: apply a mask to gl_SampleMaskIn in the PS prolog gl_SampleMaskIn is supposed to contain set bits only for the samples that are covered by the current fragment shader invocation, but the VGPR initialization hardware loads the set of all bits that are covered at the current pixel. Fixes various tests in dEQP-GLES31.functional.shaders.sample_variables.sample_mask_in.* Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-13 18:25:41 +02:00
Nicolai Hähnle	8d8f1ef573	radeonsi: rename variable to clarify its meaning Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-13 18:24:18 +02:00
Nicolai Hähnle	48b3364b5b	radeonsi: make si_init_shader_selector_async static Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-13 18:24:18 +02:00
Nicolai Hähnle	7e4344151f	radeonsi: fix segfault in descriptor dumping Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-09-13 18:24:18 +02:00
Marek Olšák	6eade342eb	radeonsi: optimize TCS epilog when invocation 0 writes tess factors This removes the barrier and LDS stores and loads for tess factors when it's possible. The removal of the barrier seems more important to me though. In one shader, it removes 17 * 4 bytes from the shader binary. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-11 19:02:02 +02:00
Connor Abbott	b8a51c8c4b	radeonsi: move the guts of ARB_shader_group_vote emission to ac Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-08 04:12:49 +01:00
Connor Abbott	bd73b89792	radeonsi: move si_emit_ballot() to ac Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-08 04:12:42 +01:00
Connor Abbott	ac27fa7294	radeonsi: move emit_optimization_barrier() to ac Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-08 04:06:47 +01:00
Connor Abbott	c181d4f2b7	radeonsi: move llvm_get_type_size() to ac Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-08 04:04:16 +01:00
Marek Olšák	4bd2bdbb3c	ac/surface: add radeon_surf::has_stencil for convenience Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-07 17:59:37 +02:00
Marek Olšák	7ec64bd88c	radeonsi: don't read tcs_out_lds_layout.patch_stride from an SGPR Same as before, writing TCS outputs to LDS is rare. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-09-07 13:00:07 +02:00

1 2 3 4 5 ...

2960 Commits