KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Qiang Yu	3ab9c42b43	radeonsi: add si_create_passthrough_tcs For replacing si_create_fixed_func_tcs. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16705>	2022-06-27 02:38:21 +00:00
Qiang Yu	74350cf057	radeonsi: support multi stage shader state creation in nir shaderlib For creating tcs passthrough shader. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16705>	2022-06-27 02:38:21 +00:00
Qiang Yu	a599576654	radeonsi: use si_shader as parameter in si_get_nir_shader Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16705>	2022-06-27 02:38:21 +00:00
Qiang Yu	05b829cd0c	radeonsi: deserialize nir binary in si_check_blend_dst_sampler_noop We can do this parse with original nir instead of shader key pass applied nir in si_get_nir_shader. This can free si_get_nir_shader to just use si_shader as parameter. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16705>	2022-06-27 02:38:21 +00:00
Jason Volk	e1488d9374	radeon: Support shared memory user pointers. The RADEON_GEM_USERPTR_ANONONLY flag is hardcoded here which excludes shared memory pages. DRM is actually capable of supporting shared file- backed memory, but only if it's read-only. This mutability intent has to be conveyed through the stack, so a flags argument is added to the winsys regime to pass RADEON_FLAG_READ_ONLY. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16115>	2022-06-22 12:23:02 +00:00
Ruijing Dong	365bf2a3b0	radeonsi/vcn: support unified queue in vcn4 - use unified queue only in vcn4 - implement signature and engine-info ib headers in vcn4 - implemented unified queue functions Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16911>	2022-06-16 03:30:47 +00:00
Ruijing Dong	515112eabd	radeonsi/vcn: prepare for unified queue in vcn4 - apply unified queue ib headers to vcn4 - re-use encoding queue as unified queue - define unified queue functions and structures Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16911>	2022-06-16 03:30:47 +00:00
Marek Olšák	e24354c1b2	radeonsi/gfx11: rework GDS streamout code to single-lane and enable streamout GDS is basically scalar in gfx11. This is not exactly how it's supposed to be done (we should be using the GDS_STRMOUT registers), but it works. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16990>	2022-06-15 20:52:42 +00:00
Marek Olšák	44e4d42c23	radeonsi/gfx11: add missing register shadowing code it doesn't work yet Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16990>	2022-06-15 20:52:42 +00:00
Marek Olšák	fbd68a3839	radeonsi/gfx11: drop the ES vertex count requirement Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16990>	2022-06-15 20:52:42 +00:00
Marek Olšák	99fd408946	radeonsi/gfx11: don't allocate unused wait_mem_scratch We sync using PWS instead of memory. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16990>	2022-06-15 20:52:42 +00:00
Marek Olšák	98d6a3d6c6	radeonsi/gfx11: don't use memory for waiting for cache flushes There is a new flush/wait mechanism called PixelWaitSync that uses an internal counter. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16990>	2022-06-15 20:52:42 +00:00
Marek Olšák	56d4e0be86	radeonsi/gfx11: synchronize correctly before setting SPI_ATTRIBUTE_RING_* Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16990>	2022-06-15 20:52:42 +00:00
Marek Olšák	fa25eba744	radeonsi/gfx11: allocate more space for pipeline statistics Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16990>	2022-06-15 20:52:42 +00:00
Marek Olšák	0e8beb1eed	radeonsi/gfx11: compile monolithic PS if it writes memory Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16990>	2022-06-15 20:52:42 +00:00
Yonggang Luo	a9e2c699aa	util/c11: Update function u_thread_create to be c11 conformance Do not assume thrd_t to be a pointer or integer, as the C11 standard tells us: thrd_t: implementation-defined complete object type identifying a thread At https://en.cppreference.com/w/c/thread So we always return the thread creation return code instead of thrd_t value, and judge the return code properly. Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15087>	2022-06-15 17:37:17 +00:00
Pierre-Eric Pelloux-Prayer	aa58ff191f	Revert "winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+" This reverts commit `8edafaa25c`. This fixes hangs on Navi21. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17032>	2022-06-15 10:38:04 +00:00
Pierre-Eric Pelloux-Prayer	b75ef3815f	radeonsi: use helpers to access si_screen::aux_context This avoids to mistakenly use the context without locking it first. The aux_context_lock needs to become a recursive one now, since si_texture_get_handle can call si_reallocate_texture_inplace which uses resource_create which may use the aux_context too. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6666 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17032>	2022-06-15 10:38:04 +00:00
Pierre-Eric Pelloux-Prayer	bda1c081bd	radeonsi: add helper to use si_screen::aux_context This context needs to be locked before usage, and flushed after. If it's forgotten, radeonsi may crash (eg #6666). To avoid this kind of error, introduce 2 helpers. cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17032>	2022-06-15 10:38:03 +00:00
Marek Olšák	bdf3797aeb	ac,radeonsi: don't export null from PS if it has no effect on gfx10+ We just need to pass the uses_discard flag to the epilog. The hw skips the export anyway. This will hang if SPI registers declare an output format or KILL_ENABLE is set because those cases require an export with done=1. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	e4b7088779	radeonsi: allocate only 1 GDS OA counter for gfx10 NGG streamout It works with just one. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	0f48c581f9	radeonsi: allocate GDS only once per process Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	dfa8dcf80e	radeonsi: remove streamout code from shaders if no streamout buffers are bound This is an optimization using asynchronous shader compilation. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	dbbbe73d05	radeonsi: fix NGG streamout hang by allocating GDS in the right place Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	3f900df071	radeonsi: inline gfx10_emit_streamout_begin/end Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	002e34d860	radeonsi: unconditionally enable the streamout overflow query with NGG It fails some tests, but we need it for gfx11. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	0f4f98ea50	radeonsi: fix a crash in gfx10_sh_query_get_result_resource If tmp_buffer (in ssbo[1]) is NULL, setting the writable bit causes the called function to access the NULL buffer. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	fc392ff104	radeonsi: fix an NGG streamout hang with monolithic shaders ac_llvm_add_target_dep_function_attr has no effect if the function is inlined. amdgpu-gds-size determines m0 for ds_sub_u32 gds, which hangs if it's 0. This helps both gfx10 and gfx11, though it will only be used by gfx11 after we enable streamout. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	a9f7744cfe	radeonsi: rework how vs_state_bits is set and unpacked Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	c2342e6770	radeonsi: move GS_STATE bits to the end to make space at the beginning Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	c9c7dcb619	radeonsi: rename and regroup VS_STATE definitions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	091617002f	radeonsi: rework how VS_STATE_BITS are set for VS, TES, and GS We need more GS/NGG bits, so we need to add current_gs_state for that. This simplifies the logic in the draw code. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	928e5f240d	radeonsi: simplify how pipeline statistic offsets are computed Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	57b7dcd9db	radeonsi: add BREAK_BATCH at the beginning of IBs to fix possible issues if the previous IB comes from a different app Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	eea46094ff	radeonsi: set INTERPOLATE_COMP_Z to 0 to work around an EQAA bug Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	4f3c74ddfb	radeonsi: determine DB_SHADER_CONTROL in si_shader_ps This is cleaner and more flexible. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	8e879dcedd	radeonsi: restructure PS no-export fixups Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	7cbea71aab	radeonsi: fix polygon stippling without color and Z outputs (v2) We need to handle the fact that it kills pixels. v2: also update si_update_ps_inputs_read_or_disabled Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	56359e9f6e	radeonsi: remove unused dword from wait_mem_scratch Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	8e0d34ce98	radeonsi: fix uninitialized wait_mem_scratch_tmz The initialization was dead code because it's allocated later. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	705e9af29a	radeonsi: don't use info.gs.invocations if it's not GS It's a union, which makes gs.invocations undefined for VS and TES. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	3b9cd2469e	radeonsi: print LDS size in bytes Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Marek Olšák	8edafaa25c	winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+ This skips the preamble for following IBs if the queue receives IBs from the same context back-to-back. This eliminates VGT_FLUSH (for tess and legacy GS) and PS_PARTIAL_FLUSH (for gfx11) in those cases if the preamble contains them. v2: only use this on gfx10+ due to stability issues on Stoney and limited testing Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16885>	2022-06-11 11:14:16 +00:00
Pierre-Eric Pelloux-Prayer	3d37291e1c	radeonsi: prevent recursion in si_decompress_dcc This avoids u_blitter recursion: #0 util_blitter_set_running_flag #1 util_blitter_custom_color #2 si_blit_decompress_color #3 si_decompress_dcc #4 si_texture_disable_dcc #5 si_update_ps_colorbuf0_slot #6 si_bind_ps_shader #7 util_blitter_restore_fragment_states #8 util_blitter_custom_color #9 si_blit_decompress_color #10 si_decompress_dcc #11 si_sdma_copy_image #12 si_blit cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16962>	2022-06-10 17:40:18 +00:00
Pierre-Eric Pelloux-Prayer	813e60f1ea	tradeonsi: fix preamble state producing incorrect packets If the first time the preamble is written, one of the rings isn't allocated, we wouldn't write the RING_SIZE to the preamble. Later, when the preamble gets updated after the ring allocation, the new RING_SIZE packet would overwrite other packets. To prevent this, always write the RING_SIZE (the alternative would be to write NOP packets). This fix "ERROR Illegal register access in command stream" hangs I observed on GFX8. Fixes: `32c7805ccc` ("radeonsi: merge all preamble states into one") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16962>	2022-06-10 17:40:18 +00:00
Emma Anholt	cf265c6606	nir: Rename is_arb_asm to use_legacy_math_rules and document its meaning. On iris and crocus, this flag is used to set "alt mode" math on the shader as a whole. Some other drivers have a similar mode for DX9/ARB-program behavior, so document what it does so we can start using it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16176>	2022-06-10 03:26:32 +00:00
Marek Olšák	b7cb4d4f6f	radeonsi: set the max UBO size same as the max SSBO size Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16901>	2022-06-08 10:23:20 +00:00
Marek Olšák	b750844319	radeonsi: compute PIPE_CAP_MAX_TEXEL_BUFFER_ELEMENTS_UINT correctly Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16901>	2022-06-08 10:23:20 +00:00
Marek Olšák	aee8ee17a5	radeonsi: change max TBO/SSBO sizes again and rework max alloc size Allow 1/4 of the max heap size, but maximum of 512 MB on 32-bit architectures. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16901>	2022-06-08 10:23:20 +00:00
Marek Olšák	c1adb33a93	radeonsi: clamp against MAX_TEXEL_BUFFER_ELEMENTS correctly Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16901>	2022-06-08 10:23:20 +00:00
Marek Olšák	91e533c6aa	radeonsi: report correct maximum compute grid sizes Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16901>	2022-06-08 10:23:20 +00:00
Marek Olšák	ecda7be628	radeonsi: increase the max compute LDS size to 64KB for gfx7+ Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16901>	2022-06-08 10:23:19 +00:00
Pierre-Eric Pelloux-Prayer	b81f05e94d	radeonsi: set size in si_texture_get_handle Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6507 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6491 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16813>	2022-06-08 09:42:47 +02:00
Marek Olšák	ad8f9d5d58	gallium: rename PIPE_CAP_MAX_SHADER_BUFFER_SIZE -> *_UINT to imply the maximum of 4GB - 1. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16881>	2022-06-07 00:17:58 -04:00
Marek Olšák	fd6b8999d7	gallium: rename PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE->MAX_TEXEL_BUFFER_ELEMENTS_UINT to allow exposing 4G - 1. The "SIZE" was also a misnomer because it meant elements. This no longer clamps the size to INT_MAX in st/mesa. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16881>	2022-06-07 00:17:58 -04:00
Marek Olšák	406cf871b2	gallium: rename PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE to _BUFFER0_ UBOs will use a larger limit. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16881>	2022-06-07 00:17:57 -04:00
Emma Anholt	8c4b88ee48	gallium+glsl: Remove EmitNoSat/PIPE_CAP_VERTEX_SHADER_SATURATE The drivers not setting it were: - nv30, which gets lowering using NIR's lower_fsat flag. - r300, which gets lowering using NIR's lower_fsat flag. - a2xx, which has was getting it optimized back to fsat anyway. This drops the check for the cap from gallium nine. While nine does have a non-nir path, I think it's safe to assume that if you have SM3 texturing, you can do fsat. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Qiang Yu	61c500ee9b	radeonsi: replace llvm ls/hs interface lds ops with nir lowered ones Use ac nir lower pass to generate these lds load/store ops explicitly. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16418>	2022-06-07 01:40:14 +00:00
Qiang Yu	87dfff3e6b	radeonsi: add tcs_vgpr_only_inputs parameter to si_get_nir_shader Will be used later. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16418>	2022-06-07 01:40:14 +00:00
Qiang Yu	47dd3525fb	radeonsi: implement load_lshs_vertex_stride abi Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16418>	2022-06-07 01:40:14 +00:00
Timothy Arceri	26ff49038c	gallium: remove PIPE_SHADER_CAP_MAX_UNROLL_ITERATIONS_HINT CAP This is used for the old, buggy and slow GLSL IR loop unrolling code. All drivers have now switched to the NIR unrolling code so here we remove the CAP. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
SureshGuttula	5d0f922834	Revert "radeon: hardcode uvd/vce encoder not_referenced value to false" This reverts commit `96b276b327`. This patch enable SVC encoding support on VCE/UVD. Signed-off-by: Suresh Guttula <suresh.guttula@amd.com> Reviewed-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16768>	2022-06-03 17:46:28 +00:00
Erik Faye-Lund	69d55f42b6	radeonsi: port amdgcn_glslc build to meson Seems nice to reduce the number of old-fashioned build systems we have in-tree. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16789>	2022-06-02 08:54:08 +00:00
Daniel Schürmann	bd151a256e	nir/opt_vectorize: add callback for max vectorization width The callback allows to request different vectorization factors per instruction depending on e.g. bitsize or opcode. This patch also removes using the vectorize_vec2_16bit option from nir_opt_vectorize(). Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080>	2022-06-01 11:41:44 +00:00
Emma Anholt	7472bb4bad	glsl,nir: Move i/umulExtended lowering to NIR. NIR already has the necessary lowering, and the GLSL lowering violates GLSL IR validation rules. Once quadop lowering was turned off, the IR validation at the end of the compile path on DEBUG builds caught the problem. In order to move the lowering to NIR, though, we need to make sure that drivers supporting these functions actually have the lowering flag set. xfails added for t860, where apparently this tickles a variety of existing 64-bit bugs in the backend. Fixes: #6461 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16437>	2022-06-01 10:56:35 +00:00
Timothy Arceri	abe4536c51	ci: uprev piglit 2022-05-31 Also document additional piglit failures and passes. Multiple changes, mostly notable: - few new tests - fixed test for upcoming mesa MR Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16785>	2022-06-01 03:14:29 +00:00
Jason Ekstrand	2a22885a45	st,nir: Use nir_shader::xfb_info in nir_lower_io_passes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Pierre-Eric Pelloux-Prayer	dad36b5f12	radeonsi: enable use_waterfall_for_divergent_tex_samplers And run the nir_divergence_analysis pass in si_get_nir_shader to make sure it's up to date. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2253 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16709>	2022-05-31 13:08:07 +00:00
SureshGuttula	f2e3646321	Revert "radeonsi: Set display_remote for non-refernced frames" This reverts commit `ef76b83633`. Reason for revert: This only helps in using I MBs.To further fix in dpb , reverting this. Fix added : https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16745 Signed-off-by: SureshGuttula <suresh.guttula@amd.com> Reviewed-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16744>	2022-05-29 00:07:47 +00:00
SureshGuttula	77a6feff89	radeonsi/vcn : update enc->dpb ref_use for index 0 Currently dpb_enc referneces not updated properly when index 0, as we are skipping clearing that ref. This patch will fix this for index 0. So that when ever we set non_referenced flag, that is not used as ref and not pushed to DPB. This is helping in SVC encoding. Signed-off-by: SureshGuttula <suresh.guttula@amd.com> Reviewed-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16745>	2022-05-28 15:52:53 +00:00
David Heidelberg	0a9461caf5	ci/radeonsi: add RoR and Nheko traces Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16633>	2022-05-27 06:51:38 +00:00
SureshGuttula	ef76b83633	radeonsi: Set display_remote for non-refernced frames When we do SVC temporal encoding, we see output bitsream is not proper. To fix this , incase of SVC passing session init varaible display_remote as enable. Signed-off-by: SureshGuttula <suresh.guttula@amd.com> Reviewed-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16468>	2022-05-26 12:26:53 +00:00
Vlad Zahorodnii	e20718e8fa	radeonsi: Add support for EGL_IMG_context_priority This allows creating high priority contexts when using radeonsi. It's primarily intended for apps whose rendering commands must be processed as soon as possible, e.g. wayland compositors. Signed-off-by: Vlad Zahorodnii <vlad.zahorodnii@kde.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16594>	2022-05-25 14:15:30 +00:00
Vlad Zahorodnii	f4de4453cf	winsys/amdgpu-radeon: Allow specifying context priority This is needed to implement EGL_IMG_context_priority in radeonsi. Signed-off-by: Vlad Zahorodnii <vlad.zahorodnii@kde.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16594>	2022-05-25 14:15:30 +00:00
Pierre-Eric Pelloux-Prayer	ef950d370a	radeonsi: don't use sel->nir in si_check_blend_dst_sampler_noop We don't want to modify sel->nir so force the use of the serialized version of the shader. Waiting on sel->ready guarantees that sel->nir will be NULL and that si_get_nir_shader will use sel->nir_binary. Fixes: `b78a38bd02` ("radeonsi: use si_nir_is_output_const_if_tex_is_const") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6415 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16587>	2022-05-25 12:03:34 +00:00
Pierre-Eric Pelloux-Prayer	e87135c552	radeonsi/tests: use a smaller tests-per-group value Faster glcts runs (44 -> 34 sec). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16580>	2022-05-20 09:57:14 +02:00
Pierre-Eric Pelloux-Prayer	c2892b811a	radeonsi/tests: add a --slow option Some glcts tests implement tons of tests because they verify every possible combination of format/swizzle/target/... They take a long time to execute and aren't possible to run using multiple processes. The proper way to fix it would be to split them in vk-gl-cts, as is already done for some of them (eg es31fTextureGatherTests.cpp). In the meantime, not running them makes glcts run almost 10 times faster. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16580>	2022-05-20 09:57:05 +02:00
Qiang Yu	cc4d5b1666	radeonsi: lower nir_intrinsic_sparse_residency_code_and This is required by lower_tg4_offsets which split one sparseTextureGatherOffsetsARB call to four sparseTextureGatherOffsetARB calls and merge their resisident results into one. Fixes: `ee040a6b63` ("radeonsi: enable ARB_sparse_texture2") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16599>	2022-05-20 01:45:12 +00:00
Pierre-Eric Pelloux-Prayer	cf9ee6d432	radeonsi: wait for PS idle in si_set_framebuffer_state This is needed to avoid write-after-read hazards in texture -> render transitions. This fixes fbo-depth tests that were flaky on GPUs (at least sienna_cichlid and vega20). Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16561>	2022-05-19 12:22:11 +00:00
Marek Olšák	2443054932	amd: rename fishes to Navi21, Navi22, Navi23, Navi24, and Rembrandt Reviewed-by: Mihai Preda <mhpreda@gmail.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16604>	2022-05-19 11:55:50 +00:00
Indrajit Kumar Das	03bc7503d4	radeonsi: save the fs constant buffer to the util blitter context Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15318>	2022-05-19 11:18:30 +00:00
Pierre-Eric Pelloux-Prayer	74a172a448	radeonsi: fix glTexBuffer max size handling The spec says the number of texels must be clamped to the value of GL_MAX_TEXTURE_BUFFER_SIZE. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16480>	2022-05-18 10:36:01 +02:00
Sil Vilerino	d2871e40e0	gallium radeon/r600/omx/va: Adds support for multiple reference encoding gallium: pipe_h264_enc_picture_desc: ref_idx_lx to ref_idx_lx_list[32], add num_ref_idx_lx_active_minus1 gallium radeon/r600: Change usage of ref_idx_lx to ref_idx_lx_list gallium omx: Fill out ref_idx_lx_list, num_ref_idx_lx_active_minus1 gallium va: Add support for multiple references encoding Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16286>	2022-05-17 21:02:25 +00:00
Sil Vilerino	15540abf22	gallium/va/radeonsi: Using private as a parameter name conflicts with C++ keywords Reviewed-by: Leo Liu <leo.liu@amd.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16286>	2022-05-17 21:02:25 +00:00
Marek Olšák	ad50daa982	radeonsi: fix resource_copy_region with ETC formats (e.g. for Stoney) Only Stoney, Vega10, Raven, and Raven2 support ETC. Fixed tests: dEQP-GLES31.functional.copy_image.mixed.viewclass_64_bits_mixed.r11_eac_rgba16i.texture2d_to_texture2d dEQP-GLES31.functional.copy_image.mixed.viewclass_64_bits_mixed.r11_eac_rgba16ui.texture2d_to_texture2d dEQP-GLES31.functional.copy_image.mixed.viewclass_64_bits_mixed.signed_r11_eac_rgba16i.texture2d_to_texture2d dEQP-GLES31.functional.copy_image.mixed.viewclass_64_bits_mixed.signed_r11_eac_rgba16ui.texture2d_to_texture2d Fixes: `cf1e562fdd` - radeonsi: remove compressed and subsampled gfx copy from resource_copy_region Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6431 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16491>	2022-05-17 11:26:25 +00:00
Marek Olšák	1fdc3b0fde	radeonsi: move CS preamble emission into the winsys The preamble will be skipped by the kernel if there is no context switch. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Marek Olšák	32c7805ccc	radeonsi: merge all preamble states into one Tess registers are appended. GS registers are appended or overwritten if they are already set. There are separate TMZ and non-TMZ preambles. The preamble will be passed to the kernel as an IB to execute on a context switch only. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Marek Olšák	f46cd73e29	radeonsi/gfx11: optimize attribute stores Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Marek Olšák	9b20120d57	radeonsi/gfx11: fix VM faults due to the attribute ring Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Marek Olšák	c74d854348	Revert "radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order" This reverts commit `54d85700a1`. It's an LLVM bug. If you disable AMDGPUImageIntrinsicOptimizer in LLVM, MSAA is fixed. There is no LLVM command line option to disable it from Mesa. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Marek Olšák	a529e4f7ad	radeonsi/gfx11: fix the value of VGT_GS_OUT_PRIM_TYPE at the beginning of IBs Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Marek Olšák	a8d2ef8bd6	radeonsi/gfx11: don't insert shader code for GS_PIPELINE_STATS_EMU GS_PIPELINE_STATS_EMU is always false, so the branches were never entered. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Marek Olšák	fcaa9f5096	radeonsi/gfx11: fix alpha-to-coverage with stencil or samplemask export We can't use UINT16_ABGR for the alpha channel. Always use 32_ABGR. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Marek Olšák	af880e591e	radeonsi: remove GFX9_MERGED_NUM_USER_SGPR definition Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16509>	2022-05-17 10:27:04 +00:00
Dave Airlie	8198900071	ac/radv: drop info pointer from the ac and radv shader structs This was being used for one bool, just pass the bool. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16521>	2022-05-17 06:15:25 +00:00
Marek Olšák	3382af7f6a	radeonsi/gfx11: set BIG_PAGE for the attribute ring Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16466>	2022-05-16 07:03:41 -04:00
Marek Olšák	8a2f151ef8	radeonsi: print an error when failing to create a context Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16466>	2022-05-16 07:03:41 -04:00
Marek Olšák	6515b3b2dc	radeonsi: fix a crash when failing to create a context When shader_query_buffers is NULL, the code treated as as non-empty. Fixes: `792a638b03` "radeonsi/gfx10: implement streamout-related queries" Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16466>	2022-05-16 07:03:40 -04:00
Marek Olšák	0755d02456	radeonsi: use AMDGPU_VM_PAGE_NOALLOC to disable MALL (infinity cache) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16466>	2022-05-16 07:03:39 -04:00
Marek Olšák	e9e9086b66	radeonsi: use the new flag AMDGPU_GEM_CREATE_DISCARDABLE It forces the best placement (usually VRAM) and evictions discard the contents instead of copying. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16466>	2022-05-16 07:03:39 -04:00

1 2 3 4 5 ...

6122 Commits