KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	9bd7928a35	radeonsi: add an option for debugging VM faults Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-10-03 22:06:07 +02:00
Marek Olšák	a9971e85d9	radeonsi: rework uploading border colors The border colors are uploaded only once when the state is created. This brings truly immutable sampler descriptors, because they don't have to be updated every time a sampler state is re-bound. It also moves the TA_BC_BASE_ADDR registers to init_config, removing one more state. The catch is there is now a limit: only 4096 border colors can be used by one context. I don't think that will be a problem. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:15 +02:00
Marek Olšák	228e80123a	radeonsi: reorder si_context variables Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:15 +02:00
Marek Olšák	28b34b474e	radeonsi: don't send IB dword usage to si_need_cs_space Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:15 +02:00
Marek Olšák	ec9d5e181e	radeonsi: don't count IB space for states, just use an upper bound Since we don't put any resource descriptors in IBs, the space used by draw calls is quite small. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:15 +02:00
Marek Olšák	fc95058add	radeonsi: convert SPI state to an atom Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:15 +02:00
Marek Olšák	45e549fcbc	radeonsi: convert CB_TARGET_MASK setup to an atom Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	e21418f221	radeonsi: convert stencil ref state into an atom Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	c44de30979	radeonsi: convert blend color state into an atom Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	74aa64876b	radeonsi: convert sample mask state into an atom Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	12b205341a	radeonsi: convert clip state into an atom Reducing calloc overhead. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	0c2eed0ede	radeonsi: avoid redundant CB and DB register updates The main idea is to avoid setting CB_COLORi_INFO = 0 for i>0 repeatedly when those colorbuffers aren't used. This is mainly for glamor. Same for DB. Z_INFO and STENCIL_INFO need to be cleared only once. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	c2a42d1f9f	radeonsi: don't rebind GSVS ring buffers every draw call using GS Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	a2c6ae07b4	radeonsi: remove the tf_ring state, add the registers to init_config One less state to worry about. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	0d46c3bc9d	radeonsi: remove the gs_rings state, add the registers to init_config Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	87c1e9e19c	radeonsi: use a bitmask for tracking dirty atoms This mainly removes the cache misses when checking the dirty flags. Not much else though. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:14 +02:00
Marek Olšák	ba7a6cf626	radeonsi: define the state atom array separately Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:13 +02:00
Marek Olšák	8a97528b3a	radeonsi: optimize viewport states same as scissors Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:13 +02:00
Marek Olšák	f6a10f60b7	radeonsi: optimize scissor states - convert 16 states to 1 atom - only emit 1 scissor if VIEWPORT_INDEX isn't written - use only one packet when emitting consecutive scissors Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Christian König <christian.koenig@amd.com>	2015-09-01 21:51:13 +02:00
Marek Olšák	2c14a6d3b1	radeonsi: add IB tracing support for debug contexts This adds trace points to all IBs and the parser prints them and also prints which trace points were reached (executed) by the CP. This can help pinpoint a problematic packet, draw call, etc. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	189953ee13	radeonsi: remove old CS tracing code Some of it is left there and it will be re-used in the next commit. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	be6dc87776	radeonsi: save the contents of indirect buffers for debug contexts This will be used by the IB parser. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:19 +02:00
Marek Olšák	110873ed11	radeonsi: add an initial dump_debug_state implementation dumping shaders This is usually called after a draw call. Acked-by: Christian König <christian.koenig@amd.com> Acked-by: Alex Deucher <alexander.deucher@amd.com>	2015-08-26 19:25:18 +02:00
Grazvydas Ignotas	3206d4ed44	gallium/radeon: use helper functions to mark atoms dirty This is analogous to r300_mark_atom_dirty() used by r300, and will be used by later patches. For common radeon code, appropriate helper is called through a function pointer. No functional changes. Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-08-11 14:46:53 +02:00
Marek Olšák	2d3ae154ba	radeonsi: move CP DMA functions to their own file Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-31 16:49:17 +02:00
Marek Olšák	b0528118df	radeonsi: completely rework updating descriptors without CP DMA The patch has a better explanation. Just a summary here: - The CPU always uploads a whole descriptor array to previously-unused memory. - CP DMA isn't used. - No caches need to be flushed. - All descriptors are always up-to-date in memory even after a hang, because CP DMA doesn't serve as a middle man to update them. This should bring: - better hang recovery (descriptors are always up-to-date) - better GPU performance (no KCACHE and TC flushes) - worse CPU performance for partial updates (only whole arrays are uploaded) - less used IB space (no CP_DMA and WRITE_DATA packets) - simpler code - hopefully, some of the corruption issues with SI cards will go away. If not, we'll know the issue is not here. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-31 16:49:16 +02:00
Marek Olšák	3344699243	radeonsi: set VGT_LS_HS_CONFIG for tessellation Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-23 00:59:33 +02:00
Marek Olšák	74c1001d13	radeonsi: add derived tessellation state Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-23 00:59:33 +02:00
Marek Olšák	db267a04ce	radeonsi: implement a fixed-function tessellation control shader and its state Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-23 00:59:32 +02:00
Marek Olšák	b6f4fdf6a9	radeonsi: set up a ring buffer for tessellation factors Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-23 00:59:32 +02:00
Marek Olšák	59b3556f4c	radeonsi: program VGT_SHADER_STAGES_EN for tessellation Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-23 00:59:32 +02:00
Marek Olšák	d1f43a7e5b	radeonsi: add code for creating, binding and destroying tessellation shaders This doesn't do anything yet. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-23 00:59:31 +02:00
Marek Olšák	3ce91c727f	radeonsi: rework how shader pointers to descriptors are set This is mainly needed for tessellation where a VS can be bound as VS, ES, or LS, and TES (tess. evaluationshader) can be bound as VS or ES or neither. Therefore we need the ability to move pointers to descriptors between shaders arbitrarily. The idea is that the context has a mapping from PIPE_SHADER_x to SPI_SHADER_USER_DATA_x. After a shader is enabled or disabled, si_shader_change_notify should be called to update this mapping accordingly. There is a dirty flag for each shader pointer, but only one emit function for all pointers in the whole context, whose code and logic is separated from descriptors. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-07-23 00:59:31 +02:00
Ilia Mirkin	a2a1a5805f	gallium: replace INLINE with inline Generated by running: git grep -l INLINE src/gallium/ \| xargs sed -i 's/\bINLINE\b/inline/g' git grep -l INLINE src/mesa/state_tracker/ \| xargs sed -i 's/\bINLINE\b/inline/g' git checkout src/gallium/state_trackers/clover/Doxyfile and manual edits to src/gallium/include/pipe/p_compiler.h src/gallium/README.portability to remove mentions of the inline define. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Marek Olšák <marek.olsak@amd.com>	2015-07-21 17:52:16 -04:00
Marek Olšák	f1be3d8cdd	radeonsi: don't flush an empty IB if the only thing we need is a fence Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-07-05 15:08:59 +02:00
Michel Dänzer	56e38edc96	radeonsi: Add CIK SDMA support Based on the corresponding SI support. Same as that, this is currently only enabled for one-dimensional buffer copies due to issues with multi-dimensional SDMA copies. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-06-08 18:13:22 +09:00
Michel Dänzer	d64adc3a79	radeonsi: Cache LLVMTargetMachineRef in context instead of in screen Fixes a crash in genymotion with several threads compiling shaders concurrently. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=89746 Cc: 10.5 <mesa-stable@lists.freedesktop.org> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2015-03-30 15:15:10 +09:00
Marek Olšák	dc39413640	radeonsi: move scratch reloc state setup - move it to its own function - do it after all states are emitted - bump SI_MAX_DRAW_CS_DWORDS Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-03-16 12:54:19 +01:00
Marek Olšák	1f4bb38264	radeonsi: don't emit PA_SC_LINE_STIPPLE after every rasterizer state change Do it only when the line stipple state is changed. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-03-16 12:54:19 +01:00
Marek Olšák	f5832f3f9d	radeonsi: move PA_SU_SC_MODE_CNTL to rasterizer state This requires enabling the optional GL provoking vertex behavior for quads. + some cosmetic changes, so that the register is set exactly the same as on r600. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-03-16 12:54:19 +01:00
Marek Olšák	98a2398222	radeonsi: implement line and polygon smoothing Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-03-16 12:54:19 +01:00
Marek Olšák	303d23e10d	radeonsi: add shader code for smoothing The fragment shader multiplies the alpha channel with gl_SampleMaskIn. If blending is enabled, it looks like MSAA. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-03-16 12:54:19 +01:00
Marek Olšák	4f20a8f278	radeonsi: split sample locations into its own state atom Sample locations are not updated as often as framebuffers. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-03-16 12:54:18 +01:00
Marek Olšák	6c5af1dc4e	radeonsi: implement polygon stippling Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-02-04 14:34:13 +01:00
Marek Olšák	1fe7ba8c69	radeonsi: deduce rasterizer primitive type at the beginning of draw_vbo I will need this for polygon stippling. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-02-04 14:34:13 +01:00
Marek Olšák	b142dd2f24	radeonsi: move the buffer descriptor to the end of the image descriptor This will allow supporting NULL textures. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-02-04 14:34:13 +01:00
Tom Stellard	2397a72129	radeonsi: Enable VGPR spilling for all shader types v5 v2: - Only emit write SPI_TMPRING_SIZE once per packet. - Use context global scratch buffer. v3: - Patch shaders using WRITE_DATA packet instead of map/unmap. - Emit ICACHE_FLUSH, CS_PARTIAL_FLUSH, PS_PARTIAL_FLUSH, and VS_PARTIAL_FLUSH when patching shaders. v4: - Code cleanups. - Remove unnecessary multiplies. v5: - Patch shaders in system memory and re-upload to vram. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-01-28 21:03:47 +00:00
Michel Dänzer	82b7ee62fc	Revert "radeonsi: only set BC_OPTIMIZE_DISABLE when necessary" This reverts commit `0543630d0b`. It caused flickering artifacts in Steam games such as Team Fortress 2 or Left 4 Dead 2. We could probably only enable this optimization by also making sure the shader code only uses either SI_PARAM_LINEAR_CENTROID or SI_PARAM_LINEAR_CENTER, not both. This would probably require a shader variant. Sorry I didn't remember this when reviewing the reverted change. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-01-15 15:09:48 +09:00
Marek Olšák	ca9c5b2be5	radeonsi: improve and fix streamout flushing - we don't usually need to flush TC L2 - we should flush KCACHE (not really an issue now since we always flush KCACHE when updating descriptors, but it could be a problem if we used CE, which doesn't require flushing KCACHE) - add an explicit VS_PARTIAL_FLUSH flag Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-01-07 12:06:43 +01:00
Marek Olšák	0aecf9e2d1	radeonsi: add a combined flag for flushing a framebuffer Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-01-07 12:06:43 +01:00

1 2 3

110 Commits