KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Pierre-Eric Pelloux-Prayer	8b6d19413f	mesa: add ARB_vertex_attrib_binding glVertexArray* functions We can't simply alias ARB_direct_state_access functions because those fail if the vao has never been bound before. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:45 +01:00
Pierre-Eric Pelloux-Prayer	657396aa10	mesa: extend vertex_array_attrib_format to support EXT_dsa Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:45 +01:00
Pierre-Eric Pelloux-Prayer	bb2241bf06	mesa: implement ARB_texture_storage_multisample + EXT_dsa functions Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:45 +01:00
Pierre-Eric Pelloux-Prayer	a0d667036d	mesa: add ARB_texture_buffer_range glTextureBufferRangeEXT function Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:45 +01:00
Pierre-Eric Pelloux-Prayer	b78e2a197a	mesa: add ARB_instanced_arrays EXT_dsa function Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:45 +01:00
Pierre-Eric Pelloux-Prayer	a807b8c0a8	mesa: add ARB_gpu_shader_fp64 selector-less functions Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:45 +01:00
Pierre-Eric Pelloux-Prayer	e3385eb0c1	mesa: add ARB_clear_buffer_object named functions Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:45 +01:00
Pierre-Eric Pelloux-Prayer	442fd3d007	mesa: add ARB_vertex_attrib_64bit VertexArrayVertexAttribLOffsetEXT Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:44 +01:00
Pierre-Eric Pelloux-Prayer	8cfb3e4ee5	mesa: add ARB_framebuffer_no_attachments named functions The wording in ARB_framebuffer_no_attachments and EXT_direct_state_access is different. In the former framebuffer names must have been generated using glGenFramebuffers before using the named functions. In the latter framebuffer names have no such constraints, so we can't use the _mesa_lookup_framebuffer_dsa function. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:44 +01:00
Pierre-Eric Pelloux-Prayer	dc057f638c	mesa: update features.txt to reflect EXT_dsa status All features from the EXT_dsa spec are implemented. Interactions with other specs: - GL_AMD_gpu_shader_int64: not needed, since it's not enabled in compatibility profile. - GL_ARB_bindless_texture is DONE "INVALID_OPERATION is generated when calling various functions to modify the state of a texture object from which handles have been extracted" - GL_ARB_buffer_storage/GL_EXT_buffer_storage is DONE (NamedBufferStorageEXT function) - GL_ARB_texture_storage is DONE (3 TextureStorageDEXT functions) - GL_ARB_vertex_attrib_binding is DONE (6 VertexArray functions) - GL_EXT_external_buffer is not supported by Mesa Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-19 08:49:44 +01:00
Alyssa Rosenzweig	8b1548a12f	panfrost: Set PIPE_COMPUTE_CAP_ADDRESS_BITS to 64 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-19 06:22:31 +00:00
Alyssa Rosenzweig	9c28700aaf	panfrost: Disable tiling for GLOBAL resources It doesn't make sense to have nonlinear layouts for a buffer that can be accessed as direct memory for a compute kernel. Turn that off so things work as expected. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-19 06:22:31 +00:00
Alyssa Rosenzweig	21dd7574a8	panfrost: Pass kernel inputs as uniforms We can take the OpenCL kernel inputs and interpret them as uniforms by simply reusing the Gallium callback. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-19 06:22:31 +00:00
Alyssa Rosenzweig	a7b5dd1290	panfrost: Stub out clover callbacks We don't implement these yet but let's not crash. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-19 06:22:31 +00:00
Miguel Casas-Sanchez	b196958574	i965: Ensure that all 2101010 image imports can pass framebuffer completeness. Chrome OS would like to import and render to any supported format that has a corresponding display plane format, and this prevents throwing framebuffer incomplete for FBOs using these textures. See: crbug.com/949260 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-11-19 02:21:12 +00:00
Dave Airlie	1468a4f1f3	nir/serialize: fix serializing functions with no implementations. Store a flag stating if there was an implmentation, and use fxn->impl as a temporary flag between deserializsation stages. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-11-19 09:30:32 +10:00
Dave Airlie	0fd6b8aa98	nir/serialize: pack function has name and entry point into flags. Suggested by Jason. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-11-19 09:30:12 +10:00
Jason Ekstrand	fc72df1d93	iris: Re-enable param compaction In `d1c4e64a69`, we added a parameter to tell the back-end compiler to ignore the param array and just push however many constants you ask it to push. I enabled it for iris because this is really what iris wants but it seems to have caused a number of regressions. Revert to the old behavior for now. Fixes: `d1c4e64a69` "intel/compiler: Add a flag to avoid compacting..."	2019-11-18 16:54:07 -06:00
Marek Olšák	189c0cc45b	mesa: enable glthread for 7 Days To Die Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-11-18 17:25:57 -05:00
Iván Briano	ca94717035	intel/compiler: Don't change hstride if not needed Alignment requirements may have changed the horizontal stride already, so don't set it if not required to avoid breaking said requirements. Fixes several tests such as dEQP-VK.subgroups.vote.graphics.subgroupallequal_int8_t Signed-off-by: Iván Briano <ivan.briano@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-11-18 14:19:41 -08:00
Jonathan Marek	3cd44839fa	turnip: add x11 wsi Copied from radv Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-11-18 22:18:05 +00:00
Jonathan Marek	df9f2adfa3	turnip: add display wsi Copied from radv (minus the fence change) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-11-18 22:18:05 +00:00
Jason Ekstrand	7260df5894	nir: Validate that variables are in the right lists Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-11-18 16:15:30 -06:00
Jonathan Marek	e2b9d6277e	etnaviv: blt: set TS dirty after clear RS engine does this already, it is missing for BLT engine. This fixes cases where a clear isn't immediately at the start of the frame. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-11-18 20:59:02 +01:00
Jonathan Marek	d819d4b344	etnaviv: separate PE and RS formats, use only RS only for tiling There are PE formats not supported by RS, so we can't have a single to translate both. Use RS only for same formats until we have a translate_rs_format and test the possible different format blits. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-11-18 20:58:14 +01:00
Jonathan Marek	e1a86bd634	etnaviv: blt: use only for tiling, and add missing formats * Removes the incorrect usage of translate_rs_format * Disables use of BLT engine for different src/dst format We only really need the BLT engine for tiling/detiling right now, but it would be nice to support as many blit cases as possible to avoid using PE for that. To deal with different formats we need to: * Have a translate_blt_format which has all supported formats * Fix the swizzle translation from gallium (current version was wrong) * Set the src/dst sRGB bits as needed * Find which type conversions the BLT engine can actually do Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-11-18 20:57:40 +01:00
Brian Paul	02c3dad0f3	Call shmget() with permission 0600 instead of 0777 A security advisory (TALOS-2019-0857/CVE-2019-5068) found that creating shared memory regions with permission mode 0777 could allow any user to access that memory. Several Mesa drivers use shared- memory XImages to implement back buffers for improved performance. This path changes the shmget() calls to use 0600 (user r/w). Tested with legacy Xlib driver and llvmpipe. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-11-18 12:28:59 -07:00
Jason Ekstrand	fdaf8144a8	anv: Emit a NULL vertex for zero base_vertex/instance If both are zero (the common case), we can emit a null vertex buffer rather than emitting a vertex buffer with zeros in it. The packing of the VERTEX_BUFFER_STATE is faster because no relocation is emitted and we can avoid creating the vertex buffer which means one less anv_state_stream_alloc. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	bc9d7836bc	anv: Use an anv_state for the next binding table This is a bit more natural because we're already getting an anv_state most places in the pipeline. The important part here, however, is that we're no longer calling anv_block_pool_map on every alloc_binding_table call. While it's probably pretty cheap, it is potentially a linear walk over the list of BOs and it was showing up in profiles. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	98dc179c1e	anv: More carefully dirty state in BindPipeline Instead of blindly dirtying descriptors and push constants the moment we see a pipeline change, check to see if it actually changes the bind layout or push constant layout. This doubles the runtime performance of one CPU-limited example running with the Dawn WebGPU implementation when running on my laptop. NOTE: This effectively reverts `beca63c6c0`. While it was a nice optimization, it was based on prog_data and we can't do that anymore once we start allowing the same binding table to be used with multiple different pipelines. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	22f16ff54a	anv: More carefully dirty state in BindDescriptorSets Instead of dirtying all graphics or all compute based on binding point, we're now much more careful. We first check to see if the actual descriptor set changed and then only dirty the stages used by that descriptor set. For dynamic offsets, we keep a bitfield per-stage of which offsets are actually used in that stage and we only dirty push constants and descriptors if that stage has dynamic offsets AND those offsets actually change. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	ca8117b5d5	anv: Use a switch statement for binding table setup It theoretically could be more efficient but the real point here is that it's no longer really a matter of dealing with special cases and then the "real" thing. The way we're handling binding tables, it's more of a multi-step process and a switch is more natural. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	9baa33cef0	anv: Rework push constant handling This substantially reworks both the state setup side of push constant handling and the pipeline compile side. The fundamental change here is that we're no longer respecting the prog_data::param array and instead are just instructing the back-end compiler to leave the array alone. This makes the state setup side substantially simpler because we can now just memcpy the whole block of push constants and don't have to upload one DWORD at a time. This also means that we can compute the full push constant layout up-front and just trust the back-end compiler to not mess with it. Maybe one day we'll decide that the back-end compiler can do useful things there again but for now, this is functionally no different from what we had before this commit and makes the NIR handling cleaner. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	ca91ab8015	anv: Re-arrange push constant data a bit This moves the compute stuff into a anv_push_constants::cs sub-struct. It also moves dynamic offsets into the push constants. This means we have to duplicate the data per-stage but that doesn't seem like the end of the world and one day we may wish to make dynamic offsets per-stage anyway. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	d1c4e64a69	intel/compiler: Add a flag to avoid compacting push constants In vec4, we can just not run the pass. In fs, things are a bit more deeply intertwined. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	aecde23519	anv: Pre-compute push ranges for graphics pipelines It turns off that emitting push constants is one of the hottest paths in the driver and ANY work we do there costs us. By pre-computing things a bit ahead of time, we shave 5% off the runtime of a CPU-limited example running with the Dawn WebGPU implementation. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	4b392ced2d	anv: Stop bounds-checking pushed UBOs The bounds checking is actually less safe than just pushing the data. If the bounds checking actually ever kicks in and it's not on the last UBO push range, then the shrinking will cause all subsequent ranges to be pushed to the wrong place in the GRF. One of the behaviors we definitely don't want is for OOB UBO access to result in completely unrelated UBOs returning garbage values. It's safer to just push the UBOs as-requested. If we're really concerned about robustness, we can emit shader code to do bounds checking which should be stupid cheap (a CMP followed by SEL). Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	ebad00d9e7	anv: Delete dead shader constant pushing code As of `2d78e55a8c`, nir_intrinsic_load_constant with a constant offset is constant-folded so we should never end up with any that trigger brw_nir_analyze_ubo_ranges. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	0709c0f6b4	anv: Flatten descriptor bindings in anv_nir_apply_pipeline_layout This lets us stop tracking the pipeline layout. It also means less indirection on a very hot path. As an extra bonus, we can make some of our data structures smaller. No measurable CPU overhead improvement. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	fa120cb31c	anv: Input attachments are always single-plane Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	0a02f2a278	genxml: Mark everything in genX_pack.h always_inline Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Jason Ekstrand	abfd4651ed	anv/pipeline: Assume layout != NULL In the early days of the driver we allowed layout to be VK_NULL_HANDLE and used that for some internal pipelines when we wanted to be lazy. Vulkan doesn't actually allow NULL layouts, however, so there's no reason to have this check. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-11-18 18:35:14 +00:00
Italo Nicola	59623f211b	intel/compiler: remove old comment This comment was correct some time ago, but since commit `d3c10ad427`, it isn't true anymore. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com>	2019-11-18 10:20:34 -08:00
Alyssa Rosenzweig	3663340049	pan/midgard: Use shader stage in mir_op_computes_derivative A 'normal' texture op may be emitted in a vertex shader on T720 but it still doesn't take any derivatives. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-18 08:48:54 -05:00
Danylo Piliaiev	6f17fe0606	i965: Unify CC_STATE and BLEND_STATE atoms on Haswell as a workaround Re-emitting 3DSTATE_CC_STATE_POINTERS after emitting 3DSTATE_BLEND_STATE_POINTERS fixes the shadow flickering in SuperTuxCart and Tropico 6 which was seen only on Haswell. The reason for this is unknown and fix was found empirically. The closest mention in PRM is that it should improve performance. From the HSW PRM, volume 2b, page 823 (3DSTATE_BLEND_STATE_POINTERS): "When the BLEND_STATE pointer changes but not the CC_STATE pointer, driver needs to force a CC_STATE pointer change to improve blend performance in pixel backend." Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/1834 Fixes: `eca4a654` ("i965: Disable dual source blending when shader doesn't support it on gen8+") Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-11-18 11:00:23 +02:00
Samuel Pitoiset	1ebd9459e7	radv: implement VK_AMD_device_coherent_memory This extension adds the device coherent and device uncached memory types. It's known to be slower than non-device coherent memory but it might be useful for debugging. This is only exposed for chips that support L2 uncached. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-18 08:20:19 +00:00
Samuel Pitoiset	2af7511ed2	ac: add radeon_info::has_l2_uncached For chips that have uncached device memory (ie. MTYPE_UC). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-18 08:20:19 +00:00
Pierre-Eric Pelloux-Prayer	3c9ea6bdfd	radeonsi: enable mesa_glthread for GfxBench It improves offscreen tests performance. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-18 09:16:18 +01:00
Alyssa Rosenzweig	bc9a7d0699	pan/midgard: Represent ld/st offset unpacked This simplifies manipulation of the offsets dramatically, fixing some UBO access related bugs. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-17 22:19:31 -05:00
Alyssa Rosenzweig	1798f6bfc3	pan/midgard: Fix masks/alignment for 64-bit loads These need to be handled with special care. Oh, Midgard, you're extra special. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-17 22:19:31 -05:00

1 2 3 4 5 ...

117708 Commits All Branches Search

117708 Commits

All Branches