KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	9322974ec7	radeonsi: always put persistent buffers into GTT on radeon This improves performance for certain games. Cc: 18.1 <mesa-stable@lists.freedesktop.org> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-19 12:52:28 -04:00
Marek Olšák	ffbbc008be	radeonsi: fix si_get_num_queries for radeon Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-19 12:52:28 -04:00
Marek Olšák	94b29763a4	radeonsi: don't expose performance counters for non-existent blocks Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-19 12:52:28 -04:00
Marek Olšák	a2451a4c23	ac/gpu_info: add radeon_info::num_tcc_blocks The values for the radeon winsys were copied from the kernel driver. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-19 12:52:28 -04:00
Marek Olšák	166c00e28e	radeonsi: set a better NUM_PATCHES hard limit AMDVLK uses 64 (distributed) and 16 (non-distributed). radeonsi will use 63 and 16. * This might improve tessellation performance on Hawaii, Bonaire, Tahiti, Pitcairn. (they will use 16) * I'm not sure if this matters for 1 SE configs. Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-19 12:52:28 -04:00
Marek Olšák	0d685ba290	radeonsi: make sure LS-HS vector lanes are reasonably occupied Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-19 12:52:28 -04:00
Marek Olšák	e93fe403bc	radeonsi: properly compute an LS-HS thread group size limit "64 / max * 4" is less than "64 * 4 / max". Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-19 12:52:28 -04:00
Eric Anholt	da0115b1c3	v3d: Fix blitting from a linear winsys BO. This is the case for the simulator environment, and broke many blitter tests by trying to texture from linear while the HW can only actually do UIF/UBLINEAR/LT. Just make a temporary and copy into it with the CPU, then blit from that. This is the kind of path that should use the TFU, but I haven't exposed that hardware yet. Fixes dEQP-GLES3.functional.fbo.blit.default_framebuffer.*	2018-06-19 09:42:20 -07:00
Eric Anholt	07b243674f	v3d: Add missing always_flush debug flag. The #define existed and was checked in the driver.	2018-06-19 09:42:20 -07:00
Tomeu Vizoso	9b1cb50ba4	virgl: Remove debugging left-overs Some fprintfs were probably left unintentionally a few years ago and are a bit of a nuisance. Fixes: `2d3301e4d5` ("virgl: fix reference counting of prime handles") Cc: Rob Herring <robh@kernel.org> Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-06-19 13:35:13 +02:00
Timothy Arceri	6c243ac2dd	glsl: fix desktop glsl linking regression The prog->Shaders[i]->IsES check was accidentally removed causing ES linking rules to be applied to desktop GLSL. Fixes: `725b1a406d` ("mesa/util: add allow_glsl_relaxed_es driconfig override")	2018-06-19 17:58:05 +10:00
Timothy Arceri	a9114b5e3e	util: add allow_glsl_relaxed_es to drirc for Google Earth VR Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-06-19 12:09:56 +10:00
Timothy Arceri	725b1a406d	mesa/util: add allow_glsl_relaxed_es driconfig override This relaxes a number of ES shader restrictions allowing shaders to follow more desktop GLSL like rules. This initial implementation relaxes the following: - allows linking ES shaders with desktop shaders - allows mismatching precision qualifiers - always enables standard derivative builtins These relaxations allow Google Earth VR shaders to compile. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-06-19 12:09:56 +10:00
Timothy Arceri	781c23ece6	util: add allow_glsl_builtin_const_expression to drirc for Google Earth VR Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-06-19 12:09:56 +10:00
Timothy Arceri	90dbab0f9a	mesa/util: add allow_glsl_builtin_const_expression driconf override Google Earth VR shaders uses builtins in constant expressions with GLSL 1.10. That feature wasn't allowed until GLSL 1.20. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-06-19 12:09:56 +10:00
Timothy Arceri	de93f546a7	util: manually extract the program name from program_invocation_name Glibc has the same code to get program_invocation_short_name. However for some reason the short name gets mangled for some wine apps. For example with Google Earth VR I get: program_invocation_name: "/home/tarceri/.local/share/Steam/steamapps/common/EarthVR/Earth.exe" program_invocation_short_name: "e" Acked-by: Eric Engestrom <eric.engestrom@intel.com>	2018-06-19 12:09:56 +10:00
Bas Nieuwenhuizen	1a8501a9dd	ac/surface: Set compressZ for stencil-only surfaces. We HTILE compress stencil-only surfaces too. CC: 18.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-06-19 02:52:01 +02:00
Jason Ekstrand	0146d79636	anv: Use a single global API patch version The Vulkan API has only one patch version shared among all of the major.minor versions. We should also advertise the same patch version regardless of major.minor. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106941 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-06-18 17:11:52 -07:00
Timothy Arceri	68bf94a8b0	radeonsi: enable OpenGL 3.3 compat profile Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-06-19 09:21:33 +10:00
Timothy Arceri	89a5d6f715	mesa: add ff fragment shader support for geom and tess shaders This is required for compatibility profile support. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-06-19 09:21:33 +10:00
Eric Anholt	e636199c1c	v3d: Set the SO offsets correctly if we have to re-emit. This should fix TF across a glFlush() or TF pause/restart. Fixes dEQP-GLES3.functional.transform_feedback.array.interleaved.lines.highp_float and many, many others.	2018-06-18 14:54:16 -07:00
Marek Olšák	94178044d5	gallium/hud: = should rename the last added data source Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-06-18 17:53:15 -04:00
Rafael Antognolli	ba2c18763b	anv: Disable constant buffer 0 being relative. If we are on gen8+ and have context isolation support, just make that constant buffer address be absolute, so we can use it for push UBOs too. v2: Do not duplicate constant_buffer_0_is_relative flag (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-18 14:41:38 -07:00
Rafael Antognolli	be18d5a0ce	anv/device: Check for kernel support of context isolation. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-06-18 14:41:38 -07:00
Rafael Antognolli	056214ebfc	intel/genxml: Add bitmasks for CS_DEBUG_MODE2/INSTPM. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-06-18 14:41:38 -07:00
Alok Hota	a678f40e46	swr/rast: Clang-Format most rasterizer source code Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-06-18 13:57:38 -05:00
Eric Engestrom	d85fef1e34	radv: fix reported number of available VGPRs It's a bit late to round up after an integer division. Fixes: `de88979413` "radv: Implement VK_AMD_shader_info" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Alex Smith <asmith@feralinteractive.com>	2018-06-18 17:08:22 +01:00
Eric Engestrom	9a4bd6b45f	mesa: add missing return in error path Fixes: `67f40dadaa` "mesa: add support for ARB_sample_locations" Cc: Rhys Perry <pendingchaos02@gmail.com> Cc: Brian Paul <brianp@vmware.com> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2018-06-18 16:19:48 +01:00
Bas Nieuwenhuizen	a3d93eec7c	radv: Use less conservative approximation for context rolls. Drops the number of time we set the scissor by 4x for F1 2017, which results in a consistent performance improvement of about 4%. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-06-18 16:21:10 +02:00
Eric Engestrom	4d08c1e7d1	radv: fix bitwise check Fixes: `922cd38172` "radv: implement out-of-order rasterization when it's safe on VI+" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-06-18 12:15:18 +01:00
Eric Engestrom	e8eb84826e	meson: fix i965/anv/isl genX static lib names Shouldn't make any functional difference, just that `liblibanv_gen90.a` will now be called `libanv_gen90.a`. Fixes: `3218056e0e` "meson: Build i965 and dri stack" Fixes: `d1992255bb` "meson: Add build Intel "anv" vulkan driver" Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2018-06-18 12:03:24 +01:00
Timothy Arceri	66673bef94	mesa: Unconditionally enable floating-point textures ARB_texture_float references US Patent #6,650,327 [1] which has a filing date of June 16 1998. According to [2], patents filed after 1995 expire 20 years from the filing date, giving an expiration of June 17 2018. [1] https://www.google.com/patents/US6650327 [2] https://en.wikipedia.org/wiki/Term_of_patent_in_the_United_States Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-06-18 09:29:38 +10:00
Jose Maria Casanova Crespo	b8e099e7d5	intel/fs: shuffle_64bit_data_for_32bit_write is not used anymore Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	a4965842d6	intel/fs: Use new shuffle_32bit_write for all 64-bit storage writes Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	a4d445b93c	intel/fs: shuffle_32bit_load_result_to_64bit_data is not used anymore Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	71b319a285	intel/fs: Use shuffle_from_32bit_read for 64-bit FS load_input As the previous use of shuffle_32bit_load_result_to_64bit_data had a source/destination overlap for 64-bit. Now a temporary destination is used for 64-bit cases to use shuffle_from_32bit_read that doesn't handle src/dst overlaps. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	8003ae87f4	intel/fs: shuffle_from_32bit_read at load_per_vertex_input at TCS/TES Previously, the shuffle function had a source/destination overlap that needs to be avoided to use shuffle_from_32bit_read. As we can use for the shuffle destination the destination of removed MOVs. This change also avoids the internal MOVs done by the previous shuffle to deal with possible overlaps. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	5565630f85	intel/fs: Use shuffle_from_32bit_read at VS load_input shuffle_from_32bit_read manages 32-bit reads to 32-bit destination in the same way that the previous loop so now we just call the new function for all bitsizes, simplifying also the 64-bit load_input. v2: Add comment about future 16-bit support (Jason Ekstrand) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	152bffb69b	intel/fs: Use shuffle_from_32bit_read for 64-bit gs_input_load This implementation avoids two unneeded MOVs for each 64-bit component. One was done in the old shuffle, to avoid cases of src/dst overlap but this is not the case. And the removed MOV was already being being done in the shuffle. Copy propagation wasn't able to remove them because shuffle destination values are defined with partial writes because they have stride == 2. v2: Reword commit log summary (Jason Ekstrand) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	8b26a2d96d	intel/fs: shuffle_from_32bit_read for 64-bit do_untyped_vector_read do_untyped_vector_read is used at load_ssbo and load_shared. The previous MOVs are removed because shuffle_from_32bit_read can handle storing the shuffle results in the expected destination just using the proper offset. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	c2297bdf19	intel/fs: Remove old 16-bit shuffle/unshuffle functions Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	fd3d8a8f79	intel/fs: Use shuffle_for_32bit_write for 16-bits store_ssbo Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	20e4732f7d	intel/fs: Use shuffle_from_32bit_read to read 16-bit SSBO Using shuffle_from_32bit_read instead of 16-bit shuffle functions avoids the need of retype. At the same time new function are ready for 8-bit type SSBO reads. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	a0891eabca	intel/fs: Use shuffle_from_32bit_read at VARYING_PULL_CONSTANT_LOAD shuffle_from_32bit_read can manage the shuffle/unshuffle needed for different 8/16/32/64 bit-sizes at VARYING PULL CONSTANT LOAD. To get the specific component the first_component parameter is used. In the case of the previous 16-bit shuffle, the shuffle operation was generating not needed MOVs where its results where never used. This behaviour passed unnoticed on SIMD16 because dead_code_eliminate pass removed the generated instructions but for SIMD8 they cound't be removed because of being partial writes. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	22c654941b	intel/fs: New shuffle_for_32bit_write and shuffle_from_32bit_read These new shuffle functions deal with the shuffle/unshuffle operations needed for read/write operations using 32-bit components when the read/written components have a different bit-size (8, 16, 64-bits). Shuffle from 32-bit to 32-bit becomes a simple MOV. shuffle_src_to_dst takes care of doing a shuffle when source type is smaller than destination type and an unshuffle when source type is bigger than destination. So this new read/write functions just need to call shuffle_src_to_dst assuming that writes use a 32-bit destination and reads use a 32-bit source. As shuffle_for_32bit_write/from_32bit_read components take components in unit of source/destination types and shuffle_src_to_dst takes units of the smallest type component, we adjust components and first_component parameters. To enable this new functions it is needed than there is no source/destination overlap in the case of shuffle_from_32bit_read. That never happens on shuffle_for_32bit_write as it allocates a new destination register as it was at shuffle_64bit_data_for_32bit_write. v2: Reword commit log and add comments to explain why first_component and components parameters are adjusted. (Jason Ekstrand) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Maria Casanova Crespo	a5665056e5	intel/fs: general 8/16/32/64-bit shuffle_src_to_dst function This new function takes care of shuffle/unshuffle components of a particular bit-size in components with a different bit-size. If source type size is smaller than destination type size the operation needed is a component shuffle. The opposite case would be an unshuffle. Component units are measured in terms of the smaller type between source and destination. As we are un/shuffling the smaller components from/into a bigger one. The operation allows to skip first_component number of components from the source. Shuffle MOVs are retyped using integer types avoiding problems with denorms and float types if source and destination bitsize is different. This allows to simplify uses of shuffle functions that are dealing with these retypes individually. Now there is a new restriction so source and destination can not overlap anymore when calling this shuffle function. Following patches that migrate to use this new function will take care individually of avoiding source and destination overlaps. v2: (Jason Ekstrand) - Rewrite overlap asserts. - Manage type_sz(src.type) == type_sz(dst.type) case using MOVs from source to dest. This works for 64-bit to 64-bits operation that on Gen7 as it doesn't support Q registers. - Explain that components units are based in the smallest type. v3: - Fix unshuffle overlap assert (Jason Ekstrand) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-16 22:39:08 +02:00
Jose Fonseca	d882331f7a	appveyor: Consume LLVM 5.0.1. https://ci.appveyor.com/project/jrfonseca/mesa/build/47 Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2018-06-16 18:09:20 +01:00
Bas Nieuwenhuizen	c4714f698b	ac: Clear meminfo to avoid valgrind warning. Somehow valgrind misses that the value is initialized by the ioctl. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-06-16 19:03:47 +02:00
Samuel Pitoiset	5917761e3d	radv: fix emitting the TCS regs on GFX9 The primitive ID is NULL and this generates an invalid select instruction which crashes because one operand is NULL. This fixes crashes in The Long Journey Home, Quantum Break and Just Cause 3 with DXVK. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106756 CC: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-06-16 10:18:51 +02:00
Ian Romanick	355868dbfc	nir: Document a couple instances of parent_instr nir_ssa_def::parent_instr and nir_src::parent_instr have the same name, but they mean really different things. I choose to save the next person the hour+ that I just spent figuring that out. Even now that I know, I doubt I'd notice in code review that someone typed foo->parent_instr when they actually meant foo->ssa->parent_instr. v2: Minor wording tweak in nir_ssa_def::parent_instr. Suggested by Jason. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-06-15 17:36:51 -07:00

... 3 4 5 6 7 ...

102983 Commits All Branches Search

102983 Commits

All Branches