mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Brian Paul	f39840f866	gallium/util: whitespace fixes in u_debug_memory.c Trivial.	2018-07-27 21:21:24 -06:00
Brian Paul	2261d6a403	mesa: whitespace clean-up in texstore.c Trivial.	2018-07-27 21:21:24 -06:00
Brian Paul	a67b629193	mesa: move var decls in texstore_rgba() Move them closer to where they're first used. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-07-27 21:21:24 -06:00
Brian Paul	5e2582b381	mesa: remove unneeded free() call in texstore_rgba() The pointer will always be NULL since that's what we just tested for. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2018-07-27 21:21:24 -06:00
Eric Anholt	942456f646	v3d: Skip printing sub-id or pad fields in CLIF dumping. The parser doesn't expect them, so our fields would end up mismatched. They're not really useful in console output, either.	2018-07-27 18:00:48 -07:00
Eric Anholt	3ee0ab599e	v3d: Emit commands to switch CLIF parser to CL/shader/attr input mode. By default after saying you are emitting a buffer, it'll expect a buffer size. Once you set a format, it'll keep parsing that format until you announce something else.	2018-07-27 18:00:46 -07:00
Eric Anholt	a57770aa37	v3d: Dump fields in CLIF output in increasing offset order. Previously, we emitted in XML order, which I happen to type in the decreasing offset order of the specifications. However, the CLIF parser wants increasing offsets.	2018-07-27 17:56:55 -07:00
Eric Anholt	95bafeeabf	v3d: Print addresses in CLIFs as references to buffers. With CLIFs, the parser will choose an address for the buffer being created, so we need to use effectively relocations to buffers instead of the addresses that the driver uses. This is also a whole lot more intelligible for console output than raw addresses!	2018-07-27 17:56:36 -07:00
Eric Anholt	3c02838d29	v3d: Stop doing pretty-printed colorful booleans in CLIF output. The parser wants to see a 1 or 0. We can put "true" and "false" in a comment to clarify that it's a boolean and the parser will skip it.	2018-07-27 17:55:57 -07:00
Eric Anholt	422910d2e7	v3d: Move clif dumping to a separate step from noting where the CLs are. Now all the printing happens from the same worklist processing.	2018-07-27 17:08:35 -07:00
Eric Anholt	01b4952773	v3d: Move clif dump BO lookup into the clif dumper. The clif dumper is going to need information about all of our BOs if we're going to dump them for replay purposes.	2018-07-27 17:08:35 -07:00
Eric Anholt	e92959c4e0	v3d: Pass the whole clif_dump structure to v3d_print_group(). To generate CLIF files that the v3dv3 simulator can parse, we're going to need to decode addresses, and for that we'll need the vaddr lookup function from the clif structure from within v3d_decoder.	2018-07-27 17:08:35 -07:00
Timothy Arceri	77207e5380	ac: pass write param to get_sampler_desc() from get_image_descriptor() Looks like a mistake from when the deref stuff landed. Fixes: `506a07e4e3` ("ac/nir: Add deref support to image intrinsics.") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-07-28 08:57:03 +10:00
Marek Olšák	d89a123dfd	gallium/u_vbuf: split u_vbuf_get_minmax_index function (v2) This will be used by indirect multidraws. v2: clean up the function further, change return types to unsigned Reviewed-by: Eric Anholt <eric@anholt.net> (v1)	2018-07-27 17:50:40 -04:00
Alexander von Gluck IV	da8de6b757	gallium/auxiliary: Extern "c" fixes. Used by C++ code such as Haiku's renderer. Reviewed-by: Brian Paul <brianp@vmware.com>	2018-07-27 16:19:12 -05:00
Marek Olšák	5fe943aaee	gallium/noop: implement invalidate_resource	2018-07-27 16:31:56 -04:00
Dave Airlie	5040319331	radv: fix cdw check vs tracing emit If we have tracing enabled we could do all the tracing emits and overflow the precalculated cdw_max. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-28 06:20:27 +10:00
Dave Airlie	b88468f15c	radv: return binary code_size not variant code size to cache The code sizes return here get passed to the cache shader insert function, which then memcpy from the code ptr, and causes all sorts of valgrind errors like: ==6755== Invalid read of size 8 ==6755== at 0x4C32FEE: memcpy@GLIBC_2.2.5 (vg_replace_strmem.c:1021) ==6755== by 0x2305D4C7: radv_pipeline_cache_insert_shaders (radv_pipeline_cache.c:416) ==6755== by 0x2305791D: radv_create_shaders (radv_pipeline.c:2158) ==6755== by 0x2305C523: radv_pipeline_init (radv_pipeline.c:3404) ==6755== by 0x2305C890: radv_graphics_pipeline_create (radv_pipeline.c:3515) ==6755== by 0x230188AB: radv_device_init_meta_blit_color (radv_meta_blit.c:871) ==6755== by 0x2301D50E: radv_device_init_meta_blit_state (radv_meta_blit.c:1278) ==6755== by 0x23011893: radv_device_init_meta (radv_meta.c:352) ==6755== by 0x2300744B: radv_CreateDevice (radv_device.c:1576) ==6755== by 0x5187D0F: ??? (in /usr/lib64/libvulkan.so.1.1.77) ==6755== by 0x518F6A3: ??? (in /usr/lib64/libvulkan.so.1.1.77) ==6755== by 0x5192A42: vkCreateDevice (in /usr/lib64/libvulkan.so.1.1.77) ==6755== Address 0x22a58548 is 4 bytes after a block of size 116 alloc'd ==6755== at 0x4C2EBAB: malloc (vg_replace_malloc.c:299) ==6755== by 0x23089DC4: ac_elf_read (ac_binary.c:144) ==6755== by 0x23090A60: ac_compile_module_to_binary (ac_llvm_helper.cpp:162) ==6755== by 0x23053F06: compile_to_memory_buffer (radv_llvm_helper.cpp:58) ==6755== by 0x23053F06: radv_compile_to_binary (radv_llvm_helper.cpp:98) ==6755== by 0x23052769: ac_llvm_compile (radv_nir_to_llvm.c:3394) ==6755== by 0x23052823: ac_compile_llvm_module (radv_nir_to_llvm.c:3418) ==6755== by 0x23053C05: radv_compile_nir_shader (radv_nir_to_llvm.c:3542) ==6755== by 0x23061B4E: shader_variant_create (radv_shader.c:580) ==6755== by 0x23061CFD: radv_shader_variant_create (radv_shader.c:634) ==6755== by 0x23057765: radv_create_shaders (radv_pipeline.c:2123) ==6755== by 0x2305C523: radv_pipeline_init (radv_pipeline.c:3404) ==6755== by 0x2305C890: radv_graphics_pipeline_create (radv_pipeline.c:3515) Since we are just inserting the code into the cache, we can avoid these bad reads and data in the cache by just using the binary code size here. Fixes: `939e5a382` (radv: add padding for the UMR disassembler) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-28 06:20:20 +10:00
Eric Anholt	22a1ba0403	v3d: Drop the use of the semaphores. The kernel's scheduler doesn't rely on our emitting them, and in fact we'd get in trouble if the kernel decided to schedule too many bins in a row before getting around to scheduling the corresponding render.	2018-07-27 12:56:36 -07:00
Eric Anholt	9bf9a6d6a1	v3d: Drop the VG support from the XML. This reflects a change on the HW/closed SW side to drop this unused HW. With it dropped on their side, the CLIF parser no longer expects to find VG fields.	2018-07-27 12:56:36 -07:00
Eric Anholt	5a1cc3861c	v3d: Use /* */ instead of () for enum names in CLIF output. This lets the comments be ignored by the CLIF parser.	2018-07-27 12:56:36 -07:00
Eric Anholt	95a0f99825	v3d: CLIF-dump the "Vec size" field as 0 == maximum value. That's what a user should want to see, and what the CLIF parser wants. This should maybe be generalized.	2018-07-27 12:56:36 -07:00
Eric Anholt	1c8e4632a7	v3d: Stop using spaces in the names of our buffers. For CLIF dumping, we need names to not have spaces. Rather than rewriting them after the fact, just change the two cases where I had put a space in.	2018-07-27 12:56:36 -07:00
Fritz Koenig	ab05dd183c	i965: implement GL_MESA_framebuffer_flip_y [v3] Instead of using _mesa_is_winsys_fbo or _mesa_is_user_fbo to infer if an fbo is flipped use the FlipY flag. v2: * additional window-system framebuffer checks [for jason] v3: * s/inverted_y/flip_y/g [for chadv] * s/InvertedY/FlipY/g [for chadv] Reviewed-by: Chad Versace <chadversary@chromium.org>	2018-07-27 12:33:32 -07:00
Fritz Koenig	318c265160	mesa: GL_MESA_framebuffer_flip_y extension [v4] Adds an extension to glFramebufferParameteri that will specify if the framebuffer is vertically flipped. Historically system framebuffers are vertically flipped and user framebuffers are not. Checking to see the state was done by looking at the name field. This adds an explicit field. v2: * updated spec language [for chadv] * correctly specifying ES 3.1 [for chadv] * refactor access to rb->Name [for jason] * handle GetFramebufferParameteriv [for chadv] v3: * correct _mesa_GetMultisamplefv [for kusmabite] v4: * update spec language [for chadv] * s/GLboolean/bool/g [for chadv] * s/InvertedY/FlipY/g [for chadv] * s/inverted_y/flip_y/g [for chadv] * assert changes [for chadv] Reviewed-by: Chad Versace <chadversary@chromium.org>	2018-07-27 12:32:25 -07:00
Chad Versace	7953399e59	gallium/auxiliary: Fix Autotools on Android (v2) Problem 1: u_debug_stack_android.cpp transitively included "pipe/p_compiler.h", but src/gallium/include was missing from the C++ include path. Problem 2: Add -std=c++11 to AM_CXXFLAGS. Android's libbacktrace headers require C++11, but the Android toolchain (at least in the Chrome OS SDK) does not enable C++11 by default. v2: Add -std=c++11. Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tomasz Figa <tfiga@chromium.org> Cc: Eric Engestrom <eric.engestrom@intel.com>	2018-07-27 11:35:56 -07:00
Topi Pohjolainen	a5889d70f2	i965/icl: Disable binding table prefetching Gen 11 workarounds table #2056 WABTPPrefetchDisable suggests to disable prefetching of binding tables for ICLLP A0 and B0 steppings. It fixes multiple gpu hangs in ext_framebuffer_multisample* tests on ICLLP B0 h/w. Anuj: Add comments and commit message. Add gen 11 checks in the code. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-07-27 11:05:04 -07:00
Caio Marcelo de Oliveira Filho	1d71981b27	glsl: use only copy_propagation_elements Now that the elements version handles both cases, remove the non-elements version. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-07-27 10:51:25 -07:00
Caio Marcelo de Oliveira Filho	134b5a7047	glsl: teach copy_propagation_elements to deal with whole variables Keep information in acp_entry whether the entry is full or not, and use the ACP in more nodes when visiting the instructions: - add_copy: write whole variables to the ACP state (regardless the type). - visit(ir_dereference_variable ): perform the propagation here if we have a full candidate. Element-wise here doesn't apply because the mask isn't available at this point. - visit_leave(ir_assignment ): process beyond scalar and vector, as the full variables might have other types. Also import an improvement from opt_copy_propagation.cpp: if ir_call is an intrinsic, we know the variables affected, so keep going. v2: (all from Eric Anholt) Describe how acp_entry attributes are used. Don't do book-keeping to avoid adding repeated element to the dsts in write_elements(). v3: Use _mesa_set_remove_key. (Thomas Helland) Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Thomas Helland <thomashelland90@gmail.com>	2018-07-27 10:51:25 -07:00
vadym.shovkoplias	399228ecad	i965: Disable guardband clipping on SandyBridge for odd dimensions Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104388 Signed-off-by: Andriy Khulap <andriy.khulap@globallogic.com> Acked-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-07-27 10:07:44 -07:00
Dylan Baker	665fc9cf55	docs: Update release calendar, add news item, and add release notes for 18.1.5	2018-07-27 07:08:59 -07:00
Dylan Baker	2b7b5d3100	docs: Add sha-256 sums for 18.1.5	2018-07-27 07:06:55 -07:00
Dylan Baker	5cc4ee3e17	docs: add 18.1.5 release notes	2018-07-27 07:06:53 -07:00
Iago Toral Quiroga	615aaedb93	intel/compiler: fix lower conversions to account for predication The pass can create a temporary result for the instruction and then moves from it to the original destination, however, if the original instruction was predicated, the mov has to be predicated as well. Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com>	2018-07-27 14:48:29 +02:00
Samuel Pitoiset	df679b1643	radv: allocate enough space in radv_cmd_buffer_after_draw() The driver might emit up to 4 dwords when RADV_TRACE_FILE is used. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-07-27 14:31:29 +02:00
Samuel Pitoiset	c08ae911d9	radv: check CS space in radv_emit_write_data_packet() This wasn't wrong but it looks better to me like this. It's only used for debugging purposes (ie. RADV_TRACE_FILE). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-07-27 14:31:27 +02:00
Samuel Pitoiset	434630f57c	radv: do not emit pipeline stats flushes on compute queue Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-07-27 14:31:26 +02:00
Samuel Pitoiset	c118c8938c	radv: reduce CB/DB meta flushes in radv_dst_access_flush() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-07-27 14:31:24 +02:00
Kenneth Graunke	0c4e0471f5	radv: Fix build I renamed this pass and forgot to update radv. Fixes: `488972222c` ("i965: Combine both gl_PatchVerticesIn lowering passes.")	2018-07-26 23:57:13 -07:00
Kenneth Graunke	488972222c	i965: Combine both gl_PatchVerticesIn lowering passes. Until now, we had separate passes for lowering gl_PatchVerticesIn to a statically known constant (for TES inputs when linked against a TCS), and a uniform in the other cases. Annoyingly, one had to be run before nir_lower_system_values, and the other afterward. This simplified the passes, but made life painful for the callers. This patch combines both into a single pass. If you give it a non-zero static count, it uses that. If you give it Mesa state slots, it turns it back into a built-in uniform. Otherwise, it does nothing. This also moves the i965 uniform lowering out to shared code. v2: Make token arrays const. Reviewed-by: Eric Anholt <eric@anholt.net>	2018-07-26 21:51:36 -07:00
Sagar Ghuge	29dd5dda9d	i965: Expose EXT_base_instance extension in OpenGLES 3.0 The extension requires at least OpenGL 3.0 and OpenGL ES 3.0. Fixes two ext_base_instance tests: arb_base_instance-baseinstance-doesnt-affect-gl-instance-id_gles3 arb_base_instance-drawarrays_gles3 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2018-07-26 17:25:35 -07:00
Bas Nieuwenhuizen	3665f66ef2	radv: Add support for ETC2 textures. Was surprised that is even supported by Vega. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-07-27 01:31:32 +02:00
Jan Vesely	1e8b8e0878	clover: Reduce wait_count in abort path. Trigger waiter condition variable. Passes 'events' CTS on carrizo and turks. v2: reduce to 0 Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2018-07-26 15:38:22 -04:00
Jan Vesely	c2942141ae	clover: Don't extend illegal integer types. It's OK to pass them in memory, which is what kernel invocation needs. Fixes regressions since llvm r337535 ("Reapply "AMDGPU: Fix handling of alignment padding in DAG argument lowering"): scalar-arithmetic-char scalar-arithmetic-uchar scalar-arithemtic-short scalar-arithmetic-ushort scalar-comparison-char scalar-comparison-uchar scalar-comparison-short scalar-comparison-ushort Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2018-07-26 15:38:22 -04:00
Kenneth Graunke	8794fe3e30	intel/compiler: Delete dead VS intrinsic handling. These are lowered by brw_nir_lower_vs_inputs(). If they weren't, we would have already hit the unreachable() in emit_system_values_block(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-07-26 11:45:34 -07:00
Eric Anholt	deecc1ef86	v3d: Avoid the GFXH-1461 workaround if we have only Z or only S. This seems like a sensible precaution to avoid extra draws. It doesn't deal with the case of a Z24S8 buffer created by the window system for an application that happens to never use S.	2018-07-26 11:02:25 -07:00
Eric Anholt	301c32caf4	v3d: Rework the ordering of how we clear things. First, figure out if we can just sneak the clear into the TLB clear, even if drawing has already happened (since we have job->load and job->clear to tell us), taking into account GFXH-1461. For any pieces we can't TLB clear, fall back to drawing a quad without flushing the scene. Fixes extra scene flushes in glmark2 due to GFXH-1461.	2018-07-26 11:02:25 -07:00
Eric Anholt	ceecddfe77	v3d: Only store buffers that have been written to. I've seen cases where a color buffer is bound, but only Z is written, and we end up storing color.	2018-07-26 11:02:25 -07:00
Eric Anholt	d29435e7cb	v3d: Track the buffers being loaded separately. We were computing this at RCL generation time, but that means you can't unflag the store for an invalidate_resource, or not flag the store if writmasking is disabled.	2018-07-26 11:02:20 -07:00
Eric Anholt	47f5d158ae	v3d: Rename cleared/resolve to clear/store. These describe what the fields mean in RCL generation. "resolve" is left over from VC4, and sounds like MSAA resolves (which may or may not be involved in the store we generate).	2018-07-26 11:00:34 -07:00

1 2 3 4 5 ...

103853 Commits All Branches Search

103853 Commits

All Branches