mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Matt Turner	7638e75cf9	i965: Use brw_reg's nr field to store register number. In addition to combining another field, we get replace silliness like "reg.reg" with something that actually makes sense, "reg.nr"; and no one will ever wonder again why dst.reg isn't a dst_reg. Moving the now 16-bit nr field to a 16-bit boundary decreases code size by about 3k. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	3048053908	i965: Unwrap some lines. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	58fa9d47b5	i965/vec4: Remove swizzle/writemask fields from src/dst_reg. Also allows us to handle HW_REGs in the swizzle() and writemask() functions. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	94b1031703	i965: Remove fixed_hw_reg field from backend_reg. Since backend_reg now inherits brw_reg, we can use it in place of the fixed_hw_reg field. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	1392e45bfb	i965: Use immediate storage in inherited brw_reg. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	d74dd703f8	i965: Add and use enum brw_reg_file. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	977df90d65	i965: Reorganize brw_reg fields. Put fields that are meaningless with an immediate in the same storage with the immediate. This leaves fields type, file, nr, subnr in the first dword where there's now extra room for expansion. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	e42fb0c2a6	i965: Make 'dw1' and 'bits' unnamed structures in brw_reg. Generated by sed -i -e 's/\.bits\././g' .c .h .cpp sed -i -e 's/dw1\.//g' .c .h .cpp and then reverting changes to comments in gen7_blorp.cpp and brw_fs_generator.cpp. There wasn't any utility offered by forcing the programmer to list these to access their fields. Removing them will reduce churn in future commits. This is C11 (and gcc has apparently supported it for sometime "compatibility with other compilers") See https://gcc.gnu.org/onlinedocs/gcc/Unnamed-Fields.html Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	182f137521	i965: Delete type field from backend_reg. Switching from an implicitly-sized type field to field with an explicit bit width is safe because we have fewer than 2^4 types, and gcc will warn if you attempt to set a value that will not fit. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	433df2e03c	i965: Delete abs/negate fields from backend_reg. Instead use the ones provided by brw_reg. Also allows us to handle HW_REGs in the negate() functions. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	c7ed5d1d1c	i965: Make backend_reg inherit from brw_reg. Some fields (file, type, abs, negate) in brw_reg are shadowed by backend_reg. Reviewed-by: Emil Velikov <emil.velikov@collabora.co.uk> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-13 11:27:50 -08:00
Matt Turner	88f349c4e1	i965/fs: Replace nested ternary with if ladder. Since the types of the expression were bool ? src_reg : (bool ? brw_reg : brw_reg) the result of the second (nested) ternary would be implicitly converted to a src_reg by the src_reg(struct brw_reg) constructor. I.e., bool ? src_reg : src_reg(bool ? brw_reg : brw_reg) In the next patch, I make backend_reg (the parent of src_reg) inherit from brw_reg, which changes this expression to return brw_reg, which throws away any fields that exist in the classes derived from brw_reg. I.e., src_reg(bool ? brw_reg(src_reg) : bool ? brw_reg : brw_reg) Generally this code was gross, and wasn't actually shorter or easier to read than an if ladder. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2015-11-13 11:27:50 -08:00
Marek Olšák	3694d58e6c	radeonsi: remove dead code after ES-GS linkage change Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	d79a3449a7	radeonsi: link ES-GS just like LS-HS This reduces the shader key for ES. Use a fixed attrib location based on (semantic name, index). The ESGS item size is determined by the physical index of the highest ES output, so it's almost always larger than before, but I think that shouldn't matter as long as the ESGS ring buffer is large enough. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	b1c5f3faa9	radeonsi: calculate optimal GS ring sizes to fix GS hangs on Tonga I discovered that increasing the ESGS ring size fixes GS hangs on Tonga, so let's do it properly. There is now a separate init_config_gs_rings state that is not immutable, because GS rings are resized when needed. This also saves some memory. Most apps won't need more than 1MB per ring per shader engine. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	2f5d911ba2	radeonsi: rename si_update_gs_rings Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	4acd856088	radeonsi: calculate ESGS_RING_ITEMSIZE in create_shader Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	a0cf589961	radeonsi: move maximum gs stream calculation into create_shader Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	3ab0c49f04	radeonsi: clean up small duplication in si_shader_gs Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	eb0d3e8a90	gallium/radeon: shorten render_cond variable names and ..._cond -> ..._invert Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	70c40cc989	gallium/radeon: remove predicate_drawing flag Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	12596cfd4c	gallium/radeon: atomize render condition (SET_PREDICATION) Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	3521907622	gallium/radeon: simplify restoring render condition after flush Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:42 +01:00
Marek Olšák	600e212d87	gallium/radeon: don't use PREDICATION_OP_CLEAR Not setting the predication bit is sufficient. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	6eff5415e4	gallium/radeon: simplify disabling render condition for u_blitter just disable it by not setting the predication bit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	8dd1ee6ff3	r600g: don't set predication on non-draw packets This has no effect. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	6cc8f6c6a7	gallium/radeon: inline the r600_rings structure Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	3d963abc81	radeonsi: prevent recursion in si_context_gfx_flush The recursion can only occur if you modify need_cs_space to always flush. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	8569f9a87e	gallium/radeon: remove the IB flushing flag Not needed anymore. A similar flag will be introduced in the next commit, which will be private in radeonsi. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	81d412e02c	gallium/radeon: move GFX/DMA flushing from add_to_buffer_list to need_cs_space need_cs_space isn't invoked so often and is called before all commands too. This is a lot cleaner. The code in radeon_add_to_buffer_list always seemed dodgy to me. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	c6012a6650	radeonsi: rename cache flushing flags once more KCACHE, TC L1 and TC L2 are renamed to: - SMEM L1 - VMEM L1 - GLOBAL L2 You can easily tell what they are used for now. Shaders must deal with coherency issues between both L1s manually, e.g. by setting GLC=1 or by using s_dcache_*. BOTH_ICACHE_KCACHE was an unused definition. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	10130ccd8c	radeonsi: set the DISABLE_WR_CONFIRM flag on CI-VI as well I missed this in commit `c3e527f93d` radeonsi: only enable write confirmation on the last CP DMA packet Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	40912dd91e	radeonsi: initialize SX_PS_DOWNCONVERT to 0 on Stoney otherwise the SX or CB blocks can go bananas Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Cc: mesa-stable@lists.freedesktop.org	2015-11-13 19:54:41 +01:00
Marek Olšák	f7757100f2	radeonsi: add glClearBufferSubData acceleration 8-bit and 16-bit clears which are not aligned to dwords are done in software. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	19773f9805	radeonsi: add SI_SAVE_FRAGMENT_STATE blitter flag Buffer clears via transform feedback won't set this. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	19a9c1ecc7	gallium/u_blitter: add support for multi-dword clear values in clear_buffer Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	e15c5c7a06	radeonsi: fix a future crash in emit_cb_target_mask This can't crash currently, but it would crash if clear_buffer from u_blitter were used with a clean context. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:41 +01:00
Marek Olšák	65d0c558d5	radeonsi: fix unaligned clear_buffer fallback This is unreachable currently, but it will be used by unaligned 8-bit and 16-bit fills. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:40 +01:00
Marek Olšák	7f1e34e6c8	r600g: fix clear_buffer fallback with offset != 0 Discovered by luck. This code path hasn't been exercised since transform feedback was implemented. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2015-11-13 19:54:40 +01:00
Marek Olšák	01526136ba	gallium/radeon: fix PIPE_QUERY_GPU_FINISHED Broken by the addition of r600_multi_fence in `3b37155a68` Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=89014 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-11-13 19:54:40 +01:00
Brian Paul	40663864d2	mesa: minor comment fix in blend.c	2015-11-13 08:02:19 -07:00
Brian Paul	5a5efbf804	docs: add link to Coverity on developer utilities page Signed-off-by: Brian Paul <brianp@vmware.com>	2015-11-13 08:02:19 -07:00
Brian Paul	00046393f8	docs: update VMware driver instructions Use a LIBDIR variable, set per-platform. Update the Mesa configuration flags. Run update-initramfs or dracut, update /etc/modules Signed-off-by: Brian Paul <brianp@vmware.com>	2015-11-13 08:02:19 -07:00
Daniel Stone	d1314de293	egl/wayland: Ignore rects from SwapBuffersWithDamage eglSwapBuffersWithDamage accepts damage-region rectangles to hint the compositor that it only needs to redraw certain areas, which was passed through the wl_surface_damage request, as designed. Wayland also offers a buffer transformation interface, e.g. to allow users to render pre-rotated buffers. Unfortunately, there is no way to query buffer transforms, and the damage region was provided in surface, rather than buffer, co-ordinate space. Users could in theory account for this themselves, but EGL also requires co-ordinates to be passed in GL/mathematical co-ordinate space, with an inversion to Wayland's natural/scanout co-ordinate space, so transformations other than a 180-degree rotation will fail as EGL attempts to subtract the region from (its view of the) surface height. Pending creation and acceptance of a wl_surface.buffer_damage request, which will accept co-ordinates in buffer co-ordinate space, pessimise to always sending full-surface damage. `bce64c6c` provides the explanation for why we send maximum-range damage, rather than the full size of the surface: in the presence of buffer transformations, full-surface damage may not actually cover the entire surface. Signed-off-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Pekka Paalanen <pekka.paalanen@collabora.co.uk> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-11-13 10:09:23 +00:00
Iago Toral Quiroga	a29d922c1a	Revert "nir/copy_propagate: do not copy-propagate MOV srcs with source modifiers" The change proposed in the review leads to piglit regressions because is_move() is used in other places and relies on the checks for source modifiers to be there. Revert this until we agree on a better solution.	2015-11-13 08:53:10 +01:00
Samuel Iglesias Gonsálvez	5f004fd197	glsl: fix 'shared' layout qualifier related regressions Commit `8b28b35` added 'shared' as a keyword for compute shaders but it broke the existing 'shared' layout qualifier support for uniform and shader storage blocks. This patch fixes 578 dEQP-GLES31.functional.ssbo.* tests. v2: - Move SHARED to interface_block_layout_qualifier (Timothy) - Don't remove "shared" case insensitive check (Timothy) - Remove the clearing of shared_storage flag (Timothy) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2015-11-13 08:04:49 +01:00
Iago Toral Quiroga	8610cd6b8c	nir/copy_propagate: do not copy-propagate MOV srcs with source modifiers If a source operand in a MOV has source modifiers, then we cannot copy-propagate it from the parent instruction and remove the MOV. v2: remove the check for source source modifiers from is_move() (Jason) Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-11-13 07:54:33 +01:00
Jason Ekstrand	5f43e074d4	nir/vars_to_ssa: Delete dead output set code This was a remnant of an early attempt to handle output reads in vars_to_ssa. That attempt was abandon a long time ago but these few lines were aparently left in the pass and managed to evade review. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-11-12 22:08:43 -08:00
Jason Ekstrand	226ba889a0	nir/vars_to_ssa: Rework copy set handling in lower_copies_to_load_store Previously, we walked through a given deref_node's copies and, after lowering the copy away, removed it from both the source and destination copy sets. This commit changes this to only remove it from the other node's copy set (not the one we're lowering). At the end of the loop, we just throw away the copy set for the node we're lowering since that node no longer has any copies. This has two advantages: 1) It's more efficient because we're doing potentially half as many set search operations. 2) It now properly handles copies from a node to itself. Perviously, it would delete the copy from the set when processing the destinatioon and then assert-fail when we couldn't find it for the source. Cc: "11.0" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92588 Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-11-12 22:08:43 -08:00
Jason Ekstrand	4bbf2ac06e	nir/validate: Allow subroutine types for the tails of derefs The shader-subroutine code creates uniforms of type SUBROUTINE for subroutines that are then read as integers in the backends. If we ever want to do any optimizations on these, we'll need to come up with a better plan where they are actual scalars or something, but this works for now. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92859 Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-11-12 22:08:43 -08:00

... 2 3 4 5 6 ...

74421 Commits All Branches Search

74421 Commits

All Branches