mirrors/mesa - Frog Git

Commit Graph

Author	SHA1	Message	Date
Rob Clark	6320e37d4b	nir: add amul instruction Used for address/offset calculation (ie. array derefs), where we can potentially use less than 32b for the multiply of array idx by element size. For backends that support `imul24`, this gives a lowering pass an easy way to find multiplies that potentially can be converted to `imul24`. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2019-10-18 15:08:54 -07:00
Rob Clark	0568761f8e	nir: Add a new ALU nir_op_imul24 Some hardware can do 24b multiply in a single instruction, but not 32b. However in most cases 24b is sufficient for address/offset calculation. Signed-off-by: Rob Clark <robdclark@chromium.org> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2019-10-18 15:08:54 -07:00
Eduardo Lima Mitev	bc2ccdc45a	freedreno/ir3: Handle newly added opcode nir_op_imad24_ir3 Simply emit an ir3_MAD_S24 instruction in the backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2019-10-18 15:08:54 -07:00
Eduardo Lima Mitev	32e5fbf47c	nir: Add a new ALU nir_op_imad24_ir3 ir3 compiler has a signed integer multiply-add instruction (MAD_S24) that is used for different offset calculations in the backend. Since we intend to move some of these calculations to NIR, we need a new ALU op that can directly represent it. Signed-off-by: Rob Clark <robdclark@chromium.org> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2019-10-18 15:08:54 -07:00
Rob Clark	6ad442acae	freedreno/ir3: rename mul.s/mul.u to mul.s24/mul.u24, to better reflect that these are 24b multiply. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2019-10-18 15:08:54 -07:00
Rob Clark	ad8167c1e0	nir/search: fix the PoT helpers Otherwise, if the base type is (for example) uint32, we would incorrectly think that PoT optimizations could not apply. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Jason Ekstsrand <jason@jleksrand.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2019-10-18 15:08:54 -07:00
Rob Clark	f30c256ec0	freedreno/ir3: enable pre-fs texture fetch for a6xx Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	72048dd799	turnip: add support for pre-fs texture fetch Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	a5afcc76d5	freedreno/a6xx: add support for pre-fs texture fetch Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Hyunjun Ko	e9450ad27d	freedreno/ir3: Add support for texture sampling pre-dispatch Signed-off-by: Eduardo Lima Mitev <elima@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Eduardo Lima Mitev	2a0d45ae6c	freedreno/ir3: Add a NIR pass to select tex instructions eligible for pre-fetch The pass should run once at the end of shader compilation, for a4xx onwards. It iterates texture sampling instructions and mark those eligibile for pre-dispatch by changing the tex op from 'tex' to 'tex_prefetch'. An instruction is eligibile if: * The coordinate is a vector where all its components come from a shader input. * The order of the components match exactly that of the input (no swizzles). * The instruction is in the 'main' function, and in the outer most-block. The first two restrictions were arrived to empirically, so more testing could tighten or loosen it. The 3rd restriction is there to allow moving the instructions eligible for pre-dispatch to the beginning of the shader, so that we don't block the registers holding the result for too long. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	7d4213fe88	freedreno/ir3: force i/j pixel to r0.x It seems that pre-fs texture fetch only works if ij_pix ends up in r0.x. I've tried unknown zero bits, to no avail, and blob also seems to force r0.x when this feature is used. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	07e9bf564f	freedreno/ir3: add pre-dispatch tex fetch to disasm Useful to see in disassembly listing texture fetches that were moved to pre-dispatch. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	2b93eb9c76	freedreno/ir3: add dummy bary.f(ei) for pre-fs-fetch If the only use of varyings is a pre-shader texture-fetch, we still need to issue a bary.f with the end-input flag, otherwise we'll block further VS invocations, as the hw will think varying storage is still busy. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	392a309a55	freedreno/ir3: fixup register footprint to account for prefetch It is possible that the result of a pre-fs texture fetch is an output (or partially an output) of the FS. Sine the meta:tex_prefetch instructions are dropped before the assembler, we need to account for this when we fixup the register footprint. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	482e1b9955	freedreno/ir3: add meta instruction for pre-fs texture fetch Add a placeholder instruction to track texture fetches made prior to FS shader dispatch. These, like meta:input instructions are scheduled before any real instructions, so that RA realizes their result values are live before the first real instruction. And to give legalize a way to track usage of fetched sample requiring (sy) sync flags. There is some related special handling for varying texcoord inputs used for pre-fs-fetch, so that they are not DCE'd and remain in linkage between FS and previous stage. Note that we could almost avoid this special handling by giving meta:tex_prefetch real src arguments, except that in the FS stage, inputs are actual bary.f/ldlv instructions. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	11e467c378	freedreno/ir3: don't DCE ij_pix if used for pre-fs-texture-fetch When we enable pre-dispatch texture fetch, we could have a scenario where the barycentric i/j coord sysval is not used in the shader, but only used for the varying fetch for the pre-dispatch texture fetch. In this case we need to take care not to DCE this sysval. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	af817a44c1	freedreno/ir3: track sysval slot for inputs Will be needed for special handling of SYSTEM_VALUE_BARYCENTRIC_PIXEL (ij_pix) when pre-fs texture fetch is enabled. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	35692fab86	freedreno/ir3: remove unused ir3_instruction::inout Not sure I remember how long this has been unused for. But it's unused now. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Hyunjun Ko	fd14788e1f	freedreno/ir3: Add data structures to support texture pre-fetch Signed-off-by: Eduardo Lima Mitev <elima@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Rob Clark	766a68cdb9	freedreno: update registers Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Eduardo Lima Mitev	f1d4fadf1b	nir: Add new texop nir_texop_tex_prefetch This is like nir_texop_tex, but signals that the sampling coordinates are immutable during the shader stage, in a way that allows the HW that supports pre-dispatching sampling operations to pre-fetch the result prior to scheduling the shader stage. This is introduced to support the feature in Freedreno. Adreno HW from a4xx supports it. A NIR pass introduced later in this series will detect sampling operations that are eligible for pre-dispatch, and replace nir_texop_tex by this new op, to tell the backend to enable pre-fetch. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-10-18 21:11:54 +00:00
Eric Engestrom	27df3e015b	osmesa: add missing #include <stdint.h> Fixes: `281466332b` ("gallium/osmesa: Introduce a test.") Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/1947 Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Acked-by: Dylan Baker <dylan@pnwbakers.com>	2019-10-18 22:07:21 +01:00
Dylan Baker	1ce23b5653	docs: Add new feature for compiling for windows with meson Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-10-18 13:02:58 -07:00
Dylan Baker	0b6b7ff3ca	appveyor: Move appveyor script into .appveyor directory This clears out the scripts directory completely Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-10-18 13:02:58 -07:00
Dylan Baker	fbb969b98a	appveyor: Add support for building llvmpipe with meson Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-10-18 13:02:58 -07:00
Dylan Baker	41b3eb08d9	docs: update meson docs for windows Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-10-18 13:02:58 -07:00
Dylan Baker	821cf6942a	meson: Use cmake to find LLVM when building for windows We don't use cmake normally because it always results in static linking. This is very problematic for *nix OSes which expect shared linking by default, but for windows this isn't a problem as LLVM doesn't support shared linking on windows anyway. Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-10-18 13:02:58 -07:00
Dylan Baker	b962c7c971	meson: Add support for wrapping llvm For building on Windows (when not using cygwin), users may want to use a binary wrap of LLVM, this provides a fallback to the LLVM dependency which may be used in this case Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-10-18 13:02:58 -07:00
Dylan Baker	dbd554ba05	meson/llvmpipe: Add dep_llvm to driver_swrast This fixes build errors in gl-gdi on windows when using llvmpipe Reviewed-by: Adam Jackson <ajax@redhat.com>	2019-10-18 13:02:58 -07:00
Hal Gentz	fa611b07f1	Revert "egl: Add EGL_CONFIG_SELECT_GROUP_MESA ext." This reverts commit `173bc9d684`.	2019-10-18 18:41:51 +00:00
Hal Gentz	94386d476c	Revert "egl: Fixes transparency with EGL and X11." This reverts commit `90a19074b4`.	2019-10-18 18:41:51 +00:00
Hal Gentz	9997693960	Revert "egl: Puts RGBA visuals in the second config selection group." This reverts commit `a800d16e4f`.	2019-10-18 18:41:51 +00:00
Hal Gentz	4ef2c53755	Revert "egl: Configs w/o double buffering support have no `EGL_WINDOW_BIT`." This reverts commit `075a96aa92`.	2019-10-18 18:41:51 +00:00
Jonathan Marek	9a7a92c1ec	etnaviv: check NO_ASTC feature bit Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Lucas Stach <l.stach@pengutronix.de>	2019-10-18 19:30:41 +02:00
Jonathan Marek	15c5ec0024	etnaviv: fix TS samplers on GC7000L Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Lucas Stach <l.stach@pengutronix.de>	2019-10-18 19:23:59 +02:00
Jonathan Marek	ad48411d72	etnaviv: fix linear_nearest / nearest_linear filters on GC7000Lite MIN filter is only used when LOD MAX is at least 4 (I guess the 2 LSB don't actually exist). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Lucas Stach <l.stach@pengutronix.de>	2019-10-18 19:06:44 +02:00
Lucas Stach	95adc393eb	etnaviv: GC7000: flush TX descriptor and instruction cache The etnaviv kernel driver will only ever flush write caches. As both the TX descriptor and instruction cache are read caches they must be flushed from the user cmdstream at an appropriate time. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-10-18 19:06:39 +02:00
Lucas Stach	54dd288317	etnaviv: add linear texture support on GC7000 It's just a matter of writing the addressing mode into the texture descriptor. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-10-18 19:06:35 +02:00
Wladimir J. van der Laan	eda73d7127	etnaviv: GC7000: Texture descriptors Create a separate implementation file with texture-descriptor-based sampler views and sampler states. Initialize the one or the other based on the GPU. There is so little in common that this seemed more appropriate that keeping them as one type of state object would only be confusing. This commit is actually a combiation of the original commit by Wladimir, fixes and TS implementation from Jonathan and changed to use softpin by Lucas. Signed-off-by: Wladimir J. van der Laan <laanwj@gmail.com> Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-10-18 19:06:20 +02:00
Lucas Stach	5bc3fcf620	etnaviv: check for softpin availability on Halti5 devices Halti5 uses texture descriptors to control the samplers, and thus needs to know the GPU virtual address for the texture buffers to fill into the descriptor buffer. Without softpin userspace has no control over the GPU VM and also no way to fix up the texture descriptor buffer, so there is no point in creating a screen on a Halti5 device without softpin being available. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-10-18 19:05:25 +02:00
Lucas Stach	0bdf5420f1	etnaviv: drm: add softpin interface If softpin is available on the kernel side, we transparently replace the relocs with self-managed GPU virtual addresses. This allows to skip some work at the kernel side, as it doesn't need to touch the command stream anymore before submitting it to the hardware. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-10-18 19:05:21 +02:00
Marek Vasut	e5cc66dfad	etnaviv: Rework locking Replace the per-screen locking of flushing with per-context one and add per-context lock around command stream buffer accesses, to prevent cross-context flushing from corrupting these command stream buffers. Signed-off-by: Marek Vasut <marex@denx.de>	2019-10-18 17:03:25 +00:00
Marek Vasut	0c38c5454b	etnaviv: Command buffer realloc Reallocate the command stream buffer in case it is too small. The older kernel versions are limited to 64 kiB buffer, so limit the size to avoid oversized buffers. Signed-off-by: Marek Vasut <marex@denx.de>	2019-10-18 17:03:25 +00:00
Marek Vasut	1456aa61cc	etnaviv: Rework resource status tracking Have each context track which resources it marked as pending read and pending write. Have each resource track in which context it is pending. This way, it is possible to identify when a resource is both pending read and pending write at the same time. Moreover, the status field can be correctly calculated and updated when necessary. Signed-off-by: Marek Vasut <marex@denx.de>	2019-10-18 17:03:25 +00:00
Lucas Stach	1194afdfe3	etnaviv: rework the stream flush to always go through the context flush This way we can ensure that the pipe driver tracking of pending resources stays in sync with the actual command buffer state, even if a space reservation triggers a forced flush. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-10-18 17:03:25 +00:00
Lucas Stach	1864fcd8c7	etnaviv: drm: remove unused etna_cmd_stream_finish It's not used by anything and gets in the way for the refactoring of the flush handling. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-10-18 17:03:25 +00:00
Lucas Stach	9e672e4d20	etnaviv: keep references to pending resources As long as a resource is pending in any context we must not destroy it, otherwise we'll hit a classical use-after-free with fireworks. To avoid this take a reference when the resource is first added to the pending set and put the reference when no longer pending. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-10-18 17:03:25 +00:00
Marek Vasut	90e223646b	etnaviv: Make contexts track resources Currently, the screen tracks all resources for all contexts, but this is not correct. Each context should track the resources it uses. This also allows a context to detect whether a resource is used by another context and to notify another context using a resource that the current context is done using the resource. Signed-off-by: Marek Vasut <marex@denx.de> Cc: Christian Gmeiner <christian.gmeiner@gmail.com> Cc: Guido Günther <guido.gunther@puri.sm> Cc: Lucas Stach <l.stach@pengutronix.de>	2019-10-18 17:03:25 +00:00
Brian Paul	2946bd6628	REVIEWERS: add VMware reviewers	2019-10-18 16:42:40 +00:00

... 2 3 4 5 6 ...

116681 Commits All Branches Search

116681 Commits

All Branches