KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Rob Clark	d71a50f831	freedreno: combine fd_resource_layer_offset()/fd_resource_offset() We really only need this logic in one place. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-22 15:27:37 -05:00
Rob Clark	4ec2f6129b	freedreno: move fd_resource_copy_region() Code-motion prep for next patch. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	4cd016b5d6	freedreno: debug GEM obj names With a recent enough kernel, set debug names for GEM BOs, which will show up in $debugfs/gem Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-13 15:51:01 -05:00
Rob Clark	913eb7fa58	freedreno/a6xx: MSAA Reviewed-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-12-06 16:55:59 -08:00
Hyunjun Ko	76945e4140	freedreno: implements get_sample_position Since `1285f71d3e` landed, it needs to provide apps with proper sample position for MSAA. Currently no way to query this to hw, these are taken from blob driver. Fixes: dEQP-GLES31.functional.texture.multisample.samples_#.sample_position Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-11-27 15:44:03 -05:00
Jonathan Marek	e68cd91251	freedreno: use MSM_BO_SCANOUT with scanout buffers Signed-off-by: Jonathan Marek <jonathan@marek.ca>	2018-11-27 15:44:03 -05:00
Rob Clark	ddb7fadaf8	freedreno: avoid no-op flushes by re-using last-fence Noticed that with webgl (in chromium, at least) we end up generating a lot of no-op submits just to get a fence. Tracking the last fence and returning that if there is no rendering since last flush avoids this. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-26 18:10:00 -04:00
Rob Clark	e8606b11dd	freedreno: add resource seqno Intended to be something more compact than a 64b pointer, which could be used as a key into hashtables. Prep work for texture state objects. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Neil Roberts	ee61790daf	freedreno: Remove the Emacs mode lines These are not necessary because the corresponding settings are set via the .dir-locals.el file anyway. Most of them were missing a ‘:’ after “tab-width” which was making Emacs display an annoying warning whenever you open the file. This patch was made with: sed -ri '/-\- mode:/,/^$/d' \ $(find src/gallium/{drivers,winsys} -name \.\[ch\] \ -exec grep -l -- '-\*- mode:' {} \+) Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-17 12:44:48 -04:00
Kenneth Graunke	38a23517fd	gallium/u_transfer_helper: Add support for separate Z24/S8 as well. u_transfer_helper already had code to handle treating packed Z32_S8 as separate Z32_FLOAT and S8_UINT resources, since some drivers can't handle that interleaved format natively. Other hardware needs depth and stencil as separate resources for all formats. For example, V3D3 needs this for 24-bit depth as well. This patch adds a new flag to lower all depth/stencils formats, and implements support for Z24_UNORM_S8_UINT. (S8_UINT_Z24_UNORM is left as an exercise to the reader, preferably someone who has access to a machine that uses that format.) Reviewed-by: Eric Anholt <eric@anholt.net>	2018-10-14 23:36:28 -07:00
Rob Clark	fa52ff856d	freedreno/a5xx+a6xx: fix LRZ pitch alignment Both RB_2D_DST_SIZE.PITCH (a6xx) and RB_MRT[n].PITCH (a5xx) need alignment to 64. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-10-08 19:05:14 -04:00
Rob Clark	ef6d15f8a8	freedreno: fix corrupted fb state In `c3d9f29b` we allowed ctx->batch to be null, and started tracking the current framebuffer state in fd_context. But the existing logic in fd_blitter_pipe_begin() would, if !ctx->batch, set null fb state to be restored after blit. Which broke the world of deqp (and probably other things) Fixes: `c3d9f29b78` freedreno: allocate ctx's batch on demand Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-09-27 15:27:38 -04:00
Kristian H. Kristensen	de3b34df97	freedreno: Add a6xx backend This adds a freedreno backend for the a6xx generation GPUs, which at the time of this commit is about 98% GLES2 conformant. Much remains to be done - both performance work and feature work towards more recent GLES versions, but this is a good start. Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-16 19:13:36 -04:00
Kristian H. Kristensen	e89683d5a2	freedreno: Fix warnings Signed-off-by: Kristian H. Kristensen <hoegsberg@chromium.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-08-16 19:11:08 -04:00
Marek Olšák	966f155623	gallium: add storage_sample_count parameter into is_format_supported Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2018-07-31 18:28:41 -04:00
Rob Clark	cf0c7258ee	freedreno/a5xx: MSAA Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-06-21 08:54:47 -04:00
Rob Clark	39b63c18f1	freedreno/a5xx: texture tiling Overall a nice 5-10% gain for most games. And more for things like glmark2 texture benchmark. There are some rough edges. In particular, the hardware seems to only support tiling or component swap. (Ie. from hw PoV, ARGB/ABGR/RGBA/ BGRA are all the same format but with different component swap.) For tiled formats, only ARGB is possible. This isn't a big problem for sampling since we also have swizzle state there (and since util_format_compose_swizzles() already takes into account the component order, we didn't use COLOR_SWAP for sampling). But it is a problem if you try to render to a tiled BGRA (for example) surface. The next patch introduces a workaround for blitter, so we can generate tiled textures in ABGR/RGBA/BGRA, but that doesn't help the render- target case. To handle that, I think we'd need to keep track that the tiled format is different from the linear format, which seems like it would get extra fun with sampler views/etc. So for now, disabled by default, enable with FD_MESA_DEBUG=ttile. In practice it works fine for all the games I've tried, but makes piglit grumpy. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-01-14 16:13:39 -05:00
Rob Clark	16b91c2254	freedreno: add screen->setup_slices() for tex layout The rules are sufficiently different for a5xx with tiled textures, so split this out into something that can be implemented per-generation. The a5xx specific implementation will come in a later patch. Signed-off-by: Rob Clark <robdclark@gmail.com>	2018-01-14 16:10:06 -05:00
Ilia Mirkin	0dbdb07070	freedreno: set missing internal_format when importing texture Fixes running piglits without -fbo. Probably lots of other stuff too. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@gmail.com>	2017-12-22 09:56:02 -05:00
Rob Clark	37464efa3f	freedreno: add generic blitter Basically a clone of util_blitter_blit() but with special handling to blit PIPE_BUFFER as a PIPE_TEXTURE_1D. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-17 12:41:32 -05:00
Rob Clark	2697480c92	freedreno: track staging and shadow perf ctrs for the HUD Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-17 12:41:32 -05:00
Rob Clark	d848bee50f	freedreno: staging upload transfers In the busy && !needs_flush case, we can support a DISCARD_RANGE upload using a staging buffer. This is a bit different from the case of mid- batch uploads which require us to shadow the whole resource (because later draws in an earlier tile happen before earlier draws in a later tile). Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-17 12:41:32 -05:00
Rob Clark	d1465b3aee	freedreno: use u_transfer_helper Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-15 08:09:44 -05:00
Rob Clark	e90f1a26c3	freedreno: remove use of u_transfer Freedreno doesn't treat buffers and images differently, so it's use was kind of pointless. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-04 11:50:45 -05:00
Rob Clark	4b1d0d2844	freedreno: small cleanups Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	4ab6ab8036	freedreno: avoid mem2gmem for invalidated buffers Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:41 -05:00
Rob Clark	15ebf387fc	freedreno: rework fence tracking ctx->last_fence isn't such a terribly clever idea, if batches can be flushed out of order. Instead, each batch now holds a fence, which is created before the batch is flushed (useful for next patch), that later gets populated after the batch is actually flushed. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:40 -05:00
Rob Clark	deb57fb237	freedreno: proper locking for iterating dependent batches In transfer_map(), when we need to flush batches that read from a resource, we should be holding screen->lock to guard against race conditions. Somehow deferred flush seems to make this existing race more obvious. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-12-03 14:17:40 -05:00
Rob Clark	4f0f80776f	freedreno: implement pipe->invalidate_resource() Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-10-26 08:39:32 -04:00
Rob Clark	a6bd23e43b	freedreno/a5xx: rename invalidate_resource() This is different from pipe->invalidate_resource().. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-10-26 08:39:32 -04:00
Rob Clark	eed9685dd6	freedreno: per-context fd_pipe To enable per-context priorities, we need to have per-context pipe's. Unfortunately we still need to keep the global screen pipe, mostly just for screen->get_timestamp(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-10-24 12:56:51 -04:00
Rob Clark	16ac70bdcf	freedreno/a5xx: align height to GMEM Similar to the way width/pitch alignment works, it seems like we need to do similar for height. Otherwise the BLIT from system memory to GMEM can over-fetch beyond the end of the buffer, triggering a fault. I'm not sure if there is a better solution yet. Possibly we could fall back to pre-a5xx style DRAW packets for cases where BLIT might over- fetch. (We in theory have that problem already with rendering to higher mipmap levels, although fortunately those tend to use GMEM bypass.) This fixes issues reported with glamor. Reported-by: don.harbin@linaro.org Cc: 17.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-10-02 09:25:57 -04:00
Rob Clark	5b60004525	freedreno/a5xx: LRZ support Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-06-07 12:32:00 -04:00
Rob Clark	313f6360aa	freedreno: drop timestamp field unused. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-06-07 12:32:00 -04:00
Marek Olšák	330d0607ed	gallium: remove pipe_index_buffer and set_index_buffer pipe_draw_info::indexed is replaced with index_size. index_size == 0 means non-indexed. Instead of pipe_index_buffer::offset, pipe_draw_info::start is used. For indexed indirect draws, pipe_draw_info::start is added to the indirect start. This is the only case when "start" affects indirect draws. pipe_draw_info::index is a union. Use either index::resource or index::user depending on the value of pipe_draw_info::has_user_indices. v2: fixes for nine, svga	2017-05-10 19:00:16 +02:00
Marek Olšák	c24c3b94ed	gallium: decrease the size of pipe_vertex_buffer - 24 -> 16 bytes	2017-05-10 19:00:16 +02:00
Rob Clark	4d841fbaae	freedreno: core SSBO support The generation-independent support for binding shader buffer objects. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-04 13:48:06 -04:00
Rob Clark	52d2fa37f5	freedreno: drop ring arg from _set_stage() It is always the draw ring. Except for a5xx queries like time-elapsed, where we will eventually want to emit cmds into both binning and draw rings. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-04-22 10:03:02 -04:00
Rob Clark	df63ff4d82	freedreno: make hw-query a helper For a5xx (and actually some queries on a4xx) we can accumulate results in the cmdstream, so we don't need this elaborate mechanism of tracking per-tile query results. So make it into vfuncs so generation specific backend can use it when it makes sense. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-04-22 10:03:01 -04:00
Rob Clark	4299849ec7	freedreno: refactor dirty state handling In particular, move per-shader-stage info out to a seperate array of enum's indexed by shader stage. This will make it easier to add more shader stages as well as new per-stage state (like SSBOs). Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-04-18 16:32:00 -04:00
Rob Clark	0cc23ae779	freedreno: make texture state an array Make this an array indexed by shader stage, as is done elsewhere for other per-shader-stage state. This will simplify things as more shader stages are eventually added. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-04-18 16:32:00 -04:00
Timothy Arceri	628e84a58f	gallium/util: replace pipe_mutex_unlock() with mtx_unlock() pipe_mutex_unlock() was made unnecessary with `fd33a6bcd7`. Replaced using: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_unlock($[^)]*$):mtx_unlock(\&\1):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:53:05 +11:00
Timothy Arceri	ba72554f3e	gallium/util: replace pipe_mutex_lock() with mtx_lock() replace pipe_mutex_lock() was made unnecessary with `fd33a6bcd7`. Replaced using: find ./src -type f -exec sed -i -- \ 's:pipe_mutex_lock($[^)]*$):mtx_lock(\&\1):g' {} \; Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-07 08:52:38 +11:00
Rob Clark	f043904080	freedreno/a5xx: texture layout Seems to be imilar to a4xx, and sampler state "array-pitch" needs to be aligned to page size. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	c1e9cca696	freedreno: pitch alignment should match gmem alignment Deal w/ differing gmem tile size alignment between generations, and make sure texture pitch matches. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	8cb965b112	freedreno: fix slice size for imported buffers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-27 17:26:05 -05:00
Rob Clark	3ebfc44b42	freedreno: don't try to shadow layered textures We will only hit this with multi-planar YUV external images, so we would probably never hit this code path in the first place. But if we did, it wouldn't do the right thing so just bail. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-10-07 15:50:46 -04:00
Nicolai Hähnle	0334ba150f	freedreno: use the new parent/child pools for transfers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-10-05 15:42:17 +02:00
Rob Clark	32c061b110	freedreno: reject imports with bogus pitch Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-09-07 11:41:38 -04:00
Marek Olšák	e7a73b75a0	gallium: switch drivers to the slab allocator in src/util	2016-09-06 14:24:04 +02:00
Rob Clark	a8e6734a83	freedreno: support for using generic clear path Since clears are more or less just normal draws, there isn't that much benefit in having hand-rolled clear path. Add support to use u_blitter instead if gen specific backend doesn't implement ctx->clear(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-08-16 09:21:13 -04:00
Rob Clark	e684c32d2f	freedreno: some locking Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	00bed8a794	freedreno: threaded batch flush With the state accessed from GMEM+submit factored out of fd_context and into fd_batch, now it is possible to punt this off to a helper thread. And more importantly, since there are cases where one context might force the batch-cache to flush another context's batches (ie. when there are too many in-flight batches), using a per-context helper thread keeps various different flushes for a given context serialized. TODO as with batch-cache, there are a few places where we'll need a mutex to protect critical sections, which is completely missing at the moment. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	c44163876a	freedreno: track batch/blit types Add a bit of extra book-keeping about blits and back-blits (from resource shadowing). If the app uploads all mipmap levels, as opposed to uploading the first level and then glGenerateMipmap(), we can discard the back-blit (as opposed to being naive and shadowing the resource for each mipmap level). Also, after a normal blit, we might as well flush the batch immediately, since there is not likely to be further rendering to the surface. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	7f8fd02dc7	freedreno: re-order support for hw queries Push query state down to batch, and use the resource tracking to figure out which batch(es) need to be flushed to get the query result. This means we actually need to allocate the prsc up front, before we know the size. So we have to add a special way to allocate an un- backed resource, and then later allocate the backing storage. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	ba30096888	freedreno: support discarding previous rendering in special cases Basically, to "DCE" blits triggered by resource shadowing, in cases where the levels are immediately completely overwritten. For example, mid-frame texture upload to level zero triggers shadowing and back-blits to the remaining levels, which are immediately overwritten by glGenerateMipmap(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	7105774bab	freedreno: shadow textures if possible to avoid stall/flush To make batch re-ordering useful, we need to be able to create shadow resources to avoid a flush/stall in transfer_map(). For example, uploading new texture contents or updating a UBO mid-batch. In these cases, we want to clone the buffer, and update the new buffer, leaving the old buffer (whose reference is held by cmdstream) as a shadow. This is done by blitting the remaining other levels (and whatever part of current level that is not discarded) from the old/shadow buffer to the new one. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	dcde4cd114	freedreno: spiff up some debug traces Make it easier to track batches, to ensure things happen properly when they are reordered. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	9f219c7047	freedreno: add batch-cache and batch reordering Note that I originally also had a entry-point that would construct a key and do lookup from a pipe_surface. I ended up not needing that (yet?) but it is easy-enough to re-introduce later if we need it for the blit path. For now, not enabled by default, but can be enabled (on a3xx/a4xx) with FD_MESA_DEBUG=reorder. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	f02a64dbdd	freedreno: move more batch related tracking to fd_batch To flush batches out of order, the gmem code needs to not depend on state from fd_context (since that may apply to a more recent batch). So this all moves into batch. The one exception is the gmem/pipe/tile state itself. But this is only used from gmem code (and batches are flushed serially). The alternative would be having to re-calculate GMEM layout on every batch, even if the dimensions of the render targets are the same. Note: This opens up the possibility of pushing gmem/submit into a helper thread. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	9e4561d3c4	freedreno: push resource tracking down into batch Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	9bbd239a40	freedreno: introduce fd_batch Introduce the batch object, to track a batch/submit's worth of ringbuffers and other bookkeeping. In this first step, just move the ringbuffers into batch, since that is mostly uninteresting churn. For now there is just a single batch at a time. Note that one outcome of this change is that rb's are allocated/freed on each use. But the expectation is that the bo pool in libdrm_freedreno will save us the GEM bo alloc/free which was the initial reason to implement a rb pool in gallium. The purpose of the batch is to eventually facilitate out-of-order rendering, with batches associated to framebuffer state, and tracking the dependencies on other batches. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Marek Olšák	1ffe77e7bb	gallium: split transfer_inline_write into buffer and texture callbacks to reduce the call indirections with u_resource_vtbl. The worst call tree you could get was: - u_transfer_inline_write_vtbl - u_default_transfer_inline_write - u_transfer_map_vtbl - driver_transfer_map - u_transfer_unmap_vtbl - driver_transfer_unmap That's 6 indirect calls. Some drivers only had 5. The goal is to have 1 indirect call for drivers that care. The resource type can be determined statically at most call sites. The new interface is: pipe_context::buffer_subdata(ctx, resource, usage, offset, size, data) pipe_context::texture_subdata(ctx, resource, level, usage, box, data, stride, layer_stride) v2: fix whitespace, correct ilo's behavior Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Acked-by: Roland Scheidegger <sroland@vmware.com>	2016-07-23 13:33:42 +02:00
Rob Clark	4500d17245	freedreno: fix multi-layer transfer_map's The use of transfer_inline_write() in TexSubImage path (see `fb9fe352ea`) exposed a bug for "layer_first" resources (ie. a4xx) not setting correct layer_stride. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-11 12:03:21 -04:00
Thomas Hindoe Paaboel Andersen	3a6763f0a0	freedreno: remove null check before free Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-05-05 09:34:01 +02:00
Rob Clark	2c8674f5a9	freedreno: honor handle->offset Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-04-25 16:16:22 -04:00
Marek Olšák	82db518f15	gallium: add external usage flags to resource_from(get)_handle (v2) This will allow drivers to make better decisions about texture sharing for DRI2, DRI3, Wayland, and OpenCL. v2: add read/write flags, take advantage of __DRI_IMAGE_USE_BACKBUFFER Reviewed-by: Axel Davy <axel.davy@ens.fr>	2016-03-09 15:02:25 +01:00
Serge Martin	0149e7a944	freedreno: change to goto fail in fd_resource_transfer_map, like the others error cases Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-12-09 17:31:16 -05:00
Ilia Mirkin	93905a8df1	freedreno/a4xx: add astc formats Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-23 11:17:15 -05:00
Ilia Mirkin	ecb0dcd34c	freedreno/a4xx: only align slices in non-layer_first textures When layer is the container, slices are tightly packed inside of each layer. We don't need any additional alignment. On a3xx, each slice contains all the layers, so having alignment makes sense. This fixes a whole slew of array-related piglits, including texelFetch and tex-miplevel-selection varieties. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2015-11-21 09:08:16 -05:00
Ilia Mirkin	fe29330406	freedreno/a4xx: use hardware RGTC texture samplers a4xx hardware has real support for RGTC so there's no need to fake it like we do on a3xx. Undo the hacks, and keep track of an "internal format" of a resource, which on a3xx will be different, triggering the transfer-time conversions to take place. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-20 19:46:21 -05:00
Ilia Mirkin	d69e557f2a	freedreno: add support for conditional rendering, required for GL3.0 A smarter implementation would make it possible to attach this to emit state for the BY_REGION versions to avoid breaking the tiling. But this is a start. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	059da344ec	freedreno/a3xx: add fake RGTC support (required for GL3) Also throw in LATC while we're at it (same exact format). This could be made more efficient by keeping a shadow compressed texture to use for returning at map time. However... it's not worth it for now... presumably compressed textures are not updated often. Lastly fix up Z32S8 transfers to non-0 layers. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	581cbfdec1	freedreno/a3xx: fix up logic for handling block formats This only appears in cubemaps which have have packed layers, so are very sensitive to any layout disagreement between sw and hw. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-08-17 11:38:38 -04:00
Ilia Mirkin	b4ace13eea	freedreno/a4xx: add cube map array support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-08-15 14:05:37 -04:00
Rob Clark	5ca032a9a8	freedreno: simplify/cleanup resource status tracking Collapse dirty/reading bools into status bitmask (and drop writing which should really be the same as dirty). And use 'used_resources' list for all tracking, including zsbuf/cbufs, rather than special casing the color and depth/stencil buffers. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-08-04 16:03:45 -04:00
Rob Clark	be8a8ebe57	freedreno: add transform-feedback state Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-07-27 13:51:06 -04:00
Rob Clark	bda1354aac	freedreno: add resource tracking support for written buffers With stream-out (transform-feedback) we have the case where resources are written by the gpu, which needs basically the same tracking to figure out when rendering must be flushed. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-07-27 13:51:06 -04:00
Rob Clark	422296e38d	freedreno: fix crash in fd_invalidate_resource() Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-07-10 11:57:30 -04:00
Ilia Mirkin	9fc3f47278	freedreno/a3xx: add support for S8 and Z32F_S8 Enables ARB_depth_buffer_float. There is no sampling support for interleaved Z32F_S8, so we store the two textures separately, one as Z32F, the other as S8. As a result, we need a lot of additional logic for restores and transfers. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-04-27 20:17:07 -04:00
Ilia Mirkin	0a4cb00c77	freedreno: add fd_transfer to wrap around pipe_transfer Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-04-27 20:17:07 -04:00
Ilia Mirkin	14dfd8cc43	freedreno: dirty context when reallocating a bound bo Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-04-05 16:36:35 -04:00
Ilia Mirkin	bde2045fa2	freedreno: keep track of buffer valid ranges Copies nouveau_buffer and radeon_buffer. This allows a write to proceed to an uninitialized part of a buffer even when the GPU is using the previously-initialized portions. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-04-05 16:36:35 -04:00
Ilia Mirkin	dacf22e0a3	freedreno: mark resources as being read so that writes flush the queue Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-04-05 16:36:34 -04:00
Ilia Mirkin	1fee3061d5	freedreno: add a reading flag to indicate gpu is reading rsc Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-04-05 16:36:34 -04:00
Ilia Mirkin	ea0952a9db	freedreno: fix resource flushing confusion A resource flush is an upload of a hypothetically-staging texture to the GPU. For a UMA system, this will largely be a no-op or cache-maintenance. Move the render flush logic into transfer_map where it belongs, and clear out the transfer_flush function. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-04-05 16:36:34 -04:00
Ilia Mirkin	738c8319ac	freedreno/a3xx: fix 3d texture layout The SZ2 field contains the layer size of a lower miplevel. It only contains 4 bits, which limits the maximum layer size it can describe. In situations where the next miplevel would be too big, the hardware appears to keep minifying the size until it hits one of that size. Unfortunately the hardware's ideas about sizes can differ from freedreno's which can still lead to issues. Minimize those by stopping to minify as soon as possible. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.4 10.5" <mesa-stable@lists.freedesktop.org>	2015-03-28 14:54:41 -04:00
Ilia Mirkin	620e29b748	freedreno: fix slice pitch calculations For example if width were 65, the first slice would get 96 while the second would get 32. However the hardware appears to expect the second pitch to be 64, based on halving the 96 (and aligning up to 32). This fixes texelFetch piglit tests on a3xx below a certain size. Going higher they break again, but most likely due to unrelated reasons. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.4 10.5" <mesa-stable@lists.freedesktop.org> Reviewed-by: Rob Clark <robclark@freedesktop.org>	2015-03-13 16:05:16 -04:00
Ilia Mirkin	89b26d5a36	freedreno/a3xx: use the same layer size for all slices We only program in one layer size per texture, so that means that all levels must share one size. This makes the piglit test bin/texelFetch fs sampler2DArray have the same breakage as its non-array version instead of being completely off, and makes bin/ext_texture_array-gen-mipmap start passing. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.4 10.5" <mesa-stable@lists.freedesktop.org> Reviewed-by: Rob Clark <robclark@freedesktop.org>	2015-03-13 16:05:16 -04:00
Rob Clark	0ebd623f60	freedreno/a4xx: mipmaps Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-12-13 15:09:37 -05:00
Rob Clark	1e3a732603	freedreno/a4xx: texture fixes Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-12-09 18:01:49 -05:00
Rob Clark	5d7c9c9160	freedreno: cleanup slice alignment/setup Collapse things back into a setup_slices() which takes the desired alignment as a param. This gets things ready for a4xx which has some slightly different requirements. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-12-09 18:01:21 -05:00
Rob Clark	6eabc11936	freedreno: fix PIPE_TRANSFER_DISCARD_WHOLE_RESOURCE fd_bo_cpu_prep() doesn't realize the bo is already referenced in unflushed cmdstream. It could be made to do so (but would have to be implemented twice, ie. both for msm and kgsl). But we still can't do the expected thing if the caller isn't using _NOSYNC. Because of the way the tiling works, we need to build quite a bit of cmdstream at flush time, which is not possible to do at the libdrm level. So rather than trying to make fd_bo_cpu_prep() smarter than it can possibly be, just always discard and reallocate if the PIPE_TRANSFER_DISCARD_WHOLE_RESOURCE flag is set. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-10-23 10:46:51 -04:00
Rob Clark	74069e324e	freedreno/a3xx: more layer/level fixes Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-10-20 21:42:44 -04:00
Rob Clark	dd332fe641	freedreno: fix layer_stride Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-10-15 15:49:48 -04:00
Rob Clark	d5d80b3739	freedreno/a3xx: refactor vertex state emit Get rid of fd3_vertex_buf and use fd_vertex_state directly for all draws. Removes a tiny bit of CPU overhead for munging around the vertex state every time it is emitted, but more importantly it cleans things up for later optimizations, so the emit paths don't have to special case internal draws (gmem<->mem, clears, etc) with regular draws. Instead of constructing fd3_vertex_buf array each time for internal draws, and context init time pre-create solid_vbuf_state and blit_vbuf_state. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-10-15 15:49:48 -04:00
Rob Clark	ca29c4c3b0	freedreno/a3xx: 3d/array textures Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-09-13 15:31:58 -04:00
Rob Clark	b40a6c2b17	freedreno: implement pipe_flush_resource() Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-08-24 13:09:00 -04:00
Rob Clark	478a08ebd2	freedreno: don't ignore src/dst level Don't ignore src/dst_level in pipe_copy_region. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-08-24 13:08:14 -04:00
Rob Clark	286863939f	freedreno: few caps fixes Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-05-26 08:56:27 -04:00
Rob Clark	b8f78e1890	freedreno: add support for hw queries Real GPU queries need some infrastructure to track samples per tile and accumulate the results. But fortunately this can be shared across GPU generation. See: https://github.com/freedreno/freedreno/wiki/Queries#hardware-queries Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-05-13 18:33:19 -04:00
Rob Clark	0a1e4361e8	freedreno/resource: fail more gracefully Fail more gracefully when buffer allocation/import fails. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-10-24 20:21:08 -04:00
Marek Olšák	419cd5f2a2	gallium: add flush_resource context function r600g needs explicit flushing before DRI2 buffers are presented on the screen. v2: add (stub) implementations for all drivers, fix frontbuffer flushing v3: fix galahad Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2013-09-20 20:35:55 +02:00
Rob Clark	575a6e7ec5	freedreno: fix glReadPixels duh, we still need to flush if there are pending draws and it isn't an unsynchronized case. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-09-19 11:45:01 -04:00
Rob Clark	ffa3244534	freedreno: PIPE_TRANSFER_DISCARD_WHOLE_RESOURCE When the old contents do not need to be preserved, it is faster to create a new backing bo rather than stall. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-09-14 13:31:58 -04:00
Rob Clark	cb9e07aa84	freedreno: multi-slice resources (cubemap, mipmap, etc) Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-09-14 13:31:58 -04:00
Rob Clark	e95b7d89b9	freedreno: updates for msm drm/kms driver There where some small API tweaks in libdrm_freedreno to enable support for msm drm/kms driver. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-08-29 17:35:05 -04:00
Rob Clark	18c317b21d	freedreno: prepare for a3xx Split the parts that are specific to adreno a2xx series GPUs from the parts that will be in common with a3xx, so that a3xx support can be added more cleanly. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-06-08 13:15:51 -04:00
Rob Clark	97fa811d14	freedreno: implement pipe->resource_copy_region() Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-05-23 14:35:21 -04:00
Rob Clark	73de07cbbc	freedreno: use writecombine buffers Better than uncached for writes, which are common for vertex buffer upload, etc. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-04-25 15:10:56 -04:00
Rob Clark	eec37f1cdc	freedreno: use u_math macros/helpers more Get rid of a few self-defined macros: ALIGN() -> align() min() -> MIN2() max() -> MAX2() Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-04-24 21:09:46 -04:00
Rob Clark	732b0b5ebc	freedreno: track maximal scissor bounds Optimize out parts of the render target that are scissored out by taking into account maximal scissor bounds in fd_gmem_render_tiles(). This is a big win on things like gnome-shell which frequently do partial screen updates. Signed-off-by: Rob Clark <robdclark@gmail.com>	2013-03-25 13:05:44 -04:00
Rob Clark	eab8d6cbdb	freedreno: add pipe->blit Signed-off-by: Rob Clark <robdclark@gmail.com>	2013-03-21 17:33:51 -04:00
Rob Clark	6173cc19c4	freedreno: gallium driver for adreno Currently works on a220. Others in the a2xx family look pretty similar and should be pretty straightforward to support with the same driver. The a3xx has a new shader ISA, and while many registers appear similar, the register addresses have been completely shuffled around. I am not sure yet whether it is best to support with the same driver, but different compiler, or whether it should be split into a different driver. v1: original v2: build file updates from review comments, and remove GPL licensed header files from msm kernel v3: smarter temp/pred register assignment, fix clear and depth/stencil format issues, resource_transfer fixes, scissor fixes Signed-off-by: Rob Clark <robdclark@gmail.com>	2013-03-11 21:53:24 -04:00

... 2 3 4 5 6

264 Commits