KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Marek Olšák	4a883966c1	gallium: remove PIPE_CAP_USER_INDEX_BUFFERS all drivers support it Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com> (VMware driver only)	2017-02-25 00:03:09 +01:00
Marek Olšák	7fff5b77f1	freedreno: add support for user index buffers Reviewed-by: Brian Paul <brianp@vmware.com>	2017-02-25 00:03:09 +01:00
Marek Olšák	24847dd1b5	gallium/u_queue: isolate util_queue_fence implementation it's cleaner this way. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-22 20:26:39 +01:00
Marek Olšák	55ad59d2b7	gallium: set pipe_context uploaders in drivers (v3) Notes: - make sure the default size is large enough to handle all state trackers - pipe wrappers don't receive transfer calls from stream_uploader, because pipe_context::stream_uploader points directly to the underlying driver's stream_uploader (to keep it simple for now) v2: add error handling to nv50, nvc0, noop v3: set const_uploader Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> (v1) Tested-by: Charmaine Lee <charmainel@vmware.com>	2017-02-14 21:46:16 +01:00
Ilia Mirkin	b090033087	gallium: add separate PIPE_CAP_INT64_DIVMOD Nouveau does not currently have logic to implement this as a library function. Even though such a library could be written, there's no big advantage to do it that way for now given that int64 is a very uncommon use-case. Allow a driver to expose INT64 without supporting division and modulo operations. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-09 12:57:21 -05:00
Nicolai Hähnle	a020cb3a72	gallium: turn PIPE_SHADER_CAP_DOUBLES into a screen capability Make the cap consistent with PIPE_CAP_INT64. Aside from the hypothetical case of using draw for vertex shaders (and actually caring about doubles...), every implementation supports doubles either nowhere or everywhere. Also, st/mesa didn't even check the cap correctly in all supported shader stages. While at it, add a missing LLVM version check for 64-bit integers in radeonsi. This is conservative: judging by the log, LLVM 3.8 might be sufficient, but there are probably bugs that have been fixed since then. v2: fix clover (Marek) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-02-02 16:53:42 +01:00
Emil Velikov	a922c82125	freedreno: automake: correctly set MKDIR_GEN Analogous to previous commit. Fixes: `4610e5ef28` "freedreno/ir3: fix sin/cos" Cc: "12.0 13.0" <mesa-dev@lists.freedesktop.org> Cc: Rob Clark <robclark@freedesktop.org> Cc: Nicolas Dechesne <nicolas.dechesne@linaro.org> Reported-by: Nicolas Dechesne <nicolas.dechesne@linaro.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Tested-by: Nicolas Dechesne <nicolas.dechesne@linaro.org>	2017-01-27 17:56:55 +00:00
Dave Airlie	f804506d4d	gallium: Add integer 64 capability v1.1: move to using a normal CAP. (Marek) v2: fill in the cap everywhere Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-27 10:19:25 +01:00
Ilia Mirkin	6e40938fbc	gallium: add PIPE_CAP_TGSI_MUL_ZERO_WINS Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Axel Davy <axel.davy@ens.fr>	2017-01-23 20:36:47 -05:00
Rob Clark	31daeb5bf1	freedreno/a5xx: set frag shader threadsize Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:12:05 -05:00
Rob Clark	8d6af93e76	freedreno/a5xx: set fragcoordxy properly What a3xx docs call IJPERSPCENTERREGID.. the xy coord passed into bary.f. We were incorrectly setting both this and gl_FragCoord.xy to the same register resulting in all sorts of hilarity. Fixes stk, vdrift, 0ad, probably a bunch others. Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:11:43 -05:00
Rob Clark	278b97946f	freedreno/ir3: setup var locations in standalone compiler Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-22 14:11:26 -05:00
Rob Clark	6cc93bedc1	freedreno/a5xx: fix psize Note spritelist (POINTLIST_PSIZE) seems not to be a thing anymore on a5xx. Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:11:15 -05:00
Rob Clark	141a4f86d6	freedreno/a5xx: srgb fix Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:11:04 -05:00
Rob Clark	69fbb458cf	freedreno/a5xx: fix int vbos Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:10:54 -05:00
Rob Clark	16671e9704	freedreno/a5xx: fix clear for uint/sint formats Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:10:42 -05:00
Rob Clark	4d9aa4f67d	freedreno/a5xx: fix cull state Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:10:28 -05:00
Rob Clark	4c39458460	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com> Cc: "17.0" <mesa-stable@lists.freedesktop.org>	2017-01-22 14:09:45 -05:00
Ilia Mirkin	ee3ebe68f9	gallium: add PIPE_CAP_TGSI_FS_FBFETCH Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-16 21:13:09 -05:00
Rob Clark	99e9dca149	freedreno: add "nogrow" debug param Sometimes it is useful to disable the "growable" cmdstream buffers for debugging. (See 419a154d in libdrm) Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	a43f3b895c	freedreno/a5xx: remove hack for glamor Now that issues glamor was hitting w/ glsl>=130 (aka missing INSTANCED bit in vertex attribute state) is fixed, remove hack. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	3c71853c9a	freedreno/a5xx: fixed instanced Add missing bit, now that we know where it is. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	b48fde1576	freedreno/a5xx: use the non-_ZERO_BASE for vertexid Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	730c3047f0	freedreno/a5xx: add texture MIPLVLS Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	1a5d0818df	freedreno/a5xx: fix fragcoord related hangs Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	ff81c3c9fd	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Marek Olšák	a4ace98a97	gallium: remove TGSI_OPCODE_ABS It's redundant with the source modifier. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-05 18:30:00 +01:00
Marek Olšák	e51baeb6c1	gallium: add PIPE_CAP_GLSL_OPTIMIZE_CONSERVATIVELY Drivers with good compilers don't need aggressive optimizations before TGSI. Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-05 13:07:12 +01:00
Rob Clark	832dddcf91	freedreno/ir3: rework varying slots (maybe??) See: dEQP-GLES2.functional.shaders.swizzles.vector_swizzles.mediump_vec2_yyyy_fragment if we only access (in FS) varying.y then it ends up in slot zero.. I'm not sure the hw likes that.. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-30 13:49:57 -05:00
Jason Ekstrand	fb181196de	nir: Rename convert_to_ssa lower_regs_to_ssa This matches the naming of nir_lower_vars_to_ssa, the other to-SSA pass.	2016-12-29 16:02:44 -08:00
Rob Clark	ec01ef2db1	freedreno/ir3: fix linkage::var size It should actually be 32 for a4xx/a5xx.. we still only advertise 16 but for a5xx the linkage map includes position/psize. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	c416ea31cf	freedreno/ir3: treat clipvertex like a normal varying We need this in case it is streamed out. Not sure why we were treating it specially before. Having it as a VS out is harmless if FS doesn't have a matching input. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	d10c5a2481	freedreno/a5xx: transform-feedback support We'll need to revisit when adding hw binning pass support, whether we can still do this in main draw step, as we do w/ a3xx/a4xx, or if we needed to move it to the binning stage. Still some failing piglits but most tests pass and the common cases seem to work. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	928e9bd602	freedreno: update generated headers Pull in a5xx streamout related regs. Also fixes a couple incorrect register definitions. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	6d77ceb701	freedreno/ir3: UBO support for 64b GPUs (a5xx) Update address calculation to support 64b addresses. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	fc10dc9fde	freedreno/ir3: rework location of driver constants Rework how we lay out driver constants (driver-params, UBO/TFBO buffer addresses, immediates) for more flexibility. For a5xx+ we need to deal with the fact that gpu ptrs are 64b instead of 32b, which makes the fixed offset scheme not work so well. While we are dealing with that we might also make the layout more dynamic to account for varying # of UBOs, etc. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	09202cde7e	freedreno/a5xx: fix emit for bo addresses Reloc for the buffer address is two dwords on 64b devices (a5xx+) Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	f043904080	freedreno/a5xx: texture layout Seems to be imilar to a4xx, and sampler state "array-pitch" needs to be aligned to page size. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-27 16:54:01 -05:00
Rob Clark	2c0dfd48f0	freedreno/a5xx: border color support Not 100% sure it works if you have border color in VS.. but it might be right. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:49:45 -05:00
Rob Clark	939486d3d3	freedreno/a5xx: use MRT0 to import linear zs A bit of a hack, but we need to do this until we can do tiled zs in sysmem (and associated tile/until blits for transfer_map). Fixes xonotic and glmark2 "refract", when reorder wasn't enabled. (reorder would paper over the issue by avoiding the extra round- trip to system memory and back to gmem. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:48:10 -05:00
Rob Clark	bea8602e5b	freedreno: fdN_gmem_restore_format() is not gen specific Refactor out into a common helper, since this is the same across generations when we need equiv z/s gmem restore format. Next patch needs this in a5xx, rather than creating yet another helper push this into core. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:48:03 -05:00
Rob Clark	6f93c75a47	freedreno/a5xx: cargo-cult end-batch sequence more faithfully Fixes some issues at least with GMEM bypass mode, where we'd sometimes end up with some FS quads not hitting memory. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:54 -05:00
Rob Clark	d35022f24d	freedreno/a5xx: misc fix Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:47 -05:00
Rob Clark	651f2655a8	freedreno/a5xx: fix (at least some) vtx formats Swap/component-order doesn't seem to be quite what that is. At least blob was always setting it to XYZW ('11') but we weren't. Causing problems w/ formats like sint16.. Hard-coding this instead at least seems to get glamor working. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:38 -05:00
Rob Clark	2540226f66	freedreno/a5xx: more formats Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:31 -05:00
Rob Clark	c768461c1f	freedreno/a5xx: fixup caps Might not be 100% accurate, mostly just copy from a4xx to get started. We are defn lying about occlusion query at this point (not implemented yet) but need it to expose anything higher than gl1.4 (glamor needs gl2.1) Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:18 -05:00
Rob Clark	abcf8f5b58	freedreno/a5xx: fix random faults on first sysmem draw Not sure what this event is, but blob writes it.. and it seems to solve random write faults at mystery address that would sometimes happen on first BYPASS draw. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:08 -05:00
Rob Clark	54537fa1dc	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:47:00 -05:00
Rob Clark	5e632b3a83	freedreno/a5xx: fix stride/size for mem->gmem blits <brownpaperbag>these should be the in-GMEM dimensions</brownpaperbag> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-18 13:46:48 -05:00
Ilia Mirkin	fd249c803e	treewide: s/comparitor/comparator/ git grep -l comparitor \| xargs sed -i 's/comparitor/comparator/g' Just happened to notice this in a patch that was sent and included one of the tokens in question. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-12-12 22:13:07 -05:00
Rob Clark	a9383ae6d6	freedreno/a5xx: fix draw packet size with index buffer gpuaddr of idx buffer is now two dwords (64b). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	ec24f009ca	freedreno/a5xx: gmem bypass mode Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	85a3057f65	freedreno/a5xx: fix emit_string_marker() Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	c1e9cca696	freedreno: pitch alignment should match gmem alignment Deal w/ differing gmem tile size alignment between generations, and make sure texture pitch matches. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	8f4da2ff63	freedreno/a5xx: more formats Bunch of stuff we can at least turn on for vbo formats. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	b337099849	freedreno/a5xx: fix fragface Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	f143eeaffa	freedreno/a5xx: fix fragcoord Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	f5c5f76255	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	3ec4d1f809	freedreno/a5xx: fix alpha test GRAS_SU_DEPTH_PLANE_CNTL doesn't in fact seem to be anything to do with alpha test. This fixes xonotic and (other than some iommu faults) gets gnome-shell working. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	2b305725e2	freedreno/a5xx: fix VPC_VAR[n].DISABLE bits We don't need varying interpolators enabled for pos/psize out of the VS (despite the fact that they show up in VS_OUT map), so emit these before we append pos/psize to the linkage. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	534917495d	freedreno: no-op render when we need a fence If app tries to create a fence but there is no rendering to submit, we need a dummy/no-op submit. Use a string-marker for the purpose.. mostly since it avoids needing to realize that the packet format changes in later gen's (so one less place to fixup for a5xx). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-01 20:24:59 -05:00
Rob Clark	0b98e84e9b	freedreno: native fence fd support Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-01 20:24:46 -05:00
Rob Clark	16f6ceaca9	freedreno: some fence cleanup Prep-work for next patch, mostly move to tracking last_fence as a pipe_fence_handle (created now only in fd_gmem_render_tiles()), and a bit of superficial renaming. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-01 20:16:31 -05:00
Rob Clark	026a7223a6	gallium: support for native fence fd's This enables gallium support for EGL_ANDROID_native_fence_sync, for drivers which support PIPE_CAP_NATIVE_FENCE_FD. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-12-01 20:16:31 -05:00
Rob Clark	45eef9af03	freedreno/a5xx: fix negative branches Looks like immed branch offset size increased again.. making what we think is a small negative number look to hw like a huge positive number. And things go badly when shader tries to jump to hyperspace. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 17:32:54 -05:00
Rob Clark	ef30e91fe6	freedreno: fix android build with a5xx Android doesn't build all the files that normal linux/autotools build does (mainly standalond ir3_compiler).. but possibly we should pull C_SOURCES + aNxx_SOURCES into a single variable picked up by both Android.mk and Makefile.am? (Suggested by Rob H.) Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 17:32:54 -05:00
Rob Clark	8b6f8f2576	freedreno/a5xx: fix discard Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 17:32:54 -05:00
Rob Clark	946cf4eb68	freedreno/a5xx: initial support Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 12:35:49 -05:00
Rob Clark	fcba3046e1	freedreno: update generated headers Pull in a5xx Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 12:25:48 -05:00
Rob Clark	8c56789f60	freedreno: make gmem tile size alignment configurable a5xx seems to prefer 64 pixel alignment, in at least some cases. Make this configurable per generation. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 12:25:48 -05:00
Rob Clark	728e2c4d38	freedreno/ir3: don't offset inloc by 8 On a3xx/a4xx, the SP_VS_VPC_DST_REG.OUTLOCn is offset by 8, so we used to add this offset into fs->inputs[n].inloc. But a5xx drops this extra offset-by-8. So instead make inloc zero based and add the offset when we emit OUTLOCn values (for the gen's that need the offset). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 12:25:48 -05:00
Rob Clark	7a59157287	freedreno/a3xx: use new shader linkage helper Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 12:25:48 -05:00
Rob Clark	98c83b5d1c	freedreno/a4xx: use new shader linkage helper Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 12:25:48 -05:00
Rob Clark	1be5670c8d	freedreno/ir3: add new helper for shader linkage Helps simplify things on a5xx, where pos/psize get added to the vs-out map. And anyways, simplifies a3xx and a4xx. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-30 12:25:48 -05:00
Nicolai Hähnle	611166b8ed	gallium: add PIPE_CAP_TGSI_CAN_READ_OUTPUTS Drivers that support this benefit by saving one lowering pass in the GLSL-to-TGSI conversion. radeonsi already supports this because all outputs are stored in temporary variables before the export (except for TCS outputs, which have always been readable in TGSI anyway due to their special semantics). Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-11-30 09:09:50 +01:00
Rob Clark	8cb965b112	freedreno: fix slice size for imported buffers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-27 17:26:05 -05:00
Rob Clark	f4ffe2786b	freedreno/a3xx: make _emit_const() static Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-27 17:26:05 -05:00
Rob Clark	b8b800d18a	freedreno/a4xx: make _emit_const() static Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-27 17:26:05 -05:00
Marek Olšák	72217d4335	gallium: add PIPE_SHADER_CAP_LOWER_IF_THRESHOLD Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-11-15 20:23:40 +01:00
Rob Clark	dfc001dccc	freedreno/ir3: fixup ralloc fallout Fixes fallout from `acc23b04` ("ralloc: remove memset from ralloc_size"). We were still depending on zero'd allocations in a couple of places. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-12 08:57:03 -05:00
Marek Olšák	52d2b28f7f	ralloc: use rzalloc where it's necessary No change in behavior. ralloc_size is equivalent to rzalloc_size. That will change though. Calls not switched to rzalloc_size: - ralloc_vasprintf - glsl_type::name allocation (it's filled with snprintf) - C++ classes where valgrind didn't show uninitialized values I switched most of non-glsl stuff to rzalloc without checking whether it's really needed. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-31 11:53:38 +01:00
Timothy Arceri	e1af20f18a	nir/i965/anv/radv/gallium: make shader info a pointer When restoring something from shader cache we won't have and don't want to create a nir_shader this change detaches the two. There are other advantages such as being able to reuse the shader info populated by GLSL IR. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-10-26 14:29:36 +11:00
Ilia Mirkin	3fdeb7c983	gallium: add PIPE_CAP_STREAM_OUTPUT_INTERLEAVE_BUFFERS This allows the driver to signal that it can't handle random interleaving of attributes across buffers. This is required for ARB_transform_feedback3, and it's initialized to whatever the previous value of PIPE_CAP_STREAM_OUTPUT_PAUSE_RESUME was except for nv50 where it is disabled. Note that the proprietary drivers never expose ARB_transform_feedback3 on any GT21x's (where nouveau previously did), and after some effort I was unable to get it to work. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-10-22 12:02:35 -04:00
Nicolai Hähnle	700a571f89	gallium: add PIPE_CAP_TGSI_ARRAY_COMPONENTS This is a screen cap because drivers are expected to support it either for all shader types or for none of them. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-10-12 18:50:10 +02:00
Rob Clark	3ebfc44b42	freedreno: don't try to shadow layered textures We will only hit this with multi-planar YUV external images, so we would probably never hit this code path in the first place. But if we did, it wouldn't do the right thing so just bail. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-10-07 15:50:46 -04:00
Rob Clark	f88f025e8c	freedreno/a3xx+a4xx: fix clip-plane lowering state If enabled clip-planes have changed, we need to mark program state dirty. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-10-07 15:50:46 -04:00
Jason Ekstrand	2ed17d46de	nir: Make nir_foo_first/last_cf_node return a block instead One of NIR's invariants is that control flow lists always start and end with blocks. There's no good reason why we should return a cf_node from these functions since we know that it's always a block. Making it a block lets us remove a bunch of code. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-10-06 09:16:37 -07:00
Nicolai Hähnle	0334ba150f	freedreno: use the new parent/child pools for transfers Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-10-05 15:42:17 +02:00
Jason Ekstrand	ed65e6ef49	nir: Add a flag to lower_io to force "sample" interpolation Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-09-15 13:31:43 -07:00
Kenneth Graunke	2d8a3fa7ea	nir: Report progress from nir_lower_phis_to_scalar. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2016-09-14 12:01:51 -07:00
Kenneth Graunke	32630e211e	nir: Report progress from nir_lower_alu_to_scalar. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2016-09-14 12:01:49 -07:00
Ilia Mirkin	148fbf32a8	freedreno/a3xx: disable filtering for texture buffers and int textures Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-09-11 13:14:06 -04:00
Marek Olšák	5981ab5445	gallium: remove PIPE_BIND_TRANSFER_READ/WRITE not used in any useful way Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2016-09-08 22:51:33 +02:00
Rob Clark	32c061b110	freedreno: reject imports with bogus pitch Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-09-07 11:41:38 -04:00
Marek Olšák	e7a73b75a0	gallium: switch drivers to the slab allocator in src/util	2016-09-06 14:24:04 +02:00
Ilia Mirkin	ca313e00b6	a3xx: use window scissor to simulate viewport xy clip Unfortunately a3xx does not have a separate disable for depth clipping, so when depth clamp is enabled, we disable the whole 3d clipper logic. This in turn also gets rid of the xy clip that it would normally do. When we detect this would happen, instead we integrate the viewport into the window scissor. This may have slightly different behavior around wide points, but it's unlikely that anything depends on this. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97231 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2016-09-03 19:58:42 -04:00
Ilia Mirkin	83d7230fd5	a3xx: make use of software clipping when hw can't handle it The hw clipper only handles up to 6 UCPs. If there are more than 6 UCPs, or a clip vertex, or clip distances are in use, then we must use the fallback discard-based clipping from the frag shader. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2016-09-03 19:58:42 -04:00
Ilia Mirkin	dac72234c7	a3xx: make sure to actually clamp depth as requested We were previously ... not clamping. I guess this meant that everything got clamped to 1/0, which was enough to pass the existing tests. Or perhaps the clamping would only happen to the rasterized depth value and not the frag shader's output depth value. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97231 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2016-09-03 19:58:42 -04:00
Eric Engestrom	3bd885d09c	Introduce .editorconfig A few weeks ago, Jose Fonseca suggested [0] we use .editorconfig files to try and enforce the formatting of the code, to which Michel Dänzer suggested [1] we start by importing the existing .dir-locals.el settings. The first draft was discussed in the RFC [2]. These .editorconfig are a first step, one that has the advantage of requiring little to no intervention from the devs once the settings files are in place, but the settings are very limited. This does have the advantage of applying while the code is being written. This doesn't replace the need for more comprehensive formatting tools such as clang-format & clang-tidy, but those reformat the code after the fact. [0] https://lists.freedesktop.org/archives/mesa-dev/2016-June/121545.html [1] https://lists.freedesktop.org/archives/mesa-dev/2016-June/121639.html [2] https://lists.freedesktop.org/archives/mesa-dev/2016-July/123431.html Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Acked-by: Eric Anholt <eric@anholt.net> Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-08-31 17:06:54 -07:00
Kai Wasserbäch	532db3b788	gallium: Use enum pipe_shader_type in set_sampler_views() Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-29 09:07:25 -06:00
Kai Wasserbäch	7413625ad3	gallium: Use enum pipe_shader_type in bind_sampler_states() (v2) v1 → v2: - Fixed indentation (noted by Brian Paul) - Removed second assert from nouveau's switch statements (suggested by Brian Paul) Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-29 08:45:48 -06:00
Kenneth Graunke	93bfa1d7a2	nir: Change nir_shader_get_entrypoint to return an impl. Jason suggested adding an assert(function->impl) here. All callers of this function actually want ->impl, so I decided just to change the API. We also change the nir_lower_io_to_temporaries API here. All but one caller passed nir_shader_get_entrypoint(), and with the previous commit, it now uses a nir_function_impl internally. Folding this change in avoids the need to change it and change it back. v2: Fix one call I missed in ir3_compiler (caught by Eric). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2016-08-25 19:18:24 -07:00
Ilia Mirkin	9515d651f9	gallium: add a cap to expose whether driver supports mixed color/zs bits Some hardware can't render to color/depth buffers of mixed bitness. When that happens a fallback has to happen, but this allows the driver to express that this isn't an optimal scenario. The purpose of this is to remove such fbconfigs from the GLX/EGL config list. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-23 18:30:49 -04:00
Ilia Mirkin	89f00f749f	a4xx: make sure to actually clamp depth as requested We were previously ... not clamping. I guess this meant that everything got clamped to 1/0, which was enough to pass the existing tests. Or perhaps the clamping would only happen to the rasterized depth value and not the frag shader's output depth value. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97231 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2016-08-19 19:40:04 -04:00
Ilia Mirkin	cd8e30452f	a4xx: only disable depth clipping, not all clipping, when requested The previous bit disables the whole clipper, including the regular viewport-related clipping that would go on. The two new bits disable near and far clipping (separately, as verified with the depth-clamp-range piglit). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2016-08-19 19:40:04 -04:00
Eric Anholt	c078c41520	ttn: Use nir_load_front_face instead of the TGSI-style input. This reduces the diff between GLSL-to-NIR and TGSI-to-NIR, and gives NIR more optimization to work on. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-08-19 13:11:36 -07:00
Eric Anholt	ed92241d78	ttn: Make FRAG_RESULT_DEPTH be a float variable to match gtn and ptn. This lets TTN-using drivers handle FRAG_RESULT_DEPTH the same between all their source paths. Reviewed-by: Rob Clark <robdclark@gmail.com>	2016-08-19 13:11:36 -07:00
Marek Olšák	7cd256ce7e	gallium: change pipe_sampler_view::first_element/last_element -> offset/size This is required by OpenGL. Our hardware supports this. Example: Bind RGBA32F with offset = 4 bytes. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97305 Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-17 14:15:33 +02:00
Rob Clark	5def00875d	freedreno/a3xx: fix generic clear path Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-08-16 19:26:03 -04:00
Rob Clark	27f12dd8fd	freedreno/a4xx: use generic clear path Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-08-16 09:21:13 -04:00
Rob Clark	f77e59e76c	freedreno/a3xx: use generic clear path Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-08-16 09:21:13 -04:00
Rob Clark	a8e6734a83	freedreno: support for using generic clear path Since clears are more or less just normal draws, there isn't that much benefit in having hand-rolled clear path. Add support to use u_blitter instead if gen specific backend doesn't implement ctx->clear(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-08-16 09:21:13 -04:00
Rob Clark	561fd226d4	freedreno/a3xx+a4xx: move common VBOs to fd_context These are the same for a3xx and later. (a2xx could probably use them too, but due to limited hw support and ancient downstream kernels, it isn't so easy to test.) Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-08-13 13:59:03 -04:00
francians@gmail.com	a49fb4ab2d	freedreno/a2xx: add missing casts to silence notices Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-08-13 09:37:41 -04:00
Rob Clark	78ba262d00	freedreno/ir3: fix issue with emit_tex() For various tex fetch instructions, coord's get fixed up in different ways. But modifying the array returned from get_src() has side-effects if the same SSA src is used again.. the later instruction will see the previous fixups. Fix this, and const'ify things to prevent this sort of mistake in the future. Noticed by Varad when adding support for txf_ms. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-08-13 09:33:47 -04:00
Marek Olšák	54272e18a6	gallium: add a pipe_context parameter to fence_finish required by glClientWaitSync (GL 4.5 Core spec) that can optionally flush the context Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-10 01:11:10 +02:00
Marek Olšák	a909210131	gallium: add render_condition_enable param to clear_render_target/depth_stencil Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-08-10 01:10:21 +02:00
francians@gmail.com	e713a9e613	freedreno/a4xx: fix comparison out of range warnings Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:25:42 -04:00
francians@gmail.com	43492c7f2c	freedreno/a3xx: fix comparison out of range warnings Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:25:31 -04:00
francians@gmail.com	089cc74b6a	freedreno/a2xx: fix comparison out of range warnings Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:25:16 -04:00
francians@gmail.com	3fa68fdc90	freedreno/ir3: init ir3_shader_key with memset() To silence missing initializers warning Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:24:59 -04:00
Eric Engestrom	a63bac9271	gallium/freedreno: move cast to avoid integer overflow Previously, the bitshift would be performed on a simple int (32 bits on most systems), overflow, and then be cast to 64 bits. CovID: 1362461 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Eric Engestrom	3563c4d161	freedreno/a2xx: remove duplicate assignment CovID: 1362445, 1362446 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	2d64a003c5	freedreno: defer flush_queue allocation Some apps, like warsow, create a bazillion contexts but don't render on most of them. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	4175606474	freedreno: add some hw query traces Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	e684c32d2f	freedreno: some locking Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	9f0eb69527	freedreno: drop needs_rb_fbd We need to emit RB_FRAME_BUFFER_DIMENSION once per batch.. tracking this in fd_context is wrong when the gmem code executes asynchronously from the flush_queue worker. But in fact we don't really need to track it at all. We cannot assume previous value at the beginning of the batch (because of other processes potentially using the GPU), so just drop the tracking and emit it in _tile_init(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	e6bfe1c773	freedreno: move needs_wfi into batch This is also used in gmem code, which executes from the "bottom half" (ie. from the flush_queue worker thread), so it cannot be in fd_context. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	0739bbceec	freedreno: a bit of micro-optimization Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	e1b1052700	freedreno: drop mem2gmem/gmem2mem query stages They weren't really used, and it gets somewhat more complicated to deal with if batches are flushed asynchronously (on another thread). So just drop them, and move _query_set_state(NULL) call into batch (so it is not happening on background thread). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	00bed8a794	freedreno: threaded batch flush With the state accessed from GMEM+submit factored out of fd_context and into fd_batch, now it is possible to punt this off to a helper thread. And more importantly, since there are cases where one context might force the batch-cache to flush another context's batches (ie. when there are too many in-flight batches), using a per-context helper thread keeps various different flushes for a given context serialized. TODO as with batch-cache, there are a few places where we'll need a mutex to protect critical sections, which is completely missing at the moment. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	c44163876a	freedreno: track batch/blit types Add a bit of extra book-keeping about blits and back-blits (from resource shadowing). If the app uploads all mipmap levels, as opposed to uploading the first level and then glGenerateMipmap(), we can discard the back-blit (as opposed to being naive and shadowing the resource for each mipmap level). Also, after a normal blit, we might as well flush the batch immediately, since there is not likely to be further rendering to the surface. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	7f8fd02dc7	freedreno: re-order support for hw queries Push query state down to batch, and use the resource tracking to figure out which batch(es) need to be flushed to get the query result. This means we actually need to allocate the prsc up front, before we know the size. So we have to add a special way to allocate an un- backed resource, and then later allocate the backing storage. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	10baf05b2c	freedreno: use prsc for hw queries Switch to using a pipe_resource (rather than an fd_bo directly) for hw query result buffers. This is first step towards making queries work properly with reordered batches, since we'll need the additional dependency tracking to know which batches to flush. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	ba30096888	freedreno: support discarding previous rendering in special cases Basically, to "DCE" blits triggered by resource shadowing, in cases where the levels are immediately completely overwritten. For example, mid-frame texture upload to level zero triggers shadowing and back-blits to the remaining levels, which are immediately overwritten by glGenerateMipmap(). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	7105774bab	freedreno: shadow textures if possible to avoid stall/flush To make batch re-ordering useful, we need to be able to create shadow resources to avoid a flush/stall in transfer_map(). For example, uploading new texture contents or updating a UBO mid-batch. In these cases, we want to clone the buffer, and update the new buffer, leaving the old buffer (whose reference is held by cmdstream) as a shadow. This is done by blitting the remaining other levels (and whatever part of current level that is not discarded) from the old/shadow buffer to the new one. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	dcde4cd114	freedreno: spiff up some debug traces Make it easier to track batches, to ensure things happen properly when they are reordered. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	9f219c7047	freedreno: add batch-cache and batch reordering Note that I originally also had a entry-point that would construct a key and do lookup from a pipe_surface. I ended up not needing that (yet?) but it is easy-enough to re-introduce later if we need it for the blit path. For now, not enabled by default, but can be enabled (on a3xx/a4xx) with FD_MESA_DEBUG=reorder. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	f02a64dbdd	freedreno: move more batch related tracking to fd_batch To flush batches out of order, the gmem code needs to not depend on state from fd_context (since that may apply to a more recent batch). So this all moves into batch. The one exception is the gmem/pipe/tile state itself. But this is only used from gmem code (and batches are flushed serially). The alternative would be having to re-calculate GMEM layout on every batch, even if the dimensions of the render targets are the same. Note: This opens up the possibility of pushing gmem/submit into a helper thread. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	eeafaf2d37	freedreno: dynamically sized/growable cmd buffers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	9e4561d3c4	freedreno: push resource tracking down into batch Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	9bbd239a40	freedreno: introduce fd_batch Introduce the batch object, to track a batch/submit's worth of ringbuffers and other bookkeeping. In this first step, just move the ringbuffers into batch, since that is mostly uninteresting churn. For now there is just a single batch at a time. Note that one outcome of this change is that rb's are allocated/freed on each use. But the expectation is that the bo pool in libdrm_freedreno will save us the GEM bo alloc/free which was the initial reason to implement a rb pool in gallium. The purpose of the batch is to eventually facilitate out-of-order rendering, with batches associated to framebuffer state, and tracking the dependencies on other batches. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-30 09:23:42 -04:00
Rob Clark	591eeb7d1c	freedreno: limit non-user constant buffers to a4xx Seems to mostly work on a3xx. Except when it doesn't and kills gpu quite badly. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-29 14:58:39 -04:00
Rob Clark	2f57e57881	freedreno/a4xx: time-elapsed query should be active for clears Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-24 09:33:05 -04:00
Rob Clark	9253dcde58	freedreno/a4xx: timestamp queries Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-23 13:39:30 -04:00
Rob Clark	b888d8e937	freedreno: hw timestamp support If the kernel supports it, use hw counter for timestamps. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-23 13:39:30 -04:00
Rob Clark	6a4b052820	freedreno: prep work for timestamp queries We need "NULL" state to be a valid bit in the bitmask, because timestamp queries are not restricted to draw/etc stages (ie. the only commands to submit may just be to read the timestamp). And just because there are no draws, isn't a reason to skip the flush and return zero. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-23 13:39:30 -04:00
francians@gmail.com	abb2a865a4	freedreno/ir3: Add missing braces in initializer Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-23 09:14:55 -04:00
francians@gmail.com	c99cdd2175	freedreno/a2xx: silence missing case 'SHADER_COMPUTE' warning (v2) v2: no need for break after an unreachable (Matt Turner) Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-23 09:14:18 -04:00
Marek Olšák	1ffe77e7bb	gallium: split transfer_inline_write into buffer and texture callbacks to reduce the call indirections with u_resource_vtbl. The worst call tree you could get was: - u_transfer_inline_write_vtbl - u_default_transfer_inline_write - u_transfer_map_vtbl - driver_transfer_map - u_transfer_unmap_vtbl - driver_transfer_unmap That's 6 indirect calls. Some drivers only had 5. The goal is to have 1 indirect call for drivers that care. The resource type can be determined statically at most call sites. The new interface is: pipe_context::buffer_subdata(ctx, resource, usage, offset, size, data) pipe_context::texture_subdata(ctx, resource, level, usage, box, data, stride, layer_stride) v2: fix whitespace, correct ilo's behavior Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Acked-by: Roland Scheidegger <sroland@vmware.com>	2016-07-23 13:33:42 +02:00
Józef Kucia	3cd28fe3de	gallium: add a cap for VIEWPORT_SUBPIXEL_BITS (v2) This allows Gallium drivers to advertise the subpixel precision for floating point viewports bounds. v2: - Set ViewportSubpixelBits in st_init_limits. Signed-off-by: Józef Kucia <joseph.kucia@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-07-20 12:45:31 +02:00
Kenneth Graunke	ac1181ffbe	compiler: Rename INTERP_QUALIFIER_* to INTERP_MODE_. Likewise, rename the enum type to glsl_interp_mode. Beyond the GLSL front-end, talking about "interpolation modes" seems more natural than "interpolation qualifiers" - in the IR, we're removed from how exactly the source language specifies how to interpolate an input. Also, SPIR-V calls these "decorations" rather than "qualifiers". Generated by: $ find . -regextype egrep -regex '.\.(c\|cpp\|h)' -type f -exec sed -i \ -e 's/INTERP_QUALIFIER_/INTERP_MODE_/g' \ -e 's/glsl_interp_qualifier/glsl_interp_mode/g' {} \; Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Dave Airlie <airlied@redhat.com>	2016-07-17 19:26:48 -07:00
francians@gmail.com	3db7f3458f	freedreno/a4xx: Fix sign compare warnings Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-14 09:55:02 -04:00
francians@gmail.com	948822018f	freedreno/a3xx: Fix sign compare warnings Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-14 09:55:02 -04:00
francians@gmail.com	cf2f345356	freedreno/a2xx: Fix sign compare warnings Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-14 09:55:02 -04:00
Rob Clark	7295428e41	freedreno: fix crash on smaller gpus and higher resolutions Devices with smaller GMEM size need more tiles. On db410c at 2048x1152, glmark2 shadow needed ~330 tiles for fullscreen. Lets bump it up to 512. (Maybe with MRT you could end up needing more, but at that point things are probably going to be painfully slow.) Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-03 11:16:28 -04:00
Rob Clark	202710d110	freedreno/ir3: support glsl linking for cmdline compiler For .vert/.frag, now multiple can be specified on the cmdline for purposes of linking, and the last one specified is the one that is fed into the ir3 backend (and dumped along the way if --verbose is specified) Without this, varyings in frag shaders would appear as undefined. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-07-02 09:00:19 -04:00
Rob Clark	1759eb1d19	freedreno: update valid_buffer_range for SO buffers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-02 08:58:50 -04:00
Rob Clark	da39ac9c51	freedreno/ir3: support non-user_buffer consts Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-02 08:58:50 -04:00
Rob Clark	2081c1ecc0	freedreno/a2xx: move setup/restore cmds into binning pass Rather than doing a separate submit at context create, move these cmds to before first tile, as is done on a3xx/a4xx. Otherwise state can be overwritten by other contexts. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-02 08:58:50 -04:00
Rob Clark	2c3b54c278	freedreno: pass index buffer as a pipe_resource This will be useful in a following patch. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-02 08:58:50 -04:00
Rob Clark	88cc11e971	freedreno: switch emit_const_bo() to take prsc's We can push the unwrap of pipe_resource down. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-07-02 08:58:50 -04:00
Axel Davy	59a692916c	gallium: Add a cap for offset_units_unscaled D3D9 has a different behaviour for depth bias. For OGL/D3D1X, the depth bias unit is the minimal resolvable value for the depth buffer, which depends on the format (and has different behaviour for float depth buffers). For D3D9, the depth bias unit is 1.0f. Signed-off-by: Axel Davy <axel.davy@ens.fr> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-25 10:16:15 +02:00
Giuseppe Bilotta	60a27ad122	Remove wrongly repeated words in comments Clean up misrepetitions ('if if', 'the the' etc) found throughout the comments. This has been done manually, after grepping case-insensitively for duplicate if, is, the, then, do, for, an, plus a few other typos corrected in fly-by v2: * proper commit message and non-joke title; * replace two 'as is' followed by 'is' to 'as-is'. v3: * 'a integer' => 'an integer' and similar (originally spotted by Jason Ekstrand, I fixed a few other similar ones while at it) Signed-off-by: Giuseppe Bilotta <giuseppe.bilotta@gmail.com> Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-06-23 13:55:03 -07:00
Rob Clark	ef534b9389	gallium: make constant_buffer const Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-20 12:36:20 -04:00
Ilia Mirkin	07fcb06fe0	gallium: add PIPE_CAP_MAX_WINDOW_RECTANGLES to all drivers This says how many window rectangles are supported by the implementation, although it may not exceed PIPE_MAX_WINDOW_RECTANGLES. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Brian Paul <brianp@vmware.com>	2016-06-18 13:38:29 -04:00
Rob Clark	243417810b	freedreno: support start param for sampler views/states Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-14 11:00:59 -04:00
Rob Clark	b8eb1493a9	freedreno: only do extra vertex-buffer state logic on a2xx Possibly this should move into an fd2 wrapper fxn, similar to the texture state tracking done for fd3/fd4 (clamp emulation, etc) Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-14 11:00:59 -04:00
Rob Clark	26d0efa9ce	freedreno: use util_copy_constant_buffer() helper Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-14 11:00:59 -04:00
Rob Herring	112e988329	Android: move libdrm settings to top-level Android.common.mk Fix warnings like these due to HAVE_LIBDRM being inconsistently defined: external/libdrm/include/drm/drm.h:839:30: warning: redefinition of typedef 'drm_clip_rect_t' is a C11 feature [-Wtypedef-redefinition] typedef struct drm_clip_rect drm_clip_rect_t; HAVE_LIBDRM needs to be set project wide to fix this. This change also harmlessly links libdrm with everything, but simplifies the makefiles a bit. Signed-off-by: Rob Herring <robh@kernel.org> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2016-06-13 15:31:29 +01:00
Ilia Mirkin	edfa7a4b25	gallium: add PIPE_CAP_TGSI_VOTE for when the VOTE ops are allowed Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-06-06 20:49:29 -04:00
Rob Clark	1535519e51	freedreno/ir3: do idiv lowering after main opt loop Give algebraic-opt pass a chance to catch udiv by const power-of-two, before running lower-idiv pass. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-03 16:05:03 -04:00
Rob Clark	94d8fbd217	freedreno: fix bad bitshift warnings Coverity doesn't realize idx will never be negative. Throw in some assert()s to help it out. (Hopefully assert() isn't getting compiled out for coverity build.. but there seems to be just one way to find out. We might have to change these to assume()) Fixes CID 1362442, 1362443 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 16:29:32 -04:00
Rob Clark	676c77a923	freedreno: assume builtin shaders do compile Maybe we should switch to ureg to build the builtin shaders. But at any rate, if they fail to compile it is because someone messed them up (or changed TGSI syntax?). CID 1362444 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 16:29:32 -04:00
Rob Clark	80c2886033	freedreno/a4xx: silence coverity warning CID 1362451 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	9b854ce53c	freedreno/a3xx+a4xx: fix potential null ptr deref Coverity spotted the a3xx case (not sure why not the a4xx). CID 1362452 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	27a97097e1	freedreno/ir3: fix coverity warning CID 1362453 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	374ad2e2bd	freedreno/ir3: use nir_shader_get_entrypoint() helper Should also fix coverity warning: CID 1362454 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	df64cd6814	freedreno/a4xx: fix incorrect enum type a4xx has it's own enum, different from a2xx/a3xx. Spotted by coverity: CID 1362458, 1362459 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	1632b0eac0	freedreno: fix coverity negative array index warning Never can happen, since query would not have been created in the first place if pidx(query_type) return negative. Lets let coverity realize this. CID 1362460 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	ba452d43e0	freedreno: fix dereference before null check ptr can actually never be null so just drop the check. CID 1362464 (#1 of 1): Dereference before null check (REVERSE_INULL) check_after_deref: Null-checking ptr suggests that it may be null, but it has already been dereferenced on all paths leading to the check. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	18fb922faa	freedreno/a3xx: only update/emit bordercolor state when needed Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	11f0652404	freedreno/a4xx: only update/emit bordercolor state when needed I noticed in stk that it was contributing to a lot of overhead. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Emil Velikov	2f43908395	freedreno: make sure we pick up ir3_nir_trig.py in the release tarball Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-05-30 10:28:53 +01:00
Jason Ekstrand	32210dea8e	compiler: Move glsl_to_nir to libglsl.la Right now libglsl.la depends on libnir.la so putting it in libnir.la adds a dependency on libglsl.la that goes the wrong direction. Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2016-05-26 14:13:38 -07:00
Rob Clark	231dcb19f9	freedreno/ir3: cmdline compiler for glsl Use glsl/libstandalone.la to add support for taking glsl src files (in addition to .tgsi) as input. Then glsl->nir and feed the result into the ir3 backend as normal. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-25 16:31:15 -04:00
Kenneth Graunke	70048eb1e3	gallium: Add a pipe cap for whether primitive restart works for patches. Some hardware supports primitive restart on patch primitives, and other hardware does not. Modern GL and ES include a query for this feature; adding a capability bit will allow us to answer it. As far as I know, AMD hardware does not support this feature, while NVIDIA and Intel hardware does. However, most Gallium drivers do not appear to support tessellation shaders yet. So, I've enabled it for nvc0 and disabled it everywhere else. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-05-23 16:44:11 -07:00
Rob Clark	46ff17559b	freedreno/ir3: disable cp for indirect src's The variable-indexing tests always had a few random fails, which I usually couldn't reproduce when running tests manually. Somehow recently this got a lot worse. I ported a couple of the shaders to GLES to see what blob does, and it also seems to be avoiding to cp indirect srcs. So I guess indirect w/ instructions other than cat1 (mov) are not totally reliable. Let's just switch that off until this is better understood. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-23 15:57:13 -04:00
Rob Clark	3a1bbd6a0a	freedreno/ir3: need to lower fmod too Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-20 11:13:50 -04:00
Rob Clark	b65bd3dee5	freedreno/ir3: fix compiler warning Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-17 10:05:20 -04:00
Rob Clark	277818ecfb	freedreno/ir3: small standalone compiler cleanup Don't hard-code the gpu-id anymore. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-15 17:25:48 -04:00
Rob Clark	f8840f471d	freedreno/ir3: lower fdiv Not sure how we didn't hit this already, but since we want fdiv converted into mul + rcp, we should set this. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-15 17:25:48 -04:00
Rob Clark	53cde5e295	freedreno/ir3: handle VARYING_SLOT_PNTC In the glsl->tgsi path, this already gets translated to VAR8, which matches up with rasterizer->sprite_coord_enable. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-15 17:25:48 -04:00
Rob Clark	2f1581059b	freedreno/ir3: disable TGSI specific hacks in nir case When we got NIR directly from state tracker (vs using tgsi_to_nir) we need to realize this and skip some TGSI specific hacks. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-15 17:25:48 -04:00
Rob Clark	784086f3c1	freedreno/ir3: add support for NIR as preferred IR For now under debug flag, since only suitable for debugging/testing. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-15 17:25:47 -04:00
Tobias Klausmann	2be258ea18	gallium: Add a pipe cap for arb_cull_distance This lets us safely enable or disable the extension as needed Signed-off-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-05-14 08:28:17 +10:00
Jason Ekstrand	1b72c31e1f	nir/algebraic: Separate ffma lowering from fusing The i965 driver has its own pass for fusing mul+add combinations that's much smarter than what nir_opt_algebraic can do so we don't want to get the nir_opt_algebraic one just because we didn't set lower_ffma. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-11 11:44:35 -07:00
Rob Clark	4500d17245	freedreno: fix multi-layer transfer_map's The use of transfer_inline_write() in TexSubImage path (see `fb9fe352ea`) exposed a bug for "layer_first" resources (ie. a4xx) not setting correct layer_stride. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-11 12:03:21 -04:00
Rob Clark	8623e599fc	freedreno/ir3: size input/output arrays properly We index into these based on var->data.driver_location, which might have gaps (ie. two inputs, one w/ drvloc 0 and other 2). This shows up in (for example) 'bin/copyteximage 1D', but was only noticed recently due to additional asserts. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-05-10 13:17:27 -04:00
Samuel Iglesias Gonsálvez	d00a239b28	freedreno/ir3: lower lrp when operating with double operands Lower lrp when operating with double operands because float version of lrp is also lowered. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-05-10 11:25:01 +02:00

... 2 3 4 5 6 ...

1261 Commits