mirror of https://gitlab.freedesktop.org/mesa/mesa
4456 lines
204 KiB
ReStructuredText
4456 lines
204 KiB
ReStructuredText
Mesa 24.0.0 Release Notes / 2024-02-01
|
||
======================================
|
||
|
||
Mesa 24.0.0 is a new development release. People who are concerned
|
||
with stability and reliability should stick with a previous release or
|
||
wait for Mesa 24.0.1.
|
||
|
||
Mesa 24.0.0 implements the OpenGL 4.6 API, but the version reported by
|
||
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
|
||
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
|
||
Some drivers don't support all the features required in OpenGL 4.6. OpenGL
|
||
4.6 is **only** available if requested at context creation.
|
||
Compatibility contexts may report a lower version depending on each driver.
|
||
|
||
Mesa 24.0.0 implements the Vulkan 1.3 API, but the version reported by
|
||
the apiVersion property of the VkPhysicalDeviceProperties struct
|
||
depends on the particular driver being used.
|
||
|
||
SHA256 checksum
|
||
---------------
|
||
|
||
::
|
||
|
||
dc7e8c077bc5884df95478263b34bdebb7e88e600689cb56fb07be2b8c304c36 mesa-24.0.0.tar.xz
|
||
|
||
|
||
New features
|
||
------------
|
||
|
||
- VK_EXT_image_compression_control on RADV
|
||
- VK_EXT_device_fault on RADV
|
||
- OpenGL 3.3 on Asahi
|
||
- Geometry shaders on Asahi
|
||
- GL_ARB_texture_cube_map_array on Asahi
|
||
- GL_ARB_clip_control on Asahi
|
||
- GL_ARB_timer_query on Asahi
|
||
- GL_EXT_disjoint_timer_query on Asahi
|
||
- GL_ARB_base_instance on Asahi
|
||
- OpenGL 4.6 (up from 4.2) on d3d12
|
||
- VK_EXT_depth_clamp_zero_one on RADV
|
||
- GL_ARB_shader_texture_image_samples on Asahi
|
||
- GL_ARB_indirect_parameters on Asahi
|
||
- GL_ARB_viewport_array on Asahi
|
||
- GL_ARB_fragment_layer_viewport on Asahi
|
||
- GL_ARB_cull_distance on Asahi
|
||
- GL_ARB_transform_feedback_overflow_query on Asahi
|
||
- VK_KHR_calibrated_timestamps on RADV
|
||
- VK_KHR_vertex_attribute_divisor on RADV
|
||
- VK_KHR_maintenance6 on RADV
|
||
- VK_KHR_ray_tracing_position_fetch on RADV
|
||
- EGL_EXT_query_reset_notification_strategy
|
||
|
||
|
||
Bug fixes
|
||
---------
|
||
|
||
- vlc crashes when playing 1920x1080 video with Radeon RX6600 hardware acceleration and deinterlacing enabled.
|
||
- [radeonsi] Regression: graphical artifacting on water texture in OpenGOAL
|
||
- Assertion when creating dmabuf-compatible VkImage on Tigerlake
|
||
- VAAPI: EFC on VCN2 produces broken H264 video and crashes the HEVC encoder
|
||
- [AMDGPU RDNA3] Antialiasing is broken in Blender
|
||
- MTL: vulkan cooperative matrix tests gpu hang on MTL
|
||
- Assassin's Creed Odyssey wrong colors on Arc A770
|
||
- The Finals fails to launch with DX12 on Intel Arc unless "force_vk_vendor" is set to -1.
|
||
- VA-API CI tests freeze
|
||
- radv: games render with garbage output on RX5600M through PRIME with DCC
|
||
- radv: RGP reports for mesh shaders are confusing
|
||
- zink crashes on nvidia
|
||
- d3d10umd: Build failure regression with MSVC during 23.3 development cycle
|
||
- Error during SPIR-V parsing of OpCopyLogical
|
||
- rusticl: fails to find SPIRV-Tools headers via pkg-config under non-default prefix
|
||
- Conservative depth output doesn't work with RADV
|
||
- RADV: DOA-X3 (yuzu) missing hair, eyes and skybox
|
||
- intel: Require 64KB alignment when using CCS and multiple engines
|
||
- radv: Atlas Fallen corrupted rendering
|
||
- r300: nir pass to lower indirect regression
|
||
- r300: LRP present even with .lower_flrp32=true
|
||
- 23.3.2 regression: kms_swrast_dri.so segfaults
|
||
- Radeon: YUYV DMA BUF eglCreateImageKHR fails
|
||
- No support for a644
|
||
- anv: importing memory for a compressed image using modifier is hitting an assert
|
||
- anv: importing memory for a compressed image using modifier is hitting an assert
|
||
- anv: importing memory for a compressed image using modifier is hitting an assert
|
||
- Large regression in \`glbench --tests context` on Intel
|
||
- Android 14 depends on Vulkan EXT_swapchain_maintenance1, which breaks radv
|
||
- nvk,nak: Implement shaderFloat64
|
||
- Mesa is not compatible with Python 3.12 due to use of distutils
|
||
- anv: glcts regression on zink
|
||
- nir: Trivial loop not unrolling
|
||
- Possible regression with AMD GPU with flatpak apps
|
||
- nvk,nak: Implement VK_KHR_vulkan_memory_model
|
||
- Compiling Mesa with X in custom prefix fails in Intel Vulkan driver
|
||
- anv: implement recommended AUX-TT invalidation on compute/transfer queues
|
||
- anv: implement recommended AUX-TT invalidation on compute/transfer queues
|
||
- !26307 broke some piglit tests with rusticl on radeonsi on Navi 14
|
||
- Compute shader with imageStore() to a swapchain image (from a display surface) produces incorrect results (Raspberry, Vulkan).
|
||
- nvk: Implement VK_EXT_multi_draw
|
||
- radv/aco: Crysis 2 Remastered RT reflections are blocky around the edges with ACO, renders normally with LLVM
|
||
- radv: Major regression in main branch causing all Vulkan apps to crash on 6600M (Navi 23)
|
||
- [23.3.0] Parallel build failure - fatal error: vtn_generator_ids.h: No such file or directory
|
||
- crocus: Assertion failures in NIR divergence analysis
|
||
- nak: Implement nir_op_fmulz
|
||
- nvk,nak: Implement VK_KHR_shader_float_controls
|
||
- 748b7f80ef1cf6a3fed9991d70230e69fef51a0e - Regression on Doom Eternal w/ RT Reflections
|
||
- glFlush() blocks until close to GPU completion on Radeon R9 270
|
||
- nvk: Implement VK_EXT_texel_buffer_alignment
|
||
- rusticl: fails to find X11 headers via pkg-config under non-default prefix
|
||
- nvk,nak: Implement VK_EXT_shader_image_atomic_int64
|
||
- nvk,nak: Implement VK_KHR_shader_atomic_int64
|
||
- nvk,nak: Implement VK_KHR_shader_subgroup_extended_types
|
||
- nvk,nak: Implement shaderInt64
|
||
- nvk: Implement VK_EXT_subgroup_size_control
|
||
- mesa:freedreno / afuc-disasm unit test failure
|
||
- anv: Resident Evil 2 hang
|
||
- Mesa 23.3.0 release build fails on 22.04 LTS
|
||
- Segfault in SDL2 game when using environment variables: \`SDL_VIDEODRIVER=wayland DRI_PRIME=1`
|
||
- Mesa 22.3.0 SEGFAULT in nir shader creation for r600 cards on FreeBSD
|
||
- radeonsi: merge request 26055 causes thousands of piglit failures
|
||
- iris: INTEL_COMPUTE_CLASS causes gpu hangs on MTL platforms
|
||
- anv: piglit tests regressed for zink
|
||
- aco,radeonsi: GFX11 dEQP-GLES31.functional.separate_shader.random.0 fail when AMD_DEBUG=useaco
|
||
- crash in si_update_tess_io_layout_state during _mesa_ReadPixels (radeonsi_dri, mesa 23.2.1)
|
||
- Compilation error with current LLVM git (createLoopSinkPass)
|
||
- [RADV] War Thunder has some grass flickering.
|
||
- radv: satisfactory broken shader
|
||
- RADV problem with R7 M440 in some games
|
||
- nvk,nak: Weird fog effect in old GTA games with DXVK
|
||
- gpu driver crashes when opening ingame map playing dead space 2023
|
||
- [anv] Valheim water misrendering
|
||
- radv, zink: dEQP-GLES3.functional.fbo.msaa.4_samples.depth_component16 fails on gfx9
|
||
- Armored Core 6 (1888160) fake_sparse support
|
||
- radv: fix sparseResidencyImage3D on GFX8
|
||
- build still broken on Slackware 15.0 i586
|
||
- mesa fails to build on arch
|
||
- EGL/v3d: EGL applications under a X compositor doesn't work
|
||
- nvk,nak: Implement VK_KHR_fragment_shader_barycentric
|
||
- RADV: trunc_coord breaks ambient occlusion in Dirt Rally and other games
|
||
- radv: Mass Effect Legendary Edition: a line going across the screen is visible in some areas with Ambient Occlusion enabled
|
||
- LTO-related build failures
|
||
- anv: DIRT5 gfx11_generated_draws_spv_source triggers "assert(!copy_value_is_divergent(src) || copy_value_is_divergent(dest));"
|
||
- nvk: Implement VK_KHR_synchronization2
|
||
- nvk: Implement bufferDeviceAddressCaptureReplay
|
||
- nvk,nak,codegen: Implement VK_KHR_pipeline_executable_properties
|
||
- panfrost: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12 buffers
|
||
- Sastisfactory since Update 8 needs force_vk_vendor set
|
||
- [RADV][TONGA] - BeamNG.drive (284160) - Artifacts are present when looking at the skybox.
|
||
- LEGO Star Wars: The Skywalker Saga graphical glitches (DXVK) on R9 380
|
||
- [radv] Crypt not rendering properly
|
||
- Leaks of DescriptorSet debug names
|
||
- [Tracing flake] Missing geometry in trace\@freedreno-a630\@freedoom\@freedoom-phase2-gl-high.trace
|
||
- Unreal Engine 5.2 virtual shadow maps have glitchy/lazy tile updates
|
||
- RADV: Visual glitches in Unreal Engine 5.2.1 when using material with anisotropy and light channel 2
|
||
- radv: Regression with UE5 test
|
||
- SIGSEGV with MESA_VK_TRACE=rgp and compute only queue
|
||
- mesa: vertex attrib regression
|
||
- [ANV] Corruptions in Battlefield 4
|
||
- anv regression w/ commit e488773b29d97 ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments")
|
||
- freedreno uses wrong patch size
|
||
- ir3: dEQP-GLES31.functional.synchronization.inter_invocation.image_atomic_read_write crash on a6xx gen4
|
||
- a630: antichamber crashes with pack_A6XX_GRAS_CL_GUARDBAND_CLIP_ADJ: Assertion
|
||
- mesa:amd+compiler / aco_tests assembler.gfx11.vop12c_v128/gfx11 failure with llvm-17
|
||
- ci_run_n_monitor crash because of incorrect parsing of dag
|
||
- Zink + Venus: driver can't handle INVALID<->LINEAR!
|
||
- anv not initializing engine correctly with INTEL_COPY_CLASS=1
|
||
- Anv: Particles have black square artifacts on Counter Strike 2 on Skylake
|
||
- Lords of the Fallen 2023 Red Eye mode crashing game and desktop
|
||
- [radeonsi] [vulkan] [23.3-rc1 regression] Video output corrupted in QMplay2 with Vulkan renderer
|
||
- [BISECTED] ac/radeon commit somehow breaks nv12 surface from HEVC decode
|
||
- radv: Chrome crashes when ANGLE uses GPL
|
||
- Parsec displays completely green screen with hardware decoder selected while using Mesa 23.3 and Mesa 24
|
||
- H264 to H264 transcode output corruption with gst-vaapi
|
||
- opencl-jpeg-encoder does not work with nouveau/rusticl, works with nouveau/clover
|
||
- [rusticl] [radeonsi] [darktable4] [ppc64le] Darktable always renders black images despite not throwing any error
|
||
- [R600] X-plane 11 demo (Linux Native) crashes upon launch on HD5870 and HD6970
|
||
- [CI] .gitlab-ci/setup-test-env.sh date -d parsing fails on Alpine Linux containers
|
||
- ANV not handling VkMutableDescriptorTypeCreateInfoEXT::pMutableDescriptorTypeLists[i] being out of range
|
||
- Ubuntu 23.10 build error with rusticl_opencl_bindings.rs
|
||
- Rusticl fails to build
|
||
- tu: Wolfenstein: The New Order misrenders on a740
|
||
- DRI_PRIME fails with ACO only radeonsi
|
||
- ci_run_n_monitor: undetected sanity dep breaks the pipeline
|
||
|
||
|
||
Changes
|
||
-------
|
||
|
||
Alejandro Piñeiro (10):
|
||
|
||
- broadcom/qpu: use back BITFIELD64_RANGE for ANYOPMASK
|
||
- broadcom/compiler: add v3d_pack_unnormalized_coordinates helper
|
||
- broadcom: only support v42 and v71
|
||
- broadcom/compiler: set properly lod query
|
||
- broadcom/cle: remove v33 and v41 from xml definition
|
||
- broadcom/cle: rename xml files
|
||
- docs/v3d: update v3d documentation
|
||
- nir: add new opcodes to map new v71 packing/conversion instructions
|
||
- broadcom/compiler: update image store lowering to use v71 new packing/conversion instructions
|
||
- broadcom/compiler: remove one superfluous call to nir_opt_undef
|
||
|
||
Alessandro Astone (2):
|
||
|
||
- asahi: Use the compat version of qsort_r
|
||
- zink: Fix resizable BAR detection logic
|
||
|
||
Alexander von Gluck IV (3):
|
||
|
||
- egl/haiku: Cleanup includes; minor build fix
|
||
- hgl: Redefine visual options in hgl_context.h
|
||
- egl/haiku: Remove some dead cleanup code
|
||
|
||
Alyssa Rosenzweig (286):
|
||
|
||
- hasvk: Support builiding on non-Intel
|
||
- crocus: Support building on non-Intel
|
||
- meson: Add vulkan-drivers=all option
|
||
- meson: Add gallium-drivers=all option
|
||
- gitlab: Highlight .cl as C
|
||
- nir,vtn: Add exported bool to nir_function
|
||
- nir: Add nir_remove_non_exported
|
||
- nir/builder: Add nir_call helper
|
||
- meson: Simplify clc expression
|
||
- meson: Require clc for asahi
|
||
- vtn: Add spirv_library_to_nir_builder feature
|
||
- clc: Add missing idep_vtn
|
||
- agx: Fix lower regular texture metadata
|
||
- agx: Vectorize load/stores
|
||
- agx: Fuse (unmasked) extr_agx
|
||
- agx: Fuse ubitfield_extract
|
||
- asahi: Fix agx_pack unrolling
|
||
- asahi: Make GenXML compatible with OpenCL
|
||
- asahi: Unpack at 32-bit granularity
|
||
- asahi: Reexpress genxml pack macro
|
||
- asahi: Add folder for internal shaders
|
||
- asahi: Add asahi_clc infrastructure
|
||
- asahi: Pass valid memctx to open_device
|
||
- asahi: Deserialize libagx when opening device
|
||
- asahi,agx: Plumb libagx
|
||
- asahi: Add software-defined field to texture desc
|
||
- agx: Use CL for texture lowerings
|
||
- asahi: Remove placeholder shader
|
||
- asahi: Fix tools=all builds
|
||
- ci: Opt out asahi from clang-format
|
||
- ttn: Set sample shading for sample ID reads
|
||
- compiler: Make shader_enums.h CL-safe
|
||
- compiler: Inline mesa_vertices_per_prim
|
||
- compiler: Make u_decomposed_prims_for_vertices available to CL
|
||
- nir/lower_gs_intrinsics: Include primitive counts
|
||
- nir/lower_gs_intrinsics: Append EndPrimitive
|
||
- nir/lower_gs_intrinsics: Count decomposed primitives too
|
||
- nir: Also gather decomposed primitive count
|
||
- nir: Add intrinsics for lowering GS
|
||
- nir: Add intrinsics for lowering bindless textures/samplers
|
||
- nir/print: handle adjacency
|
||
- asahi: Clamp 8-bit integer RTs
|
||
- agx: Legalize image MS index
|
||
- agx: Fix fragment side effects scheduling
|
||
- agx: Check for spilling in release builds
|
||
- docs/features: Mark ARB_mdi done on asahi
|
||
- agx: Cleanup 8-bit math before lowering
|
||
- agx: Require 32-bit alignment for EOT offset
|
||
- agx: Add scaffolding for subgroup ops
|
||
- agx: Translate simple subgroup ops
|
||
- asahi: Pack non-border colour sampler desc
|
||
- agx: Allow drivers to lower texture handles
|
||
- asahi: Lower samplers to bindless if needed
|
||
- agx: Lower LOD bias earlier
|
||
- agx: Handle bindless samplers
|
||
- asahi: Handle load_sampler_handle
|
||
- asahi: Add sampler heap data structure
|
||
- asahi: Use the sampler heap
|
||
- asahi: Upload tex/samplers properly with merged shaders
|
||
- asahi: Don't hazard track fake resources
|
||
- asahi: Refactor encoder data structure
|
||
- asahi: Factor out agx_launch
|
||
- asahi: Make encoder_allocate public
|
||
- asahi: Add data structures for geometry shaders
|
||
- asahi: Add helpers for lowering GS
|
||
- asahi: Add GS lowering pass
|
||
- asahi: Wire up geometry shaders
|
||
- asahi: Advertise geometry shaders
|
||
- asahi: rm unused deqp debug flag
|
||
- asahi: Don't use OpenGL clip bit
|
||
- asahi: Plumb clip_halfz bit from RS
|
||
- asahi: Advertise ARB_clip_control
|
||
- asahi: Implement timer queries
|
||
- docs: Mark timer queries as done on asahi
|
||
- asahi: Implement ARB_base_instance
|
||
- nir: Simplify nir_alu_instr_channel_used definition
|
||
- nir/validate: Optimize ssa_srcs set
|
||
- nir/validate: Don't spam nir_alu_instr_channels
|
||
- nir/validate: Don't validate out-of-bounds channels
|
||
- nir/validate: Use unlikely for validate_assert
|
||
- nir/validate: Don't check dimensions in validate_def
|
||
- nir/validate: Drop stale todo
|
||
- nir/validate: Inline validate_ssa_src
|
||
- nir/validate: Split out validate_sized_src
|
||
- nir/validate: Specialize if source validation
|
||
- panfrost: Add an allow_rotating_primitives() helper
|
||
- panfrost: Factor out vertex attribute stride calculation
|
||
- panfrost: Add panfrost_get_{position,varying}_shader() helpers
|
||
- gallium: add pipe_shader_from_nir helper
|
||
- radeonsi: use pipe_shader_from_nir
|
||
- v3d: use pipe_shader_from_nir
|
||
- asahi: use pipe_shader_from_nir
|
||
- vc4: use pipe_shader_from_nir
|
||
- zink: use pipe_shader_from_nir
|
||
- nouveau: use pipe_shader_from_nir
|
||
- panfrost: use pipe_shader_from_nir
|
||
- gallium: drop pipe_shader_state_from_nir
|
||
- mesa/st: collapse tgsi deadcode
|
||
- mesa/st: use pipe_shader_from_nir
|
||
- nir/lower_tex: Add 1D lowering
|
||
- agx: fix 1D texture sampling
|
||
- ac,radv,radeonsi: use common 1D texture lowering
|
||
- nir/format_convert: handle clamping smaller bit sizes
|
||
- nir/lower_idiv: Optimize idiv sign calculation
|
||
- agx: Hotfix for stack_adjust in GS
|
||
- asahi/decode: Decode multiple macOS commands
|
||
- asahi: Quiet clang warning
|
||
- asahi: Add half float type to genxml
|
||
- asahi: Add XML for hw tessellation
|
||
- asahi: Identify Primitive ID frag input
|
||
- asahi: Identify bicubic filtering mode
|
||
- asahi: fix index bias with GS/XFB
|
||
- asahi: Sync heap size
|
||
- asahi: init clear colour between batches
|
||
- asahi: clamp clear colours
|
||
- asahi: handle self blits
|
||
- asahi: bump limits
|
||
- asahi: remove bogus assertion
|
||
- asahi: be robust about null xfb
|
||
- asahi: fix dirty tracking fail with point sprites
|
||
- asahi: handle null PBE
|
||
- asahi: Be robust with arrays of images
|
||
- asahi: fix imageSize of null image
|
||
- asahi: rm compact image atomic descriptors
|
||
- asahi: use 2D descriptors for cubes
|
||
- asahi: defer texture packing to draw-time
|
||
- ail: handle >4GiB textures
|
||
- asahi: return GL_OOM for excessive image sizes
|
||
- asahi: fix meta usc builder allocation
|
||
- asahi: implement xfb stream queries
|
||
- asahi: fix output to non-rast streams
|
||
- asahi: bump glsl version
|
||
- asahi: minify when blitting for transition
|
||
- asahi: blit with the old format when transitioning
|
||
- asahi: flush before resource transition
|
||
- agx: Fix flatshading of matrices
|
||
- asahi: fix xfb of pointsize when not drawing points
|
||
- asahi: defeature quads
|
||
- asahi: Rotate tri fans based on provoking vtx
|
||
- asahi: use GS for first-provoking fans
|
||
- asahi: Early out for GS + rast discard
|
||
- asahi: Implement draw parameters
|
||
- agx: wire up texture_samples/image_samplers
|
||
- asahi: advertise ARB_shader_texture_image_samples
|
||
- asahi: fix layout transitions with arrays
|
||
- asahi: use correct target packing PBE
|
||
- asahi: choose staging bind better
|
||
- asahi: fix destroy_query leaving dangling references
|
||
- asahi: add agx_push macro
|
||
- asahi: collapse unreachable condition
|
||
- asahi: use agx_push
|
||
- asahi: remove dead declarations
|
||
- asahi: rm unnecessary uniform upload for GS
|
||
- asahi: make UB easier to see
|
||
- asahi: force GS for indirect prim gen query
|
||
- asahi: rework GS input assembly
|
||
- asahi: Implement multidraw indirect
|
||
- asahi: move heap alloc to first use
|
||
- asahi: double depth bias
|
||
- asahi: add static assert
|
||
- agxdecode: fix stack smash with border colour
|
||
- asahi: Support L/A/I formats for texture buffers
|
||
- asahi: fix tri fan enum
|
||
- asahi: rework cf binding xml
|
||
- asahi: add xml for flatshading fans
|
||
- agx: fix VARYING_SLOT_COL0 getting flatshaded
|
||
- agx: Avoid scratch mem with tri strip w/ adjacency
|
||
- agx: rework libagx linking a bit
|
||
- asahi: Unroll GS/XFB primitive restart on the GPU
|
||
- asahi: Lower edge flags
|
||
- asahi: assert hw invariant
|
||
- asahi: rewrite pointsize handling
|
||
- agx: remove spurious z/s writes in force early-z shaders
|
||
- agx: handle force early-z + discard
|
||
- agx: note that sample_mask runs occlusion queries
|
||
- agx: allocate varying slot if writing viewport only
|
||
- agx: report if we have a nonzero viewport
|
||
- asahi: allow empty scissor box
|
||
- asahi: add XML for multiple viewports
|
||
- asahi: Implement ARB_viewport_array
|
||
- asahi: handle some components/offsets in GS lowering
|
||
- asahi: prepare gs copy shaders for compact clip/cull
|
||
- asahi: handle compact clip/cull in gs component gather
|
||
- asahi: Implement ARB_cull_distance
|
||
- asahi: add more BGR formats
|
||
- asahi: fix dupe rgb65 formats
|
||
- asahi: fix pbe swizzling
|
||
- asahi: fix integer RT clamping
|
||
- agx: fix fp64 lowering options
|
||
- agx: Lower 64-bit I/O to 32-bit
|
||
- agx: don't produce split of immediate
|
||
- asahi: fix size calculation for 2d msaa arrays
|
||
- asahi: allow more format reinterpretation
|
||
- asahi: respect render condition for compute
|
||
- asahi: wire up hardware gl_PrimitiveID
|
||
- asahi: clamp draw count for mdi
|
||
- gallium: fix util_clamp_color type confusion
|
||
- gallium: add PIPE_IMAGE_ACCESS_DRIVER_INTERNAL
|
||
- nir/validate: allow bias on nir_texop_lod
|
||
- asahi: Implement lod queries
|
||
- vtn: fuse OpenCL mad if we can can
|
||
- asahi: fix eMRT + background load interaction
|
||
- ail: add is_level_compressed query
|
||
- ail: use is_level_compressed
|
||
- ail: add ail_is_level_twiddled_uncompressed
|
||
- asahi: do not use compression blits for uncompressed levels
|
||
- agx: allow bindful arrays if not clamping
|
||
- asahi: don't format convert with staging blits
|
||
- asahi: implement arrays as 2d for internal images
|
||
- asahi: respect last_block
|
||
- asahi: allow compressed image stores in blits
|
||
- asahi: fix image_mask with unbind num trailing
|
||
- asahi: add compute blitter
|
||
- asahi: add and use batch_is_compute helper
|
||
- asahi: fix get_batch with compute batches
|
||
- asahi: allow multiple compute dispatches in a batch
|
||
- asahi: drop custom mipmap generate
|
||
- asahi: set data_valid on first draw
|
||
- asahi: fix data valid tracking
|
||
- asahi: reduce transfer map flushing with staging blits
|
||
- asahi: do not stall for writers with invalid mips
|
||
- asahi: implement blit-based resource_copy_region
|
||
- asahi: fix snorm staging blits
|
||
- asahi: use copy region for decompression
|
||
- asahi: fix scissor arrays
|
||
- asahi: disable compute-based blitter for now
|
||
- agx: use more mem->tex barriers even on g13g
|
||
- agx: fix early-z + discard together
|
||
- asahi: fix set_sampler_views
|
||
- asahi: fix max tex sizes
|
||
- agx: optimize fcmp like fcmpsel
|
||
- agx: wire up some ballots
|
||
- agx: lower votes to ballots
|
||
- agx: implement query_levels
|
||
- agx: skip scoreboard bit in builder for !wait
|
||
- agx: make vec widths explicit in IR
|
||
- agx: validate post-RA
|
||
- agx: rm silly todo
|
||
- agx: rm outdated comment
|
||
- agx: add index size helper
|
||
- agx: trust in agx_index size
|
||
- agx: mv agx_read/write_regs to validator
|
||
- agx: use custom assert when packing
|
||
- agx: use mov imm for pcopies
|
||
- agx: allow phis with 16bit imms
|
||
- agx: prepare for immediates in phis
|
||
- agx: handle imm inlining into phis
|
||
- asahi: rework compute emptiness tracking
|
||
- asahi: stub qbo on the cpu
|
||
- asahi: implement xfb overflow queries
|
||
- agx: const fold after discard lowering
|
||
- agx: fix xfb of invalid comp
|
||
- agx: fix xfb of invalid var
|
||
- asahi: bump vertex shader outputs
|
||
- asahi: rm pointless multisample key bit
|
||
- asahi: rm layered bit from shader key
|
||
- asahi: implement point sprites w/o shader key
|
||
- asahi: rm unused blend enable bit
|
||
- asahi: rm logicop enable bit
|
||
- asahi: rm nr_cbufs from key
|
||
- asahi: rm blend->store from shader key
|
||
- asahi: rm vbuf.count from key
|
||
- asahi: rm agx_vbufs wrapper
|
||
- asahi: invert program_point_size
|
||
- asahi: divide by xfb stride for xfb draws
|
||
- asahi: disable fp16 cbuf cap
|
||
- asahi: add missing GS line strip (+adj) handling
|
||
- asahi: link libagx before lowering mem access widths
|
||
- asahi: cl-ify some xfb logic
|
||
- asahi: factor out libagx_map_vertex_in_tri_strip
|
||
- asahi: rotate xfb'd tri strips
|
||
- asahi: inline something silly
|
||
- asahi: plumb get_ubo_size
|
||
- asahi: make txf robust properly
|
||
- asahi: fix passthrough GS with poly modes
|
||
- asahi: add missing tib alignment check
|
||
- agx: optimize split(64-bit uniform)
|
||
- agx: expand agx_index
|
||
- agx: fix 64-bit phis with inlined immediates
|
||
- agx: add unit test for pcopy lowering bug
|
||
- agx: require min alignment for load/store vectorize
|
||
- asahi: fallback some resource copies
|
||
- asahi: don't canonicalize nans/flush denorms when copying
|
||
- agx: unit test split uniform opt
|
||
- agx: clang-fmt
|
||
- nir,zink: Redefine flat_mask in terms of I/O locations
|
||
|
||
Andrew Gazizov (4):
|
||
|
||
- venus: Add use_guest_vram capset to enable guest-based blob alloc
|
||
- venus: Use vk_object_id as blob_id for guest_vram device memory alloc
|
||
- venus: Tighten the conditions for guest_vram device memory alloc
|
||
- venus: Make sure that guest allocated blobs from hostmem are mappable
|
||
|
||
Anthony Roberts (1):
|
||
|
||
- glsl: Use unsigned instead of enum type in ir_variable_data
|
||
|
||
Antoine Coutant (1):
|
||
|
||
- clc: retrieve libclang path at runtime.
|
||
|
||
Antonio Gomes (14):
|
||
|
||
- rusticl, meson: Move libc functions to their own crate
|
||
- rusticl, meson: Add gl/egl/glx bindings
|
||
- iris: Fixups in resource_get_handle and resource_from_handle
|
||
- mesa/st: Add new data to mesa_glinterop
|
||
- mesa/st, dri2, wgl, glx: Modify flush_objects interop func to export a fence_fd
|
||
- rusticl: Add xplat helpers to dynamic link interop functions
|
||
- rusticl/device: Function to check for gl interop support
|
||
- rusticl/device: Enable gl_sharing only if create_fence_fd is implemented
|
||
- rusticl: Add functions to create CL ctxs from GL, and also to query them
|
||
- rusticl/format: Add conversion table for GL->CL
|
||
- rusticl: Create CL mem objects from GL
|
||
- rusticl: Add support for cube maps
|
||
- rusticl: Flush objects just before importing them
|
||
- rusticl: Advertise cl_khr_gl_sharing extension
|
||
|
||
Anuj Phogat (1):
|
||
|
||
- intel/l3: Adjust URB weight calculation for gfx12.5+.
|
||
|
||
Asahi Lina (12):
|
||
|
||
- asahi: Fix CDM Launch/Barrier naming
|
||
- asahi: Add extra CDM barrier bit for G13X
|
||
- asahi: Move USC cache flush to agx_batch_init_state
|
||
- asahi: Add more memory barrier opcodes
|
||
- asahi: Add extra barrier for texture atomics on G13X
|
||
- ail: Fix miptree offset generation for compressed textures
|
||
- ail: Add explicit specification of mip level strides
|
||
- ail: Fix tile size & strides for compressed textures
|
||
- asahi: Add .editorconfig for CL files
|
||
- asahi: Implement BO alignment
|
||
- agx: Fix packing of stack map/unmap
|
||
- agx: Add scoreboarding to stack instructions
|
||
|
||
Bas Nieuwenhuizen (11):
|
||
|
||
- radv: Add DGC preprocessing barrier support.
|
||
- radv: Add compute DGC preprocessing support.
|
||
- radv: Add some initial graphics DGC preprocessing support.
|
||
- radv: Add implementation of cmd buffers for a sparse binding queue.
|
||
- radv: Remove the sparse binding queue from coherent images.
|
||
- radv: Move sparse binding into a dedicated queue.
|
||
- nir: Add nir_static_workgroup_size helper.
|
||
- nir: Add pass for clearing memory at the end of a shader.
|
||
- radv: Add option to clear LDS at the end of a shader.
|
||
- radeonsi: Add support to clear LDS at the end of a shader.
|
||
- radv: Use correct writemask for cooperative matrix ordering.
|
||
|
||
Benjamin Lee (14):
|
||
|
||
- nak: make sm available in builders
|
||
- nak: Legalize a bunch of instructions for SM50
|
||
- nak: add IADD instruction for SM50
|
||
- nak: implement ST* and LD* on SM50
|
||
- nak: add ATOM{G,S} encoding for SM50
|
||
- nak: add carry register file
|
||
- nak: move iadd64 construction to a builder method
|
||
- nak: use carry register file for IADD2
|
||
- nak: make as_imm_not_{i,f}20 helper methods public
|
||
- nak: implement SHL and SHR on SM50
|
||
- nak: implement IMUL for SM50
|
||
- nak: encode Dst::None as RZ on SM50
|
||
- nak: implement SHFL on SM50
|
||
- nak: implement VOTE on SM50
|
||
|
||
Boris Brezillon (74):
|
||
|
||
- pan/genxml: Fix "{Last,First} Heap Chunk" field position
|
||
- panfrost: Fix format_minimum_alignment() for v6-
|
||
- pan/bo: Make sure we catch refcnt underflows
|
||
- pan/genxml: Fix 'Shader Program' descriptor definition on v9 and v10
|
||
- pan/decode: Print the resource table label
|
||
- pan/decode: Make CSF decoding more robust to NULL pointers
|
||
- pan/decode: Fix the pan_unpack() call for JUMP instruction unpacking
|
||
- panfrost: Flag the right shader when updating images
|
||
- panfrost: Kill unused panfrost_batch::polygon_list field
|
||
- panfrost: Emit attribs in panfrost_update_state_3d() on bifrost/midgard
|
||
- panfrost: Emit image attribs for compute in panfrost_update_shader_state()
|
||
- panfrost: Rename panfrost_vtable::context_init
|
||
- panfrost: Inline pan_emit_tiler_heap()
|
||
- panfrost: Inline pan_emit_tiler_ctx()
|
||
- panfrost: Count draws at the batch level
|
||
- panfrost: Express the per-batch limit in term of draws
|
||
- panfrost: Count the number of compute jobs at the batch level
|
||
- panfrost: Make panfrost_has_fragment_job() public
|
||
- panfrost: Stop using the scoreboard to check the presence of draws/compute
|
||
- panfrost: Store the fragment job descriptor address in the batch
|
||
- panfrost: Emit the fragment job from panfrost_batch_submit()
|
||
- panfrost: Move the panfrost_emit_tile_map() call around
|
||
- panfrost: Get rid of unused in_sync parameter in panfrost_batch_submit[_ioctl]()
|
||
- panfrost: Get rid of the out_sync parameter in panfrost_batch_submit_jobs()
|
||
- panfrost: Get rid of unused fb parameter passed to panfrost_batch_submit_jobs()
|
||
- panfrost: Add a submit_batch() hook to panfrost_vtable
|
||
- panfrost: Store the index pointer in panfrost_batch
|
||
- panfrost: Stop passing vertex attribute arrays around
|
||
- panfrost: Store varying related fields in panfrost_batch
|
||
- panfrost: Use u_reduced_prim() to do the is_line check
|
||
- panfrost: Move JM specific fields to their own struct
|
||
- panfrost: s/panfrost_emit_vertex_tiler_jobs/jm_push_vertex_tiler_jobs/
|
||
- panfrost: Move the JM-specific bits out of emit_fragment_job()
|
||
- panfrost: Rename several job emission helpers
|
||
- panfrost: Factor out the point-sprite shader update logic
|
||
- panfrost: Factor out the vertex count logic
|
||
- panfrost: Re-order things in panfrost_direct_draw()
|
||
- panfrost: Move all JM-specific bits out of panfrost_direct_draw()
|
||
- panfrost: Use batch->tls.gpu to store the compute TLS descriptor
|
||
- panfrost: Move JM-specific bits out of panfrost_launch_grid_on_batch()
|
||
- panfrost: Move JM specific bits out of panfrost_launch_xfb()
|
||
- panfrost: Drop the vertex_count argument passed to panfrost_batch_get_bifrost_tiler()
|
||
- panfrost: Rename panfrost_batch_get_bifrost_tiler()
|
||
- panfrost: s/panfrost_emit_shader/jm_emit_shader_env/
|
||
- panfrost: s/panfrost_emit_primitive/jm_emit_primitive/
|
||
- panfrost: Rename JM-specific batch submission helpers
|
||
- panfrost: s/preload/jm_preload_fb/
|
||
- panfrost: s/init_batch/jm_init_batch/
|
||
- panfrost: Prepare things for the common/JM cmdstream split
|
||
- panfrost: Move JM helpers to their own source file
|
||
- panfrost: Add a JOBX() macro to simplify job-frontend selection
|
||
- panfrost: Fix multiplanar YUV texture descriptor emission on v9+
|
||
- panfrost: Don't leak NIR compute shaders
|
||
- panfrost: s/pan_scoreboard/pan_jc/
|
||
- panfrost: Rename pan_cs.{c,h} into pan_desc.{c,h}
|
||
- panfrost: Make pan_afbc_compression_mode() per-gen
|
||
- panfrost: Restrict job chain helpers to JM hardware
|
||
- panfrost: Restrict job descriptor emission to JM hardware
|
||
- util/hash_table: Use FREE() to be consistent with the CALLOC_STRUCT() call
|
||
- util/hash_table: Don't leak hash_u64_key objects when the entry exists
|
||
- util/hash_table: Don't leak hash_key_u64 objects when the u64 hash table is destroyed
|
||
- panfrost: Abstract kernel driver operations
|
||
- pan/kmod: Add a backend for the panfrost kernel driver
|
||
- panfrost: Avoid direct accesses to some panfrost_device fields
|
||
- panfrost: Avoid direct accesses to some panfrost_bo fields
|
||
- panfrost: Back panfrost_device with pan_kmod_dev object
|
||
- panfrost: Add a VM to panfrost_device
|
||
- panfrost: Back panfrost_bo with pan_kmod_bo object
|
||
- panfrost: Introduce a PAN_BO_SHAREABLE flag
|
||
- panvk: Pass PAN_BO_SHAREABLE when relevant
|
||
- panfrost: Flag BO shareable when appropriate
|
||
- panvk: Fix tracing
|
||
- panvk: Fix access to unitialized panvk_pipeline_layout::num_sets field
|
||
- panfrost: Clamp the render area to the damage region
|
||
|
||
Boyuan Zhang (4):
|
||
|
||
- gallium/pipe: define hevc max slices number
|
||
- frontend/va: add support for multi slices reflist
|
||
- radeonsi: add new interface to handle multi slice reflist
|
||
- radeonsi/vcn: add new logic for hevc multi slices reflist
|
||
|
||
Brian King ((MEDIA)) (1):
|
||
|
||
- d3d12: Add constraint_set1_flag support
|
||
|
||
Caio Oliveira (90):
|
||
|
||
- anv: Fix leak when compiling internal kernels
|
||
- intel/compiler: Remove unused parameter from brw_nir_adjust_payload()
|
||
- intel/compiler: Take more precise params in brw_nir_optimize()
|
||
- intel/compiler: Remove unused parameter from brw_nir_analyze_ubo_ranges()
|
||
- intel/compiler: Clarify the asserts in nir_load_workgroup_id lowering
|
||
- intel/compiler: Rework opt_split_sends to not rely/modify LOAD_PAYLOAD
|
||
- intel/compiler: Re-enable opt_zero_samples() for Gfx7+
|
||
- intel/compiler: Re-enable opt_zero_samples() in many cases for Gfx12.5
|
||
- intel/compiler: Remove is_tex()
|
||
- intel/compiler: Use linear allocator in parts of brw_schedule_instructions
|
||
- intel/compiler: Remove reference to brw_isa_info from schedule_node
|
||
- intel/compiler: Allocate all schedule_nodes at once
|
||
- intel/compiler: Use array to iterate the scheduler nodes
|
||
- intel/compiler: Add only available instructions to scheduling list
|
||
- intel/compiler: Extract scheduling related basic functions
|
||
- intel/compiler: Cache issue_time information
|
||
- intel/compiler: Remove virtual calls from scheduler
|
||
- intel/compiler: Move FS specific fields to fs_instruction_scheduler
|
||
- intel/compiler: Merge child/latency arrays in schedule_node
|
||
- intel/compiler: Tidy up code in scheduler related to reads_remaining
|
||
- intel/compiler: Move earlier scheduler code that is not mode-specific
|
||
- intel/compiler: Separate schedule_node temporary data
|
||
- intel/compiler: Make scheduler classes take an external mem_ctx
|
||
- intel/compiler: Reuse same scheduler for all pre-RA scheduling modes
|
||
- intel/compiler: Clear up block instructions before re-adding them
|
||
- intel/compiler: Simplify allocation of NIR related arrays
|
||
- intel/compiler: Prefer ctor/dtors in some Google Tests
|
||
- intel/compiler: Don't use fs_visitor::bld in tests
|
||
- intel/compiler: Don't use fs_visitor::bld in fs_reg_alloc
|
||
- intel/compiler: Don't use fs_visitor::bld in thread payload classes
|
||
- intel/compiler: Add a few more helpers to fs_builder
|
||
- intel/compiler: Allow dumping CFG to a specific FILE*
|
||
- intel/compiler: Sort lists of succs and preds in CFG dump output
|
||
- intel/compiler: Add a few tests to opt_predicated_break
|
||
- anv/xe2+: Use Region-based Tessellation redistribution
|
||
- iris/xe2+: Use Region-based Tessellation redistribution
|
||
- intel/compiler: Refactor program exit in intel_clc
|
||
- intel/compiler: Use single variable instead of dynarray
|
||
- intel/compiler: Fix memory leaks in intel_clc
|
||
- intel/compiler: Remove the linking step in intel_clc
|
||
- intel/compiler: Remove unused headers
|
||
- intel/compiler: Move NIR emission code to brw_fs_nir.cpp
|
||
- intel/compiler: Make a NIR intrinsic emission functions static
|
||
- intel/compiler: Make more functions in NIR conversion static
|
||
- intel/compiler: Make functions for NIR control flow conversion static
|
||
- intel/compiler: Make setup functions of NIR emission static
|
||
- intel/compiler: Make non-intrinsic NIR conversion functions static
|
||
- intel/compiler: Make NIR atomic conversion functions static
|
||
- intel/compiler: Make NIR resources helpers static
|
||
- intel/compiler: Move nir_ssa_value into a local structure
|
||
- intel/compiler: Move remaining NIR conversion fields to nir_to_brw_state
|
||
- intel/compiler: Stop using fs_visitor::bld field in NIR conversion
|
||
- intel/compiler: Annotate and use nir_to_brw_state::bld
|
||
- intel/compiler: Don't use fs_visitor::bld in remaining places
|
||
- intel/compiler: Remove fs_visitor::bld
|
||
- intel/compiler: Make fs_visitor not depend on fs_builder
|
||
- intel/compiler: Make fs_builder include fs_visitor and not the other way
|
||
- intel/compiler: Add ctor to fs_builder that just takes the shader
|
||
- intel/compiler: Create and use nir_to_brw() function
|
||
- intel/compiler: Use reference instead of pointer for nir_to_brw_state
|
||
- intel/compiler: Use reference instead of pointer for fs_visitor
|
||
- compiler/glsl: Reduce scope of is_anonymous
|
||
- clover: Remove usage of glsl_type C++ helpers
|
||
- compiler/types: Add a few more helpers to get builtin types
|
||
- intel/compiler: Use C helpers to access builtin types
|
||
- compiler: Remove C++ static member pointers to builtin types
|
||
- intel/compiler: Use glsl_type C helpers
|
||
- r600/sfn: Use glsl_type C helpers
|
||
- nouveau: Use glsl_type C helpers
|
||
- nir: Use glsl_type C helpers
|
||
- mesa: Use glsl_type C helpers
|
||
- lima: Use glsl_type C helpers
|
||
- compiler/types: Add a few more glsl_type C helpers
|
||
- glsl: Use glsl_type C helpers
|
||
- compiler/types: Remove glsl_type C++ helpers
|
||
- compiler/types: Use a typedef for glsl_type
|
||
- intel/cmat: Add pass to lower cooperative matrix to subgroup operations
|
||
- intel/dev: Add cooperative matrix configuration information
|
||
- anv: Implement VK_KHR_cooperative_matrix
|
||
- util: Add a way to set the min_buffer_size in linear_alloc
|
||
- spirv: Use linear_alloc for parsing-only data
|
||
- spirv: Use value_id_bound to set initial memory allocated
|
||
- intel/fs: Only allocate acp_entry if we are adding one
|
||
- intel/fs: Use linear allocator in opt_copy_propagation
|
||
- intel/fs: Use linear allocator in fs_live_variables
|
||
- anv: Don't print warnings for GRL kernel compilations
|
||
- intel/compiler: Use INTEL_DEBUG=cs to ask for brw_compiler output
|
||
- nir: Disable -Wmisleading-indentation when compiling with GCC
|
||
- ci: Add Werror=misleading-indentation to debian-clang
|
||
- intel/compiler: Fix rebuilding the CFG in fs_combine_constants
|
||
|
||
Casey Bowman (1):
|
||
|
||
- anv: Override vendorID for Diablo IV
|
||
|
||
Chia-I Wu (14):
|
||
|
||
- radv: fix vkCmdCopyImage2 for emulated etc2/astc
|
||
- radv: stop using vk_render_pass_state::render_pass
|
||
- vulkan, tu, pvr: remove vk_render_pass_state::render_pass
|
||
- radv: fix image view extent override for astc
|
||
- radv: minor clean up to image view extent override
|
||
- ac: be careful with stencil_offset override
|
||
- radv: disable TC-compat htile on GFX9 in some cases
|
||
- radv: fix VkDrmFormatModifierProperties2EXT for multi-planar formats
|
||
- radv: fix VkSubresourceLayout2KHR for multi-planar formats with modifiers
|
||
- radv: fix a typo in radv_image_view_make_descriptor
|
||
- radv: fix asserts for radv_init_metadata
|
||
- radv: convert a check in radv_get_memory_fd to assert
|
||
- vk/util: ignore unsupported feature structs
|
||
- Revert "vk/util: ignore unsupported feature structs"
|
||
|
||
Chris Spencer (7):
|
||
|
||
- meson: Add option to ignore artificial Android limitations
|
||
- android.mk: Add option to pass arbitrary parameters to meson
|
||
- anv/android: Only limit advertised Vulkan version in strict mode
|
||
- radv/android: Only limit advertised Vulkan version in strict mode
|
||
- v3dv/android: Only limit advertised Vulkan version in strict mode
|
||
- vn/android: Only limit advertised Vulkan version in strict mode
|
||
- vulkan/android: Only limit advertised extensions in strict mode
|
||
|
||
Christian Gmeiner (13):
|
||
|
||
- agx: Re-index nir defs to reduce memory usage
|
||
- ci/etnaviv: Update ci expectation
|
||
- etnaviv: rs: Call etna_rs_gen_clear_surface(..) when needed
|
||
- etnaviv: Mark etna_rs_gen_clear_surface(..) private
|
||
- docs: Update etnaviv extensions
|
||
- etnaviv: Update headers from rnndb
|
||
- etnaviv: Add static_assert(..) to catch memory corruption
|
||
- isaspec: Add bool_inv type to print inverted bools
|
||
- etnaviv: Add isaspec support
|
||
- etnaviv: disassembler: Switch to isaspec
|
||
- mesa: Drop not used program_written_to_cache
|
||
- nir/opt_peephole_select: handle speculative ubo loads
|
||
- pan/mdg: Use nir_builder for load_sampler_lod_parameters_pan
|
||
|
||
Colin Marc (1):
|
||
|
||
- vulkan video: correctly set SPS VUI bits
|
||
|
||
Connor Abbott (32):
|
||
|
||
- util/rb_tree: Fix editorconfig
|
||
- util/rb_tree: Add augmented trees and interval trees
|
||
- freedreno/ci: Remove minetest trace
|
||
- v3d/ci: Remove minetest trace
|
||
- vk,lvp,tu,radv,anv: Add common vk_*_pipeline_create_flags() helper
|
||
- vk/graphics_state: Support VK_KHR_maintenance5
|
||
- vk/graphics_state, tu: Rewrite renderpass flags handling
|
||
- vk/graphics_state: Support VK_EXT_attachment_feedback_loop_dynamic_state
|
||
- vk/graphics_state: Add vk_pipeline_flags_feedback_loops helper
|
||
- tu: Assume no raster-order attachment access with NULL DS/blend state
|
||
- tu: Fix order of rasterizer_discard check
|
||
- tu: Make sure copies to half-float formats are bit exact
|
||
- tu: Fix getting VkDescriptorSetVariableDescriptorCountLayoutSupport
|
||
- ir3/ra: Don't swap killed sources for early-clobber destination
|
||
- nir: Add quad vote intrinsics
|
||
- amd: Implement quad_vote intrinsics
|
||
- nir/subgroups: Add option to lower Boolean subgroup reductions
|
||
- amd: Enable boolean subgroup lowering
|
||
- tu: Fix re-emitting VS param state after it is re-enabled
|
||
- tu: Don't use pipeline layout to emit shared const enable
|
||
- tu: Rework dynamic offset handling
|
||
- tu: Make filling out tu_program_state not depend on the pipeline
|
||
- tu: Move shader linking to tu_shader.cc
|
||
- freedreno/afuc: Handle store instruction on a5xx
|
||
- freedreno/afuc: Add separate "SQE registers"
|
||
- freedreno/afuc: Use SQE registers for call stack
|
||
- freedreno/afuc: Add syntax for pre-increment addressing
|
||
- freedreno/afuc: Decode (sdsN) modifier
|
||
- freedreno: Update more control/pipe registers for a7xx
|
||
- freedreno/afuc: README updates for a7xx
|
||
- freedreno/afuc: Fix gen autodetection for a7xx
|
||
- ir3/legalize: Fix helper propagation with b.any/b.all/getone
|
||
|
||
Corentin Noël (10):
|
||
|
||
- mesa/bufferobj: ensure that very large width+offset are always rejected
|
||
- virgl: fill the array_size value when using PIPE_TEXTURE_CUBE
|
||
- virgl/texture: Align destination box to block depth
|
||
- mesa/ffvs: Use gl_state_index16 in helpers directly
|
||
- gallivm: Initialize indir_index to NULL before use
|
||
- gallivm/lp_bld_nir_aos: Use TGSI instead of PIPE enum
|
||
- mesa: Use a switch for state_iter and be more precise about its type
|
||
- frontends/va: Remove wrong use of ProfileToPipe
|
||
- virgl: Only send the same amount of data than declared in pipe_sampler_state
|
||
- virgl: Assert build_id_note before dereferencing it
|
||
|
||
Daniel Almeida (33):
|
||
|
||
- nak: derive From<OpFoo> for Op through a proc macro
|
||
- nak: make Instr::new() generic
|
||
- nak: compiler: add From<T:Into<Op>> for Instr
|
||
- nak: compiler: replace Instr::new(..) with OpFoo {}.into()
|
||
- nak: Heap-allocate Instrs
|
||
- nak: Do not allocate vectors needlessly in optimization passes
|
||
- nak: add support for floor, ceil and trunc
|
||
- nak: run nir_lower_frexp and nir_opt_algebraic_late
|
||
- nak: more lowerings
|
||
- nak: change ishl data type to I32
|
||
- nak: add support for nir_op_isign
|
||
- nak: Add support for nir_op_bitcount
|
||
- nak: add support for nir_op_bitfield_reverse
|
||
- nak: add support for findmsb,findlsb
|
||
- nak: add support for packhalf2x16_split
|
||
- nak: add support for nir_op_unpack_half_2x16_split_{x|y}
|
||
- nak: add support for atomic cmpxcgh on images
|
||
- nak/sm50: rewrite encode_iadd2 to not use encode_alu()
|
||
- nak: sm50: rewrite fsetp to not use encode_alu
|
||
- nak: sm50: Rewrite fmnmx to not use encode_alu
|
||
- nak: sm50: rewrite fmul to not use encode_alu
|
||
- nak: sm50: rewrite fset to not use encode_alu
|
||
- nak: sm50: rewrite iabs to not use encode_alu
|
||
- nak: sm50: convert sel to not use encode_alu()
|
||
- nak: sm50: convert i2f to not use encode_alu()
|
||
- nak: sm50: rewrite encode_f2f to not use encode_alu()
|
||
- nak: convert encode_imad to not use encode_alu()
|
||
- nak: sm50: rewrite encode_popc to not use encode_alu()
|
||
- nak: sm50: rewrite encode_prmt to not use encode_alu()
|
||
- nak: sm50: remove encode_alu() and friends
|
||
- nak/sm50: remove ALUSrc and friends
|
||
- nak/sm50: remove \*fmod* calls from iabs
|
||
- nak: sm50: fix ineg legalization
|
||
|
||
Daniel Schürmann (24):
|
||
|
||
- nir/lower_subgroups: optimize reductions with cluster_size == 1
|
||
- nir: optimize open-coded quadVote* directly to new nir_quad intrinsics
|
||
- aco: delete instruction selection for boolean subgroup operations
|
||
- nir: remove info.fs.needs_all_helper_invocations
|
||
- nir/gather_info: add missing wide subgroup operations
|
||
- nir: add info.fs.require_full_quads
|
||
- aco: enable helper lanes if shader->info.fs.require_full_quads
|
||
- amd: rename max_wave64_per_simd -> max_waves_per_simd
|
||
- aco: rename max_wave64_per_simd -> max_waves_per_simd
|
||
- radv: fix number of physical SGPRs on GFX10+
|
||
- aco: remove VCCZ and EXECZ register handling
|
||
- nir/opt_loop: move loop control-flow optimizations into separate pass
|
||
- treewide: replace calls to nir_opt_trivial_continues() with nir_opt_loop()
|
||
- nir: remove nir_opt_trivial_continues()
|
||
- nir: remove redundant passes from nir_opt_if()
|
||
- nir/opt_loop_cf: generalize removal of "trivial" continues
|
||
- aco: fix should_form_clause() for memory instructions without operands
|
||
- aco: form clauses for LDS instructions
|
||
- aco: add new post-RA scheduler for ILP
|
||
- aco: refactor and speed-up dead code analysis
|
||
- nir/opt_move_discards_to_top: don't schedule discard/demote across subgroup operations
|
||
- nir/gather_info: fix enumeration of wide subgroup intrinsics
|
||
- aco: give spiller more room to assign spilled SGPRs to VGPRs
|
||
- aco/insert_exec_mask: Fix unconditional demote at top-level control flow.
|
||
|
||
Daniel Stone (7):
|
||
|
||
- ci: Try really hard to print final result string
|
||
- ci/radeonsi: Occlusion queries are flaky on stoney
|
||
- ci: Fix trivial typo in ARTIFACTS_BASE_URL
|
||
- panfrost/ci: Remove Vulkan expectations from G57
|
||
- panfrost/ci: Add environment variable to suppress warnings
|
||
- panfrost/ci: Skip broken image copy tests
|
||
- ci: Re-enable Collabora farm
|
||
|
||
Danylo Piliaiev (15):
|
||
|
||
- tu: Fix reading of stale (V)PC_PRIMITIVE_CNTL_0
|
||
- tu/a7xx: Zero out A7XX_VPC_PRIMITIVE_CNTL_0 in 3d blits
|
||
- tu/a6xx: Exclude REG_A6XX_TPL1_UNKNOWN_B602 from reg stomping
|
||
- tu/a7xx: Fix occlusion queries on pre-A740 GPUs
|
||
- tu: Always print startup failure messages
|
||
- tu: Return error when GPU is unsupported
|
||
- freedreno/devices: Support Adreno 725
|
||
- tu: Add a725 workaround dispatch at the start of each cmdbuf
|
||
- freedreno/devices: Separate device definition into base + gen features
|
||
- freedreno,tu,ir3: Pass fd_dev_info into ir3_compiler_create
|
||
- freedreno,tu: Add env vars to modify fd_dev_info
|
||
- freedreno: Add a644 support
|
||
- freedreno/devices: Update a690 magic regs from WSL blob
|
||
- turnip: Disable UBWC for D/S images on A690
|
||
- freedreno: Disable UBWC for D/S images on A690
|
||
|
||
Dave Airlie (38):
|
||
|
||
- vulkan: update video headers
|
||
- vulkan/video: add support for h264 encode to common code
|
||
- vulkan/video: add h265 encode support
|
||
- vulkan/video: add h264 nal enum
|
||
- vulkan/video: add a nal_unit lookup for hevc
|
||
- util: add a bitstream encoder for video stream headers.
|
||
- vulkan/video: add h264 level idc convertor utility
|
||
- vulkan/video: add a h265 level translator.
|
||
- vulkan/video: add h264 headers encode
|
||
- vulkan/video: add h265 header encoders.
|
||
- nak: fix backtrace crash running computeheadless
|
||
- nak: make ipa encoding match the order in codegen gv100
|
||
- nak: do perspective divide for interp none as well
|
||
- nvk/xfb: set correct counter buffer for writing stream out counters.
|
||
- nvk/nil: allow storage on VK_FORMAT_A2B10G10R10_UINT_PACK32
|
||
- nvk: fix transform feedback with multiple saved counters.
|
||
- nvk/nak/xfb: handle skipping properly when setting xfb_attr.
|
||
- nvk: drop unneeded shader type conversion function
|
||
- nvk/nak: fix regression with shf changes on sm70
|
||
- intel/compiler: move gen5 final pass to actually be final pass
|
||
- vulkan/video: drop encode beta checks and rename EXT->KHR
|
||
- gallivm: handle llvm 16 atexit ordering problems.
|
||
- intel/compiler: fix release build unused variable.
|
||
- intel/compiler: revert part of "Move earlier scheduler code that is not mode-specific"
|
||
- llvmpipe: fix caching for texture shaders.
|
||
- gallivm/sample: refactor first/last level handling and use level_zero_only.
|
||
- gallivm/sample: add some num_samples vs level zero only support
|
||
- gallivm/sample: make the load_mip helper useful outside this file.
|
||
- gallivm/lp: reduce size of lp_jit_texture.
|
||
- gallivm/lp: reduce image descriptor size.
|
||
- gallivm/lp: merge sample info into normal info
|
||
- gallivm/lp: move sampler index around to reduce struct
|
||
- lavapipe: bump .maxResourceDescriptorBufferRange
|
||
- intel/compiler: reemit boolean resolve for inverted if on gen5
|
||
- radv: don't emit cp dma packets on video rings.
|
||
- radv/video: refactor sq start/end code to avoid decode hangs.
|
||
- radv: don't submit empty command buffers on encoder ring.
|
||
- gallivm: passing fp16_split_fp64 to fp16 lowering.
|
||
|
||
Dave Stevenson (2):
|
||
|
||
- gallium: Add more TinyDRM drivers to the list of kmsro drivers
|
||
- gallium: Add udl (DisplayLink) to the list of kmsro drivers
|
||
|
||
David Heidelberg (53):
|
||
|
||
- ci/docs: add coreutils
|
||
- ci: bump tags
|
||
- ci/zink: reduce premerge testing on a618 to ~ 12 minutes
|
||
- ci: hide Mesa install phase
|
||
- ci: drop clover from release builds and remove rusticl build
|
||
- ci: simplify debian-rusticl-testing definition
|
||
- ci: drop mingw and wine from the x86_64 build container
|
||
- ci: always cleanup pip and cargo leftovers
|
||
- ci: bashify scripts, use arrays
|
||
- ci: drop debootstrap, unused
|
||
- ci/panfrost: run T860 traces as intended (nightly job)
|
||
- ci/venus: reduce pre-merge to fit under 15 min
|
||
- ci/alpine: do not store apk cache
|
||
- ci/wine: move wine configuration into rootfs where is wine available
|
||
- Revert "ci/wine: move wine configuration into rootfs where is wine available"
|
||
- ci/lava: add wine into the amd64 ephemeral container packages
|
||
- ci/zink: restore full premerge testing on Adreno 618
|
||
- ci: fixup section names
|
||
- ci/nouveau: define a kernel and dtb, so we can fetch it from external sources
|
||
- ci: inject gfx-ci/linux S3 artifacts without rebuilding containers
|
||
- ci/zink: disable nheko trace, as it sometimes crashes
|
||
- gitlab: make commit more commit-like formatted
|
||
- ci: tag sanity, rustfmt and clang-format job as a "placeholder" job
|
||
- ci/traces: drop the freedoom-phase2-gl-high.trace
|
||
- ci: disable Anholt farm
|
||
- ci/freedreno: disable a660 as it's down now
|
||
- Revert "ci/freedreno: disable a660 as it's down now"
|
||
- ci: bump kernel to 6.6.4
|
||
- docs: drop unused manual optimizations override
|
||
- ci/freedreno: mark unvanquished-lowest trace as flaky and skip
|
||
- ci/freedreno: switch Adreno 630 boards back to 6.4 kernel
|
||
- ci/freedreno: increase fraction for Vulkan testing
|
||
- ci/tu: add another failing pipeline strip draw
|
||
- ci/freedreno: extend timeout for full runs
|
||
- ci/freedreno: re-enable two Adreno 618 tests
|
||
- ci/freedreno: timestamp-get no longer fails on Adreno
|
||
- ci/freedreno: downgrade a618_piglit to 6.4 kernel
|
||
- ci/freedreno: fail introduced by ARB_post_depth_coverage
|
||
- rusticl: add freedreno alias for RUSTICL_ENABLE
|
||
- ci/freedreno: more issues showed up on a618, let's use 6.4
|
||
- ci/austriancoder: separate HW definition from SW
|
||
- ci/freedreno: downgrade whole Adreno 6xx series, incl. zink-a618 jobs
|
||
- ci/broadcom: separate HW definition from SW
|
||
- ci: skip EGL functional color_clears tests for Wayland
|
||
- ci/lava: separate HW definitions from SW
|
||
- ci/google: re-enable farm
|
||
- ci/zink: update piano trace
|
||
- ci/radeonsi: disable VA-API testing on raven
|
||
- ci: enable ci-deb-repo for libdrm 2.4.119 (and others in the future)
|
||
- ci/alpine: update to latest to get libdrm 2.4.119
|
||
- ci: bump Fedora and Android libdrm2 to 2.4.119
|
||
- ci/rootfs: add libdrm also inside the rootfs
|
||
- ci/deqp: uprev deqp-runner for Linux too to 0.18.0
|
||
|
||
David Rosca (19):
|
||
|
||
- frontends/va: Map decoder and postproc surfaces for reading
|
||
- radeonsi/vce: Implement destroy_fence vfunc
|
||
- radeonsi/uvd: Implement destroy_fence vfunc
|
||
- radeonsi/uvd_enc: Implement destroy_fence vfunc
|
||
- radeonsi/uvd_enc: Fix leaking session info buffer
|
||
- Revert "radeon/radeon_vce: fix out of target bitrate in CBR mode (H.264)"
|
||
- radeonsi/vce: Tweak motion estimation params for better quality
|
||
- radeonsi/vce: Add VUI parameters in output bitstream
|
||
- radeonsi/uvd_enc: Add VUI parameters in output bitstream
|
||
- radeonsi: Fix offset for linear surfaces on GFX < 9
|
||
- gallium/auxiliary/vl: Fix coordinates clamp in compute shaders
|
||
- gallium/auxiliary: Fix coordinates clamp in util_compute_blit
|
||
- gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane
|
||
- gallium/auxiliary/vl: Support interleaved input in deinterlace filter
|
||
- Revert "frontends/va: Alloc interlaced surface for interlaced pics"
|
||
- gallium/auxiliary: NIR blit_compute_shader
|
||
- gallium/auxiliary/vl: NIR compute shaders
|
||
- util/rbsp: Fill bits twice if reading more than 16 bits
|
||
- radeonsi/vcn: Fix H264 slice header when encoding I frames
|
||
|
||
Dennis Bonke (1):
|
||
|
||
- mesa: add managarm support
|
||
|
||
Dmitry Baryshkov (9):
|
||
|
||
- freedreno/regs/mdp_common: change BPC1 -> BPC4
|
||
- freedreno/regs/mdp_common: fix BPC comments
|
||
- freedreno/regs: add mdp_fetch_mode enum
|
||
- freedreno/drm: fallback to default BO allocation if heap alloc fails
|
||
- ir3: fix shift amount for 8-bit shifts
|
||
- ir3/a6xx: fix ldg/stg of ulong2 and ulong4 data
|
||
- freedreno/drm: notify valgrind about FD_BO_NOMAP maps
|
||
- freedreno/drm: don't crash in heap allocator when run under valgrind
|
||
- freedreno/drm: don't crash for unsupported devices
|
||
|
||
Dudemanguy (1):
|
||
|
||
- vulkan/wsi/wayland: fix wl_event_queue memory leak
|
||
|
||
Dylan Baker (3):
|
||
|
||
- docs: add release notes for 23.2.1
|
||
- docs: Add sha256 sum for 23.2.1
|
||
- meson: add wrap for libdrm
|
||
|
||
Echo J (2):
|
||
|
||
- nvk: Set HOST_CACHED_BIT for the GTT type
|
||
- vulkan: Remove nonexistent output in vk_synchronization_helpers target
|
||
|
||
Eric Engestrom (236):
|
||
|
||
- VERSION: bump to 24.0
|
||
- docs: reset new_features.txt
|
||
- docs: update calendar for 23.3.0-rc1
|
||
- ci/rpi4: group all spec\@ext_image_dma_buf_import\@ext_image_dma_buf_import-sample_* together
|
||
- ci/rpi4: add spec\@ext_image_dma_buf_import\@ext_image_dma_buf_import-sample_yvyu to the list of known failures
|
||
- ci/zink+radv: add another flake on polaris
|
||
- ci: drop confusing fake \`rules`, \`if` and \`when` on the list of rules strings
|
||
- docs/ci: allow sanity job to be missing
|
||
- ci: don't run sanity in Marge pipelines
|
||
- ci: add \`.never-post-merge-rules` to avoid re-running pre-merge jobs after merging
|
||
- broadcom: use \`.never-post-merge-rules` for all rpi tests
|
||
- ci/radeonsi: add another flake
|
||
- rpi4/ci: add more known dEQP-EGL.functional.*.*_context.gles*.other failures
|
||
- rpi4/ci: move \`spec\@!opengl 1.1\@depthstencil-default_fb-drawpixels-24_8 samples=2` from fails for flakes after an UnexpectedPass
|
||
- rpi4/ci: remove \`spec\@!opengl 1.1\@depthstencil-default_fb-drawpixels-32f_24_8_rev samples=2` from fails as it's a flaky test and already marked as such
|
||
- Revert "ci: backport two mesh/task query fixes for VKCTS"
|
||
- ci/build-deqp: stop ignoring failures while fetching patches
|
||
- ci/build-deqp: split deqp version into a variable
|
||
- ci/build-deqp: move mkdir earlier
|
||
- ci/build-deqp: print more detailed information about what deqp version is running
|
||
- ci: bump image tags to rebuild deqp
|
||
- ci/rules: add missing clang-format files to what needs containers to build
|
||
- broadcom/ci: merge gl test lists to use a single deqp instance
|
||
- broadcom/ci: fix list indentation
|
||
- broadcom/ci: split broadcom-common manual rules to .broadcom-common-manual-rules
|
||
- vc4/ci: add manual variant of .vc4-rules
|
||
- v3dv/ci: add manual variant of .v3dv-rules
|
||
- v3d/ci: add "full run" variant of v3d-rpi4-gl:arm64 as a manual job
|
||
- v3dv/ci: add "full run" variant of v3dv-rpi4-vk:arm64 as a manual job
|
||
- vc4/ci: add piglit "full run" variant of vc4-rpi3-gl:arm32 as a manual job
|
||
- rpi4/ci: skip more timing out tests in the dEQP-VK.ssbo.layout.* group
|
||
- zink+radv/ci: simplify deqp config
|
||
- zink+radv/ci: ensure renderer is "zink on radv"
|
||
- ci: restore sanity (aka. Revert "ci: don't run sanity in Marge pipelines")
|
||
- gitlab_gql: strip newline at the end of the token file
|
||
- ci_run_n_monitor: compile target_jobs_regex only once
|
||
- ci/gitlab_gql: stop re-compiling regex now that all users pre-compile it
|
||
- v3d/ci: run manual jobs in daily pipeline
|
||
- radeonsi/ci: document new failures and flakes
|
||
- ci: disable lima farm as it appears to be down
|
||
- radv/ci: add navi21 flakes
|
||
- radv/ci: add vega10 flakes
|
||
- radv/ci: add polaris10 flakes
|
||
- radv+zink/ci: add polaris10 flakes
|
||
- radv+zink/ci: add navi10 flakes
|
||
- bin/gitlab_gql: resolve sha locally to be able to use things like \`HEAD`
|
||
- gitlab_gql: make \`--rev` optional, defaulting to \`HEAD`
|
||
- bin/gitlab_gql: fix command in example
|
||
- bin/gitlab_gql: only get the pipeline when a pipeline is needed
|
||
- v3d/ci: add new failures
|
||
- bin/gitlab_gql: only allow a single \`--print-\*` argument per invocation
|
||
- bin/gitlab_gql: rename get_job_final_definition() to print\_...() since that's what it actually does
|
||
- bin/gitlab_gql: deduplicate fetch_merged_yaml() logic between print branches
|
||
- bin/gitlab_gql: give a better name to the --print-job-manifest argument value than PRINT_JOB_MANIFEST
|
||
- ci/valve-infra: ensure the correct farm picks up the job
|
||
- docs: update calendar for 23.3.0-rc{2,3,4} and add another release candidate
|
||
- util/xmlconfig: drop default SYSCONFDIR & DATADIR values
|
||
- lima: drop unused lima_get_absolute_timeout()
|
||
- intel/ci: fix gl/vk dependencies in hsw jobs
|
||
- intel/dev: use libdrm.h wrapper to support builds without libdrm
|
||
- ci_run_n_monitor: require user to add an explicit \`.*` at the end if jobs like \`*-full` are wanted
|
||
- amd/ci: avoid re-running all the test jobs when changing the expectations for only one of them
|
||
- egl/dri2: increase NUM_ATTRIBS to fit all the attributes
|
||
- asahi: use util_resource_num() instead of open-coding it
|
||
- ci/piglit: specify only the traces file in the job config
|
||
- amd/ci: track changes to the traces config file as well
|
||
- ci: fix kdl commit fetch
|
||
- ci: uprev deqp-runner from 0.16.1 to 0.18.0
|
||
- ci/deqp-runner: turn paths in errors into links
|
||
- docs: update calendar for 23.0.0-rc5
|
||
- docs: add another -rc
|
||
- ci: use released version of meson
|
||
- lp: make sure 0xff is unsigned before shifting it past signed int range
|
||
- intel/perf: fix regex escaping
|
||
- intel/ci: fix .hasvk-manual-rules
|
||
- docs: update calendar for 23.3.0
|
||
- docs/calendar: add 23.3.x releases
|
||
- bin/python-venv: detect python version change
|
||
- ci: disable opengl & gles in debian-vulkan build
|
||
- radv/ci: add navi21-aco flake
|
||
- bin/gen_release_notes: fix regex raw string
|
||
- bin/python-venv: fix venv folder check
|
||
- bin/gen_release_notes: include removed 'new_features.txt' in commit
|
||
- docs: add release notes for 23.3.0
|
||
- docs: add sha256sum for 23.3.0
|
||
- docs: fix release date for 23.3.0
|
||
- turnip: fix typo in comment
|
||
- ci_run_n_monitor: allow picking a pipeline by its MR
|
||
- amd/ci: radeonsi is gl, not vk
|
||
- v3dv: update symbols that have become aliases for newer ones
|
||
- v3dv: drop duplicate flag
|
||
- radv: update symbols that have become aliases for newer ones
|
||
- pvr: update symbols that have become aliases for newer ones
|
||
- anv: update symbols that have become aliases for newer ones
|
||
- hasvk: update symbols that have become aliases for newer ones
|
||
- amd/ci: fix yaml indentation
|
||
- amd/ci: split common amd files list from radeonsi files list
|
||
- amd/ci: limit radv jobs to radv + aco files changes
|
||
- nvk: update symbols that have become aliases for newer ones
|
||
- vk/runtime: update symbols that have become aliases for newer ones
|
||
- vk/wsi: update symbols that have become aliases for newer ones
|
||
- vk/util: update symbols that have become aliases for newer ones
|
||
- vk/overlay-layer: update symbols that have become aliases for newer ones
|
||
- venus: update symbols that have become aliases for newer ones
|
||
- venus: fix typo in comment
|
||
- amd/ci: reuse .radeonsi-rules in .radeonsi-vaapi-rules
|
||
- nvk: use \`||` instead of \`|` between bools
|
||
- radeonsi/ci: update vangogh piglit expectations
|
||
- freedreno/ci: add flake seen on a630
|
||
- freedreno/ci: add more flakes seen on a630
|
||
- freedreno/ci: add more a630 flakes
|
||
- v3d: drop leftover from "move v3d_tiling to common"
|
||
- radeonsi/ci: track changes to \`vpelib`
|
||
- turnip: update symbols that have become aliases for newer ones
|
||
- util/blob: fix trivial typo
|
||
- ci: explain what we mean by the various types of pipelines
|
||
- ci: turn comment into code in \`sanity` job rules
|
||
- ci: identify merge request pipelines using \`$CI_PIPELINE_SOURCE == merge_request_event` instead of \`$CI_COMMIT_BRANCH` being missing
|
||
- ci: rename is-pre-merge-for-marge to is-merge-attempt to be clearer
|
||
- ci: drop containers, builds, and tests from post-merge pipeline
|
||
- ci: add pipeline for direct pushes to main
|
||
- ci: give an explicit priority to the scheduled nightly pipelines
|
||
- ci: clean up pre-merge and fork pipelines rules
|
||
- ci: make sure pre-merge pipelines have the same jobs as merge pipelines
|
||
- ci: improve comments
|
||
- ci: take microsoft farm offline
|
||
- ci: fix rules for formatting checks
|
||
- zink/ci: fix yaml indentation
|
||
- zink/ci: use variable to avoid repeating the list
|
||
- zink/ci: expand first (and only) level of folders in the list of files
|
||
- zink/ci: run only the relevant jobs when changing the ci expectations
|
||
- panfrost/ci: fix yaml indendation
|
||
- panfrost/ci: run only the relevant jobs when changing the ci expectations
|
||
- freedreno/ci: fix yaml indentation
|
||
- freedreno/ci: run only the relevant jobs when changing the ci expectations
|
||
- intel/ci: fix yaml indentation
|
||
- intel/ci: deduplicate common intel files rules
|
||
- intel/ci: expand first level of common intel files
|
||
- intel/ci: anv changes should only trigger anv jobs
|
||
- intel/ci: hasvk changes should only trigger hasvk jobs
|
||
- intel/ci: run only the relevant jobs when changing the ci expectations
|
||
- docs/calendar: add 24.0 branchpoint and release schedule
|
||
- etnaviv/ci: fix yaml indentation
|
||
- etnaviv/ci: expand first level of files in src/etnaviv/
|
||
- etnaviv/ci: run only the relevant jobs when changing the ci expectations
|
||
- broadcom/ci: avoid running the rpi4 jobs when changing the rpi3 expectations, and vice-versa
|
||
- vk/update-aliases.py: drop dead --check-only
|
||
- vk/update-aliases.py: allow specifying the files we want to update
|
||
- vk/update-aliases.py: handle "no match" grep call
|
||
- vk/update-aliases.py: sort files when informing the user of the matches
|
||
- vk/update-aliases.py: simplify addition of other concatenated prefixes
|
||
- vk/update-aliases.py: handle more concatenated prefixes
|
||
- vk/update-aliases.py: enforce correct list order
|
||
- vk/update-aliases.py: only apply renames for the vulkan api (not vulkansc)
|
||
- v3dv/ci: only trigger on relevant changes
|
||
- a630/ci: add another flake
|
||
- freedreno/ci: move hang-y a630 jobs from pre-merge to nightly
|
||
- spirv: add missing build dependency
|
||
- ci/b2c: drop passthrough of unset CI_JOB_JWT
|
||
- ci/b2c: stop ignoring errors in before_script
|
||
- ci/b2c: fix indentation of comment and after_script: list
|
||
- ci/b2c: drop unused B2C_EXTRA_VOLUME_ARGS
|
||
- ci/b2c: tags are mandatory
|
||
- ci/b2c: drop support for harbor.freedesktop.org
|
||
- ci/b2c: drop unused --volume and --mount-volume
|
||
- ci/b2c: always define job_volume_exclusions
|
||
- ci/b2c: always define cmdline_extras
|
||
- ci/b2c: use with:write instead of manually doing open;write;close
|
||
- ci/b2c: export B2C_TEST_SCRIPT
|
||
- ci/b2c: use envvars directly instead of converting them back and forth into cli args
|
||
- ci/b2c: import all variables starting with \`B2C_`
|
||
- ci/b2c: rename B2C_TEST_SCRIPT to B2C_CONTAINER_CMD to match the automatic import
|
||
- ci/b2c: identify dut by its id instead of its tags
|
||
- docs: add release notes for 23.3.1
|
||
- docs: add sha256sum for 23.3.1
|
||
- docs: update calendar for 23.3.1
|
||
- ci: deduplicate constructing the ARTIFACTS_BASE_URL
|
||
- bin/gitlab_gql: fix --print-merged-yaml when --rev != HEAD
|
||
- bin/gitlab_gql: print merged yaml as yaml instead of a python dict
|
||
- v3d/ci: add flake
|
||
- ci: fix indentation
|
||
- ci: run every test when changing the build
|
||
- docs: drop \`:` in title
|
||
- radv/ci: add flake
|
||
- docs: document how to build the docs
|
||
- vulkan/wsi: fix build when platform headers are installed in non-standard locations
|
||
- ci/build: drop redundant meson/build.sh from jobs that already inherit from .meson-build
|
||
- radv/ci: add flake on raven
|
||
- ci: add nvk to the clang build
|
||
- ci: disable collabora farm as it is currently offline
|
||
- ci: fix farm restore pipelines
|
||
- meson: always define {,DRAW_}LLVM_AVAILABLE one way or the other
|
||
- docs: add release notes for 23.3.2
|
||
- docs: add sha256sum for 23.3.2
|
||
- docs: update calendar for 23.3.2
|
||
- meson: update expat wrap
|
||
- meson: update libarchive wrap
|
||
- meson: update libxml2 wrap
|
||
- meson: update zlib wrap
|
||
- meson: use \`allow_fallback` instead of manually listing the deps and what they provide
|
||
- ci/containers: use build-libdrm.sh in debian/android
|
||
- Revert "meson: add wrap for libdrm"
|
||
- zink: update symbols that have become aliases for newer ones
|
||
- zink/requirements: update feature and property names that have been promoted
|
||
- docs/backport-mr: fix invalid nested formatting
|
||
- docs: fix list whitespace
|
||
- docs: mention that python package \`packaging` is required on python 3.12+
|
||
- lvp: update symbols that have become aliases for newer ones
|
||
- egl: only accept APIs that are compiled in
|
||
- ci: split & reuse debian version identifier
|
||
- ci: convert several \`find | xargs` to \`find -exec`
|
||
- ci/deqp: set default platform to \`default` instead of glx, to also support wayland
|
||
- docs: add release notes for 23.3.3
|
||
- docs: add sha256sum for 23.3.3
|
||
- docs: update calendar for 23.3.3
|
||
- docs: close the 23.2 cycle
|
||
- VERSION: bump for 24.0.0-rc1
|
||
- .pick_status.json: Update to 4fe5f06d400a7310ffc280761c27b036aec86646
|
||
- .pick_status.json: Mark 0557f0d59c5b22a8a934900ddc91f7a6057e146f as denominated
|
||
- ci: make sure we evaluate the python-test rules first
|
||
- .pick_status.json: Update to ff84aef116f9d0d13440fd13edf2ac0b69a8c132
|
||
- .pick_status.json: Update to 10e2dbb63b9d1f8f35c4fc3f570cd19b3fc03b43
|
||
- ci: fix job dependency error in MRs for bin/ci/* scripts
|
||
- VERSION: bump for 24.0.0-rc2
|
||
- ci/deqp: ensure that in \`default` builds, wayland + x11 + xcb are all built
|
||
- .pick_status.json: Update to d2b08f9437f692f6ff4be2512967973f18796cb2
|
||
- .pick_status.json: Update to d0a3bac163ca803eda03feb3afea80e516568caf
|
||
- .pick_status.json: Update to 90939e93f6657e1334a9c5edd05e80344b17ff66
|
||
- .pick_status.json: Update to eca4f0f632b1e3e6e24bd12ee5f00522eb7d0fdb
|
||
- VERSION: bump for 24.0.0-rc3
|
||
- .pick_status.json: Update to b75ee1a0670a3207dfd99917e4f47d064a44197f
|
||
- .pick_status.json: Update to 4cd5b2b5426e8d670fc3657eee040a79e3f9df1e
|
||
- util: rename __check_suid() to __normal_user()
|
||
- tree-wide: use __normal_user() everywhere instead of writing the check manually
|
||
- util: simplify logic in __normal_user()
|
||
- util: check for setgid() as well in __normal_user()
|
||
|
||
Eric R. Smith (1):
|
||
|
||
- panfrost: fix panfrost drm-shim
|
||
|
||
Erico Nunes (6):
|
||
|
||
- v3dv: Rework to remove drm authentication for wsi
|
||
- lima/ci: update piglit ci expectations
|
||
- Revert "ci: disable lima farm as it appears to be down"
|
||
- panvk: Support modifiers for Wayland WSI
|
||
- ci: lima farm is down
|
||
- Revert "ci: lima farm is down"
|
||
|
||
Erik Faye-Lund (34):
|
||
|
||
- docs: prepare for hawkmoth
|
||
- docs: remove breathe/doxygen stuff
|
||
- docs: improve readability of c-signatures
|
||
- util: remove unused lut
|
||
- panfrost: allow packing formats outside of pan_format.c
|
||
- panfrost: bypass format-table for null-textures
|
||
- panfrost: pass blendable formats to pan_pack_color
|
||
- panfrost: store blendable_formats in panfrost_device
|
||
- panfrost: look at correct blendable format version
|
||
- panfrost: use perf_debug instead of open-coding
|
||
- mesa/ffvs: use unreachable instead of assert
|
||
- docs: apply permanent redirect
|
||
- panfrost: do not open-code panfrost_has_fragment_job()
|
||
- ci: opt-out panfrost from clang-format
|
||
- panfrost: minify dimensions when converting modifiers
|
||
- util/format: document NONE swizzle
|
||
- lavapipe: do not use NONE-swizzle
|
||
- panfrost: do not handle NONE-swizzle
|
||
- d3d12: do not handle PIPE_SWIZZLE_NONE from sampler-view
|
||
- zink: do not handle PIPE_SWIZZLE_NONE
|
||
- meson: work around meson 0.62 issue
|
||
- mesa/main: remove unused Log2 variants of width/height/depth
|
||
- mesa/main: remove unused ClassID
|
||
- mesa/main: use _mesa_is_zero_size_texture-helper
|
||
- mesa/main: remove unused function
|
||
- mesa/st: use _mesa_is_zero_size_texture-helper
|
||
- zink: update profile schema
|
||
- zink: use KHR version of maint5 features
|
||
- panfrost: document ci failure
|
||
- mesa/st: do not require render-target support for texture-only exts
|
||
- mesa/st: do not check for emulated format
|
||
- mesa: actually check for EXT_color_buffer_float support
|
||
- mesa/main: require EXT_color_buffer_float for ES 3.2
|
||
- mesa: check for float-format support
|
||
|
||
Etaash Mathamsetty (1):
|
||
|
||
- driconf: add a workaround for Rainbow Six Siege
|
||
|
||
Faith Ekstrand (663):
|
||
|
||
- nir: Add a lower_first_invocation_to_ballot option to lower_subgroups
|
||
- nir: Add a lower_read_first_invocation option to lower_subgroups
|
||
- nir/lower_bit_size: Fix subgroup lowering for floats
|
||
- nir/lower_bit_size: Handle vote_feq/ieq separately
|
||
- nir/lower_bit_size: Use u_intN_min/max()
|
||
- nir: Split nir_lower_subgroup_options::lower_vote_eq into two bits
|
||
- nir: Return b2b ops from nir_type_conversion_op()
|
||
- nir/lower_bit_size: Use b2b for boolean subgroup ops
|
||
- nir: add deref follower builder for casts.
|
||
- nir: Handle wildcards with casts in copy_prop_vars
|
||
- nir: Use nir_builder to insert movs
|
||
- nir: Add asserts to nir_phi_builder_value_set_block_def
|
||
- vc4: Stop assuming glsl_get_length() returns 0 for vectors
|
||
- v3d: Stop assuming glsl_get_length() returns 0 for vectors
|
||
- nir/lower_io_to_vector: Only call glsl_get_length() on arrays
|
||
- nir/types: Support vectors in glsl_get_length()
|
||
- nir: Handle array-deref-of-vec in vars_to_ssa
|
||
- nir: Handle array-deref-of-vec in var split passes
|
||
- nir/validate: Allow array derefs on vectors on function/shader_temp
|
||
- nvk: Force all mappable BOs into GART pre-Maxwell
|
||
- nvk: Fix nvk_heap_free() for contiguous heaps
|
||
- nvk: Drop a bogus assert
|
||
- nvk: Assert no storage images on Kepler
|
||
- nir: Optimize boolean ieq/ine with an immediate
|
||
- nouveau: Add initial headers and meson for the new compoiler
|
||
- nak: Copy the optimization loop from Intel
|
||
- nak: Add a bunch of shader lowering code in NIR
|
||
- nak: Add initial stubs for rust code
|
||
- nvk: Run shaders through NAK
|
||
- nak: Add the core IR
|
||
- nak: Add Rust bindings for NIR
|
||
- nak: Add initial translation from NIR
|
||
- nak: Add a copy-prop pass
|
||
- nak: Add a dead-code pass
|
||
- nak: Add a util library
|
||
- nak: Add a trivial register allocator
|
||
- nak: Add a lowering pass for VEC and SPLIT instructions
|
||
- nak: Add a lowering pass for ZERO sources and destinations
|
||
- nak: Add bitset infrastructure
|
||
- nak: Add encoding for a few instructions
|
||
- nak: Encode program headers
|
||
- nak: Header stuff
|
||
- nak: Lower system values to a new load_sysval_nak intrinsic
|
||
- nak: Implement load_sysval_nv as S2R
|
||
- nak: Implement load_ubo
|
||
- nak: Implement load/store_global
|
||
- nak: Zero out the .w component of descriptors
|
||
- nak: Add an instruction fuzzing tool
|
||
- nak: Implement iadd and ishl
|
||
- nak: Add a pass for computing instruction dependencies
|
||
- nak: Implement 32-bit logic ops
|
||
- nak: Add support for instruction predicates
|
||
- nak: Implement integer comparisons
|
||
- nak: Implement bcsel
|
||
- nak: Rework ALU instruction encode
|
||
- nak/meson: Use bindgen dependencies
|
||
- nak: Add nak_compiler_create/destroy
|
||
- nvk: Pass an actual nak_compiler to nak_compile_shader()
|
||
- nak: Plumb the SM through to nak::Shader
|
||
- nak: Encode load/store correctly on SM80
|
||
- nak: Rework instruction encoding
|
||
- nak: Implement boolean logic ops
|
||
- nak: Lower 8 and 16-bit types
|
||
- HACK: Support old meson
|
||
- nak: Use Instr::num_srcs/dsts() less
|
||
- nak: Get rid of meta instructions
|
||
- meson: Pull in syn from crates.io
|
||
- nak: Add SrcAsSlice and DstAsSlice traits
|
||
- nak: Add a SrcModsAsSlice trait
|
||
- nak: Use a different inner struct type for each opcode
|
||
- nak: Use Src::Zero for load_const(0)
|
||
- nak: Handle zeroes at emit time
|
||
- nak: Implement i2f
|
||
- nak: Implement fadd
|
||
- nak: Rework integer compare ops
|
||
- nak: Implement float comparisons
|
||
- nak: Implement nir_op_b2f32
|
||
- nak: Implement unary float and integer ops
|
||
- nak: Allow iadd3 to take an immediate in srcs[2]
|
||
- nak: Implement fsign
|
||
- nak: Rework ALUSrc in emit code
|
||
- nak: Rework source modifiers
|
||
- nak: One of the predicates in IADD3 is a destination
|
||
- nak: Implement Display for SSAValue
|
||
- nak: Make Dst its own type
|
||
- nak: Add modifier propagation
|
||
- nak: Implement basic control-flow
|
||
- nak: Move nak_compiler to nak_private.h
|
||
- nak: Add a nir_shader_compiler_options to nak_compiler
|
||
- nvk: Pull the NIR options from NAK
|
||
- nak: Implement b2i32
|
||
- nak: Implement iadd64
|
||
- nak: Implement phis
|
||
- nak: Add a union-find implementation
|
||
- nak: Lower global access to scalars as needed
|
||
- nak: Print names of missing instructions
|
||
- nak: Implement unpack_64_2x32_split_*
|
||
- WIP: nak: Rework the barrier assignment pass
|
||
- nak: Add an SSAValueAllocator struct
|
||
- nak: Pass an SSAValueAllocator through to map methods
|
||
- nak: Handle fadd funnyness in the emit code
|
||
- WIP: nak: Add a legalization pass
|
||
- nak: Rename Imm to Imm32
|
||
- nak: Add separate True and False source types
|
||
- nak: Handle phis with non-SSA sources
|
||
- nak: Support both destinations in PLOP3
|
||
- nak: Drop the special cases for single-component vec/split
|
||
- nak: Don't emit MOVs for overlapping vec and split src/dst
|
||
- HACK: nak: Lower iadd64 again
|
||
- nak: Add a parallel copy in struction with lowering
|
||
- nak: Use OpParCopy for OpVec and OpSplit lowering
|
||
- nak: Get rid of the BitSet and BitSetMut traits
|
||
- nak: Rename BitSetView to BitView
|
||
- nak: Add a BitSet struct
|
||
- nak: Add an SSAComp struct
|
||
- nak: Rework dead-code
|
||
- nak: Rework phis
|
||
- nak: Add a space to the end of vec and split arg lists
|
||
- nak: Add a liveness analysis pass
|
||
- nak: Add a non-trivial register allocator
|
||
- nak: Improve the dependency tracker
|
||
- nak: Handle token re-use in dep tracking
|
||
- nak: Implement nir_op_i(eq|ne) for booleans
|
||
- nak: Fold [P]Lop3 sources
|
||
- nak: Predicates default to true
|
||
- nak: Implement nir_op_[iu](min|max)
|
||
- nak: Implement nir_op_fmul
|
||
- nak: Implement nir_op_(fmin|fmax)
|
||
- nak: Implement nir_op_u2f
|
||
- nak: Implement nir_op_vecN
|
||
- nak: Implement MuFu and a bunch of float unops
|
||
- nak: Move nak_sysval_attr_addr/sysval_idx higher in the file
|
||
- nak: Implement input interpolation
|
||
- nak: Handle multiple vector destinations in RA
|
||
- nak: Use immediage offsets for load/store_global
|
||
- nak: Implement OpFSOut with an OpParCopy
|
||
- nak: Implement f2[iu]32
|
||
- nak: Wire up ffma
|
||
- nak: Add more legalization
|
||
- nak: Implement right-shifts
|
||
- nak: Implement nir_op_[iu]mul[_high]
|
||
- nak: Enable nir_lower_idiv
|
||
- nak: Add a NIR texture lowering pass
|
||
- nak: Use more core NIR texture lowering
|
||
- nak: Wire up texture ops
|
||
- nak: Simplify the FromVariants proc macro
|
||
- nak: Simplify the (Srcs|Dsts)AsSlice proc macro
|
||
- HACK: spirv: Add a MESA_SPIRV_DUMP_PATH environment variable
|
||
- nak: Add a NAK_DEBUG environment variable
|
||
- nvk: Drop printing of NAK shaders
|
||
- nvk: Pass NAK flags through to shader cache UUIDs
|
||
- nak: Add a debug flag to assign worst-case instruction deps
|
||
- nak: Rework vector handling
|
||
- nak: Legalize vector sources
|
||
- nak: Add a use tracker to RA
|
||
- nak: Much more believable try_find_unused_reg_range()
|
||
- nak: Implement nir_op[iu]mul_2x32_64
|
||
- Revert "HACK: nak: Lower iadd64 again"
|
||
- nak: Implement nir_op_ixor
|
||
- nak: Implement undef instructions
|
||
- nak: Implement image load/store
|
||
- nak: Wire up OpLd and OpSt for local and shared
|
||
- nak: Implement nir_intrinsic_load/store_scratch
|
||
- nak: Add a smarter new_lop2 helper
|
||
- nak: Improve RA failure messages
|
||
- nak: Legalize OpShf
|
||
- nak: Only put actually live SSA values in the ra.live_in sets
|
||
- nak: Legalize more stuff
|
||
- nak/nir: Lower image size and samples to txq
|
||
- nak: Improve [FI]SETP encoding
|
||
- nak: Legalize Op[FI]Setp
|
||
- nak: Don't allow r255 in texture or surface ops
|
||
- nak: sin() and cos() require we divide by 2pi
|
||
- nak: Add F2F and implement fquantize16
|
||
- nak: Implement barriers
|
||
- nvk: Plumb num_barriers through from NAK
|
||
- nak: Implement load/store_shared
|
||
- nak: Integers don't have abs() source modifiers
|
||
- nak: Add a mechanism for decorating sources with types
|
||
- nak: Decorate sources with types
|
||
- nak: Only divide FS inputs by .w for smooth interpolation
|
||
- nak: Rework source modifiers a bit
|
||
- nak: Add a Src::supports_src_type() helper
|
||
- nak: Rework copy-prop to use soruce type decorations
|
||
- nak: Implement nir_intrinsic_global_atomic_*
|
||
- nak: Implement nir_intrinsic_shared_atomic_*
|
||
- nak: Implement global/shared_atomic_comp_swap
|
||
- nak: Implement image atomics
|
||
- nak: Fix the 2nd predicate on LOP3
|
||
- nak: Optimize OpLop3 and OpPLop3
|
||
- nak: DCE things with constant false predicates
|
||
- nak: Rework source modifiers instructions a bit
|
||
- nak: Fold fsat into FAdd/FFma/FMul
|
||
- nak: Delete unused imports and dead code
|
||
- nak: Add accum predicates to Op[FI]Setp
|
||
- nak: Add a Pred struct move the enum to PredRef
|
||
- nak: Fix multisampled textureing
|
||
- nak: Legalize everything
|
||
- nak: Rework cbufs a bit
|
||
- nak: Implement indirect UBO loads
|
||
- nak: Implement nir_op_b2b1 and nir_op_b2b32
|
||
- nak: Follow memcpy semantics with OpParCopy
|
||
- nak: Work in terms of bits for type sizes
|
||
- nak: Add a builder
|
||
- nak: Use the builder in some lowering passes
|
||
- nak: Compute liveness in reverse block order
|
||
- nak: Rework liveness to add next-use information
|
||
- nak: Add a PerRegFile helper struct
|
||
- nak: Record register pressure in liveness
|
||
- nak: Initialize RA with only live registers
|
||
- nak: Use num_regs instead of max_reg in RA
|
||
- nak: Use pcopy.push() in RA
|
||
- nak: Rework RA a bit
|
||
- nak: Add some documentation for SSA values
|
||
- nak: Print to stderr
|
||
- nak/ra: Pass a PerRegFile num_regs into the allocator
|
||
- nak: Allocate the minimum number of GPRs.
|
||
- nak: Separate the CFG from liveness
|
||
- nak: Break guts of liveness into traits
|
||
- nak: Require Rust 1.70.0
|
||
- nak: Handle dead destinations in RA
|
||
- nak: Make calc_max_live a function of the Liveness trait
|
||
- nak: Bring back bitset-based liveness
|
||
- nak: Add mum_gprs and tls_size to Shader
|
||
- nak: Accurately set num_gprs
|
||
- nak: Add a RegFileSet struct
|
||
- nak: Add more SSA iterator options
|
||
- nak: Add a new VecPair type
|
||
- nak/nir: Add more helpers
|
||
- nak: Emit if branches in the predecessor block
|
||
- nak: Add a more awesome CFG data structure
|
||
- nak: Store the blocks in the CFG
|
||
- nak: Base liveness on CFG indices
|
||
- nak: Add loop detection to the CFG
|
||
- nak: Add a phi allocator
|
||
- nak: Refactor nak_assign_regs a bit
|
||
- nak: Use u32 for register indices
|
||
- nak: Rework map_instrs()
|
||
- nak: Add a new OpCopy instruction for parallel copy lowering
|
||
- nak: Use the builder for the legalize pass
|
||
- nak: Use OpCopy in legalize
|
||
- nak: Use more OpCopy
|
||
- nak: Add a Mem register file
|
||
- nak: Handle RegFile::Mem in parallel copy lowering
|
||
- nak: Allow DCE on functions
|
||
- nak: Restructure liveness construction
|
||
- nak: Add interference helpers
|
||
- nak: Add a dominance check to CFG
|
||
- nak: Add helpers to BasicBlock to get phis
|
||
- nak: Add a to-CSSA pass
|
||
- nak: Add an SSA repair pass
|
||
- nak: Union find
|
||
- nak/ra: Drop the pointless AssignRegs struct
|
||
- nak/ra: Handle parallel copies as a special case
|
||
- nak/ra: Don't free killed for OpPhiSrcs
|
||
- nak: Expose LiveSet for incremental liveness tracking
|
||
- nak: Add a RegFileSet filter to NextUseLiveness::for_function()
|
||
- nak: Add more NextUseLiveness helpers
|
||
- nak: Add a spilling pass
|
||
- nak: Use the correct number of GPRs on Turing+
|
||
- nak: Spill registers before RA
|
||
- nak: Add a debug flag to test spilling
|
||
- nak: Implement shader clock
|
||
- nak/ra: Improve coalescing
|
||
- nak/spill: Tweak the construction of S sets
|
||
- nak: Document spilling and RA
|
||
- nak: Add an alloc_vec() to SSAValueAllocator
|
||
- nak: Move all the IADD3 insanity to a new OpIAdd3X opcode
|
||
- nak/legalize: Fix too many IADD3 source modifiers
|
||
- nak: Disable lower_image_size_to_txs for NAK
|
||
- nak: IMAD also has a destination predicate
|
||
- nak: Remap GLSL_SAMPLER_DIM_SUBPASS and SUBPASS_MS to 2D and MS
|
||
- nak: Fix instruction ordering in nak_ir.rs
|
||
- nak: Rename OpBFind to OpFlo
|
||
- nak: Implement Index[Mut] for RegTracker
|
||
- nak: Use the right number of predicates in RegTracker
|
||
- nak: Rework the barrier insert pass
|
||
- nak: Rework calc_delay.rs
|
||
- nak: Re-work Instr::get_latency()
|
||
- nak: Emit FS_OUT before EXIT
|
||
- nvk: Use sysvals for fragcoord etc. with NAK
|
||
- nak: Handle flat FS inputs
|
||
- nak: Add support for centroid and sample interp modes
|
||
- nak: Use load_interpolated_input for frag_coord
|
||
- nak: Properly handle OpFSOut in RA and liveness
|
||
- nak: Handle empty OpFSOut
|
||
- nak/nir: Several FS output fixes
|
||
- nak: Implement load_sample_id and load_sample_mask_in
|
||
- nak: Implement discard and demote
|
||
- nak: Set TLS size properly in the shader header
|
||
- nvk,nak: Plumb through the zs_self_dep key bit
|
||
- nak: Use count_attribute_slots for FS input var sizes
|
||
- nak: Pull sm, num_gprs, and tls_size into a ShaderInfo struct
|
||
- nak: Stash a ShaderInfo in ShaderFromNir
|
||
- nak: Rework FS outputs again
|
||
- nak: Re-plumb compute shader info
|
||
- nak: Plumb more FS info through to the C API
|
||
- nvk/nak: Translate our new FS flags from NAK to nvk_shader
|
||
- nak: Saturate depth writes
|
||
- nak: Add support for gl_FrontFace
|
||
- nak/nir: Fix helper invocations
|
||
- nak/nir: Use nir_shader_intrinsics_pass for FS inputs
|
||
- nak: Handle interpolate_at_offset
|
||
- nak: Take components into account in load_*input
|
||
- nak: Plumb uses_kill through from nak_from_nir
|
||
- nak/nir: Plumb the FS key into lower_fs_input_intrin
|
||
- nak/nir: Move frag_coord/sample_pos lowering to FS input lowering
|
||
- nak/nir: Fix sample vs. pixel input interpolation
|
||
- nak/nir: Add a load_frag_w helper
|
||
- nak/nir: Interpolate gl_PointCoord
|
||
- nak/nir: Return one sample for gl_SampleMaskIn[0] when sample shading
|
||
- nak: Fold source modifiers in legalize
|
||
- nak: Provide more detail when printing IR after passes
|
||
- nak: Handle modifiers in dedup_srcs() in opt_lop()
|
||
- nvk: Add a helper for lowering system values to root table loads
|
||
- nvk: Lower more draw system values
|
||
- nak: Take component into account in store_output
|
||
- nak: Fix printing of OpASt
|
||
- nak: Move NIR enum translation out of nak_sph.rs
|
||
- nak: rustfmt fixes
|
||
- nak: Simplify I/O gathering
|
||
- nvk: Set clip/cull_enable for NAK shaders
|
||
- nak: Run simple liveness data-flow bottom-up
|
||
- nak/bitset: Add a helper for modifying in-place
|
||
- nak: Don't allocate bitsets in liveness data-flow
|
||
- nak: Handle non-constant I/O offsets
|
||
- nouveau/parser: Dump SET_STREAM_OUT_CONTROL_* properly
|
||
- nak: Translate XFB info
|
||
- nvk: Plumb through XFB info from NAK
|
||
- nak: Add a Label struct for branch targets
|
||
- nak: Add OpNop which can have a label
|
||
- nak: Break indirect offset encoding into a helper
|
||
- nak: Allow encoding Dst::None
|
||
- nak: Add barrier instructions
|
||
- nak/builder: Return the instruction from push_*()
|
||
- nak: Implement NIR control barriers
|
||
- nak: Implement From for SrcRef for more types
|
||
- nak: Add enums for sysvals and attributes
|
||
- nak: Plumb clip/cull enables through nak
|
||
- nak/nir: Lower tessellation and geometry I/O
|
||
- spirv: Fix locations for per-patch varyings
|
||
- nak: NVIDIA calls them tessellation init shaders
|
||
- nak: Rework OpALd and OpASt a bit
|
||
- nak: Set per patch attribute count both places in the SPH
|
||
- nak: Handle location_frac for FS outputs in nak_from_nir.rs
|
||
- nak: Add lowering for per-vertex I/O
|
||
- nak: Implement more attribute I/O
|
||
- nak/nir: Lower load_primitive_id
|
||
- nak,nvk: Plumb through tessellation info
|
||
- nak: Implement load_tess_coord
|
||
- nak: Fix lowering for patch_vertices_in
|
||
- HACK: Only emit OpBar in compute shaders
|
||
- nak/nir: Use count_vec4_slots instead of count_attribute_slots
|
||
- nak: Add NIR lowering for attribute I/O
|
||
- nak/nir: Lower systm values before lowering I/O
|
||
- nak: Use nak_nir_lower_vtg_io
|
||
- nak: Fix a bunch of warnings
|
||
- nak: Fix opt_out
|
||
- nak/bitset: Improve set_words()
|
||
- nak/bitset: Add an is_empty() helepr
|
||
- nak/bitset: Fix next_set()
|
||
- nak/sph: Round tls_size up to a multiple of 16
|
||
- nak: Fix repair_ssa() for back-edges
|
||
- nak: Fix parallel copy handling in spilling
|
||
- nak: Fix to_cssa()
|
||
- nak/nir: Don't lower 1-bit phis
|
||
- nak: Support encoding -Zero
|
||
- nak: Fix fneg to do fadd(-0, x)
|
||
- nak: Rename lower_vec_split() to lower_ineg()
|
||
- nak: Use Src::From<u32> and Src::From<bool>
|
||
- nak: A quick rustfmt fix
|
||
- nak: Upgrade to more modern meson
|
||
- nak: Add some #[allow(dead_code)]
|
||
- nak: Drop some unused helpers
|
||
- nak: Get rid of dead code warnings in RegFileSet
|
||
- nak: Get rid of warnings in nak_sph.rs
|
||
- nak: Drop the final calc_max_live() after GPR spilling
|
||
- nak: Don't print a range for one register
|
||
- nir: Add nvidia barrier intrinsics
|
||
- nak/nir: Add a pass for adding convergence barriers
|
||
- nak: Add OpBreak
|
||
- nak: Handle control-flow barriers
|
||
- nak: Use barriers for re-convergence
|
||
- nak: Remove unnecessary control barriers
|
||
- nak: Call nir_lower_subgroups()
|
||
- nak: Use nir_shader_intrinsics_pass for system values
|
||
- nak: Lower subgroup_id and num_subgroups
|
||
- nak/nir: Allow boolean vote_ieq
|
||
- nak/nir: Zero-pad subgroup masks
|
||
- nak: Implement vote and ballot
|
||
- nak: Fix the encoding of OpShfl
|
||
- nak: Implement read_invocation and shuffle_*
|
||
- nak: Allow 1-component image load/store
|
||
- nak: Emit CCtl in barriers with acq/rel semantics
|
||
- nak: Use strong ordering for Image load/store
|
||
- nak: Use the simplified BAR.SYNC encoding
|
||
- nak: Emit MemBar before Bar
|
||
- nak: Insert an OpNop after OpBar
|
||
- nak: Document a bit in encode_lds()
|
||
- nvk: Enable subgroups features
|
||
- nak: Rely on Rust 1.73 for next_multiple_of() and div_ceil()
|
||
- nak: Require meson 1.3.0 and clean up a couple bits
|
||
- meson: Set build.rust_std
|
||
- ci: Bump container images for NAK dependencies
|
||
- ci: Add syn to --force-fallback-for
|
||
- ci: Update the python env for ci_run_n_monitor.py
|
||
- nvk: Default to NAK on Turing+
|
||
- nvk: Stop asserting 11-bit storage image handles
|
||
- nvk: Free NAK shaders
|
||
- nak: Fix copy-prop for OpPLop3 sources
|
||
- nak: Drop OpAtomCas in favor of OpAtom with atom_op == CmpExch
|
||
- nak: Make ALD/AST.PHYS a boolean
|
||
- nak: Make encode_sm75 a method of Shader
|
||
- nak: Plumb the nak_compiler through to lower_fs_input_intrin
|
||
- nak: Rework FS input interpolation
|
||
- nvk: Only advertise VK_KHR_shader_terminate_invocation if using NAK
|
||
- nvk: Handle load_first_vertex in nvk_nir_lower_descriptors()
|
||
- nak/nir: Lower indirect FS inputs
|
||
- nvk: Only lower outputs to temporaries
|
||
- nvk: Add a codegen helper for nir_shader_compiler_options
|
||
- nvk: Move a bunch of codegen-specific lowering to helpers
|
||
- nvk: Move the optimization loop to the nvk_codegen.c
|
||
- nvk: Move the guts of nvk_compile_nir() to nvk_codegen.c
|
||
- nvk: Move even more lowering into nvk_codegen.c
|
||
- nvk: Use nak_fs_key instead of rolling our own
|
||
- nak: Rename TLS to SLM
|
||
- nak: Properly prefix nak_xfb_info
|
||
- nak: Move clip, cull, and XFB into a nak_shader_info.vtg
|
||
- nak: Add a writes_layer bit to nak_shader_info::vtg
|
||
- nak: Handle the num_gpr offsetting inside nak
|
||
- nvk: Use nak_shader_info natively
|
||
- nak: Enable SM70 for Volta
|
||
- nak: Stop passing undefs to ipa_nv
|
||
- nak: Support dumping shader assembly as part of compile
|
||
- nvk: Don't set pipeline->base.type manually
|
||
- nvk: Implement VK_KHR_pipeline_executable_properties
|
||
- nvk: Drop nouveau_ws_bo_new_tiled()
|
||
- nvk: Rework error handling in nouveau_ws_bo_new() and from_dma_buf()
|
||
- nvk: Handle VMA allocation failure
|
||
- nvk: Add a separate VMA heap for BDA capture/replay
|
||
- nvk: Implement bufferDeviceAddressCaptureReplay
|
||
- nvk: Advertise VK_KHR_synchronization2
|
||
- nvk: Set the right API version in the ICD json files
|
||
- nak: Add the predicate destination to OpShfl
|
||
- nak: Add builder helpers for a few ops
|
||
- nak: Use c == 0x0 for shuffle_up
|
||
- nak: Lower scan/reduce in NIR
|
||
- nak: Implement quad ops
|
||
- nvk: Advertise the rest of the subgroup ops
|
||
- nak: Rework reg and SSA value printing
|
||
- nak: Make most Display stuff lower-case
|
||
- nak: Rework opcode printing to use a new trait
|
||
- nak: Implement DisplayOp on Op instead of Display
|
||
- nak: Default InstrDeps::delay to 0
|
||
- nak: Only write deps.delay when set
|
||
- nak: Align instructions when printing
|
||
- nak: Display memory access bits with the "." prefix
|
||
- nak: Make MemAddrType a part of MemSpace
|
||
- nak: Display memory type at the end for load/store ops
|
||
- nak: Rework printing of texture and image dims
|
||
- nak: Two more print fixes
|
||
- nak: gl_FragCoord and gl_PointCoord are screen-space interpolated
|
||
- nvk/codegen: Fragment shader builtins are noperspective
|
||
- nvk: Wire up MESA_VK_VERSION_OVERRIDE
|
||
- nvk: Limit shader stages to supported stages
|
||
- nak: Run rustfmt
|
||
- nak: Only insert barriers around ifs if they actually re-converge
|
||
- vulkan: Default override patch version to VK_HEADER_VERSION
|
||
- nvk: Advertise Vulkan 1.1 on Turing+
|
||
- nak: Drop the PrmtSelection stuff
|
||
- nak: Add a builder helper for OpPrmt
|
||
- nak: Rework OpPrmt a bit
|
||
- nak: Implement nir_op_extract_*
|
||
- nak: Fix int8/16 lowering
|
||
- nak: Add base support for 8 and 16-bit types
|
||
- nak: Implement more int/float conversions
|
||
- nak: Implement integer conversions
|
||
- nak: Handle non-DW-aligned UBO loads
|
||
- nvk: Enable 8 and 16-bit integer types
|
||
- nak: Implement scan/reduce on booleans
|
||
- nak/nir: Handle CBuf alignment rules
|
||
- nak: Revert "nak: Handle non-DW-aligned UBO loads"
|
||
- nvk: Use the copy engine for CmdFillBuffer
|
||
- nvk: Use the copy engine for NVK_DEBUG=zero_memory
|
||
- nvk: Stop initializing the 2D engine
|
||
- vulkan: Move vk_synchronization2 to vk_synchronization
|
||
- vulkan: Add some auto-generated synchronization helpers
|
||
- vulkan: Add helpers for pipeline stage flags
|
||
- vulkan: Add helpers for access flags
|
||
- nvk: Move Begin/EndTransformFeedback to nvk_cmd_draw.c
|
||
- nvk: Rework transform feedback stalling
|
||
- nvk: Implement vkCmdPipelineBarrier2 for real
|
||
- nvk: Drop unnecessary per-draw/dispatch cache maintenance
|
||
- nvk: Drop MME_DMA_SYSMEMBAR before indirect draw/dispatch
|
||
- nak: Drop a bunch of SET_REFERENCE from the pre-Turing paths
|
||
- nvk: Advertise VK_EXT_subgroup_size_control
|
||
- nil: Add support for filling out linear texture headers
|
||
- nouveau: Rename nvidia-headers to headers
|
||
- nouveau: Move headers/classes to headers/nvidia/classes
|
||
- nak: Run rustfmt again
|
||
- nak: Fix integer roll-over when we have a u64vec4
|
||
- nak: Set .64/.32 on CSSR as needed
|
||
- nak/nir: Don't use nir_lower_bit_size on 64-bit values
|
||
- nak: Implement 64-bit ineg
|
||
- nak: Natively implement 64-bit shifts
|
||
- nak: Lower isign in NIR
|
||
- nak: Rework printing of comparisons
|
||
- nak: Implement 64-bit comparisons
|
||
- nak: Don't ask NIR to lower [iu]mul64_2x32
|
||
- nak: Use the right source types for I2F, F2I, and F2F
|
||
- nak: Fix encoding of 64-bit F2I, I2F, and F2F
|
||
- nak: Implement b2i64
|
||
- nak/nir: Don't lower 64-bit conversions
|
||
- nvk: Advertise shaderInt64
|
||
- nvk: Advertise VK_EXT_shader_subgroup_ballot/vote
|
||
- nak/nir: Handle non-32-bit data in lower_scan_reduce
|
||
- nvk: Advertise KHR_shader_subgroup_extended_types
|
||
- nvk: Advertise VK_KHR_shader_atomic_int64
|
||
- nak/nir: Trim image load/stores based on format
|
||
- nak: Lower 64-bit image load/store
|
||
- nak: Handle 64-bit image atomics
|
||
- nil: Add R64_SINT and R64_UINT formats
|
||
- nvk: Don't disable non-texturable formats
|
||
- nvk: Implement VK_EXT_shader_image_atomic_int64
|
||
- nak: Simplify Src::is_predicate()
|
||
- nak: Replace OpBMov with OpBClear
|
||
- nak: Fix scheduling for control barriers
|
||
- nak: Add a barrier register file
|
||
- nak: Add back OpBMov with better semantics
|
||
- nak: Add support for spilling barriers
|
||
- nak: Take num_barriers from RA
|
||
- nak: Make barriers SSA-friendly
|
||
- nak: Force RA to allocate bar_in/out to the same register
|
||
- nak: Add a barrier propagation pass
|
||
- dxil: Use mesa_prim consistently
|
||
- glsl: Properly remap GL_* to MESA_PRIM
|
||
- intel/vec4: Use MESA_PRIM_* instead of GL_*
|
||
- nir: Return a mesa_prim from gs_in_prim_for_topology
|
||
- compiler: Fix a comment
|
||
- radeonsi: Drop an unnecessary cast
|
||
- nvk: Advertise VK_EXT_scalar_block_layout
|
||
- nak: Advertise subgroupBroadcastDynamicId
|
||
- nak: Add a B32 source type
|
||
- nak: Rework the OpIAdd3/OpIAdd3X split
|
||
- nak/legalize: Handle the src0/1 source mod condition for OpIAdd3X
|
||
- nak: Legalize immediates with source modifiers
|
||
- nak: Implement uadd_sat
|
||
- nak: Implement usub_sat
|
||
- nvk: Implement VK_EXT_texel_buffer_alignment
|
||
- spirv: Plumb variable alignments through to NIR
|
||
- nir: Respect variable alignments in lower_vars_to_explicit_types
|
||
- nak: rustfmt
|
||
- nak: Restructure for better module separation
|
||
- ci: Also rustfmt binaries
|
||
- nir: Split has_[su]dot_4x8 bits into regular and _sat versions
|
||
- nir: Lower [su]dot_4x8_[ui]add_sat to [su]dot_4x8_[ui]add
|
||
- microsoft: Stop claiming dot_4x8_sat support
|
||
- nak: Rework printing of int/float types and rounding modes
|
||
- nak: Wire up DP4
|
||
- nvk: Advertise KHR_shader_integer_dot_product
|
||
- nak: Split legalize into per-SM functions
|
||
- nak: Initial WIP SM50 backend
|
||
- nak: Rework set_src_imm20 in nak_encode_sm50
|
||
- nak: Rewrite SM50 encode_fadd to not use encode_alu
|
||
- nak: Rename LogicOp to LogicOp3
|
||
- nak: Use OpLop2 and OpPSetP pre-SM70
|
||
- nak: Rework the SM50 encoding of isetp
|
||
- nak: Add SM50 encodings for ALD and AST
|
||
- nak: Only split texture destinations on Volta+
|
||
- nak: Rework nvfuzz for SM50
|
||
- nak/nv50: Rewrite the encoding of OpShf
|
||
- nak/sm50: Wire up tex ops
|
||
- nak: Rewrite the SM50 encoding of OpF2I
|
||
- nak/sm50: Rewrite the encoding for OpIMnMx
|
||
- nak: Implement FS input interpolation on SM50
|
||
- nak/sm50: Rewrite the encoding for OpMov
|
||
- nak: Drop the SM50 encoding of BREV
|
||
- nak/sm50: Add better helpers for encoding sources with modifiers
|
||
- nak/sm50: Stop using ALUSrc for IADD2
|
||
- nak/sm50: Drop src_mod_has* in favor of core helpers
|
||
- nak: Clean up compiler warnings
|
||
- nak: Add barriers on Volta
|
||
- nak/nvfuzz: Add an SM parameter
|
||
- nak: Drop the fmnmx from Builder
|
||
- nak: Add an ftz bit to a bunch of float ops
|
||
- nak: Plumb through float controls
|
||
- nvk: Advertise VK_KHR_shader_float_controls
|
||
- nak: Plumb through float controls for fset[p]
|
||
- nak: Plumb through float controls for frnd[p]
|
||
- nak: Add dnz bits to OpFMul and OpFFma
|
||
- nak: Audit remaining FTZ/DNZ bits on sm70+
|
||
- nak: Audit sm50 for FTZ/DNZ bits
|
||
- nak: Clean up instruction printing a bit
|
||
- nak: Rework barrier handling a bit
|
||
- nvk: Make NVK_DEBUG=push an alias for push_dump
|
||
- nvk: s/device/dev in nvk_descriptor_set_layout.c
|
||
- nvk: Plumb a physical device into descriptor_stride_align_for_type
|
||
- nvk: Add a nvk_min_cbuf_alignment() helper and use it
|
||
- nvk: Add an NVK_MIN_TEXEL_BUFFER_ALIGNMENT #define
|
||
- nak: Reduce minStorageBufferAlignment
|
||
- nvk: Simplify alignment limit plumbing
|
||
- nvk: CBuf alignment reduces to 64B on Turing
|
||
- nvk: Throw Tegra behind NVK_I_WANT_A_BROKEN_VULKAN_DRIVER
|
||
- nvk: Rework the way we set up memory heaps/types
|
||
- nir: Add a new has_fmulz_no_denorms flag
|
||
- nak: Set .ftz on f32 ops by default
|
||
- nak: Implement fmulz and ffmaz
|
||
- nvk: Enable NAK by default for Volta
|
||
- nak: Don't set both FTZ and DNZ at the same time
|
||
- nvk: Implement VK_EXT_multi_draw
|
||
- nak: Add a delay of 2 cycles for barriers
|
||
- nak: Rework the dependency pass
|
||
- nak: Handle negative cbuf offset immediates
|
||
- nak/sm50: Fix immediate encodings
|
||
- nak/sm50: Fix legalization of OpIAdd
|
||
- nak/sm50: Add legalization and encoding for OpLdc
|
||
- nvk/nir: Add cbuf analysis to nvi_nir_lower_descriptors()
|
||
- nvk/nir: Lower UBO loads to load_ubo when we have a cbuf
|
||
- nvk: Add a cbuf_bind_map to nvk_shader
|
||
- nvk: Stash descriptor set sizes
|
||
- nvk: Rework push_indirect to take an address
|
||
- nvk: Set MME_DATA_FIFO_CONFIG on device init
|
||
- nvk: Don't flush descriptors in BeginConditionalRendering
|
||
- nvk: Upload cbufs based on the cbuf_map
|
||
- nvk: Add debug flags to the physical device
|
||
- nvk: Enable cbufs
|
||
- nvk: Use ENUM_PACKED for enums instead of PACKED
|
||
- nir: Scalarize bounds checked loads and stores
|
||
- nak: Switch to //-style comments
|
||
- nak: Plumb shader model into instruction latency queries
|
||
- nak: Handle minimum execution latencies in the dep tracker
|
||
- nvk: Advertise VK_KHR_vulkan_memory_model
|
||
- nvk: Use render->color_att_count for color write enables
|
||
- nvk: Support extendedDynamicState3ColorWriteMask
|
||
- nak: Move the copy detection part of opt_copy_prop to a helper
|
||
- nak: Fix copy-prop for fp64
|
||
- nak: Copy propagate and constant fold OpPrmt
|
||
- nak: Make OpAtom::cmpr a GPR source
|
||
- nak: Pass SrcTypes around instead of RegFile in legalize
|
||
- nak/sm70: Allow src2 of 3src ops to be an immediate
|
||
- nak: OpDAdd doesn't have saturate
|
||
- nak: Rework encoding of ALU instructions on SM70+
|
||
- nak: Add the rest of the double-precision ops
|
||
- nak: Split fmul/ffma handling from fmulz/ffmaz
|
||
- nak: Wire up 64-bit nir_op_fadd/ffma/fmul and comparisons
|
||
- nak: Fix nir_op_f2f64
|
||
- nak: Implement b2f64
|
||
- nak/nir: Set nir_lower_io_lower_64bit_to_32 for varyings
|
||
- meson: Update our rust dependencies
|
||
- nak: Fix encoding of dsetp with RZ on SM70+
|
||
- nak: Implement 64-bit nir_op_fsign
|
||
- nak/sm50: Add encoding and legalization for dadd/dfma/dmul/dsetp
|
||
- nak/sm50: Fix encoding of f20 immediates
|
||
- nak/sm50: Fix encoding of iadd with imm32
|
||
- nak/sm50: Properly legalize OpSel and drop an assert
|
||
- nak/sm50: Add DMnMx and use it for fp64 fmin/fmax
|
||
- nir/lower_doubles: Add lowering for fmin/fmax/fsat
|
||
- nak/nir: Lower a bunch of fp64
|
||
- nvk: Advertise shaderFloat64
|
||
- nvk: Free shaders created by codegen
|
||
- nvk: Unref shaders on pipeline free
|
||
- nvk: Don't exnore ExternalImageFormatInfo
|
||
- nak: Fix TCS output reads
|
||
|
||
Felix DeGrood (3):
|
||
|
||
- anv: remove CS_FLUSH from query regression
|
||
- driconf: add Dying Light 2 to Intel XeSS workaround
|
||
- driconf: add Witcher3 to Intel XeSS workaround
|
||
|
||
Felix bridault (1):
|
||
|
||
- radv: use 32bit va range for sparse descriptor buffers
|
||
|
||
Florian Weimer (1):
|
||
|
||
- meson: C type error in strtod_l/strtof_l probe
|
||
|
||
Francisco Jerez (70):
|
||
|
||
- intel/l3/gfx11+: Add tile cache partition to intel_l3_config struct.
|
||
- intel/l3: Define helper for obtaining the size of an L3 partition in KB.
|
||
- intel/l3: Set up L3FullWayAllocationEnable config if ALL partition has over 126 ways.
|
||
- intel/dg2: Import L3 cache configurations.
|
||
- intel/mtl: Import L3 cache configurations.
|
||
- intel/xehp+: Add TBIMR-related genxml definitions.
|
||
- intel/xehp+: Import algorithm for TBIMR tiling parameter calculation.
|
||
- intel/xehp+: Add dynamic state flags controlling whether TBIMR is enabled during 3D primitives.
|
||
- intel/xehp+: Define driconf option for selectively disabling TBIMR.
|
||
- iris/xehp: Implement TBIMR tile pass setup and pipeline bandwidth estimation.
|
||
- anv/xehp: Implement TBIMR tile pass setup and pipeline bandwidth estimation.
|
||
- anv/xehp+: Enable TBIMR in generated draw calls.
|
||
- intel/xehp: Adjust TBIMR performance chicken bits.
|
||
- intel/xehp+: Adjust TBIMR batch size based on slice count.
|
||
- intel/xehp+: Use TBIMR tile box check in order to avoid performance regressions.
|
||
- intel/xehp: Enable TBIMR by default.
|
||
- intel/eu/xe2+: Add support for 10-bit SWSB representation on Xe2+ platforms.
|
||
- intel/fs/xe2+: Add comment reminding us to take advantage of the 32 SBID tokens.
|
||
- intel/fs/xe2+: Teach SWSB pass about the behavior of double precision instructions.
|
||
- intel/fs/xe2+: Handle extended math instructions as in-order in SWSB pass.
|
||
- intel/eu/xe2+: Add definition for size of GRF space on Xe2.
|
||
- intel/fs/xe2+: Don't special case SEL_EXEC in inferred_exec_pipe().
|
||
- intel: Improve N-way pixel hashing computation to handle pixel pipes with asymmetric processing power.
|
||
- intel/compiler: Add max_polygons FS compilation parameter.
|
||
- intel/compiler: Add multipolygon dispatch fields to brw_wm_prog_data.
|
||
- intel/compiler: Add polygon count statistic to brw_compile_stats.
|
||
- intel/fs: Add separate constructor of fs_visitor for fragment shaders.
|
||
- intel/fs: Map all GS input attributes to ATTR register number 0.
|
||
- intel/fs: Map all VS input attributes to ATTR register number 0.
|
||
- intel/fs: Map all TES input attributes to ATTR register number 0.
|
||
- intel/fs: Assert fs_reg::nr is always zero for ATTR registers in geometry stages.
|
||
- intel/fs: Consider ATTR registers with different fs_reg::nr as belonging to disjoint register spaces.
|
||
- intel/fs: Provide component index explicitly to interp_reg().
|
||
- intel/fs: Pass builder to per_primitive_reg().
|
||
- intel/fs: Fix fs_reg::component_size() to handle two-dimensional register regions.
|
||
- intel/fs: Rework layout of FS vertex setup data in ATTR file to support multi-polygon dispatch.
|
||
- intel/fs: Don't copy-propagate ATTR registers in multi-polygon FS shaders when invalid.
|
||
- intel/compiler: Don't change types for copies from ATTR file.
|
||
- intel/fs/gfx12+: Don't set nir_divergence_single_prim_per_subgroup option for fragment shaders.
|
||
- intel/fs/gfx12: Don't consider multipolygon PS to have packed dispatch.
|
||
- intel/fs: No need to copy null destinations in lower_simd_width.
|
||
- intel/fs: Fix PS thread payload setup for depth_w_coef_reg.
|
||
- intel/fs/gfx12: Implement multi-polygon format of back/front-facing flag in PS payload.
|
||
- intel/fs/gfx12: Implement multi-polygon format of render target array index in PS payload.
|
||
- intel: Add debug flag for enabling dual-SIMD8 fragment shader dispatch.
|
||
- intel/compiler: Attempt to build dual-SIMD8 variant of fragment shaders on gfx12+ platforms.
|
||
- intel/genxml: Add 3DSTATE_PS definitions needed for dual-SIMD8 dispatch on Gfx12+.
|
||
- intel/gfx12: Enable SIMD8 dispatch in 3DSTATE_PS for FS multipolygon dispatch.
|
||
- iris/gfx12: Hook up dual-SIMD8 fragment shader dispatch.
|
||
- anv/gfx12: Hook up dual-SIMD8 fragment shader dispatch.
|
||
- intel/fs/xe2+: Stop building SIMD8 compute-like shaders (CS/BS/TS/MS).
|
||
- intel/fs/xe2+: Stop building SIMD8 fragment shaders.
|
||
- intel/fs/xe2+: Stop building SIMD8 shaders for geometry stages (VS/TCS/TES/GS).
|
||
- intel/eu/xe2+: Add helpers for constructing registers in 512b units.
|
||
- intel/fs/xe2+: Implement PS thread payload register offset setup.
|
||
- intel/fs/xe2+: Fix for new layout of X/Y pixel coordinates in PS payload.
|
||
- intel/fs/xe2+: Update uses of pixel/sample mask from PS thread payload.
|
||
- intel/fs/xe2+: Update location of sample ID fields in PS payload.
|
||
- intel/fs/xe2+: Update poly info PS payload for new multi-polygon dispatch format.
|
||
- intel/fs: Add support for vector payload values to fetch_payload_reg().
|
||
- intel/fs/xe2+: Enable new format of barycentrics in PS payload.
|
||
- intel/fs/xe2+: Update for new layout of vertex setup data in PS payload.
|
||
- intel/fs/xe2+: Implement support for multi-polygon vertex setup data in PS payload.
|
||
- intel/fs/xe2+: Implement layout of mesh shading per-primitive inputs in PS thread payloads.
|
||
- intel/fs: Plumb shader instead of compiler to get_lowered_simd_width() and friends.
|
||
- intel/fs/xe2+: Lower SIMD width of instructions that access ATTR file from SIMD2x8/4x8 FS.
|
||
- intel: Add debug flags for enabling Xe2+ multipolygon fragment shader dispatch modes.
|
||
- intel/fs/xe2+: Attempt to build quad-SIMD8 and dual-SIMD16 FS variants on Xe2+ platforms.
|
||
- intel/xe2+: Implement fragment shader dispatch state setup.
|
||
- intel/compiler/xe2: Don't disassemble non-existent fields.
|
||
|
||
Frank Binns (4):
|
||
|
||
- pvr: rename some more instances of 'reserved' to 'carveout' for consistency
|
||
- include/drm-uapi: add pvr_drm.h
|
||
- pvr: Add powervr winsys implementation
|
||
- pvr: alloc WSI memory via GPU when there isn't a valid display FD
|
||
|
||
Friedrich Vock (24):
|
||
|
||
- aco: Update printed block kinds
|
||
- vulkan: Don't use set_foreach_remove when destroying pipeline caches
|
||
- radv/ci: Update skips comments
|
||
- ac/gpu_info: Manually compute L3 size for Navi33
|
||
- radv: Enable compute dispatch tunneling
|
||
- radv,vtn,driconf: Add and use radv_rt_ssbo_non_uniform workaround for Crysis 2/3 Remastered
|
||
- radv/rt: Initialize unused children in PLOC early-exit
|
||
- radv/rt: bsearch inlined shaders
|
||
- radv/rt: Free traversal NIR after compilation
|
||
- radv,aco: Convert 1D ray launches to 2D
|
||
- radv/rt: Move per-geometry build info into a geometry_data struct
|
||
- radv/rt: Acceleration structure updates
|
||
- radv/rt: Add workaround to make leaves always active
|
||
- radv: Fix shader replay allocation condition
|
||
- nir: Make is_trivial_deref_cast public
|
||
- nir: Handle casts in nir_opt_copy_prop_vars
|
||
- util: Provide a secure_getenv fallback for platforms without it
|
||
- vulkan: Use secure_getenv for trigger files
|
||
- aux/trace: Guard triggers behind __normal_user
|
||
- vtn: Use secure_getenv for shader dumping
|
||
- mesa/main: Use secure_getenv for shader dumping
|
||
- radv: Use secure_getenv in radv_builtin_cache_path
|
||
- radv: Use secure_getenv for RADV_THREAD_TRACE_TRIGGER
|
||
- util/disk_cache: Use secure_getenv to determine cache directories
|
||
|
||
GKraats (1):
|
||
|
||
- i915G: show correct number of needed ALU instructions at errmess
|
||
|
||
Ganesh Belgur Ramachandra (9):
|
||
|
||
- radeonsi: Fix clear-render-target shader for 1darrays in NIR
|
||
- radeonsi: "create_dma_compute" shader in nir
|
||
- radeonsi: "create_fmask_expand_cs" shader in nir
|
||
- radeonsi: "get_blitter_vs" shader in nir
|
||
- asahi: fixes prevailing '-Werror=maybe-uninitialized' issue
|
||
- radeonsi: enable nir pass for 64 bit operations
|
||
- radeonsi: add comments for unpack_2x16* utility functions
|
||
- radeonsi: convert "create_query_result_cs" shader to nir
|
||
- radeonsi: convert "gfx11_create_sh_query_result_cs" shader to nir
|
||
|
||
Georg Lehmann (28):
|
||
|
||
- aco, radv: vectorize f2f16 if rounding mode is rtz
|
||
- aco: force uniform result for LDS load with uniform address if it can be non uniform
|
||
- aco: stop using cstdint
|
||
- aco: namespace aco_opcode
|
||
- aco: deduplicate instr_class definition
|
||
- aco: deduplicate Format definition
|
||
- aco: don't CSE v_permlane across exec
|
||
- aco: use null operand for SOPK s_waitcnt
|
||
- aco: fix detecting sgprs read by SMEM hazard
|
||
- aco/tests: add some missing scc defs
|
||
- aco/tests: use correct operand size for some 64bit ops
|
||
- aco: use lm for carry out in vsub32
|
||
- aco: add missing scc def for SALU quad broadcast
|
||
- aco/gfx10+: don't use v_cmpx with VCC def
|
||
- aco: use correct operand size for int tg4 wa
|
||
- aco: add src/def count and size for all ALU opcodes
|
||
- aco: validate ALU operands and defs
|
||
- aco/sched: treat p_dual_src_export_gfx11 like export
|
||
- aco: don't optimize DPP across more than one block
|
||
- aco: add test for post-ra DPP clobbered in linear cfg
|
||
- aco: optimize 32bit fsign by using fmulz with Inf
|
||
- aco: shrink buffer stores with undef/zero components
|
||
- aco/gfx12: implement broadcast dmask shrink behavior
|
||
- aco: apply packed fneg commutatively
|
||
- aco: fix applying input modifiers to DPP8
|
||
- aco: clean up fneg/fabs combining
|
||
- aco: apply fneg/fabs to VOP3P
|
||
- aco: stop scheduling at p_logical_end
|
||
|
||
George Ouzounoudis (9):
|
||
|
||
- nvk: Move SET_BLEND_STATE_PER_TARGET to graphics state initialization
|
||
- nvk: Support extendedDynamicState3ColorBlendEnable
|
||
- nvk: Support extendedDynamicState3ColorBlendEquation
|
||
- nvk: Support extendedDynamicState3SampleMask
|
||
- nvk: Support extended dynamic state for alpha to coverage/one
|
||
- vulkan: Fix dynamic graphics state enum usage
|
||
- nvk: Support extended dynamic state for rasterization stream
|
||
- nvk: Remove pipeline state setting functions
|
||
- nvk: Support extended dynamic state for tessellation domain origin
|
||
|
||
Gert Wollny (15):
|
||
|
||
- virgl: Use host reported limits for max outputs
|
||
- r600: Add callbacks for get_driver_uuid and get_device_uuid
|
||
- r600: Add experimental get_compute_state_info
|
||
- r600: Link with libgalliumvl, when enabling rusticl this is needed
|
||
- r600/sfn: Fixup component count only if intrinsic has it
|
||
- r600/sfn: Allow skipping backend shader optimization for a subset of shaders
|
||
- r600/sfn: keep workgroup and invocation ID registers for whole shader
|
||
- r600/sfn: Fix usage of std::string constructor
|
||
- r600/sfn: Don't try to re-use iterators when the set is made empty
|
||
- zink: Don't pass a blend state when we have full ds3 support
|
||
- r600: lower dround_even also on hardware that supports fp64
|
||
- virgl: Use better reporting for mirror_clamp features
|
||
- radv: Fix compilation with gcc-13 and tsan enabled
|
||
- nir/lower_int64: Fix compilation with gcc-13 and tsan enabled
|
||
- nir/builder: Fix compilation with gcc-13 when tsan is enabled
|
||
|
||
Giancarlo Devich (1):
|
||
|
||
- nir: Workaround MSVC internal compiler error in ARM64 build
|
||
|
||
Guilherme Gallo (19):
|
||
|
||
- ci/bin: Use iid instead of SHA in gitlab_gql
|
||
- ci/bin: Do not forget to add early-stage dependencies
|
||
- ci/bin: Refactor create_job_needs_dag
|
||
- ci/lava: Use project_name instead of hardcoded \`mesa`
|
||
- ci/lava: Fix imports formatting
|
||
- ci/lava: Refactor UART definition building blocks
|
||
- ci/lava: Create LAVAJobDefinition
|
||
- ci/lava: Make SSH definition wrap the UART one
|
||
- ci/lava: Enable SSH by default in fastboot devices
|
||
- ci/lava: Add unit tests covering job definition
|
||
- ci/bin: Fix find_dependency function calls
|
||
- ci/bin: Replace AIOHTTPTransport with RequestsHTTPTransport
|
||
- ci/bin: gql: make the query cache optional
|
||
- ci/bin: gql: Log the caching errors
|
||
- ci/bin: gql: Implement pagination
|
||
- ci/bin: gql: Improve queries for jobs/stages retrieval
|
||
- ci/bin: Fix gitlab_gql methods that uses needs DAG
|
||
- ci/bin: Fix mypy errors in gitlab_gql.py
|
||
- ci/bin: Print a summary list of dependency and target jobs
|
||
|
||
Haihao Xiang (1):
|
||
|
||
- anv: Fix typo in transition_color_buffer
|
||
|
||
Hans-Kristian Arntzen (2):
|
||
|
||
- radv/radeonsi: Forward correct GPU instance to umr.
|
||
- wsi/x11: Add workaround for Detroit Become Human.
|
||
|
||
Helen Koike (3):
|
||
|
||
- ci/zink: add spec\@ext_timer_query\@time-elapsed to flakes
|
||
- ci/ci_run_n_monitor: abort when target gets skipped
|
||
- ci: fix python-test dependency error on merge requests
|
||
|
||
Hyunjun Ko (2):
|
||
|
||
- vulkan/video: fix a typo
|
||
- anv/video: fix out-of-bounds read
|
||
|
||
Iago Toral Quiroga (13):
|
||
|
||
- v3d,v3dv: fix MMU error from hardware prefetch after ldunifa
|
||
- v3d: implement support for PIPE_CAP_NATIVE_FENCE_FD
|
||
- broadcom: fix scheduling dependencies for SETMSF instruction
|
||
- v3dv: disallow image stores on VK_KHR_DISPLAY surfaces
|
||
- v3dv: switch timestamp queries to using BO memory
|
||
- broadcom: disable perquad tmu loads after discards
|
||
- broadcom: lower null pointers
|
||
- v3dv: implement VK_KHR_shader_terminate_invocation
|
||
- v3dv: implement VK_EXT_shader_demote_to_helper_invocation
|
||
- v3dv: expose VK_EXT_subgroup_size_control
|
||
- broadcom/compiler: fix incorrect flags setup in non-uniform if path
|
||
- broadcom/compiler: fix incorrect flags update for subgroup elect
|
||
- broadcom/compiler: be more careful with unifa in non-uniform control flow
|
||
|
||
Ian Romanick (39):
|
||
|
||
- nir/split_vars: Don't split arrays of cooperative matrix types
|
||
- nir/lower_packing: Don't generate nir_pack_32_4x8_split on drivers that can't handle it
|
||
- nir/lower_packing: Add lowering for nir_op_unpack_32_4x8
|
||
- nir/builder: Teach nir_pack_bits and nir_unpack_bits about 32_4x8
|
||
- intel/vec4: Don't emit an empty ELSE
|
||
- intel/compiler: Add basic CFG validation
|
||
- intel/compiler: Limit scope of cur_endif variable
|
||
- intel/compiler: Delete bidirectional block links in opt_predicated_break
|
||
- intel/compiler: Don't create extra CFG links in opt_predicated_break
|
||
- intel/compiler: Don't create extra CFG links when deleting a block
|
||
- intel/compiler: Don't promote CFG link types when removing a block
|
||
- intel/fs: Don't add MOV instructions to DO blocks in combine constants
|
||
- intel/compiler: Verify that DO is alone in the block
|
||
- nir: Handle divergence for decl_reg
|
||
- intel/fs/xe2+: Pass correct dispatch_width to fs_generator for geometry-processing stages.
|
||
- intel/cmat: Update get_slice_type for packed slices
|
||
- intel/cmat: Add lowering for cmat_insert and cmat_extract
|
||
- intel/cmat: Enable packed formats for unary, length, and construct
|
||
- intel/cmat: Enable packed formats for binary ops
|
||
- intel/cmat: Enable packed formats for scalar ops
|
||
- intel/cmat: Add lowering for cmat_bitcast
|
||
- intel/cmat: Lower cmat_load and cmat_store
|
||
- intel/compiler: Initial bits for DPAS instruction
|
||
- intel/disasm: Disassembly support for DPAS
|
||
- intel/compiler: Validation for DPAS instructions
|
||
- intel/fs: Fix scoreboarding for DPAS
|
||
- intel/fs: DPAS lowering
|
||
- intel/fs: nir: Add nir_intrinsic_dpas_intel
|
||
- anv: Add anv_physical_device::has_cooperative_matrix
|
||
- anv: Set COMPUTE_WALKER systolic mode enable flag
|
||
- anv: Set PIPELINE_SELECT systolic mode enable flag
|
||
- anv: Lower indirect derefs again after lowering cooperative matrices
|
||
- anv: Select the SIMD mode very early when cooperative matrices are used
|
||
- intel/dev: Advertise integer configs with saturatingAccumulation too
|
||
- intel/dev: Enable VK_KHR_cooperative_matrix on all Gfx9+ GPUs
|
||
- intel/cmat: Generate better code for nir_intrinsic_cmat_insert
|
||
- intel/compiler: Disable DPAS instructions on MTL
|
||
- intel/compiler: Track lower_dpas flag in brw_get_compiler_config_value
|
||
- intel/compiler: Track mue_compaction and mue_header_packing flags in brw_get_compiler_config_value
|
||
|
||
Italo Nicola (4):
|
||
|
||
- panfrost: fix untracked dependency when converting resource modifier
|
||
- gallium: stop calling resource_copy_region for multisampled copy_image
|
||
- panfrost: legalize afbc before blitting
|
||
- panfrost: expose support for EXT_copy_image
|
||
|
||
Iván Briano (8):
|
||
|
||
- anv: use the right vertexOffset on CmdDrawMultiIndexed
|
||
- hasvk: ensure we reapply always pipeline dynamic state in runtime state
|
||
- anv: allow NULL index buffers
|
||
- anv: remove no longer valid assert
|
||
- anv: handle VkBindMemoryStatusKHR on buffer/image memory bind
|
||
- anv: add support for Cmd*DescriptorSet*2KHR
|
||
- anv: move astc_emu to use descriptors2 calls
|
||
- anv: enable VK_KHR_maintenance6
|
||
|
||
Jan Beich (2):
|
||
|
||
- intel: make CLOCK_TAI optional for non-Linux
|
||
- intel: make CLOCK_BOOTTIME optional for non-Linux
|
||
|
||
Jani Nikula (7):
|
||
|
||
- nir: add names to some typedef'd structs/enums
|
||
- nir: drop \**< style documentation comments
|
||
- isl: drop \**< style documentation comments
|
||
- docs: Add docs/header-stubs/README.rst
|
||
- docs/vulkan: use hawkmoth instead of doxygen
|
||
- docs/nir: use hawkmoth instead of doxygen
|
||
- docs/isl: use hawkmoth instead of doxygen
|
||
|
||
Janne Grunau (4):
|
||
|
||
- gallium: Avoid empty version scripts in pipe-loader
|
||
- gallium: Fix i915 pipe-loader build
|
||
- gallium: Do not create pipe-loader version scripts for disabled drivers
|
||
- asahi: Fix typo in arch check in agx_get_gpu_timestamp
|
||
|
||
Jesse Natalie (64):
|
||
|
||
- microsoft: Disable post-merge CI for Windows
|
||
- d3d12: Only set draw params root parameter index for actual draw params
|
||
- dzn: Implement VK_MSFT_layered_driver
|
||
- wgl: Take pixelformat color channels into account for choosing a PFD
|
||
- winsys/gdi: Handle 4444 and 1010102 texture formats
|
||
- winsys/gdi: Update is_displaytarget_format_supported to reflect reality
|
||
- d3d12: Don't support displaytargets that can't be supported by GDI/DXGI
|
||
- dzn: Use vk_properties helper
|
||
- vulkan: Remove no-longer-needed prototypes for ICD entrypoints
|
||
- vulkan: Consolidate common ICD methods
|
||
- vulkan: Support loader interface v7
|
||
- dzn: Fix memory type sorting
|
||
- microsoft/compiler: Set src/dest nir types on image intrinsics when deducing format
|
||
- d3d12: Disable common state promotion for non-simultaneous-access textures
|
||
- d3d12: Initialize shader key swizzle for non-int textures
|
||
- d3d12: Add a fallback for int clears where value can't be cast to float
|
||
- d3d12: Binding buffers as SSBO/storage image needs to add buffer ranges
|
||
- d3d12: Change memory barrier implementation
|
||
- d3d12: Support ARB_texture_view
|
||
- d3d12: Use format casting for shader images
|
||
- d3d12: GL4.3
|
||
- microsoft/compiler: Bump signature limits for 32 rows of 4 components
|
||
- microsoft/compiler: Don't declare PS output registers split across variables
|
||
- microsoft/compiler: Don't use 64-bit types for signature entries
|
||
- microsoft/compiler: When packing fractional inputs, find a row with space for it
|
||
- microsoft/compiler: Stop lowering all I/O to temps
|
||
- d3d12: Fix location_frac_mask bitfield size
|
||
- d3d12: Split dvec3 interpolatns into devc2 and double
|
||
- d3d12: Support enhanced layouts for VS inputs
|
||
- d3d12: Fix GS variant I/O slot counts
|
||
- d3d12: Enable ARB_enhanced_layouts and ARB_texture_mirror_clamp_to_edge
|
||
- d3d12: Reference count queries in a batch
|
||
- d3d12: ARB_query_buffer_object and GL4.4
|
||
- d3d12: PRIMITIVES_GENERATED for stream > 0 should only be an SO query
|
||
- d3d12: Handle cull distance as an XFB target
|
||
- d3d12: Fix MSAA-disabling pass; sample mask should be 0 for helper lanes
|
||
- d3d12: GL4.5
|
||
- nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics
|
||
- nir: Add a flag to opt_if to prevent fighting with splitting 64bit phis
|
||
- d3d12: Fixes for QBO shaders
|
||
- d3d12: Enable some 4.6 extensions that were already implemented
|
||
- d3d12: GL4.6
|
||
- nir_lower_mem_access_bit_sizes: Fix assert (bit -> byte size)
|
||
- microsoft/compiler: Fix lower_mem_access_bit_size callback result
|
||
- d3d12/driconf: Force on ARB_texture_view for Blender
|
||
- d3d12: Fix multidimensional array ordering
|
||
- d3d12: Fix h264 encoder 32-bit build (uint64_t -> size_t)
|
||
- d3d12: Fix hevc encoder 32-bit build (uint64_t -> size_t)
|
||
- microsoft/clc: Fix image lowering pass to only erase variables at the end
|
||
- microsoft/clc: Fix images with multiple derefs for real
|
||
- microsoft/clc: Add a test which sinks image derefs
|
||
- microsoft/clc: One more image lowering fix
|
||
- compiler/clc: Don't fail to parse SPIR-V if there's no kernels
|
||
- microsoft/clc: Flip on capabilities to prevent warning spew
|
||
- microsoft: Whitespace change to trigger CI
|
||
- vulkan/wsi: Convert bit tests to bool with != 0
|
||
- util: Re-implement getenv for Windows
|
||
- d3d12: Add a debug flag to opt out of singleton behavior
|
||
- d3d12: Only destroy the winsys during screen destruction, not reset
|
||
- libgl-gdi: Update wgl test to use a 32bit framebuffer
|
||
- libgl-gdi: Update wgl test to set debug flags needed for tests
|
||
- dzn: Fix 3D to 2D image copies
|
||
- zink: Add ASSERTED to vars that are only used for asserts
|
||
- mesa: Consider mesa format in addition to internal format for mip/cube completeness
|
||
|
||
Jianxun Zhang (12):
|
||
|
||
- intel/isl: Add a debug option to override modifer list
|
||
- intel: Move mod_plane_is_clear_color() into isl
|
||
- intel/vulkan: Report clear color in subresource layout
|
||
- intel/vulkan: Allow modifiers supporting fast clear
|
||
- intel/vulkan: Specify offset when creating aux state tracker
|
||
- intel/vulkan: Import aux state tracking buffer
|
||
- intel/vulkan: Remove private binding on fast clear region
|
||
- intel/vulkan: Use the last 2 dwords of clear color struct
|
||
- intel/vulkan: Correct a comment about an offset in fast clear
|
||
- intel/vulkan: Update comment of a workaround of modifiers
|
||
- intel/vulkan: Add COMPRESSED_CLEAR state in layout translation
|
||
- intel/isl: Add Gfx 12.x RC_CCS_CC into modifier scores
|
||
|
||
Job Noorman (5):
|
||
|
||
- ir3: correctly set bit size for 64b constant \@load_ubo
|
||
- nir: add _safe variants of nir_foreach_reg_load/store
|
||
- ir3: lower 64b registers
|
||
- nir: add helper to create cursor after all \@decl_regs
|
||
- ir3: lower 64b registers before creating preamble
|
||
|
||
Jonathan Gray (2):
|
||
|
||
- intel/common: add directory prefix to intel_gem.h include
|
||
- zink: put sysmacros.h include under #ifdef MAJOR_IN_SYSMACROS
|
||
|
||
Jordan Justen (25):
|
||
|
||
- intel/l3: Use devinfo->urb.size when cfg urb-size is 0.
|
||
- anv: Add more space for init_render_queue_state() batch (MTL regression)
|
||
- intel/dev/wa: Raise error if mesa_defs.json contains unknown platforms
|
||
- intel/dev: Rename mtl-m to mtl-u
|
||
- intel/dev: Rename mtl-p to mtl-h
|
||
- intel/compiler: Define XE2 compiler enum
|
||
- intel/genxml: Update COMPUTE_WALKER for xe2
|
||
- iris: Set COMPUTE_WALKER Message SIMD field
|
||
- anv: Set COMPUTE_WALKER Message SIMD field
|
||
- intel/genxml: Update INTERFACE_DESCRIPTOR_DATA for xe2
|
||
- anv, iris: Update INTERFACE_DESCRIPTOR_DATA programming for xe2
|
||
- iris: xe2 doesn't have INTERFACE_DESCRIPTOR_DATA::BarrierEnable
|
||
- intel/genxml: Update 3DSTATE_TE for xe2
|
||
- isl: Add mocs for xe2
|
||
- intel/genxml: Add UNIFIED_COMPRESSION_FORMAT enum for xe2
|
||
- anv, blorp, iris: Update 3DSTATE_PS programming for xe2
|
||
- anv, blorp, iris, intel/genxml: Update 3DSTATE_VS for xe2
|
||
- anv, blorp, iris, intel/genxml: Update 3DSTATE_PS_EXTRA for xe2
|
||
- intel/batch_decoder: Update 3DSTATE_PS decoding for xe2
|
||
- anv, iris, intel/genxml: Update 3DSTATE_GS for xe2
|
||
- anv, iris, intel/genxml: Update 3DSTATE_HS for xe2
|
||
- intel/compiler: Pass max_polygons to copy-prop from fs_visitor.
|
||
- intel/xe2+: Implement brw_wm_state_simd_width_for_ksp() on Xe2+.
|
||
- intel/genxml/gfx125: Move L1_CACHE_CONTROL to enum
|
||
- intel/genxml/gfx125: Move STATE_SURFACE_TYPE to enum
|
||
|
||
Jordan Petridis (1):
|
||
|
||
- Revert "ci: take microsoft farm offline"
|
||
|
||
Joshua Ashton (2):
|
||
|
||
- nvk: Hook up driconf for nvk_instance
|
||
- nvk: Enable KHR_present_id and KHR_present_wait
|
||
|
||
José Expósito (5):
|
||
|
||
- zink: Fix crash on zink_create_screen error path
|
||
- zink: fix dereference before NULL check
|
||
- zink: allow software rendering only if selected
|
||
- zink: initialize drm_fd to -1
|
||
- egl/glx: fallback to software when Zink is forced and fails
|
||
|
||
José Roberto de Souza (56):
|
||
|
||
- anv: Add missing ANV_BO_ALLOC_EXTERNAL flags when calling anv_device_import_bo()
|
||
- intel: Add more information about the PAT entry used
|
||
- intel: Update MTL scanout PAT entry
|
||
- intel: Add a write combining PAT entry
|
||
- anv: Honor memory coherency of the memory type selected
|
||
- anv: Move PAT entry selection to common code
|
||
- anv: Change default PAT entry to WC
|
||
- anv: Calculate mmap mode based on alloc_flags
|
||
- anv: Remove anv_bo flags that can be inferred from alloc_flags
|
||
- iris: Add iris_bufmgr_get_pat_entry_for_bo_flags()
|
||
- intel/common: Add intel_gem_read_correlate_cpu_gpu_timestamp()
|
||
- anv: Reduce ifdefs in anv_GetCalibratedTimestampsEXT()
|
||
- anv: Make use of intel_gem_read_correlate_cpu_gpu_timestamp()
|
||
- intel/common/xe: Re implement xe_gem_read_render_timestamp() with xe_gem_read_correlate_cpu_gpu_timestamp()
|
||
- anv: Bring back the non optimized version of build_load_render_surface_state_address()
|
||
- intel: Sync xe_drm.h
|
||
- intel: Sync xe_drm.h
|
||
- iris: Change default PAT entry to WC
|
||
- intel: Rename PAT entries
|
||
- intel: Share function to do device query in Xe KMD
|
||
- iris: Check for maximum allowed priority in Xe KMD
|
||
- anv: Rename ANV_BO_ALLOC_SNOOPED to ANV_BO_ALLOC_HOST_CACHED_COHERENT
|
||
- anv: Add support all possible cached and coherent memory types
|
||
- intel: Add PAT entries for gfx12 and newer
|
||
- intel: Sync xe_drm.h
|
||
- intel: Enable has_set_pat_uapi for Xe
|
||
- iris: Prepare iris_heap_to_pat_entry() for discrete GPUs
|
||
- iris: Fill PAT fields in Xe KMD gem_create and vm_bind uAPIs
|
||
- anv: Prepare anv_device_get_pat_entry() for discrete GPUs
|
||
- anv: Fill PAT fields in Xe KMD gem_create and vm_bind uAPIs
|
||
- anv: Add heaps for Xe KMD in platforms without LLC
|
||
- intel/dev: Adjust prefetch_size values for Xe2 engines
|
||
- anv: Fix vm bind of DRM_XE_VM_BIND_FLAG_NULL
|
||
- iris: Fix the mmap mode for IRIS_HEAP_DEVICE_LOCAL_PREFERRED
|
||
- intel: Sync xe_drm.h take 2 part 3
|
||
- intel/isl: Set mocs.blitter_dst/src for MTL
|
||
- anv: Fix handling of host_cached_coherent bos in gen9 lp in older kernels
|
||
- anv: Split ANV_BO_ALLOC_HOST_CACHED_COHERENT into two actual flags
|
||
- anv: Promote bos to host_cached+host_coherent in platforms with LLC
|
||
- anv: Avoid unnecessary intel_flush calls
|
||
- intel/genxml/xe2: Update PIPE_CONTROL
|
||
- intel/genxml/xe2: Update PIPELINE_SELECT
|
||
- intel: Sync xe_drm.h final part
|
||
- anv: Remove libdrm usage from Xe KMD backend
|
||
- anv: Add ANV_BO_ALLOC_IMPORTED
|
||
- anv: Replace anv_bo.vram_only by anv_bo.alloc_flags check
|
||
- anv: Assume that imported bos already have flat CCS requirements satisfied
|
||
- intel/isl/xe2: Enable route of Sampler LD message to LSC
|
||
- utils/u_debug: Fix parse of "all,<something else>
|
||
- anv: Increase ANV_MAX_QUEUE_FAMILIES
|
||
- anv: Drop useless STATIC_ASSERT in anv_physical_device_init_queue_families()
|
||
- anv: Simply companion_rcs handling
|
||
- anv: Add missing anv_measure_submit() calls in Xe KMD backend
|
||
- anv: Fix anv_measure_start/stop_snapshot() over copy or video engine
|
||
- anv: Call anv_measure_submit() before anv_cmd_buffer_chain_command_buffers()
|
||
- anv: Fix PAT entry for userptr in integrated GPUs
|
||
|
||
Juan A. Suarez Romero (12):
|
||
|
||
- v3d/ci: run V3D GL tests in 64-bits
|
||
- v3d: use kmsro to create drm screen on real hw
|
||
- vc4/ci: comment why piglit is disabled
|
||
- broadcom/ci: separate hiden jobs to -inc.yml files
|
||
- v3d: include the revision in the device name
|
||
- ci/baremetal: make BM_BOOTCONFIG optional
|
||
- ci: do not mount already mounted directories
|
||
- ci/v3d/vc4: remove explicit modules to load
|
||
- ci/v3dv: add new failures
|
||
- ci/v3dv: update results
|
||
- ci/vc4/v3d: remove some flakes
|
||
- ci/v3d: add support for rpi5
|
||
|
||
Julia Zhang (1):
|
||
|
||
- radeonsi: modify binning settings to improve performance
|
||
|
||
Juston Li (17):
|
||
|
||
- venus: add helper function to get cmd handle
|
||
- venus: refactor out common cmd feedback functions
|
||
- venus: support deferred query feedback recording
|
||
- venus: track/recycle appended query feedback cmds
|
||
- venus: append query feedback at submission time
|
||
- venus: switch to unconditionally deferred query feedback
|
||
- venus: sync protocol for VK_EXT_extended_dynamic_state3
|
||
- venus: pipeline fixes for VK_EXT_extended_dynamic_state3
|
||
- venus: enable VK_EXT_extended_dynamic_state3
|
||
- venus: disable unsupported ExtendedDynamicState3Features
|
||
- venus: implement vkGet[Device]ImageSparseMemoryRequirements
|
||
- radv: enable stippledBresenhamLines on GFX9 chips
|
||
- venus: fix query feedback copy sanitize off by 1
|
||
- venus: rename buffer cache to buffer reqs cache
|
||
- venus: use vk_format helper for plane count
|
||
- venus: support caching image memory requirements
|
||
- venus: add LRU cache eviction for image mem reqs cache
|
||
|
||
Kai Wasserbäch (1):
|
||
|
||
- fix: ac/llvm: LLVM 18: remove useless passes, partially removed upstream
|
||
|
||
Karol Herbst (74):
|
||
|
||
- vtn/opencl: always lower to libclc fmod
|
||
- rusticl/device: restrict image_buffer_size
|
||
- rusticl/device: restrict param_max_size further
|
||
- rusticl/mem: properly set pipe_image_view::access
|
||
- zink: support CLAMP_TO_BORDER with unnormalized coords
|
||
- zink: alias nir scratch memory by lowering to common bit_size
|
||
- zink: emit float controls
|
||
- zink: lower fisnormal as it requires the Kernel Cap
|
||
- radv: fix buffers in vkGetDescriptorEXT with size not aligned to 4
|
||
- rusticl/queue: Only take a weak ref to the last Event
|
||
- rusticl/device: restrict const max size to 1 << 26 bytes
|
||
- rusticl/mesa: pass PIPE_BIND_LINEAR in resource_create_texture_from_user
|
||
- rusticl: handle failed maps gracefully
|
||
- zink: validate pointer alignment in resource_from_user_memory
|
||
- zink: handle denorm preserve execution modes
|
||
- zink: deallocate global_bindings array
|
||
- zink: emit MemoryAccess flags for coherent global load/stores
|
||
- rusticl/mesa/screen: do not derefence the entire pipe_screen struct
|
||
- nir: Stop assuming glsl_get_length() returns 0 for vectors
|
||
- ir2: Stop assuming glsl_get_length() returns 0 for vectors
|
||
- nvc0: implement PIPE_CAP_TIMER_RESOLUTION
|
||
- radeonsi: support importing arbitrary resources
|
||
- radeonsi: hack for importing 3D textures
|
||
- rusticl/context: fix importing gl cube maps
|
||
- docs/features: mark rusticl gl_sharing as done
|
||
- rusticl/queue: do not send empty lists of event to worker queue
|
||
- rusticl/queue: fix implicit flushing of queue dependencies
|
||
- rusticl: only support the matching device for gl_sharing
|
||
- rusticl/memory: fix new clippy::needless-borrow warning
|
||
- nir: allow vec derefs on system values
|
||
- vtn: add hack for system values placed in CrossWorkgroup memory
|
||
- rusticl/api: workaround DPCPP fetching clSetProgramSpecializationConstant
|
||
- rusticl: add x11 dependency
|
||
- rusticl/gl: make GLX support optional
|
||
- clc: allow debug flag to be read from other files
|
||
- clc: add dump_llvm debug options
|
||
- nir/opt_preamble: make load_workgroup_size handling optional
|
||
- radeonsi: lower relative shuffle subgroup ops
|
||
- radeonsi: lower 64bit subgroup shuffle to 32 bit
|
||
- clc: add support for cl_khr_subgroup_shuffle and shuffle_relative
|
||
- rusticl: implement cl_khr_subgroup_shuffle and shuffle_relative
|
||
- ci/fedora: bump to meson 1.3.0
|
||
- rusticl: bump meson req
|
||
- rusticl: use rust.proc_macro for proc macros
|
||
- clc: use addMacroDef/Undef instead of -D/-U flags
|
||
- nak: fix some sm checks for volta
|
||
- nir/algebraic: add support for custom arguments
|
||
- nak: add algebraic lowering pass
|
||
- nak: move nir_lower_subgroups into nak_postprocess_nir
|
||
- rusticl/kernel: explicitly set rounding modes
|
||
- radeonsi: fix reg_saved_mask for non graphics contexts
|
||
- clc: add workaround for clang always defining __IMAGE_SUPPORT_ and __opencl_c_int64
|
||
- rusticl: do not warn on empty RUSTICL_DEBUG or RUSTICL_FEATURES
|
||
- rusticl: silence clippy::arc-with-non-send-sync for now
|
||
- rusticl: fix constant and printf buffer size
|
||
- rusticl/nir: add missing nir include
|
||
- rusticl: check rustc version for flags requiring newer rustc/clippy
|
||
- ci: merge debian-rusticl-testing into debian-testing
|
||
- zink: lock screen queue on context_destroy and CreateSwapchain
|
||
- clc: remove code supporting pre llvm-10
|
||
- zink: fix heap-use-after-free on batch_state with sub-allocated pipe_resources
|
||
- rusticl: specify buffer bindings explicitly
|
||
- rusticl: add QueueContext to track GPU state
|
||
- rusticl/queue: release bound constant buffer
|
||
- rusticl: use real buffer for cb0 for drivers prefering
|
||
- ci,rusticl: bump meson req to 1.3.1
|
||
- rusticl/meson: generate bindings for LLVM
|
||
- rusticl/program: add LLVM functions to cache timestamp
|
||
- rusticl/llvm: do not include spirv-tools/linker.hpp
|
||
- rusticl/kernel: run opt/lower_memcpy later to fix a crash
|
||
- nir: rework and fix rotate lowering
|
||
- nak/opt_out: fix comparison in try_combine_outs
|
||
- rusticl/kernel: check that local size on dispatch doesn't exceed limits
|
||
- clc: force fPIC for every user when using shared LLVM
|
||
|
||
Kenneth Graunke (21):
|
||
|
||
- intel/compiler: Delete unused emit_dummy_fs()
|
||
- intel/compiler: Delete unused repclear shader uniform handling
|
||
- intel/compiler: Delete repclear shader's special case for 1 color target
|
||
- intel/compiler: Drop unused saturate handling in repclear shader
|
||
- intel/compiler: Convert the repclear shader to use send-from-GRF
|
||
- intel/compiler: Assert that FS_OPCODE_[REP\_]FB_WRITE is for pre-Gfx7
|
||
- iris: Make an iris_bucket_cache structure and array per heap
|
||
- iris: Make an iris_heap_is_device_local() helper
|
||
- iris: Rename heap_flags -> heap in i915_gem_create
|
||
- iris: Split system memory heap into cached-coherent and uncached heaps
|
||
- iris: Use 64K BOs for the shader uploader
|
||
- iris: Align fresh BO allocations to 2MB in size
|
||
- iris: Ensure virtual addresses are aligned to 2MB for 2MB+ blocks
|
||
- anv: Implement rudimentary VK_AMD_buffer_marker support
|
||
- anv: Drop 3/4 of PPGTT size restriction for sys heap size calculation
|
||
- anv: Don't report more memory available than the heap size
|
||
- intel/fs: Allow omitting the destination of A64 untyped atomics
|
||
- intel/fs: Drop opt_register_renaming()
|
||
- iris: Initialize bo->index to -1 when importing buffers
|
||
- iris: Don't search the exec list if BOs have never been added to one
|
||
- iris: Skip mi_builder init for indirect draws
|
||
|
||
Konstantin Seurer (40):
|
||
|
||
- radv: Add RADV_MAX_HIT_ATTRIB_DWORDS
|
||
- radv/nir: Add radv_nir_lower_hit_attrib_derefs
|
||
- radv/nir: Handle boolean hit attribs
|
||
- radv/clang-format: Do not indent C++ modifiers
|
||
- radv: Add radv_nir_lower_hit_attrib_derefs_tests
|
||
- radv/sqtt: Fix tracing acceleration structure commands
|
||
- radv/sqtt: Handle monolithic RT pipelines
|
||
- radv/rt: Use a helper for inlining non-recursive stages
|
||
- radv/rt: Skip null checks for small case counts
|
||
- nir/lower_vars_to_scratch: Remove all unused derefs
|
||
- drm-shim/nouveau: Set nv_device_info_v0::platform
|
||
- drm-shim/nouveau: Expose the 2D engine on NV50+
|
||
- drm-shim/nouveau: Stub mitting ioctls
|
||
- nvk: Do not preserve metadata after lower_load_global_constant_offset_instr
|
||
- radv: Add more offsets acceleration_structure_layout
|
||
- radv/bvh: Stop emitting leaf nodes inside the encoder
|
||
- nir: Optimize fpow with small constant exponents
|
||
- radv: Implement VK_KHR_ray_tracing_position_fetch
|
||
- radv: Make pipeline cache object data generic
|
||
- radv: Don't store library stack sizes
|
||
- radv: Add more ray tracing data to the cache
|
||
- radv/rt: Skip compiling a traversal shader
|
||
- radv: Skip compiling chit and miss shaders
|
||
- radv/rt: Remove useless assert
|
||
- radv/rt: Use radv_shader for compiled shaders
|
||
- radv/sqtt: Avoid duplicate stage check
|
||
- radv/rt: Repurpose radv_ray_tracing_stage_is_compiled
|
||
- vtn: Remove transpose(m0)*m1 fast path
|
||
- ac/nir: Export clip distances according to clip_cull_mask
|
||
- vtn: Handle DepthReplacing correctly
|
||
- radv/rmv: Fix tracing ray tracing pipelines
|
||
- radv/rt/rmv: Log pipeline library creation
|
||
- radv: Use PLOC for TLAS builds
|
||
- radv: Remove the BVH depth heuristics
|
||
- radv/rt: Lower ray payloads to registers
|
||
- vtn: Allow for OpCopyLogical with different but compatible types
|
||
- ac/llvm: Enable helper invocations for quad OPs
|
||
- lavapipe: Fix DGC vertex buffer handling
|
||
- lavapipe: Mark vertex elements dirty if the stride changed
|
||
- lavapipe: Report the correct preprocess buffer size
|
||
|
||
Lang Yu (1):
|
||
|
||
- radeonsi: emit SQ_NON_EVENT for GFX11_5
|
||
|
||
Leo Liu (2):
|
||
|
||
- gallium/vl: match YUYV/UYVY swizzle with change of color channels
|
||
- radeonsi: fix video processing path without VPE enabled
|
||
|
||
LingMan (9):
|
||
|
||
- rusticl: Show an error message if the build is attempted with an outdated bindgen version
|
||
- rusticl: Show an error message if the version of bindgen can't be detected
|
||
- rusticl: Directly pass a \`&Device` to \`Mem::map_image` and \`Mem::map_buffer`
|
||
- rusticl: Only put an Arc around PipeScreen where needed
|
||
- rusticl: Avoid repeatedly creating Vecs during Platform initialization
|
||
- rusticl: Turn pointers in enqueue_svm_mem_fill_impl into proper Rust types
|
||
- rusticl: Turn pointers in enqueue_svm_memcpy_impl into slices
|
||
- rusticl/api: Add checking wrappers around \`slice::from_raw_parts{_mut}`
|
||
- rusticl: Use the \`from_raw_parts` wrappers
|
||
|
||
Lionel Landwerlin (88):
|
||
|
||
- intel/fs: fix dynamic interpolation mode selection
|
||
- anv/meson: add missing dependency on the interface header
|
||
- anv: ensure we reapply always pipeline dynamic state in runtime state
|
||
- intel/fs: Xe2 fix for ExBSO on UGM
|
||
- blorp: handle binding table & surface state allocation failures
|
||
- anv: rename internal heaps
|
||
- anv: deal with state stream allocation failures
|
||
- anv: add max_size argument for block & state pools
|
||
- anv: make sure pools can handle more than 2Gb
|
||
- anv: fail pool allocation when over the maximal size
|
||
- anv: use anv_state_pool_state_address for blorp vertex buffer address
|
||
- anv: fix corner case of mutable descriptor pool creation
|
||
- anv: dynamically allocate utrace batch buffers
|
||
- perfetto/pps-producer: add optimized cpu/gpu timestamp correlation support
|
||
- intel/ds: use improved timestamp correlation if available
|
||
- isl: disable MCS compression on R9G9B9E5
|
||
- intel: fix PXP status check
|
||
- anv: handle protected memory allocation
|
||
- anv: allow creation of protected queues
|
||
- anv: Emit protection + session ID on protected command buffers
|
||
- anv: allow protected GEM context creation
|
||
- anv: enable protected memory
|
||
- intel/fs: fix residency handling on Xe2
|
||
- anv: workaround XeSS for Satisfactory
|
||
- intel/fs: rerun divergence analysis prior to convert_from_ssa
|
||
- intel/nir/rt: fix reportIntersection() hitT handling
|
||
- anv: fix source_hash propagation with libraries
|
||
- anv: fix missing naming for dirty bit
|
||
- anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders
|
||
- anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC
|
||
- intel/decoder: handle 3DPRIMITIVE_EXTENDED in accumulated prints
|
||
- intel/blorp: move Wa_18019816803 out of blorp code
|
||
- anv: get rid of the duplicate pipeline fields in command buffer state
|
||
- anv/blorp: move helper function about BTI changes to blorp
|
||
- intel/perf: fix querying of configurations
|
||
- intel/fs: fix incorrect register flag interaction with dynamic interpolator mode
|
||
- intel/fs: reuse set_predicate()
|
||
- intel/aux_map: introduce ref count of L1 entries
|
||
- anv: use main image address to determine ccs compatibility
|
||
- anv: track & unbind image aux-tt binding
|
||
- anv: remove heuristic preferring dedicated allocations
|
||
- intel/ds: add trace of buffer markers
|
||
- intel/tools: add hang_replay tool
|
||
- intel/hang_replay: add the ability to pass the context image to sim-drm
|
||
- intel: add error2hangdump tool
|
||
- intel/aubinator_error_decode: bump max buffers to 1024
|
||
- intel/error_decode: map i915 gfx12.5 register names to our names
|
||
- intel/tools: hang viewer/editor
|
||
- anv: add a sampler state pool
|
||
- anv: move descriptor set type selection to earlier
|
||
- anv: make a couple of descriptor function private
|
||
- anv: add missing push descriptor flush on ray tracing pipelines
|
||
- anv: set layout printer
|
||
- anv: use 2 different buffers for surfaces/samplers in descriptor sets
|
||
- intel/hang_replay: fix compile race with generated files
|
||
- intel/tools: 32bit compile fixes
|
||
- vulkan/runtime: retain video session creation flags
|
||
- anv/video: only report matching memory types for protected sessions
|
||
- util/u_printf: add a u_printf_ptr() variant
|
||
- nir: make printf_info (de)serializer available
|
||
- nir/clone: fix missing printf_info clone
|
||
- nir: include printfs from linked shaders
|
||
- nir/divergence: handle printf intrinsic
|
||
- nir/serialize: untangle printf serialization from a particular stage
|
||
- nir: fixup nir_printf intrinsic description
|
||
- anv: fix incorrect queue_family access on command buffer
|
||
- isl: constify isl_device_get_sample_counts()
|
||
- anv: get features after initializing drm
|
||
- anv: switch to use runtime physical device properties infrastructure
|
||
- anv: promote EXT_vertex_attribute_divisor to KHR
|
||
- anv: promote EXT_calibrated_timestamps to KHR
|
||
- isl: drop AUX-TT CCS alignment with INTEL_DEBUG=noccs
|
||
- anv: wait for CS write completion before executing secondary
|
||
- isl: further restrict alignment constraints
|
||
- isl: implement Wa_22015614752
|
||
- intel/fs: fix depth compute state for unchanged depth layout
|
||
- anv: remove ANV_ENABLE_GENERATED_INDIRECT_DRAWS variable
|
||
- anv: fix disabled Wa_14017076903/18022508906
|
||
- intel/aux_map: fix fallback unmapping range on failure
|
||
- anv: hide vendor ID for The Finals
|
||
- anv: fix pipeline executable properties with graphics libraries
|
||
- anv: implement undocumented tile cache flush requirements
|
||
- anv: don't prevent L1 untyped cache flush in 3D mode
|
||
- anv: add missing alignment for AUX-TT mapping
|
||
- anv: factor out aux-tt binding logic for future reuse
|
||
- anv: rename aux_tt image field
|
||
- anv: retain ccs image binding address
|
||
- anv: fix transfer barriers flushes with compute queue
|
||
|
||
Louis-Francis Ratté-Boulianne (4):
|
||
|
||
- panfrost: factor out method to check whether we can discard resource
|
||
- panfrost: add copy_resource flag to pan_resource_modifier_convert
|
||
- panfrost: add can_discard flag to pan_legalize_afbc_format
|
||
- panfrost: Legalize before updating part of a AFBC-packed texture
|
||
|
||
Luc Ma (1):
|
||
|
||
- loader: Remove a line of unused include
|
||
|
||
Luca Weiss (1):
|
||
|
||
- freedreno: Enable A305B
|
||
|
||
Lucas Fryzek (2):
|
||
|
||
- freedreno/drm: Add more APIs to per backend API
|
||
- gallivm/nir: Load all inputs into indirect inputs array
|
||
|
||
Lucas Stach (2):
|
||
|
||
- etnaviv: drm: don't update cmdstream timestamp when skipping submit
|
||
- etnaviv: disable 64bpp render/sampler formats
|
||
|
||
Lynne (1):
|
||
|
||
- radv: change queue family order in radv_get_physical_device_queue_family_properties
|
||
|
||
M Henning (21):
|
||
|
||
- nak: Fix a warn(unused_must_use) by calling drop
|
||
- nak: Remove MemScope::Cluster
|
||
- nak: Memory order/scope encodings for Ampere
|
||
- nak: Specify MemScope on MemOrder::Strong
|
||
- nak: Bind nir_intrinsic_access
|
||
- nak: Add MemOrder::Constant
|
||
- nvk: Use load_global_constant for ubo loads
|
||
- nak: Add encodings for cache eviction priorities
|
||
- nak: Set "evict first" from ACCESS_NON_TEMPORAL
|
||
- nak: Request alignment that matches the load width
|
||
- nak: Use nir_combined_align
|
||
- nvk: Fix descriptor alignment offset
|
||
- nak: Provide robustness info to postprocess_nir
|
||
- nak: Call nir_opt_load_store_vectorize
|
||
- nak: Call nir_opt_combine_barriers
|
||
- nak: Call nir_opt_shrink_vectors
|
||
- nak: Clamp negative texture array indices to zero
|
||
- nak: Enable loop unrolling.
|
||
- nak: Print out an instruction count
|
||
- nak: Add a jump threading pass
|
||
- nak: Optimize jumps to fall-through if possible
|
||
|
||
Marcin Ślusarz (1):
|
||
|
||
- anv: fix minSubgroupSize for xe2
|
||
|
||
Marek Olšák (199):
|
||
|
||
- radeonsi: initialize perfetto in the right place
|
||
- ac: add missing gfx11.5 bits
|
||
- ac/gpu_info: adjust attribute ring size for gfx11
|
||
- ac/surface: cosmetic changes
|
||
- ac/surface/tests: cosmetic changes
|
||
- radeonsi: don't use nir_optimization_barrier_vgpr_amd with ACO
|
||
- radeonsi: inline si_allocate_gds and si_add_gds_to_buffer_list
|
||
- radeonsi: inline si_screen_clear_buffer
|
||
- radeonsi: remove redundant VS_PARTIAL_FLUSH for streamout
|
||
- radeonsi: remove AMD_DEBUG=nogfx
|
||
- radeonsi: rename ctx -> sctx in si_emit_guardband
|
||
- radeonsi: remove and inline si_shader::ngg::prim_amp_factor
|
||
- radeonsi: decrease PIPE_CAP_MAX_GEOMETRY_TOTAL_OUTPUT_COMPONENTS to 1024
|
||
- radeonsi: cosmetic changes in si_pm4.c
|
||
- radeonsi: split setting num_threads in si_emit_dispatch_packets
|
||
- radeonsi: use si_shader_uses_streamout properly
|
||
- radeonsi: adjust setting PA_SC_EDGERULE once more
|
||
- radeonsi: various isolated cosmetic changes
|
||
- radeonsi: move max_dist for MSAA into si_state_msaa.c
|
||
- radeonsi: cosmetic changes in si_state_viewport.c
|
||
- radeonsi: cosmetic changes in si_state_binning.c, si_state_msaa.c
|
||
- radeonsi: move setting registers at the end of si_emit_cb_render_state
|
||
- ac/gpu_info: split has_set_pairs_packets into context and sh flags
|
||
- ac/gpu_info,llvm: trivial cosmetic changes
|
||
- radeonsi: clean up si_set_streamout_targets
|
||
- radeonsi: upload shaders using a compute queue instead of gfx
|
||
- radeonsi: rewrite PM4 packet building helpers with less duplication
|
||
- radeonsi: move buffered_xx_regs into a substructure
|
||
- radeonsi: rename HAS_PAIRS -> HAS_SH_PAIRS_PACKED
|
||
- radeonsi: rename radeon_*push_*_sh_reg -> gfx11_*push_*_sh_reg
|
||
- radeonsi: rewrite gfx11_*push*_sh_reg helpers
|
||
- radeonsi: restructure blocks in si_setup_nir_user_data
|
||
- radeonsi: restructure blocks in si_emit_graphics_{shader,compute}_pointers
|
||
- radeonsi/gfx11: use PKT3_SET_CONTEXT_REG_PAIRS_PACKED for PM4 states
|
||
- radeonsi: don't call nir_lower_compute_system_values too many times
|
||
- radeonsi: don't check DCC compatibility on chips where it's no-op
|
||
- radeonsi: cosmetic changes in si_emit_db_render_state
|
||
- radeonsi: prettify code around PA_SC_LINE_STIPPLE
|
||
- radeonsi: move emitting VGT_TF_PARAM into gfx10_emit_shader_ngg
|
||
- radeonsi: remove num_params variable from gfx10_shader_ngg
|
||
- radeonsi: move SPI_SHADER_IDX_FORMAT into the preamble (it's immutable)
|
||
- radeonsi: adjust the total viewport area
|
||
- radeonsi/gfx11: use SET_CONTEXT_REG_PAIRS_PACKED for other states
|
||
- radeonsi/gfx11: don't set OREO_MODE to fix rare corruption
|
||
- radeonsi: don't dma-upload shaders on APUs
|
||
- radeonsi/ci: update failures for gfx103
|
||
- st/mesa: disable light_twoside if back faces are culled
|
||
- glsl/nir: return failure from link_varyings if there is a linker error
|
||
- nir: add lowering from FS LAYER input to LAYER_ID sysval
|
||
- nir: return progress from nir_remove_sysval_output
|
||
- ac/nir: add kill_layer flag to VS/GS/NGG lowering
|
||
- st/mesa: set pipe_framebuffer_state::layers for PBO blits
|
||
- radeonsi: clean up si_nir_kill_outputs
|
||
- radeonsi: don't allocate output space for LAYER/VIEWPORT before TES and GS
|
||
- radeonsi: implement gl_Layer in FS as a system value
|
||
- radeonsi: remove the LAYER output if the framebuffer state has only 1 layer
|
||
- nir: fix gathering TESS_LEVEL_INNER/OUTER usage with lowered IO
|
||
- nir: don't declare illegal varyings in nir_create_passthrough_tcs
|
||
- nir/print: print PATCH0 and VARn_16BIT names instead of numbers for TCS and TES
|
||
- gallium/docs: make CAP doc order match definition order
|
||
- gallium: add PIPE_CAP_PERFORMANCE_MONITOR for GL_AMD_performance_monitor
|
||
- radeonsi: group equal CAP cases
|
||
- radeonsi: only expose GL_AMD_performance_monitor on gfx7-10.3
|
||
- ac: rename ac_parse_ib.c -> ac_ib_parser.c
|
||
- ac: move the IB parsers into ac_parse_ib.c
|
||
- ac: add an IB parser that gathers context rolls
|
||
- mesa: optimize _mesa_matrix_is_identity
|
||
- mesa: skip checking for identity matrix in glMultMatrixf with glthread
|
||
- mesa: optimize setting the identity matrix
|
||
- glthread: add a marker at the end of batches indicating the end
|
||
- glthread: eliminate push/pop calls in PushMatrix+Draw/MultMatrixf+PopMatrix
|
||
- glthread: add option to put autogenerated marshal structures in the header file
|
||
- glapi: rename primcount -> instance_count in a few Draw functions
|
||
- glthread: use autogenerated marshal structures for custom functions
|
||
- glthread: rework type reduction and reduce vertex stride params to 16 bits
|
||
- glapi: only expose GL_EXT_direct_state_access functions to GL compatibility
|
||
- glthread: don't do "if (COMPAT)" if the function is not in the GL core profile
|
||
- glapi: only allow deprecated="" on non-aliased functions
|
||
- glthread: pass struct marshal_cmd_DrawElementsUserBuf into Draw directly
|
||
- mesa: deduplicate glVertexPointer and glNormalPointer vs DSA error checking
|
||
- glthread: add a string table of function names
|
||
- radeonsi/gfx11: fix unaligned SET_CONTEXT_PAIRS_PACKED
|
||
- radeonsi: don't set non-existent VGT_GS_MAX_PRIMS_PER_SUBGROUP on gfx10
|
||
- radeonsi: change the low-priority compiler queue to normal priority
|
||
- radeonsi: update shaders for blend state only if the shader key changed
|
||
- radeonsi: update shaders for rasterizer state only if the shader key changed
|
||
- radeonsi: clean up setting poly/line/stipple shader key bits
|
||
- radeonsi: rewrite how shader key bits dependent on current_rast_prim are updated
|
||
- radeonsi: rewrite si_get_total_colormask as si_any_colorbuffer_written
|
||
- radeonsi: in bind_{blend,rs}_state, only call 1 update function per if
|
||
- radeonsi/gfx11: skip si_set_streamout_enable because it has no effect
|
||
- radeonsi: execute streamout_begin after cache flushes
|
||
- radeonsi: don't print the preamble state separately for GALLIUM_DDEBUG
|
||
- radeonsi: replace gl_FrontFacing with a constant if one side is always culled
|
||
- radeonsi: set OOB_SELECT for VBOs in si_create_vertex_elements
|
||
- radeonsi: group most vertex element fields
|
||
- radeonsi/gfx11: prefer Wave64 for PS without inputs for better VALU perf
|
||
- radeonsi/gfx11: disable the shader profile for Medical that forces Wave64
|
||
- radeonsi/gfx11: disable the shader profile for Medical that disables binning
|
||
- radeonsi: clean up how debug flags and shader profiles determine the wave size
|
||
- radeonsi/gfx11: prefer Wave64 for VS/TCS/TES/GS because it's slightly faster
|
||
- winsys/amdgpu: bypass GL2 for command buffers
|
||
- radeonsi: track NIR progress properly for optimizations in si_get_nir_shader
|
||
- ac,radeonsi: rename pos_inputs -> fragcoord_components
|
||
- nir,radeonsi: add FLAGS into load_vector_arg_amd to record color input usage
|
||
- radeonsi: change the signature of si_nir_lower_ps_color_input
|
||
- radeonsi: gather lowered color inputs for monolithic PS
|
||
- radeonsi: add PS input info into si_shader_binary_info
|
||
- radeonsi: don't include the PARAM_GEN input in si_shader_info
|
||
- radeonsi: decrease NUM_INTERP if uniform inlining eliminated PS inputs
|
||
- radeonsi: update comments about uniform inlining
|
||
- radeonsi: decrease NUM_INTERP if export formats/colormask eliminated PS inputs
|
||
- util: make BITSET_TEST_RANGE_INSIDE_WORD take a value to compare with
|
||
- radeonsi: merge context_reg_saved_mask and other_reg_saved_mask into a BITSET
|
||
- radeonsi: convert depth-stencil-alpha state to tracked registers
|
||
- radeonsi: convert rasterizer state to tracked registers
|
||
- ac/gpu_info: fix printing radeon_info after adding VPE
|
||
- radeonsi: rework how guardband registers are updated to decrease overhead
|
||
- mesa: fix _mesa_matrix_is_identity
|
||
- mesa: remove some DrawTransformFeedback duplication
|
||
- mesa: remove some DrawElementsInstanced duplication
|
||
- mesa: remove more DrawArrays/Elements duplication
|
||
- mesa: remove non-relevant 16-year-old comment
|
||
- st/mesa: make prepare_(indexed\_)draw non-static
|
||
- mesa: inline st_draw_transform_feedback
|
||
- mesa: call st_prepare_(indexed\_)draw before Driver.DrawGallium(MultiMode)
|
||
- st/mesa: no need to check index_size in st_prepare_indexed_draw anymore
|
||
- mesa: move index bounds code (st_prepare_indexed_draw) into draw.c
|
||
- cso: do cso_context inheritance how we do it elsewhere
|
||
- cso: inline cso_get_pipe_context
|
||
- mesa: execute an error path sooner in _mesa_validated_drawrangeelements
|
||
- gallium: add typedef pipe_draw_func matching the draw_vbo signature and use it
|
||
- ac/llvm: remove code for converting txd from 1D to 2D because NIR does it
|
||
- ac,radeonsi: require DRM 3.27+ (kernel 4.20+) same as RADV
|
||
- winsys/amdgpu: don't return a value from cs_add_buffer
|
||
- winsys/amdgpu: cosmetic changes in amdgpu_cs_add_buffer
|
||
- winsys/amdgpu: inline amdgpu_add_fence_dependencies_bo_lists
|
||
- winsys/amdgpu: use inheritance for the cache_entry BO field
|
||
- winsys/amdgpu: use inheritance for the real BO
|
||
- winsys/amdgpu: use inheritance for the sparse BO
|
||
- winsys/amdgpu: use inheritance for the slab BO
|
||
- winsys/amdgpu: move lock from amdgpu_winsys_bo into sparse and real BOs
|
||
- winsys/amdgpu: don't count memory usage because it's unused
|
||
- winsys/amdgpu: change real/slab/sparse_buffers to buffer_lists[3]
|
||
- winsys/amdgpu: change amdgpu_lookup_buffer to take struct amdgpu_buffer_list
|
||
- winsys/amdgpu: clean up duplicated code around amdgpu_lookup/add_buffer
|
||
- winsys/amdgpu: return amdgpu_cs_buffer* from add/lookup_buffer instead of index
|
||
- winsys/amdgpu: pass amdgpu_buffer_list* to amdgpu_add_bo_fences_to_dependencies
|
||
- winsys/amdgpu: clean up the rest of the code for cs->buffer_lists
|
||
- winsys/amdgpu: fix amdgpu_cs_has_user_fence for VPE
|
||
- winsys/amdgpu: document BO structures
|
||
- ci: disable the google/freedreno farm because it's down
|
||
- glthread: add a missing end-of-batch marker
|
||
- mesa: micro-improvements in draw.c
|
||
- st/mesa: restore pipe_draw_info::mode at the end of st_hw_select_draw_gallium
|
||
- mesa: add a pipe_draw_indirect_info* parameter into the DrawGallium callback
|
||
- mesa: enable GL_SELECT and GL_FEEDBACK modes for indirect draws
|
||
- winsys/amdgpu: reduce wasted memory due to the size tolerance in pb_cache
|
||
- gallium/pb_slab: move group_index and entry_size from pb_slab_entry to pb_slab
|
||
- iris,zink,winsys/amdgpu: remove unused/redundant slab->entry_size
|
||
- winsys/amdgpu: rename to amdgpu_bo_slab to amdgpu_bo_slab_entry
|
||
- winsys/amdgpu: stop using pb_buffer::vtbl
|
||
- gallium/pb_cache: remove pb_cache_entry::end to save space
|
||
- gallium/pb_cache: switch time variables to milliseconds and 32-bit type
|
||
- radeon_winsys: add struct radeon_winsys* parameter into fence_reference
|
||
- r300,r600,radeon/winsys: always pass the winsys to radeon_bo_reference
|
||
- winsys/amdgpu: don't layer slabs, use only 1 level of slabs, it improves perf
|
||
- winsys/amdgpu: add amdgpu_bo_real_reusable slab for the backing buffer
|
||
- winsys/amdgpu: remove now-redundant amdgpu_bo_slab_entry::real
|
||
- winsys/amdgpu: remove va (gpu_address) from amdgpu_bo_slab_entry
|
||
- winsys/amdgpu: don't use gpu_address to compute slab entry offset in bo_map
|
||
- gallium/pb_buffer: define pb_buffer_lean without vtbl, inherit it by pb_buffer
|
||
- gallium/pb_cache: switch to pb_buffer_lean
|
||
- gallium/pb_cache: remove pb_cache_entry::mgr
|
||
- gallium/pb_cache: remove pb_cache_entry::buffer
|
||
- winsys/radeon: stop using pb_buffer::vtbl
|
||
- r300,r600,radeonsi: switch to pb_buffer_lean
|
||
- winsys/amdgpu: allocate 1 amdgpu_bo_slab_entry per cache line
|
||
- winsys/amdgpu: compute bo->unique_id at pb_slab_alloc, not at memory allocation
|
||
- winsys/amdgpu: rewrite BO fence tracking by adding a new queue fence system
|
||
- winsys/amdgpu: rename amdgpu_winsys_bo::bo -> bo_handle
|
||
- winsys/amdgpu: rename amdgpu_bo_sparse::lock -> commit_lock
|
||
- winsys/amdgpu: rename amdgpu_bo_real::lock to map_lock
|
||
- winsys/amdgpu: remove dependency_flags parameter from cs_add_fence_dependency
|
||
- winsys/amdgpu: implement explicit fence dependencies as sequence numbers
|
||
- winsys/amdgpu: use pipe_reference for amdgpu_ctx refcounting
|
||
- winsys/amdgpu: don't use amdgpu_fence::ctx for fence dependencies
|
||
- winsys/amdgpu: simplify code using amdgpu_cs_context::chunk_ib
|
||
- radeonsi/ci: add gfx11 flakes
|
||
- glthread: don't unroll draws using user VBOs with GLES
|
||
- glthread: add proper helpers for call fences
|
||
- gallium/u_threaded_context: use function table to jump to different draw impls
|
||
- mesa,u_threaded_context: add a fast path for glDrawElements calling TC directly
|
||
- gallium/u_threaded: use a dummy end call to indicate the end of the batch
|
||
- gallium/u_threaded: remove unused param from tc_bind_buffer/add_to_buffer_list
|
||
- gallium/u_threaded: keep it enabled even if the CPU count is 1
|
||
- meson: require libdrm_amdgpu 2.4.119
|
||
- winsys/amdgpu: remove amdgpu_bo_real::gpu_address, use amdgpu_va_get_start_addr
|
||
- winsys/amdgpu: remove amdgpu_bo_sparse::gpu_address, use amdgpu_va_get_start_addr
|
||
|
||
Mario Kleiner (1):
|
||
|
||
- v3d: add B10G10R10[X2/A2]_UNORM to format table.
|
||
|
||
Mark Collins (8):
|
||
|
||
- meson: Only include virtio when DRM available
|
||
- meson: Only link libvdrm to Turnip with virtio KMD
|
||
- meson: Update lua wrap to 5.4.6-4
|
||
- freedreno/rddecompiler: Emit explicit scope for CP_COND_REG_EXEC
|
||
- freedreno/rddecompiler: Decode ELSE branches using NOPs
|
||
- freedreno/rddecompiler: Reset buffers after RD_CMDSTREAM_ADDR
|
||
- freedreno/rddecompiler: Print pkt values in hex
|
||
- freedreno/rddecompiler: Add ability to read GPU buffer into file
|
||
|
||
Mark Janes (7):
|
||
|
||
- iris: make shader cache content deterministic
|
||
- anv: make shader cache content deterministic
|
||
- intel: remove workaround for preproduction DG2 steppings
|
||
- intel/dev: improve descriptions of workaround macros.
|
||
- intel/dev: poison macros for workarounds fixed at a stepping
|
||
- intel: remove MTL a0 workarounds
|
||
- intel/dev: update workaround definitions to latest defect status
|
||
|
||
Mart Raudsepp (1):
|
||
|
||
- docs: Fix typo in OpenGL 3.3 support on Asahi
|
||
|
||
Martin Roukala (né Peres) (12):
|
||
|
||
- zink/ci: drop the concurrency of the zink-radv-vangogh-valve job
|
||
- ci/b2c: fix artifact collection
|
||
- radv/ci: fix \`vkcts-navi21-valve` execution
|
||
- Revert "ci/deqp-runner: turn paths in errors into links"
|
||
- radv: disable meshShaderQueries on gfx10.3
|
||
- amd/ci: reduce Renoir's concurrency to 16
|
||
- ci/b2c: fix the \`cmdline_extra` variable name
|
||
- ci: disable the valve-kws farm until it can be rebooted
|
||
- Revert "ci: disable the valve-kws farm until it can be rebooted"
|
||
- ci: disable mupuf's farm
|
||
- ci: disable collabora's farm which appears to be down
|
||
- Revert "ci: disable mupuf's farm"
|
||
|
||
Mary Guillemard (37):
|
||
|
||
- venus: skip bind sparse info when checking for feedback query
|
||
- nir: Add AGX-specific doorbell and stack mapping opcodes
|
||
- agx: Add doorbell and stack mapping opcodes
|
||
- agx: Handle doorbell and stack mapping intrinsics
|
||
- asahi: clc: Handle doorbell and stack mapping intrinsics
|
||
- agx: Add stack load and store opcodes
|
||
- agx: Implement scratch load/store
|
||
- agx: Add stack adjust opcode
|
||
- agx: Emit stack_adjust in the entrypoint
|
||
- zink: Check for VK_EXT_extended_dynamic_state3 before setting A2C
|
||
- nak: sm75: Fix panic when encoding MUFU with SQRT and TANH
|
||
- nak: Make PRMT selection a Src
|
||
- nak: Add support for fddx and fddy
|
||
- nak: Add for_each_instr in Shader
|
||
- nak: Gather global memory usage for ShaderInfo
|
||
- nak: Fix ALD/AST encoding for vtx and offset
|
||
- nak: Add a complete wrapper around SPH
|
||
- nak: Collect information to create SPH
|
||
- nak: Remove encode_hdr_for_nir
|
||
- nak: Restructure ShaderInfo
|
||
- nak: Add geometry shader support
|
||
- nak: Ensure we allocate one barrier when using BAR.SYNC
|
||
- nak: Implement VK_KHR_shader_terminate_invocation
|
||
- nak: Move nir_lower_int64 after I/O lowering
|
||
- nak: Pass offset to load_frag_w
|
||
- nak: Rewrite nir_intrinsic_load_sample_pos and implement nir_intrinsic_load_barycentric_at_sample
|
||
- nir: Add a ldtram_nv intrinsic
|
||
- nak: Add more bits discovered in SPH
|
||
- nvk: Implement VK_KHR_fragment_shader_barycentric
|
||
- nvk: Disable flush on each queries and flush at the end
|
||
- nvk: Implement VK_EXT_primitives_generated_query
|
||
- venus: Do not submit batch manually when no feedback is required
|
||
- nak: Fix NAK_ATTR_CLIP_CULL_DIST_7 wrong value
|
||
- nak: sm50: Implement FFMA
|
||
- zink: Force 128 fs input components under Venus for Intel
|
||
- zink: Initialize pQueueFamilyIndices for image query / create
|
||
- zink: Always fill external_only in zink_query_dmabuf_modifiers
|
||
|
||
Matt Turner (11):
|
||
|
||
- r600: Add missing dep on git_sha1.h
|
||
- util: Include stdint.h in libdrm.h
|
||
- util: Provide DRM_DEVICE_GET_PCI_REVISION definition
|
||
- ci/lava: Add firmware-misc-nonfree on amd64
|
||
- intel: Only validate inst compaction if debugging a shader stage
|
||
- iris: Only initialize batch decoder if necessary
|
||
- symbols-check: Add _GLOBAL_OFFSET_TABLE_
|
||
- nir: Fix cast
|
||
- nir/tests: Reenable tests that failed on big-endian
|
||
- util: Add DETECT_ARCH_HPPA macro
|
||
- util/tests: Disable half-float NaN test on hppa/old-mips
|
||
|
||
Mauro Rossi (3):
|
||
|
||
- Android.mk: filter out cflags to build with Android 14 bundled clang
|
||
- Android.mk: disable android-libbacktrace to build with Android 14
|
||
- Android.mk: be able to build radeonsi without llvm
|
||
|
||
Max R (3):
|
||
|
||
- virgl: Implement clear_render_target and clear_depth_stencil
|
||
- ci: Uprev virglrenderer
|
||
- d3d10umd: Fix compilation
|
||
|
||
Maíra Canal (22):
|
||
|
||
- v3dv: implement VK_EXT_multi_draw
|
||
- v3dv: move multisync functions to the beginning of the file
|
||
- v3dv: allow different in/out sync queues
|
||
- v3dv: allow set_multisync() to accept more wait syncobjs
|
||
- drm-uapi: extend interface for indirect CSD CPU job
|
||
- v3dv: check CPU queue availability
|
||
- v3dv: create a CPU queue type
|
||
- v3dv: use the indirect CSD user extension
|
||
- v3dv: occlusion queries aren't handled with a CPU job
|
||
- drm-uapi: extend interface for timestamp query CPU job
|
||
- v3dv: use the timestamp query user extension
|
||
- drm-uapi: extend interface for reset timestamp CPU job
|
||
- v3dv: use the reset timestamp user extension
|
||
- drm-uapi: extend interface for copy timestamp results CPU job
|
||
- v3dv: use the copy timestamp query results user extension
|
||
- drm-uapi: extend interface for the reset performance query CPU job
|
||
- v3dv: don't start iterating performance queries at zero
|
||
- v3dv: use the reset performance query user extension
|
||
- drm-uapi: extend interface for copy performance query CPU job
|
||
- v3dv: use the copy performance query results user extension
|
||
- v3d/v3dv: move V3D_CSD definitions to a separate file
|
||
- v3dv: enable CPU jobs in the simulator
|
||
|
||
Michael Catanzaro (1):
|
||
|
||
- util: create parents of disk cache directory if needed
|
||
|
||
Michael Tretter (1):
|
||
|
||
- egl/wayland: fix formatting and add trailing comma
|
||
|
||
Michel Dänzer (2):
|
||
|
||
- gallium/dri: Return __DRI_ATTRIB_SWAP_UNDEFINED for _SWAP_METHOD
|
||
- glx: Handle IGNORE_GLX_SWAP_METHOD_OML regardless of GLX_USE_APPLEGL
|
||
|
||
Mike Blumenkrantz (48):
|
||
|
||
- zink: don't block large vram allocations
|
||
- vulkan/wsi: unify all the image usage flag caps
|
||
- draw: fix uninit variable false positive
|
||
- zink: add copy box locking
|
||
- tc: add non-definitive tracking for batch completion
|
||
- tc: always track fb attachments
|
||
- tc: add batch usage tagging to threaded_resource
|
||
- tc: use strong refs for fb attachment tracking
|
||
- tc: allow unsynchronized texture_subdata calls where possible
|
||
- zink: handle unsynchronized image maps from tc
|
||
- zink: barrier_cmdbuf -> reordered_cmdbuf
|
||
- zink: assert that transfer_dst is available before doing buf2img
|
||
- zink: rework cmdbuf submission to be more extensible
|
||
- zink: add a third cmdbuf for unsynchronized (not reordered) ops
|
||
- zink: add flag to restrict unsynchronized texture access
|
||
- zink: add locking for batch refs
|
||
- zink: enable unsynchronized texture uploads using staging buffers
|
||
- ci: skip zink vram test
|
||
- ci: bump VVL to 1.3.269
|
||
- zink: emit SpvCapabilitySampleRateShading with SampleId
|
||
- zink: always set VK_EXTERNAL_MEMORY_HANDLE_TYPE_HOST_ALLOCATION_BIT_EXT for usermem
|
||
- zink: clamp resolve extents to src/dst geometry
|
||
- zink: only emit xfb execution mode for last vertex stage
|
||
- aux/u_transfer_helper: set rendertarget bind for msaa staging resource
|
||
- zink: unset explicit_xfb_buffer for non-xfb shaders
|
||
- mesa/st/texture: match width+height for texture downloads of cube textures
|
||
- zink: add more locking for compute pipelines
|
||
- radv: correctly return oom from the device when failing to create a cs
|
||
- zink: make (some) vk allocation commands more robust against vram depletion
|
||
- zink: check for cbuf0 writes before setting A2C
|
||
- vk/cmd_queue: exempt more descriptor functions from autogeneration
|
||
- vulkan: add wrappers for descriptor '2' functions
|
||
- zink: enforce maxTexelBufferElements for texel buffer sizing
|
||
- zink: always force flushes when originating from api frontend
|
||
- vk/cmd_queue: stop using explicit casts
|
||
- vk/cmd_queue: generate maint6 functions
|
||
- vk/cmd_queue: fix up indentation a little
|
||
- lavapipe: maint6 descriptor stuff
|
||
- lavapipe: maint6
|
||
- zink: fix buffer rebind early-out check
|
||
- zink: ignore tc buffer replacement info
|
||
- vk/cmdbuf: add back deleted maint6 workgraph bits
|
||
- lavapipe: use pushconstants2 for dgc
|
||
- lavapipe: fix devenv icd filename
|
||
- zink: fix separate shader patch variable location adjustment
|
||
- zink: set more dynamic states when using shader objects
|
||
- zink: always map descriptor buffers as COHERENT
|
||
- zink: fix descriptor buffer unmaps on screen destroy
|
||
|
||
Mohamed Ahmed (4):
|
||
|
||
- nvk: Fix GetImageSubResourceLayout for non-disjoint images
|
||
- nil: Add support for linear images
|
||
- nvk: Wire up rendering to linear
|
||
- nvk: Enable linear images for texturing
|
||
|
||
Molly Sophia (1):
|
||
|
||
- tu: Fix KHR_present_id and KHR_present_wait being used without initialization
|
||
|
||
Nanley Chery (11):
|
||
|
||
- iris: Optimize BO_ALLOC_ZEROED for suballocations
|
||
- iris: Zero the clear color before FCV_CCS_E rendering
|
||
- iris: Don't memset the clear color BO during aux init
|
||
- iris: Simplify get_main_plane_for_plane
|
||
- iris: Simplify a plane count check in from_handle
|
||
- iris: Use helpers for generic aux plane importing
|
||
- iris: Inline import_aux_info
|
||
- iris: Use common res fields for imported planes
|
||
- iris: Delay main and aux resource creation on import
|
||
- isl: Handle MOD_INVALID in clear color plane check
|
||
- iris: Fix lowered images in get_main_plane_for_plane
|
||
|
||
Neha Bhende (1):
|
||
|
||
- ntt: lower indirect tesslevels in ntt
|
||
|
||
Patrick Lerda (1):
|
||
|
||
- glsl/nir: fix gl_nir_cross_validate_outputs_to_inputs() memory leak
|
||
|
||
Paulo Zanoni (34):
|
||
|
||
- anv: don't forget to destroy device->vma_mutex
|
||
- anv: alloc client visible addresses at the bottom of vma_hi
|
||
- anv/sparse: join multiple bind operations when possible
|
||
- anv/sparse: join multiple NULL binds when possible
|
||
- anv/sparse: also print bind->address at dump_anv_vm_bind
|
||
- intel/genxml: add the Gen12+ TR-TT registers
|
||
- anv/sparse: extract anv_sparse_bind()
|
||
- anv: setup the TR-TT vma heap
|
||
- vulkan: fix potential memory leak in create_rect_list_pipeline()
|
||
- anv/sparse: allow sparse resouces to use TR-TT as its backend
|
||
- anv/sparse: fix limits.sparseAddressSpaceSize when using vm_bind
|
||
- anv/trtt: join L1 writes into a single MI_STORE_DATA_IMM when possible
|
||
- anv/trtt: also join the L3/L2 writes into a single MI_STORE_DATA_IMM
|
||
- anv/sparse: drop anv_sparse_binding_data from dump_anv_vm_bind()
|
||
- anv/sparse: join all submissions into a single anv_sparse_bind() call
|
||
- anv/sparse: pass anv_sparse_submission to the backend functions
|
||
- anv/sparse: add 'queue' to anv_sparse_submission
|
||
- anv/trtt: use 'queue' from anv_sparse_submission in the backend
|
||
- anv/sparse: move waiting/signaling syncobjs to the backends
|
||
- anv/sparse: process image binds before opaque image binds
|
||
- anv/i915: extract setup_execbuf_fence_params()
|
||
- anv/xe: allow passing extra syncs to xe_exec_process_syncs()
|
||
- anv/trtt: don't wait/signal syncobjs using the CPU anymore
|
||
- anv/trtt: add struct anv_trtt_batch_bo and pass it around
|
||
- anv/trtt: add support for queue->sync to the TR-TT batches
|
||
- anv/trtt: properly handle the lifetime of TR-TT batch BOs
|
||
- anv: enable sparse by default on i915.ko
|
||
- anv/sparse: don't support YCBCR 2x1 compressed formats
|
||
- anv+zink/ci: document new sparse failures
|
||
- anv/sparse: reject binds that are not a multiple of the granularity
|
||
- anv/tr-tt: assert the bind size is a multiple of the granularity
|
||
- anv/sparse: check if the non-sparse version is supported first
|
||
- anv/sparse: document USAGE_2D_3D_COMPATIBLE as non-standard too
|
||
- intel/tools: fix compilation of intel_hang_viewer on 32 bits
|
||
|
||
Pavel Asyutchenko (1):
|
||
|
||
- mesa/main: allow S3TC for 3D textures
|
||
|
||
Pavel Ondračka (17):
|
||
|
||
- r300: add late vectorization after nir_move_vec_src_uses_to_dest
|
||
- r300: small adress register load optimization
|
||
- r300: nir fcsel/CMP lowering pass for R500
|
||
- r300: add some more early bool lowering
|
||
- r300: lower flrp in NIR
|
||
- r300: fcsel_ge lowering from lowered ftrunc
|
||
- r300: lower ftrunc in NIR
|
||
- r300: remove backend CMP lowering
|
||
- r300: remove backend LRP lowering
|
||
- r300: mark load_ubo_vec4 with ACCESS_CAN_SPECULATE
|
||
- r300: fix memory leaks in compiler tests
|
||
- ci: uprev mesa-trigger container
|
||
- ci: add r300 RV530 dEQP gles2 CI job
|
||
- r300/ci: add missing kernel url quotes
|
||
- r300/ci: switch to b2c v0.9.11
|
||
- r300/ci: add piglit job
|
||
- r300: fix reusing of color varying slots for generic ones
|
||
|
||
Peyton Lee (6):
|
||
|
||
- frontends, va: add new parameters of post processor
|
||
- amd,radeonsi: add libvpe
|
||
- amd: add new hardware ip for vpe
|
||
- amd, radeonsi: add si_vpe.c with helper functions of VPE lib
|
||
- amd, radeonsi: supports post processing entrypoint
|
||
- winsys, amdgpu, drm: add VPE submission handle
|
||
|
||
Phillip Pearson (1):
|
||
|
||
- radeonsi: use PRIu64 instead of %lu for uint64_t formatting
|
||
|
||
Pierre-Eric Pelloux-Prayer (23):
|
||
|
||
- mesa: restore call to _mesa_set_varying_vp_inputs from set_vertex_processing_mode
|
||
- radeonsi/ci: update failures
|
||
- radeonsi: check sctx->tess_rings is valid before using it
|
||
- Revert "radeonsi: decrease PIPE_CAP_MAX_GEOMETRY_TOTAL_OUTPUT_COMPONENTS to 1024"
|
||
- egl/wayland: set the correct modifier for the linear_copy image
|
||
- radeonsi: use a compute shader to convert unsupported indices format
|
||
- radeonsi: update guardband if vs_disables_clipping_viewport changes
|
||
- radeonsi/sqtt: fix RGP pm4 state emit function
|
||
- radeonsi/sqtt: clear record_counts variable
|
||
- radeonsi/sqtt: rework pm4.reg_va_low_idx
|
||
- radeonsi/sqtt: use calloc instead of malloc
|
||
- radeonsi/sqtt: reformat with clang-format
|
||
- radeonsi/sqtt: fix capturing indirect dispatches with SQTT
|
||
- radeonsi/winsys: add cs_get_ip_type function
|
||
- radeonsi/sqtt: fix emitting SQTT userdata when CAM is needed
|
||
- radeonsi/sqtt: fix capturing RGP on RDNA3 with more than one Shader Engine
|
||
- radeonsi/sqtt: handle COMPUTE queues as well
|
||
- radeonsi: fix extra_md handling with fmask
|
||
- ac/surface: don't oversize surf_size
|
||
- radeonsi: compute epitch when modifying surf_pitch
|
||
- Revert "ci/radeonsi: disable VA-API testing on raven"
|
||
- radeonsi: emit cache flushes before draw registers
|
||
- radeonsi: adjust flags for si_compute_shorten_ubyte_buffer
|
||
|
||
Qiang Yu (35):
|
||
|
||
- aco: do not fix_exports when separately compiled ngg vs or es
|
||
- aco: add create_end_for_merged_shader
|
||
- aco: extend max operands in a instruction to 128
|
||
- aco: move end program handling to select_shader
|
||
- aco: stop emit s_endpgm for first stage of merged shader
|
||
- aco: add aco_is_gpu_supported
|
||
- radeonsi: add vs prolog args needed by aco ls vgpr fix
|
||
- radeonsi: fill aco shader info for part mode merged shader
|
||
- radeonsi: enable aco compilation for merged shader parts
|
||
- radeonsi: move use_aco to si_screen
|
||
- radeonsi: move llvm compiler alloc/free into create/destroy funcntion
|
||
- radeonsi: stop llvm context creation when use aco
|
||
- radeonsi: move llvm internal header to si_shader_llvm.h
|
||
- radeonsi: selectively build si llvm compiler create/destroy
|
||
- radeonsi: selectively build llvm compile
|
||
- radeonsi: set use_aco when no llvm available
|
||
- radeonsi: include ac_llvm_util.h when llvm available
|
||
- radeonsi: disk cache remove llvm dependancy when use aco
|
||
- radeonsi: does not call llvm init when no llvm available
|
||
- radeonsi: change compiler name for aco
|
||
- radeonsi: selectively build llvm files
|
||
- meson: be able to build radeonsi without llvm
|
||
- radeonsi: fix piglit image coherency test when use aco
|
||
- aco,radv: add aco_is_nir_op_support_packed_math_16bit
|
||
- radeonsi: only vectorize nir ops that aco support
|
||
- ac/llvm: remove nir_op_*2*mp ops handling
|
||
- nir: add force_f2f16_rtz option to lower f2f16 to f2f16_rtz
|
||
- aco,ac/llvm,radeonsi: lower f2f16 to f2f16_rtz in nir
|
||
- aco: set MIMG unrm for GL_TEXTURE_RECTANGLE
|
||
- aco: handle GL_TEXTURE_RECTANGLE in tg4_integer_workarounds
|
||
- radeonsi: add missing args in spi_ps_input_ena when fbfetch output
|
||
- nir: fix load layer id system_values_read info gather
|
||
- aco: fix set_wqm segfault when ps prolog
|
||
- radeonsi: fix legacy merged LS/ES workgroup size for aco compilation
|
||
- radeonsi: unify elf and raw shader binary upload
|
||
|
||
Raphaël Gallais-Pou (1):
|
||
|
||
- gallium: add sti DRM entry point
|
||
|
||
Rhys Perry (55):
|
||
|
||
- nir: add helpers to skip idempotent passes
|
||
- radv: use NIR_LOOP_PASS helpers
|
||
- aco: add VALU/SALU/VMEM/SMEM statistics
|
||
- aco: collect Pre-Sched SGPRs/VGPRs before spilling
|
||
- radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements
|
||
- radv: skip radv_remove_varyings for mesh shaders
|
||
- radv: disable gs_fast_launch=2 by default
|
||
- aco/tests: fix tests with LLVM 17
|
||
- aco/tests: fix tests with LLVM 18
|
||
- aco: workaround LS VGPR initialization bug in RADV prologs
|
||
- aco: skip LS VGPR initialization bug workaround if the prolog exists
|
||
- radv: set prolog as_ls if has_ls_vgpr_init_bug=true
|
||
- docs: fix RADV_THREAD_TRACE_CACHE_COUNTERS default
|
||
- nir/lower_fp16_casts: correctly round RTNE f64->f16 casts
|
||
- nir/lower_fp16_casts: add option to split fp64 casts
|
||
- radeonsi: use nir_lower_fp16_casts
|
||
- radv: use nir_lower_fp16_casts
|
||
- aco: remove f16<->f64 conversions
|
||
- intel/compiler: use nir_lower_fp16_casts
|
||
- radv: add radv_disable_trunc_coord option
|
||
- radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK
|
||
- ac/gpu_info: update conformant_trunc_coord comment
|
||
- ac/nir: fix partial mesh shader output writes on GFX11
|
||
- ac/nir: ignore 8/16-bit global access offset
|
||
- ac/nir: fix 32-bit offset global access optimization
|
||
- aco: flush denormals for 16-bit fmin/fmax on GFX8
|
||
- aco: implement 16-bit fsign on GFX8
|
||
- aco: implement 16-bit derivatives
|
||
- aco: implement 16-bit fsat on GFX8
|
||
- aco: simplify v_mul_* labelling slightly
|
||
- aco: insert p_end_wqm before p_jump_to_epilog
|
||
- nir/loop_analyze: skip if basis/limit/comparison is vector
|
||
- nir/loop_analyze: scalarize try_eval_const_alu
|
||
- nir/loop_analyze: fix vector basis/limit/comparison
|
||
- nir/loop_analyze: check min compatibility with comparison
|
||
- nir/loop_analyze: support umin and {u,i,f}max
|
||
- nir/loop_analyze: support loops with min/max and non-add incrementation
|
||
- vulkan/wsi: don't support present with queues where blit is unsupported
|
||
- vulkan/wsi: fix win32 compilation
|
||
- vulkan/wsi: always create command buffer for special blit queues
|
||
- nir/loop_analyze: remove invariance analysis
|
||
- aco/tests: use more raw strings
|
||
- aco: correctly set min/max_subgroup_size for wave32-as-wave64
|
||
- radv: use CS wave selection for task shaders
|
||
- radv: remove radv_shader_info's cs.subgroup_size
|
||
- nir: add msad_4x8
|
||
- nir/algebraic: optimize vkd3d-proton's MSAD
|
||
- aco: implement msad_4x8
|
||
- ac/llvm: implement msad_4x8
|
||
- radv: enable msad_4x8
|
||
- nir: remove sad_u8x4
|
||
- radv: do nir_shader_gather_info after radv_nir_lower_rt_abi
|
||
- nir/lower_non_uniform: set non_uniform=false when lowering is not needed
|
||
- nir/lower_shader_calls: remove CF before nir_opt_if
|
||
- aco: fix labelling of s_not with constant
|
||
|
||
Rob Clark (34):
|
||
|
||
- ci: Only strip debug symbols
|
||
- tu/msm: Fix timeline semaphore support
|
||
- tu/virtio: Fix timeline semaphore support
|
||
- freedreno/drm: Fix race in zombie import
|
||
- freedreno: Fix modifier determination
|
||
- freedreno: Handle DRM_FORMAT_MOD_QCOM_TILED3 import
|
||
- virtio/drm: Split out common virtgpu drm structs
|
||
- freedreno/drm: Simplify backend mmap impl
|
||
- virtio: Add vdrm native-context helper
|
||
- freedreno/drm/virtio: Switch to vdrm helper
|
||
- tu/drm/virtio: Switch to vdrm helper
|
||
- freedreno/a6xx: Assume MOD_INVALID imports are linear
|
||
- freedreno/a6xx: Fix antichamber trace replay assert
|
||
- Revert "ci/freedreno: disable antichambers trace"
|
||
- freedreno/a6xx: Don't set patch_vertices if no tess
|
||
- freedreno/a6xx: Rework wave input size
|
||
- freedreno/drm: Fix mmap leak
|
||
- freedreno: Always attach bo to submit
|
||
- isaspec: Sort labels with same output
|
||
- freedreno/drm: Fix zombie BO import harder
|
||
- freedreno/a6xx: Fix NV12+UBWC import
|
||
- freedreno: De-duplicate 19.2MHz RBBM tick conversion
|
||
- freedreno: Fix timestamp conversion
|
||
- freedreno: Implement PIPE_CAP_TIMER_RESOLUTION
|
||
- drm-uapi: Sync drm-uapi
|
||
- freedreno/layout: Add layout metadata
|
||
- tu: Add metadata support for dedicated allocations
|
||
- freedreno/drm: Add BO metadata support
|
||
- freedreno: Add layout metadata support
|
||
- ci: More context for color_clear skips for Wayland
|
||
- ci: List specific color_clears skips
|
||
- ci: Add wayland-dEQP-EGL.functional.render.* skips
|
||
- ci: Remove per-driver wayland-dEQP-EGL xfails
|
||
- freedreno/drm/virtio: Fix typo
|
||
|
||
Robert Foss (3):
|
||
|
||
- egl/surfaceless: Fix EGL_DEVICE_EXT implementation
|
||
- egl: Add _eglHasAttrib() function
|
||
- egl/surfaceless: Don't overwrire disp->Device if using EGL_DEVICE_EXT
|
||
|
||
Robert Mader (4):
|
||
|
||
- util: Add new helpers for pipe resources
|
||
- panfrost: Support parameter queries for main planes
|
||
- vc4/resource: Support offset query for multi-planar planes
|
||
- v3d/resource: Support offset query for multi-planar planes
|
||
|
||
Rohan Garg (31):
|
||
|
||
- intel/compiler: migrate WA 14013672992 to use WA framework
|
||
- blorp,anv,iris: refactor blorp functions into something more generic
|
||
- iris: Wa 16014538804 for DG2, MTL A0
|
||
- iris: pull WA 22014412737 into emit_3dprimitive_was
|
||
- anv: WA 16014538804 for DG2, MTL A0
|
||
- blorp: WA 16014538804 for DG2, MTL A0
|
||
- anv: Refactor loading indirect parameters and filling IDD
|
||
- anv: refactor kernel dispatch to use new common functions
|
||
- intel/dev: Add a bit for when the HW can do a indirect draw/dispatch unroll
|
||
- genxml/12.5: Add the EXECUTE_INDIRECT_DRAW instruction
|
||
- genxml/12.5: Add the EXECUTE_INDIRECT_DISPATCH instruction
|
||
- anv: Emit EXECUTE_INDIRECT_DRAW when available
|
||
- anv: Emit a EXECUTE_INDIRECT_DISPATCH when available
|
||
- iris: Emit a EXECUTE_INDIRECT_DISPATCH when available
|
||
- anv: memcpy the thread dimentions only when they're on the CPU
|
||
- anv: introduce ANV_TIMESTAMP_REWRITE_INDIRECT_DISPATCH
|
||
- intel/genxml: Add the preferred slm size enum for xe2
|
||
- intel: Set a preferred SLM size for LNL
|
||
- intel/genxml: Update COMPUTE_WALKER_BODY for xe2
|
||
- intel/genxml: Update IDD for new fields
|
||
- blorp: set min/max viewport depths to -FLT_MAX/FLT_MAX when EXT_depth_range_unrestricted is enabled
|
||
- anv: ensure that we clamp only when EXT_depth_range_unrestricted is not enabled
|
||
- anv: enable VK_EXT_depth_range_unrestricted
|
||
- iris: Emit EXECUTE_INDIRECT_DRAW when available
|
||
- intel/compiler: use the proper enum type to store the op
|
||
- intel/compiler: infer the number of operands using lsc_op_num_data_values
|
||
- anv: rename anv_create_companion_rcs_command_buffer to anv_cmd_buffer_ensure_rcs_companion
|
||
- iris,isl: Adjust driver for several commands of clear color (xe2)
|
||
- intel/fs/xe2+: Lift CPS dispatch width restrictions on Xe2+.
|
||
- intel/compiler: Update disassembly for new LSC cache enums
|
||
- anv: untyped data port flush required when a pipeline sets the VK_ACCESS_2_SHADER_STORAGE_READ_BIT
|
||
|
||
Roland Scheidegger (1):
|
||
|
||
- lavapipe: bump image alignment up to 64 bytes
|
||
|
||
Roman Stratiienko (5):
|
||
|
||
- v3d: Don't implicitly clear the content of the imported buffer
|
||
- u_gralloc: Extract common code from fallback gralloc
|
||
- u_gralloc: Add QCOM gralloc support
|
||
- egl/android: Switch to generic buffer-info code
|
||
- u_gralloc: Add support for gbm_gralloc
|
||
|
||
Ruijing Dong (12):
|
||
|
||
- radeonsi/vcn: vcn4 encoding interface dummy update
|
||
- radeonsi/vcn: preparation for enc intra-refresh
|
||
- radeonsi/vcn: change intra-ref name
|
||
- radonesi/vcn: enable intra-refresh in vcn encoders
|
||
- frontends/va: add intra-refresh in VAAPI interface
|
||
- radesonsi/vcn add qp_map definition
|
||
- frontends/va: add ROI feature
|
||
- radeonsi/vcn: ROI feature implementation
|
||
- radeonsi/vcn: enable ROI feature in vcn.
|
||
- radeonsi/vcn: ROI capability value initialization.
|
||
- frontends/va: remove some TODOs in hevc encoding
|
||
- radeonsi/vcn: update session_info from vcn3 and up.
|
||
|
||
Ryan Neph (6):
|
||
|
||
- virgl: implemement resource_get_param() for modifier query
|
||
- venus: add VN_PERF=no_tiled_wsi_image
|
||
- venus: strip ALIAS_BIT for WSI image creation on ANV
|
||
- venus: reject multi-plane modifiers for tiled wsi images
|
||
- venus: add dri option to enable multi-plane wsi modifiers
|
||
- venus: fix shmem leak on vn_ring_destroy
|
||
|
||
Sagar Ghuge (24):
|
||
|
||
- iris: Disable auxiliary buffer if MSRT is bound as texture
|
||
- iris: Disable CCS compression on top of MSAA compression on ACM
|
||
- isl: Enable MCS compression on ACM platform
|
||
- anv: Write timestamp using MI_FLUSH_DW on blitter
|
||
- anv: Avoid emitting PIPE_CONTROL command for copy/video queue
|
||
- anv: Flush data cache while clearing depth using HIZ_CCS_WT
|
||
- anv: Add comment to copy image code block
|
||
- iris: Init aux map state for compute engine
|
||
- anv,hasvk: Use uint32_t for queue family indices
|
||
- blorp: Handle stencil buffer compression on blitter engine
|
||
- anv: Use RCS cmd buffer if blit src/dest has 3 components
|
||
- intel/compiler: Adjust assertion in lower_get_buffer_size() for Xe2
|
||
- intel/fs: Adjust destination size for image size intrinsic
|
||
- intel/fs: Adjust destination size for global load constant on Xe2+
|
||
- intel/fs: Adjust destination size for load ubo on Xe2+
|
||
- intel/genxml: Add BCS/VD0 aux table base address register
|
||
- anv: Handle video/copy engine queue initialization
|
||
- anv: Invalidate aux map for copy/video engine
|
||
- iris: Handle aux map init for copy engine
|
||
- docs: Document INTEL_COPY_CLASS
|
||
- anv: Enable blitter engine unconditionally on ACM+
|
||
- iris: No need to emit PIPELINE_SELECT on Xe2+
|
||
- anv: No need to emit PIPELINE_SELECT on Xe2+
|
||
- intel/fs: Check fs_visitor instance before using it
|
||
|
||
Samuel Pitoiset (169):
|
||
|
||
- radv: move RADV_DEBUG_NO_HIZ check in radv_use_htile_for_image()
|
||
- radv: implement VK_EXT_image_compression_control
|
||
- radv: advertise VK_EXT_image_compression_control
|
||
- ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queues
|
||
- radv: dump the pipeline hash to the gpu hang report
|
||
- radv: fix a synchronization issue with primitives generated query on RDNA1-2
|
||
- ac/registers: allow to parse GCVM_L2_PROTECTION_FAULT_STATUS
|
||
- ac/debug: add a helper to print GPUVM fault protection status
|
||
- radv: use the GPUVM fault protection status helper
|
||
- radv: remove NGG streamout support for RDNA1-2
|
||
- radv: remove unnecessary VS_PARTIAL_FLUSH for NGG streamout
|
||
- ac/nir: remove dead code in nir_intrinsic_xfb_counter_{add,sub}_amd
|
||
- aco: remove dead code in nir_intrinsic_xfb_counter_{add,sub}_amd
|
||
- radv/ci: update list of expected failures/flakes for NAVI31
|
||
- radv: add RADV_DEBUG=nomeshshader
|
||
- radv/ci: enable RADV_DEBUG=nomeshshader for vkcts-navi31-valve
|
||
- radv: bind the non-dynamic graphics state from the pipeline unconditionally
|
||
- radv: adjust binning settings to improve performance on GFX9
|
||
- radv: fix compute shader invocations query on compute queue on GFX6
|
||
- radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE
|
||
- ci: backport two mesh/task query fixes for VKCTS
|
||
- radv/ci: document one more flake test
|
||
- nir: fix inserting the break instruction for partial loop unrolling
|
||
- radv: add initial VK_EXT_device_fault support
|
||
- radv: advertise VK_EXT_device_fault
|
||
- ci: re-apply two mesh/task query fixes for VKCTS
|
||
- radv: add a helper to determine if it's possible to preprocess DGC
|
||
- radv: emit individual SET_SH_REG for inlined push constants with DGC
|
||
- radv: optimize emitting inlined push constants with DGC
|
||
- radv: enable DGC preprocessing when all push constants are inlined
|
||
- radv: restore sampling CPU/GPU clocks before starting SQTT trace
|
||
- ac/rgp: update dumping queue event records to the capture
|
||
- radv: add radv_write_timestamp() helper
|
||
- radv: add support for RGP queue events
|
||
- radv: add drirc options to force re-compilation of shaders when needed
|
||
- radv: fix VRS subpass attachment when HTILE can't be enabled on GFX10.3
|
||
- radv: fix registering queues for RGP with compute only
|
||
- radv: set radv_zero_vram=true for Unreal Engine 4/5
|
||
- radv: fix a descriptor leak with debug names and host base descriptor set
|
||
- radv: add a missing async compute workaround for Tonga/Iceland
|
||
- zink/ci: add a manual job on radv-navi31
|
||
- aco: remove useless nir_intrinsic_load_force_vrs_rates_amd
|
||
- radv: remove redundant check when forcing VRS rates
|
||
- radv: check earlier if a graphics pipeline can force VRS per vertex
|
||
- ac/surface: change tile mode for 3D PRT surfaces with bpp < 64 on GFX6-8
|
||
- radv: re-enable sparseResidencyImage3D on POLARIS10+
|
||
- aco: rename color_exports to exports in create_fs_jump_to_epilog()
|
||
- radv: rename ps_epilog_inputs to colors for PS epilogs
|
||
- radv: add radv_physical_device::emulate_mesh_shader_queries for GFX10.3
|
||
- radv: add support for mesh primitives queries on GFX10.3
|
||
- radv: define new pipeline statistics indices for mesh/task on GFX11
|
||
- radv: bump the pipeline state query size to 14 on GFX10.3
|
||
- radv: do not harcode the pipeline stats mask for query resolves
|
||
- radv: add support for mesh shader invocations queries on GFX10.3
|
||
- radv: rework gfx10_copy_gds_query() slightly
|
||
- radv: make some gang functions non-static
|
||
- radv: add support for task shader invocations queries on GFX10.3
|
||
- radv: enable meshShaderQueries on GFX10.3
|
||
- radv/ci: add missing expected failures for mesh queries on VANGOGH
|
||
- radv: disable TC-compatible HTILE on Tonga and Iceland
|
||
- radv: add missing FDCC_CONTROL bits for GFX1103 R2
|
||
- radv: set radv_invariant_geom=true for War Thunder
|
||
- radv: do not set OREO_MODE to fix rare corruption on GFX11
|
||
- ci: uprev vkd3d-proton to 2.11
|
||
- radv/ci: add new flakes for VEGA10
|
||
- radv: remove useless NIR instructions when emitting IBO with DGC
|
||
- radv: set the stream VA for DGC graphics
|
||
- radv: use an indirect draw when IBO isn't updated as part of DGC
|
||
- radv: enable DGC preprocessing for IBO
|
||
- radv: fix bogus interaction between DGC and RT with descriptor bindings
|
||
- radv: make sure to prefetch the compute shader for DGC
|
||
- radv: remove radv_pipeline_key::dynamic_color_write_mask
|
||
- radv: simplify creating image views for src resolve images
|
||
- radv: stop performing redundant resolves with the HW resolve path
|
||
- radv: remove unused layers support for the HW/FS resolve paths
|
||
- radv: only re-initialize DCC for one level for the HW resolve path
|
||
- radv: adjust assertions for multi-layer resolves with the HW/FS paths
|
||
- radv: remove never used binds_state for DGC
|
||
- radv: only initialize the VBO reg if VBOs are bound with DGC
|
||
- radv: only initialize the VTX base SGPR if non-zero with DGC
|
||
- radv: add DGC support for mesh shader only
|
||
- radv: advertise VK_EXT_depth_clamp_zero_one
|
||
- radv: update the reset stipple pattern mode
|
||
- radv: change the reset stipple pattern mode for adjacent lines
|
||
- radv: make sure to reset the stipple line state when it's disabled
|
||
- radv: set combinedImageSamplerDescriptorCount to 1 for multi-planar formats
|
||
- radv: switch to on-demand PS epilogs for GPL
|
||
- radv: remove unused code for compiling PS epilogs as part of pipelines
|
||
- aco: export depth/stencil/samplemask in create_fs_jump_to_epilog()
|
||
- ac/nir: add an option to skip MRTZ exports in ac_nir_lower_ps()
|
||
- radv: determine if MRTZ needs to be exported via PS epilogs
|
||
- radv: prepare the PS epilog key for exporting MRTZ on RDNA3
|
||
- radv,aco: declare PS epilog VGPR arguments for depth/stencil/samplemask
|
||
- radv: determine and emit SPI_SHADER_Z_FORMAT for PS epilogs
|
||
- zink/ci: remove skipped tests from the list of expected failures for NAVI31
|
||
- radv: export MRTZ via PS epilogs when alpha to coverage is dynamic on GFX11
|
||
- radv: enable extendedDynamicState3AlphaToCoverageEnable on GFX11
|
||
- zink/ci: skip more tests that run OOM on NAVI31
|
||
- zink/ci: update list of failures for NAVI31
|
||
- zink/ci: stop running zink-radv-navi31-valve sequentially
|
||
- ci: uprev vkd3d-proton to a0ccc383937903f4ca0997ce53e41ccce7f2f2ec
|
||
- radv: simplify disabling MRT compaction for PS epilogs
|
||
- vulkan: bump headers/registry to 1.3.273
|
||
- radv: promote EXT_calibrated_timestamps to KHR
|
||
- docs: update features.txt for RADV
|
||
- radv: remove useless check for TC-compat CMASK images during fb emission
|
||
- radv: stop clearing FMASK_COMPRESS_1FRAG_ONLY for TC-compat CMASK images
|
||
- vulkan/runtime: promote VK_EXT_vertex_attribute_divisor to KHR
|
||
- radv: advertise VK_KHR_vertex_attribute_divisor
|
||
- radv/ci: remove dEQP-VK.mesh_shader.ext.query.* from the lists
|
||
- radv: emit the task shader in radv_emit_graphics_pipeline()
|
||
- radv: cleanup ac_nir_lower_ps options
|
||
- radv: cleanup gathering PS info with/without PS epilogs
|
||
- radv: cleanup radv_pipeline_generate_ps_epilog_key()
|
||
- radv: add support for MRT compaction with PS epilogs
|
||
- radv: fix binding partial depth/stencil views with dynamic rendering
|
||
- radv: stop asserting some image create info fields
|
||
- radv: remove some declared but unused functions/macros
|
||
- radv: add missing HTILE support for fb mip tail workaround
|
||
- radv: stop checking FMASK for the fb mip tail workaround
|
||
- radv: move emitting the fb mip tail workaround when rendering begins
|
||
- radv: remove radv_get_tess_output_topology() declaration
|
||
- radv: move meta declarations to radv_meta.h
|
||
- radv: move RADV_HASH_SHADER_xxx flags to radv_pipeline.c
|
||
- radv: move radv_image_is_renderable() to radv_image.c
|
||
- radv: move more descriptor related declarations to radv_descriptor_set.h
|
||
- radv: move radv_depth_clamp_mode to radv_cmd_buffer.c
|
||
- radv: move more shader related declarations to radv_shader.h
|
||
- radv: move SI_GS_PER_ES to radv_constants.h
|
||
- radv: move buffer view related code to radv_buffer_view.c
|
||
- radv: move image view related code to radv_image_view.c
|
||
- vulkan: bump headers/registry to 1.3.274
|
||
- vulkan: drop VK_ENABLE_BETA_EXTENSIONS for video encode layouts
|
||
- radv/ci: update CI lists for NAVI10,NAVI31 and RENOIR
|
||
- ci: apply two bugfixes for VKCTS
|
||
- radv: move radv_{emulate,enable}_rt() to radv_physical_device.c
|
||
- radv: make a couple of NIR RT functions as static
|
||
- radv: move radv_rt_{common,shader} files to nir/
|
||
- radv: move radv_BindImageMemory2() to radv_image.c
|
||
- radv: add support for VkBindMemoryStatusKHR
|
||
- radv: rename RADV_GRAPHICS_STAGES to RADV_GRAPHICS_STAGE_BITS
|
||
- radv: add support for version 2 of all descriptor binding commands
|
||
- radv: add support for NULL index buffer
|
||
- radv: advertise VK_KHR_maintenance6
|
||
- radv: disable FMASK for MSAA images with layers on GFX9
|
||
- radv: stop clearing CMASK to 0xcc when FMASK is present on GFX9
|
||
- radv: disable stencil test without a stencil attachment
|
||
- radv: constify a variable in radv_emit_depth_control()
|
||
- radv: remove duplicated si_tile_mode_index() function
|
||
- radv: rename si_make_texture_descriptor() to gfx6_make_texture_descriptor()
|
||
- radv: remove radv_write_scissors()
|
||
- radv: drop si\_ prefix from all functions
|
||
- Revert "radv: disable DCC with signedness reinterpretation on GFX11"
|
||
- radv: stop disabling DCC for mutable with 0 formats on GFX11
|
||
- radv: do not program COMPUTE_MAX_WAVE_ID (GDS register) on GFX6
|
||
- radv/winsys: replace '<= GFX6' by '== GFX6'
|
||
- radv: query drirc options in only one place
|
||
- radv: move dri options to radv_instance::drirc
|
||
- radv: rework declaring color arguments for PS epilogs
|
||
- Revert "radv/rt: Lower ray payloads to registers"
|
||
- radv: do not issue SQTT marker with DISPATCH_MESH_INDIRECT_MULTI
|
||
- radv: add missing disable_shrink_image_store to the pipeline key
|
||
- radv: move RADV_HASH_SHADER_KEEP_STATISTICS to radv_pipeline_key
|
||
- radv: initialize radv_device::disable_trunc_coord earlier
|
||
- radv: introduce radv_device_cache_key for per-device cache compiler options
|
||
- radv: move all per-device keys from radv_pipeline_key to radv_device_cache_key
|
||
- radv: fix indirect dispatches on the compute queue on GFX7
|
||
- radv: fix indirect draws with NULL index buffer on GFX10
|
||
- radv: fix segfault when getting device vm fault info
|
||
|
||
Sarah Walker (3):
|
||
|
||
- pvr: Update AM62 DSS compatible string to match upstream
|
||
- pvr: csbgen: Add dummy implementation of stream type
|
||
- pvr: Add command stream and static context state layout to rogue_kmd_stream.xml
|
||
|
||
Sathishkumar S (1):
|
||
|
||
- frontends/va: use va interface for jpeg partial decode
|
||
|
||
Sebastian Wick (1):
|
||
|
||
- radeonsi: Destroy queues before the aux contexts
|
||
|
||
Sergi Blanch Torne (8):
|
||
|
||
- ci: disable Collabora's LAVA lab for maintance
|
||
- Revert "ci: disable Collabora's LAVA lab for maintance"
|
||
- ci: disable Collabora's LAVA lab for maintance
|
||
- Revert "ci: disable Collabora's LAVA lab for maintance"
|
||
- Revert "ci: disable collabora farm as it is currently offline"
|
||
- ci: disable Collabora's LAVA lab for maintance
|
||
- Revert "ac/nir: Export clip distances according to clip_cull_mask"
|
||
- Revert "ci: disable Collabora's LAVA lab for maintance"
|
||
|
||
Shuicheng Lin (1):
|
||
|
||
- intel/xe: Correct DRM_XE_EXEC_QUEUE_SET_PROPERTY's ioctl
|
||
|
||
Sil Vilerino (76):
|
||
|
||
- d3d12: d3d12_video_buffer_create_impl - Fix resource importing
|
||
- d3d12: Allow creating d3d12_dxcore_screen from existing ID3D12Device
|
||
- vl/win32: Add vl_win32_screen_create_from_d3d12_device
|
||
- gallium/auxiliary: Fix pb_bufmgr_slab.c leak
|
||
- pipe: Extend get_feedback with additional metadata
|
||
- pipe: Add PIPE_VIDEO_CAP_ENC_H264_DISABLE_DBK_FILTER_MODES_SUPPORTED
|
||
- pipe: Add PIPE_VIDEO_CAP_ENC_INTRA_REFRESH_MAX_DURATION
|
||
- pipe: Add H264 VUI encode params
|
||
- pipe: Add HEVC VUI encode params
|
||
- pipe: Add max_slice_bytes for H264, HEVC encoding
|
||
- frontend/va: Add log2_max_frame_num_minus4 and log2_max_pic_order_cnt_lsb_minus4 for h264enc
|
||
- frontend/va: Parse VUI H264 parameters
|
||
- frontend/va: Parse VUI HEVC parameters
|
||
- frontend/va: Support VAEncMiscParameterMaxSliceSize
|
||
- meson: add vp9 and av1 codec support options
|
||
- gallium/vl: Check for VP9 and AV1 meson option support flags
|
||
- d3d12: Plumb pipe_h264_enc_picture_desc.dbk.disable_deblocking_filter_idc
|
||
- d3d12: Use log2_max_frame_num_minus4 and log2_max_pic_order_cnt_lsb_minus4 from pipe_pic_params_h264
|
||
- d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported
|
||
- d3d12: Disable codecs according to meson video-codecs option
|
||
- d3d12: Implement H264 VUI Writer
|
||
- d3d12: Implement HEVC VUI Writer
|
||
- d3d12: Implement Intra Refresh for H264, HEVC, AV1
|
||
- d3d12: Support PIPE_VIDEO_CAP_ENC_H264_DISABLE_DBK_FILTER_MODES_SUPPORTED
|
||
- d3d12: Implement get_feedback with additional metadata
|
||
- d3d12: fix usage of GetAdapterLuid() in mingw/GCC using ABI helper
|
||
- ci: Build d3d12 gallium driver in debian-x86_32
|
||
- pipe: Support inserting new headers on each H264/HEVC IDR frame
|
||
- pipe: Add get_feedback_fence for encode async waiting on pipe_feedback_fence
|
||
- pipe: Add fence_get_win32_handle to get HANDLE from pipe_fence_handle
|
||
- pipe: Add p_video_codec.get_encode_headers for out of band VPS, SPS, PPS
|
||
- pipe: Add PIPE_VIDEO_FEEDBACK_METADATA_TYPE_AVERAGE_FRAME_QP
|
||
- pipe: Add PIPE_VIDEO_CAP_ENC_H264_SUPPORTS_CABAC_ENCODE
|
||
- pipe: Add PIPE_H264_MAX_REFERENCES
|
||
- frontend/va: Add h264 encode ip_period param
|
||
- frontend/va: Add VACodedBufferSegment Average QP metadata
|
||
- frontend/va: Use p_video_codec.get_feedback_fence to report errors on frame submission
|
||
- vl_winsys_win32: call winsys->destroy(winsys) in error conditions
|
||
- d3d12: Implement inserting optional new headers on each H264/HEVC IDR frame
|
||
- d3d12: Do not increase active_seq_parameter_set_id on new SPS. Force PPS on new SPS
|
||
- d3d12: H264 encode - Allow CONSTRAINED_BASELINE profile to be written in headers
|
||
- d3d12: Implement get_feedback_fence for encode async waiting on pipe_feedback_fence
|
||
- d3d12: Implement fence_get_win32_handle to get HANDLE from d3d12_fence
|
||
- d3d12: Only pass texture dimensions to d3d12_video_encoder_update_current_encoder_config_state
|
||
- d3d12: Implement d3d12_video_encoder_get_encode_headers for out of band VPS, SPS, PPS
|
||
- d3d12: Use new pipe h264 encode ip_period param
|
||
- d3d12: max_frame_poc workaround for infinite GOPs
|
||
- d3d12: Fix max slice size and max frame size metadata reporting
|
||
- d3d12: Implement PIPE_VIDEO_FEEDBACK_METADATA_TYPE_AVERAGE_FRAME_QP
|
||
- d3d12: Autodetect d3d12_video_buffer imported handle/resource format and dimensions when not passed
|
||
- d3d12: Implement PIPE_VIDEO_CAP_ENC_H264_SUPPORTS_CABAC_ENCODE
|
||
- d3d12: Detect imported resource buffer unknown format
|
||
- d3d12: Improve error detection and reporting for video encoder
|
||
- d3d12: Fix d3d12_tcs_variant_cache_destroy leak in d3d12_context
|
||
- d3d12: Fix screen->winsys leak in d3d12_screen
|
||
- d3d12: d3d12_create_fence_win32 - Fix double refcount bump
|
||
- d3d12: Fix max reference frames reporting when HW does not support B frame
|
||
- d3d12: Video Encoder - When setting rate control dirty flags take into account rolled back optional configs
|
||
- d3d12: Video Encoder: Support reporting non contiguous NALU, offsets for frontend extraction
|
||
- meson: Add all, all_free (default) options for video-codecs option.
|
||
- d3d12: Fix usage of H264/HEVC specific classes when VIDEO_CODEC_H26XENC not set
|
||
- d3d12: Fix AV1 video encode 32 bits build
|
||
- d3d12: Fix typos in d3d12_video_encoder_bitstream_builder_h264
|
||
- d3d12: Use enc_constraint_set_flags for H264 NALU writing
|
||
- frontends/va: Parse enc_constraint_set_flags from packed SPS
|
||
- d3d12: Check video encode codec cap before checking encode profile/level cap
|
||
- meson: Only build WGL for Windows platform when opengl option is active
|
||
- d3d12: Bump directx-headers dependency to v611.0 for latest video codecs and features
|
||
- d3d12: Remove D3D12_SDK_VERSION checks after bumping directx-headers dependency to v611
|
||
- d3d12: Fix warning C4065 switch statement contains default but no case labels
|
||
- d3d12: Implement Delta QP ROI In h264, hevc and av1 video encode
|
||
- d3d12: Report support for PIPE_VIDEO_CAP_ENC_ROI for Delta QP
|
||
- Revert "d3d12: Only destroy the winsys during screen destruction, not reset"
|
||
- Revert "d3d12: Fix screen->winsys leak in d3d12_screen"
|
||
- d3d12: Fix AV1 Encode - log2 rounding for tile_info section
|
||
- d3d12: Implement cap for PIPE_VIDEO_CAP_ENC_INTRA_REFRESH
|
||
|
||
Simon Ser (3):
|
||
|
||
- egl: extract EGLDevice setup in dedicated function
|
||
- egl: move dri2_setup_device() after dri2_setup_extensions()
|
||
- egl: ensure a render node is passed to _eglFindDevice()
|
||
|
||
Simon Zeni (2):
|
||
|
||
- EGL: sync files with Khronos
|
||
- egl: implement EGL_EXT_query_reset_notification_strategy
|
||
|
||
Sviatoslav Peleshko (23):
|
||
|
||
- nir/loop_analyze: Fix inverted condition handling in iterations calculation
|
||
- anv: Fix MI_ARB_CHECK calls in generated indirect draws optimization
|
||
- nir/loop_analyze: Don't test non-positive iterations count
|
||
- intel/fs: Don't optimize DW*1 MUL if it stores value to the accumulator
|
||
- intel/compiler: Add variable to dump binaries of all compiled shaders
|
||
- intel/disasm: Print half-float values instead of placeholder
|
||
- intel/compiler: Set flag reg to 0 when disabling predication
|
||
- intel/disasm: Print src1_len correctly depending on ExDesc type
|
||
- intel/fs: Set group 0 for Wa_14010017096 MOV instruction
|
||
- intel/eu/validate: Validate that the ExecSize is a factor of chosen ChanOff
|
||
- intel/tools/i965_asm: Add SWSB handling
|
||
- intel/tools/i965_asm: Handle HF immediates
|
||
- intel/tools/i965_asm: Handle sync instruction
|
||
- intel/tools/i965_asm: Allow neg and abs modifiers on accumulator register
|
||
- intel/tools/i965_asm: Don't override flag reg from cond modifier
|
||
- intel/tools/i965_asm: Allow src0 and src2 of ternary instructions to be imm
|
||
- intel/tools/i965_asm: Implement gfx12 and gfx12.5 send/sendc
|
||
- intel/tools/i965_asm: Add dp4a and add3 instructions
|
||
- intel/tools/i965_asm: Don't set src0 for break and while on gfx12
|
||
- intel/tools/tests: Fix sends indirect argument in gfx9 test
|
||
- intel/tools/tests: Unbreak i965_asm tests
|
||
- intel/tools/tests: Add i965_asm tests for gfx12 and gfx12.5
|
||
- nir: Use alu source components count in nir_alu_srcs_negative_equal
|
||
|
||
Sylvain Munaut (1):
|
||
|
||
- mesa/st, dri2, wgl, glx: Restore flush_objects interop backward compat
|
||
|
||
Tapani Pälli (34):
|
||
|
||
- intel/dev: provide intel_device_info_is_adln helper
|
||
- iris: add required PC for Wa_14014966230
|
||
- anv: add current_pipeline for batch_emit_pipe_control
|
||
- anv: add required PC for Wa_14014966230
|
||
- intel/dev: fix intel_device_info_is_adln check
|
||
- iris: handle tile case where cso width, height is zero
|
||
- anv: skip engine initialization if vm control not supported
|
||
- iris: add data cache flush for pre hiz op
|
||
- anv/drirc: add option to disable FCV optimization
|
||
- drirc: use fake_sparse for Armored Core 6
|
||
- drirc: Set limit_trig_input_range option for Valheim
|
||
- iris: implement Wa_18020335297
|
||
- anv: refactor state emission
|
||
- anv: implement Wa_18020335297
|
||
- iris: implement dummy blit for Wa_16018063123
|
||
- anv: implement dummy blit for Wa_16018063123
|
||
- mesa: lower EXT_render_snorm version requirement
|
||
- anv: use slow clear for small surfaces with Wa_18020603990
|
||
- iris: use slow clear for small surfaces with Wa_18020603990
|
||
- anv/hasvk/drirc: change anv_assume_full_subgroups to have subgroup size
|
||
- drirc: setup anv_assume_full_subgroups=16 for UnrealEngine5.1
|
||
- anv: cleanup, use intel_needs_workaround instead of is_dg2
|
||
- iris: cleanup, use intel_needs_workaround instead of is_dg2
|
||
- iris: use intel_needs_workaround with 14015055625
|
||
- mesa: fix enum support for EXT_clip_cull_distance
|
||
- drirc/anv: disable FCV optimization for Baldur's Gate 3
|
||
- isl: implement Wa_14018471104
|
||
- iris: use workaround framework for Wa_22018402687
|
||
- anv: use workaround framework for Wa_22018402687
|
||
- anv: check for wa 16013994831 in emit_so_memcpy_end
|
||
- iris: expand pre-hiz data cache flush to gfx >= 125
|
||
- anv: expand pre-hiz data cache flush to gfx >= 125
|
||
- iris: replace constant cache invalidate with hdc flush
|
||
- anv: move \*bits_for_access_flags to genX_cmd_buffer
|
||
|
||
Tatsuyuki Ishi (25):
|
||
|
||
- fast_urem_by_const: #ifdef DEBUG an assertion.
|
||
- radv: Fix mis-sizing of pipeline_flags in radv_hash_rt_shaders.
|
||
- radv: Use sizeof(flags) instead of hardcoded size in radv_hash_shaders.
|
||
- aco: Replace aco_vs_input_state.divisors with bitfields.
|
||
- radv: Remove last VS prolog reuse logic.
|
||
- radv, aco: Rework VS prolog key handling.
|
||
- radv, aco: Inline struct aco_vs_input_state.
|
||
- radv: Pre-mask misaligned_mask for VS prolog.
|
||
- radv: Implement helpers for shader part caching.
|
||
- radv: Use shader part caching helpers for VS prolog and PS/TCS epilog.
|
||
- zink: Fix missing sparse buffer bind synchronization.
|
||
- zink: Defer freeing sparse backing buffers.
|
||
- zink: Fix waiting for texture commit semaphores.
|
||
- zink: Remove now unused dead_framebuffers.
|
||
- radv: Remove aspect mask "expansion" for copy_image.
|
||
- radv: Add workaround to allow sparse binding on gfx queues.
|
||
- radv: Enable radv_legacy_sparse_binding for DOOM Eternal.
|
||
- radv/amdgpu: Remove virtual bo dump logic.
|
||
- radv/amdgpu: Separate the concept of residency from use_global_list.
|
||
- radv: Simplify shader config assignment.
|
||
- radv: Move up radv_get_max_waves, radv_get_max_scratch_waves.
|
||
- radv: Precompute shader max_waves.
|
||
- radv: Add layer to skip UnmapMemory for Quantic Dream Engine
|
||
- radv: Recompute max_waves after postprocessing RT config
|
||
- radv: never set DISABLE_WR_CONFIRM for CP DMA clears and copies
|
||
|
||
Tele42 (1):
|
||
|
||
- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle VR"
|
||
|
||
Teng, Jin Chung (1):
|
||
|
||
- d3d12: Decode - Adding more supported resolution
|
||
|
||
Thomas Devoogdt (1):
|
||
|
||
- util: os_same_file_description: fix unknown linux < 3.5 syscall SYS_kcmp
|
||
|
||
Thomas H.P. Andersen (13):
|
||
|
||
- docs: update nvk extensions
|
||
- nvk: use nvk_pipeline_zalloc
|
||
- nouveau: drop unused #includes of tgsi_parse.h
|
||
- nvk: VK_EXT_color_write_enable
|
||
- docs: update features.txt for nvk
|
||
- nvk: loop over stages in MESA order
|
||
- nvk: add hashing for shaders
|
||
- nvk: allocatable nvk_shaders
|
||
- nvk: pipeline shader cache
|
||
- nvk: VK_EXT_pipeline_creation_feedback
|
||
- nvk: VK_EXT_pipeline_creation_cache_control
|
||
- nvk: VK_EXT_shader_module_identifier
|
||
- docs: update features.txt for nvk
|
||
|
||
Thong Thai (1):
|
||
|
||
- radeonsi/vcn: remove EFC support for renoir
|
||
|
||
Timothy Arceri (24):
|
||
|
||
- nir: move build_write_masked_stores() to nir builder
|
||
- glsl/nir: implement a nir based lower distance pass
|
||
- glsl: switch to NIR distance lowering pass
|
||
- glsl: remove now unused lower distance pass
|
||
- nir: simplify nir_build_write_masked_store()
|
||
- glsl: drop ir_binop_ubo_load
|
||
- glsl: add nir based lower_named_interface_blocks()
|
||
- glsl: use the nir based lower_named_interface_blocks()
|
||
- glsl: remove GLSL IR lower_named_interface_blocks()
|
||
- nir: add nir_fixup_deref_types()
|
||
- glsl: support glsl linking in nir block linker
|
||
- glsl: use new nir based block linker
|
||
- glsl: remove now unused GLSL IR block linker
|
||
- glsl/st: move has_half_float_packing flag to consts struct
|
||
- glsl/st: move remaining glsl ir lowering to linker
|
||
- mesa/st: drop additional validate_ir_tree() call
|
||
- glsl: combine shader stage loops in linker
|
||
- radeonsi: fix divide by zero in si_get_small_prim_cull_info()
|
||
- glsl: tidy up validation loop in linker
|
||
- glsl: remove some unused linker code
|
||
- glsl: copy precision val of function output params
|
||
- glsl: add additional lower mediump test
|
||
- glsl: move glsl ir lowering out of glsl_to_nir()
|
||
- glsl: add support for inout params to glsl_to_nir()
|
||
|
||
Timur Kristóf (32):
|
||
|
||
- radv: Remove always false tmz variables from SDMA functions.
|
||
- radv: Expose radv_get_dcc_max_uncompressed_block_size function.
|
||
- radv: Implement buffer/image copies on transfer queues.
|
||
- radv: Add temporary BO for transfer queues.
|
||
- radv: Implement workaround for unaligned buffer/image copies.
|
||
- ac: Rename SDMA max copy size macros to reflect SDMA version.
|
||
- ac: Remove CIK prefix from SDMA opcodes.
|
||
- ac: Add sdma_version enum and use it for SDMA features.
|
||
- radv: Use GPU info for determining SDMA metadata support.
|
||
- radv: Use SDMA version instead of gfx_level where possible.
|
||
- radv: disable HTILE/DCC for concurrent images with transfer queue if unsupported.
|
||
- radv: Disable DCC on exclusive images with transfer queue when SDMA doesn't support it.
|
||
- radv: Disable HTILE on exclusive images with transfer queues when SDMA doesn't support it.
|
||
- radv: Don't retile DCC on transfer queues.
|
||
- radv: Implement barriers for transfer queues.
|
||
- radv: Implement vkCmdFillBuffer on transfer queues.
|
||
- radv: Implement vkCmdWriteTimestamp2 on transfer queues.
|
||
- radv: Implement vkCmdWriteBufferMarker2AMD on transfer queues.
|
||
- radv: Implement buffer copies on transfer queues.
|
||
- radv: Implement vkCmdUpdateBuffer on transfer queues.
|
||
- radv: Move SDMA function and struct declarations to a new header.
|
||
- radv: Unify SDMA surface struct for linear and tiled images.
|
||
- radv: Refactor and simplify SDMA surface info functions.
|
||
- radv: Pass radv_sdma_surf from copy functions to SDMA.
|
||
- radv: Use SDMA surface structs for determining unaligned buffer copies.
|
||
- radv: Clean up SDMA chunked copy info struct.
|
||
- radv: Use correct plane and binding index with SDMA.
|
||
- radv: Correct binding index for transfer buffer-image copies.
|
||
- radv: Implement image copies on transfer queues.
|
||
- radv: Implement T2T scanline copy workaround.
|
||
- radv: Expose transfer queues, hidden behind a perftest flag.
|
||
- radv: Correctly select SDMA support for PRIME blit.
|
||
|
||
Vignesh Raman (5):
|
||
|
||
- ci: Add CustomLogger class and CLI tool
|
||
- ci: copy logging script to install
|
||
- ci: bare-metal: poe: Create strutured logs
|
||
- ci: bare-metal: cros-servo: Create strutured logs for a630
|
||
- ci/freedreno: add FARM variable
|
||
|
||
Vinson Lee (6):
|
||
|
||
- ac/surface/tests: Remove duplicate variable block_size_bits
|
||
- nir: Fix decomposed_prmcnt copy-paste error
|
||
- nvk: Fix tautological-overlap-compare warning
|
||
- etnaviv: Remove duplicate initializers
|
||
- ac/rgp: Fix single-bit-bitfield-constant-conversion warning
|
||
- intel/disasm: Remove duplicate variable reg_file
|
||
|
||
Violet Purcell (1):
|
||
|
||
- gallium: Fix undefined symbols in version scripts
|
||
|
||
Vitaliy Triang3l Kuzmin (13):
|
||
|
||
- r600: Move r600_create_vertex_fetch_shader to r600_shader.c
|
||
- r600: Remove Gallium dependencies in r600_isa
|
||
- r600: Replace R600_ERR with R600_ASM_ERR in shader code
|
||
- r600: Remove Gallium dependencies in r600_asm
|
||
- r600: Split r600_shader.h into common and Gallium parts
|
||
- r600/sfn: Make r600 header include paths relative
|
||
- r600/sfn: Split r600_shader_from_nir into common and Gallium parts
|
||
- r600: Fix outputs typo in print_pipe_info
|
||
- r600: Replace TGSI I/O semantics with shader_enums
|
||
- r600/sfn: Change sampler_index to texture_index in buffer txs
|
||
- r600/sfn: Remove unused sampler reference in emit_tex_lod
|
||
- nir: Don't skip lower_alu if only bit_count needs lowering
|
||
- vulkan: Fix pipeline layout allocation scope
|
||
|
||
Vlad Schiller (1):
|
||
|
||
- pvr: Fix VK_EXT_texel_buffer_alignment
|
||
|
||
VladimirTechMan (1):
|
||
|
||
- venus/android: Switch to using u_gralloc
|
||
|
||
Yiwei Zhang (57):
|
||
|
||
- venus: use common vk_image_format_to_ahb_format helper
|
||
- venus: use common vk_image_usage_to_ahb_usage helper
|
||
- venus: tiny refactor of device memory report interface
|
||
- venus: avoid modifier prop query in vn_android_get_image_builder
|
||
- venus: use common vk_image as vn_image base
|
||
- venus: use common vk_device_memory as vn_device_memory base
|
||
- venus: use common AHB management and export impl
|
||
- venus: use vk_device_memory tracked export and import handle types
|
||
- venus: use vk_device_memory tracked size
|
||
- venus: use vk_device_memory tracked memory_type_index
|
||
- venus: fix query feedback batch leak and race upon submission
|
||
- zink: apply can_do_invalid_linear_modifier to Venus
|
||
- venus: scrub msaa sample mask only with valid msaa state
|
||
- venus: fix async compute pipeline creation
|
||
- venus: properly initialize ring monitor initial alive status
|
||
- venus: add missing shmem pool fini for cs_shmem pool
|
||
- venus: reduce ring idle timeout from 50ms to 5ms
|
||
- venus: use STACK_ARRAY to prepare for indirect submission
|
||
- venus: enable renderer shmem cache dump for cache debug
|
||
- venus: add ring helper to avoid redundant ring wait requests
|
||
- venus: use instance allocator for ring allocs
|
||
- venus: use instance allocator for indirect cs storage alloc
|
||
- venus: add vn_instance_fini_ring helper
|
||
- venus: refactor instance creation failure path
|
||
- venus: move ring monitor to instance for sharing across rings
|
||
- venus: refactor to add vn_watchdog
|
||
- venus: further cleanup vn_relax_init to take instance instead of ring
|
||
- venus: always set reply command stream to avoid seek
|
||
- venus: make vn_renderer_shmem_pool thread-safe
|
||
- venus: remove command_dropped tracking
|
||
- venus: relax ring mutex
|
||
- venus: move ring shmem into vn_ring
|
||
- venus: move the rest ring belongings into ring
|
||
- venus: move ring submission into ring
|
||
- venus: move the actual ring creation into ring as well
|
||
- venus: add vn_ring_get_id and hide vn_ring internals entirely
|
||
- venus: switch to vn_ring as the protocol interface - part 1
|
||
- venus: switch to vn_ring as the protocol interface - part 2
|
||
- venus: switch to vn_ring as the protocol interface - part 3
|
||
- venus: add vn_gettid helper
|
||
- venus: dispatch background shader tasks to secondary ring
|
||
- driconfig: add a workaround for Hades (Vulkan backend)
|
||
- vulkan/wsi/wayland: ensure drm modifiers stored in chain are immutable
|
||
- venus: clang format fixes
|
||
- venus: split up the pipeline fix description into self and pnext
|
||
- venus: refactor to add pipeline info fixes helpers
|
||
- venus: properly ignore formats in VkPipelineRenderingCreateInfo
|
||
- meson/vulkan/util: allow venus to drop compiler deps
|
||
- venus: make tls hint specific to pipeline creation
|
||
- venus: TLS ring
|
||
- venus: clean up secondary ring
|
||
- venus: allow to retrieve pipeline cache on TLS ring
|
||
- venus: populate oom from ring submit alloc failures
|
||
- vulkan/wsi/wayland: fix returns and avoid leaks for failed swapchain
|
||
- venus: fix pipeline layout lifetime
|
||
- venus: fix pipeline derivatives
|
||
- venus: fix to respect the final pipeline layout
|
||
|
||
Yogesh Mohan Marimuthu (10):
|
||
|
||
- winsys/amdgpu: add _dw to max_ib_size variable for code readability
|
||
- winsys/amdgpu: remove ib_type variable from struct amdgpu_ib
|
||
- winsys/amdgpu: rename struct amdgpu_ib main variable as main_ib everywhere
|
||
- winsys/amdgpu: rename ib variable name to chunk_ib
|
||
- winsys/amdgpu: remove rcs variable from struct amdgpu_ib
|
||
- winsys/amdgpu: move 125% comment to correct line of code
|
||
- winsys/amdgpu: rename requested_size_dw to projected_size_dw
|
||
- winsys/amdgpu: rename ptr_ib_size_inside_ib to is_chained_ib
|
||
- winsys/amdgpu: rename big_ib_buffer,ib_mapped variables in struct amdgpu_ib
|
||
- winsys/radeon: remove unused gpu_address variable from struct radeon_cmdbuf
|
||
|
||
Yonggang Luo (61):
|
||
|
||
- compiler: Implement num_mesh_vertices_per_primitive to match u_vertices_per_prim
|
||
- treewide: Merge num_mesh_vertices_per_primitive and u_vertices_per_prim into mesa_vertices_per_prim
|
||
- nir: remove redundant include of gallium headers
|
||
- nir: #include "util/macros.h" for BITFIELD64_MASK in nir.c
|
||
- compiler,vulkan,drm-shim: Remove unused include directories from meson.build
|
||
- nvk: Should use alignment instead of align
|
||
- microsoft/clc: Using sampler_id instead PIPE_MAX_SHADER_SAMPLER_VIEWS for dxil_lower_sample_to_txf_for_integer_tex
|
||
- microsoft/clc: Use 128 instead of PIPE_MAX_SHADER_SAMPLER_VIEWS
|
||
- micosoft: define enum dxil_tex_wrap to avoid the usage of enum pipe_tex_wrap
|
||
- micosoft: decouple microsoft vulkan driver and compiler from gallium
|
||
- dzn: Fixes -Werror=incompatible-pointer-type
|
||
- d3d12,dzn: Simplify the usage of #include <wsl/winadapter.h>
|
||
- util: Fixes note: the alignment of ‘_Atomic long long int’ fields changed in GCC 11.
|
||
- glsl: move glsl_get_gl_type into glsl/linker_util.h
|
||
- meson/win32: There is no need install OpenGL headers on win32
|
||
- intel: Remove unused ALIGN macro
|
||
- clover: Rename function align to align_vector to avoid conflict with global align
|
||
- treewide: Avoid use align as variable, replace it with other names
|
||
- util,vulkan,mesa,compiler: Generate source files with utf8 encoding from mako template
|
||
- intel: Generate source file with utf-8 encoding from mako template
|
||
- zink: Generate source file with utf-8 encoding from mako template
|
||
- docs: Generate document with utf8 encoding
|
||
- v3dv: Use correct type VkStencilOp in function translate_stencil_op
|
||
- broadcom/compiler: Use correct type pipe_logicop for logicop_func in struct v3d_fs_key
|
||
- broadcom/compiler: remove unused blend in v3d_fs_key
|
||
- broadcom: remove unused headers include
|
||
- osmesa: Make osmesa.h compatible with Windows SDK's GL.h
|
||
- broadcom/(compiler,common): avoid include of gallium headers in header files
|
||
- broadcom/compiler: remove include of gallium headers from meson.build
|
||
- osmesa: Fixes building osmesa.c on windows
|
||
- meson: Support for both packaging and distutils
|
||
- dzn: Remove #if D3D12_SDK_VERSION blocks now that 611 is required
|
||
- ci/msvc: update flex and bison to winflexbison3
|
||
- ci/msvc: Install graphics tools(DirectX debug layer) easy to stuck, place it at the beginning
|
||
- ci/msvc: Split install vulkan sdk out of choco
|
||
- ci/msvc: Rename vs2019 to msvc
|
||
- ci/msvc: Rename vs to msvc for consistence
|
||
- ci/msvc: Improve msvc init
|
||
- ci/msvc: Remove &windows_msvc_image_tag
|
||
- ci/msvc: Upgrade to vs2022 build tools
|
||
- ci/msvc: Install msvc2019 only from vs2022
|
||
- ci/msvc: Install both msvc2019 and msvc2022
|
||
- ci/msvc: Stick deqp-runner to version v0.16.1
|
||
- ci/msvc: Stick VK-GL-CTS to specific version 56114106d860c121cd6ff0c3b926ddc50c4c11fd
|
||
- ci/msvc: Split the install of rust and d3d out of mesa_deps_test.ps1
|
||
- ci/microsoft: Update the image-tag and image-path for msvc2019/msvc2022
|
||
- treewide: Replace the include of nir_types.h with glsl_types.h
|
||
- compiler/glsl: Move glsl specific _mesa_glsl_initialize_types out and glsl_symbol_table of glsl_types.h
|
||
- intel: Avoid use align as variable, replace it with other names
|
||
- intel: Use ALIGN_POT instead of ALIGN inside macro define
|
||
- intel: Cleanup duplicate ALIGN macro defines
|
||
- intel,crocus,iris: Use align64 instead of ALIGN for 64 bit value parameter
|
||
- amd: Use align64 instead of ALIGN for 64 bit value parameter
|
||
- util,compiler: Avoid use align as variable, replace it with other names
|
||
- panfrost: Avoid use align as variable, replace it with other names
|
||
- glsl: Fixes glcpp/tests with mingw/gcc
|
||
- util: Add align_uintptr and use it treewide to replace ALIGN that works on size_t and uintptr_t
|
||
- nvk: Avoid use align as variable, replace it with alignment
|
||
- nouveau: Use align64 instead of ALIGN for 64 bit value parameter
|
||
- etnaviv/drm: Remove redundant ALIGN macro by #include "util/u_math.h"
|
||
- compiler/spirv: The spirv shader is binary, should write in binary mode
|
||
|
||
Zhang Ning (2):
|
||
|
||
- iris: use helper util_resource_at_index
|
||
- lima: Support parameter queries for PIPE_RESOURCE_PARAM_NPLANES
|
||
|
||
Zhang, Jianxun (5):
|
||
|
||
- intel/genxml: Remove 3DSTATE_CLEAR_PARAMS instruction (xe2)
|
||
- intel/genxml: update 3DSTATE_WM_HZ_OP instruction (xe2)
|
||
- intel/genxml: update 3DSTATE_DEPTH_BUFFER instruction (xe2)
|
||
- intel/isl: update 3DSTATE_STENCIL_BUFFER (xe2)
|
||
- intel/genxml: Add RENDER_SURFACE_STATE for xe2
|
||
|
||
antonino (4):
|
||
|
||
- nir: don't take the derivative of the array index in \`nir_lower_tex`
|
||
- vulkan: use instance allocator for \`object_name` in some objects
|
||
- nir/zink: drop NIH helper in favor of \`mesa_vertices_per_prim`
|
||
- egl: only check dri3 on X11
|
||
|
||
daoxianggong (1):
|
||
|
||
- zink - Fix for blend color change without blend state change
|
||
|
||
duncan.hopkins (4):
|
||
|
||
- util: Update util/libdrm.h stubs to allow loader.c to compile on MacOS.
|
||
- dri: added build dependencies for systems using non-standard prefixed X11 libs.
|
||
- glx: fix automatic zink fallback loading between hw and sw drivers on MacOS
|
||
- vulkan: added build dependencies for systems using non-standard prefixed X11 libs.
|
||
|
||
i509VCB (3):
|
||
|
||
- asahi,docs: add PBE to hardware glossary
|
||
- asahi: create queue for screen
|
||
- agx: remove internal agx_device queue
|
||
|
||
jphuang (1):
|
||
|
||
- dzn: Change dst image layout according to aspect
|
||
|
||
llyyr (1):
|
||
|
||
- docs: document AMD_DEBUG=noefc and useaco
|
||
|
||
ratatouillegamer (2):
|
||
|
||
- hasvk: Add Vulkan API version override
|
||
- hasvk: Enable hasvk override Vulkan API Version for Brawlhalla
|