KonstantinSeurer/mesa

Commit Graph

Author	SHA1	Message	Date
Eric Engestrom	f1eae2f8bb	python: drop python2 support Signed-off-by: Eric Engestrom <eric@engestrom.ch> Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3674>	2021-08-14 21:44:32 +00:00
jzielins	639605e5ba	gallium/swr: Fix compilation with LLVM 12 Update to reflect changes in VectorType and ElementCount APIs Closes: #3457 Reviewed-by: Krzysztof Raszkowski <krzysztof.raszkowski@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6554>	2020-09-02 11:12:21 +02:00
Jan Zielinski	4a523baa00	gallium/swr: Fix LLVM 11 compilation issues Changes needed to adapt to LLVM API changes in vector and pointer types. Reviewed-by: Krzysztof Raszkowski <krzysztof.raszkowski@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4769>	2020-04-27 22:29:52 +00:00
Krzysztof Raszkowski	ad66b25415	gallium/swr: Fix vcvtph2ps llvm intrinsic compile error Reviewed-by: Jan Zielinski <jan.zielinski@intel.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4090> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4090>	2020-03-09 09:21:00 +00:00
Krzysztof Raszkowski	ff8265b64f	gallium/swr: Fix llvm11 compilation issues Reviewed-by: Jan Zielinski <jan.zielinski@intel.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3747> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3747>	2020-02-07 15:03:55 +00:00
Jan Zielinski	5dd9ad1570	swr/rasterizer: Better implementation of scatter Added support for avx512 scatter instruction. Non-avx512 will now call into a C function to do the scatter emulation. This has better jit compile performance than the previous approach of jitting scalar loops. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-07-30 13:39:19 +00:00
Alok Hota	b8adb540a0	swr/rast: Add string handling to AR event framework For use by an internal tool Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-05-01 15:11:30 +00:00
Alok Hota	2729d847ce	swr/rast: add more llvm intrinsics Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-04-30 19:48:12 +00:00
Eric Engestrom	fc82ea1350	Revert "swr/rast: Archrast codegen updates" This reverts the following commits: `71a76a47cc` "swr/codegen: fix autotools build" `7763e664ce` "meson/swr: replace hard-coded path with current_build_dir()" `773b3ceaca` "swr/rast: Fix autotools and scons codegen" `16e10b8c30` "swr/rast: Add general SWTag statistics" `b45a15a39f` "swr/rast: Add string handling to AR event framework" `8608a747aa` "swr/rast: Add initial SWTag proto definitions" `93cd9905c8` "swr/rast: Cleanup and generalize gen_archrast" The last one in this list broke all the build systems that can build this (meson, autotools & scons). See MR !304 for more details: https://gitlab.freedesktop.org/mesa/mesa/merge_requests/304 Signed-off-by: Eric Engestrom <eric.engestrom@intel.com>	2019-03-01 16:46:32 +00:00
Alok Hota	b45a15a39f	swr/rast: Add string handling to AR event framework For use by an internal tool Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-02-25 13:05:31 -06:00
Alok Hota	f695e43354	swr/rast: Add translation support to streamout Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2019-02-15 14:54:29 -06:00
Alok Hota	edf38019a0	swr/rast: ignore CreateElementUnorderedAtomicMemCpy This function's API changed between LLVM 5 and 6. Compile errors occur when building with LLVM 6+ if LLVM 5 was used for a dist tarball CC: <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107865 Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-10-25 11:05:59 -05:00
Alok Hota	8c872ac2e3	swr/rast: fix intrinsic/function for LLVM 7 compatibility Converted from x86 VFMADDPS intrinsic to generic LLVM intrinsic, and removed createInstructionSimplifierPass, which were both removed in LLVM 7.0.0 These changes combine patches we received from the community and our own internal patches Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com> Tested-by: Chuck Atkins <chuck.atkins@kitware.com>	2018-10-25 10:32:27 -05:00
George Kyriazis	9a72d4c03e	swr/rast: Fix return type of VCVTPS2PH. expecting <8xi16> return. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-27 14:36:41 -05:00
George Kyriazis	99fe90722d	swr/rast: Replace x86 VMOVMSK with llvm-only implementation Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	94ca1c018f	swr/rast: minimize codegen redundant work Move filtering of redundant codegen operations into gen scripts themselves Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	670a99c233	swr/rast: add cvt instructions in x86 lowering pass Support generic VCVTPD2PS and VCVTPH2PS in x86 lowering pass. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	81371a5909	swr/rast: Change gfx pointers to gfxptr_t Changing type to gfxptr for indices and related changes to fetch and mem builder code. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	8f848ada8a	swr/rast: Start refactoring of builder/packetizer. Move x86 intrinsic lowering to a separate pass. Builder now instantiates generic intrinsics for features not supported by llvm. The separate x86 lowering pass is responsible for lowering to valid x86 for the target SIMD architecture. Currently it's a port of existing code to get it up and running quickly. Will eventually support optimized x86 for AVX, AVX2 and AVX512. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	67c8bb4db7	swr/rast: Fix name mangling for LLVM pow intrinsic Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	4c69823d15	swr/rast: WIP builder rewrite (2) Finish up the remaining explicit intrinsic uses. At this point all explicit Intrinsic::getDeclaration() usage has been replaced with auto generated macros generated with gen_llvm_ir_macros.py. Going forward, make sure to only use the intrinsics here, adding new ones as needed. Next step is to remove all references to x86 intrinsics to keep the builder target-independent. Any x86 lowering will be handled by a separate pass. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	c2163dc56a	swr/rast: Add autogen of helper llvm intrinsics. Replace sqrt, maskload, fp min/max, cttz, ctlz with llvm equivalent. Replace AVX maskedstore intrinsic with LLVM intrinsic. Add helper llvm macros for stacksave, stackrestore, popcnt. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	6427315e43	swr/rast: WIP builder rewrite. Start removing avx2 macros for functionality that exists in llvm. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	d34edffe48	swr/rast: Add some instructions to jitter VPHADDD, PMAXUD, PMINUD Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-04-18 10:51:38 -05:00
George Kyriazis	1c73f42e6e	swr/rast: Consolidate TRANSLATE_ADDRESS Translate is now part of an overloaded LOAD call which required a change to the code gen to skip the load functions in order to handle them manually to make them virtual. Reviewed-By: Bruce Cherniak <bruce.cherniak@intel.com>	2018-02-28 11:42:41 -06:00
George Kyriazis	3140e714d2	swr/rast: x86 autogenerated macro work Add name argument to x86 autogenerated macros. Add useful variable names for DCL_inputVec implementation. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2018-01-19 16:52:39 -06:00
Tim Rowley	c68b2d5c79	swr/rast: Replace VPSRL with LSHR Replace use of x86 intrinsic with general llvm IR instruction. Generates the same final assembly. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-12-15 10:56:54 -06:00
Tim Rowley	01a57c11cb	swr/rast: SIMD16 Fetch - Fully widen 32-bit integer vertex components Also widen the 16-bit a 8-bit integer vertex component gathers to SIMD16. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-12-15 10:56:30 -06:00
Tim Rowley	df54678ba0	swr/rast: SIMD16 Fetch - Fully widen 32-bit float vertex components Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-12-15 10:56:03 -06:00
Tim Rowley	005d937e15	swr/rast: Implement AVX-512 GATHERPS in SIMD16 fetch shader Disabled for now. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-11-20 13:51:06 -06:00
Tim Rowley	066d1dc951	swr/rast: remove llvm fence/atomics from generated files We currently don't use these instructions, and since their API changed in llvm-5.0 having them in the autogen files broke the mesa release tarballs which ship with generated autogen files. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102847 CC: mesa-stable@lists.freedesktop.org Tested-by: Laurent Carlier <lordheavym@gmail.com> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-09-22 11:38:57 -05:00
Tim Rowley	4475583f5e	swr/rast: Allow gather of floats from fetch shader with 2-4GB offsets Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-09-06 11:01:39 -05:00
Tim Rowley	79d92a72d5	swr: [rasterizer jitter] fix llvm-5.0.0 build bustage Add CreateAlignmentAssumptionHelper to gen_llvm_ir_macros.py ignore list. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-03-28 11:46:58 -05:00
Tim Rowley	fee3fc018b	swr: [rasterizer codegen] Refactor codegen Move common codegen functions into gen_common.py. v2: change gen_knobs.py to find the template file internally, like the rest of the gen scripts. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-03-28 11:23:04 -05:00
Tim Rowley	fe325e6423	swr: [rasterizer] Cleanup naming of codegen files All template files and generated files are prefixed with gen_. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-03-20 18:05:54 -05:00
Tim Rowley	5d0b3b05a2	swr: [rasterizer codegen] Fix generation of knobs Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-03-20 18:05:54 -05:00
Tim Rowley	2776d94545	swr: [rasterizer codegen] Rewrite gen_llvm_ir_macros.py to use mako Don't create/use cpp files, header only now. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-03-20 18:05:54 -05:00
Tim Rowley	97cbabc8fb	swr: [rasterizer scripts] Put codegen scripts into a separate directory Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-03-20 18:05:54 -05:00

38 Commits