[pull] main from llvm:main #5547

pull · 2025-02-07T01:14:21Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

Fix clang-with-thin-lto-ubuntu - failed build

The current check in writeFileDefinition() is incorrect, and prevents us from ever emitting the URL from the clang-doc tool. The unit tests do test this, but call the API directly circumventing the check. This is the first step towards addressing #59814.

This follows suite with disabling float printing.

Tested call-graph matching on some of Meta's large services, it works to reuse some renamed function profiles, no negative perf or significant build speed regression observed. Turned it on by default for CSSPGO mode.

Suboptimally handled by visitInstruction: llvm.aarch64.neon. - fcvtas, fcvtau - fcvtms, fcvtmu - fcvtns, fcvtnu - fcvtps, fcvtpu - fcvtzs, fcvtzu - fcvtxn - vcvtfp2fxs, vcvtfp2fxu - vcvtfxs2fp, vcvtfxu2fp Forked from llvm/test/CodeGen/AArch64/arm64-{cvt,vcvt}.ll

Summary: This is supposed to be `__llvm_rpc_client` but I screwed it up and didn't notice at the time. Will need to be backported.

…ng new utility. (#125925) 1. Our static functions are a bit spread out in this file. I am gathering them in an anonymous namespace 2. Moving the code to get the `target` attribute on a `fir.global` into its own utility.

This patch fixes a bug in the dependency node iterators that would incorrectly not skip nodes that are not in the current DAG. This resulted in iterators returning nullptr when dereferenced. The fix is to update the existing "skip" function to not only skip non-instruction values but also to skip instructions not in the DAG.

This commit upgrades our npm dependencies to the latest available version. I was prompted to this change because `npm run package` failed for me with an error. The error disappeared after upgrading `@vscode/vsce`. I also upgraded the other dependencies because I think it's generally preferable to stay up-to-date. I did not bump the `@types/vscode` and `@types/node` versions, since this would effectively make older VS-Code versions unsupported. I also changed `@types/vscode` to be a precise version match, since we are claiming compatibility with that version via the `enginges.vscode` property.

7347870 introduced a textual header but did not update clang's module map. This PR adds the header to the module map.

…ts' (#125933) Add initial parsing/sema support for new assumption clause so clause can be specified. For now, it's ignored, just like the others. Added support for 'no_openmp_construct' to release notes. Testing - Updated appropriate LIT tests. - Testing: check-all

These tests are for frame handling code with push/pop. To increase coverage of CFI/Unwind info, this removes the `nounwind` annotations and regenerates the checks for this test.

The non-GTest library will be shared by unittests of Flang and Flang-RT. Promote it as a regular library for use by both projects. In the long term, we may want to convert these to regular GTest checks to avoid having multiple testing frameworks.

…126109) The file descriptor of the first opened file is not necessarily 3, so we change the assertion so that it's >= 0 (i.e. not an error.) Fixes #126106

Remove some indirection when matching recipe and matcher operands by directly using fold over parameter pack.

Add support for expanding `%b` in `LLVM_PROFILE_FILE` to the binary ID (build ID). It can be used with `%m` to avoid its signature collisions. This is supported on all platforms where writing binary IDs into profiles is implemented, as the `__llvm_write_binary_ids` function is used. Fixes #51560.

Turns out there are users who use gcc to compile compiler-rt. Using the clang-specific builtin function `__builtin_readcyclecounter()` does not work in this case. Solution is to use inline assembly using the stckf instruction in case the compiler is not clang.

This patch implements generic associative container benchmarks for containers with unique keys. In doing so, it replaces the existing std::map benchmarks which were based on the cartesian product infrastructure and were too slow to execute. These new benchmarks aim to strike a balance between exhaustive coverage of all operations in the most interesting case, while executing fairly rapidly (~40s on my machine). This bumps the requirement for the map benchmarks from C++17 to C++20 because the common header that provides associative container benchmarks requires support for C++20 concepts.

Add interface for `sinpi`, `cospi` and `sincospi` and also expose `sincosf`

Fixes: #125102

We were previously checking this after recursing on all callers, but if we already have a single allocation type there is no need to even look at any callers. Didn't show a significant improvement overall, but it does reduce the count of times we enter the identifyClones and do other checks.

I missed a few places to tidy up from before using the tablengen files directly for the builtins. I didn't remove all of the modulemap entries and there were two small `.def` files left lingering. This should clean all of that up. I went through to cross check the list of files and it looks correct now.

) This patch is a followup of the previous one: #115922, It adds an option to turn on emitting non-atomic rmw code sequence instead of atomic rmw.

Export all symbols from both EC and native symbol tables. If an explicit export is present in either symbol table, auto-export is disabled for both.

… set. (#125962)

The DAG will now receive a callback whenever a new instruction is created and will update itself accordingly.

Summary: This patch cleans up how we query the offloading toolchain. We create a single that is more similar to the existing `getToolChain` driver function and make all the offloading handlers use it.

Summary: Currently the `-Xarch` argument needs to re-parse the option, which goes through every single registered argument. This causes errors when trying to pass `-O1` through it because it thinks it's a DXC option. This patch changes the behavior to only allow `clang` options. Concievably we could detect the driver mode to make this more robust, but I don't know if there are other users for this. Fixes: #110325

Implement HLSLElementwiseCast excluding support for splat cases Do not support casting types that contain bitfields. Partly closes #100609 and partly closes #100619

For 0d vector type the rewrite crashes.

When a test depends on a new debugserver feature/fix, the API test must be marked @skipIfOutOfTreeDebugserver because the macOS CI bots test using the latest Xcode release debugserver. But over time all of these fixes & new features are picked up in the Xcode debugserver and these skips can be removed. We may see unexpected test failures from removing all of these 1+ year old skips, but that's likely a separate reason the test is failing that is being papered over by this skip.

…he base class of {Function, GlobalVariable, IFunc} (#125757) This is a split of #125756

…ld (#126145) Adding 'no_openmp_constructs' assumption clause to clang broke the flang build. Adding to flang so it builds. Testing - Build - Testing: check-all

…126149) Reverts #118842

@ftynse

cc @ftynse @wsmoses

…e lookups (#123391) **Summary** Add support for filtering line table entries based on `DW_AT_LLVM_stmt_sequence` attribute when looking up address ranges. This ensures that line entries are correctly attributed to their corresponding functions, even when multiple functions share the same address range due to optimizations. **Background** In #110192 we added support to clang to generate the `DW_AT_LLVM_stmt_sequence` attribute for `DW_TAG_subprogram`'s. Corresponding RFC: [New DWARF Attribute for Symbolication of Merged Functions](https://discourse.llvm.org/t/rfc-new-dwarf-attribute-for-symbolication-of-merged-functions/79434) The `DW_AT_LLVM_stmt_sequence` attribute allows accurate attribution of line number information to their corresponding functions, even in scenarios where functions are merged or share the same address space due to optimizations like Identical Code Folding (ICF) in the linker. **Implementation Details** The patch modifies `DWARFDebugLine::lookupAddressRange` to accept an optional DWARFDie parameter. When provided, the function checks if the `DIE` has a `DW_AT_LLVM_stmt_sequence` attribute. This attribute contains an offset into the line table that marks where the line entries for this DIE's function begin. If the attribute is present, the function filters the results to only include line entries from the sequence that starts at the specified offset. This ensures that even when multiple functions share the same address range, we return only the line entries that actually belong to the function represented by the DIE. The implementation: - Adds an optional DWARFDie parameter to lookupAddressRange - Extracts the `DW_AT_LLVM_stmt_sequence` offset if present - Modifies the address range lookup logic to filter sequences based on their offset - Returns only line entries from the matching sequence

…126141) This patch implements the vectorizer's callback for getting notified about new instructions being created. This updates the scheduler state, which may involve removing dependent instructions from the ready list and update the "scheduled" flag. Since we need to remove elements from the ready list, this patch also implements the `remove()` operation.

These prevented ThreadMemory from correctly returning the Name/Queue/Info of the backing thread. Note about testing: this test only finds regressions if the system sets a name or queue for the backing thread. While this may not be true everywhere, it still provides coverage in some systems, e.g. in Apple platforms.

It seems that depending on the platform, gcc acceptts or does not accept `-mvx` without specifying an architecture actually having vector instructions. The solution which seems to work across different versions of gcc and clang is to specify the least architecture which has vector instructions. In addition, initialization of the unused variable CPU prevents a compiler warning from gcc.

Co-authored-by: Nikita Popov <[email protected]>

…125481) This is extracted from #118638 After c7ebe4f we will crash in fixNonInductionPHIs if we use a VPWidenPHIRecipe with the vector preheader as an incoming block, because the phi will reference the old non-IRBB vector preheader. This fixes this by updating VPBlockUtils::reassociateBlocks to update any VPWidenPHIRecipes's incoming blocks. This assumes that if the VPWidenPHIRecipe is in a VPRegionBlock, it's in the entry block, and that we are replacing a VPBasicBlock with another VPBasicBlock.

Add an optional flag for the secondary allocator called `EnableGuardPages` to enable/disable the use of guard pages. By default, this option is enabled.

hokein and others added 30 commits February 6, 2025 20:15

[bazel] Port for f497fe4

e40610d

[NVPTX] Require asserts in unrecognized-sm1x.ll (#126105)

b2bd3a4

Fix clang-with-thin-lto-ubuntu - failed build

Update test for symbolizer fix

14d6e1e

[libc] Disable fixed point printing for baremetal (#126115)

ec7167b

This follows suite with disabling float printing.

[CSSPGO] Turn on call-graph matching by default for CSSPGO (#125938)

068d0c0

Tested call-graph matching on some of Meta's large services, it works to reuse some renamed function profiles, no negative perf or significant build speed regression observed. Turned it on by default for CSSPGO mode.

[OpenMP] Fix misspelled symbol name (#126120)

b357495

Summary: This is supposed to be `__llvm_rpc_client` but I screwed it up and didn't notice at the time. Will need to be backported.

[Modules] Fix missing module dependency introduced by 7347870 (#126007)

b93f8b8

7347870 introduced a textual header but did not update clang's module map. This PR adds the header to the module map.

[RISCV][NFC] Remove nounwind from push/pop tests (#125939)

624dc00

These tests are for frame handling code with push/pop. To increase coverage of CFI/Unwind info, this removes the `nounwind` annotations and regenerates the checks for this test.

[libc][minor] Fix assertion in LlvmLibcFILETest.SimpleFileOperations (#…

4c8acbd

…126109) The file descriptor of the first opened file is not necessarily 3, so we change the assertion so that it's >= 0 (i.e. not an error.) Fixes #126106

[VPlan] Simplify operand tuple matching in VPlanPatternMatch (NFC).

049aa17

Remove some indirection when matching recipe and matcher operands by directly using fold over parameter pack.

[flang][cuda] Add interface for sinpi, cospi and sincospi (#126123)

98752ef

Add interface for `sinpi`, `cospi` and `sincospi` and also expose `sincosf`

[libc] generate sys/wait.h for aarch64 (#125171)

74c2e9a

Fixes: #125102

[MLIR] Support non-atomic RMW option for emulated vector stores (#124887

f0e1857

) This patch is a followup of the previous one: #115922, It adds an option to turn on emitting non-atomic rmw code sequence instead of atomic rmw.

[LLD][COFF] Add support for MinGW auto-export on ARM64X (#125862)

f729477

Export all symbols from both EC and native symbol tables. If an explicit export is present in either symbol table, auto-export is disabled for both.

[RISCV][Disassemble] Ensure the comment stream of the disassembler is…

b40ef2c

… set. (#125962)

[SandboxVec][DAG] Update DAG when a new instruction is created (#126124)

166b2e8

The DAG will now receive a callback whenever a new instruction is created and will update itself accordingly.

[compiler-rt] Fix binary-id-path.c after da05341

8d925a1

[Clang][NFC] Clean up fetching the offloading toolchain (#125095)

6e2f08b

Summary: This patch cleans up how we query the offloading toolchain. We create a single that is more similar to the existing `getToolChain` driver function and make all the offloading handlers use it.

jhuber6 and others added 19 commits February 6, 2025 16:36

[HLSL] Implement HLSL Flat casting (excluding splat cases) (#118842)

01072e5

Implement HLSLElementwiseCast excluding support for splat cases Do not support casting types that contain bitfields. Partly closes #100609 and partly closes #100619

[mlir][amdgpu] Support for 8bit extf for 0d vector type (#126102)

97b08b8

For 0d vector type the rewrite crashes.

[IR] Generalize Function's {set,get}SectionPrefix to GlobalObjects, t…

5399782

…he base class of {Function, GlobalVariable, IFunc} (#125757) This is a split of #125756

[flang][OpenMP] 'no_openmp_constructs' added to clang broke flang bui…

8fb1b3f

…ld (#126145) Adding 'no_openmp_constructs' assumption clause to clang broke the flang build. Adding to flang so it builds. Testing - Build - Testing: check-all

[lldb][NFC] whitespace reflow

163ccfa

Revert "[HLSL] Implement HLSL Flat casting (excluding splat cases)" (#…

14716f2

…126149) Reverts #118842

[mlir] feat: add mlirFuncSetResultAttr (#125972)

a15618f

cc @ftynse @wsmoses

[libc++][NFC] Replace typedefs with using aliases in <string> (#126070)

7788617

[SystemZ] Avoid repeated hash lookups (NFC) (#126005)

5a056f9

Co-authored-by: Nikita Popov <[email protected]>

[Analysis] Avoid repeated hash lookups (NFC) (#126011)

4590f75

[gn build] Manually port f7b3559

7f7605d

[scudo] Make guard pages optional in the secondary (#125960)

3d35246

Add an optional flag for the secondary allocator called `EnableGuardPages` to enable/disable the use of guard pages. By default, this option is enabled.

pull bot added the ⤵️ pull label Feb 7, 2025

pull bot merged commit 3d35246 into Ericsson:main Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] main from llvm:main #5547

[pull] main from llvm:main #5547

pull bot commented Feb 7, 2025 •

edited

Loading

[pull] main from llvm:main #5547

[pull] main from llvm:main #5547

Conversation

pull bot commented Feb 7, 2025 • edited Loading

pull bot commented Feb 7, 2025 •

edited

Loading