Develop upstream sync 240521 #2548

draganmladjenovic · 2024-05-21T17:07:29Z

No description provided.

PiperOrigin-RevId: 634161496

http://github.com/tensorflow/runtime/commit/76dfd9f8757c12739429c8178d1b0c5167862ae7. PiperOrigin-RevId: 634168734

PiperOrigin-RevId: 634170429

…KindInfo()` Checking `GetEnableMemories()` inside `PyDeviceList::PopulateMemoryKindInfo()` causes the memory kind to be built/cached based on the value of `enable_memories` at the time the method was called for the first time. This is problematic when users later change the value of `enable_memories` but reuse the same device list. This CL fixes the problem by making `PyDeviceList::MemoryKinds()` and `PyDeviceList::DefaultMemoryKind()` always check for `GetEnableMemories()` outside the cached method to make it behave correctly even if `enable_memories` changes. PiperOrigin-RevId: 634191502

PiperOrigin-RevId: 634193569

PiperOrigin-RevId: 634195071

`xla::ifrt::RemapPlan`'s `mappings` is frequently reused across different `xla::ifrt::RemapPlan`. It can be costly to keep a copy of `mappings` for each plan if the mappings is not trivially small. This change wraps `mappings` with `std::shared_ptr` so that it can be shared more cheaply across multiple plans. PiperOrigin-RevId: 634214931

PiperOrigin-RevId: 634216731

PiperOrigin-RevId: 634224863

… tests. Imported from GitHub PR openxla/xla#12538 Copybara import of the project: -- 78d74154a1693426014eef2940e759d1e36902ed by Ilia Sergachev <[email protected]>: [GPU] Fix OSS compilation of previously disabled tests. Merging this change closes tensorflow#12538 PiperOrigin-RevId: 634248275

PiperOrigin-RevId: 634257201

The previous logic for filtering by-module wasn't correct, as instructions could be modified after autotuning, resuling in not all relevant information serialized. This could result in excessive data serialized, but autotuning results are very small compared to the module size and the compiled artifact. PiperOrigin-RevId: 634259538

…mRewriter pass. PiperOrigin-RevId: 634263367

…eVars PiperOrigin-RevId: 634264428

PiperOrigin-RevId: 634268437

PiperOrigin-RevId: 634268459

XLA should not include any platform specific headers in XLA header files. This change moves us one step closer to this goal by getting rid of type aliases in `gpu_types.h`. PiperOrigin-RevId: 634278649

PiperOrigin-RevId: 634279450

PiperOrigin-RevId: 634285627

Imported from GitHub PR openxla/xla#12501 Copybara import of the project: -- 97b50d47877dd3dc46535c6fa34f0e449ee57bfe by Ilia Sergachev <[email protected]>: [NFC] Fix mistypes in scatter expander. Merging this change closes tensorflow#12501 PiperOrigin-RevId: 634285954

PiperOrigin-RevId: 634288224

PiperOrigin-RevId: 634291787

…fusion_analysis. This is a pure HLO pass that does not require a GPU. PiperOrigin-RevId: 634303515

This is used as an input to mlir_replay. Note that GPU-specific ops are not implemented in the interpreter yet (at HEAD). PiperOrigin-RevId: 634304231

…keep track of the cache path. PiperOrigin-RevId: 634308967

…NN_3.4.1 PiperOrigin-RevId: 634320860

For this to work, we need a folder, so add that too. We kind of rely on the folder to work most of the time (for caching of instructions). There may be some dragons here, I'll probably rewrite that stuff at some point. This works around the remaining known miscompiles. The issue must be somewhere in LLVM and be related to GEPs with subtractions, but I lack the skills and resources to actually track it down. PiperOrigin-RevId: 634326550

PiperOrigin-RevId: 634330969

…he module. PiperOrigin-RevId: 634349913

…m_algorithm_picker. These defines are not needed since the BUILD target is anyway guarded by if_gpu_is_configured. PiperOrigin-RevId: 634353818

draganmladjenovic · 2024-05-29T18:50:48Z

https://clang.llvm.org/docs/ThreadSafetyAnalysis.html

--config=warnings?

i-chaochen · 2024-05-29T20:57:32Z

@draganmladjenovic I think you will upstream this profiler as well later?

i-chaochen

We could try to enable this cudnn_fused_conv_rewriter_test this week as openxla/xla#9666 is merged

draganmladjenovic · 2024-05-30T03:45:16Z

retest Ubuntu-GPU-multi please
retest Ubuntu-CPU please

draganmladjenovic · 2024-05-30T07:53:55Z

We could try to enable this cudnn_fused_conv_rewriter_test this week as openxla/xla#9666 is merged

Nope. Still fails.

i-chaochen · 2024-05-31T12:44:32Z

@draganmladjenovic let's disable this one as well so this weekly-sync should be ok. We should enable some of disable XLA ones next weekly-sync.

draganmladjenovic · 2024-06-02T18:56:18Z

retest Ubuntu-GPU-single please

To avoid loading both system and locald_rocm_config version of the library. WORKAROUND!

…dfe17b7

i-chaochen · 2024-06-03T16:19:15Z

third_party/xla/xla/backends/profiler/gpu/rocm_collector.cc

@Ruturaj4 @rahulbatra85 please check this on JAX side.

draganmladjenovic · 2024-06-03T23:03:14Z

retest Ubuntu-GPU-single please

draganmladjenovic · 2024-06-04T03:51:23Z

retest Ubuntu-GPU-single please

draganmladjenovic · 2024-06-04T03:52:55Z

retest Ubuntu-CPU please

…t/cost_analysis.h

i-chaochen · 2024-06-04T09:24:29Z

retest Ubuntu-GPU-single please

i-chaochen · 2024-06-04T21:40:20Z

retest Ubuntu-GPU-single please
retest Ubuntu-CPU please

draganmladjenovic · 2024-06-05T08:47:15Z

@i-chaochen Please review.

i-chaochen

@draganmladjenovic could you upstream this profile change? 4736848

tensorflower-gardener and others added 30 commits May 15, 2024 20:00

Update ops-related pbtxt files.

09675d1

PiperOrigin-RevId: 634161496

Update TFRT dependency to use revision

7201a35

http://github.com/tensorflow/runtime/commit/76dfd9f8757c12739429c8178d1b0c5167862ae7. PiperOrigin-RevId: 634168734

Reverts adf1068

b55d4f1

PiperOrigin-RevId: 634170429

Update ops-related pbtxt files.

099c465

PiperOrigin-RevId: 634193569

Propagate error to output if the input buffer has error.

c13cbeb

PiperOrigin-RevId: 634195071

Update to Triton version that contains previously patched fixes.

c0cbc2f

PiperOrigin-RevId: 634216731

Update ops-related pbtxt files.

b12b5e9

PiperOrigin-RevId: 634224863

Update ops-related pbtxt files.

6ffda33

PiperOrigin-RevId: 634257201

[XLA:GPU] Remove preprocessor #if GOOGLE_CUDAs and similar from Gem…

31c77e9

…mRewriter pass. PiperOrigin-RevId: 634263367

[XLA:GPU] Use IndexingMap::RemoveUnusedSymbols instead of WithoutRang…

1f94489

…eVars PiperOrigin-RevId: 634264428

Update GraphDef version to 1864.

6c2e476

PiperOrigin-RevId: 634268437

compat: Update forward compatibility horizon to 2024-05-16

175131a

PiperOrigin-RevId: 634268459

[XLA:GPU] Remove GpuStatus.

5720ac7

XLA should not include any platform specific headers in XLA header files. This change moves us one step closer to this goal by getting rid of type aliases in `gpu_types.h`. PiperOrigin-RevId: 634278649

Automated Code Change

bf59de7

PiperOrigin-RevId: 634279450

Automated Code Change

b3388cd

PiperOrigin-RevId: 634285627

Update ops-related pbtxt files.

9d9aa16

PiperOrigin-RevId: 634288224

XNNPack weight cache provider: refactor code.

e641445

PiperOrigin-RevId: 634291787

[XLA:GPU] Remove GOOGLE_CUDA and TENSORFLOW_USE_ROCM guards from hlo_…

62fe318

…fusion_analysis. This is a pure HLO pass that does not require a GPU. PiperOrigin-RevId: 634303515

Optionally dump the MLIR compilation pipeline steps.

b43e061

This is used as an input to mlir_replay. Note that GPU-specific ops are not implemented in the interpreter yet (at HEAD). PiperOrigin-RevId: 634304231

XNNPack weight cache provider: update the internal options object to …

f9d867e

…keep track of the cache path. PiperOrigin-RevId: 634308967

Merge pull request tensorflow#67683 from Intel-tensorflow:aimran/oneD…

5855701

…NN_3.4.1 PiperOrigin-RevId: 634320860

Update ops-related pbtxt files.

8621ef5

PiperOrigin-RevId: 634330969

XNNPack weight cache: make logging message format consistent across t…

3fa9094

…he module. PiperOrigin-RevId: 634349913

[XLA:GPU] Remove GOOGLE_CUDA and TENSORFLOW_USE_ROCM defines from gem…

15da74a

…m_algorithm_picker. These defines are not needed since the BUILD target is anyway guarded by if_gpu_is_configured. PiperOrigin-RevId: 634353818

i-chaochen reviewed May 29, 2024

View reviewed changes

draganmladjenovic force-pushed the develop-upstream-sync-240521 branch 2 times, most recently from 4e1e7ee to f1d0cc8 Compare May 31, 2024 05:04

draganmladjenovic force-pushed the develop-upstream-sync-240521 branch 2 times, most recently from bae8034 to 161cea2 Compare June 2, 2024 13:31

draganmladjenovic added 5 commits June 2, 2024 19:23

Merge fixes after 240521

84d5270

Fix tensorflow/core/kernels/conv_ops_gpu.cc build failure

6af30d6

[ROCM][DO NOT UPSTREAM] Pin libamdhip64.so version

cf09f6a

To avoid loading both system and locald_rocm_config version of the library. WORKAROUND!

Fix //tensorflow/core/common_runtime/gpu:gpu_device_test_gpu failure

4da4f80

[ROCm] Fix profiler deadlock due to 3d7a4720b02e643faccd13f9a450213b5…

4736848

…dfe17b7

draganmladjenovic force-pushed the develop-upstream-sync-240521 branch from 161cea2 to 4e0d9b8 Compare June 2, 2024 19:23

i-chaochen reviewed Jun 3, 2024

View reviewed changes

third_party/xla/xla/backends/profiler/gpu/rocm_collector.cc Outdated

Copy link

i-chaochen Jun 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Ruturaj4 @rahulbatra85 please check this on JAX side.

Disable some failing XLA tests

cc4a257

draganmladjenovic force-pushed the develop-upstream-sync-240521 branch from 4e0d9b8 to cc4a257 Compare June 3, 2024 17:46

Fix build error in third_party/xla/xla/service/memory_space_assignmen…

b3fcb02

…t/cost_analysis.h

Fix xla/service/gpu/gpu_compiler_test.cc build

a344018

i-chaochen approved these changes Jun 5, 2024

View reviewed changes

draganmladjenovic merged commit c3a1fef into develop-upstream Jun 5, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop upstream sync 240521 #2548

Develop upstream sync 240521 #2548

draganmladjenovic commented May 21, 2024

draganmladjenovic commented May 29, 2024 •

edited

Loading

i-chaochen commented May 29, 2024

i-chaochen left a comment

draganmladjenovic commented May 30, 2024

draganmladjenovic commented May 30, 2024

i-chaochen commented May 31, 2024 •

edited

Loading

draganmladjenovic commented Jun 2, 2024

i-chaochen Jun 3, 2024

draganmladjenovic commented Jun 3, 2024

draganmladjenovic commented Jun 4, 2024

draganmladjenovic commented Jun 4, 2024

i-chaochen commented Jun 4, 2024

i-chaochen commented Jun 4, 2024

draganmladjenovic commented Jun 5, 2024

i-chaochen left a comment

Develop upstream sync 240521 #2548

Develop upstream sync 240521 #2548

Conversation

draganmladjenovic commented May 21, 2024

draganmladjenovic commented May 29, 2024 • edited Loading

i-chaochen commented May 29, 2024

i-chaochen left a comment

Choose a reason for hiding this comment

draganmladjenovic commented May 30, 2024

draganmladjenovic commented May 30, 2024

i-chaochen commented May 31, 2024 • edited Loading

draganmladjenovic commented Jun 2, 2024

i-chaochen Jun 3, 2024

Choose a reason for hiding this comment

draganmladjenovic commented Jun 3, 2024

draganmladjenovic commented Jun 4, 2024

draganmladjenovic commented Jun 4, 2024

i-chaochen commented Jun 4, 2024

i-chaochen commented Jun 4, 2024

draganmladjenovic commented Jun 5, 2024

i-chaochen left a comment

Choose a reason for hiding this comment

draganmladjenovic commented May 29, 2024 •

edited

Loading

i-chaochen commented May 31, 2024 •

edited

Loading