-
Notifications
You must be signed in to change notification settings - Fork 185
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TF] Update TF v2.16.1 (without libfft) #9388
Closed
Closed
Changes from 21 commits
Commits
Show all changes
22 commits
Select commit
Hold shift + click to select a range
9b9a8c4
Update TF v2.16.1
smuzaffar e7f25dc
update abseil-cpp 20230802.2
smuzaffar a5ab9c5
update bazel-absl patch
smuzaffar 85957cb
TF2.16: Apply abseil aarch64 patch
iarspider 3e11df4
Merge branch 'IB/CMSSW_14_1_X/tf' into tf2.16.1
iarspider f3b3548
Update py3-scipy
iarspider a819169
Update cython as well
iarspider 81be620
Try updating py3-blosc2
iarspider f03065f
Update py3-tables
iarspider fd4d07d
Add symlink for cudnn_frontend_archive
iarspider 49ca37a
Update tensorflow.spec
iarspider 9abe3d2
Remove more files with broken symbols
iarspider d7f70e3
Update keras
iarspider f38a194
Add missing keras dependencies
iarspider 302fd5e
Update numpy.file
iarspider 84be832
Update blosc2.file
iarspider 8df8a95
Update keras.file
iarspider 0509280
Update keras.file
iarspider fdfef33
Update keras.file
iarspider 1ba91fc
Force Keras to use TensorFlow backend
iarspider 3a27d69
Set KERAS_BACKEND in toolfile
iarspider 34311da
KERAS_BACKEND should be runtime
iarspider File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,2 @@ | ||
Requires: py3-scikit-build py3-cython py3-numpy py3-msgpack py3-ndindex py3-py-cpuinfo py3-rich | ||
BuildRequires: cmake ninja | ||
|
||
%define PipPostPost \ | ||
%{relocateConfig}lib64/pkgconfig/blosc2.pc |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,5 @@ | ||
## INITENV SET KERAS_BACKEND tensorflow | ||
%define PipDownloadSourceType none | ||
Requires: py3-PyYAML py3-six | ||
Requires: py3-h5py py3-keras-applications py3-keras-preprocessing | ||
Requires: py3-absl-py py3-ml_dtypes py3-namex py3-optree py3-rich |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
Requires: py3-typing-extensions |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it should be
<runtime ..../>
type variable ( see ROOTSYS as an example)did you run test locally to see if gpu unit tests passed after setting this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tested it by setting the environment variable manually (not via toolfile).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
for me all the unit tests still fails with error
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some tests that were failing previously worked after setting
KERAS_BACKEND
. Yes, I saw these failures as well - I thought I missed some setup step to make them work (in a container started with--nv
flag)There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
e.g. testTFConstSession was failing with
ValueError: Unable to import backend : theano
, but after setting KERAS_BACKEND it passed.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could the failure be due to 12.4 not being an officially tested CUDA version for TF 2.16.1 (and even 2.17) - link lists 12.3 as officially tested version?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Running
python -c "import tensorflow as tf; print(tf.config.list_physical_devices('GPU'))"
prints this message:and returns an empty list
[]
. I googled this message, and there are basically three solutions:conda
and prebuilt wheelssudo echo 0 | sudo tee -a /sys/bus/pci/devices/0000\:06\:10.0/numa_node
after each reboot. But that requiressudo
rights (and, I would imagine, not in container, but on the host).There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
are you sure you started
cmssw-el8
with--nv
option? For me the following commandruns fine (both for this PR and TF_X Ibs) and return
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now it works for me as well, weird.