-
Notifications
You must be signed in to change notification settings - Fork 244
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MIOpen Error: /home/dlowell/MIOpenPrivate/src/include/miopen/hipoc_kernel.hpp:200: Failed to get function: Im2Col from /home/username/.cache/miopen/1.7.0/4d8ca72dd191276335b17fe1d88f6f76/MIOpenUtilKernels.cl.o hipErrorNotFound #67
Comments
@greatken999 Please make sure you have upgraded rocm-clang-ocl as well AND delete your program caches if they exist. Typically the cache is located in ~/.cache/miopen https://rocmsoftwareplatform.github.io/MIOpen/doc/html/cache.html |
@daniellowell thanks . |
Uninstall your old version and do |
it's ok now use rocm-clang-ocl-0.3.0-688fe5d |
No problem! |
I can't install rocm-clang-ocl
What is it? Please help |
Looks like you previously had mivisionx installed but you do not now? Did you try purging mivisionx, or running |
I removed mivisionx and after that all are OK. Thanks! But...
Why "no registered kernels" ? |
e05dcb421 perf db validation fix (#68) 260d9465d Add INT8 as a data_type v2 (#67) b6a5b2a77 sync with fin folder in miopen (#62) 0e03399ec prep for Palamida scan (#63) e6bd05c33 Performance db testing (#61) 30d699b9e Perf Eval Update (#60) 3535b948c PerfCompile and PerfEval changes (#59) de79468d2 remove unneccessary solution check, add check for previously modified kernel names (#56) 6924286a2 miopen hash update (#55) 530399575 Refactor googletest infra to align with MIOpen (#53) 71c50d146 Datatype fix for BN (#57) 8abe2f5c6 Perf Eval updates, Add find info (#51) e1c1ef0f5 filter find compile by solver input (#54) 722feea66 sp/chk precomp kernel 264 (#41) b9aba2034 Batch norm find compile (#50) 359f3da80 Fix missing link directives in fin binary (#48) a4020c1ba Cache Miss Fixes (#46) 2ec7ef44d Enable google test and compiling fin in the CI (#47) 8b6b453bc Applicability support for batch norm (#45) 44323aae9 Perf compile/eval for fin (#42) ebd9aa6bd update member name (#43) d6d798efe add cu count (#39) 8e1989a9f Add find option for selecting only dynamic solvers (#38) 0e164bf66 setting json version (#37) f3f7fed18 Remove function redefinition (#36) e1de51a58 Performance DB de-serialize test (#34) 043cdcdaa Layout support in Fin (#33) 3a1d58236 Hotfix (#32) ee3f0d543 4.4 Tuning Bugfixes (#31) 832dbe234 Tunability Reporting (#27) a564a229f include gfx90a_110 (#28) git-subtree-dir: fin git-subtree-split: e05dcb42187f05fe0d0d1b05b822dc4b750f199e
* remove datatype 0,1 from perf_db * rm invalid fp16 entries from pdb * Squashed 'fin/' changes from 53d2563fe..e05dcb421 e05dcb421 perf db validation fix (#68) 260d9465d Add INT8 as a data_type v2 (#67) b6a5b2a77 sync with fin folder in miopen (#62) 0e03399ec prep for Palamida scan (#63) e6bd05c33 Performance db testing (#61) 30d699b9e Perf Eval Update (#60) 3535b948c PerfCompile and PerfEval changes (#59) de79468d2 remove unneccessary solution check, add check for previously modified kernel names (#56) 6924286a2 miopen hash update (#55) 530399575 Refactor googletest infra to align with MIOpen (#53) 71c50d146 Datatype fix for BN (#57) 8abe2f5c6 Perf Eval updates, Add find info (#51) e1c1ef0f5 filter find compile by solver input (#54) 722feea66 sp/chk precomp kernel 264 (#41) b9aba2034 Batch norm find compile (#50) 359f3da80 Fix missing link directives in fin binary (#48) a4020c1ba Cache Miss Fixes (#46) 2ec7ef44d Enable google test and compiling fin in the CI (#47) 8b6b453bc Applicability support for batch norm (#45) 44323aae9 Perf compile/eval for fin (#42) ebd9aa6bd update member name (#43) d6d798efe add cu count (#39) 8e1989a9f Add find option for selecting only dynamic solvers (#38) 0e164bf66 setting json version (#37) f3f7fed18 Remove function redefinition (#36) e1de51a58 Performance DB de-serialize test (#34) 043cdcdaa Layout support in Fin (#33) 3a1d58236 Hotfix (#32) ee3f0d543 4.4 Tuning Bugfixes (#31) 832dbe234 Tunability Reporting (#27) a564a229f include gfx90a_110 (#28) git-subtree-dir: fin git-subtree-split: e05dcb42187f05fe0d0d1b05b822dc4b750f199e * fix clang-format issue Co-authored-by: Jun Liu <[email protected]>
49e3e3a62 clang format db80b1777 update to using TestPerfCfgParams for pdb validity checks e48a4fd3a format a4f85842c exception for non-tunable solvers in params check d58c42bbd Check params at end of perf tuning (#70) 1a3b47c7b Return status for failed compile commands (#69) d59962752 out_layout -> in_layout 6ba7a8f3f Rename conv_mode to mode (#64) 513a3da1b [bg/LWPTUNA-173] (#65) e05dcb421 perf db validation fix (#68) 260d9465d Add INT8 as a data_type v2 (#67) b6a5b2a77 sync with fin folder in miopen (#62) 0e03399ec prep for Palamida scan (#63) e6bd05c33 Performance db testing (#61) 30d699b9e Perf Eval Update (#60) 3535b948c PerfCompile and PerfEval changes (#59) de79468d2 remove unneccessary solution check, add check for previously modified kernel names (#56) 6924286a2 miopen hash update (#55) 530399575 Refactor googletest infra to align with MIOpen (#53) 71c50d146 Datatype fix for BN (#57) 8abe2f5c6 Perf Eval updates, Add find info (#51) e1c1ef0f5 filter find compile by solver input (#54) 722feea66 sp/chk precomp kernel 264 (#41) b9aba2034 Batch norm find compile (#50) 359f3da80 Fix missing link directives in fin binary (#48) a4020c1ba Cache Miss Fixes (#46) 2ec7ef44d Enable google test and compiling fin in the CI (#47) 8b6b453bc Applicability support for batch norm (#45) 44323aae9 Perf compile/eval for fin (#42) ebd9aa6bd update member name (#43) d6d798efe add cu count (#39) 8e1989a9f Add find option for selecting only dynamic solvers (#38) 0e164bf66 setting json version (#37) f3f7fed18 Remove function redefinition (#36) e1de51a58 Performance DB de-serialize test (#34) 043cdcdaa Layout support in Fin (#33) 3a1d58236 Hotfix (#32) ee3f0d543 4.4 Tuning Bugfixes (#31) 832dbe234 Tunability Reporting (#27) a564a229f include gfx90a_110 (#28) git-subtree-dir: fin git-subtree-split: 49e3e3a62a7cc54adacbeea95680d35f9a4685de
* [Windows] Debug vector add Windows CI failure. (#3090) * fixed missing <CR> in the kernel * replaced uint with size_t to fix Windows CI * clang format fix * Fixed duplicating info, narrowing conversion. * [MHA] Implement MIOPEN_BACKEND_OPERATION_RESHAPE_DESCRIPTOR (#3082) * Implement MIOPEN_BACKEND_OPERATION_RESHAPE_DESCRIPTOR * Introduce an enum to indicate transpose op * Fix possible throws in the test * Bump rocm-docs-core[api_reference] from 1.4.1 to 1.5.0 in /docs/sphinx (#3101) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.4.1 to 1.5.0. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](ROCm/rocm-docs-core@v1.4.1...v1.5.0) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump certifi from 2024.2.2 to 2024.7.4 in /docs/sphinx (#3102) Bumps [certifi](https://github.com/certifi/python-certifi) from 2024.2.2 to 2024.7.4. - [Commits](certifi/python-certifi@2024.02.02...2024.07.04) --- updated-dependencies: - dependency-name: certifi dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Fix #3076 - issues related to in_offset overflow in Col2Im (2D and 3D) and Im2Col (2D only) (#3099) * fix-overflow-col2im2d(01) Switch to unsigned math in 2D and 3D col2im kernels. * fix-overflow-col2im2d(02) Add "dry-run" mode to Col2ImGPU() and use it. * fix-overflow-col2im2d(03) Revert "fix-overflow-col2im2d(02) Add "dry-run" mode to Col2ImGPU() and use it." This reverts commit c8f9d40. * fix-overflow-col2im2d(04) Add 64-bit index mode * fix-overflow-col2im2d(06) Tidy fix * fix-overflow-col2im2d(07) Minimal changes to avoid trimming of im_offset in 2D Im2Col * Update CK commit hash in requirements.txt (#3087) * Update CK commit hash in requirements.txt * update CK commit to include fix * use CK version with cmake fixes * update CK commit hash to include the fix --------- Co-authored-by: illsilin <[email protected]> * Enable hipBLASLt backend for GEMM, and make it the default option for RNN's under specific conditions (#3030) * Unblock CI by regenerating gfx908.kdb (#3092) * Support for invoker caching and find modes to fusion find (#3095) * implemented support for invoker caching and find modes to fusion find * Fixed some errors * [NFC] Move convolution solvers to solver/conv directory (part 2) (#3042) * [Fixes CI] Fixed tidy hang (#3114) * Fixed tidy hang by disabling a check on bugged versions * Added a comment on why this is necessary * Find: Optimize benchmarking for long execution times. (#3103) * [ROCm 6.2] add change log (#3113) * [WA] Remove exception checking tests for unsupported hardware (#3117) * Remove exception checking tests for unsupported hardware * Add workaround define and re-add tests * Add tests for ConvBiasResAddActivation forward using Graph API (#3112) * Add tests for ConvBiasResAddActivation forward using Graph API * Apply formatting * [gfx12] add support of gfx12 platforms (#3109) * [Staging] Update CK commit hash in requirements.txt (#3122) * Revert "[Staging] Update CK commit hash in requirements.txt (#3122)" (#3125) This reverts commit 8449363. * Update GraphAPI Tensor to inherit from TensorDescriptor (#3119) * [COMgr] Prepare for COMgr 3.x (#3107) * amdcomgr-3(01) [cmake] Do not allow different values of MIOPEN_USE_COMGR and MIOPEN_USE_HIPRTC. Remove leftover of support of ROCm older than 5.0. * amdcomgr-3(02) [importance_normal] Fix precompiled binary cache miss for Winograd Fury. Resolves #2778 (comment) * amdcomgr-3(03) [winograd fury] Add comment. * amdcomgr-3(04) [comgr] Removed comgr::BuildHip() * amdcomgr-3(05) [comgr] Added support for AMD COMgr 3.0. Removed support for AMD COMgr older than 1.7. * amdcomgr-3(06) [comgr] Removed support of unused enum members from to_string(). * amdcomgr-3(08) [comgr] Fix tidy error --------- Co-authored-by: Evgenii Averin <[email protected]> * [debugging] Make naive conv solvers obey MIOPEN_DEBUG_HIP_KERNELS (#3111) * Fix GCC8 linking issues with stdc++fs (#3126) * Added missing weight sorting as WTI calculation in fusion (#3105) * GTests renaming: filename starting letters A-B (#3124) * Tests with starting letters of the filenames "a"-"b" are renamed * "b" test change added --------- Co-authored-by: Jun Liu <[email protected]> * [bug][tests] Fix broken tests (#3123) * Fix broken gtests * Fix formatting * Fix ctests --------- Co-authored-by: Jun Liu <[email protected]> * Changed Fusion to have a separate find mode env variable and default to fast (#3121) * Added logging of paths to HIP, HIPRTC, COMgr, rocBlas, rocMLIR, rocRand, frugally-deep, Eigen3. Added logging of versions of frugally-deep, Eigen3. (#3110) * Implement getitem backward (#2883) * Fix bnorm args log info (#3127) Co-authored-by: Alex Eremin <[email protected]> * Fixed redundant find on failed fallback (#3131) Co-authored-by: Jun Liu <[email protected]> * [gtest] Fix log test error (#3150) * Bump rocm-docs-core[api_reference] from 1.5.0 to 1.6.1 in /docs/sphinx (#3160) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.5.0 to 1.6.1. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](ROCm/rocm-docs-core@v1.5.0...v1.6.1) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [Windows] Temporary workaround on rocMLIR issue (#3164) * [RNN] BWD concurrent imp (#3169) * GTests renaming: filenames which start with letter C (#3153) * [MHA] Implement MHA backward pattern (#3128) * Reduce calculation prod enhancement in case of inner dim (#2970) * Deduplicated ValidateGroupCount and fixed incorrect usage (#3177) * Deduplicated ValidateGroupCount * Changed static to namespace {} * format * Fixed find 2.0 ValidateGroupCount incorrect usage when transposed * windows fix, build without GEMM (#3179) * windows fix * Copyright * Refactor BnCKFwdInference::GetSolution for NHWC (#3120) * [WORKAROUND] Allow to use boost:filesystem instead of std::filesystem, [BUGFIX] check_cxx_linker_flag (#3154) * GTests renaming: filenames which start with letters D-G (#3178) * RNN wall clock timer update (#3180) * Add pattern and executor for ConvBiasResAddActiv GraphAPI (#3161) * [RNN] WA for tensors with more than 3 dims. (#3186) * [Fix][TransformTensor] Ignore output buffer when BETA=0 (#3184) * MI300 TunaNet Update: CK FWD and WRW Solvers Updated (#3130) * Bump rocm-docs-core[api_reference] from 1.6.1 to 1.6.2 in /docs/sphinx (#3185) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.6.1 to 1.6.2. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/v1.6.2/CHANGELOG.md) - [Commits](ROCm/rocm-docs-core@v1.6.1...v1.6.2) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Dropout kernel OpenCL to HIP + gtest (#3088) * Enhancement rotary position embedding (#3009) * [RNN] MS heuristic update (#3187) * Update Conv3DTestCase to use structs (#3162) * Backward MHA test using C++ API (#3097) * Revert "[Fix][TransformTensor] Ignore output buffer when BETA=0 (#3184)" (#3194) This reverts commit a89a850. * Update CK without new base ROCm renew (#3191) * [gtest] Unit test for ConvWinoFuryRxS (#3175) * Update CK commit hash to include fix for gfx12 (#3195) * [Perf] replace blocking hipMemset with async hip Memset (#3196) * Bump rocm-docs-core[api_reference] from 1.6.2 to 1.7.0 in /docs/sphinx (#3201) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.6.2 to 1.7.0. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](ROCm/rocm-docs-core@v1.6.2...v1.7.0) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Enable Graph execution in Backward MHA C++ API Test (#3199) * WIP: add reshape nodes and enalbe graph execution * fixed a bug due to polluted variant pack * rename for consistency * [WA] Disable MLIR when building debug to workaround sanitizer issue (#3200) * Update docker ROCm version for CI and MIOpen version (#3181) * Implement PReLU backward (#3152) * Bump rocm-docs-core[api_reference] from 1.7.0 to 1.7.1 in /docs/sphinx (#3205) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.7.0 to 1.7.1. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](ROCm/rocm-docs-core@v1.7.0...v1.7.1) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [gtest] Unit tests for GEMM solvers (#3197) * Fix bn layout network config caching issue (#3203) * Reapplied and fixed #3194 (#3211) * Reapply "[Fix][TransformTensor] Ignore output buffer when BETA=0 (#3184)" (#3194) This reverts commit 326d734. * Fixed incorrect network config * Removed redundant boost::thread include (#3210) * Fixed Find 2.0 fusion ignoring miopenSetFindOptionTuning (#3157) * [Tests] All tests in the same test suite must use the same test fixture class (#3202) class --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: sgundabo <[email protected]> Co-authored-by: Sergei Grigoriev <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Artem Tamazov <[email protected]> Co-authored-by: Jun Liu <[email protected]> Co-authored-by: illsilin <[email protected]> Co-authored-by: BrianHarrisonAMD <[email protected]> Co-authored-by: Vasilii Filippov <[email protected]> Co-authored-by: Evgenii Averin <[email protected]> Co-authored-by: mentat <[email protected]> Co-authored-by: Vsevolod Golovko <[email protected]> Co-authored-by: Seungman Han <[email protected]> Co-authored-by: who who who <[email protected]> Co-authored-by: Alex Eremin <[email protected]> Co-authored-by: Artur Wojcik <[email protected]> Co-authored-by: Kamil Nasyrov <[email protected]> Co-authored-by: Vasilii Filippov <[email protected]> Co-authored-by: xinlipn <[email protected]> Co-authored-by: M. Saud Ul Hassan <[email protected]> Co-authored-by: amberhassaan <[email protected]> Co-authored-by: Daming Feng <[email protected]> Co-authored-by: Long Luong <[email protected]>
I get this error info after apt upgrade:
MIOpen Error: /home/dlowell/MIOpenPrivate/src/include/miopen/hipoc_kernel.hpp:200: Failed to get function: Im2Col from /home/ken/.cache/miopen/1.7.0/4d8ca72dd191276335b17fe1d88f6f76/MIOpenUtilKernels.cl.o hipErrorNotFound
The text was updated successfully, but these errors were encountered: