-
Notifications
You must be signed in to change notification settings - Fork 202
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
arm64 and x86_64 linux: TF java full native builds are failing to find the native headers #544
Comments
The issue is not specific to arm64, I see the same missing headers issue even on the other platforms, at least I have reproduced it on linx-x86_64 as well, with Ubuntu 22.04 OS. From the code it looks like it happens on every platform.
The backtrace:
|
We modified where it's looking for the headers just before the rc1 release to fix this kind of issue. I tested it on macOS, and I thought I had tested it on a few Linuxes as well. I'll rerun the Linux build to see what's going on. |
So it looks like the problem is that we used to get the absl headers from Bazel, but something has changed in the TF build process so it's not putting the absl repo in the bazel-tensorflow-core-native folder like it used to. We'd missed this because the clean is inconsistent between bazel & non-bazel builds. |
Hi @Craigacp , it's not just the |
I can replicate this, but we couldn't replicate it on Karl's machine, even after a clean of bazel. Both machines are running macOS 14.5 with the latest XCode, and the same version of bazel so I'm pretty confused as to what's causing the issue. |
I'm surprised in the working case where it is getting the all absl/Eigen/ml_dtype headers from. Probably checking the include paths for libjnitensorflow.cpp compilation might give some clue? |
No, in some cases the external folder in |
I see the same problem on Ubuntu 24.04 with rc2
|
Please make sure that this is a build/installation issue. As per our GitHub Policy, we only address code/doc bugs, performance issues, feature requests and build/installation issues on GitHub. tag:build_template
System information
java -version
): openjdk 11.0.23 2024-04-16Describe the problem
TensorFlow java source builds are failing on aarch64 linux system with the missing native headers. please let me know how it's built for x86_64 linux platform.
based on my debugging so far it looks like the dependency comes from this commit which added C API extension for custom gradient functions, and introduced these headers and .cc which requires several third_party libraries from tensorflow native but none of those bazel workspaces are cloned.
I tried to manually clone the missing workspaces into bazel cache, but the cycle is never ending, it's missing tsl, eigen, ml_dtype, absl, protobuf, and now compiled headers for protobuf....
Provide the exact sequence of commands / steps that you executed before running into the problem
Any other info / logs
Include any logs or source code that would be helpful to diagnose the problem. If including tracebacks, please include the full traceback. Large logs and files should be attached.
The text was updated successfully, but these errors were encountered: