Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark:read iceberg table data error #11336

Open
3 tasks
beyond-up opened this issue Oct 17, 2024 · 4 comments
Open
3 tasks

Spark:read iceberg table data error #11336

beyond-up opened this issue Oct 17, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@beyond-up
Copy link

beyond-up commented Oct 17, 2024

Apache Iceberg version

1.5.2

Query engine

Spark

Please describe the bug 🐞

When I used iceberg-spark-runtime-3.3_2.12-1.5.2.jar to query the iceberg table data, an error was reported. The error message showed that there were null values, but there was no null value data in the table.
image

image

Willingness to contribute

  • I can contribute a fix for this bug independently
  • I would be willing to contribute a fix for this bug with guidance from the Iceberg community
  • I cannot contribute a fix for this bug at this time
@beyond-up beyond-up added the bug Something isn't working label Oct 17, 2024
@nastra
Copy link
Contributor

nastra commented Oct 17, 2024

@beyond-up can you share the full stack trace please? Usually there's some more info in other parts of the stack trace that show what went wrong

@beyond-up
Copy link
Author

beyond-up commented Oct 17, 2024

@beyond-up can you share the full stack trace please? Usually there's some more info in other parts of the stack trace that show what went wrong

I have found the cause of this problem. It is because there is an '' in the data field in the table. However, I am surprised that '' in a String type field can cause an NLP error! @nastra

@nastra
Copy link
Contributor

nastra commented Oct 17, 2024

@beyond-up so far the NPE seems to be coming from Spark itself, not from Iceberg. Do you have a small reproducible example?

@nastra
Copy link
Contributor

nastra commented Oct 17, 2024

Which exact Spark version are you using? A similar issue was reported in https://issues.apache.org/jira/browse/SPARK-39061 and was already fixed in Spark 3.3.1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants