You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
val df = spark.read.parquet("/tmp/tpch-parquet/nation.parquet")
org.apache.spark.sql.AnalysisException: Unable to infer schema for Parquet. It must be specified manually.
However, if I ask Spark to read the one partition file directly, and not the directory, then it works, which confuses me,
Describe the bug
I generated TPC-H data and converted to Parquet using DataFusion. Here is the
nation
table.I can read the schema fine from bdt (which uses DataFusion)
Spark fails with:
However, if I ask Spark to read the one partition file directly, and not the directory, then it works, which confuses me,
To Reproduce
Expected behavior
Additional context
The text was updated successfully, but these errors were encountered: