Re-partioning data frame and saving to parquet loses index and divisions
|
|
2
|
21
|
February 20, 2025
|
Speeding up hive partitioned queries
|
|
4
|
75
|
October 25, 2024
|
The Apache Parquet project is looking for real-world samples and feedback
|
|
1
|
40
|
October 18, 2024
|
Ideal way to create parquet part files limited for size?
|
|
4
|
129
|
August 16, 2024
|
Quick Q on dask parquet append
|
|
5
|
29
|
August 16, 2024
|
Error when calling to_parquet() "TypeError: argument of type 'int' is not iterable"
|
|
2
|
130
|
March 29, 2024
|
Reading Parquet directory from HDFS
|
|
4
|
418
|
February 12, 2024
|
Error when creating pyarrow schema from dask dataframe
|
|
2
|
1672
|
June 1, 2023
|
Option to batch read_parquet
|
|
1
|
175
|
April 11, 2023
|
Dask not distributing reading of parquet file?
|
|
1
|
1675
|
April 6, 2023
|
Dask on AWS Sagemaker Exception: 'FSTimeoutError()'
|
|
1
|
1095
|
April 28, 2022
|