How to improve Dask read_parquet performance while reading 20000 parquet files (very few are corrupted)?
|
|
0
|
32
|
October 17, 2022
|
Slow processing of parquet dataset using the distributed client
|
|
1
|
53
|
October 11, 2022
|
Best practices for asserting data in dask.dataframe?
|
|
0
|
46
|
September 30, 2022
|
Map_partitions just to execute and save per partition
|
|
0
|
58
|
September 28, 2022
|
Windows jobs failing on my Pull Request
|
|
0
|
39
|
September 17, 2022
|
Dask tests randomly fail with "Heartbeat to scheduler failed" message
|
|
3
|
76
|
September 15, 2022
|
Dask dataframe groupby on already existing index column
|
|
0
|
38
|
September 8, 2022
|
Turn an array column in a dask dataframe into multiple columns
|
|
0
|
65
|
August 31, 2022
|
Create an numpy array from dask dataframe
|
|
1
|
58
|
August 31, 2022
|
Sequential reading of CSVs?
|
|
2
|
49
|
August 31, 2022
|
Map_partition function to apply a plotting function on partitions
|
|
0
|
41
|
August 30, 2022
|
Dask very slow with simple processing of large parquet file
|
|
2
|
225
|
August 29, 2022
|
Dataframe.compute hangs during iterating a dataset
|
|
1
|
84
|
August 20, 2022
|
Dataframe from sparse array
|
|
0
|
123
|
August 18, 2022
|
Dask DataFrames - Object/Byte Streams?
|
|
1
|
51
|
August 9, 2022
|
Issue in Parallel row preprocessing with Dask
|
|
2
|
94
|
August 6, 2022
|
Using DataFrame apply in a loop
|
|
2
|
143
|
August 5, 2022
|
How to sort within groups?
|
|
1
|
84
|
August 5, 2022
|
How to shuffle full data elegantly and efficiently?
|
|
1
|
74
|
July 31, 2022
|
Divisions Lost When Writing as Parquet
|
|
1
|
40
|
July 27, 2022
|
Actual and meta columns mismatch
|
|
2
|
119
|
July 22, 2022
|
Compression Levels while storing Dask DataFrame & Dask Array
|
|
2
|
86
|
July 22, 2022
|
Run Dask script from command line without warning messages
|
|
1
|
49
|
July 22, 2022
|
How does dataframe column projection optimization work?
|
|
2
|
74
|
July 14, 2022
|
Gradually build up a Dataframe
|
|
2
|
145
|
July 14, 2022
|
S3://nyc-tlc seems to have disappeared ...?
|
|
2
|
156
|
July 11, 2022
|
DataFrameIOLayer becomes MaterializedLayer when pickled
|
|
6
|
162
|
July 6, 2022
|
Dealing with the hack for dta
|
|
0
|
62
|
June 30, 2022
|
Astype giving error while type casting to object type
|
|
1
|
45
|
June 29, 2022
|
Dataframe indexes
|
|
3
|
174
|
June 16, 2022
|