I have a parquet file with 35 columns nd i have to check if a specific value is present in a column or not using pyarrow.does anyone know how to do that?
Asked
Active
Viewed 837 times
1 Answers
0
The example from the dataset docs might be useful:
Something like:
dataset = ds.dataset(base / "parquet_dataset", format="parquet")
dataset.to_table(columns=['a'], filter=ds.field('a') >= 7).to_table

Micah Kornfield
- 1,325
- 5
- 10