0

I have a parquet file with 35 columns nd i have to check if a specific value is present in a column or not using pyarrow.does anyone know how to do that?

Barkha C
  • 1
  • 1

1 Answers1

0

The example from the dataset docs might be useful:

Something like:

dataset = ds.dataset(base / "parquet_dataset", format="parquet")
dataset.to_table(columns=['a'], filter=ds.field('a') >= 7).to_table
Micah Kornfield
  • 1,325
  • 5
  • 10