I need to transform data from JSON to parquet as a part of an ETL pipeline. I'm currently doing it with the from_pandas
method of a pyarrow.Table. However building a dataframe first feels like a unnecessary step, plus I'd like to avoid having pandas as a dependency.
Is there a way how to write parquet files without the need to load it in a dataframe first?